0% found this document useful (0 votes)

27 views10 pages

306140C49 NLP Exp2

Uploaded by

KP TECH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views10 pages

306140C49 NLP Exp2

Uploaded by

KP TECH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Experiment No.

2
Aim- Implement morphological Parser to accept and reject given string

Objective- To understand and implement morphological parser to accept and reject string

Outcome- students will be able to understand the morphological parsing to accept and reject string.

Theory-

The most sophisticated methods for lemmatization involve complete morphological parsing of the
word. Morphology is the study of the way words are built up from smaller meaning-bearing units
called morphemes. Two broad classes of morphemes can be distinguished: stems—the central
morpheme of the word, supplying the main meaning—and affixes—adding “additional” meanings
of various kinds. So, for example, the word fox consists of one morpheme (the morpheme fox) and
the word cats consists of two: the morpheme cat and the morpheme -s. A morphological parser
takes a word like cats and parses it into the two morphemes cat and s, or parses a Spanish word
like amaren (‘if in the future they would love’) into the morpheme amar ‘to love’, and the
morphological features 3PL and future subjunctive.
The goal of morphological parsing is to find out what morphemes a given word is built from. For
example, a morphological parser should be able to tell us that the word cats is the plural form of
the noun stem cat, and that the word mice is the plural form of the noun stem mouse. So, given the
string cats as input, a morphological parser should produce an output that looks similar to cat N
PL. Here are some more examples:

mouse mouse N SG
mice mouse N PL
foxes fox N PL

Morphological parsing yields information that is useful in many NLP applications. In

parsing, e.g., it helps to know the agreement features of words. Similarly, grammar
checkers need to know agreement information to detect such mistakes. But
morphological information also helps spell checkers to decide whether something is a
possible word or not, and in information retrieval it is used to search not only cats, if
that's the user's input, but also for cat.

To get from the surface form of a word to its morphological analysis, we are going to
proceed in two steps. First, we are going to split the words up into its possible
components. So, we will make cat + s out of cats, using + to indicate morpheme
boundaries. In this step, we will also take spelling rules into account, so that there are
two possible ways of splitting up foxes, namely foxe + s and fox + s. The first one
assumes that foxe is a stem and s the suffix, while the second one assumes that the stem
is fox and that the e has been introduced due to the spelling rule that we saw above.

In the second step, we will use a lexicon of stems and affixes to look up the categories
of the stems and the meaning of the affixes. So, cat + s will get mapped to cat NP PL,
and fox + s to fox N PL. We will also find out now that foxe is not a legal stem. This
tells us that splitting foxes into foxe + s was actually an incorrect way of splitting foxes,
which should be discarded. But note that for the word houses splitting it into house +
s is correct.

Here is a picture illustrating the two steps of our morphological parser with some
examples.

We will now build two transducers: one to do the mapping from the surface form to
the intermediate form and the other one to do the mapping from the intermediate form
to the underlying form.

1 From the Surface to the Intermediate Form

To do morphological parsing this transducer has to map from the surface form to the
intermediate form. For now, we just want to cover the cases of English singular and
plural nouns that we have seen above. This means that the transducer may or may not
insert a morpheme boundary if the word ends in s. There may be singular words that
end in s (e.g. kiss). That's why we don't want to make the insertion of a morpheme
boundary obligatory. If the word ends in ses, xes or zes, it may furthermore delete
the e when introducing a morpheme boundary. Here is a transducer that does this. The
``other'' arc in this transducer stands for a transition that maps all symbols except for s,
z, x to themselves.
Let's see how this transducer deals with some of our examples. The following graphs
show the possible sequences of states that the transducer can go through given the
surface forms cats and foxes as input.

2 From the Intermediate Form to the Morphological Structure

Now, we want to take the intermediate form that we produced in the previous section
and map it to the underlying form. The input that this transducer has to accept is of one
of the following forms:

1. regular noun stem, e.g. cat

2. regular noun stem + s, e.g. cat + s
3. singular irregular noun stem, e.g. mouse
4. plural irregular noun stem, e.g. mice

In the first case, the transducer has to map all symbols of the stem to themselves and
then output N and SG. In the second case, it maps all symbols of the stem to themselves,
but then outputs N and replaces PL with s. In the third case, it does the same as in the
first case. Finally, in the fourth case, the transducer should map the irregular plural noun
stem to the corresponding singular stem (e.g. mice to mouse) and then it should
add N and PL. So, the general structure of this transducer looks like this:
What still needs to be specified is how exactly the parts between state 1 and states 2,3,
and 4 respectively look like. Here, we need to recognize noun stems and decide whether
they are regular or not. We do this be encoding a lexicon in the following way. The
transducer part that recognizes cat, for instance, looks like this:

And the transducer part mapping mice to mouse can be specified as follows:

Plugging these (partial) transducers into the transducer given above we get a transducer
that checks that input has the right form and adds category and numerus information.

transducers for mapping from the surface to the intermediate form and for mapping
from the intermediate to the underlying form run in a cascade (i.e. we let the second
transducer run on the output of the first one), we can do a morphological parse of (some)
English noun phrases. However, we can also use this transducer for generating a surface
form from an underlying form. Remember that we can change the direction of
translation when using a transducer in translation mode.

Now, consider the input berries. What will our cascaded transducers make out of it?
The first one will return two possible splittings, berries and berrie + s, but the one that
we would want, berry + s, is not one of them. The reason for this is that there is another
spelling rule at work, here, which we haven't taken into account at all. This rule is saying
that ``y changes to ie before s''. So, in the first step there may be more than one spelling
rules that all have to be applied.

There are basically two ways of dealing with this. First, we can formulate the
transducers for each of the rules in such a way that they can be run in a cascade. Another
possibility is to specify the transducers in such a way that they can be applied in parallel.

There are algorithms for combining several cascaded tranducers or several transducers
that are supposed to be applied in parallel into a single transducer. However, these
algorithms only work, if the individual transducers obey some restrictions so that we
have to take some care when specifying them.
3 Combining the two Transducers

If we now let the two transducers for mapping from the surface to the intermediate form
and for mapping from the intermediate to the underlying form run in a cascade (i.e. we
let the second transducer run on the output of the first one), we can do a morphological
parse of (some) English noun phrases. However, we can also use this transducer for
generating a surface form from an underlying form. Remember that we can change the
direction of translation when using a transducer in translation mode.

Conclusion- Thus students studied in depth about morphological parsing with the implementation
in python/R programming.
PART B
(PART B: TO BE COMPLETED BY STUDENTS)

(Students must submit the soft copy as per following segments within two hours of the practical. The soft
copy must be uploaded on the Blackboard or emailed to the concerned lab in charge faculties at the end
of the practical in case the there is no Black board access available)

Roll. No.BE-C49 Name: Wakhare Amar Sanjay

Class: BE-Comps Batch:C3
Date of Experiment:24/07/2023 Date of Submission:24/07/2023
Grade:

B.1 Software Code written by student:

(Paste your Search material completed during the 2 hours of practical in the lab here)
B.2 Input and Output:
(Not Required)

B.3 Observations and learning:

(Students are expected to comment on the output obtained with clear observations and learning for each task/ sub
part assigned)
B.4 Conclusion:
(Students must write the conclusion as per the attainment of individual outcome listed above and learning/observation
noted in section B.3)

B.5 Question of Curiosity

(To be answered by student based on the practical performed and learning/observations)
Q1: What is morphology? Why do we need to do Morphological Analysis? Discuss various
application domains of Morphological Analysis.

Ans: Morphology deals with parts of words called morphemes. Morphological analysis looks at

how morphemes can be combined or separated to make different words with different

meanings. The most common examples are plural nouns. Usually a noun’s root word alone

means the singular version; for example, for the morpheme cat, the root word cat means “one

cat.” To talk about two or more cats, we take the morpheme cat and add an –s to the end; this is

because spelling plurals with –s or –es is common in English. Understanding the relationship

between cat, cats, and the suffix –s is all part of morphology.

Morphological analysis can be used to reduce the size of lexicon and also plays an important role

in determining the pronunciation of a homograph. It also helps in Schwa deletion and the

consideration of it.

Q2: Explain derivational and inflectional morphology with suitable example.

Ans: inflectional morphemes never change the grammatical category (part of speech) of a word.
derivational morphemes often change the part of speech of a word. Thus, the verb read becomes
the noun reader when we add the derivational morpheme -er. It is simply that read is a verb, but
reader is a noun. However, some derivational morphemes do not change the grammatical category
of a word.
Q3: What is the role of FSA in Morphological analysis? Explain FST in details
Ans:- A finite-state transducer (FST) is a finite-state machine with two memory tapes, following
the terminology for Turing machines: an input tape and an output tape. This contrasts with an
ordinary finite-state automaton, which has a single tape. An FST is a type of finite-state automaton
(FSA) that maps between two sets of symbols.[1] An FST is more general than an FSA. An FSA
defines a formal language by defining a set of accepted strings, while an FST defines relations
between sets of strings.An FST will read a set of strings on the input tape and generates a set of
relations on the output tape. An FST can be thought of as a translator or relater between strings
in a set.In morphological parsing, an example would be inputting a string of letters into the FST,
the FST would then output a string of morphemes.

GILBARCO Frontier Series I F210
75% (4)
GILBARCO Frontier Series I F210
4 pages
Narrative CV Guide Oxford June 2023 0
No ratings yet
Narrative CV Guide Oxford June 2023 0
9 pages
NLP 39-48
No ratings yet
NLP 39-48
11 pages
Inf2a L15 Slides
No ratings yet
Inf2a L15 Slides
31 pages
Lexical Analysis - Morphological Analysis
No ratings yet
Lexical Analysis - Morphological Analysis
9 pages
Morphology
No ratings yet
Morphology
41 pages
Morphological Analysis
No ratings yet
Morphological Analysis
35 pages
Lecture 3
No ratings yet
Lecture 3
55 pages
7-Morphology Part2
No ratings yet
7-Morphology Part2
28 pages
Morphology FST
No ratings yet
Morphology FST
47 pages
CH 3 Morphology and FST
No ratings yet
CH 3 Morphology and FST
30 pages
Morp
No ratings yet
Morp
30 pages
675469663
No ratings yet
675469663
33 pages
NLP Unit-1
No ratings yet
NLP Unit-1
5 pages
NLP Unit-1
No ratings yet
NLP Unit-1
12 pages
UNIT-1 Notes
No ratings yet
UNIT-1 Notes
19 pages
CS674 Natural Language Processing Morphology: Topics For Today
No ratings yet
CS674 Natural Language Processing Morphology: Topics For Today
4 pages
Wordlevel Analysis - Chap2
No ratings yet
Wordlevel Analysis - Chap2
97 pages
8-Morphology Part3
No ratings yet
8-Morphology Part3
27 pages
Morphology Notes
No ratings yet
Morphology Notes
5 pages
Words & Transducers
No ratings yet
Words & Transducers
7 pages
Chapter 1
No ratings yet
Chapter 1
41 pages
10 FST
No ratings yet
10 FST
26 pages
NLP MODULE-2 Final
No ratings yet
NLP MODULE-2 Final
114 pages
Module 3: Morphology Morphological Parsing With Finite State
No ratings yet
Module 3: Morphology Morphological Parsing With Finite State
29 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
Finnish, Turkish and Hungarian
100% (2)
Finnish, Turkish and Hungarian
12 pages
Lecture-3 (Words - Transducers)
No ratings yet
Lecture-3 (Words - Transducers)
61 pages
Chapter 2
No ratings yet
Chapter 2
8 pages
Experiment No: 7 BE-COMP-B-26 Aim: Tools: Theory: Morphological Parsing, in Natural Language Processing, Is The Process
No ratings yet
Experiment No: 7 BE-COMP-B-26 Aim: Tools: Theory: Morphological Parsing, in Natural Language Processing, Is The Process
2 pages
Word Level Analysis
No ratings yet
Word Level Analysis
49 pages
Words
No ratings yet
Words
44 pages
Lec08 09 FSA For Morphological Parsig and Generation
No ratings yet
Lec08 09 FSA For Morphological Parsig and Generation
40 pages
Morphological Analysis
No ratings yet
Morphological Analysis
27 pages
Finnish 2008
No ratings yet
Finnish 2008
64 pages
Unit3 - Morphology and Finite State Transducers
100% (1)
Unit3 - Morphology and Finite State Transducers
55 pages
Two Theories of Morphology, One Implementation: SIL Electronic Working Papers 1998-001, February 1998
No ratings yet
Two Theories of Morphology, One Implementation: SIL Electronic Working Papers 1998-001, February 1998
36 pages
IS 7118 Unit-3 Morphology
No ratings yet
IS 7118 Unit-3 Morphology
98 pages
Scan 27 Nov 23 09 21 15
No ratings yet
Scan 27 Nov 23 09 21 15
11 pages
Lecture 1 Text Preprocessing PDF
No ratings yet
Lecture 1 Text Preprocessing PDF
29 pages
NLP - Sem
No ratings yet
NLP - Sem
31 pages
Two Approach Morph
No ratings yet
Two Approach Morph
17 pages
Ermolaeva Parser
No ratings yet
Ermolaeva Parser
5 pages
ACFrOgBKMtkrKQXYgwzYfGAQxQ0GJjQ4MloahBs6vi5pwqo xRZUN6IRgh8lAAyR2U7sguAn6becvxh174Y RYo84nZ3K9mm OlN3Q JrDvd18FxMzMkCBuxruzd1tH0C6XqndKXsCSXuwHIWVT7olg5FKOstIhFYq-Kh6hMBg
No ratings yet
ACFrOgBKMtkrKQXYgwzYfGAQxQ0GJjQ4MloahBs6vi5pwqo xRZUN6IRgh8lAAyR2U7sguAn6becvxh174Y RYo84nZ3K9mm OlN3Q JrDvd18FxMzMkCBuxruzd1tH0C6XqndKXsCSXuwHIWVT7olg5FKOstIhFYq-Kh6hMBg
32 pages
2 NLP
No ratings yet
2 NLP
36 pages
Module 3 - Part 1
No ratings yet
Module 3 - Part 1
54 pages
Unit V Intelligence and Applications: Morphological Analysis/Lexical Analysis
No ratings yet
Unit V Intelligence and Applications: Morphological Analysis/Lexical Analysis
30 pages
NLP Lecture2 Text Pre Processing
No ratings yet
NLP Lecture2 Text Pre Processing
54 pages
Week3FA Linguistics
No ratings yet
Week3FA Linguistics
3 pages
Multilingual Issues
No ratings yet
Multilingual Issues
7 pages
DHull GGrefenstette Technical Report MLTT96
No ratings yet
DHull GGrefenstette Technical Report MLTT96
17 pages
Natural Langauge Processsing Unit 2
No ratings yet
Natural Langauge Processsing Unit 2
16 pages
02 Background Morphology FSA IntroFST Final
No ratings yet
02 Background Morphology FSA IntroFST Final
38 pages
7MvcJmJaQ8uL3CZiWhPLDQ Lecture02 6 FST
No ratings yet
7MvcJmJaQ8uL3CZiWhPLDQ Lecture02 6 FST
16 pages
Phonological Rules
No ratings yet
Phonological Rules
41 pages
Doing Linguistics With A Corpus
No ratings yet
Doing Linguistics With A Corpus
88 pages
CCS369 - TSS-Unit 4
No ratings yet
CCS369 - TSS-Unit 4
30 pages
3.chapter4 - Lexical Representations
No ratings yet
3.chapter4 - Lexical Representations
36 pages
Comparison ISO 10360-2 To ASME B89.4.1
100% (1)
Comparison ISO 10360-2 To ASME B89.4.1
15 pages
Hospitality and Tourism Marketing (372TXT or 372CIN)
No ratings yet
Hospitality and Tourism Marketing (372TXT or 372CIN)
7 pages
ASHRAE Table9 Hot Water Demand
100% (1)
ASHRAE Table9 Hot Water Demand
1 page
Ethical Values in Business Practice: DR Nicole Dando, Project Manager Institute of Business Ethics
No ratings yet
Ethical Values in Business Practice: DR Nicole Dando, Project Manager Institute of Business Ethics
41 pages
Deciphering Mary Stuart S Lost Letters From 1578 1584
No ratings yet
Deciphering Mary Stuart S Lost Letters From 1578 1584
103 pages
Latches and Flip-Flops: Experiment E10
No ratings yet
Latches and Flip-Flops: Experiment E10
9 pages
EDUCATION VMware Technical Sales Professional
No ratings yet
EDUCATION VMware Technical Sales Professional
1 page
Downloadnoveldarisujudkesujud PDF
No ratings yet
Downloadnoveldarisujudkesujud PDF
2 pages
MegaFoodParksinIndia PDF
100% (1)
MegaFoodParksinIndia PDF
16 pages
English Teaching Tool Kit
No ratings yet
English Teaching Tool Kit
12 pages
Functions & Distance-Time Graph Test
No ratings yet
Functions & Distance-Time Graph Test
7 pages
GANAPATHI
No ratings yet
GANAPATHI
1 page
Paypal PDF
No ratings yet
Paypal PDF
9 pages
Bugs of Pes2017.Exe by Cpy Crack
0% (1)
Bugs of Pes2017.Exe by Cpy Crack
1 page
Manual Instalacion LC151 PDF
No ratings yet
Manual Instalacion LC151 PDF
80 pages
Line Blockage Guidance For Planners & Gzac
No ratings yet
Line Blockage Guidance For Planners & Gzac
12 pages
30 Days Review Questions For New Hires
No ratings yet
30 Days Review Questions For New Hires
3 pages
Heat Equation PDF
No ratings yet
Heat Equation PDF
4 pages
Daftar Pustaka
No ratings yet
Daftar Pustaka
4 pages
12 Physics AB Mind Map
No ratings yet
12 Physics AB Mind Map
1 page
GGH Casing Assembly Procedure
No ratings yet
GGH Casing Assembly Procedure
14 pages
2019-20 GC7416 Module Handbook
No ratings yet
2019-20 GC7416 Module Handbook
17 pages
Rnym08 1320B 30
No ratings yet
Rnym08 1320B 30
2 pages
Zodiac Constellations
No ratings yet
Zodiac Constellations
16 pages
Weather and Seasons PPT - EVS 2 1
100% (1)
Weather and Seasons PPT - EVS 2 1
24 pages
Growth Translated Final
0% (1)
Growth Translated Final
239 pages
Day 3
No ratings yet
Day 3
41 pages
Course Outline No. 4: IV. Material Strips
No ratings yet
Course Outline No. 4: IV. Material Strips
48 pages

306140C49 NLP Exp2

Uploaded by

306140C49 NLP Exp2

Uploaded by

Experiment No.

Morphological parsing yields information that is useful in many NLP applications. In

1 From the Surface to the Intermediate Form

2 From the Intermediate Form to the Morphological Structure

1. regular noun stem, e.g. cat

Roll. No.BE-C49 Name: Wakhare Amar Sanjay

B.1 Software Code written by student:

B.3 Observations and learning:

B.5 Question of Curiosity

between cat, cats, and the suffix –s is all part of morphology.

Q2: Explain derivational and inflectional morphology with suitable example.

You might also like