0% found this document useful (0 votes)

19 views17 pages

02 - Morphological Analysis

NLP-Final sem study material

Uploaded by

Khushi khokhar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views17 pages

02 - Morphological Analysis

NLP-Final sem study material

Uploaded by

Khushi khokhar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Unit # 1

Morphological
Analysis
Typical Use case ….
Absolutely loving the new update to the app. Great job! Positive Review
Very disappointed with the customer service, not helpful at all. Negative Review
I noticed the store has extended its hours. Interesting move. Neutral comment
Does anyone know if this product is available in blue? Enquiry
Just tried the new cafe downtown, and it's amazing! Praise , Positive f / b

I'm having trouble logging into my account, can you assist me? Support Request
My order has been delayed for two weeks now, what's going on? Complain

.
.
What are your store hours on weekends?
Can I get more information about the warranty on the laptop models you
sell
Suggestions Service Enquiry Complaint Top Mgmt
0.45 0.72 0.35 0.85 0.15
What is Morphology ?
In linguistics, Morphology is the study of the internal structure of words. It is the study
of words, how they are formed, and their relationship to other words in the same
language. It analyzes the structure of words and parts of words such as stems, root
words, prefixes, and suffixes. Morphology also looks at parts of speech, intonation
and stress, and the ways context can change a word's pronunciation and meaning.

It focuses on how the components within a word (stems, root words, prefixes,
suffixes, etc.) are arranged or modified to create different meanings.

Morphology varies greatly between languages. In languages such as Russian, word

endings indicate the role of a word in a sentence . As a result, morphological analysis
depends heavily on the source language, and an understanding of what is supported
within that language plays vital role in developing a NLP application.
The Natural Language API uses morphological analysis to infer grammatical
information about words.
Types of Morphemes
Some very Important relevant terminologies used in Morphology are …
Stem :
Is a part of a word responsible for its lexical meaning. It refers to the main part of a
word to which affixes (prefixes, suffixes, infixes, circumfixes) are added. It is the base
form that remains after removing all the affixes that modify its meaning or create new
words. Examples.
In the word "unbelievable" the stem is
For the word "runner," the
"believe."
stem is "run."
Prefix: "un-" (meaning not)
Stem: "run" (basic action)
Stem: "believe" (basic meaning: accept as
Suffix: "-ner" (one who does
true)
the action)
Suffix: "-able" (meaning able to be)
"Runner" refers to 'one who
The word "unbelievable" thus means 'not
runs.'
able to be believed.'

Root :
Is the most basic, irreducible part that carries the core meaning of the word. Unlike
stems, roots cannot be broken down into smaller parts and typically do not have
prefixes, suffixes, or infixes attached to them in their most basic form. Roots form the
base upon which stems and ultimately full words are built. In many cases, the root is
the same as the stem
Types of Morphemes (contd)
For the word "reaction," the root is "act." In "writer" the root is "write."
Prefix: "re-" (meaning again or back)
Root: "act" (basic action or doing) Root: "write" (basic action: to form letters
Suffix: "-ion" (denoting the action or or words)
condition of) Suffix: "-er" (one who does the action)
"Reaction" refers to 'the action of doing "Writer" refers to 'one who writes.'
something again or in response.'

Part of Speech :
Is a category of words in a language that have similar grammatical properties.
Common parts of speech include nouns, verbs, adjectives, adverbs, pronouns,
prepositions, conjunctions, and interjections. Each part of speech plays a specific role
in a sentence, contributing to the sentence's overall meaning and structure.
Understanding parts of speech is crucial for analyzing and constructing sentences
effectively.
Nouns: Words that name people, places, Adjectives: Words that describe or modify
things, or ideas. nouns.
Example: "Computer," "Paris," "happiness." Example: "red," "quick," "intelligent."
Verbs: Words that express actions,
occurrences, or states of being.
Example: "run," "is," "think."
Types of Morphemes (contd)
Adverbs: Words that modify verbs,
adjectives, or other adverbs, often indicating Conjunctions: Words that join
manner, place, time, or degree. words, phrases, or clauses.
Example: "quickly," "there," "very.“ Example: "and," "but," "because.“

Pronouns: Words that take the place of Interjections: Words used to

nouns. express emotions or sudden bursts
Example: "he," "they," "it.“ of feeling.
Example: "Wow!," "Ouch!," "Hey!"
Prepositions: Words that show the
relationship between a noun (or pronoun)
and other words in a sentence, often
indicating time, place, or direction.
Example: "in," "at," "by.“

Inflectional morphology
Adds information to a word consistent with its context within a sentence
Examples
• Number (singular versus plural) • Case (nominative versus accusative versus…)
automaton → automata he, him, his, …
• Walk → walks
Morphology Analysis Approaches
Morphological analysis may be defined as the process of obtaining grammatical
information from tokens, given their suffix information. Morphological analysis can be
performed in three ways:
1. Morpheme-based morphology (or anitem and arrangement approach),
2. Word-based morphology (or a word and paradigm approach), and
3. Lexeme-based morphology (or an item and process approach).

1. Morpheme-based morphology
Morpheme-based morphology analyzes and describes the structure of words by
breaking them down into their smallest meaningful units, called morphemes. There
are two main types of morphemes in morpheme-based morphology.
Free Morphemes: These can stand alone as words (e.g., "book", "go").
Bound Morphemes: These cannot stand alone and must be attached to a free
morpheme (e.g., prefixes like "un-", suffixes like "-ing"). Words are formed by
combining these morphemes in a linear arrangement.
Word: "Unhappiness"
Structure: [Prefix "Un-"] + [Root "happy"] + [Suffix "-ness"]

This structure shows that the word "unhappiness" is composed of three morphemes:
"un-" (a prefix), "happy" (a root), and "-ness" (a suffix). Each morpheme contributes to
the overall meaning of the word.
Morphology Analysis Approaches (contd)
2. Word -based morphology
Word-based morphology focuses on words as the central units of morphological
analysis rather than morphemes. This approach emphasizes the full forms of words
rather than attempting to segment words into constituent morphemes. It’s a contrast
to morpheme-based morphology, which breaks down words into the smallest units of
meaning. It treats words as indivisible wholes or as bases to which processes are
applied. It looks at how words change as whole units through processes like
inflection, derivation, and compounding.
There is less focus on dividing the word into prefixes, stems, and suffixes. Instead,
the processes that affect the word as a whole are examined.

Base Word: "Run" → Past Tense Process → Result: "Ran"

Morphology Analysis Approaches (contd)
3. Lexeme-based morphology
Lexeme-based morphology is a theoretical framework in linguistics, which
separates morphological processes into two layers: the lexical layer and the
inflectional layer.
-The lexical layer consists of lexemes, which are the abstract, minimal units of
meaning without any inflectional endings or derivational affixes. They represent
the set of words which often are "dictionary entries.
-The inflectional layer involves the addition of affixes to lexemes to express
grammatical relationships and features, such as tense, number, gender, etc.,
without changing the core meaning or word class (e.g., "walk" to "walked").
[ Lexeme "walk" ] → [ Derivation (N/A in this case) ] ↓
[ Inflection ] → [ "walk" (base) | "walks" (3rd person singular) | "walked"
(past) | "walking" (progressive) ]
Morphology Analysis (contd)
A morphological analyzer may be defined as a program that is responsible for the
analysis of the morphology of a given input token. It analyzes a given token and
generates morphological information, such as root ,stem,prefix and so on, as an
output.
While performing the morphological analysis, each particular word is analyzed. Each
word is assigned a syntactic category to discard the uncertainty from the word. Non-
word tokens such as punctuation are removed from the words.

Stemming
Stemming algorithms aim to remove those affixes required for eg. grammatical role,
tense, derivational morphology leaving only the stem of the word. This is a difficult
problem due to irregular words (eg. common verbs in English), complicated
morphological rules, and part-of-speech and sense ambiguities
NLTK algorithm
- PorterStemmer
- SnowballStemmer
- Lancaster stemmer:
Morphology Analysis (contd)
Lemmatization
Lemmatization is another technique used to reduce inflected words to their root
word. It describes the algorithmic process of identifying an inflected word’s “lemma”
(dictionary form) based on its intended meaning.

POS
Part of natural language processing is determining the role of each word or token in
a body of text. In the world of NLP, we call this process part-of-speech (POS)
tagging. The NLTK package comes with a function pos_tag() that makes this job
relatively seamless, and gives us a good starting point.
VB verb, base form – take
VBD verb, past tense – took
VBG verb, gerund/present participle – taking
VBN verb, past participle – taken
VBP verb, sing. present, non-3d – take
VBZ verb, 3rd person sing. present – takes

NN noun, singular ‘- desk’

NNS noun plural – ‘desks’
NNP proper noun - America
NNPS proper noun, plural - Americans

RB adverb – very, silently,

Stemming Vs Lemmatisation
Stemming and lemmatization are both text-processing techniques that aim to
reduce inflected words to a common base root. Despite the correlation in the
overarching objective, the two techniques are not the same.
Stemming algorithms attempt to find the common base roots of various inflections by
cutting off the endings or beginnings of the word. The crude heuristic approach taken
by stemming algorithms typically means they’re fast and efficient but not always
accurate.
On the other hand, lemmatization algorithms attempt to find common base roots from
inflected words by conducting a more heuristic morphological analysis. However , to
accurately reduce inflections, a detailed dictionary must be kept so the algorithm can
search through to link an inflected word back to its lemma. Lemmatization algorithms
sacrifice speed and efficiency for accuracy, BUT, may result in meaningful base roots
better than Stemming algorithms.
Popular NLP Tools
NLTK
NLTK is a leading platform for building Python programs to work with human
language data. It provides easy-to-use interfaces to over 50 corpora and lexical
resources such as WordNet, along with a suite of text processing libraries for
classification, tokenization, stemming, tagging, parsing, and semantic reasoning,
wrappers for industrial-strength NLP libraries

Google Natural Language API

The Google Natural Language API is an easy to use interface to a set of powerful NLP
models which have been pre-trained by Google to perform various tasks. As these
models have been trained on enormously large document corpuses, their performance
is usually quite good as long as they are used on datasets that do not make use of a
very idiosyncratic language.
The Natural Language API comprises five different services:

Syntax Analysis
Sentiment Analysis
Entity Analysis
Entity Sentiment Analysis
Text Classification
Popular NLP Tools (contd)
The analyzeSyntax method returns details about the linguistic structure of the given
text. For each token in the text, the Natural Language API provides information about
its internal structure (morphology) and its role in the sentence (syntax).

Google AutoML Natural Language

• If the Natural Language API is not flexible enough for business purposes, then
AutoML Natural Language is the next choice. AutoML is a new Google Cloud
Service (still in beta) that enables the user to create customized machine learning
models. In contrast to the Natural Language API, the AutoML models will be
trained on the user’s data and therefore fit a specific task. The AutoML service
requires a bit more effort for the user, mainly because you have to provide a
dataset to train the model.
• The AutoML service covers three use cases. All of these use cases support solely
the English language for now.
1. AutoML Text Classification
2. AutoML Entity Extraction
Thanks
Google AutoML Natural Language
If the Natural Language API is not flexible enough for your business purposes, then
AutoML Natural Language might be the right service. AutoML is a new Google Cloud
Service (still in beta) that enables the user to create customized machine learning
models. In contrast to the Natural Language API, the AutoML models will be trained
on the user’s data and therefore fit a specific task.

HSC E2 2023
No ratings yet
HSC E2 2023
3 pages
Ling2030 Mid Term
No ratings yet
Ling2030 Mid Term
3 pages
Morphology 9077
No ratings yet
Morphology 9077
23 pages
Introduction To Morphology Speed Run
No ratings yet
Introduction To Morphology Speed Run
13 pages
Class 2 - Introduction To Verbs
No ratings yet
Class 2 - Introduction To Verbs
25 pages
Morphology Analysis
No ratings yet
Morphology Analysis
3 pages
Unit 12 (3 Half)
No ratings yet
Unit 12 (3 Half)
37 pages
Communicative English 1st Semester (BHALOTIA)
No ratings yet
Communicative English 1st Semester (BHALOTIA)
124 pages
2 NLP
No ratings yet
2 NLP
36 pages
The Passive Voice Answer
100% (1)
The Passive Voice Answer
5 pages
Tanvi Chiman 10 BE3 EXP3 A SMA
No ratings yet
Tanvi Chiman 10 BE3 EXP3 A SMA
3 pages
Lecture 2 LinguisticPreliminaries
No ratings yet
Lecture 2 LinguisticPreliminaries
65 pages
Summit 2B (3rd) PDF
0% (2)
Summit 2B (3rd) PDF
21 pages
NLP Notes
No ratings yet
NLP Notes
180 pages
Logic Assignment Group 4
No ratings yet
Logic Assignment Group 4
5 pages
12b Irregular Verb List Translated To Spanish
No ratings yet
12b Irregular Verb List Translated To Spanish
10 pages
EN G 509 Morph Ology and Syntax: Les Son 1
No ratings yet
EN G 509 Morph Ology and Syntax: Les Son 1
38 pages
B2 Grammar Final Exam
No ratings yet
B2 Grammar Final Exam
13 pages
Howtousethissheet: Theunitcontainsthefollowing
No ratings yet
Howtousethissheet: Theunitcontainsthefollowing
33 pages
ENG101 Short Notes
No ratings yet
ENG101 Short Notes
29 pages
G1 Morphology Presentation
No ratings yet
G1 Morphology Presentation
37 pages
Oct 31 - Morphology
No ratings yet
Oct 31 - Morphology
56 pages
The Structure of The Language
No ratings yet
The Structure of The Language
3 pages
Morfologia - Dispensa
No ratings yet
Morfologia - Dispensa
182 pages
2 Natural Language Processing-Phases
No ratings yet
2 Natural Language Processing-Phases
13 pages
MORPHOLOGY
No ratings yet
MORPHOLOGY
5 pages
2 - Unit - 1 - Find Structures of Words
No ratings yet
2 - Unit - 1 - Find Structures of Words
42 pages
Live - Beat Level.2 SB 2015 120p
No ratings yet
Live - Beat Level.2 SB 2015 120p
120 pages
Personal Pronoun & Modals
No ratings yet
Personal Pronoun & Modals
6 pages
Elt Lesson Plan
No ratings yet
Elt Lesson Plan
6 pages
Morphological Analysis
No ratings yet
Morphological Analysis
3 pages
Morphology in Linguistics
No ratings yet
Morphology in Linguistics
61 pages
Lesson 6 1
No ratings yet
Lesson 6 1
32 pages
Sem 2
No ratings yet
Sem 2
14 pages
Lecture 02
No ratings yet
Lecture 02
44 pages
Unit2 A
No ratings yet
Unit2 A
22 pages
Wordlevel Analysis - Chap2
No ratings yet
Wordlevel Analysis - Chap2
97 pages
Comma Rules For Dummy
No ratings yet
Comma Rules For Dummy
8 pages
NLP-unit2 Final
No ratings yet
NLP-unit2 Final
158 pages
Aula 3 - English (Curso Básico - Teacher Claudia Eleutério
No ratings yet
Aula 3 - English (Curso Básico - Teacher Claudia Eleutério
22 pages
Prepositions A2 - B1
100% (10)
Prepositions A2 - B1
26 pages
WEEK 2 Morphemes, Words, and Lexemes QH2022.1
No ratings yet
WEEK 2 Morphemes, Words, and Lexemes QH2022.1
123 pages
SOL-week 3
No ratings yet
SOL-week 3
31 pages
Primary Checkpoint - English (0844) April 2017 Paper 1 Mark Scheme
100% (2)
Primary Checkpoint - English (0844) April 2017 Paper 1 Mark Scheme
12 pages
Morphology Glossary
No ratings yet
Morphology Glossary
4 pages
Personal Pronouns: Singular Plural
No ratings yet
Personal Pronouns: Singular Plural
17 pages
GRADE 4 - Sapphire Language - Page 169
No ratings yet
GRADE 4 - Sapphire Language - Page 169
22 pages
LECTURES
No ratings yet
LECTURES
29 pages
Morphology
No ratings yet
Morphology
23 pages
Advantages-Disadvantages Essay : Advantages, But Then Reduce The Amount of Support For Each Advantage
No ratings yet
Advantages-Disadvantages Essay : Advantages, But Then Reduce The Amount of Support For Each Advantage
3 pages
NLP Merged
No ratings yet
NLP Merged
52 pages
NLP Unit-I-1
No ratings yet
NLP Unit-I-1
84 pages
Verb Tenses and Other Forms Map
No ratings yet
Verb Tenses and Other Forms Map
1 page
AEN 220 Morphology, Syntax and Semantics
No ratings yet
AEN 220 Morphology, Syntax and Semantics
33 pages
Lecture 02
No ratings yet
Lecture 02
44 pages
Final Report in Morphology
No ratings yet
Final Report in Morphology
27 pages
Final Examination General Instruction
No ratings yet
Final Examination General Instruction
8 pages
Irregular Verbs
No ratings yet
Irregular Verbs
3 pages
Sindarin Dictionary
100% (1)
Sindarin Dictionary
74 pages
Chapter 1
No ratings yet
Chapter 1
41 pages
What Is Morphology? - Brief Explanation
No ratings yet
What Is Morphology? - Brief Explanation
21 pages
NLP 2
No ratings yet
NLP 2
29 pages
Harmony
No ratings yet
Harmony
12 pages
Words & Transducers
No ratings yet
Words & Transducers
7 pages
CHAPTER 5 - Morphology
No ratings yet
CHAPTER 5 - Morphology
16 pages
Morphology Resume
No ratings yet
Morphology Resume
9 pages
3D Morphology Group4
No ratings yet
3D Morphology Group4
9 pages
WIDA Performance Definitions ListeningReading
No ratings yet
WIDA Performance Definitions ListeningReading
1 page
Chapter II
No ratings yet
Chapter II
51 pages
Possessive Case With Gerunds
No ratings yet
Possessive Case With Gerunds
4 pages
02 - Morphological Analysis
100% (1)
02 - Morphological Analysis
17 pages
Adam-Troi-S Fin
No ratings yet
Adam-Troi-S Fin
6 pages
Word Order 1
100% (1)
Word Order 1
30 pages
Morphology 1
No ratings yet
Morphology 1
32 pages
What Is Morphology? - The Study of The: Internal Structures of Words Rules by Which Words Are Formed
No ratings yet
What Is Morphology? - The Study of The: Internal Structures of Words Rules by Which Words Are Formed
22 pages
Morphology: Marvin D. Nacionales
100% (1)
Morphology: Marvin D. Nacionales
57 pages
Thesis Review On Morophological Analyzer For Geez Verbs
No ratings yet
Thesis Review On Morophological Analyzer For Geez Verbs
13 pages
Morphologyterms
No ratings yet
Morphologyterms
3 pages
General Linguistics: Morphology Word Structure
No ratings yet
General Linguistics: Morphology Word Structure
45 pages
New Latin Grammar by Bennett, Charles E.
100% (2)
New Latin Grammar by Bennett, Charles E.
390 pages
English Grammar
From Everand
English Grammar
Manal Shedeed
No ratings yet
Makalah English Noun Clause
No ratings yet
Makalah English Noun Clause
15 pages
Morphology
No ratings yet
Morphology
19 pages
Home Work Morpheme
No ratings yet
Home Work Morpheme
12 pages
Ms. Ariani Morphology
No ratings yet
Ms. Ariani Morphology
5 pages
Linguistic I Task Icut
No ratings yet
Linguistic I Task Icut
15 pages
Computational Morphology
No ratings yet
Computational Morphology
12 pages
Morphology Study Guide
No ratings yet
Morphology Study Guide
11 pages
Slides Morpho Lectures
No ratings yet
Slides Morpho Lectures
73 pages
7 Days to Grammar Excellence: How to Master English from Beginner to Advanced
From Everand
7 Days to Grammar Excellence: How to Master English from Beginner to Advanced
Ranjot Singh Chahal
No ratings yet

02 - Morphological Analysis

Uploaded by

02 - Morphological Analysis

Uploaded by

Unit # 1

Morphology varies greatly between languages. In languages such as Russian, word

Pronouns: Words that take the place of Interjections: Words used to

Base Word: "Run" → Past Tense Process → Result: "Ran"

NN noun, singular ‘- desk’

RB adverb – very, silently,

Google Natural Language API

Google AutoML Natural Language

You might also like