0% found this document useful (0 votes)

8 views23 pages

NLP Unit 03

The document discusses key concepts in Natural Language Processing (NLP), focusing on grammars and parsing techniques such as top-down and bottom-up parsing. It explains the role of parsers in validating sentence structures and introduces morphological analysis, augmented transition networks, and common parsing issues like ambiguity and error propagation. Additionally, it covers the importance of feature systems in capturing linguistic details beyond basic syntax.

Uploaded by

jitendrayadav91001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views23 pages

NLP Unit 03

Uploaded by

jitendrayadav91001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Natural Language

Processing (KOE-088)
Unit - 03 (Grammars And Parsing)

Dr. Abdul Kalam Technical University, Lucknow

Grammar, Parser

Basic Methods of Searching

The top-down chart parsing algorithm
Feature System & Augmented Grammar
Agenda Morphological Analysis
Issues in Parsing & Various Techniques
Augmented Transition Network
Back to Agenda Page
Grammar
In Natural Language Processing (NLP),
grammar refers to a set of rules that define the
structure of sentences in a language. These rules
specify how words can be combined to form
phrases, clauses, and sentences that are
syntactically correct.

Key Points About Grammar in NLP

Structure
Syntax
Formal Representation
Context-Free Grammar (CFG)
Probabilistic Context-Free Grammar (PCFG)
Back to Agenda Page
Parser
It is a software component that takes input and
checks it on several grammar rules if it is valid
then it generates a parse tree.

How is it used in NLP?

Grammar Checking
Intermediate Stage of semantic Analysis
Back to Agenda Page
Concept of Parser

It is a graphical representation of derivation.

Start symbols are the root of the parse tree.

Leaf nodes are terminals.

Interior nodes are non-terminals.

If parsed properly will create input text.

Back to Agenda Page
Basic Methods of Searching

Top Down Parsing

Bottom Up Parsing
Top Down Parsing Back to Agenda Page

1. It is a parsing strategy that first looks at the

highest level of the parse tree and works down
the parse tree by using the rules of grammar.
2. Attempts to find the leftmost derivations for
an input string
3. We start parsing from the top to down.
4. The technique used is the most Derivation
5. The main decision is to select what production
rule to use in order to construct the string.
Bottom Up Parsing Back to Agenda Page

1. It is a parsing strategy that first looks at the lowest level

of the parse tree and works up the parse tree by using
the rules of grammar.
2. Bottom-up parsing can be defined as an attempt to
reduce the input string to the start of the symbol of a
grammar.
3. In this parsing technique, we start parsing from bottom
to up in a bottom-up manner.
4. This parsing technique uses the rightmost derivation.
5. The main decision is to select when to use a production
rule to reduce the string to get the starting symbol.
Top Down Chart Parsing Algorithm Back to Agenda Page

Top-down chart parsing is a parsing technique used in Natural Language

Processing (NLP) to analyze the syntactic structure of a sentence.

1. Initialization
2. Prediction
3. Scanning
4. Completion
5. Repetition
Top Down Chart Parsing Algorithm Back to Agenda Page

Initialization
Start with an empty chart (a table used to store intermediate parsing results).
Initialize the chart with the start symbol of the grammar at the root.
Prediction
For each non-terminal in the chart, use the grammar rules to predict possible
expansions (productions) of that non-terminal.
Add these expansions to the chart.
Scanning

Compare the next word in the input sentence with the terminals in the chart.
If there’s a match, add this information to the chart.
Top Down Chart Parsing Algorithm Back to Agenda Page

Completion
Once all parts of a rule match the input, mark this rule as completed in the chart.
Use completed rules to complete higher-level rules that depend on them.
Repetition
Repeat the prediction, scanning, and completion steps until the entire input is
parsed or no more expansions are possible.
Top Down Chart Parsing Algorithm Back to Agenda Page

Example
Consider a simple grammar for a fragment of English:

S → NP VP
NP → Det N
VP → V NP
Det → 'the'
N → 'cat' | 'mat'
V → 'sat on'

Input sentence: "the cat sat on the mat"

Feature System Back to Agenda Page

A feature system in Natural Language Processing (NLP) is a way to

represent additional information about words and phrases to
capture linguistic details that go beyond basic syntactic structure.

Features: Attributes or properties of linguistic elements.

Example: In the sentence "The dogs are running":
"dogs" has features: {number: plural}
"are" has features: {tense: present, number: plural}

Purpose: Helps in disambiguating and understanding the finer details of

language.
Example: Differentiating between "he" (singular, male) and "they" (plural).

Represents additional linguistic information to capture nuances in language.

Morphological Analysis Back to Agenda Page

Morphological analysis involves breaking down

words into their smallest meaningful units, called
morphemes, and understanding how these units
combine to form words.

Key Concepts in Morphological Analysis

Morphemes
Free & Bound
Word Formation
Inflection
Derivation
Compounding
Steps in Morphological Analysis Back to Agenda Page

Identification of Morphemes:

Segmenting Words: Dividing words into their constituent

morphemes.
Example: "unhappiness" → "un-" + "happy" + "-ness"
Classifying Morphemes: Determining whether morphemes are
free or bound, and identifying their roles (prefix, suffix, root).

Analyzing Word Structure:

Root Identification: Finding the base morpheme that

carries the primary meaning.
Example: In "disapproval", the root is "approve".
Affix Identification: Identifying any prefixes, suffixes, or
infixes that modify the root.
Example: In "disapproval", the prefix is "dis-" and
the suffix is "-al".
Steps in Morphological Analysis Back to Agenda Page

Understanding Morphological Rules:

Inflectional Rules: Rules for adding

inflectional morphemes to indicate
grammatical features.
Example: Adding "-s" to form plurals
("cat" → "cats").
Derivational Rules: Rules for adding
derivational morphemes to create new words
or change word classes.
Example: Adding "-ly" to form adverbs
("quick" → "quickly").
Examples of Morphological Analysis Back to Agenda Page

Simple Inflection (dogs)

Derivation (happiness)

Complex Inflection and Derivation (unbelievably)

Compounding (sunflower)
Augmented Transition Network Back to Agenda Page

An Augmented Transition Network (ATN) is a type of computational model used in Natural Language

Processing (NLP) for parsing sentences. An ATN is like a flowchart that processes sentences by moving

through different states according to specific rules, augmented with additional capabilities to handle complex

language

Key Components

1. States: Points in the network representing different stages in the parsing process.

2. Transitions: Arrows connecting states, representing the rules for moving from one state to another.

3. Registers: Memory slots used to store information during parsing.

4. Tests and Actions: Conditions that must be met to follow a transition and actions to be taken (like

storing or manipulating data in registers).

Augmented Transition Network Back to Agenda Page

How It Works

Start State: The initial state where the parsing begins.

End State: The final state representing the completion of parsing.

Arc: A transition that can involve:

Word Tests: Checking if the next word in the input matches specific criteria (e.g., part of speech).

Push: Handling recursive structures by temporarily moving to a subroutine or another part of the

network.

Pop: Returning from a subroutine or another part of the network after processing a nested structure.

Actions: Operations like storing parts of the sentence in registers for later use.
Issues in Parsing Back to Agenda Page

Parsing in NLP involves analyzing the grammatical structure of sentences to derive their syntactic

structure. Several issues can arise during parsing:

1. Ambiguity:

Lexical Ambiguity: A word can have multiple meanings.

Example: "bank" can mean a financial institution or the side of a river.

Syntactic Ambiguity: A sentence can have multiple valid parse trees.

Example: "I saw the man with the telescope" can mean either "I used a telescope to see the

man" or "I saw a man who had a telescope."

2. Complex Sentences:

Sentences with nested or long structures can be difficult to parse accurately.

Example: Sentences with multiple clauses or embedded phrases.

Issues in Parsing Back to Agenda Page

3) Non-Standard Grammar:

Informal or colloquial language, including slang and incomplete sentences, can be

challenging to parse.

Example: "Gonna go now."

4) Error Propagation:

Mistakes in tokenization or part-of-speech tagging can lead to parsing errors.

Example: Misidentifying a word's part of speech can lead to incorrect parse

trees.

5) Incomplete or Noisy Data:

Incomplete sentences or sentences with errors (e.g., typos) can complicate parsing.

Example: "She want to go" instead of "She wants to go."

Please Like,
Share &
Subscribe
Thank you!
Do you have any questions?

AI - Natural Language Processing
No ratings yet
AI - Natural Language Processing
6 pages
NLP Chapter 3
No ratings yet
NLP Chapter 3
23 pages
Module-2 ch-4
No ratings yet
Module-2 ch-4
32 pages
Unit 2 Syntactic Processing
No ratings yet
Unit 2 Syntactic Processing
17 pages
Parsing Techniques
No ratings yet
Parsing Techniques
16 pages
Natural Language Processing
No ratings yet
Natural Language Processing
47 pages
NLP UNIT 2 Notes
No ratings yet
NLP UNIT 2 Notes
14 pages
NLP Module 3
No ratings yet
NLP Module 3
41 pages
NLP Unit 2
No ratings yet
NLP Unit 2
20 pages
Chapter15 NaturalLanguage
100% (1)
Chapter15 NaturalLanguage
35 pages
4.chapter5 - Syntactic and Semantic Representations
No ratings yet
4.chapter5 - Syntactic and Semantic Representations
47 pages
Natural Language Processing PDF
100% (1)
Natural Language Processing PDF
47 pages
Natural Language Processing: Dr. Ahmed El-Bialy
100% (1)
Natural Language Processing: Dr. Ahmed El-Bialy
49 pages
NLP Unit Ii
No ratings yet
NLP Unit Ii
30 pages
5th Unit NLP
No ratings yet
5th Unit NLP
32 pages
Longsem2024-25 Cse3015 Eth Ap2024256000125 Reference-material-III
No ratings yet
Longsem2024-25 Cse3015 Eth Ap2024256000125 Reference-material-III
89 pages
Unit 2
No ratings yet
Unit 2
140 pages
Natural Language Processing
No ratings yet
Natural Language Processing
21 pages
Natural Language Processing Unit 3
No ratings yet
Natural Language Processing Unit 3
55 pages
Unit-2 - NLP
No ratings yet
Unit-2 - NLP
54 pages
Module 3 NLP
No ratings yet
Module 3 NLP
32 pages
NLP Unit 3 Part A PDF
No ratings yet
NLP Unit 3 Part A PDF
75 pages
Bottom Up Parsing and Transition Net Grammar
No ratings yet
Bottom Up Parsing and Transition Net Grammar
7 pages
Module 14
No ratings yet
Module 14
7 pages
Syntax Complete
No ratings yet
Syntax Complete
22 pages
NLP Unit 3
No ratings yet
NLP Unit 3
17 pages
Unit 5
No ratings yet
Unit 5
70 pages
Syntactic Analysis
No ratings yet
Syntactic Analysis
66 pages
Mod - 3
No ratings yet
Mod - 3
51 pages
Natural Language Processing
No ratings yet
Natural Language Processing
13 pages
Unit - 5 Natural Language Processing
No ratings yet
Unit - 5 Natural Language Processing
66 pages
Module No. 3: Parsing Structure in Text
No ratings yet
Module No. 3: Parsing Structure in Text
54 pages
NLP Ans
No ratings yet
NLP Ans
9 pages
Unit - 2 NLP - R20
No ratings yet
Unit - 2 NLP - R20
21 pages
NLP Unit Ii
No ratings yet
NLP Unit Ii
30 pages
NLP Unit-3
No ratings yet
NLP Unit-3
14 pages
Parsing
No ratings yet
Parsing
10 pages
Lecture15 Parsing
No ratings yet
Lecture15 Parsing
37 pages
Unit 3 NLP
No ratings yet
Unit 3 NLP
7 pages
Unit 2 - Lecture 1
No ratings yet
Unit 2 - Lecture 1
19 pages
Grammar and Parsing
No ratings yet
Grammar and Parsing
8 pages
8 Parsing
No ratings yet
8 Parsing
40 pages
Parsing Algorithms
No ratings yet
Parsing Algorithms
20 pages
NLP Simple Explanation
No ratings yet
NLP Simple Explanation
9 pages
13-Dependency Grammar-03-09-2024
No ratings yet
13-Dependency Grammar-03-09-2024
31 pages
Parsing
No ratings yet
Parsing
12 pages
NLP Unit 2
No ratings yet
NLP Unit 2
48 pages
Natural Language Processing Parsing Techniques:: Unit IV
100% (1)
Natural Language Processing Parsing Techniques:: Unit IV
24 pages
What Is Parsing
No ratings yet
What Is Parsing
47 pages
Unit 3
No ratings yet
Unit 3
8 pages
NLP - Unit Ii
No ratings yet
NLP - Unit Ii
13 pages
3 Chart Parsing
No ratings yet
3 Chart Parsing
39 pages
Feature Systems and Augmented Grammars
No ratings yet
Feature Systems and Augmented Grammars
7 pages
ACFrOgBKMtkrKQXYgwzYfGAQxQ0GJjQ4MloahBs6vi5pwqo xRZUN6IRgh8lAAyR2U7sguAn6becvxh174Y RYo84nZ3K9mm OlN3Q JrDvd18FxMzMkCBuxruzd1tH0C6XqndKXsCSXuwHIWVT7olg5FKOstIhFYq-Kh6hMBg
No ratings yet
ACFrOgBKMtkrKQXYgwzYfGAQxQ0GJjQ4MloahBs6vi5pwqo xRZUN6IRgh8lAAyR2U7sguAn6becvxh174Y RYo84nZ3K9mm OlN3Q JrDvd18FxMzMkCBuxruzd1tH0C6XqndKXsCSXuwHIWVT7olg5FKOstIhFYq-Kh6hMBg
32 pages
Unit V Intelligence and Applications: Morphological Analysis/Lexical Analysis
No ratings yet
Unit V Intelligence and Applications: Morphological Analysis/Lexical Analysis
30 pages
Introduction To Natural Language Processing and NLTK
No ratings yet
Introduction To Natural Language Processing and NLTK
23 pages
Ai Phases in NLP Sem Vi
No ratings yet
Ai Phases in NLP Sem Vi
3 pages
Practical Solutions To Global Business Negotiations
25% (4)
Practical Solutions To Global Business Negotiations
31 pages
Chart Parsing Bottom-Up Chart Parsing
No ratings yet
Chart Parsing Bottom-Up Chart Parsing
5 pages
Atural Anguage Rocessing: Chandra Prakash LPU
No ratings yet
Atural Anguage Rocessing: Chandra Prakash LPU
59 pages
Amulet 9-Waverider Text
100% (4)
Amulet 9-Waverider Text
240 pages
About Us: Presenting Company Information On Corporate Websites and in Sections
No ratings yet
About Us: Presenting Company Information On Corporate Websites and in Sections
258 pages
An Empirical Analysis of Political Dynasties in The 15th Philippine Congress - Asian Institute of Management
100% (4)
An Empirical Analysis of Political Dynasties in The 15th Philippine Congress - Asian Institute of Management
41 pages
Plain Truth 1962 (Vol XXVII No 08) Aug - W
100% (1)
Plain Truth 1962 (Vol XXVII No 08) Aug - W
48 pages
Legal Rights - Definition, Theories, Characteristics and Kinds
No ratings yet
Legal Rights - Definition, Theories, Characteristics and Kinds
12 pages
Vol 1 Iss 4
No ratings yet
Vol 1 Iss 4
212 pages
24 Civil Court Ordinance (7-10)
No ratings yet
24 Civil Court Ordinance (7-10)
11 pages
Strategic Management and The Entrepreneur
50% (2)
Strategic Management and The Entrepreneur
52 pages
USIU Student Handbook by Frog
No ratings yet
USIU Student Handbook by Frog
75 pages
Kamenica Castle in The Croatian Zagorje Region
No ratings yet
Kamenica Castle in The Croatian Zagorje Region
25 pages
The Chaplet of Divine Mercy
No ratings yet
The Chaplet of Divine Mercy
4 pages
28 People V Enriquez Digest
100% (5)
28 People V Enriquez Digest
2 pages
Marriot - HR
50% (2)
Marriot - HR
3 pages
Topic 8B (Plants) - Lesson 3
No ratings yet
Topic 8B (Plants) - Lesson 3
33 pages
Ilp Ha Semester4 Completed
No ratings yet
Ilp Ha Semester4 Completed
8 pages
Jonathan Essay 3
100% (1)
Jonathan Essay 3
4 pages
MKT243 Chapter 1
No ratings yet
MKT243 Chapter 1
19 pages
Bibliography On Psychokinesis
No ratings yet
Bibliography On Psychokinesis
3 pages
3
No ratings yet
3
212 pages
There Are Three Levels of Outsourci
No ratings yet
There Are Three Levels of Outsourci
2 pages
Condensed Theology: A Primer in Systematic Theology
No ratings yet
Condensed Theology: A Primer in Systematic Theology
52 pages
Customer Master - CIN Details Screen Changes
No ratings yet
Customer Master - CIN Details Screen Changes
4 pages
Reader Islamic Studies
No ratings yet
Reader Islamic Studies
92 pages
Phil. History Notes
No ratings yet
Phil. History Notes
2 pages
أولي ثانوي تقييم أسبوعي س13
No ratings yet
أولي ثانوي تقييم أسبوعي س13
2 pages
Corvus Corone E
No ratings yet
Corvus Corone E
4 pages
Removing Items From A Binary Search Tree
No ratings yet
Removing Items From A Binary Search Tree
2 pages
Journal Adopsi IFRS
No ratings yet
Journal Adopsi IFRS
8 pages
Reason Why Mobile Phones Are Not 100% Safe
No ratings yet
Reason Why Mobile Phones Are Not 100% Safe
1 page
Natural Language Processing
From Everand
Natural Language Processing
Ajit Singh
No ratings yet

NLP Unit 03

Uploaded by

NLP Unit 03

Uploaded by

Natural Language

Dr. Abdul Kalam Technical University, Lucknow

Basic Methods of Searching

Key Points About Grammar in NLP

How is it used in NLP?

It is a graphical representation of derivation.

Start symbols are the root of the parse tree.

Leaf nodes are terminals.

Interior nodes are non-terminals.

If parsed properly will create input text.

Top Down Parsing

1. It is a parsing strategy that first looks at the

1. It is a parsing strategy that first looks at the lowest level

Top-down chart parsing is a parsing technique used in Natural Language

Input sentence: "the cat sat on the mat"

A feature system in Natural Language Processing (NLP) is a way to

Features: Attributes or properties of linguistic elements.

Purpose: Helps in disambiguating and understanding the finer details of

Represents additional linguistic information to capture nuances in language.

Morphological analysis involves breaking down

Key Concepts in Morphological Analysis

Segmenting Words: Dividing words into their constituent

Analyzing Word Structure:

Root Identification: Finding the base morpheme that

Understanding Morphological Rules:

Inflectional Rules: Rules for adding

Simple Inflection (dogs)

Complex Inflection and Derivation (unbelievably)

3. Registers: Memory slots used to store information during parsing.

storing or manipulating data in registers).

Start State: The initial state where the parsing begins.

End State: The final state representing the completion of parsing.

Arc: A transition that can involve:

structure. Several issues can arise during parsing:

Lexical Ambiguity: A word can have multiple meanings.

Example: "bank" can mean a financial institution or the side of a river.

Syntactic Ambiguity: A sentence can have multiple valid parse trees.

man" or "I saw a man who had a telescope."

Sentences with nested or long structures can be difficult to parse accurately.

Example: Sentences with multiple clauses or embedded phrases.

Informal or colloquial language, including slang and incomplete sentences, can be

Example: "Gonna go now."

Mistakes in tokenization or part-of-speech tagging can lead to parsing errors.

Example: Misidentifying a word's part of speech can lead to incorrect parse

5) Incomplete or Noisy Data:

Example: "She want to go" instead of "She wants to go."

You might also like