0% found this document useful (0 votes)

35 views10 pages

Lab 06 - Parse Tree Tutorial

lab 6

Uploaded by

Don Pablo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views10 pages

Lab 06 - Parse Tree Tutorial

lab 6

Uploaded by

Don Pablo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

CT107-3-3-TXSA - Text Analytics and Sentiment Analysis Parse Tree

Lab 6: CFG & Parse Tree

References:
1. Natural Language Processing with Python, by Steven Bird, Ewan Klein and Edward Loper,
2014.

QUICK REVIEW

CFG has been the most influential grammar formalism for describing language syntax. This is not
because CFG has been generally adopted as such for linguistic description, but rather because most
grammar formalisms are derived from or can somehow be related to CFG. For this reason, CFG is
often used as a base formalism when parsing algorithms are described.
The standard way to represent the syntactic structure of a grammatical sentence is as a syntax tree,
or a parse tree, which is a representation of all the steps in the derivation of the sentence from the
root node. This means that each internal node in the tree represents an application of a grammar
rule.

PRACTICES

Parse Tree 01

import nltk

text2 = nltk.CFG.fromstring("""
S -> NP VP
PP -> P NP
NP -> Det N | PP NP | Det N PP | 'I'
VP -> V NP | VP PP | V
Det -> 'a'
N -> 'book'
V -> 'write'
""")
text1 = nltk.tokenize.word_tokenize("I write a book")
print(text1)
parser = nltk.ChartParser(text2)
for tree in parser.parse(text1):
print(tree)
tree.draw()

Output

Level 3 Asia Pacific University (APU) Page 1 of 10

CT107-3-3-TXSA - Text Analytics and Sentiment Analysis Parse Tree

['I', 'write', 'a', 'book']

(S (NP I) (VP (V write) (NP (Det a) (N book))))
Parse Tree 02

sent = ['I', 'shot', 'an', 'elephant', 'in', 'my', 'pajamas']

parser = nltk.ChartParser(groucho_grammar)
for tree in parser.parse(sent):
tree.draw()
print(tree)

Parse Tree 03

import nltk

Level 3 Asia Pacific University (APU) Page 2 of 10

CT107-3-3-TXSA - Text Analytics and Sentiment Analysis Parse Tree

text2 = nltk.CFG.fromstring("""
S -> NP VP
PP -> P NP
NP -> Det N | PP NP | Det N PP
VP -> V NP | VP PP | V
N -> 'Alice' | 'Bob'
V -> 'loves'
Det ->
P ->
""")
text1 = nltk.tokenize.word_tokenize("Alice loves Bob")
print(text1)
print()
parser = nltk.ChartParser(text2)
for tree in parser.parse(text1):
print(tree)
tree.draw()

Parse Tree 04 – Adjective Phrase

The little bear saw the fine fat trout in the brook

Clue:

Level 3 Asia Pacific University (APU) Page 3 of 10

CT107-3-3-TXSA - Text Analytics and Sentiment Analysis Parse Tree

NP  DT Nom
Nom  Adj N | Adj Adj N

text1 = nltk.tokenize.word_tokenize("the little bear saw the fine

fat trout in the brook")
print(text1)
print()
parser = nltk.ChartParser(text2)
for tree1 in parser.parse(text1):
tree1.draw()
print(tree1)

Level 3 Asia Pacific University (APU) Page 4 of 10

CT107-3-3-TXSA - Text Analytics and Sentiment Analysis Parse Tree

Parse Tree 05 – Adjective Phrase

sent = ['the', 'angry', 'bear', 'chased', 'the', 'frightened',

'little', 'squirrel']
parser = nltk.ChartParser(grammar2)
for tree in parser.parse(sent):
tree.draw()
print(tree)

Level 3 Asia Pacific University (APU) Page 5 of 10

CT107-3-3-TXSA - Text Analytics and Sentiment Analysis Parse Tree

Parse Tree 06 – Adverb Phrases (AdvP)

E.g.: Ken snores very loudly

import nltk

sentence = "Ken snores very loudly"

gram = nltk.CFG.fromstring("""
S -> NP VP
NP -> N
VP -> V ADV
N -> 'Ken'
V -> 'snores'
DEG -> 'very'
ADV -> DEG ADV | 'loudly'
""")

Level 3 Asia Pacific University (APU) Page 6 of 10

CT107-3-3-TXSA - Text Analytics and Sentiment Analysis Parse Tree

token = nltk.tokenize.word_tokenize(sentence)
print(token)
parser = nltk.ChartParser(gram)
for tree in parser.parse(token):
print(tree)
tree.draw()

import nltk
from nltk.tokenize import word_tokenize

sents = [
"unfortunately the cat killed the mouse",
"the cat unfortunately killed the mouse",
"the cat killed the mouse unfortunately"
]

grammar = nltk.CFG.fromstring("""
S -> ADV NP VP | NP VP
NP -> DT N
VP -> ADV VP | VP ADV | V NP
DT -> 'the'
N -> 'cat' | 'mouse'

Level 3 Asia Pacific University (APU) Page 7 of 10

CT107-3-3-TXSA - Text Analytics and Sentiment Analysis Parse Tree

V -> 'killed'
ADV -> 'unfortunately'
""")

parser = nltk.ChartParser(grammar)

for sent in sents:

print(sent)
for tree in parser.parse(word_tokenize(sent)):
tree.draw()
print(tree)

Unfortunately the cat killed the mouse

The cat unfortunately killed the mouse

Level 3 Asia Pacific University (APU) Page 8 of 10

CT107-3-3-TXSA - Text Analytics and Sentiment Analysis Parse Tree

The cat killed the mouse unfortunately

Draw Parse Tree using COLAB

Run the following set of codes to set the COLAB platform

import nltk
nltk.download('punkt')

### CREATE VIRTUAL DISPLAY ###

!apt-get install -y xvfb # Install X Virtual Frame Buffer
import os
os.system('Xvfb :1 -screen 0 1600x1200x16 &') # create virtual display w
ith size 1600x1200 and 16 bit color. Color can be changed to 24 or 8
os.environ['DISPLAY']=':1.0' # tell X clients to use our virtual DISPLAY
:1.0.

%matplotlib inline

### INSTALL GHOSTSCRIPT (Required to display NLTK trees) ###

!apt install ghostscript python3-tk

Example Program ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

import nltk
from IPython.display import display

Level 3 Asia Pacific University (APU) Page 9 of 10

CT107-3-3-TXSA - Text Analytics and Sentiment Analysis Parse Tree

""")
text1 = nltk.tokenize.word_tokenize("I write a book")
print(text1)
parser = nltk.ChartParser(text2)
for tree in parser.parse(text1):
display(tree) # tree.draw()
# print(tree)

Level 3 Asia Pacific University (APU) Page 10 of

Stuttering Cheat Sheets
No ratings yet
Stuttering Cheat Sheets
3 pages
4.chapter5 - Syntactic and Semantic Representations
No ratings yet
4.chapter5 - Syntactic and Semantic Representations
47 pages
NLP Unit 2
No ratings yet
NLP Unit 2
20 pages
Machine 22
No ratings yet
Machine 22
5 pages
Lecture 2
No ratings yet
Lecture 2
28 pages
I041 - NLP - Assignment1.ipynb - Colaboratory
No ratings yet
I041 - NLP - Assignment1.ipynb - Colaboratory
11 pages
1 Motivation: Setting Up To Use Pstone
No ratings yet
1 Motivation: Setting Up To Use Pstone
9 pages
CH 08
No ratings yet
CH 08
31 pages
NLP Chapter 3
No ratings yet
NLP Chapter 3
23 pages
Mod - 3
No ratings yet
Mod - 3
51 pages
Sample Program Using Python 3
No ratings yet
Sample Program Using Python 3
5 pages
SPR Solution Ex 08 CFG
No ratings yet
SPR Solution Ex 08 CFG
6 pages
14 Syntax 1
No ratings yet
14 Syntax 1
22 pages
CS6120 35650 - Spring2025 - Assignment - 2-1
No ratings yet
CS6120 35650 - Spring2025 - Assignment - 2-1
5 pages
Constituency Parsing PPT 2
No ratings yet
Constituency Parsing PPT 2
33 pages
Week 3 - Probablistic Context Free Grammars
No ratings yet
Week 3 - Probablistic Context Free Grammars
18 pages
Module No. 3: Parsing Structure in Text
No ratings yet
Module No. 3: Parsing Structure in Text
54 pages
Luận Văn Lexicalized Statistical Parsing for Vietnamese
No ratings yet
Luận Văn Lexicalized Statistical Parsing for Vietnamese
16 pages
NLP Unit-Ii
No ratings yet
NLP Unit-Ii
71 pages
NLP M3 SPP
No ratings yet
NLP M3 SPP
53 pages
What Is Parsing
No ratings yet
What Is Parsing
47 pages
423/723 Natural Language Processing: Assignment 1
No ratings yet
423/723 Natural Language Processing: Assignment 1
4 pages
NLP Programming
No ratings yet
NLP Programming
39 pages
NLP Unit-Ii
No ratings yet
NLP Unit-Ii
42 pages
NLP Module 3
No ratings yet
NLP Module 3
41 pages
13-Dependency Grammar-03-09-2024
No ratings yet
13-Dependency Grammar-03-09-2024
31 pages
Unit 3
No ratings yet
Unit 3
19 pages
Natural Language Processing Unit 3
No ratings yet
Natural Language Processing Unit 3
55 pages
2024 CD-Ch03 Syntaxx Analysis
No ratings yet
2024 CD-Ch03 Syntaxx Analysis
28 pages
Unit 2 - Lecture 1
No ratings yet
Unit 2 - Lecture 1
19 pages
Natural Language Processing: Parsing
No ratings yet
Natural Language Processing: Parsing
18 pages
NLP Unit-2
No ratings yet
NLP Unit-2
18 pages
Longsem2024-25 Cse3015 Eth Ap2024256000125 Reference-material-III
No ratings yet
Longsem2024-25 Cse3015 Eth Ap2024256000125 Reference-material-III
89 pages
NLTK Cheatsheet
No ratings yet
NLTK Cheatsheet
27 pages
CH4 Syntax Directed Analysis
No ratings yet
CH4 Syntax Directed Analysis
39 pages
Introduction To PEG (Parsing Expression Grammar) in Python
50% (2)
Introduction To PEG (Parsing Expression Grammar) in Python
71 pages
Module 3 NLP
No ratings yet
Module 3 NLP
32 pages
Shubham Jade MSC It 31031420010 NLP Practical Journal
No ratings yet
Shubham Jade MSC It 31031420010 NLP Practical Journal
17 pages
Lecture 07
No ratings yet
Lecture 07
35 pages
CCS369-Text and Speech Analysis Lab (1-9)
No ratings yet
CCS369-Text and Speech Analysis Lab (1-9)
37 pages
Natural Language Toolkit NLTK PDF
No ratings yet
Natural Language Toolkit NLTK PDF
23 pages
Gabbar 2025 Update
No ratings yet
Gabbar 2025 Update
15 pages
Chapter 3 (Part 1)
No ratings yet
Chapter 3 (Part 1)
33 pages
Sem:U: Btecht in
No ratings yet
Sem:U: Btecht in
5 pages
Background
No ratings yet
Background
18 pages
NLP Unit 2
No ratings yet
NLP Unit 2
13 pages
NLTK: The Natural Language Toolkit: Steven Bird Edward Loper
No ratings yet
NLTK: The Natural Language Toolkit: Steven Bird Edward Loper
4 pages
03LexicalAndSyntaxAnalysis 1
No ratings yet
03LexicalAndSyntaxAnalysis 1
25 pages
NLP Unit-Ii
No ratings yet
NLP Unit-Ii
45 pages
Compiler Design Notes, IIT Delhi
No ratings yet
Compiler Design Notes, IIT Delhi
147 pages
NLP Unit 2
No ratings yet
NLP Unit 2
20 pages
Unit-2 F&CD
No ratings yet
Unit-2 F&CD
31 pages
Chapter 02
No ratings yet
Chapter 02
67 pages
Lecture15 Parsing
No ratings yet
Lecture15 Parsing
37 pages
UNIT 2
No ratings yet
UNIT 2
53 pages
Lesson Outline, Practice Session, November'21 Class-KG: Subject: Bangla Teacher: Rowshon Ara Khandoker
No ratings yet
Lesson Outline, Practice Session, November'21 Class-KG: Subject: Bangla Teacher: Rowshon Ara Khandoker
2 pages
ELT Secondary
No ratings yet
ELT Secondary
27 pages
English Grammar
No ratings yet
English Grammar
1 page
Authentic Assessment #1
100% (1)
Authentic Assessment #1
2 pages
Strategies Applied by Ngoc Thu Lang in English-Vietnamese Translation of Slang in "The Godfather"
100% (5)
Strategies Applied by Ngoc Thu Lang in English-Vietnamese Translation of Slang in "The Godfather"
82 pages
Functional Styles of The English Language
No ratings yet
Functional Styles of The English Language
7 pages
Assignment 11
No ratings yet
Assignment 11
3 pages
ELT Notes
No ratings yet
ELT Notes
8 pages
Lesson 2 Communication and Globalization
No ratings yet
Lesson 2 Communication and Globalization
13 pages
ESP For Tourism
No ratings yet
ESP For Tourism
34 pages
Hallet Whats A Digital Classroom-1
No ratings yet
Hallet Whats A Digital Classroom-1
6 pages
Unit Vocabula RY Grammar Reading Listening Speaking Culture Clil Writing
No ratings yet
Unit Vocabula RY Grammar Reading Listening Speaking Culture Clil Writing
6 pages
Project Proposal For SEO Service PDF
No ratings yet
Project Proposal For SEO Service PDF
14 pages
50 Graphic Organizers Compress
No ratings yet
50 Graphic Organizers Compress
113 pages
Gpcom-Module 1 - Unit 2
No ratings yet
Gpcom-Module 1 - Unit 2
2 pages
Reading Response For Ann Johns
No ratings yet
Reading Response For Ann Johns
4 pages
INGLES PPT Semana 10
No ratings yet
INGLES PPT Semana 10
12 pages
Quarter 3 Module 3 Reading and Writing
No ratings yet
Quarter 3 Module 3 Reading and Writing
10 pages
B1 Self-Assessment Units 3,4
No ratings yet
B1 Self-Assessment Units 3,4
21 pages
13 - Chapter 5 PDF
No ratings yet
13 - Chapter 5 PDF
47 pages
ҚМЖ Фатима 10
No ratings yet
ҚМЖ Фатима 10
1 page
Iep Impact Examples 3
100% (2)
Iep Impact Examples 3
3 pages
SLMR FlegeBohn 2021
No ratings yet
SLMR FlegeBohn 2021
42 pages
Full Easy English Fluency
No ratings yet
Full Easy English Fluency
20 pages
Barriers
67% (3)
Barriers
4 pages
Rhythm in Translation, With Two Accounts of Leconte de Lisle's Midi'
No ratings yet
Rhythm in Translation, With Two Accounts of Leconte de Lisle's Midi'
15 pages
RPH Bi Year 2 Usm 2 Utm (Week 13)
No ratings yet
RPH Bi Year 2 Usm 2 Utm (Week 13)
13 pages
Basic 3 Term 1 Scheme
No ratings yet
Basic 3 Term 1 Scheme
188 pages
Why Learn
No ratings yet
Why Learn
2 pages