0% found this document useful (0 votes)

69 views6 pages

Pract Q

Uploaded by

purid9991

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

69 views6 pages

Pract Q

Uploaded by

purid9991

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Practice Questions

Q. Convert the following Context Free Grammar (CFG) into Chomsky Normal Form (CNF)

Q. Apply CYK parsing algorithm to generate the parsing table for input sentence “the pilot flew the plane
to Delhi” using the given grammar in CNF

Q. Consider the sentence “Sing loudly to Dance joyfully” that needs to be tagged using a Hidden Markov Model
(HMM). Two stochastic probability matrices are given: matrix A for state transitions and matrix B for emission
probabilities, with two states being Adjective (Adj) and Adverb (Adv).

The state transition matrix A and the emission probability matrix B are provided as follows:
Matrix A (State Transition Probabilities):
State Adj Adv
Adj 0.6 0.4
Adv 0.3 0.7
Matrix B (Emission Probabilities):
State/Obs. Adj Adv
Adj 0.5 0.5
Adv 0.4 0.6

1. Draw the HMM transition model with transition probabilities Aij and emission probabilities Bij for the
two ambiguous words of the sentence W1=Sing and W2=Dance as states of the model.

2. Determine the state (Adj or Adv), if the observation or output state is “Adverb” and also calculate the
probability for W1=Sing and W2=Dance.

3. If the sequence of observations or output states is “Adv-Adj-Adv”, calculate the probability of

each possible case for the word sequence and write the most likely word sequence.

Q. Write Notes on following:

• Morphological ambiguity
• Label Bracketing
• Bag of Word
• Transition based Discourse Analysis
Q. Explain the difference between Tokenization and Segmentation in the context of Natural Language
Processing. Provide a suitable example to illustrate these concepts.
Q. What are the key differences between Syntax and Semantics in language processing? Give an example that
highlights these differences.
Q. Compare and contrast between Part-of-Speech Tagging and Named Entity Recognition in NLP. Use examples
to demonstrate how these processes differ in handling text data.
Q. What is the distinction between a Corpus and a Lexicon in linguistic studies? Illustrate your answer with an
example of how each would be used in language analysis.

Q The following shows a simple context free grammar (CFG) for a fragment of English.

the Show the parse tree for sentence “the dog is angry at the cat”.

Q. Show all possible parse trees for the sentence "bronze pots clatter" and calculate the probability
of each tree using PCFG.
S -> Noun VP [0.5]
S -> NP Verb [0.5]
VP -> Verb Noun [1.0]
NP -> Adj Noun [1.0]
Adj -> "bronze" [1.0]
Noun -> "bronze" [0.4]
Noun -> "pots" [0.3]
Noun -> "clatter" [0.3]
Verb -> "bronze" [0.3]
Verb -> "pots" [0.5]
Verb -> "clatter" [0.2]
Answer
Parse Tree 1:
S -> Noun VP
Noun -> "bronze"
VP -> Verb Noun
Verb -> "pots"
Noun -> "clatter"

Probability Calculation:

P(T1)=P(S→Noun VP)×P(Noun→"bronze")×P(VP→Verb Noun)×P(Verb→"pots")×P(Noun→"clatter

P(T1)=0.5×0.4×1.0×0.5×0.3 =0.03

Parse Tree 2:
S -> NP Verb
NP -> Adj Noun
Adj -> "bronze"
Noun -> "pots"
Verb -> "clatter"
Probability Calculation:
P(T2)=P(S→NP Verb)×P(NP→Adj Noun)×P(Adj→"bronze")×P(Noun→"pots")×P(Verb→"clatter")
P(T2)=0.5×1.0×1.0×0.3×0.2
P(T2)=0.03
Both parse trees T1 and T2 are possible for the sentence "bronze pots clatter" with the provided
PCFG, and both have the same probability of 0.03. This illustrates how natural language sentences
can have multiple valid parses, and how PCFGs can be used to calculate the probability of each
parse, which can be useful in choosing the most likely parse in natural language processing
applications.

Q. Given the following PCFG rules, show all possible parse trees for the sentence "red birds sing"
and calculate the probability of each tree.
S -> Noun VP [0.6] S -> NP Verb [0.4] VP -> Verb Noun [0.9] VP -> Verb Adj [0.1] NP -> Adj Noun
[1.0] Adj -> "red" [1.0] Noun -> "red" [0.2] Noun -> "birds" [0.5] Noun -> "sing" [0.3] Verb -> "red"
[0.1] Verb -> "birds" [0.2] Verb -> "sing" [0.7]

Q. Using the PCFG provided, construct all possible parse trees for the sentence "green fish swim"
and compute their respective probabilities.

S -> NP VP [0.7]
S -> Noun VP [0.3]
VP -> Verb NP [0.8]
VP -> Verb Noun [0.2]
NP -> Adj Noun [1.0]
Adj -> "green" [1.0]
Noun -> "green" [0.3]
Noun -> "fish" [0.4]
Noun -> "swim" [0.3]
Verb -> "green" [0.2]
Verb -> "fish" [0.3]
Verb -> "swim" [0.5]

Q. Create a unigram and bigram word model for the following corpus:
Have fun on the school trip.
Ask the teacher if you have any problem.
I have fun just looking around.
(i) Predict the probability of occurrence of next word as ‘any’ after the given word as ‘have’.
(ii) Predict the probability of occurrence of next word as ‘fun’ after the given word as ‘have’.
(iii) Predict the probability of sentence ‘Ask the teacher if you have any problem’ considering Bigram.
(iv) Predict the probability of sentence ‘Ask the teacher if you have any problem’ considering Trigram.
Answer

Unigram model
The unigram model considers each word independently.

To calculate the probability of a word, we use:

p(word)=Count(word) / Total number of words
Total number of words in the corpus = 18
Count("have") = 2
Count("any") = 1
Count("fun") = 2

P("have")=2/18
P("any")=1/18
P ("fun")=2/82

Bigram Model
The bigram model considers a pair of words. To calculate the probability of a word given the previous
word, we use:
p(word2 | word1)=Count(word1 word2) / Count(word1)

Count("have any") = 1
Count("have fun") = 2

P("any" | "have")=1/2
P("fun" | "have")= 2/2

P(Sentence | Bigram)=31×1×21×1×1×1×21×1×1
P(Sentence | Bigram)= 1/12
So, the probability of the sentence "Ask the teacher if you have any problem" under the bigram
model and given the provided corpus is 1/12 or approximately 0.0833.

(iv) To calculate the probability of this sentence using the trigram model, we need to find the
probability of each word given the two previous words, i.e., p(wordn∣word n−1,wordn−2)).

P(Sentence | Trigram)=1×1×1×1×1×1×1×1×1
P(Sentence | Trigram)=1

Q. Questions on N-Gram Model

Corpus:
Time flies like an arrow.
Fruit flies like a banana.
She likes to have fruit for breakfast.

Question 1:
Create a unigram model for the above corpus and use it to:
(i) Predict the probability of occurrence of the word 'like'.
(ii) Predict the probability of occurrence of the word 'flies'.
Question 2:
Create a bigram model for the same corpus and use it to:
(i) Predict the probability of the next word being 'flies' given the current word is 'Time'.
(ii) Predict the probability of the next word being 'an' given the current word is 'like'.
Question 3:
Using the bigram model, predict the probability of the sentence 'Fruit flies like a banana'
occurring in the corpus.
Question 4:
Create a trigram model for the corpus and use it to:
(i) Predict the probability of the next word being 'arrow' given the previous two words are
'Time flies'.
(ii) Predict the probability of the next word being 'for' given the previous two words are
'to have'.
Question 5:
Using the trigram model, predict the probability of the sentence 'She likes to have fruit for
breakfast' occurring in the corpus.

Instructions for Solving Q1-Q5:

Unigram Model: Count the frequency of each word and divide by the total number of
words in the corpus.
Bigram Model: Count the frequency of each pair of consecutive words (bigram) and divide
by the frequency of the first word in the pair.
Trigram Model: Similar to bigram, but for sequences of three words.
Sentence Probability Calculation:
For bigrams: Multiply the probabilities of each bigram in the sentence.
For trigrams: Multiply the probabilities of each trigram in the sentence.

Technological Web of Narcissistic Abuse
0% (1)
Technological Web of Narcissistic Abuse
12 pages
Geography Lec-26 (Himanshu Thapa Sir) Upsc Hinglish
No ratings yet
Geography Lec-26 (Himanshu Thapa Sir) Upsc Hinglish
41 pages
CSIR NET Statistics PYQs
No ratings yet
CSIR NET Statistics PYQs
94 pages
UPDATED Kanishq Conference 1
No ratings yet
UPDATED Kanishq Conference 1
8 pages
Emelo - QTR 2 - FINAL PERFORMANCE TASK
No ratings yet
Emelo - QTR 2 - FINAL PERFORMANCE TASK
4 pages
轻松学中文第四册课文
No ratings yet
轻松学中文第四册课文
118 pages
mockExamWS21 With Solution
No ratings yet
mockExamWS21 With Solution
35 pages
Literature Review
No ratings yet
Literature Review
23 pages
UPSC 2026 - 11 To 12 Months Plan
No ratings yet
UPSC 2026 - 11 To 12 Months Plan
7 pages
1-8 NLP
No ratings yet
1-8 NLP
432 pages
Assignment6 cs22bt012
No ratings yet
Assignment6 cs22bt012
20 pages
Communicative English II Sem II
No ratings yet
Communicative English II Sem II
43 pages
Capf Ac 2024 Grammar Solution
No ratings yet
Capf Ac 2024 Grammar Solution
18 pages
English Answer Key of Cds 2 2024
No ratings yet
English Answer Key of Cds 2 2024
19 pages
Milena Rodriguez - FOL
No ratings yet
Milena Rodriguez - FOL
13 pages
L 2 Written Commentary 2022-2023 Students
No ratings yet
L 2 Written Commentary 2022-2023 Students
8 pages
General Studies I
No ratings yet
General Studies I
4 pages
Shankar Ganesh Indian Economy 3rd Clear Printable Version (Upscpdf - Com)
No ratings yet
Shankar Ganesh Indian Economy 3rd Clear Printable Version (Upscpdf - Com)
102 pages
Essay
No ratings yet
Essay
2 pages
PCFG
No ratings yet
PCFG
79 pages
Form 1 Civics
No ratings yet
Form 1 Civics
5 pages
Regular Verbs - Ed - Ing - Ihci
No ratings yet
Regular Verbs - Ed - Ing - Ihci
3 pages
Kieler 1974 UrbanizationinSouthIndia SouthwestConferenceonAsianStudies
No ratings yet
Kieler 1974 UrbanizationinSouthIndia SouthwestConferenceonAsianStudies
11 pages
Romeo and Juliet Wordlist
No ratings yet
Romeo and Juliet Wordlist
10 pages
JMI RCA Entrance Syllabus
No ratings yet
JMI RCA Entrance Syllabus
3 pages
History UPSC (Mains) (1979-2011)
100% (1)
History UPSC (Mains) (1979-2011)
56 pages
NLP Unit-4
No ratings yet
NLP Unit-4
48 pages
Guido 08.07.24
No ratings yet
Guido 08.07.24
17 pages
YAKUNIY NAZORAT TESTI TDV-119, TDV-219 - Copy (копия)
No ratings yet
YAKUNIY NAZORAT TESTI TDV-119, TDV-219 - Copy (копия)
33 pages
Pragmatic Analysis of Politeness Strategies in Some Selected Presidential Debates
No ratings yet
Pragmatic Analysis of Politeness Strategies in Some Selected Presidential Debates
32 pages
Slp14 Handout s17hw
No ratings yet
Slp14 Handout s17hw
71 pages
Sociology Thinkers - All at One Place
No ratings yet
Sociology Thinkers - All at One Place
20 pages
Week 10
No ratings yet
Week 10
31 pages
Grammar Revision
No ratings yet
Grammar Revision
3 pages
Assignment Iii NLP
No ratings yet
Assignment Iii NLP
15 pages
Lari Young-Insideoutsidealgorithm
No ratings yet
Lari Young-Insideoutsidealgorithm
22 pages
Somali Phonology
100% (1)
Somali Phonology
7 pages
UPPCS Pre GSCSAT Year-Wise Topic-Wise Solved Papers (UPPCS Exam Expert Team - A.K. Mahajan) (Z-Library)
No ratings yet
UPPCS Pre GSCSAT Year-Wise Topic-Wise Solved Papers (UPPCS Exam Expert Team - A.K. Mahajan) (Z-Library)
816 pages
CYK Algorithm
No ratings yet
CYK Algorithm
6 pages
Art History Essay Writing Guide
No ratings yet
Art History Essay Writing Guide
7 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
63 pages
Statistical Constituency Pars-Ing: 14.1 Probabilistic Context-Free Grammars
No ratings yet
Statistical Constituency Pars-Ing: 14.1 Probabilistic Context-Free Grammars
29 pages
Unit 2 Test
No ratings yet
Unit 2 Test
3 pages
Complete POLITY NOTES For RRB NTPC, Group-D & SSC
No ratings yet
Complete POLITY NOTES For RRB NTPC, Group-D & SSC
200 pages
Statistical Constituency Pars-Ing: C.1 Probabilistic Context-Free Grammars
No ratings yet
Statistical Constituency Pars-Ing: C.1 Probabilistic Context-Free Grammars
21 pages
Ma de 401
No ratings yet
Ma de 401
4 pages
Sequence 1 ms4 Famous Landmarks 2019-2020
100% (1)
Sequence 1 ms4 Famous Landmarks 2019-2020
16 pages
PMF IAS Ancient and Medieval India Book Sample
No ratings yet
PMF IAS Ancient and Medieval India Book Sample
108 pages
Harness The Power of Generative AI in Healthcare With Amazon AI ML Services
No ratings yet
Harness The Power of Generative AI in Healthcare With Amazon AI ML Services
3 pages
Model Answer of Mid Term Exam 2015/2016: B Is A Corpus Which Only Contains One Single Bitstring
No ratings yet
Model Answer of Mid Term Exam 2015/2016: B Is A Corpus Which Only Contains One Single Bitstring
3 pages
III - II Mid 1 OBJECTIVE - KRR (SET 1)
No ratings yet
III - II Mid 1 OBJECTIVE - KRR (SET 1)
2 pages
Adaptor Nips
No ratings yet
Adaptor Nips
8 pages
R23 II Year Syllabus CSE
No ratings yet
R23 II Year Syllabus CSE
45 pages
Translation Product
No ratings yet
Translation Product
23 pages
At The Office: Ielts Listening Section 2
100% (1)
At The Office: Ielts Listening Section 2
42 pages
Sociology Final
No ratings yet
Sociology Final
781 pages
70th BPSC Prelims Micro Syllabus (English)
No ratings yet
70th BPSC Prelims Micro Syllabus (English)
25 pages
Junior Secondary Certificate: English Second Language
No ratings yet
Junior Secondary Certificate: English Second Language
3 pages
1664365041075data Interpretation Analysis TSPSC G 1 Practice Material
No ratings yet
1664365041075data Interpretation Analysis TSPSC G 1 Practice Material
44 pages
Decode Ethics 4.0 Sample 1
No ratings yet
Decode Ethics 4.0 Sample 1
32 pages
C S and Application UNIT-1
No ratings yet
C S and Application UNIT-1
243 pages
Initial Test For 3Rd Grade NOMBRE: - I.-Cambia de Voz Pasiva A Voz Activa
No ratings yet
Initial Test For 3Rd Grade NOMBRE: - I.-Cambia de Voz Pasiva A Voz Activa
1 page
Lecture8 ClassNotes
No ratings yet
Lecture8 ClassNotes
18 pages
Polity Notes
No ratings yet
Polity Notes
65 pages
Reasoning 1 PDF
No ratings yet
Reasoning 1 PDF
13 pages
Appsc - g1 - Paper - 3
No ratings yet
Appsc - g1 - Paper - 3
150 pages
Coding Decoding Practiceset
No ratings yet
Coding Decoding Practiceset
16 pages
Practice Questions NLP
No ratings yet
Practice Questions NLP
5 pages
B.ed II Sem Assignment PAPER
No ratings yet
B.ed II Sem Assignment PAPER
13 pages
CV Palmer Attias
No ratings yet
CV Palmer Attias
4 pages
Sources of History
No ratings yet
Sources of History
16 pages
6 CAPF Previous 10 Years Preview PDF
No ratings yet
6 CAPF Previous 10 Years Preview PDF
50 pages
PMFIAS-Geo-HG-08-Major Tribes
No ratings yet
PMFIAS-Geo-HG-08-Major Tribes
26 pages
Punctuation - The Comma
No ratings yet
Punctuation - The Comma
14 pages
Art & Culture Vision IAS Hand Written 2021
No ratings yet
Art & Culture Vision IAS Hand Written 2021
165 pages
Ancient History 01 - Daily Class Notes - UPSC Prarambh 2026
No ratings yet
Ancient History 01 - Daily Class Notes - UPSC Prarambh 2026
8 pages
PTASK Movie Review Rubrics
No ratings yet
PTASK Movie Review Rubrics
1 page
SSC CGL 2024 Complete General Awareness Subject Wise Compilation
No ratings yet
SSC CGL 2024 Complete General Awareness Subject Wise Compilation
99 pages
Copyreading Headline Writing
No ratings yet
Copyreading Headline Writing
78 pages
Summer Express Between Grades 3 - 4
No ratings yet
Summer Express Between Grades 3 - 4
142 pages
Active Online Learning For Social Media Analysis To Support Crisis Management
No ratings yet
Active Online Learning For Social Media Analysis To Support Crisis Management
10 pages
Master of Physical Education PDF
No ratings yet
Master of Physical Education PDF
34 pages
Ncert Book - Social and Political Life - Class VI
100% (8)
Ncert Book - Social and Political Life - Class VI
79 pages
G-20 (Maths) Topicwise Test Series: Date DAY Unit Topics/Subtopics To Be Covered
No ratings yet
G-20 (Maths) Topicwise Test Series: Date DAY Unit Topics/Subtopics To Be Covered
2 pages
CSAT General Mental Ability
No ratings yet
CSAT General Mental Ability
10 pages
Ancient India Timeline
100% (1)
Ancient India Timeline
4 pages
Untitled
No ratings yet
Untitled
122 pages
Ancient Medieval Notes Mudit Jain Blog
No ratings yet
Ancient Medieval Notes Mudit Jain Blog
31 pages
LB 5th Unit Test 05B
No ratings yet
LB 5th Unit Test 05B
2 pages
Ethics Lexicon 7th Edition 2021 Notes Sample
No ratings yet
Ethics Lexicon 7th Edition 2021 Notes Sample
32 pages
Sociology (Brouchure) Opt PDF
No ratings yet
Sociology (Brouchure) Opt PDF
16 pages
Akshat Kaushal AIR 55 Sociology Paper 1 Notes PDF
No ratings yet
Akshat Kaushal AIR 55 Sociology Paper 1 Notes PDF
161 pages
How To Prepare UPSC Civil Services Mains Paper-II (GS-1) : by Insights
No ratings yet
How To Prepare UPSC Civil Services Mains Paper-II (GS-1) : by Insights
6 pages
Reading List For GS Paper I
No ratings yet
Reading List For GS Paper I
3 pages
Beyond the Barracks
From Everand
Beyond the Barracks
Nelson Baker
No ratings yet

Pract Q

Uploaded by

Pract Q

Uploaded by

Practice Questions

3. If the sequence of observations or output states is “Adv-Adj-Adv”, calculate the probability of

Q. Write Notes on following:

P(T1)=P(S→Noun VP)×P(Noun→"bronze")×P(VP→Verb Noun)×P(Verb→"pots")×P(Noun→"clatter

To calculate the probability of a word, we use:

Q. Questions on N-Gram Model

Instructions for Solving Q1-Q5:

You might also like