0% found this document useful (0 votes)

362 views10 pages

Practice Exam and Solution For Natural Language Processing

This document provides instructions and information for a practice midterm exam in natural language processing. It states that the practice test is designed to take longer than the actual exam, which will be 1 hour and 15 minutes. It allows the use of open notes, textbooks, and online resources during the exam and provides examples of doctors' names to identify in a later question.

Uploaded by

Saumya Rai Parkash

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

362 views10 pages

Practice Exam and Solution For Natural Language Processing

Uploaded by

Saumya Rai Parkash

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Practice Midterm Exam for Natural Language Processing

Name:
Net ID

Instructions
In the actual midterm there will be 7 questions, each will be worth 15 points. You also get 10 point for signing
your name on all test materials, seriously, because when students forget to sign their names, I have to somehow
figure out whose test a particular piece of paper belongs to. The maximum score on the test will be 115. You
will have approximately 1:15 minutes to complete this test.
This practice test will have a different number of problems that are intended to be of the same basic type
of question as on the actual midterm. THE PRACTICE TEST IS DESIGNED TO TAKE LONGER TO
COMPLETE THAN THE ACTUAL TEST WOULD (AROUND 2 HOURS, RATHER THAN 1:15).
The test materials will include this printout and one blank test booklet. I suggest that you fill in all answers
directly on this printout and use the blank test booklet as scrap paper. However, if you run out of space, you
have the option of using the test booklet. If you do this, please include a clear note on the test so I know where
to look for your answer.
This test is an open book/open notes test: Please feel free to bring your text book, your notes, copies of class
lectures and other reading material to the test. A calculator is also permitted and it is OK to look at materials
on the web in order to read helpful information, being mindful of the time limit. Just don’t use a program that
solves a problem for you, e.g., do not find a part of speech tagger and run it if asked to manually annotate
mark parts of speech – that WOULD be cheating.

Answer all questions on the test. If you show your work and you make a simple arithmetic mistake, but
it is clear you knew how to do it, you will get partial credit.
• William R. Breakey M.D.

• Pamela J. Fischer M.D.

• Leighton E. Cluff M.D.

• James S. Thompson, M.D.

• C.M. Franklin, M.D.

• Atul Gawande, M.D.

• Dr. Talcott

• Dr. J. Gordon Melton

• Dr. Etienne-Emile Baulieu

• Dr. Karl Thomae

• Dr. Alan D. Lourie

• Dr. Xiaotong Fei

• Doctor Dre

• Doctor Dolittle

• Doctor William Archibald Spooner

• Doctor No

Figure 1: Correct Instances of Doctors in Our Corpus

Question 1. Write a regular expression for identifying names of doctors in text. Your regular expression
should match the examples in figure 1, but should not recognize either non-names (words lacking capital
letters) or names that do not include the identifying title information (Dr., Doctor, M.D.). Do your best to
include information about spaces, hyphens, commas and periods, as per the examples.

((Doctor|Dr\.)( [A-Z][a-z\.]+)+)|(([A-Z][a-z\.]+ )+M\.D\.)

Tag Description Tag Description
CC Coordinating conjunction RB Adverb
CD Cardinal number RBR Adverb, comparative
DT Determiner RBS Adverb, superlative
EX Existential there RP Particle
FW Foreign word SYM Symbol
IN Preposition or subordinating conjunction TO to
JJ Adjective UH Interjection
JJR Adjective, comparative VB Verb, base form
JJS Adjective, superlative VBD Verb, past tense
LS List item marker VBG Verb, gerund or present participle
MD Modal VBN Verb, past participle
NN Noun, singular or mass VBP Verb, non-3rd person singular present
NNS Noun, plural VBZ Verb, 3rd person singular present
NNP Proper noun, singular WDT Wh-determiner
NNPS Proper noun, plural WP Wh-pronoun
PDT Predeterminer WP$ Possessive wh-pronoun
POS Possessive ending WRB Wh-adverb
PRP Personal pronoun PU Punctuation
PRP$ Possessive pronoun

Table 1: Penn Treebank POS tags

Question 2. Assign Penn parts of speech tags (as per Table 1) to all the words in the following two sentences
using the notation word/POS:

a. John/NNP and/CC Mary/NNP bought/VBD a/DT refrigerator/NN with/IN three/CD doors/NNS ./PU

b. It/PRP was/VBD purchased/VBN from/IN a/DT very/RB small/JJ store/NN near/IN their/PRP$ house/NN
./PU

Question 3. Mark the noun groups in the following sentence using BIO (beginning, intermediate, other) tags.

Mary B
has O
a B
room I
with O
a B
view I
and O
a B
bottle I
of O
beer B
S

NP VP

NP NP VBD NP

NNP CC NNP DT NN PP
bought

John and Mary a refrigerator IN NP

with CD NNS

three doors

Figure 2: Possible Answer to Question 4

Question 4. Draw a Phrase Structure Tree representing one parse of the following sentence. Make a list of
the phrase structure rules that you are assuming.
John and Mary bought a refrigerator with three doors .

1. S → N P V P 9. N N P → M ary

2. N P → N P CC N P 10. N N → ref rigerator

3. N P → DT N N P P 11. N N S → doors

4. N P → CD N N S 12. CC → and

13. V BD → bought
5. N P → N N P
14. DT → a
6. V P → V BD N P
15. CD → three
7. P P → IN N P
16. IN → with
8. N N P → John
Question 5. Calculate precision, recall and f-measure in order to score the following system against the
answer key. Assume any item reported by the system and found in the answer key is correct.
The system reports that the following strings of words describing attack events:

1. Jay Leno attacked Conan O’brien.

2. attacks by the U.S.-backed rebels Correct

3. the latest in a series of attacks in the 10-year-old civil war. Correct

4. Mr. Baldwin is also attacking the greater problem: lack of ringers.

5. the criminals were convicted for bombings. Correct

6. The broadway musical “Bridges of Madison County” bombed.

7. Groupon fires CEO Andrew Mason.

The answer key includes the following strings of words describing attack events:

1. the martians bombarded the Earth with death rays

2. attacks by the U.S.-backed rebels

3. the latest in a series of attacks in the 10-year-old civil war.

4. the criminals were convicted for bombings.

5. the allies launched a missile at the enemy stronghold.

Precision = 3/7 ≈ .429

Recall = 3/5 = .6
2
F-measure = 1 2+ 3 = 7/3+5/3 = .5
3/7 5
Question 6. Fill in the CKY chart below for sentence The rain rains down assuming the following rules:

1. S → NP VP

2. NP → N

3. NP → DT N

4. VP → V ADVP

5. VP → V

6. ADVP → ADV

7. DT → the

8. N → rain

9. N → rains

10. V → rain

11. V → rains

12. ADV → down

The rain rains down

1 2 3 4
0 DT NP S S

1 N, V, NP, VP S S

2 N, V, NP, VP VP

3 ADV, ADVP
Question 7. Some defining characteristics of organization and facility as per the ACE guidelines are as
follows:

• An Organization entity must have some formally established association. Typical examples are busi-
nesses, government units, sports teams, and formally organized music groups. Industrial sectors and
industries are also treated as Organization entities. (ACE Entity Guidelines v6.6, page 7)

• A facility is a functional, primarily man-made structure. These include buildings and similar facilities
designed for human habitation, such as houses, factories, stadiums, office buildings, gymnasiums, pris-
ons, museums, and space stations; objects of similar size designed for storage, such as barns, parking
garages and airplane hangars; elements of transportation infrastructure, including streets, highways,
airports, ports, train stations, bridges, and tunnels. Roughly speaking, facilities are artifacts falling
under the domains of architecture and civil engineering. (ACE Entity Guidelines v6.6, page 22)

In the following text from the May 3, 2012 New York Times (A House Tour: Yes, That House) mark
the organizations by underlining them and writing an ORG immediately above them; mark the facilities by
underlining them and writing FAC immediately above. If a particular piece of text is difficult to mark only
ORG or only FAC, mark it ORG/FAC. Mark noun groups ignoring determiners including both names and
common nouns representing FAC and ORG constiuents. Do not mark pronouns.

After the 9/11 attacks, the system changed radically. Now, anyone who
wants to tour the White House/FAC must apply through the office/ORG of his
or
her representative in Congress/ORG, which forwards the names to the White
House/ORG for clearance...
Once they get the green light, visitors show up at the appointed time
on 15th Street/FAC between E/FAC and F Streets/FAC and join the line to enter
through the southeast gate/FAC.
Anyone who has flown on an airline/ORG in recent years will recognize the
familiar territory of identity checks and electronic scans, although
here you do get to keep your shoes on. At the head of the line,
rangers from the National Park Service/ORG check photo IDs against a list
of names.
Question 8. Assuming that the following sentence is at the beginning of a file, fill in the table below listing
each token (word and punctuation), along with its start character offset and its end character offset. Note that
there are more blank lines in the table than there are tokens. So it is expected that you will leave one or more
line blank.

This sentence contains words, characters, spaces and punctuation.

Token Start Offset End Offset

This 0 4

sentence 5 13

contains 14 22

words 23 28

, 28 29

characters 30 40

, 40 41

spaces 42 48

and 49 52

punctuation 53 64

. 64 65
1 VBZ

PRP .25
NNP
.50 1
.25
.25
.5 VBG

.25
End
Start .25 NNS .50 .50
.50
.33
.25

JJ .66

Figure 3: Prior Probability for Question 9

Question 9. Given the training data below, execute the following 3 steps: (a) calculate the likelihood
probabilities for each word given each POS; (b) draw a finite state machine where states are POS and edges
are labeled with transition probabilities; (c) draw a chart where the columns are positions in the sentence and
the rows are names of states (start, end, POS tags) and fill in the probability scores assigned by the Viterbi
algorithm assigning POS tags to the string flying planes.
Training Data:

• buffalo/NNS flying/VBG is/VBZ dangerous/JJ

• flying/JJ planes/NNS are/VBZ numerous/JJ

• I/PRP saw/VBZ Mary/NNP flying/VBG planes/NNS

• He/PRP planes/VBZ shelves/NNS

Likelihood for Question 9

JJ dangerous: .33 flying: .33 numerous: .33

NNP Mary: 1
NNS buffalo: .25 planes: .5 shelves: .25
PRP I: .5 he: .5
VBG flying: 1
VBZ is: .25 are: .25 saw: .25 planes: .25
BEGIN flying planes END
BEGIN 1.0

JJ .33 * .25

NNP

NNS (from JJ) .33 * .25 * .5 * .33

(from VBG) 0
PRP

VBG 1*0

VBZ (from JJ) .33 * .25 * .25 * 0

(from VBG) 0
END (from NNS) .33 * .25 * .5 * .33 * .5 ≈ .0068

Figure 4: Viterbi for Question 9

Question 10. Calculate the TFIDF for the terms listed below for documents 1 to 4. There are 10,000
documents in a collection. The number of times each of these terms occur in documents 1 to 4 as well as the
number of documents in the collections are listed below. Use this information to fill in the TFIDF scores in
the table below.
Number of Documents Containing Terms:
• reverse cascade: 3 IDF = log(10000/3) ≈ 8.11
• full shower: 50 IDF = log(10000/50) ≈ 5.30
• half bath: 10 IDF = log(10000/10) ≈ 6.91
• multiplex: 3 IDF = log(10000/3) ≈ 8.11

Term Frequencies
Documents
Doc 1 Doc 2 Doc 3 Doc 4
reverse cascade 8 10 0 0
full shower 3 1 2 2
half bath 0 0 8 7
multiplex 2 2 2 9

TFIDF for terms in documents

Documents
Doc 1 Doc 2 Doc 3 Doc 4
reverse cascade 8.11 * 8 = 64.88 8.11 * 10 = 81.10 0 0

full shower 5.30 * 3 + 15.90 5.30 * 1 = 5.30 5.30 * 2 = 10.60 5.30 * 2 = 10.60

half bath 0 0 6.91 * 8 = 55.28 6.91 * 7 = 48.37

multiplex 8.11 * 2 = 16.22 8.11 * 2 = 16.22 8.11 * 2 = 16.22 8.11 * 9 = 72.99

The White Knight - Eric Nicol
89% (9)
The White Knight - Eric Nicol
3 pages
CS-602 Computer Networks Lab Manual Updated
No ratings yet
CS-602 Computer Networks Lab Manual Updated
62 pages
Design and Analysis of Algorithm Questions and Answers
No ratings yet
Design and Analysis of Algorithm Questions and Answers
81 pages
Wa0030.
No ratings yet
Wa0030.
36 pages
CS606 FinalTerm 2016 MCQs
No ratings yet
CS606 FinalTerm 2016 MCQs
38 pages
Rich Automata Solns
100% (1)
Rich Automata Solns
187 pages
Return To Labyrinth, Vol 4
No ratings yet
Return To Labyrinth, Vol 4
212 pages
FDS Unit 1
No ratings yet
FDS Unit 1
21 pages
1694601295-Unit 3.6 Generalized Discriminant Analysis CU 2.0
100% (1)
1694601295-Unit 3.6 Generalized Discriminant Analysis CU 2.0
15 pages
Java MCQ
No ratings yet
Java MCQ
24 pages
AP PGECET CS and IT (CS-2015) Question Paper & Answer Key. Download All Previous Years Computer Science & Information Technology Sample & Model Question Papers.
100% (2)
AP PGECET CS and IT (CS-2015) Question Paper & Answer Key. Download All Previous Years Computer Science & Information Technology Sample & Model Question Papers.
16 pages
Pride and Predujice Q&A
100% (2)
Pride and Predujice Q&A
12 pages
Math Lnig2
No ratings yet
Math Lnig2
449 pages
MCQ On Knowledge Representation 5eea6a0e39140f30f369e525
No ratings yet
MCQ On Knowledge Representation 5eea6a0e39140f30f369e525
21 pages
Data Warehousing Mining MCQs
No ratings yet
Data Warehousing Mining MCQs
12 pages
VTU Exam Question Paper With Solution of 17CS73 Machine Learning Jan-2021-Swathi Y
No ratings yet
VTU Exam Question Paper With Solution of 17CS73 Machine Learning Jan-2021-Swathi Y
7 pages
Chapter Three
No ratings yet
Chapter Three
37 pages
Theory of Computation - Question Bank
No ratings yet
Theory of Computation - Question Bank
19 pages
Syllabus
No ratings yet
Syllabus
9 pages
Chapter 1:-: Basics of An Algorithm and Mathematics
100% (1)
Chapter 1:-: Basics of An Algorithm and Mathematics
34 pages
Quiz # 2 - Writing Bibliography
100% (2)
Quiz # 2 - Writing Bibliography
4 pages
Tutorial 1
100% (1)
Tutorial 1
23 pages
Compiler Mcqs Last Updated Solved Complete
No ratings yet
Compiler Mcqs Last Updated Solved Complete
57 pages
Assignment 11
100% (1)
Assignment 11
4 pages
CS8691 Artificial Intelligence MCQ Quest
No ratings yet
CS8691 Artificial Intelligence MCQ Quest
47 pages
NLP MCQ 153 Out of 427 - Part One
No ratings yet
NLP MCQ 153 Out of 427 - Part One
30 pages
Trie and Redblack Tree Mcqs
No ratings yet
Trie and Redblack Tree Mcqs
9 pages
Compiler Design Quiz Test-1 - Question - Paper
No ratings yet
Compiler Design Quiz Test-1 - Question - Paper
10 pages
Matlab File - Deepak - Yadav - Bca - 4TH - Sem - A50504819015
No ratings yet
Matlab File - Deepak - Yadav - Bca - 4TH - Sem - A50504819015
59 pages
Code Optimization
0% (1)
Code Optimization
90 pages
MCQs On Tree With Answers
100% (2)
MCQs On Tree With Answers
8 pages
Advanced Database Management System-Mcq
No ratings yet
Advanced Database Management System-Mcq
8 pages
Design and Analysis of Algorithms Solved MCQs (Set-11)
No ratings yet
Design and Analysis of Algorithms Solved MCQs (Set-11)
7 pages
Unit 3 Big Data MCQ AKTU: Royal Brinkman Gartenbaubedarf
No ratings yet
Unit 3 Big Data MCQ AKTU: Royal Brinkman Gartenbaubedarf
17 pages
Mid Term Date Sheet 2022 Fall Foc Iub
No ratings yet
Mid Term Date Sheet 2022 Fall Foc Iub
53 pages
NLP Questions and Answers MCQ
No ratings yet
NLP Questions and Answers MCQ
7 pages
Question Bank of Applied Machine Learning
No ratings yet
Question Bank of Applied Machine Learning
2 pages
hw7 Sol 2
No ratings yet
hw7 Sol 2
10 pages
02 - Data Types - MCQ
No ratings yet
02 - Data Types - MCQ
4 pages
MCQ On Trees
No ratings yet
MCQ On Trees
2 pages
AoA Important Question
100% (1)
AoA Important Question
3 pages
1-NLP - Lab Manual
No ratings yet
1-NLP - Lab Manual
15 pages
NLP Asgn2
No ratings yet
NLP Asgn2
7 pages
DSF Unit IV MCQ Notes
No ratings yet
DSF Unit IV MCQ Notes
6 pages
Important Questions and Answers of Big Data Course
No ratings yet
Important Questions and Answers of Big Data Course
4 pages
Compiler Design Final Question Bank
No ratings yet
Compiler Design Final Question Bank
5 pages
Practice Questions Logic
No ratings yet
Practice Questions Logic
15 pages
Data Warehouse and Data Mining Question Bank R13 PDF
No ratings yet
Data Warehouse and Data Mining Question Bank R13 PDF
12 pages
AI310 & CS361 AI (Mainstream) Midterm Exam (Fall 2023) ANSWER KEY
No ratings yet
AI310 & CS361 AI (Mainstream) Midterm Exam (Fall 2023) ANSWER KEY
2 pages
DAABits
No ratings yet
DAABits
4 pages
FINC 614 Introduction To Data Science Mid Term Exam Do The Following in R and Turn in A Word or PDF Document Generated With Knitr, Via Blackboard
No ratings yet
FINC 614 Introduction To Data Science Mid Term Exam Do The Following in R and Turn in A Word or PDF Document Generated With Knitr, Via Blackboard
1 page
Oracle Technical Questions
No ratings yet
Oracle Technical Questions
14 pages
Week-1 Assessment-1 Answers
No ratings yet
Week-1 Assessment-1 Answers
3 pages
AI Unit II All Topics
No ratings yet
AI Unit II All Topics
114 pages
440 Sample Questions Dec
No ratings yet
440 Sample Questions Dec
7 pages
Unit 5 MCQ It 8074 Soa
No ratings yet
Unit 5 MCQ It 8074 Soa
13 pages
Data Structure For GATE
No ratings yet
Data Structure For GATE
5 pages
Mid-Term Test COSC3101.03: Design & Analysis of Algorithms
No ratings yet
Mid-Term Test COSC3101.03: Design & Analysis of Algorithms
5 pages
Faculty of Engineering Scit B. Tech It/Cse/Cce VI Semester First Mid Term Examination: 2021-22 Data Mining and Warehousing (IT3240)
No ratings yet
Faculty of Engineering Scit B. Tech It/Cse/Cce VI Semester First Mid Term Examination: 2021-22 Data Mining and Warehousing (IT3240)
2 pages
List of D100 Games (With Affiliate Links!)
100% (1)
List of D100 Games (With Affiliate Links!)
8 pages
Cp5151 Advanced Data Structures and Algorithims
No ratings yet
Cp5151 Advanced Data Structures and Algorithims
3 pages
Unit 5a
No ratings yet
Unit 5a
31 pages
Subject-Distributed Computing: Question Bank For Oral Exam
No ratings yet
Subject-Distributed Computing: Question Bank For Oral Exam
1 page
Basic Chinese Grammar and Sentence Patterns
No ratings yet
Basic Chinese Grammar and Sentence Patterns
114 pages
ST Microelectronics Interview Questions
No ratings yet
ST Microelectronics Interview Questions
4 pages
Practice Final Exam For Natural Language Processing
No ratings yet
Practice Final Exam For Natural Language Processing
9 pages
Masters Level Literature Review Sample
100% (3)
Masters Level Literature Review Sample
7 pages
Explanatory Writing
No ratings yet
Explanatory Writing
26 pages
Quoting, Paraphrasing, & Summarizing - UAGC Writing Center
No ratings yet
Quoting, Paraphrasing, & Summarizing - UAGC Writing Center
9 pages
G U G U R e D A S A M U K A
No ratings yet
G U G U R e D A S A M U K A
3 pages
ENG 210 Study Guide 2025
No ratings yet
ENG 210 Study Guide 2025
55 pages
Edgar Allan Poe Collected Works of Poe, Volume III Websters Korean Thesaurus Edition 2006 PDF
No ratings yet
Edgar Allan Poe Collected Works of Poe, Volume III Websters Korean Thesaurus Edition 2006 PDF
332 pages
Dandelion Wine Chart Final
No ratings yet
Dandelion Wine Chart Final
2 pages
University of Delhi: Semester Examination Nov-Dec 2020 Statement of Marks/Grades
No ratings yet
University of Delhi: Semester Examination Nov-Dec 2020 Statement of Marks/Grades
2 pages
Of Studies by Francis Bacon
No ratings yet
Of Studies by Francis Bacon
23 pages
Power System Analysis and Design SI Edition Fifth Edition J. Duncan Glover PDF Download
No ratings yet
Power System Analysis and Design SI Edition Fifth Edition J. Duncan Glover PDF Download
48 pages
3
No ratings yet
3
4 pages
Decameron
No ratings yet
Decameron
56 pages
Handbook of Digital Games and Entertainment Technologies 1st Edition Ryohei Nakatsu
No ratings yet
Handbook of Digital Games and Entertainment Technologies 1st Edition Ryohei Nakatsu
49 pages
Tennessee Williams Plastic Theater A Formulation of Dramaturgy
No ratings yet
Tennessee Williams Plastic Theater A Formulation of Dramaturgy
107 pages
Model Exam March 2025.docx-1
No ratings yet
Model Exam March 2025.docx-1
2 pages
Old English Literature
No ratings yet
Old English Literature
2 pages
Flashcards 2 G
No ratings yet
Flashcards 2 G
15 pages
English Multiple Choice Questions - Grade 6 - Part 1a
No ratings yet
English Multiple Choice Questions - Grade 6 - Part 1a
2 pages
A Fighter's Lines Poem (Comprehension Questions)
No ratings yet
A Fighter's Lines Poem (Comprehension Questions)
2 pages
The Road Not Taken
No ratings yet
The Road Not Taken
4 pages
Term - Wise Syllabus Break - Up Class 12,11
No ratings yet
Term - Wise Syllabus Break - Up Class 12,11
1 page
The Lady and The Tiger Questions
No ratings yet
The Lady and The Tiger Questions
1 page
1.1 Quotes About Friendship: Student A, Student B, or Student C
No ratings yet
1.1 Quotes About Friendship: Student A, Student B, or Student C
2 pages
Mary Shelley S Frankenstein Genesis of A
No ratings yet
Mary Shelley S Frankenstein Genesis of A
4 pages

Practice Exam and Solution For Natural Language Processing

Uploaded by

Practice Exam and Solution For Natural Language Processing

Uploaded by

Practice Midterm Exam for Natural Language Processing

• Pamela J. Fischer M.D.

• Leighton E. Cluff M.D.

• James S. Thompson, M.D.

• C.M. Franklin, M.D.

• Atul Gawande, M.D.

• Dr. J. Gordon Melton

• Dr. Etienne-Emile Baulieu

• Dr. Karl Thomae

• Dr. Alan D. Lourie

• Dr. Xiaotong Fei

• Doctor William Archibald Spooner

Figure 1: Correct Instances of Doctors in Our Corpus

((Doctor|Dr\.)( [A-Z][a-z\.]+)+)|(([A-Z][a-z\.]+ )+M\.D\.)

Table 1: Penn Treebank POS tags

John and Mary a refrigerator IN NP

Figure 2: Possible Answer to Question 4

2. N P → N P CC N P 10. N N → ref rigerator

1. Jay Leno attacked Conan O’brien.

2. attacks by the U.S.-backed rebels Correct

3. the latest in a series of attacks in the 10-year-old civil war. Correct

4. Mr. Baldwin is also attacking the greater problem: lack of ringers.

5. the criminals were convicted for bombings. Correct

6. The broadway musical “Bridges of Madison County” bombed.

7. Groupon fires CEO Andrew Mason.

1. the martians bombarded the Earth with death rays

2. attacks by the U.S.-backed rebels

3. the latest in a series of attacks in the 10-year-old civil war.

4. the criminals were convicted for bombings.

5. the allies launched a missile at the enemy stronghold.

Precision = 3/7 ≈ .429

12. ADV → down

The rain rains down

This sentence contains words, characters, spaces and punctuation.

Token Start Offset End Offset

Figure 3: Prior Probability for Question 9

• buffalo/NNS flying/VBG is/VBZ dangerous/JJ

• flying/JJ planes/NNS are/VBZ numerous/JJ

• I/PRP saw/VBZ Mary/NNP flying/VBG planes/NNS

• He/PRP planes/VBZ shelves/NNS

Likelihood for Question 9

JJ dangerous: .33 flying: .33 numerous: .33

NNS (from JJ) .33 * .25 * .5 * .33

VBZ (from JJ) .33 * .25 * .25 * 0

Figure 4: Viterbi for Question 9

TFIDF for terms in documents

half bath 0 0 6.91 * 8 = 55.28 6.91 * 7 = 48.37

multiplex 8.11 * 2 = 16.22 8.11 * 2 = 16.22 8.11 * 2 = 16.22 8.11 * 9 = 72.99

You might also like