0% found this document useful (0 votes)

7 views11 pages

NLP Final

The document consists of a series of problems related to regular expressions, language models, and various machine learning techniques including Naive Bayes, Logistic Regression, and embeddings. It includes tasks such as fitting Heaps' law to documents, comparing classifiers, and discussing the implications of different parsing techniques. Additionally, it covers modern neural models, including BERT and GPT, and their applications in sequence-to-sequence tasks.

Uploaded by

0xz8jorq0

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views11 pages

NLP Final

Uploaded by

0xz8jorq0

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

"

Problem 1 Regular Expressions, Heaps’ Law, Language Models (10 credits)

0 1.1 (0 P) Just to be sure: Write your first (given) name, your last (family) name, and your matriculation
number (just as a sanity check).

0 1.2 (3 P) Describe / list the patterns that may be detected by the regular expression
o+u?h|a+h+|hm+
1 (1 P) What is the possible purpose of this RegEx?
2

0 1.3 (2 P) You fit Heaps’ law |V | = kN β to two different documents and you get the following values:

1 1. β = 0.99, k = 1

2 2. β = 0.71, k = 80
Provide a reasonable suggestion of the nature of the documents!

0 1.4 (2 P) You fit Heaps’ law |V | = kN β to the Java source code of the JDK class library. Provide a meaningful
guess for the value of β you would get and give a short reasoning!
1

0 1.5 (2 P) What are the the mathematical consequences in terms of conditional independence when applying
a tri-gram approximation for a language model for four word sequences P(w1 w2 w3 w4 ) (using no beginning-
1 or end-of-sequence markers)?
2
ute Discounting
"
Problem 2 Language Models (ctd.), Naive Bayes vs. Logistic Regression, Embeddings (10 cred-
its)
We have seen: introducing priors à adjusted (“discounted”) counts are
maller than2.1
original counts (shifting
(3 P) Language some probability
models: Absolute Discounting: mass to unseen
Church&Gale 1991: How would the second column of 0
the table ("N number of bigrams in first half of data that occurred n times") look like for the second half of the
words / N-Grams
data? (“zeros”)) 1

2
uppose we wanted to subtract n + ,- ,- / +
3
little from a count of 4 to save number of bigrams in
first half of data that
total number of
occurrences of
average no of
occurrences of

probability mass for the zeros: occurred n times these bigrams in

second half of data
these bigrams in
second half of

How much to subtract ? data

0 74 671 100 000 2 019 187 0.000027

1 2 018 046 903 206 0.448
Church & Gale 1991: Divide 2 449 721 564 153 1.25
44 ∗ 10% word corpus in two 3 188 933 424 015 2.24
22 ∗ 10% halves. For all bigrams 4 105 664 341 099 3.23
hat occurred exactly n times in 5 68 379 287 776 4.21
irst half: how often on average 6 48 190 251 951 5.23
do they occur in second set? 7 35 709 221 693 6.21
8 27 710 199 779 7.21
à subtract ≈ 0.75 (probability 9 22 280 183 971 8.26
mass shifted to zeros)
18

2.2 (3 P) Naive Bayes vs. Logistic Regression: How is P(x, y |θ) mathematically decomposed for a generative 0
classifier, and how is it decomposed for a discriminative classifier? (In your answer, you can e.g. write θ as
"theta"). 1

2.3 (2 P) What is an advantage of Logistic Regression compared to Naive Bayes? State and explain one 0
advantage (not more than one)!
1

2
"
0 2.4 (2 P) Your new PhD student suggests to train Word2Vec Skip-Gram embeddings with replacing the inner
product based similarity t · c in P(+|t, c) = 1/(1P
+ exp(−t · c))
Pwith a similarity measure based on the Jensen-
1 Shannon-Divergence between the vectors t/ i ti and c/ i ci . Provide one reasonable counter-argument
for doing that! (Not more than one)!
2
"
Problem 3 CCG parsing, Constituency Grammars (10 credits)

3.1 (3 P) In the following incomplete CCG parse, provide the missing three categories! 0

3.2 (4 P) CCG parsing with the A* algorithm: provide expressions for w, x, y and z in terms of the ai , bi and 0
ci , assuming that a1 < a2 < a3 < a4 , b1 < b2 and c1 < c2 < c3 !
1

2
− log%& '
Bayern beats Schalke 3
Initial Agenda
N/N: a1 (S\NP)/NP: b1 NP: c1
4
NP: a2 N: b2 N/N: c2
Bayern: N/N Bayern beats: N[0,2]
x z S/S: a3 S/(S\NP) c3
Goal state S\S: a4

beats: (S\NP)/NP beats Schalke: S\NP[1,3] Bayern beats Schalke: S[0,3]

x x y
Bayern beats Schalke
Schalke: NP Schalke: S/(S\NP)[0,1]
x w
>
S\NP
<
S
…

2
3.3 (3 P) Explain the motivation for subcatagorization of verb-phrases, especially for training machine-
3
"
"
Problem 4 GloVe embeddings, LSTM neurons (10 credits)

4.1 (3 P) Motivation for GloVe embeddings: Given the following table, sort the numbers a1 , a2 , a3 , a4 in 0
ascending order (e.g. a2 = a4 < a1 < a2 )!
1

k = guitar k = engine k = wheel k = saddle 2

P(k |car)/P(k |bike) a1 a2 a3 a4
3

4.2 (2 P) Motivation for GloVe embeddings: aside from symmetry or group-homomorphism considerations, 0
motivate why setting uiT vk = logP(i |k ) also makes intuitive sense!
1

4.3 (3 P) Assuming you had never heard of contextual embeddings (such as BERT), how can static 0
embeddings (GloVe, Word2Vec etc.) deal with different word senses? Why is that not really practical?
1

1
4.4 (2 P) LSTM neurons: why do we use the Hadamard product in connection with the gates and not the
2
"
"
Problem 5 Modern neural models (10 credits)

5.1 (2 P) Some systems use hybrid combinations of word-based approaches and character-based ap- 0
proaches for neural machine translation. Provide one pro-argument for these approaches and provide one
counter-argument for these approaches! Do not provide more than one argument each! 1

5.2 (2 P) What is the problem with just sticking to the standard word-based softmax architectures when 0
vocabularies become large?
1

5.3 (3 P) Another idea to deal with the problems associated with large vocabularies is combining standard 0
word-based softmax architectures with Pointer Networks. What do pointer networks and basic attention have
in common? 1

5.4 (3 P) Paper "Attention is all you need" (Vaswani et al, 2017): Multi-Head Attention is defined as 0

1
Multihead(Q, K , V) = Concat(head1 , ... , headh )W O (5.1)
where headi = Attention(QWiQ , KWiK , VWiV ) (5.2) 2

3
Someone states: "This is very similar to CNN approaches as in the paper Kim, Y. (2014) "Convolutional
neural networks for sentence classification".
Provide one argument supporting the statement and one counter argument! Do not provide more than one
"
"
Problem 6 BERT and GPT (10 credits)

6.1 (3 P) Explain why BERT is not immediately usable for generation (e.g. as a language model)! 0

6.2 (3 P) Someone suggests using BERT as an encoder and GPT as a decoder in a seq-to-seq architecture. 0
The decoder would attend to the encoder in a similar way as in the original Transformer model (Vaswani et al.
2017). Motivate the usefulness of this architecture for seq-to-seq tasks! 1

3
6.3 (4 P) How could such a system as suggested in the previous assignment be trained and used for
4
"

Introduction To Modern Industrial Engineering
100% (2)
Introduction To Modern Industrial Engineering
221 pages
77 4001 StaSaf
No ratings yet
77 4001 StaSaf
20 pages
NLP Basics
No ratings yet
NLP Basics
119 pages
cs224n 2025 Lecture02 Wordvecs2
No ratings yet
cs224n 2025 Lecture02 Wordvecs2
46 pages
NLP Midsem Paper Jan 2024 Regular Exam
No ratings yet
NLP Midsem Paper Jan 2024 Regular Exam
4 pages
mockExamWS21 With Solution
No ratings yet
mockExamWS21 With Solution
35 pages
Classification of Business Environment
83% (6)
Classification of Business Environment
12 pages
Obs Gynae Dams Notes 2018 PDF
No ratings yet
Obs Gynae Dams Notes 2018 PDF
398 pages
Automata and Formal Languages
No ratings yet
Automata and Formal Languages
24 pages
Flat Unit-5
No ratings yet
Flat Unit-5
10 pages
Week9 Discussion - Deep Learning
No ratings yet
Week9 Discussion - Deep Learning
22 pages
CS-GATE - Solved PYQ
No ratings yet
CS-GATE - Solved PYQ
38 pages
NLP Sample Questions-Stu
No ratings yet
NLP Sample Questions-Stu
4 pages
Traumatic Care DR - GOLDEN
No ratings yet
Traumatic Care DR - GOLDEN
34 pages
NLP Midsem Paper August 2024 Regular Solution
No ratings yet
NLP Midsem Paper August 2024 Regular Solution
10 pages
Practice Midterm Solutions
No ratings yet
Practice Midterm Solutions
7 pages
3-1 Final QB
No ratings yet
3-1 Final QB
32 pages
Cs224n 2024 Lecture02 Wordvecs2
No ratings yet
Cs224n 2024 Lecture02 Wordvecs2
45 pages
FIT2014 SampleExam2013
No ratings yet
FIT2014 SampleExam2013
24 pages
Cs335 Es Solutions
No ratings yet
Cs335 Es Solutions
16 pages
Computer Science Theory Final Exam
No ratings yet
Computer Science Theory Final Exam
8 pages
Btech Cse 8 Sem Natural Language Processing 2010
No ratings yet
Btech Cse 8 Sem Natural Language Processing 2010
7 pages
Cs224n Midterm 2018 Solution
No ratings yet
Cs224n Midterm 2018 Solution
17 pages
Btech Cse 8 Sem Natural Language Processing s3 2010
No ratings yet
Btech Cse 8 Sem Natural Language Processing s3 2010
7 pages
Tutorial I
No ratings yet
Tutorial I
6 pages
Exam
No ratings yet
Exam
10 pages
BCOC Outstanding 24 Oktober 2023
No ratings yet
BCOC Outstanding 24 Oktober 2023
12 pages
NLP Unit 4
No ratings yet
NLP Unit 4
22 pages
Cse - CS605 - Formal Language - Automata Theory - R14
No ratings yet
Cse - CS605 - Formal Language - Automata Theory - R14
5 pages
Volume Bible - Set Volume For Muscle Size - The Ultimate Evidence Based Bible (UPDATED MARCH 2020) James Krieger
100% (1)
Volume Bible - Set Volume For Muscle Size - The Ultimate Evidence Based Bible (UPDATED MARCH 2020) James Krieger
54 pages
Orientering
No ratings yet
Orientering
15 pages
Assignment 2 - 20240709
No ratings yet
Assignment 2 - 20240709
13 pages
sp20 Midterm Solutions
No ratings yet
sp20 Midterm Solutions
12 pages
cs224n Practice Midterm 3 Sol
No ratings yet
cs224n Practice Midterm 3 Sol
14 pages
Photoluminescence FBG
No ratings yet
Photoluminescence FBG
13 pages
Mock MCQ 1
No ratings yet
Mock MCQ 1
6 pages
Your Roll No ................ :: B.Sc. ® Computer Science
No ratings yet
Your Roll No ................ :: B.Sc. ® Computer Science
4 pages
Week 5 Exercises Solutions
100% (1)
Week 5 Exercises Solutions
12 pages
Kami Export - Assignment - 2 - 20240709
No ratings yet
Kami Export - Assignment - 2 - 20240709
13 pages
1. 听力部分SL Mock Examination02-S
No ratings yet
1. 听力部分SL Mock Examination02-S
8 pages
Last Unit Motes
No ratings yet
Last Unit Motes
11 pages
Bapp Telesys Tag 3
No ratings yet
Bapp Telesys Tag 3
4 pages
CS671A/CS671: Introduction To Natural Language Processing Mid-Semester Exam
No ratings yet
CS671A/CS671: Introduction To Natural Language Processing Mid-Semester Exam
7 pages
Listening Reading Task 1 Task 4 Use of English Task 8: Answers Test 1
No ratings yet
Listening Reading Task 1 Task 4 Use of English Task 8: Answers Test 1
7 pages
0 Yqn EK3 VG 4 He OTv 089 KX SI1 Ij Wzu Ax T1 Ag Gev OKKJE
No ratings yet
0 Yqn EK3 VG 4 He OTv 089 KX SI1 Ij Wzu Ax T1 Ag Gev OKKJE
4 pages
PCS224 MST 23
No ratings yet
PCS224 MST 23
3 pages
Pebeo
No ratings yet
Pebeo
1 page
CSE701 2015 Final Question
No ratings yet
CSE701 2015 Final Question
11 pages
Gate 2006 It
No ratings yet
Gate 2006 It
14 pages
AI Answers
No ratings yet
AI Answers
4 pages
MUD Exam 2024 SOLVED
No ratings yet
MUD Exam 2024 SOLVED
6 pages
NLP Question
No ratings yet
NLP Question
4 pages
Exhibit B - Security Policy
No ratings yet
Exhibit B - Security Policy
4 pages
New Microsoft Office Word Document
No ratings yet
New Microsoft Office Word Document
6 pages
Rahwaz Syndicate Profile
No ratings yet
Rahwaz Syndicate Profile
3 pages
Porsche Case Study
No ratings yet
Porsche Case Study
4 pages
Exam ml4nlp1 Hs21.example Solution
No ratings yet
Exam ml4nlp1 Hs21.example Solution
6 pages
08 Exercises Word2vec MUD SOLVED
No ratings yet
08 Exercises Word2vec MUD SOLVED
3 pages
OOP Assignment 2
No ratings yet
OOP Assignment 2
2 pages
Worksheet 3 LS6 - MIANO, REYMARK
No ratings yet
Worksheet 3 LS6 - MIANO, REYMARK
1 page
It-3035 (NLP) - CS Mid Feb 2024
No ratings yet
It-3035 (NLP) - CS Mid Feb 2024
6 pages
6.045 Final Exam: 6.045J/18.400J: Automata, Computability and Complexity
No ratings yet
6.045 Final Exam: 6.045J/18.400J: Automata, Computability and Complexity
20 pages
Tutorial Plan
No ratings yet
Tutorial Plan
14 pages
WWW - Solutionsadda.in: Examination Department of Computer Science and Engineering
No ratings yet
WWW - Solutionsadda.in: Examination Department of Computer Science and Engineering
4 pages
Sains (Kertas 2) PMR Perak
No ratings yet
Sains (Kertas 2) PMR Perak
17 pages
Tropical Soils
No ratings yet
Tropical Soils
5 pages
NLP Endsem 2016
No ratings yet
NLP Endsem 2016
2 pages
CS 540: Introduction To Artificial Intelligence: Final Exam: 8:15-9:45am, December 21, 2016 132 Noland
No ratings yet
CS 540: Introduction To Artificial Intelligence: Final Exam: 8:15-9:45am, December 21, 2016 132 Noland
8 pages
Lecture 36: NP-Completeness: 2.1 Optimization, Decision & Search Problems
No ratings yet
Lecture 36: NP-Completeness: 2.1 Optimization, Decision & Search Problems
7 pages
Bodyweight Hoplite - Build A Lean and Mean Physique With Only Your Own Body PDF
No ratings yet
Bodyweight Hoplite - Build A Lean and Mean Physique With Only Your Own Body PDF
9 pages
CS 540: Introduction To Artificial Intelligence: Final Exam: 5:30-7:30pm, December 17, 2015 Beatles Room at Epic
No ratings yet
CS 540: Introduction To Artificial Intelligence: Final Exam: 5:30-7:30pm, December 17, 2015 Beatles Room at Epic
11 pages
GATE Computer Science 2001
No ratings yet
GATE Computer Science 2001
10 pages
Artificial Intelligence Exam 1
No ratings yet
Artificial Intelligence Exam 1
6 pages
Daniel Science
No ratings yet
Daniel Science
10 pages
ARINC Meteorological Data Collection and Reporting System (MDCRS)
No ratings yet
ARINC Meteorological Data Collection and Reporting System (MDCRS)
16 pages
Injection Engine Control System. VAZ 21213, 21214 (Niva)
No ratings yet
Injection Engine Control System. VAZ 21213, 21214 (Niva)
3 pages
List of Banned Pesticides
No ratings yet
List of Banned Pesticides
3 pages
What Is Athletic Sports and Management?
No ratings yet
What Is Athletic Sports and Management?
3 pages
Sample Midterm Questions Answers
No ratings yet
Sample Midterm Questions Answers
5 pages
Fig. Qty Description Code Fig. Qty Description Code: Carburettor 40 DCOE Part No. 19550.174 Parts
No ratings yet
Fig. Qty Description Code Fig. Qty Description Code: Carburettor 40 DCOE Part No. 19550.174 Parts
2 pages
CS 540: Introduction To Artificial Intelligence: Final Exam: 12:25-2:25pm, May 16, 2013 Room 228 Educational Sciences
No ratings yet
CS 540: Introduction To Artificial Intelligence: Final Exam: 12:25-2:25pm, May 16, 2013 Room 228 Educational Sciences
10 pages
Modbus TCP Client RTU Slave MN67010 ENG
No ratings yet
Modbus TCP Client RTU Slave MN67010 ENG
9 pages
Faculty of Engineering: - Answer Any Four Full Questions Missing Data, If Any, May Be Assumed Suitably. 1. (A)
No ratings yet
Faculty of Engineering: - Answer Any Four Full Questions Missing Data, If Any, May Be Assumed Suitably. 1. (A)
2 pages
Cs607 Collection of Old Papers
No ratings yet
Cs607 Collection of Old Papers
13 pages
TH 2
No ratings yet
TH 2
4 pages
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Calculus: Maths of the Gods
From Everand
Calculus: Maths of the Gods
Bill Todorovich
No ratings yet
Pre-Calculus Essentials
From Everand
Pre-Calculus Essentials
Ernest Woodward
No ratings yet
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet

NLP Final

Uploaded by

NLP Final

Uploaded by

"

Problem 1 Regular Expressions, Heaps’ Law, Language Models (10 credits)

probability mass for the zeros: occurred n times these bigrams in

How much to subtract ? data

0 74 671 100 000 2 019 187 0.000027

beats: (S\NP)/NP beats Schalke: S\NP[1,3] Bayern beats Schalke: S[0,3]

k = guitar k = engine k = wheel k = saddle 2

You might also like