0% found this document useful (0 votes)

169 views43 pages

NLP Unit-5

The document discusses the department's vision and mission as well as the course objectives and outcomes for a natural language processing course. It outlines the course scheme and syllabus covering topics like morphology, part-of-speech tagging, syntax, semantics, language modelling, and probabilistic parsing. It also lists the recommended text and reference books for the course.

Uploaded by

E47Priyanshu Sharma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

169 views43 pages

NLP Unit-5

Uploaded by

E47Priyanshu Sharma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 43

Index

1. Department Vision and Mission

2. Course Objectives & Course Outcome
3. Scheme & Syllabus
4. Text Book & Reference Book
5. Unit - 5 Information

1
Amit Pimpalkar
Department Vision and Mission

Our Vision
• To Continually improve the education environment, in order to develop
graduates with strong academic and technical background needed to
achieve distinction in the discipline. The excellence is expected in various
domains like workforce, higher studies or lifelong learning. To strengthen
links between industry through partnership and collaborative
development works.

Our Mission
• To develop strong foundation of theory and practices of computer science
amongst the students to enable them to develop into knowledge,
responsible professionals, lifelong learners and implement the latest
computing technologies for the betterment of the society.
2
Amit Pimpalkar
Course Objectives & Course Outcome
Course Objective Course Outcomes
I. To familiarize the concepts and After the completion of the course, student will be able
techniques of Natural language to
Processing for analyzing words 1. Apply the Principles and Process of Human
based on Morphology and Languages using computers.
CORPUS. 2. Demonstrate the state-of-the-art algorithms and
II. To relate mathematical techniques for text-based processing of natural
foundations, Probability theory languages with respect to morphology.
with Linguistic essentials such 3. Perform POS tagging for a given natural language
as syntactic and semantic 4. Create Linguistics CORPUS based on Text Corpus
analysis of text. method
III. To apply the Statistical learning 5. Realize semantics and pragmatics of natural
methods and cutting-edge languages for text processing
research models to solve NLP 6. Develop a Statistical Methods for Real World NLP
problems Applications.
3
Amit Pimpalkar
Scheme & Syllabus
Load Credit Total Mark Continuous Assessment ESE Mark
3 hrs (Theory) + 0 hr (Tutorial) 3 100 40 60

UNIT I: Introduction to NLP, Morphology: Introduction to NLP, Stages of NLP,

Ambiguity, Information Theory Essentials , Linguistic Essentials : Parts of Speech and
Morphology, Morphological analysis and generation using Finite State Automata and
Finite State transducer.
UNIT II: Markov Model and POS Tagging: Markov Model: Hidden Markov model,
Fundamentals, Probability of properties, Parameter estimation, Variants, Multiple input
observation. The Information Sources in Tagging: Markov model taggers, Viterbi
algorithm, Applying HMMs to POS tagging, Applications of Tagging.
UNIT 3: Syntax and Semantics: Shallow Parsing and Chunking, Shallow Parsing with
Conditional Random Fields (CRF), Lexical Semantics, WordNet, Thematic Roles,
Semantic Role Labelling with CRFs.
4
Amit Pimpalkar
Scheme & Syllabus
Load Credit Total Mark Continuous Assessment ESE Mark
3 hrs (Theory) + 0 hr (Tutorial) 3 100 40 60
UNIT IV: Language Modelling: Corpus based work, Statistical Inference: n -gram Models
over Sparse Data, Methodological Preliminaries, Supervised Disambiguation: Bayesian
classification, An information- theoretic approach, Dictionary-Based Disambiguation:
Disambiguation based on sense, Thesaurus-based disambiguation, Disambiguation based on
translations in a second-language corpus.
UNIT V: Probabilistic Parsing and Disambiguation: Probabilistic Context Free Grammars
and Probabilistic parsing The Probability of a String, Problems with the Inside-Outside
Algorithm, Parsing for disambiguation, Treebanks, Parsing models vs. language models, Phrase
structure grammars and dependency, Lexicalized models using derivational histories,
Dependency-based models.
UNIT VI: NLP Applications : Statistical Alignment and Machine Translation, Text alignment,
Word alignment, Information extraction, Text mining, Information Retrieval, NL interfaces,
Sentimental Analysis, Question Answering Systems, Social network analysis.
5
Amit Pimpalkar
Text & Reference Book

Text Books:

1. Christopher D. Manning and Hinrich Schutze, “Foundations of Natural Language

Processing”, 6th Edition, The MIT Press Cambridge, Massachusetts London, England,
2003
2. Daniel Jurafsky and James H. Martin “Speech and Language Processing”, 3rd edition,
Prentice Hall, 2009.

Reference Books:

1. James Allen “Natural Language Understanding”, Pearson Publication 8th Edition.

2012.

6
Amit Pimpalkar
Text & Reference Book

7
Amit Pimpalkar
UNIT 5: Probabilistic Parsing and Disambiguation

Probabilistic Context Free Grammars and Probabilistic parsing

The Probability of a String,
Problems with the Inside-Outside Algorithm,
Parsing for disambiguation,
Treebanks,
Parsing models vs. language models,
Phrase structure grammars and dependency,
Lexicalized models using derivational histories,
Dependency-based models

8
Amit Pimpalkar
Remember Unit 3: Parsing (Top-down Parsing)
Grammar

S →NP VP
NP → N N
VP →V NP
NP → N

Lexicon

Fed : N
interest : N, V
rates : N
raises : N, V
UNIT 5: Probabilistic CFGs
Handling Ambiguities
• The ambiguities handling algorithms are equipped to represent ambiguities
efficiently but not to resolve them.
• Methods available for resolving ambiguities include:
• Semantics (choose parse that makes sense).
• Statistics: (choose parse that is most likely).
• Probabilistic context-free grammars (PCFGs) offer a solution.

10
Amit Pimpalkar
UNIT 5: Probabilistic CFGs
• A context-free grammar is a tuple <N, T, S, R>
• N : the set of non-terminals
• Phrasal categories: S, NP, VP, ADJ, etc.
• Parts-of-speech (pre-terminals): NN, JJ, DT, VB
• T : the set of terminals (the words)
• S : the start symbol
• Often written as ROOT or TOP
• Not usually the sentence non-terminal S
• R : the set of rules
• Of the form X  Y1 Y2 … Yk, with X, Yi  N
• Examples: S  NP VP, VP  VP CC VP
• Also called rewrites, productions, or local trees
• A PCFG adds:
• A top-down production probability per rule P(Y1 Y2 … Yk | X)
11
Amit Pimpalkar
UNIT 5: Probabilistic CFGs
• The probabilistic model
• Assigning probabilities to parse trees
• Getting the probabilities for the model
• Parsing with probabilities
• Slight modification to dynamic programming approach
• Task is to find the max probability tree for an input
• Getting the Probabilities
• From an annotated database (a treebank)
• Learned from a corpus
• Assume PCFG is in Chomsky Normal Form
• (production is either A → B C or A → a)
12
Amit Pimpalkar
UNIT 5: Probabilistic CFGs

Chomsky Normal Form (CNF)

All rules have form

A  BC and Aa

Non-Terminal Non-Terminal terminal

13
Amit Pimpalkar
UNIT 5: Some Features of PCFGs

• A PCFG gives some idea of the possibility of different parses.

• However, the probabilities are based on structural factors and not lexical ones.
• PCFG are good for grammar induction.
• PCFGs are robust.
• PCFGs give a probabilistic language model for English.
• The predictive power of a PCFG (measured by entropy) tends to be greater
than for an HMM.
• PCFGs are not good models alone but they can be combined with a tri-gram
model.
• PCFGs have certain biases which may not be appropriate.

14
Amit Pimpalkar
UNIT 5: Probabilistic CFGs
Examples:

S  AS S  AS
S a S  AAS
A  SA A  SA
Ab A  aa

Chomsky Normal Form Not Chomsky Normal Form ???

15
Amit Pimpalkar
UNIT 5: A Simple PCFG (in CNF)
S  NP VP 1.0 NP  NP PP 0.4
VP  V NP 0.7 NP  astronomers 0.1
VP  VP PP 0.3 NP  ears 0.18
PP  P NP 1.0 NP  saw 0.04
P  with 1.0 NP  stars 0.18
V  saw 1.0 NP  telescope 0.1

astronomers saw stars with ears

Draw all parse tree for this sentence, (Top Down)

16
Amit Pimpalkar
UNIT 5: A Simple PCFG (in CNF)
S  NP VP 1.0 NP  NP PP 0.4
VP  V NP 0.7 NP  astronomers 0.1
VP  VP PP 0.3 NP  ears 0.18
PP  P NP 1.0 NP  saw 0.04
P  with 1.0 NP  stars 0.18
V  saw 1.0 NP  telescope 0.1

• w12 = astronomers saw stars with ears

• P(t1) = 1.0 * 0.1 * 0.7 * 1.0 * 0.4 * 0.18
* 1.0 * 1.0 * 0.18
= 0.0009072
• P(t2) = 1.0 * 0.1 * 0.3 * 0.7 * 1.0 * 0.18
* 1.0 * 1.0 * 0.18
= 0.0006804
• P(w12) = P(t1) + P(t2)
= 0.0009072 + 0.0006804
= 0.0015876 18
Amit Pimpalkar
UNIT 5: A Simple PCFG (in CNF)

Grammar Prob Lexicon

Amit Pimpalkar book the flight through Houston 19

UNIT 5: A Simple PCFG (in CNF)
• Assume productions for each node are chosen independently.
• Probability of derivation is the product of the probabilities of its productions.
S D1
P(D1) = 0.1 x 0.5 x 0.5 x 0.6 x 0.6 x 0.5 x 0.1
0.3 x 1.0 x 0.2 x 0.2 x 0.5 x 0.8 VP
0.5
= 0.0000216
Verb NP 0.6
0.5
book Det Nominal
0.6 0.5
book the flight through Houston the Nominal PP 1.0
0.3
Noun Prep NP
0.2
0.5 0.2
flight through Proper-Noun
0.8
Houston
20
Amit Pimpalkar
UNIT 5: A Simple PCFG (in CNF)
• Resolve ambiguity by picking most probable parse tree.
book the flight through Houston
S
P(D2) = 0.1 x 0.3 x 0.5 x 0.6 x 0.5 x 0.6 x D2
0.1
0.3 x 1.0 x 0.5 x 0.2 x 0.2 x 0.8
= 0.00001296 VP
0.3
VP 0.5
• Probability of a sentence is the sum of the
probabilities of all of its derivations. Verb NP 0.6
0.5
P(“book the flight through Houston”) book Det Nominal
PP
1.0
0.6
= P(D1) + P(D2) 0.3
the Noun Prep NP
= 0.0000216 + 0.00001296 0.2 0.2
0.5
flight through Proper-Noun
= 0.00003456 0.8
Houston
21
Amit Pimpalkar
UNIT 5: A Simple PCFG (in CNF)

Amit Pimpalkar can you book TWA flights 22

UNIT 5: A Simple PCFG (in CNF)
can you book TWA flights

P(S) = 3.2 x 10-6

23
Amit Pimpalkar
UNIT 5: A Simple PCFG (in CNF)

24
Amit Pimpalkar
UNIT 5: Example of Inside Probabilities (CYK Algorithm)
S  NP VP 1.0 NP  NP PP 0.4 John Cocke, Daniel Younger, Tadao
VP  V NP 0.7 NP  astronomers 0.1
Kasami, and Jacob Schwartz (1961)
VP  VP PP 0.3 NP  ears 0.18
PP  P NP 1.0 NP  saw 0.04
P  with 1.0 NP  stars 0.18 astronomers saw stars with ears
V  saw 1.0 NP  telescope 0.1

1 2 3 4 5
1 NP = 0.1
2 NP = 0.04
V = 1.0
3 NP = 0. 18
4 P = 1.0
5 NP = 0. 18
astronomers saw stars with ears

25
Amit Pimpalkar
UNIT 5: Example of Inside Probabilities (CYK Algorithm)
S  NP VP 1.0 NP  NP PP 0.4
VP  V NP 0.7 NP  astronomers 0.1 astronomers saw stars with ears
VP  VP PP 0.3 NP  ears 0.18
PP  P NP 1.0 NP  saw 0.04
P  with 1.0 NP  stars 0.18
V  saw 1.0 NP  telescope 0.1

1 2 3 4 5
1 NP = 0.1 ----
2 NP = 0.04 VP = 0.126
V = 1.0 (1 x 0.7 x 0. 18)
3 NP = 0. 18
4 P = 1.0
5 NP = 0. 18
astronomers saw stars with ears

26
Amit Pimpalkar
UNIT 5: Example of Inside Probabilities (CYK Algorithm)
S  NP VP 1.0 NP  NP PP 0.4
VP  V NP 0.7 NP  astronomers 0.1 astronomers saw stars with ears
VP  VP PP 0.3 NP  ears 0.18
PP  P NP 1.0 NP  saw 0.04
P  with 1.0 NP  stars 0.18
V  saw 1.0 NP  telescope 0.1

1 2 3 4 5
1 NP = 0.1 ---- S = 0.0126
(0.1 x 1.0 x 0.126 )
2 NP = 0.04 VP = 0.126
V = 1.0 (1 x 0.7 x 0. 18)
3 NP = 0. 18
4 P = 1.0
5 NP = 0. 18
astronomers saw stars with ears
27
Amit Pimpalkar
UNIT 5: Example of Inside Probabilities (CYK Algorithm)
S  NP VP 1.0 NP  NP PP 0.4
VP  V NP 0.7 NP  astronomers 0.1
astronomers saw stars with ears
VP  VP PP 0.3 NP  ears 0.18
PP  P NP 1.0 NP  saw 0.04
P  with 1.0 NP  stars 0.18
V  saw 1.0 NP  telescope 0.1

1 2 3 4 5
1 NP = 0.1 S = 0.0126 ----
(0.1 x 1.0 x 0.126 )
2 NP = 0.04 VP = 0.126 ----
V = 1.0 (1 x 0.7 x 0. 18)
3 NP = 0. 18 ----
4 P = 1.0
5 NP = 0. 18
astronomers saw stars with ears
28
Amit Pimpalkar
UNIT 5: Example of Inside Probabilities (CYK Algorithm)
S  NP VP 1.0 NP  NP PP 0.4
VP  V NP 0.7 NP  astronomers 0.1
astronomers saw stars with ears
VP  VP PP 0.3 NP  ears 0.18
PP  P NP 1.0 NP  saw 0.04
P  with 1.0 NP  stars 0.18
V  saw 1.0 NP  telescope 0.1

1 2 3 4 5
1 NP = 0.1 S = 0.0126 ----
(0.1 x 1.0 x 0.126 )
2 NP = 0.04 VP = 0.126 ----
V = 1.0 (1 x 0.7 x 0. 18)
3 NP = 0. 18 ----
4 P = 1.0 PP = 0. 18
(1.0 x 1.0 x 0.18)
5 NP = 0. 18
astronomers saw stars with ears

29
Amit Pimpalkar
UNIT 5: Example of Inside Probabilities (CYK Algorithm)
S  NP VP 1.0 NP  NP PP 0.4
VP  V NP 0.7 NP  astronomers 0.1
astronomers saw stars with ears
VP  VP PP 0.3 NP  ears 0.18
PP  P NP 1.0 NP  saw 0.04
P  with 1.0 NP  stars 0.18
V  saw 1.0 NP  telescope 0.1

1 2 3 4 5
1 NP = 0.1 S = 0.0126 ---- S = 0.0015876
(0.1 x 1.0 x 0.126 ) (ref next slide for calculation)
2 NP = 0.04 VP = 0.126 ---- VP = 0.015876
V = 1.0 (1 x 0.7 x 0. 18) (ref next slide for calculation)

3 NP = 0. 18 ---- NP = 0.01296

(0.18 x 0.4 x 0.18)
4 P = 1.0 PP = 0. 18
(1.0 x 0.18)
5 NP = 0. 18
astronomers saw stars with ears
30
Amit Pimpalkar
UNIT 5: Example of Inside Probabilities (CYK Algorithm)
 
S
VP 
NP VP
V NP
1.0
0.7
NP
NP 
NP PP 0.4
astronomers 0.1
astronomers1 saw2 stars3
VP  VP PP 0.3 NP  ears 0.18 with4 ears5
PP  P NP 1.0 NP  saw 0.04
P  with 1.0 NP  stars 0.18 Overall probability of the sentence:
V  saw 1.0 NP  telescope 0.1 P(w15)=P(t1)+P(t2)=0.0015876
Refer to Slide
Number 18 to
cross check

31
Amit Pimpalkar
UNIT 5: Example of Inside Probabilities (CYK Algorithm)

S  NP VP 0.8 V  includes 0.05

NP  DT N 0.3 DT  the | a 0.4
VP  V NP 0.2 N  price 0.01
N  facemask 0.02

the price includes a facemask

32
Amit Pimpalkar
UNIT 5: Example of Inside Probabilities (CYK Algorithm)
S  NP VP 0.8 V  includes 0.05
NP  DT N 0.3 DT  the | a 0.4
VP  V NP 0.2 N  meals 0.01
N  flight 0.02

the flight includes the meals

33
Amit Pimpalkar
UNIT 5: Example of Inside Probabilities (CYK Algorithm)
S  NP VP 1.0 Vi  sleeps 1.0
VP  Vi 0.3 Vt  saw 1.0
VP  Vt NP 0.5 NN  man | woman 0.1
VP  VP PP 0.2 NN  telescope 0.3
NP  DT NN 0.8 NN  dog 0.5
NP  NP PP 0.2 DT  the 1.0
PP  IN NP 1.0 IN  with 0.6
IN  in 0.4

the dog saw the man with the telescope

34
Amit Pimpalkar
UNIT 5: Example of Inside Probabilities (CYK Algorithm)

Grammar Prob Lexicon

Amit Pimpalkar book the flight through Houston 35

UNIT 5: Example of Inside Probabilities (CYK Algorithm)

Amit Pimpalkar can you book TWA flights 36

UNIT 5: Problems with the inside-outside algorithm
• Remember
• CYK algorithm is for recognition (valid/invalid string based on grammar),
while Inside algorithm delves deeper, analyzing all possible sub-structures
within a string based on the grammar.

• Extremely Slow: For each sentence, each iteration of training is O(m3n3).

• Local Maxima are much more of a problem than in HMMs
• Satisfactory learning requires many more nonterminal than are theoretically
needed to describe the language.
• There is no guarantee that the learned nonterminal will be linguistically
motivated.
37
Amit Pimpalkar
UNIT 5: Treebanks

• English Penn Treebank: Standard corpus for testing syntactic parsing consists
of 1.2 M words of text from the Wall Street Journal (WSJ).
• Typical to train on about 40,000 parsed sentences and test on an additional
standard disjoint test set of 2,416 sentences.
• Chinese Penn Treebank: 100K words from the Xinhua news service.
• Other corpora existing in many languages, see the Wikipedia article
“Treebank”
• Treebanks used in improving machine translation systems

38
Amit Pimpalkar
Other Resources to study NLP
• https://onlinecourses.nptel.ac.in/noc21_cs19
• By Prof. Sourav Mukhopadhyay | IIT Kharagpur
• https://onlinecourses.nptel.ac.in/noc20_cs87
• By Prof. Ramaseshan R | Chennai Mathematical Institute (CMI)
• https://docs.microsoft.com/en-us/learn/paths/explore-natural-language-
processing/
• https://www.upgrad.com/machine-learning-nlp-pgc-iiitb
• https://www.amazon.science/latest-news/machine-learning-course-free-
online-from-amazon-machine-learning-university
• https://online.stanford.edu/courses/xcs224n-natural-language-processing-
deep-learning
• https://courses.cs.washington.edu/courses/cse517/
Amit Pimpalkar
39
Other Resources to study NLP
• https://cse.iitk.ac.in/users/cs671/2013/resources.html
• https://home.cs.colorado.edu/~martin/slp.html
• https://web.stanford.edu/~jurafsky/NLPCourseraSlides.html
• https://nlp.stanford.edu/teaching/
• https://tildesites.bowdoin.edu/~allen/nlp/
• https://nlp-iiith.vlabs.ac.in/
• www.purenlp.com
• https://www.nlpworks.com/
• www.compendiumdev.co.uk/nlp

40
Amit Pimpalkar
Other Resources to study NLP
• Natural Language Processing, IIT Kharagpur
• Prof. Pawan Goyal
• https://nptel.ac.in/courses/106105158 (Enrolment Opens: 2023-11-09 to 2024-01-29)

• Natural Language Processing, IIT Bombay

• Prof. Pushpak Bhattacharyya
• https://nptel.ac.in/courses/106101007 (Completed and Videos are available for self-learning)

• Applied Natural Language Processing, Chennai Mathematical Institute

• Prof. Ramaseshan R
• https://nptel.ac.in/courses/106106211 (Completed and Videos are available for self-learning)

• Principles and Parameters in Natural Language, IIT Madras

• Prof. Rajesh Kumar
• https://nptel.ac.in/courses/109106083(Completed and Videos are available for self-learning)
41
Amit Pimpalkar
End of Unit-5

Amit Pimpalkar
Thank You ! 42
43
Amit Pimpalkar

Analysis of An Interview Based On Emotion Detection Using Convolutional Neural Networks
No ratings yet
Analysis of An Interview Based On Emotion Detection Using Convolutional Neural Networks
25 pages
CFG and PCFG
No ratings yet
CFG and PCFG
7 pages
Statistical Constituency Pars-Ing: C.1 Probabilistic Context-Free Grammars
No ratings yet
Statistical Constituency Pars-Ing: C.1 Probabilistic Context-Free Grammars
21 pages
Slp14 Handout s17hw
No ratings yet
Slp14 Handout s17hw
71 pages
SCFG PCFG LCFG
No ratings yet
SCFG PCFG LCFG
25 pages
NLP 2 Internal
No ratings yet
NLP 2 Internal
39 pages
PCFG
No ratings yet
PCFG
79 pages
Unit 4
No ratings yet
Unit 4
45 pages
Assignment 5 (COPY)
No ratings yet
Assignment 5 (COPY)
5 pages
Lec15 CL1-f11
No ratings yet
Lec15 CL1-f11
5 pages
CS6120 35650 - Spring2025 - Assignment - 2-1
No ratings yet
CS6120 35650 - Spring2025 - Assignment - 2-1
5 pages
Statistical Constituency Pars-Ing: 14.1 Probabilistic Context-Free Grammars
No ratings yet
Statistical Constituency Pars-Ing: 14.1 Probabilistic Context-Free Grammars
29 pages
PCFG
No ratings yet
PCFG
79 pages
4.chapter5 - Syntactic and Semantic Representations
No ratings yet
4.chapter5 - Syntactic and Semantic Representations
47 pages
SocherBauerManningNg ACL2013 PDF
No ratings yet
SocherBauerManningNg ACL2013 PDF
11 pages
Constituency Parsing PPT 2
No ratings yet
Constituency Parsing PPT 2
33 pages
NLP Unit-4
No ratings yet
NLP Unit-4
6 pages
Lecture 2
No ratings yet
Lecture 2
28 pages
UNIT 4 Part1
No ratings yet
UNIT 4 Part1
19 pages
Lecture 07
No ratings yet
Lecture 07
35 pages
CFG & PCFG
No ratings yet
CFG & PCFG
15 pages
NLP - Shortnotes Unit 3
No ratings yet
NLP - Shortnotes Unit 3
16 pages
6 Probabilisticparse
No ratings yet
6 Probabilisticparse
46 pages
NLP Unit-Iii
No ratings yet
NLP Unit-Iii
26 pages
r19 Ai Unit IV Chapter 1
No ratings yet
r19 Ai Unit IV Chapter 1
19 pages
14 Syntax 1
No ratings yet
14 Syntax 1
22 pages
Unit Iii - NLP
No ratings yet
Unit Iii - NLP
36 pages
5 Stats Parsing
No ratings yet
5 Stats Parsing
51 pages
Assignment 1 - NLP 2024
No ratings yet
Assignment 1 - NLP 2024
2 pages
ST 2 Imp Questn
No ratings yet
ST 2 Imp Questn
3 pages
Iii-Ii Btech R20 Honors NLP (Cse) May-2025
No ratings yet
Iii-Ii Btech R20 Honors NLP (Cse) May-2025
2 pages
NLP Question
No ratings yet
NLP Question
4 pages
TIC 2151 - Theory of Computation: Context-Free Grammars (CFG)
No ratings yet
TIC 2151 - Theory of Computation: Context-Free Grammars (CFG)
23 pages
Module No. 3: Parsing Structure in Text
No ratings yet
Module No. 3: Parsing Structure in Text
54 pages
NLP Session 16 - Post Midsem Review
No ratings yet
NLP Session 16 - Post Midsem Review
189 pages
Introduction To Computational Linguistics: Eugene Charniak and Mark Johnson
No ratings yet
Introduction To Computational Linguistics: Eugene Charniak and Mark Johnson
148 pages
CS242 - Module 5
No ratings yet
CS242 - Module 5
42 pages
NPTEL NLP Assignment 5
No ratings yet
NPTEL NLP Assignment 5
4 pages
3 cs626 Pos Tagging Week of 8aug22
No ratings yet
3 cs626 Pos Tagging Week of 8aug22
27 pages
NLP Sample Questions-Stu
No ratings yet
NLP Sample Questions-Stu
4 pages
14 Ai Cse551 NLP 2 PDF
No ratings yet
14 Ai Cse551 NLP 2 PDF
39 pages
8-Syntax Part1 Merged
No ratings yet
8-Syntax Part1 Merged
139 pages
Compilers Lecture 5
No ratings yet
Compilers Lecture 5
30 pages
Context Free Grammars
No ratings yet
Context Free Grammars
24 pages
Parsing Probabilistic
No ratings yet
Parsing Probabilistic
59 pages
CS6314
No ratings yet
CS6314
2 pages
NLP Unit 3
No ratings yet
NLP Unit 3
17 pages
Question Bank - NLP
No ratings yet
Question Bank - NLP
3 pages
Context-Free Languages & Grammars (Cfls & CFGS) : Reading: Chapter 5
No ratings yet
Context-Free Languages & Grammars (Cfls & CFGS) : Reading: Chapter 5
40 pages
Faculty of Engineering: - Answer Any Four Full Questions Missing Data, If Any, May Be Assumed Suitably. 1. (A)
No ratings yet
Faculty of Engineering: - Answer Any Four Full Questions Missing Data, If Any, May Be Assumed Suitably. 1. (A)
2 pages
Unit 3
No ratings yet
Unit 3
19 pages
Toc Mod3
No ratings yet
Toc Mod3
72 pages
Final Practice
No ratings yet
Final Practice
12 pages
Theory of Computing
No ratings yet
Theory of Computing
118 pages
Week 3 - Probablistic Context Free Grammars
No ratings yet
Week 3 - Probablistic Context Free Grammars
18 pages
The Expectation Maximization (EM) Algorithm: Continued!
No ratings yet
The Expectation Maximization (EM) Algorithm: Continued!
67 pages
Ai Unit 5
No ratings yet
Ai Unit 5
19 pages
Syntax JB Slides
No ratings yet
Syntax JB Slides
90 pages
Context Free Grammars
No ratings yet
Context Free Grammars
25 pages
A Handbook of Computational Linguistics: Artificial Intelligence in Natural Language Processing
From Everand
A Handbook of Computational Linguistics: Artificial Intelligence in Natural Language Processing
Youddha Beer Singh
No ratings yet
The spaCy Handbook: Simplifying Natural Language Processing
From Everand
The spaCy Handbook: Simplifying Natural Language Processing
Robert Johnson
No ratings yet
Wolaita Sodo University Electrical and Computer Engineering Smart Boom Gate
No ratings yet
Wolaita Sodo University Electrical and Computer Engineering Smart Boom Gate
49 pages
BSC BSC Cs Electronic Science Semester 1 2022 April Semiconductor Devices and Basic Electronic Systems 2019 Pattern
No ratings yet
BSC BSC Cs Electronic Science Semester 1 2022 April Semiconductor Devices and Basic Electronic Systems 2019 Pattern
2 pages
Simple Humanoid Walking and Dancing Robot Arduino
100% (1)
Simple Humanoid Walking and Dancing Robot Arduino
12 pages
Bosch Presentation
No ratings yet
Bosch Presentation
21 pages
Eh Unit2
No ratings yet
Eh Unit2
10 pages
Boot - Device AFF - A150 ASA - A150 AFF - C190 AFF - A220 ASA - AFF A220 - FAS2720 50 Ev20 024a
No ratings yet
Boot - Device AFF - A150 ASA - A150 AFF - C190 AFF - A220 ASA - AFF A220 - FAS2720 50 Ev20 024a
19 pages
चरित्र प्रमाण पत्र - PDF
No ratings yet
चरित्र प्रमाण पत्र - PDF
6 pages
Report FlipFlops
No ratings yet
Report FlipFlops
15 pages
Chat GPT
No ratings yet
Chat GPT
2 pages
GangaPrasad N PD
No ratings yet
GangaPrasad N PD
2 pages
Raspberry Pi Thesis PDF
100% (2)
Raspberry Pi Thesis PDF
5 pages
Deep Learning Lab Manual
No ratings yet
Deep Learning Lab Manual
47 pages
2015 KS2 L3-5 EnglishGPS Paper2 Spelling PDFA
No ratings yet
2015 KS2 L3-5 EnglishGPS Paper2 Spelling PDFA
4 pages
Lab 7 Capturing and Examining The Registry (15 PTS.)
No ratings yet
Lab 7 Capturing and Examining The Registry (15 PTS.)
8 pages
Dealing With Diverged Git Branches Pro Version
No ratings yet
Dealing With Diverged Git Branches Pro Version
4 pages
Assignment 01 Logika Matematika
No ratings yet
Assignment 01 Logika Matematika
14 pages
Theory-Review Pack (Answer Key)
No ratings yet
Theory-Review Pack (Answer Key)
11 pages
S31600317b184b-Abb Ticket
No ratings yet
S31600317b184b-Abb Ticket
7 pages
Readme
No ratings yet
Readme
2 pages
List Dict Set Tuple
No ratings yet
List Dict Set Tuple
23 pages
Log - 2023 06 28 - 17 48
No ratings yet
Log - 2023 06 28 - 17 48
8 pages
Curriculum Vitae
No ratings yet
Curriculum Vitae
3 pages
Instructions For Creating and Submitting Effective Assignment Solutions
No ratings yet
Instructions For Creating and Submitting Effective Assignment Solutions
6 pages
MIN-EM-GL-008 - FLS MIE Enovia Naming Conventions
No ratings yet
MIN-EM-GL-008 - FLS MIE Enovia Naming Conventions
5 pages
Zebra ZE500™: User Guide
No ratings yet
Zebra ZE500™: User Guide
170 pages
Assignment-1 1. Print 'Hello World' in Java.: Sybca-Div-2-Java Avani Joshi ROLL NO:-102
No ratings yet
Assignment-1 1. Print 'Hello World' in Java.: Sybca-Div-2-Java Avani Joshi ROLL NO:-102
22 pages
RMK Group 21cs905 CV Unit 1
No ratings yet
RMK Group 21cs905 CV Unit 1
77 pages
Thesis Example Chapter 4
100% (3)
Thesis Example Chapter 4
4 pages
Da Unit Ii
No ratings yet
Da Unit Ii
25 pages

NLP Unit-5

Uploaded by

NLP Unit-5

Uploaded by

Index

1. Department Vision and Mission

UNIT I: Introduction to NLP, Morphology: Introduction to NLP, Stages of NLP,

1. Christopher D. Manning and Hinrich Schutze, “Foundations of Natural Language

1. James Allen “Natural Language Understanding”, Pearson Publication 8th Edition.

Probabilistic Context Free Grammars and Probabilistic parsing

Chomsky Normal Form (CNF)

Non-Terminal Non-Terminal terminal

• A PCFG gives some idea of the possibility of different parses.

Chomsky Normal Form Not Chomsky Normal Form ???

astronomers saw stars with ears

Draw all parse tree for this sentence, (Top Down)

astronomers saw stars with ears

• w12 = astronomers saw stars with ears

Grammar Prob Lexicon

Amit Pimpalkar book the flight through Houston 19

Amit Pimpalkar can you book TWA flights 22

P(S) = 3.2 x 10-6

3 NP = 0. 18 ---- NP = 0.01296

S  NP VP 0.8 V  includes 0.05

the price includes a facemask

the flight includes the meals

the dog saw the man with the telescope

Grammar Prob Lexicon

Amit Pimpalkar book the flight through Houston 35

Amit Pimpalkar can you book TWA flights 36

• Extremely Slow: For each sentence, each iteration of training is O(m3n3).

• Natural Language Processing, IIT Bombay

• Applied Natural Language Processing, Chennai Mathematical Institute

• Principles and Parameters in Natural Language, IIT Madras

You might also like