0% found this document useful (0 votes)

13 views

CYK Parsing Notes

The CYK algorithm is a parsing technique for context-free grammar that checks if a string can be derived from a given grammar, requiring the grammar to be in Chomsky Normal Form (CNF). It utilizes dynamic programming to construct a parsing table, allowing for efficient parsing with a complexity of O(n³) and the ability to handle ambiguous grammars. The document outlines the steps to convert a grammar to CNF, construct a CYK parsing table, and highlights the advantages of the CYK algorithm in various applications.

Uploaded by

bhowmikpinki59

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

CYK Parsing Notes

Uploaded by

bhowmikpinki59

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

CYK PARSING

• CYK algorithm is a parsing algorithm for context free grammar.

• It was independently developed by three Russian scientists
named Cocke, Younger, and Kasami, hence the name CYK!
• t is used to check if a particular string can be derived from the language
generated by a given grammar. It is also called the membership algorithm as it
tells whether the given string is a member of the given grammar or not.
• To apply CYK algorithm to a grammar, it must be in Chomsky Normal Form.
• It uses a dynamic programming algorithm to tell whether a string is in the
language of a grammar.

Basics

In a CYK algorithm, the structure of grammar should be in Chomsky normal form.

In addition, the CYK algorithm uses the dynamic programming or table filling algorithm.

CNF

• Chomsky Normal Form (CNF) is a way to simplify context-free grammars (CFGs)

so that all production rules follow specific patterns.
• In CNF, each rule either produces two non-terminal symbols, a single terminal
symbol, or, in some cases, the empty string.
• Converting a CFG to CNF is an important step in many parsing algorithms, like
the CYK algorithm, and helps in understanding the structure of languages.

CNF Rules

• It has starting symbol

• It has LHS and RHS
• LHS will have non-terminal
• RHS can contain:
▪ Single terminal
▪ Two non-terminal
• RHS should not contain:
▪ Combination of terminal and non-terminal
▪ Single non-terminal

The grammar will be in CNF if each rule has one of the following forms:

• A→BC (at most two variables on the right-hand side)

• A→ a (a single terminal on the right-hand side)

• S→Ø (null string)

If the given Grammar is not in the CNF form, convert it to the CNF form before applying
the CYK Algorithm.

Step 1:

Original Grammar

S → NP VP

NP → Det N | N

VP → V NP

Det → 'the' | 'a'

N → 'cat' | 'dog'

V → 'chased'

Step 2: Convert to CNF

S → NP VP

NP → Det N

VP → V NP

Det → 'the' | 'a'

N → 'cat' | 'dog'

V → 'chased'

The above is already in CNF form

CYK Parsing Table Construction

We construct a CYK table where row i, column j represents the non-terminals that
can derive the substring from position iii to jjj.

Sentence: "The cat chased a dog"

Count the number of words in the sentence and construct a matrix accordingly.

i.e., 5 x 5 matrix (rows from 1 to 5 and columns from 0 to 4)

0 the 1 cat 2 chased 3 a 4 dog 5

1 2 3 4 5
[0,1] [1,2]
0 Det
1 N
2
3
4

Check the CNF rule and find if there are any non-terminal contains Det and V;

Yes, we have :

NP → Det N

1 2 3 4 5
[0,1] [1,2]
0 Det NP

1
N
2

Next, we have ‘chased’ comes in between 2 and 3

1 2 3 4 5
[0,1] [1,2] [2,3]
0 Det NP

1
N
2 V

4
Next , we have ‘a’ which occurs between 3 and 4 and similarly ‘dog’ occurs between 4
and 5:

1 2 3 4 5
[0,1] [1,2] [2,3] [3,4] [4,5]

0 Det NP

1
N
2 V

3 Det NP

4
N

Finally go up one by one, V and NP is there in the rule, so we can combine

1 2 3 4 5
[0,1] [1,2] [2,3] [3,4] [4,5]

0 Det NP

1
N

2 V VP

3 Det NP

4
N
Now, check VP and NP in the rule, which is there in S→ NP VP

1 2 3 4 5
[0,1] [1,2] [2,3] [3,4] [4,5]

0 Det NP S

1
N

2 V VP

3 Det NP

4
N

Finally S is obtained, so can end here.

Advantages:

• Guaranteed Completeness – Ensures all valid parses are found.

• Efficient (O(n³) Complexity) – Uses dynamic programming for faster parsing.
• Handles Ambiguous Grammars – Finds all possible parses for a sentence.
• Supports Probabilistic Parsing (PCFGs) – Determines the most probable
parse.
• Foundation for Modern Parsers – Used in NLP and machine learning models.
• Useful in Compiler Design – Helps in syntax analysis and syntax tree
generation.
• Works Well for Large Sentences – More efficient than top-down approaches.

YSQ S3 Scoring Sheet Finala
86% (22)
YSQ S3 Scoring Sheet Finala
2 pages
OSPF Demystified With RFC: Request For Comments Translated Into Practice
From Everand
OSPF Demystified With RFC: Request For Comments Translated Into Practice
Redouane MEDDANE
5/5 (1)
GP 43-46-DRAFT - Pipeline Pre-Commissioning
100% (2)
GP 43-46-DRAFT - Pipeline Pre-Commissioning
33 pages
Risk Management Plan
100% (7)
Risk Management Plan
32 pages
NLPPR6
No ratings yet
NLPPR6
6 pages
CYK Algorithm
No ratings yet
CYK Algorithm
6 pages
TIC 2151 - Theory of Computation: Context-Free Grammars (CFG)
No ratings yet
TIC 2151 - Theory of Computation: Context-Free Grammars (CFG)
23 pages
CKY) Cocke-Kasami-Younger) Earley Parsing Algorithms: Dor Altshuler
No ratings yet
CKY) Cocke-Kasami-Younger) Earley Parsing Algorithms: Dor Altshuler
81 pages
Chomshky Notes
No ratings yet
Chomshky Notes
8 pages
NATURAL LANGUAGE PROCESSING
No ratings yet
NATURAL LANGUAGE PROCESSING
5 pages
Thuật toán NLP
No ratings yet
Thuật toán NLP
57 pages
Lesson 44
No ratings yet
Lesson 44
43 pages
Constituency Parsing Ppt 2
No ratings yet
Constituency Parsing Ppt 2
33 pages
Parsing
No ratings yet
Parsing
27 pages
Lecture7 PDF
No ratings yet
Lecture7 PDF
40 pages
lec26-dynamic-programming-7
No ratings yet
lec26-dynamic-programming-7
57 pages
Unit 4 Earley Parser
No ratings yet
Unit 4 Earley Parser
56 pages
Pda Annotated 10 12 2021
No ratings yet
Pda Annotated 10 12 2021
37 pages
Normal Forms For Context-Free Grammars
No ratings yet
Normal Forms For Context-Free Grammars
57 pages
Lecture 10
No ratings yet
Lecture 10
24 pages
Unit 3 NLP 8 Question
No ratings yet
Unit 3 NLP 8 Question
6 pages
Unit 4 CYK Algo Slides
No ratings yet
Unit 4 CYK Algo Slides
60 pages
CYK Algorithm - A Haskell Implementation
No ratings yet
CYK Algorithm - A Haskell Implementation
3 pages
Xu-Ly-Ngon-Ngu-Tu-Nhien - Kai-Wei-Chang - 16-Cky - (Cuuduongthancong - Com)
No ratings yet
Xu-Ly-Ngon-Ngu-Tu-Nhien - Kai-Wei-Chang - 16-Cky - (Cuuduongthancong - Com)
61 pages
Pumping Lemma (Bar Hillel Lemma)
No ratings yet
Pumping Lemma (Bar Hillel Lemma)
49 pages
Cky CNF
No ratings yet
Cky CNF
112 pages
The CYK Algorithm
No ratings yet
The CYK Algorithm
9 pages
Theory of Computation: Automata Theory (CFG, CFL, CNF)
No ratings yet
Theory of Computation: Automata Theory (CFG, CFL, CNF)
39 pages
Addis Ababa University College of Natural and Computational Science
No ratings yet
Addis Ababa University College of Natural and Computational Science
3 pages
Chapter 05 - Pushdown Automata
No ratings yet
Chapter 05 - Pushdown Automata
32 pages
NLP Module 3
No ratings yet
NLP Module 3
11 pages
CFG & PCFG
No ratings yet
CFG & PCFG
15 pages
TOC 3IS(cs)
No ratings yet
TOC 3IS(cs)
24 pages
Lecture 09 CNF and DFA minimization
No ratings yet
Lecture 09 CNF and DFA minimization
14 pages
4 Predctive Parser
No ratings yet
4 Predctive Parser
59 pages
NPTEL_NLP_Assignment_5 (1)
No ratings yet
NPTEL_NLP_Assignment_5 (1)
4 pages
SLoSP 2007 2
No ratings yet
SLoSP 2007 2
45 pages
Homework 5 Solutions
No ratings yet
Homework 5 Solutions
7 pages
Chapter 05 - Pushdown Automata
No ratings yet
Chapter 05 - Pushdown Automata
31 pages
Lecture 3: Text Processing & Minimum Edit Distance Algorithm
No ratings yet
Lecture 3: Text Processing & Minimum Edit Distance Algorithm
57 pages
TOC II Updated
No ratings yet
TOC II Updated
41 pages
Chapter Three
No ratings yet
Chapter Three
37 pages
unit-4
No ratings yet
unit-4
45 pages
Chapter 3 - Context Free Languages
No ratings yet
Chapter 3 - Context Free Languages
59 pages
Homework 5 Solutions
No ratings yet
Homework 5 Solutions
6 pages
201 2018 2 b-22 PDF
No ratings yet
201 2018 2 b-22 PDF
21 pages
Lect 11
No ratings yet
Lect 11
7 pages
EECE 338 - Course Summary
No ratings yet
EECE 338 - Course Summary
16 pages
Flat Module 3
No ratings yet
Flat Module 3
18 pages
13-Shallow Parsing-05-09-2024
No ratings yet
13-Shallow Parsing-05-09-2024
62 pages
5c-partB-CFG and PDA
No ratings yet
5c-partB-CFG and PDA
57 pages
Module-4 Normal Forms
No ratings yet
Module-4 Normal Forms
63 pages
CYK Parsing 93de85edf167a22e7bfc2086b85b56e2
No ratings yet
CYK Parsing 93de85edf167a22e7bfc2086b85b56e2
60 pages
Unit-3 Aim 502
No ratings yet
Unit-3 Aim 502
14 pages
Context Free Languages: Context Free Grammars Parsing Arithmetic Expression Removing λ-productions Normal forms
No ratings yet
Context Free Languages: Context Free Grammars Parsing Arithmetic Expression Removing λ-productions Normal forms
24 pages
Semiring Parsing
No ratings yet
Semiring Parsing
34 pages
NLP Session 16 - Post Midsem Review
No ratings yet
NLP Session 16 - Post Midsem Review
189 pages
Hwsoln 05
No ratings yet
Hwsoln 05
6 pages
CS372 Formal Languages & The Theory of Computation
No ratings yet
CS372 Formal Languages & The Theory of Computation
33 pages
Assignment 5 (COPY)
No ratings yet
Assignment 5 (COPY)
5 pages
The Cocke-Younger-Kasami Algorithm
No ratings yet
The Cocke-Younger-Kasami Algorithm
12 pages
Lecture 09
No ratings yet
Lecture 09
34 pages
LEARN MPLS FROM SCRATCH PART-B: A Beginners guide to next level of networking
From Everand
LEARN MPLS FROM SCRATCH PART-B: A Beginners guide to next level of networking
POONAM DEVI
No ratings yet
50HE-3020 Waterjet Cutting Machine Specification
No ratings yet
50HE-3020 Waterjet Cutting Machine Specification
12 pages
Dasdasdasd 1
No ratings yet
Dasdasdasd 1
3 pages
Download ebooks file Clinical Bioinformatics 2nd Edition Ronald Trent (Eds.) all chapters
No ratings yet
Download ebooks file Clinical Bioinformatics 2nd Edition Ronald Trent (Eds.) all chapters
25 pages
CRN6593289507
No ratings yet
CRN6593289507
3 pages
Invoice
No ratings yet
Invoice
1 page
Roi Interactive Price List
No ratings yet
Roi Interactive Price List
72 pages
APPEA Guidelines For Lifting Equipment APPENDICIES
100% (4)
APPEA Guidelines For Lifting Equipment APPENDICIES
60 pages
Wi Cswip 3.1 Part 4
No ratings yet
Wi Cswip 3.1 Part 4
10 pages
Lecture 13 Forage Conservation
No ratings yet
Lecture 13 Forage Conservation
29 pages
RSOvam2201v1 3
No ratings yet
RSOvam2201v1 3
40 pages
Electric Motor Cooling Systems: Welkon Limited
100% (1)
Electric Motor Cooling Systems: Welkon Limited
6 pages
Anna Mulcahy Leadership
No ratings yet
Anna Mulcahy Leadership
10 pages
Risk Assessment for Soil Test.
No ratings yet
Risk Assessment for Soil Test.
7 pages
Numerical Methods and Optimization APR-2015 Course 2012 in Sem-II 30 Marks (TE MECHANICAL)
No ratings yet
Numerical Methods and Optimization APR-2015 Course 2012 in Sem-II 30 Marks (TE MECHANICAL)
2 pages
The Mismeasure of Man Gould PDF
0% (2)
The Mismeasure of Man Gould PDF
3 pages
1 Quantity Estimation (Foundation and Load Bearing Wall
No ratings yet
1 Quantity Estimation (Foundation and Load Bearing Wall
30 pages
Solving Quadratics Using Graphs
No ratings yet
Solving Quadratics Using Graphs
2 pages
Kia L. Steele: Intern
100% (1)
Kia L. Steele: Intern
1 page
Roland BK-9 Addendum V 1.05
No ratings yet
Roland BK-9 Addendum V 1.05
8 pages
Teks Story Telling Maling Kundang
No ratings yet
Teks Story Telling Maling Kundang
2 pages
BOM Assignment
No ratings yet
BOM Assignment
6 pages
4.2 OECD: Wilson, K., Ch. 5: Entrepreneurship Education in Europe
No ratings yet
4.2 OECD: Wilson, K., Ch. 5: Entrepreneurship Education in Europe
1 page
CH 02 - PL
No ratings yet
CH 02 - PL
92 pages
TDB01 R Jameco Valuepro PDF
No ratings yet
TDB01 R Jameco Valuepro PDF
1 page
Kyle Rhayne D. Caliwag Michaella Kenaizan S. Bacani Millen Grace C. Deang Ma. Bench Niña T. Delacruz Killua G. Pilapil
No ratings yet
Kyle Rhayne D. Caliwag Michaella Kenaizan S. Bacani Millen Grace C. Deang Ma. Bench Niña T. Delacruz Killua G. Pilapil
46 pages
Brand EMI Activation by PSA App
No ratings yet
Brand EMI Activation by PSA App
10 pages
Zoe Chuia Resort: Bathroom Cleaning Checklist
100% (1)
Zoe Chuia Resort: Bathroom Cleaning Checklist
6 pages

CYK Parsing Notes

Uploaded by

CYK Parsing Notes

Uploaded by

CYK PARSING

• CYK algorithm is a parsing algorithm for context free grammar.

In a CYK algorithm, the structure of grammar should be in Chomsky normal form.

• Chomsky Normal Form (CNF) is a way to simplify context-free grammars (CFGs)

• It has starting symbol

• A→BC (at most two variables on the right-hand side)

• A→ a (a single terminal on the right-hand side)

• S→Ø (null string)

Det → 'the' | 'a'

Step 2: Convert to CNF

Det → 'the' | 'a'

The above is already in CNF form

CYK Parsing Table Construction

Sentence: "The cat chased a dog"

i.e., 5 x 5 matrix (rows from 1 to 5 and columns from 0 to 4)

Next, we have ‘chased’ comes in between 2 and 3

Finally go up one by one, V and NP is there in the rule, so we can combine

Finally S is obtained, so can end here.

• Guaranteed Completeness – Ensures all valid parses are found.

You might also like