0% found this document useful (0 votes)

13 views15 pages

Flat 2

Uploaded by

SAROJ KUMAR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views15 pages

Flat 2

Uploaded by

SAROJ KUMAR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 15

FLAT

Module 2

Context-free languages and pushdown automata:

2.1 Context-free grammars (CFG) and Context-free

languages (CFL)

CFLs are used by the compiler in the parsing phase as they define
the syntax of a programming language and are used in many
editors.

There are four important components in a grammatical

description of a language:

 There is a set of symbols that form the strings of the language

being defined. They are called terminal symbols, represented by
Vt.
 There is a finite set of variables, called non-terminals. These
are represented by Vn.
 One of the variable represents the language being defined; it is
called the start symbol. It is represent by S.
 There are finite set of rules called productions that represent
the recursive definition of a language. Each production consists
of:

1. A variable that is being defined by the production. This variable

is often called the head of the production.
2. The production symbol ->.
3. A string of zero or more terminals and variable.

Formal definition

A context-free grammar (CFG) is a 4-tuple G=(V n, Vt, S, P), where

Vn and Vt are disjoint finite sets, S is an element of V n, and P is a
finite set of formulas of the form A -> α, where
A ϵ Vn and α ϵ (Vn U Vt)*.
2.2 Chomsky and Greibach normal forms

Chomsky normal forms

A context free grammar (CFG) is in Chomsky Normal Form (CNF) if

all production rules satisfy one of the following conditions:

 A non-terminal generating a terminal (e.g.; X->x)

 A non-terminal generating two non-terminals (e.g.; X->YZ)
 Start symbol generating ε. (e.g.; S->ε)

Consider the following grammars,

G1 = {S->a, S->AZ, A->a, Z->z}

G2 = {S->a, S->aZ, Z->a}

The grammar G1 is in CNF as production rules satisfy the

rules specified for CNF. However, the grammar G2 is not in CNF as
the production rule S->aZ contains terminal followed by non-
terminal which does not satisfy the rules specified for CNF.

Note –
 CNF is a pre processing step used in various algorithms.
 For generation of string x of length ‘m’, it requires ‘2m-1’
production or steps in CNF.

Greibach normal forms

A CFG is in Greibach Normal Form if the Productions are in the
following forms −
A→b
A → bD1…Dn
S→ε
where A, D1,....,Dn are non-terminals and b is a terminal.
Algorithm to Convert a CFG into Greibach Normal Form
Step 1 − If the start symbol S occurs on some right side, create
a new start symbol S’ and a new production S’ → S.
Step 2 − Remove Null productions. (Using the Null production
removal algorithm discussed earlier)
Step 3 − Remove unit productions. (Using the Unit production
removal algorithm discussed earlier)
Step 4 − Remove all direct and indirect left-recursion.
Step 5 − Do proper substitutions of productions to convert it into
the proper form of GNF.

Problem
Convert the following CFG into CNF
S → XY | Xn | p
X → mX | m
Y → Xn | o

Solution
Here, S does not appear on the right side of any production and
there are no unit or null productions in the production rule set.
So, we can skip Step 1 to Step 3.
Step 4
Now after replacing
X in S → XY | Xo | p
with
mX | m
we obtain
S → mXY | mY | mXo | mo | p.
And after replacing
X in Y → Xn | o
with the right side of
X → mX | m
we obtain
Y → mXn | mn | o.
Two new productions O → o and P → p are added to the
production set and then we came to the final GNF as the
following −
S → mXY | mY | mXC | mC | p
X → mX | m
Y → mXD | mD | o
O→o
P→p

2.3 Non deterministic pushdown automata (PDA) and

equivalence with CFG

The non-deterministic pushdown automata is very much similar to

NFA. We will discuss some CFGs which accepts NPDA.

The CFG which accepts deterministic PDA accepts non-

deterministic PDAs as well. Similarly, there are some CFGs which
can be accepted only by NPDA and not by DPDA. Thus NPDA is
more powerful than DPDA.

Example:

Design PDA for Palindrome strips.

Solution:

Suppose the language consists of string L = {aba, aa, bb, bab,

bbabb, aabaa, ......]. The string can be odd palindrome or even
palindrome. The logic for constructing PDA is that we will push a
symbol onto the stack till half of the string then we will read each
symbol and then perform the pop operation. We will compare to
see whether the symbol which is popped is similar to the symbol
which is read. Whether we reach to end of the input, we expect
the stack to be empty.
This PDA is a non-deterministic PDA because finding the mid for
the given string and reading the string from left and matching it
with from right (reverse) direction leads to non-deterministic
moves. Here is the ID.

Simulation of abaaba

⊢ δ(q1, baaba, aZ)

 δ(q1, abaaba, Z) Apply rule 1

⊢ δ(q1, aaba, baZ)

 Apply rule 5

⊢ δ(q1, aba, abaZ)

 Apply rule 4

⊢ δ(q2, ba, baZ)

 Apply rule 7

⊢ δ(q2, a, aZ)
 Apply rule 8

⊢ δ(q2, ε, Z)
 Apply rule 7

⊢ δ(q2, ε)
 Apply rule 11
 Accept

2.4 Parse trees

Figure Parse trees

A parse tree is an entity which represents the structure of the

features to define are the root ∈ V and yield ∈ Σ* of each tree.

derivation of a terminal string from some non-terminal. Key

 For each σ ∈ Σ, there is a tree with root σ and no children; its

yield is σ
 For each rule A → ε, there is a tree with root A and one child ε;
its yield is ε
 If t1, t2, ..., tn are parse trees with roots r1, r2, ..., rn and
respective yields y1, y2, ..., yn, and A → r1r2...rn is a production,
then there is a parse tree with root A whose children
are t1, t2, ..., tn. Its root is A and its yield is the concatenation of
yields: y1y2...yn

Here, parse trees are constructed from bottom up, not top down.

The actual construction of "adding children" should be made more

precise, but we intuitively know what's going on.

As an example, here are all the parse (sub) trees used to

build the parse tree for the arithmetic expression 4 + 2 * 3 using
the expression grammar

E→E+T|E-T|T

T→T*F|F

F→a|(E)

where a represents an operand of some type, be it a number

or variable. The trees are grouped by height.
Figure Example of Parse trees

Parse Trees and Derivations

A derivation is a sequence of strings in V* which starts with a

non-terminal in V-Σ and ends with a string in Σ*.

Let's consider the sample grammar

E → E+E | a
We write:

E ⇒ E+E ⇒ E+E+E ⇒a+E+E⇒a+a+E⇒a+a+a

but this is incomplete, because it doesn't tell us where the

replacement rules are applied.

We actually need "marked" strings which indicate which non-

terminal is replaced in all but the first and last step:

E ⇒ Ě+E ⇒ Ě+E+E ⇒a+Ě+E ⇒a+a+Ě ⇒a+a+a

In this case, the marking is only necessary in the second step;

however it is crucial, because we want to distinguish between this
derivation and the following one:

E ⇒ E+Ě ⇒ Ě+E+E ⇒a+Ě+E ⇒a+a+Ě ⇒a+a+a

We want to characterize two derivations as "coming from the

same parse tree."

The first step is to define the relation among derivations as being

"more left-oriented at one step". Assume we have two equal
length derivations of length n > 2:

D: x1⇒ x2⇒ ... ⇒xn

D′: x1′ ⇒ x2′ ⇒ ... ⇒xn′

Where x1 = x1′ is a non-terminal and

xn = xn′ ∈ Σ*

Namely they start with the same non-terminal and end at the
same terminal string and have at least two intermediate steps.
Let’s say D < D′ if the two derivations differ in only one step in
which there are 2 non-terminals, A and B, such that D replaces
the left one before the right one and D′ does the opposite.
Formally:

D < D′ if there exists k, 1 < k < n such that

xi = xi′ for all i ≠ k (equal strings, same marked position)

xk-1 = uǍvBw, for u, v, w ∈ V*

xk-1′ = uAvB̌w, for u, v, w ∈ V*

xk =uyvB̌w, for production A → y

xk′ = uǍvzw, for production B → z

xk+1 = xk+1′ = uyvzw (marking not shown)

Two derivations are said to be similar if they belong to the

reflexive, symmetric, transitive closure of <.

2.5 Ambiguity in CFG

Suppose we have a context free grammar G with production rules: S-
>aSb|bSa|SS|ɛ

Left most derivation (LMD) and Derivation Tree:

Leftmost derivation of a string from staring symbol S is done by replacing
leftmost non-terminal symbol by RHS of corresponding production rule.
For example: The leftmost derivation of string abab from grammar G
above is done as:

S =>aSb =>abSab =>abab

The symbols in bold are replaced using production rules.

Derivation tree: It explains how string is derived using production rules
from S and is shown in Figure.

Figure Derivation tree

Right most derivation (RMD):

It is done by replacing rightmost non-terminal symbol S by RHS of
corresponding production rule.
For Example: The rightmost derivation of string abab from grammar
G above is done as:

S => SS =>SaSb =>Sab =>aSbab =>abab

The symbols in bold are replaced using production rules.

The derivation tree for abab using rightmost derivation is shown in Figure.

Figure Right most derivation

A derivation can be either LMD or RMD or both or none. For Example:

S =>aSb =>abSab =>abab is LMD as well as RMD

butS => SS =>SaSb =>Sab =>aSbab =>abab is RMD but not LMD.

Ambiguous Context Free Grammar:

 A context free grammar is called ambiguous if there exists more than
one LMD or RMD for a string which is generated by grammar.
 There will also be more than one derivation tree for a string in
ambiguous grammar.
 The grammar described above is ambiguous because there are two
derivation trees.
 There can be more than one RMD for string abab which are:

S => SS =>SaSb =>Sab =>aSbab =>abab

S =>aSb =>abSab =>abab

2.6 Pumping lemma for context-free languages

Lemma: The language = is not context free.

Proof (By contradiction)

Assuming that this language is context-free; hence it will have a context-
free grammar.
Let be the constant of the Pumping Lemma.
Considering the string , where is length greater than .

By the Pumping Lemma this is represented as , such that all

are also in , which is not possible, as:

either or cannot contain many letters from ; else they are in

the wrong order .

if or consists of a's, b's or c's, then cannot maintain the

balance amongst the three letters.

Lemma: The language = is not context free.

Proof (By contradiction)

Assuming that this language is context-free; hence it will have a context-
free grammar.
Let be the constant of the Pumping Lemma.
Considering the string , which is > .
By the Pumping Lemma this must be represented as , such that

all are also in .

-As mentioned previously neither nor may contain a mixture of

symbols.

-Suppose consists of a's.

Then there is no way cannot have b's and c's. It generate enough letters
to keep them more than that of the a's (it can do it for one or the other of
them, not both).
Similarly cannot consist of just a's.

-So suppose then that or contains only b's or only c's.

Consider the string which must be in . Since we have dropped

both and , we must have at least one b' or one c' less than we had
in , which was . Consequently, this string no longer has
enough of either b's or c's to be a member of .

2.7 Deterministic pushdown automata

 Machine transitions are based on the current state and input symbol,
and also the current topmost symbol of the stack.
 Symbols lower in the stack are not visible and have no immediate
effect. Machine actions include pushing, popping, or replacing the stack
top.
 A deterministic pushdown automaton has at most one legal transition
for the same combination of input symbol, state, and top stack symbol.
 This is where it differs from the nondeterministic pushdown automaton.
2.8 Closure properties of CFLs
They are closed under −
 Union
 Concatenation
 Kleene Star operation

Union

Let A and A be two context free languages. Then A ∪ A is also context

1 2 1 2

free.

Example

Let A = { x y , n > 0}. Corresponding grammar G will have P: S1 → aAb|

1
n n
1

ab
Let A = { c d , m ≥ 0}. Corresponding grammar G will have P:
2
m m
2

S2 → cBb| ε
Union of A and A , A = A ∪ A = { x y } ∪ { c d }
1 2 1 2
n n m m
The corresponding grammar G will have the additional production S → S1
| S2

Concatenation

If A and A are context free languages, then A A is also context free.

1 2 1 2

Example

Union of the languages A and A , A = A A = { a b c d }

1 2 1 2
n n m m

The corresponding grammar G will have the additional production S → S1

Kleene Star

If A is a context free language, then A* is also context free.

Example

Let A = { x y , n ≥ 0}. Corresponding grammar G will have P: S → aAb| ε

n n

Kleene Star L = { x y }*
1
n n

The corresponding grammar G will

1 have additional productions
S1 → SS | ε
1

Context-free languages are not closed under −

 Intersection − If A1 and A2 are context free languages, then A1 ∩ A2
is not necessarily context free.
 Intersection with Regular Language − If A1 is a regular language
and A2 is a context free language, then A1 ∩ A2 is a context free
language.
 Complement − If A1 is a context free language, then A1’ may not be
context free.

Reference books

1. Harry R. Lewis and Christos H. Papadimitriou, Elements of the Theory of

Computation, Pearson Education Asia.

2. Dexter C. Kozen, Automata and Computability, Undergraduate Texts in

Computer Science, Springer.

3. Michael Sipser, Introduction to the Theory of Computation, PWS

Publishing.
4. John Martin, Introduction to Languages and the Theory of Computation,
Tata McGraw Hill.

Golosa A Basic Course in Russian Book One PDF
0% (2)
Golosa A Basic Course in Russian Book One PDF
3 pages
Lec 3
No ratings yet
Lec 3
76 pages
Flat M2
No ratings yet
Flat M2
40 pages
Flat CH 3
No ratings yet
Flat CH 3
74 pages
FLAT - Ch. 3
No ratings yet
FLAT - Ch. 3
69 pages
CFG
No ratings yet
CFG
58 pages
FLAT - Ch. 3 (Lecture Notes)
No ratings yet
FLAT - Ch. 3 (Lecture Notes)
23 pages
CS242 - Module 5
No ratings yet
CS242 - Module 5
42 pages
Theory of Computation
No ratings yet
Theory of Computation
30 pages
WINSEM2024-25 CSE1008 TH AP2024254000332 2025-02-12 Reference-Material-I
No ratings yet
WINSEM2024-25 CSE1008 TH AP2024254000332 2025-02-12 Reference-Material-I
56 pages
Automata Lectuee5
No ratings yet
Automata Lectuee5
33 pages
Unit 3 CFG
No ratings yet
Unit 3 CFG
65 pages
Chapter - 2 - Finite State Automata - Part - 3
No ratings yet
Chapter - 2 - Finite State Automata - Part - 3
50 pages
Flat Unit-3
No ratings yet
Flat Unit-3
74 pages
Context-Free Grammar (CFG)
No ratings yet
Context-Free Grammar (CFG)
27 pages
Toc CHP-3
No ratings yet
Toc CHP-3
19 pages
Chapter 3 - Context Free Languages
No ratings yet
Chapter 3 - Context Free Languages
59 pages
Chapter - 3 - Context Free Language - Part - 1
No ratings yet
Chapter - 3 - Context Free Language - Part - 1
110 pages
Flat Module 3
No ratings yet
Flat Module 3
18 pages
Chapter 3
No ratings yet
Chapter 3
32 pages
WINSEM2024-25 BCSE304L TH VL2024250501632 2025-02-15 Reference-Material-I
No ratings yet
WINSEM2024-25 BCSE304L TH VL2024250501632 2025-02-15 Reference-Material-I
29 pages
Lec 6
No ratings yet
Lec 6
35 pages
Act CH 3
No ratings yet
Act CH 3
36 pages
Context Free Language
No ratings yet
Context Free Language
31 pages
Chapter 3 Slides
No ratings yet
Chapter 3 Slides
42 pages
Flat - Unit 3
No ratings yet
Flat - Unit 3
18 pages
Chapter3 CFG
No ratings yet
Chapter3 CFG
67 pages
Dbms U2
No ratings yet
Dbms U2
15 pages
Pda Annotated 10 12 2021
No ratings yet
Pda Annotated 10 12 2021
37 pages
Lecture 6 (6-2-23)
No ratings yet
Lecture 6 (6-2-23)
9 pages
Theory of Automata
No ratings yet
Theory of Automata
202 pages
Formal Languages, Automata and Computability
No ratings yet
Formal Languages, Automata and Computability
29 pages
Context Free Grammars
No ratings yet
Context Free Grammars
39 pages
Context Free Grammars
No ratings yet
Context Free Grammars
25 pages
Module 3 CFG - Final
No ratings yet
Module 3 CFG - Final
40 pages
TOC Notes Endsem
No ratings yet
TOC Notes Endsem
32 pages
Context Free Grammars
No ratings yet
Context Free Grammars
36 pages
Theory of Automata: by Arjun Singh
No ratings yet
Theory of Automata: by Arjun Singh
29 pages
Context Free Grammars
No ratings yet
Context Free Grammars
40 pages
Context-Free Languages & Grammars (Cfls & CFGS) : Reading: Chapter 5
No ratings yet
Context-Free Languages & Grammars (Cfls & CFGS) : Reading: Chapter 5
38 pages
Parsing Bun
No ratings yet
Parsing Bun
48 pages
Context Free Grammars
No ratings yet
Context Free Grammars
40 pages
5c-partB-CFG and PDA
No ratings yet
5c-partB-CFG and PDA
57 pages
Toc 3
No ratings yet
Toc 3
65 pages
CS351 Context Free Grammars
No ratings yet
CS351 Context Free Grammars
9 pages
Context
No ratings yet
Context
57 pages
Chapter 3
No ratings yet
Chapter 3
57 pages
Automata Theory Lec-03
No ratings yet
Automata Theory Lec-03
58 pages
Automata Suggestions Solution. Soumyadip Karak
No ratings yet
Automata Suggestions Solution. Soumyadip Karak
6 pages
Chapter Three
No ratings yet
Chapter Three
110 pages
2nd Phase Syntax Analyzer - 1
No ratings yet
2nd Phase Syntax Analyzer - 1
136 pages
Unit 3 - Theory of Computation - WWW - Rgpvnotes.in
No ratings yet
Unit 3 - Theory of Computation - WWW - Rgpvnotes.in
14 pages
l5 CFG
No ratings yet
l5 CFG
21 pages
TOC II Updated
No ratings yet
TOC II Updated
41 pages
Context-Free Languages & Grammars (Cfls & CFGS) : Reading: Chapter 5
No ratings yet
Context-Free Languages & Grammars (Cfls & CFGS) : Reading: Chapter 5
40 pages
TOC-DEC-19 StrangeR
No ratings yet
TOC-DEC-19 StrangeR
8 pages
08 CFG
No ratings yet
08 CFG
27 pages
Finite Automata C FLV 2
No ratings yet
Finite Automata C FLV 2
35 pages
Winsem2024-25 Bcse304l TH Ch2024250501975 Reference Material III Module 4 5
No ratings yet
Winsem2024-25 Bcse304l TH Ch2024250501975 Reference Material III Module 4 5
56 pages
Untitled Document
No ratings yet
Untitled Document
29 pages
4XML
No ratings yet
4XML
3 pages
DBMS 2
No ratings yet
DBMS 2
26 pages
Aec 4
No ratings yet
Aec 4
30 pages
Aec 3
No ratings yet
Aec 3
17 pages
Flat 3
No ratings yet
Flat 3
4 pages
Se 7
No ratings yet
Se 7
7 pages
DBMS 3
No ratings yet
DBMS 3
3 pages
Nursing Foundatio1 Assinment 02
No ratings yet
Nursing Foundatio1 Assinment 02
19 pages
DBMS 1
No ratings yet
DBMS 1
9 pages
DBMS 5
No ratings yet
DBMS 5
3 pages
DBMS 6
No ratings yet
DBMS 6
4 pages
M2 Question Bank
No ratings yet
M2 Question Bank
17 pages
Nursing Foundation
No ratings yet
Nursing Foundation
13 pages
DBMS 4
No ratings yet
DBMS 4
15 pages
Bee 4
No ratings yet
Bee 4
14 pages
Syntactical Mechanics A New Approach To English Latin and Greek PDF
100% (1)
Syntactical Mechanics A New Approach To English Latin and Greek PDF
227 pages
Verb To Be Grammar PDF
No ratings yet
Verb To Be Grammar PDF
23 pages
Grammar
No ratings yet
Grammar
99 pages
Focgb4 Ak GQ 6
57% (7)
Focgb4 Ak GQ 6
2 pages
612 Grammar Lesson Plan
No ratings yet
612 Grammar Lesson Plan
2 pages
Poetic Devices Worksheetwith Examples
No ratings yet
Poetic Devices Worksheetwith Examples
2 pages
Grammar Fo English Language Teachers - Info
No ratings yet
Grammar Fo English Language Teachers - Info
2 pages
TRẮC NGHIỆM MÔN NGÔN NGỮ ĐỐI CHIẾU
No ratings yet
TRẮC NGHIỆM MÔN NGÔN NGỮ ĐỐI CHIẾU
5 pages
The Word As A Nominative Unit. The Notion of Referent. The Opposition of Notional and Functional Words
No ratings yet
The Word As A Nominative Unit. The Notion of Referent. The Opposition of Notional and Functional Words
10 pages
Passive: A) Recognizing Active and Passive Sentences
No ratings yet
Passive: A) Recognizing Active and Passive Sentences
16 pages
Portfolio Second Language Acquisition
No ratings yet
Portfolio Second Language Acquisition
35 pages
The Students' Use of Code Switching As A Strategy To Better Communicate in EFL Classroom
No ratings yet
The Students' Use of Code Switching As A Strategy To Better Communicate in EFL Classroom
112 pages
Assignment Week 2 Topic 3
No ratings yet
Assignment Week 2 Topic 3
6 pages
Complex Word Stress and Intonation: Group 5: Amelya Putri .A. Oktya Putri Bungsu Yogie Alfajar M
100% (2)
Complex Word Stress and Intonation: Group 5: Amelya Putri .A. Oktya Putri Bungsu Yogie Alfajar M
18 pages
Whats The Weather Like PDF
No ratings yet
Whats The Weather Like PDF
3 pages
Journey To The Center of The Earth Vocabulary Sheets
100% (3)
Journey To The Center of The Earth Vocabulary Sheets
4 pages
T2 E 2047 Sort The Suffix Differentiated Activity Sheets Ver 2
No ratings yet
T2 E 2047 Sort The Suffix Differentiated Activity Sheets Ver 2
5 pages
Unit 11 - Grammar - Modal Verbs
No ratings yet
Unit 11 - Grammar - Modal Verbs
2 pages
(SAT) 24여름 Writing Masterkey
No ratings yet
(SAT) 24여름 Writing Masterkey
201 pages
Let's: Learn
100% (2)
Let's: Learn
120 pages
Unit-5 Aim 502
No ratings yet
Unit-5 Aim 502
7 pages
Reviewer in Rewrski
No ratings yet
Reviewer in Rewrski
11 pages
Pinnacle - English 6th Edition HINDI-133-205
No ratings yet
Pinnacle - English 6th Edition HINDI-133-205
73 pages
Harvard Business Writing in Business Exam Flashcards - Quizlet
No ratings yet
Harvard Business Writing in Business Exam Flashcards - Quizlet
9 pages
Present Continuous (I Am Doing) : Un It
No ratings yet
Present Continuous (I Am Doing) : Un It
20 pages
Learn English
No ratings yet
Learn English
4 pages
Midterm Challenges of Intercultural Communication
50% (2)
Midterm Challenges of Intercultural Communication
35 pages
Hittite Grammar
No ratings yet
Hittite Grammar
113 pages
An Analysis of Derivational Affixes in Lewolaga Dialect of Lamaholot Language
No ratings yet
An Analysis of Derivational Affixes in Lewolaga Dialect of Lamaholot Language
38 pages