0% found this document useful (0 votes)

41 views56 pages

CMP3008 LN5 ContextFreeGrammars

The document provides information about non-regular languages, the pumping lemma, and context-free grammars. It discusses how languages that are not regular require recognizers beyond finite automata. The pumping lemma is introduced as a tool to prove that a language is not regular by showing it does not satisfy the pumping property. Context-free grammars are presented as a way to describe languages larger than regular languages using recursive rules. Examples are given to demonstrate applying the pumping lemma and generating strings from context-free grammars.

Uploaded by

Ammar Jagadhita

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views56 pages

CMP3008 LN5 ContextFreeGrammars

Uploaded by

Ammar Jagadhita

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 56

CMP3008

Formal Languages
and Automata Theory
Lecture Notes 5
Nonregular Languages, Pumping Lemma and
Context Free Grammars
Sources
https://eecs.wsu.edu/~ananth/CptS317/Lectures/index.htm
"Introduction to automata theory, languages and
computation" by JE Hopcroft, R Motwani and JD Ullman.
" An Introduction to Formal Languages and Automata Theory" by
Peter Linz 1
Content
• Non-Regular Languages
• Pumping Lemma
• Context-Free Grammars
• Ambiguity in CFG
• Chomsky Normal Form

2
Not all languages are regular
• So what happens to the languages which are not regular?

• Can we still come up with a language recognizer?

• i.e., something that will accept (or reject) strings that belong (or do not
belong) to the language?

3
Non-Regular Languages
• Question: What are the limitations of finite automata, i.e. DFAs (or
NFAs)?
• Can we find a DFA for the language B = {0n1m| n ≥ 0, m ≥ 0}
• What about B = {0n1n| n ≥ 0}?
• The language B = {0n1n| n ≥ 0} is nonregular because the number of
0s isn’t limited, the machine will have to keep track of an unlimited
number of possibilities. But it cannot do so with any finite number of
states.
Non-Regular Languages
• We need a proof to show that a given language is not regular
• Question: Doesn’t the argument already given prove nonregularity
because the number of 0s is unlimited?
• No 
• A language requiring unbounded memory doesn’t mean that it is not
regular
• There are languages seem to require an unlimited number of
possibilities, yet actually they are regular
Non-Regular Languages
• For example, consider two languages over the alphabet Σ = {0,1}:
• C = {w| w has an equal number of 0s and 1s},
• D = {w| w has an equal number of occurrences of 01 and 10 as substrings
• Can we design a DFA for C and/or D?
• For C, no
• But for D, yes! -> DFA or NFA?
• So, we need a proof!
Pumping Lemma
• Pumping lemma theorem states that all regular languages have a
special property.
• If we can show that a language does not have this property, we are
guaranteed that it is not regular.
• The property states that all strings in the language can be “pumped”
if they are at least as long as a certain special value, called the
pumping length.
• That means each such string contains a section that can be repeated
any number of times with the resulting string remaining in the
language
Formal Definition of Pumping Lemma
Formal Definition of Pumping Lemma
• When s is divided into xyz, either x or z may be ε, but condition 2 says
that y ≠ ε.
• Observe that without condition 2 the theorem would be trivially true.
• Condition 3 states that the pieces x and y together have length at
most p.
• It is an extra technical condition that we occasionally find useful when
proving certain languages to be nonregular
How to use Pumping Lemma?
• To use the pumping lemma to prove that a language B is not regular,
• First assume that B is regular in order to obtain a contradiction.
• Then use the pumping lemma to guarantee the existence of a
pumping length p such that all strings of length p or greater in B
can be pumped.
• Next, find a string s in B that has length p or greater but that
cannot be pumped.
• Finally, demonstrate that s cannot be pumped by considering all
ways of dividing s into x, y, and z (taking condition 3 of the
pumping lemma into account if convenient) and, for each such
division, finding a value i where xyi z is not a member of B.
Example 1
• Let B = {0n1n}| n ≥ 0}. We use the pumping lemma to prove that B is
not regular. The proof is by contradiction.
• Assume to the contrary that B is regular. Let p be the pumping length
given by the pumping lemma. Choose s to be the string 0p1p.
• Because s is a member of B and s has length more than p, the
pumping lemma guarantees that s can be split into three pieces, s =
xyz, where for any i ≥ 0 the string xyiz is in B. We consider three cases
to show that this result is impossible.
Example 1 (cont’d)
• The string y consists only of 0s. In this case, the string xyyz has more
0s than 1s and so is not a member of B, violating condition 1 of the
pumping lemma. This case is a contradiction.
• The string y consists only of 1s. This case also gives a contradiction.
• The string y consists of both 0s and 1s. In this case, the string xyyz
may have the same number of 0s and 1s, but they will be out of order
with some 1s before 0s. Hence it is not a member of B, which is a
contradiction.
Example 2
• C = {w | w has an equal number of 0s and 1s}
• Assume C is regular
• Let s be the string 0p1p.
• With s being a member of C and having length more than p, the
pumping lemma guarantees that s can be split into three pieces,
s = xyz, where for any i ≥ 0 the string xyiz is in C.
• Let’s show that this is not possible!
Example 2
• If we let x and z be the empty string and y be the string 0p1p, then xyiz
always has an equal number of 0s and 1s and hence is in C. So it
seems that s can be pumped.
• But! Here condition 3 in the pumping lemma is useful.
• It stipulates that when pumping s, it must be divided so that |xy| ≤ p.
• If |xy| ≤ p, then y must consist only of 0s, so xyyz is not in C.
• Therefore, s cannot be pumped. That gives us the desired
contradiction.
Example 2
• Can we show the same for s = (01)p which is also a member of C?
• Can we pump it?
• x = ε, y = 01, and z = (01)p−1. Then xyiz ∈ C for every value of i.
Example 3
• F = {ww | w ∈ {0,1}*}
• Assume that F is regular
• s = 0p1p0p1p
• 00000111110000011111
• It is not possible to find a y in the first p number of 0’s such that if we
pump y the resulting string is in F.
• s = 0p10p1 is another good choice
• s = 0p0p not a good choice
Example 4
• E = {0i1j| i > j}
• Assume that E is regular
• s = 0p+11p
• 0000 0 011111 (if p is 5)
• When y = 0 or y = (0)p , removing y (xy0z) will reduce the number of
zeros and hence, the resulting string will not be in E, so we have a
contradiction.
Example 5
• A nonregular unary language:

• D contains all strings of 1s whose length is a perfect square.

Note the growing gap between successive members of this sequence.

Large members of this sequence cannot be near each other 18
Example 5
• A nonregular unary language:

19
Not all languages are regular
• So what happens to the languages which are not regular?

• Can we still come up with a language recognizer?

• i.e., something that will accept (or reject) strings that belong (or do not
belong) to the language?

20
Context-Free Languages
• A language class larger than the class of regular languages

• Supports natural, recursive notation called “context-free grammar”

• Applications:
• Parse trees, compilers
• XML
Context-
Regular free
(FA/RE)
(PDA/CFG)

21
An Example
• A palindrome is a word that reads identical from both ends
• E.g., madam, redivider, malayalam, 010010010
• Let L = { w | w is a binary palindrome}
• Is L regular?
• No.
• Proof:
• Let w=0p10p (assuming N to be the p/l constant)

• By Pumping lemma, w can be rewritten as xyz, such that xyiz is also L (for any i≥0)
• But |xy|≤p and y≠
• ==> y=0+
• ==> xyiz will NOT be in L for i=0
• ==> Contradiction

22
But the language of palindromes…
is a CFL, because it supports recursive substitution (in the form of a
CFG)
• This is because we can construct a “grammar” like this:
1. A ==> 
2. A ==> 0 Terminal
Same as:
Productions 3. A ==> 1 A => 0A0 | 1A1 | 0 | 1 | 
4. A ==> 0A0
5. A ==> 1A1 Variable or non-terminal

How does this grammar work?

23
How does the CFG for palindromes work?
An input string belongs to the language (i.e., accepted) iff it can be
generated by the CFG
G:
• Example: w=01110
A => 0A0 | 1A1 | 0 | 1 | 
• G can generate w as follows:

Generating a string from a grammar:

1. A => 0A0
1. Pick and choose a sequence
2. => 01A10 of productions that would
3. => 01110 allow us to generate the
string.
2. At every step, substitute one variable
with one of its productions.

24
Example
• Example context free grammar G1:

A → 0A1
A→B
B→#

• 3 Substitution rules (productions)

• Variables = {A, B}
• Terminals = {0, 1, #}
• Start variable = A
Derivation
• For example, grammar G1 generates the string 000#111.
A → 0A1
A→B
B →#

• The sequence of substitutions to obtain a string is called a derivation.

A derivation of string 000#111 in grammar G1 is
• A ⇒ 0A1 ⇒ 00A11 ⇒ 000A111 ⇒ 000B111 ⇒ 000#111.
Parse Trees
• Each CFG can be represented using a parse tree:
• Each internal node is labeled by a variable in V
• Each leaf is terminal symbol
• For a production, A==>X1X2…Xk, then any internal node labeled A has k
children which are labeled from X1,X2,…Xk from left to right

Parse tree for production and all other subsequent productions:

A ==> X1..Xi..Xk A

X1 … Xi … Xk

27
Examples

Recursive inference
A
E + E
0 A 0
F F

Derivation
1 A 1
a 1


Parse tree for 0110

Parse tree for a + 1
G: G:
E => E+E | E*E | (E) | F A => 0A0 | 1A1 | 0 | 1 | 
F => aF | bF | 0F | 1F | 0 | 1 | a | b
28
Parse Tree
Examples
• Can the following strings be derived from G1:

0#1 A ⇒ 0A1 ⇒ 0B1 ⇒ 0#1 A → 0A1

A→B
0#11 Cannot be derived. B→#

# A⇒B⇒#
Language of the grammar.
• All strings generated in this way constitute the language of the
grammar. We write L(G1) for the language of grammar G1.
• Some experimentation with the grammar G1 shows us that L(G1) is:
A → 0A1
A→B
B→#

{0n#1n| n ≥ 0}
“|” symbol
For convenience when presenting a context-free grammar, we
abbreviate several rules with the same left-hand variable, such as

A → 0A1 and A → B

into a single line

A → 0A1 | B

using the symbol “|” as an “or”.

Grammar G2
Examples
Strings in L(G2) include:
• a boy sees
• the boy sees a flower
• a girl with a flower likes the boy
Derivation of “a boy sees”
FORMAL DEFINITION OF A CONTEXT-FREE
GRAMMAR
Example
Design a CFG for the following language:
L = {w | w is a properly nested parentheses}
(), (()), (()())(), ()()()() are in L
()), (()(), ))(( are not in L

G3 = ({S}, {(, )}, R, S). The set of rules, R, is

S → (S) | SS | ε
Example
• A grammar for L = {0m1n | m≥n}

• CFG?
G:
S => 0S1 | A
A => 0A | 

How would you interpret the string “00000111”

using this grammar?

38
Examples
DESIGNING CONTEXT-FREE GRAMMARS
As with the design of finite automata the design of context-free
grammars requires creativity.

But there are some useful techniques

Technique I: Merging Grammars
Technique II: DFA to CFG
• You can convert any DFA into an equivalent CFG as follows.
• Make a variable Ri for each state qi of the DFA.
• Add the rule Ri → aRj to the CFG if δ(qi,a) = qj is a transition in the DFA.
• Add the rule Ri → ε if qi is an accept state of the DFA.
• Make R0 the start variable of the grammar, where q0 is the start state of the
machine
Technique II: DFA to CFG
E → 0E
E → 1O
O → 0O
O → 1E
O→ε

Example Derivation:
E ⇒ 0E ⇒ 00E ⇒ 001O ⇒ 0010O ⇒ 00101E ⇒ 001011O ⇒ 001011
Ambiguity
Ambiguity
Example Derivations
• E -> E + E | E x E | (E) | a
a+a
• E⇒E+E⇒a+E⇒a+a
((a + a) x a)
• E ⇒ (E) ⇒(E x E) ⇒ ((E) x E) ⇒ ((E+E) x E) ⇒ ((a + a) x a)
a+axa
• E⇒E+E⇒E+ExE⇒a+axa
• E⇒ExE⇒E+ExE⇒a+axa
Ambiguity

the girl touches the boy with the flower

(a) (b)
Leftmost derivation
• A derivation of a string w in a grammar G is a leftmost derivation if at
every step the leftmost remaining variable is the one replaced. The
derivation below is a leftmost derivation.
Ambiguity
Chomsky Normal Form
Theorem
Example

This change guarantees that the start variable

doesn’t occur on the right-hand side of a rule.
Example con’t.

• Second, we take care of all ε-rules.

• We remove an ε-rule A → ε, where A is not the start variable.
• Then for each occurrence of an A on the right-hand side of a rule, we add a new rule with that
occurrence deleted.
• In other words, if R → uAv is a rule in which u and v are strings of variables and terminals, we add
rule R → uv.
• We do so for each occurrence of an A, so the rule R → uAvAw causes us to add R → uvAw, R →
uAvw, and R → uvw.
• If we have the rule R → A, we add R → ε unless we had previously removed the rule R → ε.
• We repeat these steps until we eliminate all ε-rules not involving the start variable.
Example con’t.
Example con’t.
Example con’t.

Pumping Lemma For Regular Languages
No ratings yet
Pumping Lemma For Regular Languages
60 pages
Chapters (5 - 8) TOC BOOK by Adesh K Pandey
No ratings yet
Chapters (5 - 8) TOC BOOK by Adesh K Pandey
95 pages
Decision Properties of Regular Language
100% (1)
Decision Properties of Regular Language
29 pages
Theory of Computation Long Type of Questions-1
100% (1)
Theory of Computation Long Type of Questions-1
22 pages
Unit-III Combinational Logic Circuits
No ratings yet
Unit-III Combinational Logic Circuits
260 pages
Formal Methods in Computer Science CS1502 Pumping Lemma: Patchrawat Uthaisombut University of Pittsburgh
No ratings yet
Formal Methods in Computer Science CS1502 Pumping Lemma: Patchrawat Uthaisombut University of Pittsburgh
27 pages
Module 3 RE&CFG
No ratings yet
Module 3 RE&CFG
108 pages
hw3 Tex
No ratings yet
hw3 Tex
5 pages
Pumping Lema
No ratings yet
Pumping Lema
97 pages
Lecture 10
No ratings yet
Lecture 10
73 pages
Non-Regular Languages: Md. Rafsan Jani Assistant Professor Department of CSE Jahangirnagar University
No ratings yet
Non-Regular Languages: Md. Rafsan Jani Assistant Professor Department of CSE Jahangirnagar University
56 pages
RegularLanguageProperties Myppt
No ratings yet
RegularLanguageProperties Myppt
68 pages
9 Pumping Lemma
No ratings yet
9 Pumping Lemma
36 pages
Wa0012.
No ratings yet
Wa0012.
50 pages
CS402 Short Notes: For More Visit
No ratings yet
CS402 Short Notes: For More Visit
39 pages
7-Pumping Lemma For Regular Languages
No ratings yet
7-Pumping Lemma For Regular Languages
46 pages
The Pumping Lemma For Context Free Grammars
No ratings yet
The Pumping Lemma For Context Free Grammars
14 pages
Regular Language Properties
No ratings yet
Regular Language Properties
65 pages
Pumping Lemma
No ratings yet
Pumping Lemma
38 pages
Lecture 10 Nonregular Languages
No ratings yet
Lecture 10 Nonregular Languages
26 pages
Pumping Lemma
No ratings yet
Pumping Lemma
74 pages
Lec 3
No ratings yet
Lec 3
34 pages
Pumping Lemma For Regular Languages
No ratings yet
Pumping Lemma For Regular Languages
45 pages
Non-Regular Languages: (Pumping Lemma)
No ratings yet
Non-Regular Languages: (Pumping Lemma)
76 pages
The Pumping Lemma
No ratings yet
The Pumping Lemma
40 pages
Pumping Lemma Regular
No ratings yet
Pumping Lemma Regular
15 pages
4c-Regular Expressions
No ratings yet
4c-Regular Expressions
36 pages
CS5371 Theory of Computation: Lecture 5: Automata Theory III (Non-Regular Language, Pumping Lemma, Regular Expression)
No ratings yet
CS5371 Theory of Computation: Lecture 5: Automata Theory III (Non-Regular Language, Pumping Lemma, Regular Expression)
19 pages
CS5371 Theory of Computation: Lecture 9: Automata Theory VII (Pumping Lemma, Non-CFL)
No ratings yet
CS5371 Theory of Computation: Lecture 9: Automata Theory VII (Pumping Lemma, Non-CFL)
24 pages
Regular Language Properties
No ratings yet
Regular Language Properties
33 pages
Cfls and The Pumping Lemma
No ratings yet
Cfls and The Pumping Lemma
24 pages
Syntax Tree
0% (1)
Syntax Tree
3 pages
Properties of Regular Languages: Reading: Chapter 4
No ratings yet
Properties of Regular Languages: Reading: Chapter 4
58 pages
Pumping Lemma For CFG
No ratings yet
Pumping Lemma For CFG
15 pages
Automata Lectuee4 0
No ratings yet
Automata Lectuee4 0
26 pages
Solution To All Theory Questions From Q Bank
No ratings yet
Solution To All Theory Questions From Q Bank
23 pages
Lec 13
No ratings yet
Lec 13
14 pages
Lecture 3
No ratings yet
Lecture 3
18 pages
Tutorial 2
No ratings yet
Tutorial 2
15 pages
Group: ZU059: Students: Habibillayeva Gulchin, Kazymov Ruslan, Salimov Khanbala, Sadygov Baylar
No ratings yet
Group: ZU059: Students: Habibillayeva Gulchin, Kazymov Ruslan, Salimov Khanbala, Sadygov Baylar
19 pages
Lecture#8 (PL)
No ratings yet
Lecture#8 (PL)
12 pages
CS340 Theory of Computation VI
No ratings yet
CS340 Theory of Computation VI
13 pages
Flat 2021 Pyq Solution
No ratings yet
Flat 2021 Pyq Solution
10 pages
5-Pumping Lemma
No ratings yet
5-Pumping Lemma
10 pages
Infiniteness Test The Pumping Lemma Nonregular Languages
No ratings yet
Infiniteness Test The Pumping Lemma Nonregular Languages
8 pages
Pumping Lemma
No ratings yet
Pumping Lemma
12 pages
Extra On Regular Languages and Non-Regular Languages
No ratings yet
Extra On Regular Languages and Non-Regular Languages
34 pages
22-Pumping Lemma For CFL-11-03-2024
No ratings yet
22-Pumping Lemma For CFL-11-03-2024
6 pages
Non Regular Language
No ratings yet
Non Regular Language
25 pages
HW 2
No ratings yet
HW 2
5 pages
Pumping Lemma For Regular Languages: Sipser Pages 77 - 82
No ratings yet
Pumping Lemma For Regular Languages: Sipser Pages 77 - 82
21 pages
Pumping Lemma Exer ALRsol
No ratings yet
Pumping Lemma Exer ALRsol
6 pages
What Is Pumping Lemma Useful For?
No ratings yet
What Is Pumping Lemma Useful For?
4 pages
Automata Pumping Lemma
No ratings yet
Automata Pumping Lemma
3 pages
Operations Research: CT-4-BCA-601
No ratings yet
Operations Research: CT-4-BCA-601
2 pages
9 Pumping Lemma Examples
No ratings yet
9 Pumping Lemma Examples
2 pages
Assignment Pumping Lemma For CFL
No ratings yet
Assignment Pumping Lemma For CFL
4 pages
Group2 Assignment2 TOC
No ratings yet
Group2 Assignment2 TOC
6 pages
Theory of Computation
No ratings yet
Theory of Computation
7 pages
Pigeon Hole
No ratings yet
Pigeon Hole
5 pages
Pumping Lemma For RG
No ratings yet
Pumping Lemma For RG
13 pages
Pumping Lemma R
No ratings yet
Pumping Lemma R
7 pages
FALLSEM2020-21 CSE2002 TH VL2020210104550 Reference Material I 17-Aug-2020 Pumping Lemma Example 2
No ratings yet
FALLSEM2020-21 CSE2002 TH VL2020210104550 Reference Material I 17-Aug-2020 Pumping Lemma Example 2
3 pages
Assignment 4 Data
No ratings yet
Assignment 4 Data
11 pages
Stanford Dsa
No ratings yet
Stanford Dsa
52 pages
ML 2 (Mainly KNN)
100% (1)
ML 2 (Mainly KNN)
12 pages
CMP3008 LN1 CourseOverview Introduction
No ratings yet
CMP3008 LN1 CourseOverview Introduction
49 pages
CMP3008 LN3 NonDeterminism
No ratings yet
CMP3008 LN3 NonDeterminism
40 pages
Cooks Theorem
No ratings yet
Cooks Theorem
33 pages
Knowledge Representation Using Predicate Logic
No ratings yet
Knowledge Representation Using Predicate Logic
14 pages
VLSI Physical Design
No ratings yet
VLSI Physical Design
15 pages
Data Structures & Java
No ratings yet
Data Structures & Java
5 pages
SQL With R
100% (1)
SQL With R
12 pages
Compiler Design
100% (2)
Compiler Design
17 pages
Recursion
No ratings yet
Recursion
50 pages
Class 12th IP Project
No ratings yet
Class 12th IP Project
134 pages
Training Feed Forward Networks With The Marquardt Algorithm
No ratings yet
Training Feed Forward Networks With The Marquardt Algorithm
5 pages
CMP2003 LectureNotes Week3 4
No ratings yet
CMP2003 LectureNotes Week3 4
83 pages
Introduction To R
No ratings yet
Introduction To R
33 pages
New File
No ratings yet
New File
52 pages
Sorting Part 2
No ratings yet
Sorting Part 2
69 pages
MA207 Chap2
No ratings yet
MA207 Chap2
22 pages
RUSHIL Combined
No ratings yet
RUSHIL Combined
58 pages
Week1 Slides 20221004
No ratings yet
Week1 Slides 20221004
62 pages
Red Black Trees
No ratings yet
Red Black Trees
41 pages
Simulating Chemistry On A Quantum Computer
No ratings yet
Simulating Chemistry On A Quantum Computer
52 pages
Syllabus
No ratings yet
Syllabus
2 pages
CMP2003 Lecturenotes Week9
No ratings yet
CMP2003 Lecturenotes Week9
25 pages
Analysis of Algorithms
No ratings yet
Analysis of Algorithms
19 pages
Initialization
No ratings yet
Initialization
16 pages
Genetic Algorithm For Project Scheduling With Resource Allocation and Time Constraints
No ratings yet
Genetic Algorithm For Project Scheduling With Resource Allocation and Time Constraints
11 pages
CH 18
No ratings yet
CH 18
12 pages
Adsw 3
No ratings yet
Adsw 3
4 pages
13BTECPC303CH54761691834456BTECPC303PDSApdfpdf
No ratings yet
13BTECPC303CH54761691834456BTECPC303PDSApdfpdf
1 page
Assignment1 MFML
No ratings yet
Assignment1 MFML
2 pages
Convolutional Neural Network With An Optimized Backpropagation Technique
No ratings yet
Convolutional Neural Network With An Optimized Backpropagation Technique
5 pages
Multilayer Percef'Tron Structures Applied To Adaptive Equalisers For Data Communications
No ratings yet
Multilayer Percef'Tron Structures Applied To Adaptive Equalisers For Data Communications
4 pages
HW 10
No ratings yet
HW 10
2 pages

CMP3008 LN5 ContextFreeGrammars

Uploaded by

CMP3008 LN5 ContextFreeGrammars

Uploaded by

CMP3008

• Can we still come up with a language recognizer?

• D contains all strings of 1s whose length is a perfect square.

Note the growing gap between successive members of this sequence.

• Can we still come up with a language recognizer?

• Supports natural, recursive notation called “context-free grammar”

How does this grammar work?

Generating a string from a grammar:

• 3 Substitution rules (productions)

• The sequence of substitutions to obtain a string is called a derivation.

Parse tree for production and all other subsequent productions:

Parse tree for 0110

0#1 A ⇒ 0A1 ⇒ 0B1 ⇒ 0#1 A → 0A1

into a single line

using the symbol “|” as an “or”.

G3 = ({S}, {(, )}, R, S). The set of rules, R, is

How would you interpret the string “00000111”

But there are some useful techniques

the girl touches the boy with the flower

This change guarantees that the start variable

• Second, we take care of all ε-rules.

You might also like