l5 CFG

The document introduces context-free grammars (CFG), which represent a larger class of languages than regular languages and are essential in compiler technology and XML document formats. It explains the structure of CFGs, including their components, productions, and the methods of derivation, while also discussing applications and the concept of ambiguous grammars. The document provides examples, such as the language of palindromes, to illustrate the principles of CFGs and their derivations.

Uploaded by

sripathisneha221826

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views21 pages

l5 CFG

Uploaded by

sripathisneha221826

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Context-Free Grammars

Notes on Automata and Computation Theory

Chia-Ping Chen

Department of Computer Science and Engineering

National Sun Yat-Sen University
Kaohsiung, Taiwan ROC

Context-Free Grammars – p. 1
Introduction
We will introduce a larger class of languages than
the regular languages. They are called the
context-free languages.
These languages have a natural, recursive formal
representation called context-free grammar (CFG).
CFG plays a central role in compiler technology.
More recently, it is used to describe document
formats in XML (extensible markup language).
CFG is equivalent to another class of automata
called pushdown automata.

Context-Free Grammars – p. 2
An Informal Example
Consider the language of palindromes, where a
palindrome is a string that read the same forwards
and backwards, Lp = {w | w = w R }.
There is a natural, recursive definition for Lp :
basis: , 0 and 1 are in Lp
induction: if w is in Lp , so are 0w0 and 1w1.
A context-free grammar is a formal notation to
express such a recursive definition for languages.
Variables are used to represent classes of strings and
the relations between variables are specified.

Context-Free Grammars – p. 3
CFG for the Palindromes
For Lp , we need only one variable S to represent the
set of palindromes.
With S, the set of rules P of the CFG is


 S→

S → 0


P : S→1
S → 0S0




S → 1S1


Note that the rules use symbols in the alphabet and

the variables.
Context-Free Grammars – p. 4
Formal Definition
There are four components in a CFG
a finite set T of alphabet symbols, also known as
terminals
a finite set V of variables, a.k.a. nonterminals
the variable S (a.k.a. the start symbol)
representing the language being defined
a finite set P of productions, a.k.a. rules
We can represent a CFG G by its components,
G = (V, T, P, S). For the palindromes,
Gp = ({S}, {0, 1}, P, S)

Context-Free Grammars – p. 5
Productions of CFG
Each rule in a CFG consists of a head, an arrow and
a body, in the form head → body.
The head must be a single variable.
The body is a string of zero or more terminals
and variables. It represents one way to form
strings in the language of the head variable.
The notation for productions can be more compact.
We can group all productions headed by variable A,
and call them A-productions. Suppose the bodies are
α1 , . . . , αn , we may rewrite the A-productions as
A → α1 | α2 | . . . | α n

Context-Free Grammars – p. 6
Derivations
There are two basic methods to use the productions:
recursive inference, where we use the rules to go
from body to head.
derivations, where we use the rules to go from
head to body.
The language of a CFG is the set of all terminal
strings that can be obtained by derivations, starting
with the start symbol S.

Context-Free Grammars – p. 7
Extending Derivation Rules
For convenience, we define a new symbol ⇒. If
G
A → γ is a production, then αAβ ⇒ αγβ. (G is
G
often omitted in the notation if it is obvious.)
We may extend the ⇒ relationship to represent zero,
∗
one or more derivation steps, by ⇒. More precisely,
∗
basis: α ⇒ α for any α ∈ (V ∪ T )∗ .
∗ ∗
induction: if α ⇒ β and β ⇒ γ, then α ⇒ γ.
∗
Put in another way, if α ⇒ β, then there exists a
sequence γ1 , . . . , γn such that α = γ1 , β = γn and
γi ⇒ γi+1 for all i = 1, . . . , n − 1.
Context-Free Grammars – p. 8
Leftmost and Rightmost Derivations
To turn a variable into a string of terminals, the
leftmost derivation requires that in each step, we
replace the leftmost variable by one of its bodies.
∗
This is denoted by ⇒ or ⇒. Similarly for the
lm lm
rightmost derivation.
Any derivation has an equivalent leftmost (and
rightmost) derivation. That is,
∗ ∗ ∗
A ⇒ w iff A ⇒ w (and A ⇒ w).
lm rm

Context-Free Grammars – p. 9
The Language of a Grammar
Let G = (V, T, P, S) be a CFG, the language of G,
denoted by L(G) is the set of terminal strings that
have derivations from the start symbol. That is,
∗ ∗
L(G) = {w ∈ T | S ⇒ w}.
G

L is said to be a context-free language if L = L(G)

for some CFG G.
The set of palindromes Lp is a context-free language
because the context-free grammar Gp defines it.
That is, Lp = L(Gp ).

Context-Free Grammars – p. 10
Sentential Forms
Derivations from the start symbol produce strings
called “sentential forms”. That is, any α ∈ (V ∪ T )∗
∗
is a sentential form if S ⇒ α.
Note that L(G) is the set of sentential forms in T ∗ .
For example, for the CFG from Fig 5.2,
E ⇒ E ∗ E ⇒ E ∗ (E) ⇒ E ∗ (E + E)
So E ∗ E, E ∗ (E) and E ∗ (E + E) are all
sentential forms.

Context-Free Grammars – p. 11
Parse Trees
The derivation of a sentence can be represented by a
(parse) tree. It shows clearly how the symbols of a
terminal string are grouped into substrings, each of
which belongs to a variable in the grammar.
When used in a compiler, this “parse tree” is the data
structure to represent the source program. It enables
a natural translation of source code to the executable.
The matter of ambiguity will also be studied, where
a terminal string can have more than one parse tree.

Context-Free Grammars – p. 12
Construction of Parse Trees
A parse tree for grammar G = (V, T, P, S) satisfies
Each interior node is a variable.
Each leaf is either a variable, a terminal, or . If
it is , then it must be the only child of its parent.
An interior node A can have its children labeled
by X1 , . . . , Xk from left to right only if
A → X1 , . . . , X k
is a production in P .

Context-Free Grammars – p. 13
The Yield of a Parse Tree
If we look at the leaves of a parse tree from left to
right, we get a string called the yield of the tree.
It is always a string that is derived by the root node.
The yields of those trees with S as root and terminal
symbols or as leaves are strings in the language of
the underlying grammar.

Context-Free Grammars – p. 14
Inference, Derivation and Parse Tree
Given G = (V, T, P, S), the following are equivalent:
1. The recursive inference determines that w ∈ L(A),
the language of variable A.
∗
2. A ⇒ w.
∗
3. A ⇒ w.
lm
∗
4. A ⇒ w.
rm
5. There is a parse tree with root A and yield w.
Except for the recursive inference, all conditions are also
equivalent when w is a string with some variables.
Context-Free Grammars – p. 15
From Inference to Tree
We will follow Figure 5.7 to show the equivalence.
First, (1) ⇒ (5). We will prove by induction on the
number of steps to infer w ∈ L(A).
basis: one step. Only the basis of the inference
has been used. So there must be a production of
A → w. The tree is trivial.
induction: n + 1 steps. Suppose the last step is
the production A → X1 . . . Xk in the inference
for w ∈ A. We can break w into w1 . . . wk , where
wi = Xi if Xi is a terminal or wi ∈ L(Xi ) if Xi
is a variable and there is a parse tree for Xi by
the induction hypothesis. Connecting these parse
trees with root A gives the parse tree for w.
Context-Free Grammars – p. 16
From Tree to Derivation
The second step in showing equivalence is to
construct a leftmost derivation from a parse tree.
(5) ⇒ (3) is shown by induction on the tree height.
basis: height 1. The tree is rooted by A with
terminal string w. A ⇒ w by A → w.
lm
induction: height n + 1. There is a root A with
children X1 . . . Xk from left to right. We can
partition the string w into w1 . . . wk where
∗
Xi ⇒ wi by the induction hypothesis. Applying
lm
for each Xi , i = 1, . . . , k, we have the leftmost
∗
derivation A ⇒ w.
lm
Context-Free Grammars – p. 17
From Derivation to Recur. Inference
Finally (2) ⇒ (4), and the cycle is completed. Note
(3) ⇒ (2) is trivial.
∗
The induction is on the length of derivation A ⇒ w.
basis: one step. A → w must be a production,
and w ∈ L(A) will be concluded by inference.
induction: n + 1 steps. Singling out the first
∗
derivation, we can write A ⇒ X1 . . . Xk ⇒ w.
We can break w into w1 . . . wk , where wi = Xi if
∗
Xi is a terminal and Xi ⇒ wi if Xi is a variable.
By the induction hypothesis, wi ∈ L(Xi ) is
concluded by the recursive inference. Then
recursive inference concludes w ∈ L(A).
Context-Free Grammars – p. 18
Applications of CFG
Grammars are used to describe programming
languages. There is actually a mechanical way to
turn the language description as a CFG to a parser.
This parser is used in compiler to recognize the
structure of a source program and represent that
structure as a parse tree.
Grammars are used in XML for document type
definition (DTD) to describe the allowable tags and
the ways to use these tags.

Context-Free Grammars – p. 19
Ambiguous Grammars
If any string w in L(G) have more than one parse
tree, then G is called an ambiguous grammar. It is a
fact that there is no algorithm to decide whether a
CFG is ambiguous.
For some ambiguous G, it may be possible to
redesign the grammar to make the parse tree unique
for every string in L(G), i.e., to create an equivalent
unambiguous grammar.
However, the creation of an equivalent unambiguous
grammar may not be possible for some CFL. Such a
CFL is called inherently ambiguous, or simply
ambiguous.
Context-Free Grammars – p. 20
Ambiguous Languages
Given a CFl L, if every CFG G with L(G) = L is
ambiguous, then L is ambiguous.
Here we give an ambiguous CFL. Let
L = {an bn cm dm | n ≥ 1, m ≥ 1}∪{an bm cm dn | n ≥ 1, m ≥ 1}

L is a CFL as there is a CFG (Figure 5.22) for L.

We can argue that all strings with equal numbers of
a, b, c, d are generated in two different ways: one
ensures that a, b are equal and c, d are equal, and the
other ensures that a, d are equal and b, c are equal, as
shown in Figure 5.23.

Context-Free Grammars – p. 21

Context Free Grammar CFG
No ratings yet
Context Free Grammar CFG
71 pages
Chương 3. Phân Tích Cú Pháp
No ratings yet
Chương 3. Phân Tích Cú Pháp
103 pages
Unit 4 ContextFreeLanguage
No ratings yet
Unit 4 ContextFreeLanguage
58 pages
Flat CH 3
No ratings yet
Flat CH 3
74 pages
FLAT - Ch. 3
No ratings yet
FLAT - Ch. 3
69 pages
2nd Phase Syntax Analyzer - 1
No ratings yet
2nd Phase Syntax Analyzer - 1
136 pages
ch5ـcontextـfreeـgrammars
No ratings yet
ch5ـcontextـfreeـgrammars
49 pages
CS242 - Module 5
No ratings yet
CS242 - Module 5
42 pages
Module 3 CFG - Final
No ratings yet
Module 3 CFG - Final
40 pages
Unit-4 Context Free Grammar
No ratings yet
Unit-4 Context Free Grammar
106 pages
FLAT - Ch. 3 (Lecture Notes)
No ratings yet
FLAT - Ch. 3 (Lecture Notes)
23 pages
ContextFreeGrammars
No ratings yet
ContextFreeGrammars
28 pages
Lecture 6 (6-2-23)
No ratings yet
Lecture 6 (6-2-23)
9 pages
Context-Free Grammar (CFG) : Dr. Nadeem Akhtar
No ratings yet
Context-Free Grammar (CFG) : Dr. Nadeem Akhtar
56 pages
Chapter 6 Context Free Grammar and Context Free Language
No ratings yet
Chapter 6 Context Free Grammar and Context Free Language
55 pages
Chapter 3
No ratings yet
Chapter 3
57 pages
Unit3 Toc
No ratings yet
Unit3 Toc
97 pages
Handout 11 CFG
No ratings yet
Handout 11 CFG
22 pages
Chapter3 CFG
No ratings yet
Chapter3 CFG
67 pages
Module-3 Notes
No ratings yet
Module-3 Notes
28 pages
Chapter 3 - Context Free Languages
No ratings yet
Chapter 3 - Context Free Languages
59 pages
Parsing Bun
No ratings yet
Parsing Bun
48 pages
UNIT-2 TOc by Krishnendu
No ratings yet
UNIT-2 TOc by Krishnendu
44 pages
Compiler 8
No ratings yet
Compiler 8
28 pages
Chapter - 2 - Finite State Automata - Part - 3
No ratings yet
Chapter - 2 - Finite State Automata - Part - 3
50 pages
(Week 3) Syntax Analysis (Derivation)
No ratings yet
(Week 3) Syntax Analysis (Derivation)
46 pages
Gramatici Exemplu
No ratings yet
Gramatici Exemplu
45 pages
SDC - Grammar - CFG
No ratings yet
SDC - Grammar - CFG
46 pages
Grammar
No ratings yet
Grammar
57 pages
Chapter 3 Context Free Language
No ratings yet
Chapter 3 Context Free Language
84 pages
Second Phase of The Compiler. Main Task:: Lexical Analyzer Rest of Front End Parser Source Tree Parse Req Token IR
No ratings yet
Second Phase of The Compiler. Main Task:: Lexical Analyzer Rest of Front End Parser Source Tree Parse Req Token IR
13 pages
Chapter4 CFG
No ratings yet
Chapter4 CFG
43 pages
2CFL
No ratings yet
2CFL
20 pages
UNIT IV CONTEXT FREE GRAMMARS and LANGUAGES
No ratings yet
UNIT IV CONTEXT FREE GRAMMARS and LANGUAGES
69 pages
Context-Free Languages & Grammars (Cfls & CFGS) : Reading: Chapter 5
No ratings yet
Context-Free Languages & Grammars (Cfls & CFGS) : Reading: Chapter 5
38 pages
Context-Free Grammar (CFG)
No ratings yet
Context-Free Grammar (CFG)
27 pages
Automata Theory Lec-03
No ratings yet
Automata Theory Lec-03
58 pages
Compiler CFG Slides of PowerPoint
No ratings yet
Compiler CFG Slides of PowerPoint
66 pages
Context Free Language
No ratings yet
Context Free Language
31 pages
Chapter 4 - Context-Free Grammars and Languages
No ratings yet
Chapter 4 - Context-Free Grammars and Languages
60 pages
CS 373: Theory of Computation: Manoj Prabhakaran Mahesh Viswanathan Fall 2008
No ratings yet
CS 373: Theory of Computation: Manoj Prabhakaran Mahesh Viswanathan Fall 2008
64 pages
Unit-3 Context Free Grammar
No ratings yet
Unit-3 Context Free Grammar
57 pages
Theory of Computation Notes
No ratings yet
Theory of Computation Notes
4 pages
Oscp Preparation
83% (6)
Oscp Preparation
39 pages
Unit-3 Flat
No ratings yet
Unit-3 Flat
29 pages
Motivation For Formal Grammars
No ratings yet
Motivation For Formal Grammars
15 pages
08 CFG
No ratings yet
08 CFG
27 pages
Samir CFG
No ratings yet
Samir CFG
105 pages
FL&T Unit 3 - 1 - 1724732026415
No ratings yet
FL&T Unit 3 - 1 - 1724732026415
17 pages
Lecture Notes On Context-Free Grammars: 15-411: Compiler Design Frank Pfenning September 15, 2009
No ratings yet
Lecture Notes On Context-Free Grammars: 15-411: Compiler Design Frank Pfenning September 15, 2009
9 pages
Unit IV Context Free Languages
No ratings yet
Unit IV Context Free Languages
89 pages
Formal Languages and Automata Theory: CH 4: Context Free Languages
No ratings yet
Formal Languages and Automata Theory: CH 4: Context Free Languages
59 pages
Toc 3
No ratings yet
Toc 3
65 pages
USAMO
No ratings yet
USAMO
7 pages
Class 18 Context Free Grammar
No ratings yet
Class 18 Context Free Grammar
35 pages
Unit Iv Context Free Languages
No ratings yet
Unit Iv Context Free Languages
74 pages
Notes 4x
No ratings yet
Notes 4x
3 pages
Context Free Grammars: Bachelor of Technology Computer Science and Engineering
No ratings yet
Context Free Grammars: Bachelor of Technology Computer Science and Engineering
10 pages
Theory of Computation: Lecture 7: Context-Free Grammar
No ratings yet
Theory of Computation: Lecture 7: Context-Free Grammar
21 pages
Lect 11
No ratings yet
Lect 11
7 pages
Early On Kenpo History
No ratings yet
Early On Kenpo History
4 pages
Syntax Analyzer
No ratings yet
Syntax Analyzer
38 pages
Healthy BLDC Motor Simulation Using Finite Element Analysis-IJAERDV04I1073133
No ratings yet
Healthy BLDC Motor Simulation Using Finite Element Analysis-IJAERDV04I1073133
12 pages
0-02-Oct-2017-05-10-50English Self Learning Material PDF
No ratings yet
0-02-Oct-2017-05-10-50English Self Learning Material PDF
258 pages
Nike - Final Report
No ratings yet
Nike - Final Report
13 pages
1.1 Apogamy, Apospory and Parthenogenesis
No ratings yet
1.1 Apogamy, Apospory and Parthenogenesis
21 pages
Sree Kaala Hastiswara Satakam in Telugu PDF
No ratings yet
Sree Kaala Hastiswara Satakam in Telugu PDF
21 pages
External Environment
No ratings yet
External Environment
54 pages
A Guilted Age Apologies For The Past Ashraf A H Rushdy PDF Download
No ratings yet
A Guilted Age Apologies For The Past Ashraf A H Rushdy PDF Download
77 pages
Success Against The Odds
No ratings yet
Success Against The Odds
194 pages
Workers Compensation Practice and Procedure Guide
No ratings yet
Workers Compensation Practice and Procedure Guide
84 pages
Tale of High Elf and Futa Oni
No ratings yet
Tale of High Elf and Futa Oni
1 page
Contraception Today A Pocketbook For General Practitioners and Practice Nurses 7th Edition John Guillebaud
No ratings yet
Contraception Today A Pocketbook For General Practitioners and Practice Nurses 7th Edition John Guillebaud
55 pages
IC Engine L1
No ratings yet
IC Engine L1
8 pages
Inglesina Zippy Free Manual
No ratings yet
Inglesina Zippy Free Manual
44 pages
XML Tutorial For Beginners
No ratings yet
XML Tutorial For Beginners
28 pages
How Rentomojo and Furlenco Got Buried Under The Weight of Their Own Furniture - The Ken
No ratings yet
How Rentomojo and Furlenco Got Buried Under The Weight of Their Own Furniture - The Ken
2 pages
The Reign of Terror
No ratings yet
The Reign of Terror
11 pages
Corporate Bridge Internship Proposal
No ratings yet
Corporate Bridge Internship Proposal
5 pages
Acromegaly Poster
No ratings yet
Acromegaly Poster
1 page
How To Setup A Kali Linux Hacking Station On Raspberry Pi 3 Model B+
No ratings yet
How To Setup A Kali Linux Hacking Station On Raspberry Pi 3 Model B+
11 pages
Deduction in Respect of Health Insurance Premia. 80D
No ratings yet
Deduction in Respect of Health Insurance Premia. 80D
2 pages
Armance V The State 2020 SCJ 148
No ratings yet
Armance V The State 2020 SCJ 148
9 pages
Units 15 16 - Exercises
No ratings yet
Units 15 16 - Exercises
4 pages
Lista de Libros 2024
No ratings yet
Lista de Libros 2024
2 pages
Case Daka
No ratings yet
Case Daka
7 pages
Jesse
No ratings yet
Jesse
4 pages
SCFR1 JHS Currhead Consolidation
No ratings yet
SCFR1 JHS Currhead Consolidation
2 pages
6.1comprehensive Interviews
No ratings yet
6.1comprehensive Interviews
2 pages

l5 CFG

Uploaded by

l5 CFG

Uploaded by

Context-Free Grammars

Notes on Automata and Computation Theory

Department of Computer Science and Engineering

Note that the rules use symbols in the alphabet and

L is said to be a context-free language if L = L(G)

L is a CFL as there is a CFG (Figure 5.22) for L.

You might also like