0% found this document useful (0 votes)

15 views18 pages

ModuleIII

The document provides an overview of Context-Free Grammars (CFG), detailing their components such as terminals, non-terminals, production rules, and the start symbol. It explains the significance of parse trees in analyzing syntactic structures, applications of CFGs in programming languages, natural language processing, and compiler design, as well as issues of ambiguity in grammars. Additionally, it discusses normal forms for CFGs, including Chomsky Normal Form and Greibach Normal Form, and presents examples illustrating the concepts.

Uploaded by

bayyapureddigaritejaswi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views18 pages

ModuleIII

Uploaded by

bayyapureddigaritejaswi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Module 3

Context Free Grammars

3.1 CFG

A CFG is a formal grammar used to describe the syntax or structure of a language in terms of
production rules. These rules define how strings of symbols can be generated in the language.
Context-free grammars are widely used in computer science for tasks such as defining the syntax
of programming languages, parsing natural language, and modeling biological sequences.

Components of CFG:

Symbols (Terminals):

These are basic units of language being generated. Terminals are symbols that appear in strings
generated by CFG.

Non-terminals (Variables):

Non-terminals are placeholders representing syntactic categories or groups of symbols.

Production Rules:

Production rules specify how non-terminals can be expanded into sequences of terminals and/or
other non-terminals. Each production rule consists of a non-terminal symbol (left-hand side) and
a sequence of symbols (right-hand side).

Start Symbol:

It is a special non-terminal that represents initial symbol from which derivation of strings begin.
It serves as root of derivation tree and indicates starting point for generating strings in language.

Formal Definition of a CFG:

Parsing:

Context-free grammars are used in parsing algorithms to analyze and recognize the syntactic
structure of strings according to the CFG rules. Parsing involves determining whether given
string can be generated by CFG and constructing a derivation tree to represent syntactic structure
of string.

3.2 Parse trees

Parse trees play a crucial role in understanding and analyzing the syntactic structure of strings
generated by formal grammars.

A parse tree (PT), also known as a derivation tree, illustrates the syntactic structure of a string
according to the production rules of a formal grammar. Each node in PT corresponds to a symbol
in the input string, and each edge represents one production rule application during derivation
process.

Components of PT:

Root Node:

The topmost node of PT represents start symbol of CFG, from which derivation of string begins.
Internal Nodes:

Internal nodes of parse tree represent non-terminal symbols of CFG. Each internal node is
labeled with a non-terminal symbol. PT’s children correspond to the symbols derived from that
non-terminal.

Leaf Nodes:

Leaf nodes of PT are terminal symbols of CFG. Each leaf node is a terminal symbol from input
string.

Edges:

Edges connecting nodes in PT represent application of production rules during derivation

process. Each edge is labeled with the production rule used to derive the child node from the
parent node.

Construction of Parse Trees:

Start Symbol: It forms the root node.

Expansion: Apply productions recursively to expand non-terminal symbols into sequences of

terminals / non-terminals until all symbols in string are derived.

Terminal Placement: Label leaf nodes with corresponding terminal symbols from input.

Derivation Path: Each path from root to any leaf in PT represents a derivation of input from start
symbol.
3.3 Applications of CFG

Language Recognition:

PTs are used in parsing algorithms to recognize whether a given string belongs to the language
generated by CFG.

Ambiguity Detection:

PTS help identify ambiguity in grammars by revealing multiple valid interpretations or

derivations of the same string.

Syntax Analysis:

Parse trees provide insights into the syntactic structure of strings, aiding in the understanding and
analysis of programming languages and natural languages.

Compiler Design:

PTs are utilized in syntax analysis phase of compilers. The purpose is to validate and parse
source code a/c grammar of PL.

Context-free grammars (CFGs) find applications in various fields, primarily in computer science,
linguistics, and related areas. Here are some of the key applications of context-free grammars:

Programming Languages:

CFGs are extensively used to define syntax of PLs. The structure of valid programs is described
by the rules for constructing statements, expressions, and other language constructs.
Parser generators like Yacc/Bison and ANTLR use CFGs to generate parsers for programming
languages, allowing developers to write compilers, interpreters, and other language-processing
tools.

Compiler Design:

In compiler construction, CFGs are used in the syntax analysis phase (parsing) to analyze
structure of source code in order to build PT.

PT which is intermediate representation of source code, is subsequently used in later phases of

compilation process, such as semantic analysis and code generation.

Natural Language Processing (NLP):

CFGs are employed in NLP to model the syntax of natural languages. They describe the
grammatical rules governing the formation of sentences, phrases, and other linguistic structures.

CFG-based parsers can be used to parse and analyze text for tasks.

Syntax Highlighting and Code Analysis:

Text editors and integrated development environments (IDEs) use CFG-based grammars to
perform syntax highlighting, which visually distinguishes different language constructs in source
code based on their syntactic roles.

CFG-based static analysis tools can analyze source code for potential errors, code smells, and
style violations by parsing the code and checking it against predefined grammar rules.

Data Validation and Parsing:

CFGs are employed in data validation and parsing tasks across various domains, including
markup languages (e.g., XML, HTML), configuration files, log files, and network protocols.

By defining a grammar for the expected structure of data formats, CFG-based parsers can
validate input data for correctness and extract relevant information for further processing.

3.4 Ambiguity in grammars and languages

Ambiguity in grammars refers to situations where a single string in the language can be derived
by more than one PT. This can lead to confusion in parsing and interpretation, as there may be
multiple valid interpretations of the same input. Ambiguity can arise in both natural and formal
languages.
Parse Tree 1:

/ \

* 4

/ \

2 3

According to this interpretation, "2 * 3" is evaluated first, resulting in 6, which is then added to 4
to produce the final result of 10.

Parse Tree 2:

/ \

2 +

/ \

3 4
In this interpretation, "3 + 4" is evaluated first, resulting in 7, which is then multiplied by 2 to
produce the final result of 14.

To resolve ambiguity, the grammar can be modified to explicitly specify the precedence and
associativity of operators. For example, adding separate production rules for addition and
multiplication with appropriate precedence levels can clarify the intended parsing behavior:

In this grammar:

S represents a statement.

E represents an expression.

a represents some arbitrary terminal symbol.

Ambiguity: Let's look at the sentence "if E1 then if E2 then a else a". This sentence can be parsed
in two different ways:

Parse Tree 1:

/ | \

if E1 S

/ | | \

then if S else

/ \ | |
E2 a a a

In this interpretation, the "else" clause belongs to the inner "if" statement.

In this interpretation, the "else" clause belongs to the outer "if" statement.

The ambiguity arises because the grammar does not specify the associativity of the "if-then-else"
construct. As a result, there are multiple valid ways to interpret the nesting of "if-then-else"
statements, leading to different parse trees and interpretations of the sentence.

To resolve ambiguity, the grammar can be modified to explicitly specify the associativity of the
"if-then-else" construct. For example, adding parentheses to indicate the associativity can clarify
the intended parsing behavior:

With this modified grammar, the ambiguity in parsing the sentence "if E1 then if E2 then a else
a" would be eliminated, as the parentheses would enforce a specific grouping of the "if-then-
else" constructs.
Example:

Consider the following context-free grammar for arithmetic expressions with explicit precedence
rules:

Sentence:

Let's consider the sentence "2 * 3 + 4".

Parse Tree:

The unambiguous parse tree for the sentence "2 * 3 + 4" would be as follows:

* 4

2 3

In this parse tree, the multiplication operation ("2 * 3") is evaluated first, and then the addition
operation ("result of 2 * 3 + 4") is performed. This unambiguous interpretation follows the
precedence rules specified in the grammar, where multiplication takes precedence over addition.

This grammar specifies explicit precedence rules for + and * operations. * has higher precedence
than +, which means that * operations are evaluated before + operations. Additionally, the
grammar enforces left associativity for both operations, meaning that when there are multiple
operators of the same precedence level, they are evaluated from left to right.

Benefits:

Using an unambiguous grammar with explicit precedence rules ensures that there is only one
valid interpretation of a given sentence, eliminating ambiguity and ensuring predictable parsing
behavior. This clarity is crucial for language processing tasks such as compiler design, syntax
analysis, and natural language processing, where unambiguous interpretations are essential for
correct program execution or understanding of natural language expressions.

3.5 Normal forms for CFG

CFGs can be transformed into various normal forms to simplify their analysis and processing.

The two most common normal forms for context-free grammars are the Chomsky Normal Form
(CNF) and the Greibach Normal Form (GNF).

Chomsky Normal Form (CNF):

These are the transformations needed to convert a CFG into CNF and GNF, respectively. Each
form has its advantages and is useful in different contexts, depending on the specific
requirements of the application or parsing algorithm being used.
B -> FF | b

C -> XY | a

X -> AB

Y -> AB

D -> FF

E -> FF

F -> C

Resulting CNF Grammar is:

S -> XY | BC

A -> BA | a

B -> FF | b

C -> XY | a

X -> AB

Y -> AB

D -> FF

E -> FF

F -> C
Example:

Let's consider the language L={anbncn ∣ n≥0}, which consists of strings of the form anbncn . We
can use the pumping lemma to prove that L is not context-free.

Assume L is context-free.

Let p be the pumping length given by the pumping lemma.

Consider the string s=apbpcp in L.

According to the pumping lemma, s can be divided into five substrings:

u,v,w,x,y, satisfying the conditions of the lemma.

Pumping down or up by one (i.e., setting i=0 or i=2) leads to a string that does not belong to

L, since the number of a's, b's, and c's will no longer be equal.

Therefore, L cannot be context-free.

2. Simple Syntax Directed Translation
No ratings yet
2. Simple Syntax Directed Translation
51 pages
Business Logic Notes
No ratings yet
Business Logic Notes
7 pages
CC lec 7
No ratings yet
CC lec 7
16 pages
cfg (1)
No ratings yet
cfg (1)
8 pages
Our DSA lab 12[1]
No ratings yet
Our DSA lab 12[1]
5 pages
DAA
No ratings yet
DAA
4 pages
Lecture 05
No ratings yet
Lecture 05
58 pages
6277
No ratings yet
6277
65 pages
1 s2.0 0012365X9190350B Main
No ratings yet
1 s2.0 0012365X9190350B Main
25 pages
FLAT - Ch. 3 (Lecture Notes)
No ratings yet
FLAT - Ch. 3 (Lecture Notes)
23 pages
COSC3054 Lec 03 I Grammars (4)
No ratings yet
COSC3054 Lec 03 I Grammars (4)
96 pages
Module 4
No ratings yet
Module 4
7 pages
2019-11-29_04_41_39CS_V_sem_Compiler_design
No ratings yet
2019-11-29_04_41_39CS_V_sem_Compiler_design
10 pages
Parsing Part - 1
No ratings yet
Parsing Part - 1
53 pages
Truth Tables
No ratings yet
Truth Tables
2 pages
Context-free Grammar
No ratings yet
Context-free Grammar
22 pages
Slide Set 5 Parsing
No ratings yet
Slide Set 5 Parsing
18 pages
Production System (Computer Science)
No ratings yet
Production System (Computer Science)
5 pages
Lec4 SyntaxAnalysis
No ratings yet
Lec4 SyntaxAnalysis
41 pages
CSC508 - Pass Year 2022
No ratings yet
CSC508 - Pass Year 2022
7 pages
APznzabvYKoN4zDY71onQwxNN3R5YXoFXjgna4I0XurpAH1XE77GlYeHrkYJx-bE96PPeJntwqzIfNBvguewq_9dNxjJHAPsi5CaMk-Pv6X530i-KQDKh3JuMvyl95bEO1TR_fC6I6zJQhW0qb1oPgi21XiXcoliVzRGGVn66Gsj5rdWsJ7DYhv9_bPuB3iUXcsUAVwQmrEsvBAIIrycUz
No ratings yet
APznzabvYKoN4zDY71onQwxNN3R5YXoFXjgna4I0XurpAH1XE77GlYeHrkYJx-bE96PPeJntwqzIfNBvguewq_9dNxjJHAPsi5CaMk-Pv6X530i-KQDKh3JuMvyl95bEO1TR_fC6I6zJQhW0qb1oPgi21XiXcoliVzRGGVn66Gsj5rdWsJ7DYhv9_bPuB3iUXcsUAVwQmrEsvBAIIrycUz
44 pages
ContextFreeGrammars_myppt
No ratings yet
ContextFreeGrammars_myppt
41 pages
Context-Free Grammar (CFG)
No ratings yet
Context-Free Grammar (CFG)
27 pages
Automata Theory Assignment Adam
No ratings yet
Automata Theory Assignment Adam
5 pages
Emth637 U02
No ratings yet
Emth637 U02
17 pages
Church - A Set of Postulates For The Foundation of Logic
100% (1)
Church - A Set of Postulates For The Foundation of Logic
22 pages
New DOC Document
No ratings yet
New DOC Document
17 pages
Unit1_Discrete Structure
No ratings yet
Unit1_Discrete Structure
70 pages
Function - Placemats
No ratings yet
Function - Placemats
4 pages
Memoization
No ratings yet
Memoization
4 pages
Compiler Construction Week 04 Syntax Analysis I)
No ratings yet
Compiler Construction Week 04 Syntax Analysis I)
41 pages
Chapter 4 intro_to_parsing
No ratings yet
Chapter 4 intro_to_parsing
53 pages
Java Regular Expressions Cheat Sheet
No ratings yet
Java Regular Expressions Cheat Sheet
1 page
B: Ü.:: A:::::: A:: Xxy XXX Yxx: Acb. A:I - , 2I, B: ("R - I3 X: 2 - Xy
No ratings yet
B: Ü.:: A:::::: A:: Xxy XXX Yxx: Acb. A:I - , 2I, B: ("R - I3 X: 2 - Xy
9 pages
Compiler Design - Syntax Analysis
No ratings yet
Compiler Design - Syntax Analysis
6 pages
[Week 3] Syntax Analysis (Derivation)
No ratings yet
[Week 3] Syntax Analysis (Derivation)
46 pages
Parse Trees, Ambiguity, and Chomsky Normal Form
No ratings yet
Parse Trees, Ambiguity, and Chomsky Normal Form
11 pages
Toa-Handout-22 CFG More Examples
No ratings yet
Toa-Handout-22 CFG More Examples
16 pages
CS6109-MODULE-4
No ratings yet
CS6109-MODULE-4
36 pages
Dynamic Capacitated Lot-Sizing Problems: A Classification and Review of Solution Approaches
No ratings yet
Dynamic Capacitated Lot-Sizing Problems: A Classification and Review of Solution Approaches
31 pages
Figure 1two Parse Trees For 9-5+2
No ratings yet
Figure 1two Parse Trees For 9-5+2
3 pages
Binary Decision Diagram
No ratings yet
Binary Decision Diagram
23 pages
Ch03 Recursion PDF
No ratings yet
Ch03 Recursion PDF
43 pages
Microkanren
No ratings yet
Microkanren
12 pages
Parsing 120903115324 Phpapp02
No ratings yet
Parsing 120903115324 Phpapp02
20 pages
7.1 Context Free Grammars: 7.1.1 The Basics
No ratings yet
7.1 Context Free Grammars: 7.1.1 The Basics
5 pages
Second Phase of The Compiler. Main Task:: Lexical Analyzer Rest of Front End Parser Source Tree Parse Req Token IR
No ratings yet
Second Phase of The Compiler. Main Task:: Lexical Analyzer Rest of Front End Parser Source Tree Parse Req Token IR
13 pages
Theme
No ratings yet
Theme
11 pages
Gramatici Exemplu
No ratings yet
Gramatici Exemplu
45 pages
Parsing Bun
No ratings yet
Parsing Bun
48 pages
toc mod3
No ratings yet
toc mod3
72 pages
Ramsey Theory Presentation
No ratings yet
Ramsey Theory Presentation
20 pages
Syntax Analyzer
No ratings yet
Syntax Analyzer
38 pages
Formal Methods in Software Engineering: (FMSE)
No ratings yet
Formal Methods in Software Engineering: (FMSE)
23 pages
Lecture 03
No ratings yet
Lecture 03
36 pages
Lecture 9
No ratings yet
Lecture 9
22 pages
Ba Computer Science Fs Fi Bacs en
No ratings yet
Ba Computer Science Fs Fi Bacs en
403 pages
ContextFreeGrammars
No ratings yet
ContextFreeGrammars
28 pages
CS242_Module 5
No ratings yet
CS242_Module 5
42 pages
SE Compiler Chapter 3-Parser
No ratings yet
SE Compiler Chapter 3-Parser
27 pages
Rules of Inference
No ratings yet
Rules of Inference
17 pages
TOC II Updated
No ratings yet
TOC II Updated
41 pages
CHP 3 Part One
No ratings yet
CHP 3 Part One
57 pages
Lecture05-Syntax Analysis-CFG
No ratings yet
Lecture05-Syntax Analysis-CFG
19 pages
Introduction To Automata Theory, Languages, and Computation: Solutions For Chapter 4 Solutions For Section 4.1
No ratings yet
Introduction To Automata Theory, Languages, and Computation: Solutions For Chapter 4 Solutions For Section 4.1
5 pages
2ND Term S2 Mathematics
No ratings yet
2ND Term S2 Mathematics
57 pages
Parsing: Programming Language Principles
No ratings yet
Parsing: Programming Language Principles
33 pages
CGF and CFL
No ratings yet
CGF and CFL
45 pages
Module 3 CFG - Final
No ratings yet
Module 3 CFG - Final
40 pages
Unit 3 Syntax - Analyzer
No ratings yet
Unit 3 Syntax - Analyzer
56 pages
UNIT IV CONTEXT FREE GRAMMARS and LANGUAGES
No ratings yet
UNIT IV CONTEXT FREE GRAMMARS and LANGUAGES
69 pages
Unit II PDF
No ratings yet
Unit II PDF
7 pages
CPSC 388 - Compiler Design and Construction: Parsers - Context Free Grammars
No ratings yet
CPSC 388 - Compiler Design and Construction: Parsers - Context Free Grammars
35 pages
Entrepreneurship Process
No ratings yet
Entrepreneurship Process
22 pages
Compiler 3
No ratings yet
Compiler 3
11 pages
Context Free Grammars
No ratings yet
Context Free Grammars
39 pages
2nd Phase Syntax Analyzer -1
No ratings yet
2nd Phase Syntax Analyzer -1
136 pages
Lecture Notes: 0N1 (MATH19861) Mathematics For Foundation Year
No ratings yet
Lecture Notes: 0N1 (MATH19861) Mathematics For Foundation Year
177 pages
Principles of Programming Languages: Syntax Analysis
No ratings yet
Principles of Programming Languages: Syntax Analysis
51 pages
Chapter 12: Context-Free Grammars
No ratings yet
Chapter 12: Context-Free Grammars
13 pages
Context Free Grammar and Parsing
0% (1)
Context Free Grammar and Parsing
138 pages
M7 Q1 W1
100% (2)
M7 Q1 W1
16 pages
Context-Free Grammar (CFG) : Dr. Nadeem Akhtar
No ratings yet
Context-Free Grammar (CFG) : Dr. Nadeem Akhtar
56 pages
2014-CD Ch-03 SAn
No ratings yet
2014-CD Ch-03 SAn
21 pages
Formal Languages and Automata Theory: CH 4: Context Free Languages
No ratings yet
Formal Languages and Automata Theory: CH 4: Context Free Languages
59 pages
Regular Expressions Demystified: A Practical Guide with Examples
From Everand
Regular Expressions Demystified: A Practical Guide with Examples
William E. Clark
No ratings yet
DLD Lab Manual - 240605 - 203418
No ratings yet
DLD Lab Manual - 240605 - 203418
49 pages
Python Regular Expressions Explained: A Practical Guide with Examples
From Everand
Python Regular Expressions Explained: A Practical Guide with Examples
William E. Clark
No ratings yet
Gene Expression Programming: Fundamentals and Applications
From Everand
Gene Expression Programming: Fundamentals and Applications
Fouad Sabry
No ratings yet

ModuleIII

Uploaded by

ModuleIII

Uploaded by

Module 3

Context Free Grammars

Non-terminals are placeholders representing syntactic categories or groups of symbols.

Formal Definition of a CFG:

3.2 Parse trees

Edges connecting nodes in PT represent application of production rules during derivation

Construction of Parse Trees:

Start Symbol: It forms the root node.

Expansion: Apply productions recursively to expand non-terminal symbols into sequences of

PTS help identify ambiguity in grammars by revealing multiple valid interpretations or

PT which is intermediate representation of source code, is subsequently used in later phases of

Natural Language Processing (NLP):

Syntax Highlighting and Code Analysis:

Data Validation and Parsing:

3.4 Ambiguity in grammars and languages

a represents some arbitrary terminal symbol.

Let's consider the sentence "2 * 3 + 4".

3.5 Normal forms for CFG

Chomsky Normal Form (CNF):

Resulting CNF Grammar is:

Let p be the pumping length given by the pumping lemma.

Consider the string s=apbpcp in L.

According to the pumping lemma, s can be divided into five substrings:

u,v,w,x,y, satisfying the conditions of the lemma.

Therefore, L cannot be context-free.

You might also like