0% found this document useful (0 votes)

33 views35 pages

CPSC 388 - Compiler Design and Construction: Parsers - Context Free Grammars

This document discusses parsers and context-free grammars (CFGs) for compiler design. It introduces parsers as tools for recognizing more types of languages than regular expressions or finite state automata. CFGs are used to define parsers and consist of terminals, non-terminals, productions, and a start symbol. Productions specify rewrite rules with non-terminals on the left and sequences of terminals/non-terminals on the right. The document provides examples of CFGs for boolean expressions, assignments, and IF statements. It discusses ambiguous grammars and how to avoid ambiguity through precedence and associativity rules in the grammar.

Uploaded by

Kashif Raffat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views35 pages

CPSC 388 - Compiler Design and Construction: Parsers - Context Free Grammars

Uploaded by

Kashif Raffat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 35

CPSC 388 – Compiler Design

and Construction

Parsers – Context Free Grammars

Announcements
 HW3 due via Sakai

 Solution to HW3

 Homework: HW4 assigned today, due next

Friday, via Sakai

 Reading: 4.1-4.4

 Progress Report Grades due Sept 28th

Compilers
Lexical Analyzer Syntax Analyzer
(Scanner) Symantic Analyzer
(Parser)
Source Token Abstract
Code Stream Syntax Intermediate Code
Tree Generator

Optimizer

Code Generator
Scanner
Source Code:

position = initial + rate * 60 ;

Corresponding Tokens:

IDENT(position)
ASSIGN
IDENT(position)
PLUS
IDENT(rate)
TIMES
INT-LIT(60)
SEMI-COLON
Example Parse
Source Code:
position = initial + rate * 60 ;

Abstract-Syntax Tree: =

position +

•Interior nodes are operators.

•A node's children are operands.
initial *
•Each subtree forms "logical unit"
e.g., the subtree with * at its root shows that
because multiplication has higher precedence
than addition, this operation must be performed rate 60
as a unit (not initial+rate).
Limitations of RE and FSA
 Regular Expressions and Finite State
Automata cannot express all
languages
 For Example the language that
consists of all balanced parenthesis:
() and ((())) and (((((())))))
 Parsers can recognize more types of
languages than RE or FSA
Parsers
 Input: Sequence of Tokens

 Output: a representation of program

 Often AST, but could be other things

 Also find syntax errors

 CFGs used to define Parser

(Context Free Grammar)
Context Free Grammars

stmt → if ( expr ) stmt else stmt

terminals Rule or Production non-terminals

Context Free Grammar
CFGs consist of:

Σ – Set of Terminals (use tokens from scanner)

N – Set of Non-Terminals (variables)
P – Set of Productions (also called rules)
S – the Start Non-Terminal (one on left of
first rule if not specified)
Productions

stmt → if ( expr ) stmt else stmt

Sequence of zero or more terminals and non-terminals

Single non-terminal
Example with Boolean Expressions
 “true” and “false” are boolean
expressions
 If exp1 and exp2 are boolean
expressions, then so are:
 exp1 || exp2
 exp1 && exp2
 ! exp1
 ( exp1 )
Corresponding CFG
bexp → TRUE
bexp → FALSE
bexp → bexp OR bexp
bexp → bexp AND bexp
bexp → NOT bexp
bexp → LPAREN bexp RPAREN
UPPERCASE represent tokens (thus terminals)
lowercase represent non-terminals
CFG for Assignments
 Here is CFG for simple assignment
statements
(Can only assign boolean expressions
to identifiers)
stmt → ID ASSIGN bexp SEMICOLON
CFG for simple IF statements
Combine these CFGs and add 2 more rules for
simple IF statements:
1. stmt → IF LPAREN bexp RPAREN stmt
2. stmt → IF LPAREN bexp RPAREN stmt ELSE stmt
3. stmt → ID ASSIGN bexp SEMICOLON
4. bexp → TRUE
5. bexp → FALSE
6. bexp → bexp OR bexp
7. bexp → bexp AND bexp
8. bexp → NOT bexp
9. bexp → LPAREN bexp RPAREN
You Try It
Write a context-free grammar for the
language of very simple while loops
(in which the loop body only contains
one statement) by adding a new
production with nonterminal stmt on
the left-hand side.
CFG Languages
 The language defined by a context-free
grammar is the set of strings (sequences of
terminals) that can be derived from the
start nonterminal.

 Think of productions as rewriting rules

Set cur_seq = starting non-terminal
While (non-terminal, X, exists in cur_seq):
Select production with X on left of “→”
Replace X with right portion of selected
production
 Try it with given CFG
What Strings are in Language
1. stmt → IF LPAREN bexp RPAREN stmt
2. stmt → IF LPAREN bexp RPAREN stmt ELSE stmt
3. stmt → ID ASSIGN bexp SEMICOLON
4. bexp → TRUE
5. bexp → FALSE
6. bexp → bexp OR bexp
7. bexp → bexp AND bexp
8. bexp → NOT bexp
9. bexp → LPAREN bexp RPAREN

Set cur_seq = starting non-terminal

While (non-terminal, X, exists in cur_seq):
Select production with X on left of “→”
Replace X with right portion of selected production
Try It Again
exp → exp PLUS term
exp → exp MINUS term
exp → term
term → term TIMES factor
term → term DIVIDE factor
term → factor
factor → LPAREN exp RPAREN
factor → ID
What is the language?
Leftmost and Rightmost Derivations
 A derivation is a leftmost derivation
if it is always the leftmost
nonterminal that is chosen to be
replaced.
 It is a rightmost derivation if it is
always the rightmost one.
Derivation Notation
 E => a
 E =>* a
 E =>+ a
Parse Trees
Start with the start nonterminal.
Repeat:
choose a leaf nonterminal X
choose a production X --> alpha
the symbols in alpha become the children of
X in the tree
until there are no more leaf
nonterminals left.
The derived string is formed by reading the
leaf nodes from left to right.
Ambiguous Grammars
 Consider the grammar
exp → exp PLUS exp
exp → exp MINUS exp
exp → exp TIMES exp
exp → exp DIVIDE exp
exp → INT_LIT
 Construct Parse tree for 3-4/2
 Are there more than one parse trees?
 If there is more than one parse tree for a string then
the grammar is ambiguous
 Ambiguity causes problems with parsing (what is the
correct structure)?
Precedence in Grammars
To write a grammar whose parse trees
express precedence correctly, use a
different nonterminal for each
precedence level. Start by writing a
rule for the operator(s) with the
lowest precedence ("-" in our case),
then write a rule for the operator(s)
with the next lowest precedence, etc:
Precedence in Grammars
exp → exp MINUS exp | term
term → term DIVIDE term | factor
factor → INT_LIT | LPAREN exp RPAREN
 Now try constructing multiple parse
trees for 3-4/2
 Grammar is still ambiguous. Look at
associativity. Construct 2 parses tree
for 5-3-2.
Recursion on CFGs
 A grammar is recursive in nonterminal X
if: X derives a sequence of symbols that
includes an X.
 A grammar is left recursive in X if: X
derives a sequence of symbols that starts
with an X.
 A grammar is right recursive in X if: X
derives a sequence of symbols that ends
with an X.
 For left associativity, use left recursion.
 For right associativity, use right
recursion.
Ambiguity Removed in CFG
exp → exp MINUS term | term
term → term DIVIDE factor | factor
factor → INT_LIT | LPAREN exp RPAREN
 One level for each order of operation
 Left recursion for left assiciativity
 Now try constructing 2 parse trees for
5-3-2.
You Try it
 Construct a grammar for arithmetic
expressions with addition, multiplication,
exponentiation (right assoc.), subtraction,
division, and unary negative.
exp → exp PLUS exp |
exp MINUS exp |
exp TIMES exp |
exp DIVIDE exp |
exp POW exp |
MINUS exp |
LPAREN exp RPAREN |
INT_LIT
Solution
exp → exp PLUS term |
exp MINUS term |
term
term → term TIMES factor |
term DIVIDE factor |
factor
factor → exponent POW factor |
exponent
exponent → MINUS exponent |
final
final → INT_LIT |
LPAREN exp RPAREN
You Try It
 Write a grammar for the language of boolean
expressions, with two possible operands: true false,
and three possible operators: and or not. Add
nonterminals so that or has lowest precedence, then
and, then not. Finally, change the grammar to reflect
the fact that both and and or are left associative.
bexp → TRUE
bexp → FALSE
bexp → bexp OR bexp
bexp → bexp AND bexp
bexp → NOT bexp
bexp → LPAREN bexp RPAREN
List Grammars
 Several types of lists can be created using
CFGs
 One or more x's (without any separator or
terminator):
1. xList → X | xList xList
2. xList → X | xList X
3. xList → X | X xList
 One or more x's, separated by commas:
1. xList → X | xList COMMA xList
2. xList → X | xList COMMA X
3. xList → X | X COMMA xList
List Grammars
 One or more x's, each x terminated by a
semi-colon:
 You Try It
1. xList → X SEMICOLON | xList xList
2. xList → X SEMICOLON | xList X SEMICOLON
3. xList → X SEMICOLON | X SEMICOLON xList
 Zero or more x's (without any separator or
terminator):
1. xList → ε | X | xList xList
2. xList → ε | X | xList X
3. xList → ε | X | X xList
List Grammars
 Zero or more x's, each terminated by a
semi-colon:
1. xList → ε | X SEMICOLON | xList xList
2. xList → ε | X SEMICOLON | xList X SEMICOLON
3. xList → ε | X SEMICOLON | X SEMICOLON xList
 Zero or more x's, separated by commas:
 You Try It
1. xList → ε | nonEmptyXList
nonEmptyXList → X | X COMMA nonEmptyXList
CFGs for Whole Languages
 To write a grammar for a whole
programming language, break down
the problem into pieces. For example,
think about a Java program: a
program consists of one or more
classes:
program → classlist
classlist → class | class classlist
CFGs for Whole Languages
 A class is the word "class", optionally
preceded by the word "public",
followed by an identifier, followed by
an open curly brace, followed by the
class body, followed by a closing curly
brace:
class → PUBLIC CLASS ID LCURLY classbody RCURLY |
CLASS ID LCURLY classbody RCURLY
CFGs for Whole Languages
 A class body is a list of zero or more
field and/or method definitions:
classbody → ε | deflist
deflist → def | def deflist

And So On…

Lecture 05
No ratings yet
Lecture 05
58 pages
Chapter 4
No ratings yet
Chapter 4
62 pages
Chapter 3
No ratings yet
Chapter 3
77 pages
CS6109 Module 4
No ratings yet
CS6109 Module 4
36 pages
Unit 2-Part A
No ratings yet
Unit 2-Part A
75 pages
CC Lec 7
No ratings yet
CC Lec 7
16 pages
ContextFreeGrammars Myppt
No ratings yet
ContextFreeGrammars Myppt
41 pages
2nd Phase Syntax Analyzer - 1
No ratings yet
2nd Phase Syntax Analyzer - 1
136 pages
COSC3054 Lec 03 I Grammars
No ratings yet
COSC3054 Lec 03 I Grammars
96 pages
Context Free Grammars
No ratings yet
Context Free Grammars
40 pages
WINSEM2024-25 BCSE304L TH VL2024250501632 2025-02-15 Reference-Material-I
No ratings yet
WINSEM2024-25 BCSE304L TH VL2024250501632 2025-02-15 Reference-Material-I
29 pages
08 CFG
No ratings yet
08 CFG
41 pages
Act CH 3
No ratings yet
Act CH 3
36 pages
Context-Free Grammar (CFG)
No ratings yet
Context-Free Grammar (CFG)
27 pages
Chapter - 3 - Context Free Language - Part - 1
No ratings yet
Chapter - 3 - Context Free Language - Part - 1
110 pages
Automata Lectuee5
No ratings yet
Automata Lectuee5
33 pages
Reasoning Banknotes 180405084744 PDF
No ratings yet
Reasoning Banknotes 180405084744 PDF
220 pages
Unit 24
No ratings yet
Unit 24
32 pages
Slides
No ratings yet
Slides
217 pages
Chapter - 2 - Finite State Automata - Part - 3
No ratings yet
Chapter - 2 - Finite State Automata - Part - 3
50 pages
Unit-3 Context Free Grammar
No ratings yet
Unit-3 Context Free Grammar
57 pages
CH2-1 To CH2-3
No ratings yet
CH2-1 To CH2-3
79 pages
CH2 1
No ratings yet
CH2 1
27 pages
Context Free Grammars
No ratings yet
Context Free Grammars
39 pages
Mathematics XII Chapter 10
No ratings yet
Mathematics XII Chapter 10
77 pages
Lecture 9
No ratings yet
Lecture 9
22 pages
Context-Free Grammar (CFG) : Dr. Nadeem Akhtar
No ratings yet
Context-Free Grammar (CFG) : Dr. Nadeem Akhtar
56 pages
Parsing Part - 1
No ratings yet
Parsing Part - 1
53 pages
TOC II Updated
No ratings yet
TOC II Updated
41 pages
UNIT IV CONTEXT FREE GRAMMARS and LANGUAGES
No ratings yet
UNIT IV CONTEXT FREE GRAMMARS and LANGUAGES
69 pages
CS242 - Module 5
No ratings yet
CS242 - Module 5
42 pages
Gramatici Exemplu
No ratings yet
Gramatici Exemplu
45 pages
Mathematics XII Chapter 4
No ratings yet
Mathematics XII Chapter 4
45 pages
Mathematics XII Chapter 7
No ratings yet
Mathematics XII Chapter 7
44 pages
Chapter3 CFG
No ratings yet
Chapter3 CFG
67 pages
New DOC Document
No ratings yet
New DOC Document
17 pages
Mathematics XII Chapter 3
No ratings yet
Mathematics XII Chapter 3
38 pages
Mathematics XII Chapter 9
No ratings yet
Mathematics XII Chapter 9
33 pages
Unit-2 2.1. Review of CFG Ambiguity of Grammars 2.1.1. Limitations of Regular Language
No ratings yet
Unit-2 2.1. Review of CFG Ambiguity of Grammars 2.1.1. Limitations of Regular Language
44 pages
Chapter 3 Syntax Analysis I
No ratings yet
Chapter 3 Syntax Analysis I
27 pages
Context
No ratings yet
Context
57 pages
Mathematics XII Chapter 11
No ratings yet
Mathematics XII Chapter 11
24 pages
Introduction To Numerical Methods
No ratings yet
Introduction To Numerical Methods
21 pages
CGF and CFL
No ratings yet
CGF and CFL
45 pages
Mathematics XII Chapter 6
No ratings yet
Mathematics XII Chapter 6
15 pages
Fundamentals of Mathematics
No ratings yet
Fundamentals of Mathematics
728 pages
Grammar and Parse Trees (Syntax) : What Makes A Good Programming Language?
100% (2)
Grammar and Parse Trees (Syntax) : What Makes A Good Programming Language?
50 pages
Partial Differentiation
No ratings yet
Partial Differentiation
13 pages
2-Role of Parser and Parse Tree-02!08!2024
No ratings yet
2-Role of Parser and Parse Tree-02!08!2024
69 pages
Context Free Grammar and Parsing
0% (1)
Context Free Grammar and Parsing
138 pages
ContextFreeGrammars
No ratings yet
ContextFreeGrammars
28 pages
Notes Unit 02 Unit Conversions
100% (1)
Notes Unit 02 Unit Conversions
42 pages
CSE322 #Automata Full Unit - 4 Context Free Languages (@rajkumar)
No ratings yet
CSE322 #Automata Full Unit - 4 Context Free Languages (@rajkumar)
74 pages
Automata Theory Lec-03
No ratings yet
Automata Theory Lec-03
58 pages
Chapter 3 Syntax Analysis (Parsing)
No ratings yet
Chapter 3 Syntax Analysis (Parsing)
29 pages
Learning Module in Digital Design
No ratings yet
Learning Module in Digital Design
35 pages
Slide Set 5 Parsing
No ratings yet
Slide Set 5 Parsing
18 pages
14.context Free Grammars
No ratings yet
14.context Free Grammars
19 pages
Compilers Lecture 5
No ratings yet
Compilers Lecture 5
30 pages
Unit - Iii
No ratings yet
Unit - Iii
21 pages
Context Free Grammars
No ratings yet
Context Free Grammars
24 pages
1.weights and Measures
No ratings yet
1.weights and Measures
11 pages
Context Free Grammars
No ratings yet
Context Free Grammars
25 pages
CPSC 388 - Compiler Design and Construction: Evolution of Programming Languages Programming Language Basics Make
No ratings yet
CPSC 388 - Compiler Design and Construction: Evolution of Programming Languages Programming Language Basics Make
21 pages
Principles of Programming Languages: Syntax Analysis
100% (1)
Principles of Programming Languages: Syntax Analysis
51 pages
Lecture05-Syntax Analysis-CFG
No ratings yet
Lecture05-Syntax Analysis-CFG
19 pages
CPSC 388 - Compiler Design and Construction: Scanners - Finite State Automata
No ratings yet
CPSC 388 - Compiler Design and Construction: Scanners - Finite State Automata
16 pages
Fraction Printables Lessons Journal Entries Worksheets
No ratings yet
Fraction Printables Lessons Journal Entries Worksheets
169 pages
Computer Programming 2 Week 11
No ratings yet
Computer Programming 2 Week 11
9 pages
Diagnostic Criteria For Primary Osteoporosis-Year 2012 Revision PDF
No ratings yet
Diagnostic Criteria For Primary Osteoporosis-Year 2012 Revision PDF
12 pages
Computer Usage: Sentence
No ratings yet
Computer Usage: Sentence
2 pages
CPSC 388 - Compiler Design and Construction: Scanners - Jlex Scanner Generator
No ratings yet
CPSC 388 - Compiler Design and Construction: Scanners - Jlex Scanner Generator
15 pages
Cobol Day 2
100% (1)
Cobol Day 2
42 pages
Amila Thennakoon (British Computer Socity) 1
No ratings yet
Amila Thennakoon (British Computer Socity) 1
17 pages
The Legality of Pakistan and Indian View Points Over Kashmir Dispute (An Analysis)
No ratings yet
The Legality of Pakistan and Indian View Points Over Kashmir Dispute (An Analysis)
12 pages
Template ACIE 2023
No ratings yet
Template ACIE 2023
5 pages
Context-Free Languages & Grammars (Cfls & CFGS) : Reading: Chapter 5
No ratings yet
Context-Free Languages & Grammars (Cfls & CFGS) : Reading: Chapter 5
38 pages
JS Lexical Grammer
No ratings yet
JS Lexical Grammer
12 pages
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
CPSC 388 - Compiler Design and Construction: Scanners - Regular Expressions
No ratings yet
CPSC 388 - Compiler Design and Construction: Scanners - Regular Expressions
20 pages
SSK5204 Chapter 5: Context-Free Grammars and Languages
No ratings yet
SSK5204 Chapter 5: Context-Free Grammars and Languages
55 pages
Context Free Grammars
No ratings yet
Context Free Grammars
36 pages
Context Free Grammar (CFG) - 2021
100% (1)
Context Free Grammar (CFG) - 2021
2 pages
Yamaha PSR-E353 Song Book (French)
100% (2)
Yamaha PSR-E353 Song Book (French)
212 pages
ASCII Printable Characters
No ratings yet
ASCII Printable Characters
22 pages
Deep Learning Based Medical X-Ray
No ratings yet
Deep Learning Based Medical X-Ray
35 pages
Appin Technology Lab: Programming in C
No ratings yet
Appin Technology Lab: Programming in C
7 pages
Compiler (RE and TD) - 01
No ratings yet
Compiler (RE and TD) - 01
2 pages
Syntax Analyzer
No ratings yet
Syntax Analyzer
38 pages
Perspectives and Problems of Codifying Nigerian Pidgin English Orthography
No ratings yet
Perspectives and Problems of Codifying Nigerian Pidgin English Orthography
9 pages
Unit II PDF
No ratings yet
Unit II PDF
7 pages
Drums of Autumn Challenge 1
No ratings yet
Drums of Autumn Challenge 1
4 pages
Motivation For Formal Grammars
No ratings yet
Motivation For Formal Grammars
15 pages
Context-Free Languages & Grammars (Cfls & CFGS) : Reading: Chapter 5
No ratings yet
Context-Free Languages & Grammars (Cfls & CFGS) : Reading: Chapter 5
40 pages
Lex1 Lab Manual TE Computer SPPU
No ratings yet
Lex1 Lab Manual TE Computer SPPU
6 pages
Topic #4: Syntactic Analysis (Parsing) : INF 524 Compiler Construction Spring 2011
No ratings yet
Topic #4: Syntactic Analysis (Parsing) : INF 524 Compiler Construction Spring 2011
44 pages
TQ Mapeh 5 Partiall
No ratings yet
TQ Mapeh 5 Partiall
5 pages
Conversion Factors For U.S./British and Metric Units
No ratings yet
Conversion Factors For U.S./British and Metric Units
4 pages
Context Free Grammars
No ratings yet
Context Free Grammars
40 pages
Casting in Java
No ratings yet
Casting in Java
3 pages
AWP Practical 2-1
0% (1)
AWP Practical 2-1
15 pages
Time Signature - Wikipedia
No ratings yet
Time Signature - Wikipedia
17 pages
Data Structure & Algorithms - Tower of Hanoi
100% (1)
Data Structure & Algorithms - Tower of Hanoi
3 pages
Land Mesurement
No ratings yet
Land Mesurement
6 pages
Khachaturian Andantino D
No ratings yet
Khachaturian Andantino D
1 page
Moment For Morricone-Horn in F
No ratings yet
Moment For Morricone-Horn in F
4 pages
Fractions Represent Equal Parts of A Whole or A Collection
No ratings yet
Fractions Represent Equal Parts of A Whole or A Collection
15 pages
979 Vanhal
No ratings yet
979 Vanhal
8 pages
JavaScript Notes
No ratings yet
JavaScript Notes
8 pages

CPSC 388 - Compiler Design and Construction: Parsers - Context Free Grammars

Uploaded by

CPSC 388 - Compiler Design and Construction: Parsers - Context Free Grammars

Uploaded by

CPSC 388 – Compiler Design

Parsers – Context Free Grammars

 Homework: HW4 assigned today, due next

 Progress Report Grades due Sept 28th

position = initial + rate * 60 ;

•Interior nodes are operators.

 Output: a representation of program

 Also find syntax errors

 CFGs used to define Parser

stmt → if ( expr ) stmt else stmt

terminals Rule or Production non-terminals

Σ – Set of Terminals (use tokens from scanner)

stmt → if ( expr ) stmt else stmt

Sequence of zero or more terminals and non-terminals

 Think of productions as rewriting rules

Set cur_seq = starting non-terminal

You might also like