0% found this document useful (0 votes)

73 views28 pages

Chapter 3-Syntax Analysis-II

This document discusses top-down parsing techniques. It covers recursive descent parsing which uses backtracking, and predictive parsing which does not use backtracking but requires grammars to be in a special form called LL(1). Predictive parsing techniques include recursive predictive parsing and non-recursive (table-driven) predictive parsing, also known as LL(1) parsing. LL(1) parsing uses a parsing table and stack to parse inputs based on the grammar without backtracking. Examples of recursive predictive and LL(1) parsing are provided.

Uploaded by

Feraol Negera

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

73 views28 pages

Chapter 3-Syntax Analysis-II

Uploaded by

Feraol Negera

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 28

Principles of Compiler Design

Chapter 3
Lecture 2
Top-Down Parsing

1
Top-Down Parsing
• The parse tree is created top to bottom.
• Top-down parser
– Recursive-Descent Parsing
• Backtracking is needed (If a choice of a production rule does not work, we backtrack to try other
alternatives.)
• It is a general parsing technique, but not widely used.
• Not efficient
– Predictive Parsing
• no backtracking
• efficient
• needs a special form of grammars (LL(1) grammars).
• Recursive Predictive Parsing is a special form of Recursive Descent parsing without backtracking.
• Non-Recursive (Table Driven) Predictive Parser is also known as LL(1) parser.

21 Q
Recursive-Descent Parsing (uses Backtracking)
• Backtracking is needed.
• It tries to find the left-most derivation.

S  aBc
B  bc | b
S S
input: abc
a B c a B c
fails, backtrack
b c b

3
Predictive Parser
a grammar   a grammar suitable for predictive
eliminate left parsing (a LL(1) grammar)
left recursion factor no %100 guarantee.

• When re-writing a non-terminal in a derivation step, a predictive parser

can uniquely choose a production rule by just looking the current
symbol in the input string.

A  1 | ... | n input: ... a .......

current token

4
Left Factoring

 A predictive parser (a top-down parser without backtracking) insists that the grammar
must be left-factored.

 grammar a new equivalent grammar suitable for predictive parsing
stmt → if expr then stmt else stmt | if expr then stmt when we see if, we cannot now
which production rule to choose to re-write stmt in the derivation.

 In general,
A → βα1 | βα where α is non-empty and the
first symbols of β1 and β2 (if they have one)are different.
 when processing α we cannot know whether expand A to βα1 or A to βα2 But, if
we re-write the grammar as follows
A → αA’
A’ → β1 | β2 so, we can immediately expand A to αA’
5
Predictive Parser (example)

stmt  if ...... |
while ...... |
begin ...... |
for .....

• When we are trying to write the non-terminal stmt, if the current token
is if we have to choose first production rule.
• When we are trying to write the non-terminal stmt, we can uniquely
choose the production rule by just looking the current token.
• We eliminate the left recursion in the grammar, and left factor it. But it
may not be suitable for predictive parsing (not LL(1) grammar).
6
Recursive Predictive Parsing
• Each non-terminal corresponds to a procedure.

Ex: A  aBb (This is only the production rule for A)

proc A {
- match the current token with a, and move to the next token;
- call ‘B’;
- match the current token with b, and move to the next token;
}

7
Recursive Predictive Parsing (cont.)
A  aBb | bAB

proc A {
case of the current token {
‘a’: - match the current token with a, and move to the next token;
- call ‘B’;
- match the current token with b, and move to the next token;
‘b’: - match the current token with b, and move to the next token;
- call ‘A’;
- call ‘B’;
}
}

8
Recursive Predictive Parsing (cont.)
• When to apply -productions.

A  aA | bB | 

• If all other productions fail, we should apply an -production. For

example, if the current token is not a or b, we may apply the
-production.
• Most correct choice: We should apply an -production for a non-
terminal A when the current token is in the follow set of A (which
terminals can follow A in the sentential forms).

9
Recursive Predictive Parsing (Example)
A  aBe | cBd | C
B  bB | 
Cf
proc C { match the current token with f,
proc A { and move to the next token; }
case of the current token {
a: - match the current token with a,
and move to the next token; proc B {
- call B; case of the current token {
- match the current token with e, b: - match the current token with b,
and move to the next token; and move to the next token;
c: - match the current token with c, - call B
and move to the next token; e,d: do nothing
- call B; }
- match the current token with d, }
and move to the next token;
f: - call C
}
}
follow set of B

first set of C
10
Non-Recursive Predictive Parsing -- LL(1) Parser
• Non-Recursive predictive parsing is a table-driven parser.
• It is a top-down parser.
• It is also known as LL(1) Parser.

input buffer

stack Non-recursive output

Predictive Parser

Parsing Table

11
LL(1) Parser
input buffer
– our string to be parsed. We will assume that its end is marked with a special symbol $.

output
– a production rule representing a step of the derivation sequence (left-most derivation) of the string in the input
buffer.

stack
– contains the grammar symbols
– at the bottom of the stack, there is a special end marker symbol $.
– initially the stack contains only the symbol $ and the starting symbol S. $S  initial stack
– when the stack is emptied (ie. only $ left in the stack), the parsing is completed.

parsing table
– a two-dimensional array M[A,a]
– each row is a non-terminal symbol
– each column is a terminal symbol or the special symbol $
– each entry holds a production rule.

12
LL(1) Parser – Parser Actions
• The symbol at the top of the stack (say X) and the current symbol in the input string
(say a) determine the parser action.
• There are four possible parser actions.
1. If X and a are $  parser halts (successful completion)
2. If X and a are the same terminal symbol (different from $)
 parser pops X from the stack, and moves the next symbol in the input buffer.
3. If X is a non-terminal
 parser looks at the parsing table entry M[X,a]. If M[X,a] holds a production rule
XY1Y2...Yk, it pops X from the stack and pushes Yk,Yk-1,...,Y1 into the stack. The
parser also outputs the production rule XY1Y2...Yk to represent a step of the
derivation.
4. none of the above  error
– all empty entries in the parsing table are errors.
– If X is a terminal symbol different from a, this is also an error case.

13
LL(1) Parser – Example1
S  aBa a b $ LL(1) Parsing
B  bB |  S S  aBa Table
B B B  bB

stack input output

$S abba$ S  aBa
$aBa abba$
$aB bba$ B  bB
$aBb bba$
$aB ba$ B  bB
$aBb ba$
$aB a$ B
$a a$
$ $ accept, successful completion

14
LL(1) Parser – Example1 (cont.)

Outputs: S  aBa B  bB B  bB B

Derivation(left-most): SaBaabBaabbBaabba

S
parse tree
a B a

b B


15
LL(1) Parser – Example2

E  TE’
E’  +TE’ | 
T  FT’
T’  *FT’ | 
F  (E) | id

id + * ( ) $
E E  TE’ E  TE’
E’ E’  +TE’ E’   E’  
T T  FT’ T  FT’
T’ T’   T’  *FT’ T’   T’  
F F  id F  (E)
16
LL(1) Parser – Example2
stack input output
$E id+id$ E  TE’
$E’T id+id$ T  FT’
$E’ T’F id+id$ F  id
$ E’ T’id id+id$
$ E ’ T’ +id$ T’  
$ E’ +id$ E’  +TE’
$ E’ T+ +id$
$ E’ T id$ T  FT’
$ E ’ T’ F id$ F  id
$ E’ T’id id$
$ E ’ T’ $ T’  
$ E’ $ E’  
$ $ accept

17
Constructing LL(1) Parsing Tables
• Two functions are used in the construction of LL(1) parsing tables:
– FIRST FOLLOW

• FIRST() is a set of the terminal symbols which occur as first symbols

in strings derived from  where  is any string of grammar symbols.
• if  derives to , then  is also in FIRST() .

• FOLLOW(A) is the set of the terminals which occur immediately after

(follow) the non-terminal A in the strings derived from the starting
symbol.
– a terminal a is in FOLLOW(A) if S  * Aa
*
– $ is in FOLLOW(A) if S  A

18
Compute FIRST for Any String X
• If X is a terminal symbol  FIRST(X)={X}
• If X is a non-terminal symbol and X   is a production rule
  is in FIRST(X).
• If X is a non-terminal symbol and X  Y1Y2..Yn is a production rule
 if a terminal a in FIRST(Yi) and  is in all FIRST(Yj) for j=1,...,i-1
then a is in FIRST(X).
 if  is in all FIRST(Yj) for j=1,...,n
then  is in FIRST(X).
• If X is   FIRST(X)={}
• If X is Y1Y2..Yn
 if a terminal a in FIRST(Yi) and  is in all FIRST(Yj) for j=1,...,i-1
then a is in FIRST(X).
 if  is in all FIRST(Yj) for j=1,...,n
then  is in FIRST(X).
19
FIRST Example
E  TE’
E’  +TE’ | 
T  FT’
T’  *FT’ | 
F  (E) | id

FIRST(F) = {(,id} FIRST(TE’) = {(,id}

FIRST(T’) = {*, } FIRST(+TE’ ) = {+}
FIRST(T) = {(,id} FIRST() = {}
FIRST(E’) = {+, } FIRST(FT’) = {(,id}
FIRST(E) = {(,id} FIRST(*FT’) = {*}
FIRST() = {}
FIRST((E)) = {(}
FIRST(id) = {id}

20
Compute FOLLOW (for non-terminals)
• If S is the start symbol  $ is in FOLLOW(S)

• if A  B is a production rule

 everything in FIRST() is FOLLOW(B) except 

• If ( A  B is a production rule ) or
( A  B is a production rule and  is in FIRST() )
 everything in FOLLOW(A) is in FOLLOW(B).

We apply these rules until nothing more can be added to any follow set.

21
FOLLOW Example
E  TE’
E’  +TE’ | 
T  FT’
T’  *FT’ | 
F  (E) | id

FOLLOW(E) = { $, ) }
FOLLOW(E’) = { $, ) }
FOLLOW(T) = { +, ), $ }
FOLLOW(T’) = { +, ), $ }
FOLLOW(F) = {+, *, ), $ }

22
Constructing LL(1) Parsing Table -- Algorithm
• for each production rule A   of a grammar G
– for each terminal a in FIRST()
 add A   to M[A,a]
– If  in FIRST()
 for each terminal a in FOLLOW(A) add A   to M[A,a]
– If  in FIRST() and $ in FOLLOW(A)
 add A   to M[A,$]

• All other undefined entries of the parsing table are error entries.

23
Constructing LL(1) Parsing Table -- Example
E  TE’ FIRST(TE’)={(,id}  E  TE’ into M[E,(] and M[E,id]
E’  +TE’ FIRST(+TE’ )={+}  E’  +TE’ into M[E’,+]

E’   FIRST()={}  none
but since  in FIRST()
and FOLLOW(E’)={$,)}  E’   into M[E’,$] and M[E’,)]

T  FT’ FIRST(FT’)={(,id}  T  FT’ into M[T,(] and M[T,id]

T’  *FT’ FIRST(*FT’ )={*}  T’  *FT’ into M[T’,*]

T’   FIRST()={}  none
but since  in FIRST()
and FOLLOW(T’)={$,),+}  T’   into M[T’,$], M[T’,)] and M[T’,+]

F  (E) FIRST((E) )={(}  F  (E) into M[F,(]

F  id FIRST(id)={id}  F  id into M[F,id]

24
LL(1) Grammars
• A grammar whose parsing table has no multiply-defined entries is said
to be LL(1) grammar.

one input symbol used as a look-head symbol do determine parser action

LL(1) left most derivation

input scanned from left to right

• The parsing table of a grammar may contain more than one production
rule. In this case, we say that it is not a LL(1) grammar.

25
A Grammar which is not LL(1)
SiCtSE | a FOLLOW(S) = { $,e }
EeS |  FOLLOW(E) = { $,e }
Cb FOLLOW(C) = { t }

FIRST(iCtSE) = {i}
a b e i t $
FIRST(a) = {a}
S Sa S  iCtSE
FIRST(eS) = {e}
E EeS E
FIRST() = {}
E
FIRST(b) = {b}
C Cb

two production rules for M[E,e]

Problem  ambiguity
26
A Grammar which is not LL(1) (cont.)
• What do we have to do it if the resulting parsing table contains multiply
defined entries?
– If we didn’t eliminate left recursion, eliminate the left recursion in the grammar.
– If the grammar is not left factored, we have to left factor the grammar.
– If its (new grammar’s) parsing table still contains multiply defined entries, that grammar is
ambiguous or it is inherently not a LL(1) grammar.
• A left recursive grammar cannot be a LL(1) grammar.
– A  A | 
 any terminal that appears in FIRST() also appears FIRST(A) because A  .
 If  is , any terminal that appears in FIRST() also appears in FIRST(A) and FOLLOW(A).

• A grammar is not left factored, it cannot be a LL(1) grammar

• A  1 | 2
any terminal that appears in FIRST(1) also appears in FIRST(2).

• An ambiguous grammar cannot be a LL(1) grammar.

27
Properties of LL(1) Grammars
• A grammar G is LL(1) if and only if the following conditions hold for
two distinctive production rules A   and A  

1. Both  and  cannot derive strings starting with same terminals.

2. At most one of  and  can derive to .

3. If  can derive to , then  cannot derive to any string starting

with a terminal in FOLLOW(A).

Mod 2.1 - (Lec 8) - Syntax Analyzer and CFG
No ratings yet
Mod 2.1 - (Lec 8) - Syntax Analyzer and CFG
39 pages
Cdeprt
No ratings yet
Cdeprt
12 pages
Compilers Lecture 7
No ratings yet
Compilers Lecture 7
21 pages
Ch4a
No ratings yet
Ch4a
36 pages
Compiler Unit2
No ratings yet
Compiler Unit2
89 pages
unit7
No ratings yet
unit7
34 pages
parser (1)
No ratings yet
parser (1)
36 pages
Syntax Analysis I 2024
No ratings yet
Syntax Analysis I 2024
38 pages
td2-ll_1-parsing
No ratings yet
td2-ll_1-parsing
45 pages
PPT Lecture 1.9 Top Down Parsing and Lecture 1.10 Recursive Descent Parsing (1)
No ratings yet
PPT Lecture 1.9 Top Down Parsing and Lecture 1.10 Recursive Descent Parsing (1)
21 pages
Unit-II CD
No ratings yet
Unit-II CD
81 pages
u2ppt
No ratings yet
u2ppt
91 pages
Parser Lec4
No ratings yet
Parser Lec4
21 pages
Parsing
No ratings yet
Parsing
158 pages
Top to Bottom (1)
No ratings yet
Top to Bottom (1)
31 pages
Chapter 8 - Syntax Analysis
No ratings yet
Chapter 8 - Syntax Analysis
92 pages
4 Parsing
No ratings yet
4 Parsing
55 pages
CD Unit 2
No ratings yet
CD Unit 2
6 pages
parsing technique baar baar
No ratings yet
parsing technique baar baar
29 pages
Chapter4-1
No ratings yet
Chapter4-1
61 pages
CD Unit-3 Part-1
No ratings yet
CD Unit-3 Part-1
99 pages
Module 4 - Top down Parsing
No ratings yet
Module 4 - Top down Parsing
31 pages
L5_TopDownParsing
No ratings yet
L5_TopDownParsing
30 pages
51114. Compiler Design Syntax Analysis Top Down
No ratings yet
51114. Compiler Design Syntax Analysis Top Down
34 pages
CSC 4181 Compiler Construction Parsing
No ratings yet
CSC 4181 Compiler Construction Parsing
53 pages
toc unit 3.pptx
No ratings yet
toc unit 3.pptx
49 pages
Ll1parser 190921075612
No ratings yet
Ll1parser 190921075612
84 pages
3 Syntax Analysis
No ratings yet
3 Syntax Analysis
42 pages
syntax analysis
No ratings yet
syntax analysis
90 pages
Syntax Analyzer
No ratings yet
Syntax Analyzer
38 pages
Pert 4 - Syntax Analysis-Top Down Parsing
No ratings yet
Pert 4 - Syntax Analysis-Top Down Parsing
54 pages
CS6109-MODULE-5
No ratings yet
CS6109-MODULE-5
117 pages
Compiler Design Syntax Analysis Top Down
No ratings yet
Compiler Design Syntax Analysis Top Down
34 pages
CD Unit3
No ratings yet
CD Unit3
74 pages
Chapter 4 - Syntax Analysis
No ratings yet
Chapter 4 - Syntax Analysis
73 pages
Chapter 4 - Syntax Analysis Part 1
No ratings yet
Chapter 4 - Syntax Analysis Part 1
36 pages
M2 Compiler Design
No ratings yet
M2 Compiler Design
51 pages
Syntax Analysis I 2022 Class
No ratings yet
Syntax Analysis I 2022 Class
33 pages
Lec03 parserCFG
No ratings yet
Lec03 parserCFG
27 pages
Module-2 1
No ratings yet
Module-2 1
51 pages
Module 1 Lesson 8
No ratings yet
Module 1 Lesson 8
18 pages
Predictive Parsing: Recall The Main Idea of Top-Down Parsing
No ratings yet
Predictive Parsing: Recall The Main Idea of Top-Down Parsing
19 pages
Chapter # 5 Parsing Mechanisms. Chapter # 5 Parsing Mechanisms
No ratings yet
Chapter # 5 Parsing Mechanisms. Chapter # 5 Parsing Mechanisms
31 pages
Theory of Computation and Compiler Design: Module - 4
No ratings yet
Theory of Computation and Compiler Design: Module - 4
31 pages
Chapter-4 - CS-411 Compiler Construction
No ratings yet
Chapter-4 - CS-411 Compiler Construction
8 pages
Top Down PDF
No ratings yet
Top Down PDF
49 pages
Chapter 4 - Syntax Analysis
No ratings yet
Chapter 4 - Syntax Analysis
68 pages
Predictive Parsing: Recall The Main Idea of Top-Down Parsing
No ratings yet
Predictive Parsing: Recall The Main Idea of Top-Down Parsing
19 pages
Week 10 - Non Recursive Predictive Parsor
0% (1)
Week 10 - Non Recursive Predictive Parsor
41 pages
Parsing
No ratings yet
Parsing
33 pages
Top-Down Parsing: - The Parse Tree Is Created Top To Bottom. - Top-Down Parser
No ratings yet
Top-Down Parsing: - The Parse Tree Is Created Top To Bottom. - Top-Down Parser
31 pages
Top Down Parser
No ratings yet
Top Down Parser
111 pages
Unit - Ii 2.1 Syntax Analysis
No ratings yet
Unit - Ii 2.1 Syntax Analysis
122 pages
Top-Down Parsing: - The Parse Tree Is Created Top To Bottom. - Top-Down Parser
No ratings yet
Top-Down Parsing: - The Parse Tree Is Created Top To Bottom. - Top-Down Parser
36 pages
Predictive Parsing and LL (1) - Compiler Design - Dr. D. P. Sharma - NITK Surathkal by Wahid311
100% (2)
Predictive Parsing and LL (1) - Compiler Design - Dr. D. P. Sharma - NITK Surathkal by Wahid311
56 pages
Topic #4: Syntactic Analysis (Parsing) : INF 524 Compiler Construction Spring 2011
No ratings yet
Topic #4: Syntactic Analysis (Parsing) : INF 524 Compiler Construction Spring 2011
44 pages
Parsing
No ratings yet
Parsing
38 pages
Chapter 4 - Syntax Analysis
No ratings yet
Chapter 4 - Syntax Analysis
82 pages
??_♂️DAA solutions
No ratings yet
??_♂️DAA solutions
43 pages
Elimination of Left Recursion
No ratings yet
Elimination of Left Recursion
17 pages
Module 1
No ratings yet
Module 1
175 pages
A faster algorithm for solving general LPs
No ratings yet
A faster algorithm for solving general LPs
10 pages
Quantum-Networking final pdf
No ratings yet
Quantum-Networking final pdf
8 pages
ITC - Mod 5 - Ktunotes - in
No ratings yet
ITC - Mod 5 - Ktunotes - in
49 pages
Wiley Encyclopedia of Telecommunications Vol II.
No ratings yet
Wiley Encyclopedia of Telecommunications Vol II.
596 pages
Programming Project1
No ratings yet
Programming Project1
8 pages
CD Lab Manual PDF
No ratings yet
CD Lab Manual PDF
83 pages
Components of An Algorithm
No ratings yet
Components of An Algorithm
50 pages
LPP Vle
No ratings yet
LPP Vle
19 pages
Data Structure 18043
No ratings yet
Data Structure 18043
125 pages
Programming Assignment 5: Minimum Spanning Trees: Algorithms On Graphs Class
No ratings yet
Programming Assignment 5: Minimum Spanning Trees: Algorithms On Graphs Class
11 pages
Extra Practice Topic 1.5 Polynomial Functions - Complex Zeros
No ratings yet
Extra Practice Topic 1.5 Polynomial Functions - Complex Zeros
3 pages
Final Cheatsheet
No ratings yet
Final Cheatsheet
4 pages
Data Structure: Name:Naveen Kumar
No ratings yet
Data Structure: Name:Naveen Kumar
8 pages
Chapter Three Searching and Sorting Algorithm
100% (1)
Chapter Three Searching and Sorting Algorithm
47 pages
Activity 3.1 Complex Numbers PDF
No ratings yet
Activity 3.1 Complex Numbers PDF
4 pages
Data Mining 2020
No ratings yet
Data Mining 2020
2 pages
Serializability Theory
100% (1)
Serializability Theory
8 pages
Exercises of Function Study
From Everand
Exercises of Function Study
Simone Malacrida
No ratings yet
What Is Stack Data Structure?
No ratings yet
What Is Stack Data Structure?
3 pages
B.tech 15CS201J Data Structures
No ratings yet
B.tech 15CS201J Data Structures
3 pages
P vs. NP - An Introduction
No ratings yet
P vs. NP - An Introduction
2 pages
Exercises - Prolog
No ratings yet
Exercises - Prolog
6 pages
Esoteric Languages
No ratings yet
Esoteric Languages
2 pages
Y (S) U (S) K K Τ S Where Ε Is Damping Coefficient M S Thusy (S) = Km S Τ S For Critically Damped, Ε=1, Thus Y (T) =Km T Τ E
No ratings yet
Y (S) U (S) K K Τ S Where Ε Is Damping Coefficient M S Thusy (S) = Km S Τ S For Critically Damped, Ε=1, Thus Y (T) =Km T Τ E
4 pages
Oundary Alue Roblems: Dr. Johnson
No ratings yet
Oundary Alue Roblems: Dr. Johnson
33 pages
AP Calculus Flashcards, Fourth Edition: Up-to-Date Review and Practice
From Everand
AP Calculus Flashcards, Fourth Edition: Up-to-Date Review and Practice
Barron's Educational Series
No ratings yet
Chap 10 2 (1) - Jacobian
No ratings yet
Chap 10 2 (1) - Jacobian
8 pages
Color Receipe
100% (1)
Color Receipe
9 pages
CS3401 Algorithms
No ratings yet
CS3401 Algorithms
51 pages

Chapter 3-Syntax Analysis-II

Uploaded by

Chapter 3-Syntax Analysis-II

Uploaded by

Principles of Compiler Design

• When re-writing a non-terminal in a derivation step, a predictive parser

A  1 | ... | n input: ... a .......

Ex: A  aBb (This is only the production rule for A)

• If all other productions fail, we should apply an -production. For

stack Non-recursive output

stack input output

Outputs: S  aBa B  bB B  bB B

• FIRST() is a set of the terminal symbols which occur as first symbols

• FOLLOW(A) is the set of the terminals which occur immediately after

FIRST(F) = {(,id} FIRST(TE’) = {(,id}

• if A  B is a production rule

T  FT’ FIRST(FT’)={(,id}  T  FT’ into M[T,(] and M[T,id]

F  (E) FIRST((E) )={(}  F  (E) into M[F,(]

F  id FIRST(id)={id}  F  id into M[F,id]

one input symbol used as a look-head symbol do determine parser action

LL(1) left most derivation

two production rules for M[E,e]

• A grammar is not left factored, it cannot be a LL(1) grammar

• An ambiguous grammar cannot be a LL(1) grammar.

1. Both  and  cannot derive strings starting with same terminals.

2. At most one of  and  can derive to .

3. If  can derive to , then  cannot derive to any string starting

You might also like