Lecture04 Week06 TopDownParsing 1 - Compilers

This document summarizes a chapter about top-down parsing from a textbook on compiler construction. It discusses two forms of top-down parsers: predictive parsers and backtracking parsers. It also describes two top-down parsing algorithms: recursive descent parsing and LL(1) parsing. The chapter covers using EBNF notation to represent grammars for recursive descent parsing and how to construct a recursive descent parser for expressions. It provides an example of a working recursive descent parser for a simple calculator written in C code.

Uploaded by

Shehab Khaled Gad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views

Lecture04 Week06 TopDownParsing 1 - Compilers

Uploaded by

Shehab Khaled Gad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 48

COMPILER CONSTRUCTION

Principles and Practice

Kenneth C. Louden
4. Top-Down Parsing

PART ONE
The outline of this chapter
Concept of Top-Down Parsing(1)
• It parses an input string of tokens by tracing out
the steps in a leftmost derivation.
– And the implied traversal of the parse tree is a preorder
traversal and, thus, occurs from the root to the leaves.
• The example:
– number + number, and corresponds to the parse tree
exp

exp op exp

number + number
Concept of Top-Down Parsing(2)
The example: number + number, and corresponds to the
parse tree
• The above parse tree is corresponds to the leftmost
derivations:
(1) exp => exp op exp
(2) => number op exp
(3) => number + exp
(4) => number + number
exp

exp op exp

number + number
Two forms of Top-Down Parsers
• Predictive parsers:
– attempts to predict the next construction in the input
string using one or more look-ahead tokens
• Backtracking parsers:
– try different possibilities for a parse of the input,
backing up an arbitrary amount in the input if one
possibility fails.
– It is more powerful but much slower, unsuitable for
practical compilers.
Two kinds of Top-Down parsing
algorithms
• Recursive-descent parsing:
– is quite versatile and suitable for a handwritten parser.
• LL(1) parsing:
– The first “L” refers to the fact that it processes the input
from left to right;
– The second “L” refers to the fact that it traces out a
leftmost derivation for the input string;
– The number “1” means that it uses only one symbol of
input to predict the direction of the parse.
Other Contents
• Look-Ahead Sets
– First and Follow sets: are required by both recursive-
descent parsing and LL(1) parsing.
• A TINY Parser
– It is constructed by recursive-descent parsing algorithm.
• Error recovery methods
– The error recovery methods used in Top-Down parsing
will be described.
Contents
PART ONE
4.1 Top-Down Parsing by Recursive-Descent [More]
4.2 LL(1) Parsing [More]

PART TWO
4.3 First and Follow Sets
4.4 A Recursive-Descent Parser for the TINY
Language
4.5 Error Recovery in Top-Down Parsers
4.1 Top-Down Parsing by
Recursive-Descent
4.1.1 The Basic Method of
Recursive-Descent
The idea of Recursive-Descent
Parsing
• Viewing the grammar rule for a non-terminal A as
a definition for a procedure to recognize an A
• The right-hand side of the grammar for A specifies
the structure of the code for this procedure
• The Expression Grammar:
exp → exp addop term∣term
addop → + ∣-
term → term mulop factor ∣ factor
mulop →*
factor →(exp) ∣ number
A recursive-descent procedure that
recognizes a factor
procedure factor • The token keeps the current
begin next token in the input (one
case token of symbol of look-ahead)
( : match( ( );
exp;
match( )); • The Match procedure
number: matches the current next
match (number); token with its parameters,
else error; advances the input if it
end case; succeeds, and declares error
end factor if it does not
Match Procedure
• The Match procedure matches the current next token with
its parameters,
– advances the input if it succeeds, and declares error if it does not

procedure match( expectedToken);

begin
if token = expectedToken then
getToken;
else
error;
end if;
end match
Requiring the Use of EBNF
• The corresponding EBNF is
exp → term { addop term }
addop→ + | -
term → factor { mulop factor }
mulop→ *
factor → ( exp ) | number

• Writing recursive-decent procedure for the

remaining rules in the expression grammar is not
as easy for factor
The corresponding syntax diagrams
+
exp
addop
term

term addop -

term mulop
factor *

factor mulop

( exp )

factor

number
4.1.2 Repetition and Choice:
Using EBNF
An Example
procedure ifstmt; • The grammar rule for an if-
begin statement:
match( if ); If-stmt → if ( exp ) statement
match( ( ); ∣ if ( exp ) statement else statement
exp;
match( ) );
statement; • Could not immediately
if token = else then distinguish the two choices
match (else); because the both start with the
statement; token if
end if; • Put off the decision until we see
end ifstmt; the token else in the input
The EBNF of the if-statement
• If-stmt → if ( exp ) statement [ else statement]
Square brackets of the EBNF are translated into a test in the code for
ifstmt.
• if token = else then
• match (else);
• statement;
• endif;
• Notes
– EBNF notation is designed to mirror closely the actual code of a
recursive-descent parser,
– So a grammar should always be translated into EBNF if recursive-
descent is to be used.
• It is natural to write a parser that matches each else token
as soon as it is encountered in the input
EBNF for Simple Arithmetic
Grammar(1)
• The EBNF rule for exp → exp addop term∣term
– exp → term {addop term}
– Where, the curly bracket expressing repetition can be
translated into the code for a loop:
procedure exp;
begin
term;
while token = + or token = - do
match(token);
term;
end while;
end exp;
EBNF for Simple Arithmetic
Grammar(2)
• The EBNF rule for term:
– term → factor {mulop factor}
Becomes the code

procedure term;
begin
factor;
while token = * do
match(token);
factor;
end while;
end exp;
Left associatively implied by the
curly bracket
• The left associatively implied by the curly
bracket (and explicit in the original BNF) can
still be maintained within this code
function exp: integer; case token of
+ : match(+);
var temp: integer;
temp:=temp+term;
begin -:match(-);
temp:=term; temp:=temp-term;
while token=+ or token = - do end case;
end while;
return temp;
end exp;
A working simple calculator in C
code(1)

/*Simple integer arithmetic calculator according to the EBNF;

<exp> → <term> { <addop> <term>}
<addop> → + ∣ -
<term>→ <factor> { <mulop> <factor> }
<mulop> → *
<factor> → ( <exp> ) ∣ Number
inputs a line of text from stdin
outputs “error” or the result.
*/
A working simple calculator in C
code(2)
#include <stdio.h>
#include <stdio.h>
char token; /* global token variable */
/*function prototype for recursive calls*/
int exp(void);
int term(void);
int factor(void);
void error(void)
{fprint(stderr, “error\n”);
exit(1);
}
A working simple calculator in C
code(3)
void match(char expectedToken)
{if (token==expectedToken) token=getchar();
else error();
}
main()
{ int result;
token=getchar();/*load token with first character for lookahead*/
result=exp();
if (token==’\n’) /*check for end of line*/
printf(“Result = %d\n”, result);
else error(); /*extraneous chars on line*/
return 0;
}
A working simple calculator in C
code(4)
int exp(void)
{ int temp =term();
while ((token==’+’) || token==’-‘))
switch (token) {
case ‘+’: match (‘+’);
temp+=term();
break;
case ‘-‘: match (‘-‘);
temp-=term();
break;
}
return temp;
}
A working simple calculator in C
code(5)
int term(void)
{int temp=factor();
while (token==’*’){
match(‘*’);
temp*=factor();
}
return temp;
}
A working simple calculator in C
code(5)
int factor(void)
{ int temp;
if (token==’(‘) {
match (‘(‘);
temp = exp();
match(‘)’);
}
else if (isdigit(token)){
ungetc(token,stdin);
scanf(“%d”,&temp);
token = getchar();
}
else error();
return temp;
}
Some Notes
• The method of turning grammar rule in EBNF into
code is quite powerful.
• There are a few pitfalls, and care must be taken in
scheduling the actions within the code.
• In the previous pseudo-code for exp:
(1) The match of operation should be before repeated calls
to term;
(2) The global token variable must be set before the parse
begins;
(3) The getToken must be called just after a successful
test of a token
Construction of the syntax tree
• The expression: 3+4+5

+ 5

3 4
The pseudo-code for constructing
the syntax tree(1)
function exp : syntaxTree;
var temp, newtemp: syntaxTree;
begin
temp:=term;
while token=+ or token = - do
case token of
+ : match(+);
newtemp:=makeOpNode(+);
leftChild(newtemp):=temp;
rightChild(newtemp):=term;
temp=newtemp;
The pseudo-code for constructing
the syntax tree(2)

-:match(-);
newtemp:=makeOpNode(-);
leftChild(newtemp):=temp;
rightChild(newtemp):=term;
temp=newtemp;
end case;
end while;
return temp;
end exp;
A simpler one
function exp : syntaxTree;
var temp, newtemp: syntaxTree;
begin
temp:=term;
while token=+ or token = - do
newtemp:=makeOpNode(token);
match(token);
leftChild(newtemp):=temp;
rightChild(newtemp):=term;
temp=newtemp;
end while;
return temp;
end exp;
The pseudo-code for the if-statement
procedure (1)
function ifstatement: syntaxTree;
var temp:syntaxTree;
begin
match(if);
match(();
temp:= makeStmtNode(if);
testChild(temp):=exp;
match());
thenChild(temp):=statement;
The pseudo-code for the if-statement
procedure (2)
if token= else then
match(else);
elseChild(temp):=statement;
else
ElseChild(temp):=nil;
end if;
end ifstatement
4.1.3 Further Decision Problems
More formal methods to deal with
complex situation
(1) It may be difficult to convert a grammar in
BNF into EBNF form;
(2) It is difficult to decide when to use the
choice A →αand the choice A →β;
if both α andβ begin with non-terminals.
Such a decision problem requires the
computation of the First Sets.
More formal methods to deal with
complex situation
(3) It may be necessary to know what token legally
coming from the non-terminal A, in writing the
code for an ε-production: A→ε. Such tokens
indicate A may disappear at this point in the
parse. This set is called the Follow Set of A.
(4) It requires computing the First and Follow sets in
order to detect the errors as early as possible.
Such as “)3-2)”, the parse will descend from exp
to term to factor before an error is reported.
4.2 LL(1) PARSING
4.2.1 The Basic Method of LL(1)
Parsing
Main idea
• LL(1) Parsing uses an explicit stack rather than
recursive calls to perform a parse
• An example:
– a simple grammar for the strings of balanced
parentheses:
S→(S) S∣ε
• The following table shows the actions of a top-
down parser given this grammar and the string ( )
Table of Actions
Steps Parsing Stack Input Action
1 $S ()$ S→(S) S
2 $S)S( ()$ match
3 $S)S )$ S→ε
4 $S) )$ match
5 $S $ S→ε
6 $ $ accept
General Schematic
• A top-down parser begins by pushing the start symbol onto
the stack
• It accepts an input string if, after a series of actions, the
stack and the input become empty
• A general schematic for a successful top-down parse:
$ StartSymbol Inputstring$
… … //one of the
two actions
… … //one of the two actions
$ $ accept
Two Actions
• The two actions
– Generate: Replace a non-terminal A at the top of the stack by a
string α(in reverse) using a grammar rule A →α, and
– Match: Match a token on top of the stack with the next input token.
• The list of generating actions in the above table:
S => (S)S [S→(S) S]
=> ( )S [S→ε]
=> ( ) [S→ε]
• Which corresponds precisely to the steps in a leftmost
derivation of string ( ).
• This is the characteristic of top-down parsing.
4.2.2 The LL(1) Parsing Table
and Algorithm
4.2.3 Left Recursion Removal
and Left Factoring
4.2.4 Syntax Tree Construction in
LL(1) Parsing
End of Part One

THANKS

Toefl Exercise 6-8
100% (7)
Toefl Exercise 6-8
5 pages
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
201 Arabic Verbs - Fully Conjugated in All The Forms
86% (7)
201 Arabic Verbs - Fully Conjugated in All The Forms
212 pages
Chapter 4 Top-Down Parsing: Outline
No ratings yet
Chapter 4 Top-Down Parsing: Outline
17 pages
Compiler Principle and Technology: Mr. Aruna Malik BIT (Mesra) Ranchi, Off Campus NOIDA
No ratings yet
Compiler Principle and Technology: Mr. Aruna Malik BIT (Mesra) Ranchi, Off Campus NOIDA
86 pages
BKS Unit II - V - Recursive Decent Parser
No ratings yet
BKS Unit II - V - Recursive Decent Parser
28 pages
Top Down Translation
No ratings yet
Top Down Translation
96 pages
Top Down Parsing
No ratings yet
Top Down Parsing
27 pages
4 - Top-Down
No ratings yet
4 - Top-Down
67 pages
Parsing
No ratings yet
Parsing
9 pages
7- Parsing Techniques- Top Down Parsing
No ratings yet
7- Parsing Techniques- Top Down Parsing
47 pages
5.ll-lr
No ratings yet
5.ll-lr
53 pages
CD Unit3
No ratings yet
CD Unit3
74 pages
L13Parsing 5 PDF
No ratings yet
L13Parsing 5 PDF
25 pages
Pec 31 Acd Material
No ratings yet
Pec 31 Acd Material
12 pages
Ch02 Programming Language Syntax 4e 2
No ratings yet
Ch02 Programming Language Syntax 4e 2
64 pages
What Is Parsing: Parsing Is The Process of Analyzing An Input Sequence in Order
No ratings yet
What Is Parsing: Parsing Is The Process of Analyzing An Input Sequence in Order
9 pages
Recap: Mooly Sagiv
No ratings yet
Recap: Mooly Sagiv
42 pages
Parsing, Lexical Analysis, and Tools: William Cook
No ratings yet
Parsing, Lexical Analysis, and Tools: William Cook
16 pages
Chapter 5 Intro to Top Down Parsing
No ratings yet
Chapter 5 Intro to Top Down Parsing
50 pages
CSC-437 Chapter 4
No ratings yet
CSC-437 Chapter 4
65 pages
CSE 4102 Syntax Analysis or Parsing
No ratings yet
CSE 4102 Syntax Analysis or Parsing
73 pages
Parser Final
No ratings yet
Parser Final
19 pages
Chapter 3
No ratings yet
Chapter 3
96 pages
Chapter 2 - Simple Syntax Directed Translator
No ratings yet
Chapter 2 - Simple Syntax Directed Translator
39 pages
Top Down Parsing
No ratings yet
Top Down Parsing
37 pages
Parsers
No ratings yet
Parsers
24 pages
Chapter 4 - Syntax Analysis
No ratings yet
Chapter 4 - Syntax Analysis
82 pages
Predetive Parse
No ratings yet
Predetive Parse
7 pages
04 Syntax Analysis
No ratings yet
04 Syntax Analysis
112 pages
Lexical and syntax analysis
No ratings yet
Lexical and syntax analysis
63 pages
Chapter 4 - Syntax Analysis CIE1
No ratings yet
Chapter 4 - Syntax Analysis CIE1
69 pages
Chapter 3 Syntax Analysis
No ratings yet
Chapter 3 Syntax Analysis
78 pages
Compiler Design Unit-2
No ratings yet
Compiler Design Unit-2
29 pages
Top Down Parser
No ratings yet
Top Down Parser
111 pages
Syntax Analysis: CD: Compiler Design
No ratings yet
Syntax Analysis: CD: Compiler Design
90 pages
EXP - 4 - To - 6CD - Lab Manual - ODD - 2024 - Removed
No ratings yet
EXP - 4 - To - 6CD - Lab Manual - ODD - 2024 - Removed
16 pages
Session 3
No ratings yet
Session 3
18 pages
Chapter 4 - Syntax Analysis
No ratings yet
Chapter 4 - Syntax Analysis
73 pages
Syntax Analysis Parsing (1)
No ratings yet
Syntax Analysis Parsing (1)
9 pages
Unit - Ii Topdown Parsing 1. Context-Free Grammars: Definition
No ratings yet
Unit - Ii Topdown Parsing 1. Context-Free Grammars: Definition
26 pages
Unit 2-Part B
No ratings yet
Unit 2-Part B
73 pages
Predictive Parsing: Recall The Main Idea of Top-Down Parsing
No ratings yet
Predictive Parsing: Recall The Main Idea of Top-Down Parsing
19 pages
Predictive Parsing: Recall The Main Idea of Top-Down Parsing
No ratings yet
Predictive Parsing: Recall The Main Idea of Top-Down Parsing
19 pages
Parser
No ratings yet
Parser
40 pages
Lec 09-Left Recursion Removal
No ratings yet
Lec 09-Left Recursion Removal
23 pages
4 Parsing
No ratings yet
4 Parsing
55 pages
Chapter-4 - CS-411 Compiler Construction
No ratings yet
Chapter-4 - CS-411 Compiler Construction
8 pages
Parsing
No ratings yet
Parsing
33 pages
Top Down PDF
No ratings yet
Top Down PDF
49 pages
Unit III
No ratings yet
Unit III
29 pages
Context Free Grammar & Parser
No ratings yet
Context Free Grammar & Parser
10 pages
Chapter – 3
No ratings yet
Chapter – 3
46 pages
parsing technique baar baar
No ratings yet
parsing technique baar baar
29 pages
Compiler Design Syntax Analysis Top Down
No ratings yet
Compiler Design Syntax Analysis Top Down
34 pages
CSC 4181 Compiler Construction Parsing
No ratings yet
CSC 4181 Compiler Construction Parsing
53 pages
Unit 2 Basic Parsing Techniques
No ratings yet
Unit 2 Basic Parsing Techniques
34 pages
Top-Down Parsing PDF
No ratings yet
Top-Down Parsing PDF
6 pages
Unit - Ii 2.1 Syntax Analysis
No ratings yet
Unit - Ii 2.1 Syntax Analysis
122 pages
Presented by Jyoti Thakur
No ratings yet
Presented by Jyoti Thakur
31 pages
CD UNIT II
No ratings yet
CD UNIT II
11 pages
Learn Programming Using C#
From Everand
Learn Programming Using C#
Taurius Litvinavicius
No ratings yet
Modal Verbs Exercises
No ratings yet
Modal Verbs Exercises
3 pages
S2 English2-1
No ratings yet
S2 English2-1
15 pages
10 кл ҚМЖ
No ratings yet
10 кл ҚМЖ
316 pages
Computron Libro Basico 4 Interiores
No ratings yet
Computron Libro Basico 4 Interiores
32 pages
L4 Grammar Revision 4 Student
No ratings yet
L4 Grammar Revision 4 Student
7 pages
(PDF) Factsheets On Subject Agreements SASE English Reviewer - Compress PDF
No ratings yet
(PDF) Factsheets On Subject Agreements SASE English Reviewer - Compress PDF
3 pages
PP For Sem II GR Vi Mar-Apr 2023-24
No ratings yet
PP For Sem II GR Vi Mar-Apr 2023-24
3 pages
Gerund Dzakwan 12 Ips 1
No ratings yet
Gerund Dzakwan 12 Ips 1
3 pages
Don Bosco Splendid Home 2 Terminal Examination-2017 Class-VIII English - Ii
No ratings yet
Don Bosco Splendid Home 2 Terminal Examination-2017 Class-VIII English - Ii
3 pages
PREPOSITION
No ratings yet
PREPOSITION
2 pages
Infinitives
No ratings yet
Infinitives
10 pages
Observing Rules in Constructing An Inverted Word Order
No ratings yet
Observing Rules in Constructing An Inverted Word Order
14 pages
Find The Pieces Predicate Adjective Noun and Verb
No ratings yet
Find The Pieces Predicate Adjective Noun and Verb
2 pages
There Is: TH Ere I S - TH Ere A Re
No ratings yet
There Is: TH Ere I S - TH Ere A Re
9 pages
Comparative and Superlative
100% (1)
Comparative and Superlative
3 pages
Unit 1 Grammar
No ratings yet
Unit 1 Grammar
1 page
A Course in English Lexicology
100% (4)
A Course in English Lexicology
63 pages
BSQ-1 Gr.5, Var.A
No ratings yet
BSQ-1 Gr.5, Var.A
3 pages
B2 Prepositions
No ratings yet
B2 Prepositions
8 pages
Irregular Verbs
No ratings yet
Irregular Verbs
2 pages
Speech Act
No ratings yet
Speech Act
3 pages
Generic Struktur Narrative
No ratings yet
Generic Struktur Narrative
3 pages
English Step by Step
No ratings yet
English Step by Step
145 pages
B2 art
No ratings yet
B2 art
2 pages
GCSE German Textbook AQA
No ratings yet
GCSE German Textbook AQA
44 pages
Direct Indirect Speech Ppt Ws Ans Key.pptx
No ratings yet
Direct Indirect Speech Ppt Ws Ans Key.pptx
39 pages
Sample Detailed Lesson Plan in English For Teaching Demonstration
No ratings yet
Sample Detailed Lesson Plan in English For Teaching Demonstration
7 pages
Simple Past or Present Perfect - Test: A - Put in The Verbs in Brackets Into The Gaps
No ratings yet
Simple Past or Present Perfect - Test: A - Put in The Verbs in Brackets Into The Gaps
4 pages

Lecture04 Week06 TopDownParsing 1 - Compilers

Uploaded by

Lecture04 Week06 TopDownParsing 1 - Compilers

Uploaded by

COMPILER CONSTRUCTION

Principles and Practice

procedure match( expectedToken);

• Writing recursive-decent procedure for the

/*Simple integer arithmetic calculator according to the EBNF;

You might also like