0% found this document useful (0 votes)

15 views5 pages

Cs1622 Parsing Part2 Bun

Uploaded by

Ungureanu Ioana Mădălina

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views5 pages

Cs1622 Parsing Part2 Bun

Uploaded by

Ungureanu Ioana Mădălina

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

9/18/2012

Derivations vs Parses
Grammar is used to derive string or construct parser

A derivation is a sequence of applications of rules

• Starting from the start symbol

Context Free Grammars • S ⇒ ... ⇒ ... ⇒ ... ⇒ (sentence)

Leftmost and rightmost derivations

• At each derivation step, a leftmost derivation always replaces the
leftmost non-terminal symbol
• Rightmost derivation always replaces the rightmost one

Example Parse Tree

E → E * E | E + E | ( E ) | id Parse tree:
• Internal nodes are non-terminals
Leftmost derivation: • Leaves are terminals
E ⇒ E + E ⇒ E * E + E ⇒ id * E + E ⇒ id * id + E ⇒ ...
⇒ id * id + id * id It filters out the order of replacement and describes the hierarchy

Rightmost derivation: The same parse tree results from both the rightmost and leftmost derivations in
E ⇒ E + E ⇒ E + E * E ⇒ E + E * id ⇒ E + id * id ⇒ ... the previous example:
⇒ id * id + id * id E

E + E

E * E E * E

id id id id

Different Parse Trees Ambiguity

While the two derivations could have the same parse tree for id * id + id * id there A grammar G is ambiguous if there exists a string str  L(G) such that more than
can actually be 3 different trees: one parse trees derive str
E
E
We prefer unambiguous grammars.
E * E E * E
Ambiguity is the property of a grammar and not the language
id E + E E + E id
It is possible to rewrite the grammar to remove ambiguity
id E * E E * E id

id id id id
E

E + E

E * E E * E

id id id id

1
9/18/2012

Removing Ambiguity Removing Ambiguity

Method 1: Specify precedence. Method 2: Specify associativity.
• When recursion is allowed, we need to specify associativity
You can build precedence into the grammar by having a different non-terminal for
each precedence level: For the previous example,
• Lowest level — highest in the tree (lowest precedence) E → E*E
• Highest level — lowest in the tree Allows both right and left associativity.
• Same level — same precedence E
We can rewrite it to force it either way:
For the previous example, E + T
Left associative :
E → E * E | E + E | ( E ) | id E → E*T
T T * F
rewrite it to: Right associative:
E → E+T|T T E → T*E
* F F id
T → T*F|F
F → id | ( E ) F id In a programming language, most operators are left associative.
id

Syntax Analysis Types of Parsers

We’ve only discussed grammar from the point of view of derivation. Universal parser
• Can parse any CFG grammar. (Early’s algorithm)
What is syntax analysis? • Powerful but extremely inefficient
• To process an input string for a given grammar, and compose the
derivation if the string is in the language Top-down parser
• It is goal-directed, expands the start symbol to the given sentence
• Two subtasks: • Only works for certain class of grammars
• to determine if string in the language or not • To start from the root of the parse tree and reach leaves
• to construct the parse tree • Find leftmost derivation
• Can be implemented efficiently by hand

Is it possible to construct such a parser?

Types of Parsers Parser Output

Bottom-up parser We have a choice of outputs from the parser:
• It tries to reduce input string to the start symbol • A parse tree (concrete syntax tree), or
• Works for wider class of grammars • An abstract syntax tree
• Starts at leaves and build tree in bottom-up fashion
• Find reverse order of the rightmost derivation Example Grammar:
• Automated tool generates it automatically E → int | ( E ) | E + E
and an input:
5 + ( 2 + 3 )
After lexical analysis, we have a sequence of tokens
INT:5 ‘+’ ‘(’ INT:2 ‘+’ INT:3 ‘)’

2
9/18/2012

Parser Output Summary

The parse tree traces the operation of the parser. E We specify the syntax structure using CFG even if the programming language
itself is not context free.
E + E
Captures the nested structure but contains too much
information: INT:5 ( A parser can:
E )
• Parentheses (precedence encoded in tree • Answer if an input str  L(G)
hierarchy) E + E • and build a parse tree
• Single-successor nodes (could be • or build an AST instead
collapsed/omitted) INT:2 INT:3
• and pass it to the rest of compiler.

We prefer an Abstract Syntax Tree (AST):

• AST also captures the nested structure. PLUS
• AST abstracts from the concrete syntax.
• AST is more compact and easier to use. 5 PLUS

2 3

Parsing
We will study two approaches:

Top-down
• Easier to understand and implement manually

Parsing Bottom-up
• More powerful, can be implemented automatically

Top Down Parsers Parsing Using Backtracking

Recursive descent Approach: For a non-terminal in the derivation, productions are tried in some
• Simple to implement, use backtracking order until
• A production is found that generates a portion of the input,
Predictive parser or
• Predict the rule based on the 1st m symbols without backtracking • No production is found that generates a portion of the input, in which
• Restrictions on the grammar to avoid backtracking case backtrack to previous non-terminal.

LL(k) — predictive parser for LL(k) grammar Parsing fails if no production for the start symbol generates the entire input.
• Non recursive and only k symbol look ahead
• Table driven — efficient Terminals of the derivation are compared against input.
• Match — advance input, continue parsing
• Mismatch — backtrack, or fail

3
9/18/2012

Parsing Using Backtracking Parsing Using Backtracking

Grammar: Input Derivation Action
E → T + E | T int * int E pick rightmost rule E → T
T → int * T | int | ( E ) int * int E⇒T pick rightmost rule T → ( E )
int * int E⇒T⇒(E) “(” does not match “int”
Input string: int * int E⇒T Failure, backtrack one level.
int * int
int * int E ⇒ T ⇒ int pick next rule T → int
int * int E ⇒ T ⇒ int “int” matches input “int”
Start symbol: E
int * int E⇒T We have more tokens, so this is
failure too. Backtrack.
Assume:
int * int E ⇒ T ⇒ int * T Match int * Expand T.
• When there are alternative rules, try right rule first
int * int E ⇒ T ⇒ int * T ⇒ int * ( E ) pick rightmost rule E → ( E )
int * int E ⇒ T ⇒ int * T ⇒ int * ( E ) “(” does not match input “int”
int * int E ⇒ T ⇒ int * T Failure, backtrack one level.
int * int E ⇒ T ⇒ int * T ⇒ int * int pick next rule T → int
int * int E ⇒ T ⇒ int * T ⇒ int * int Match whole input. Accept.

Implementation Problems
Create a procedure for each non-terminal: Unclear what to label the last case with.
1. Checks if input symbol matches a terminal symbol in the grammar rule
2. Calls other procedure when non-terminals are part of the rule What if we don’t label it at all and make it the default?
3. If end of procedure is reached, success is reported to the caller
Consider parsing 5 + 5:
E → int | ( E ) | E + E
We’d find INT and be done with the parse with more input to consume. We’d
void E() { want to backtrack, but there’s no prior function call to return to.
switch(lexer.yylex()) {
case INT: eat(INT); break; What if we put the call to E() prior to the switch/case?
case LPAREN: eat(LPAREN); E(); eat(RPAREN); break;
case ???: E(); eat(PLUS); E(); break; Then E() would always make a recursive call to E() with no end case for the
} recursion.
}

Left Recursion Removing Left Recursion

A production is left recursive if the same nonterminal that appears on the LHS In general, we can eliminate all immediate left recursion:
appears first on the RHS of the production. A → A x | y

Recursive descent parsers cannot deal with left recursion. By changing the grammar to:
A → y A’
However, we can rewrite the grammar to represent the same language without the A’ → x A’ | 
need for left recursion.
Not all left recursion is immediate may be hidden in multiple production rules
A → BC | D
B → AE | F

There is a general approach for removing indirect left recursion, but we’ll not
worry about if for this course.

4
9/18/2012

Recursive Descent Summary

Recursive descent is a simple and general parsing strategy
• Left-recursion must be eliminated first
• But this can be done automatically

It is not popular because of its inefficiency:

• Backtracking re-parses the string
• Undoing semantic actions (actions taken upon matching a production
much like the actions from our lexer) may be difficult!

Techniques used in practice do no backtracking at the cost of restricting the class

of grammar

Unit 2 Basic Parsing Techniques
No ratings yet
Unit 2 Basic Parsing Techniques
34 pages
CD Unit-Ii
No ratings yet
CD Unit-Ii
37 pages
03 Parsing
No ratings yet
03 Parsing
61 pages
Unit 3
No ratings yet
Unit 3
117 pages
Ch3 SyntaxAnalysispdf 2024 01 01 08 48 28
No ratings yet
Ch3 SyntaxAnalysispdf 2024 01 01 08 48 28
134 pages
Unit-2 2.1. Review of CFG Ambiguity of Grammars 2.1.1. Limitations of Regular Language
No ratings yet
Unit-2 2.1. Review of CFG Ambiguity of Grammars 2.1.1. Limitations of Regular Language
44 pages
Atcd Unit 2
No ratings yet
Atcd Unit 2
49 pages
Parsing Notes
No ratings yet
Parsing Notes
96 pages
Compiler Engineering
No ratings yet
Compiler Engineering
27 pages
Chapter 3 Syntax Analysis
No ratings yet
Chapter 3 Syntax Analysis
54 pages
Chapter 3 Syntax Analysis
No ratings yet
Chapter 3 Syntax Analysis
78 pages
CSE 4102 Syntax Analysis or Parsing
No ratings yet
CSE 4102 Syntax Analysis or Parsing
73 pages
First and Follow Set
86% (7)
First and Follow Set
5 pages
Module-2 1
No ratings yet
Module-2 1
51 pages
Compiler Design: Parsing
No ratings yet
Compiler Design: Parsing
48 pages
Top-Down Parsing: CS164 Lecture 5-6
No ratings yet
Top-Down Parsing: CS164 Lecture 5-6
56 pages
ACD-UNIT-4 Notes
No ratings yet
ACD-UNIT-4 Notes
32 pages
Top-Down and Bottom-Up Parsing
No ratings yet
Top-Down and Bottom-Up Parsing
23 pages
Parser
No ratings yet
Parser
40 pages
Lecture3 Java
No ratings yet
Lecture3 Java
82 pages
LL1 ParsingExamples
No ratings yet
LL1 ParsingExamples
11 pages
Top Down PDF
No ratings yet
Top Down PDF
49 pages
Unit - 4 Syntax Analysis
No ratings yet
Unit - 4 Syntax Analysis
25 pages
Compiler Design Notes
100% (1)
Compiler Design Notes
156 pages
7MCE1C4-Principles of Compiler Design
No ratings yet
7MCE1C4-Principles of Compiler Design
117 pages
Lista3 de Comp
No ratings yet
Lista3 de Comp
6 pages
CD DSTC Notes
No ratings yet
CD DSTC Notes
35 pages
Unit 2 (CD)
No ratings yet
Unit 2 (CD)
12 pages
Parsing Assignment
No ratings yet
Parsing Assignment
6 pages
III B. Tech II - Sem CG LessonPlan (R23) - DR Raja Kumar
No ratings yet
III B. Tech II - Sem CG LessonPlan (R23) - DR Raja Kumar
2 pages
Chapter 3
No ratings yet
Chapter 3
96 pages
CD Chapter-3
No ratings yet
CD Chapter-3
105 pages
Compiler Design SYLLABUS
No ratings yet
Compiler Design SYLLABUS
2 pages
Chapter 3 Syntax Analyzer1
No ratings yet
Chapter 3 Syntax Analyzer1
58 pages
Syntax Analysis
No ratings yet
Syntax Analysis
73 pages
CCS 305
No ratings yet
CCS 305
5 pages
Chapter - Three
No ratings yet
Chapter - Three
139 pages
3b. LMD & RMD
No ratings yet
3b. LMD & RMD
24 pages
UNIT-2: Parsing
No ratings yet
UNIT-2: Parsing
18 pages
SPCC
No ratings yet
SPCC
80 pages
Chapter - 3
No ratings yet
Chapter - 3
46 pages
Unit Ii QB
No ratings yet
Unit Ii QB
16 pages
2-Role of Parser and Parse Tree-02!08!2024
No ratings yet
2-Role of Parser and Parse Tree-02!08!2024
69 pages
Compiler Design BIT052 Complete Notes (RRSIMT)
No ratings yet
Compiler Design BIT052 Complete Notes (RRSIMT)
126 pages
CD Chapter 2
No ratings yet
CD Chapter 2
39 pages
Tekkom M4,5
No ratings yet
Tekkom M4,5
29 pages
Chapter 3
No ratings yet
Chapter 3
180 pages
Compiler Design Bcs Ai 3d
No ratings yet
Compiler Design Bcs Ai 3d
4 pages
Course Plan: DSEC/CSE/CS8602/III/VI
No ratings yet
Course Plan: DSEC/CSE/CS8602/III/VI
9 pages
Compiler Construction Lecture 12 Predictive Parsing-Step1
No ratings yet
Compiler Construction Lecture 12 Predictive Parsing-Step1
24 pages
Complier Design Gate Question
No ratings yet
Complier Design Gate Question
22 pages
CD Unit-Ii
No ratings yet
CD Unit-Ii
56 pages
Module 2 C D Notes
No ratings yet
Module 2 C D Notes
21 pages
Top Down Parser
No ratings yet
Top Down Parser
111 pages
Unit Ii
No ratings yet
Unit Ii
17 pages
Compiler Design Unit II-1
No ratings yet
Compiler Design Unit II-1
46 pages
CD Unit-2
100% (1)
CD Unit-2
60 pages
Syntax Analyser
No ratings yet
Syntax Analyser
30 pages
Chapter-3-Syntax Analysis
No ratings yet
Chapter-3-Syntax Analysis
126 pages
CD Unit 3
No ratings yet
CD Unit 3
76 pages
CD - Ch.2
No ratings yet
CD - Ch.2
39 pages
Compiler Construction CS-4207: Lecture 8-9 Instructor Name: Atif Ishaq
No ratings yet
Compiler Construction CS-4207: Lecture 8-9 Instructor Name: Atif Ishaq
34 pages
2024 CD-Ch03 Syntaxx Analysis
No ratings yet
2024 CD-Ch03 Syntaxx Analysis
28 pages
CD Unit2
No ratings yet
CD Unit2
73 pages
CSE353 Slides
No ratings yet
CSE353 Slides
76 pages
KCA015 Unit2
No ratings yet
KCA015 Unit2
29 pages
Grammars
No ratings yet
Grammars
34 pages
Question Bank
No ratings yet
Question Bank
6 pages
Chapter - Three: Syntax Analysis
No ratings yet
Chapter - Three: Syntax Analysis
100 pages
Unit Ii Part of Speech Tagging and Syntactic Parsing
No ratings yet
Unit Ii Part of Speech Tagging and Syntactic Parsing
29 pages
Unit - Ii Topdown Parsing 1. Context-Free Grammars: Definition
No ratings yet
Unit - Ii Topdown Parsing 1. Context-Free Grammars: Definition
26 pages
CSE - 3 2 Sem - CS Syllabus - UG - R20 Revised On 27 02 2023
No ratings yet
CSE - 3 2 Sem - CS Syllabus - UG - R20 Revised On 27 02 2023
5 pages
Compiler Design Lec-Three Syntax Analysis
No ratings yet
Compiler Design Lec-Three Syntax Analysis
60 pages
CD Unit-Ii
No ratings yet
CD Unit-Ii
34 pages
CH03
No ratings yet
CH03
57 pages
Chapter-3 So Far
No ratings yet
Chapter-3 So Far
50 pages
Compiler Design Chapter-3
0% (1)
Compiler Design Chapter-3
177 pages
2.2 - Syntax Analysis (Upto Top-Down Parsing)
No ratings yet
2.2 - Syntax Analysis (Upto Top-Down Parsing)
91 pages
Topic #4: Syntactic Analysis (Parsing) : INF 524 Compiler Construction Spring 2011
No ratings yet
Topic #4: Syntactic Analysis (Parsing) : INF 524 Compiler Construction Spring 2011
44 pages
CD Unit 2
No ratings yet
CD Unit 2
19 pages
Compiler Design - Syntax Analysis
No ratings yet
Compiler Design - Syntax Analysis
14 pages
Parsing
No ratings yet
Parsing
33 pages
Session 3
No ratings yet
Session 3
18 pages
Syntax Analyzer
No ratings yet
Syntax Analyzer
38 pages
Compiler Design Notes
No ratings yet
Compiler Design Notes
157 pages
BCS613C
No ratings yet
BCS613C
3 pages

Cs1622 Parsing Part2 Bun

Uploaded by

Cs1622 Parsing Part2 Bun

Uploaded by

9/18/2012

A derivation is a sequence of applications of rules

Context Free Grammars • S ⇒ ... ⇒ ... ⇒ ... ⇒ (sentence)

Leftmost and rightmost derivations

Example Parse Tree

Different Parse Trees Ambiguity

Removing Ambiguity Removing Ambiguity

Syntax Analysis Types of Parsers

Is it possible to construct such a parser?

Types of Parsers Parser Output

Parser Output Summary

We prefer an Abstract Syntax Tree (AST):

Top Down Parsers Parsing Using Backtracking

Parsing Using Backtracking Parsing Using Backtracking

Left Recursion Removing Left Recursion

Recursive Descent Summary

It is not popular because of its inefficiency:

Techniques used in practice do no backtracking at the cost of restricting the class

You might also like