0% found this document useful (0 votes)

0 views

slides08-lr-parsing

The document discusses Bottom-Up LR Parsing, contrasting it with Top-Down parsing methods. It explains the mechanics of LR parsing, including the use of LR items and the handling of shift-reduce conflicts, particularly in the context of if-then-else statements. Additionally, it touches on error reporting and recovery strategies, as well as other parsing tools like GLR and PEG parsers.

Uploaded by

Rasha Elsayed Sakr

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

0 views

slides08-lr-parsing

Uploaded by

Rasha Elsayed Sakr

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

Bottom-Up LR Parsing

17-363/17-663: Programming Language Pragmatics

Reading: PLP section 2.3

Copyright © 2016 Elsevier

Prof. Jonathan Aldrich
Top-Down vs. Bottom-Up Parsing

• Top-Down/LL Parsing Intuition

program Start trying to parse a program

stmt_list $$$ Based

Start trying
on lookahead,
to parse arefine
program
to stmt_list
then to stmt stmt_list
stmt stmt_list $$$
Stack tracks predicted future parsing
...
• Bottom-Up/LR Parsing Intuition
read A Start by shifting a few tokens

stmt Reduce tokens to a stmt, thentotoa stmt_list

stmt, then a stmt_list

stmt_list Continue
Continue to
tokens
to shift
shift and
and reduce
reduce tokens
tokens
tokens to
to recognize
recognize another
another stmt
stmt
stmt_list read B Stack shows what constructs
stmt_list stmt have been recognized so far
Example Program and SLR(1) Grammar

read A
read B
sum := A + B
write sum
write sum / 2
Modeling a Parse with LR Items

• Initial parse state captured by an item

– includes start symbol, production, and current location

• What we see next might be inside stmt_list

– So we expand stmt_list and get a set of items:
Modeling a Parse with LR Items

• We can likewise expand stmt to get the item set:

• This is an SLR parser state

– We’ll call it state 0
Modeling a Parse with LR Items

• Our starting stack has state 0 on it:

0
• Input: read A read B …

• From state 0, we shift read onto the stack and

move to state 1:
0 read 1

• State 1 represents the following item:

Modeling a Parse with LR Items

• stack / item: 0 read 1

• input: A read B …

• From state 1, we shift id onto the stack

• stack / item: 0 read 1 id 1’
• input: read B …

• Now we reduce to stmt, and put stmt into the input

• stack / item: 0
• input: stmt read B …
Modeling a Parse with LR Items

• stack / item: 0
• input: stmt read B …

• We now shift stmt

• stack / item: 0 stmt 0’
• input: read B …

• Next we reduce to stmt_list

• stack / item: 0
• input: stmt_list read B …
Modeling a Parse with LR Items

• stack / item: 0
• input: stmt_list read B …

• Now we shift stmt_list

• stack / item: 0 stmt_list 2
• input: read B …
The Characteristic Finite State
Machine (CFSM)

There are also shift-reduce actions. So our states 0’, 1’ aren’t shown
here: they are “in between” states within a shift-reduce action
The CFSM as a Table
A Detailed Explanation of the CFSM
A Detailed Explanation of the CFSM
A Detailed Explanation of the CFSM
Exercise: LR Parsing

• Assume you are in parsing state 0

and the token stream is write sum / 2
• Show how the parse stack changes as the token
stream is consumed
• We’ll do the first action together
Parsing if-then-else Statements

• A famous parsing challenge (from Algol) involves if-

then-else, where else is optional:

stmt ::= if exp then stmt

| if exp then stmt else stmt

• Consider the phrase:

if exp then if exp then stmt else stmt

• Which then does the else belong to?

Shift/Reduce Conflicts

• This is a shift-reduce conflict

if exp then if exp then stmt . else stmt

• When the else appears

• This is a shift-reduce conflict

if exp then if exp then stmt . else stmt

• When the else appears

• we can shift, treating it as part of the inner if statement, or
• we can reduce the inner if statement,
treating the else as part of the outer if statement
• How to solve?
– Many existing tools prioritize shift over reduce
– You can declare productions with precedence
• E.g. giving the if-then-else production higher precedence
than the if-then production
Shift/Reduce Conflicts

• This is a shift-reduce conflict

if exp then if exp then stmt . else stmt

• When the else appears

• we can shift, treating it as part of the inner if statement, or
• we can reduce the inner if statement,
treating the else as part of the outer if statement
• How to solve?
– Many existing tools prioritize shift over reduce
– You can declare productions with precedence
– Rewrite the grammar to make it LR(1)
An LR(0) If-Then-Else Grammar
stmt → balanced_stmt | unbalanced_stmt
balanced_stmt → if cond then balanced_stmt
else balanced_stmt
| other_stuff
unbalanced_stmt → if cond then stmt
| if cond then balanced_stmt
else unbalanced_stmt

Invariant: balanced_stmts may be inside unbalanced_stmts

– but not vice versa
Unfortunately this grammar is LR(0) but not LL(0)
– Have to use precedence in LL parsers
or custom code in a recursive-descent parser
Connections to Theory
• A scanner is a Deterministic Finite Automaton (DFA)
– it can be specified with a state diagram

• An LL or LR parser is a Pushdown Automaton (PDA)

– a PDA can be specified with a state diagram and a stack
• the state diagram looks just like a DFA state diagram, except the arcs
are labeled with <input symbol, top-of-stack symbol> pairs, and in
addition to moving to a new state the PDA has the option of pushing
or popping a finite number of symbols onto/off the stack
• For LL(1) parsers the state machine has only two states:
processing and accepted
• All the action is in the input symbol and top of stack
• LR(1) parsers are richer (and more expressive)
Error Reporting
• Error reporting is relatively simple
• If you get a token for which there’s no entry in the
current parsing state / top of stack element, signal an
error
• Can tell the user what tokens would be OK here
Error Recovery
• Nice to report more than one error to the user
• Rather than stopping after the first one
• Simple idea: Panic mode
• In C-like languages, semicolons are good recovery spots
• So on an error:
• read tokens until you get to a semicolon
• discard the parser’s stack (predictions in an LL parser, states in an LR
parser) until you come to a production that has a semicolon
• assume you’ve parsed the semicolon-containing construct,
and continue parsing
• There are ways to do substantially better – see the online
supplement to the textbook
Other Parsing Tools
• Generalized LR (GLR) parser generators
• Accept any grammar – even ambiguous ones!
• This can be good if you have grammars written by nonexperts, as in
SASyLF
• But for a compiler-writer it is dangerous—you may not even know
your grammar is ambiguous, and then your poor users get ambiguity
errors when the parser runs
• Works like an LR parser, but on ambiguity considers all
possible parses in parallel
• Still O(n) if the grammar is LR (or “close”)
Other Parsing Tools
• Parsing Expression Grammar (PEG) parser generators
• Sidestep ambiguity by always favoring the first production
• Same danger as GLR parsers – you may not know your
grammar is ambiguous
• Still used some in practice (e.g. in Python)
• About as efficient as LL or LR in practice
• Like LR, PEG grammars can be cleaner than LL grammars
• Requires extreme care to get right – must think algorithmically
instead of declaratively
• Guido van Rossum, the developer of Python, saw this as an advantage

Fifa
No ratings yet
Fifa
10 pages
18 Miscellaneous Parsing
No ratings yet
18 Miscellaneous Parsing
8 pages
CS346 Bottom Up Parser
No ratings yet
CS346 Bottom Up Parser
64 pages
Syntax Analyzer 2-up to LALR
No ratings yet
Syntax Analyzer 2-up to LALR
74 pages
Syntax Analyzer 2-up to LR(0)
No ratings yet
Syntax Analyzer 2-up to LR(0)
73 pages
Module 3
No ratings yet
Module 3
29 pages
Bottomupparser
No ratings yet
Bottomupparser
58 pages
Mod 2
No ratings yet
Mod 2
29 pages
Compiler Design 5
No ratings yet
Compiler Design 5
7 pages
Unit 3 21csc304j CD
No ratings yet
Unit 3 21csc304j CD
103 pages
CD_Chap3_III_Bottom Up Parsing (2)
No ratings yet
CD_Chap3_III_Bottom Up Parsing (2)
37 pages
Bottom Up Parser
No ratings yet
Bottom Up Parser
75 pages
Syntax Analysis 2
No ratings yet
Syntax Analysis 2
70 pages
Syntax Analysis (Part-II)
No ratings yet
Syntax Analysis (Part-II)
69 pages
CD - R16 - UNIT III - Notes
No ratings yet
CD - R16 - UNIT III - Notes
33 pages
Bottomupparsing
No ratings yet
Bottomupparsing
12 pages
M2 - P4 LR Parser
No ratings yet
M2 - P4 LR Parser
38 pages
CD_Unit3
No ratings yet
CD_Unit3
103 pages
Parsing
No ratings yet
Parsing
33 pages
07 Bottom Up Parsing
No ratings yet
07 Bottom Up Parsing
79 pages
21 SLR Parsing
No ratings yet
21 SLR Parsing
93 pages
General Framework: X X X X: LR Parser
No ratings yet
General Framework: X X X X: LR Parser
6 pages
LR 0 Notes
No ratings yet
LR 0 Notes
14 pages
Bottom Up Parsing
No ratings yet
Bottom Up Parsing
11 pages
Lecture 8
No ratings yet
Lecture 8
13 pages
CC LR Parser
No ratings yet
CC LR Parser
37 pages
LR Parsing
No ratings yet
LR Parsing
21 pages
Introduction To Bottom Up Parser
No ratings yet
Introduction To Bottom Up Parser
75 pages
Chapter 6-1 note
No ratings yet
Chapter 6-1 note
54 pages
Bottom Up Parse
No ratings yet
Bottom Up Parse
14 pages
Sectlrparse S
No ratings yet
Sectlrparse S
19 pages
UNIT-4 Parsing Techniques
No ratings yet
UNIT-4 Parsing Techniques
20 pages
CD Unit-3 (1) (R20)
No ratings yet
CD Unit-3 (1) (R20)
29 pages
Bottom Up Parsing
No ratings yet
Bottom Up Parsing
39 pages
Mehak
No ratings yet
Mehak
23 pages
CH 4 Syntax Analysis - Part2
No ratings yet
CH 4 Syntax Analysis - Part2
31 pages
r20 CD Unit-3 Part 2
No ratings yet
r20 CD Unit-3 Part 2
8 pages
Parsing Notes
No ratings yet
Parsing Notes
96 pages
LR Parsing Methods
No ratings yet
LR Parsing Methods
50 pages
D LR Parsing
No ratings yet
D LR Parsing
41 pages
Lec06 Bottomupparser
83% (6)
Lec06 Bottomupparser
88 pages
LR Parser
No ratings yet
LR Parser
15 pages
CD R19 Unit-2
No ratings yet
CD R19 Unit-2
53 pages
Compiler Design(Unit-II)
No ratings yet
Compiler Design(Unit-II)
89 pages
LR
No ratings yet
LR
4 pages
LR (K) Parsing: CPSC 388 Ellen Walker Hiram College
No ratings yet
LR (K) Parsing: CPSC 388 Ellen Walker Hiram College
30 pages
Compilef Design Unit 2 AKTU As Per 2023-24 Syllabus
No ratings yet
Compilef Design Unit 2 AKTU As Per 2023-24 Syllabus
46 pages
Unit 02 - Part 03
No ratings yet
Unit 02 - Part 03
50 pages
Lecture3 Parser Full
No ratings yet
Lecture3 Parser Full
30 pages
Syntax Analysis: CD: Compiler Design
No ratings yet
Syntax Analysis: CD: Compiler Design
90 pages
Bottom-Up Parsing: Goal of Parser: Build A Derivation
No ratings yet
Bottom-Up Parsing: Goal of Parser: Build A Derivation
31 pages
Lrparser HaLrparser Handout Ndout
No ratings yet
Lrparser HaLrparser Handout Ndout
16 pages
S2 BottomUpParsing
No ratings yet
S2 BottomUpParsing
59 pages
Compiler Design Unit-2
No ratings yet
Compiler Design Unit-2
29 pages
bottom up
No ratings yet
bottom up
10 pages
Bottom Up Parsing
No ratings yet
Bottom Up Parsing
44 pages
ch2 3
No ratings yet
ch2 3
26 pages
Module 4
No ratings yet
Module 4
53 pages
BOTTOM
No ratings yet
BOTTOM
29 pages
Bottom Up Parser
No ratings yet
Bottom Up Parser
61 pages
05 Parsingbottomup PDF
No ratings yet
05 Parsingbottomup PDF
129 pages
Syntactic and Dependency Parsing
No ratings yet
Syntactic and Dependency Parsing
159 pages
bag_of_words nlp
No ratings yet
bag_of_words nlp
23 pages
10-estimators-pre-lecture
No ratings yet
10-estimators-pre-lecture
109 pages
Pattern Recognition: Dr. Farah Qais Al-Khalidi
No ratings yet
Pattern Recognition: Dr. Farah Qais Al-Khalidi
49 pages
lect33-textcat (1)
No ratings yet
lect33-textcat (1)
70 pages
Primes
No ratings yet
Primes
39 pages
new trends for authentication
No ratings yet
new trends for authentication
5 pages
ch07-consistency-replication (1)
No ratings yet
ch07-consistency-replication (1)
30 pages
Tut4_WordEmb nlp
No ratings yet
Tut4_WordEmb nlp
30 pages
2DI90_chID190-CH5
No ratings yet
2DI90_chID190-CH5
62 pages
reduction proofs
No ratings yet
reduction proofs
9 pages
13-oo-opolymorphism plc
No ratings yet
13-oo-opolymorphism plc
15 pages
2DI90_ch9 (1)
No ratings yet
2DI90_ch9 (1)
83 pages
2DI90_ch11 (1)
No ratings yet
2DI90_ch11 (1)
54 pages
CSE538 sp25 (4) Lexical and Vector Semantics 2-25 nlp
No ratings yet
CSE538 sp25 (4) Lexical and Vector Semantics 2-25 nlp
126 pages
Jarrar.LectureNotes.Ch1.Introduction
No ratings yet
Jarrar.LectureNotes.Ch1.Introduction
18 pages
NLP-LLM
No ratings yet
NLP-LLM
47 pages
imc_shift-cipher
No ratings yet
imc_shift-cipher
17 pages
ML4D-L6 nlp2
No ratings yet
ML4D-L6 nlp2
58 pages
04-textcat text class
No ratings yet
04-textcat text class
77 pages
3_slides corpus3
No ratings yet
3_slides corpus3
88 pages
07-covariance-answers-hidden-lecture
No ratings yet
07-covariance-answers-hidden-lecture
62 pages
Ch. 1 Notes
No ratings yet
Ch. 1 Notes
11 pages
01-introduction plc
No ratings yet
01-introduction plc
53 pages
13-neuralcrf pos tagging
No ratings yet
13-neuralcrf pos tagging
40 pages
2.BasicTextProcessing NEW
No ratings yet
2.BasicTextProcessing NEW
39 pages
4_slides Regualer expression
No ratings yet
4_slides Regualer expression
75 pages
61799956 POS tagging
No ratings yet
61799956 POS tagging
63 pages
01-bayes-all-handout prob
No ratings yet
01-bayes-all-handout prob
28 pages
02 Random Vars All Handout
No ratings yet
02 Random Vars All Handout
23 pages
4 PL SQL Control Statement
No ratings yet
4 PL SQL Control Statement
19 pages
F
No ratings yet
F
22 pages
138 Pushdown Automata Theory: Constructive
No ratings yet
138 Pushdown Automata Theory: Constructive
4 pages
Viii Atso Number Series Ws-2
No ratings yet
Viii Atso Number Series Ws-2
2 pages
Main
No ratings yet
Main
19 pages
Units of Time Conversion Chart
No ratings yet
Units of Time Conversion Chart
5 pages
Graph
No ratings yet
Graph
128 pages
QM II Assignment JCG Global Air Services
No ratings yet
QM II Assignment JCG Global Air Services
1 page
Grossberg Nets
No ratings yet
Grossberg Nets
5 pages
AI Chapter 3
No ratings yet
AI Chapter 3
78 pages
Prac-6 Darshan ADA Gtu
No ratings yet
Prac-6 Darshan ADA Gtu
9 pages
PSLT
No ratings yet
PSLT
16 pages
Assembly Language Assignment 1
No ratings yet
Assembly Language Assignment 1
2 pages
Rutgers_CS_205_Sp25___Homework_3
No ratings yet
Rutgers_CS_205_Sp25___Homework_3
3 pages
BCA Course Outcomes
No ratings yet
BCA Course Outcomes
5 pages
Lua 5.3 Reference Manual
No ratings yet
Lua 5.3 Reference Manual
103 pages
THE 8085 Microprocessor Instruction Set
No ratings yet
THE 8085 Microprocessor Instruction Set
23 pages
Professional C++ 5th Edition Marc Gregoire - Download the ebook today and own the complete content
100% (1)
Professional C++ 5th Edition Marc Gregoire - Download the ebook today and own the complete content
50 pages
Time Series Using Stata (Oscar Torres-Reyna Version) : December 2007
No ratings yet
Time Series Using Stata (Oscar Torres-Reyna Version) : December 2007
32 pages
21EC71
No ratings yet
21EC71
4 pages
JNTUA R20 MCACourse Structure Syllabus
No ratings yet
JNTUA R20 MCACourse Structure Syllabus
33 pages
CSE&DS R24 COURSE STRUTURE With Syllabus
No ratings yet
CSE&DS R24 COURSE STRUTURE With Syllabus
14 pages
Aph Theory - Fundamental Concepts
No ratings yet
Aph Theory - Fundamental Concepts
44 pages
Pipeline Processing Coa
No ratings yet
Pipeline Processing Coa
34 pages
Lesson 7 Oop Fundamentals Diit PPT Dcit 50 Oop
No ratings yet
Lesson 7 Oop Fundamentals Diit PPT Dcit 50 Oop
23 pages
Deep Learning Syllabus
No ratings yet
Deep Learning Syllabus
2 pages
SY BSC Computer Science - Syllabus
No ratings yet
SY BSC Computer Science - Syllabus
41 pages
Chapter 1 OOP
No ratings yet
Chapter 1 OOP
88 pages
CENG240-2021__Week11__File_Handling
No ratings yet
CENG240-2021__Week11__File_Handling
14 pages

slides08-lr-parsing

Uploaded by

slides08-lr-parsing

Uploaded by

Bottom-Up LR Parsing

17-363/17-663: Programming Language Pragmatics

Reading: PLP section 2.3

Copyright © 2016 Elsevier

• Top-Down/LL Parsing Intuition

stmt_list $$$ Based

stmt Reduce tokens to a stmt, thentotoa stmt_list

• Initial parse state captured by an item

– includes start symbol, production, and current location

• What we see next might be inside stmt_list

• We can likewise expand stmt to get the item set:

• This is an SLR parser state

• Our starting stack has state 0 on it:

• From state 0, we shift read onto the stack and

• State 1 represents the following item:

• stack / item: 0 read 1

• From state 1, we shift id onto the stack

• Now we reduce to stmt, and put stmt into the input

• We now shift stmt

• Next we reduce to stmt_list

• Now we shift stmt_list

• Assume you are in parsing state 0

• A famous parsing challenge (from Algol) involves if-

stmt ::= if exp then stmt

• Consider the phrase:

if exp then if exp then stmt else stmt

• Which then does the else belong to?

• This is a shift-reduce conflict

if exp then if exp then stmt . else stmt

• When the else appears

• This is a shift-reduce conflict

if exp then if exp then stmt . else stmt

• When the else appears

• This is a shift-reduce conflict

if exp then if exp then stmt . else stmt

• When the else appears

Invariant: balanced_stmts may be inside unbalanced_stmts

• An LL or LR parser is a Pushdown Automaton (PDA)

You might also like