0% found this document useful (0 votes)

14 views21 pages

3 parser-Intro-L5

This document discusses syntax analysis and parsing. It begins by listing the functions of a parser as testing for membership in a language and generating parse trees. It then discusses context-free grammars and how they are used to precisely define language syntax. The document covers topics such as derivation, parse trees, ambiguity, and the two main types of parsing: top-down and bottom-up. It provides examples to illustrate concepts like derivation and parsing strategies. The key takeaway is that parsing checks if tokens conform to language syntax by analyzing them based on the grammar rules.

Uploaded by

PRANJAL SHARMA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views21 pages

3 parser-Intro-L5

Uploaded by

PRANJAL SHARMA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

BITS Pilani

BITS Pilani Prof.Aruna Malapati

Hyderabad Campus Department of CSIS
BITS Pilani
Hyderabad Campus

Syntax Analysis / Parser

Today’s Learning Objectives

• List the functions of a Parser

• Perform grammar transformations

BITS Pilani, Hyderabad Campus

Parser

BITS Pilani, Hyderabad Campus

Where Are We?

Source code: if (b==0) a = “Hi”;

Lexical Analysis

Token Stream: if ( b == 0)a = “Hi”;

Syntactic Analysis
Abstract Syntax Tree
if
(AST)
Semantic Analysis
== = ;

b 0 a “Hi”
Do tokens conform to the language syntax?
BITS Pilani, Hyderabad Campus
Functions of Syntax Analysis

• Test for membership of w belongs to L(G).

• Additional functionalities

• Generate parse trees

• Handle errors if the string does not belong to the

language.

• Form of the grammar is not important

• Many grammars generate the same language.

BITS Pilani, Hyderabad Campus

What syntax analysis cannot
do?
• To check whether variables are of types on which
operations are allowed.

• To check whether a variable has been declared before

use.

• To check whether a variable has been initialized.

• These issues will be handled in semantic analysis.

BITS Pilani, Hyderabad Campus

Limitations of regular
languages
• How to describe language syntax precisely and
conveniently. Can regular expressions be used?

• Many languages are not regular for example string of

balanced parentheses.
• ((((.))))

• {(i)i|i=0}

• There is no regular expression for this language.

Syntax definition - Context free grammars

BITS Pilani, Hyderabad Campus

Syntax definition

• Context free grammars is a four tuple <V,T,P,S>

– a finite set of non terminal symbols / Variables (V)
– A finite set of terminals (T) V ∩ T = Φ
– a finite set of productions of the form A -> w where A € V and w € ( V U T)*
– a start symbol / Variable (S) S € V

• A parser derives strings by beginning with start symbol and

repeatedly replacing a non terminal by the right hand side of
a production for that non terminal.

• The strings that can be derived from the start symbol of a

grammar G from the language L(G) defined by the grammar.

BITS Pilani, Hyderabad Campus

Derivation

list -> list + digit | list - digit | digit

digit -> 0 | 1 | . | 9

Consists of the language which is a list of digit separated

by + or -.
Therefore, the string 9-5+2 belongs to the language
specified by the grammar.
list -> list + digit
-> list - digit + digit
-> digit - digit + digit The name context free comes from the fact
-> 9 - digit + digit
that use of a production X does not depend
on the context of X.
-> 9 - 5 + digit
-> 9 - 5 + 2

BITS Pilani, Hyderabad Campus

Derivation

• If in a sentential form only the leftmost non terminal is

replaced then it becomes leftmost derivation.

• Every leftmost step can we written as wAγ ->lm* wδγ,

where w is a string of terminals and A-> δ is a
production.

• Similarly rightmost derivation can also be defined

accordingly when the rightmost non terminal is replaced.

BITS Pilani, Hyderabad Campus

Parse tree

• It shows how the start symbol of a grammar derives a

string in the language.
• Root is labeled by the start symbol.
• Leaf nodes are labeled by tokens.

• Each internal node is labeled by a non terminal.

• if A is a non-terminal labeling an internal node and x 1 , x
2 , .x n are labels of children of that node then A à x 1 x 2
. x n is a production.

BITS Pilani, Hyderabad Campus

Ambiguity
• A Grammar can have more than one parse tree for a
string.

• Consider grammar
string -> string + string | string - string | 0 | 1 | . | 9

• String 9-5+2 has two parse trees

BITS Pilani, Hyderabad Campus

Ambiguity

• Ambiguity is problematic because meaning of the

programs can be incorrect.

• Ambiguity can be handled in several ways.

– Enforce associativity and precedence.

– Rewrite the grammar (cleanest way).

• There are no general techniques for handling ambiguity.

• It is impossible to convert automatically an ambiguous

grammar to an unambiguous one.

BITS Pilani, Hyderabad Campus

Example

• String of balanced parentheses

S -> ( S ) S | ε

• For example, consider the string: (( )). It can be derived

as:

S -> (S) S Replacing inner S with (S)S

S -> ((S)S) S Replacing all S with Empty
S -> (())

BITS Pilani, Hyderabad Campus

Parsing

• Process of determination whether a string can be

generated by a grammar.

• Parsing falls in two categories:

– Top-down parsing:

• Construction of the parse tree starts at the root (from the start symbol) and
proceeds towards leaves (token or terminals)

– Bottom-up parsing:

• Constructions of the parse tree starts from the leaf nodes (tokens or terminals of
the grammar) and proceeds towards root (start symbol) 1.

BITS Pilani, Hyderabad Campus

The Parsing Problem

• Two categories of parsers

– Top down - Construction of the parse tree starts at the root (from the start
symbol) and proceeds towards leaves (token or terminals) Order is that of a
leftmost derivation.

• Does a preorder traversal of tree

– node then branches

– Branches followed left-to-right

BITS Pilani, Hyderabad Campus

Top-down parsing

S –> AB
A –> aA | ε S
B –> b | bB
A B

Here is a top-down parse of aaab. a A b

S
a A
AB S –> AB
aAB A –> aA
a A
aaAB A –> aA
aaaAB A –> aA
aaaεB A –> ε ϵ
aaab B –> b

BITS Pilani, Hyderabad Campus

The Parsing Problem

• Bottom up - Construction of the parse tree starts from

the leaf nodes (tokens or terminals of the grammar) and
proceeds towards root (start symbol)

• Order is that of the reverse of a rightmost

derivation

• Parsers look only one token ahead in the input

BITS Pilani, Hyderabad Campus

Bottom - Up parsing
E→T+E|T
T → int * T | int | (E)

Consider the string: int * int + int

Bottom-up parsing reduces a string to the start symbol
by inverting productions:

BITS Pilani, Hyderabad Campus

Take home message
• Parsing checks if tokens conform to the language
syntax?

• Often generates parse trees or an error if the input string

does not confirm.

• Grammar Transformations help in improving

performance of parser.

• Top down parsers cannot handle left recursive grammar

and grammars with common prefixes.

BITS Pilani, Hyderabad Campus

NP 286 (1) United Kingdom and Europe PDF
No ratings yet
NP 286 (1) United Kingdom and Europe PDF
444 pages
Candlesticks Report A Guide To Candlesticks
100% (2)
Candlesticks Report A Guide To Candlesticks
19 pages
CEO Database
100% (2)
CEO Database
176 pages
Cage Trim Valves
100% (1)
Cage Trim Valves
57 pages
Chapter 10 Strategy Implementation Organizing and Structure
100% (1)
Chapter 10 Strategy Implementation Organizing and Structure
28 pages
Car Safety Comprehension
100% (1)
Car Safety Comprehension
9 pages
IS15477 - 2019 Tile Adhesive
No ratings yet
IS15477 - 2019 Tile Adhesive
21 pages
Wave On A String
100% (1)
Wave On A String
25 pages
Design Proposal of An Automatic Smart MultiInsect Mosquito Killing System IEEE
No ratings yet
Design Proposal of An Automatic Smart MultiInsect Mosquito Killing System IEEE
6 pages
"The Electoral Reforms Law of 1987" Sec. 27. Election Offenses. - in Addition To The Prohibited Acts and Election Offenses Enumerated in
100% (1)
"The Electoral Reforms Law of 1987" Sec. 27. Election Offenses. - in Addition To The Prohibited Acts and Election Offenses Enumerated in
24 pages
Aero Seal
No ratings yet
Aero Seal
14 pages
Part 1.2
100% (1)
Part 1.2
88 pages
Amplifier Build and Design: Faculty of Engineering and Applied Science
No ratings yet
Amplifier Build and Design: Faculty of Engineering and Applied Science
21 pages
Bob Hunt Sheeting Wing
100% (3)
Bob Hunt Sheeting Wing
36 pages
B.tech Eeee Syllabus
No ratings yet
B.tech Eeee Syllabus
12 pages
Presumption of Constitutionality
No ratings yet
Presumption of Constitutionality
17 pages
Package Desire': R Topics Documented
No ratings yet
Package Desire': R Topics Documented
22 pages
INSPI - Yaoure-ESIA-Appendix-34-Cultural-Heritage-Management-Plan
100% (1)
INSPI - Yaoure-ESIA-Appendix-34-Cultural-Heritage-Management-Plan
7 pages
Open The Dor
No ratings yet
Open The Dor
9 pages
Instructables Com FAN Repair
No ratings yet
Instructables Com FAN Repair
9 pages
07820100024353
No ratings yet
07820100024353
20 pages
Zenit Mataplast P.Ltd. vs. State of Maharashtra & Ors PDF
No ratings yet
Zenit Mataplast P.Ltd. vs. State of Maharashtra & Ors PDF
3 pages
G9 DLL Q1 Week4
No ratings yet
G9 DLL Q1 Week4
3 pages
Department of Management Presentation
No ratings yet
Department of Management Presentation
84 pages
Shareholders & Stakehoders
No ratings yet
Shareholders & Stakehoders
9 pages
Charles Oman
No ratings yet
Charles Oman
49 pages
Translation Certification: Form H-1
No ratings yet
Translation Certification: Form H-1
2 pages
Finance Services.3
No ratings yet
Finance Services.3
10 pages
MINERALS
No ratings yet
MINERALS
4 pages
Page 5 A&A May 5, 2025 - Barclay Page 5
No ratings yet
Page 5 A&A May 5, 2025 - Barclay Page 5
1 page
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (643)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1856)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4103)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
4/5 (1175)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1139)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (836)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (815)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (629)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (943)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (298)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2885)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (919)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1267)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2546)

3 parser-Intro-L5

Uploaded by

3 parser-Intro-L5

Uploaded by

BITS Pilani

BITS Pilani Prof.Aruna Malapati

Syntax Analysis / Parser

• List the functions of a Parser

• Perform grammar transformations

BITS Pilani, Hyderabad Campus

BITS Pilani, Hyderabad Campus

Source code: if (b==0) a = “Hi”;

Token Stream: if ( b == 0)a = “Hi”;

• Test for membership of w belongs to L(G).

• Generate parse trees

• Handle errors if the string does not belong to the

• Form of the grammar is not important

• Many grammars generate the same language.

BITS Pilani, Hyderabad Campus

• To check whether a variable has been declared before

• To check whether a variable has been initialized.

• These issues will be handled in semantic analysis.

BITS Pilani, Hyderabad Campus

• Many languages are not regular for example string of

• There is no regular expression for this language.

Syntax definition - Context free grammars

BITS Pilani, Hyderabad Campus

• Context free grammars is a four tuple <V,T,P,S>

• A parser derives strings by beginning with start symbol and

• The strings that can be derived from the start symbol of a

BITS Pilani, Hyderabad Campus

list -> list + digit | list - digit | digit

Consists of the language which is a list of digit separated

BITS Pilani, Hyderabad Campus

• If in a sentential form only the leftmost non terminal is

• Every leftmost step can we written as wAγ ->lm* wδγ,

• Similarly rightmost derivation can also be defined

BITS Pilani, Hyderabad Campus

• It shows how the start symbol of a grammar derives a

• Each internal node is labeled by a non terminal.

BITS Pilani, Hyderabad Campus

• String 9-5+2 has two parse trees

BITS Pilani, Hyderabad Campus

• Ambiguity is problematic because meaning of the

• Ambiguity can be handled in several ways.

– Rewrite the grammar (cleanest way).

• There are no general techniques for handling ambiguity.

• It is impossible to convert automatically an ambiguous

BITS Pilani, Hyderabad Campus

• String of balanced parentheses

• For example, consider the string: (( )). It can be derived

S -> (S) S Replacing inner S with (S)S

BITS Pilani, Hyderabad Campus

• Process of determination whether a string can be

• Parsing falls in two categories:

BITS Pilani, Hyderabad Campus

• Two categories of parsers

• Does a preorder traversal of tree

– Branches followed left-to-right

BITS Pilani, Hyderabad Campus

Here is a top-down parse of aaab. a A b

BITS Pilani, Hyderabad Campus

• Bottom up - Construction of the parse tree starts from

• Order is that of the reverse of a rightmost

• Parsers look only one token ahead in the input

BITS Pilani, Hyderabad Campus

Consider the string: int * int + int

BITS Pilani, Hyderabad Campus

• Often generates parse trees or an error if the input string

• Grammar Transformations help in improving

• Top down parsers cannot handle left recursive grammar

BITS Pilani, Hyderabad Campus

You might also like