0% found this document useful (0 votes)

7 views20 pages

Parser Lec1

Uploaded by

Mohammad Humayun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views20 pages

Parser Lec1

Uploaded by

Mohammad Humayun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 20

Syntax Analysis

Contents

• Syntax Analysis
• Introduction
• The Role of the Parser
• Representative Grammars
• Context-Free Grammars
• Formal Definition of a CFG
• Notational Conventions
• Derivations

2
Syntax Analysis
• Grammars offer significant benefits for both language
designers and compiler writers.

• A grammar gives a precise syntactic specification of a

programming language.

• From certain classes of grammars, we can construct

automatically an efficient parser that determines the syntactic
structure of a source program.

• The structure imparted to a language by a properly designed

grammar is useful for translating source programs into correct
object code and for detecting errors.
3
Role of the Parser
• In compiler model, the parser obtains a string of tokens
from the lexical analyzer & verifies that the string of token
names can be generated by the grammar for the source
language.

4
Role of the Parser..
• There are 03 general types of parsers for grammars:
universal, top-down, and bottom-up.
• Universal parsing methods such as the Cocke-Younger-Kasami
algorithm and Earley's algorithm can parse any grammar. These
general methods are, however, too inefficient to use in
production compilers.

• The methods commonly used in compilers is either top-down or

bottom-up.

• Top-down methods build parse trees from the top (root) to the
bottom (leaves), while bottom-up methods start from the leaves
and work their way up to the root.

5
Representative Grammars
• Expressions with + and *

E→E+T|T
T→T*F|F
F → ( E ) | id

• This takes care of precedence, but as we saw before, gives

us trouble since it is left-recursive and we did top-down
parsing.

• So we use the next non-left-recursive grammar that generates

the same language.
6
Representative Grammars
E → T E'
E' → + T E' | ε
T → F T'
T' → * F T' | ε
F → ( E ) | id
• Following ambiguous grammar will be used for
illustration, but in general we try to avoid ambiguity.

E → E + E | E * E | ( E ) | id
• This grammar does not enforce precedence and it does not
specify left vs right associativity.
For example, id + id + id and id * id + id each have two parse
trees.
7
CFG
• Grammars used to systematically describe the syntax of
programming language constructs like expressions and
statements.

stmt --> if ( expr ) stmt else stmt

• A syntactic variable stmt is used to denote statements and variable

expr to denote expressions.

• Other productions then define precisely what an expr is and what else
a stmt can be.

• A language generated by a (context-free) grammar is called

a context free language.
8
CFG Definition
• Context-free grammar (grammar ) consists of terminals,
non-terminals, a start symbol, and productions.

• Terminals:
• The basic components found by the lexer.
• They are sometimes called token names, i.e., the first
component of the token as produced by the lexer.

• Non-terminals:
• Syntactic variables that denote sets of strings.
• The sets of strings denoted by non-terminals help define the
language generated by the grammar
9
CFG Definition..
• Start Symbol:
• A non-terminal that forms the root of the parse tree.
• Conventionally, the productions for the start symbol are listed
first.

• Productions:
• The productions of a grammar specify the manner in which the
terminals and non-terminals can be combined to form strings.

• Each production consists of:

1. A nonterminal called the head or left side of the production,
this production defines some of the strings denoted by the
head.
10
CFG Definition..
2. The symbol  Sometimes ::= has been used in place of the
arrow.

3. A body or right side consisting of zero or more terminals and

non-terminals.

The components of the body describe one way in which

strings of the non-terminal at the head can be constructed.

11
CFG Definition..
• Ex Grammar

• Terminals: id + - * / ( )
• Non-Terminals: expression, term, factor
• Start Symbol: expression
12
Notational Conventions
• Notational conventions for grammars:

• These symbols are terminals:

(a) Lowercase letters early in the alphabet, such as a, b, c.

(b) Operator symbols such as +, *, and so on.
(c) Punctuation symbols such as parentheses, comma, and so on.
(d) The digits 0, 1, . . . , 9.
(e) Boldface strings such as id or if, each of which represents a
single terminal symbol.

13
Notational Conventions..

• These symbols are non-terminals:

(a) Uppercase letters early in the alphabet, such as A, B, C.

(b) The letter S, which, when it appears, is usually the start
symbol.
(c) Lowercase, italic names such as expr or stmt.
(d) When discussing programming constructs, uppercase letters
may be used to represent non-terminals for the constructs.
For example, non-terminals for expressions, terms, and factors
are often represented by E, T, and F, respectively.

14
Notational Conventions…
• Uppercase letters late in the alphabet, such as X, Y, Z, represent
grammar symbols, that is, either non-terminals or terminals.

• Lowercase letters late in the alphabet , chiefly u, v, ... ,z,

represent (possibly empty) strings of terminals.

• Lowercase Greek letters, represents (possibly empty) strings of

grammar symbols.

• A set of productions A  α1 , A  α2 ,…, A  αk with a common

head A (call them A-productions) , may be written as
A  α1 | α2 ,…, |αk
15
Notational Conventions…

• The grammar we defined earlier using notations:

16
Derivations
• Assume we have a production A → α.
• We would then say that A derives α and write A ⇒ α

• We generalize this. If, in addition, β and γ are strings, we

say that βAγ derives βαγ and write
βAγ ⇒ βαγ

• We generalize further. If α derives β and β derives γ, we

say α derives γ and write
α ⇒* z
• Means drives in zero or more steps.

17
Derivations
• Formal definition of zero or more definitions:
1. α ⇒* α, for any string α.
2. If α ⇒* β and β ⇒ γ, then α ⇒* γ.

• If S is the start symbol and S ⇒* α, we say α is a sentential

form of the grammar.

• A sentential form may contain non-terminals and terminals.

• If it contains only terminals it is a sentence of the grammar and
the language generated by a grammar G, L(G), is the set of sentences.

• Two grammars generating the same language are

called equivalent.
18
Derivations
• Ex: E → E + E | E * E | ( E ) | id

• We see that id + id is a sentence. Indeed it can be derived

in two ways from the start symbol E.
E ⇒ E + E ⇒ id + E ⇒ id + id E ⇒ E + E ⇒ E + id ⇒ id + id

• In the first derivation, we replaced the leftmost non-terminal by

the body of a production having the non-terminal as head. This
is called a leftmost derivation.
• Similarly the second derivation in which the rightmost non-
terminal is replaced is called a rightmost derivation or
a canonical derivation.

19
Thank You

ISC 2024 Computer Science Practical Question
88% (8)
ISC 2024 Computer Science Practical Question
3 pages
How To Write A Variable Description
75% (4)
How To Write A Variable Description
2 pages
Juspay Interview Experience
No ratings yet
Juspay Interview Experience
3 pages
003chapter 3 - Syntax Analysis
No ratings yet
003chapter 3 - Syntax Analysis
171 pages
Unit-4 Context Free Grammar
No ratings yet
Unit-4 Context Free Grammar
106 pages
Compiler Design Chapter-3
0% (1)
Compiler Design Chapter-3
177 pages
1.describing Syntax and Semantics
No ratings yet
1.describing Syntax and Semantics
110 pages
Grammar and Parse Trees (Syntax) : What Makes A Good Programming Language?
100% (2)
Grammar and Parse Trees (Syntax) : What Makes A Good Programming Language?
50 pages
ppt3 Network Theory
No ratings yet
ppt3 Network Theory
66 pages
CD Unit 2
100% (1)
CD Unit 2
20 pages
1 Syntax Analyzer
No ratings yet
1 Syntax Analyzer
33 pages
Compiler Design 3
No ratings yet
Compiler Design 3
140 pages
Chapter 4
No ratings yet
Chapter 4
62 pages
Compiler 8
No ratings yet
Compiler 8
28 pages
Chapter 3
No ratings yet
Chapter 3
77 pages
2nd Phase Syntax Analyzer - 1
No ratings yet
2nd Phase Syntax Analyzer - 1
136 pages
Chapter 3 - Syntax Analysis
No ratings yet
Chapter 3 - Syntax Analysis
160 pages
CD Chapter III-1
No ratings yet
CD Chapter III-1
77 pages
Chapter 2
No ratings yet
Chapter 2
47 pages
Chapter 3 - Syntax Analysis
No ratings yet
Chapter 3 - Syntax Analysis
9 pages
Recurrence Relation
No ratings yet
Recurrence Relation
13 pages
Unit 2
No ratings yet
Unit 2
168 pages
CS6109 Module 4
No ratings yet
CS6109 Module 4
36 pages
Chapter - Three
No ratings yet
Chapter - Three
139 pages
4 - Syntax Analyzer (CFG)
No ratings yet
4 - Syntax Analyzer (CFG)
42 pages
Context-Free Grammar (CFG) : Dr. Nadeem Akhtar
No ratings yet
Context-Free Grammar (CFG) : Dr. Nadeem Akhtar
56 pages
Multi-Faculty: Introduction To Quantum Computing: Quantum Algorithms and Qiskit
No ratings yet
Multi-Faculty: Introduction To Quantum Computing: Quantum Algorithms and Qiskit
2 pages
Chapter 3
No ratings yet
Chapter 3
57 pages
Chapter 3 Syntax Analysis (Parsing)
No ratings yet
Chapter 3 Syntax Analysis (Parsing)
29 pages
Chapter 4 Syntax Analysis
No ratings yet
Chapter 4 Syntax Analysis
95 pages
Turing Machine
No ratings yet
Turing Machine
39 pages
CH2-1 To CH2-3
No ratings yet
CH2-1 To CH2-3
79 pages
Lec01-Introduction and Overview
No ratings yet
Lec01-Introduction and Overview
45 pages
Unit 2 - Sessions 1 - 2
No ratings yet
Unit 2 - Sessions 1 - 2
133 pages
Compiler Design - Syntax Analysis
No ratings yet
Compiler Design - Syntax Analysis
11 pages
Class Three
No ratings yet
Class Three
74 pages
Lec 2
No ratings yet
Lec 2
21 pages
Chapter 3
No ratings yet
Chapter 3
41 pages
CH2 1
No ratings yet
CH2 1
27 pages
9-Syntax Part1
No ratings yet
9-Syntax Part1
26 pages
Lec 1
No ratings yet
Lec 1
15 pages
Parsing Bun
No ratings yet
Parsing Bun
48 pages
Unit 2 Notes
No ratings yet
Unit 2 Notes
43 pages
Unit Iii
No ratings yet
Unit Iii
95 pages
Compiler Construction Week 04 Syntax Analysis I)
No ratings yet
Compiler Construction Week 04 Syntax Analysis I)
41 pages
Chapter 3 Syntax Analysis I
No ratings yet
Chapter 3 Syntax Analysis I
27 pages
(Week 4) Syntax Analysis (CFG)
No ratings yet
(Week 4) Syntax Analysis (CFG)
50 pages
Compiler - Design - Module3
No ratings yet
Compiler - Design - Module3
19 pages
Syntax Analysis
No ratings yet
Syntax Analysis
47 pages
Lesson 3: Syntax Analysis: Risul Islam Rasel
No ratings yet
Lesson 3: Syntax Analysis: Risul Islam Rasel
106 pages
New DOC Document
No ratings yet
New DOC Document
17 pages
Chapter 4 - Context-Free Grammars and Languages
No ratings yet
Chapter 4 - Context-Free Grammars and Languages
60 pages
Unit 3 Syntax - Analyzer
No ratings yet
Unit 3 Syntax - Analyzer
56 pages
Unit 3 SAD Lecture 2
No ratings yet
Unit 3 SAD Lecture 2
8 pages
Chapter 4
No ratings yet
Chapter 4
23 pages
Introduction To Algorithms Greedy
No ratings yet
Introduction To Algorithms Greedy
69 pages
Chapter 3 Syntax Analysis (Parsing)
No ratings yet
Chapter 3 Syntax Analysis (Parsing)
29 pages
Syntax Analysis (Part-I)
No ratings yet
Syntax Analysis (Part-I)
88 pages
Cse Learning Materials1729858868090
No ratings yet
Cse Learning Materials1729858868090
43 pages
Lec4 SyntaxAnalysis
No ratings yet
Lec4 SyntaxAnalysis
41 pages
Lec 3
No ratings yet
Lec 3
10 pages
Compiler Design CS - 4
No ratings yet
Compiler Design CS - 4
70 pages
II. Parser: Syntax Analysis
No ratings yet
II. Parser: Syntax Analysis
18 pages
CC-Lec 5 Week 5 Cfgs
No ratings yet
CC-Lec 5 Week 5 Cfgs
29 pages
Lecture05-Syntax Analysis-CFG
No ratings yet
Lecture05-Syntax Analysis-CFG
19 pages
NNML Mid 1 Objective
No ratings yet
NNML Mid 1 Objective
16 pages
Lecture 1 Introduction DR Raheel 19022024 032426pm
No ratings yet
Lecture 1 Introduction DR Raheel 19022024 032426pm
32 pages
CH 6
No ratings yet
CH 6
18 pages
Intro To Graph Theory
No ratings yet
Intro To Graph Theory
27 pages
G52Cmp Compilers: Syntax Analysis
No ratings yet
G52Cmp Compilers: Syntax Analysis
36 pages
The Newton-Raphson Method A Detailed Analysis
No ratings yet
The Newton-Raphson Method A Detailed Analysis
8 pages
2 Syntax Analysis - Introduction
No ratings yet
2 Syntax Analysis - Introduction
8 pages
FOIT - Mid Term Date Sheet F22 (Final Version)
No ratings yet
FOIT - Mid Term Date Sheet F22 (Final Version)
17 pages
Data Science in Meteorology - Weather Forecast: S.Y. Mechanical Department
No ratings yet
Data Science in Meteorology - Weather Forecast: S.Y. Mechanical Department
25 pages
SD Modul 9-10 Hash Table
No ratings yet
SD Modul 9-10 Hash Table
12 pages
Lecture 04
No ratings yet
Lecture 04
44 pages
Chapter 3 - Syntax Analysis Part One
No ratings yet
Chapter 3 - Syntax Analysis Part One
10 pages
Lecture 19: Dynamic Programming I: Memoization, Fibonacci, Shortest Paths, Guessing
No ratings yet
Lecture 19: Dynamic Programming I: Memoization, Fibonacci, Shortest Paths, Guessing
6 pages
Lecture 03
No ratings yet
Lecture 03
36 pages
Interview Series
No ratings yet
Interview Series
5 pages
Unit II PDF
No ratings yet
Unit II PDF
7 pages
Games of Prediction
No ratings yet
Games of Prediction
33 pages
Lec 05
No ratings yet
Lec 05
22 pages
Forest
No ratings yet
Forest
2 pages
MP2 Slides - Greenberg
No ratings yet
MP2 Slides - Greenberg
14 pages
Parser Lec5
No ratings yet
Parser Lec5
13 pages
Operator Overloading More Operators
No ratings yet
Operator Overloading More Operators
34 pages
Lec15 16
No ratings yet
Lec15 16
35 pages
Design and Analysis of Algorithms: CSE 5311 Lecture 22 All-Pairs Shortest Paths
No ratings yet
Design and Analysis of Algorithms: CSE 5311 Lecture 22 All-Pairs Shortest Paths
40 pages
COAL Theory Outline Fall 2022
No ratings yet
COAL Theory Outline Fall 2022
4 pages
Bin Math
No ratings yet
Bin Math
8 pages
Asymptotic Analysis PDF
No ratings yet
Asymptotic Analysis PDF
26 pages
Motivation For Formal Grammars
No ratings yet
Motivation For Formal Grammars
15 pages
Assignment 2 G1
No ratings yet
Assignment 2 G1
2 pages
Sea130 DS Exp 4
No ratings yet
Sea130 DS Exp 4
5 pages
(Gen Math) Reviewer - 1G
No ratings yet
(Gen Math) Reviewer - 1G
3 pages
This Time, We Have 3 Different Homework Assignments
No ratings yet
This Time, We Have 3 Different Homework Assignments
26 pages

Parser Lec1

Uploaded by

Parser Lec1

Uploaded by

Syntax Analysis

• A grammar gives a precise syntactic specification of a

• From certain classes of grammars, we can construct

• The structure imparted to a language by a properly designed

• The methods commonly used in compilers is either top-down or

• This takes care of precedence, but as we saw before, gives

• So we use the next non-left-recursive grammar that generates

stmt --> if ( expr ) stmt else stmt

• A syntactic variable stmt is used to denote statements and variable

• A language generated by a (context-free) grammar is called

• Each production consists of:

3. A body or right side consisting of zero or more terminals and

The components of the body describe one way in which

• These symbols are terminals:

(a) Lowercase letters early in the alphabet, such as a, b, c.

• These symbols are non-terminals:

(a) Uppercase letters early in the alphabet, such as A, B, C.

• Lowercase letters late in the alphabet , chiefly u, v, ... ,z,

• Lowercase Greek letters, represents (possibly empty) strings of

• A set of productions A  α1 , A  α2 ,…, A  αk with a common

• The grammar we defined earlier using notations:

• We generalize this. If, in addition, β and γ are strings, we

• We generalize further. If α derives β and β derives γ, we

• If S is the start symbol and S ⇒* α, we say α is a sentential

• A sentential form may contain non-terminals and terminals.

• Two grammars generating the same language are

• We see that id + id is a sentence. Indeed it can be derived

• In the first derivation, we replaced the leftmost non-terminal by

You might also like