0% found this document useful (0 votes)

197 views11 pages

WWW Tutorialspoint Com Compiler Design Compiler Design Syntax Analysis HTM

Uploaded by

DESSIE FIKIR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

197 views11 pages

WWW Tutorialspoint Com Compiler Design Compiler Design Syntax Analysis HTM

Uploaded by

DESSIE FIKIR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Home Jobs Current Affairs UPSC Notes

Search your favorite tutorials ...

Compiler Design Syntax Analysis

Advertisements

Previous Page Next Page

Syntax analysis or parsing is the second phase of a compiler. In this chapter, we shall learn
the basic concepts used in the construction of a parser.

We have seen that a lexical analyzer can identify tokens with the help of regular expressions
and pattern rules. But a lexical analyzer cannot check the syntax of a given sentence due to
the limitations of the regular expressions. Regular expressions cannot check balancing
tokens, such as parenthesis. Therefore, this phase uses contextfree grammar (CFG), which
is recognized by pushdown automata.
CFG, on the other hand, is a superset of Regular Grammar, as depicted below:

We make use of cookies to improve our user experience. By using this website, you agree with our Cookies
Policy.

Agree

Learn more
Learn more

It implies that every Regular Grammar is also contextfree, but there exists some problems,
which are beyond the scope of Regular Grammar. CFG is a helpful tool in describing the
syntax of programming languages.

ContextFree Grammar
In this section, we will first see the definition of contextfree grammar and introduce
terminologies used in parsing technology.
A contextfree grammar has four components:

A set of nonterminals (V). Nonterminals are syntactic variables that denote sets
of strings. The nonterminals define sets of strings that help define the language
generated by the grammar.

A set of tokens, known as terminal symbols (Σ). Terminals are the basic symbols
from which strings are formed.
A set of productions (P). The productions of a grammar specify the manner in
which the terminals and nonterminals can be combined to form strings. Each
production consists of a nonterminal called the left side of the production, an
arrow, and a sequence of tokens and/or on terminals, called the right side of the
production.
One of the nonterminals is designated as the start symbol (S); from where the
production begins.
The strings are derived from the start symbol by repeatedly replacing a nonterminal (initially
the start symbol) by the right side of a production, for that nonterminal.

Example
We take the problem of palindrome language, which cannot be described by means of
Regular Expression. That is, L = { w | w = w R } is not a regular language. But it can be
described by means of CFG, as illustrated below:

G = ( V, Σ, P, S )

Where:

V = { Q, Z, N }
Σ = { 0, 1 }
P = { Q → Z | Q → N | Q → ℇ | Z → 0Q0 | N → 1Q1 }
S = { Q }

This grammar describes palindrome language, such as: 1001, 11100111, 00100, 1010101,
11111, etc.

Syntax Analyzers
Syntax Analyzers
A syntax analyzer or parser takes the input from a lexical analyzer in the form of token
streams. The parser analyzes the source code (token stream) against the production rules
to detect any errors in the code. The output of this phase is a parse tree.

This way, the parser accomplishes two tasks, i.e., parsing the code, looking for errors and
generating a parse tree as the output of the phase.
Parsers are expected to parse the whole code even if some errors exist in the program.
Parsers use error recovering strategies, which we will learn later in this chapter.

Derivation
A derivation is basically a sequence of production rules, in order to get the input string.
During parsing, we take two decisions for some sentential form of input:
Deciding the nonterminal which is to be replaced.
Deciding the production rule, by which, the nonterminal will be replaced.
To decide which nonterminal to be replaced with production rule, we can have two options.

Leftmost Derivation
If the sentential form of an input is scanned and replaced from left to right, it is called left
most derivation. The sentential form derived by the leftmost derivation is called the left
sentential form.

Rightmost Derivation
If we scan and replace the input with production rules, from right to left, it is known as right
most derivation. The sentential form derived from the rightmost derivation is called the right
sentential form.
Example
Production rules:
E → E + E
E → E * E
E → id

Input string: id + id * id
The leftmost derivation is:
E → E * E
E → E + E * E
E → id + E * E
E → id + id * E
E → id + id * id

Notice that the leftmost side nonterminal is always processed first.
The rightmost derivation is:
E → E + E
E → E + E * E
E → E + E * id
E → E + id * id
E → id + id * id

Parse Tree
A parse tree is a graphical depiction of a derivation. It is convenient to see how strings are
derived from the start symbol. The start symbol of the derivation becomes the root of the
parse tree. Let us see this by an example from the last topic.
We take the leftmost derivation of a + b * c
The leftmost derivation is:
E → E * E
E → E + E * E
E → id + E * E
E → id + id * E
E → id + id * id

Step 1:

E → E * E

Step 2:

E → E + E * E
Step 3:

E → id + E * E

Step 4:

E → id + id * E

Step 5:

E → id + id * id

In a parse tree:
All leaf nodes are terminals.
All interior nodes are nonterminals.
Inorder traversal gives original input string.
A parse tree depicts associativity and precedence of operators. The deepest subtree is
traversed first, therefore the operator in that subtree gets precedence over the operator
which is in the parent nodes.

Ambiguity
A grammar G is said to be ambiguous if it has more than one parse tree (left or right
derivation) for at least one string.
Example
E → E + E
E → E – E
E → id

For the string id + id – id, the above grammar generates two parse trees:

The language generated by an ambiguous grammar is said to be inherently ambiguous.
Ambiguity in grammar is not good for a compiler construction. No method can detect and
remove ambiguity automatically, but it can be removed by either rewriting the whole
grammar without ambiguity, or by setting and following associativity and precedence
constraints.

Associativity
If an operand has operators on both sides, the side on which the operator takes this
operand is decided by the associativity of those operators. If the operation is left
associative, then the operand will be taken by the left operator or if the operation is right
associative, the right operator will take the operand.
Example
Operations such as Addition, Multiplication, Subtraction, and Division are left associative. If
the expression contains:
id op id op id

it will be evaluated as:
(id op id) op id
For example, (id + id) + id
Operations like Exponentiation are right associative, i.e., the order of evaluation in the same
expression will be:
id op (id op id)

For example, id ^ (id ^ id)

Precedence
If two different operators share a common operand, the precedence of operators decides
which will take the operand. That is, 2+3*4 can have two different parse trees, one
corresponding to (2+3)*4 and another corresponding to 2+(3*4). By setting precedence
among operators, this problem can be easily removed. As in the previous example,
mathematically * (multiplication) has precedence over + (addition), so the expression 2+3*4
will always be interpreted as:
2 + (3 * 4)

These methods decrease the chances of ambiguity in a language or its grammar.

Left Recursion
A grammar becomes leftrecursive if it has any nonterminal ‘A’ whose derivation contains
‘A’ itself as the leftmost symbol. Leftrecursive grammar is considered to be a problematic
situation for topdown parsers. Topdown parsers start parsing from the Start symbol, which
in itself is nonterminal. So, when the parser encounters the same nonterminal in its
derivation, it becomes hard for it to judge when to stop parsing the left nonterminal and it
goes into an infinite loop.
Example:

(1) A => Aα | β

(2) S => Aα | β
A => Sd

(1) is an example of immediate left recursion, where A is any nonterminal symbol and α
represents a string of nonterminals.
(2) is an example of indirectleft recursion.
A topdown parser will first parse the A, which inturn will yield a string consisting of A itself
and the parser may go into a loop forever.

Removal of Left Recursion
One way to remove left recursion is to use the following technique:
The production

A => Aα | β

is converted into following productions

A => βA'
A'=> αA' | ε

This does not impact the strings derived from the grammar, but it removes immediate left
recursion.
Second method is to use the following algorithm, which should eliminate all direct and
indirect left recursions.

START

Arrange non‐terminals in some order like A1, A2, A3,…, An

   for each i from 1 to n
      {
      for each j from 1 to i‐1
         {
         replace each production of form Ai ⟹Ajᵳ
         with Ai ⟹ δ1ᵳ  | δ2ᵳ | δ3ᵳ |…| ᵳ
         where Aj ⟹ δ1 | δ2|…| δn  are current Aj productions
         }
      }
   eliminate immediate left‐recursion

END

Example
The production set

S => Aα | β
A => Sd
after applying the above algorithm, should become

S => Aα | β
A => Aαd | βd

and then, remove immediate left recursion using the first technique.

A => βdA'
A' => αdA' | ε

Now none of the production has either direct or indirect left recursion.

Left Factoring
If more than one grammar production rules has a common prefix string, then the topdown
parser cannot make a choice as to which of the production it should take to parse the string
in hand.

Example
If a topdown parser encounters a production like

A ⟹ αβ | αᵳ | …

Then it cannot determine which production to follow to parse the string as both productions
are starting from the same terminal (or nonterminal). To remove this confusion, we use a
technique called left factoring.
Left factoring transforms the grammar to make it useful for topdown parsers. In this
technique, we make one production for each common prefixes and the rest of the derivation
is added by new productions.
Example
The above productions can be written as

A => αA'
A'=> β | ᵳ | …

Now the parser has only one production per prefix which makes it easier to take decisions.

First and Follow Sets
An important part of parser table construction is to create first and follow sets. These sets
can provide the actual position of any terminal in the derivation. This is done to create the
parsing table where the decision of replacing T[A, t] = α with some production rule.
First Set
This set is created to know what terminal symbol is derived in the first position by a non
terminal. For example,

α → t β

That is α derives t (terminal) in the very first position. So, t ∈ FIRST(α).

Algorithm for calculating First set

Look at the definition of FIRST(α) set:
if α is a terminal, then FIRST(α) = { α }.
if α is a nonterminal and α → ℇ is a production, then FIRST(α) = { ℇ }.
if α is a nonterminal and α → ᵳ1 ᵳ2 ᵳ3 … ᵳn and any FIRST(ᵳ) contains t then t is
in FIRST(α).
First set can be seen as:

Follow Set
Likewise, we calculate what terminal symbol immediately follows a nonterminal α in
production rules. We do not consider what the nonterminal can generate but instead, we
see what would be the next terminal symbol that follows the productions of a nonterminal.

Algorithm for calculating Follow set:

if α is a start symbol, then FOLLOW() = $
if α is a nonterminal and has a production α → AB, then FIRST(B) is in
FOLLOW(A) except ℇ.
if α is a nonterminal and has a production α → AB, where B ℇ, then FOLLOW(A) is
in FOLLOW(α).
Follow set can be seen as: FOLLOW(α) = { t | S *αt*}

Limitations of Syntax Analyzers
Syntax analyzers receive their inputs, in the form of tokens, from lexical analyzers. Lexical
analyzers are responsible for the validity of a token supplied by the syntax analyzer. Syntax
analyzers have the following drawbacks
it cannot determine if a token is valid,
it cannot determine if a token is declared before it is being used,
it cannot determine if a token is initialized before it is being used,
it cannot determine if an operation performed on a token type is valid or not.
These tasks are accomplished by the semantic analyzer, which we shall study in Semantic
Analysis.
Analysis.

Useful Video Courses
Video

Compiler Design Online Training

102 Lectures 10 hours
Arnab Chakraborty

More Detail

Previous Page Print Page Next Page

About us Refund Policy Terms of use Privacy Policy FAQ's Contact

REFLEX ACT III™ Quick User Guide v12
100% (1)
REFLEX ACT III™ Quick User Guide v12
20 pages
Catalog Amp Ruang Teknik Group
100% (1)
Catalog Amp Ruang Teknik Group
23 pages
Simple Syntax Directed Translation
No ratings yet
Simple Syntax Directed Translation
51 pages
Grade 7 SCIENCE Item-Analysis-for-item-bank
100% (1)
Grade 7 SCIENCE Item-Analysis-for-item-bank
5 pages
Chapter 3
No ratings yet
Chapter 3
77 pages
CO3005 Chapter 3 Syntax Analysis
No ratings yet
CO3005 Chapter 3 Syntax Analysis
62 pages
Compiler Unit 2 ... 5
No ratings yet
Compiler Unit 2 ... 5
71 pages
Human Resource
100% (1)
Human Resource
92 pages
COSC3054 Lec 03 I Grammars
No ratings yet
COSC3054 Lec 03 I Grammars
96 pages
Compiler Construction Week 6
No ratings yet
Compiler Construction Week 6
34 pages
BKS Unit II-I - Introduction To Parsing
No ratings yet
BKS Unit II-I - Introduction To Parsing
30 pages
CC Lec 7
No ratings yet
CC Lec 7
16 pages
1 Syntax Analyzer
No ratings yet
1 Syntax Analyzer
33 pages
2019-11-29 04 41 39CS V Sem Compiler Design
No ratings yet
2019-11-29 04 41 39CS V Sem Compiler Design
10 pages
2nd Phase Syntax Analyzer - 1
No ratings yet
2nd Phase Syntax Analyzer - 1
136 pages
Input Output Devices
No ratings yet
Input Output Devices
44 pages
Internship Report Final
No ratings yet
Internship Report Final
35 pages
Lecture 8 Syntax Analysis
No ratings yet
Lecture 8 Syntax Analysis
37 pages
Syntax Analysis: Dr. Nguyen Hua Phung Nhphung@hcmut - Edu.vn
No ratings yet
Syntax Analysis: Dr. Nguyen Hua Phung Nhphung@hcmut - Edu.vn
33 pages
Theme
No ratings yet
Theme
11 pages
Principles of Programming Languages: Syntax Analysis
100% (1)
Principles of Programming Languages: Syntax Analysis
51 pages
Yellow Musk Creeper
No ratings yet
Yellow Musk Creeper
7 pages
Bisrat Alemayehu PDF
No ratings yet
Bisrat Alemayehu PDF
67 pages
Compiler Construction CS-4207: Instructor Name: Atif Ishaq
No ratings yet
Compiler Construction CS-4207: Instructor Name: Atif Ishaq
19 pages
Parsing Part - 1
No ratings yet
Parsing Part - 1
53 pages
Entrepreneurship Process
No ratings yet
Entrepreneurship Process
22 pages
Walls
No ratings yet
Walls
17 pages
Chapter 3 - Syntax Analysis Part One
No ratings yet
Chapter 3 - Syntax Analysis Part One
17 pages
CH2-1 To CH2-3
No ratings yet
CH2-1 To CH2-3
79 pages
Parsing Notes
No ratings yet
Parsing Notes
96 pages
Parsing 120903115324 Phpapp02
No ratings yet
Parsing 120903115324 Phpapp02
20 pages
Principles of Programming Language
No ratings yet
Principles of Programming Language
44 pages
Second Phase of The Compiler. Main Task:: Lexical Analyzer Rest of Front End Parser Source Tree Parse Req Token IR
No ratings yet
Second Phase of The Compiler. Main Task:: Lexical Analyzer Rest of Front End Parser Source Tree Parse Req Token IR
13 pages
Chapter 2
No ratings yet
Chapter 2
47 pages
EPB-6. Cs-Ti
No ratings yet
EPB-6. Cs-Ti
29 pages
WWW Programmingwithbasics Com 2017 11 Hospital Management System Project in HTML
No ratings yet
WWW Programmingwithbasics Com 2017 11 Hospital Management System Project in HTML
39 pages
SE Compiler Chapter 3-Parser
No ratings yet
SE Compiler Chapter 3-Parser
27 pages
System Programming
No ratings yet
System Programming
22 pages
Module III
No ratings yet
Module III
18 pages
Compiler Design - Syntax Analysis
No ratings yet
Compiler Design - Syntax Analysis
11 pages
CSC441-Lesson 04
No ratings yet
CSC441-Lesson 04
40 pages
AI CH 2
No ratings yet
AI CH 2
38 pages
1 Syntax Analyzer
No ratings yet
1 Syntax Analyzer
33 pages
Slide Set 5 Parsing
No ratings yet
Slide Set 5 Parsing
18 pages
Parsing Part - 1
No ratings yet
Parsing Part - 1
53 pages
Compiler Lecture 4
No ratings yet
Compiler Lecture 4
17 pages
Public Administration
No ratings yet
Public Administration
178 pages
Feelings When Your Needs Are Satisfied: Engaged
No ratings yet
Feelings When Your Needs Are Satisfied: Engaged
4 pages
Compiler Construction Week 04 Syntax Analysis I)
No ratings yet
Compiler Construction Week 04 Syntax Analysis I)
41 pages
Unit 3 Syntax - Analyzer
No ratings yet
Unit 3 Syntax - Analyzer
56 pages
Figure 1two Parse Trees For 9-5+2
No ratings yet
Figure 1two Parse Trees For 9-5+2
3 pages
Multimedia Application L4
No ratings yet
Multimedia Application L4
42 pages
Stauffer 1957
No ratings yet
Stauffer 1957
7 pages
Compiler Theory: (A Simple Syntax-Directed Translator)
No ratings yet
Compiler Theory: (A Simple Syntax-Directed Translator)
50 pages
Experiment No.3
No ratings yet
Experiment No.3
7 pages
Itsourcecode Com Free Projects Java Projects Project On Hospital Management Syst
No ratings yet
Itsourcecode Com Free Projects Java Projects Project On Hospital Management Syst
29 pages
Review Proosal
No ratings yet
Review Proosal
18 pages
Tripping Batteries
No ratings yet
Tripping Batteries
5 pages
WEEK 1-2 Individual Report 2019
No ratings yet
WEEK 1-2 Individual Report 2019
4 pages
Allied Telesis
No ratings yet
Allied Telesis
2 pages
CC Unit 3
No ratings yet
CC Unit 3
51 pages
Pran Yog
No ratings yet
Pran Yog
3 pages
3 Role of Parser
No ratings yet
3 Role of Parser
135 pages
Ethics in Human Resource Management: A Conceptual and Theoretical Analysis
No ratings yet
Ethics in Human Resource Management: A Conceptual and Theoretical Analysis
17 pages
AI Chapter 4
No ratings yet
AI Chapter 4
41 pages
Managing Corporate Social Responsibility - 2011 - Coombs
No ratings yet
Managing Corporate Social Responsibility - 2011 - Coombs
10 pages
TN Budget - INR 75 CR Startup Hub To Be Set Up in Chennai
No ratings yet
TN Budget - INR 75 CR Startup Hub To Be Set Up in Chennai
11 pages
DMDW Chapter 3
No ratings yet
DMDW Chapter 3
13 pages
04 Pointers
No ratings yet
04 Pointers
103 pages
CC-Lec 5 Week 5 Cfgs
No ratings yet
CC-Lec 5 Week 5 Cfgs
29 pages
Some Remarks On Ethiopia's New Cybercrime Legislation
No ratings yet
Some Remarks On Ethiopia's New Cybercrime Legislation
11 pages
Lec4 SyntaxAnalysis
No ratings yet
Lec4 SyntaxAnalysis
41 pages
15 Syntax Parsing
No ratings yet
15 Syntax Parsing
30 pages
Belt Conveyors For Bulk Materials Conveyor: Traducir Esta Página
No ratings yet
Belt Conveyors For Bulk Materials Conveyor: Traducir Esta Página
4 pages
Compilers Notes
No ratings yet
Compilers Notes
31 pages
Samrawit Proposals
No ratings yet
Samrawit Proposals
19 pages
1 Android - Overview
No ratings yet
1 Android - Overview
7 pages
Lecture 03
No ratings yet
Lecture 03
7 pages
EDU431 Mega For Final Term Obj+Subj All in 1 File by Everblue August2023
No ratings yet
EDU431 Mega For Final Term Obj+Subj All in 1 File by Everblue August2023
193 pages
2-Role of Parser and Parse Tree-02!08!2024
No ratings yet
2-Role of Parser and Parse Tree-02!08!2024
69 pages
Compiler 3
No ratings yet
Compiler 3
11 pages
EF
No ratings yet
EF
3 pages
Efrata CV
No ratings yet
Efrata CV
3 pages
Lev S. Vygotsky - Mind in Society The Development of Higher Psychological Processes
88% (16)
Lev S. Vygotsky - Mind in Society The Development of Higher Psychological Processes
170 pages
Temporary Copy: Petiros Gire Kacho Name: 155165 Admission No
No ratings yet
Temporary Copy: Petiros Gire Kacho Name: 155165 Admission No
1 page
Neural Network
No ratings yet
Neural Network
2 pages
L4 Formal Grammers
No ratings yet
L4 Formal Grammers
23 pages
Lec 03 Syntax Analysis
No ratings yet
Lec 03 Syntax Analysis
19 pages
NSCP (2010) - Chapter 3
No ratings yet
NSCP (2010) - Chapter 3
24 pages
Role of Statistics in Psychology
No ratings yet
Role of Statistics in Psychology
4 pages
2024 CD-Ch03 Syntaxx Analysis
No ratings yet
2024 CD-Ch03 Syntaxx Analysis
28 pages
Chapter 3 - Syntax Analysis
No ratings yet
Chapter 3 - Syntax Analysis
16 pages
Compiler Design Chapter-3
0% (1)
Compiler Design Chapter-3
177 pages
Array: Intermediate Level Questions
No ratings yet
Array: Intermediate Level Questions
3 pages
English Project
No ratings yet
English Project
22 pages
BB
No ratings yet
BB
3 pages
Bo de Thi Tieng Anh Lop 4 Hoc Ki 1 Co Dap An
No ratings yet
Bo de Thi Tieng Anh Lop 4 Hoc Ki 1 Co Dap An
60 pages
Compiler Design Lec-Three Syntax Analysis
No ratings yet
Compiler Design Lec-Three Syntax Analysis
60 pages
App LetterS
No ratings yet
App LetterS
4 pages
2014-CD Ch-03 SAn
No ratings yet
2014-CD Ch-03 SAn
21 pages
Compiler Design - Syntax Analysis
No ratings yet
Compiler Design - Syntax Analysis
14 pages
Python Reference: An Alphabetical Guide
From Everand
Python Reference: An Alphabetical Guide
Jo Foster
No ratings yet
Composing Software: An Exploration of Functional Programming and Object Composition in JavaScript
From Everand
Composing Software: An Exploration of Functional Programming and Object Composition in JavaScript
Eric Elliott
No ratings yet
Fana
0% (1)
Fana
509 pages
Analyst Prep Quants 2024
100% (1)
Analyst Prep Quants 2024
465 pages
So HVAC
No ratings yet
So HVAC
1 page
Problem Solving in C and Python: Programming Exercises and Solutions, Part 1
From Everand
Problem Solving in C and Python: Programming Exercises and Solutions, Part 1
Yana Kortsarts
4.5/5 (2)
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet

WWW Tutorialspoint Com Compiler Design Compiler Design Syntax Analysis HTM

Uploaded by

WWW Tutorialspoint Com Compiler Design Compiler Design Syntax Analysis HTM

Uploaded by

Home Jobs Current Affairs UPSC Notes

Previous Page Print Page Next Page

About us Refund Policy Terms of use Privacy Policy FAQ's Contact

You might also like