CD Unit 2

The document discusses syntax analysis in programming languages, focusing on the role of parsers, specifically top-down and bottom-up parsing methods. It explains recursive descent parsing, the computation of FIRST and FOLLOW sets, and the construction of predictive parsing tables for LL(1) grammars. Additionally, it covers error recovery techniques in predictive parsing, emphasizing the importance of synchronizing sets for effective error handling.

Uploaded by

vyshnavivadlamani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views6 pages

CD Unit 2

Uploaded by

vyshnavivadlamani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

UNIT -II

SYNTAX ANALYSIS

ROLE OF THE PARSER

Parser obtains a string of tokens from the lexical analyzer and verifies that it can be generated
by the language for the source program. The parser should report any syntax errors in an
intelligible fashion. The two types of parsers employed are:
1.Top down parser: which build parse trees from top(root) to bottom(leaves)
2.Bottom up parser: which build parse trees from leaves and work up the root.
Therefore there are two types of parsing methods– top-down parsing and bottom-up parsing

TOP-DOWN PARSING
A program that performs syntax analysis is called a parser. A syntax analyzer takes tokens as
input and output error message if the program syntax is wrong. The parser uses symbol-look-
ahead and an approach called top-down parsing without backtracking. Top-downparsers
check to see if a string can be generated by a grammar by creating a parse tree starting from
the initial symbol and working down. Bottom-up parsers, however, check to see a string can
be generated from a grammar by creating a parse tree from the leaves, and working up. Early
parser generators such as YACC creates bottom-up parsers whereas many of Java parser
generators such as JavaCC create top-down parsers.

RECURSIVE DESCENT PARSING

Typically, top-down parsers are implemented as a set of recursive functions that descent
through a parse tree for a string. This approach is known as recursive descent parsing, also
known as LL(k) parsing where the first L stands for left-to-right, the second L stands for
leftmost-derivation, and k indicates k-symbol lookahead. Therefore, a parser using the single
symbol look-ahead method and top-down parsing without backtracking is called LL(1)
parser. In the following sections, we will also use an extended BNF notation in which some
regulation expression operators are to be incorporated.
A syntax expression defines sentences of the form , or . A syntax of the form defines
sentences that consist of a sentence of the form followed by a sentence of the form followed
by a sentence of the form . A syntax of the form defines zero or one occurrence of the form .
A syntax of the form defines zero or more occurrences of the form .
A usual implementation of an LL(1) parser is:
o initialize its data structures,
o get the lookahead token by calling scanner routines, and
o call the routine that implements the start symbol.

Here is an example.
proc syntaxAnalysis()
begin

initialize(); // initialize global data and structures

nextToken(); // get the lookahead token
program(); // parser routine that implements the start symbol
end;

FIRST AND FOLLOW

To compute FIRST(X) for all grammar symbols X, apply the following rules until
no more terminals or e can be added to any FIRST set.
1. If X is terminal, then FIRST(X) is {X}.
2. If X->e is a production, then add e to FIRST(X).
3. If X is nonterminal and X->Y1Y2...Yk is a production, then place a in FIRST(X) if for
some i, a is in FIRST(Yi) and e is in all of FIRST(Y1),...,FIRST(Yi-1) that is,
Y1.......Yi-1=*>e. If e is in FIRST(Yj) for all j=1,2,...,k, then add e to FIRST(X). For
example, everything in FIRST(Yj) is surely in FIRST(X). If y1 does not derive e, then we
add nothing more to FIRST(X), but if Y1=*>e, then we add FIRST(Y2) and so on.
To compute the FIRST(A) for all nonterminals A, apply the following rules until nothing
can be added to any FOLLOW set.
1. Place $ in FOLLOW(S), where S is the start symbol and $ in the input right endmarker.
2. If there is a production A=>aBs where FIRST(s) except e is placed in FOLLOW(B).
3. If there is aproduction A->aB or a production A->aBs where FIRST(s) contains e, then
everything in FOLLOW(A) is in FOLLOW(B).
Consider the following example to understand the concept of First and Follow.Find the first
and follow of all nonterminals in the Grammar-
E -> TE'
E'-> +TE'|e
T -> FT'
T'-> *FT'|e
F -> (E)|id
Then:
FIRST(E)=FIRST(T)=FIRST(F)={(,id}
FIRST(E')={+,e}
FIRST(T')={*,e}
FOLLOW(E)=FOLLOW(E')={),$}
FOLLOW(T)=FOLLOW(T')={+,),$}
FOLLOW(F)={+,*,),$}
For example, id and left parenthesis are added to FIRST(F) by rule 3 in definition of FIRST
with i=1 in each case, since FIRST(id)=(id) and FIRST('(')= {(} by rule 1. Then by rule 3
with i=1, the production T -> FT' implies that id and left parenthesis belong to FIRST(T)
also.
To compute FOLLOW,we put $ in FOLLOW(E) by rule 1 for FOLLOW. By rule 2 applied
toproduction F-> (E), right parenthesis is also in FOLLOW(E). By rule 3 applied to
production E-> TE', $ and right parenthesis are in FOLLOW(E').
CONSTRUCTION OF PREDICTIVE PARSING TABLES
For any grammar G, the following algorithm can be used to construct the predictive parsing
table. The algorithm is
Input : Grammar G
Output : Parsing table M
Method
1. 1.For each production A-> a of the grammar, do steps 2 and 3.
2. For each terminal a in FIRST(a), add A->a, to M[A,a].
3. If e is in First(a), add A->a to M[A,b] for each terminal b in FOLLOW(A). If e is in
FIRST(a) and $ is in FOLLOW(A), add A->a to M[A,$].
4. Make each undefined entry of M be error.

LL(1) GRAMMAR
The above algorithm can be applied to any grammar G to produce a parsing table M. For
some Grammars, for example if G is left recursive or ambiguous, then M will have at least
one multiply-defined entry. A grammar whose parsing table has no multiply defined entries
is said to be LL(1). It can be shown that the above algorithm can be used to produce for every
LL(1) grammar G a parsing table M that parses all and only the sentences of G. LL(1)
grammars have several distinctive properties. No ambiguous or left recursive grammar can
be LL(1). There remains a question of what should be done in case of multiply defined
entries. One easy solution is to eliminate all left recursion and left factoring, hoping to
produce a grammar which will produce no multiply defined entries in the parse tables.
Unfortunately there are some grammars which will give an LL(1) grammar after any kind of
alteration. In general, there are no universal rules to convert multiply defined entries into
single valued entries without affecting the language recognized by the parser.

The main difficulty in using predictive parsing is in writing a grammar for the source
language such that a predictive parser can be constructed from the grammar. Although left
recursion elimination and left factoring are easy to do, they make the resulting grammar hard
to read and difficult to use the translation purposes. To alleviate some of this difficulty, a
common organization for a parser in a compiler is to use a predictive parser for control
constructs and to use operator precedence for expressions.however, if an lr parser generator
is available, one can get all the benefits of predictive parsing and operator precedence
automatically.
ERROR RECOVERY IN PREDICTIVE PARSING
The stack of a nonrecursive predictive parser makes explicit the terminals and nonterminals
that the parser hopes to match with the remainder of the input. We shall therefore refer to
symbols on the parser stack in the following discussion. An error is detected during
predictive parsing when the terminal on top of the stack does not match the next input
symbol or when nonterminal A is on top of the stack, a is the next input symbol, and the
parsing table entry M[A,a] is empty.
Panic-mode error recovery is based on the idea of skipping symbols on the input until a token
in a selected set of synchronizing tokens appears. Its effectiveness depends on the choice of
synchronizing set. The sets should be chosen so that the parser recovers quickly from errors
that are likely to occur in practice. Some heuristics are as follows

As a starting point, we can place all symbols in FOLLOW(A) into the synchronizing
set for nonterminal A. If we skip tokens until an element of FOLLOW(A) is seen and
pop A from the stack, it is likely that parsing can continue.
It is not enough to use FOLLOW(A) as the synchronizingset for A. Fo example , if
semicolons terminate statements, as in C, then keywords that begin statements may
not appear in the FOLLOW set of the nonterminal generating expressions. A missing
semicolon after an assignment may therefore result in the keyword beginning the next
statement being skipped. Often, there is a hierarchica structure on constructs in a
language; e.g., expressions appear within statement, which appear within bblocks,and
so on. We can add to the synchronizing set of a lower construct the symbols that
begin higher constructs. For example, we might add keywords that begin statements
to the synchronizing sets for the nonterminals generaitn expressions.
If we add symbols in FIRST(A) to the synchronizing set for nonterminal A, then it
may be possible to resume parsing according to A if a symbol in FIRST(A) appears in
the input.
If a nonterminal can generate the empty string, then the production deriving
e can be used as a default. Doing so may postpone some error detection, but
cannot cause an error to be missed. This approach reduces the number of
nonterminals that have to be considered during error recovery.
If a terminal on top of the stack cannot be matched, a simple idea is to pop
the terminal, issue a message saying that the terminal was inserted, and
continue parsing. In effect, this approach takes the synchronizing set of a
token to consist of all other tokens.

*****

18 1 Variation and Natural Selection Theory Questions and Answers
100% (1)
18 1 Variation and Natural Selection Theory Questions and Answers
55 pages
ACD-UNIT-4 Notes
No ratings yet
ACD-UNIT-4 Notes
32 pages
Module 4-6: E M A E M I T S C O L L E G E P H I L I P P I N E S
100% (1)
Module 4-6: E M A E M I T S C O L L E G E P H I L I P P I N E S
14 pages
Module-2 1
No ratings yet
Module-2 1
51 pages
Oil Well Drilling Problems Presentation
100% (1)
Oil Well Drilling Problems Presentation
28 pages
A Detailed Lesson Plan in Reading and Writing Skills: Topic: Comparison-Contrast
No ratings yet
A Detailed Lesson Plan in Reading and Writing Skills: Topic: Comparison-Contrast
11 pages
Theory of Computation and Compiler Design: Module - 4
No ratings yet
Theory of Computation and Compiler Design: Module - 4
31 pages
Chapter - 3
No ratings yet
Chapter - 3
46 pages
Chapter 5 Section B Wilson THeorem
No ratings yet
Chapter 5 Section B Wilson THeorem
5 pages
TCET-SEM 5 Syllabus
No ratings yet
TCET-SEM 5 Syllabus
39 pages
Grade 4 English
No ratings yet
Grade 4 English
3 pages
Module 2a - With Soln
No ratings yet
Module 2a - With Soln
90 pages
CD Unit-2
No ratings yet
CD Unit-2
107 pages
The Mindset For Deep Work
No ratings yet
The Mindset For Deep Work
4 pages
Unit-II CD
No ratings yet
Unit-II CD
81 pages
CD Unit-3 Part-1
No ratings yet
CD Unit-3 Part-1
99 pages
LL 1
No ratings yet
LL 1
73 pages
CD Unit3
No ratings yet
CD Unit3
74 pages
Mod 2.1 - (Lec 8) - Syntax Analyzer and CFG
No ratings yet
Mod 2.1 - (Lec 8) - Syntax Analyzer and CFG
39 pages
Kinnevik Broker Report Apr-15
No ratings yet
Kinnevik Broker Report Apr-15
118 pages
Toc Unit 3
No ratings yet
Toc Unit 3
49 pages
CD Unit3 Part2
No ratings yet
CD Unit3 Part2
51 pages
Chapter 4 - Syntax Analysis
No ratings yet
Chapter 4 - Syntax Analysis
73 pages
Chapter4 1
No ratings yet
Chapter4 1
61 pages
Chapter 4 - Syntax Analysis
No ratings yet
Chapter 4 - Syntax Analysis
68 pages
Compiler Design Syntax Analysis Top Down
No ratings yet
Compiler Design Syntax Analysis Top Down
34 pages
CD Chapter 2
No ratings yet
CD Chapter 2
39 pages
EPI 200 SIT Skin Irritation MK 24 007 0023
No ratings yet
EPI 200 SIT Skin Irritation MK 24 007 0023
35 pages
Syntax Analysis I 2024
No ratings yet
Syntax Analysis I 2024
38 pages
td2 LL - 1 Parsing
No ratings yet
td2 LL - 1 Parsing
45 pages
M2 Compiler Design
No ratings yet
M2 Compiler Design
51 pages
CD Unit3 Part1
No ratings yet
CD Unit3 Part1
22 pages
3 Syntax Analysis
No ratings yet
3 Syntax Analysis
42 pages
CDU1
No ratings yet
CDU1
21 pages
Unit - Ii 2.1 Syntax Analysis
No ratings yet
Unit - Ii 2.1 Syntax Analysis
122 pages
Parsing Technique Baar Baar
No ratings yet
Parsing Technique Baar Baar
29 pages
نماذج الاضواء انجليزي اولى اعدادي الترم الثاني 2024 بالاجابات
No ratings yet
نماذج الاضواء انجليزي اولى اعدادي الترم الثاني 2024 بالاجابات
44 pages
Module 4 - Top Down Parsing
No ratings yet
Module 4 - Top Down Parsing
31 pages
Compiler Design Syntax Analysis Top Down
No ratings yet
Compiler Design Syntax Analysis Top Down
34 pages
CC 3
No ratings yet
CC 3
29 pages
CDU5
No ratings yet
CDU5
15 pages
Pert 4 - Syntax Analysis-Top Down Parsing
No ratings yet
Pert 4 - Syntax Analysis-Top Down Parsing
54 pages
Parser Lec4
No ratings yet
Parser Lec4
21 pages
CD Unit 2
No ratings yet
CD Unit 2
19 pages
Unit - 3 Syntax Analyzer
No ratings yet
Unit - 3 Syntax Analyzer
43 pages
Unit 2 (CD)
No ratings yet
Unit 2 (CD)
12 pages
Unit - 3 Syntax Analysis: 3.1 Role of The Parser
No ratings yet
Unit - 3 Syntax Analysis: 3.1 Role of The Parser
18 pages
Chapter 04 Top-Down Syntactic Analysis
No ratings yet
Chapter 04 Top-Down Syntactic Analysis
10 pages
Top-Down Parsing: - The Parse Tree Is Created Top To Bottom. - Top-Down Parser
No ratings yet
Top-Down Parsing: - The Parse Tree Is Created Top To Bottom. - Top-Down Parser
36 pages
CD - Ch.2
No ratings yet
CD - Ch.2
39 pages
Cdeprt
No ratings yet
Cdeprt
12 pages
Syntax Analysis I 2022 Class
No ratings yet
Syntax Analysis I 2022 Class
33 pages
Sucker by Carson Mccullers Sep
No ratings yet
Sucker by Carson Mccullers Sep
4 pages
CD KCS502 Unit 2
No ratings yet
CD KCS502 Unit 2
18 pages
Chapter 3a - Syntax Analysis
No ratings yet
Chapter 3a - Syntax Analysis
10 pages
Unit 3 Class
No ratings yet
Unit 3 Class
23 pages
Compiler Design Unit-2
No ratings yet
Compiler Design Unit-2
29 pages
3 Syntax Analysis - Top Down Parsing
No ratings yet
3 Syntax Analysis - Top Down Parsing
9 pages
Chapter 3-Syntax Analysis-II
No ratings yet
Chapter 3-Syntax Analysis-II
28 pages
CD UNIT-II Syntax Analysis
No ratings yet
CD UNIT-II Syntax Analysis
13 pages
Customization Customization Customization: 450-2200SF 1000-1200SF 450-2200SF 1000-1200SF 450-2200SF 1000-1200SF
No ratings yet
Customization Customization Customization: 450-2200SF 1000-1200SF 450-2200SF 1000-1200SF 450-2200SF 1000-1200SF
21 pages
FprEN - 1992 1 1 BD
No ratings yet
FprEN - 1992 1 1 BD
4 pages
Top-Down and Bottom-Up Parsing
No ratings yet
Top-Down and Bottom-Up Parsing
23 pages
‎⁨‏لقطة شاشة ٢٠٢٤-٠٣-٢٩ في ١١.٠٧.٠٧ م⁩
No ratings yet
‎⁨‏لقطة شاشة ٢٠٢٤-٠٣-٢٩ في ١١.٠٧.٠٧ م⁩
6 pages
Chapter 3
No ratings yet
Chapter 3
9 pages
Chapter 4 - Syntax Analysis
No ratings yet
Chapter 4 - Syntax Analysis
82 pages
Lumpia
No ratings yet
Lumpia
4 pages
Crafting A Compiler With C (VIII) : The LL Grammar Class
No ratings yet
Crafting A Compiler With C (VIII) : The LL Grammar Class
18 pages
Context Free Grammars
No ratings yet
Context Free Grammars
10 pages
Top-Down Parsing
No ratings yet
Top-Down Parsing
10 pages
Chapter # 5 Parsing Mechanisms. Chapter # 5 Parsing Mechanisms
No ratings yet
Chapter # 5 Parsing Mechanisms. Chapter # 5 Parsing Mechanisms
31 pages
Parsing
No ratings yet
Parsing
38 pages
The Consumer Decision Journey
No ratings yet
The Consumer Decision Journey
8 pages
Sight Screen Catalog
No ratings yet
Sight Screen Catalog
3 pages
Chapter-4 - CS-411 Compiler Construction
No ratings yet
Chapter-4 - CS-411 Compiler Construction
8 pages
BUYAMIA-Investment Round 12-September 2023 Indonesia Updated
No ratings yet
BUYAMIA-Investment Round 12-September 2023 Indonesia Updated
23 pages
Top-Down Parsing: - The Parse Tree Is Created Top To Bottom. - Top-Down Parser
No ratings yet
Top-Down Parsing: - The Parse Tree Is Created Top To Bottom. - Top-Down Parser
31 pages
Docu85238 - Data Domain Boost For OpenStorage 3.4.1.1 Release Notes
No ratings yet
Docu85238 - Data Domain Boost For OpenStorage 3.4.1.1 Release Notes
8 pages
Spicejet Improves Transparency and Control With Ibm Airline Office and Sap Erp
No ratings yet
Spicejet Improves Transparency and Control With Ibm Airline Office and Sap Erp
4 pages
Lesson 6 Evan S Dela Rosa
No ratings yet
Lesson 6 Evan S Dela Rosa
6 pages
Sample Paper-5
No ratings yet
Sample Paper-5
8 pages
Week 5
No ratings yet
Week 5
3 pages
Lecture 4 PDF
No ratings yet
Lecture 4 PDF
28 pages
Arun &associates
No ratings yet
Arun &associates
12 pages
Unit - 3 Syntax Analysis: 3.1 Role of The Parser
No ratings yet
Unit - 3 Syntax Analysis: 3.1 Role of The Parser
6 pages
Chapter 2 (Uts)
No ratings yet
Chapter 2 (Uts)
2 pages
Shri Vaishnav Institute of Management, Indore (M.P.)
No ratings yet
Shri Vaishnav Institute of Management, Indore (M.P.)
14 pages
Parsing
No ratings yet
Parsing
33 pages
New Microsoft Word Document
No ratings yet
New Microsoft Word Document
2 pages
Letter of Complaint
No ratings yet
Letter of Complaint
2 pages
09 Davao Freeworkers V Cir
No ratings yet
09 Davao Freeworkers V Cir
5 pages
Top-Down and Bottom-Up Parsing
No ratings yet
Top-Down and Bottom-Up Parsing
23 pages
Latihan Soal Bahasa Inggris Kelas 6
No ratings yet
Latihan Soal Bahasa Inggris Kelas 6
3 pages

CD Unit 2

Uploaded by

CD Unit 2

Uploaded by

UNIT -II

ROLE OF THE PARSER

RECURSIVE DESCENT PARSING

initialize(); // initialize global data and structures

FIRST AND FOLLOW

You might also like