0% found this document useful (0 votes)

14 views46 pages

CH 04

Uploaded by

SARANYA M

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views46 pages

CH 04

Uploaded by

SARANYA M

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 46

Chapter 4

Lexical and Syntax

Analysis

ISBN 0-321-49362-1
Chapter 4 Topics

• Introduction
• Lexical Analysis
• The Parsing Problem
• Recursive-Descent Parsing
• Bottom-Up Parsing

Copyright © 2009 Addison-Wesley. All rights reserved. 1-2

Introduction

• Language implementation systems must

analyze source code, regardless of the
specific implementation approach
• Nearly all syntax analysis is based on a
formal description of the syntax of the
source language (BNF)

Copyright © 2009 Addison-Wesley. All rights reserved. 1-3

Syntax Analysis

• The syntax analysis portion of a language

processor nearly always consists of two
parts:
– A low-level part called a lexical analyzer
(mathematically, a finite automaton based on
a regular grammar)
– A high-level part called a syntax analyzer, or
parser (mathematically, a push-down
automaton based on a context-free grammar,
or BNF)

Copyright © 2009 Addison-Wesley. All rights reserved. 1-4

Advantages of Using BNF to Describe
Syntax

• Provides a clear and concise syntax

description
• The parser can be based directly on the
BNF
• Parsers based on BNF are easy to
maintain

Copyright © 2009 Addison-Wesley. All rights reserved. 1-5

Reasons to Separate Lexical and
Syntax Analysis

• Simplicity - less complex approaches can

be used for lexical analysis; separating
them simplifies the parser
• Efficiency - separation allows optimization
of the lexical analyzer
• Portability - parts of the lexical analyzer
may not be portable, but the parser
always is portable

Copyright © 2009 Addison-Wesley. All rights reserved. 1-6

Lexical Analysis

• A lexical analyzer is a pattern matcher for

character strings
• A lexical analyzer is a “front-end” for the
parser
• Identifies substrings of the source
program that belong together - lexemes
– Lexemes match a character pattern, which is
associated with a lexical category called a
token
– sum is a lexeme; its token may be IDENT

Copyright © 2009 Addison-Wesley. All rights reserved. 1-7

Lexical Analysis (continued)

• The lexical analyzer is usually a function that is

called by the parser when it needs the next token
• Three approaches to building a lexical analyzer:
– Write a formal description of the tokens and use a
software tool that constructs table-driven lexical
analyzers given such a description
– Design a state diagram that describes the tokens and
write a program that implements the state diagram
– Design a state diagram that describes the tokens and
hand-construct a table-driven implementation of the
state diagram

Copyright © 2009 Addison-Wesley. All rights reserved. 1-8

State Diagram Design

– A naïve state diagram would have a transition

from every state on every character in the
source language - such a diagram would be
very large!

Copyright © 2009 Addison-Wesley. All rights reserved. 1-9

Lexical Analysis (cont.)

• In many cases, transitions can be

combined to simplify the state diagram
– When recognizing an identifier, all uppercase
and lowercase letters are equivalent
• Use a character class that includes all letters
– When recognizing an integer literal, all digits
are equivalent - use a digit class

Copyright © 2009 Addison-Wesley. All rights reserved. 1-10

Lexical Analysis (cont.)

• Reserved words and identifiers can be

recognized together (rather than having a
part of the diagram for each reserved
word)
– Use a table lookup to determine whether a
possible identifier is in fact a reserved word

Copyright © 2009 Addison-Wesley. All rights reserved. 1-11

Lexical Analysis (cont.)

• Convenient utility subprograms:

– getChar - gets the next character of input,
puts it in nextChar, determines its class and
puts the class in charClass
– addChar - puts the character from nextChar
into the place the lexeme is being
accumulated, lexeme
– lookup - determines whether the string in
lexeme is a reserved word (returns a code)

Copyright © 2009 Addison-Wesley. All rights reserved. 1-12

State Diagram

Copyright © 2009 Addison-Wesley. All rights reserved. 1-13

Lexical Analyzer

Implementation:
 SHOW front.c (pp. 176-181)

- Following is the output of the lexical analyzer

of
front.c when used on (sum + 47) / total

Next token is: 25 Next lexeme is (

Next token is: 11 Next lexeme is sum
Next token is: 21 Next lexeme is +
Next token is: 10 Next lexeme is 47
Next token is: 26 Next lexeme is )
Next token is: 24 Next lexeme is /
Next token is: 11 Next lexeme is total
Next token is: -1 Next lexeme is EOF
Copyright © 2009 Addison-Wesley. All rights reserved. 1-14
The Parsing Problem

• Goals of the parser, given an input

program:
– Find all syntax errors; for each, produce an
appropriate diagnostic message and recover
quickly
– Produce the parse tree, or at least a trace of
the parse tree, for the program

Copyright © 2009 Addison-Wesley. All rights reserved. 1-15

The Parsing Problem (cont.)

• Two categories of parsers

– Top down - produce the parse tree, beginning
at the root
• Order is that of a leftmost derivation
• Traces or builds the parse tree in preorder
– Bottom up - produce the parse tree, beginning
at the leaves
• Order is that of the reverse of a rightmost
derivation
• Useful parsers look only one token ahead
in the input
Copyright © 2009 Addison-Wesley. All rights reserved. 1-16
The Parsing Problem (cont.)

• Top-down Parsers
– Given a sentential form, xA , the parser must
choose the correct A-rule to get the next
sentential form in the leftmost derivation,
using only the first token produced by A
• The most common top-down parsing
algorithms:
– Recursive descent - a coded implementation
– LL parsers - table driven implementation

Copyright © 2009 Addison-Wesley. All rights reserved. 1-17

The Parsing Problem (cont.)

• Bottom-up parsers
– Given a right sentential form, , determine
what substring of  is the right-hand side of
the rule in the grammar that must be reduced
to produce the previous sentential form in the
right derivation
– The most common bottom-up parsing
algorithms are in the LR family

Copyright © 2009 Addison-Wesley. All rights reserved. 1-18

The Parsing Problem (cont.)

• The Complexity of Parsing

– Parsers that work for any unambiguous
grammar are complex and inefficient ( O(n3),
where n is the length of the input )
– Compilers use parsers that only work for a
subset of all unambiguous grammars, but do it
in linear time ( O(n), where n is the length of
the input )

Copyright © 2009 Addison-Wesley. All rights reserved. 1-19

Recursive-Descent Parsing

• There is a subprogram for each

nonterminal in the grammar, which can
parse sentences that can be generated by
that nonterminal
• EBNF is ideally suited for being the basis
for a recursive-descent parser, because
EBNF minimizes the number of
nonterminals

Copyright © 2009 Addison-Wesley. All rights reserved. 1-20

Recursive-Descent Parsing (cont.)

• A grammar for simple expressions:

<expr>  <term> {(+ | -) <term>}

<term>  <factor> {(* | /) <factor>}
<factor>  id | int_constant | ( <expr> )

Copyright © 2009 Addison-Wesley. All rights reserved. 1-21

Recursive-Descent Parsing (cont.)

• Assume we have a lexical analyzer named

lex, which puts the next token code in
nextToken
• The coding process when there is only one
RHS:
– For each terminal symbol in the RHS, compare
it with the next input token; if they match,
continue, else there is an error
– For each nonterminal symbol in the RHS, call
its associated parsing subprogram

Copyright © 2009 Addison-Wesley. All rights reserved. 1-22

Recursive-Descent Parsing (cont.)
/* Function expr
Parses strings in the language
generated by the rule:
<expr> → <term> {(+ | -) <term>}
*/

void expr() {

/* Parse the first term */

term();
/* As long as the next token is + or -, call
lex to get the next token and parse the
next term */

while (nextToken == ADD_OP ||

nextToken == SUB_OP){
lex();
term();
}
}

Copyright © 2009 Addison-Wesley. All rights reserved. 1-23

Recursive-Descent Parsing (cont.)

• This particular routine does not detect errors

• Convention: Every parsing routine leaves the
next token in nextToken

Copyright © 2009 Addison-Wesley. All rights reserved. 1-24

Recursive-Descent Parsing (cont.)

• A nonterminal that has more than one

RHS requires an initial process to
determine which RHS it is to parse
– The correct RHS is chosen on the basis of the
next token of input (the lookahead)
– The next token is compared with the first
token that can be generated by each RHS until
a match is found
– If no match is found, it is a syntax error

Copyright © 2009 Addison-Wesley. All rights reserved. 1-25

Recursive-Descent Parsing (cont.)

/* term
Parses strings in the language generated by the rule:
<term> -> <factor> {(* | /) <factor>)
*/
void term() {
printf("Enter <term>\n");
/* Parse the first factor */
factor();
/* As long as the next token is * or /,
next token and parse the next factor */
while (nextToken == MULT_OP || nextToken == DIV_OP) {
lex();
factor();
}
printf("Exit <term>\n");
} /* End of function term */

Copyright © 2009 Addison-Wesley. All rights reserved. 1-26

Recursive-Descent Parsing (cont.)

/* Function factor
Parses strings in the language
generated by the rule:
<factor> -> id | (<expr>) */

void factor() {

/* Determine which RHS */

if (nextToken) == ID_CODE || nextToken == INT_CODE)

/* For the RHS id, just call lex */

lex();

/* If the RHS is (<expr>) – call lex to pass over the left parenthesis,
call expr, and check for the right parenthesis */
else if (nextToken == LP_CODE) {
lex();
expr();
if (nextToken == RP_CODE)
lex();
else
error();
} /* End of else if (nextToken == ... */

else error(); /* Neither RHS matches */

}

Copyright © 2009 Addison-Wesley. All rights reserved. 1-27

Recursive-Descent Parsing (cont.)
- Trace of the lexical and syntax analyzers on (sum + 47) / total

Next token is: 25 Next lexeme is ( Next token is: 11 Next lexeme is total
Enter <expr> Enter <factor>
Enter <term> Next token is: -1 Next lexeme is EOF
Enter <factor> Exit <factor>
Next token is: 11 Next lexeme is sum Exit <term>
Enter <expr> Exit <expr>
Enter <term>
Enter <factor>
Next token is: 21 Next lexeme is +
Exit <factor>
Exit <term>
Next token is: 10 Next lexeme is 47
Enter <term>
Enter <factor>
Next token is: 26 Next lexeme is )
Exit <factor>
Exit <term>
Exit <expr>
Next token is: 24 Next lexeme is /
Exit <factor>

Copyright © 2009 Addison-Wesley. All rights reserved. 1-28

Recursive-Descent Parsing (cont.)

• The LL Grammar Class

– The Left Recursion Problem
• If a grammar has left recursion, either direct or
indirect, it cannot be the basis for a top-down
parser
– A grammar can be modified to remove left recursion
For each nonterminal, A,
1. Group the A-rules as A → Aα1 | … | Aαm | β1 | β2 | … |
βn
where none of the β‘s begins with A
2. Replace the original A-rules with
A → β1A’ | β2A’ | … | βnA’
A’ → α1A’ | α2A’ | … | αmA’ | ε
Copyright © 2009 Addison-Wesley. All rights reserved. 1-29
Recursive-Descent Parsing (cont.)

• The other characteristic of grammars that

disallows top-down parsing is the lack of
pairwise disjointness
– The inability to determine the correct RHS on
the basis of one token of lookahead
– Def: FIRST() = {a |  =>* a }
(If  =>* ,  is in FIRST())

Copyright © 2009 Addison-Wesley. All rights reserved. 1-30

Recursive-Descent Parsing (cont.)

• Pairwise Disjointness Test:

– For each nonterminal, A, in the grammar that
has more than one RHS, for each pair of rules,
A  i and A  j, it must be true that
FIRST(i) ⋂ FIRST(j) = 
• Examples:
A  a | bB | cAb
A  a | aB

Recursive-Descent Parsing (cont.)

• Left factoring can resolve the problem

Replace
<variable>  identifier | identifier
[<expression>]
with
<variable>  identifier <new>
<new>   | [<expression>]
or
<variable>  identifier [[<expression>]]
(the outer brackets are metasymbols of EBNF)

Bottom-up Parsing

• The parsing problem is finding the correct

RHS in a right-sentential form to reduce to
get the previous right-sentential form in
the derivation

Bottom-up Parsing (cont.)

•Intuition about handles:

– Def:  is the handle of the right sentential form
 = w if and only if S =>*rm Aw =>rm w

– Def:  is a phrase of the right sentential form

 if and only if S =>*  = 1A2 =>+ 12

– Def:  is a simple phrase of the right sentential

form  if and only if S =>*  = 1A2 => 12

Bottom-up Parsing (cont.)

• Intuition about handles (continued):

– The handle of a right sentential form is its
leftmost simple phrase
– Given a parse tree, it is now easy to find the
handle
– Parsing can be thought of as handle pruning

Bottom-up Parsing (cont.)

• Shift-Reduce Algorithms
– Reduce is the action of replacing the handle
on the top of the parse stack with its
corresponding LHS
– Shift is the action of moving the next token to
the top of the parse stack

Bottom-up Parsing (cont.)

• Advantages of LR parsers:
– They will work for nearly all grammars that
describe programming languages.
– They work on a larger class of grammars than
other bottom-up algorithms, but are as
efficient as any other bottom-up parser.
– They can detect syntax errors as soon as it is
possible.
– The LR class of grammars is a superset of the
class parsable by LL parsers.

Bottom-up Parsing (cont.)

• LR parsers must be constructed with a

tool
• Knuth’s insight: A bottom-up parser could
use the entire history of the parse, up to
the current point, to make parsing
decisions
– There were only a finite and relatively small
number of different parse situations that could
have occurred, so the history could be stored
in a parser state, on the parse stack

Bottom-up Parsing (cont.)

• An LR configuration stores the state of an

LR parser

(S0X1S1X2S2…XmSm, aiai+1…an$)

Bottom-up Parsing (cont.)

• LR parsers are table driven, where the

table has two components, an ACTION
table and a GOTO table
– The ACTION table specifies the action of the
parser, given the parser state and the next
token
• Rows are state names; columns are terminals
– The GOTO table specifies which state to put
on top of the parse stack after a reduction
action is done
• Rows are state names; columns are
nonterminals
Copyright © 2009 Addison-Wesley. All rights reserved. 1-40
Structure of An LR Parser

Bottom-up Parsing (cont.)

• Initial configuration: (S0, a1…an$)

• Parser actions:
– If ACTION[Sm, ai] = Shift S, the next
configuration is:
(S0X1S1X2S2…XmSmaiS, ai+1…an$)
– If ACTION[Sm, ai] = Reduce A   and S =
GOTO[Sm-r, A], where r = the length of , the
next configuration is
(S0X1S1X2S2…Xm-rSm-rAS, aiai+1…an$)

Bottom-up Parsing (cont.)

• Parser actions (continued):

– If ACTION[Sm, ai] = Accept, the parse is
complete and no errors were found.
– If ACTION[Sm, ai] = Error, the parser calls an
error-handling routine.

LR Parsing Table

Bottom-up Parsing (cont.)

• A parser table can be generated from a

given grammar with a tool, e.g., yacc

Summary

• Syntax analysis is a common part of language

implementation
• A lexical analyzer is a pattern matcher that
isolates small-scale parts of a program
– Detects syntax errors
– Produces a parse tree
• A recursive-descent parser is an LL parser
– EBNF
• Parsing problem for bottom-up parsers: find the
substring of current sentential form
• The LR family of shift-reduce parsers is the most
common bottom-up parsing approach

Chapter 3 - Lexical Analysis
100% (3)
Chapter 3 - Lexical Analysis
51 pages
Class 9 (Moral Science)
No ratings yet
Class 9 (Moral Science)
27 pages
Lexical and Syntax Analysis: CSE 325/CSE 425: Concepts of Programming Language
No ratings yet
Lexical and Syntax Analysis: CSE 325/CSE 425: Concepts of Programming Language
41 pages
Lecture 02
No ratings yet
Lecture 02
150 pages
Compiler-Lexical Analysis
100% (1)
Compiler-Lexical Analysis
59 pages
SP Unit III-2024-25
No ratings yet
SP Unit III-2024-25
126 pages
pl9ch4 Backup
No ratings yet
pl9ch4 Backup
55 pages
Unit 2
No ratings yet
Unit 2
14 pages
Microprocessor Tutorial
No ratings yet
Microprocessor Tutorial
46 pages
pl12ch4 061259
No ratings yet
pl12ch4 061259
46 pages
L4 Syntax-Analysis
No ratings yet
L4 Syntax-Analysis
50 pages
Lexical and Syntax Analysis: CSE 325/CSE 425: Concepts of Programming Language
No ratings yet
Lexical and Syntax Analysis: CSE 325/CSE 425: Concepts of Programming Language
41 pages
Sebesta Chapter 4 With Additions
No ratings yet
Sebesta Chapter 4 With Additions
46 pages
Lexical Analysis and Lexical Analyzer Generators: COP5621 Compiler Construction
No ratings yet
Lexical Analysis and Lexical Analyzer Generators: COP5621 Compiler Construction
52 pages
CH 4
No ratings yet
CH 4
46 pages
CSC 461 Final
No ratings yet
CSC 461 Final
170 pages
Chapter 3 - Lexical Analysis and Lexical Analyzer Generators
No ratings yet
Chapter 3 - Lexical Analysis and Lexical Analyzer Generators
52 pages
Ch3 1
No ratings yet
Ch3 1
52 pages
Lexical and Syntax Analysis-4
No ratings yet
Lexical and Syntax Analysis-4
54 pages
Chapter 3 - Lexical Analysis
No ratings yet
Chapter 3 - Lexical Analysis
34 pages
Lexical Analyzer (Compiler Contruction)
100% (1)
Lexical Analyzer (Compiler Contruction)
6 pages
Saudi Aramco Inspection Checklist
100% (2)
Saudi Aramco Inspection Checklist
2 pages
4 Lexical Analysis
No ratings yet
4 Lexical Analysis
60 pages
03LexicalAndSyntaxAnalysis 1
No ratings yet
03LexicalAndSyntaxAnalysis 1
25 pages
SSC Module2 LexicalAnalysis
No ratings yet
SSC Module2 LexicalAnalysis
26 pages
Hist SN T1 e ST
No ratings yet
Hist SN T1 e ST
58 pages
Lexical and Syntax Analysis
No ratings yet
Lexical and Syntax Analysis
63 pages
Chapter 3 Lexical Analysis
No ratings yet
Chapter 3 Lexical Analysis
5 pages
Chapter 2
No ratings yet
Chapter 2
77 pages
Chap 04
No ratings yet
Chap 04
15 pages
PL Özet (1,3,4)
No ratings yet
PL Özet (1,3,4)
8 pages
Lecture 7 (Slide)
No ratings yet
Lecture 7 (Slide)
14 pages
What Is Parsing: Parsing Is The Process of Analyzing An Input Sequence in Order
No ratings yet
What Is Parsing: Parsing Is The Process of Analyzing An Input Sequence in Order
9 pages
Lexical Analysis: Dr. Murali Krishna Enduri Department of CSE
No ratings yet
Lexical Analysis: Dr. Murali Krishna Enduri Department of CSE
88 pages
Ch02 Programming Language Syntax 4e 2
No ratings yet
Ch02 Programming Language Syntax 4e 2
64 pages
Chapter 3 - Lexical Analysis
No ratings yet
Chapter 3 - Lexical Analysis
52 pages
Compiler Course: Lexical Analysis
No ratings yet
Compiler Course: Lexical Analysis
50 pages
IS 4308 Product Manual
No ratings yet
IS 4308 Product Manual
7 pages
02 Lexical Analysis
No ratings yet
02 Lexical Analysis
86 pages
CS3304 9 LanguageSyntax 2 PDF
No ratings yet
CS3304 9 LanguageSyntax 2 PDF
39 pages
002 - ManualC - G - 47-50 - ING Rev.2 20.10.11
No ratings yet
002 - ManualC - G - 47-50 - ING Rev.2 20.10.11
13 pages
21CS51 ATCD MODULE 2 - 2 Lexical Analyser Part2
No ratings yet
21CS51 ATCD MODULE 2 - 2 Lexical Analyser Part2
62 pages
Unit 1 Notes1
No ratings yet
Unit 1 Notes1
30 pages
Compiler Rewind
No ratings yet
Compiler Rewind
52 pages
Chapter 3
No ratings yet
Chapter 3
43 pages
Compler
No ratings yet
Compler
35 pages
CD KCS502 Unit 1 B
No ratings yet
CD KCS502 Unit 1 B
12 pages
Chapter 2 Lexical Analysis
No ratings yet
Chapter 2 Lexical Analysis
33 pages
Table Morgan Sample Thesis
86% (7)
Table Morgan Sample Thesis
1 page
2019 February Iat 1 Te CMPN Sem Vi SPCC
No ratings yet
2019 February Iat 1 Te CMPN Sem Vi SPCC
12 pages
Chapter 3 Syntax Analysis
No ratings yet
Chapter 3 Syntax Analysis
78 pages
Compiler Design Lexical Analysis
No ratings yet
Compiler Design Lexical Analysis
24 pages
04 Lexi Cal A Analysis
No ratings yet
04 Lexi Cal A Analysis
39 pages
Lexical and Syntax Analysis - Updated
No ratings yet
Lexical and Syntax Analysis - Updated
5 pages
Unit 1 (B)
No ratings yet
Unit 1 (B)
69 pages
NLP Lect Unit I
100% (1)
NLP Lect Unit I
140 pages
A Typical Lexical Analyzer Generator Nfa To Dfa DFA Analysis
No ratings yet
A Typical Lexical Analyzer Generator Nfa To Dfa DFA Analysis
64 pages
Chapter 2 - Lexical Analysis
No ratings yet
Chapter 2 - Lexical Analysis
56 pages
Chapter 2 Lexical Analysis (Scanning) Edited
No ratings yet
Chapter 2 Lexical Analysis (Scanning) Edited
46 pages
Lecture 3
No ratings yet
Lecture 3
22 pages
Operation Guide 3294: About This Manual
No ratings yet
Operation Guide 3294: About This Manual
3 pages
Comp Chap2
No ratings yet
Comp Chap2
36 pages
Lecture 4 Lexical Analysis
No ratings yet
Lecture 4 Lexical Analysis
23 pages
AMP Microproject Grp-12
No ratings yet
AMP Microproject Grp-12
16 pages
Compiler Designnotes
No ratings yet
Compiler Designnotes
18 pages
Yellowstripe Scad
No ratings yet
Yellowstripe Scad
7 pages
Unit-2 F&CD
No ratings yet
Unit-2 F&CD
31 pages
Cognitive Computing
No ratings yet
Cognitive Computing
5 pages
One Hundred Years of Solitude-The Story of Mankind Re-Visited
No ratings yet
One Hundred Years of Solitude-The Story of Mankind Re-Visited
5 pages
Lesson Plan in Science 6
100% (1)
Lesson Plan in Science 6
6 pages
Chapter 2 Lexical Analysis
No ratings yet
Chapter 2 Lexical Analysis
14 pages
AI Introduction
100% (1)
AI Introduction
3 pages
Pico Interactive Instruction Manual
No ratings yet
Pico Interactive Instruction Manual
200 pages
Danfoss Refrigeration Basics - ESSENTIAL
100% (1)
Danfoss Refrigeration Basics - ESSENTIAL
24 pages
21CS51 ATCD MODULE 2 - 2 Lexical Analyser Part1
No ratings yet
21CS51 ATCD MODULE 2 - 2 Lexical Analyser Part1
63 pages
CSM Notes Print
No ratings yet
CSM Notes Print
97 pages
Internet Case Study For Chapter 13: Aggregate Planning Cornwell Glass
No ratings yet
Internet Case Study For Chapter 13: Aggregate Planning Cornwell Glass
5 pages
Basfiber For Construction Market (US Customary Units) .
No ratings yet
Basfiber For Construction Market (US Customary Units) .
4 pages
CSM PPT Unit 1
No ratings yet
CSM PPT Unit 1
83 pages
Lexical Analysis
No ratings yet
Lexical Analysis
6 pages
Ex Inspections - A Journey For Maintenance Engineers: Shailesh Chauhan Shell Project &technology Stavanger Norway
No ratings yet
Ex Inspections - A Journey For Maintenance Engineers: Shailesh Chauhan Shell Project &technology Stavanger Norway
4 pages
Motherboard Labeling Designed by Fujitsu
No ratings yet
Motherboard Labeling Designed by Fujitsu
3 pages
Unit 1 Notes2
No ratings yet
Unit 1 Notes2
15 pages
Unit 1 Network Security
No ratings yet
Unit 1 Network Security
22 pages
Logical Programming Language
No ratings yet
Logical Programming Language
39 pages
The Recycling Folded Cascode A General Enhancement of The Folded Cascode Amplifier
No ratings yet
The Recycling Folded Cascode A General Enhancement of The Folded Cascode Amplifier
8 pages
3.-GE11 EntrepreneurialMind FINAL
100% (4)
3.-GE11 EntrepreneurialMind FINAL
15 pages
Cis & QB
No ratings yet
Cis & QB
14 pages
Syntax Semantics
No ratings yet
Syntax Semantics
6 pages
IMU (V) 2012 13 Detail Brochure
No ratings yet
IMU (V) 2012 13 Detail Brochure
6 pages
SYLL
No ratings yet
SYLL
2 pages
Description Manufacturer Reference Footprint Designation QNT Farnell Digikey RS
No ratings yet
Description Manufacturer Reference Footprint Designation QNT Farnell Digikey RS
2 pages
20it511 Ui & Ux Design
No ratings yet
20it511 Ui & Ux Design
40 pages
Scala Multi
No ratings yet
Scala Multi
48 pages
Non Core - Ganai
No ratings yet
Non Core - Ganai
2 pages
Logic Programming and PROLOG
No ratings yet
Logic Programming and PROLOG
18 pages
Context Free Grammar
No ratings yet
Context Free Grammar
5 pages
Case Study BARGAIN CITY
No ratings yet
Case Study BARGAIN CITY
1 page
Content Beyond The Syllabus
No ratings yet
Content Beyond The Syllabus
7 pages
Future of Ai
No ratings yet
Future of Ai
7 pages
E-Rickshaws + E-Carts
No ratings yet
E-Rickshaws + E-Carts
12 pages
Attitude Defines Our Altitude
No ratings yet
Attitude Defines Our Altitude
3 pages
1 Essay3
No ratings yet
1 Essay3
2 pages
Activities For AI - Activities
No ratings yet
Activities For AI - Activities
6 pages
AI - Possible 2 Marks With Answers
No ratings yet
AI - Possible 2 Marks With Answers
11 pages
AI - Content Beyond Syllabus
No ratings yet
AI - Content Beyond Syllabus
17 pages
Syllabus
No ratings yet
Syllabus
2 pages
AI - Possible 16 Marks
No ratings yet
AI - Possible 16 Marks
12 pages
Distribution of Public Keys
No ratings yet
Distribution of Public Keys
4 pages
Modeep
No ratings yet
Modeep
13 pages
SMCG PPT Unit-1
No ratings yet
SMCG PPT Unit-1
41 pages
22IT520 NETWORK SECURITY Syll
No ratings yet
22IT520 NETWORK SECURITY Syll
2 pages
ChuteDesignFormulas Paper43
No ratings yet
ChuteDesignFormulas Paper43
11 pages
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)

CH 04

Uploaded by

CH 04

Uploaded by

Chapter 4

Lexical and Syntax

Copyright © 2009 Addison-Wesley. All rights reserved. 1-2

• Language implementation systems must

Copyright © 2009 Addison-Wesley. All rights reserved. 1-3

• The syntax analysis portion of a language

Copyright © 2009 Addison-Wesley. All rights reserved. 1-4

• Provides a clear and concise syntax

Copyright © 2009 Addison-Wesley. All rights reserved. 1-5

• Simplicity - less complex approaches can

Copyright © 2009 Addison-Wesley. All rights reserved. 1-6

• A lexical analyzer is a pattern matcher for

Copyright © 2009 Addison-Wesley. All rights reserved. 1-7

• The lexical analyzer is usually a function that is

Copyright © 2009 Addison-Wesley. All rights reserved. 1-8

– A naïve state diagram would have a transition

Copyright © 2009 Addison-Wesley. All rights reserved. 1-9

• In many cases, transitions can be

Copyright © 2009 Addison-Wesley. All rights reserved. 1-10

• Reserved words and identifiers can be

Copyright © 2009 Addison-Wesley. All rights reserved. 1-11

• Convenient utility subprograms:

Copyright © 2009 Addison-Wesley. All rights reserved. 1-12

Copyright © 2009 Addison-Wesley. All rights reserved. 1-13

- Following is the output of the lexical analyzer

Next token is: 25 Next lexeme is (

• Goals of the parser, given an input

Copyright © 2009 Addison-Wesley. All rights reserved. 1-15

• Two categories of parsers

Copyright © 2009 Addison-Wesley. All rights reserved. 1-17

Copyright © 2009 Addison-Wesley. All rights reserved. 1-18

• The Complexity of Parsing

Copyright © 2009 Addison-Wesley. All rights reserved. 1-19

• There is a subprogram for each

Copyright © 2009 Addison-Wesley. All rights reserved. 1-20

• A grammar for simple expressions:

<expr>  <term> {(+ | -) <term>}

Copyright © 2009 Addison-Wesley. All rights reserved. 1-21

• Assume we have a lexical analyzer named

Copyright © 2009 Addison-Wesley. All rights reserved. 1-22

/* Parse the first term */

while (nextToken == ADD_OP ||

Copyright © 2009 Addison-Wesley. All rights reserved. 1-23

• This particular routine does not detect errors

Copyright © 2009 Addison-Wesley. All rights reserved. 1-24

• A nonterminal that has more than one

Copyright © 2009 Addison-Wesley. All rights reserved. 1-25

Copyright © 2009 Addison-Wesley. All rights reserved. 1-26

/* Determine which RHS */

/* For the RHS id, just call lex */

else error(); /* Neither RHS matches */

Copyright © 2009 Addison-Wesley. All rights reserved. 1-27

Copyright © 2009 Addison-Wesley. All rights reserved. 1-28

• The LL Grammar Class

• The other characteristic of grammars that

Copyright © 2009 Addison-Wesley. All rights reserved. 1-30

• Pairwise Disjointness Test:

Copyright © 2009 Addison-Wesley. All rights reserved. 1-31

• Left factoring can resolve the problem

Copyright © 2009 Addison-Wesley. All rights reserved. 1-32

• The parsing problem is finding the correct

Copyright © 2009 Addison-Wesley. All rights reserved. 1-33

•Intuition about handles:

– Def:  is a phrase of the right sentential form

– Def:  is a simple phrase of the right sentential

Copyright © 2009 Addison-Wesley. All rights reserved. 1-34

• Intuition about handles (continued):

Copyright © 2009 Addison-Wesley. All rights reserved. 1-35

Copyright © 2009 Addison-Wesley. All rights reserved. 1-36

Copyright © 2009 Addison-Wesley. All rights reserved. 1-37

• LR parsers must be constructed with a

Copyright © 2009 Addison-Wesley. All rights reserved. 1-38

• An LR configuration stores the state of an

Copyright © 2009 Addison-Wesley. All rights reserved. 1-39

• LR parsers are table driven, where the

Copyright © 2009 Addison-Wesley. All rights reserved. 1-41