0% found this document useful (0 votes)

46 views39 pages

Ch2 Modified

1) The document describes building a simple compiler by defining a programming language syntax using context-free grammar, developing a predictive parser, and implementing syntax-directed translation to generate intermediate code. 2) A context-free grammar consists of tokens, nonterminals, productions, and a start symbol. Productions specify rewriting rules to derive strings from the grammar. 3) Derivations and parse trees represent the structure of strings according to the grammar. Derivations apply productions to replace nonterminals, and parse trees visually depict the structure.

Uploaded by

Hassnain Abbas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views39 pages

Ch2 Modified

Uploaded by

Hassnain Abbas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

1

A SIMPLE SYNTAX-
DIRECTED TRANSLATOR

Chapter 2
2

Building a Simple Compiler

• Building our compiler involves:
– Defining the syntax of a programming language
– Develop a source code parser: for our compiler
we will use predictive parsing
– Implementing syntax directed translation to
generate intermediate code
3

Syntax Definition
• Context-free grammar is a 4-tuple with
– A set of tokens (terminal symbols)
– A set of nonterminals
– A set of productions
– A designated start symbol
4

Example Grammar

Context-free grammar for simple expressions:

G = <{list,digit}, {+,-,0,1,2,3,4,5,6,7,8,9}, P, list>

with productions P =

list  list + digit

list  list - digit

list  digit

digit  0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9
5

Derivation
• Given a CF grammar we can determine the
set of all strings (sequences of tokens)
generated by the grammar using derivation
– We begin with the start symbol
– In each step, we replace one nonterminal in the
current sentential form with one of the right-
hand sides of a production for that nonterminal
6

Derivation for the Example

Grammar

list
 list + digit
 list - digit + digit
 digit - digit + digit
 9 - digit + digit
 9 - 5 + digit
9-5+2

This is an example leftmost derivation, because we replaced

the leftmost nonterminal (underlined) in each step.
7

Derivation for the Example

Rightmost Grammar
Likewise, a rightmost derivation replaces the rightmost
nonterminal in each step
list
P=  digit - list
list  digit + list  digit - digit + list
 digit - digit + digit
list  digit - list
 digit - digit + 2
list  digit  digit - 5 + 2
digit  0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 9-5+2
8

Parse Trees
• The root of the tree is labeled by the start symbol
• Each leaf of the tree is labeled by a terminal
(=token) or 
• Each interior node is labeled by a nonterminal
• If A  X1 X2 … Xn is a production, then node A has
immediate children X1, X2, …, Xn where Xi is a
(non)terminal or  ( denotes the empty string)
9

Parse Tree for the Example

Grammar
Parse tree of the string 9-5+2 using grammar G

list

list digit

digit
The sequence of
9 - 5 + 2 leafs is called the
yield of the parse tree
The Two Derivations for x – 2 * y
Rule Sentential Form Rule Sentential Form
— Expr — Expr
1 Expr Op Expr 1 Expr Op Expr
3 <id,x> Op Expr 3 Expr Op <id,y>
5 <id,x> – Expr 6 Expr * <id,y>
1 <id,x> – Expr Op Expr 1 Expr Op Expr * <id,y>
2 <id,x> – <num,2> Op Expr 2 Expr Op <num,2> * <id,y>
6 <id,x> – <num,2> * Expr 5 Expr – <num,2> * <id,y>
3 <id,x> – <num,2> * <id,y> 3 <id,x> – <num,2> * <id,y>

Leftmost derivation Rightmost derivation

In both cases, Expr * id – num * id

• The two derivations produce different parse trees
• The parse trees imply different evaluation orders!
Derivations and Parse Trees
G
Leftmost derivation
Rule Sentential Form
— Expr E
1 Expr Op Expr
3 <id,x> Op Expr
5 <id,x> – Expr E Op E
1 <id,x> – Expr Op Expr
2 <id,x> – <num,2> Op Expr
6 <id,x> – <num,2> * Expr x – Op E
E
3 <id,x> – <num,2> * <id,y>

This evaluates as x – ( 2 * y ) 2 y
*
Derivations and Parse Trees
G
Rightmost derivation
Rule Sentential Form
— Expr E
1 Expr Op Expr
3 Expr Op <id,y>
6 Expr * <id,y> E Op E
1 Expr Op Expr * <id,y>
2 Expr Op <num,2> * <id,y>
5 Expr – <num,2> * <id,y> E Op E * y
3 <id,x> – <num,2> * <id,y>

x – 2
This evaluates as ( x – 2 ) * y
13

Ambiguity

Consider the following context-free grammar:

G = <{string}, {+,-,0,1,2,3,4,5,6,7,8,9}, P, string>

with production P =

string  string + string | string - string | 0 | 1 | … | 9

This grammar is ambiguous, because more than one parse tree
represents the string 9-5+2
14

Ambiguity (cont’d)

string string

string string string

string string string string string

9 - 5 + 2 9 - 5 + 2
15

Associativity of Operators
Left-associative operators have left-recursive productions
left  left + term | term
String a+b+c has the same meaning as (a+b)+c
Right-associative operators have right-recursive productions
right  term = right | term
String a=b=c has the same meaning as a=(b=c)
Operators on the same line have the same associativity and
precedence:
left-associative: + -

left-associative: */
16

Syntax of Statements

Syntax-Directed Translation
• Uses a CF grammar to specify the syntactic
structure of the language
• AND associates a set of attributes with the
terminals and nonterminals of the grammar
• AND associates with each production a set of
semantic rules to compute values of attributes
• A parse tree is traversed and semantic rules
applied: after the tree traversal(s) are completed,
the attribute values on the nonterminals contain
the translated form of the input
19

Synthesized and Inherited

Attributes
• An attribute is said to be …
– synthesized if its value at a parse-tree node is
determined from the attribute values at the children of
the node
– Suppose a node N in a parse tree is labeled by the
grammar symbol X . We write X.a to denote the value
of attribute a of X at that node.
– inherited if its value at a parse-tree node is determined
by the parent (by enforcing the parent’s semantic rules)
20

Example Attribute Grammar

Syntax-directed definition for infix to postfix translation

String concat operator

Production Semantic Rule
expr  expr1 + term expr.t := expr1.t // term.t // “+”
expr  expr1 - term expr.t := expr1.t // term.t // “-”
expr  term expr.t := term.t
term  0 term.t := “0”
term  1 term.t := “1”
… …
term  9 term.t := “9”
21

Example Annotated Parse Tree

expr.t = “95-2+”

expr.t = “95-” term.t = “2”

expr.t = “9” term.t = “5”

term.t = “9”

9 - 5 + 2
22

Depth-First Traversals
procedure visit(n : node);
begin
for each child m of n, from left to right do
visit(m);
evaluate semantic rules at node n
end
23

Depth-First Traversals (Example)

expr.t = “95-2+”

expr.t = “95-” term.t = “2”

expr.t = “9” term.t = “5”

term.t = “9”

9 - 5 + 2 Note: all attributes are

of the synthesized type
24

Translation Schemes
• A translation scheme is a CF grammar embedded
with semantic actions
• When drawing a parse tree for a translation scheme,
we indicate an action by constructing an extra child
for it, connected by a dashed line to the node that
corresponds to the head of the production.

rest  + term { print(“+”) } rest

rest
Embedded
semantic action
+ term { print(“+”) } rest
25

Example Translation Scheme

expr  expr + term { print(“+”) }

expr  expr - term { print(“-”) }
expr  term
term  0 { print(“0”) }
term  1 { print(“1”) }
… …
term  9 { print(“9”) }
26

Example Translation Scheme

(cont’d)

expr
{ print(“+”) }
expr + term
{ print(“2”) }
{ print(“-”) }
- term 2
expr
{ print(“5”) }
term 5
{ print(“9”) }
9
Translates 9-5+2 into postfix 95-2+
27

Parsing
• Parsing = process of determining if a string of
tokens can be generated by a grammar
• For any CF grammar there is a parser that takes at
most O(n3) time to parse a string of n tokens
• Top-down parsing “constructs” a parse tree from
root to leaves
• Bottom-up parsing “constructs” a parse tree from
leaves to root
28

Predictive Parsing
• Recursive descent parsing is a top-down parsing
method
– Each nonterminal has one (recursive) procedure that is
responsible for parsing the nonterminal’s syntactic
category of input tokens
– When a nonterminal has multiple productions, each
production is implemented in a branch of a selection
statement based on input look-ahead information
• Predictive parsing is a special form of recursive
descent parsing where we use one lookahead
token to unambiguously determine the parse
operations
29

Example Predictive Parser

(Grammar)

type  simple
| ^ id
| array [ simple ] of type
simple  integer
| char
| num dotdot num
30

Example Predictive Parser

(Execution Step 1)

Check lookahead
type()
and call match

match(‘array’)

Input: array [ num dotdot num ] of integer

lookahead
31

Example Predictive Parser

(Execution Step 2)
type()

match(‘array’) match(‘[’)

Input: array [ num dotdot num ] of integer

lookahead
32

Example Predictive Parser

(Execution Step 3)
type()

match(‘array’) match(‘[’) simple()

match(‘num’)

Input: array [ num dotdot num ] of integer

lookahead
33

Example Predictive Parser

(Execution Step 4)
type()

match(‘array’) match(‘[’) simple()

match(‘num’) match(‘dotdot’)

Input: array [ num dotdot num ] of integer

lookahead
34

Example Predictive Parser

(Execution Step 5)
type()

match(‘array’) match(‘[’) simple()

match(‘num’) match(‘dotdot’) match(‘num’)

Input: array [ num dotdot num ] of integer

lookahead
35

Example Predictive Parser

(Execution Step 6)
type()

match(‘array’) match(‘[’) simple() match(‘]’)

match(‘num’) match(‘dotdot’) match(‘num’)

Input: array [ num dotdot num ] of integer

lookahead
36

Example Predictive Parser

(Execution Step 7)
type()

match(‘array’) match(‘[’) simple() match(‘]’) match(‘of’)

match(‘num’) match(‘dotdot’) match(‘num’)

Input: array [ num dotdot num ] of integer

lookahead
37

Example Predictive Parser

(Execution Step 8)
type()

match(‘array’) match(‘[’) simple() match(‘]’) match(‘of’) type()

match(‘num’) match(‘dotdot’) match(‘num’) simple()

match(‘integer’)
Input: array [ num dotdot num ] of integer

lookahead
38

Adding a Lexical Analyzer

• Typical tasks of the lexical analyzer:
– Remove white space and comments
– Encode constants as tokens
– Recognize keywords
– Recognize identifiers and store identifier names
in a global symbol table
39

The Lexical Analyzer “lexer”

Lexical analyzer
y := 31 + 28*x
lexan()

<id, “y”> <assign, > <num, 31> <‘+’, > <num, 28> <‘*’, > <id, “x”>

token
(lookahead)
tokenval Parser
(token attribute) parse()

CS401-Midterm Solved Mcqs With References by Moaaz
75% (4)
CS401-Midterm Solved Mcqs With References by Moaaz
29 pages
Chapter 3 - Syntax Analysis Part One
No ratings yet
Chapter 3 - Syntax Analysis Part One
17 pages
Compiler Design Chapter-3
0% (1)
Compiler Design Chapter-3
177 pages
Compiler 2
100% (1)
Compiler 2
45 pages
Vitrea Installation and Setup Guide PDF
No ratings yet
Vitrea Installation and Setup Guide PDF
38 pages
A Simple One - Pass Compiler
No ratings yet
A Simple One - Pass Compiler
62 pages
Parsing - 1
No ratings yet
Parsing - 1
59 pages
Simple One Pass Compiler
No ratings yet
Simple One Pass Compiler
62 pages
CD 2,3 Unit's Material
100% (1)
CD 2,3 Unit's Material
170 pages
CW GAT - Preparation and Tips
No ratings yet
CW GAT - Preparation and Tips
4 pages
CH03
No ratings yet
CH03
57 pages
Syntax Analyzer
No ratings yet
Syntax Analyzer
38 pages
BCA Syllabus Sambalpur University
0% (1)
BCA Syllabus Sambalpur University
8 pages
Software Requirements
100% (1)
Software Requirements
66 pages
Design and Implementation of Computerized Property Valuation System
100% (1)
Design and Implementation of Computerized Property Valuation System
25 pages
Module1 1
No ratings yet
Module1 1
20 pages
CH2 1
No ratings yet
CH2 1
27 pages
Lecture 5
No ratings yet
Lecture 5
28 pages
Dos Notes
No ratings yet
Dos Notes
12 pages
Entrepreneurship Process
No ratings yet
Entrepreneurship Process
22 pages
2024 CD-Ch03 Syntaxx Analysis
No ratings yet
2024 CD-Ch03 Syntaxx Analysis
28 pages
Chapter-02 (Part-II) PDF
No ratings yet
Chapter-02 (Part-II) PDF
23 pages
Chapter 3 Syntax Analysis Full Reading Material
No ratings yet
Chapter 3 Syntax Analysis Full Reading Material
76 pages
CH2 2
No ratings yet
CH2 2
30 pages
CC-Lec 5 Week 5 Cfgs
No ratings yet
CC-Lec 5 Week 5 Cfgs
29 pages
Chapter 2 - Simple Syntax Directed Translator
No ratings yet
Chapter 2 - Simple Syntax Directed Translator
39 pages
CS 4300: Compiler Theory A Simple Syntax-Directed Translator
No ratings yet
CS 4300: Compiler Theory A Simple Syntax-Directed Translator
70 pages
8 Notes
No ratings yet
8 Notes
12 pages
Chapter 3 - Syntax Analysis
No ratings yet
Chapter 3 - Syntax Analysis
16 pages
Lecture 1 Introduction DR Raheel 19022024 032426pm
No ratings yet
Lecture 1 Introduction DR Raheel 19022024 032426pm
32 pages
Chapter 3
No ratings yet
Chapter 3
77 pages
Compiler Construction Week 04 Syntax Analysis I)
No ratings yet
Compiler Construction Week 04 Syntax Analysis I)
41 pages
(Week 3) Syntax Analysis (Derivation)
No ratings yet
(Week 3) Syntax Analysis (Derivation)
46 pages
Syntax Analyzer
No ratings yet
Syntax Analyzer
38 pages
CSC 409 Note 2
No ratings yet
CSC 409 Note 2
12 pages
Compiler Design - Syntax Analysis
No ratings yet
Compiler Design - Syntax Analysis
14 pages
2-Role of Parser and Parse Tree-02!08!2024
No ratings yet
2-Role of Parser and Parse Tree-02!08!2024
69 pages
Lec4 SyntaxAnalysis
No ratings yet
Lec4 SyntaxAnalysis
41 pages
CH2-1 To CH2-3
No ratings yet
CH2-1 To CH2-3
79 pages
Compiler 2
No ratings yet
Compiler 2
45 pages
4 Parsing
No ratings yet
4 Parsing
32 pages
Labview Books
No ratings yet
Labview Books
3 pages
Ex 6
50% (2)
Ex 6
2 pages
Compiler Design Lec-Three Syntax Analysis
No ratings yet
Compiler Design Lec-Three Syntax Analysis
60 pages
Chapter-3 So Far
No ratings yet
Chapter-3 So Far
50 pages
BCS 324 Compiler Design Notes - Unit2
No ratings yet
BCS 324 Compiler Design Notes - Unit2
37 pages
(Week 4) Syntax Analysis (CFG)
No ratings yet
(Week 4) Syntax Analysis (CFG)
50 pages
Chapter 3
No ratings yet
Chapter 3
41 pages
Chapter 2 (Part 1)
No ratings yet
Chapter 2 (Part 1)
32 pages
G52Cmp Compilers: Syntax Analysis
No ratings yet
G52Cmp Compilers: Syntax Analysis
36 pages
Compiler 3
No ratings yet
Compiler 3
11 pages
Compiler Theory: (A Simple Syntax-Directed Translator)
No ratings yet
Compiler Theory: (A Simple Syntax-Directed Translator)
50 pages
CD Chapter-3
No ratings yet
CD Chapter-3
105 pages
Chapter - Three
No ratings yet
Chapter - Three
139 pages
Compiler Design 3
No ratings yet
Compiler Design 3
140 pages
2014-CD Ch-03 SAn
No ratings yet
2014-CD Ch-03 SAn
21 pages
SAS® 9.3 SQL Query WindowUser's GuideSAS®
No ratings yet
SAS® 9.3 SQL Query WindowUser's GuideSAS®
104 pages
Chapter - Three: Syntax Analysis
No ratings yet
Chapter - Three: Syntax Analysis
100 pages
Chapter 2
No ratings yet
Chapter 2
47 pages
Chapter 3 Syntax Analysis
No ratings yet
Chapter 3 Syntax Analysis
78 pages
CD Unit 3
No ratings yet
CD Unit 3
76 pages
Syntax Analysis
No ratings yet
Syntax Analysis
90 pages
A Simple One-Pass Compiler (To Generate Code For The JVM)
No ratings yet
A Simple One-Pass Compiler (To Generate Code For The JVM)
70 pages
Lecture2 PDF
No ratings yet
Lecture2 PDF
45 pages
Chapter 3
No ratings yet
Chapter 3
180 pages
Chapter-3-Syntax Analysis
No ratings yet
Chapter-3-Syntax Analysis
126 pages
Syntax Analysis: EECS 483 - Lecture 4 University of Michigan Monday, September 17, 2006
No ratings yet
Syntax Analysis: EECS 483 - Lecture 4 University of Michigan Monday, September 17, 2006
28 pages
Resource-Allocation Graph
No ratings yet
Resource-Allocation Graph
15 pages
Topic #4: Syntactic Analysis (Parsing) : INF 524 Compiler Construction Spring 2011
No ratings yet
Topic #4: Syntactic Analysis (Parsing) : INF 524 Compiler Construction Spring 2011
44 pages
HCI in The Software Process
No ratings yet
HCI in The Software Process
15 pages
UHF+RFID+Reader+UHFReader18+User's+Manual+V2 0
No ratings yet
UHF+RFID+Reader+UHFReader18+User's+Manual+V2 0
39 pages
Selenium and Grinder: A Powerful Duo For Load Testing: Eric Pugh Principle, Opensource Connections
No ratings yet
Selenium and Grinder: A Powerful Duo For Load Testing: Eric Pugh Principle, Opensource Connections
37 pages
SPARC T2 Systems Installation and Maintenance
No ratings yet
SPARC T2 Systems Installation and Maintenance
7 pages
What's New in IBM BPM 8.6 EXTERNAL (Paul Pacholski) PDF
No ratings yet
What's New in IBM BPM 8.6 EXTERNAL (Paul Pacholski) PDF
91 pages
Quality Assurance (Qa) : Manual Testing Process
No ratings yet
Quality Assurance (Qa) : Manual Testing Process
6 pages
ESO211 (Data Structures and Algorithms) Lectures 1 To 3: 1 Random Access Machine
No ratings yet
ESO211 (Data Structures and Algorithms) Lectures 1 To 3: 1 Random Access Machine
5 pages
Queuing Theory by Kumkum Sultana
No ratings yet
Queuing Theory by Kumkum Sultana
5 pages
Android Operating System
No ratings yet
Android Operating System
23 pages
Motherboard
No ratings yet
Motherboard
2 pages
Cyber - Unit 31
No ratings yet
Cyber - Unit 31
110 pages
What Are The Difference Between DDL, DML and DCL Commands - Oracle FAQ
No ratings yet
What Are The Difference Between DDL, DML and DCL Commands - Oracle FAQ
4 pages
Computer Science Foundation Exam: Solutions
No ratings yet
Computer Science Foundation Exam: Solutions
5 pages
X137daE560 IEC61850 Host R1
No ratings yet
X137daE560 IEC61850 Host R1
75 pages
SmileBasic Manual
No ratings yet
SmileBasic Manual
52 pages
4 1 e A Softwaremodelingintroductionvideo
No ratings yet
4 1 e A Softwaremodelingintroductionvideo
7 pages
Senthil Kumar Muthuvel: Highcharts Api
No ratings yet
Senthil Kumar Muthuvel: Highcharts Api
2 pages
CertDumps 70-564
No ratings yet
CertDumps 70-564
5 pages
Stack Implementation With Its Application
No ratings yet
Stack Implementation With Its Application
7 pages
How To Install Antenna Magus
No ratings yet
How To Install Antenna Magus
1 page
IST MLT Configuration
No ratings yet
IST MLT Configuration
58 pages
From Bahrain With Love: FinFisher's Spy Kit Exposed
No ratings yet
From Bahrain With Love: FinFisher's Spy Kit Exposed
117 pages
Pacman Project
No ratings yet
Pacman Project
13 pages
Human Computer Interaction
No ratings yet
Human Computer Interaction
15 pages
Ch4b Modified
No ratings yet
Ch4b Modified
64 pages
Run-Time Environments: COP5621 Compiler Construction
No ratings yet
Run-Time Environments: COP5621 Compiler Construction
21 pages
Design Rules
No ratings yet
Design Rules
23 pages
HCI in The Software Process
No ratings yet
HCI in The Software Process
18 pages
Approaches of Machine Intelligence
No ratings yet
Approaches of Machine Intelligence
11 pages
"Software Engineering" Assignment 4 "Use Case Descriptions and Diagram"
No ratings yet
"Software Engineering" Assignment 4 "Use Case Descriptions and Diagram"
10 pages
SE Class Diagram
No ratings yet
SE Class Diagram
3 pages
Process Model
No ratings yet
Process Model
2 pages
Government College University, Lahore: Operating Systems Lab. Semester: 6th Session: 2015-19
No ratings yet
Government College University, Lahore: Operating Systems Lab. Semester: 6th Session: 2015-19
1 page
Challenging Prime Number Problems
From Everand
Challenging Prime Number Problems
Gerald Patterson
No ratings yet
The Genetic Code of All Languages,(Part 2.1; Numerals)
From Everand
The Genetic Code of All Languages,(Part 2.1; Numerals)
Moni Kanchan Panda
No ratings yet

Ch2 Modified

Uploaded by

Ch2 Modified

Uploaded by

1

Building a Simple Compiler

Context-free grammar for simple expressions:

G = <{list,digit}, {+,-,0,1,2,3,4,5,6,7,8,9}, P, list>

list  list + digit

list  list - digit

Derivation for the Example

This is an example leftmost derivation, because we replaced

Derivation for the Example

Parse Tree for the Example

Leftmost derivation Rightmost derivation

In both cases, Expr * id – num * id

Consider the following context-free grammar:

G = <{string}, {+,-,0,1,2,3,4,5,6,7,8,9}, P, string>

string  string + string | string - string | 0 | 1 | … | 9

string string string

string string string string string

Synthesized and Inherited

Example Attribute Grammar

String concat operator

Example Annotated Parse Tree

expr.t = “95-” term.t = “2”

expr.t = “9” term.t = “5”

Depth-First Traversals (Example)

expr.t = “95-” term.t = “2”

expr.t = “9” term.t = “5”

9 - 5 + 2 Note: all attributes are

rest  + term { print(“+”) } rest

Example Translation Scheme

expr  expr + term { print(“+”) }

Example Translation Scheme

Example Predictive Parser

Example Predictive Parser

Input: array [ num dotdot num ] of integer

Example Predictive Parser

Input: array [ num dotdot num ] of integer

Example Predictive Parser

match(‘array’) match(‘[’) simple()

Input: array [ num dotdot num ] of integer

Example Predictive Parser

match(‘array’) match(‘[’) simple()

Input: array [ num dotdot num ] of integer

Example Predictive Parser

match(‘array’) match(‘[’) simple()

match(‘num’) match(‘dotdot’) match(‘num’)

Input: array [ num dotdot num ] of integer

Example Predictive Parser

match(‘array’) match(‘[’) simple() match(‘]’)

match(‘num’) match(‘dotdot’) match(‘num’)

Input: array [ num dotdot num ] of integer

Example Predictive Parser

match(‘array’) match(‘[’) simple() match(‘]’) match(‘of’)

match(‘num’) match(‘dotdot’) match(‘num’)

Input: array [ num dotdot num ] of integer

Example Predictive Parser

match(‘array’) match(‘[’) simple() match(‘]’) match(‘of’) type()

match(‘num’) match(‘dotdot’) match(‘num’) simple()

Adding a Lexical Analyzer

The Lexical Analyzer “lexer”

You might also like