0% found this document useful (0 votes)

19 views20 pages

Module1 1

Uploaded by

Rudhhi Shah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views20 pages

Module1 1

Uploaded by

Rudhhi Shah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

CSE309N

Chapter 2
A Simple One – Pass Compiler

Chapter 2
CSE309N
The Entire Compilation Process
 Grammars for Syntax Definition
 Syntax-Directed Translation
 Parsing - Top Down & Predictive
 Pulling Together the Pieces
 The Lexical Analysis Process
 Symbol Table Considerations
 A Brief Look at Code Generation
 Concluding Remarks/Looking Ahead

Chapter 2
CSE309N
Overview

Programming Language can be defined by describing

1. The syntax of the language
1. What its program looks like
2. We use CFG or BNF (Backus Naur Form)
2. The semantics of the language
1. What its program mean
2. Difficult to describe
3. Use informal descriptions and suggestive examples

Chapter 2
CSE309N
Grammars for Syntax Definition
 A Context-free Grammar (CFG) Is Utilized to
Describe the Syntactic Structure of a Language
 A CFG Is Characterized By:
1. A Set of Tokens or Terminal Symbols
2. A Set of Non-terminals
3. A Set of Production Rules
Each Rule Has the Form
NT  {T, NT}*
4. A Non-terminal Designated As
the Start Symbol
Chapter 2
CSE309N
Grammars for Syntax Definition
Example CFG

list  list + digit

list  list - digit
list  digit
digit  0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9
(the “|” means OR)
(So we could have written
list  list + digit | list - digit | digit )

Chapter 2
CSE309N
Information
 A string of tokens is a sequence of zero or more tokens.
 The string containing with zero tokens, written as , is called
empty string.
 A grammar derives strings by beginning with the start symbol
and repeatedly replacing the non terminal by the right side of a
production for that non terminal.
 The token strings that can be derived from the start symbol form
the language defined by the grammar.

Chapter 2
CSE309N
Grammars are Used to Derive Strings:

Using the CFG defined on the earlier slide, we can

derive the string: 9 - 5 + 2 as follows:
list  list + digit P1 : list  list + digit

 list - digit + digit P2 : list  list - digit

 digit - digit + digit P3 : list  digit

 9 - digit + digit P4 : digit  9

 9 - 5 + digit P4 : digit  5

 9-5+2 P4 : digit  2

Chapter 2
CSE309N
Grammars are Used to Derive Strings:

This derivation could also be represented via a Parse Tree

(parents on left, children on right)
list  list + digit list
 list - digit + digit
 digit - digit + digit
list + digit
 9 - digit + digit - 2
list digit
 9 - 5 + digit
 9-5+2 5
digit
9

Chapter 2
CSE309N
Defining a Parse Tree

 A parse tree pictorially shows how the start symbol of a

grammar derives a string in the language.
 More Formally, a Parse Tree for a CFG Has the Following
Properties:
 Root Is Labeled With the Start Symbol
 Leaf Node Is a Token or 
 Interior Node Is a Non-Terminal
 If A  x1x2…xn, Then A Is an Interior; x1x2…xn Are
Children of A and May Be Non-Terminals or Tokens

Chapter 2
CSE309N
Other Important Concepts
Ambiguity
Two derivations (Parse Trees) for the same token string.

string string
-
+ string string
string string
+ string
string - string 2 9 string

9 5 5 2

Grammar:
string  string + string | string – string | 0 | 1 | …| 9

Why is this a Problem ?

Chapter 2
CSE309N
Other Important Concepts
Associativity of Operators
Left vs. Right

list right

list + digit letter = right

2 a =
list - digit letter right
5 b
digit letter
9 c

list  list + digit | right  letter = right | letter

| list - digit | digit letter  a | b | c | …| z
digit  0 | 1 | 2 | …| 9
Chapter 2
CSE309N
Embedding Associativity
 The language of arithmetic expressions with + -
 (ambiguous) grammar that does not enforce
associativity
string  string + string | string – string | 0 | 1 | …| 9

 non-ambiguous grammar enforcing left

associativity (parse tree will grow to the left)
string  string + digit | string - digit | digit
digit  0 | 1 | 2 | …| 9

 non-ambiguous grammar enforcing right

associativity (parse tree will grow to the right)
string  digit + string | digit - string | digit
digit  0 | 1 | 2 | …| 9

Chapter 2
CSE309N
Other Important Concepts
Operator Precedence
What does ( )
9+5*2 Typically * / is precedence
mean? + - order

This can be expr  expr + term | expr – term | term

incorporated term  term * factor | term / factor | factor
into a grammar factor  digit | ( expr )
via rules: digit  0 | 1 | 2 | 3 | … | 9

Precedence Achieved by:

expr & term for each precedence level

Rules for each are left recursive or associate to the left

Chapter 2
CSE309N
Syntax for Statements

stmt  id := expr
| if expr then stmt
| if expr then stmt else stmt
| while expr do stmt
| begin opt_stmts end

Ambiguous Grammar?

Chapter 2
CSE309N
Syntax-Directed Translation
 Associate Attributes With Grammar Rules and Translate as Parsing
occurs

 The translation will follow the parse tree structure (and as a result the
structure and form of the parse tree will affect the translation).

 First example: Inductive Translation.

 Infix to Postfix Notation Translation for Expressions
 Translation defined inductively as: Postfix(E) where E is an
Expression.

Rules
1. If E is a variable or constant then Postfix(E) = E
2. If E is E1 op E2 then Postfix(E)
= Postfix(E1 op E2) = Postfix(E1) Postfix(E2) op
3. If E is (E1) then Postfix(E) = Postfix(E1)

Chapter 2
CSE309N
Examples

Postfix( ( 9 – 5 ) + 2 )
= Postfix( ( 9 – 5 ) ) Postfix( 2 ) +
= Postfix( 9 – 5 ) Postfix( 2 ) +
= Postfix( 9 ) Postfix( 5 ) - Postfix( 2 ) +
=95–2+

Postfix(9 – ( 5 + 2 ) )
= Postfix( 9 ) Postfix( ( 5 + 2 ) ) -
= Postfix( 9 ) Postfix( 5 + 2 ) –
= Postfix( 9 ) Postfix( 5 ) Postfix( 2 ) + –
=952+–

Chapter 2
CSE309N
Syntax-Directed Definition

 Each Production Has a Set of Semantic Rules

 Each Grammar Symbol Has a Set of Attributes
 For the Following Example, String Attribute “t” is
Associated With Each Grammar Symbol

expr  expr – term | expr + term | term

term  0 | 1 | 2 | 3 | … | 9

 recall: What is a Derivation for 9 + 5 - 2?

list  list - digit  list + digit - digit  digit + digit - digit
 9 + digit - digit  9 + 5 - digit  9 + 5 - 2
Chapter 2
CSE309N
Syntax-Directed Definition (2)

 Each Production Rule of the CFG Has a Semantic

Rule
Production Semantic Rule
expr  expr + term expr.t := expr.t || term.t || ‘+’
expr  expr – term expr.t := expr.t || term.t || ’-’
expr  term expr.t := term.t
term  0 term.t := ‘0’
term  1 term.t := ‘1’
…. ….
term  9 term.t := ‘9’

 Note: Semantic Rules for expr define t as a

“synthesized attribute” i.e., the various copies of t
obtain their values from “children t’s”
Chapter 2
CSE309N
Semantic Rules are Embedded in Parse Tree

expr.t =95-2+

expr.t =95- term.t =2

expr.t =9 term.t =5

term.t =9

9 - 5 + 2
 It starts at the root and recursively visits the children of
each node in left-to-right order
 The semantic rules at a given node are evaluated once all
descendants of that node have been visited.
 A parse tree showing all the attribute values at each node
is called annotated parse tree. Chapter 2
CSE309N
Translation Schemes
Embedded Semantic Actions into the right sides of
the productions.
A translation scheme is
expr  expr + term {print(„+‟)}
like a syntax-directed
definition except the
 expr - term {print(„-‟)}
order of evaluation of
 term the semantic rules is
term  0 {print(„0‟)} explicitly shown.
term  1 {print(„1‟)}
expr
… {print(„+‟)}
+
term  9 {print(„9‟)} expr term

- {print(„-‟)} 2 {print(„2‟)}
expr term
5 {print(„5‟)}
term
9 {print(„9‟)}
Chapter 2

Review Data
No ratings yet
Review Data
745 pages
Compiler Design 3
No ratings yet
Compiler Design 3
140 pages
Simple Syntax Directed Translation
No ratings yet
Simple Syntax Directed Translation
51 pages
Coraline Script
75% (8)
Coraline Script
62 pages
Chapter 2
No ratings yet
Chapter 2
47 pages
Class Three
No ratings yet
Class Three
74 pages
2nd Phase Syntax Analyzer - 1
No ratings yet
2nd Phase Syntax Analyzer - 1
136 pages
Chapter 4 Intro - To - Parsing
No ratings yet
Chapter 4 Intro - To - Parsing
53 pages
Multimedia Application L4
No ratings yet
Multimedia Application L4
42 pages
Compiler Design Lec-Three Syntax Analysis
No ratings yet
Compiler Design Lec-Three Syntax Analysis
60 pages
Chương 3. Phân Tích Cú Pháp
No ratings yet
Chương 3. Phân Tích Cú Pháp
103 pages
Principles of Programming Language
No ratings yet
Principles of Programming Language
44 pages
(PPT) Environmental Impacts of Dam
100% (2)
(PPT) Environmental Impacts of Dam
17 pages
BCS 324 Compiler Design Notes - Unit2
No ratings yet
BCS 324 Compiler Design Notes - Unit2
37 pages
Chapter3 CFG
No ratings yet
Chapter3 CFG
67 pages
Compiler Design Chapter-3
0% (1)
Compiler Design Chapter-3
177 pages
Lecture 03
No ratings yet
Lecture 03
36 pages
Professional Growth Plan
No ratings yet
Professional Growth Plan
2 pages
Special Ed Thesis Topics
100% (3)
Special Ed Thesis Topics
5 pages
(Week 3) Syntax Analysis (Derivation)
No ratings yet
(Week 3) Syntax Analysis (Derivation)
46 pages
Module III
No ratings yet
Module III
18 pages
CST302 - Compiler - Design - Module 2
No ratings yet
CST302 - Compiler - Design - Module 2
19 pages
CC Lec 7
No ratings yet
CC Lec 7
16 pages
CH2-1 To CH2-3
No ratings yet
CH2-1 To CH2-3
79 pages
4 Parsing
No ratings yet
4 Parsing
32 pages
Compiler Construction Week 04 Syntax Analysis I)
No ratings yet
Compiler Construction Week 04 Syntax Analysis I)
41 pages
Parsing Part - 1
No ratings yet
Parsing Part - 1
53 pages
Chapter Four
No ratings yet
Chapter Four
54 pages
8DG24624AGAATQZZA - V1 - 1850 Transport Service Switch 5C (TSS-5C) Release 6.1 User Provisioning Guide PDF
No ratings yet
8DG24624AGAATQZZA - V1 - 1850 Transport Service Switch 5C (TSS-5C) Release 6.1 User Provisioning Guide PDF
464 pages
Parsing Part - 1
No ratings yet
Parsing Part - 1
53 pages
2019-11-29 04 41 39CS V Sem Compiler Design
No ratings yet
2019-11-29 04 41 39CS V Sem Compiler Design
10 pages
Chapter 2 - Simple Syntax Directed Translator
No ratings yet
Chapter 2 - Simple Syntax Directed Translator
39 pages
Lec02 Programming Language Specification
No ratings yet
Lec02 Programming Language Specification
36 pages
Unit 3 Syntax - Analyzer
No ratings yet
Unit 3 Syntax - Analyzer
56 pages
8 Notes
No ratings yet
8 Notes
12 pages
DC Module 2 - 1
No ratings yet
DC Module 2 - 1
38 pages
Grammar and Parse Trees (Syntax) : What Makes A Good Programming Language?
100% (2)
Grammar and Parse Trees (Syntax) : What Makes A Good Programming Language?
50 pages
03-08-24 - JR - IPL-IC - Jee-Main - WTM-05 - Key & Sol's
No ratings yet
03-08-24 - JR - IPL-IC - Jee-Main - WTM-05 - Key & Sol's
16 pages
SE Compiler Chapter 3-Parser
No ratings yet
SE Compiler Chapter 3-Parser
27 pages
Lec4 SyntaxAnalysis
No ratings yet
Lec4 SyntaxAnalysis
41 pages
Lecture 1 Introduction DR Raheel 19022024 032426pm
No ratings yet
Lecture 1 Introduction DR Raheel 19022024 032426pm
32 pages
CS 4300: Compiler Theory A Simple Syntax-Directed Translator
No ratings yet
CS 4300: Compiler Theory A Simple Syntax-Directed Translator
70 pages
CSC 409 Note 2
No ratings yet
CSC 409 Note 2
12 pages
Compiler 2
100% (1)
Compiler 2
45 pages
DC - Module 2 - 6
No ratings yet
DC - Module 2 - 6
12 pages
Price Forecastingof Tomatoes
No ratings yet
Price Forecastingof Tomatoes
11 pages
Please Provide Answers To The Following Questions:: Activity 5 - Determine Appropriate Business Structure
No ratings yet
Please Provide Answers To The Following Questions:: Activity 5 - Determine Appropriate Business Structure
4 pages
ConsumerProtectionAct2019 Word
No ratings yet
ConsumerProtectionAct2019 Word
22 pages
CC-Lec 5 Week 5 Cfgs
No ratings yet
CC-Lec 5 Week 5 Cfgs
29 pages
DC - Module 2 - 7
No ratings yet
DC - Module 2 - 7
15 pages
Lecture 03
No ratings yet
Lecture 03
7 pages
CSC441-Lesson 04
No ratings yet
CSC441-Lesson 04
40 pages
Nodi Amazzonici - Genere, Genere e Donne Guerriere Di Ariosto
No ratings yet
Nodi Amazzonici - Genere, Genere e Donne Guerriere Di Ariosto
24 pages
A Simple One-Pass Compiler (To Generate Code For The JVM)
No ratings yet
A Simple One-Pass Compiler (To Generate Code For The JVM)
70 pages
Compiler Design
No ratings yet
Compiler Design
19 pages
Title 216 The Fintech Revolution AI's Role in Disrupting Traditional Banking and Financial Services
No ratings yet
Title 216 The Fintech Revolution AI's Role in Disrupting Traditional Banking and Financial Services
14 pages
Ch2 Modified
No ratings yet
Ch2 Modified
39 pages
Syntax Analyzer
No ratings yet
Syntax Analyzer
38 pages
Compiler 2
No ratings yet
Compiler 2
45 pages
Unit 3 SDD
No ratings yet
Unit 3 SDD
7 pages
Cox Bill
No ratings yet
Cox Bill
5 pages
Dragonborn Warlock 3rd Level
No ratings yet
Dragonborn Warlock 3rd Level
3 pages
An Approach To Physical Performance Analysis For Judo
No ratings yet
An Approach To Physical Performance Analysis For Judo
8 pages
Lecture Planner Physical Chemistry
No ratings yet
Lecture Planner Physical Chemistry
2 pages
WORKSHOP 1 Roadmap For Developing Relationship
No ratings yet
WORKSHOP 1 Roadmap For Developing Relationship
3 pages
BBM Prelim Labs and Assessments
No ratings yet
BBM Prelim Labs and Assessments
10 pages
7.CD Lab Manual
No ratings yet
7.CD Lab Manual
35 pages
Entrepreneurship Process
No ratings yet
Entrepreneurship Process
22 pages
Metaverse Seminar Report
No ratings yet
Metaverse Seminar Report
19 pages
Compiler 3
No ratings yet
Compiler 3
11 pages
Compiler Theory: (A Simple Syntax-Directed Translator)
No ratings yet
Compiler Theory: (A Simple Syntax-Directed Translator)
50 pages
Simple One Pass Compiler
No ratings yet
Simple One Pass Compiler
62 pages
CSE 12 Abstract Syntax Trees
No ratings yet
CSE 12 Abstract Syntax Trees
38 pages
Chapter 2 (Part 1)
No ratings yet
Chapter 2 (Part 1)
32 pages
Haha
No ratings yet
Haha
3 pages
A Simple One - Pass Compiler
No ratings yet
A Simple One - Pass Compiler
62 pages
2014-CD Ch-03 SAn
No ratings yet
2014-CD Ch-03 SAn
21 pages
A Triumph of Surgery - Explanation +glossary
0% (1)
A Triumph of Surgery - Explanation +glossary
3 pages
Austria Info: Culture 2010
No ratings yet
Austria Info: Culture 2010
52 pages
Product Design and Development - Design For Manufacturing
No ratings yet
Product Design and Development - Design For Manufacturing
35 pages
Compilers Notes
No ratings yet
Compilers Notes
31 pages
Communicative Strategies
No ratings yet
Communicative Strategies
4 pages
Figure 1two Parse Trees For 9-5+2
No ratings yet
Figure 1two Parse Trees For 9-5+2
3 pages
Syntax Analysis: EECS 483 - Lecture 4 University of Michigan Monday, September 17, 2006
No ratings yet
Syntax Analysis: EECS 483 - Lecture 4 University of Michigan Monday, September 17, 2006
28 pages
CS 3723 - Programming Language: 1. Introductory Stuff
No ratings yet
CS 3723 - Programming Language: 1. Introductory Stuff
11 pages
2016 SSS Guidebook Retirement
No ratings yet
2016 SSS Guidebook Retirement
16 pages
Marginal Costing
No ratings yet
Marginal Costing
12 pages
Maths QE Teacher Notes
No ratings yet
Maths QE Teacher Notes
19 pages
Assignment 1 Solution: Data Communication & Computer Networks
No ratings yet
Assignment 1 Solution: Data Communication & Computer Networks
7 pages
Mark The Letter A, B, C or D To Indicate The Correct Answer To Each of The Following
No ratings yet
Mark The Letter A, B, C or D To Indicate The Correct Answer To Each of The Following
4 pages
CSR Pepsico
No ratings yet
CSR Pepsico
5 pages
Snow White and The Seven Dwarfs (1937)
No ratings yet
Snow White and The Seven Dwarfs (1937)
1 page

Module1 1

Uploaded by

Module1 1

Uploaded by

CSE309N

Programming Language can be defined by describing

list  list + digit

Using the CFG defined on the earlier slide, we can

 list - digit + digit P2 : list  list - digit

 digit - digit + digit P3 : list  digit

This derivation could also be represented via a Parse Tree

 A parse tree pictorially shows how the start symbol of a

Why is this a Problem ?

list + digit letter = right

list  list + digit | right  letter = right | letter

 non-ambiguous grammar enforcing left

 non-ambiguous grammar enforcing right

This can be expr  expr + term | expr – term | term

Precedence Achieved by:

Rules for each are left recursive or associate to the left

 First example: Inductive Translation.

 Each Production Has a Set of Semantic Rules

expr  expr – term | expr + term | term

 recall: What is a Derivation for 9 + 5 - 2?

 Each Production Rule of the CFG Has a Semantic

 Note: Semantic Rules for expr define t as a

expr.t =95- term.t =2

You might also like