0% found this document useful (0 votes)

12 views23 pages

Unit-4-2

The document discusses intermediate code generation in compilers. It describes different intermediate representations like syntax trees, postfix notation, and three-address code. It also provides examples of different types of three-address statements and how a syntax-directed definition can be used to generate three-address code from a source program.

Uploaded by

Jefferson Aaron

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views23 pages

Unit-4-2

Uploaded by

Jefferson Aaron

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

18CSC304J- COMPILER DESIGN

UNIT-4

SRMIST, Vadapalani Campus

UNIT-IV SYLLABUS
1. Intermediate Code Generation 10. Code Generation
2. Intermediate Languages - prefix - postfix11. Issues in the design of code generator
3. Quadruple - triple - indirect triples 12. The target machine – Runtime Storage
Representation management
4. Syntax tree- Evaluation of 10. A simple Code generator
expression-three-address code 11. Code Generation Algorithm
5. Synthesized attributes – Inherited 12. Register and Address Descriptors
attributes 13. Generating Code of Assignment Statements
6. Intermediate languages – Declarations 14. Cross Compiler – T diagrams
7. Assignment Statements 15. Issues in Cross compilers
8. Boolean Expressions, Case Statements
9. Back patching – Procedure calls

2
SRMIST, Vadapalani Campus
Intermediate Code Generation
● In the analysis-synthesis model of a compiler, the front end translates a source
program into an intermediate representation from which the back end generates
target code.
● A source program can be translated directly into the target language, some
beneﬁts of using a machine-independent Intermediate form are:
1. Retargeting is facilitated; a complier for a diﬀerent machine can be created
by attaching a back end for the new machine to an existing front end.
2. A machine-independent code optimizer can be applied to the intermediate
representation.

3
● It can be used to translate into an intermediate code programming language
constructs such as declarations, assignments, and ﬂow-of-control
statements.
● Assume that the source program has already been parsed and statically
checked
● Most of the SDD can be implemented during either bottom-up or top-down
parsing

4
Intermediate Languages
● Syntax trees and postﬁx notation, are two kinds of intermediate
representations.
● A third, called three-address code, will be used here.
● The semantic rules for generating three-address code from common
programming language constructs are similar to those for constructing
syntax trees or for generating postﬁx notation.

CS416 Compiler Design

5
Graphical Representations

● A syntax tree depicts the

natural hierarchical
structure of a source
program.
● A dag gives the same
information but in a more
compact way because
common subexpressions
are identiﬁed.
● A syntax tree and dag for
the assignment statement
a := b* -c + b* -c
CS416 Compiler Design
6
Postﬁx notation

● It is a linearized representation of s syntax tree; it is a list of the nodes of the tree

in which a node appears immediately after its children.
● The postfix notation for the syntax tree in previous slide is
a b c uminus * b c uminus * + assign
● The edges in a syntax tree do not appear explicitly in postfix notation
● They can be recovered from the order in which the nodes appear and the number
of operands that the operator at a node expects.
● The recovery of edges is similar to the evaluation using a stack, of an expression in
postfix notation.
● Syntax trees for assignment statements are produced by the SDD
● It is an extension of SDD.
● Nonterminal S generates an assignment statement.
● The two binary operators + and * are examples of the full operator set in a typical
7
language.
● Operator associativities and precedence's are the usual ones; even though they
have not been put into the grammar.
● This definition constructs the SDD from the input a := b* - c + b* -c.
● This same SDD will produce the dag representation if the functions mkunode (op,
child) and mknodr(op, left, right) return a pointer to an existing node
● The token id has an attribute place that points to the symbol-table entry for the
identifier.

8
● Two representations of the
syntax tree
● Each node is represented as
a record with a ﬁeld for its
operator and additional
ﬁelds for pointers to its
children.
● In Fig (b), nodes are
allocated from an array of
records and the index or
position of the node serves
as the pointer to the node.
● All the nodes in the syntax
tree can be visited by
following pointers, starting
from the root at position 10.
9
Three-Address Code

● Three address code is a sequence of statements of the general form

● where x, y, and z are names, constants, or compiler-generated temporaries;
● op stands for any operator, such as a ﬁxed- or ﬂoating-point arithmetic operator,
or a logical operator on boolean-valued data.
● A source language expression like x + y * z might be translated into a sequence

● where tl and t2 are compiler-generated temporary names.

● The use of names for the intermediate values computed by a program allows
three-address code to be easily rearranged - unlike postﬁx notation.

10
• Three-address code is a linearized representation of a syntax tree or a dag in which explicit names
correspond to the interior nodes of the graph.
• The syntax tree and dag are represented by the three-address code sequences as given below
• Variable names can appear directly in three-address statements, and has no statements
corresponding to the leaves

The reason for the term "three-address code" is that each statement usually contains three
addresses, two for the operands and one for the result.
11
Types of Three Address Statements

● Three-address statements are similar to assembly code.

● Statements can have symbolic labels and there are statements for flow of control.
● A symbolic label represents the index of a three-address statement in the array
holding intermediate code.
● Actual indices can- be substituted for the labels either by making a separate pass,
or by using "backpatching“
Some of the common three-address statements used are:
1. Assignment statements of the form x := y op Z, where op is a binary arithmetic or
logical operation.
2. Assignment instructions of the form x : = op y, where op is a unary operation.
Essential unary operations include unary minus, logical negation, shift operators,
and conversion operators that,
Eg: convert a fixed-point number to a floating-point number
12
3. Copy statement of the form x : = y where the value of y is assigned to x.
4. The unconditional jump goto L. The three-address statement with label L is the
next to be executed
5. Conditional jumps such as if x relop y goto L, This instruction applies a relational
operator (<,=,>=,etc.,) to x and y and executes the statement with label L next if x
stands in relation relop to y
6. param x and call p, n for procedure calls and return y, where y representing a
returned value is optional. The sequence of three-address statements generated as
part of a call of the procedure p( xl , x2, . . . , xn )
The integer n indicating the number of actual-parameters in
''call p , n" is not redundant because calls can be nested.

13
7. Indexed assignments of the form x:=y[i] and x[i] :=y.
The ﬁrst statement x:=y[i] 🡪sets x to the value in the location i memory units beyond
location y.
The second statement x[i] :=y 🡪sets the contents of the location i units beyond x to the
value of y.
Where instructions, x, y, and i refer to data objects.
8. Address and pointer assignments of the form x := &y, x := *y and *x := y
Statement x := &y 🡪sets the value of x to be the location of y.
Here y is a name, a temporary, that denotes an expression and x is a pointer name or
temporary.
Statement x : = *y🡪 sets y is a pointer or a temporary whose r-value is a location.
Statement *x := y 🡪sets the r-value of the object pointed to by x to the r-value of y.

14
SDD into Three-Address Code

• When three-address code is generated, temporary names are made up for the interior nodes of
a syntax tree.
• The value of nonterminal E on the left side of E🡪El +E, will be computed into a new temporary t,
• The three address code for id : = E consists of code to evaluate E into some temporary t,
followed by the assignment id.place : = t.

The S-attributed definition generates three-address code for assignment statements.

Given input a := b * - c + b * - c, it produces the code

15
The synthesized attribute
S.code represents the three
address code for the
assignment S.
The nonterminal E has two
attributes:
1. E.place the name that will
hold the value of E,
2. E.code the sequence of
three-address statements
evaluating E.
The function newtemp returns
a sequence of distinct names
t1,t2,…,tn in response to
successive calls.
For convenience, the notation gen(x ':=' y '+' z) is used
to represent the three-address statement x : = y + z.

16
Flow-of-control statements can be added to the language of assignments by productions and
semantic rules.
The code for S->while E do S1 is generated using new attributes S.begin and S,after to mark
the first statement in the code for E and the statement following the code S

17
● These attributes represent labels created by a function newlabel that returns a
new label every time it is called.
● Note that S.after becomes the label of the statement that comes after the code for
the while statement.
● Assume that a non-zero expression represents true; i.e. when the value of E
becomes zero, control leaves the while statement
● Expressions that govern the ﬂow of control may in general be boolean expressions
containing relational and logical operators

● Postﬁx notation can be obtained by adapting the semantic rules .

● The postfix notation for an identifier is the identifier itself,
● The rules for the other productions concatenate only the operator after the code
for the operands.
● Eg: Associated with the production E🡪-E 1 is the semantic rule

18
Implementation of Three-Address Statements
● A three address statement is an abstract form of intermediate code.
● In a compiler, these statements can be implemented as records with fields for the
operator and the operands.
● There are 3 such representations: quadruples, triples, and indirect triples.
Quadruples
● A quadruple is a record structure with four fields, op, arg1, arg2, and result
● The op field contains an internal code for the operator
● The three-address statement x : = y op z is represented by placing y in arg1, z in
arg2, and x in result.
● Statements with unary operators like x : = -y or x : = y do not use arg2.
● Operators like param use neither arg2 nor result.
● Conditional and unconditional jumps put the target label in result
● The quadruples for the assignment a : = b * - c + b * - c is given by (next slide)
● The contents of fields arg1, arg2, and result are normally pointers to the
19
symbol-table entries for the names represented by these fields
Triples
● To avoid entering temporary names into the symbol table; We refer it as temporary
value by the position of the statement that computes it.
● Here three-address statements can be represented by records with only three
fields: op, arg1 and arg2
● The fields arg1 and arg2, for the arguments of op, are either pointers to the symbol
table or pointers into the triple structure (for temporary values).
● Since three fields are used, this intermediate code format is known as triples
Parenthesized numbers
represent pointers into the triple
structure, while symbol-table
pointers are represented by the
names themselves.

The copy statement a : = t5 is

encoded in the triple
representation by placing a in
the arg1 field and using the
operator assign. 20
● A ternary operation like x[i] : = y requires two entries in the triple structure,
while x := y[i] is naturally represented us two operations

Indirect triples
● Another implementation of three-address
code is listing pointers to triples, rather than
listing the triples themselves.
● This implementation is naturally called
indirect triples
● Eg: An array statement to list pointers to
triples in the desired order.
21
22
23

Course Syllabus
No ratings yet
Course Syllabus
4 pages
Python Notes
No ratings yet
Python Notes
373 pages
Data Structures and Problem Solving Using Java 4th, Intern. Edition Weiss PDF Download
No ratings yet
Data Structures and Problem Solving Using Java 4th, Intern. Edition Weiss PDF Download
38 pages
ch04 Notes
No ratings yet
ch04 Notes
13 pages
Assignment 3
No ratings yet
Assignment 3
8 pages
21cs15it Problem Solving and Python Revision Questions and Answers
No ratings yet
21cs15it Problem Solving and Python Revision Questions and Answers
33 pages
Pattara Python 01204111
No ratings yet
Pattara Python 01204111
233 pages
Build A Technical Documentation Page
No ratings yet
Build A Technical Documentation Page
10 pages
Wrong Email ID
No ratings yet
Wrong Email ID
10 pages
CHAPTER 11 - Introduction To Programming Languages
No ratings yet
CHAPTER 11 - Introduction To Programming Languages
17 pages
Exam 2023s1 Main Solutions
No ratings yet
Exam 2023s1 Main Solutions
18 pages
Behavioural Modelling Verilog HDL
No ratings yet
Behavioural Modelling Verilog HDL
36 pages
Unit-4-4
No ratings yet
Unit-4-4
59 pages
Cse3077 CD m3
No ratings yet
Cse3077 CD m3
74 pages
T314-11 Structured Text - RevC
No ratings yet
T314-11 Structured Text - RevC
20 pages
Revision Tour of Python Class XII
No ratings yet
Revision Tour of Python Class XII
19 pages
17-Three Address Code
No ratings yet
17-Three Address Code
27 pages
ITE 186 Day 3
No ratings yet
ITE 186 Day 3
10 pages
Chapter 6 Code Generation and Optimization
No ratings yet
Chapter 6 Code Generation and Optimization
34 pages
BCS 324 Topic 5
No ratings yet
BCS 324 Topic 5
35 pages
TSR - Class Cd-Unit 3
No ratings yet
TSR - Class Cd-Unit 3
111 pages
18CSC305J AI Unit-5
No ratings yet
18CSC305J AI Unit-5
138 pages
Unit-Iii
No ratings yet
Unit-Iii
19 pages
Big Data - Iv Bda
No ratings yet
Big Data - Iv Bda
143 pages
Lecture Notes Compiler Design Chapter-6
No ratings yet
Lecture Notes Compiler Design Chapter-6
55 pages
Unit 1 Java
No ratings yet
Unit 1 Java
36 pages
Python Full Notes - Working
100% (4)
Python Full Notes - Working
645 pages
CD - CH5 - Intermediate Code Generation
No ratings yet
CD - CH5 - Intermediate Code Generation
54 pages
Compiler Design Lec-Six Intermediate Languages
No ratings yet
Compiler Design Lec-Six Intermediate Languages
21 pages
Intermediate Code Generation
No ratings yet
Intermediate Code Generation
8 pages
2-Data Types (E-Next - In)
No ratings yet
2-Data Types (E-Next - In)
19 pages
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
2024 CD Ch06 Intermidiate & Ch07 Runtime & Ch08 Code Optimization
No ratings yet
2024 CD Ch06 Intermidiate & Ch07 Runtime & Ch08 Code Optimization
29 pages
CD Imp Ques 2
No ratings yet
CD Imp Ques 2
29 pages
Basics of CPP Objective Questions MCQs
No ratings yet
Basics of CPP Objective Questions MCQs
23 pages
Advanced JavaScript
No ratings yet
Advanced JavaScript
1,130 pages
Text Chapter 2 Basic Objects
No ratings yet
Text Chapter 2 Basic Objects
126 pages
Mod 4
No ratings yet
Mod 4
39 pages
Intermediate Code Generation
No ratings yet
Intermediate Code Generation
11 pages
18CSC305J - UNIT-4.pptx - 18CSC305J - UNIT-4
No ratings yet
18CSC305J - UNIT-4.pptx - 18CSC305J - UNIT-4
77 pages
PPL UNIT 2 Notes
No ratings yet
PPL UNIT 2 Notes
66 pages
Salesforce Flow Quick Reference Guide
No ratings yet
Salesforce Flow Quick Reference Guide
4 pages
MUUnit 4
No ratings yet
MUUnit 4
63 pages
C Programming Language
From Everand
C Programming Language
Younish Pathan
No ratings yet
CSE-303 Chapter-06 Final
No ratings yet
CSE-303 Chapter-06 Final
97 pages
CD Unit4 - PPT
No ratings yet
CD Unit4 - PPT
28 pages
18 Unit-4
No ratings yet
18 Unit-4
16 pages
F77 Reference Manual
No ratings yet
F77 Reference Manual
200 pages
Lecture 209
No ratings yet
Lecture 209
42 pages
Notes For C Language Part 1
No ratings yet
Notes For C Language Part 1
36 pages
CD Unoit 3
No ratings yet
CD Unoit 3
16 pages
4th Phase New Intermediate Code Generator
No ratings yet
4th Phase New Intermediate Code Generator
24 pages
Unit-Iii: Intermediate Code Generation
No ratings yet
Unit-Iii: Intermediate Code Generation
47 pages
Compiler Design Unit 3
No ratings yet
Compiler Design Unit 3
14 pages
CD Unit 5
No ratings yet
CD Unit 5
49 pages
Lec05 Intermediate Code Generation
No ratings yet
Lec05 Intermediate Code Generation
40 pages
StructuringState Answers
No ratings yet
StructuringState Answers
8 pages
Compiler Engineering
No ratings yet
Compiler Engineering
24 pages
CC 6
No ratings yet
CC 6
30 pages
Chapter 6 Intermediate Code Generation
No ratings yet
Chapter 6 Intermediate Code Generation
47 pages
CH-6 Intermediate Code Generator
No ratings yet
CH-6 Intermediate Code Generator
54 pages
UNIT-4 Notes
No ratings yet
UNIT-4 Notes
27 pages
CD Unit-Iii
No ratings yet
CD Unit-Iii
20 pages
UNIT-3 Odg
No ratings yet
UNIT-3 Odg
17 pages
Instructions in C/C++
No ratings yet
Instructions in C/C++
44 pages
Learn Dart Tutorial - W3adda PDF
No ratings yet
Learn Dart Tutorial - W3adda PDF
228 pages
Oracle HRMS Fast Formula
100% (1)
Oracle HRMS Fast Formula
17 pages
Unit 3 Compiler
No ratings yet
Unit 3 Compiler
27 pages
CD Unit3
No ratings yet
CD Unit3
17 pages
Chapter 6 - Intermediate Code Generation
No ratings yet
Chapter 6 - Intermediate Code Generation
5 pages
Chapter 5 - Intermediate Code Generation
No ratings yet
Chapter 5 - Intermediate Code Generation
27 pages
Poc Unit 3
No ratings yet
Poc Unit 3
22 pages
Unit 3 TAC Intermidiate Code Generator
No ratings yet
Unit 3 TAC Intermidiate Code Generator
27 pages
CD Module 8 Print
No ratings yet
CD Module 8 Print
58 pages
Intermediate Code Generation: CD: Compiler Design
No ratings yet
Intermediate Code Generation: CD: Compiler Design
41 pages
006chapter 6 - Intermediate Code Generation
No ratings yet
006chapter 6 - Intermediate Code Generation
23 pages
Chapter 6
No ratings yet
Chapter 6
28 pages
Unit-Iii: Figure 4.1: Intermediate Code Generator
No ratings yet
Unit-Iii: Figure 4.1: Intermediate Code Generator
33 pages
PLC 2 Unity Reference
No ratings yet
PLC 2 Unity Reference
698 pages
CH 5 - Intermediate Code Generation
No ratings yet
CH 5 - Intermediate Code Generation
16 pages
Unit 4 - Compiler Design - WWW - Rgpvnotes.in
No ratings yet
Unit 4 - Compiler Design - WWW - Rgpvnotes.in
21 pages
Python Programming Guide Book
100% (19)
Python Programming Guide Book
323 pages
CS602PC - Compiler - Design - Lecture Notes - Unit - 3
No ratings yet
CS602PC - Compiler - Design - Lecture Notes - Unit - 3
27 pages
TAC
No ratings yet
TAC
25 pages
CS 346: Intermediate Code Generation: Resource
No ratings yet
CS 346: Intermediate Code Generation: Resource
60 pages
3 Intermediate Code Generation
No ratings yet
3 Intermediate Code Generation
20 pages
Intermediate Code Generation
No ratings yet
Intermediate Code Generation
42 pages
Unit 4 - Compiler Design - WWW - Rgpvnotes.in
No ratings yet
Unit 4 - Compiler Design - WWW - Rgpvnotes.in
23 pages
Compiler Design Chapter-6
No ratings yet
Compiler Design Chapter-6
83 pages
Intermediate Code Generation 1
No ratings yet
Intermediate Code Generation 1
56 pages
Intermediate Code Generation
No ratings yet
Intermediate Code Generation
62 pages
Intermediate
No ratings yet
Intermediate
29 pages
Unit 3 Compiler
No ratings yet
Unit 3 Compiler
27 pages
Lecture On Compiler Design: Chapter 8: Intermediate Code Generation
No ratings yet
Lecture On Compiler Design: Chapter 8: Intermediate Code Generation
11 pages
Lecture On Compiler Design: Chapter 8: Intermediate Code Generation
No ratings yet
Lecture On Compiler Design: Chapter 8: Intermediate Code Generation
29 pages

Unit-4-2

Uploaded by

Unit-4-2

Uploaded by

18CSC304J- COMPILER DESIGN

SRMIST, Vadapalani Campus

CS416 Compiler Design

● A syntax tree depicts the

● It is a linearized representation of s syntax tree; it is a list of the nodes of the tree

● Three address code is a sequence of statements of the general form

● where tl and t2 are compiler-generated temporary names.

● Three-address statements are similar to assembly code.

The S-attributed definition generates three-address code for assignment statements.

● Postﬁx notation can be obtained by adapting the semantic rules .

The copy statement a : = t5 is

You might also like