0% found this document useful (0 votes)

11 views

3unit cd IntermediateCode_Part1

The document discusses intermediate code generation in compilers, focusing on syntax trees, three-address code, type checking, and control flow. It explains the construction of Directed Acyclic Graphs (DAGs) for expressions, the value-number method for DAG construction, and various forms of three-address code including quadruples and triples. Additionally, it covers type expressions, type equivalence, and the storage layout for local names in programming languages.

Uploaded by

Akhila Athinarapu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

3unit cd IntermediateCode_Part1

Uploaded by

Akhila Athinarapu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 38

Intermediate Code Generation

Dr. N. Kalyani
Professor, CSE
Contents
Intermediate-Code Generation:
Variants of Syntax Trees
Three-Address Code
Types and Declarations
Type checking
Control Flow
Back Patching
Switch Statements
Intermediate Code for Procedures
Parsing & Intermediate Code generation
• In the analysis-synthesis model of a compiler, the front end
analyzes a source program and creates an intermediate
representation, from which the back end generates target code.
• Source language is confined to the front end, and details of the
target machine to the back end.
• While parsing, static checking, and intermediate-code generation
are done sequentially; All these can be combined and folded into
parsing.
Intermediate Code Representation
• Intermediate representations, including syntax trees and three-
address code.
• Syntax trees are high level; they depict the natural hierarchical
structure of the source program and are well suited to tasks
like static type checking /evaluation ordering.
• A low-level representation is suitable for machine-dependent
tasks like register allocation and instruction selection.
• “Three-address code" comes from instructions of the general
form x = y op z with three addresses: two for the operands y
and z and one for the result x.
• Three-address code can range from high to low-level,
depending on the choice of operators.
Variants of Syntax Trees
1. Directed Acyclic Graphs for Expressions
2. The Value-Number Method for Constructing DAG's
1. Directed Acyclic Graphs for Expressions

Like the syntax tree for an expression,

• A DAG has leaves corresponding to atomic operands and
interior nodes corresponding to operators.
• The difference is that a node N in a DAG has more than one
parent if N represents a common subexpression.
• In a syntax tree, the tree for the common subexpression
would be replicated as many times as the subexpression
appears in the original expression.
• A DAG gives the compiler important clues regarding the
generation of efficient code to evaluate the expressions.
DAG for the expression

a + a * (b - c) + (b - c) * d
Construction of DAG

 The SDD can construct either syntax trees or DAG’s.

 It was used to construct syntax trees, where functions

Leaf and Node created a fresh node each time they
were called.
 It will construct a DAG if, before creating a new node,
these functions first check whether an identical node
already exists.
 If a previously created identical node exists, the
existing node is returned.
SDD to produce Syntax tree or DAG
2. The Value-Number Method for Constructing DAG's
2. The Value-Number Method for Constructing DAG's
Algorithm : The value-number method for constructing the nodes of a
DAG.
INPUT : Label op, node l, and node r.
OUTPUT : The value number of a node in the array with signature
(op, l, r).
METHOD :
• Search the array for a node M with label op, left child I, and right
child r.
• If there is such a node, return the value number of M.
• If not, create in the array a new node N with label op, left child I,
and right child r, and return its value number.
Essential data structure to construct DAG
• The hash table is one of several data structures that support
dictionaries efficiently.
• To construct a hash table for the nodes of a DAG, we need a
hash function h that computes the index of the bucket for a
signature (op, I, r).
• The bucket index h(op, I, r) is computed deterministically from
op, I, and r, so that we may repeat the calculation and always get
to the same bucket index for node (op, I, r).
Exercise

Construct the DAG for the expression

((a + y)- ((x + y)(x- y))) + ((x + y) (x - y))

Three-Address Code

1. Addresses and Instructions

2. Quadruples

3. Triples

4. Static Single-Assignment Form

Three-Address Code

• Three-address code - One operator and two operands on right side.

• For expression like x+y*z might be translated into the sequence of
three-address instructions

where t1 and t2 are compiler-generated temporary names.

• The use of names for the intermediate values computed by a
program allows three-address code to be rearranged easily.
DAG for the expression

a + a * (b - c) + (b - c) * d
Addresses and Instructions to build Three-address code
Three -address code can be implemented using records called
quadruples and triples
An address can be one of the following:
A name : For convenience, we allow source-program names
(pointer to its symbol-table entry) to appear as addresses in
three-address code.
A constant : In practice, a compiler must deal with many
different types of constants and variables.
A compiler-generated temporary : Useful, especially in
optimizing compilers, to create a distinct name each time a
temporary is needed. These temporaries can be combined, if
possible, when registers are allocated to variables.
Addresses and Instructions to build Three-address code

A symbolic label represents the index of a three-address instruction in

the sequence of instructions. Actual indexes can be substituted for the
labels, either by making a separate pass or by "backpatching,"
1. Assignment instructions of the form x = y op z, where op is a
binary arithmetic or logical operation, and x, y, and z are
addresses.
2. Assignments of the form x = op y, where op is a unary operation.
Essential unary operations include unary minus, logical negation,
shift operators, and conversion operators (int, float, etc).
3. Copy instructions of the form x = y, where x is assigned the
value of y.
4. An unconditional jump go to L. The three-address instruction
with label L is the next to be executed.
Addresses and Instructions to build Three-address code

5. Conditional jumps of the form if x goto L and if False x goto L.

These instructions execute the instruction with label L next if x is
true and false, respectively. Otherwise, the following three-
address instruction in sequence is executed next, as usual.
6. Conditional jumps such as if x relop y goto L, which apply a
relational operator (<, ==, >=, etc.) to x and y, and execute the
instruction with label L next if x stands in relation relop to y. If
not, the three-address instruction following if x relop y goto L is
executed next, in sequence.
7. Procedure calls and returns are implemented using the following
instructions: param x for parameters; call p , n and y = call p , n
for procedure and function calls, respectively; and return y, where
y, representing a returned value, is optional. Their typical use is as
the sequence of three-address instructions
Addresses and Instructions to build Three-address code
8. Indexed copy instructions of the form x = y[i] and x[i]
= y. The instruction x = y[i] sets x to the value in the
location i memory units beyond location y. The
instruction x[i] =y sets the contents of the location i
units beyond x to the value of y.
9. Address and pointer assignments of the form x = &y, x =
* y, and * x = y.
• x = &y sets the r-value of x to be the location (l-value) of y.

• x = *y, The r-value of x is made equal to the contents of the

r-value of location y.
• *x = y sets the r-value of the object pointed to by x to the r-
value of y.
Addresses and Instructions to build Three-address code

Consider the statement do i = i + 1 ; while ( a [ i ] < v ) ;

2. Quadruples
A quadruple (or just "quad!') has four fields op, arg1, arg2, result.
• op field contains an internal code for the operator, agr 1 and arg2
are the arguments and the fourth field is the result.
For instance, the three-address instruction x = y + z is represented by
placing + in op, y in arg1, z in arg2, and x in result.

The following are some exceptions to this rule:

1. Instructions with unary operators like x = minus y or x = y do not
use arg2. Note that for a copy statement like x = y, op is =, while for
most other operations, the assignment operator is implied.

2. Operators like param use neither arg2 nor result.

3. Conditional and unconditional jumps put the target label in result.

2. Quadruples
Three-address code for the assignment a = b * - c + b * - c
3. Triples
• A triple has only three fields, which we call op, arg1, and arg2.
• Using triples, we refer to the result of an operation x op y by its
position, rather than by an explicit temporary name.
• A triple representation would refer to position (0). Parenthesized
numbers represent pointers into the triple structure itself.
• Triples are equivalent to signatures in Value numbering method.
The DAG and triple representations of expressions are
equivalent.
• The equivalence ends with expressions, since syntax-tree
variants and three-address code represent control flow quite
differently.
3. Triples

a = (b * - c) + (b * - c)
Indirect triples
• Indirect triples consist of a listing of pointers to triples, rather than
a listing of triples themselves.
• With indirect triples, an optimizing compiler can move an
instruction by reordering the instruction list, without affecting the
triples themselves.
• When implemented in Java, an array of instruction objects is
analogous to an indirect triple representation, since Java treats the
array elements as references to objects.
Translate the arithmetic expression
a) a + - (a -b- c)
b) a = b[i] + c[j]

1. A syntax tree.
2. Quadruples.
3. Triples.
4. Indirect triples.
Static Single-Assignment Form
• Static single-assignment form (SSA) is an intermediate representation
that facilitates certain code optimizations.
• Two distinctive aspects distinguish SSA from three-address code.
• The first is that all assignments in SSA are to variables with distinct
names; hence the term static single-assignment.
• Note that subscripts distinguish each definition of variables p and q in the
SSA representation.
• The same variable may be defined in two different control-flow paths in
a program.
Types and Declarations

1 Type Expressions

2 Type Equivalence

3 Declarations

4 Storage Layout for Local Names

5 Sequences of Declarations
Types and Declaration
Type checking
• It uses logical rules to reason about the behavior of a program at
run time.
• Specifically, it ensures that the types of the operands match the
type expected by an operator.
• Example : && operator expects its two operands to be Booleans,
the result is also of type Boolean.

Translation Applications
• From the type of a name, a compiler can determine the storage
that will be needed for that name at run time.
• Type information is also needed to calculate the address denoted
by an array reference, to insert explicit type conversions.
1. Type Expressions
• Types have structure, which we shall represent using type
expressions.
• A type expression is either a basic type or is formed by
applying an operator called a type constructor to a type
expression.
• The sets of basic types and constructors depend on the
language to be checked.
1. Type Expressions
Definition of type expressions:

• A basic type is a type expression. Typical basic types for a

language include Boolean, char, integer, float, and void.
• A type name is a type expression.
• A type expression can be formed by applying the array type
constructor to a number and a type expression.
• A record is a data structure with named fields. A type expression
can be formed by applying the record type constructor to the
field names and their types.
• A type expression can be formed by using the type constructor
→• for function types. We write s —»• t for "function from type
s to type t."
Type Names and Recursive Types
Once a class is defined, its name can be used as a type name
Example: consider Node in the program fragment
public class Node { • • • }
public Node n;
Names can be used to define recursive types, which are needed for
data structures such as linked lists.
class Cell { int info; Cell next; ••• }
Similar recursive types can be defined using records and pointers.
If s and t are type expressions, then their Cartesian product s x t is
a type expression.
2. Type Equivalence
When type expressions are represented by graphs, two types are
structurally equivalent if and only if one of the following conditions
is true:
• They are the same basic type.
• They are formed by applying the same constructor to structurally
equivalent types.
• One is a type name that denotes the other.
• If type names are treated as standing for themselves, then the
first two conditions in the above definition lead to name
equivalence of type expressions.
• Name-equivalent expressions are assigned the same value
number, if used.
3. Declarations
A simplified grammar that declares just one name at a time and
declarations with lists of names.

The above grammar that deals with basic and array types was used to
illustrate inherited attributes.
• Nonterminal D generates a sequence of declarations.
• Nonterminal T generates basic, array, or record types.
• Nonterminal B generates one of the basic types int and float.
• Nonterminal C, for "component," generates strings of zero or more
integers, each integer surrounded by brackets.
4. Storage Layout for Local Names
4. Storage Layout for Local Names

parse tree for the type int [2][3]

5. Sequences of Declarations

Chapter-6 (Compiler Design and Construction)
100% (1)
Chapter-6 (Compiler Design and Construction)
14 pages
1 Intermediate Code Generation VNM
No ratings yet
1 Intermediate Code Generation VNM
17 pages
CSE-303 Chapter-06 Final (1)
No ratings yet
CSE-303 Chapter-06 Final (1)
97 pages
Chapter 6
No ratings yet
Chapter 6
28 pages
18 UNIT-4(1)
No ratings yet
18 UNIT-4(1)
16 pages
Module 5 Chapter 6 ICG
No ratings yet
Module 5 Chapter 6 ICG
44 pages
UNIT-4 Notes
No ratings yet
UNIT-4 Notes
27 pages
Chapter 6 - Intermediate Code Generation
No ratings yet
Chapter 6 - Intermediate Code Generation
5 pages
Unit-Iv: Intermediate Code Generation
No ratings yet
Unit-Iv: Intermediate Code Generation
19 pages
1 Unit 4 Complete
No ratings yet
1 Unit 4 Complete
92 pages
24-Module 4_ Variants of Syntax Trees - Three Address Code-10!09!2024
100% (1)
24-Module 4_ Variants of Syntax Trees - Three Address Code-10!09!2024
44 pages
Cs 3007 Inter Code Gen
No ratings yet
Cs 3007 Inter Code Gen
42 pages
Unit-4 LMD CD
No ratings yet
Unit-4 LMD CD
34 pages
CS6109-MODULE-8
No ratings yet
CS6109-MODULE-8
42 pages
CD Unit 5
No ratings yet
CD Unit 5
49 pages
Chapter 6 - ICG
No ratings yet
Chapter 6 - ICG
14 pages
Intermediate Code Generation
No ratings yet
Intermediate Code Generation
62 pages
Lecture On Compiler Design: Chapter 8: Intermediate Code Generation
No ratings yet
Lecture On Compiler Design: Chapter 8: Intermediate Code Generation
29 pages
CD Unit-Iii
No ratings yet
CD Unit-Iii
20 pages
UNIT 3 - Chapter 2 in Compiler Design
No ratings yet
UNIT 3 - Chapter 2 in Compiler Design
38 pages
4th Phase New Intermediate code generator
No ratings yet
4th Phase New Intermediate code generator
24 pages
Intermediate Code Generation
No ratings yet
Intermediate Code Generation
13 pages
cd_3rd unit _15
No ratings yet
cd_3rd unit _15
58 pages
CD Module 8 Print
No ratings yet
CD Module 8 Print
58 pages
3 Intermediate Code Generation
No ratings yet
3 Intermediate Code Generation
20 pages
Intermediate Code Generation: CD: Compiler Design
No ratings yet
Intermediate Code Generation: CD: Compiler Design
41 pages
6837
No ratings yet
6837
47 pages
Unit 4 - Compiler Design - WWW - Rgpvnotes.in
No ratings yet
Unit 4 - Compiler Design - WWW - Rgpvnotes.in
21 pages
CD Unit3
No ratings yet
CD Unit3
17 pages
Lec05 Intermediate Code Generation
No ratings yet
Lec05 Intermediate Code Generation
40 pages
CS 346: Intermediate Code Generation: Resource
No ratings yet
CS 346: Intermediate Code Generation: Resource
60 pages
Compiler Construction Week 14
No ratings yet
Compiler Construction Week 14
23 pages
Construction of Syntax Trees
No ratings yet
Construction of Syntax Trees
15 pages
Intermediate Code Generation
No ratings yet
Intermediate Code Generation
23 pages
Compiler Construction: A Compulsory Module For Students in
No ratings yet
Compiler Construction: A Compulsory Module For Students in
34 pages
Lecture 08
No ratings yet
Lecture 08
36 pages
Subject Code: 6CS63/06IS662 NO. of Lectures Per Week: 04 Total No. of Lecture HRS: 52 IA Marks: 25 Exam HRS: 03 Exam Marks:100
No ratings yet
Subject Code: 6CS63/06IS662 NO. of Lectures Per Week: 04 Total No. of Lecture HRS: 52 IA Marks: 25 Exam HRS: 03 Exam Marks:100
38 pages
Compiler Design Chapter-6
No ratings yet
Compiler Design Chapter-6
83 pages
FALLSEM2023-24 BCSE307L TH VL2023240100900 2023-06-14 Reference-Material-I
No ratings yet
FALLSEM2023-24 BCSE307L TH VL2023240100900 2023-06-14 Reference-Material-I
33 pages
Chapter 6 - Intermediate Code Generation
No ratings yet
Chapter 6 - Intermediate Code Generation
42 pages
Lecture On Compiler Design: Chapter 8: Intermediate Code Generation
No ratings yet
Lecture On Compiler Design: Chapter 8: Intermediate Code Generation
11 pages
006chapter 6 - Intermediate Code Generation
No ratings yet
006chapter 6 - Intermediate Code Generation
23 pages
Intermediate Code Generation in Compiler Design
No ratings yet
Intermediate Code Generation in Compiler Design
29 pages
Chapter 6 Intermediate Code Generation
No ratings yet
Chapter 6 Intermediate Code Generation
47 pages
Three Address Code Report
No ratings yet
Three Address Code Report
11 pages
UNIT-3 Part-A:Semantic Analysis 1. Intermediate Code Forms
No ratings yet
UNIT-3 Part-A:Semantic Analysis 1. Intermediate Code Forms
26 pages
Three Address Code (TAC) : Addresses and Instructions
No ratings yet
Three Address Code (TAC) : Addresses and Instructions
28 pages
Chapter 5 - Intermediate Code Generation
No ratings yet
Chapter 5 - Intermediate Code Generation
27 pages
Intermediate Code Generation
No ratings yet
Intermediate Code Generation
11 pages
Intermediate Code Generator-20241219073843
No ratings yet
Intermediate Code Generator-20241219073843
40 pages
Compiler Design Unit 3
No ratings yet
Compiler Design Unit 3
14 pages
INTERMEDIATE CODE GENERATION & RUNTIME ENVIRNOMENTS
No ratings yet
INTERMEDIATE CODE GENERATION & RUNTIME ENVIRNOMENTS
35 pages
Module 4 - Intermediate Code Generation
No ratings yet
Module 4 - Intermediate Code Generation
61 pages
CS602PC - Compiler - Design - Lecture Notes - Unit - 3
No ratings yet
CS602PC - Compiler - Design - Lecture Notes - Unit - 3
27 pages
Three Address Code
No ratings yet
Three Address Code
21 pages
Intermediate Code Generation
No ratings yet
Intermediate Code Generation
42 pages
Unit 4 - Compiler Design - WWW - Rgpvnotes.in
No ratings yet
Unit 4 - Compiler Design - WWW - Rgpvnotes.in
23 pages
Intermediate Code Generation
No ratings yet
Intermediate Code Generation
7 pages
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Software ConceptS Class XI
No ratings yet
Software ConceptS Class XI
7 pages
Unit 3 MainMemory Solutions
No ratings yet
Unit 3 MainMemory Solutions
10 pages
# Tutorial 9 & 10
No ratings yet
# Tutorial 9 & 10
6 pages
Atmel White Paper Introducing New Breed of Microcontrollers For 8-16-Bit Applications
No ratings yet
Atmel White Paper Introducing New Breed of Microcontrollers For 8-16-Bit Applications
15 pages
Chapter Four
No ratings yet
Chapter Four
66 pages
Heat Transfer Computer Design
No ratings yet
Heat Transfer Computer Design
62 pages
A Simulated Mano Machine An Novel Project For Computer Architecture Class
No ratings yet
A Simulated Mano Machine An Novel Project For Computer Architecture Class
28 pages
R.Nageswara Rao.: in Tsolv Solutions
No ratings yet
R.Nageswara Rao.: in Tsolv Solutions
52 pages
Banking System Project
No ratings yet
Banking System Project
94 pages
Lecture 1
No ratings yet
Lecture 1
12 pages
Assembly Language Assignment 1
No ratings yet
Assembly Language Assignment 1
14 pages
Lecture Notes
No ratings yet
Lecture Notes
12 pages
Ec8691 MPMC Question Bank
No ratings yet
Ec8691 MPMC Question Bank
41 pages
Fundamentals of Programming
No ratings yet
Fundamentals of Programming
49 pages
Unit II
No ratings yet
Unit II
28 pages
Icjecapu 09
No ratings yet
Icjecapu 09
7 pages
Operand Storage in The CPU
No ratings yet
Operand Storage in The CPU
3 pages
CD Ict Worksheet La5 Form 5
No ratings yet
CD Ict Worksheet La5 Form 5
29 pages
PPS-1 Unit - 1
No ratings yet
PPS-1 Unit - 1
52 pages
Java Introduction
No ratings yet
Java Introduction
9 pages
PLC Programming 1
No ratings yet
PLC Programming 1
46 pages
Computer Architecture and Organization Mcqs
No ratings yet
Computer Architecture and Organization Mcqs
9 pages
1S00252 F.Y.B.sc - I.T Sem. II Choice Base 77502 Microprocessor Architecture. Q.P.code 33407
No ratings yet
1S00252 F.Y.B.sc - I.T Sem. II Choice Base 77502 Microprocessor Architecture. Q.P.code 33407
7 pages
Implementation Issue For Super Instructions in Gforth
No ratings yet
Implementation Issue For Super Instructions in Gforth
9 pages
My Notes PPS
No ratings yet
My Notes PPS
101 pages
DISA Review Questions-May-18
No ratings yet
DISA Review Questions-May-18
15 pages
Lecture 3 - Introduction To Computer Data Processing Using Python
No ratings yet
Lecture 3 - Introduction To Computer Data Processing Using Python
22 pages
Introduction To Computer Programming Concepts
No ratings yet
Introduction To Computer Programming Concepts
22 pages
Class IX Chapter 2
No ratings yet
Class IX Chapter 2
36 pages
Pattara Python 01204111
No ratings yet
Pattara Python 01204111
233 pages

3unit cd IntermediateCode_Part1

Uploaded by

3unit cd IntermediateCode_Part1

Uploaded by

Intermediate Code Generation

Like the syntax tree for an expression,

 The SDD can construct either syntax trees or DAG’s.

 It was used to construct syntax trees, where functions

Construct the DAG for the expression

((a + y)- ((x + y)*(x- y))) + ((x + y) * (x - y))

1. Addresses and Instructions

4. Static Single-Assignment Form

• Three-address code - One operator and two operands on right side.

where t1 and t2 are compiler-generated temporary names.

A symbolic label represents the index of a three-address instruction in

5. Conditional jumps of the form if x goto L and if False x goto L.

• x = *y, The r-value of x is made equal to the contents of the

Consider the statement do i = i + 1 ; while ( a [ i ] < v ) ;

The following are some exceptions to this rule:

2. Operators like param use neither arg2 nor result.

3. Conditional and unconditional jumps put the target label in result.

4 Storage Layout for Local Names

• A basic type is a type expression. Typical basic types for a

parse tree for the type int [2][3]

You might also like

((a + y)- ((x + y)(x- y))) + ((x + y) (x - y))