0% found this document useful (0 votes)

4 views36 pages

Lecture 08

The document discusses the process of intermediate code generation in compilers, highlighting the role of intermediate representations (IR) in optimizing code and enabling platform independence. It covers the advantages of using IR, such as retargeting and optimization opportunities, and explains various forms of intermediate code like three-address code and quadruples. Additionally, it addresses the importance of directed acyclic graphs (DAGs) in representing expressions and optimizing code generation.

Uploaded by

nihafahima9

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views36 pages

Lecture 08

Uploaded by

nihafahima9

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 36

Intermediate Code Generation

12/8/2024 1
Summary of Front End

Lexical Analyzer (Scanner)

+
Syntax Analyzer (Parser)
+ Semantic Analyzer

Front
Abstract Syntax Tree w/Attributes End

Intermediate-code Generator

Error Non-optimized Intermediate Code

Message

12/8/2024 \course\cpeg621-10F\Topic-1a.ppt 2
Component-Based Approach to Building
Compilers
Source program Source program
in Language-1 in Language-2

Language-1 Front End Language-2 Front End

Non-optimized Intermediate Code

Intermediate-code Optimizer

Optimized Intermediate Code

Target-1 Code Target-2 Code
Generator Generator

Target-1 machine code Target-2 machine code

4
Intermediate Representation (IR)

A kind of abstract machine language that

can express the target machine operations
without committing to too much machine
details.

•Why IR ?

12/8/2024 \course\cpeg621-10F\Topic-1a.ppt 5
Without IR

C SPARC

Pascal HP PA

FORTRAN x86

C++ IBM PPC

12/8/2024 \course\cpeg621-10F\Topic-1a.ppt 6
With IR

C SPARC

Pascal HP PA
IR
FORTRAN x86

C++ IBM PPC

12/8/2024 \course\cpeg621-10F\Topic-1a.ppt 7
With IR

Pascal Common ?
IR Backend
FORTRAN

C++

12/8/2024 \course\cpeg621-10F\Topic-1a.ppt 8
Advantages of Using an Intermediate
Language

1. Retargeting - Build a compiler for a new machine by

attaching a new code generator to an existing front-end.
2. Optimization - reuse intermediate code optimizers in
compilers for different languages and different machines.
Note: the terms “intermediate code”, “intermediate
language”, and “intermediate representation” are all
used interchangeably.

12/8/2024 \course\cpeg621-10F\Topic-1a.ppt 9
Issues in Designing an IR

❖ Whether to use an existing IR

▪ if target machine architecture is similar
▪ if the new language is similar
❖ Whether the IR is appropriate for the kind of
optimizations to be performed
▪ e.g. speculation and prediction
▪ some transformations may take much
longer than they would on a different IR

12/8/2024 \course\cpeg621-10F\Topic-1a.ppt 10
Issues in Designing an IR

❖ Designing a new IR needs to consider

▪ Level (how machine dependent it is)
▪ Structure
▪ Expressiveness
▪ Appropriateness for general and special
optimizations
▪ Appropriateness for code generation
▪ Whether multiple IRs should be used
12/8/2024 \course\cpeg621-10F\Topic-1a.ppt 11
Source code can change or translate into
machine code. But, we also need
intermediate code. Now the question here
is, why do we need intermediate code? Or
advantages of having intermediate code
Importance of Intermediate Code
Generation

•Platform Independence: The intermediate code can be

further translated into machine code for different target
platforms.
•Optimization Opportunities: Optimization techniques
can be applied to intermediate code, improving efficiency
before generating the final machine code.
•Simplifies Code Generation: By breaking down the
process into two phases (intermediate and final code
generation), the complexity of translating high-level code
directly into machine code is reduced.
Characteristics of Intermediate Code

• Abstraction Level: It’s an abstraction between

the source and target code.
• Portable: Intermediate code is independent of
specific machine architectures.
• Efficient: It allows for optimizations that are
difficult to apply to high-level or machine-level
code directly.
Directed Acyclic Graph (DAG)
• Definition: A Directed Acyclic Graph (DAG) is a graph
with directed edges and no cycles. In compiler
optimization, DAGs are used to represent expressions
and control dependencies.
• Purpose: DAGs provide a way to simplify the
representation of expressions and detect common
subexpressions, which can be eliminated to optimize the
code.
• Key Features:
– Nodes represent operations or operands.
– Edges represent dependencies between operations.
– It is acyclic, meaning no node can have a path back
to itself, ensuring a clear order of execution.
DAG for Expressions
• A DAG has leaves corresponding to atomic operands &
interior nodes corresponding to operators.
❑ Difference between DAG & Syntax Tree
• a node N in a DAG has more than one parent if N
represents a common sub-expression
• in a syntax tree, the tree for the common sub
expression would be replicated as many times as the
sub expression appears in the original expression.
Thus, a DAG not only represents expressions more
succinctly, it gives the compiler important clues
regarding the generation of efficient code to evaluate
the expressions.
a + a * (b-c) + (b-c) * d

a + a*(b-c) (b-c) * d

a * (b-c)

(b-c)
Value Number Method for Constructing
DAG’s

• Often, the nodes of a syntax tree or DAG are

stored in an array of records.
• Each row of the array represents one record,
and therefore one node.
DAG Representation
A variant of syntax tree.
Example: D = ((A+B*C) + (A*B*C))/ -C
=
DAG: Direct Acyclic
D / Graph
+ _
+ *

*
A
B C
12/8/2024 \course\cpeg621-10F\Topic-1a.ppt 19
Postfix Notation (PN)
A mathematical notation wherein every
operator follows all of its operands.
Examples:

The PN of expression 9* (5+2) is 952+*

How about (a+b)/(c-d) ? ab+cd-/

12/8/2024 \course\cpeg621-10F\Topic-1a.ppt 20
Postfix Notation (PN) – Cont’d

Form Rules:
1. If E is a variable/constant, the PN of E is E
itself
2. If E is an expression of the form E1 op E2, the
PN of E is E1’E2’op (E1’ and E2’ are the PN of E1
and E2, respectively.)
3. If E is a parenthesized expression of form
(E1), the PN of E is the same as the PN of E1.

12/8/2024 \course\cpeg621-10F\Topic-1a.ppt 21
Three Address Code
• In three-address code, there is at most
one operator on the right side of an
instruction;
• That is, no built-up arithmetic expressions
are permitted.
• source-language expression: x + y * z

where t1 & t2 are compiler-generated temporary

names.
Three-Address Statements

A popular form of intermediate code used in optimizing

compilers is three-address statements.
Source statement:
x = a + b* c + d
Three address statements with temporaries t1 and t2:
t1 = b* c
t2 = a + t1
x = t2 + d

12/8/2024 \course\cpeg621-10F\Topic-1a.ppt 23
Three Address Code
The general form
x := y op z
x,y,and z are names, constants,
compiler-generated temporaries
op stands for any operator such as +,-,…
x*5-y might be translated as
t1 := x * 5
t2 := t1 - y
12/8/2024 \course\cpeg621-10F\Topic-1a.ppt 24
Syntax-Directed Translation Into
Three-Address

Temporary
• In general, when generating three-address
statements, the compiler has to create new temporary
variables (temporaries) as needed.
• We use a function newtemp( ) that returns a new
temporary each time it is called.
• Recall Topic-2: when talking about this topic

12/8/2024 \course\cpeg621-10F\Topic-1a.ppt 25
Syntax-Directed Translation Into
Three-Address

• The syntax-directed definition for E in a production

id := E has two attributes:
1. E.place - the location (variable name or offset) that
holds the value corresponding to the nonterminal
2. E.code - the sequence of three-address statements
representing the code for the nonterminal

12/8/2024 \course\cpeg621-10F\Topic-1a.ppt 26
Syntax tree vs. Three address code

Expression: (A+BC) + (-BA) - B

_
+ B T1 := B * C
T2 = A + T1
+ * T3 = - B
_ A
A * T4 = T3 * A
T5 = T2 + T4
B C
B T6 = T5 – B

Three address code is a linearized representation

of a syntax tree (or a DAG) in which explicit names
(temporaries) correspond to the interior nodes of the graph.

12/8/2024 \course\cpeg621-10F\Topic-1a.ppt 27
DAG vs. Three address code

Expression: D = ((A+BC) + (AB*C))/ -C

=
T1 := A
D / T1 := B * C
T2 := C
T2 := A+T1
_ T3 := B * T2
+ T3 := A*T1
T4 := T1+T3
+ * T4 := T2+T3
T5 := T1*T3
T5 := – C
T6 := T4 + T5
T6 := T4 / T5
* T7 := – T2
D := T6
A T8 := T6 / T7
B C D := T8

Question: Which IR code sequence is better?

12/8/2024 \course\cpeg621-10F\Topic-1a.ppt 28
Three-address Implementation
Technique
• The description of three-address instructions
specifies the components of each type of
instruction, but it does not specify the
representation of these instructions in a data
structure.
• In a compiler, these instructions can be
implemented as objects or as records with
fields for the operator and the operands.
❑ quadruples
❑ triples
❑ indirect triples
Quadruples
• A quadruple (or “quad”) has four fields:
op, arg1, arg2 & result.
• The op field contains an internal code for
the operator.

• x = y + z is represented by placing + in op,

y in arg1, z in arg2 , and x in result.
Few exceptions
a=b*-c+b*-c
Three-address code Quadruples
Triples
• A triple has only three fields: op, arg1 , & arg2
• Using triples, we refer to the result of an
operation x op y by its position, rather than
by an explicit temporary name.
• Thus, instead of the temporary t1, a triple
representation would refer to position (0).
• Parenthesized numbers represent pointers
into the triple structure itself (value numbers)
a=b*-c+b*-c
Three-address code
Implementation of Three Address
Code
• Quadruples
• Four fields: op, arg1, arg2, result
¤ Array of struct {op, *arg1, *arg2, *result}
• x:=y op z is represented as op y, z, x
• arg1, arg2 and result are usually pointers to
symbol table entries.
• May need to use many temporary names.
• Many assembly instructions are like
quadruple, but arg1, arg2, and result are real
registers.
12/8/2024 \course\cpeg621-10F\Topic-1a.ppt 35
Implementation of Three Address Code
(Con’t)
• Triples
• Three fields: op, arg1, and arg2. Result become implicit.
• arg1 and arg2 are either pointers to the symbol table or
index/pointers to the triple structure.
Example: d = a + (b*c)
1 * b, c
Problem in
2 + a, (1) reorder the
3 assign d, (2) codes?
• No explicit temporary names used.
• Need more than one entries for ternary operations such
as x:=y[i], a=b+c, x[i]=y, … etc.
12/8/2024 \course\cpeg621-10F\Topic-1a.ppt 36

CE4530 4.0v1 Sophos Central XDR Live Discover Query Scheduling and Editing
No ratings yet
CE4530 4.0v1 Sophos Central XDR Live Discover Query Scheduling and Editing
37 pages
Chapter-6 (Compiler Design and Construction)
100% (1)
Chapter-6 (Compiler Design and Construction)
14 pages
006chapter 6 - Intermediate Code Generation
No ratings yet
006chapter 6 - Intermediate Code Generation
23 pages
24-Module 4 - Variants of Syntax Trees - Three Address Code-10!09!2024
100% (1)
24-Module 4 - Variants of Syntax Trees - Three Address Code-10!09!2024
44 pages
Chapter 5 - Intermediate Code Generation
No ratings yet
Chapter 5 - Intermediate Code Generation
27 pages
Mod 4
No ratings yet
Mod 4
39 pages
Chapter 5 Intermediate Code Generaration-1
No ratings yet
Chapter 5 Intermediate Code Generaration-1
31 pages
CH-6 Intermediate Code Generator
No ratings yet
CH-6 Intermediate Code Generator
54 pages
Unit 4
No ratings yet
Unit 4
51 pages
Chapter 6 Intermediate Code Generation
No ratings yet
Chapter 6 Intermediate Code Generation
47 pages
CD - CH5 - Intermediate Code Generation
No ratings yet
CD - CH5 - Intermediate Code Generation
54 pages
CD Module 8 Print
No ratings yet
CD Module 8 Print
58 pages
UNIT-3 Part-A:Semantic Analysis 1. Intermediate Code Forms
No ratings yet
UNIT-3 Part-A:Semantic Analysis 1. Intermediate Code Forms
26 pages
INTERMEDIATE CODE GENERATION & RUNTIME ENVIRNOMENTS
No ratings yet
INTERMEDIATE CODE GENERATION & RUNTIME ENVIRNOMENTS
35 pages
CD Unit3
No ratings yet
CD Unit3
17 pages
Cs 3007 Inter Code Gen
No ratings yet
Cs 3007 Inter Code Gen
42 pages
Compiler Construction Week 14
No ratings yet
Compiler Construction Week 14
23 pages
Unit IV-1
No ratings yet
Unit IV-1
35 pages
Chapter 6
No ratings yet
Chapter 6
28 pages
Lecture On Compiler Design: Chapter 8: Intermediate Code Generation
No ratings yet
Lecture On Compiler Design: Chapter 8: Intermediate Code Generation
11 pages
Lecture On Compiler Design: Chapter 8: Intermediate Code Generation
No ratings yet
Lecture On Compiler Design: Chapter 8: Intermediate Code Generation
29 pages
MUUnit 4
No ratings yet
MUUnit 4
63 pages
Learning Materials, CD, Unit-7 & 8
No ratings yet
Learning Materials, CD, Unit-7 & 8
36 pages
18 Unit-4
No ratings yet
18 Unit-4
16 pages
2024 CD Ch06 Intermidiate & Ch07 Runtime & Ch08 Code Optimization
No ratings yet
2024 CD Ch06 Intermidiate & Ch07 Runtime & Ch08 Code Optimization
29 pages
CD Unit-5
No ratings yet
CD Unit-5
15 pages
TSR - Class Cd-Unit 3
No ratings yet
TSR - Class Cd-Unit 3
111 pages
Subject Code: 6CS63/06IS662 NO. of Lectures Per Week: 04 Total No. of Lecture HRS: 52 IA Marks: 25 Exam HRS: 03 Exam Marks:100
No ratings yet
Subject Code: 6CS63/06IS662 NO. of Lectures Per Week: 04 Total No. of Lecture HRS: 52 IA Marks: 25 Exam HRS: 03 Exam Marks:100
38 pages
3 Intermediate Code Generation
No ratings yet
3 Intermediate Code Generation
20 pages
CD Unit 5
No ratings yet
CD Unit 5
49 pages
UNIT-4 Notes
No ratings yet
UNIT-4 Notes
27 pages
Intermediate Code Generation
No ratings yet
Intermediate Code Generation
11 pages
Wa0001
No ratings yet
Wa0001
8 pages
Intermediate Code Generation
No ratings yet
Intermediate Code Generation
13 pages
Reading List: Aho-Sethi-Ullman: Chapter 6.1 6.2 Chapter 6.3 6.10 (Note: Glance Through It Only For
No ratings yet
Reading List: Aho-Sethi-Ullman: Chapter 6.1 6.2 Chapter 6.3 6.10 (Note: Glance Through It Only For
33 pages
Intermediate Code Generation: Logical Structure of Compiler
No ratings yet
Intermediate Code Generation: Logical Structure of Compiler
31 pages
Unit 4 - Compiler Design - WWW - Rgpvnotes.in
No ratings yet
Unit 4 - Compiler Design - WWW - Rgpvnotes.in
21 pages
CD UNIT-V Intermediate Code Generation
No ratings yet
CD UNIT-V Intermediate Code Generation
12 pages
Unit 4 PartA
No ratings yet
Unit 4 PartA
16 pages
Compiler Design Lec-Six Intermediate Languages
No ratings yet
Compiler Design Lec-Six Intermediate Languages
21 pages
Chapter 6 - ICG
No ratings yet
Chapter 6 - ICG
15 pages
Intermediate Code Generation
No ratings yet
Intermediate Code Generation
7 pages
Unit-Iii: Intermediate Code Generation
No ratings yet
Unit-Iii: Intermediate Code Generation
47 pages
Intermediate Code Generation and Code Optimization
No ratings yet
Intermediate Code Generation and Code Optimization
40 pages
CD Unit-Iii
No ratings yet
CD Unit-Iii
20 pages
10 3.1IntermediateCode11
No ratings yet
10 3.1IntermediateCode11
34 pages
CSE-303 Chapter-06 Final
No ratings yet
CSE-303 Chapter-06 Final
97 pages
UNIT 3 - Chapter 2 in Compiler Design
No ratings yet
UNIT 3 - Chapter 2 in Compiler Design
38 pages
Intermediate Code Generation: CD: Compiler Design
No ratings yet
Intermediate Code Generation: CD: Compiler Design
41 pages
Unit - 5 Intermediate Code Generation
No ratings yet
Unit - 5 Intermediate Code Generation
15 pages
Compiler Design Chapter-6
No ratings yet
Compiler Design Chapter-6
83 pages
Unit 5 - Intermediate Code Generation
No ratings yet
Unit 5 - Intermediate Code Generation
18 pages
Module 5 Chapter 6 ICG
No ratings yet
Module 5 Chapter 6 ICG
44 pages
Unit-Iii
No ratings yet
Unit-Iii
19 pages
CH06
No ratings yet
CH06
28 pages
Intermediate Code
No ratings yet
Intermediate Code
18 pages
CD - 3rd Unit - 15
No ratings yet
CD - 3rd Unit - 15
58 pages
Unit 3 TAC Intermidiate Code Generator
No ratings yet
Unit 3 TAC Intermidiate Code Generator
27 pages
CD Unit-4 (Part-2)
No ratings yet
CD Unit-4 (Part-2)
15 pages
C Programming
From Everand
C Programming
Netra
No ratings yet
"C Programming for Beginners: A Step-by-Step Guide"
From Everand
"C Programming for Beginners: A Step-by-Step Guide"
Lov kush
No ratings yet
Lecture 02
No ratings yet
Lecture 02
150 pages
Lecture 03
No ratings yet
Lecture 03
36 pages
Compiler Questions
No ratings yet
Compiler Questions
50 pages
Lecture 01
No ratings yet
Lecture 01
47 pages
Lecture 04
No ratings yet
Lecture 04
51 pages
Noteartificial Intelligence
No ratings yet
Noteartificial Intelligence
23 pages
Nvidia
No ratings yet
Nvidia
36 pages
09 RS485 Communication Modbus RTU ALR121
No ratings yet
09 RS485 Communication Modbus RTU ALR121
15 pages
T3 Worksheet 3
No ratings yet
T3 Worksheet 3
4 pages
Top 50 Azure Data Factory Interview Questions and Answers
No ratings yet
Top 50 Azure Data Factory Interview Questions and Answers
14 pages
Introduction To Computer: A Device That Processes Input and Generates Output
No ratings yet
Introduction To Computer: A Device That Processes Input and Generates Output
17 pages
Itanium Processor: Presented by Name-Mohammad Faizan Akhter Branch-ETC (Section) Semester-6 Regd No-1801289179
No ratings yet
Itanium Processor: Presented by Name-Mohammad Faizan Akhter Branch-ETC (Section) Semester-6 Regd No-1801289179
18 pages
DAS Fiber Transport V1.3
No ratings yet
DAS Fiber Transport V1.3
4 pages
Unit 3 - Computer Networks - WWW - Rgpvnotes.in
No ratings yet
Unit 3 - Computer Networks - WWW - Rgpvnotes.in
18 pages
Hadamard Code Sec B Wireless Assignment
No ratings yet
Hadamard Code Sec B Wireless Assignment
5 pages
Geography Import Steps - v1
No ratings yet
Geography Import Steps - v1
13 pages
1Z0 1072 24 Demo
No ratings yet
1Z0 1072 24 Demo
18 pages
Jobsinmalta CV
No ratings yet
Jobsinmalta CV
2 pages
JLG - PC Analyzer Kit Instruction
No ratings yet
JLG - PC Analyzer Kit Instruction
4 pages
X1SG Entry Level IP Phone: Highlights
No ratings yet
X1SG Entry Level IP Phone: Highlights
2 pages
Session 3-Software Process Model
No ratings yet
Session 3-Software Process Model
29 pages
Object-Oriented Programming (OOP) in Python 3 - Real Python
No ratings yet
Object-Oriented Programming (OOP) in Python 3 - Real Python
19 pages
Java Notes (Inheritance)
No ratings yet
Java Notes (Inheritance)
4 pages
System Design Document
No ratings yet
System Design Document
3 pages
Sample Report
No ratings yet
Sample Report
33 pages
Project Report Group 5 (Section 08)
No ratings yet
Project Report Group 5 (Section 08)
14 pages
19 Ijsrr D 2180.f
No ratings yet
19 Ijsrr D 2180.f
3 pages
Programmable Logic Controller L T P C 1 0 0 1: Department of
No ratings yet
Programmable Logic Controller L T P C 1 0 0 1: Department of
4 pages
TMS374 Family In-Circuit Programming: Users Manual Rev. 1.3 2005.05.11
100% (1)
TMS374 Family In-Circuit Programming: Users Manual Rev. 1.3 2005.05.11
10 pages
Ch-8 Dynamic Memory Allocation
No ratings yet
Ch-8 Dynamic Memory Allocation
18 pages
Scheme Document
No ratings yet
Scheme Document
5 pages
Pushdown Automata (PDA) : Reading: Chapter 6
No ratings yet
Pushdown Automata (PDA) : Reading: Chapter 6
34 pages
Start Download: Code in Code::blocks
No ratings yet
Start Download: Code in Code::blocks
4 pages
FX2 USB To ATA Design Notes
No ratings yet
FX2 USB To ATA Design Notes
6 pages
6854 Proj
No ratings yet
6854 Proj
7 pages

Lecture 08

Uploaded by

Lecture 08

Uploaded by

Intermediate Code Generation

Lexical Analyzer (Scanner)

Error Non-optimized Intermediate Code

Language-1 Front End Language-2 Front End

Non-optimized Intermediate Code

Optimized Intermediate Code

Target-1 machine code Target-2 machine code

A kind of abstract machine language that

C++ IBM PPC

C++ IBM PPC

1. Retargeting - Build a compiler for a new machine by

❖ Whether to use an existing IR

❖ Designing a new IR needs to consider

•Platform Independence: The intermediate code can be

• Abstraction Level: It’s an abstraction between

• Often, the nodes of a syntax tree or DAG are

The PN of expression 9* (5+2) is 952+*

How about (a+b)/(c-d) ? ab+cd-/

where t1 & t2 are compiler-generated temporary

A popular form of intermediate code used in optimizing

• The syntax-directed definition for E in a production

Expression: (A+B*C) + (-B*A) - B

Three address code is a linearized representation

Expression: D = ((A+B*C) + (A*B*C))/ -C

Question: Which IR code sequence is better?

• x = y + z is represented by placing + in op,

You might also like

Expression: (A+BC) + (-BA) - B

Expression: D = ((A+BC) + (AB*C))/ -C