0% found this document useful (0 votes)

58 views47 pages

Intermediate Code Generation in Compilers

Uploaded by

winfredmutindi001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

58 views47 pages

Intermediate Code Generation in Compilers

Uploaded by

winfredmutindi001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Intermediate code

generation

Compiler construction
Introduction to semantic analysis
Analysis Synthesis
of input program of output program
(front-end) (back-end)
Compiler Passes character
stream
Intermediate
Lexical Analysis Code Generation

token intermediate
stream form

Syntactic Analysis Optimization

abstract intermediate
syntax tree form

Semantic Analysis Code Generation

annotated target
AST language
Compiler construction
Introduction to semantic analysis

• The intermediate code generator is a phase in the

compiler that translates the source code into an
intermediate representation, which is closer to
the target machine code but still independent of
the target machine.
• This intermediate code serves as a bridge
between the high-level source code and the low-
level machine code.
• It simplifies the process of code optimization and
target code generation.
• The main goal of the intermediate code generator
is to produce an efficient and easily transformable
intermediate representation of the source code.
Compiler construction
Intermediate code generation
.

Compiler construction
Outline

• Why front end and back end in compilers?

• Different forms of intermediate code generation

• Postfix notation
• Abstract Syntax Tree
• Three Address Code

Compiler construction
Recall

• The objective of a compiler is to analyze a source

program and produce target code.
• Front end analyzes the source program and generates
an intermediate code.
• Eack end takes the Intermediate code as input and
generates the target code.

Compiler construction
Need for Intermediate code

C C Compiler Machine
Progra for 80X86 instructions
m System for 80X86
systems
C C Compiler Machine
Progra for SPARC instructions
m System for SPARC
systems

Compiler construction
Need for Intermediate code
C
C Compile
Program r Front
end

Intermedi
ate code

Compiler Compiler
Eank end Eank end
for 80X86 for SPARC
System System
Machine Machine
instructions instructions
for 80X86 for SPARC
systems systems
Compiler construction
Advantages of using Intermediate Code

• Retargeting to a different machine.

• Optimization of the code at intermediate level.

Front Target
Language 1 end machine
back 2
Intermediate end Target
Language Representati machine 1
2 on Target
Language machine 3
3
Compiler construction
Recall

AST/DAGTAc
postfix
A+bte
Intermediate code
Front- - Back-
end end
Target machine code

it (a cb)
Enables machine-independent code optimization

Compiler construction
Intermediate Representations
• We could translate the source program directly into the
target language.

• However, there are benefits to having an intermediate,

machine-independent representation.
• A clear distinction between the machine-independent and machine-dependent
parts of the compiler
• Retargeting is facilitated; the implementation of language processors for new
machines will require replacing only the back-end
• We could apply machine independent code optimization techniques

Compiler construction
Intermediate Representations

• Intermediate representations span the gap between the

source and target languages.

• High Level Representations

• closer to the source language
• easy to generate from an input program
• code optimizations may not be straightforward

• Low Level Representations

• closer to the target machine
• Suitable for register allocation and instruction selection
• easier for optimizations, final code generation

Compiler construction
Options for intermediate code

• There are several options for intermediate code.

• Specific to the language being implemented

• P-code for Pascal
• Object code for C
• Bytecode for Java

• Language independent:
• 3-address code

Compiler construction
Intermediate forms

• Postfix notation
• Syntax tree (Graphical representation of statements)
• Abstract Syntax Tree
• Parse Tree
• Directed acyclic graph(DAG)
• Three-address code
• Quadruple

Compiler construction
Postfix Notation:

• Any expression can be written unambiguously without

parentheses, nor need for stating operator precedence.

• Ideally suited for source languages that primarily deal

with expression like SNOEOL.

• We can easily build interpreters for these expressions,

using a stack.

• This is the procedure followed by most assemblers.

Compiler construction
Register Allocator
•
• A register allocator assigns temporary variables (which are
often represented in the intermediate code) to a limited
number of CPU registers in the target machine. Since most
modern CPUs have a limited number of registers, the
allocator must decide which variables should be stored in
registers and which should be stored in memory.
• The goal of register allocation is to minimize register spilling,
where variables are stored in slower memory because there
aren't enough registers available. Efficient register allocation
can have a significant impact on the performance of the
generated code.

Compiler construction
How to generate the postfix code?
• A semantic stack is used to represent the postfix code being
generated.

• This stack is initially empty.

• Semantic actions are connected with each production (as

seen in semantic analysis).

• Only one semantic action is used to create the semantic

stack:
• push <value> : place a value (address or operator) on
the semantic stack
Compiler construction
How to generate the postfix code?

All logic or arithmetic operations are assumed to be directly

supported by the machine:

+, *, /, -, and, or

Compiler construction
How to generate the postfix code?
An example ‘semantic grammar’

E -> E+T {push +} a(9+d) a9d+

E -> E-T {push -}
E -> T
T -> T*F {push *}
T -> T/F {push /}
T -> F
F -> i {push i}
F -> (E)
Parentheses have no
effect on resulting
postfix
code
Compiler construction
Suffix notation

a*(b+c/a)