0% found this document useful (0 votes)

47 views20 pages

Code Generation 5th Year Computer Science Course

Code generation is the final phase of a compiler that takes an intermediate representation of the source program and symbol table as input and produces target code as output. It aims to produce code that has the exact meaning of the source and is efficient. The key tasks of code generation include instruction selection, register allocation, and instruction ordering. It addresses issues like target code format, memory management, and evaluation order.

Uploaded by

Mekonnen Solomon

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views20 pages

Code Generation 5th Year Computer Science Course

Uploaded by

Mekonnen Solomon

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 20

Code Generation

1
Code Generation
It is the final phase of a compiler. It takes as input an intermediate representation
of the source program with supplementary information in symbol table and
produces as output an equivalent target program.

The source code written in a higher-level language is transformed into a lower-

level language that results in a lower-level object code, which should have the
following minimum properties:
 It should carry the exact meaning of the source code.

 It should be efficient in terms of CPU usage and memory management.

It is used to produce the target code for three-address statements. It uses
registers to store the operands of the three address statement.
2
Code Generation
 Code generator main tasks:

 Instruction selection: factors to determining (level of IR, nature of ISA

(instruction set architecture) and desired quality of generated code)
 Register allocation and assignment

 Instruction ordering

Consider the three address statement x:= y + z. It can have the following
sequence of codes:
MOV z, R0
ADD y, R0
3
Code Generation
 presented below can be used whether or not an
optimizing phase occurs before code generation.

.
4
ISSUES IN THE DESIGN OF A
CODE GENERATOR
 The following issues arise during the code generation
phase:
 Input to code generator
 Target program
 Memory management
 Instruction selection
 Register allocation
 Evaluation order
5
Input to code generator
 The input to the code generation consists of the intermediate
representation of the source program produced by front end, together
with information in the symbol table to determine run-time addresses of
the data objects denoted by the names in the intermediate representation.

 Intermediate representation can be :

 Linear representation such as postfix notation
 Three address representation such as quadruples
 Virtual machine representation such as stack machine code
 Graphical representations such as syntax trees and DAGs

6
Target program
 The output of the code generator is the target program.
The output may be :
 Absolute machine language
 Producing an absolute machine language program as output has the advantage that
it can be placed in a fixed location in memory and immediately executed.

 Relocatable machine language

 Producing a relocatable machine language program as output allows subprograms
to be compiled separately.

7
Target program
 A set of relocatable object modules can be linked together and
loaded for execution by a linking loader.
 If the target machine does not handle relocation automatically,
the compiler must provide explicit relocation information
to the loader, to link the separately compiled program
segments.
 Assembly language
 Producing an assembly language program as output makes
the process of code generation some what easier
8
Memory Management
 Names in the source program are mapped to addresses of data
objects in run-time memory by the front end and code generator.

 It makes use of symbol table, that is, a name in a three-address

statement refers to a symbol table entry for the name.

 Labels in three-address statements have to be converted to

addresses of instructions.

9
Instruction selection
 The instructions of target machine should be complete and uniform.

 Instruction speeds and machine idioms are important factors when

efficiency of target program is considered.

 The quality of the generated code is determined by its speed and

size.

 The factors to be considered during instruction selection are:

 The uniformity and completeness of the instruction set.
 Instruction speed.
 Size of the instruction set.
10
Instruction selection
 The former statement can be translated into the latter statement as shown below:

Eg., for the following address code is:

a := b + c
d := a + e

inefficient assembly code is:

MOV b, R0 R0 ← b

ADD c, R0 R0 ← c + R0

MOV R0, a a ← R0

MOV a, R0 R0 ← a

ADD e, R0 R0 ← e + R0

MOV R0 , d d ← R0

Here the fourth statement is redundant, and so is the third statement if

11
Register allocation
 Instructions involving register operands are usually shorter
and faster than those involving operands in memory.
Therefore efficient utilization of registers is particularly
important in generating good code.

 The use of registers is subdivided into two sub problems :

 Register allocation - the set of variables that will reside in

registers at a point in the program is selected.

 Register assignment - the specific register that a value
12
Evaluation order
 It affects the efficiency of the target code.

 Some computation orders require fewer registers to

hold intermediate results than others.

 Picking a best order in the general case is a difficult NP-

complete problem.

 Initially, we shall avoid the problem by generating code

for the three-address statements in the order in which they
13
Basic Blocks and Control Flow Graphs
 A basic block is the longest sequence of three-address codes with the
following properties.
 The control flows to the block only through the first three-address code.
 The flow goes out of the block only through the last three-address code.

 A control-ﬂow graph is a directed graph G = (V,E), where the nodes are

the basic blocks and the edges correspond to the ﬂow of control from
one basic block to another. As an example the edge eij = (vi , vj)
corresponds to the transfer of ﬂow from the basic block vi to the basic
block vj.

14
Directed Acyclic Graph
 It is a tool that depicts the structure of basic blocks, helps
to see the flow of values flowing among the basic blocks,
and offers optimization too. DAG provides easy
transformation on basic blocks. DAG can be understood
here:
 Leaf nodes represent identifiers, names or constants.
 Interior nodes represent operators.
 Interior nodes also represent the results of expressions or the
identifiers/name where the values are to be stored or assigned.
15
Directed Acyclic Graph
 t0 = a + b

 t1 = t0 + c

 d = t0 + t1

16

Descriptors
 The code generator has to track both the registers (for availability) and
addresses (location of values) while generating the code. For both of
them, the following two descriptors are used:

 Register descriptor :
 It is used to inform the code generator about the availability of registers.
 It keeps track of values stored in each register.
 Whenever a new register is required during code generation, this
descriptor is consulted for register availability.
 The register descriptors show that all the registers are initially empty.

17
Descriptors
 Address descriptor :
 An address descriptor is used to store the location where current
value of the name can be found at run time.
 Values of the names (identifiers) used in the program might be
stored at different locations while in execution.
 It used to keep track of memory locations where the values of
identifiers are stored.
 These locations may include CPU registers, heaps, stacks, memory
or a combination of the mentioned locations.
18
getReg Function
 getReg : Code generator uses getReg function to
determine the status of available registers and the location
of name values. getReg works as follows:
 If variable Y is already in register R, it uses that register.
 Else if some register R is available, it uses that register.
 Else if both the above options are not possible, it chooses a
register that requires minimal number of load and store
instructions.
19
A code-generation algorithm
 The algorithm takes a sequence of three-address statements as input. For each three
address statement of the form x : = y op z perform the various actions. These are as
follows:
 Invoke a function getreg to find out the location L where the result of computation b op c should
be stored.
 Consult the address description for y to determine y'. If the value of y currently in memory and
register both then prefer the register y' . If the value of y is not already in L then generate the
instruction MOV y' , L to place a copy of y in L.
 Generate the instruction OP z' , L where z' is used to show the current location of z. if z is in both
then prefer a register to a memory location. Update the address descriptor of x to indicate that x
is in location L. If x is in L then update its descriptor and remove x from all other descriptor.
 If the current value of y or z have no next uses or not live on exit from the block or in register
then alter the register descriptor to indicate that after execution of x : = y op z those register will
no longer contain y or z. 20

Combinatorics and Algos - ETH (2018)
100% (1)
Combinatorics and Algos - ETH (2018)
261 pages
Analysis and Design of Algorithm Lecture Notes
No ratings yet
Analysis and Design of Algorithm Lecture Notes
337 pages
Apache Airflow
No ratings yet
Apache Airflow
24 pages
Code Generation and Instruction Selection Unit-8
No ratings yet
Code Generation and Instruction Selection Unit-8
6 pages
Graph (Graph DS, BFS, DFS, Prim's, Krushkal's) PDF
No ratings yet
Graph (Graph DS, BFS, DFS, Prim's, Krushkal's) PDF
60 pages
Chapter 8 Code Optimization and Code Generation
No ratings yet
Chapter 8 Code Optimization and Code Generation
58 pages
BCS 324 Topic 6
No ratings yet
BCS 324 Topic 6
56 pages
Cs - 502 Final Term Solve by Vu - Toper
No ratings yet
Cs - 502 Final Term Solve by Vu - Toper
54 pages
Codegeneration Final
No ratings yet
Codegeneration Final
31 pages
Algorithms Simplified - A Minimalist Approach To Problem-Solving by Rohith B. V.
No ratings yet
Algorithms Simplified - A Minimalist Approach To Problem-Solving by Rohith B. V.
146 pages
UNIT-5 Notes
No ratings yet
UNIT-5 Notes
14 pages
Chapter 6 Code Generation and Optimization
No ratings yet
Chapter 6 Code Generation and Optimization
34 pages
CS6109 Module 11
No ratings yet
CS6109 Module 11
41 pages
2018 - Lecture - 14 - Code Generation - 2 PDF
No ratings yet
2018 - Lecture - 14 - Code Generation - 2 PDF
96 pages
Code Generation (Autosaved)
No ratings yet
Code Generation (Autosaved)
48 pages
CODE Generation CD
No ratings yet
CODE Generation CD
57 pages
Directory Structure
No ratings yet
Directory Structure
27 pages
Compiler Design Lec-8Code Generation and Optimization
No ratings yet
Compiler Design Lec-8Code Generation and Optimization
46 pages
1-CodeGeneration Unit5 Chap8 Lecture44
No ratings yet
1-CodeGeneration Unit5 Chap8 Lecture44
17 pages
Dependency Parsing: Pawan Goyal
No ratings yet
Dependency Parsing: Pawan Goyal
38 pages
Compiler Design - Code Generation
No ratings yet
Compiler Design - Code Generation
62 pages
Unit 5 1 Basicblocks
No ratings yet
Unit 5 1 Basicblocks
39 pages
CH5 2
No ratings yet
CH5 2
23 pages
Unit Iv Non Linear Data Structures - Graphs
No ratings yet
Unit Iv Non Linear Data Structures - Graphs
29 pages
Chapter 7
No ratings yet
Chapter 7
16 pages
Unit V
No ratings yet
Unit V
21 pages
18 Code Gen
No ratings yet
18 Code Gen
24 pages
CH5 2
No ratings yet
CH5 2
24 pages
Code Generation
No ratings yet
Code Generation
21 pages
Unit-5-Code Gen
No ratings yet
Unit-5-Code Gen
13 pages
13-Issues in The Design of A Code Generator - 22!10!2024
No ratings yet
13-Issues in The Design of A Code Generator - 22!10!2024
54 pages
UNIT 4 - Chapter 1 in Compiler Design
No ratings yet
UNIT 4 - Chapter 1 in Compiler Design
51 pages
Unit-4-5
No ratings yet
Unit-4-5
36 pages
Code Generation and Optimization
No ratings yet
Code Generation and Optimization
42 pages
15Cs314J - Compiler Design: Unit 4
No ratings yet
15Cs314J - Compiler Design: Unit 4
71 pages
34-Issues in The Design of A Code Generator - Target Machine-25-10-2024
No ratings yet
34-Issues in The Design of A Code Generator - Target Machine-25-10-2024
29 pages
Unit V
No ratings yet
Unit V
42 pages
CD Unit-6 LM
No ratings yet
CD Unit-6 LM
17 pages
Lecture 8 - Code Generation
No ratings yet
Lecture 8 - Code Generation
19 pages
CD Unit 6.1
No ratings yet
CD Unit 6.1
20 pages
CC 7
No ratings yet
CC 7
20 pages
Darshan Sem7 170701 CD 2014
No ratings yet
Darshan Sem7 170701 CD 2014
81 pages
CD Unit 5
No ratings yet
CD Unit 5
26 pages
Compiler Design (Unit-5)
No ratings yet
Compiler Design (Unit-5)
22 pages
Code Generation I: Compiler Construction
No ratings yet
Code Generation I: Compiler Construction
28 pages
Unit Viii
No ratings yet
Unit Viii
16 pages
Code Generation F
No ratings yet
Code Generation F
7 pages
Code Geneartion
No ratings yet
Code Geneartion
13 pages
Code Generation
No ratings yet
Code Generation
25 pages
Lec 12 - Code Generator
No ratings yet
Lec 12 - Code Generator
23 pages
Unit 5
No ratings yet
Unit 5
10 pages
Chapter 10 - Code Generation
No ratings yet
Chapter 10 - Code Generation
31 pages
Issues in The Design of A Code Generator
No ratings yet
Issues in The Design of A Code Generator
4 pages
CD Unit 5
No ratings yet
CD Unit 5
26 pages
Compiler Design and Construction Lecture Notes
No ratings yet
Compiler Design and Construction Lecture Notes
28 pages
Unit 5
No ratings yet
Unit 5
13 pages
Experiment No 6 - DONE
No ratings yet
Experiment No 6 - DONE
8 pages
Introduction To Compilers: Jun.-Prof. Dr. Christian Plessl Custom Computing University of Paderborn
No ratings yet
Introduction To Compilers: Jun.-Prof. Dr. Christian Plessl Custom Computing University of Paderborn
51 pages
Principles of Compiler Design (Seng 3043) : Chapter - 8 Code Generation
No ratings yet
Principles of Compiler Design (Seng 3043) : Chapter - 8 Code Generation
25 pages
Code Generation I
No ratings yet
Code Generation I
32 pages
Acd 5
No ratings yet
Acd 5
9 pages
Code Generation: Issues in The Design of A Code Generator
No ratings yet
Code Generation: Issues in The Design of A Code Generator
33 pages
Code Generation
No ratings yet
Code Generation
49 pages
Unit 1 Topic 2 Bayesian Modeling
No ratings yet
Unit 1 Topic 2 Bayesian Modeling
76 pages
Unit-Iv: Intermediate Code Generation
No ratings yet
Unit-Iv: Intermediate Code Generation
19 pages
Issues in Code Generator-Pages-2
No ratings yet
Issues in Code Generator-Pages-2
3 pages
Code Opti
No ratings yet
Code Opti
26 pages
Sub Theme 5 - Full Paper
No ratings yet
Sub Theme 5 - Full Paper
179 pages
170 Dis
No ratings yet
170 Dis
5 pages
Code Generation
No ratings yet
Code Generation
43 pages
Target Code Generation: Utkarsh Jaiswal 11CS30038
No ratings yet
Target Code Generation: Utkarsh Jaiswal 11CS30038
15 pages
Unit 4 PCD
No ratings yet
Unit 4 PCD
15 pages
Compiler Design Code Generation
No ratings yet
Compiler Design Code Generation
4 pages
Graph Theory Representation of Graphs
No ratings yet
Graph Theory Representation of Graphs
11 pages
Mathematical Notations and Set Theory
No ratings yet
Mathematical Notations and Set Theory
63 pages
Dag For Basic Block
No ratings yet
Dag For Basic Block
9 pages
Chapter 6 - Mining Social Network Graphs PDF
No ratings yet
Chapter 6 - Mining Social Network Graphs PDF
74 pages
Biconnected !! Graph Theory
No ratings yet
Biconnected !! Graph Theory
52 pages
Satya Ans 10
No ratings yet
Satya Ans 10
16 pages
Graphs Topological Sort Single Source Shortest Path: Manoj Kumar DTU, Delhi
No ratings yet
Graphs Topological Sort Single Source Shortest Path: Manoj Kumar DTU, Delhi
25 pages
03 Graphs
No ratings yet
03 Graphs
51 pages
Teach Yourself TCP-IP in 14 Days (Unix)
No ratings yet
Teach Yourself TCP-IP in 14 Days (Unix)
20 pages
Design and Implementation of The PULSAR Programming System For Large Scale Computing
No ratings yet
Design and Implementation of The PULSAR Programming System For Large Scale Computing
23 pages
Spark 101 - Overview and Efficient Use
No ratings yet
Spark 101 - Overview and Efficient Use
9 pages
Finding Minimal Dseparator - Jin Tian, Azaria PAz, Judea Pearl
No ratings yet
Finding Minimal Dseparator - Jin Tian, Azaria PAz, Judea Pearl
15 pages
Compiler Design Assignment
No ratings yet
Compiler Design Assignment
19 pages
Detecting Bottlenecks in Parallel DAG-based Data Flow Programs
No ratings yet
Detecting Bottlenecks in Parallel DAG-based Data Flow Programs
10 pages
Fundamental Algorithms Final Exam
No ratings yet
Fundamental Algorithms Final Exam
2 pages
Compiler Question Bank
No ratings yet
Compiler Question Bank
3 pages
Assembly Programming:Simple, Short, And Straightforward Way Of Learning Assembly Language
From Everand
Assembly Programming:Simple, Short, And Straightforward Way Of Learning Assembly Language
Sherwyn Allibang
5/5 (2)