0% found this document useful (0 votes)

22 views

Unit V

Uploaded by

narayana.guttula

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views

Unit V

Uploaded by

narayana.guttula

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

AUTOMATATHEORY&COMPILERDESIGN
UNIT V
CODE GENERATION

Code generator converts the intermediate representation of source code into a form
that can be readily executed by the machine. A code generator is expected to generate the
correct code. Designing of the code generator should be done in such a way that it can be
easily implemented, tested, and maintained.

ISSUES IN THE DESIGN OF CODE GENERATION:

1. Input to the code generator
2. Target program
3. Memory management
4. Instruction selection
5. Register allocation
6. Evaluation order
1. Input to code generator
The input to the code generator is the intermediate code generated by the front end,
along with information in the symbol table that determines the run-time addresses of the data
objects denoted by the names in the intermediate representation.

Intermediate codes may be represented mostly in quadruples, triples, indirect triples,

Postfix notation, syntax trees, DAGs, etc. The code generation phase just proceeds on an
assumption that the input is free from all syntactic and state semantic errors, the necessary
type checking has taken place and the type-conversion operators have been inserted wherever
necessary.

We assume front end produces low-level intermediate representation i.e. values of

names in it can directly manipulated by the machine instructions.
Department of Computer Science and Engineering

2. Target program:
The target program is the output of the code generator. The output may be absolute
machine language, relocatable machine language, or assembly language.

Absolute machine language as output has the advantages that it can be placed in a
fixed memory location and can be immediately executed. For example, WATFIV is a compiler
that produces the absolute machine code as output.

Relocatable machine language as an output allows subprograms and subroutines to be

compiled separately. Relocatable object modules can be linked together and loaded by a
linking loader. But there is added expense of linking and loading.

Assembly language as output makes the code generation easier. We can generate
symbolic instructions and use the macro-facilities of assemblers in generating code. And we
need an additional assembly step after code generation.
3. Memory management
Mapping the names in the source program to the addresses of data objects is done by
the front end and the code generator. A name in the three address statements refers to the
symbol table entry for the name. Then from the symbol table entry, a relative address can be
determined for the name.
4. Instruction selection
Selecting the best instructions will improve the efficiency of the program. It includes
the instructions that should be complete and uniform. Instruction speeds and machine idioms
also play a major role when efficiency is considered. But if we do not care about the efficiency
of the target program then instruction selection is straightforward.
For example, the respective three-address statements would be translated into the latter
code sequence as shown below:
P:=Q+R
S:=P+T
MOV Q, R0
ADD R, R0
MOV R0, P
MOV P, R0
ADD T, R0
MOV R0, S
Here the fourth statement is redundant as the value of the P is loaded again in that
statement that just has been stored in the previous statement. It leads to an inefficient code
sequence. A given intermediate representation can be translated into many code sequences,
with significant cost differences between the different implementations. Prior knowledge of
instruction cost is needed in order to design good sequences, but accurate cost information is
difficult to predict.

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 2

Department of Computer Science and Engineering

5. Register allocation
Register can be accessed faster than memory. The instructions involving operands in
register are shorter and faster than those involving in memory operand. The following sub
problems arise when we use registers:
 During Register allocation – we select only those sets of variables that will reside
in the registers at each point in the program.
 During a subsequent Register assignment phase, the specific register is picked to
access the variable.
To understand the concept consider the following three address code sequence
t:=a+b
t:=t*c
t:=t/d
Their efficient machine code sequence is as follows:
MOV a,R0
ADD b,R0
MUL c,R0
DIV d,R0
MOV R0,t
6. Evaluation order
The code generator decides the order in which the instruction will be executed. The
order of computations affects the efficiency of the target code. Among many computational
orders, some will require only fewer registers to hold the intermediate results. However,
picking the best order in the general case is a difficult NP-complete problem.
Approaches to code generation issues: Code generator must always generate the correct
code. It is essential because of the number of special cases that a code generator might face.
Some of the design goals of code generator are:
 Correct
 Easily maintainable
 Testable
 Efficient
Disadvantages in the design of a code generator:
Limited flexibility: Code generators are typically designed to produce a specific type
of code, and as a result, they may not be flexible enough to handle a wide range of inputs or
generate code for different target platforms. This can limit the usefulness of the code generator
in certain situations.
Maintenance overhead: Code generators can add a significant maintenance overhead
to a project, as they need to be maintained and updated alongside the code they generate. This
can lead to additional complexity and potential errors.
Debugging difficulties: Debugging generated code can be more difficult than
debugging hand-written code, as the generated code may not always be easy to read or
understand. This can make it harder to identify and fix issues that arise during development.
Performance issues: Depending on the complexity of the code being generated, a code
generator may not be able to generate optimal code that is as performant as hand-written code.
This can be a concern in applications where performance is critical.

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 3

Department of Computer Science and Engineering

Learning curve: Code generators can have a steep learning curve, as they typically
require a deep understanding of the underlying code generation framework and the
programming languages being used. This can make it more difficult to onboard new developers
onto a project that uses a code generator.

Over-reliance: It‘s important to ensure that the use of a code generator doesn‘t lead to
over-reliance on generated code, to the point where developers are no longer able to write code
manually when necessary. This can limit the flexibility and creativity of a development team,
and may also result in lower quality code overall.

MACHINE DEPENDENTCODE GENERATION

Machine-dependent code generation refers to the process of generating executable code
specifically tailored to run on a particular type of hardware or architecture. In computer
programming, especially in low-level languages like assembly language or when dealing with
compilers, machine-dependent code generation is crucial for optimizing performance and
ensuring compatibility with the target system.
Machine dependent code generation is a type of code optimization that improves the
performance and efficiency of software by taking advantage of the hardware's specific features.
It's usually done with assembly language, a low-level programming language that's designed to
be used with a specific type of hardware.

Machine dependent code generation involves transformations that take into account the
target machine's properties, such as its registers and special machine instruction sequences. It's
closely tied to the architecture and instruction set of the target processor.

Machine dependent code generation is performed after the target code has been generated
and when the code is transformed according to the target machine architecture. It involves CPU
registers and may have absolute memory references rather than relative references.

Machine dependent code generation can provide significant performance gains because
the code is specifically designed to take advantage of the specific features of the hardware.

Machine-dependent code generation refers to the process of generating executable code

that is tailored to a specific computer architecture or machine. This process involves translating
high-level programming language code or intermediate code representations into instructions that
are compatible with the target hardware.

Machine-dependent code generation is a crucial step in the compilation process, which

converts human-readable code into machine-executable instructions. It involves considerations
such as instruction set architecture (ISA), memory organization, register allocation, and other
hardware-specific features.

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 4

Department of Computer Science and Engineering

Machine-dependent code generation is typically handled by the backend of a compiler,

which translates intermediate representations (such as abstract syntax trees or intermediate code)
produced by the front end into machine code. The generated code is specific to the target
hardware platform, ensuring optimal execution performance and compatibility.

Here's an overview of how machine-dependent code generation typically works:

Understanding the Target Architecture: The first step is to understand the architecture
of the target machine. This includes details such as the instruction set architecture (ISA),
memory organization, available registers, and other hardware-specific features.
Code Generation: Based on the knowledge of the target architecture, the compiler or
code generator generates machine code instructions that are specific to that architecture. This
involves mapping high-level language constructs to sequences of machine instructions that
perform equivalent operations.
Platform-specific Considerations: Certain platforms may have unique characteristics or
requirements that need to be taken into account during code generation. For example, different
operating systems may have different system call conventions or memory allocation strategies.
Instruction Selection: Choosing the appropriate machine instructions to implement high-
level language constructs. This involves mapping each operation in the source code to one or
more instructions in the target ISA.
Register Allocation: Assigning variables to processor registers or memory locations.
Efficient register allocation is critical for performance optimization.
Addressing Modes: Determining how memory addresses are computed and accessed.
Different architectures support various addressing modes, such as direct, indirect, indexed, and
relative addressing.
Code Optimization: Applying transformations to the generated code to improve
performance, reduce code size, or minimize power consumption. This includes techniques like
loop unrolling, function inlining, and instruction scheduling.Machine-dependent code generation
often involves optimization techniques tailored to the target architecture. These optimizations
can include instruction scheduling, register allocation, and exploiting specific features of the
hardware to improve performance.
Machine-Specific Tuning: Fine-tuning code generation for specific hardware platforms
to leverage architectural features and optimize performance. This may involve utilizing vector
instructions, exploiting parallelism, or targeting specialized coprocessors.
Testing and Validation: Once the machine-dependent code is generated, it needs to be
thoroughly tested on the target hardware to ensure correctness and performance. This may
involve running benchmarks, profiling, and debugging any issues that arise.

Machine-dependent code generation is essential for achieving optimal performance and

compatibility in software that needs to run on a specific type of hardware. It allows developers to
take advantage of the unique features and capabilities of the target architecture, ultimately
leading to more efficient and reliable software.

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 5

Department of Computer Science and Engineering

OBJECT CODE FORMS

Object code refers to the output of the compilation process after the source code has been
translated into machine code by a compiler or an assembler. Object code comes in different
forms depending on the stage of compilation, the programming language, and the target
platform. Here are some common forms of object code:

Machine Code: This is the binary representation of instructions that can be executed
directly by the CPU. Machine code is specific to the target architecture and is usually represented
in hexadecimal or binary format.

Object Files: These are binary files containing machine code along with additional
information such as symbol tables, relocation information, and metadata. Object files are
produced by the compilation process and can be linked together to create executable programs or
shared libraries.

Executable Files: These are fully linked object files that are ready to be executed by the
operating system. Executable files contain machine code, as well as headers and metadata
required by the operating system to load and execute the program.

Shared Libraries: Also known as dynamic link libraries (DLLs) on Windows or shared
objects (SO) on Unix-like systems, shared libraries contain reusable code and data that can be
dynamically linked into multiple executable programs at runtime. Shared libraries are similar to
executable files but are designed to be loaded into memory and shared among multiple
processes.

Object Code Listings: These are human-readable representations of object code,

typically in assembly language or a higher-level intermediate representation. Object code listings
can be useful for debugging, performance analysis, or understanding the generated code.

Intermediate Representations (IR): In some cases, compilers generate intermediate

representations of the code during the compilation process. These intermediate representations
are not executable but serve as an abstraction that facilitates optimization and code generation.

Each form of object code serves a specific purpose in the software development process,
from compilation and linking to program execution. The choice of object code form depends on
factors such as performance requirements, portability, and development workflow.

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 6

Department of Computer Science and Engineering

REGISTER ALLOCATION AND ASSIGNMENT:

Instructions involving only register operands are faster than those involving memory
operands. On modern machines, processor speeds are often an order of magnitude or more faster
than memory speeds. Therefore, efficient utilization of registers is vitally important in generating
good code. This section presents various strategies for deciding at each point in a program what
values should reside in registers (register allocation) and in which register each value should
reside (register assignment).

One approach to register allocation and assignment is to assign specific values in the
target program to certain registers. For example, we could decide to assign base addresses to one
group of registers, arithmetic computations to another, the top of the stack to fixed register, and
so on.

This approach has the advantage that it simplifies the design of a code generator. Its
disadvantage is that, applied too strictly, it uses registers inefficiently; certain registers may go
unused over substantial portions of code, while unnecessary loads and stores are generated into
the other registers. Nevertheless, it is reasonable in most computing environments to reserve a
few registers for base registers, stack pointers, and the like, and to allow the remaining registers
to be used by the code generator as it sees fit.

A primary task of the compiler is register allocation for the variables. The number of
registers available in any hardware architecture is very minimal compared to the number of
variables that are defined in a particular piece of program. The getreg algorithm is simple but not
optimal as the algorithm stores all live variables in registers till the end of a block. The register
allocation problem is NP complete. Suppose, if we go in for Global register allocation which
involves assigning variables to limited number of available registers and attempts to keep these
registers consistent across basic block boundaries.
A key problem in code generation is deciding what values to hold in what registers.
Registers are the fastest computational unit on the target machine, but we usually do not have
enough of them to hold all values. Values not held in registers need to reside in memory.
Instructions involving register operands are invariably shorter and faster than those involving
operands in memory, so efficient utilization of registers is particularly important.
The use of registers is often subdivided into two sub problems
1. Register allocation, during which we select the set of variables that will reside in
registers at each point in the program.
2. Register assignment, during which we pick the specific register that a variable will
reside in.
Finding an optimal assignment of registers to variables is difficult, even with single-
register machines. Mathematically, the problem is NP-complete. The problem is further
complicated because the hardware and/or the operating system of the target machine may require
that certain register-usage conventions be observed.

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 7

Department of Computer Science and Engineering

Global Register Allocation:

The code generation algorithm uses registers to hold values for the duration of a single
basic block. However, all live variables were stored at the end of each block. To save some of
these stores and corresponding loads, we might arrange to assign registers to frequently used
variables and keep these registers consistent across block boundaries (globally). Since programs
spend most of their time in inner loops, a natural approach to global register assignment is to try
to keep a frequently used value in fixed register throughout a loop. For the time being, assume
that we know the loop structure of a flow graph, and that we know what values computed in a
basic block are used outside that block.
One strategy for global register allocation is to assign some fixed number of registers to
hold the most active values in each inner loop. The selected values may be different in different
loops. Registers not already allocated may be used to hold values local to one block as in Section
8.6. This approach has the drawback that the fixed number of registers is not always the right
number to make available for global register allocation.

With early C compilers, a programmer could do some register allocation explicitly by

using register declarations to keep certain values in registers for the duration of a procedure.
Judicious use of register declarations did speed up many programs, but programmers were
encouraged to first profile their programs to determine the program's hotspots before doing their
own register allocation.

Register Allocation Algorithms in Compiler Design

Register allocation is an important method in the final phase of the compiler . Registers
are faster to access than cache memory. Registers are available in small size up to few hundred
Kb .Thus it is necessary to use minimum number of registers for variable allocation . There are
three popular Register allocation algorithms.

1. Naive Register Allocation

2. Linear Scan Algorithm
3. Chaitin’s Algorithm
Naïve Register Allocation:
 Naive (no) register allocation is based on the assumption that variables are stored in Main
Memory.
 We can‘t directly perform operations on variables stored in Main Memory .
 Variables are moved to registers which allows various operations to be carried out using
ALU .
 ALU contains a temporary register where variables are moved before performing
arithmetic and logic operations.
 Once operations are complete we need to store the result back to the main memory in this
method.
 Transferring of variables to and fro from Main Memory reduces the overall speed of
execution.

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 8

Department of Computer Science and Engineering

a = b + c
d = a
c = a + d
Variables stored in Main Memory :

a b c d

2 fp 4 fp 6 fp 8 fp
Machine Level Instructions:

LOAD R1, _4fp

LOAD R2, _6fp
ADD R1, R2
STORE R1, _2fp
LOAD R1, _2fp
STORE R1, _8fp
LOAD R1, _2fp
LOAD R2, _8fp
ADD R1, R2
STORE R1, _6fp
Advantages:
 Easy to understand operations and the flow of variables from Main memory to
Registers and vice versa.
 Only 2 registers are enough to perform any operations.
 Design complexity is less.
Disadvantages:
 Time complexity increases as variables is moved to registers from main memory.
 Too many LOAD and STORE instructions.
 To access a variable second time we need to STORE it to the Main Memory to record
any changes made and LOAD it again.
 This method is not suitable for modern compilers.
Linear Scan Algorithm:
 Linear Scan Algorithm is a global register allocation mechanism.
 It is a bottom up approach.
 If n variables are live at any point of time then we require ‗n‘ registers.
 In this algorithm the variables are scanned linearly to determine the live ranges of the
variable based on which the registers are allocated.
 The main idea behind this algorithm is that to allocate minimum number of registers such
that these registers can be used again and this totally depends upon the live range of the
variables.
 For this algorithm we need to implement live variable analysis of Code Optimization .

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 9

Department of Computer Science and Engineering

a=b+c
d=e+f
d=d+e
IFZ a goto L0
b=a+d
goto L1
L0 : b = a - d
L1 : i = b
Control Flow Graph :

 At any point of time the maximum number of live variables is 4 in this example. Thus we
require 4 registers at maximum for register allocation.

 If we draw horizontal line at any point on the above diagram we can see that we require
exactly 4 registers to perform the operations in the program.

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 10

Department of Computer Science and Engineering

Splitting:
 Sometimes the required number of registers may not be available . In such case we may
require to move some variables to and from the RAM . This is known as spilling .
 If we draw horizontal line at any point on the above diagram we can see that we require
exactly 4 registers to perform the operations in the program .
Disadvantages:

 Linear Scan Algorithm doesn‘t take into account the ―lifetime holes‖ of the variable .
 Variables are not live throughout the program and this algorithm fails to record the holes
in the live range of the variable.
Graph Coloring (Chaitin’s Algorithm:
 Register allocation is interpreted as a graph coloring problem .
 Nodes represent live range of the variable.
 Edges represent the connection between two live ranges .
 Assigning color to the nodes such that no two adjacent nodes have same color .
 Number of colors represents the minimum number of registers required .
A k-coloring of the graph is mapped to k registers .

Steps :
1. Choose an arbitrary node of degree less than k .
2. Push that node onto the stack and remove all of it‘s outgoing edges .
3. Check if the remaining edges have degree less than k, if YES goto 5 else goto #
4. If the degree of any remaining vertex is less than k then push it onto to the stack .
5. If there is no more edge available to push and if all edges are present in the stack POP
each node and color them such that no two adjacent nodes have same color.
6. Number of colors assigned to nodes is the minimum number of registers needed .

# spill some nodes based on their live ranges and then try again with same k value . If problem
persists it means that the assumed k value can‘t be the minimum number of registers .Try
increasing the k value by 1 and try the whole procedure again .

For the same instructions mentioned above the graph coloring will be as follows :
Assuming k=4

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 11

Department of Computer Science and Engineering

After performing the graph coloring, final graph is obtained as follows

Note : Any color(register) can be assigned to ‗i‘ as it has no edge to any other nodes .

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 12

Department of Computer Science and Engineering

DAG REPRESENTATION FOR BASIC BLOCKS:

A DAG for basic block is a directed acyclic graph with the following labels on nodes:
1. The leaves of graph are labelled by unique identifier and that identifier can be variable
names or constants.
2. An interior node of the graph is labelled by an operator symbol.
3. Nodes are also given a sequence of identifiers for labels to store the computed value.
 DAGs are a type of data structure. It is used to implement transformations on basic
blocks.
 DAG provides a good way to determine the common sub-expression.
 It gives a picture representation of how the value computed by the statement is used in
subsequent statements.
Algorithm for construction of DAG
Input: It contains a basic block
Output: It contains the following information:
 Each node contains a label. For leaves, the label is an identifier.
 Each node contains a list of attached identifiers to hold the computed values.
Case (i) x:= y OP z
Case (ii) x:= OP y
Case (iii) x:= y
Method:
Step 1:
If y operand is undefined then create node(y).
If z operand is undefined then for case(i) create node(z).
Step 2:
For case(i), create node(OP) whose right child is node(z) and left child is node(y).
For case(ii), check whether there is node(OP) with one child node(y).
For case(iii), node n will be node(y).
Output:
For node(x) delete x from the list of identifiers. Append x to attached identifiers list for the node
n found in step 2. Finally set node(x) to n.
Example:
Consider the following three address statement:
S1:= 4 * i
S2:= a[S1]
S3:= 4 * i
S4:= b[S3]
S5:= s2 * S4
S6:= prod + S5
Prod:= s6
S7:= i+1
i := S7
if i<= 20 goto (1)

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 13

Department of Computer Science and Engineering

Stages in DAG Construction:

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 14

Department of Computer Science and Engineering

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 15

Department of Computer Science and Engineering

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 16

Department of Computer Science and Engineering

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 17

Department of Computer Science and Engineering

CODE GENERATION FROM DAG:

One of the ideas behind the DAG construction is code generation. The steps involved
include reordering of the instructions, labeling the nodes with the number of registers required
and use this information to generate target assembly language code.

The code generation algorithm uses a recursive procedure on a labeled DAG. Considers
code generation based on the labels assigned to the nodes. It uses two stacks, one register stack
―rstack‖ and another memory stack, ―mstack‖. Stack ―rstack‖ is used to allocate registers.
Initially rstack contains all available registers. The algorithm retains the registers on rstack in the
same order it has found them. The typical functions of the stack, like push(), pop() is used to
rearrange the rstack and in addition, the algorithm uses a swap(rstack) function to interchange
the top two registers on rstack.

The algorithm, considers five different cases to generate code. They are discussed as
follows:

 Case 0: This is a simple and terminating case of the recursive procedure. If, ‘n‘ is a leaf
and the leftmost child of its parent, we generate just a load instruction.

 Case 1: This is the situation when the right node is a leaf and the left node could be a
sub-tree. In this case, we generate code to evaluate n1 into register R=top(rstack)
followed by the instruction ―op name R‖.
 Case 2: The right sub-tree requires more registers than the left sub-tree. A sub-tree of the
form where n1 can be evaluated without stores but n2 is harder to evaluate than n1 as it
requires more registers. For this case, swap the top two registers on rsatck, then evaluate
n2 into R=top(rstack).We remove R from rstack and evaluate n1 into S = top(rstack).
Then we generate the instruction ―op R, S‖, which produce the value of ―n‖ in register S.
Another call to swap leaves rstack as it was, upon this call code generation begins.
 Case 3: It is similar to case 2 except that here the left sub-tree is harder and is evaluated
first. There is no need to swap registers here.

 Case 4: It occurs when both sub-trees require r or more registers to evaluate without
stores. Since we must use a temporary memory location, we first evaluate the right sub-
tree into the temporary T, then the left sub-tree, and finally the root. All these cases are
discussed in Algorithm 32.1 to generate code from the DAG and the algorithm is named
gencode(n) where ‗n‘ is the root of the DAG which is passed as argument

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 18

Department of Computer Science and Engineering

Procedure gencode(n);
Begin

/* case 0 */

if n is a left leaf representing operand name and n is the leftmost child of its
parent then print ‗MOV‘ || name || ‗.‘ || top(rstack)

else if n is an interior node with operator op, left child n1, and right child n2
then /* case 1 */
if label(n2) = 0 then begin
let name be the operand represented by n2;
gencode(n1);
print op || name || ‗.‘ || top(rstack)
end

/* case 2 */
else if 1 ≤ label (n1) < label(n2) and label(n1) < r then begin swap(rstack);
gencode(n2 );

R := pop(rstack); /* n2 was evaluated into register R */ gencode(n1);

print op || R || ‗.‘ || top(rstack);

push(rstack,R);
swap(rstack)
end

/* case 3 */
else if 1 ≤ label (n2) < label(n1) and label(n2) < r then begin gencode(n1);
R := pop(rstack); /* n1 was evaluated into register R */ gencode(n2);
print op || R || ‗.‘ || top(rstack);
push(rstack,R);
end

/* case 4, both labels ≥ r, the total number of registers */ else begin

gencode(n2 );
T := pop(tstack);
Print ‗MOV‘ || top(rstack) || ‗.‘ || T;
gencode(n1);
push(rstack,R);
print op || T || ‗.‘ || top(rstack)
end
end

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 19

Department of Computer Science and Engineering

Case 1 checks if the right child of an interior node is a leaf node. If it is a leaf node, then we
call the function recursively with the left child as the root node. To evaluate and finally conclude
this case with generating an ―op‖ instruction.

Case 2 is evaluated if the right sub-tree is heavy. If it is heavy, we swap the register stack so
that the right sub—tree is evaluated into the register which is beneath the top register. We then
recursively call gencode() function with the right sub -tree‘s node as root and we remove this
register from rstack. We then call gencode() to evaluate the left sub-tree and use the top of the
stack register. After that we swap the rstack contents to ensure the initial rstack content is
retained.

Case 3 is just the opposite of Case 2 and since in this context, we evaluate first the left sub-
tree, the rstack contents are used as it is and not swapped.

Case 4 is the situation when there are no registers in the rstack. We use a memory based
operation where the operands would be memory to compute.

Consider the DAG given in figure as an example for code generation.

The nodes of the DAG are labeled with the number of registers that it requires for
computation. Assume that there are two register R0 and R1 with R0 on the top of the stack. The
algorithm gencode() is called with the root node t4. The children of this node are t1 and t3. Since
the label of t3 is greater, case 2 is initiated. This calls recursively gencode() with t3 after
swapping the register stack. This results in its left child being a leaf node and hence falls under
case 0.

Case 0 is a load instruction and the value ‗e‘ is loaded into R1. After this call returns it
goes to the next step of the previous call, which removes the register R1 from rstack using the
pop() command and the gencode function is called with the right sub-tree node, which is t2. This
falls under case 1 as the label of the right leaf node is 0 and thus gencode is again called with
node ‗c‘. This again falls under case 0 and initiates a load instruction into R0 and returns. The
next instruction is the next step of Case1 where an operator instruction is issued followed by the
next instruction of case 2 where a SUB instruction is issued and the register is pushed and then
swapped.

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 20

Department of Computer Science and Engineering

Proceeding in a similar fashion we get the following code.

gencode(t4) [R1 R0] // case 2

gencode(t3) [R0 R1] // case 3
gencode(e) [R0R1] // case 0
print MOV e, R1
gencode(t2) [R0] // case 1
gencode(c) [R0] // case 0
print MOV c, R0
print ADD d, R0
print SUB R0, R1
gencode(t1) [R0] // case 1
gencode(a) [R0] // case 0
print MOV a, R0
print ADD b, R0
print SUB R1, R0

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 21

CS213 Syllabus
No ratings yet
CS213 Syllabus
6 pages
Unit 5
No ratings yet
Unit 5
10 pages
Issues in the design of a code generator
No ratings yet
Issues in the design of a code generator
4 pages
CD Uint5
No ratings yet
CD Uint5
16 pages
Unit Viii
No ratings yet
Unit Viii
16 pages
Unit V
No ratings yet
Unit V
42 pages
Unit4 Compiler PDF
No ratings yet
Unit4 Compiler PDF
73 pages
CH5 2
No ratings yet
CH5 2
23 pages
CODE generation cd
No ratings yet
CODE generation cd
57 pages
CD Unit 5
No ratings yet
CD Unit 5
26 pages
CD Unit 5
No ratings yet
CD Unit 5
26 pages
Compiler Design Code Generation
No ratings yet
Compiler Design Code Generation
4 pages
Unit-4-5
No ratings yet
Unit-4-5
36 pages
CH5 2
No ratings yet
CH5 2
24 pages
Unit 4 PCD
No ratings yet
Unit 4 PCD
15 pages
Principles of Compiler Design (Seng 3043) : Chapter - 8 Code Generation
No ratings yet
Principles of Compiler Design (Seng 3043) : Chapter - 8 Code Generation
25 pages
Compiler Notes KCG Unit IV
No ratings yet
Compiler Notes KCG Unit IV
14 pages
Code Generation 5th Year Computer Science Course
No ratings yet
Code Generation 5th Year Computer Science Course
20 pages
UNIT 4 - Chapter 1 in Compiler Design
No ratings yet
UNIT 4 - Chapter 1 in Compiler Design
51 pages
Code Generation (Autosaved)
No ratings yet
Code Generation (Autosaved)
48 pages
4nd5 unit cd CodeGeneration
No ratings yet
4nd5 unit cd CodeGeneration
21 pages
Compiler Notes Unit IV
No ratings yet
Compiler Notes Unit IV
15 pages
CD UNIT-5
No ratings yet
CD UNIT-5
16 pages
Chapter 10 - Code Generation
No ratings yet
Chapter 10 - Code Generation
31 pages
Lecture 8- Code Generation
No ratings yet
Lecture 8- Code Generation
19 pages
Code Generation: Issues in The Design of A Code Generator
No ratings yet
Code Generation: Issues in The Design of A Code Generator
33 pages
15Cs314J - Compiler Design: Unit 4
No ratings yet
15Cs314J - Compiler Design: Unit 4
71 pages
Acd 5
No ratings yet
Acd 5
9 pages
13-Issues in the Design of a Code Generator--22!10!2024
No ratings yet
13-Issues in the Design of a Code Generator--22!10!2024
54 pages
Compiler Design and Construction Lecture Notes
No ratings yet
Compiler Design and Construction Lecture Notes
28 pages
28-Code Generation - Issues-11-07-2023
No ratings yet
28-Code Generation - Issues-11-07-2023
15 pages
34-Issues in the design of a code generator_Target Machine-25-10-2024
No ratings yet
34-Issues in the design of a code generator_Target Machine-25-10-2024
29 pages
Code Generation
No ratings yet
Code Generation
5 pages
Chapter 8 Code Optimization and Code Generation
No ratings yet
Chapter 8 Code Optimization and Code Generation
58 pages
Issues in Code Generator-Pages-2
No ratings yet
Issues in Code Generator-Pages-2
3 pages
Unit-V Code Generation: 4.5. Issues in The Design of A Code Generator
No ratings yet
Unit-V Code Generation: 4.5. Issues in The Design of A Code Generator
6 pages
Unit 4 Part 2 A
No ratings yet
Unit 4 Part 2 A
19 pages
Unit VI Code Generation
No ratings yet
Unit VI Code Generation
29 pages
Code Geneartion
No ratings yet
Code Geneartion
13 pages
Code Generation I
No ratings yet
Code Generation I
32 pages
Target Code Generation: Utkarsh Jaiswal 11CS30038
No ratings yet
Target Code Generation: Utkarsh Jaiswal 11CS30038
15 pages
Code Generation I: Compiler Construction
No ratings yet
Code Generation I: Compiler Construction
28 pages
Compiler-Design U5
No ratings yet
Compiler-Design U5
13 pages
ACD-UNIT_5
No ratings yet
ACD-UNIT_5
50 pages
8-Code Generation PDF
No ratings yet
8-Code Generation PDF
21 pages
1-CodeGeneration Unit5 Chap8 Lecture44
No ratings yet
1-CodeGeneration Unit5 Chap8 Lecture44
17 pages
CD UNIT-6 LM
No ratings yet
CD UNIT-6 LM
17 pages
Experiment No 6 - DONE
No ratings yet
Experiment No 6 - DONE
8 pages
Code Generation-20241219074111
No ratings yet
Code Generation-20241219074111
20 pages
CD R19 Unit-5
No ratings yet
CD R19 Unit-5
13 pages
Darshan Sem7 170701 CD 2014
No ratings yet
Darshan Sem7 170701 CD 2014
81 pages
mod 4-5
No ratings yet
mod 4-5
40 pages
CS6109-MODULE-11
No ratings yet
CS6109-MODULE-11
41 pages
Unit 5
No ratings yet
Unit 5
13 pages
CODE GENERATION and Issues
No ratings yet
CODE GENERATION and Issues
3 pages
UNIT-5 Notes
No ratings yet
UNIT-5 Notes
14 pages
Compiler Design Unit 5 By Dr. Choudhary Ravi Singh
No ratings yet
Compiler Design Unit 5 By Dr. Choudhary Ravi Singh
19 pages
Code_Generation_170011515118723860166555b2cfa763f
No ratings yet
Code_Generation_170011515118723860166555b2cfa763f
7 pages
UE20CS353_Unit5_UpdatedSlides
No ratings yet
UE20CS353_Unit5_UpdatedSlides
182 pages
Code Beneath the Surface: Mastering Assembly Programming
From Everand
Code Beneath the Surface: Mastering Assembly Programming
Kameron Hussain
No ratings yet
Compiler Design
From Everand
Compiler Design
Knowledge Flow
No ratings yet
jewellery management system project report.docx
No ratings yet
jewellery management system project report.docx
71 pages
These Nguyen ChiThanh
No ratings yet
These Nguyen ChiThanh
423 pages
Chapter 1
No ratings yet
Chapter 1
59 pages
Introducing the .NET-0 AFramework 4.0
No ratings yet
Introducing the .NET-0 AFramework 4.0
25 pages
Advant OCS Advant Controller 55: With Master Software
No ratings yet
Advant OCS Advant Controller 55: With Master Software
116 pages
RTOS
No ratings yet
RTOS
16 pages
Debugging 9
No ratings yet
Debugging 9
16 pages
UNIT- V
No ratings yet
UNIT- V
14 pages
Advanced OS Assignment 1,2 notes
100% (1)
Advanced OS Assignment 1,2 notes
17 pages
Computer Architecture - 09 - Review For Midterm
No ratings yet
Computer Architecture - 09 - Review For Midterm
13 pages
Service Weaver HotOS2023
No ratings yet
Service Weaver HotOS2023
8 pages
STQA Data Provider and Annotations Assignment 6
No ratings yet
STQA Data Provider and Annotations Assignment 6
13 pages
Unit 1
No ratings yet
Unit 1
111 pages
Agya Ram Verma_ Yatendra Kumar - Basic and Advance_ Phython Programming-Independently Published (2024)
No ratings yet
Agya Ram Verma_ Yatendra Kumar - Basic and Advance_ Phython Programming-Independently Published (2024)
240 pages
Zenon
No ratings yet
Zenon
6 pages
Free Access to Technology In Action Complete 12th Edition Evans Test Bank Chapter Answers
100% (26)
Free Access to Technology In Action Complete 12th Edition Evans Test Bank Chapter Answers
52 pages
Harsh Training Report
No ratings yet
Harsh Training Report
27 pages
Cs336 Spring2024 Assignment2 Systems
No ratings yet
Cs336 Spring2024 Assignment2 Systems
30 pages
A Compiler and Runtime Infrastructure For Automatic Program Distribution
No ratings yet
A Compiler and Runtime Infrastructure For Automatic Program Distribution
10 pages
Curtis Diagnostics and Troubleshooting
No ratings yet
Curtis Diagnostics and Troubleshooting
11 pages
SANS AppSec Report and API Survey
No ratings yet
SANS AppSec Report and API Survey
10 pages
COMPUTATIONAL PHYSICS Lecture 02 SHAHBAZ BHATTI
No ratings yet
COMPUTATIONAL PHYSICS Lecture 02 SHAHBAZ BHATTI
5 pages
Python
No ratings yet
Python
8 pages
5963-9987E MELPS-7700 Emulator TechnicalData 7pages Nov95
No ratings yet
5963-9987E MELPS-7700 Emulator TechnicalData 7pages Nov95
7 pages
Chain of Tools: Large Language Model Is An Automatic Multi-Tool Learner
No ratings yet
Chain of Tools: Large Language Model Is An Automatic Multi-Tool Learner
28 pages
UiPath-TAEPv1 UiPath Test Automation Engineer Professional v1.0 Exam Dumps
No ratings yet
UiPath-TAEPv1 UiPath Test Automation Engineer Professional v1.0 Exam Dumps
9 pages
Week 1 - Module 1 - Introduction To Algorithms and Complexity
No ratings yet
Week 1 - Module 1 - Introduction To Algorithms and Complexity
4 pages
Os Unit - 4
No ratings yet
Os Unit - 4
25 pages
Comp 212 Assignment 2
No ratings yet
Comp 212 Assignment 2
3 pages

Unit V

Uploaded by

Unit V

Uploaded by

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

ISSUES IN THE DESIGN OF CODE GENERATION:

Intermediate codes may be represented mostly in quadruples, triples, indirect triples,

We assume front end produces low-level intermediate representation i.e. values of

Relocatable machine language as an output allows subprograms and subroutines to be

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 2

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 3

MACHINE DEPENDENTCODE GENERATION

Machine-dependent code generation refers to the process of generating executable code

Machine-dependent code generation is a crucial step in the compilation process, which

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 4

Machine-dependent code generation is typically handled by the backend of a compiler,

Here's an overview of how machine-dependent code generation typically works:

Machine-dependent code generation is essential for achieving optimal performance and

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 5

OBJECT CODE FORMS

Object Code Listings: These are human-readable representations of object code,

Intermediate Representations (IR): In some cases, compilers generate intermediate

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 6

REGISTER ALLOCATION AND ASSIGNMENT:

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 7

Global Register Allocation:

With early C compilers, a programmer could do some register allocation explicitly by

Register Allocation Algorithms in Compiler Design

1. Naive Register Allocation

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 8

LOAD R1, _4fp

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 9

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 10

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 11

After performing the graph coloring, final graph is obtained as follows

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 12

DAG REPRESENTATION FOR BASIC BLOCKS:

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 13

Stages in DAG Construction:

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 14

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 15

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 16

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 17

CODE GENERATION FROM DAG:

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 18

R := pop(rstack); /* n2 was evaluated into register R */ gencode(n1);

/* case 4, both labels ≥ r, the total number of registers */ else begin

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 19

Consider the DAG given in figure as an example for code generation.

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 20

Proceeding in a similar fashion we get the following code.

gencode(t4) [R1 R0] // case 2

Automata Theory & Compiler Design Mr. P.Krishnamoorthy Page 21

You might also like