0% found this document useful (0 votes)

169 views15 pages

UNIT-5 Compiler Design

The document discusses runtime storage organization and code generation techniques. It describes how runtime memory is typically subdivided into sections for code, static data, stack, and heap. The stack is used to allocate memory for procedure activations using activation records. Code generation involves laying out data and allocating memory statically at compile time, dynamically on the stack for procedure locals, or from the heap as needed. Parameters and return values are placed in the activation record, divided between the caller and callee's responsibilities.

Uploaded by

Purushottam Rohidas Patil Purushottam Rohidas Patil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

169 views15 pages

UNIT-5 Compiler Design

Uploaded by

Purushottam Rohidas Patil Purushottam Rohidas Patil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

CS8602-Compiler Design Department of CSE

UNIT IV RUN-TIME ENVIRONMENT AND CODE GENERATION 8

Storage Organization, Stack Allocation Space, Access to Non-local Data on the Stack, Heap
Management - Issues in Code Generation - Design of a simple Code Generator.

The final phase in compiler model is the code generator. It takes as input an intermediate
representation of the source program and produces as output an equivalent target program. The
code generation techniques presented below can be used whether or not an optimizing phase
occurs before code generation.

Position of code generator

source intermediate intermediate target

front end code code
program code optimizer code generator progr

2020 – 2021 Jeppiaar Institute of Technology

STORAGE ORGANISATION

 The executing target program runs in its own logical address space in which each program
 value has a location. 
 The management and organization of this logical address space is shared between the complier,
operating system and target machine. The operating system maps the logical address into
physical addresses, which are usually spread throughout memory. 

Typical subdivision of run-time memory:

Code

Static Data

Stack

free memory

Heap

 Run-time storage comes in blocks, where a byte is the smallest unit of addressable memory.
Four bytes form a machine word. Multibyte objects are stored in consecutive bytes and given the
address of first byte. 
This run-time storage might be subdivided to hold:
1. The generated target code,

2. Data objects, and

3. A counterpart of the control stack to keep track of procedure activations.

 The storage layout for data objects is strongly influenced by the addressing constraints of the target
 machine. 
 A character array of length 10 needs only enough bytes to hold 10 characters, a compiler may
 allocate 12 bytes to get alignment, leaving 2 bytes unused. 
  This unused space due to alignment considerations is referred to as padding. 
 The size of some program objects may be known at run time and may be placed in an area
 called static. 
 The dynamic areas used to maximize the utilization of space at run time are stack and heap. 

Activation records:
2
  Procedure calls and returns are usually managed by a run time stack called the control stack. 
 Each live activation has an activation record on the control stack, with the root of the activation
 tree at the bottom, the latter activation has its record at the top of the stack. 
 The contents of the activation record vary with the language being implemented. The diagram
below shows the contents of activation record. 

Temporaries

Local Data

Machine Status

Control Link

Access Link

Actual Parameters

Return Value

  Temporary values such as those arising from the evaluation of expressions. 

  Local data belonging to the procedure whose activation record this is. 
 A saved machine status, with information about the state of the machine just before the call to
 procedures. 
 An access link may be needed to locate data needed by the called procedure but found
 elsewhere. 
 A control link pointing to the activation record of the caller. 
 Space for the return value of the called functions, if any. Again, not all called procedures return a
value, and if one does, we may prefer to place that value in a register for efficiency. 
 The actual parameters used by the calling procedure. These are not placed in activation record but
rather in registers, when possible, for greater efficiency. 

STORAGE ALLOCATION STRATEGIES

The different storage allocation strategies are :
1. Static allocation – lays out storage for all data objects at compile time
3
2. Stack allocation – manages the run-time storage as a stack.
3. Heap allocation – allocates and deallocates storage as needed at run time from a data area known as
heap.

Static allocation
 In static allocation, names are bound to storage as the program is compiled, so there is no need for
 a run-time support package. 
 Since the bindings do not change at run-time, everytime a procedure is activated, its names
are bound to the same storage locations. 
 Therefore values of local names are retained across activations of a procedure. That is, when
control returns to a procedure the values of the locals are the same as they were when control left
the last time. 
 From the type of a name, the compiler decides the amount of storage for the name and decides
where the activation records go. At compile time, we can fill in the addresses at which the target
code can find the data it operates on. 
Some limitations of using static allocation:
1. The size of a data object and constraints on its position in memory must be known at
compile time.

2. Recursive procedures are restricted, because all activations of a procedure use the same
bindings for local names.

3. Data structures cannot be created dynamically, since there is no mechanism for storage
allocation at run time.


FORTRAN use static storage allocation

4
 

Stack allocation

 All compilers for languages that use procedures, functions or methods as units of user-defined
 actions manage at least part of their run-time memory as a stack. 
 Each time a procedure is called , space for its local variables is pushed onto a stack, and when the
procedure terminates, that space is popped off the stack. 

Calling sequences:

Procedures called are implemented in what is called as calling sequence, which consists of code
that allocates an activation record on the stack and enters information into its fields. 
 A return sequence is similar to code to restore the state of machine so the calling procedure
 can continue its execution after the call. 
 The code in calling sequence is often divided between the calling procedure (caller) and the
 procedure it calls (callee). 
 When designing calling sequences and the layout of activation records, the following principles
 are helpful: 
 Values communicated between caller and callee are generally placed at the beginning of
the callee’s activation record, so they are as close as possible to the caller’s activation
record. 
 Fixed length items are generally placed in the middle. Such items typically include the control
 link, the access link, and the machine status fields. 
 Items whose size may not be known early enough are placed at the end of the activation record.
The most common example is dynamically sized array, where the value of one of the callee’s
parameters determines the length of the array. 
 We must locate the top-of-stack pointer judiciously. A common approach is to have it point to the
end of fixed-length fields in the activation record. Fixed-length data can then be accessed by fixed
offsets, known to the intermediate-code generator, relative to the top-of-stack pointer. 

5
Parameters and returned values

caller’s
control link
activation
links and saved status
record
caller’s temporaries and local data
responsibility Parameters and returned values
callee’s
activation control link
record links and saved status
top_sp
callee’s
responsibility temporaries and local data

Division of tasks between caller and callee

 The calling sequence and its division between caller and callee are as follows. 

  The caller evaluates the actual parameters. 
 The caller stores a return address and the old value of top_sp into the callee’s activation
 record. The caller then increments the top_sp to the respective positions. 
  The callee saves the register values and other status information. 
  The callee initializes its local data and begins execution. 
 A suitable, corresponding return sequence is: 

  The callee places the return value next to the parameters. 
 Using the information in the machine-status field, the callee restores top_sp and other
 registers, and then branches to the return address that the caller placed in the status field. 
 Although top_sp has been decremented, the caller knows where the return value is, relative to the
current value of top_sp; the caller therefore may use that value. 

Variable length data on stack:

 The run-time memory management system must deal frequently with the allocation of space for
objects, the sizes of which are not known at the compile time, but which are local to a procedure
and thus may be allocated on the stack. 
 The reason to prefer placing objects on the stack is that we avoid the expense of garbage collecting

6
 their space. 
 The same scheme works for objects of any type if they are local to the procedure called and have a
size that depends on the parameters of the call. 

Heap allocation
Stack allocation strategy cannot be used if either of the following is possible :
1. The values of local names must be retained when an activation ends.
2. A called activation outlives the caller.

 Heap allocation parcels out pieces of contiguous storage, as needed for activation records or
 other objects. 
 Pieces may be deallocated in any order, so over the time the heap will consist of alternate
areas that are free and in use. 

7
Position in the Activation records in the heap Remarks
activation tree

s Retained activation
s record for r

r q ( 1 , 9) control link

control link

q(1,9)

control link

 The record for an activation of procedure r is retained when the activation ends. 

 Therefore, the record for the new activation q(1 , 9) cannot follow that for s physically. 

 If the retained activation record for r is deallocated, there will be free space in the heap
between the activation records for s and q. 
For large blocks of storage use the heap manager.This approach results in fast allocation
and deallocation of small amounts of storage, since taking and returning a block from
linked list are efficient operations.

ISSUES IN THE DESIGN OF A CODE GENERATOR

The following issues arise during the code generation phase :

1. Input to code generator

2. Target program
3. Memory management
4. Instruction selection
5. Register allocation
8
6. Evaluation order

1. Input to code generator:

 The input to the code generation consists of the intermediate representation of the source program
produced by front end , together with information in the symbol table to determine run-time
addresses of the data objects denoted by the names in the intermediate representation. 

 Intermediate representation can be : 
a. Linear representation such as postfix notation
b. Three address representation such as quadruples
c. Virtual machine representation such as stack machine code
d. Graphical representations such as syntax trees and dags.

 Prior to code generation, the front end must be scanned, parsed and translated into intermediate
representation along with necessary type checking. Therefore, input to code generation is assumed
to be error-free. 

2. Target program:
 The output of the code generator is the target program. The output may be : 
a. Absolute machine language
- It can be placed in a fixed memory location and can be executed immediately.

b. Relocatable machine language

- It allows subprograms to be compiled separately.

c. Assembly language
- Code generation is made easier.

3. Memory management:
 Names in the source program are mapped to addresses of data objects in run-time memory by
the front end and code generator. 

 It makes use of symbol table, that is, a name in a three-address statement refers to a symbol-
table entry for the name. 

 Labels in three-address statements have to be converted to addresses of instructions. For
example, 
j :gotoigenerates jump instruction as follows :
 ifi<j, a backward jump instruction with target address equal to location of code for
quadruple i is generated. 
 ifi>j, the jump is forward. We must store on a list for quadruplei the location of the
first machine instruction generated for quadruplej. When iis processed, the machine
locations for all instructions that forward jumps to i are filled. 

4. Instruction selection:
 The instructions of target machine should be complete and uniform. 

 Instruction speeds and machine idioms are important factors when efficiency of target program
9
is considered. 

 The quality of the generated code is determined by its speed and size. 

 The former statement can be translated into the latter statement as shown below: 

5. Register allocation
 Instructions involving register operands are shorter and faster than those involving operands in
memory. 

 The use of registers is subdivided into two subproblems : 
Register allocation – the set of variables that will reside in registers at a point inthe program is selected.

 Register assignment – the specific register that a variable will reside in ispicked. 

 Certain machine requires even-odd register pairs for some operands and results. For
example , consider the division instruction of the form : 
D x, y

where, x – dividend even register in even/odd register pair y –

divisor
even register holds the remainder odd
register holds the quotient

6. Evaluation order
 The order in which the computations are performed can affect the efficiency of the target code.
Some computation orders require fewer registers to hold intermediate results than others. 

A SIMPLE CODE GENERATOR

 A code generator generates target code for a sequence of three- address statements and effectively
uses registers to store operands of the statements.
For example: consider the three-address statement a := b+c

It can have the following sequence of codes:

10
ADD Rj, Ri Cost = 1 // if Ri contains b and R j contains c

(or)

ADD c, Ri Cost = 2 // if c is in a memory location

(or)

MOV c, Rj Cost = 3 // move c from memory to Rj and add

ADD Rj, Ri
Register and Address Descriptors:

 A register descriptor is used to keep track of what is currently in each registers. The register
 descriptors show that initially all the registers are empty. 
 An address descriptor stores the location where the current value of the name can be found at run
time.

A code-generation algorithm:

The algorithm takes as input a sequence of three -address statements constituting a basic block. For each
three-address statement of the form x : = y op z, perform the following actions:

1. Invoke a function getreg to determine the location L where the result of the computation y op z should
be stored.

2. Consult the address descriptor for y to determine y’, the current location of y. Prefer the register for
y’ if the value of y is currently both in memory and a register. If the value of y is not already in L,
generate the instruction MOV y’ , L to place a copy of y in L.

3. Generate the instruction OP z’ , L where z’ is a current location of z. Prefer a register to a

memory location if z is in both. Update the address descriptor of x to indicate that x is in location
L. If x is in L, update its descriptor and remove x from all other descriptors.

4. If the current values of y or z have no next uses, are not live on exit from the block, and are in
registers, alter the register descriptor to indicate that, after execution of x : = y op z , those registers
will no longer contain y or z.

The algorithmic sequence of getreg function can be,

1. if x value is in register that register is returned.

2. If (1) fails, new register is returned.
3. If (2) fails, and the operation needs a special register, that register value is temporarily moved to
the memory and the register is returned.
4. If (3) fails, finally memory location is returned.

11
Generating Code for Assignment Statements:

 The assignment d : = (a-b) + (a-c) + (a-c) might be translated into the following three-address code
sequence: 
t:=a–b u:=
a–c v:=t+u
d:=v+u
with d live at the end.

12
Code sequence for the example is:

Statements Code Generated Register descriptor Address descriptor

t:=a-b MOV a, R0 R0 contains t t in R0

SUB b, R0

u:=a-c MOV a , R1 R0 contains t t in R0

SUB c , R1 R1 contains u u in R1

v : =t + u ADD R1, R0 R0 contains v u in R1

R1 contains u v in R0

d:=v+u ADD R1, R0 R0 contains d d in R0

d in R0 and memory
MOV R0, d

Generating Code for Indexed Assignments

The table shows the code sequences generated for the indexed assignment statements a : = b [ i ]
and a [ i ] : = b

Statements Code Generated

a : = b[i] MOV b(Ri), R

a[i] : = b MOV b, a(Ri)

Generating Code for Pointer Assignments

The table shows the code sequences generated for the pointer assignments a : = *p and *p : = a

Statements Code Generated

a : = *p MOV *Rp, a

*p : = a MOV a, *Rp

13
Generating Code for Conditional Statements

Statement Code

CMP x, y
if x < y goto z CJ<z
/* jump to z if condition code
is negative */

x : = y +z if x <0 goto z MOV y, R0

ADD z, R0
MOV R0,x
CJ<Z

14
15

Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
AACVPR Guidelines For AACVPR Guidelines For Pulmonary Rehabilitation Programs (4 Edition)
No ratings yet
AACVPR Guidelines For AACVPR Guidelines For Pulmonary Rehabilitation Programs (4 Edition)
37 pages
Acd Unit V
No ratings yet
Acd Unit V
44 pages
Module 5
No ratings yet
Module 5
22 pages
Unit 5 Contd Final
No ratings yet
Unit 5 Contd Final
12 pages
Unit 4 Compiler
No ratings yet
Unit 4 Compiler
19 pages
Unit 5 Code Generation
No ratings yet
Unit 5 Code Generation
19 pages
11 RunTimeAdministration1
No ratings yet
11 RunTimeAdministration1
68 pages
Unit 5 - Runtime Environment
No ratings yet
Unit 5 - Runtime Environment
69 pages
Code Generation PDF
No ratings yet
Code Generation PDF
19 pages
Unit 4
No ratings yet
Unit 4
22 pages
CD Unit 4
No ratings yet
CD Unit 4
6 pages
Unit 4 Part 1
No ratings yet
Unit 4 Part 1
18 pages
UNIT V CD Print
No ratings yet
UNIT V CD Print
9 pages
UNIT - IV - TUV - Compiler Design - NOTES - PG
No ratings yet
UNIT - IV - TUV - Compiler Design - NOTES - PG
38 pages
CS 346: Code Generation: Resource
No ratings yet
CS 346: Code Generation: Resource
52 pages
Unit Iv QB
No ratings yet
Unit Iv QB
17 pages
Cse Unit 5
No ratings yet
Cse Unit 5
54 pages
Compiler Design Unit 5
100% (1)
Compiler Design Unit 5
28 pages
Unit 5
No ratings yet
Unit 5
32 pages
Run-Time Environments
No ratings yet
Run-Time Environments
51 pages
Run Time Environments
No ratings yet
Run Time Environments
32 pages
Compiler Design Unit 4 Question and Answers
No ratings yet
Compiler Design Unit 4 Question and Answers
25 pages
5CS4-CD-Unit-4 - PPT @zammers
No ratings yet
5CS4-CD-Unit-4 - PPT @zammers
56 pages
Unit-5-Issues in Code Generation
No ratings yet
Unit-5-Issues in Code Generation
20 pages
BCS 324 Topic 6
No ratings yet
BCS 324 Topic 6
56 pages
CD 5
No ratings yet
CD 5
14 pages
Run-Time Environments
No ratings yet
Run-Time Environments
24 pages
Input To The Code Generator
No ratings yet
Input To The Code Generator
62 pages
Lec37 39
No ratings yet
Lec37 39
19 pages
CD GTU Study Material Presentations Unit-6 09092020043010PM
No ratings yet
CD GTU Study Material Presentations Unit-6 09092020043010PM
41 pages
Compiler Design - Unit-4
No ratings yet
Compiler Design - Unit-4
42 pages
CD Notes 4,5
No ratings yet
CD Notes 4,5
40 pages
Compiler Design UNIT IV
No ratings yet
Compiler Design UNIT IV
18 pages
CDU4
No ratings yet
CDU4
26 pages
Lecture 16
No ratings yet
Lecture 16
19 pages
BCS 324 Notes - Unit4
No ratings yet
BCS 324 Notes - Unit4
38 pages
Storage Allocation and Parameter Passing
100% (1)
Storage Allocation and Parameter Passing
9 pages
CS6109 Module 9
No ratings yet
CS6109 Module 9
45 pages
CD Unit 4 (Rte, CG)
No ratings yet
CD Unit 4 (Rte, CG)
27 pages
7 CD-PPT-5 Unit
No ratings yet
7 CD-PPT-5 Unit
26 pages
Unit 4
No ratings yet
Unit 4
43 pages
CS3501 CD Qb-Unit 4
No ratings yet
CS3501 CD Qb-Unit 4
5 pages
Module 5
No ratings yet
Module 5
30 pages
CH4 1
No ratings yet
CH4 1
38 pages
PCD - Unit Iv
No ratings yet
PCD - Unit Iv
16 pages
Source Language Issues Procedures:: CS1601 Compiler Design
No ratings yet
Source Language Issues Procedures:: CS1601 Compiler Design
11 pages
13 Runtime Systems
No ratings yet
13 Runtime Systems
65 pages
Compiler Design Unit-4
No ratings yet
Compiler Design Unit-4
27 pages
Unit IV
No ratings yet
Unit IV
57 pages
CD Unit 6.1
No ratings yet
CD Unit 6.1
20 pages
CD Unit 5 Part 1
No ratings yet
CD Unit 5 Part 1
14 pages
CH4 1
No ratings yet
CH4 1
37 pages
2000 by Antony L. Hosking. Permission To Make Digital or Hard Copies of
No ratings yet
2000 by Antony L. Hosking. Permission To Make Digital or Hard Copies of
26 pages
5th Unit ACD
No ratings yet
5th Unit ACD
27 pages
CD Unit 5-1
No ratings yet
CD Unit 5-1
15 pages
Compiler Construction: Runtime Environment
No ratings yet
Compiler Construction: Runtime Environment
35 pages
Compiler Construction: Runtime Environment
No ratings yet
Compiler Construction: Runtime Environment
35 pages
Runtime System
No ratings yet
Runtime System
25 pages
Run-Time Environment and Program Organization
No ratings yet
Run-Time Environment and Program Organization
35 pages
Runtime Environment
No ratings yet
Runtime Environment
7 pages
Cesc 12 - Q1 - M5 PDF
No ratings yet
Cesc 12 - Q1 - M5 PDF
14 pages
Zindgi Ki Dastan - Merged
No ratings yet
Zindgi Ki Dastan - Merged
149 pages
Afm-Kiosk Deign Criteria 2021-R2
No ratings yet
Afm-Kiosk Deign Criteria 2021-R2
27 pages
TAN, MEA S. Unlocking Writing Potential The Impact of Multisensory Activities On The Writing Skills of Struggling Kindergarten Learners of Leon Consumo Memorial Elementary School
No ratings yet
TAN, MEA S. Unlocking Writing Potential The Impact of Multisensory Activities On The Writing Skills of Struggling Kindergarten Learners of Leon Consumo Memorial Elementary School
11 pages
NM FCL 301 Exam Essay
No ratings yet
NM FCL 301 Exam Essay
14 pages
3) Unemployment and Types of Unemployment
No ratings yet
3) Unemployment and Types of Unemployment
4 pages
B 8145 C 694
No ratings yet
B 8145 C 694
42 pages
General Ledger of Journal 1
No ratings yet
General Ledger of Journal 1
8 pages
GoWork Event Space & Price Details (2024)
No ratings yet
GoWork Event Space & Price Details (2024)
29 pages
Grade 4 Rationalised Creative Arts Schemes of Work Term 2
100% (1)
Grade 4 Rationalised Creative Arts Schemes of Work Term 2
29 pages
Reliability: Supplement Outline
No ratings yet
Reliability: Supplement Outline
19 pages
Cylinder Liner - Production Recommendation 0742048 3
No ratings yet
Cylinder Liner - Production Recommendation 0742048 3
17 pages
Clifford E. Clark - House Furnishings Cultural
No ratings yet
Clifford E. Clark - House Furnishings Cultural
10 pages
Big-4 India Stat Audit Interview Questions
No ratings yet
Big-4 India Stat Audit Interview Questions
3 pages
CI How To Read Literature Like A Professor Quiz Questions 2
100% (1)
CI How To Read Literature Like A Professor Quiz Questions 2
5 pages
Customers To Be Linkedfinal
No ratings yet
Customers To Be Linkedfinal
8 pages
Renal Diseases Pathophysiology
100% (1)
Renal Diseases Pathophysiology
6 pages
Solutions On Quiz 1
No ratings yet
Solutions On Quiz 1
6 pages
StartNow Overview
No ratings yet
StartNow Overview
22 pages
Berger Paint Project
100% (2)
Berger Paint Project
144 pages
Meeting Script
No ratings yet
Meeting Script
1 page
Hybrid Organizations:: O, S, I, I
No ratings yet
Hybrid Organizations:: O, S, I, I
8 pages
Hep & GIT Final MCQ 21 B
100% (4)
Hep & GIT Final MCQ 21 B
23 pages
Intermediate Relay: Wiring Diagram
No ratings yet
Intermediate Relay: Wiring Diagram
1 page
Noun Rules
No ratings yet
Noun Rules
12 pages
Andculture Brand Guide
No ratings yet
Andculture Brand Guide
35 pages
True Experimental Design
75% (4)
True Experimental Design
2 pages
Overland Journal Arctic
100% (3)
Overland Journal Arctic
20 pages
AS CRJ Vol5 Aircraft Operating Manual Part 2
No ratings yet
AS CRJ Vol5 Aircraft Operating Manual Part 2
136 pages

UNIT-5 Compiler Design

Uploaded by

UNIT-5 Compiler Design

Uploaded by

CS8602-Compiler Design Department of CSE

UNIT IV RUN-TIME ENVIRONMENT AND CODE GENERATION 8

Position of code generator

source intermediate intermediate target

2020 – 2021 Jeppiaar Institute of Technology

Typical subdivision of run-time memory:

2. Data objects, and

3. A counterpart of the control stack to keep track of procedure activations.

  Temporary values such as those arising from the evaluation of expressions. 

STORAGE ALLOCATION STRATEGIES

Division of tasks between caller and callee

Variable length data on stack:

ISSUES IN THE DESIGN OF A CODE GENERATOR

The following issues arise during the code generation phase :

1. Input to code generator

1. Input to code generator:

b. Relocatable machine language

where, x – dividend even register in even/odd register pair y –

A SIMPLE CODE GENERATOR

It can have the following sequence of codes:

ADD c, Ri Cost = 2 // if c is in a memory location

MOV c, Rj Cost = 3 // move c from memory to Rj and add

3. Generate the instruction OP z’ , L where z’ is a current location of z. Prefer a register to a

The algorithmic sequence of getreg function can be,

1. if x value is in register that register is returned.

Statements Code Generated Register descriptor Address descriptor

t:=a-b MOV a, R0 R0 contains t t in R0

u:=a-c MOV a , R1 R0 contains t t in R0

v : =t + u ADD R1, R0 R0 contains v u in R1

d:=v+u ADD R1, R0 R0 contains d d in R0

Generating Code for Indexed Assignments

Statements Code Generated

a : = b[i] MOV b(Ri), R

a[i] : = b MOV b, a(Ri)

Generating Code for Pointer Assignments

Statements Code Generated

x : = y +z if x <0 goto z MOV y, R0

You might also like