Advanced Compiler Design and Implementation: Run-Time Support

The document discusses various aspects of run-time support for compiled code, including: 1. Data type representations and register usage for different integer sizes and long arithmetic. 2. Organization of the run-time stack, activation records, and how different links are used to support procedure calls and nested procedures. 3. Parameter passing modes and the division of responsibilities between caller and callee procedures in setting up the stack frame and argument passing.

Uploaded by

vadriangmail

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

109 views27 pages

Advanced Compiler Design and Implementation: Run-Time Support

Uploaded by

vadriangmail

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

1 of 27

Advanced Compiler
Design and
Implementation:
Run-Time Support
Juhana Helovuo
Data type representations and instruction set support
Register set and register usage
Activation records and run-time stack
Parameter passing modes
Code for subroutine calls
2 of 27
Shared object code
Dynamic typing, heap management, function polymorphism
3 of 27
Data type representations
Fixed-size integers: word, halfword, byte
How to treat integers with size < register size
Example: Add 5 to signed byte @ sp+72
Different sizes of loads, stores and arithmetic (M68k)
addi.b (72,a7), 5 ; add immediate byte
Sign/zero-extend on load instructions (Sparc)
ldsb [%sp+72],%l2 ; load signed byte (and extend)
add %l2, 5, %l2 ; add (32-bit)
stb %l2, [%sp+72] ; store byte
4 of 27
Sign/zero-extend and align with separate instructions
(Alpha)
ldq_u r2, 72(sp) ; load quadword unaligned -> r2
lda r1, 72(sp) ; load address -> r1
extbl r2, r1, r3 ; extract 1 byte from r2 -> r3
mskbl r2, r1, r2 ; mask (clear) byte from r2
addq r3, 5, r3 ; add quadword (64-bit)
insbl r3, r1, r3 ; shift byte back in position
or r2, r3, r2 ; combine result & rest of qword
stq_u r2, (r1) ; write quadword back to memory
The general case seems very complex, but case-specic
optimizations often simplify this (register allocation,
alignment, BWX)
Very simple memory unit: Only aligned 64-bit loads & stores
For integer size > register size: Use two or four registers
Architecture may provide double load for two consecutive
registers (Sparc) or multiple load (ARM)
5 of 27
Long arithmetic
Use carry ag for addition & subtraction (Sparc)
addcc %i1, %i3, %l0 ; add low words, generate carry
addx %i2, %i4, %l1 ; add high words + carry
Or use unsigned less than-comparison (Alpha)
addq a0, a2, t0 ; add low words
addq a1, a3, t1 ; add high words
cmpult t0, a0, t2 ; generate carry: t2 = (t0<a0 ? 1 : 0)
addq t1, t2, t1 ; add carry to result high word
6 of 27
Character strings
C-style strings: Array of characters, end of string marked by
character code 0
Pascal-style strings: Character count (integer) followed by an
array of characters
Instruction set support
x86: store string or move string instructions + repeat prex,
byte-sized operations
Sparc: byte loads and stores
Alpha: insert, extract, mask, zap, cmpbge
PowerPC: load/store string (and compare)
7 of 27
Pointers
Usually 32/64-bit words (same as register size)
Naturally aligned: pointer mod sizeof(pointed data) = 0
Array access often requires pointer arithmetic
base pointer + (index * element size)
Element size is often 4 or 8
Special support for address computation
ARM: Data path for second operand contains a shifter unit
Alpha: s4add, s8add, s4sub, s8sub
PowerPC, ARM, Sparc: Indexed addressing mode
lwzx r0,r9,r2 ; r0 := M[r9+r2] (PowerPC)
ld [%i2+%i3], %l1 ; l1 := M[i2+i3] (Sparc)
8 of 27
Register Usage
Typical RISC has 32 integer registers
(ARM: 16, Itanium: 128, Sparc: register windows, x86: ~8)
Compiler typically has several uses for registers
stack pointer and frame pointer
global offset table pointer (global pointer)
dynamic link and static link
call arguments and return values
local variables
frequently used global variables
temporary values
9 of 27
...Register Usage
The compiler should maximize the use of the register set in
order to avoid memory accesses
The partitioning of the register set may be partially
determined by
ISA (Instruction Set Architecture = hardware platform) and
ABI (Application Binary Interface = system software)
ISA usually denes or recommends a stack pointer, possibly
also frame pointer and link register
ABI may dene argument and return value registers
ABI must be followed to maintain interoperability with other
compilers and libraries
10 of 27
Register partitioning example (Alpha)
v0 = return value, a0..a5 = call arguments, ra=return address
s0..s5 = local/global variables, preserved across calls
t0..t11 = local variables and temporaries, not preserved
pv = call address, gp = global pointer, AT = assembler temp.
v0 t7
s0
s1
fp
t8
t9
t10
t11
pv
AT
gp
sp
zero
s2
s3
s4
s5
a0
a1
a2
a3
a4
a5
ra
t0
t1
t2
t3
t4
t5
t6
r0
r7 r31
r24
r1
r2
...
...
11 of 27
The Run-Time Stack
The run-time stack is used to store activation records (stack
frames)
Activation records represent
procedure activations and they may
contain
dynamic link and static link
call arguments and return values
local variables
saved registers (by caller and callee)
procedure call return address
The stack is maintained and accessed through the stack
pointer register, often also by the frame pointer
sp
fp
current
frame
previous
frame
sp+N
fp-M
12 of 27
The activation record is used to communicate between the
caller (main program) and callee (subroutine)
These procedures may be compiled separately
The compiler must adhere to a call convention, or a
procedure call protocol
Parts of the activation record are constructed by the caller
and some parts by the callee
Only the caller may know the size of argument list (C)
Only the callee knows the storage required for local
variables
Both have to be able to access arguments, return value and
links (dynamic, static, return address)
13 of 27
Links in Stack Frame
Dynamic Link
Used to nd the calling stack frame on return
If the frame size is xed and static, then there is no need for
this. Just use a constant offset in the code
Static Link
Used to nd the last activation of the static parent of the
current frame
Required only in languages allowing nested, local
procedures (e.g. Pascal, Ada, not in C)
Return Address
Used to nd the code of the caller on procedure exit
RISCs store return address into a link register on call (jump-
and-link) instruction
14 of 27
Parameter passing modes
Call by value: Argument value is copied into the callee. The
original variable of the caller is not modied during the call.
Default in most languages (except Fortran and Perl)
Call by result: Argument is copied from the callee to the
caller. Used to return values.
Call by value-result: Argument is copied both ways.
Call by reference: Callee gets a reference (pointer) to a
memory location holding the argument. Callee can modify
the argument.
Call by name: Like call by reference, but the argument pointer
expression is recomuputed at each access.
The callee is passed a small anonymous function to
compute the address of the argument.
15 of 27
Procedure Call and Return
Callers view of a subroutine call
Call
1. Evaluate each argument and place them in argument
registers or stack frame
2. Determine the address of the subroutine (mostly done by the
linker)
3. Store caller-save -registers in stack frame
4. Compute a static link for the subroutine, if necessary
5. Save the return address and jump to the subroutine
Return
1. Restore saved registers from stack
2. Use the return value
16 of 27
Epilogue and Prologue
Callees view of the call
Prologue
1. Save frame pointer, copy stack pointer to frame pointer,
compute new stack pointer, i.e. allocate new stack frame
2. Save callee-save registers, if necessary
3. Construct a display (cache of static links), if necessary
Procedure body is executed between the prologue and the
epilogue
Epilogue
1. Restore saved callee-save registers
2. Restore SP from frame pointer and FP from dynamic link
3. Place return value in appropriate register or stack location
4. Jump to return address
17 of 27
Call Example
Sample C code
int test_proc(int a1, int a2)
{
int lv1, lv2;
...
return ...;
}
...
r = test_proc(r,4);
Subroutine with two int parameters and two int locals
18 of 27
PowerPC calling convention (MacOS X)
stack frames are of
static and xed size
no frame pointer
callee saves as
many registers as it
uses
frame contains
outgoing arguments
(incoming
arguments in
previous frame)
callee may store
incoming args in
callers frame if it
needs them in memory
r0
r1
r2
r3
r10
r11
r12
r13
r31
link
count
cond
exception
zero/temp
stack ptr
temp
arg0/ret.v.
arg7
temp
indir. branch target
local
variables
Register partitioning
Stack frame structure
old SP
SP
saved cond
saved link
???
SP+24
arg0
argN
local
variables
saved
registers
prev. frame
old SP
and temps
in memory
outgoing
args
arg1/ret.v.
arg2
...
19 of 27
PowerPC assembly for example call
Prologue and epilogue
_test_proc:
mflr r0 ; r0 <- link
stmw r26,-24(r1) ; store r26..r31 below SP (24 bytes)
stw r0,8(r1) ; M[SP+8] <- r0 (to callers frame)
stwu r1,-96(r1) ; M[SP-96] <- SP ; SP <- SP-96
...body...
lwz r0,104(r1) ; r0 <- M[SP+96+8]
addi r1,r1,96 ; SP <- SP + 96
mtlr r0 ; link <- r0
lmw r26,-24(r1) ; restore r26..r31
blr ; branch to link register
Call site
li r3,3 ; load arg0
li r4,4 ; load arg1
bl _test_proc ; branch and link
mr r4,r3 ; r4 <- r3 (use return value)
20 of 27
Procedure-valued variables
Rare in imperative languages, routine in functional languages
C provides function pointers
Simple to implement as plain code pointers
This is sufcient, since there are no local procedures
Nested procedures require prodecure values to contain both
code pointer and static link (=closure)
Static link is required to nd the local variables of enclosing
scope
Now activation records may have to live even after the
function execution has ended. Stack allocation is not
sufcient for all procedures
21 of 27
Position-Independent Code
Required for shared libraries - and more generally - for any
dynamically loadable code, e.g. plugin modules
Only one copy of shared code in memory code cannot be
modied at load time
PIC must be loadable to an arbitrary memory location
Code and data references must work regardless of code
location
Local data references are SP-based ok
Jumps within the same object module can use relative
addressing ok
Global data references and jumps from object module to
another cannot be absolute use indirect addressing
22 of 27
Global Offset Table
Global Offset Table (GOT) is a pointer table used to point to
global symbols, whose addresses are not known until
program load time.
Data References
The compiler generates indirect references though the GOT
The link-editor relocates the reference as a GOT offset
The run-time linker lls the GOT with actual symbol
addresses, when it knows where the object will be loaded
Code References
Calls to shared code jump to an element of Procedure
Linkage Table (PLT)
PLT element contains code to load an address fromGOT and
a jump to that address
23 of 27
GOT Example (Sparc)
Procedure prologue
.LLGETPC0: ; helper function
retl ; to read program counter
add %o7, %l7, %l7 ; %l7 += return address
so_func: ; actual procedure start
save %sp, -112, %sp ; allocate stack frame
sethi %hi(_GLOBAL_OFFSET_TABLE_-4), %l7
call .LLGETPC0
add %l7, %lo(_GLOBAL_OFFSET_TABLE_+4), %l7
; now %l7 contains the address of GOT
; _GLOBAL_OFFSET_TABLE_ is a PC-relative symbol
Loading from global data symbol si
; symbol si has relocation type GOT, i.e. it is treated as
; an offset into GOT, not actual memory address
sethi %hi(si), %g1
or %g1, %lo(si), %g1 ; %g1 = GOT offset for si
ld [%l7+%g1], %g1 ; load address of si from GOT
ld [%g1], %i0 ; load value of si
24 of 27
Calling via PLT
call so_aux, 0 ; looks normal, but symbol so_aux
; has relocation type PLT
; linker relocates this to .PLT2
The call is to object module-local PLT, not actual subroutine
in another object
.PLT2
sethi (. - .PLT0), %g1
sethi %hi(so_aux), %g1
jmp %g1+%lo(so_aux)
.PLT3
sethi (. - .PLT0), %g1
ba,a .PLT0
nop
.PLT0
save %sp, -64, %sp
call dyn_linker
so_aux:
save ...
...
PLT
Code from shared object
0: rst entry
2: run-time
3: not yet
linked entry
linked
25 of 27
Dynamic typing and polymorphism
Dynamic typing: The programming language does not
associate types to variables, but rather to data values
Variable name can refer to value of any type
Dynamic typing is usually implemented by tagging data
values. Each value carries a type tag with it.
The compiler should generate efcient code for resolving the
types of data values and selecting the corresponding
(polymorphic) operation on them, e.g. a+b on integers, oats,
stings or bignums.
Modern architectures have very little built-in hardware
support for this
Sparc provides tagged add and subtract instructions
26 of 27
Storage management
Fully manual: malloc and free in C, or new and delete
in C++
Automatic deallocation: new but no delete in Java
Fully automatic: All memory management operations
implicit, in e.g. Lisp or Haskell
Automatic deallocation is usually based on reference
counting, garbage collection, or combination of both
Manual allocation is usually implemented as a library call
e.g. Doug Leas dlmalloc library has been shown to
outperform custom memory allocation routines
27 of 27
Summary
Language semantics and run-time services (dynamic
loading, code sharing, memory management) may require
complicated run-time support
It should be possible to optimize away costly parts of the
procedure call mechanism to obtain good call performance.
The amount of required run-time support code depends on
the language and hardware.
Modern RISCs do not have much explicit architectural
support for specic high-level languages, but this can be
compensated in software

Internal Audit Checklist Questions - IsMS Controls
88% (8)
Internal Audit Checklist Questions - IsMS Controls
22 pages
EE209 - 23 16 AssemblyFunctions
No ratings yet
EE209 - 23 16 AssemblyFunctions
64 pages
Fa24 Week 9
No ratings yet
Fa24 Week 9
38 pages
9 - Subroutines
No ratings yet
9 - Subroutines
56 pages
EE209A - 24 16 AssemblyFunctions
No ratings yet
EE209A - 24 16 AssemblyFunctions
64 pages
Chapter 08 ARM Subroutines 1 Parameters Registers Edited
No ratings yet
Chapter 08 ARM Subroutines 1 Parameters Registers Edited
35 pages
Computer Architecture: Vnu - University Engineering Technology
No ratings yet
Computer Architecture: Vnu - University Engineering Technology
29 pages
Cse Unit 5
No ratings yet
Cse Unit 5
54 pages
Lecture 35 Runtime Environments
No ratings yet
Lecture 35 Runtime Environments
44 pages
Lecture 6
No ratings yet
Lecture 6
32 pages
MIPS Assembly - Stack
No ratings yet
MIPS Assembly - Stack
24 pages
2018fa CS61C L09 BN Procedures
No ratings yet
2018fa CS61C L09 BN Procedures
23 pages
Lecture6 RISC V Assembly IV
No ratings yet
Lecture6 RISC V Assembly IV
21 pages
06 Procedures
No ratings yet
06 Procedures
76 pages
ch2 2
No ratings yet
ch2 2
36 pages
Chapter 5
No ratings yet
Chapter 5
38 pages
15 AssemblyFunctions
No ratings yet
15 AssemblyFunctions
43 pages
CCTV Band Width Calculation
No ratings yet
CCTV Band Width Calculation
5 pages
M2 - Instruction Set Architecture
No ratings yet
M2 - Instruction Set Architecture
61 pages
03 Instructions II
No ratings yet
03 Instructions II
52 pages
Lecture 04 Assembly II
No ratings yet
Lecture 04 Assembly II
49 pages
L07 Riscviii
No ratings yet
L07 Riscviii
74 pages
Ch09 Subroutines and Control Abstraction 4e
No ratings yet
Ch09 Subroutines and Control Abstraction 4e
24 pages
CS104: Computer Organization: Lecture 07, 12 February, 2020
No ratings yet
CS104: Computer Organization: Lecture 07, 12 February, 2020
43 pages
Slides 12 x86 Procedures
No ratings yet
Slides 12 x86 Procedures
43 pages
Lab3 Stack and Function Arguments
No ratings yet
Lab3 Stack and Function Arguments
12 pages
Lecture 7 - ISA - J-Type Function Calls EECS 388
No ratings yet
Lecture 7 - ISA - J-Type Function Calls EECS 388
22 pages
Slide 4
No ratings yet
Slide 4
35 pages
05 Software Security 2
No ratings yet
05 Software Security 2
33 pages
Chapter 10
No ratings yet
Chapter 10
31 pages
Unit-5-Issues in Code Generation
No ratings yet
Unit-5-Issues in Code Generation
20 pages
Chapter 8
No ratings yet
Chapter 8
15 pages
Lecture 4
No ratings yet
Lecture 4
52 pages
Micro MCQS
No ratings yet
Micro MCQS
16 pages
Lecture 09
No ratings yet
Lecture 09
19 pages
Proc Emb Ch4
No ratings yet
Proc Emb Ch4
46 pages
NET3001 4 AdvAsm
No ratings yet
NET3001 4 AdvAsm
43 pages
Subroutines: Subroutines The Stack Recursion
No ratings yet
Subroutines: Subroutines The Stack Recursion
44 pages
Lecture # 6
No ratings yet
Lecture # 6
34 pages
Procedure Calls
No ratings yet
Procedure Calls
19 pages
Lecture5 Extra Runtime Stack
No ratings yet
Lecture5 Extra Runtime Stack
28 pages
Slide Set 10 - Stacks and Register Use
No ratings yet
Slide Set 10 - Stacks and Register Use
15 pages
04 ABIs
No ratings yet
04 ABIs
60 pages
Misp Procedure Calls
No ratings yet
Misp Procedure Calls
34 pages
06ABIs 1
No ratings yet
06ABIs 1
41 pages
L2 - Process Abstraction
No ratings yet
L2 - Process Abstraction
70 pages
Register Usage in MIPS ABI: Inf3 Computer Architecture - 2007-2008
No ratings yet
Register Usage in MIPS ABI: Inf3 Computer Architecture - 2007-2008
24 pages
Assembly Language: Function Calls: Goals of This Lecture
No ratings yet
Assembly Language: Function Calls: Goals of This Lecture
21 pages
Procedure Calls and Stacks Procedures
No ratings yet
Procedure Calls and Stacks Procedures
6 pages
Calling Convention
No ratings yet
Calling Convention
11 pages
CSCI-365 Computer Organization
No ratings yet
CSCI-365 Computer Organization
31 pages
Novell Netware 5 - Advanced Admin - Instructor Guide PDF
No ratings yet
Novell Netware 5 - Advanced Admin - Instructor Guide PDF
1,128 pages
Subroutines and Control Abstraction
No ratings yet
Subroutines and Control Abstraction
44 pages
Redshift Vs Snowflake - An In-Depth Comparison PDF
100% (2)
Redshift Vs Snowflake - An In-Depth Comparison PDF
19 pages
Chapter2 2
No ratings yet
Chapter2 2
25 pages
Aula Ch2 2
No ratings yet
Aula Ch2 2
27 pages
GFSK Intro
100% (1)
GFSK Intro
5 pages
2000 by Antony L. Hosking. Permission To Make Digital or Hard Copies of
No ratings yet
2000 by Antony L. Hosking. Permission To Make Digital or Hard Copies of
26 pages
Disc05 Sols
No ratings yet
Disc05 Sols
8 pages
Disc 4 Sol
No ratings yet
Disc 4 Sol
3 pages
G00pgui v1 (Undetectable) .Lua
No ratings yet
G00pgui v1 (Undetectable) .Lua
25 pages
Constant Propagation: CS 701 Fall 2007
No ratings yet
Constant Propagation: CS 701 Fall 2007
18 pages
Lecture 16
No ratings yet
Lecture 16
19 pages
Classification of Parallel Computers
No ratings yet
Classification of Parallel Computers
16 pages
Microprocessor Lab Manual
No ratings yet
Microprocessor Lab Manual
23 pages
101 EBay Auction Secrets
No ratings yet
101 EBay Auction Secrets
77 pages
ARM Assembly Language Guide: Common ARM Instructions (And Psuedo-Instructions)
No ratings yet
ARM Assembly Language Guide: Common ARM Instructions (And Psuedo-Instructions)
7 pages
Data Modeling Vs Database Design
100% (1)
Data Modeling Vs Database Design
12 pages
Create A PDF File: Exercise 1 and Exercise 2 Produce The Same Result. Choose The One That Works Best For You
No ratings yet
Create A PDF File: Exercise 1 and Exercise 2 Produce The Same Result. Choose The One That Works Best For You
6 pages
TC p42x1x TC p50x1x
100% (1)
TC p42x1x TC p50x1x
124 pages
Chapter 7: Runtime Environment: - Run Time Memory Organization
No ratings yet
Chapter 7: Runtime Environment: - Run Time Memory Organization
18 pages
Get Started
No ratings yet
Get Started
54 pages
Inside Careers Guide To Management Consultancy 201415
No ratings yet
Inside Careers Guide To Management Consultancy 201415
96 pages
1.1) NAND Gates As OR Gate: Virtual Labs (Vlabs - Ac.in)
No ratings yet
1.1) NAND Gates As OR Gate: Virtual Labs (Vlabs - Ac.in)
9 pages
Modeling Registers With Uvm Tom Fitzpatrick
No ratings yet
Modeling Registers With Uvm Tom Fitzpatrick
45 pages
Cots Casadesus Marimon - 2014 - Benefits of Implementing Service Management Standard ISO 20000 Arrossegat 1-Libre
No ratings yet
Cots Casadesus Marimon - 2014 - Benefits of Implementing Service Management Standard ISO 20000 Arrossegat 1-Libre
12 pages
DB Normalization and Design
No ratings yet
DB Normalization and Design
11 pages
ReadME NullDC
No ratings yet
ReadME NullDC
10 pages
DS-2DY9236I-CWX (W/316L) 2MP 36× IR Explosion-Proof Positioning System
No ratings yet
DS-2DY9236I-CWX (W/316L) 2MP 36× IR Explosion-Proof Positioning System
5 pages
Adc, Dac, and Sensor Interfacing
No ratings yet
Adc, Dac, and Sensor Interfacing
25 pages
Dallas 2501
No ratings yet
Dallas 2501
1 page
Ti Cxd9926atq
0% (1)
Ti Cxd9926atq
2 pages
slxr-18 1 00-L2guide
No ratings yet
slxr-18 1 00-L2guide
286 pages
User Manual: Wireless N 150 Home Router
No ratings yet
User Manual: Wireless N 150 Home Router
75 pages
ViviCam X018 Camera Manual
No ratings yet
ViviCam X018 Camera Manual
58 pages
PDF TNPM Troubleshooting
No ratings yet
PDF TNPM Troubleshooting
186 pages
Manual Web Jetad
No ratings yet
Manual Web Jetad
82 pages
Practical 3linux Practical For B.tech Student
No ratings yet
Practical 3linux Practical For B.tech Student
6 pages
TechLibrary - Juniper Networks
No ratings yet
TechLibrary - Juniper Networks
1 page
LenovoB520 Compal-La-6951p Free Laptop Schematic PDF
No ratings yet
LenovoB520 Compal-La-6951p Free Laptop Schematic PDF
62 pages
Mcafee Endpoint Security: Frequently Asked Questions
No ratings yet
Mcafee Endpoint Security: Frequently Asked Questions
5 pages
Basic Configuration Steps of Active
No ratings yet
Basic Configuration Steps of Active
5 pages
Papaya Social Game Engine SDK Manual
No ratings yet
Papaya Social Game Engine SDK Manual
70 pages
Beaglebone Black Webcam Server For Security: M.Naveenkrishna Dr. S. Jayanthy
No ratings yet
Beaglebone Black Webcam Server For Security: M.Naveenkrishna Dr. S. Jayanthy
5 pages
PUBG Tutorial
No ratings yet
PUBG Tutorial
3 pages
Cisco DX650 Quick Reference
No ratings yet
Cisco DX650 Quick Reference
2 pages
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
Basic Information About C language PDF
From Everand
Basic Information About C language PDF
Suraj Das
No ratings yet

Advanced Compiler Design and Implementation: Run-Time Support

Uploaded by

Advanced Compiler Design and Implementation: Run-Time Support

Uploaded by

1 of 27

You might also like