0% found this document useful (0 votes)

21 views98 pages

Compiler Interpreter

Uploaded by

namita

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views98 pages

Compiler Interpreter

Uploaded by

namita

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 98

Compilers &

Interpreters
1. Phases of
Compilation
Phases of
Compiler
Source
Program
Lexical
Analysis
Syntax
Analysis

Symbo Semantic
Analysis Error
l Table
Intermediate Code Handl
Manag Generation er
er
Code
Optimization
Code
Generation
Target
Program
Exampl
e:
Position := initiate + rate * 60

Lexical Analysis

Id1 = id2 + id3 * 60

Syntax Analysis

:
= +
id
1 *
id
2
id 6
3 0
Example
Conti…
Semantic Analysis

:
= +
id
1 *
id
2
id Int to
3 real

6
0
Intermediate Code Generator

Temp1 := int to
real (60) Temp2 :=
id3 * Temp1 Temp3
:= id2 + Temp2
id1 := Temp3
Example
conti…
Code Optimizer

Temp1 := id3 *
60.0 id1 :=
id2 + Temp1

Code Generator

MOVE id3,
R2 MULR
60.0,R2
MOVE
id2,R1
ADDR
R2,R1
MOVE
R1,id1
• Each phase transfers source program from one
representation to
another.
• Typical decomposition of compiler into phases results to
conversion of source program into target program.

1. Symbol Table Manager:

• Important Functions:
– Record Identifiers used in program
• –Store
Collect info about various attributes of each identifier.
–s: Storage Location
– Type
– Scope
– Procedure name
– No. of type of arguments
– Return Type
• Symbol table is data structure containing record for
each identifiers
with fields for attributes of identifiers.
• It is prepared at lexical analysis and later phases add
information.
2. Error Detection & Reporting:
• Each phase can encounter errors.
• Compilation can proceed only after
solving errors generated by lexical,
syntax & semantic analysis.
• If code doesn’t form tokens, structure
is violated which is detected by lexical
analysis.
• If syntax is violated, error is detected
by syntax phase.
• During semantic analysis, compiler
tries to detect constructs that have
right syntactic structure.
4. Intermediate Code
•Generation:
After semantic analysis, some compiler
generate explicit Intermediate
Representation (IR) of source program.
• This IR has two properties:
– 1. Easy to produce
– 2. Easy to translate into target program.

5. Code Optimization:
• Attempts to improve IC.
• Results to faster running machine code.
• Optimization improve running time
of target program.
• But it doesn’t slow down
compilation.
6. Code Generation:
• Final phase of compiler that
generates target code.
• Consist of re-locatable machine
code or assembly code.
• Here memory locations are selected
for each variable used in the
program.
Aspects of
Compilation
Aspects of
Compilation
Compiler &
Interpreter
Bridge
Semantic Gap

PL Execution
Domain domain
• Two aspects:
– Generate code.
– Provide Diagnostics.
• To understand the implementation issue, we should
know PL features
contributing to semantic gap between PL and Execution
domain.
• PL Features:
1. Data Types
2. Data Structure
3. Scope Rules
4. Control Structures
1. Data
type
• Definition:
A datatype is the
specification of
(i)legal values for variables of type
(ii)legal operations on the legal
values of the type.
• Task:
1. Check legality of operations for type
of its operands.
2. Use type conversion operations wherever
necessary & permissible.
var
x,y :
real; i,j :
integer;
Begin
y := 10;
x := y +
i;
Type conversion of i is
needed. i : integer;
a,b :
real;
CONV_ AREG,
R a := b I
+ i;
ADD_R AREG,
2. Data
• PL permitsStructure
declaration of DS and use
it.
• To compile the reference of
element of DS compiler must
develop memory mapping to
access allocated area.
• A record, heterogeneous DS leads to
complex memory mapping.
• User defined DS requires mapping of
different kind.
• Proper combination of DS is required
to manage such complexity of
structure.
• Two kind of mapping is involved
Example:
Program example (input,output);
type
employee = record
name : array *1…10+ of
character; sex : character;
id : integer
end;
weekday =
(mon,tue,wed,thur,fri,sat,sun);
var
info : array [1..500] of
employee; today :
weekday;
i,j : integer;
begin {main
program}
today :=
mon;
info[i].id := j;
3. Scope
•
Rules
Determine the accessibility of variable
declared in different blocks of a
program.
• Eg: x, y : real;
y, z : B
• integer; A
Stat x:=y uses value of block ‘B’.
x := y;
• To determine accessibility of variable
compiler performs operation:
– Scope Analysis
– Name Resolution
4. Control
• Structure
Def: It is collection of language features for altering flow of
control during
execution.
• This includes:
– Conditional transfer of control
– Conditional execution
– Iterative control
– Procedure call
• Compiler must ensure non-violation of program semantics.
• Eg: for i = 1 to 100 do
begin
lab1 : if i = 10 then
……….
……….
End
• Forbidden: control is transferred to label1 from outside the
loop.
• Assignment statements are also not allowed.
Memory
Allocation
Memory
• Allocation
3 important task:
– Determine memory requirement
To represent value of data items.
– Determine memory allocation
To implement lifetime & scope of data item.
– Determine memory mapping
To access value in non-scalar data items.
• Binding: A memory binding is an association b/w
memory address attributes of the data item &
address of memory area.
• Topic list:
A. Static and dynamic memory allocation
B. Memory allocation in block structured language
C. Array allocation & access
[A] Static & Dynamic
Allocation
Static Memory Dynamic Memory
Allocation Allocation
1. Memory is allocated 1. Memory
to variable before allocation of
execution of variable is done
program begins. all the time of
2. Performed execution of
during program.
compilation. 2. Performed
3. At execution no during
allocation & de- execution.
allocation is 3. Allocation & de-
performed. allocation actions
4. Allocation to variable occurs only at
Static Memory Dynamic Memory
Allocation
1. Variable remains Allocation
1. Variables swap
allocated from and to allocation
permanently till state & free state.
execution does not 2. Eg: PL/I, ADA, Pascal.
end. 3. Two types/ flavors
- 1. automatic
2. Eg: Fortran allocation
3. No. of flavors or - 2. Program
types controlled allo
4. Adv: Simplicity and 4. Adv: Recursion
faster access. support DS size Memo
5. Memorywastage
Memory dynamically. ry: A
Memo
compared
Code(A) 5.ry:Less memory calls
wastage
compared to B.
formal.
A&
toData
dynamic.
(A) Only A
is B is
Code (B) active active
Data (B) Code(A) Code(A)
Code (C) Code (B) Code (B)
Data (C) Code (C) Code (C)
Data (A) Data (A)
Data (B)
Automatic v/s Program Controlled Dynamic
Allocation
Automatic Dynamic Program Controlled Dynamic
Allocation Allocation
1. Implies memory binding 1. Implies memory binding
performed at execution performed during the
initiation time of execution of program
program unit. unit.
2. Memory is allocated to 2. Memory is allocated not
declared variables when
execution starts. at execution but when
used for very first time
3. De-allocated when
program unit during execution.
is exited. 3. De-allocated when
4. Different memory areas arbitrary points are left
may be allocated to while execution.
same variable in 4. Here allocation is done
different activation of when program
program unit. modules start purely
5. Implemented using based on scope of
stack since entry, exit is
[B] Memory allocation in
Block Structured
• Language
Block contain data declaration
• There may be nested structure of
block
• Block structure uses dynamic memory
allocation
• Eg: PL/I, Pascal, ADA
• Sub Topic List
1. Scope Rules
2. Memory Allocation & Access
3. Accessing Non-local variables
4. Symbol table requirement
5. Recursion
6. Limitation of stack based memory
1. Scope Rules:
• If variables vari is created with name namei in
block B.
• Rule 1: vari can be accessed in any
statement situated in block B.
• Rule 2: Rule 1 + ‘B’ is enclosed in ‘B’ unless
‘B’ contains declaration using same name
namei.
• Eg: variables B = local
B’ = non-localBlock Accessibility of Variables
A{
Local Non-Local
x,y,z :
integer; B{ A xA,yA,zA
g : real; B gB xA,yA,zA
C{
C hC, zC xA,yA,gB
h,z : real; D iD, jD xA, yA, zA
}C
2. Memory allocation
& Access
• Implemented using extended stack
model.
• Automatic dynamic allocation is
implemented using extended stack
model.
• Minor variation: each record has two
reserved pointers, determining its
scope.
• AR: Activation Record
• ARB: Activation Record Block
• See the following figure:
X:re
A
al Y: char
0 (ARB) Reserv B
1 (ARB) ed Z,W :
- Pointer C integer Z
ax
s = 10;
X
X = Z;
-
-
TO -
S AR
x A

- AR -
B AR
AR
A y B
x
-
AR -
B
AR AR
B z C
y
w
TO
S TO
• Allocation:
1. TOS := TOS + 1;
2. TOS* := ARB;
3. ARB := TOS;
4. TOS := TOS + 1;
5. TOS* := ………..(sec reserved
pointer 2)
• 6.
SoTOS := TOSof+‘z’
address n; is

• <ARB> + z;
De-allocation:
1. TOS := ARB
– 1;
2. ARB :=
ARB* ;
3. Accessing non-local
– n1_var : is a non local variable
variables
– b_def : block defined into
– b_use : block in use
• Then textual ancestor of block b_use is a
block which encloses block b_use that is
b_def.
• Level m ancestor is a block which
immediately encloses level m-1 ancestor.
• S_nest_b_use: static nesting level of block
b_use.
• Rule: when b_use is in execution, b_def
must be active.
• i.e ARBb_def exist in stack while execution
ARb_use.
• n1_var are accessed by start address of
ARb_def + dn1_var
(i) Static
• Access of non-local variables is implemented using
Pointer:
second reserved
pointer in AR.
• It 1
has (ARB)
0
• 1 (ARB) is static pointer.
• At the time(ARB)
of creation of AR for Block B its static pointer
is set to point AR of static ancestor of b.
• Access non-local variables:
1. r := ARB;
2. Repeat step-3 m
times 3. r := 1(r)
4. Access n1_var
Example:
• using Status
address <r> +after
dn1_var.
–execution:
TOS := TOS + 1
– TOS* := address of AR at level 1
ancestor.
– <r> to access x at statement
x:= z
• r:= ARB;
• r:= 1(r);
• r:= 1(r);
Dyna Static
mic
Pointer Poine
-
s rs
AR
x A

-
AR
y B

AR -
B
AR
z
C
TO w
S
(ii) Displays:
• Display (1) = address of level (S_nestb-1)
ancestor of B.
• Display (2) = address of level (S_nestb-2)
ancestor of B.
• Display [S_nestb-1] = address of level 1
ancestor of B.
• Display [S_nestb-1] = address of ARB.
• For large value of level difference, it is
expensive to access non-local variables
using static pointers.
• Display is an array used to improve the
efficiency of non-local variables
accessibility.
-

Displ
x
ay
- #
1
#
y 2
AR - #
B 3

z
TO w
S
4. Symbol Table
Requirement:
• To improve dynamic allocation &
access, compiler should perform
following task.
– Determine static nesting level of b_current
– Determine variable designated with scope
rules.
– Determine static nesting level of block &
dv.
– Generate code.
• Extended stack model is used bcz it has
– Nesting level of b_current
– Symbol table for b_current.
• Symbol table has
-
Nesting 1 AR
level X|2 A

-
2 AR
Symbol & Y|2 B
Displacem
ent AR -
B 3
AR
Z|2
C
TO W|3
S
5. Recursion:
• Extended stack model best for
recursion.
• See program and figure in book pg.
no. 175 and 176.

6. Limitations of stack based

memory allocation:
• Not good for program controlled
memory allocation.
• Not adequate for multi-active
programs.
[C] Array allocation &
•
• Access
A[5,10]. 2D array arranged column wise.
Lower bound is 1.
• Address of element A[s1,s2] is determined by A[1,1]
formula:
• Ad.A[s1,s2] = Ad.A[1,1] + { (s2-1)xn + (s1-1)} -
x k. -
• Where, A[5,1]
– n is number of rows.
– k is size, number of words required by each A[1,2]
element.
-
• General 2D array can be
represented by a[l1:u1,l2:u2] A[5,2]
• The formula becomes
-
Ad.A[S1,S2] = Ad.A[l1,l2] +
{(s2-l2) x (u1-l1+1) + (s1-l1)} x -
k.
A[1,10]
• Defining the range:
– Range1: u1-l1+1 -
– Range2: u2-l2+1
A[5,10]
• Ad.A[s1,s2]
= Ad.A[l1,l2] + {(s2-l2) x
range1 + (s1-l1)} x k.
= Ad.A[l1,l2] – (l2 x range1
+ l1) x k + (s2 x range1
Dope
Vector:
• Definition: Dope vector is an descriptor
vector
– Accommodated in symbol table
– Used by generation of code phase.
• No. of dimension of array determine
format & size of DV.
• Code generation use d.Dv to check
validity of subscripts.
• See figure 6.10 pg. no. 178.
• The code for array reference on pg.
no. 179.
• See Example 6.15 on pg. no. 179 and
Compiler of
Expression
Topic
List
A. A Toy generator for
expression
a) Operand Descriptor
b) Register Descriptor
c) Generating an Instruction
d) Saving Partial Results
B. Intermediate code for
expression
a) Postfix string
b) Triples & Quadruples
c) Expression Trees
[A] A Toy Generator for
Expressions
• Major issues in code
generation for expression:
– Determination of execution
order for operators.
– Selection of instruction used
in target code.
– Use of registers and handling
partial results.
a) Operand
• Descriptor:
Attributes: Type Lengt Miscellaneous
h
• Addressability: Addressability Address
• Specifies: Code
– Where operand is
located
– How it can be
accessed.
• Addressability Code:
– M : operand is in
memory
– R : operand is in
register
– AR : address is in
register
• Operand descriptor is build for every
operand that is id’s, constant, partial
results.
– PRi :- Partial Results
– Opj :- Some operator
• Operand descriptor is an array in
which operand descriptions are
stored.
• Descriptor
MOVE # isAREG,
a descriptor in operand
descriptor
R array.
A
Eg: MULT
• See code AREG,
the skeleton
generation
code
forfrom
a*b book
6.13, pg. 182.B

(int, 1) M, addr (a) Descriptor for

a
(int, 1) M, addr (b) Descriptor for
b
(int, 1) R, addr (AREG) Descriptor for
a*b
b) Register
Descriptor
• It has 2 fields:
– Status: free / occupied
– Operand Descriptor #
• Stored in register descriptor array.
• One register descriptor exist for
each CPU register.
• Register Descriptor for AREG, after
generating code for a*b.
Occupied #3
c) Generating on
Instruction
• Rule: any one operand need to be
in register to perform any
operation over it.
• If not, one of the operand is
brought to register.
• Function codegen is called with
OP; and descriptors of its
operands as parameters.
d) Saving Partial
• Results
If all the registers are occupied, register are
freed by transferring content of temporary
location in memory.
• r is available to evaluate operator OPi.
• Thus, ‘temp’ array is declared in target
program to hold partial results.
• Descriptor of partial result must
change/modify when partial result are
moved to temporary location.
• After partial
1.int, result a*b is moved to temp
m,addr(a)
location.
1 m,addr(b)
• 2.int,
Eg: a*b m,addr(temp
• 1 generation
Code [1]) routine on pg. no. 184 to
3.int,
187. just have a look.
1
[B] Intermediate Codes for
Expressions
a) Postfix string
b) Triples &
Quadruples
c) Expression Trees
a) Postfix
• Strings
Here each operands appear immediately
after its last operand.
• Thus, operators can be evaluated in order in
which they appear in string.
• Eg: a+b*c+d*e^f
abc*+def^*+
• We perform code generation from postfix string
using stack of operand descriptors.
• Operand appears and then operand descriptors
are pushed to the stack. i.e stack would
contain descriptor fro a,b & c when first * is
encountered.
• Little modification in extended stack model
can manage postfix string efficiently.
b) Triples &
Quadruples
1) Triples: Operator Operand Operand
1 2
• Triple is a representation of
elementary operations in the form
of a pseudo machine instruction.
• Slight change in algorithm of
operator precedence help us
convert infix string to triples.
• See figure 6.19 on pg.no.
189 for the expression
a+b*c+d*e^f.
2) Indirect Triples:
• Are useful in optimizing compiler.
• This arrangement is useful to detect
the identical expression occurrence in
the program.
• For efficiency Hash Organization can
be used for the table of triples.
• Indirect triple representation provide
memory economy.
• Aid: Common sub expression
elimination.
• See figure 6.20 and example 6.22 on
3) Quadruple:
• Result name: designates result of
evaluation that can be used as
operand for other quadruple.
• More convenient than using triples &
indirect triples.
• For example : a+b*c+d*e^f
• See figure 6.21 on pg. no. 190.
• Remember, they are not temporary
locations (t) but result name.
• For elimination, result name can
become temporary location.
Operator Operand 1 Operand 2 Result
name
c) Expression
• Operator are Tree
evaluated in order determined by bottom up
parsing which is
not most efficient.
• Hence, compiler’s back-end analyze expression to find
best evaluation order.
• What will help us here?
• Expression Tree: is a AST (Abstract Syntax Tree) which
depicts the
structure of an expression.
• Thus, simplifying analysis of an expression to determine
best evaluation order.
• Eg: 6.24 pg.no. 190.
• How to determine best evaluation order for the
expression?
– Step 1: Register Requirement Label (RR Label) indicates
no. of register required by CPU to evaluate sub-tree.
– Step 2: Top Down parsing and RR Label information is
used and order of
evaluation is determined.
COMPILATION OF
CONTROL
STRUCTURES
Compilation of Control
Structure
Topic List:
A. Control Transfer, Conditional
Execution & Iterative Constructs
B. Function & Procedure Calls
C. Calling Conventions
D. Parameter Passing Mechanism
a) Call by Value
b) Call by Value Result
c) Call by Reference
d) Call by Name
Definition: Control
Structure
• Control structure of a programming
language is the collection of
language features which govern
the sequencing of control through a
program.
• Control structure of PL construct for
– Control Transfer
– Conditional Execution
– Iterative Construct
– Procedure Call
[A] Control Transfer,
Conditional Execution &
• Iterative
Control transfer Construct
is implemented through
conditional & un-conditional ‘goto’
statements are the most primitive control
structure.
• Control structure like if, for or while
cause semantic gap b/w PL domain &
execution domain.
• Why? Control transfer are implicit rather
then explicit.
• This semantic gap is bridge in two steps:
– Control structure is mapped into goto.
– Then this programs are translated to
assembly program.
[B] Function &
•
Procedure
x := fn-1(y , z) + b * c; Call
In the given statement function call on fn-1 is made
after execution
returns the value to calling function.
• This may result to some side effects.
• Def: Side Effect:
Side effect of a function (procedure) call is a chance in
the value of
a variable which is not local to the called function
(procedure).
• A procedure call only achieves side effect, it doesn’t
return a value.
• A function call achieves side effect also returns same
value.
• Compiler must ensure following for function call:
– 1. Actual (y,z) parameters are accessible in called
function.
– 2. Called function is able to produce side effect
• Compiler uses set of features to
implement functions:
– Parameter List: contains descriptor
for each parameter.
– Save Area: Register   Save Area
– Calling Conventions: there are few
execution time assumptions:
i. How parameter list is accessed?
ii. How save area is accessed?
iii. How transfer of control at call & return are
implemented?
iv. How function value is returned to calling
program.
– (iii) & (iv) are performed by machine
instruction or CPU?
[C] Calling
Conventions
Static Calling Dynamic Calling
Convention Convention
• Static memory • Dynamic memory
allocation & allocation &
environment. Environment.
• Parameter list & save • Calling program
area are allocated in construct parameter
calling program. list and saved area
• Calling conventions using stack.
required. Address of • This becomes part of
function, parameter list called function’s AR
and save area to be when execution is
contained in specific CPU initiated.
registers at call. • During execution Dp has
• During execution Dp address:
has the address: <ARB> + (dDp)AR.
<r_par_list> + Where, ARB is start of
(dDp)par_list. parameter list and
[D] Parameter Passing
•
Mechanism
Define semantics of parameter
usage inside a function.
• Thus, defines the kind of side
effects, a function can produce on
its actual parameter.
• Types:
– Call by value
– Call by value result
– Call by reference
– Call by name
1. Call by
• Value
Actual parameters are passed to called
function.
• These values are assigned to
corresponding formal parameters.
• Values are passed in ‘one direction’. i.e
from calling program to called program.
• If function changes value of formal parameter,
changes are not reflected on actual parameter.
• Thus, can’t produce any side effect on
parameters.
• Generally used in built in function.
• Advantage:
– Simplicity
– Efficient if parameters are scalar variables.
2. Call by Value
Result
• Extends capability of call by
value.
• Copies values of formal
parameter back to corresponding
actual parameter at return.
• So, side effects are reflected at
return.
• Advantage:
– Simplicity
• Disadvantage:
3. Call by
•
•
Reference
Address of actual parameter is passed to called function.
Parameter list is actually the list of addresses.
• At every access, corresponding actual parameter is obtained
from parameter list.
• Code:
– 1. r <- <ARB> + (dDp)AR or
r <- <r_par_list> + (dDp)par_list
– 2. Access value using address contained in register.
• Code analysis:
– Step 1: incurs overhead
– Step 2: produces instantaneous side effects.
• Mechanism is popular because has clear semantics.
• Plays important role at the time of nesting of structures.
• How? It provides updated value.
• See eg. 6.29 on pg. no. 197.
• Here, z,i are non local variables of alpha.
• Alpha be called as (d[i],x).
• Value of ‘x’ changes as ‘b’ also changes.
4. Call by
•
•
Name
Same effect as call by reference.
Every occurrence of formal parameters in the called function is replaced by the name
of the corresponding actual parameter.
• Eg:
a = d[i];
z=
d[i]; i
=i+
1;
-b =
d[i] +
5;
x=
d[i] +
5;
• Achieves
instantaneo
us side
effects.
• Has
implication
of changes
in
parameters
Parameter Passing
Mechanism
Languagesupported by
Uses
1. C Language 1. Call by value
2. Pascal 2. Call by value
3. Fortra Call by
n reference
4. PL/I 3. Call by
reference
5. ADA
4. Call by
6. Algol- reference
60 5. Call by value
result Call by
reference
6. Call by value
CODE
OPTIMIZATION
Code Optimization:
A.
Topic List
Optimizing Transformation
a) Compile Time Evaluation
b) Combination of Common Sub-
expression
c) Dead Code Elimination
d) Frequency Reduction
e) Strength Reduction
B f)Local
Local & Global Optimization
. a) Value Numbers
Optimization
A. Global Optimization
a) Program
Representation
b) Control & Data Flow
Analysis
Code Optimization:
• Introduction
Aim: Improving the execution efficiency of
program.
• How to fulfill this aim?
– Eliminating Redundancy
– Re-arrangement and re-writing program
computation to execute
it efficiently.
• Axiomatic: Code optimization must not
change the meaning of the program.
• Scope of Optimization:
– Improves program rather than improving
algorithm.
– Not possible to generate efficient code for
specific target machine.
• Thus, optimization techniques are
Schematic of Optimizing
•Compiler:
Eg: IBM 1360 system, optimizing compiler
for fortran ‘H’:
• Consumes 40% extra compilation
time due to optimization.
• Occupy 25% less storage.
• Executes three times faster.

Source Fro Optimizati Bac Target

nt on k Progra
Progra En Phase En m
m d d

IR
[A] Optimizing
• Transformation:
Optimization Transformation is a rule for
‘re-writing’ a segment of a program.
• Improve execution efficiency without
affecting its meaning.
• Types:
– Local (Small segments)
– Global (Entire Segments)
• What is the need of this classification?
– Reason is difference in cost.
– Benefits of optimization transformation
– What is needed is provided, not less, not
more.
• Lets see few Optimizing transformations:
a) Compile Time
• Evaluation
Certain actions are performed at compile
time, certain at execution time.
• This distribution improves execution
efficiency as certain actions are eliminated
at execution.
• This is called ‘constant folding’.’
• ‘Constant Folding’:
– When all operands are constant.
– Perform operation at compilation
– Result is also constant
– Thus, can be replaced by original evaluation.
• Thus, we eliminate division operation at time
of execution.
• Eg: a = 3.14157/2 is replaced by a =
1.570785
b) Elimination of
Common Sub-
•See ex. 6.31 Expression
• Common sub expression are occurrences
of
expressions yielding same value, called
‘equivalent expression’.
• Second occurrence b*c can be
eliminated.
• They are identified by using triples &
quadruples.
• Some compilers use rule of ‘algebraic
equivalence’ in common sub-expression
elimination.
c) Dead
• Code
The code which can be omitted from a
program without affecting its results is
called dead code.
• Now question is how to detect or check
whether it is a dead code or not.
• By checking whether the value assigned is an
assignment statement which is used
anywhere in the program or not.
• If not, it’s a dead code.
• Eg: x:=<exp>
• Has dead code if value, assigned to x is not
used anywhere in the program.
• i.e Expression constitutes dead code only
if it is not producing any side effects.
d) Frequency
•
Reduction
Eg: 6.33
• Those who are independent of ‘for
loop’ is called loop invariant.
• Here, x is a loop invariant, which his
moved out of loop to perform
frequency reduction.
• Y is indirectly dependent of loop. i.e z,i
;
• So, frequency reduction is not
possible.
• Thus, transformation of loop
optimization moves loop invariant code
e) Strength
• Reduction
Strength reduction optimization replaces
the occurrence of a time consuming
operation (also called ‘high strength’
operation) by n occurrence of a faster
operation (also called ‘low strength’
operation).
• Example 6.34
• Here, we are replacing multiplication
operation with addition.
• Beneficial in array reference.
• This results in strength reduction.
• Dis-advantage: Not recommended for
floating point operands. Reason, it doesn’t
guarantee equivalence of result.
f) Local & Global
• Optimization
Two phases:
– Local
– Global
• Local Optimization: applied over small
segments consisting of few statements.
• Global Optimization: applied over a
program unit over function or procedure.
• Local optimization is preparatory phase
for global optimization.
• Local optimization simplifies certain
aspects of global optimization.
• Global optimization eliminates only first
occurrence of a+b, all other occurances will
eliminate automatically with local optimization.
[B] Local
• Optimization
Provides limited benefits at low cost.
• Scope? Basic block which is essentially
sequential segment in a program.
• Cost is low. Why?
– Sequential Nature
– Simplified Analysis
– Applied to basic block.
• Limitations? Loop optimization is beyond the
scope of local optimization.
• See def of basic block in book, pg no. 203.
• Is a single entry point.
• Essentially sequential.
Value
• Provides simpleNumber:
means to determine whether two
occurrence of an
expression in a basic block are equivalent or not.
• This technique is applied while identifying the basic
block.
• Steps / Conditions for value numbers:
– Two expression ei and ej are equivalent if they are
congruent and their
operands have same value number.
• See eg. 6.35 and 6.36 pg. 204
• Starting of variable is 0.
• Value no is considered only when operation need to be
performed over
variables.
• Flag checks to see if value needs to be stored in
temporary location.
• This semantic can be extended to implement “constant
propagation”.
[C] Global
•
Optimization
Require more analytic efforts to establish the feasibility of an
optimization.
• Global common sub expression elimination is done here.
• Occurrence can be eliminated if it satisfy two condition:
– 1. Basic Block bj is executed only after some block bk ϵ SB has
been executed
one or more times (Ensure x*y is evaluated before bj)
– 2. No assignment to x or y have been executed after the last (or
only) evaluation of x*y block bj.
• x*y is saved to temporary location in all block b12 satisfying
condition 1.
• Requirement? Ensure that every possible execution of program
satisfy both the condition.
• How we would do this?
• By analysing program using two techniques:
– Control Flow Analysis
– Data Flow Analysis
PFG: Program Flow
Graph
• Def: A PFG for a program P is
directed graph Gp = (N,E,n0)
• Where,
– N : set of blocks
– E : set of directed edges (bi, bj)
indicating the possibility of control flow
from the last statement of bi(source
node) to first statement of
bj(destination node).
– n0 : start node of program.
Control & Data Flow
• Analysis
Control & Data Flow Analysis: Used to determine
whether the
condition governing and optimizing transformation are
satisfied or not.
1. Control Flow Analysis: Collects information
concerning its structure i.e nesting of loops.
• Few concepts:
– Predecessors & Successor:- If (bi,bj) ϵ E, bi is a
predecessor of bj & bj is a successor of bj.
– Paths:- A path is a sequence of edges such that
destination node of one edge is the source node of the
following edge.
– Ancestors & Descendants :- If path exist from bi to bj,
bj is an ancestor of
bj and bj is a descendant of bi.
– Dominators & Post Dominators:- Block bi is a dominator
of block bj if every path from n0 to bj is passed through bi.
And bi is the post dominator of bj if every path from bj to
2. Data Flow
•Analysis:
Analyse the use of data in the program.
• Data flow information is computed for
the purpose of optimization at entry &
exit of each basic block.
• Determines whether optimization
transformation can be applied or not.
• Concepts:
– Available Expression
– Live Variables
– Reaching Definition
• Use:
– Common sub expression elimination.
– Dead code elimination
– Constant variable propagation
Available
• Expression:
Occurrence of global common sub
expression can be eliminated only if
– 1. Condition 1 & 2 are satisfied at entry of bi.
– 2. No assignment to x or y precedes the
occurrence of x*y in bi.
• How to determine availability of an expression
at entry or exit of basic block bi?
• Rules:
– 1. Expression e is available at the exit of bi if
• (i) bi contains evaluation of e not followed by
assignment to any operand of e, or
• (ii) value of e is available at the entry to bi & bi
doesn’t contain assignment to any operand of e.
– 2. Expression e is available at entry to bi if it
is available at exit of
each predecessor of bi in Gp.
• It is forward data flow concept
• Availability at exit of node determines
availability at entry of successor.
• We associate two Boolean properties
with block bi to determine the effect
of computation called ‘local
properties’ of block bi.
• Eval i:- ‘True’ if e is evaluated in Bi and
operands of e are not modified.
• Modify :- ‘True’ if operand of e is
modified in bi.
• See page no. 209 equations and pg.
No. 210 PFG Example 6.38.
Live
• Variables:
Variable Var is said to be live, at a program point Pi basic
block bi if the value contained in it at Pi is likely to be used
during subsequent execution of program.
• Otherwise, its a dead code which can be eliminated.
• How to determine liveliness? By 2 property specified on pg.
No. 211.
• See data flow information again on pg. No. 211.
• Availability at entry of block determines availability
at exit of its predecessor.
• Hence called “Backward Data Flow” concept.
• Also called “any path concept”.
• Why?
• Liveness of an entry at successor is sufficient to ensure,
liveness at exit of block.
• Eg:
– a is live at entry of each block except fourth block.
– B is live at all block .
– X & Y are live at 1,5,6,7,8,9
INTERPRET
ERS
Interpret
ers
• Topic List
– Interpreters – use
– Overview of
Interpreters
– Toy Interpreter
– Pure & Impure
Interpreters
Interpreters :
• Introduction
Avoid overhead of compilation
• Modification at every execution is
managed by interpreters.
• Dis-advantage: Expensive in terms of
CPU time.
• Why? Each statement is subject to follow
interpreters cycle.
• Cycle:
– Fetch the Instruction
– Analyse statement & Determine meaning
– Execute the meaning of statement
• What is the difference between
compiler and interpreter.
Compiler v/s
Interpreter
Compiler Interpreter
• Next Phase: During • During interpretation
compilation analysis of analysis is followed by
statement is followed
by code generation. actions for
• .exe : Compiler convert implementation.
into • We can never
exe only once. run a program
• Development: One without
time infrastructure interpretation.
development.
• Rate of • Repetitive
Development: Slow development.
• Access Rate: Faster • Rate of
access at development:
later stage.
faster.
Compi Interpre
ler ter
• Best for: static • Best for dynamic
languages languages.
• Required at: only • Each time
one time compiler is program is
required. executed
• Cost: proved interpreter is
cheaper at longer required.
run. • Proved costly at
• Loading: Compiler language run.
is loaded • Interpreter is
only first time. needed at each
load.
Interpreter :
Introduction
• Notation:
– Tc : Average Compilation time per
statement
– Te : Average Execution time per
statement
– Ti : Average Interpretation time per
statement
• Here we assure Tc = Ti.Te
• SizeP = number of statements in
program P.
Exampl
• Let,
– Size = 200
e:
– 20 statements are executed
– Loop has 10 iterations
– Loop has 8 instructions in it
– 20 stmts are followed by loop for printing result.
• Then, stmt_executedP = 20 + (10 * 8) + 20
= 120.
• Total Execution Time:
– For compilation model:
• 200 . Tc + 120. Te
• = 206.Tc
– For interpretation model:
• 120. Tc
• 120 . Tc
• Conclusion: Clearly interpretation is better than
compilation as far as execution time is concerned.
Why Use
•
Interpreter?
Simplicity
• Efficiency & Certain environmental
benefits.
• If required modification,
recommended when stmt execution
is less than size P.
• Preferred during program
development.
• Also when programs are not
executed frequently
/ repeatedly.
Componen
• ts:
1. Symbol Table: Holds information
concerning entities present in program.
• 2. Data Store: Contains values of data items
declared.
• 3. Data Manipulation Routines: A set
containing a routine for every legal data
manipulation actions in the source language.
• Advantages:
– Meaning of source statement is
implemented using interpretation
routine which results to simplified
implementation.
– Avoid generation of machine language
instruction.
– Helps make interpretation portable.
– Interpreter itself is coded in high level
A Toy
Compiler:
• Steps:
– Ivar[5] = 7;
– Rvar [ 13] = 1.2;
– Add is called (for a=b+c)
– Addrealint is called.
– Rvar[r_tos] = rvar[13] + ivar[5];
– Type conversion is made by interpreters
– Rvar[addr1] + ivar[addr2]
– End.
• See program of interpreter on pg. No.
125.
• See example for given above steps on
pg. No. 126
Pure & Impure
•
Interpreters:
Pure Interpreters:
– Here, source program is retained in
source form all through interpretation.
– Dis-advantage: This arrangement incurs
substantial analysis overheads while
interpreting the statement.
– Eliminates most of the analysis during
interpretation except type analysis.
– For type analysis pre processor is needed.
– See fig. 6.34 (a). Pg. No. 217
– See Ex. 6.42. IC intermediate code for
postfix notation.
• Impure Interpreters:
– See fig. 6.34 (b). Pg. no. 217.
– See Ex. 6.43. IC intermediate code for postfix
notation.
– Perform some preliminary processing of
the source program to reduce the
analysis overhead during interpretation.
– Pre-processor converts program to an IR
which is used during interpretation.
– IC can be analysed more efficiently then
source program.
– Thus, speed up interpretation.
– Dis-advantage: Use of IR implies that entire
program has to
be pre-processed after any modifications.
– Thus, incurs fixed overhead at the start of
interpretation.
6th Chapter Ends
•
Here.
Please start preparing.
• U will not get 1 month study
leave.
• Start preparation Now.
• Every day complete 1 chapter.
• Chapter 1, 3, 4 from class work.
• Chapter 5, 6, 7, 8 and Unit 6 from
slides.
• From tomorrow we would start
chapter 7.

Principles of Compiler Design: Run Time Environments
No ratings yet
Principles of Compiler Design: Run Time Environments
61 pages
PPL NOTES Last Day Prep
No ratings yet
PPL NOTES Last Day Prep
32 pages
Runtime Storage Management (AutoRecovered)
No ratings yet
Runtime Storage Management (AutoRecovered)
17 pages
Filmora Activation
No ratings yet
Filmora Activation
3 pages
PPL Complete Notes
No ratings yet
PPL Complete Notes
8 pages
Unit 1: Syntax, Semantics and Pragmatics
No ratings yet
Unit 1: Syntax, Semantics and Pragmatics
13 pages
Cs8602 Unit 4 Access To Nonlocal Data On The Stack
No ratings yet
Cs8602 Unit 4 Access To Nonlocal Data On The Stack
15 pages
C.K.Pithawala College of Engg. & Tech: Topic: Language Processors
No ratings yet
C.K.Pithawala College of Engg. & Tech: Topic: Language Processors
42 pages
System Development Life Cycle
100% (2)
System Development Life Cycle
3 pages
Engineering Practices For Building Quality Software
No ratings yet
Engineering Practices For Building Quality Software
127 pages
25 Compilers 4
No ratings yet
25 Compilers 4
146 pages
Emerging Trends MCQ
100% (1)
Emerging Trends MCQ
18 pages
SPCC Answer Bank
No ratings yet
SPCC Answer Bank
64 pages
Compiler (Statement of Problem)
No ratings yet
Compiler (Statement of Problem)
58 pages
SPCC Imp Q&A
No ratings yet
SPCC Imp Q&A
65 pages
Unit IV
No ratings yet
Unit IV
57 pages
Cse Unit 5
No ratings yet
Cse Unit 5
54 pages
Chapter 01 PCPF
No ratings yet
Chapter 01 PCPF
62 pages
Chapter 1 Introduction To Compiler
No ratings yet
Chapter 1 Introduction To Compiler
68 pages
11 RunTimeAdministration1
No ratings yet
11 RunTimeAdministration1
68 pages
13 Runtime Systems
No ratings yet
13 Runtime Systems
65 pages
Unit 4 Symbol Table
No ratings yet
Unit 4 Symbol Table
55 pages
Run-Time Environments
No ratings yet
Run-Time Environments
51 pages
Unit 4.2
No ratings yet
Unit 4.2
44 pages
Lecture4 Names
No ratings yet
Lecture4 Names
66 pages
KCA-015 Compiler Design Unit - 4-5
No ratings yet
KCA-015 Compiler Design Unit - 4-5
47 pages
Unit 5
No ratings yet
Unit 5
32 pages
Excel Dynamic Arrays: Course Notes
No ratings yet
Excel Dynamic Arrays: Course Notes
34 pages
Unit 5 - Runtime Environment
No ratings yet
Unit 5 - Runtime Environment
69 pages
5th Unit ACD
No ratings yet
5th Unit ACD
27 pages
Programming Language Concepts
No ratings yet
Programming Language Concepts
76 pages
Module 4 Acd
No ratings yet
Module 4 Acd
26 pages
CSC305 Chapter 2 (Part 2)
No ratings yet
CSC305 Chapter 2 (Part 2)
27 pages
PLP Question Bank For Reference Purpose
No ratings yet
PLP Question Bank For Reference Purpose
21 pages
Unit-5-Issues in Code Generation
No ratings yet
Unit-5-Issues in Code Generation
20 pages
Unit 4
No ratings yet
Unit 4
22 pages
Pratt Chapter 2
No ratings yet
Pratt Chapter 2
41 pages
Run Time Env Symbol Table Review
No ratings yet
Run Time Env Symbol Table Review
42 pages
Unit 4 - CD
No ratings yet
Unit 4 - CD
14 pages
Unit Iv QB
No ratings yet
Unit Iv QB
17 pages
Compiler Design Unit 4
No ratings yet
Compiler Design Unit 4
28 pages
Programming Language Syntax and Semantics
No ratings yet
Programming Language Syntax and Semantics
54 pages
Module 3 (Compiler Design) 1
No ratings yet
Module 3 (Compiler Design) 1
14 pages
Binding Time and Storage Allocation: The University of North Carolina at Chapel Hill
No ratings yet
Binding Time and Storage Allocation: The University of North Carolina at Chapel Hill
25 pages
Symbol Table Run Time Environment
No ratings yet
Symbol Table Run Time Environment
39 pages
Asus Strix 1070 Cg411p 8gb Gddr5x Rev1.0
100% (1)
Asus Strix 1070 Cg411p 8gb Gddr5x Rev1.0
39 pages
Compiler Theory: 001 - Introduction and Course Outline
No ratings yet
Compiler Theory: 001 - Introduction and Course Outline
33 pages
CS3501 CD Qb-Unit 4
No ratings yet
CS3501 CD Qb-Unit 4
5 pages
280425
No ratings yet
280425
11 pages
CD 5
No ratings yet
CD 5
14 pages
Pure Impure Interpreters PDF
No ratings yet
Pure Impure Interpreters PDF
98 pages
Names, Bindings, Scopes: Programming Languages
No ratings yet
Names, Bindings, Scopes: Programming Languages
39 pages
Unit5 2
No ratings yet
Unit5 2
25 pages
Chapter 7: Runtime Environment: - Run Time Memory Organization
No ratings yet
Chapter 7: Runtime Environment: - Run Time Memory Organization
18 pages
UNIT V CD Print
No ratings yet
UNIT V CD Print
9 pages
Pplques
No ratings yet
Pplques
9 pages
Names, Scope, Memory, and Binding
No ratings yet
Names, Scope, Memory, and Binding
42 pages
2 - Names, Bindings, and Scopes
No ratings yet
2 - Names, Bindings, and Scopes
54 pages
Class 2
No ratings yet
Class 2
34 pages
Lecture 16
No ratings yet
Lecture 16
19 pages
Runtime Environment and Symbol Table
No ratings yet
Runtime Environment and Symbol Table
6 pages
COMP 3190: Principles of Programming Language
No ratings yet
COMP 3190: Principles of Programming Language
47 pages
Doctor Appointment Booking System
No ratings yet
Doctor Appointment Booking System
3 pages
REDO - 2 CD - PDF 2
No ratings yet
REDO - 2 CD - PDF 2
2 pages
REDO - 2 CD - PDF 3
No ratings yet
REDO - 2 CD - PDF 3
1 page
Unit Iv - Syntax Directed Translation & Run Time Environment
No ratings yet
Unit Iv - Syntax Directed Translation & Run Time Environment
8 pages
Patient Monitor: Series
No ratings yet
Patient Monitor: Series
498 pages
Auto Water Pump Insem Report
100% (1)
Auto Water Pump Insem Report
44 pages
Data Sharing Collaboration Delta Sharing Final
No ratings yet
Data Sharing Collaboration Delta Sharing Final
127 pages
Vet EU IG - Chapter 2 - Initial Submission
No ratings yet
Vet EU IG - Chapter 2 - Initial Submission
103 pages
Weekly Cyber Security Threat Updates 8th Oct To 14th Oct PDF
No ratings yet
Weekly Cyber Security Threat Updates 8th Oct To 14th Oct PDF
15 pages
Cs 101 Lecture - Unit1-Week1-2
No ratings yet
Cs 101 Lecture - Unit1-Week1-2
33 pages
Grade XI: Computer Science Project Work: Submitted By: Rashihang Rai
No ratings yet
Grade XI: Computer Science Project Work: Submitted By: Rashihang Rai
21 pages
TCT2 - PDH Principles - 1688735713572
No ratings yet
TCT2 - PDH Principles - 1688735713572
52 pages
19 Assessing Model Accuracy
No ratings yet
19 Assessing Model Accuracy
16 pages
Interview Questions Selenium Java Coding
No ratings yet
Interview Questions Selenium Java Coding
26 pages
Math1330 Printable Exercises and Solutions
No ratings yet
Math1330 Printable Exercises and Solutions
245 pages
Location Allocation Modelling FLP U
No ratings yet
Location Allocation Modelling FLP U
41 pages
Smart Attendance Using MAC Address
No ratings yet
Smart Attendance Using MAC Address
87 pages
Vani Resum
No ratings yet
Vani Resum
3 pages
Log
No ratings yet
Log
45 pages
Jamb Test Manual
No ratings yet
Jamb Test Manual
14 pages
Zarin Tasnim
No ratings yet
Zarin Tasnim
11 pages
CL-1208 CL-1216: User Manual
No ratings yet
CL-1208 CL-1216: User Manual
82 pages
Stochastic Regular Expressions
No ratings yet
Stochastic Regular Expressions
16 pages
2012 Spatosc Icmc Final
No ratings yet
2012 Spatosc Icmc Final
7 pages
Syllabus CSI104 Summer 2021
No ratings yet
Syllabus CSI104 Summer 2021
13 pages
What Do You Mean by One To Many Relationship Between Teacher and Class Table?
No ratings yet
What Do You Mean by One To Many Relationship Between Teacher and Class Table?
8 pages
27x HP EliteBook 830 G7 I5 10th 8GB RAM TESTED
No ratings yet
27x HP EliteBook 830 G7 I5 10th 8GB RAM TESTED
1 page
C++ VS JAVA A PERFORMANCE DEEPDIVE: Unraveling the Performance Characteristics of C++ and Java for High-Performance Computing
From Everand
C++ VS JAVA A PERFORMANCE DEEPDIVE: Unraveling the Performance Characteristics of C++ and Java for High-Performance Computing
Manoj R Chakravarthi
No ratings yet

Compiler Interpreter

Uploaded by

Compiler Interpreter

Uploaded by

Compilers &

Id1 = id2 + id3 * 60

1. Symbol Table Manager:

6. Limitations of stack based

(int, 1) M, addr (a) Descriptor for

Source Fro Optimizati Bac Target

You might also like