0% found this document useful (0 votes)

32 views24 pages

Compiler Design Unit-5

Compiler Design Unit 5

Uploaded by

chandutalari07

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views24 pages

Compiler Design Unit-5

Compiler Design Unit 5

Uploaded by

chandutalari07

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

.

UNIT 5
MACHINE INDEPENDENT OPTIMIZATION
Elimination of unnecessary instructions in object code, or the replacement of one sequence of
instructions by a faster sequence of instructions that does the same thing is usually called "code
improvement" or "code optimization."
Optimizations are classified into two categories.

1. Machine independent optimizations:

Machine independent optimizations are program transformations that improve the target
code without taking into consideration any properties of the target machine

2. Machine dependant optimizations:

Machine dependant optimizations are based on register allocation and utilization of special
machine-instruction sequences.

The Principal Sources of Optimization

A transformation of a program is called local if it can be performed by looking only at the
statements in a basic block; otherwise, it is called global. Many transformations can be performed at
both the local and global levels.

Function-Preserving Transformations: There are a number of ways in which a compiler can improve
a program without changing the function it computes.
: Common sub expression elimination
Copy propagation,
Dead-code elimination
Constant folding
Common Sub expressions elimination:
An occurrence of an expression E is called a common sub-expression if E was previously
computed, and the values of variables in E have not changed since the previous computation. We can
avoid recomputing the expression if we can use the previously computed value.
• For example
t1: = 4*i
t2: = a [t1]
t3: = 4*j
t4: = 4*i
t5: = n
t6: = b [t4] +t5
.

The above code can be optimized using the common sub-expression elimination as
t1: = 4*i
t2: = a [t1]
t3: = 4*j
t5: = n
t6: = b [t1] +t5

The common sub expression t4: =4*i is eliminated as its computation is already in t1 and the value of i
is not been changed from definition to use.
Copy Propagation:
Assignments of the form f : = g called copy statements, or copies for short. The idea behind the
copy-propagation transformation is to use g for f, whenever possible after the copy statement f: = g.
Copy propagation means use of one variable instead of another.

• For example:
x=Pi;
A=x*r*r;

The optimization using copy propagation can be done as follows: A=Pirr;

Here the variable x is eliminated

Dead-Code Eliminations:
A variable is live at a point in a program if its value can be used subsequently; otherwise, it is
dead at that point.
Example:
i=0;
if(i==1)
{
a=b+5;
}

Here, ‘if’ statement is dead code because this condition will never get satisfied.
Constant folding:

Deducing at compile time that the value of an expression is a constant and using the constant
instead is known as constant folding. One advantage of copy propagation is that it often turns the copy
statement into dead code.
For example,
a=3.14157/2 can be replaced by
a=1.570
.
Loop Optimizations:

In loops, especially in the inner loops, programs tend to spend the bulk of their time. The
running time of a program may be improved if the number of instructions in an inner loop is decreased,
even if we increase the amount of code outside that loop.
Three techniques are important for loop optimization:
1. Code motion, which moves code outside a loop;
2. Induction-variable elimination, which we apply to replace variables from inner loop.
3.Reduction in strength, which replaces expensive operation by a cheaper one, such as a
multiplication by an addition.

Fig. 5.2 Flow graph

Code Motion:

This transformation takes an expression that yields the same result independent of the number of
times a loop is executed (a loop-invariant computation) and places the expression before the loop. Note
that the notion “before the loop” assumes the existence of an entry for the loop. For example, evaluation
of limit-2 is a loop-invariant computation in the following while-statement:

while (i <= limit-2)

.
Code motion will result in the equivalent of

t= limit-2;
while (i<=t) /* statement does not change limit or t */

Induction Variables :
Loops are usually processed inside out. For example consider the loop around B3. Note that the
values of j and t4 remain in lock-step; every time the value of j decreases by 1, that of t4 decreases by 4
because 4*j is assigned to t4. Such identifiers are called induction variables.

When there are two or more induction variables in a loop, it may be possible to get rid of all but
one, by the process of induction-variable elimination. For the inner loop around B3 in Fig.5.3 we cannot
get rid of either j or t4 completely; t4 is used in B3 and j in B4.

However, we can illustrate reduction in strength and illustrate a part of the process of induction-
variable elimination. Eventually j will be eliminated when the outer loop of B2- B5 is considered.

Example:

As the relationship t4:=4*j surely holds after such an assignment to t4 in Fig. and t4 is not
changed elsewhere in the inner loop around B3, it follows that just after the statement j:=j-1 the
relationship t4:= 4*j-4 must hold. We may therefore replace the assignment t4:= 4*j by t4:= t4-4. The
only problem is that t4 does not have a value when we enter block B3 for the first time. Since we must
maintain the relationship t4=4*j on entry to the block B3, we place an initializations of t4 at the end of
the block where j itself is initialized, shown by the dashed addition to block B1 in Fig.5.3.

The replacement of a multiplication by a subtraction will speed up the object code if

multiplication takes more time than addition or subtraction

Reduction In Strength:

Reduction in strength replaces expensive operations by equivalent cheaper ones on the target
machine. Certain machine instructions are considerably cheaper than others and can often be used as
special cases of more expensive operators. For example, x² is invariably cheaper to implement as x*x
than as a call to an exponentiation routine. Fixed-point multiplication or division by a power of two is
cheaper to implement as a shift. Floating-point division by a constant can be implemented as
multiplication by a constant, which may be cheaper.
.

Fig. 5.3 B5 and B6 after common subexpression elimination

PEEPHOLE OPTIMIZATION
A statement-by-statement code-generations strategy often produces target code that contains
redundant instructions and suboptimal constructs. The quality of such target code can be improved by
applying “optimizing” transformations to the target program.
A simple but effective technique for improving the target code is peephole optimization, A
method for trying to improving the performance of the target program by examining a short sequence of
target instructions (called the peephole) and replacing these instructions by a shorter or faster sequence,
whenever possible.
The peephole is a small, moving window on the target program.
Characteristics of peephole optimizations:
Redundant-instructions elimination
Flow-of-control optimizations
Algebraic simplifications
Use of machine idioms
Unreachable code
.
Redundant-instructions elimination
see the instructions sequence
(1) MOV R0,a
(2) MOV a,R0

we can delete instructions (2) because whenever (2) is executed. (1) will ensure that the value of
a is already in register R0.If (2) had a label we could not be sure that (1) was always executed
immediately before (2) and so we could not remove (2).

Unreachable Code:

Another opportunity for peephole optimizations is the removal of unreachable instructions. An

unlabeled instruction immediately following an unconditional jump may be removed. This operation
can be repeated to eliminate a sequence of instructions. For example, for debugging purposes, a large
program may have within it certain segments that are executed only if a variable debug is 1. In C, the
source code might look like:

#define debug 0
….

If ( debug ) {
Print debugging information

}
In the intermediate representations the if-statement may be translated as:

If debug =1 goto L1 goto L2

L1: print debugging information L2: ......................................... (a)

One obvious peephole optimization is to eliminate jumps over jumps .Thus no matter what the
value of debug; (a) can be replaced by:

If debug ≠1 goto L2
Print debugging information
L2:..............................................(b)

If debug ≠0 goto L2
Print debugging information
L2:..............................................(c)

As the argument of the statement of (c) evaluates to a constant true it can be replaced
.

By goto L2. Then all the statement that print debugging aids are manifestly unreachable and can
be eliminated one at a time.

Flows-Of-Control Optimizations:
The unnecessary jumps can be eliminated in either the intermediate code or the target code by
the following types of peephole optimizations. We can replace the jump sequence

goto L1
….

L1: gotoL2 (d)

by the sequence
goto L2
….

L1: goto L2

If there are now no jumps to L1, then it may be possible to eliminate the statement L1:goto L2
provided it is preceded by an unconditional jump .Similarly, the sequence

if a < b goto L1
….

L1: goto L2 (e)

can be replaced by
If a < b goto L2

….

L1: goto L2

Ø Finally, suppose there is only one jump to L1 and L1 is preceded by an unconditional goto.
Then the sequence

goto L1

L1: if a < b goto L2 (f) L3:

may be replaced by
.
If a < b goto L2
goto L3

…….

L3:

While the number of instructions in(e) and (f) is the same, we sometimes skip the unconditional jump
in (f), but never in (e).Thus (f) is superior to (e) in execution time

Algebraic Simplification:
There is no end to the amount of algebraic simplification that can be attempted through peephole
optimization. Only a few algebraic identities occur frequently enough that it is worth considering
implementing them. For example, statements such as
x := x+0 or
x := x * 1

are often produced by straightforward intermediate code-generation algorithms, and they can be
eliminated easily through peephole optimization.

Reduction in Strength:

For example, x² is invariably cheaper to implement as x*x than as a call to an exponentiation

routine. Fixed-point multiplication or division by a power of two is cheaper to implement as a shift.
Floating-point division by a constant can be implemented as multiplication by a constant, which may be
cheaper.

X2 → X*X

Use of Machine Idioms:

The target machine may have hardware instructions to implement certain specific operations
efficiently. For example, some machines have auto-increment and auto-decrement addressing modes.
These add or subtract one from an operand before or after using its value. The use of these modes
greatly improves the quality of code when pushing or popping a stack, as in parameter passing. These
modes can also be used in code for statements like i : =i+1.

i:=i+1 → i++
.
i:=i-1 → i- -

Introduction to Date flow Analysis.

1 The Data-Flow Abstraction

2 The Data-Flow Analysis Schema

3 Data-Flow Schemas on Basic Blocks

4 Reaching Definitions

5 Live-Variable Analysis

6 Available Expressions

"Data-flow analysis" refers to a body of techniques that derive information about the flow of data along
program execution paths.

1. The Data-Flow Abstraction

The execution of a program can be viewed as a series of transformations of the program state,
which consists of the values of all the variables in the program. Each execution of an intermediate-code
statement transforms an input state to a new output state. The input state is associated with the program
point before the statement and the output state is associated with the program point after the statement.

When we analyze the behavior of a program, we must consider all the possible sequences of
program points ("paths") through a flow graph that the program execution can take. We then extract,
from the possible program states at each point, the information we need for the particular data-flow
analysis problem we want to solve. In more complex analyses, we must consider paths that jump among
the flow graphs for various procedures, as calls and returns are executed.

Within one basic block, the program point after a statement is the same as the program point
before the next statement.

If there is an edge from block B1 to block B22 , then the program point after the last statement
of B1 may be followed immediately by the program point before the first statement of B2.

Thus, we may define an execution path (or just path) from point pi to point pn to be a sequence of
points pi,p2,... ,pn such that for each i = 1,2, ... ,n - 1, either

1. Pi is the point immediately preceding a statement and pi+i is the point immediately following
that same statement, or

2.pi is the end of some block and pi+1 is the beginning of a successor block.
.

. In data-flow analysis, we do not distinguish among the paths taken to reach a program point.
Moreover, we do not keep track of entire states; rather, we abstract out certain details, keeping only the
data we need for the purpose of the analysis. Two examples will illustrate how the same program states
may lead to different information abstracted at a point.

1. To help users debug their programs, we may wish to find out what are all the values a variable may
have at a program point, and where these values may be defined. For instance, we may summarize all
the program states at point (5) by saying that the value of a is one of {1,243}, and that it may be defined
by one of { ^ 1 , ^ 3 } . The definitions that may reach a program point along some path are known
as reaching definitions.
2. Suppose, instead, we are interested in implementing constant folding. If a use of the variable x is
reached by only one definition, and that definition assigns a constant to x, then we can simply
replace x by the constant. If, on the other hand, several definitions of x may reach a single program
point, then we cannot perform constant folding on x. Thus, for constant folding we wish to find those
definitions that are the unique definition of their variable to reach a given program point, no matter
which execution path is taken. For point (5) of Fig. 9.12, there is no definition that must be the
definition of a at that point, so this set is empty for a at point (5). Even if a variable has a unique
definition at a point, that definition must assign a constant to the variable. Thus, we may simply
describe certain variables as "not a constant," instead of collecting all their possible values or all their
possible definitions.

2. The Data-Flow Analysis Schema

, we associate with every program point a data-flow value that represents an abstraction of the set of all
possible program states that can be observed for that point. The set of possible data-flow values is the
domain for this application. For example, the domain of data-flow values for reaching definitions is the
set of all subsets of definitions in the program.
.
A particular data-flow value is a set of definitions, and we want to associate with each point in the
program the exact set of definitions that can reach that point. As discussed above, the choice of
abstraction depends on the goal of the analysis; to be efficient, we only keep track of information that is
relevant.
Denote the data-flow values before and after each statement s by IN[S ] and OUT[s], respectively.
The data-flow problem is to find a solution to a set of constraints on the IN[S]'S and OUT[s]'s, for all
statements s. There are two sets of constraints: those based on the semantics of the statements ("transfer
functions") and those based on the flow of control.

Transfer Functions

The data-flow values before and after a statement are constrained by the semantics of the statement. For
example, suppose our data-flow analysis involves determining the constant value of variables at points.
If variable a has value v before executing statement b = a, then both a and b will have the value v after
the statement. This relationship between the data-flow values before and after the assignment
statement is known as a transfer function.

Transfer functions come in two flavors: information may propagate forward along execution paths, or it
may flow backwards up the execution paths. In a forward-flow problem, the transfer function of a
statement s, which we shall usually denote f(a), takes the data-flow value before the statement and
produces a new data-flow value after the statement. That is,

Conversely, in a backward-flow problem, the transfer function f(a) for statement 8 converts a data-flow
value after the statement to a new data-flow value before the statement. That is,

Control – Flow Constraints

The second set of constraints on data-flow values is derived from the flow of control. Within a basic
block, control flow is simple. If a block B consists of statements s1, s 2 , • • • ,sn in that order, then the
control-flow value out of Si is the same as the control-flow value into Si+i. That is,

However, control-flow edges between basic blocks create more complex constraints between the last
statement of one basic block and the first statement of the following block. For example, if we are
interested in collecting all the definitions that may reach a program point, then the set of definitions
reaching the leader statement of a basic block is the union of the definitions after the last statements of
each of the predecessor blocks. The next section gives the details of how data flows among the blocks.
.

3. Data-Flow Schemas on Basic Blocks

While a data-flow schema involves data-flow values at each point in the program, we can save time
and space by recognizing that what goes on inside a block is usually quite simple. Control flows from
the beginning to the end of the block, without interruption or branching. Thus, we can restate the
schema in terms of data-flow values entering and leaving the blocks. We denote the data-flow values
immediately before and immediately after each basic block B by m[B] and 0 U T [ S ] , respectively.
The constraints involving m[B] and 0UT[B] can be derived from those involving w[s] and OUT[s] for
the various statements s in B as follows.

Suppose block B consists of statements s 1 , . . . , sn, in that order. If si is the first statement of basic
block B, then m[B] = I N [ S I ] , Similarly, if sn is the last statement of basic block B, then OUT[S] =
OUT[s„] . The transfer function of a basic block B, which we denote fB, can be derived by composing
the transfer functions of the statements in the block. That is, let fa. be the transfer function of
statement st. Then of statement si. Then fB = f,sn, o . . . o f,s2, o fsl. . The relationship between the
beginning and end of the block is

The constraints due to control flow between basic blocks can easily be rewritten by
substituting IN[B] and OUT[B] for IN[SI ] and OUT[sn], respectively. For instance, if data-flow values
are information about the sets of constants that may be assigned to a variable, then we have a forward-
flow problem in which

When the data-flow is backwards as we shall soon see in live-variable analy-sis, the equations are
similar, but with the roles of the IN's and OUT's reversed. That is,

Unlike linear arithmetic equations, the data-flow equations usually do not have a unique solution. Our
goal is to find the most "precise" solution that satisfies the two sets of constraints: control-flow and
transfer constraints. That is, we need a solution that encourages valid code improvements, but does not
justify unsafe transformations — those that change what the program computes.

4. Reaching Definitions
.
"Reaching definitions" is one of the most common and useful data-flow schemas. By knowing where in
a program each variable x may have been defined when control reaches each point p, we can determine
many things about x. For just two examples, a compiler then knows whether x is a constant at
point p, and a debugger can tell whether it is possible for x to be an undefined variable, should x be used
at p.

We say a definition d reaches a point p if there is a path from the point immediately
following d to p, such that d is not "killed" along that path. We kill a definition of a variable x if there is
any other definition of x anywhere along the path . if a definition d of some variable x reaches point p,
then d might be the place at which the value of x used at p was last defined.

A definition of a variable x is a statement that assigns, or may assign, a value to x. Procedure

parameters, array accesses, and indirect references all may have aliases, and it is not easy to tell if a
statement is referring to a particular variable x. Program analysis must be conservative; if we do not
note that the path may have loops, so we could come to another occurrence of d along the path, which
does not "kill" d.
know whether a statement s is assigning a value to x, we must assume that it may assign to it; that is,
variable x after statement s may have either its original value before s or the new value created by s. For
the sake of simplicity, the rest of the chapter assumes that we are dealing only with variables that have
no aliases. This class of variables includes all local scalar variables in most languages; in the case of C
and C++, local variables whose addresses have been computed at some point are excluded.

Transfer Equations for Reaching Definitions

Start by examining the details of a single statement. Consider a definition

Here, and frequently in what follows, + is used as a generic binary operator. This statement "generates"
a definition d of variable u and "kills" all the

other definitions in the program that define variable u, while leaving the re-maining incoming
definitions unaffected. The transfer function of definition d thus can be expressed as

where gend = {d}, the set of definitions generated by the statement, and killd is the set of all other
definitions of u in the program.

The transfer function of a basic block can be found by composing the transfer functions of the
statements contained therein. The composition of functions of the form (9.1), which we shall refer to as
"gen-kill form," is also of that form, as we can see as follows. Suppose there are two functions fi(x) =
gen1 U (x - kill1) and f2(x) = gen2 U (x — kill2). Then
.

This rule extends to a block consisting of any number of statements. Suppose block B has n statements,
with transfer functions fi(x) = geni U (x — kilh) for i = 1,2, ... , n. Then the transfer function for
block B may be written as:

Thus, like a statement, a basic block also generates a set of definitions and kills a set of definitions. The
gen set contains all the definitions inside the block that are "visible" immediately after the block — we
refer to them as downwards exposed. A definition is downwards exposed in a basic block only if it is
.
not "killed" by a subsequent definition to the same variable inside the same basic block. A basic block's
kill set is simply the union of all the definitions killed by the individual statements. Notice that a
definition may appear in both the gen and kill set of a basic block. If so, the fact that it is in gen takes
precedence, because in gen-kill form, the kill set is applied before the gen set.

Example 9 . 1 0 : The gen set for the basic block

is {d2} since d1 is not downwards exposed. The kill set contains both d1 and d2, since d1 kills d2
and vice versa. Nonetheless, since the subtraction of the kill set precedes the union operation with the
gen set, the result of the transfer function for this block always includes definition d2.

Control - Flow Equations

Next, we consider the set of constraints derived from the control flow between basic blocks. Since a
definition reaches a program point as long as there exists at least one path along which the definition
reaches, O U T [ P ] C m[B] whenever there is a control-flow edge from P to B. However, since a
definition cannot reach a point unless there is a path along which it reaches, w[B] needs to be no larger
than the union of the reaching definitions of all the predecessor blocks. That is, it is safe to assume

We refer to union as the meet operator for reaching definitions. In any data-flow schema, the meet
operator is the one we use to create a summary of the contributions from different paths at the
confluence of those paths.

Iterative Algorithm for Reaching Definitions

We assume that every control-flow graph has two empty basic blocks, an ENTRY node, which
represents the starting point of the graph, and an EXIT node to which all exits out of the graph go.
Since no definitions reach the beginning of the graph, the transfer function for the ENTRY block is a
simple constant function that returns 0 as an answer. That is, O U T [ E N T R Y ] = 0.

The reaching definitions problem is defined by the following equations:

.
These equations can be solved using the following algorithm. The result of the algorithm is the least
fixedpoint of the equations, i.e., the solution whose assigned values to the IN ' s and OUT's is contained
in the corresponding values for any other solution to the equations. The result of the algorithm below is
acceptable, since any definition in one of the sets IN or OUT surely must reach the point described. It is
a desirable solution, since it does not include any definitions that we can be sure do not reach.

A l g o r i t h m 9 . 1 1 : Reaching definitions.

INPUT: A flow graph for which kills and genB have been computed for each block B.

OUTPUT: I N [ B ] and O U T [ B ] , the set of definitions reaching the entry and exit of each
block B of the flow graph.

METHOD: We use an iterative approach, in which we start with the "estimate" OUT[JB] = 0 for
all B and converge to the desired values of IN and OUT. As we must iterate until the IN ' s (and hence
the OUT's) converge, we could use a boolean variable change to record, on each pass through the
blocks, whether any OUT has changed. However, in this and in similar algorithms described later, we
assume that the exact mechanism for keeping track of changes is understood, and we elide those details.

The algorithm is sketched in Fig. 9.14. The first two lines initialize certain data-flow values. 4 Line (3)
starts the loop in which we iterate until convergence, and the inner loop of lines (4) through (6) applies
the data-flow equations to every block other than the entry. •

Algorithm 9.11 propagates definitions as far as they will go with-out being killed, thus simulating all
possible executions of the program. Algo-rithm 9.11 will eventually halt, because for every B, OUT[B]
never shrinks; once a definition is added, it stays there forever. (See Exercise 9.2.6.) Since the set of all
definitions is finite, eventually there must be a pass of the while-loop during which nothing is added to
any OUT, and the algorithm then terminates. We are safe terminating then because if the OUT's have
not changed, the IN ' s will

not change on the next pass. And, if the IN'S do not change, the OUT's cannot, so on all subsequent
passes there can be no changes.

The number of nodes in the flow graph is an upper bound on the number of times around the while-
loop. The reason is that if a definition reaches a point, it can do so along a cycle-free path, and the
number of nodes in a flow graph is an upper bound on the number of nodes in a cycle-free path. Each
.
time around the while-loop, each definition progresses by at least one node along the path in question,
and it often progresses by more than one node, depending on the order in which the nodes are visited.

In fact, if we properly order the blocks in the for-loop of line (5), there is empirical evidence that the
average number of iterations of the while-loop is under 5 (see Section 9.6.7). Since sets of definitions
can be represented by bit vectors, and the operations on these sets can be implemented by logical
operations on the bit vectors, Algorithm 9.11 is surprisingly efficient in practice.

Example 9 . 1 2 : We shall represent the seven definitions d1, d2, • • • ,d>j in the flow graph of Fig.
9.13 by bit vectors, where bit i from the left represents definition d{. The union of sets is computed by
taking the logical OR of the corresponding bit vectors. The difference of two sets S — T is computed by
complementing the bit vector of T, and then taking the logical AND of that complement, with the bit
vector for S.

Shown in the table of Fig. 9.15 are the values taken on by the IN and OUT sets in Algorithm 9.11. The
initial values, indicated by a superscript 0, as in OUTfS]0 , are assigned, by the loop of line (2) of Fig.
9.14. They are each the empty set, represented by bit vector 000 0000. The values of subsequent passes
of the algorithm are also indicated by superscripts, and labeled IN [I?]1 and OUTfS]1 for the first pass
and m[Bf and OUT[S]2 for the second.

Suppose the for-loop of lines (4) through (6) is executed with B taking on the values

in that order. With B = B1, since OUT [ ENTRY ] = 0, [IN B1]-Pow(1) is the empty set, and OUT[P1]1
is genBl. This value differs from the previous value OUT[Si]0 , so

we now know there is a change on the first round (and will proceed to a second round).

Then we consider B = B2 and compute

.
This computation is summarized in Fig. 9.15. For instance, at the end of the first pass, OUT [ 5 2 ] 1 =
001 1100, reflecting the fact that d4 and d5 are generated in B2, while d3 reaches the beginning of B2
and is not killed in B2.

Notice that after the second round, OUT [ B2 ] has changed to reflect the fact that d& also reaches the
beginning of B2 and is not killed by B2. We did not learn that fact on the first pass, because the path
from d6 to the end of B2, which is B3 -» B4 -> B2, is not traversed in that order by a single pass. That
is, by the time we learn that d$ reaches the end of B4, we have already computed IN[B2 ] and OUT [ B
2 ] on the first pass.
There are no changes in any of the OUT sets after the second pass. Thus, after a third pass, the
algorithm terminates, with the IN's and OUT's as in the final two columns of Fig. 9.15.

5. Live-Variable Analysis

Some code-improving transformations depend on information computed in the direction opposite to the
flow of control in a program; we shall examine one such example now. In live-variable analysis we
wish to know for variable x and point p whether the value of x at p could be used along some path in the
flow graph starting at p. If so, we say x is live at p; otherwise, x is dead at p.

An important use for live-variable information is register allocation for basic blocks. Aspects of this
issue were introduced in Sections 8.6 and 8.8. After a value is computed in a register, and presumably
used within a block, it is not necessary to store that value if it is dead at the end of the block. Also, if all
registers are full and we need another register, we should favor using a register with a dead value, since
that value does not have to be stored.

Here, we define the data-flow equations directly in terms of IN [5] and OUTpB], which represent
the set of variables live at the points immediately before and after block B, respectively. These
equations can also be derived by first defining the transfer functions of individual statements and
composing them to create the transfer function of a basic block. Define

1. defB as the set of variables defined (i.e., definitely assigned values) in B prior to any use of that
variable in B, and useB as the set of variables whose values may be used in B prior to any definition of
the variable.

Example 9 . 1 3 : For instance, block B2 in Fig. 9.13 definitely uses i. It also uses j before any
redefinition of j, unless it is possible that i and j are aliases of one another. Assuming there are no
aliases among the variables in Fig. 9.13, then uses2 = {i,j}- Also, B2 clearly defines i and j.
Assuming there are no aliases, defB2 = as well.

As a consequence of the definitions, any variable in useB must be considered live on entrance to block
B, while definitions of variables in defB definitely are dead at the beginning of B. In effect,
membership in defB "kills" any opportunity for a variable to be live because of paths that begin at B.

Thus, the equations relating def and use to the unknowns IN and OUT are defined as follows:
.

The first equation specifies the boundary condition, which is that no variables are live on exit from the
program. The second equation says that a variable is live coming into a block if either it is used before
redefinition in the block or it is live coming out of the block and is not redefined in the block. The third
equation says that a variable is live coming out of a block if and only if it is live coming into one of its
successors.

The relationship between the equations for liveness and the reaching-defin-itions equations should be
noticed:

Both sets of equations have union as the meet operator. The reason is that in each data-flow
schema we propagate information along paths, and we care only about whether any path with desired
properties exist, rather than whether something is true along all paths.

• However, information flow for liveness travels "backward," opposite to the direction of control flow,
because in this problem we want to make sure that the use of a variable x at a point p is transmitted to
all points prior to p in an execution path, so that we may know at the prior point that x will have its
value used.

To solve a backward problem, instead of initializing O U T [ E N T R Y ] , we initialize I N [EXIT ] .

Sets I N and O U T have their roles interchanged, and use and def substitute for gen and kill,
respectively. As for reaching definitions, the solution to the liveness equations is not necessarily unique,
and we want the so-lution with the smallest sets of live variables. The algorithm used is essentially a
backwards version of Algorithm 9.11.
Algorithm 9 . 1 4 : Live-variable analysis.

INPUT: A flow graph with def and use computed for each block.

OUTPUT: m[B] and O U T [ £ ] , the set of variables live on entry and exit of each block B of the flow
graph.
.

6. Available Expressions

An expression x + y is available at a point p if every path from the entry node to p evaluates x + y, and
after the last such evaluation prior to reaching p, there are no subsequent assignments to x or y.5 For
the available-expressions data-flow schema we say that a block kills expression x + y if it assigns (or
may 5 N o te that, as usual in this chapter, we use the operator + as a generic operator, not necessarily
standing for addition.

assign) x or y and does not subsequently recompute x + y. A block generates expression x + y if it

definitely evaluates x + y and does not subsequently define x or y.

Note that the notion of "killing" or "generating" an available expression is not exactly the same as that
for reaching definitions. Nevertheless, these notions of "kill" and "generate" behave essentially as they
do for reaching definitions.

The primary use of available-expression information is for detecting global common subexpressions.
For example, in Fig. 9.17(a), the expression 4 * i in block Bs will be a common subexpression if 4 * i is
available at the entry point of block B3. It will be available if i is not assigned a new value in block B2,
or if, as in Fig. 9.17(b), 4 * i is recomputed after i is assigned in B2.

We can compute the set of generated expressions for each point in a block, working from beginning to
end of the block. At the point prior to the block, no expressions are generated. If at point p set S of
.
expressions is available, and q is the point after p, with statement x = y+z between them, then we form
the set of expressions available at q by the following two steps.

Add to S the expression y + z.

Delete from S any expression involving variable x.

Note the steps must be done in the correct order, as x could be the same as y or z. After we reach the
end of the block, S is the set of generated expressions for the block. The set of killed expressions is all
expressions, say y + z, such that either y or z is defined in the block, and y + z is not generated by the
block.

E x a m p l e 9.15 : Consider the four statements of Fig. 9.18. After the first, b + c is available. After the
second statement, a — d becomes available, but b + c is no longer available, because b has been
redefined. The third statement does not make b + c available again, because the value of c is
immediately changed.

After the last statement, a — d is no longer available, because d has changed. Thus no expressions are
generated, and all expressions involving a, b, c, or d are killed.

We can find available expressions in a manner reminiscent of the way reach-ing definitions are
computed. Suppose U is the "universal" set of all expressions appearing on the right of one or more
statements of the program. For each block B, let IN[B] be the set of expressions in U that are available
at the point just before the beginning of B. Let OUT[B] be the same for the point following the end
of B. Define e.genB to be the expressions generated by B and eJnills to be the set of expressions
in U killed in B. Note that I N , O U T , e_#en, and eJkill can all be represented by bit vectors. The
following equations relate the unknowns
.

T he above equations look almost identical to the equations for reaching definitions. Like reaching
definitions, the boundary condition is OUT [ ENTRY ] = 0, because at the exit of the E N T R Y node,
there are no available expressions.

The most important difference is that the meet operator is intersection rather than union. This operator is
the proper one because an expression is available at the beginning of a block only if it is available at the
end of all its predecessors. In contrast, a definition reaches the beginning of a block whenever it reaches
the end of any one or more of its predecessors.

The use of D rather than U makes the available-expression equations behave differently from those of
reaching definitions. While neither set has a unique solution, for reaching definitions, it is the solution
with the smallest sets that corresponds to the definition of "reaching," and we obtained that solution by
starting with the assumption that nothing reached anywhere, and building up to the solution. In that
way, we never assumed that a definition d could reach a point p unless an actual path
propagating d to p could be found. In contrast, for available expression equations we want the solution
with the largest sets of available expressions, so we start with an approximation that is too large and
work down.

It may not be obvious that by starting with the assumption "everything (i.e., the set U) is available
everywhere except at the end of the entry block" and eliminating only those expressions for which we
can discover a path along which it is not available, we do reach a set of truly available expressions. In
the case of available expressions, it is conservative to produce a subset of the exact set of available
expressions. The argument for subsets being conservative is that our intended use of the information is
to replace the computation of an available expression by a previously computed value. Not knowing an
expres-sion is available only inhibits us from improving the code, while believing an expression is
available when it is not could cause us to change what the program computes.
.

Example 9 . 1 6 : We shall concentrate on a single block, B2 in Fig. 9.19, to illustrate the effect of
the initial approximation of OUT[B2] on IN [ B 2 ] - Let G and K abbreviate e.genB2 and e-killB2,
respectively. The data-flow equations for block B2 are

Algorithm 9 . 1 7 : Available expressions.

INPUT: A flow graph with e-kills and e.gens computed for each block B. The initial block is B1.

OUTPUT: IN [5] and O U T [ 5 ] , the set of expressions available at the entry and exit of each block B
of the flow graph.
.

Figure 9.20: Iterative algorithm to compute available expressions

Full Stack UNIT3
No ratings yet
Full Stack UNIT3
57 pages
CD Unit-V
No ratings yet
CD Unit-V
10 pages
OpenText Documentum Foundation Services 16.7 - Development Guide English (EDCPKSVC160700-PGD-EN-01)
No ratings yet
OpenText Documentum Foundation Services 16.7 - Development Guide English (EDCPKSVC160700-PGD-EN-01)
282 pages
C# Notes
No ratings yet
C# Notes
78 pages
CS602PC - Compiler - Design - Lecture Notes - Unit - 5
No ratings yet
CS602PC - Compiler - Design - Lecture Notes - Unit - 5
28 pages
White Box Tools User Manual
No ratings yet
White Box Tools User Manual
668 pages
Unit - Viii Machine Dependent Code Optimization Peephole Optimization
No ratings yet
Unit - Viii Machine Dependent Code Optimization Peephole Optimization
9 pages
Tutorial (2024-02)
No ratings yet
Tutorial (2024-02)
213 pages
Code Optimization
0% (1)
Code Optimization
42 pages
Invoice GL Account Tax Relevant
100% (1)
Invoice GL Account Tax Relevant
2 pages
App Development Roadmap
No ratings yet
App Development Roadmap
3 pages
MakeGrid v197.mq4
No ratings yet
MakeGrid v197.mq4
12 pages
CD Unit 4
No ratings yet
CD Unit 4
152 pages
Chapter 6 Functions: For Educational Purpose Only. Not To Be Circulated Without This Banner
No ratings yet
Chapter 6 Functions: For Educational Purpose Only. Not To Be Circulated Without This Banner
132 pages
Solutions of Homework 1
100% (1)
Solutions of Homework 1
9 pages
Unit V-CD New
No ratings yet
Unit V-CD New
126 pages
Unit V Updated
No ratings yet
Unit V Updated
126 pages
Objective Type Questions Binary Files in Python
No ratings yet
Objective Type Questions Binary Files in Python
3 pages
Updated - M5 - Python For Machine Learning - Copy - Maria S
No ratings yet
Updated - M5 - Python For Machine Learning - Copy - Maria S
67 pages
Logcat 1642401411412
No ratings yet
Logcat 1642401411412
64 pages
Using Quincy 2005 To Write A C Program: Single-File Programs
No ratings yet
Using Quincy 2005 To Write A C Program: Single-File Programs
1 page
Code Optimization
No ratings yet
Code Optimization
58 pages
MBean
No ratings yet
MBean
65 pages
DSA Practical
No ratings yet
DSA Practical
51 pages
Unit 5
No ratings yet
Unit 5
54 pages
Code Optimization
No ratings yet
Code Optimization
51 pages
Unit V - Code Optimization and Code Generation: Course Material
No ratings yet
Unit V - Code Optimization and Code Generation: Course Material
41 pages
CD Unit 5
No ratings yet
CD Unit 5
41 pages
Unit 5 Part 1
No ratings yet
Unit 5 Part 1
44 pages
SEN VIMP Topics With Answer
No ratings yet
SEN VIMP Topics With Answer
50 pages
Compiler Design Unit 5
No ratings yet
Compiler Design Unit 5
39 pages
CD Unit-5
No ratings yet
CD Unit-5
45 pages
Code Optimization
No ratings yet
Code Optimization
36 pages
Code Optmize
No ratings yet
Code Optmize
41 pages
Algorithms 2. Order 3. Analysis of Algorithm 4. Some Mathematical Background
No ratings yet
Algorithms 2. Order 3. Analysis of Algorithm 4. Some Mathematical Background
41 pages
CD Module 5 Answers
No ratings yet
CD Module 5 Answers
44 pages
Optimization PDF
No ratings yet
Optimization PDF
40 pages
FALLSEM2024-25 BCSE307L TH VL2024250101542 2024-10-25 Reference-Material-III
No ratings yet
FALLSEM2024-25 BCSE307L TH VL2024250101542 2024-10-25 Reference-Material-III
36 pages
Code Optimization - Compiler Design
No ratings yet
Code Optimization - Compiler Design
33 pages
CSE2002 Session38 Code Optimization3038
No ratings yet
CSE2002 Session38 Code Optimization3038
39 pages
Unit 5 Compiler PDF
No ratings yet
Unit 5 Compiler PDF
28 pages
Abhishek DBMS Ch1
No ratings yet
Abhishek DBMS Ch1
39 pages
Component - Based Development
No ratings yet
Component - Based Development
35 pages
JavaScript DOM
No ratings yet
JavaScript DOM
28 pages
CD Unit 4
No ratings yet
CD Unit 4
32 pages
Unit 5 Compiler Design
No ratings yet
Unit 5 Compiler Design
29 pages
5.3 Principal Sources of Optimization
No ratings yet
5.3 Principal Sources of Optimization
27 pages
CTCD Unit 5
No ratings yet
CTCD Unit 5
27 pages
Unit-5 F&CD
No ratings yet
Unit-5 F&CD
27 pages
Chapter 5: CPU Scheduling: Silberschatz, Galvin and Gagne ©2011 Operating System Concepts Essentials - 8 Edition
No ratings yet
Chapter 5: CPU Scheduling: Silberschatz, Galvin and Gagne ©2011 Operating System Concepts Essentials - 8 Edition
27 pages
CD Unit 5
No ratings yet
CD Unit 5
26 pages
Vision 2024 CD Chapter 5 Compiler Code Optimization 731689660928542
No ratings yet
Vision 2024 CD Chapter 5 Compiler Code Optimization 731689660928542
24 pages
Buildozer Documentation: Release 0.11
No ratings yet
Buildozer Documentation: Release 0.11
17 pages
Cd-Unit 5 Part-2
No ratings yet
Cd-Unit 5 Part-2
23 pages
@@code Optim
No ratings yet
@@code Optim
20 pages
Op Tim Ization
No ratings yet
Op Tim Ization
22 pages
Windowing Functions in Databricks 1736450539
No ratings yet
Windowing Functions in Databricks 1736450539
23 pages
Code Optimization
No ratings yet
Code Optimization
21 pages
Code Optimization
No ratings yet
Code Optimization
26 pages
18 Unit-6
No ratings yet
18 Unit-6
21 pages
Unit 5 2 Optimization
No ratings yet
Unit 5 2 Optimization
18 pages
Unit 4
No ratings yet
Unit 4
19 pages
Modularity and Architecture of PLC-based Software PDF
No ratings yet
Modularity and Architecture of PLC-based Software PDF
18 pages
Unit 4
No ratings yet
Unit 4
16 pages
Unit - Iv Run Time Storage Organization
No ratings yet
Unit - Iv Run Time Storage Organization
15 pages
Code-Optimization PPT
No ratings yet
Code-Optimization PPT
15 pages
Of The Text Book: Code Optimization
No ratings yet
Of The Text Book: Code Optimization
19 pages
Compiler Design UNIT V
No ratings yet
Compiler Design UNIT V
13 pages
PCC Unit 5
No ratings yet
PCC Unit 5
15 pages
Unit 5
No ratings yet
Unit 5
12 pages
Chapter 10 - Code Optimization
No ratings yet
Chapter 10 - Code Optimization
11 pages
CD Unit-Iv
No ratings yet
CD Unit-Iv
15 pages
Common Object Request Broker Architecture (Corba) : By: Sunil Gopinath David Watkins
No ratings yet
Common Object Request Broker Architecture (Corba) : By: Sunil Gopinath David Watkins
29 pages
Classes and Objects in Java (Presentation)
No ratings yet
Classes and Objects in Java (Presentation)
13 pages
UNIT IV CD (P)
No ratings yet
UNIT IV CD (P)
8 pages
Basic Blocks and Flow Graphs
No ratings yet
Basic Blocks and Flow Graphs
9 pages
Unit-Vi: Principle Sources of Optimization
No ratings yet
Unit-Vi: Principle Sources of Optimization
11 pages
Unit 6 Final
No ratings yet
Unit 6 Final
11 pages
r20 CD Unit-5 Part 1
No ratings yet
r20 CD Unit-5 Part 1
8 pages
Code Optimiztion Criteria For Code-Improving Transformations
No ratings yet
Code Optimiztion Criteria For Code-Improving Transformations
10 pages
Rohini 79541034505
No ratings yet
Rohini 79541034505
5 pages
Principal Sources of Code Optimisation CD
No ratings yet
Principal Sources of Code Optimisation CD
4 pages
Peephole Optimization
No ratings yet
Peephole Optimization
4 pages
CS Lab Assignment
No ratings yet
CS Lab Assignment
2 pages
ASSIGNMENT 1 String Random
No ratings yet
ASSIGNMENT 1 String Random
3 pages
Lab 02
No ratings yet
Lab 02
2 pages
Vb Net Programming
From Everand
Vb Net Programming
Martin Booch
No ratings yet
C Programming
From Everand
C Programming
Netra
No ratings yet
"C Programming for Beginners: A Step-by-Step Guide"
From Everand
"C Programming for Beginners: A Step-by-Step Guide"
Lov kush
No ratings yet
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)