0% found this document useful (0 votes)

10 views9 pages

18 Code Optimization 07-02-2025

The document discusses various compiler optimization techniques that improve code efficiency, including function inlining, outlining, loop transformations, dead code elimination, and register allocation. It also covers software performance optimization strategies, such as basic loop optimizations, cache-oriented loop optimizations, and techniques like data realignment, array padding, and loop tiling to enhance cache behavior. These methods aim to reduce execution time and memory usage in compiled programs.

Uploaded by

lanoxof509

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views9 pages

18 Code Optimization 07-02-2025

Uploaded by

lanoxof509

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Optimization Techniques

Compilation involve Translation and Optimization

I. Compiler Optimization Technique

Basic compilation techniques can generate inefficient code. Compilers use a wide
range of algorithms to optimize the code they generate.

Techniques available are :

1. Function inlining
2. Outlining
3. Loop transformations
- Loop unrolling
- Loop fusion and distribution
- Dead code elimination
- Register allocation
4. Compile time evaluation (Constant folding, Constant Propagation)

Function inlining replaces a subroutine call to a function with equivalent code to the function
body. By substituting the function call’s parameters into the body, the compiler can generate a
copy of the code that performs the same operations but without the subroutine overhead. C++
provides an inline qualifier that allows the compiler to substitute an inline version of the
function. In C, programmers can perform inlining manually or by using a preprocessor macro to
define the code body.
Although inlining eliminates function call overhead, it also increases program size. Inlining also
inhibits sharing of the function code in the cache, because the inlined copies are distinct pieces of
code, they cannot be represented by the same code in the cache. Outlining is sometimes useful to
improve the cache behavior of common functions.

Outlining is the opposite operation to inlining - a set of similar sections of code replaced with
calls to an equivalent function.

Loop Transformations:

Loops are important program structures - although they are compactly described in the source
code, they often use a large fraction of the computation time. Many techniques have been
designed to optimize loops.
A simple but useful transformation is known as loop unrolling. Loop unrolling is important
because it helps expose parallelism that can be used by later stages of the compiler.
Loop fusion
It combines two or more loops into a single loop. For this transformation to be legal, two
conditions must be satisfied. First, the loops must iterate over the same values. Second, the loop
bodies must not have dependencies that would be violated if they are executed together.
for example, if the second loop’s ith iteration depends on the results of the (i+1)th iteration of the
first loop, the two loops cannot be combined.

Loop distribution is the opposite of loop fusion, that is, decomposing a single loop into multiple
loops.

Dead Code Elimination

Dead code is code that can never be executed. Dead code can be generated by programmers,
either inadvertently or purposefully. Dead code can also be generated by compilers. Dead code
can be identified by reachability analysis, finding the other statements or instructions from
which it can be reached. If a given piece of code cannot be reached, or it can be reached only by
a piece of code that is unreachable from the main program, then it can be eliminated.

Register allocation is a very important compilation phase. Given a block of code,

we want to choose assignments of variables (both declared and temporary) to registers to
minimize the total number of required registers.
If a section of code requires more registers than are available, we must spill some
of the values out to memory temporarily. After computing some values, we write the values to
temporary memory locations, reuse those registers in other computations,

using graph coloring :

Example : Perform register allocation using life time graph

II. Software performance optimization
In this section we will look at several techniques for optimizing software
performance, including basic loop and cache-oriented loop optimizations as well as more generic
strategies.

Basic loop optimizations

3 techniques in optimizing loops:
- code motion
- induction variable elimination,
- strength reduction.

Code motion lets us move unnecessary code out of a loop. If a computation’s result
does not depend on operations performed in the loop body, then we can safely move it out of the
loop. Code motion opportunities can arise because programmers may find some computations
clearer and more concise when put in the loop body, even though they are not strictly dependent
on the loop iterations.
(Explain with example)

An induction variable is a variable whose value is derived from the loop iteration
variable’s value. The compiler often introduces induction variables to help it implement the loop.
Properly transformed, we may be able to eliminate some variables and apply strength reduction
to others.
A nested loop is a good example of the use of induction variables. Here is a simple
nested loop:

The compiler uses induction variables to help it address the arrays. Let us rewrite
the loop in C using induction variables and pointers.

Cache-oriented loop optimizations

A loop nest is a set of loops, one inside the other. Loop nests occur when we process arrays. A
large body of techniques has been developed for optimizing loop nests.

Data realignment and Array padding

It is required to optimize the cache behavior of the below code

Assume that a and b arrays are sized with M at 265 and N at 4 and a 256-line, four-way set
associative cache with four words per line. The starting location for a[ ] is 1024 and the starting
location for b[ ] is 4099.

Although a[0][0] and b[0][0] do not map to the same word in the cache, they do map to the same
block.
Once the a[0][1] access brings that line into the cache, it remains there for the a[0][2] and a[0][3]
accesses because the b[] accesses are now on the next line. However, the scenario repeats itself at
a[1][0] and every four iterations of the cache. One way to eliminate the cache conflicts is to
move one of the arrays. We do not have to move it far. If we move b’s start to 4100, we
eliminate the cache conflicts.
However, that fix will not work in more complex situations. Moving one array may only
introduce cache conflicts with another array. In such cases, we can use another technique called
padding. If we extend each of the rows of the arrays to have four elements rather than three, with
the padding word placed at the beginning of the row, we eliminate the cache conflicts.
In this case, b[0][0] is located at 4100 by the padding. Although padding wastes memory, it
substantially improves memory performance. In complex situations with multiple arrays and
sophisticated access patterns, we have to use a combination of techniques, relocating arrays and
padding them to be able to minimize cache conflicts.

Loop tiling breaks up a loop into a set of nested loops, with each inner loop performing the
operations on a subset of the data. Loop tiling changes the order in which array elements are
accessed, thereby allowing us to better control the behavior of the cache during loop execution.
The next example illustrates the use of loop tiling.

OpenMPCoursework2
100% (1)
OpenMPCoursework2
5 pages
OpenWells 5000.1.13.0 Release RlsNotes
75% (4)
OpenWells 5000.1.13.0 Release RlsNotes
283 pages
LLVM Essentials - Sample Chapter
No ratings yet
LLVM Essentials - Sample Chapter
16 pages
Chapter 10 - Code Optimization
No ratings yet
Chapter 10 - Code Optimization
11 pages
Unit 8 Code Optimization and Generation
No ratings yet
Unit 8 Code Optimization and Generation
10 pages
Unit 5
No ratings yet
Unit 5
12 pages
19 Code Optimization 17-02-2025
No ratings yet
19 Code Optimization 17-02-2025
32 pages
Code Efficiency: 1 Simplifying Expressions
No ratings yet
Code Efficiency: 1 Simplifying Expressions
3 pages
Lec02 2 Compiler Optimizations
No ratings yet
Lec02 2 Compiler Optimizations
32 pages
1.unit 5 - Compiler Design (PH)
No ratings yet
1.unit 5 - Compiler Design (PH)
30 pages
Code Optimization PDF
No ratings yet
Code Optimization PDF
25 pages
Optimal Code Compiling in C: Nitika Gupta Nistha Seth Prabhat Verma
No ratings yet
Optimal Code Compiling in C: Nitika Gupta Nistha Seth Prabhat Verma
8 pages
280425
No ratings yet
280425
11 pages
Unit - Iv Run Time Storage Organization
No ratings yet
Unit - Iv Run Time Storage Organization
15 pages
The Compilation Process: The Compilation Process Combines Both Translation and Optimisation of High Level Language Code
No ratings yet
The Compilation Process: The Compilation Process Combines Both Translation and Optimisation of High Level Language Code
20 pages
Of The Text Book: Code Optimization
No ratings yet
Of The Text Book: Code Optimization
19 pages
Presentation 1
No ratings yet
Presentation 1
18 pages
Clase de Progrea 555
No ratings yet
Clase de Progrea 555
35 pages
@@code Optim
No ratings yet
@@code Optim
20 pages
New Trends and Challenges in Source Code Optimization
No ratings yet
New Trends and Challenges in Source Code Optimization
6 pages
Unit 4
No ratings yet
Unit 4
19 pages
REDO - 2 CD - PDF 3
No ratings yet
REDO - 2 CD - PDF 3
1 page
Unit 4
No ratings yet
Unit 4
16 pages
Compilation Techniques
No ratings yet
Compilation Techniques
15 pages
Chapter 8 - Code Optimization Part 2
No ratings yet
Chapter 8 - Code Optimization Part 2
3 pages
Vision 2024 CD Chapter 5 Compiler Code Optimization 731689660928542
No ratings yet
Vision 2024 CD Chapter 5 Compiler Code Optimization 731689660928542
24 pages
CD Unit 5
No ratings yet
CD Unit 5
26 pages
CD Unit-Iv
No ratings yet
CD Unit-Iv
15 pages
Op Tim Ization
No ratings yet
Op Tim Ization
22 pages
CD Module 5 Answers
No ratings yet
CD Module 5 Answers
44 pages
Code Optimization
No ratings yet
Code Optimization
58 pages
Untitled 3
No ratings yet
Untitled 3
12 pages
REDO - 2 CD - PDF 2
No ratings yet
REDO - 2 CD - PDF 2
2 pages
C Optimization Techniques
No ratings yet
C Optimization Techniques
79 pages
A Practical Approach To Optimize Code Implementation
No ratings yet
A Practical Approach To Optimize Code Implementation
11 pages
18 Unit-6
No ratings yet
18 Unit-6
21 pages
CD Unit-5
No ratings yet
CD Unit-5
45 pages
Optimization
No ratings yet
Optimization
67 pages
PCC Unit 5
No ratings yet
PCC Unit 5
15 pages
UNIT 5 Notes CD
No ratings yet
UNIT 5 Notes CD
6 pages
Unit-5 Toc
No ratings yet
Unit-5 Toc
41 pages
Code Optimization
No ratings yet
Code Optimization
36 pages
Unit V
No ratings yet
Unit V
11 pages
Unit 5 Cd.
No ratings yet
Unit 5 Cd.
27 pages
Unit V - Code Optimization and Code Generation: Course Material
No ratings yet
Unit V - Code Optimization and Code Generation: Course Material
41 pages
Compiler Design Unit 5
No ratings yet
Compiler Design Unit 5
39 pages
Unit-5 F&CD
No ratings yet
Unit-5 F&CD
27 pages
Cd-Unit 5 Part-2
No ratings yet
Cd-Unit 5 Part-2
23 pages
Code Optimization - Compiler Design
No ratings yet
Code Optimization - Compiler Design
33 pages
Compiler Construction: A Compulsory Module For Students in
No ratings yet
Compiler Construction: A Compulsory Module For Students in
34 pages
Optimization Techniques Code Optimizations
No ratings yet
Optimization Techniques Code Optimizations
10 pages
Code Optimization
No ratings yet
Code Optimization
7 pages
Cdunit 5
No ratings yet
Cdunit 5
41 pages
33 Code Optimization
No ratings yet
33 Code Optimization
16 pages
Unit 4
No ratings yet
Unit 4
15 pages
Unit V QB
No ratings yet
Unit V QB
15 pages
Optimization PDF
No ratings yet
Optimization PDF
40 pages
Ar20 Aus CD Unit V
No ratings yet
Ar20 Aus CD Unit V
17 pages
Unit5 0CodeOptimization
No ratings yet
Unit5 0CodeOptimization
90 pages
CD Unit 5
No ratings yet
CD Unit 5
41 pages
HCI-case Study
No ratings yet
HCI-case Study
7 pages
Poly Scientist
No ratings yet
Poly Scientist
14 pages
The Deep Learning Compiler: A Comprehensive Survey
No ratings yet
The Deep Learning Compiler: A Comprehensive Survey
20 pages
SAP BW - Performance Optimization
No ratings yet
SAP BW - Performance Optimization
2 pages
Physical Design
No ratings yet
Physical Design
96 pages
GPS - Prod: Production Control For IGU, Laminated and Toughened Glass
No ratings yet
GPS - Prod: Production Control For IGU, Laminated and Toughened Glass
2 pages
Database SOP
No ratings yet
Database SOP
5 pages
Talent Acq
No ratings yet
Talent Acq
29 pages
CU5092-Real Time Embedded Systems
No ratings yet
CU5092-Real Time Embedded Systems
9 pages
Tooth Interior Fatigue Fracture Robustness of Gear
No ratings yet
Tooth Interior Fatigue Fracture Robustness of Gear
61 pages
Aexio Success Story-DiGi
No ratings yet
Aexio Success Story-DiGi
2 pages
CS3501 Set4
No ratings yet
CS3501 Set4
2 pages
Data Parallel Patterns
No ratings yet
Data Parallel Patterns
9 pages
Using Your C Compiler To Exploit NEON™ Advanced SIMD: Op Op Op Op
No ratings yet
Using Your C Compiler To Exploit NEON™ Advanced SIMD: Op Op Op Op
13 pages
Plume Color Stack
No ratings yet
Plume Color Stack
10 pages
Ace Your Job Interview: Salesforce Sales Cloud
No ratings yet
Ace Your Job Interview: Salesforce Sales Cloud
56 pages
Cpu Tutorial 2
No ratings yet
Cpu Tutorial 2
53 pages
CD Mid Ii Imp Questions
No ratings yet
CD Mid Ii Imp Questions
2 pages
BrandMaker Focus Paper: Marketing Process Optimization
No ratings yet
BrandMaker Focus Paper: Marketing Process Optimization
8 pages
Code Optimization in Compiler Design (18100BTCSAII02853)
No ratings yet
Code Optimization in Compiler Design (18100BTCSAII02853)
13 pages
Peephole Optimization
No ratings yet
Peephole Optimization
4 pages
REN - Basics of The Renesas Synergy Platform 2020 4 CH8 - GDE - 20200507
No ratings yet
REN - Basics of The Renesas Synergy Platform 2020 4 CH8 - GDE - 20200507
19 pages
Mentor Product Description (Including Insight)
No ratings yet
Mentor Product Description (Including Insight)
39 pages
Common Path Pessimism Removal An Industry Perspective
No ratings yet
Common Path Pessimism Removal An Industry Perspective
4 pages
En USTER SENTINEL Flyer Tablet PC Version 2015 11
No ratings yet
En USTER SENTINEL Flyer Tablet PC Version 2015 11
4 pages
Talus Design Datasheet
No ratings yet
Talus Design Datasheet
4 pages
5 and 6
No ratings yet
5 and 6
5 pages

18 Code Optimization 07-02-2025

Uploaded by

18 Code Optimization 07-02-2025

Uploaded by

Optimization Techniques

Compilation involve Translation and Optimization

I. Compiler Optimization Technique

Techniques available are :

Dead Code Elimination

Register allocation is a very important compilation phase. Given a block of code,

using graph coloring :

Example : Perform register allocation using life time graph

Basic loop optimizations

Cache-oriented loop optimizations

Data realignment and Array padding

You might also like