0% found this document useful (0 votes)

2 views

llvm-demo

Uploaded by

pre4508326

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

llvm-demo

Uploaded by

pre4508326

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 81

A Brief Introduction

to LLVM
Nick Sumner
[email protected]
What is LLVM?
● A compiler? (clang)
What is LLVM?
● A compiler? (clang)
● A set of formats, libraries, and tools.
What is LLVM?
● A compiler? (clang)
● A set of formats, libraries, and tools.
– A simple, typed IR (bitcode)
– Program analysis / optimization libraries
– Machine code generation libraries
– Tools that compose the libraries to perform tasks
What is LLVM?
● A compiler? (clang)
● A set of formats, libraries, and tools.
– A simple, typed IR (bitcode)
– Program analysis / optimization libraries
– Machine code generation libraries
– Tools that compose the libraries to perform tasks
What is LLVM?
● A compiler? (clang)
● A set of formats, libraries, and tools.
– A simple, typed IR (bitcode)
– Program analysis / optimization libraries
– Machine code generation libraries
– Tools that compose the libraries to perform tasks
What is LLVM?
● A compiler? (clang)
● A set of formats, libraries, and tools.
– A simple, typed IR (bitcode)
– Program analysis / optimization libraries
– Machine code generation libraries
– Tools that compose the libraries to perform tasks
What is LLVM?
● A compiler? (clang)
● A set of formats, libraries, and tools.
– A simple, typed IR (bitcode)
– Program analysis / optimization libraries
– Machine code generation libraries
– Tools that compose the libraries to perform tasks
● Easy to add / remove / change functionality
How will you be using it?
● Compiling programs to bitcode:
clang -g -c -emit-llvm <sourcefile> -o <bitcode>.bc
How will you be using it?
● Compiling programs to bitcode:
clang -g -c -emit-llvm <sourcefile> -o <bitcode>.bc
● Analyzing the bitcode:
opt -load <plugin>.so --<plugin> -analyze <bitcode>.bc
How will you be using it?
● Compiling programs to bitcode:
clang -g -c -emit-llvm <sourcefile> -o <bitcode>.bc
● Analyzing the bitcode:
opt -load <plugin>.so --<plugin> -analyze <bitcode>.bc
● Writing your own tools:
./callcounter -static test.bc
How will you be using it?
● Compiling programs to bitcode:
clang -g -c -emit-llvm <sourcefile> -o <bitcode>.bc
● Analyzing the bitcode:
opt -load <plugin>.so --<plugin> -analyze <bitcode>.bc
● Writing your own tools:
./callcounter -static test.bc
● Reporting properties of the program:
Function Counts
===============
b : 2
a : 1
printf : 3
What is LLVM Bitcode?
● A (relatively) simple
intermediate representation (IR)
– It captures the
program dependence graph
What is LLVM Bitcode?
● A (relatively) simple @str = private constant [6 x i8] c"Hello\00"
intermediate representation (IR)
define void @foo(i32) {
– It captures the %2 = icmp eq i32 %0, 0
program dependence graph br i1 %2, label %3, label %4
#include<stdio.h>
; <label>:3: ; preds = %4, %1
ret void
void
foo(unsigned e) {
; <label>:4: ; preds = %1, %4
for (unsigned i = 0; i < e; ++i) {
printf("Hello\n"); IR
%5 = phi i32 [ %7, %4 ], [ 0, %1 ]
%6 = tail call i32 @puts(i8* getelementptr
}
}
Code ([6 x i8], [6 x i8]* @str, i64 0, i64 0))
%7 = add nuw i32 %5, 1
%8 = icmp eq i32 %7, %0
int
br i1 %8, label %3, label %4
main(int argc, char **argv) {
}
foo(argc);
return 0;
define i32 @main(i32, i8** nocapture readnone) {
}
tail call void @foo(i32 %0)
clang -c -S -emit-llvm -O1 -g0 ret i32 0
}
What is LLVM Bitcode?
● A (relatively) simple @str = private constant [6 x i8] c"Hello\00"
intermediate representation (IR)
define void @foo(i32) {
– It captures the %2 = icmp eq i32 %0, 0
program dependence graph br i1 %2, label %3, label %4
#include<stdio.h>
; <label>:3: ; preds = %4, %1
ret void
void
foo(unsigned e) {
; <label>:4: ; preds = %1, %4
for (unsigned i = 0; i < e; ++i) {
%5 = phi i32 [ %7, %4 ], [ 0, %1 ]
printf("Hello\n");
%6 = tail call i32 @puts(i8* getelementptr
}
([6 x i8], [6 x i8]* @str, i64 0, i64 0))
}
%7 = add nuw i32 %5, 1
%8 = icmp eq i32 %7, %0
int
br i1 %8, label %3, label %4
main(int argc, char **argv) {
}
foo(argc);
return 0;
define i32 @main(i32, i8** nocapture readnone) {
}
tail call void @foo(i32 %0)
clang -c -S -emit-llvm -O1 -g0 ret i32 0
}
What is LLVM Bitcode?
● A (relatively) simple @str = private constant [6 x i8] c"Hello\00"
intermediate representation (IR)
define void @foo(i32) {
– It captures the %2 = icmp eq i32 %0, 0
program dependence graph br i1 %2, label %3, label %4
#include<stdio.h>
; <label>:3: ; preds = %4, %1
ret void
void
foo(unsigned e) {
; <label>:4: ; preds = %1, %4
for (unsigned i = 0; i < e; ++i) {
%5 = phi i32 [ %7, %4 ], [ 0, %1 ]
printf("Hello\n");
%6 = tail call i32 @puts(i8* getelementptr
}
([6 x i8], [6 x i8]* @str, i64 0, i64 0))
}
%7 = add nuw i32 %5, 1
%8 = icmp eq i32 %7, %0
int
br i1 %8, label %3, label %4
main(int argc, char **argv) {
}
Functions
foo(argc);
return 0;
define i32 @main(i32, i8** nocapture readnone) {
}
tail call void @foo(i32 %0)
clang -c -S -emit-llvm -O1 -g0 ret i32 0
}
What is LLVM Bitcode?
● A (relatively) simple @str = private constant [6 x i8] c"Hello\00"
intermediate representation (IR)
define void @foo(i32) {
– It captures the %2 = icmp eq i32 %0, 0
program dependence graph br i1 %2, label %3, label %4
#include<stdio.h>
; <label>:3: ; preds = %4, %1
ret void
void
foo(unsigned e) {
; <label>:4: ; preds = %1, %4
for (unsigned i = 0; i < e; ++i) {
%5 = phi i32 [ %7, %4 ], [ 0, %1 ]
printf("Hello\n");
%6 = tail call i32 @puts(i8* getelementptr
}
([6 x i8], [6 x i8]* @str, i64 0, i64 0))
}
%7 = add nuw i32 %5, 1
%8 = icmp eq i32 %7, %0
int
br i1 %8, label %3, label %4
main(int argc, char **argv) {
}
Basic Blocks
foo(argc);
return 0;
define i32 @main(i32, i8** nocapture readnone) {
}
tail call void @foo(i32 %0)
clang -c -S -emit-llvm -O1 -g0 ret i32 0
}
What is LLVM Bitcode?
● A (relatively) simple @str = private constant [6 x i8] c"Hello\00"
intermediate representation (IR)
define void @foo(i32) {
– It captures the %2 = icmp eq i32 %0, 0
program dependence graph br i1 %2, label %3, label %4
#include<stdio.h>
; <label>:3: ; preds = %4, %1
ret void
void
foo(unsigned e) {
; <label>:4: ; preds = %1, %4
for (unsigned i = 0; i < e; ++i) {
%5 = phi i32 [ %7, %4 ], [ 0, %1 ]
printf("Hello\n");
} labels & predecessors
%6 = tail call i32 @puts(i8* getelementptr
([6 x i8], [6 x i8]* @str, i64 0, i64 0))
}
%7 = add nuw i32 %5, 1
%8 = icmp eq i32 %7, %0
int
br i1 %8, label %3, label %4
main(int argc, char **argv) {
}
Basic Blocks
foo(argc);
return 0;
define i32 @main(i32, i8** nocapture readnone) {
}
tail call void @foo(i32 %0)
clang -c -S -emit-llvm -O1 -g0 ret i32 0
}
What is LLVM Bitcode?
● A (relatively) simple @str = private constant [6 x i8] c"Hello\00"
intermediate representation (IR)
define void @foo(i32) {
– It captures the %2 = icmp eq i32 %0, 0
program dependence graph br i1 %2, label %3, label %4
#include<stdio.h>
; <label>:3: ; preds = %4, %1
ret void
void
foo(unsigned e) {
; <label>:4: ; preds = %1, %4
for (unsigned i = 0; i < e; ++i) {
%5 = phi i32 [ %7, %4 ], [ 0, %1 ]
printf("Hello\n");
%6 = tail call i32 @puts(i8* getelementptr
}
([6 x i8], [6 x i8]* @str, i64 0, i64 0))
}
%7 = add nuw i32 %5, 1
%8 = icmp eq i32 %7, %0
int
br i1 %8, label %3, label %4
main(int argc, char **argv) {
}
return 0;
Basic Blocks
foo(argc);
branches & successors
define i32 @main(i32, i8** nocapture readnone) {
}
tail call void @foo(i32 %0)
clang -c -S -emit-llvm -O1 -g0 ret i32 0
}
What is LLVM Bitcode?
● A (relatively) simple @str = private constant [6 x i8] c"Hello\00"
intermediate representation (IR)
define void @foo(i32) {
– It captures the %2 = icmp eq i32 %0, 0
program dependence graph br i1 %2, label %3, label %4
#include<stdio.h>
; <label>:3: ; preds = %4, %1
ret void
void
foo(unsigned e) {
; <label>:4: ; preds = %1, %4
for (unsigned i = 0; i < e; ++i) {
%5 = phi i32 [ %7, %4 ], [ 0, %1 ]
printf("Hello\n");
%6 = tail call i32 @puts(i8* getelementptr
}
([6 x i8], [6 x i8]* @str, i64 0, i64 0))
}
%7 = add nuw i32 %5, 1
%8 = icmp eq i32 %7, %0
int
br i1 %8, label %3, label %4
main(int argc, char **argv) {
}
Instructions
foo(argc);
return 0;
define i32 @main(i32, i8** nocapture readnone) {
}
tail call void @foo(i32 %0)
clang -c -S -emit-llvm -O1 -g0 ret i32 0
}
Inspecting Bitcode
● LLVM libraries help examine the bitcode
– Easy to examine and/or manipulate
– Many helpers (e.g. CallBase, outs(), dyn_cast)
Inspecting Bitcode
● LLVM libraries help examine the bitcode
– Easy to examine and/or manipulate
– Many helpers (e.g. CallBase, outs(), dyn_cast)
Module& module = ...;
for (Function& fun : module) {
for (BasicBlock& bb : fun) {
for (Instruction& i : bb) {

Iterate over the:

● Functions in a Module

● BasicBlocks in a Function

● Instructions in a BasicBlock

...
Inspecting Bitcode
● LLVM libraries help examine the bitcode
– Easy to examine and/or manipulate
– Many helpers (e.g. CallBase, outs(), dyn_cast)
Module& module = ...;
for (Function& fun : module) {
for (BasicBlock& bb : fun) {
for (Instruction& i : bb) {
CallBase* cb = dyn_cast<CallBase>(&i);
if (!cb) {
continue; dyn_cast() efficiently checks
}
the runtime types of LLVM IR components.

CallBase provides a common interface

for different type of function calls
...
Inspecting Bitcode
● LLVM libraries help examine the bitcode
– Easy to examine and/or manipulate
– Many helpers (e.g. CallBase, outs(), dyn_cast)
Module& module = ...;
for (Function& fun : module) {
for (BasicBlock& bb : fun) {
for (Instruction& i : bb) {
CallBase* cb = dyn_cast<CallBase>(&i);
if (!cb) {
continue;
}
outs() << "Found a function call: " << i << "\n";

outs() and other printing functions

make inspecting components easy
...
Inspecting Bitcode
● LLVM libraries help examine the bitcode
– Easy to examine and/or manipulate
– Many helpers (e.g. CallBase, outs(), dyn_cast)
Module& module = ...;
for (Function& fun : module) {
for (BasicBlock& bb : fun) {
for (Instruction& i : bb) {
CallBase* cb = dyn_cast<CallBase>(&i);
if (!cb) {
Working within the API allows you
continue; to ask questions about code.
}
outs() << "Found a function call: " << i << "\n";
Value* called = cb->getCalledOperand()->stripPointerCasts();
if (Function* f = dyn_cast<Function>(called)) {
outs() << "Direct call to function: " << f->getName() << "\n";
...
Inspecting Bitcode
● LLVM libraries help examine the bitcode
– Easy to examine and/or manipulate
– Many helpers (e.g. CallBase, outs(), dyn_cast)
Module& module = ...;
for (Function& fun : module) {
for (BasicBlock& bb : fun) {
for (Instruction& i : bb) {
CallBase* cb = dyn_cast<CallBase>(&i);
if (!cb) {
Working within the API allows you
continue; to ask questions about code.
}
outs() << "Found a function call: " << i << "\n";
Value* called = cb->getCalledOperand()->stripPointerCasts();
if (Function* f = dyn_cast<Function>(called)) {
outs() << "Direct call to function: " << f->getName() << "\n";
...
Inspecting Bitcode
● LLVM libraries help examine the bitcode
– Easy to examine and/or manipulate
– Many helpers (e.g. CallBase, outs(), dyn_cast)
Module& module = ...;
for (Function& fun : module) {
for (BasicBlock& bb : fun) {
for (Instruction& i : bb) {
CallBase* cb = dyn_cast<CallBase>(&i);
if (!cb) {
Working within the API allows you
continue; to ask questions about code.
}
outs() << "Found a function call: " << i << "\n";
Value* called = cb->getCalledOperand()->stripPointerCasts();
if (Function* f = dyn_cast<Function>(called)) {
outs() << "Direct call to function: " << f->getName() << "\n";
...
Static Single Assignment (SSA)
● Program dependence graphs help answer questions like:
– Where was a variable defined?
– Where is a particular value used?
Static Single Assignment (SSA)
● Program dependence graphs help answer questions like:
– Where was a variable defined?
– Where is a particular value used?
● Compilers today help provide this using SSA form
– Each variable has a single definition,
so resolving dependencies is easier
Static Single Assignment (SSA)
● Program dependence graphs help answer questions like:
– Where was a variable defined?
– Where is a particular value used?
● Compilers today help provide this using SSA form
– Each variable has a single definition,
so resolving dependencies is easier

void foo()
unsigned i = 0;
while (i < 10) {
i = i + 1;
}
}
Static Single Assignment (SSA)
● Program dependence graphs help answer questions like:
– Where was a variable defined?
– Where is a particular value used?
● Compilers today help provide this using SSA form
– Each variable has a single definition,
so resolving dependencies is easier

void foo()
unsigned i = 0;
while (i < 10) {
i = i + 1; What is the single definition
}
} of i at this point?
Static Single Assignment (SSA)
● Program dependence graphs help answer questions like:
– Where was a variable defined?
– Where is a particular value used?
● Compilers today help provide this using SSA form
– Each variable has a single definition,
so resolving dependencies is easier
● Phi instructions select which incoming value to use among options
– Phi nodes must occur at the beginning of a basic block
Static Single Assignment (SSA)
define void @foo() {
br label %1
● Program dependence graphs help answer questions like:
void foo() {
– unsigned
Where was aivariable
= 0; defined? ; <label>:1 ; preds = %1, %0
%i.phi = phi i32 [ 0, %0 ], [ %2, %1 ]
while (i < 10) {
– Whereiis= aiparticular
+ 1; value used? %2 = add i32 %i.phi, 1
} %exitcond = icmp eq i32 %2, 10
} br i1 %exitcond, label %3, label %1
● Compilers today help provide this using SSA form
– ; <label>:3
Each variable has a single definition, ; preds = %1
ret void
so resolving dependencies is easier}

● Phi instructions select which incoming value to use among options

– Phi nodes must occur at the beginning of a basic block
Static Single Assignment (SSA)
define void @foo() {
br label %1
● Program dependence graphs help answer questions like:
void foo() {
– unsigned
Where was aivariable
= 0; defined? ; <label>:1 ; preds = %1, %0
%i.phi = phi i32 [ 0, %0 ], [ %2, %1 ]
while (i < 10) {
– Whereiis= aiparticular
+ 1; value used? %2 = add i32 %i.phi, 1
} %exitcond = icmp eq i32 %2, 10
} br i1 %exitcond, label %3, label %1
● Compilers today help provide this using SSA form
– ; <label>:3
Each variable has a single definition, ; preds = %1
ret void
so resolving dependencies is easier}

● Phi instructions select which incoming value to use among options

– Phi nodes must occur at the beginning of a basic block
Dependencies in General
● You can loop over the values an instruction uses
for (Use& u : inst->operands()) {
// inst uses the Value* u
}
Dependencies in General
● You can loop over the values an instruction uses
for (Use& u : inst->operands()) { Given %a = %b + %c:
// inst uses the Value* u [%b, %c]
}
Dependencies in General
● You can loop over the values an instruction uses
for (Use& u : inst->operands()) { Given %a = %b + %c:
// inst uses the Value* u [%b, %c]
}

● You can loop over the instructions that use a particular value
Instruction* inst = ...;
for (User* user : inst->users())
if (auto* i = dyn_cast<Instruction>(user)) {
// inst is used by Instruction i
}
Dealing with Types
● LLVM IR is strongly typed
– Every value has a type → getType()
● A value must be explicitly cast to a new type
define i64 @trunc(i16 zeroext %a) {
%1 = zext i16 %a to i64
ret i64 %1
}
Dealing with Types
● LLVM IR is strongly typed
– Every value has a type → getType()
● A value must be explicitly cast to a new type
define i64 @trunc(i16 zeroext %a) {
%1 = zext i16 %a to i64
ret i64 %1
}
Dealing with Types
● LLVM IR is strongly typed
– Every value has a type → getType()
● A value must be explicitly cast to a new type
define i64 @trunc(i16 zeroext %a) {
%1 = zext i16 %a to i64
ret i64 %1
}
● Also types for pointers, arrays, structs, etc.
– Strong typing means they take a bit more work
Dealing with Types: GEP
● We sometimes need to extract elements/fields from arrays/structs
– Pointer arithmetic struct rec {
– Done using GetElementPointer (GEP) int x;
int y;
};

struct rec *buf;

%struct.rec = type { i32, i32 }
void foo() {
@buf = global %struct.rec* null buf[5].y = 7;
}
define void @foo() {
%1 = load %struct.rec*, %struct.rec** @buf
%2 = getelementptr %struct.rec, %struct.rec* %1, i64 5, i32 1
store i32 7, i32* %2
ret void
}
Dealing with Types: GEP
● We sometimes need to extract elements/fields from arrays/structs
– Pointer arithmetic struct rec {
– Done using GetElementPointer (GEP) int x;
int y;
};

struct rec *buf;

%struct.rec = type { i32, i32 }
void foo() {
@buf = global %struct.rec* null buf[5].y = 7;
}
define void @foo() {
%1 = load %struct.rec*, %struct.rec** @buf
%2 = getelementptr %struct.rec, %struct.rec* %1, i64 5, i32 1
store i32 7, i32* %2
ret void
}
Where can you get more information?
● The online documentation is extensive:
– LLVM Programmer’s Manual
– LLVM Language Reference Manual
Where can you get more information?
● The online documentation is extensive:
– LLVM Programmer’s Manual
– LLVM Language Reference Manual
● The header files!
– All in llvm-12.x.src/include/llvm/

BasicBlock.h InstrTypes.h
DerivedTypes.h IRBuilder.h
Function.h Support/InstVisitor.h
Instructions.h Type.h
Creating a
Static Analysis
Making a new analysis
● Analyses are organized into individual passes
– ModulePass
– FunctionPass Derive from the appropriate
– LoopPass base class to make a Pass
– …
Making a new analysis
● Analyses are organized into individual passes
– ModulePass
– FunctionPass Derive from the appropriate
– LoopPass base class to make a Pass
– …
3 Steps
1) Declare your pass
2) Register your pass
3) Define your pass
Making a new analysis
● Analyses are organized into individual passes
– ModulePass
– FunctionPass Derive from the appropriate
– LoopPass base class to make a Pass
– …
3 Steps
1) Declare your pass
2) Register your pass
Let's count the number of
3) Define your pass
static direct calls to each
function.
Making a ModulePass (1)
● Declare your ModulePass
struct StaticCallCounter : public llvm::ModulePass {

static char ID;

DenseMap<Function*, uint64_t> counts;

StaticCallCounter()
: ModulePass(ID)
{ }

bool runOnModule(Module& m) override;

void print(raw_ostream& out, const Module* m) const override;

void handleInstruction(CallBase& cb);

};
Making a ModulePass (1)
● Declare your ModulePass
struct StaticCallCounter : public llvm::ModulePass {

static char ID;

DenseMap<Function*, uint64_t> counts;

StaticCallCounter()
: ModulePass(ID)
{ }

bool runOnModule(Module& m) override;

void print(raw_ostream& out, const Module* m) const override;

void handleInstruction(CallBase& cb);

};
Making a ModulePass (1)
● Declare your ModulePass
struct StaticCallCounter : public llvm::ModulePass {

static char ID;

DenseMap<Function*, uint64_t> counts;

StaticCallCounter()
: ModulePass(ID)
{ }

bool runOnModule(Module& m) override;

void print(raw_ostream& out, const Module* m) const override;

void handleInstruction(CallBase& cb);

};
Making a ModulePass (2)
● Register your ModulePass
– This allows it to even be dynamically loaded as a plugin
– Depending on your use cases, it may not be necessary
char StaticCallCounter::ID = 0;

RegisterPass<StaticCallCounter> SCCReg("callcounter",
"Print the static count of direct calls");
Making a ModulePass (3)
● Define your ModulePass
– Need to override runOnModule() and print()

bool
StaticCallCounter::runOnModule(Module& m) {
for (auto& f : m)
for (auto& bb : f)
for (auto& i : bb)
if (CallBase *cb = dyn_cast<CallBase>(&i)) {
handleInstruction(CallSite{&i});
}
return false; // False because we didn't change the Module
}
Making a ModulePass (3)
● Analysis continued...
void
StaticCallCounter::handleInstruction(CallBase* cb) {
// Check whether the called function is directly invoked
auto called = cb.getCalledOperand()->stripPointerCasts();
auto fun = dyn_cast<Function>(called);
if (!fun) { return; }

// Update the count for the particular call

auto count = counts.find(fun);
if (counts.end() == count) {
count = counts.insert(std::make_pair(fun, 0)).first;
}
++count->second;
}
Making a ModulePass (3)
● Analysis continued...
void
StaticCallCounter::handleInstruction(CallBase* cb) {
// Check whether the called function is directly invoked
auto called = cb.getCalledOperand()->stripPointerCasts();
auto fun = dyn_cast<Function>(called);
if (!fun) { return; }

// Update the count for the particular call

auto count = counts.find(fun);
if (counts.end() == count) {
count = counts.insert(std::make_pair(fun, 0)).first;
}
++count->second;
}
Making a ModulePass (3)
● Printing out the results

void
CallCounterPass::print(raw_ostream& out, const Module* m) const {
out << "Function Counts\n"
<< "===============\n";
for (auto& kvPair : counts) {
auto* function = kvPair.first;
uint64_t count = kvPair.second;
out << function->getName() << " : " << count << "\n";
}
}
Creating a
Dynamic Analysis
Making a Dynamic Analysis
● We have counted the static direct calls to each function.
● How might we count all dynamic calls to each function?
Making a Dynamic Analysis
● We have counted the static direct calls to each function.
● How might we count all dynamic calls to each function?
● Need to modify the original program!
Making a Dynamic Analysis
● We have counted the static direct calls to each function.
● How might we count all dynamic calls to each function?
● Need to modify the original program!
● Steps:
1) Modify the program using passes
2) Compile the modified version
3) Run the new program
Modifying the Original Program
● Goal: Count the dynamic calls to each function in an execution.
– So how do we want to modify the program?

?
void foo()
bar();
}

Keep a counter for each function!

2 Choices:
1) increment count for each function as it starts
2) increment count for each function at its call site
Does that even matter? Are there trade offs?
Modifying the Original Program
● Goal: Count the dynamic calls to each function in an execution.
– So how do we want to modify the program?
void foo() void foo()
bar(); count[foo]++;
} bar();
}

● We'll increment at the function entry.

(the demo code shows both options)
Modifying the Original Program
● Goal: Count the dynamic calls to each function in an execution.
– So how do we want to modify the program?
void foo() void foo() 0 ↦ main
bar(); count[1]++; 1 ↦ foo
} bar(); 2 ↦ bar
} …

● We'll increment at the function entry.

– Using numeric IDs for functions is sometimes easier
Modifying the Original Program
● Goal: Count the dynamic calls to each function in an execution.
– So how do we want to modify the program?
void foo() void foo() void
bar(); countCall(1); countCall(id)
bar(); count[id]++;
} }
}

● We'll increment at the function entry.

– Using numeric IDs for functions is sometimes easier
– Inserting function calls is easier than adding raw instructions
Modifying the Original Program
● Goal: Count the dynamic calls to each function in an execution.
– So how do we want to modify the program?
void foo() void foo() void
bar(); countCall(1); countCall(id)
bar(); count[id]++;
} }
}

● We'll increment at the function entry.

– Using numeric IDs for functions is sometimes easier
– Inserting function calls is easier than adding raw instructions
● Add new definitions to the original code
● Link against an instrumentation library
Modifying the Original Program
● What might adding this call look like?

void
DynamicCallCounter::handleInstruction(CallBase& cb, Value* counter) {
// Check whether the called function is directly invoked
auto calledValue = cb.getCalledOperation()->stripPointerCasts();
auto calledFunction = dyn_cast<Function>(calledValue);
if (!calledFunction) {
return;
}

// Insert a call to the counting function.

IRBuilder<> builder(&cb);
builder.CreateCall(counter, builder.getInt64(ids[calledFunction]));
}
Modifying the Original Program
● What might adding this call look like?

// Insert a call to the counting function.

IRBuilder<> builder(&cb);
builder.CreateCall(counter, builder.getInt64(ids[calledFunction]));
}
Modifying the Original Program
● What might adding this call look like?

// Insert a call to the counting function.

IRBuilder<> builder(&cb);
builder.CreateCall(counter, builder.getInt64(ids[calledFunction]));
}
In practice, it is more complex.
You can find details in the demo code.
Using a Runtime Library
● Recall that the definition of countCall() needs to live somewhere
1) Add directly to the modified code
2) Implemented separately & linked in via a library
What trade offs do you see?
Using a Runtime Library
● Recall that the definition of countCall() needs to live somewhere
1) Add directly to the modified code
2) Implemented separately & linked in via a library
● In practice, linking against a library is common, easy, & powerful
– Regardless of the language being analyzed
void
countCalled(uint64_t id) {
++functionInfo[id];
}
Revisiting the Big Picture of Dynamic Analysis

Program/Module

Analysis Tool
Instrumentation Pass

Runtime
Library Compilation

Input Modified Program Results!

Revisiting the Big Picture of Dynamic Analysis

Program/Module Step 1: Insert useful calls

to a runtime library
Analysis Tool
Instrumentation Pass

Runtime
Library Compilation

Input Modified Program Results!

Revisiting the Big Picture of Dynamic Analysis

Program/Module Step 2: Compile & link

against the runtime
library.
Analysis Tool
Instrumentation Pass

Runtime
Library Compilation

Input Modified Program Results!

Revisiting the Big Picture of Dynamic Analysis

Step 3: Run the new program to

Program/Module
produce your results
Analysis Tool
Instrumentation Pass

Runtime
Library Compilation

Input Modified Program Results!

Summary
● LLVM organizes groups of passes and tools into projects
● Easiest way to start is by using the demo on the course page
● For the most part, you can follow the directions online
& in the project description

Software Requirement Specification For Facebook
89% (103)
Software Requirement Specification For Facebook
47 pages
CosmWasm Book
No ratings yet
CosmWasm Book
102 pages
LLVM Essentials - Sample Chapter
No ratings yet
LLVM Essentials - Sample Chapter
16 pages
Low Level Virtual Machine C# Compiler Senior Project Proposal Presentation
No ratings yet
Low Level Virtual Machine C# Compiler Senior Project Proposal Presentation
38 pages
20 Best React - Js Books You Have To Read
No ratings yet
20 Best React - Js Books You Have To Read
30 pages
SPPID - 01 (Tutorial) PDF
No ratings yet
SPPID - 01 (Tutorial) PDF
11 pages
LLVM Tutorial
100% (1)
LLVM Tutorial
59 pages
Create A Working Compiler With The LLVM Framework, Part 1
No ratings yet
Create A Working Compiler With The LLVM Framework, Part 1
13 pages
The Architecture of Open Source Applications (Volume 1) LLVM5
No ratings yet
The Architecture of Open Source Applications (Volume 1) LLVM5
1 page
A Complete Guide to LLVM for Programming Language Creators
No ratings yet
A Complete Guide to LLVM for Programming Language Creators
22 pages
ASPLOS19-LLVM-Tutorial
No ratings yet
ASPLOS19-LLVM-Tutorial
71 pages
Quick Primer On LLVM IR: (For Those Already Familiar With LLVM IR, Feel Free To)
No ratings yet
Quick Primer On LLVM IR: (For Those Already Familiar With LLVM IR, Feel Free To)
13 pages
LLVM Crash Course
No ratings yet
LLVM Crash Course
15 pages
L3-LLVM-Part1
No ratings yet
L3-LLVM-Part1
31 pages
1 Davis Chisnall LLVM 2017
No ratings yet
1 Davis Chisnall LLVM 2017
166 pages
4.llvm
No ratings yet
4.llvm
26 pages
Developed by University of Illinois at Urbana-Champaign CIS Dept Cisc 471 Matthew Warner
No ratings yet
Developed by University of Illinois at Urbana-Champaign CIS Dept Cisc 471 Matthew Warner
9 pages
LLVM Cookbook - Sample Chapter
No ratings yet
LLVM Cookbook - Sample Chapter
30 pages
2004-09-22-LCPCLLVMTutorial
No ratings yet
2004-09-22-LCPCLLVMTutorial
61 pages
The LLVM Compiler Framework and Infrastructure
No ratings yet
The LLVM Compiler Framework and Infrastructure
61 pages
LLVM Framework Research and Applications
No ratings yet
LLVM Framework Research and Applications
6 pages
1 LLVM Introduction 16-07-2024
No ratings yet
1 LLVM Introduction 16-07-2024
11 pages
LLVM Homework
100% (1)
LLVM Homework
7 pages
The LLVM Compiler Framework and Infrastructure
No ratings yet
The LLVM Compiler Framework and Infrastructure
44 pages
Exp 11
No ratings yet
Exp 11
4 pages
LLVM
No ratings yet
LLVM
1,703 pages
L6-LLVM-Part2
No ratings yet
L6-LLVM-Part2
6 pages
Department of Computing: CS 354: Compiler Construction Class: BSCS-7A
No ratings yet
Department of Computing: CS 354: Compiler Construction Class: BSCS-7A
4 pages
UNDERSTANDING THE MEMORY SEGMENTS - WHY DO WE NEED IT_
No ratings yet
UNDERSTANDING THE MEMORY SEGMENTS - WHY DO WE NEED IT_
19 pages
Panda Recon
No ratings yet
Panda Recon
66 pages
LLVMAPP
No ratings yet
LLVMAPP
2 pages
TechTalk Kruppe Espasa RISC V Vectors and LLVM
No ratings yet
TechTalk Kruppe Espasa RISC V Vectors and LLVM
23 pages
PLDI Week 03 Irs
No ratings yet
PLDI Week 03 Irs
51 pages
PLDI Week 04 LLVM
No ratings yet
PLDI Week 04 LLVM
62 pages
LLVM at Raincode Labs
No ratings yet
LLVM at Raincode Labs
28 pages
PLDI Week 05 Lexing
No ratings yet
PLDI Week 05 Lexing
78 pages
Formalizing The LLVM Intermediate Representation For Verified Program Transformations
No ratings yet
Formalizing The LLVM Intermediate Representation For Verified Program Transformations
13 pages
Write An LLVMBackend Tutorial For Cpu 0
No ratings yet
Write An LLVMBackend Tutorial For Cpu 0
189 pages
L03 C Intro
No ratings yet
L03 C Intro
35 pages
Native Shader Compilation With LLVM PDF
No ratings yet
Native Shader Compilation With LLVM PDF
37 pages
LLVM
No ratings yet
LLVM
1 page
CSCI 2400 - Exam 4
No ratings yet
CSCI 2400 - Exam 4
2 pages
CompilerTalk 2019
No ratings yet
CompilerTalk 2019
55 pages
Disc04 Sols
No ratings yet
Disc04 Sols
7 pages
Writing An LLVM Compiler Backend
No ratings yet
Writing An LLVM Compiler Backend
29 pages
LLVM
No ratings yet
LLVM
12 pages
LLVM
No ratings yet
LLVM
474 pages
LLVM+GHV Thesis
No ratings yet
LLVM+GHV Thesis
73 pages
Lab 01: GDB Tutorial: 1. Overview
No ratings yet
Lab 01: GDB Tutorial: 1. Overview
5 pages
Generating Stack Machine Code Using LLVM
No ratings yet
Generating Stack Machine Code Using LLVM
5 pages
Nacke, Kai, Kwan, Amy - Learn LLVM 17 - A Beginner's Guide To Learning LLVM Compiler Tools and Core Libraries With C++-Packt (2023)
No ratings yet
Nacke, Kai, Kwan, Amy - Learn LLVM 17 - A Beginner's Guide To Learning LLVM Compiler Tools and Core Libraries With C++-Packt (2023)
10 pages
Lab01 GDB
No ratings yet
Lab01 GDB
5 pages
An Introduction To Assembly Programming With The ARM 32-Bit Processor Family
No ratings yet
An Introduction To Assembly Programming With The ARM 32-Bit Processor Family
15 pages
Ees04 C
No ratings yet
Ees04 C
47 pages
Lab01 GDB
No ratings yet
Lab01 GDB
6 pages
Ts Lecture10
No ratings yet
Ts Lecture10
19 pages
Lecture9
No ratings yet
Lecture9
96 pages
Memory Safety
No ratings yet
Memory Safety
28 pages
Clang Linux Fosdem 19 PDF
No ratings yet
Clang Linux Fosdem 19 PDF
13 pages
LLVM Clang - Advancing Compiler Technology
No ratings yet
LLVM Clang - Advancing Compiler Technology
28 pages
L07 Riscviii
No ratings yet
L07 Riscviii
74 pages
1.os-intro
No ratings yet
1.os-intro
43 pages
Computer Engineering Laboratory Solution Primer
From Everand
Computer Engineering Laboratory Solution Primer
Karan Bhandari
No ratings yet
150+ C Pattern Programs
From Everand
150+ C Pattern Programs
Hernando Abella
No ratings yet
SBI YONO Salary Account Opening
No ratings yet
SBI YONO Salary Account Opening
3 pages
Object Oriented Programming Lab Mca Department, Krupanidhi Group of Institutions
No ratings yet
Object Oriented Programming Lab Mca Department, Krupanidhi Group of Institutions
27 pages
External Wideband To ProECU-K
No ratings yet
External Wideband To ProECU-K
4 pages
Through A Gender Lens: An Empirical Study of Emoji Usage Over Large-Scale Android Users
No ratings yet
Through A Gender Lens: An Empirical Study of Emoji Usage Over Large-Scale Android Users
20 pages
WorkForce Pro WF-6590 Printer Specification Sheet CPD-51022R2
No ratings yet
WorkForce Pro WF-6590 Printer Specification Sheet CPD-51022R2
2 pages
VMware ESXi - HPE Support Matrix
No ratings yet
VMware ESXi - HPE Support Matrix
29 pages
php
No ratings yet
php
2 pages
The LC-3b ISA: 1.1 Overview
No ratings yet
The LC-3b ISA: 1.1 Overview
23 pages
ICT YR 7 WK 10 Capturing Images
No ratings yet
ICT YR 7 WK 10 Capturing Images
16 pages
Abstract:: How To Create An Online Corpus
No ratings yet
Abstract:: How To Create An Online Corpus
13 pages
fortigate get stale sniffer commands stopping - Αναζήτηση Google
No ratings yet
fortigate get stale sniffer commands stopping - Αναζήτηση Google
2 pages
TY BBA (CA) ROLL NO. 6686-Synopsis-Society-Maintenance-System
No ratings yet
TY BBA (CA) ROLL NO. 6686-Synopsis-Society-Maintenance-System
19 pages
R72 Boot Sequence V1.00a ENG
No ratings yet
R72 Boot Sequence V1.00a ENG
3 pages
IPTV Gateway Server: Outline
No ratings yet
IPTV Gateway Server: Outline
4 pages
Enterprise Resource Planning (Erp)
No ratings yet
Enterprise Resource Planning (Erp)
34 pages
challenges in iot
No ratings yet
challenges in iot
28 pages
CAT WFH Process Guide (Applicant's Guide)
No ratings yet
CAT WFH Process Guide (Applicant's Guide)
1 page
Deepfake Elsevier CVIU
No ratings yet
Deepfake Elsevier CVIU
19 pages
C Programming Lab M Scheme
No ratings yet
C Programming Lab M Scheme
47 pages
Bussiness Intelligence
No ratings yet
Bussiness Intelligence
6 pages
Laboration Webmin Del 2
No ratings yet
Laboration Webmin Del 2
3 pages
John The Ripper
No ratings yet
John The Ripper
7 pages
Dardanius SlidesCarnival
No ratings yet
Dardanius SlidesCarnival
53 pages
Vivek HP - Updated - Resume - 2024
No ratings yet
Vivek HP - Updated - Resume - 2024
4 pages
Excel Test PPC Exec - Updated
No ratings yet
Excel Test PPC Exec - Updated
7 pages
Vitalograph ALPHA - Brochure
No ratings yet
Vitalograph ALPHA - Brochure
2 pages

llvm-demo

Uploaded by

llvm-demo

Uploaded by

A Brief Introduction

Iterate over the:

CallBase provides a common interface

outs() and other printing functions

● Phi instructions select which incoming value to use among options

● Phi instructions select which incoming value to use among options

● Phi instructions select which incoming value to use among options

struct rec *buf;

struct rec *buf;

struct rec *buf;

struct rec *buf;

static char ID;

DenseMap<Function*, uint64_t> counts;

bool runOnModule(Module& m) override;

void print(raw_ostream& out, const Module* m) const override;

void handleInstruction(CallBase& cb);

static char ID;

DenseMap<Function*, uint64_t> counts;

bool runOnModule(Module& m) override;

void print(raw_ostream& out, const Module* m) const override;

void handleInstruction(CallBase& cb);

static char ID;

DenseMap<Function*, uint64_t> counts;

bool runOnModule(Module& m) override;

void print(raw_ostream& out, const Module* m) const override;

void handleInstruction(CallBase& cb);

// Update the count for the particular call

// Update the count for the particular call

// Update the count for the particular call

Keep a counter for each function!

● We'll increment at the function entry.

● We'll increment at the function entry.

● We'll increment at the function entry.

● We'll increment at the function entry.

// Insert a call to the counting function.

// Insert a call to the counting function.

// Insert a call to the counting function.

Input Modified Program Results!

Program/Module Step 1: Insert useful calls

Input Modified Program Results!

Program/Module Step 2: Compile & link

Input Modified Program Results!

Step 3: Run the new program to

Input Modified Program Results!

You might also like