0% found this document useful (0 votes)

32 views49 pages

Lecture 02 2022

https://web.stanford.edu/class/cs110l/slides/lecture-02-2022.pdf

Uploaded by

shiziwen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views49 pages

Lecture 02 2022

https://web.stanford.edu/class/cs110l/slides/lecture-02-2022.pdf

Uploaded by

shiziwen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 49

Program Analysis

Thea Rossman
Jan 6, 2022
Logistics

● Please make sure you’re on Slack (Canvas -> Slack tab)

● Please fill out intro survey if you haven’t yet
● Week 1 exercises coming out today; due next Monday. (Write your answers
directly into Gradescope!)
● Follow-up on course content:
○ Range of CS backgrounds in this class
○ Other options: material Ryan has posted on his website; Rust website;
CS242 (programming languages); CS295 (program analysis); etc.
Content

● Today (+ exercises): what are some tools that we can use to find mistakes in
C/C++ code? What are their limitations?
● Next week: How do other languages address some shortcomings of C?
How can we find bugs in a program?
Dynamic Analysis
Dynamic analysis: high-level

● Run the program, watch what it does, and look for problematic behavior
● Can find problems, but only if the program exhibits problematic behavior on
the inputs you use to test. (Separately, some tools only check for certain
types of issues.)
● Commonly combined with techniques to run the program with lots of different
test inputs (e.g. fuzzing), yet this still can’t give us any assurances that code
is bug-free
● Dynamic analysis is great! Test your code! *and* understand the limitations!
Dynamic analysis tool: Valgrind

● Instruments binaries on the fly

int main() {
char *buf = (char*)malloc(8);
buf[16] = 'a';
}
(compiler)
mov edi, 8
call valgrind_malloc
mov edi, 8
mov QWORD PTR [rbp-8], rax
call malloc
record memory write ^
mov QWORD PTR [rbp-8], rax (valgrind)
mov rax, QWORD PTR [rbp-8]
mov rax, QWORD PTR [rbp-8]
record memory read ^
add rax, 16
add rax, 16
mov BYTE PTR [rax], 97 Invalid write of size 4
mov BYTE PTR [rax], 97 (writing to the heap, but it’s not
record memory write ^ inside any heap allocation that was
previously made)
Valgrind (summary)

● Takes in compiled binary (executable)

● Disassembles binary into intermediate, assembly/assembly-like
representation
● Turns this ^ back into machine code (re-compiles); one small block at a time
during execution
● …while “instrumenting”: modifying instructions or inserting “analysis” code
○ For example, Valgrind’s `memcheck` stores some “shadow” metadata
about heap memory accessed by your code — for example, bits indicating
“this has been freed” or “this has been initialized”. Using these, it can
detect issues like double-freed heap blocks or uninitialized values.
Valgrind

● Works with any binary compiled by any compiler (even if you don’t have
source code available!)
● Downside: not a lot of information is available in binaries…

For more on how valgrind works: https://valgrind.org/docs/valgrind2007.pdf

From last lecture: anatomy of a stack frame
High addresses
● Stack grows DOWN (from higher to lower … previous stuff …
addresses) as functions are called (new stack
frames created) or require local variables.
Function parameters `rbp` stores
● Stack is just a chunk of memory — no
pointer to
information in binary about how it’s split into base of the
Return address
variables. curr. stack
● Remember: Writing to local stack buffers goes Saved base pointer frame;
points here
UP (from lower to higher addresses)

Local variables stack pointer

“write to `buf`” becomes “write
mov edi, 8 to the memory location `8` bytes tracks “top” of
call malloc below the base of the stack” stack (lowest
mov QWORD PTR [rbp-8], rax allocated
… address)
Low addresses
Valgrind

● Works with any binary compiled by any compiler (even if you don’t have
source code available!)
● Downside: not a lot of information is available in binaries
○ E.g. the stack is just a chunk of memory. You might be able to observe
that the stack pointer grows up/down, but no information about how it’s
divided into variables.
■ => cannot detect stack-based buffer overflows!

For more on how valgrind works: https://valgrind.org/docs/valgrind2007.pdf

LLVM Sanitizers

● Same idea, but instrument source code

● Implemented as part of the LLVM compiler suite (e.g. clang)
● Because more information is available pre-compilation, there is a lot more
analysis that sanitizers can do (and they’re also easier to implement)

int main() {
char buf[8];
Record stack buffer “buf” with size 8
buf[16] = 'a';
Record write to “buf” with offset 16
}
LLVM Sanitizers

● AddressSanitizer
○ Finds use of improper memory addresses: out of bounds memory accesses, double
free, use after free
● LeakSanitizer
○ Finds memory leaks
● MemorySanitizer
○ Finds use of uninitialized memory
● UndefinedBehaviorSanitizer
○ Finds usage of null pointers, integer/float overflow, etc
● ThreadSanitizer
○ Finds improper usage of threads (second half of CS 110)
● More…
Cool! Let’s sanitize all the code!! 🏎🔥💯

(screw)
Fundamental limitation of dynamic analysis

● Dynamic analysis can only report bad behavior that actually happened
● If your program worked fine with the input you provided, but it might do bad
things in certain edge cases, dynamic analysis cannot tell you anything about
that
#include <stdio.h>
#include <string.h>
int main() {
char s[100];
int i;
printf("\nEnter a string : ");
gets(s);
for (i = 0; s[i]!='\0'; i++) {
if(s[i] >= 'a' && s[i] <= 'z') {
s[i] = s[i] -32;
}
}
printf("\nString in Upper Case = %s", s);
return 0;
}
How can we find weird edge cases?
Fuzzing

Input seed
Fuzzing

Input seed

Control flow graph

Fuzzing

run the program again

and observe behavior

(semi-random)
mutation

These inputs
made the
Input seed program do
new things!

Control flow graph

Fuzzing

run the
run the program again program again
and observe behavior more mutation

(semi-random)
mutation
More new
behavior!

Input seed

Control flow graph

Continue this process forever…

Fuzzing

● Very simple but extremely effective

● Most common fuzzers: AFL and libfuzzer
● Still, cannot provide any guarantees that a program is bug-free (if the fuzzer
didn’t find anything in 24 hours, maybe we just didn’t run it long enough)
● Google OSS-Fuzz is a large cluster that fuzzes open-source software 24/7
Static Analysis
You Be the Static Analyzer: Round 1

You want to write a tool to help people writing code like this. What do you do?
#include <stdio.h>
#include <string.h>
int main() {
char s[100];
int i;
printf("\nEnter a string : ");
gets(s);
for (i = 0; s[i]!='\0'; i++) {
if(s[i] >= 'a' && s[i] <= 'z') {
s[i] = s[i] -32;
}
}
printf("\nString in Upper Case = %s", s);
return 0;
}
Basic static analysis (“linting”)

Stephen C. Johnson, a computer scientist at Bell Labs, came up with lint in 1978… The term
"lint" was derived from the name of the tiny bits of fiber and fluff shed by clothing, as the
command should act like a dryer machine lint trap, detecting small errors with big effects.
https://en.wikipedia.org/wiki/Lint_(software)

● Linters employ very simple techniques (e.g. ctrl+f) to find obvious mistakes
● The person running the linter can configure a set of rules to enforce
○ Rules are intended to improve the style of the codebase
○ Just because there is a linter error doesn’t mean the code is broken (e.g. it’s possible
to call strcpy() without introducing bugs, but many linters will complain if you call it)
● Common C/C++ linter: clang-tidy
○ Can even auto-fix many of the issues!
You Be the Static Analyzer: Round 2

You want to write a tool to help people writing code like this. What do you do?
void printToUpper(const char *str) { int main(int argc, char *argv[]) {
char *upper = strdup(str); printf("Enter a string to make uppercase,
for (int i = 0; str[i] != '\0'; i++) { or type \"quit\" to quit:\n");
if(str[i] >= 'a' && str[i] <= 'z') { char input[512];
upper[i] = str[i] - ('a' - 'A'); // safely read input string
} fgets(input, sizeof(input), stdin);
} char *toMakeUppercase;
printf("%s\n", upper); if (strcmp(input, "quit") != 0) {
free(upper); toMakeUppercase = input;
} }
printToUpper(toMakeUppercase);
}
Dataflow analysis

We can trace through how the program might execute, keeping track of possible variable values

int main(int argc, char *argv[]) {

printf("Enter a string to make uppercase,
or type \"quit\" to quit:\n");
char input[512];
// safely read input string
fgets(input, sizeof(input), stdin);
char *toMakeUppercase; toMakeUppercase = {uninitialized}
if (strcmp(input, "quit") != 0) {
toMakeUppercase = input;
}
printToUpper(toMakeUppercase);
}

clang-tidy will do dataflow analysis too!

Dataflow analysis

We can trace through how the program might execute, keeping track of possible variable values

int main(int argc, char *argv[]) {

printf("Enter a string to make uppercase,
or type \"quit\" to quit:\n");
char input[512];
// safely read input string
fgets(input, sizeof(input), stdin);
char *toMakeUppercase;
if (strcmp(input, "quit") != 0) {
toMakeUppercase = input;
}
printToUpper(toMakeUppercase);
toMakeUppercase = {uninitialized}
}
Dataflow analysis

We can trace through how the program might execute, keeping track of possible variable values

int main(int argc, char *argv[]) {

printf("Enter a string to make uppercase,
or type \"quit\" to quit:\n");
char input[512];
// safely read input string
fgets(input, sizeof(input), stdin);
char *toMakeUppercase;
if (strcmp(input, "quit") != 0) {
toMakeUppercase = input;
}
printToUpper(toMakeUppercase); toMakeUppercase = {uninitialized, input}
}
printToUpper called with a possibly uninitialized argument!
Dataflow analysis: very powerful!

You want to write a tool to help people writing code like this. What do you do?
int main(int argc, char *argv[]) { // Find the close bracket
// Goal: parse out a string between brackets char *close_bracket = strchr(parsed, ']');
// (e.g. " [target string]" -> "target string") if (close_bracket == NULL) {
printf("Malformed input!\n");
char *parsed = strdup(argv[1]); return 1;
}
// Find open bracket
char *open_bracket = strchr(parsed, '['); // Replace the close bracket with a null
if (open_bracket == NULL) { // terminator to end the parsed string there
printf("Malformed input!\n");
Common mistake: early *close_bracket = '\0';
return 1; return fails to clean up
} resources printf("Parsed string: %s\n", parsed);
free(parsed);
// Make the output string start after the open bracket return 0;
parsed = open_bracket + 1; }
Dataflow analysis: very powerful!

Liveness analysis: observe when variables go away, and make sure they’re cleaned up appropriately

int main(int argc, char *argv[]) { // Find the close bracket

// Goal: parse out a string between brackets char *close_bracket = strchr(parsed, ']');
// (e.g. " [target string]" -> "target string") if (close_bracket == NULL) {
printf("Malformed input!\n");
char *parsed = strdup(argv[1]); return 1;
parsed = {heap allocation} }
// Find open bracket
char *open_bracket = strchr(parsed, '['); // Replace the close bracket with a null
if (open_bracket == NULL) { // terminator to end the parsed string there
printf("Malformed input!\n"); *close_bracket = '\0';
return 1;
} printf("Parsed string: %s\n", parsed);
free(parsed);
// Make the output string start after the open bracket return 0;
parsed = open_bracket + 1; }
Dataflow analysis: very powerful!

Liveness analysis: observe when variables go away, and make sure they’re cleaned up appropriately

int main(int argc, char *argv[]) { // Find the close bracket

// Goal: parse out a string between brackets char *close_bracket = strchr(parsed, ']');
// (e.g. " [target string]" -> "target string") if (close_bracket == NULL) {
printf("Malformed input!\n");
char *parsed = strdup(argv[1]); return 1;
}
// Find open bracket
char *open_bracket = strchr(parsed, '['); // Replace the close bracket with a null
if (open_bracket == NULL) { // terminator to end the parsed string there
printf("Malformed input!\n"); *close_bracket = '\0';
return 1; parsed = {heap allocation}
} parsed is no longer live, but is still printf("Parsed string: %s\n", parsed);
a heap allocation! free(parsed);
// Make the output string start after the open bracket return 0;
parsed = open_bracket + 1; }
Dataflow analysis: works across functions

Tracking calls to functions is no different from tracing paths through if statements

void freeSometimes(void *buf) {
if (rand() == 1) {
return;
}
free(buf);
}

int main() {
void *buf = malloc(8);
freeSometimes(buf); buf = {heap allocation}
return 0;
}
Dataflow analysis: works across functions

Tracking calls to functions is no different from tracing paths through if statements

void freeSometimes(void *buf) {
if (rand() == 1) { buf = {heap allocation}
return;
}
free(buf);
}

int main() {
void *buf = malloc(8);
freeSometimes(buf);
return 0;
}
Dataflow analysis: works across functions

Tracking calls to functions is no different from tracing paths through if statements

void freeSometimes(void *buf) {
if (rand() == 1) {
return; buf = {heap allocation}
}
free(buf); buf = {heap allocation}
}

int main() {
void *buf = malloc(8);
freeSometimes(buf);
return 0;
}
Dataflow analysis: works across functions

Tracking calls to functions is no different from tracing paths through if statements

void freeSometimes(void *buf) {
if (rand() == 1) {
return;
}
buf = {heap allocation}
free(buf);
} buf = {freed allocation}

int main() {
void *buf = malloc(8);
freeSometimes(buf);
return 0;
}
Dataflow analysis: works across functions

Tracking calls to functions is no different from tracing paths through if statements

void freeSometimes(void *buf) {
if (rand() == 1) {
return;
}
free(buf);
}

int main() {
void *buf = malloc(8);
freeSometimes(buf);
return 0; buf = {heap allocation, freed allocation}
}
Dataflow analysis: works across functions

Tracking calls to functions is no different from tracing paths through if statements

broken.c:13:5: warning: Potential leak of memory pointed to by 'buf' [clang-analyzer-
unix.Malloc]
return 0;
^
broken.c:11:17: note: Memory is allocated
void *buf = malloc(8);
^
broken.c:13:5: note: Potential leak of memory pointed to by 'buf'
return 0;
^
Limitations

● False positives
○ Dataflow analysis will follow each branch, even if it’s impossible for some
condition to be true in real life
○ False positives are the Achille’s heel of static analysis. Need a good
signal/noise ratio or else no one will use your analyzer
● Need to limit scope to get reasonable performance
○ Many static analyzers only analyze a single file at a time: they don’t do
dataflow analysis into/out of functions elsewhere in the codebase
○ If you have a huge codebase, loops, tons of conditions, etc., dataflow
analysis can get unwieldy.
Take CS 243 for more info!
static analysis to the moon 🚀 🚀🌙

Cool! Let’s tidy all the code!! 🏎🔥💯

(screw)
Low-hanging fruit #1

int main(int argc, char *argv[]) {

char *message = strchr(argv[1], 'a');
printf("%s\n", message);
}
Low-hanging fruit #1

🍓 clang-tidy easy.c
🍓 cppcheck easy.c no output here means no issues found
Checking easy.c ...
🍓 scan-build clang-11 -Wall easy.c
scan-build: Using '/usr/local/Cellar/llvm/11.0.0_1/bin/clang-11' for static analysis
scan-build: Analysis run complete.
scan-build: Removing directory '/var/folders/6_/jdc6ljyd5n795x1xl8drptm80000gn/T/scan-
build-2021-04-01-002241-43549-1' because it contains no reports.
scan-build: No bugs found.
How do we fix this?

● Okay, I’ll just make sure programs can handle receiving NULL from strchr
● But what if the program is calling strchr on a string that is guaranteed to have
the character they’re looking for? (i.e. strchr will for sure not return NULL)
● And what about all the other functions that can potentially return NULL for
one reason or another?
● And what about…
Low-hanging fruit #2

int main(int argc, char *argv[]) {

char buf[16];
strncpy(buf, argv[1], sizeof(buf));
printf("%s\n", buf);
}

https://linux.die.net/man/3/strncpy

Similar real-world example: https://en.wikipedia.org/wiki/Heartbleed

Low-hanging fruit #2

🍓 clang-tidy easy.c
🍓 cppcheck easy.c
Checking easy.c ...
🍓 scan-build clang-11 -Wall easy.c
scan-build: Using '/usr/local/Cellar/llvm/11.0.0_1/bin/clang-11' for static analysis
scan-build: Analysis run complete.
scan-build: Removing directory '/var/folders/6_/jdc6ljyd5n795x1xl8drptm80000gn/T/scan-
build-2021-04-01-002241-43549-1' because it contains no reports.
scan-build: No bugs found.
How do we fix this?

● Okay, I’ll just make sure programs add a null terminator after calling strncpy
● But what if the program actually uses the copied “string” as a character array
instead of a null-terminated string (i.e. the code is actually fine)?
● And how are you going to track down every function that depends on the
string having a null terminator?
● Note: outright banning strncpy() might be a better idea, but there are still
other ways we could end up with a char* that is not a null-terminated string
Fundamental limitations of static analysis

● If you can only look at a few lines of code, it’s hard to tell (without broader context)
whether that code is safe
● Getting broader context is impossible in the general case (see: “the halting problem”)
○ We can guesstimate what values get passed around in a program using dataflow
analysis, and we can guesstimate how they get used, but it breaks down when
code gets complicated
● You can always add more specific things to check for, but there will always be other
ways to mess up
● Begin to think about: is there some way we can make it easier to verify small
snippets of code in isolation, without broader context?
○ More next week: This general idea is a key motivation for Rust!
Takeaways

● If you are writing C/C++, you should absolutely be running sanitizers, fuzzers,
and static analyzers
○ You should understand the limitations of these tools, but…
○ Just because they are limited does not mean they aren’t helpful
● If you are in a position to use languages with more robust protections, you
should!
For next week

● Take 10 minutes to look through this buggy vector implementation: https://

web.stanford.edu/class/cs110l/lecture-notes/lecture-03/
○ Try to find as many bugs as you can
● Week 1 exercises due Monday (released today)

Valgrind Manual
No ratings yet
Valgrind Manual
409 pages
2 MemoryCorruption
No ratings yet
2 MemoryCorruption
77 pages
Valgrind Manual
No ratings yet
Valgrind Manual
396 pages
Hydraulic Pump, Standby Pressure and Working Pressure, Checking and Adjusting
No ratings yet
Hydraulic Pump, Standby Pressure and Working Pressure, Checking and Adjusting
5 pages
Isuzu TF Series Gasoline Engine Workshop Manual
100% (45)
Isuzu TF Series Gasoline Engine Workshop Manual
20 pages
Valgrind Manual
No ratings yet
Valgrind Manual
349 pages
Valgrind Manual
No ratings yet
Valgrind Manual
388 pages
Intro To C - Module 8
No ratings yet
Intro To C - Module 8
17 pages
2 MemoryCorruption
No ratings yet
2 MemoryCorruption
146 pages
Ad46 Revit Mep Training Essentials
No ratings yet
Ad46 Revit Mep Training Essentials
3 pages
Valgrind Manual
No ratings yet
Valgrind Manual
274 pages
XP Display: Installation & User Guide
No ratings yet
XP Display: Installation & User Guide
30 pages
Valgrind Documentation
No ratings yet
Valgrind Documentation
319 pages
Programming Split
No ratings yet
Programming Split
102 pages
Buffer
No ratings yet
Buffer
78 pages
16 Debugging
No ratings yet
16 Debugging
94 pages
Valgrind Manual
No ratings yet
Valgrind Manual
299 pages
Functional Specification: Project SAP Support
No ratings yet
Functional Specification: Project SAP Support
7 pages
3 Software Security
No ratings yet
3 Software Security
78 pages
Instant Ebooks Textbook Handbook of Data Science Approaches For Biomedical Engineering 1st Edition Valentina Emilia Balas Download All Chapters
100% (4)
Instant Ebooks Textbook Handbook of Data Science Approaches For Biomedical Engineering 1st Edition Valentina Emilia Balas Download All Chapters
53 pages
Fa271153 1636353531708
No ratings yet
Fa271153 1636353531708
88 pages
4 MemoryCorruption
No ratings yet
4 MemoryCorruption
55 pages
Buffer Overflows: Erik Poll
No ratings yet
Buffer Overflows: Erik Poll
61 pages
ch10 2025
No ratings yet
ch10 2025
42 pages
Essence of The Problem
No ratings yet
Essence of The Problem
58 pages
Unit IV-storage Virtualization
No ratings yet
Unit IV-storage Virtualization
26 pages
09 FindingBugs
No ratings yet
09 FindingBugs
41 pages
Lecture 01 2022
No ratings yet
Lecture 01 2022
38 pages
Find Vulnerabilities
No ratings yet
Find Vulnerabilities
28 pages
3 StaticAnalysisPREfast
No ratings yet
3 StaticAnalysisPREfast
36 pages
Fuzzing
No ratings yet
Fuzzing
28 pages
Catalog Man 1
No ratings yet
Catalog Man 1
116 pages
2-A Case of Dynamic Program Analysis - CS510 Software Engineering
No ratings yet
2-A Case of Dynamic Program Analysis - CS510 Software Engineering
42 pages
C Profiling
No ratings yet
C Profiling
17 pages
Rec 06
No ratings yet
Rec 06
30 pages
Fuzzing and Patch Analysis - SAGEly Advice
No ratings yet
Fuzzing and Patch Analysis - SAGEly Advice
61 pages
BsidesDelhi 2020 Hardik
No ratings yet
BsidesDelhi 2020 Hardik
45 pages
An Introduction To Dynamic Analysis For R.E. (2020) PDF
No ratings yet
An Introduction To Dynamic Analysis For R.E. (2020) PDF
30 pages
Writing Solid Code: Zubin Huang
No ratings yet
Writing Solid Code: Zubin Huang
65 pages
Non - Authoritative Applications - 1
No ratings yet
Non - Authoritative Applications - 1
33 pages
Recitation11 Malloc2
No ratings yet
Recitation11 Malloc2
17 pages
EE485 DebuggingTechniques
No ratings yet
EE485 DebuggingTechniques
24 pages
LC60 LC70 Le732u (Excludpwb) Fin PDF
No ratings yet
LC60 LC70 Le732u (Excludpwb) Fin PDF
130 pages
C Language Topics For Interview
No ratings yet
C Language Topics For Interview
24 pages
Facebook Instagram Acquisition Closes
100% (1)
Facebook Instagram Acquisition Closes
10 pages
Porta Punch Manual
No ratings yet
Porta Punch Manual
16 pages
Operator's Manual: MDKBK MDKBL MDKBM MDKBN MDKBP MDKBR Mdkbs MDKBT Mdkbu
No ratings yet
Operator's Manual: MDKBK MDKBL MDKBM MDKBN MDKBP MDKBR Mdkbs MDKBT Mdkbu
50 pages
Lect 19
No ratings yet
Lect 19
20 pages
C Language Topics For Interview
No ratings yet
C Language Topics For Interview
24 pages
Chapter
No ratings yet
Chapter
22 pages
r04 Debugging
No ratings yet
r04 Debugging
18 pages
Intro To C - Module 4
No ratings yet
Intro To C - Module 4
15 pages
Using Valgrind:: Detecting Memory Errors
No ratings yet
Using Valgrind:: Detecting Memory Errors
11 pages
Tutorial 3: Valgrind: By: Vajih Montaghami
No ratings yet
Tutorial 3: Valgrind: By: Vajih Montaghami
10 pages
Cada Manual
No ratings yet
Cada Manual
6 pages
Chapter 3 - Software Security
No ratings yet
Chapter 3 - Software Security
31 pages
1 SourceCodeSecurity
No ratings yet
1 SourceCodeSecurity
57 pages
Experiment 1.3: 1. Aim/Overview of The Practical: To Create An Application To Calculate Interest For FDS
No ratings yet
Experiment 1.3: 1. Aim/Overview of The Practical: To Create An Application To Calculate Interest For FDS
12 pages
Dynamic Memory Allocation in C: (Mca Sem 1 Imscdr)
No ratings yet
Dynamic Memory Allocation in C: (Mca Sem 1 Imscdr)
21 pages
Source Code Security: I. II. I. II. Iii. IV. V. VI. Vii. Viii. IX. X
No ratings yet
Source Code Security: I. II. I. II. Iii. IV. V. VI. Vii. Viii. IX. X
51 pages
Debugging Tools: Towards Better Use of System Tools To Weed The Nasty Critters Out of Your Programs
No ratings yet
Debugging Tools: Towards Better Use of System Tools To Weed The Nasty Critters Out of Your Programs
58 pages
Common Syllabus For Bachelor in Computer Applications (Bca) Preamble
No ratings yet
Common Syllabus For Bachelor in Computer Applications (Bca) Preamble
42 pages
20bce7466 Secure Coding Assignment-7
No ratings yet
20bce7466 Secure Coding Assignment-7
5 pages
CS29206 Systems Programming Laboratory, Spring 2022-2023
No ratings yet
CS29206 Systems Programming Laboratory, Spring 2022-2023
4 pages
Book Sample Buffer
No ratings yet
Book Sample Buffer
70 pages
Useful Tools For Develpment
No ratings yet
Useful Tools For Develpment
18 pages
QO Loadcentres - QO8L100S
No ratings yet
QO Loadcentres - QO8L100S
2 pages
Proposal Defense in Practical Research - 1: Submitted by
No ratings yet
Proposal Defense in Practical Research - 1: Submitted by
6 pages
1 Introduction To Servlets
No ratings yet
1 Introduction To Servlets
32 pages
Generator Protection Relay Panels: Protection and Instrumentation Section Daily Check Sheet Power House
No ratings yet
Generator Protection Relay Panels: Protection and Instrumentation Section Daily Check Sheet Power House
7 pages
Memory Allocation
No ratings yet
Memory Allocation
21 pages
Crash Course in C and Assembly: Zeljko Vrba
No ratings yet
Crash Course in C and Assembly: Zeljko Vrba
10 pages
Secure Programming With Static Analysis
No ratings yet
Secure Programming With Static Analysis
56 pages
81131H Regolat-Program ENG
No ratings yet
81131H Regolat-Program ENG
13 pages
Guion Catalina Project
No ratings yet
Guion Catalina Project
2 pages
Static Analysis of String Manipulations in Critical Embedded C Programs
No ratings yet
Static Analysis of String Manipulations in Critical Embedded C Programs
17 pages
Buffer Overflow Part 1
No ratings yet
Buffer Overflow Part 1
30 pages
IT 103 Module 2
No ratings yet
IT 103 Module 2
13 pages
Notes C
No ratings yet
Notes C
10 pages
WSN Brochure
No ratings yet
WSN Brochure
2 pages
WWW - Worldcolleges.info: Predict The Output or Error(s) For The Following
No ratings yet
WWW - Worldcolleges.info: Predict The Output or Error(s) For The Following
3 pages
LAPD Link Congestion Alarm
No ratings yet
LAPD Link Congestion Alarm
3 pages
NA XX Mobil Delvac Extended Life 5050 Prediluted CoolantAntifreeze C
No ratings yet
NA XX Mobil Delvac Extended Life 5050 Prediluted CoolantAntifreeze C
2 pages
Poster
No ratings yet
Poster
1 page
3.2.4.4 H83DSDMM: Functions and Specifications
No ratings yet
3.2.4.4 H83DSDMM: Functions and Specifications
4 pages
Q1 W6 MIL Types of Media
No ratings yet
Q1 W6 MIL Types of Media
5 pages
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
Basic Information About C language PDF
From Everand
Basic Information About C language PDF
Suraj Das
No ratings yet
50 Recipes for Programming Node.js
From Everand
50 Recipes for Programming Node.js
Jamie Munro
3/5 (4)
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
From Everand
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
Joerg Christian Seubert
No ratings yet

Lecture 02 2022

Uploaded by

Lecture 02 2022

Uploaded by

Program Analysis

● Please make sure you’re on Slack (Canvas -> Slack tab)

● Instruments binaries on the fly

● Takes in compiled binary (executable)

For more on how valgrind works: https://valgrind.org/docs/valgrind2007.pdf

Local variables stack pointer

For more on how valgrind works: https://valgrind.org/docs/valgrind2007.pdf

● Same idea, but instrument source code

Control flow graph

run the program again

Control flow graph

Control flow graph

Continue this process forever…

● Very simple but extremely effective

int main(int argc, char *argv[]) {

clang-tidy will do dataflow analysis too!

int main(int argc, char *argv[]) {

int main(int argc, char *argv[]) {

int main(int argc, char *argv[]) { // Find the close bracket

int main(int argc, char *argv[]) { // Find the close bracket

Tracking calls to functions is no different from tracing paths through if statements

Tracking calls to functions is no different from tracing paths through if statements

Tracking calls to functions is no different from tracing paths through if statements

Tracking calls to functions is no different from tracing paths through if statements

Tracking calls to functions is no different from tracing paths through if statements

Tracking calls to functions is no different from tracing paths through if statements

Cool! Let’s tidy all the code!! 🏎🔥💯

int main(int argc, char *argv[]) {

int main(int argc, char *argv[]) {

Similar real-world example: https://en.wikipedia.org/wiki/Heartbleed

● Take 10 minutes to look through this buggy vector implementation: https://

You might also like