0% found this document useful (0 votes)

78 views

Week 05 Testing

This document discusses various techniques for finding software vulnerabilities, including fuzzing, dynamic analysis, and static analysis. It describes fuzzing as feeding random, corrupted, or unexpected data to a program to explore its state space and potentially find crashes or bugs. Mutation-based fuzzing mutates existing test cases while generation-based fuzzing generates test cases based on an input specification. Coverage-guided fuzzing uses code coverage as feedback to prioritize new test cases. Dynamic analysis analyzes program behavior by running its code, and tools like AddressSanitizer can detect memory errors.

Uploaded by

yasmin kara

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

78 views

Week 05 Testing

Uploaded by

yasmin kara

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 54

Finding vulnerabilities

by fuzzing, dynamic and static analysis

CENG452
Conceptualizing vulnerabilities and exploits
Common categories of software bugs
Design issue: The conceptual state machine does not meet the intended goals
The firewall’s remote interface is designed with a hardcoded admin password
Functionality bug: The code has bad transitions but only between validly
represented states

The save button code is broken, no transition to “saving the file” state
Implementation bug: Code introduces new states not represented in the
conceptual state machine

Lack of length checks introduces new “stack corruption” state

Other ways to reach unintended states
Hardware fault: The hardware suffers a glitch that causes a transition to an
unintended state even if the code is perfect

A cosmic ray causes a bit flip in a voting machine’s memory, causing a state
where one candidate has an impossible number of votes

Transmission error: The code is correct but is corrupted in-flight

A program downloaded from the internet suffers packet corruption, so the
program that is run has a different state machine from the one that was sent

This list is not intended to be exhaustive; merely to illustrate the myriad ways that unintended states may enter a system; deciding which ones to defend against is one step of proper threat modeling.
For any interesting program, it is
essentially impossible to manually
explore the full state space to find the
unintended states
Fuzzing
Fuzzing
Find bugs in a program by feeding it random, corrupted, or unexpected data

Idea: Random inputs will explore a large part of the state space
Some unintended states are observable as crashes (SIGSEGV, abort())

Any crash is a bug, but only some bugs are exploitable

Works best on programs that parse files or process complex input data
Fuzzing example
Fuzzing can be as simple as:
cat /dev/random | head -c 512 > rand.jpeg; open rand.jpeg

How could we do better?

Randomly corrupt real JPEG files

Reference the JPEG spec so that we generate only “JPEG-looking” data

Measure the JPEG parser to see how deep we’re getting in the code
Common fuzzing strategies
Mutation-based fuzzing

Randomly mutate test cases from some corpus of input files

Generation-based (smart) fuzzing

Generate test cases based on a specification for the input format

Coverage guided fuzzing

Measure code coverage of test cases to guide fuzzing towards new
(unexplored) program states

This is not a rigid taxonomy: fuzzers often employ multiple strategies.

Mutation-based fuzzing
Randomly mutate test cases from some corpus of input files

1. Collect a corpus of inputs that explores as many states as possible

2. Perturb inputs randomly, possibly guided by

heuristics Modify: bit flips, integer increments

Substitute: small integers, large integers, negative integers

3. Run the program on the inputs and check for crashes

4. Go back to step 2
Can mutation-based “dumb” fuzzing be successful?
In 2010, Charlie Miller fuzzed PDF viewers using the following mutation program:
numwrites = random.randrange(math.ceil((float(len(buf)) / FuzzFactor))) + 1
for j in range(numwrites):
rbyte = random.randrange(256)
rn = random.randrange(len(buf))
buf[rn] = "%c"%(rbyte)

Found 64 exploitable-looking crashes

Dumb fuzzing is often way more successful than it has any right to be
Mutation-based fuzzing
Advantages

Simple to set up and run

Can use off-the-shelf software (possibly with a harness) for many programs

Limitations
Results depend strongly on the quality of the initial corpus

Coverage may be shallow for formats with checksums or validation

Generation-based (smart) fuzzing
Generate test cases based on a specification for the input format
1. Convert a specification of the input format (RFC, etc.) into a generative
procedure

2. Generate test cases according to the procedure and introduce random

perturbations

3. Run the program on the inputs and check for crashes

4. Go back to step 2
Syzkaller
A kernel system call fuzzer that uses
test case generation and coverage

Test cases are sequences of syscalls

generated from syscall descriptions

Runs the test case program in a VM

Kernel crashes in the VM indicate
potential Local Privilege Escalation
(LPE) vulnerabilities

https://github.com/google/syzkaller/blob/master/docs/syscall_descriptions.md
Generation-based (smart) fuzzing
Advantages
Can get deeper coverage faster by leveraging knowledge of the input format

Input format/protocol complexity is not a limit on coverage depth

Limitations

Requires a lot of effort to set up

Successful fuzzers are often domain-specific

Coverage limited by accuracy of the spec; implementation may diverge

Coverage guided fuzzing
Key insight: code coverage is a useful metric,
why not use it as feedback to guide fuzzing?

Prefer test cases that reach new states

Basic block coverage: Has this basic block
in the CFG been run?

Edge coverage: Has this branch been taken?

Path coverage: Has this particular path
through the program been taken?

https://googleprojectzero.blogspot.com/2020/07/mms-exploit-part-2-effective-fuzzing-qmage.html
american fuzzy lop (AFL)
1. Compile the program with
instrumentation to measure
coverage
2. Trim the test cases in the queue
to the smallest size that doesn’t change the program behavior
3. Create new test cases by mutating the files in the queue using traditional
fuzzing strategies

4. If new coverage is found in a mutated file, add it into the queue

5. Go back to step 2

https://lcamtuf.coredump.cx/afl/README.txt
Coverage guided fuzzing
Advantages
Very good at finding new program states, even if the initial corpus is limited

Combines well with other fuzzing strategies

Wildly successful track record

Limitations
Not a panacea to bypass strong checksums or input validation

Still doesn’t find all types of bugs (e.g. race conditions)

Real world example: Fuzzing the Samsung Qmage codec
In 2019, Mateusz Jurczyk discovered the Qmage
image codec included on Samsung smartphones

Reachable via zero-click MMS

The code looks fragile but the library is closed source

Few examples of Qmage files

Mateusz developed a harness to enable large-scale

coverage-guided fuzzing of the Qmage codec

https://googleprojectzero.blogspot.com/2020/07/mms-exploit-part-2-effective-fuzzing-qmage.html
Fuzzing the Samsung Qmage image codec: harness
A fuzzing harness was written to call d2s:/data/local/tmp $ ./loader accessibility_light_easy_off.qmg
[+] Detected image characteristics:
[+] Dimensions: 344 x 344
the interesting functions in the library [+] Color type: 4
[+] Alpha type: 3
and supply the test case input from the [+] Bytes per pixel: 4
[+] codec->GetAndroidPixels() completed successfully
d2s:/data/local/tmp $
fuzzer

An emulator (qemu-aarch64) was used to run the harness and Qmage library on a
Linux machine

Easier to get 1000 Linux cores than 1000 Samsung Galaxy phones
Fuzzing the Samsung Qmage image codec: coverage
Code coverage was collected by
modifying qemu-aarch64 to trace
executed PC addresses

Coverage feedback
compensated for the small
number of initial test cases
Fuzzing the Samsung Qmage image codec: results
4 weeks of fuzzing
87.3% coverage of the Qmage
codec

5218 unique crashes

https://www.youtube.com/watch?v=nke8Z3G4jnc
Another cool fuzzer: Fuzzilli
Very successful JavaScript fuzzer
Principle: Translate JavaScript to a
dense Intermediate Language (IL),
and fuzz the IL

https://github.com/googleprojectzero/fuzzilli
Fuzzing summary
Off-the-shelf fuzzers are excellent at
finding bugs This code parses untrusted data
Custom fuzzers are also excellent at
finding bugs
Should I
Different fuzzers often find different write a Yes
bugs fuzzer?

Relatively easy to get started

Fuzzing doesn’t find all types of bugs

Dynamic analysis
Dynamic analysis
Analyze a program’s behavior by actually running
its code

May be combined with compile-time

modifications like instrumentation

Can modify the program’s behavior

dynamically
Useful for rapid experimentation

Often complements fuzzing very well

https://web.stanford.edu/class/cs107/resources/valgrind.html
AddressSanitizer (ASan)
Fast memory error detector for C/C++ using compiler instrumentation and a
runtime library that replaces malloc() to surround allocations with redzones

Out-of-bounds accesses
==9901==ERROR: AddressSanitizer:heap-use-after-free on address 0x60700000dfb5 at pc 0x45917b
Use-after-free bp 0x7fff4490c700 sp 0x7fff4490c6f8
READ of size 1 at 0x60700000dfb5 thread T0
#0 0x45917a in main use-after-free.c:5
Use-after-return #1 0x7fce9f25e76c in libc_start_main /build/buildd/eglibc-2.15/csu/libc-start.c:226
#2 0x459074 in _start (a.out+0x459074)
0x60700000dfb5 is located 5 bytes inside of 80-byte region [0x60700000dfb0,0x60700000e000)
Use-after-scope freed by thread T0 here:
#0 0x4441ee in interceptor_free projects/compiler-rt/lib/asan/asan_malloc_linux.cc:64
#1 0x45914a in main use-after-free.c:4
Double-free, invalid free #2 0x7fce9f25e76c in libc_start_main /build/buildd/eglibc-2.15/csu/libc-start.c:226
previously allocated by thread T0 here:
#0 0x44436e in interceptor_malloc projects/compiler-rt/lib/asan/asan_malloc_linux.cc:74
Memory leaks #1 0x45913f in main use-after-free.c:3
#2 0x7fce9f25e76c in libc_start_main /build/buildd/eglibc-2.15/csu/libc-start.c:226
SUMMARY: AddressSanitizer: heap-use-after-free use-after-free.c:5 main

Typically 2x slowdown

https://github.com/google/sanitizers/wiki/AddressSanitizer
AddressSanitizer (ASan)
Fast memory error detector for C/C++ using compiler instrumentation and a
runtime library t hat replaces malloc() to surround allocations wit h redzones
Out-of-bounds accesses
==9901==ERROR: AddressSanitizer:heap-use-after-free on addre ss 0x60700000dfb5 at pc 0x45917b
Use-after-free bp 0x7fff4490c700 sp 0x7fff4490c6f8
READ of size 1 at 0x60700000dfb5 thread T0
#0 0x45917a in main use-after-free.c:5
Use-after-return #1 0x7fce9f25e76c in __libc_start_main /build/buildd/egl ibc-2.15/csu/libc-start.c:226
#2 0x459074 in _start (a.out+0x459074)
0x60700000dfb5 is located 5 bytes inside of 80-byte region [ 0x60700000dfb0,0x60700000e000)
Use-after-scope freed by thread T0 here:
#0 0x4441ee in __interceptor_free projects/compiler-rt/l ib/asan/asan_malloc_linux.cc:64
#1 0x45914a in main use-after-free.c:4
Double-free, invalid free #2 0x7fce9f25e76c in libc_start_main /build/buildd/eglibc-2.15/csu/libc-start.c:226
previously allocated by thread T0 here:

Memory leaks #0 0x44436e in interceptor_malloc projects/compiler-rt/lib/asan/asan_malloc_linux.cc:74

#1 0x45913f in main use-after-free.c:3
#2 0x7fce9f25e76c in libc_start_main /build/buildd/eglibc-2.15/csu/libc-start.c:226
SUMMARY: AddressSanitizer: heap-use-after-free use-after-free.c:5 main

Typically 2x slowdown

https://github.com/google/sanitizers/wiki/AddressSanitizer
ThreadSanitizer (TSan)
Data race detector for C/C++
Similar in principle to AddressSanitizer but for race conditions

High overhead

5-10x memory WARNING: ThreadSanitizer: data race (pid=19219)

Write of size 4 at 0x7fcf47b21bc0 by thread T1:
#0 Thread1 tiny_race.c:4 (exe+0x00000000a360)
5-15x slowdown
Previous write of size 4 at 0x7fcf47b21bc0 by main thread:
#0 main tiny_race.c:10 (exe+0x00000000a3b4)

Thread T1 (running) created at:

#0 pthread_create tsan_interceptors.cc:705 (exe+0x00000000c790)
#1 main tiny_race.c:9 (exe+0x00000000a3a4)

https://clang.llvm.org/docs/ThreadSanitizer.html
Frida
Dynamic instrumentation for
closed-source binaries

Execute custom scripts inside

the analyzed process

Hook functions, trace execution,

modify behavior

Great way to fuzz internal functions

without writing a harness

https://frida.re/docs/hacking/
Frida
Dynamic instrumentation for
closed-source binaries

Execute custom scripts inside

the analyzed process

Hook functions, trace execution,

modify behavior

Great way to fuzz internal functions

without writing a harness

https://frida.re/docs/hacking/
Static analysis
Static analysis
Using a tool to analyze a program’s behavior without actually running it

Test whether a certain property holds or find places where it is violated

Static analysis can prove some properties about the program that fuzzing and
dynamic analysis can’t

E.g., can prove that a program is free of NULL pointer dereferences

Despite lots of work in this area, there are countless interesting topics and huge
scope for improvements!
Undecidability of static analysis
Goal: Determine whether a given program satisfies a given property

This is theoretically undecidable: it reduces to the halting problem!

def solve_halting_problem(P, a):
def new_P():
P(a)
bug()
return static_analyzer_for_bug(new_P)
Soundness and completeness
The best static analyzer can only satisfy one of the following:*
Soundness: Everything that the static analyzer finds is a bug

But some bugs may be missed!

Completeness: The static analyzer finds every bug

But there may be false positives!

Most static analyzers are neither sound nor complete

* We are assuming termination.

Data flow analysis X=0

Determine the possible values of variables at Y =A

points in the control flow graph
X == Y
Approximations are usually needed
Expressing the precise set of possible Z = Z+1 X = X+1
values may be arbitrarily complex
X == Y Z=1

Z == 2 crash

... crash
Data flow analysis X=0
X: {0}
Determine the possible values of variables at Y =A
points in the control flow graph X: ⊤; Y: {A}; Z: ⊤

X == Y
Approximations are usually needed
X: {A}; Y: {A}; Z: ⊤ X: ⊤; Y: {A}; Z: ⊤

Expressing the precise set of possible Z = Z+1 X = X+1

values may be arbitrarily complex X: {A}; Y: {A}; Z: ⊤ X: ⊤; Y: {A}; Z: ⊤

X == Y Z=1
X: {A}; Y: {A}; Z: ⊤

Z == 2 crash

... crash
Data flow analysis X=0
X: {0}
Determine the possible values of variables at Y =A
points in the control flow graph X: ⊤; Y: {A}; Z: ⊤

X == Y
Approximations are usually needed
X: {A}; Y: {A}; Z: ⊤ X: ⊤; Y: {A}; Z: ⊤

Expressing the precise set of possible Z = Z+1 X = X+1

values may be arbitrarily complex X: {A}; Y: {A}; Z: ⊤ X: ⊤; Y: {A}; Z: ⊤

X == Y Z=1
X: {A}; Y: {A}; Z: ⊤

Z == 2 crash

... crash
static int vipx_ioctl_get_container(struct vs4l_container_list *karg,
struct vs4l_container_list user *uarg)
{
...
ret = copy_from_user(karg, uarg, sizeof(*karg));
Taint analysis
...
ucon = karg->containers;
size = karg->count * sizeof(*kcon);
kcon = kzalloc(size, GFP_KERNEL);
...
Identify sources of “tainted” data
karg->containers = kcon;
ret = copy_from_user(kcon, ucon, size);
if (ret) { User/attacker input
vipx_err("Copy failed [CONTAINER] (%d)\n", ret);

}
goto p_err_free; Reads from files/network
for (idx = 0; idx < karg->count; ++idx) {
ubuf = kcon[idx].buffers;
size = kcon[idx].count * sizeof(*kbuf);
kbuf = kzalloc(size, GFP_KERNEL);
Check to see if tainted data flows
...
kcon[idx].buffers = kbuf; into a “trusted sink”
ret = copy_from_user(kbuf, ubuf, size);
if (ret) {
vipx_err("Copy failed [CONTAINER] (%d)\n", ret);
goto p_err_free; memcpy()
}
}
...
free()
return 0;
p_err_free: bzero()
for (idx = 0; idx < karg->count; ++idx)
kfree(kcon[idx].buffers);
kfree(kcon);
p_err:
return ret;
}

https://bugs.chromium.org/p/project-zero/issues/detail?id=1978
static int vipx_ioctl_get_container(struct vs4l_container_list *karg,
struct vs4l_container_list user *uarg)
{
...
ret = copy_from_user(karg, uarg, sizeof(*karg));
Taint analysis
...
ucon = karg->containers;
size = karg->count * sizeof(*kcon);
kcon = kzalloc(size, GFP_KERNEL);
...
Identify sources of “tainted” data
karg->containers = kcon;
ret = copy_from_user(kcon, ucon, size);
if (ret) { User/attacker input
vipx_err("Copy failed [CONTAINER] (%d)\n", ret);

https://bugs.chromium.org/p/project-zero/issues/detail?id=1978
Clang static analyzer
Check for common security issues
with a static analysis framework in
the compiler

Built in checkers:

Buffer overflows (with taint)

Refcount errors
malloc() integer overflows
Insecure API use
Uninitialized value use

https://clang-analyzer.llvm.org/images/analyzer_html.png
CodeQL (Semmle)
Query language for finding patterns
in large codebases

“SQL for searching code”

Works best when you have a
specific bad code pattern in mind

https://msrc-blog.microsoft.com/2018/08/16/vulnerability-hunting-with-semmle-ql-part-1/
Manual analysis
Reverse engineering
Looking at a compiled program in order to figure out what it does and how it works
Usually assisted by tools

Disassembler

Decompiler

Strings
Often aided by dynamic analysis

Tracing
IDA Pro
Disassembly
Decompilation

Binary analysis

Scripting
Ghidra
Similar to IDA

Open source
Written by the
NSA (no, really)

https://en.wikipedia.org/wiki/Ghidra#/media/File:Ghidra-disassembly,March_2019.png
Tips for writing (more) secure software
Software tests
One of the most effective ways to reduce bugs
Unit tests: Check that each piece of code behaves as expected in isolation

Goal: Unit tests should cover all code, including error handling

So many exploitable bugs would be eliminated with basic unit tests

Regression tests: Check that old bugs haven’t been reintroduced

If you don’t run regression tests, attackers will run them for you!

Integration tests: Check that modules work together as expected

General tips
Use a modern, memory safe language where possible: Go, Rust, etc.

Understand and document your threat model early in the design process
Treat all input from outside your process adversarially, even if you trust the sender

Use a clean, consistent style throughout the codebase

Additional material
Where does fuzzing excel?
Fuzzing works great on things that process input data: e.g. file parsers, image
formats, network protocols, etc.

Information density matters: complex structured binary formats (e.g. JPEG) are
generally more “fuzzable” than verbose or textual ones (e.g. Python source)

Even if the code being analyzed isn’t a good fit for fuzzing, it may be possible to
transform it into a fuzzable data-processing program
Developing exploits
“Exploits are the closest thing to ‘magic
spells’ we experience in the real world:
Construct the right incantation, gain
remote control over device.”

— Thomas Dullien

https://docs.google.com/presentation/d/1YcBqgccBcdn5-v80OX8NTYdu_-qRmrwfejlEx6eq-4E/edit

Fuzzing - A Survey For Roadmap
No ratings yet
Fuzzing - A Survey For Roadmap
36 pages
Client Installation Notes - WFM 7.2 v3.0
No ratings yet
Client Installation Notes - WFM 7.2 v3.0
28 pages
Programming Backend with Go
From Everand
Programming Backend with Go
Julian Braun
No ratings yet
Structure Relationship Values Within Table HRP1001
No ratings yet
Structure Relationship Values Within Table HRP1001
5 pages
fuzzing
No ratings yet
fuzzing
28 pages
Quickfuzz: An Automatic Random Fuzzer For Common File Formats
No ratings yet
Quickfuzz: An Automatic Random Fuzzer For Common File Formats
8 pages
Anti Fuzzing PDF
No ratings yet
Anti Fuzzing PDF
5 pages
Woot 23
No ratings yet
Woot 23
80 pages
A Review of Fuzzing Tools and Methods
No ratings yet
A Review of Fuzzing Tools and Methods
21 pages
Fuzzing Frameworks
No ratings yet
Fuzzing Frameworks
49 pages
Fuzzing Defined: - Automated Testing Technique Used To Find Bugs in Software
No ratings yet
Fuzzing Defined: - Automated Testing Technique Used To Find Bugs in Software
13 pages
09 FindingBugs
No ratings yet
09 FindingBugs
41 pages
Offensive Software Exploitation: Ali Hadi
No ratings yet
Offensive Software Exploitation: Ali Hadi
41 pages
Andrey Konovalov Fuzzing The Linux Kernel
No ratings yet
Andrey Konovalov Fuzzing The Linux Kernel
70 pages
MBW SLIDES EN@hexleak
No ratings yet
MBW SLIDES EN@hexleak
112 pages
BsidesDelhi 2020 Hardik
No ratings yet
BsidesDelhi 2020 Hardik
45 pages
07 Fuzzing & Exploit Dev 101
No ratings yet
07 Fuzzing & Exploit Dev 101
83 pages
Using Grammar Extracted From Sample Inputs To Generate Effective Fuzzing Files
No ratings yet
Using Grammar Extracted From Sample Inputs To Generate Effective Fuzzing Files
23 pages
hushcon23
No ratings yet
hushcon23
84 pages
Technical Details
No ratings yet
Technical Details
9 pages
Taming Compiler Fuzzers
No ratings yet
Taming Compiler Fuzzers
11 pages
Fuzzing For Software Security Testing and Quality Assurance
No ratings yet
Fuzzing For Software Security Testing and Quality Assurance
5 pages
An Introduction To Dynamic Analysis For R.E. (2020) PDF
No ratings yet
An Introduction To Dynamic Analysis For R.E. (2020) PDF
30 pages
ccs18-chen-hawkeye
No ratings yet
ccs18-chen-hawkeye
14 pages
Slides Fuzzing Workshop Hack - Lu v1.0 WINAFLD
No ratings yet
Slides Fuzzing Workshop Hack - Lu v1.0 WINAFLD
232 pages
Cmiller Toorcon2007 Code Coverage With Fuzzing
No ratings yet
Cmiller Toorcon2007 Code Coverage With Fuzzing
104 pages
Fuzz-Doc 1
No ratings yet
Fuzz-Doc 1
26 pages
Go Speed Tracer
No ratings yet
Go Speed Tracer
63 pages
4_Fuzzing
No ratings yet
4_Fuzzing
57 pages
Fuzzing or Fuzz Testing
No ratings yet
Fuzzing or Fuzz Testing
3 pages
2023 Tosem
No ratings yet
2023 Tosem
40 pages
Fuzzing and Patch Analysis - SAGEly Advice
No ratings yet
Fuzzing and Patch Analysis - SAGEly Advice
61 pages
1812 00140 PDF
No ratings yet
1812 00140 PDF
21 pages
FUZZCODER Byte-level Fuzzing Test via Large Language Model
No ratings yet
FUZZCODER Byte-level Fuzzing Test via Large Language Model
11 pages
Issta 20
No ratings yet
Issta 20
13 pages
2308.04748v2
No ratings yet
2308.04748v2
13 pages
3243734.3243804
No ratings yet
3243734.3243804
16 pages
Fuzzing JavaScript Engines With A Graph-Based IR
No ratings yet
Fuzzing JavaScript Engines With A Graph-Based IR
15 pages
IEEE-SW-Fuzzing
No ratings yet
IEEE-SW-Fuzzing
8 pages
I: Exploring Deep State Spaces Via Fuzzing: Cornelius Aschermann, Sergej Schumilo, Ali Abbasi, and Thorsten Holz
No ratings yet
I: Exploring Deep State Spaces Via Fuzzing: Cornelius Aschermann, Sergej Schumilo, Ali Abbasi, and Thorsten Holz
16 pages
4_Fuzzing_up_to_SAGE
No ratings yet
4_Fuzzing_up_to_SAGE
40 pages
2407 a Coverage-Guided Fuzzing Method for Automatic Software Vulnerability Detection Using Reinforcement Learning-Enabled Multi-Level Input Mutation
No ratings yet
2407 a Coverage-Guided Fuzzing Method for Automatic Software Vulnerability Detection Using Reinforcement Learning-Enabled Multi-Level Input Mutation
17 pages
Fuzzing A Survey
No ratings yet
Fuzzing A Survey
13 pages
Module 03a Bug - Hunting
No ratings yet
Module 03a Bug - Hunting
23 pages
Researchof Dynamic Fuzzing Methods
No ratings yet
Researchof Dynamic Fuzzing Methods
14 pages
HackerTools Crack With Disassembling
From Everand
HackerTools Crack With Disassembling
Omega Brdarevic
2.5/5 (3)
The Art of Fuzzing Slides
100% (1)
The Art of Fuzzing Slides
142 pages
Effective Bug Discovery: Kernel-Mode Coverage Analysis
No ratings yet
Effective Bug Discovery: Kernel-Mode Coverage Analysis
25 pages
BruCON DDIF Day1 21 April
No ratings yet
BruCON DDIF Day1 21 April
113 pages
sec22-zou
No ratings yet
sec22-zou
18 pages
Lecture 02 2022
No ratings yet
Lecture 02 2022
49 pages
Mastering Shell for DevOps
From Everand
Mastering Shell for DevOps
Gilbert Stew
No ratings yet
Mastering Shell for DevOps: Automate, streamline, and secure DevOps workflows with modern shell scripting
From Everand
Mastering Shell for DevOps: Automate, streamline, and secure DevOps workflows with modern shell scripting
Gilbert Stew
No ratings yet
PHP Package Mastery: 100 Essential Tools in One Hour - 2024 Edition
From Everand
PHP Package Mastery: 100 Essential Tools in One Hour - 2024 Edition
Kanto
No ratings yet
Hack into your Friends Computer
From Everand
Hack into your Friends Computer
Magelan Cyber Security
No ratings yet
Zhao_2020_J._Phys.__Conf._Ser._1678_012109
No ratings yet
Zhao_2020_J._Phys.__Conf._Ser._1678_012109
8 pages
Verse - Systems-Using LLMs To Generate Fuzz Generators
No ratings yet
Verse - Systems-Using LLMs To Generate Fuzz Generators
7 pages
Lecture 1 Introduction to Testing
No ratings yet
Lecture 1 Introduction to Testing
39 pages
Programming Backend with Go: Build robust and scalable backends for your applications using the efficient and powerful tools of the Go ecosystem
From Everand
Programming Backend with Go: Build robust and scalable backends for your applications using the efficient and powerful tools of the Go ecosystem
Julian Braun
No ratings yet
United States Patent: Kim Et A) - (10) Patent N0.: (45) Date of Patent
No ratings yet
United States Patent: Kim Et A) - (10) Patent N0.: (45) Date of Patent
10 pages
Go Debugging from Scratch: A Practical Guide with Examples
From Everand
Go Debugging from Scratch: A Practical Guide with Examples
William E. Clark
No ratings yet
Mastering Go Network Automation: Automating Networks, Container Orchestration, Kubernetes with Puppet, Vegeta and Apache JMeter
From Everand
Mastering Go Network Automation: Automating Networks, Container Orchestration, Kubernetes with Puppet, Vegeta and Apache JMeter
Ian Taylor
No ratings yet
Week 3 CTRL Hijacking Isolation
No ratings yet
Week 3 CTRL Hijacking Isolation
86 pages
CENG 223 ClassNotesChap01
No ratings yet
CENG 223 ClassNotesChap01
21 pages
PL 10 CH 10
No ratings yet
PL 10 CH 10
31 pages
PL 10 CH 7
No ratings yet
PL 10 CH 7
36 pages
pl10ch3 Update
No ratings yet
pl10ch3 Update
32 pages
Practice Questions & Answers: Made With by Sawzeeyy
No ratings yet
Practice Questions & Answers: Made With by Sawzeeyy
141 pages
4851151417549777
No ratings yet
4851151417549777
75 pages
FOR518 Ex 0 G01 01
No ratings yet
FOR518 Ex 0 G01 01
5 pages
Log Cat STD Err 1684646110168
No ratings yet
Log Cat STD Err 1684646110168
2 pages
Unit - Ii
No ratings yet
Unit - Ii
10 pages
Generic Design Methodology
100% (1)
Generic Design Methodology
14 pages
Coding For Kids How To Programming Code Games For Kids Supe...
0% (1)
Coding For Kids How To Programming Code Games For Kids Supe...
25 pages
DOM in JAVASCRIPT
No ratings yet
DOM in JAVASCRIPT
3 pages
Unit - 3 Exception Handling and IO
No ratings yet
Unit - 3 Exception Handling and IO
17 pages
6023/3 and 6044/3 Project Guide: Line Spacing, Font Face Being Calibri or Times New Roman
No ratings yet
6023/3 and 6044/3 Project Guide: Line Spacing, Font Face Being Calibri or Times New Roman
9 pages
Joy API Documentation
No ratings yet
Joy API Documentation
17 pages
UNIT-5 - Conditional Statement
No ratings yet
UNIT-5 - Conditional Statement
5 pages
Experiment No.5: (For Applied/experimental Sciences/materials Based Labs)
No ratings yet
Experiment No.5: (For Applied/experimental Sciences/materials Based Labs)
11 pages
FFT
No ratings yet
FFT
4 pages
Beel 1234 Lab 1 - Introduction To Python Programming
No ratings yet
Beel 1234 Lab 1 - Introduction To Python Programming
10 pages
Az 400
No ratings yet
Az 400
75 pages
Hello World in Various Languages
No ratings yet
Hello World in Various Languages
2 pages
Web-Based Control Application Using Websocket: Y. Furukawa Spring-8/Jasri, Kouto, Sayo-Cho Hyogo, 679-5198, Japan
No ratings yet
Web-Based Control Application Using Websocket: Y. Furukawa Spring-8/Jasri, Kouto, Sayo-Cho Hyogo, 679-5198, Japan
3 pages
UGEE User Manual (English)
No ratings yet
UGEE User Manual (English)
10 pages
Powercenter 8.X New Features: Education Services
No ratings yet
Powercenter 8.X New Features: Education Services
159 pages
CV Matt Harrison
No ratings yet
CV Matt Harrison
6 pages
Software Quality Assurance
100% (1)
Software Quality Assurance
35 pages
NJ Data Log Function Block
No ratings yet
NJ Data Log Function Block
6 pages
OS First PDF
No ratings yet
OS First PDF
263 pages
Romi Sad 04 Design Mar2017
No ratings yet
Romi Sad 04 Design Mar2017
204 pages
BC400 ABAP/4 Introduction To The Development Workbench: Cover Logo Page BC400
No ratings yet
BC400 ABAP/4 Introduction To The Development Workbench: Cover Logo Page BC400
6 pages
Create Standby Database On Oracle 10G
No ratings yet
Create Standby Database On Oracle 10G
10 pages
An Introduction To Puppet, Installation and Basic Features
No ratings yet
An Introduction To Puppet, Installation and Basic Features
9 pages

Week 05 Testing

Uploaded by

Week 05 Testing

Uploaded by

Finding vulnerabilities

by fuzzing, dynamic and static analysis

Lack of length checks introduces new “stack corruption” state

Transmission error: The code is correct but is corrupted in-flight

Any crash is a bug, but only some bugs are exploitable

How could we do better?

Randomly corrupt real JPEG files

Reference the JPEG spec so that we generate only “JPEG-looking” data

Randomly mutate test cases from some corpus of input files

Generation-based (smart) fuzzing

Generate test cases based on a specification for the input format

Coverage guided fuzzing

This is not a rigid taxonomy: fuzzers often employ multiple strategies.

1. Collect a corpus of inputs that explores as many states as possible

heuristics Modify: bit flips, integer increments

Substitute: small integers, large integers, negative integers

3. Run the program on the inputs and check for crashes

Found 64 exploitable-looking crashes

Simple to set up and run

Coverage may be shallow for formats with checksums or validation

2. Generate test cases according to the procedure and introduce random

3. Run the program on the inputs and check for crashes

Test cases are sequences of syscalls

Runs the test case program in a VM

Input format/protocol complexity is not a limit on coverage depth

Requires a lot of effort to set up

Successful fuzzers are often domain-specific

Coverage limited by accuracy of the spec; implementation may diverge

Prefer test cases that reach new states

Edge coverage: Has this branch been taken?

4. If new coverage is found in a mutated file, add it into the queue

Combines well with other fuzzing strategies

Wildly successful track record

Still doesn’t find all types of bugs (e.g. race conditions)

Reachable via zero-click MMS

Few examples of Qmage files

Mateusz developed a harness to enable large-scale

5218 unique crashes

Relatively easy to get started

Fuzzing doesn’t find all types of bugs

May be combined with compile-time

Can modify the program’s behavior

Often complements fuzzing very well

Memory leaks #0 0x44436e in interceptor_malloc projects/compiler-rt/lib/asan/asan_malloc_linux.cc:74

5-10x memory WARNING: ThreadSanitizer: data race (pid=19219)

Thread T1 (running) created at:

Execute custom scripts inside

Hook functions, trace execution,

Great way to fuzz internal functions

Execute custom scripts inside

Hook functions, trace execution,

Great way to fuzz internal functions

Test whether a certain property holds or find places where it is violated

E.g., can prove that a program is free of NULL pointer dereferences

This is theoretically undecidable: it reduces to the halting problem!

But some bugs may be missed!

Completeness: The static analyzer finds every bug

But there may be false positives!

Most static analyzers are neither sound nor complete

* We are assuming termination.

Determine the possible values of variables at Y =A

Expressing the precise set of possible Z = Z+1 X = X+1

Expressing the precise set of possible Z = Z+1 X = X+1

Buffer overflows (with taint)

“SQL for searching code”

So many exploitable bugs would be eliminated with basic unit tests

Integration tests: Check that modules work together as expected

Use a clean, consistent style throughout the codebase

You might also like