Radare 2 Book
Radare 2 Book
of Contents
introduction 1.1
Introduction 1.2
History 1.2.1
Overview 1.2.2
Getting radare2 1.2.3
Compilation and Portability 1.2.4
Compilation on Windows 1.2.5
Command-line Flags 1.2.6
Basic Usage 1.2.7
Command Format 1.2.8
Expressions 1.2.9
Rax2 1.2.10
Basic Debugger Session 1.2.11
Contributing to radare2 1.2.12
Configuration 1.3
Colors 1.3.1
Common Configuration Variables 1.3.2
Basic Commands 1.4
Seeking 1.4.1
Block Size 1.4.2
Sections 1.4.3
Mapping Files 1.4.4
Print Modes 1.4.5
Flags 1.4.6
Write 1.4.7
Zoom 1.4.8
Yank/Paste 1.4.9
Comparing Bytes 1.4.10
SDB 1.4.11
Visual mode 1.5
2
Visual Disassembly 1.5.1
Visual Assembler 1.5.2
Visual Configuration Editor 1.5.3
Searching bytes 1.6
Basic Searches 1.6.1
Configurating the Search 1.6.2
Pattern Search 1.6.3
Automation 1.6.4
Backward Search 1.6.5
Search in Assembly 1.6.6
Searching for AES Keys 1.6.7
Signatures 1.7
Signatures 1.7.1
Disassembling 1.8
Adding Metadata 1.8.1
ESIL 1.8.2
Scripting 1.9
Loops 1.9.1
Macros 1.9.2
R2pipe 1.9.3
Rabin2 1.10
File Identification 1.10.1
Entrypoint 1.10.2
Imports 1.10.3
Exports 1.10.4
Symbols (exports) 1.10.5
Libraries 1.10.6
Strings 1.10.7
Program Sections 1.10.8
Radiff2 1.11
Binary Diffing 1.11.1
Rasm2 1.12
Assemble 1.12.1
Disassemble 1.12.2
3
Ragg2 1.13
Analysis 1.14
Code Analysis 1.14.1
Symbols information 1.14.2
Types 1.14.3
Rahash2 1.15
Rahash Tool 1.15.1
Debugger 1.16
Getting Started 1.16.1
Migration from ida, GDB or WinDBG 1.16.2
Registers 1.16.3
Reverse Debugging 1.16.4
Remote Access Capabilities 1.17
Remoting Capabilities 1.17.1
Remote GDB 1.17.2
Remote WinDbg 1.17.3
Plugins 1.18
Plugins 1.18.1
Crackmes 1.19
IOLI 1.19.1
IOLI 0x00 1.19.1.1
IOLI 0x01 1.19.1.2
Avatao 1.19.2
R3v3rs3 4 1.19.2.1
.intro 1.19.2.1.1
.radare2 1.19.2.1.2
.first_steps 1.19.2.1.3
.main 1.19.2.1.4
.vmloop 1.19.2.1.5
.instructionset 1.19.2.1.6
.bytecode 1.19.2.1.7
.outro 1.19.2.1.8
Reference Card 1.20
4
5
introduction
R2 "Book"
Online: http://radare.gitbooks.io/radare2book/content/
PDF: https://www.gitbook.com/download/pdf/book/radare/radare2book
Epub: https://www.gitbook.com/download/epub/book/radare/radare2book
Mobi: https://www.gitbook.com/download/mobi/book/radare/radare2book
Contributors:
Anton Kochkov
aoighost
Austin Hartzheim
David Tomaschik
DZ_ruyk
Grigory Rechistov
hdznrrd
Jeffrey Crowell
John
Judge Dredd
jvoisin
Kevin Grandemange
muzlightbeer
Peter C
sghctoma
SkUaTeR
TDKPS
Thanat0s
6
introduction
7
Introduction
Introduction
This book aims to cover most usage aspects of radare2. A framework for reverse
engineering and analyzing binaries.
--pancake
8
History
History
The radare project began in February of 2006 to provide a free and simple command-line
hexadecimal editor with support for 64-bit offsets. The intention was to use the tool to
perform searches and help recover data from hard-disks.
Since then, the project has evolved to provide a complete framework for analyzing binaries
while making use of basic *NIX concepts. Those concepts include the famous "everything is
a file," "small programs that interact using stdin/stdout," and "keep it simple" paradigms.
It is mostly a single-person project. However, ideas and source code contributions are
greatly appreciated.
The central focus of this project is the hexadecimal editor. Additionally, this project contains
an assembler/disassembler, code/data analysis and graphing tools, scripting features, easy
Unix integration, and more.
9
Overview
Overview
The Radare2 project is a set of small command-line utilities that can be used together or
independently.
radare2
The core of the hexadecimal editor and debugger. radare2 allows you to open a number of
input/output sources as if they were simple, plain files, including disks, network connections,
kernel drivers, processes under debugging, and so on.
It implements an advanced command line interface for moving around a file, analyzing data,
disassembling, binary patching, data comparison, searching, replacing, visualizing. It can be
scripted with a variety of languages, including Ruby, Python, Lua, and Perl.
rabin2
A program to extract information from executable binaries, such as ELF, PE, Java CLASS,
and Mach-O. rabin2 is used by the core to get exported symbols, imports, file information,
cross references (xrefs), library dependencies, sections, etc.
rasm2
A command line assembler and disassembler for multiple architectures (including Intel x86
and x86-64, MIPS, ARM, PowerPC and Java).
Examples
rahash2
10
Overview
An implementation of a block-based hash tool. From small text strings to large disks,
rahash2 supports multiple algorithms, including MD4, MD5, CRC16, CRC32, SHA1,
SHA256, SHA384, SHA512, par, xor, xorpair, mod255, hamdist, or entropy. rahash2 can be
used to check the integrity of, or track changes to, big files, memory dumps, and disks.
Examples
$ rahash2 file
file: 0x00000000-0x00000007 sha256: 887cfbd0d44aaff69f7bdbedebd282ec96191cce9d7fa73362
98a18efc3c7a5a
radiff2
A binary diffing utility that implements multiple algorithms. It supports byte-level or delta
diffing for binary files, and code-analysis diffing to find changes in basic code blocks
obtained from the radare code analysis, or from the IDA analysis using the rsc idc2rdb script.
rafind2
A program to find byte patterns in files.
ragg2
A frontend for r_egg. ragg2 compiles programs written in a simple high-level language into
tiny binaries for x86, x86-64, and ARM.
Examples
11
Overview
$ cat hi.r
/* hello world in r_egg */
write@syscall(4); //x64 write@syscall(1);
exit@syscall(1); //x64 exit@syscall(60);
main@global(128) {
.var0 = "hi!\n";
write(1,.var0, 4);
exit(0);
}
$ ragg2 -O -F hi.r
$ ./hi
hi!
$ cat hi.c
main@global(0,6) {
write(1, "Hello0", 6);
exit(0);
}
$ ragg2 hi.c
$ ./hi.c.bin
Hello
rarun2
A launcher for running programs within different environments, with different arguments,
permissions, directories, and overridden default file descriptors. rarun2 is useful for:
Crackmes
Fuzzing
Test suites
$ cat foo.rr2
#!/usr/bin/rarun2
program=./pp400
arg0=10
stdin=foo.txt
chdir=/tmp
#chroot=.
./foo.rr2
12
Overview
$ nc -l 9999
$ rarun2 program=/bin/ls connect=localhost:9999
2 - Create a new file containing the following rarun2 profile named foo.rr2:
#!/usr/bin/rarun2
program=/bin/ls
stdio=/dev/ttys010
rax2
A minimalistic mathematical expression evaluator for the shell that is useful for making base
conversions between floating point values, hexadecimal representations, hexpair strings to
ASCII, octal to integer, etc. It also supports endianness settings and can be used as an
interactive shell if no arguments are given.
Examples
$ rax2 1337
0x539
$ rax2 0x400000
4194304
$ rax2 -b 01111001
y
$ rax2 -S radare2
72616461726532
$ rax2 -s 617765736f6d65
awesome
13
Overview
14
Getting radare2
Getting radare2
You can get radare from the website, http://radare.org/, or the GitHub repository,
https://github.com/radare/radare2.
Binary packages are available for a number of operating systems (Ubuntu, Maemo, Gentoo,
Windows, iPhone, and so on). Yet, you are highly encouraged to get the source and compile
it yourself to better understand the dependencies, to make examples more accessible and of
course to have the most recent version.
A new stable release is typically published every month. Nightly tarballs are sometimes
available at http://bin.rada.re/.
The radare development repository is often more stable than the 'stable' releases. To obtain
the latest version:
This will probably take a while, so take a coffee break and continue reading this book.
To update your local copy of the repository, use git pull anywhere in the radare2 source
code tree:
$ git pull
If you have local modifications of the source, you can revert them (and loose them!) with:
Or send me a patch:
The most common way to get r2 updated and installed system wide is by using:
$ sys/install.sh
Helper Scripts
15
Getting radare2
Take a look at the sys/* scripts, those are used to automate stuff related to syncing, building
and installing r2 and its bindings.
The most important one is sys/install.sh. It will pull, clean, build and symstall r2 system wide.
Symstalling is the process of installing all the programs, libraries, documentation and data
files using symlinks instead of copying the files.
By default it will be installed in /usr, but you can define a new prefix as argument.
This is useful for developers, because it permits them to just run 'make' and try changes
without having to run make install again.
Cleaning Up
Cleaning up the source tree is important to avoid problems like linking to old objects files or
not updating objects after an ABI change.
The following commands may help you to get your git clone up to date:
If you want to remove previous installations from your system, you must run the following
commands:
$ ./configure --prefix=/usr/local
$ make purge
16
Compilation and Portability
People often want to use radare as a debugger for reverse engineering. Currently, the
debugger layer can be used on Windows, GNU/Linux (Intel x86 and x86_64, MIPS, and
ARM), FreeBSD, NetBSD, and OpenBSD (Intel x86 and x86_64). There are plans to support
Solaris and MacOS X.
Compared to core, the debugger feature is more restrictive portability-wise. If the debugger
has not been ported to your favorite platform, you can disable the debugger layer with the --
without-debugger configure script option when compiling radare2.
Note that there are I/O plugins that use GDB, GDB Remote, or Wine as back-ends, and
therefore rely on presence of corresponding third-party tools.
$ ./configure --prefix=/usr
$ gmake
$ sudo gmake install
$ sys/install.sh
Static Build
You can build statically radare2 and all the tools with the command:
$ sys/static.sh
Docker
Radare2 repository ships a Dockerfile that you can use with Docker.
This dockerfile is also used by Remnux distribution from SANS, and is available on the
docker registryhub.
17
Compilation and Portability
18
Compilation on Windows
Compilation on Windows
Meson (MSVC)
WARNING: This build is not being tested for the moment. Note that it does not
compile every plugins available in radare2 yet. Refer to the Mingw32 build below for a
more stable and complete build.
The most native way to compile radare2 under Windows is to use meson + msvc. First you
need python3 to be installed on your computer. Once this is done, you can install the meson
build system using pip3 install meson (with Administrator privileges). Now navigate to your
Python installation folder, and copy the meson.py from .\Scripts subfolder into your
radare2 folder.
Meson also requires Ninja. You can download it from here. Copy ninja.exe binary into your
radare2 folder. Then run meson.bat and wait until compilation is done.
Compiling
At first you will need to run vsvarsall.bat from the corresponding directory of your
Visual Studio or Visual C++ Build Tools. In case of VS2015 it is located usually in
C:\\Program Files (x86)\\Microsoft Visual Studio 14.0\\VC\\ or in C:\\Program Files
(x86)\\Microsoft Visual Studio\2017\Community\VC\Auxiliary\Build\vcvarsall.bat
Using Ninja build system: This is the easiest, simply run meson.bat and wait until the
compilation is complete.
Using Visual Studio: If you want to work on radare2 in Visual Studio, you can simply
run the command meson.bat -p which will generate a Visual Studio 2015 project. Then
you can load it from build\radare2.sln .
In any case, everything is generated in the build folder by default, except for the sdb files
generation process.
Installing
Now that you successfully compiled radare2, you might want to gather every executables
and requirements into the same place. The command below will collect everything and put it
into the dist folder.
19
Compilation on Windows
E:\radare2>sys\meson_install.bat dist
You can now test your newly crafted radare2 binary easily:
E:\radare2>cd dist
E:\radare2\dist>radare2.exe -v
Mingw32
The easy way to compile things for Windows is using Mingw32. The w32 builds distributed
from the radare homepage are generated from a GNU/Linux box using Mingw32 and they
are tested with Wine. Also keep in mind, that Mingw-w64 isn't tested, so no guarantees here.
Be sure to setup your Mingw32 to compile with thread model: win32, not posix, and target
should be mingw32. Before the starting of compilation you need to setup git first, for a
proper automatic fetching of capstone:
The following is an example of compiling with MinGW32 (you need to have installed zip for
Windows):
CC=i486-mingw32-gcc ./configure
make
make w32dist
zip -r w32-build.zip w32-build
This generates a native, 32-bit console application for Windows. The 'i486-mingw32-gcc'
compiler is the one I have in my box, you will probably need to change this.
you have Mingw32 installed in C:\Mingw and Git in C:\Program Files (x86)\Git . If you want
to use another installations, just set MINGW_PATH and GIT_PATH variables correspondingly:
set MINGW_PATH=D:\Mingw32
set "GIT_PATH=E:\Program and Stuff\Git"
sys\mingw32.bat
Please, note, that this script should be run from radare2 directory.
20
Compilation on Windows
sys/mingw32.sh
Cygwin
Cygwin is another possibility; however, issues related to Cygwin libraries can make
debugging difficult. But using binary compiled for Cygwin will allow you to use Unicode in the
Windows console, and to have 256 colors.
Note, Cygwin build require exactly the opposite git configuration, so setup git first, for a
proper automatic fetching of capstone:
Please, be sure to build radare2 from the same environment you're going to use r2 in. If you
are going to use r2 in MinGW32 shell or cmd.exe — you should build r2 in the MinGW32
environment. And if you are going to use r2 in Cygwin — you have to build r2 from the
Cygwin shell. Since Cygwin is more UNIX-compatible than MinGW, the radare2 supports
more colors and Unicode symbols if build using the former one.
Mingw-W64
Download the MSYS2 distribution from the official site: http://msys2.github.io/
Setup the proxy (if needed):
export http_proxy=<myusername>:<mypassword>@zz-wwwproxy-90-v:8080
export https_proxy=$http_proxy
export ftp_proxy=$http_proxy
export rsync_proxy=$http_proxy
export rsync_proxy=$http_proxy
export no_proxy="localhost,127.0.0.1,localaddress,.localdomain.com"
Update packages:
Close MSYS2, run it again from Start menu and update the rest with
pacman -Su
21
Compilation on Windows
Bindings
To build radare2 bindings, you will need to install Vala (valac) for Windows
After you installed valabind, you can build radare2-bindings, for example for Python and
Perl:
22
Command-line Flags
Command-line Options
The radare core accepts many flags from command line.
$ radare2 -h
Usage: r2 [-dDwntLqv] [-P patch] [-p prj] [-a arch] [-b bits] [-i file] [-s addr] [-B
blocksize] [-c cmd] [-e k=v] file|-
Open a file in write mode without parsing the file format headers.
$ r2 -nw file
$ r2 -
Specify which sub-binary you want to select when opening a fatbin file:
23
Command-line Flags
$ r2 -a ppc -b 32 ls.fat
$ r2 -i patch.r2 target.bin
$ r2 -e scr.color=false blah.bin
Debug a program:
$ r2 -d ls
$ r2 -p test
24
Basic Usage
Navigation, inspection and modification of a loaded binary file is performed using three
simple actions: seek (to position), print (buffer), and alternate (write, append).
The 'seek' command is abbreviated as s and accepts an expression as its argument. The
expression can be something like 10 , +0x25 , or [0x100+ptr_table] . If you are working
with block-based files, you may prefer to set the block size to a required value with b
command, and seek forward or backwards with positions aligned to it. Use s++ and s--
commands to navigate this way.
If r2 opens an executable file, by default it will open the file in VA mode and the sections will
be mapped to their virtual addresses. In VA mode, seeking is based on the virtual address
and the starting position is set to the entry point of the executable. Using -n option you can
suppress this default behavior and ask r2 to open the file in non-VA mode for you. In non-VA
mode, seeking is based on the offset from the beginning of the file.
The 'print' command is abbreviated as p and has a number of submodes — the second
letter specifying a desired print mode. Frequent variants include px to print in hexadecimal,
and pd for disassembling.
To be allowed to write files, specify the -w option to radare when opening a file. The w
command can be used to write strings, hexpairs ( x subcommand), or even assembly
opcodes ( a subcommand). Examples:
To enter visual mode, press V<enter> . Use q to quit visual mode and return to the prompt.
In visual mode you can use HJKL keys to navigate (left, down, up, and right, respectively).
You can use these keys in cursor mode toggled by c key. To select a byte range in cursor
mode, hold down SHIFT key, and press navigation keys HJKL to mark your selection. While
25
Basic Usage
in visual mode, you can also overwrite bytes by pressing i . You can press TAB to switch
between the hex (middle) and string (right) columns. Pressing q inside the hex panel
returns you to visual mode.
26
Command Format
Command Format
A general format for radare commands is as follows:
[.][times][cmd][~grep][@[@iter]addr!size][|>pipe] ;
px # run px
3px # run px 3 times
The ! prefix is used to execute a command in shell context. If a single exclamation mark is
used, commands will be sent to the system() hook defined in currently loaded I/O plugin.
This is used, for example, by the ptrace I/O plugin, which accepts debugger commands from
radare interface.
A few examples:
The @ character is used to specify a temporary offset at which the command to its left will
be executed. The original seek position in a file is then restored. For example, pd 5 @
0x100000fce to disassemble 5 instructions at address 0x100000fce.
The ~ character enables internal grep-like function used to filter output of any command.
For example:
27
Command Format
pd 20~call:0[0] ; grep the first column of the first row matching 'call'
This internal grep function is a key feature for scripting radare, because it can be used to
iterate over a list of offsets or data generated by disassembler, ranges, or any other
command. Refer to the macros section (iterators) for more information.
Most of the commands offer autocompletion support using <TAB> key, for example s eek
or f lags commands. It offers autocompletion using the all possible values, taking
flagnames in this case. Note, that it's possible to see the history of the commands using
!~... command - it offers a visual mode to scroll through the radare2 command history.
28
Expressions
Expressions
Expressions are mathematical representations of 64-bit numerical values. They can be
displayed in different formats, be compared or used with all commands accepting numeric
arguments. Expressions can use traditional arithmetic operations, as well as binary and
boolean ones. To evaluate mathematical expressions prepend them with command ? . For
example:
[0xB7F9D810]> ? 0x8048000
134512640 0x8048000 01001100000 128.0M 804000:0000 134512640 00000000 134512640.0 0.00
0000
[0xB7F9D810]> ? 0x8048000+34
134512674 0x8048022 01001100042 128.0M 804000:0022 134512674 00100010 134512674.0 0.00
0000
[0xB7F9D810]> ? 0x8048000+0x34
134512692 0x8048034 01001100064 128.0M 804000:0034 134512692 00110100 134512692.0 0.00
0000
[0xB7F9D810]> ? 1+2+3-4*3
-6 0xfffffffffffffffa 01777777777777777777772 17179869183.0G fffff000:0ffa -6
+ : addition
- : subtraction
* : multiplication
/ : division
% : modulus
> : shift right
< : shift left
Use of logical OR should be escaped using quotes, or it will be mistaken for a pipe opeator:
29
Expressions
You can also use variables and seek positions to build complex expressions. Available
values include:
?@? or type @@? ; misc help for '@' (seek), '~' (grep) (see ~??)
?$? ; show available '$' variables
$$ ; here (the current virtual seek)
$l ; opcode length
$s ; file size
$j ; jump address (e.g. jmp 0x10, jz 0x10 => 0x10)
$f ; jump fail address (e.g. jz 0x10 => next instruction)
$m ; opcode memory reference (e.g. mov eax,[0x10] => 0x10)
[0x4A13B8C0]> ? $m + $l
140293837812900 0x7f98b45df4a4 03771426427372244 130658.0G 8b45d000:04a4 1402938378129
00 10100100 140293837812900.0 -0.000000
[0x4A13B8C0]> pd 1 @ +$l
0x4A13B8C2 call 0x4a13c000
30
Rax2
Rax2
The rax2 utility comes with the radare framework and aims to be a minimalistic expression
evaluator for the shell. It is useful for making base conversions between floating point
values, hexadecimal representations, hexpair strings to ascii, octal to integer. It supports
endianness and can be used as a shell if no arguments are given.
$ rax2 -h
31
Rax2
Some examples:
$ rax2 3+0x80
0x83
$ rax2 0x80+3
131
$ rax2 -s 4142
AB
$ rax2 -S AB
4142
$ rax2 -e 33
0x21000000
$ rax2 -e 0x21000000
33
$ rax2 -K 90203010
+--[0x10302090]---+
|Eo. . |
| . . . . |
| o |
| . |
| S |
| |
| |
| |
| |
+-----------------+
32
Basic Debugger Session
$ pidof mc
32220
$ r2 -d 32220
$ r2 -d /bin/ls
In the second case, the debugger will fork and load the debuggee ls program in memory.
It will pause its execution early in ld.so dynamic linker. Therefore, do not expect to see an
entrypoint or shared libraries at this point. You can override this behavior by setting another
name for and entry breakpoint. To do this, add a radare command e dbg.bep=entry or e
dbg.bep=main to your startup script, usually it is ~/.radare2rc . Be warned though that
certain malware or other tricky programs can actually execute code before main() and thus
you'll be unable to control them.
Maybe a simpler method to use debugger in radare is to switch it to visual mode. That way
you will not have to remember many commands nor to keep program state in your mind. To
enter visual mode use V :
[0xB7F0C8C0]> V
The initial view after entering visual mode is a hexdump view of current target program
counter (e.g., EIP for x86). Pressing p will allow you to cycle through the rest of visual
mode views. You can press p and P to rotate through the most commonly used print
33
Basic Debugger Session
modes. Use F7 or s to step into and F8 or S to step over current instruction. With the c
key you can toggle the cursor mode to mark a byte range selection (for example, to later
overwrite them with nop). You can set breakpoints with F2 key.
In visual mode you can enter regular radare commands by prepending them with : . For
example, to dump a one block of memory contents at ESI: x @ esi
To get help on visual mode, press ? . To scroll help screen, use arrows. To exit help view,
press q .
34
Contributing to radare2
Contributing
Radare2 Book
If you want to contribute to the Radare2 book, you can do it at the Github repository.
Suggested contributions include:
Crackme writeups
CTF writeups
Documentation on how to use Radare2
Documentation on developing for Radare2
Conference presentations/workshops using Radare2
Missing content from the Radare1 book updated to Radare2
Please get permission to port any content you do not own/did not create before you put it in
the Radare2 book.
35
Configuration
Configuration
The core reads ~/.radare2rc while starting. You can add e commands to this file to tune
radare configuration to your taste.
All the configuration of radare is done with the eval commands. A typical startup
configuration file looks like this:
$ cat ~/.radare2rc
e scr.color = true
e dbg.bep = loader
Configuration can also be changed with -e command-line option. This way you can adjust
configuration from the command line, keeping the .radare2rc file intact. For example, to start
with empty configuration and then adjust scr.color and asm.syntax the following line may
be used:
Internally, the configuration is stored in a hash table. The variables are grouped in
namespaces: cfg. , file. , dbg. , scr. and so on.
To get a list of all configuration variables just type e in the command line prompt. To limit
output to a selected namespace, pass it with an ending dot to e . For example, e file.
will display all variables defined inside "file" namespace.
36
Configuration
A simpler alternative to e command is accessible from the visual mode. Type Ve to enter
it, use arrows (up, down, left, right) to navigate the configuration, and q to exit it. The start
screen for the visual configuration edit looks like this:
Eval spaces:
> anal
asm
scr
asm
bin
cfg
diff
dir
dbg
cmd
fs
hex
http
graph
hud
scr
search
io
For configuration values that can take one of several values, you can use the =? operator
to get a list of valid values:
[0x00000000]> e scr.nkey =?
scr.nkey = fun, hit, flag
37
Colors
Colors
Console access is wrapped in API that permits to show output of any command as ANSI,
w32 console or HTML formats (more to come: ncurses, Pango etc.) This allows radare's
core to run inside environments with limited displaying capabilities, like kernels or embedded
devices. It is still possible to receive data from it in your favorite format. To enable colors
support by default, add a corresponding configuration option to the .radare2 configuration
file:
It is possible to configure color of almost any element of disassembly output. For *NIX
terminals, r2 accepts color specification in RGB format. To change the console color palette
use ec command. Type ec to get a list of all currently used colors. Type ecs to show a
color palette to pick colors from:
38
Colors
xvilka theme
39
Colors
ec fname rgb:0cf
ec label rgb:0f3
ec math rgb:660
ec bin rgb:f90
ec call rgb:f00
ec jmp rgb:03f
ec cjmp rgb:33c
ec offset rgb:366
ec comment rgb:0cf
ec push rgb:0c0
ec pop rgb:0c0
ec cmp rgb:060
ec nop rgb:000
ec b0x00 rgb:444
ec b0x7f rgb:555
ec b0xff rgb:666
ec btext rgb:777
ec other rgb:bbb
ec num rgb:f03
ec reg rgb:6f0
ec fline rgb:fc0
ec flow rgb:0f0
40
Common Configuration Variables
asm.arch
Defines target CPU architecture used for disassembling ( pd , pD commands) and code
analysis ( a command). You can find the list of possible value by looking at the result of e
asm.arch=? or rasm2 -L . It is quite simple to add new architectures for disassembling and
analyzing code. There is an interface for that. For x86, it is used to attach a number of third-
party disassembler engines, including GNU binutils, Udis86 and a few of handmade ones.
asm.bits
Determines width in bits of registers for current architecture. Supported values: 8, 16, 32, 64.
Note that not all target architectures support all combinations for asm.bits.
asm.syntax
Changes syntax flavor for disassembler between Intel and AT&T. At the moment, this setting
affects Udis86 disassembler for Intel 32/Intel 64 targets only. Supported values are intel
and att .
asm.pseudo
A boolean value to choose a string disassembly engine. "False" indicates a native one,
defined by current architecture, "true" activates a pseudocode strings format; for example, it
will show eax=ebx instead of a mov eax, ebx .
asm.os
41
Common Configuration Variables
asm.flags
asm.lines.call
asm.linesout
When defined as "true", the disassembly view will also draw control flow lines that go ouside
of the block.
asm.linestyle
A boolean value which changes the direction of control flow analysis. If set to "false", it is
done from top to bottom of a block; otherwise, it goes from bottom to top. The "false" setting
seems to be a better choice for improved readability, and is the default one.
asm.offset
Boolean value which controls visibility of offsets for individual disassembled instructions.
asm.trace
A boolean value that controls displaying of tracing information (sequence number and
counter) at the left of each opcode. It is used to assist programs trace analysis.
asm.bytes
cfg.bigendian
Change endianness. "true" means big-endian, "false" is for little-endian. "file.id" and
"file.flag" both to be true.
42
Common Configuration Variables
scr.color
scr.seek
This variable accepts an expression, a pointer (eg. eip), etc. If set, radare will set seek
position to its value on startup.
cfg.fortunes
43
Basic Commands
Basic Commands
Most command names in radare are derived from action names. They should be easy to
remember, as they are short. Actually, all commands are single letters. Subcommands or
related commands are specified using the second character of command name. For
example, / foo is a command to search plain string, while /x 90 90 is used to look for
hexadecimal pairs.
The general format for a valid command (as explained in the 'Command Format' chapter)
looks like this:
[[.][times][cmd][~grep][@[@iter]addr!size][|>pipe] ; ...
For example,
> 3s +1024 ; seeks three times 1024 from the current seek
If a command starts with =! , the rest of the string is passed to currently loaded IO plugin (a
debugger, for example). Most plugins provide help messages with =!? or =!help .
$ r2 -d /bin/ls
> =!help ; handled by the IO plugin
If a commands starts with ! , posix_system() is called to pass the command to your shell.
Check !? for more options and usage examples.
The meaning of arguments (iter, addr, size) depends on the specific command. As a rule of
thumb, most commands take a number as an argument to specify number of bytes to work
with, instead of currently defined block size. Some commands accept math expressions, or
strings.
44
Basic Commands
The @ sign is used to specify a temporary offset location or seek position at which the
command is executed, instead of current seek position. This is quite useful as you don't
have to seek around all the time.
Using @@ you can execute a single command on a list of flags matching the glob. You can
think of this as a foreach operation:
> s 0
> / lib ; search 'lib' string
> p8 20 @@ hit0_* ; show 20 hexpairs at each search hit
The > operation is used to redirect output of a command into a file (overwriting it if it
already exists).
> pr > dump.bin ; dump 'raw' bytes of current block to file named 'dump.bin'
> f > flags.txt ; dump flag list to 'flags.txt'
The | operation (pipe) is similar to what you are used to expect from it in a *NIX shell: us
output of one command as input to another.
You can pass several commands in a single line by separating them with semicolon ; :
> px ; dr
45
Seeking
Seeking
The current seek position is changed with s command. It accepts a math expression as
argument. The expression can be composed of shift operations, basic math operations, or
memory access operations.
[0x00000000]> s?
Usage: s[+-] [addr]
s print current address
s 0x320 seek to this address
s- undo seek
s+ redo seek
s* list undo seek history
s++ seek blocksize bytes forward
s-- seek blocksize bytes backward
s+ 512 seek 512 bytes forward
s- 512 seek 512 bytes backward
sg/sG seek begin (sg) or end (sG) of section or file
s.hexoff Seek honoring a base from core->offset
sa [[+-]a] [asz] seek asz (or bsize) aligned to addr
sn/sp seek next/prev scr.nkey
s/ DATA search for next occurrence of 'DATA'
s/x 9091 search for next occurrence of \x90\x91
sb seek aligned to bb start
so [num] seek to N next opcode(s)
sf seek to next function (f->addr+f->size)
sC str seek to comment matching given string
sr pc seek to register
If you want to inspect the result of a math expression, you can evaluate it using the ?
command. Simply pass the expression as an argument. The result can be displayed in
hexadecimal, decimal, octal or binary formats.
> ? 0x100+200
0x1C8 ; 456d ; 710o ; 1100 1000
In the visual mode you can press u (undo) or U (redo) inside the seek history to return
back to previous or forward to the next location.
Open file
46
Seeking
As test file lets use some simple hello_world.c compiled in Linux ELF format. After we
compiled it lets open it with radare2
r2 hello_world
[0x00400410]>
[0x00400410]> s 0x0
[0x00000000]>
[0x00000000]> s
0x0
[0x00000000]>
[0x00000000]> s+ 128
[0x00000080]>
[0x00000080]> s-
[0x00000000]> s-
[0x00400410]>
47
Seeking
[0x00400410]> s*
f undo_3 @ 0x400410
f undo_2 @ 0x40041a
f undo_1 @ 0x400410
f undo_0 @ 0x400411
# Current undo/redo position.
f redo_0 @ 0x4005b4
48
Block Size
Block Size
The block size determines how many bytes Radare commands will process when not given
an explicit size argument. You can temporally change the block size by specifying a numeric
argument to the print commands. For example px 20 .
[0xB7F9D810]> b?
|Usage: b[f] [arg]
Get/Set block size
| b display current block size
| b 33 set block size to 33
| b+3 increase blocksize by 3
| b-16 decrease blocksize by 16
| b eip+4 numeric argument can be an expression
| bf foo set block size to flag size
| bm 1M set max block size
The bf command is used to change the block size to value specified by a flag. For
example, in symbols, the block size of the flag represents the size of the function.
49
Sections
Sections
Firmware images, bootloaders and binary files usually place various sections of a binary at
different addresses in memory. To represent this behavior, radare offers the S command.
[0xB7EE8810]> S?
|Usage: S[?-.*=adlr] [...]
| S list sections
| S. show current section name
| S* list sections (in radare commands)
| S= list sections (ascii-art bars) (io.va to display paddr or
vaddr)
| Sa[-] [A] [B] [[off]] Specify arch and bits for given section
| Sd[a] [file] dump current (all) section to a file (see dmd)
| Sl [file] load contents of file into current section (see dml)
| Sf [baddr] Alias for S 0 0 $s $s foo mrwx
| Sj list sections in JSON (alias for iSj)
| Sr [name] rename section on current seek
| S off va sz vsz name mrwx add new section (if(!vsz)vsz=sz)
| S-[id] remove section identified by id
| S-. remove section at core->offset (can be changed with @)
| S.-* remove all sections in current offset
For example:
50
Sections
# List sections
[0x00404888]> S
...
25 0x0001a600 |-----------------------------#| 0x0001a608 ---- .gnu_debuglink
26 0x0001a608 |-----------------------------#| 0x0001a706 ---- .shstrtab
27* 0x00000000 |##############################| 0x0001ae08 -rwx ehdr
=> 0x00004888 |-----^------------------------| 0x00004988
The first three lines are sections and the last one (prefixed by => ) is the current seek
location.
[0xB7EE8810]> S-4
51
Mapping Files
Mapping Files
Radare IO system allows you to map contents of files into the same IO space used to
contain loaded binary. New contents can be placed at random offsets. This lets you create a
static environment which emulate the view you would have when using a debugger, where
the program and all its libraries are loaded in memory and can be accessed.
Using the S (sections) command you can define base address for each library to be
loaded.
Mapping files is done using the o (open) command. Let's read the help:
[0x00000000]> o?
|Usage: o [com- ] [file] ([offset])
| o list opened files
| oq list all open files
| o* list opened files in r2 commands
| o= list opened files (ascii-art bars)
| ob[?] [lbdos] [...] list opened binary files backed by fd
| oc [file] open core file, like relaunching r2
| oi[-|idx] alias for o, but using index instead of fd
| oj[?] list opened files in JSON format
| oL list all IO plugins registered
| om[?] create, list, remove IO maps
| on [file] 0x4000 map raw file at 0x4000 (no r_bin involved)
| oo[?] reopen current file (kill+fork in debugger)
| oo+ reopen current file in read-write
| ood [args] reopen in debugger mode (with args)
| oo[bnm] [...] see oo? for help
| op [fd] priorize given fd (see also ob)
| o 4 Switch to open file on fd 4
| o-1 close file descriptor 1
| o-* close all opened files
| o-- close all files, analysis, binfiles, flags, same as !r2 --
| o [file] open [file] file in read-only
| o+ [file] open file in read-write mode
| o [file] 0x4000 map file at 0x4000
| ox fd fdx exchange the descs of fd and fdx and keep the mapping
52
Mapping Files
$ rabin2 -l /bin/ls
[Linked libraries]
libselinux.so.1
librt.so.1
libacl.so.1
libc.so.6
4 libraries
Map a file:
[0x00000000]> o
- 6 /bin/ls @ 0x0 ; r
- 10 /lib/ld-linux.so.2 @ 0x100000000 ; r
- 14 /bin/zsh @ 0x499999 ; r
[0x00000000]> px @ 0x499999
Unmap files using the o- command. Pass required file descriptor to it as an argument:
[0x00000000]> o-14
53
Print Modes
Print Modes
One of the key features of radare is displaying information in many formats. The goal is to
offer a selection of displaying choices to best interpret binary data.
Binary data can be represented as integers, shorts, longs, floats, timestamps, hexpair
strings, or more complex formats like C structures, disassembly listings, decompilations, be
a result of an external processing...
[0x08049AD0]> p?
|Usage: p[=68abcdDfiImrstuxz] [arg|len] [@addr]
| p=[?][bep] [blks] [len] [blk] show entropy/printable chars/chars bars
| p2 [len] 8x8 2bpp-tiles
| p3 [file] print stereogram (3D)
| p6[de] [len] base64 decode/encode
| p8[?][j] [len] 8bit hexpair list of bytes
| pa[edD] [arg] pa:assemble pa[dD]:disasm or pae: esil from hexpairs
| pA[n_ops] show n_ops address and type
| p[b|B|xb] [len] ([skip]) bindump N bits skipping M
| pb[?] [n] bitstream of N bits
| pB[?] [n] bitstream of N bytes
| pc[?][p] [len] output C (or python) format
| pC[d] [rows] print disassembly in columns (see hex.cols and pdi)
| pd[?] [sz] [a] [b] disassemble N opcodes (pd) or N bytes (pD)
| pf[?][.nam] [fmt] print formatted data (pf.name, pf.name $<expr>)
| ph[?][=|hash] ([len]) calculate hash for a block
| p[iI][df] [len] print N ops/bytes (f=func) (see pi? and pdi)
| pm[?] [magic] print libmagic data (see pm? and /m?)
| pr[?][glx] [len] print N raw bytes (in lines or hexblocks, 'g'unzip)
| p[kK] [len] print key in randomart (K is for mosaic)
| ps[?][pwz] [len] print pascal/wide/zero-terminated strings
| pt[?][dn] [len] print different timestamps
| pu[?][w] [len] print N url encoded bytes (w=wide)
| pv[?][jh] [mode] show variable/pointer/value in memory
| p-[?][jh] [mode] bar|json|histogram blocks (mode: e?search.in)
| px[?][owq] [len] hexdump of N bytes (o=octal, w=32bit, q=64bit)
| pz[?] [len] print zoom view (see pz? for help)
| pwd display current working directory
Tip: when using json output, you can append the ~{} to the command to get a pretty-
printed version of the output:
54
Print Modes
[0x00000000]> oj
[{"raised":false,"fd":563280,"uri":"malloc://512","from":0,"writable":true,"size":512,
"overlaps":false}]
[0x00000000]> oj~{}
[
{
"raised": false,
"fd": 563280,
"uri": "malloc://512",
"from": 0,
"writable": true,
"size": 512,
"overlaps": false
}
]
For more on the magical powers of ~ see the help in ?@? , and the "Command Format"
chapter earlier in the book.
Hexadecimal View
px gives a user-friendly output showing 16 pairs of numbers per row with offsets and raw
representations:
[0x00404888]> px
- offset - 0 1 2 3 4 5 6 7 8 9 A B C D E F 0123456789ABCDEF
0x00404888 31ed 4989 d15e 4889 e248 83e4 f050 5449 1.I..^H..H...PTI
0x00404898 c7c0 4024 4100 48c7 c1b0 2341 0048 c7c7 ..@$A.H...#A.H..
0x004048a8 d028 4000 e83f dcff fff4 6690 662e 0f1f .(@..?....f.f...
[0x00404888]> pxw
0x00404888 0x8949ed31 0x89485ed1 0xe48348e2 0x495450f0 1.I..^H..H...PTI
0x00404898 0x2440c0c7 0xc7480041 0x4123b0c1 0xc7c74800 ..@$A.H...#A.H..
0x004048a8 0x004028d0 0xffdc3fe8 0x9066f4ff 0x1f0f2e66 .(@..?....f.f...
[0x00404888]> e cfg.bigendian
false
[0x00404888]> pxw
0x00404888 0x31ed4989 0xd15e4889 0xe24883e4 0xf0505449 1.I..^H..H...PTI
0x00404898 0xc7c04024 0x410048c7 0xc1b02341 0x0048c7c7 ..@$A.H...#A.H..
0x004048a8 0xd0284000 0xe83fdcff 0xfff46690 0x662e0f1f .(@..?....f.f...
55
Print Modes
[0x00404888]> p8 16
31ed4989d15e4889e24883e4f0505449
[0x08049A80]> pxq
0x00001390 0x65625f6b63617473 0x646e6962006e6967 stack_begin.bind
0x000013a0 0x616d6f6474786574 0x7469727766006e69 textdomain.fwrit
0x000013b0 0x6b636f6c6e755f65 0x6d63727473006465 e_unlocked.strcm
...
Date/Time Formats
Currently supported timestamp output modes are:
[0x00404888]> pt?
|Usage: pt[dn?]
| pt print unix time (32 bit cfg.big_endian)
| ptd print dos time (32 bit cfg.big_endian)
| ptn print ntfs time (64 bit !cfg.big_endian)
| pt? show help message
For example, you can 'view' the current buffer as timestamps in the ntfs time:
As you can see, the endianness affects the result. Once you have printed a timestamp, you
can grep output, for example, by year value:
The default date format can be configured using the cfg.datefmt variable. Formatting rules
for it follow the well known strftime(3) format. An excerpt from the strftime(3) manpage:
56
Print Modes
%a The abbreviated name of the day of the week according to the current locale.
%A The full name of the day of the week according to the current locale.
%b The abbreviated month name according to the current locale.
%B The full month name according to the current locale.
%c The preferred date and time representation for the current locale.
%C The century number (year/100) as a 2-digit integer. (SU)
%d The day of the month as a decimal number (range 01 to 31).
%D Equivalent to %m/%d/%y. (Yecch—for Americans only. Americans should note that in
other countries %d/%m/%y is rather common. This means that in international context t
his format is ambiguous and should not be used.) (SU)
%e Like %d, the day of the month as a decimal number, but a leading zero is replaced
by a space. (SU)
%E Modifier: use alternative format, see below. (SU)
%F Equivalent to %Y-%m-%d (the ISO 8601 date format). (C99)
%G The ISO 8601 week-based year (see NOTES) with century as a decimal number. The 4-
digit year corresponding to the ISO week number (see %V). This has the same format an
d value as %Y, except that if the ISO week number belongs to the previous or next year
, that year is used instead. (TZ)
%g Like %G, but without century, that is, with a 2-digit year (00-99). (TZ)
%h Equivalent to %b. (SU)
%H The hour as a decimal number using a 24-hour clock (range 00 to 23).
%I The hour as a decimal number using a 12-hour clock (range 01 to 12).
%j The day of the year as a decimal number (range 001 to 366).
%k The hour (24-hour clock) as a decimal number (range 0 to 23); single digits are pr
eceded by a blank. (See also %H.) (TZ)
%l The hour (12-hour clock) as a decimal number (range 1 to 12); single digits are pr
eceded by a blank. (See also %I.) (TZ)
%m The month as a decimal number (range 01 to 12).
%M The minute as a decimal number (range 00 to 59).
%n A newline character. (SU)
%O Modifier: use alternative format, see below. (SU)
%p Either "AM" or "PM" according to the given time value, or the corresponding string
s for the current locale. Noon is treated as "PM" and midnight as "AM".
%P Like %p but in lowercase: "am" or "pm" or a corresponding string for the current l
ocale. (GNU)
%r The time in a.m. or p.m. notation. In the POSIX locale this is equivalent to %I:%
M:%S %p. (SU)
%R The time in 24-hour notation (%H:%M). (SU) For a version including the seconds, s
ee %T below.
%s The number of seconds since the Epoch, 1970-01-01 00:00:00 +0000 (UTC). (TZ)
%S The second as a decimal number (range 00 to 60). (The range is up to 60 to allow
for occasional leap seconds.)
%t A tab character. (SU)
%T The time in 24-hour notation (%H:%M:%S). (SU)
%u The day of the week as a decimal, range 1 to 7, Monday being 1. See also %w. (SU
)
%U The week number of the current year as a decimal number, range 00 to 53, starting
with the first Sunday as the first day of week 01. See also %V and %W.
%V The ISO 8601 week number (see NOTES) of the current year as a decimal number, rang
e 01 to 53, where week 1 is the first week that has at least 4 days in the new year.
See also %U and %W.(U)
%w The day of the week as a decimal, range 0 to 6, Sunday being 0. See also %u.
57
Print Modes
%W The week number of the current year as a decimal number, range 00 to 53, starting
with the first Monday as the first day of week 01.
%x The preferred date representation for the current locale without the time.
%X The preferred time representation for the current locale without the date.
%y The year as a decimal number without a century (range 00 to 99).
%Y The year as a decimal number including the century.
%z The +hhmm or -hhmm numeric timezone (that is, the hour and minute offset from UTC)
. (SU)
%Z The timezone name or abbreviation.
%+ The date and time in date(1) format. (TZ) (Not supported in glibc2.)
%% A literal '%' character.
Basic Types
There are print modes available for all basic types. If you are interested in a more complex
structure, type pf?? for format characters and pf??? for examples:
[0x00499999]> pf??
|pf: pf[.k[.f[=v]]|[v]]|[n]|[0|cnt][fmt] [a0 a1 ...]
| Format:
| b byte (unsigned)
| B resolve enum bitfield (see t?)
| c char (signed byte)
| d 0x%%08x hexadecimal value (4 bytes) (see %%i and %%x)
| D disassemble one opcode
| e temporally swap endian
| E resolve enum name (see t?)
| f float value (4 bytes)
| i %%i signed integer value (4 bytes) (see %%d and %%x)
| n next char specifies size of signed value (1, 2, 4 or 8 byte(s))
| N next char specifies size of unsigned value (1, 2, 4 or 8 byte(s))
| o 0x%%08o octal value (4 byte)
| p pointer reference (2, 4 or 8 bytes)
| q quadword (8 bytes)
| r CPU register `pf r (eax)plop`
| s 32bit pointer to string (4 bytes)
| S 64bit pointer to string (8 bytes)
| t UNIX timestamp (4 bytes)
| T show Ten first bytes of buffer
| u uleb128 (variable length)
| w word (2 bytes unsigned short in hex)
| x 0x%%08x hex value and flag (fd @ addr) (see %%d and %%i)
| X show formatted hexpairs
| z \0 terminated string
| Z \0 terminated wide string
| ? data structure `pf ? (struct_name)example_name`
| * next char is pointer (honors asm.bits)
| + toggle show flags for each offset
| : skip 4 bytes
| . skip 1 byte
58
Print Modes
[0x00499999]> pf???
|pf: pf[.k[.f[=v]]|[v]]|[n]|[0|cnt][fmt] [a0 a1 ...]
| Examples:
| pf 3xi foo bar 3-array of struct, each with named fiel
ds: 'foo' as hex, and 'bar' as int
| pf B (BitFldType)arg_name` bitfield type
| pf E (EnumType)arg_name` enum type
| pf.obj xxdz prev next size name Define the obj format as xxdz
| pf obj=xxdz prev next size name Same as above
| pf iwq foo bar troll Print the iwq format with foo, bar, tro
ll as the respective names for the fields
| pf 0iwq foo bar troll Same as above, but considered as a unio
n (all fields at offset 0)
| pf.plop ? (troll)mystruct Use structure troll previously defined
| pf 10xiz pointer length string Print a size 10 array of the xiz struct
with its field names
| pf {integer}? (bifc) Print integer times the following forma
t (bifc)
| pf [4]w[7]i Print an array of 4 words and then an a
rray of 7 integers
| pf ic...?i foo bar "(pf xw yo foo)troll" yo Print nested anonymous structres
| pfn2 print signed short (2 bytes) value. Use
N insted of n for printing unsigned values
[0x4A13B8C0]> pf i
0x00404888 = 837634441
[0x4A13B8C0]> pf
0x00404888 = 837634432.000000
59
Print Modes
pc C
pc* print 'wx' r2 commands
pch C half-words (2 byte)
pcw C words (4 byte)
pcd C dwords (8 byte)
pca GAS .byte blob
pcA .bytes with instructions in comments
pcs string
pcS shellscript that reconstructs the bin
pcj json
pcJ javascript
pcp python
[0xB7F8E810]> pc 32
#define _BUFFER_SIZE 32
unsigned char buffer[_BUFFER_SIZE] = {
0x89, 0xe0, 0xe8, 0x49, 0x02, 0x00, 0x00, 0x89, 0xc7, 0xe8, 0xe2, 0xff, 0xff, 0xff, 0x
81, 0xc3, 0xd6, 0xa7, 0x01, 0x00, 0x8b, 0x83, 0x00, 0xff, 0xff, 0xff, 0x5a, 0x8d, 0x24
, 0x84, 0x29, 0xc2 };
[0x7fcd6a891630]> pcs
"\x48\x89\xe7\xe8\x68\x39\x00\x00\x49\x89\xc4\x8b\x05\xef\x16\x22\x00\x5a\x48\x8d\x24\
xc4\x29\xc2\x52\x48\x89\xd6\x49\x89\xe5\x48\x83\xe4\xf0\x48\x8b\x3d\x06\x1a
Strings
Strings are probably one of the most important entrypoints when starting to reverse engineer
a program, because they usually reference information about functions' actions (asserts,
debug or info messages...) Therefore radare supports various string formats:
[0x00000000]> ps?
|Usage: ps[zpw] [N]Print String
| ps print string
| pss print string in screen (wrap width)
| psi print string inside curseek
| psb print strings in current block
| psx show string with escaped chars
| psz print zero terminated string
| psp print pascal string
| psu print utf16 unicode (json)
| psw print 16bit wide string
| psW print 32bit wide string
| psj print string in JSON format
Most strings are zero-terminated. Here is an example by using the debugger to continue the
execution of a program until it executes the 'open' syscall. When we recover the control over
the process, we get the arguments passed to the syscall, pointed by %ebx. In the case of
60
Print Modes
the 'open' call, it is a zero terminated string which we can inspect using psz .
This can be used to look at the arguments passed to a function. To achieve this, simply pass
a 'format memory string' as an argument to pf , and temporally change current seek
position / offset using @ . It is also possible to define arrays of structures with pf . To do
this, prefix the format string with a numeric value. You can also define a name for each field
of the structure by appending them as a space-separated arguments list.
61
Print Modes
$ radare ~/.gstreamer-0.10/plugins/libgstflumms.so
[0x000028A0]> seek sym.gst_plugin_desc
[0x000185E0]> pf iissxsssss major minor name desc _init version \
license source package origin
major : 0x000185e0 = 0
minor : 0x000185e4 = 10
name : 0x000185e8 = 0x000185e8 flumms
desc : 0x000185ec = 0x000185ec Fluendo MMS source
_init : 0x000185f0 = 0x00002940
version : 0x000185f4 = 0x000185f4 0.10.15.1
license : 0x000185f8 = 0x000185f8 unknown
source : 0x000185fc = 0x000185fc gst-fluendo-mms
package : 0x00018600 = 0x00018600 Fluendo MMS source
origin : 0x00018604 = 0x00018604 http://www.fluendo.com
Disassembly
The pd command is used to disassemble code. It accepts a numeric value to specify how
many instructions should be disassembled. The pD command is similar but instead of a
number of instructions, it decompiles a given number of bytes.
[0x00404888]> pd 1
;-- entry0:
0x00404888 31ed xor ebp, ebp
62
Print Modes
[0xB7F08810]> e asm.arch = ?
63
Print Modes
Disassembly Syntax
The asm.syntax variable is used to change flavor of assembly syntax used by a
disassembler engine. To switch between Intel and AT&T representations:
e asm.syntax = intel
e asm.syntax = att
64
Print Modes
You can also check asm.pseudo , which is an experimental pseudocode view, and asm.esil
which outputs ESIL ('Evaluable Strings Intermedate Language'). ESIL's goal is to have a
human-readable representation of every opcode semantics. Such representations can be
evaluated (interpreted) to emulate effects of individual instructions.
65
Flags
Flags
Flags are conceptually similar to bookmarks. They associate a name with a given offset in a
file. Flags can be grouped into 'flag spaces'. A flag space is a namespace for flags, grouping
together flags of similar characteristics or type. Examples for flag spaces: sections, registers,
symbols.
You can remove a flag by appending the - character to command. Most commands accept
- as argument-prefix as an indication to delete something.
[0x4A13B8C0]> f-flag_name
00 symbols
01 imports
02 sections
03 strings
04 regs
05 maps
66
Write
Writing Data
Radare can manipulate a loaded binary file in many ways. You can resize the file, move and
copy/paste bytes, insert new bytes (shifting data to the end of the block or file), or simply
overwrite bytes. New data may be give as a widestring, as assembler instructions, or the
data may be read in from another file.
Resize the file using the r command. It accepts a numeric argument. A positive value sets
a new size for the file. A negative one will truncate the file to the current seek position minus
N bytes.
Write bytes using the w command. It accepts multiple input formats like inline assembly,
endian-friendly dwords, files, hexpair files, wide strings:
67
Write
[0x00404888]> w?
|Usage: w[x] [str] [<file] [<<EOF] [@addr]
| w[1248][+-][n] increment/decrement byte,word..
| w foobar write string 'foobar'
| w0 [len] write 'len' bytes with value 0x00
| w6[de] base64/hex write base64 [d]ecoded or [e]ncoded string
| wa[?] push ebp write opcode, separated by ';' (use '"' around the command)
| waf file assemble file and write bytes
| wao[?] op modify opcode (change conditional of jump. nop, etc)
| wA[?] r 0 alter/modify opcode at current seek (see wA?)
| wb 010203 fill current block with cyclic hexpairs
| wB[-]0xVALUE set or unset bits with given value
| wc list all write changes
| wc[?][ir*?] write cache undo/commit/reset/list (io.cache)
| wd [off] [n] duplicate N bytes from offset at current seek (memcpy) (see y?)
| we[?] [nNsxX] [arg] extend write operations (insert instead of replace)
| wf -|file write contents of file at current offset
| wh r2 whereis/which shell command
| wm f0ff set binary mask hexpair to be used as cyclic write mask
| wo[?] hex write in block with operation. 'wo?' fmi
| wp[?] -|file apply radare patch file. See wp? fmi
| wr 10 write 10 random bytes
| ws pstring write 1 byte for length and then the string
| wt[f][?] file [sz] write to file (from current seek, blocksize or sz bytes)
| wts host:port [sz] send data to remote host:port via tcp://
| ww foobar write wide string 'f\x00o\x00o\x00b\x00a\x00r\x00'
| wx[?][fs] 9090 write two intel nops (from wxfile or wxseek)
| wv[?] eip+34 write 32-64 bit value
| wz string write zero terminated string (like w + \x00)
Some examples:
Write Over
The wo command (write over) has many subcommands, each combines the existing data
with the new data using an operator. The command is applied to the current block.
Supported operators include: XOR, ADD, SUB...
68
Write
[0x4A13B8C0]> wo?
|Usage: wo[asmdxoArl24] [hexpairs] @ addr[:bsize]
|Example:
| wox 0x90 ; xor cur block with 0x90
| wox 90 ; xor cur block with 0x90
| wox 0x0203 ; xor cur block with 0203
| woa 02 03 ; add [0203][0203][...] to curblk
| woe 02 03
|Supported operations:
| wow == write looped value (alias for 'wb')
| woa += addition
| wos -= substraction
| wom *= multiply
| wod /= divide
| wox ^= xor
| woo |= or
| woA &= and
| woR random bytes (alias for 'wr $b'
| wor >>= shift right
| wol <<= shift left
| wo2 2= 2 byte endian swap
| wo4 4= 4 byte endian swap
[0x7fcd6a891630]> px
- offset - 0 1 2 3 4 5 6 7 8 9 A B C D E F 0123456789ABCDEF
0x7fcd6a891630 4889 e7e8 6839 0000 4989 c48b 05ef 1622 H...h9..I......"
0x7fcd6a891640 005a 488d 24c4 29c2 5248 89d6 4989 e548 .ZH.$.).RH..I..H
0x7fcd6a891650 83e4 f048 8b3d 061a 2200 498d 4cd5 1049 ...H.=..".I.L..I
0x7fcd6a891660 8d55 0831 ede8 06e2 0000 488d 15cf e600 .U.1......H.....
[0x7fcd6a891630]> wox 90
[0x7fcd6a891630]> px
- offset - 0 1 2 3 4 5 6 7 8 9 A B C D E F 0123456789ABCDEF
0x7fcd6a891630 d819 7778 d919 541b 90ca d81d c2d8 1946 ..wx..T........F
0x7fcd6a891640 1374 60d8 b290 d91d 1dc5 98a1 9090 d81d .t`.............
0x7fcd6a891650 90dc 197c 9f8f 1490 d81d 95d9 9f8f 1490 ...|............
0x7fcd6a891660 13d7 9491 9f8f 1490 13ff 9491 9f8f 1490 ................
[0x7fcd6a891630]> woa 01 02
[0x7fcd6a891630]> px
- offset - 0 1 2 3 4 5 6 7 8 9 A B C D E F 0123456789ABCDEF
0x7fcd6a891630 d91b 787a 91cc d91f 1476 61da 1ec7 99a3 ..xz.....va.....
0x7fcd6a891640 91de 1a7e d91f 96db 14d9 9593 1401 9593 ...~............
0x7fcd6a891650 c4da 1a6d e89a d959 9192 9159 1cb1 d959 ...m...Y...Y...Y
0x7fcd6a891660 9192 79cb 81da 1652 81da 1456 a252 7c77 ..y....R...V.R|w
69
Write
70
Zoom
Zoom
The zoom is a print mode that allows you to get a global view of the whole file or a memory
map on a single screen. In this mode, each byte represents file_size/block_size bytes of
the file. Use the pz command, or just use Z in the visual mode to toggle the zoom mode.
The cursor can be used to scroll faster through the zoom out view. Pressing z again will
zoom-in where at new cursor position.
[0x004048c5]> pz?
|Usage: pz [len] print zoomed blocks (filesize/N)
| e zoom.maxsz max size of block
| e zoom.from start address
| e zoom.to end address
| e zoom.byte specify how to calculate each byte
| pzp number of printable chars
| pzf count of flags in block
| pzs strings in range
| pz0 number of bytes with value '0'
| pzF number of bytes with value 0xFF
| pze calculate entropy and expand to 0-255 range
| pzh head (first byte value); This is the default mode
71
Zoom
0x00000000 7f00 0000 e200 0000 146e 6f74 0300 0000 .........not....
0x00000010 0000 0000 0068 2102 00ff 2024 e8f0 007a .....h!... $...z
0x00000020 8c00 18c2 ffff 0080 4421 41c4 1500 5dff ........D!A...].
0x00000030 ff10 0018 0fc8 031a 000c 8484 e970 8648 .............p.H
0x00000040 d68b 3148 348b 03a0 8b0f c200 5d25 7074 ..1H4.......]%pt
0x00000050 7500 00e1 ffe8 58fe 4dc4 00e0 dbc8 b885 u.....X.M.......
[0x08049790]> e zoom.byte=p
[0x08049790]> pz // or pzp
0x00000000 2f47 0609 070a 0917 1e9e a4bd 2a1b 2c27 /G..........*.,'
0x00000010 322d 5671 8788 8182 5679 7568 82a2 7d89 2-Vq....Vyuh..}.
0x00000020 8173 7f7b 727a 9588 a07b 5c7d 8daf 836d .s.{rz...{\}...m
0x00000030 b167 6192 a67d 8aa2 6246 856e 8c9b 999f .ga..}..bF.n....
0x00000040 a774 96c3 b1a4 6c8e a07c 6a8f 8983 6a62 .t....l..|j...jb
0x00000050 7d66 625f 7ea4 7ea6 b4b6 8b57 a19f 71a2 }fb_~.~....W..q.
0x00406e65 48d0 80f9 360f 8745 ffff ffeb ae66 0f1f H...6..E.....f..
0x00406e75 4400 0083 f801 0f85 3fff ffff 410f b600 D.......?...A...
0x00406e85 3c78 0f87 6301 0000 0fb6 c8ff 24cd 0026 <x..c.......$..&
0x00406e95 4100 660f 1f84 0000 0000 0084 c074 043c A.f..........t.<
0x00406ea5 3a75 18b8 0500 0000 83f8 060f 95c0 e9cd :u..............
0x00406eb5 feff ff0f 1f84 0000 0000 0041 8801 4983 ...........A..I.
0x00406ec5 c001 4983 c201 4983 c101 e9ec feff ff0f ..I...I.........
[0x08049790]> e zoom.byte=F
[0x08049790]> pO // or pzF
0x00000000 0000 0000 0000 0000 0000 0000 0000 0000 ................
0x00000010 0000 2b5c 5757 3a14 331f 1b23 0315 1d18 ..+\WW:.3..#....
0x00000020 222a 2330 2b31 2e2a 1714 200d 1512 383d "*#0+1.*.. ...8=
0x00000030 1e1a 181b 0a10 1a21 2a36 281e 1d1c 0e11 .......!*6(.....
0x00000040 1b2a 2f22 2229 181e 231e 181c 1913 262b .*/"")..#.....&+
0x00000050 2b30 4741 422f 382a 1e22 0f17 0f10 3913 +0GAB/8*."....9.
You can limit zooming to a range of bytes instead of the whole bytespace. Change
zoom.from and zoom.to eval variables:
[0x465D8810]> e zoom.
zoom.byte = f
zoom.from = 0
zoom.maxsz = 512
zoom.to = 118368???
72
Zoom
73
Yank/Paste
Yank/Paste
You can yank/paste bytes in visual mode using the y and Y key bindings which are
aliases for y and yy commands of command-line interface. These commands operate on
an internal buffer which stores copies of bytes taken starting from the current seek position.
You can write this buffer back to different seek position using yy command:
[0x00000000]> y?
|Usage: y[ptxy] [len] [[@]addr]
| y show yank buffer information (srcoff len bytes)
| y 16 copy 16 bytes into clipboard
| y 16 0x200 copy 16 bytes into clipboard from 0x200
| y 16 @ 0x200 copy 16 bytes into clipboard from 0x200
| yp print contents of clipboard
| yx print contents of clipboard in hexadecimal
| yt 64 0x200 copy 64 bytes from current seek to 0x200
| yf 64 0x200 file copy 64 bytes from 0x200 from file (opens w/ io), use -1 for all by
tes
| yfa file copy copy all bytes from from file (opens w/ io)
| yy 0x3344 paste clipboard
Sample session:
You can perform a yank and paste in a single line by just using the yt command (yank-to).
The syntax is as follows:
[0x4A13B8C0]> x
offset 0 1 2 3 4 5 6 7 8 9 A B 0123456789AB
0x4A13B8C0, 89e0 e839 0700 0089 c7e8 e2ff ...9........
0x4A13B8CC, ffff 81c3 eea6 0100 8b83 08ff ............
0x4A13B8D8, ffff 5a8d 2484 29c2 ..Z.$.).
[0x4A13B8C0]> x
offset 0 1 2 3 4 5 6 7 8 9 A B 0123456789AB
0x4A13B8C0, 89e0 e839 0700 0089 c7e8 e2ff ...9........
0x4A13B8CC, 89e0 e839 0700 0089 8b83 08ff ...9........
0x4A13B8D8, ffff 5a8d 2484 29c2 ..Z.$.).
74
Yank/Paste
75
Comparing Bytes
Comparing Bytes
c (short for "compare") allows you to compare arrays of bytes from different sources. The
command accepts input in a number of formats, and then compares it against values found
at current seek position.
[0x00404888]> c?
|Usage: c[?dfx] [argument] # Compare
| c [string] Compare a plain with escaped chars string
| c* [string] Compare a plain with escaped chars string (output r2 commands)
| c4 [value] Compare a doubleword from a math expression
| c8 [value] Compare a quadword from a math expression
| cat [file] Show contents of file (see pwd, ls)
| cc [at] [(at)] Compares in two hexdump columns of block size
| ccc [at] [(at)] Same as above, but only showing different lines
| ccd [at] [(at)] Compares in two disasm columns of block size
| cf [file] Compare contents of file at current seek
| cg[?] [o] [file] Graphdiff current file and [file]
| cl|cls|clear Clear screen, (clear0 to goto 0, 0 only)
| cu[?] [addr] @at Compare memory hexdumps of $$ and dst in unified diff
| cud [addr] @at Unified diff disasm from $$ and given address
| cv[1248] [addr] @at Compare 1,2,4,8-byte value
| cw[?] [us?] [...] Compare memory watchers
| cx [hexpair] Compare hexpair string (use '.' as nibble wildcard)
| cx* [hexpair] Compare hexpair string (output r2 commands)
| cX [addr] Like 'cc' but using hexdiff output
To compare memory contents at current seek position against given string of values, use
cx :
[0x08048000]> p8 4
7f 45 4c 46
[0x08048000]> cx 7f 45 90 46
Compare 3/4 equal bytes
0x00000002 (byte=03) 90 ' ' -> 4c 'L'
[0x08048000]>
76
Comparing Bytes
c8 compares a quadword from the current seek (in the example below, 0x00000000)
[0x00000000]> c8 4
The number parameter can of course also be a math expressions which uses flag names
etc:
[0x00000000]> cx 7f469046
You can use the compare command to find differences between a current block and a file
previously dumped to a disk:
r2 /bin/true
[0x08049A80]> s 0
[0x08048000]> cf /bin/true
Compare 512/512 equal bytes
77
SDB
SDB
List namespaces
k **
List sub-namespaces
k anal/**
List keys
k *
k anal/*
Set a key
k foo=bar
k foo
Useful queries
List all syscalls
k syscall/*~^0x
k anal/meta/*~.C.
k %anal/meta/[1]meta.C.0x100005000
78
SDB
79
Visual mode
Visual Mode
The visual mode is a more user-friendly interface alternative to radare2's command-line
prompt. It uses HJKL or arrow keys to move around data and code, has a cursor mode for
selecting bytes, and offers numerous key bindings to simplify debugger use. To enter visual
mode, use V command. To exit from it back to command line, press q .
Notice that the top of the panel contains the command which is used, for example for the
disassembly panel:
80
Visual mode
Getting Help
To see help on all key bindings defined for visual mode, press ? :
81
Visual mode
82
Visual Disassembly
Visual Disassembly
Navigation
Move within the Disassembly using arrows or hjkl . Use o to seek directly to a flag or an
offset, type it when requested by the prompt: [offset]> . Follow a jump or a call using the
number of your keyboard [0-9] and the number on the right in disassembly to follow a call
or a jump. In this example typing 1 on the keyboard would follow the call to
sym.imp.__libc_start_main and therefore, seek at the offset of this symbol.
Seek back to the previous location using u , U will allow you to redo the seek.
d as define
d can be used to change the type of data of the current block, several basic
d → ...
0x004048f7 48c1e83f shr rax, 0x3f
d → b
0x004048f7 .byte 0x48
d → B
0x004048f7 .word 0xc148
d → d
0x004048f7 hex length=165 delta=0
0x004048f7 48c1 e83f 4801 c648 d1fe 7415 b800 0000 H..?H..H..t.....
...
To improve code readability you can change how radare2 presents numerical values in
disassembly, by default most of disassembly display numerical value as hexadecimal.
Sometimes you would like to view it as a decimal, binary or even custom defined constant.
To change value format you can use d following by i then choose what base to work in,
this is the equivalent to ahi :
83
Visual Disassembly
d → i → ...
0x004048f7 48c1e83f shr rax, 0x3f
d → i → 10
0x004048f7 48c1e83f shr rax, 63
d → i → 2
0x004048f7 48c1e83f shr rax, '?'
Pressing lowercase c toggles the cursor mode. When this mode is active, the currently
selected byte (or byte range) is highlighted.
The cursor is used to select a range of bytes or simply to point to a byte. You can use the
cursor to create a named flag at specifc location. To do so, seek to the required position,
then press f and enter a name for a flag. If the file was opened in write mode using the -
w flag or the o+ command, you can also use the cursor to overwrite a selected range with
new values. To do so, select a range of bytes (with HJKL and SHIFT key pressed), then
press i and enter the hexpair values for the new data. The data will be repeated as
needed to fill the range selected. For example:
The Visual Assembler is a feature that provides a live-preview while you type in new
instructions to patch into the disassembly. To use it, seek or place the cursor at the wanted
location and hit the 'A' key. To provide multiple instructions, seperate them with semicolons,
; .
84
Visual Disassembly
XREF
When radare2 has discovered a XREF during the analysis, it will show you the information in
the Visual Disassembly using XREF tag:
To see where this string is called, press x , if you want to jump to the location where the
data is used then press the corresponding number [0-9] on your keyboard. (This functionality
is similar to axt )
Add a comment
To add a comment press ; .
Search
/ : allows highlighting of strings in the current display. :cmd allows you to use one of the
The HUDS
The "UserFriendly HUD"
The "UserFriendly HUD" can be accessed using the ?? key-combination. This HUD acts as
an interactive Cheat Sheet that one can use to more easily find and execute commands.
This HUD is particularly useful for new-comers. For experienced users, the other HUDS
which are more activity-specific may be more useful.
85
Visual Disassembly
This HUD can be displayed using the _ key, it shows a list of all the flags defined and lets
you jump to them. Using the keyboard you can quickly filter the list down to a flag that
contains a specific pattern.
86
Visual Disassembly
87
Visual Disassembly
88
Visual Disassembly
Examples
asm.arch: Change Architecture && asm.bits: Word size in
bits at assembler
You can view the list of all arch using e asm.arch=?
89
Visual Disassembly
e asm.arch = dalvik
0x00404870 31ed4989 cmp-long v237, v73, v137
0x00404874 d15e4889 rsub-int v14, v5, 0x8948
0x00404878 e24883e4 ushr-int/lit8 v72, v131, 0xe4
0x0040487c f0505449c7c0 +invoke-object-init-range {}, method+18772
;[0]
0x00404882 90244100 add-int v36, v65, v0
e asm.bits = 16
0000:4870 31ed xor bp, bp
0000:4872 49 dec cx
0000:4873 89d1 mov cx, dx
0000:4875 5e pop si
0000:4876 48 dec ax
0000:4877 89e2 mov dx, sp
This latest operation can also be done using & in Visual mode.
e asm.pseudo = true
0x00404870 31ed ebp = 0
0x00404872 4989d1 r9 = rdx
0x00404875 5e pop rsi
0x00404876 4889e2 rdx = rsp
0x00404879 4883e4f0 rsp &= 0xfffffffffffffff0
e asm.syntax = att
0x00404870 31ed xor %ebp, %ebp
0x00404872 4989d1 mov %rdx, %r9
0x00404875 5e pop %rsi
0x00404876 4889e2 mov %rsp, %rdx
0x00404879 4883e4f0 and $0xfffffffffffffff0, %rsp
90
Visual Disassembly
e asm.describe = true
0x00404870 31ed xor ebp, ebp ; logical exclu
sive or
0x00404872 4989d1 mov r9, rdx ; moves data fr
om src to dst
0x00404875 5e pop rsi ; pops last ele
ment of stack and stores the result in argument
0x00404876 4889e2 mov rdx, rsp ; moves data fr
om src to dst
0x00404879 4883e4f0 and rsp, 0xfffffffffffffff0 ; binary and op
eration between src and dst, stores result on dst
91
Visual Assembler
Visual Assembler
You can use Visual Mode to assemble code using A . For example let's replace the push
by a jmp :
92
Visual Assembler
You need to open the file in writing mode ( r2 -w or oo+ ) in order to patch the file. You can
also use the cache mode: e io.cache = true and wc? .
Remember that patching files in debug mode only patch the memory not the file.
93
Visual Configuration Editor
you want to change the assembly display just select asm in the list and choose your
assembly display flavor.
94
Visual Configuration Editor
95
Visual Configuration Editor
96
Searching bytes
[0x00000000]> /?
|Usage: /[amx/] [arg]Search stuff (see 'e??search' for options)
| / foo\x00 search for string 'foo\0'
| /j foo\x00 search for string 'foo\0' (json output)
| /! ff search for first occurrence not matching
| /+ /bin/sh construct the string with chunks
| /!x 00 inverse hexa search (find first byte != 0x00)
| // repeat last search
| /h[t] [hash] [len] find block matching this hash. See /#?
| /a jmp eax assemble opcode and search its bytes
| /A jmp find analyzed instructions of this type (/A? for help)
| /b search backwards
| /B search recognized RBin headers
| /c jmp [esp] search for asm code
| /C[ar] search for crypto materials
| /d 101112 search for a deltified sequence of bytes
| /e /E.F/i match regular expression
| /E esil-expr offset matching given esil expressions %%= here
| /f file [off] [sz] search contents of file with offset and size
| /i foo search for string 'foo' ignoring case
| /m magicfile search for matching magic file (use blocksize)
| /o show offset of previous instruction
| /p patternsize search for pattern of given size
| /P patternsize search similar blocks
| /r[e] sym.printf analyze opcode reference an offset (/re for esil)
| /R [grepopcode] search for matching ROP gadgets, semicolon-separated
| /v[1248] value look for an `cfg.bigendian` 32bit value
| /V[1248] min max look for an `cfg.bigendian` 32bit value in range
| /w foo search for wide string 'f\0o\0o\0'
| /wi foo search for wide string ignoring case 'f\0o\0o\0'
| /x ff..33 search for hex string ignoring some nibbles
| /x ff0033 search for hex string
| /x ff43 ffd0 search for hexpair with mask
| /z min max search for strings of given size
Because everything is treated as a file in radare2, it does not matter whether you search in a
socket, a remote device, in process memory, or a file.
97
Searching bytes
98
Basic Searches
Basic Search
A basic search for a plain text string in a file would be something like:
As can be seen from the output above, radare2 generates a "hit" flag for every entry found.
You can then use the ps command to see the strings stored at the offsets marked by the
flags in this group, athey ll haves names of the form hit0_<index> :
[0x00404888]> / ls
...
[0x00404888]> ps @ hit0_0
lseek
You can search for wide-char strings (e.g., unicode letters) using the /w command:
[0x00000000]> /w Hello
0 results found.
[0x0040488f]> /i Stallman
Searching 8 bytes from 0x00400238 to 0x0040488f: 53 74 61 6c 6c 6d 61 6e
[# ]hits: 004138 < 0x0040488f hits = 0
[0x00000000]> / \x7FELF
99
Basic Searches
But, if you are searching for a string of hexadecimal values, you're probably better of using
the /x command:
[0x00000000]> /x 7F454C46
Once the search is done, the results are stored in the searches flag space.
[0x00000000]> fs
0 0 . strings
1 0 . symbols
2 6 . searches
[0x00000000]> f
0x00000135 512 hit0_0
0x00000b71 512 hit0_1
0x00000bad 512 hit0_2
0x00000bdd 512 hit0_3
0x00000bfb 512 hit0_4
0x00000f2a 512 hit0_5
To remove "hit" flags after you do not need them anymore, use the f- hit* command.
Often, during long search sessions, you will need to launch the latest search more than
once. You can use the // command to repeat the last search.
100
Configurating the Search
The search.align variable is used to limit valid search hits to certain alignment. For
example, with e search.align=4 you will see only hits found at 4-bytes aligned offsets.
The search.flags boolean variable instructs the search engine to flag hits so that they can
be referenced later. If a currently running search is interrupted with Ctrl-C keyboard
sequence, current search position is flagged with search_stop .
101
Pattern Search
[0x00000000]> /p 10
This command output will show different patterns found and how many times each of them is
encountered.
102
Automation
Search Automation
The cmd.hit eval variable is used to define a radare2 command to be executed when a
matching entry is found by the search engine. If you want to run several commands,
separate them with ; . Alternatively, you can arrange them in a separate script, and then
invoke it as a whole with . script-file-name command. For example:
[0x00404888]> e cmd.hit = p8 8
[0x00404888]> / lib
Searching 3 bytes from 0x00400000 to 0x0041ae08: 6c 69 62
hits: 9
0x00400239 hit4_0 "lib64/ld-linux-x86-64.so.2"
31ed4989d15e4889
0x00400f19 hit4_1 "libselinux.so.1"
31ed4989d15e4889
0x00400fae hit4_2 "librt.so.1"
31ed4989d15e4889
0x00400fc7 hit4_3 "libacl.so.1"
31ed4989d15e4889
0x00401004 hit4_4 "libc.so.6"
31ed4989d15e4889
0x004013ce hit4_5 "libc_start_main"
31ed4989d15e4889
0x00416542 hit4_6 "libs/"
31ed4989d15e4889
0x00417160 hit4_7 "lib/xstrtol.c"
31ed4989d15e4889
0x00417578 hit4_8 "lib"
31ed4989d15e4889
103
Backward Search
Searching Backwards
To search backwards, use the /b command.
104
Search in Assembly
Assembler Search
If you want to search for a certain assembler opcodes, you can either use /c or /a
commands.
The command /c jmp [esp] searches for the specified asm mnemonic:
The command /a jmp eax assembles a string to machine code, and then searches for the
resulting bytes:
105
Searching for AES Keys
$ sudo r2 /dev/mem
[0x00000000]> /Ca
0 AES keys found
106
Signatures
Radare2 has its own format of the signatures, allowing to both load/apply and create them
on the fly. They are available under the z command namespace:
[0x000100b0]> z?
|Usage: z[*j-aof/cs] [args] # Manage zignatures
| z show zignatures
| z* show zignatures in radare format
| zj show zignatures in json format
| z-zignature delete zignature
| z-* delete all zignatures
| za[?] add zignature
| zo[?] manage zignature files
| zf[?] manage FLIRT signatures
| z/[?] search zignatures
| zc check zignatures at address
| zs[?] manage zignspaces
To load the created signature file you need to load it from SDB file using zo command or
from the compressed SDB file using zoz command.
To create signature you need to make function first, then you can create it from the function:
r2 /bin/ls
[0x000051c0]> aaa # this creates functions, including 'entry0'
[0x000051c0]> zaf entry0 entry
[0x000051c0]> z
entry:
bytes: 31ed4989d15e4889e24883e4f050544c............48............48............ff...
.......f4
graph: cc=1 nbbs=1 edges=0 ebbs=1
offset: 0x000051c0
[0x000051c0]>
As you can see it made a new signature with a name entry from a function entry0 . You
can show it in JSON format too, which can be useful for scripting:
107
Signatures
[0x000051c0]> zj~{}
[
{
"name": "entry",
"bytes": "31ed4989d15e4889e24883e4f050544c............48............48............
ff..........f4",
"graph": {
"cc": "1",
"nbbs": "1",
"edges": "0",
"ebbs": "1"
},
"offset": 20928,
"refs": [
]
}
]
[0x000051c0]>
To remove it just run z-entry But if you want to save all created signatures, you need to
save it into the SDB file using command zos myentry . Then we can apply them. Lets open
a file again:
r2 /bin/ls
-- Log On. Hack In. Go Anywhere. Get Everything.
[0x000051c0]> zo myentry
[0x000051c0]> z
entry:
bytes: 31ed4989d15e4889e24883e4f050544c............48............48............ff...
.......f4
graph: cc=1 nbbs=1 edges=0 ebbs=1
offset: 0x000051c0
[0x000051c0]>
This means that the signatures were successfully loaded from the file myentry and now we
can search matching functions:
[0x000051c0]> zc
[+] searching 0x000051c0 - 0x000052c0
[+] searching function metrics
hits: 1
[0x000051c0]>
Note that zc command just checks the signatures against the current address. To search
signatures across the all file we need to do a bit different thing. There is an important
moment though, if we just run it "as is" - it wont find anything:
108
Signatures
[0x000051c0]> z/
[+] searching 0x0021dfd0 - 0x002203e8
[+] searching function metrics
hits: 0
[0x000051c0]>
Note the searching address - this is because we need to adjust the searching range first:
[0x000051c0]> e search.in=io.section
[0x000051c0]> z/
[+] searching 0x000038b0 - 0x00015898
[+] searching function metrics
hits: 1
[0x000051c0]>
We are setting the search mode to io.section (it was file by default) to search in the
current section (assuming we are currently in the .text section of course). Now we can
check, what radare2 found for us:
[0x000051c0]> pd 5
;-- entry0:
;-- sign.bytes.entry_0:
0x000051c0 31ed xor ebp, ebp
0x000051c2 4989d1 mov r9, rdx
0x000051c5 5e pop rsi
0x000051c6 4889e2 mov rdx, rsp
0x000051c9 4883e4f0 and rsp, 0xfffffffffffffff0
[0x000051c0]>
Here we can see the comment of entry0 , which is taken from the ELF parsing, but also the
sign.bytes.entry_0 , which is exactly the result of matching signature.
[0x000051c0]> e zign.
zign.bytes = true
zign.graph = true
zign.maxsz = 500
zign.mincc = 10
zign.minsz = 16
zign.offset = true
zign.prefix = sign
zign.refs = true
[0x000051c0]>
109
Signatures
110
Disassembling
Disassembling
Disassembling in radare is just a way to represent an array of bytes. It is handled as a
special print mode within p command.
In the old times, when the radare core was smaller, the disassembler was handled by an
external rsc file. That is, radare first dumped current block into a file, and then simply called
objdump configured to disassemble for Intel, ARM etc... It was a working solution, but it was
inefficient as it repeated the same actions over and over, because there were no caches. As
a result, scrolling was terribly slow. Nowadays, the disassembler support is one of the basic
features of radare. It now has many options, including target architecture flavor and
disassembler variants, among other things.
To see the disassembly, use the pd command. It accepts a numeric argument to specify
how many opcodes of current block you want to see. Most of the commands in radare
consider the current block size as the default limit for data input. If you want to disassemble
more bytes, set a new block size using the b command.
The pD command works like pd but accepts the number of input bytes as its argument,
instead of the number of opcodes.
The "pseudo" syntax may be somewhat easier for a human to understand than the default
assembler notations. But it can become annoying if you read lots of code. To play with it:
111
Disassembling
[0x00405e1c]> e asm.syntax=att
[0x00405e1c]> pd 3
; JMP XREF from 0x00405dfa (fcn.00404531)
0x00405e1c 488b9424a80. mov 0x2a8(%rsp), %rdx
0x00405e24 64483314252. xor %fs:0x28, %rdx
0x00405e2d 4889d8 mov %rbx, %rax
112
Adding Metadata
By following common basic *NIX principles, it is easy to write a small utility in a scripting
language which uses objdump , otool , etc. to obtain information from a binary and to
import it into radare. For example, take a look at idc2r.py shipped with radare2ida. To use
it, invoke it as idc2r.py file.idc > file.r2 . It reads an IDC file exported from an IDA Pro
database and produces an r2 script containing the same comments, names of functions etc.
You can import the resulting 'file.r2' by using the dot . command of radare:
[0x00000000]> . file.r2
The . command is used to interpret Radare commands from external sources, including
files and program output. For example, to omit generation of an intermediate file and import
the script directly you can use this combination:
The C command is used to manage comments and data conversions. You can define a
range of program's bytes to be interpreted as either code, binary data or string. It is also
possible to execute external code at every specified flag location in order to fetch some
metadata, such as a comment, from an external file or database.
113
Adding Metadata
[0x00404cc0]> C?
|Usage: C[-LCvsdfm*?][*?] [...] # Metadata management
| C* list meta info in r2 commands
| C- [len] [[@]addr] delete metadata at given address rang
e
| CL[-][*] [file:line] [addr] show or add 'code line' information (
bininfo)
| CS[-][space] manage meta-spaces to filter comments
, etc..
| CC[?] [-] [comment-text] [@addr] add/remove comment
| CC.[addr] show comment in current address
| CC! [@addr] edit comment with $EDITOR
| CCa[-at]|[at] [text] [@addr] add/remove comment at given address
| CCu [comment-text] [@addr] add unique comment
| Cv[bsr][?] add comments to args
| Cs[?] [-] [size] [@addr] add string
| Cz[@addr] add zero-terminated string
| Ch[-] [size] [@addr] hide data
| Cd[-] [size] [repeat] [@addr] hexdump data array (Cd 4 10 == dword
[10])
| Cf[?][-] [sz] [0|cnt][fmt] [a0 a1...] [@addr] format memory (see pf?)
| CF[sz] [fcn-sign..] [@addr] function signature
| Cm[-] [sz] [fmt..] [@addr] magic parse (see pm?)
[0x00404cc0]>
[0x00000000]> pd 2
0x00000000 0000 add [rax], al
; this guy seems legit
0x00000002 0000 add [rax], al
The C? family of commands lets you mark a range as one of several kinds of types. Three
basic types are: code (disassembly is done using asm.arch), data (an array of data
elements) or string. Use the Cs comand to define a string, use the Cd command for
defining an array of data elements, and use the Cf command to define more complex data
structures like structs.
Annotating data types is most easily done in visual mode, using the "d" key, short for "data
type change". First, use the cursor to select a range of bytes (press c key to toggle cursor
mode and use HJKL keys to expand selection), then press 'd' to get a menu of possible
actions/types. For example, to mark the range as a string, use the 's' option from the menu.
You can achieve the same result from the shell using the Cs command:
114
Adding Metadata
The Cf command is used to define a memory format string (the same syntax used by the
pf command). Here's an example:
The [sz] argument to Cf is used to define how many bytes the struct should take up in
the disassembly, and is completely independent from the size of the dat structure define by
the format string. This may seem confusing, but has several uses. For example, you may
want to see the formatted structue displayed in the disassembly, but still have those
locations be visible as offsets and with raw bytes. Sometimes, you find large structures, but
only identified a few fields, or only interested in specific fields. Then, you can tell r2 to
display only those fields, using the format string and using 'skip' fields, and also have the
disassembly continue after the entire structure, by giving it full size using the sz argument.
Using Cf , it's easy to define to define complex structures with simple oneliners. See pf?
for more information. Remember that all these C commands can also be accessed from
the visual mode by pressing the d (data conversion) key.
115
ESIL
ESIL
ESIL stands for 'Evaluable Strings Intermediate Language'. It aims to describe a Forth)-like
representation for every target CPU opcode semantics. ESIL representations can be
evaluated (interpreted) in order to emulate individual instructions. Each command of an ESIL
expression is separated by a comma. Its virtual machine can be described as this:
while ((word=haveCommand())) {
if (word.isOperator()) {
esilOperators[word](esil);
} else {
esil.push (word);
}
nextCommand();
}
As we can see ESIL uses a stack-based interpreter similar to what is commonly used for
calculators. You have two categories of inputs: values and operators. A value simply gets
pushed on the stack, an operator then pops values (its arguments if you will) off the stack,
performs its operation and pushes its results (if any) back on. We can think of ESIL as a
post-fix notation of the operations we want to do.
4,esp,-=,ebp,esp,=[4]
Can you guess what this is? If we take this post-fix notation and transform it back to in-fix we
get
esp -= 4
4bytes(dword) [esp] = ebp
We can see that this corresponds to the x86 instruction push ebp ! Isn't that cool? The aim is
to be able to express most of the common operations performed by CPUs, like binary
arithmetic operations, memory loads and stores, processing syscalls etc. This way if we can
transform the instructions to ESIL we can see what a program does while it is running even
for the most cryptic architectures you definitely don't have a device to debug on for.
Use ESIL
116
ESIL
There are 2 environment variables that are important for watching what a program does:
"asm.emu" tells r2 if you want ESIL information to be displayed. If it is set to true you will see
comments appear to the right of your disassembly that tell you how the contents of registers
and memory addresses are changed by the current instruction. For example if you have an
instruction that subtracts a value from a register it tells you what the value was before and
what it becomes after. This is super useful so you don't have to sit there yourself and track
which value goes where.
One problem with this is that it is a lot of information to take in at once and sometimes you
simply don't need it. r2 has a nice compromise for this. That is what the "asm.emustr"
variable is for. Instead of this super verbose output with every register value, this only adds
really useful information to the output, e.g., strings that are found at addresses a program
uses or whether a jump is likely to be taken or not.
The third important variable is "asm.esil". This switches your disassembly to no longer show
you the actual disassembled instructions, but instead now shows you corresponding ESIL
expressions that describe what the instruction does. So if you want to take a look at how
instructions are expressed in ESIL simply set "asm.esil" to true.
ESIL Commands
"ae" : Evaluate ESIL expression.
[0x00000000]> aes
[0x00000000]>10aes
117
ESIL
[0x00000000]> aeso
[0x00000000]>10aeso
ESIL
Operands Name Operation example
Opcode
TRAP src Trap Trap signal
$ src Syscall syscall
Get address of
Instruction current instruction
$$ src
address stack=instruction
address
stack = (dst ==
src) ;
== src,dst Compare
update_eflags(dst
- src)
[0x0000000]> "ae 1,5,
stack = (dst < src) <"
Smaller
; 0x0
< src,dst (signed
update_eflags(dst [0x00000000]> "ae
comparison)
- src) 5,5"
0x0"
118
ESIL
[0x00000000]> "ae
stack = (dst > src) 1,5,>"
Bigger
; 0x1
> src,dst (signed
update_eflags(dst [0x00000000]> "ae
comparison)
- src) 5,5,>"
0x0
[0x00000000]> "ae
Bigger or stack = (dst >= 1,5,>="
Equal src) ; 0x1
>= src,dst
(signed update_eflags(dst [0x00000000]> "ae
comparison) - src) 5,5,>="
0x1
[0x00000000]> "ae
1,1,<<"
0x2
<< src,dst Shift Left stack = dst << src
[0x00000000]> "ae
2,1,<<"
0x4
[0x00000000]> "ae
1,4,>>"
0x2
>> src,dst Shift Right stack = dst >> src
[0x00000000]> "ae
2,4,>>"
0x1
[0x00000000]> "ae
31,1,<<<"
stack=dst ROL 0x80000000
<<< src,dst Rotate Left
src [0x00000000]> "ae
32,1,<<<"
0x1
[0x00000000]> "ae
1,1,>>>"
stack=dst ROR 0x80000000
>>> src,dst Rotate Right
src [0x00000000]> "ae
32,1,>>>"
0x1
[0x00000000]> "ae
1,1,&"
0x1
[0x00000000]> "ae
1,0,&"
0x0
& src,dst AND stack = dst & src [0x00000000]> "ae
0,1,&"
119
ESIL
[0x00000000]> "ae
0,0,&"
0x0
[0x00000000]> "ae
1,1,|"
0x1
[0x00000000]> "ae
1,0,|"
0x1
| src,dst OR stack = dst | src
[0x00000000]> "ae
0,1,|"
0x1
[0x00000000]> "ae
0,0,|"
0x0
[0x00000000]> "ae
1,1,^"
0x0
[0x00000000]> "ae
1,0,^"
0x1
^ src,dst XOR stack = dst ^src
[0x00000000]> "ae
0,1,^"
0x1
[0x00000000]> "ae
0,0,^"
0x0
[0x00000000]> "ae
3,4,+"
0x7
+ src,dst ADD stack = dst + src
[0x00000000]> "ae
5,5,+"
0xa
[0x00000000]> "ae
3,4,-"
0x1
[0x00000000]> "ae
- src,dst SUB stack = dst - src 5,5,-"
0x0
[0x00000000]> "ae
4,3,-"
0xffffffffffffffff
[0x00000000]> "ae
3,4,*"
0xc
* src,dst MUL stack = dst * src
[0x00000000]> "ae
5,5,*"
0x19
[0x00000000]> "ae
120
ESIL
[0x00000000]> "ae
2,4,/"
0x2
[0x00000000]> "ae
/ src,dst DIV stack = dst / src 5,5,/"
0x1
[0x00000000]> "ae
5,9,/"
0x1
[0x00000000]> "ae
2,4,%"
0x0
[0x00000000]> "ae
% src,dst MOD stack = dst % src 5,5,%"
0x0
[0x00000000]> "ae
5,9,%"
0x4
[0x00000000]> "ae 1,!"
0x0
[0x00000000]> "ae 4,!"
! src NEG stack = !!!src
0x0
[0x00000000]> "ae 0,!"
0x1
[0x00000000]> ar
r_00=0;ar r_00
0x00000000
[0x00000000]> "ae
r_00,++"
++ src INC stack = src++ 0x1
[0x00000000]> ar r_00
0x00000000
[0x00000000]> "ae
1,++"
0x2
[0x00000000]> ar
r_00=5;ar r_00
0x00000005
[0x00000000]> "ae
r_00,--"
-- src DEC stack = src-- 0x4
[0x00000000]> ar r_00
0x00000005
[0x00000000]> "ae 5,--
"
0x4
[0x00000000]> ar
r_01=5;ar r_00=0;ar
r_00
0x00000000
121
ESIL
[0x00000000]> "ae
+= src,reg ADD eq reg = reg + src r_01,r_00,+="
[0x00000000]> ar r_00
0x00000005
[0x00000000]> "ae
5,r_00,+="
[0x00000000]> ar r_00
0x0000000a
[0x00000000]> "ae
r_01,r_00,-="
[0x00000000]> ar r_00
0x00000004
-= src,reg SUB eq reg = reg - src
[0x00000000]> "ae
3,r_00,-="
[0x00000000]> ar r_00
0x00000001
[0x00000000]> ar
r_01=3;ar r_00=5;ar
r_00
0x00000005
[0x00000000]> "ae
r_01,r_00,*="
*= src,reg MUL eq reg = reg * src
[0x00000000]> ar r_00
0x0000000f
[0x00000000]> "ae
2,r_00,*="
[0x00000000]> ar r_00
0x0000001e
[0x00000000]> ar
r_01=3;ar r_00=6;ar
r_00
0x00000006
[0x00000000]> "ae
r_01,r_00,/="
/= src,reg DIV eq reg = reg / src
[0x00000000]> ar r_00
0x00000002
[0x00000000]> "ae
1,r_00,/="
[0x00000000]> ar r_00
0x00000002
[0x00000000]> ar
r_01=3;ar r_00=7;ar
r_00
0x00000007
[0x00000000]> "ae
r_01,r_00,%="
[0x00000000]> ar r_00
0x00000001
%= src,reg MOD eq reg = reg % src
[0x00000000]> ar
r_00=9;ar r_00
122
ESIL
r_00=9;ar r_00
0x00000009
[0x00000000]> "ae
5,r_00,%="
[0x00000000]> ar r_00
0x00000004
[0x00000000]> ar
r_00=1;ar r_01=1;ar
r_01
0x00000001
[0x00000000]> "ae
r_00,r_01,<<="
<<= src,reg Shift Left eq reg = reg << src
[0x00000000]> ar r_01
0x00000002
[0x00000000]> "ae
2,r_01,<<="
[0x00000000]> ar r_01
0x00000008
[0x00000000]> ar
r_00=1;ar r_01=8;ar
r_01
0x00000008
[0x00000000]> "ae
Shift Right r_00,r_01,>>="
>>= src,reg reg = reg << src
eq [0x00000000]> ar r_01
0x00000004
[0x00000000]> "ae
2,r_01,>>="
[0x00000000]> ar r_01
0x00000001
[0x00000000]> ar
r_00=2;ar r_01=6;ar
r_01
0x00000006
[0x00000000]> "ae
r_00,r_01,&="
[0x00000000]> ar r_01
0x00000002
&= src,reg AND eq reg = reg & src
[0x00000000]> "ae
2,r_01,&="
[0x00000000]> ar r_01
0x00000002
[0x00000000]> "ae
1,r_01,&="
[0x00000000]> ar r_01
0x00000000
[0x00000000]> ar
r_00=2;ar r_01=1;ar
r_01
0x00000001
123
ESIL
[0x00000000]> ar
r_00=2;ar
r_01=0xab;ar r_01
0x000000ab
[0x00000000]> "ae
r_00,r_01,^="
^= src,reg XOR eq reg = reg ^ src
[0x00000000]> ar r_01
0x000000a9
[0x00000000]> "ae
2,r_01,^="
[0x00000000]> ar r_01
0x000000ab
[0x00000000]> ar
r_00=4;ar r_00
0x00000004
++= reg INC eq reg = reg + 1 [0x00000000]> "ae
r_00,++="
[0x00000000]> ar r_00
0x00000005
[0x00000000]> ar
r_00=4;ar r_00
0x00000004
--= reg DEC eq reg = reg - 1 [0x00000000]> "ae
r_00,--="
[0x00000000]> ar r_00
0x00000003
[0x00000000]> ar
r_00=4;ar r_00
0x00000004
[0x00000000]> "ae
r_00,!="
!= reg NOT eq reg = !reg [0x00000000]> ar r_00
0x00000000
[0x00000000]> "ae
r_00,!="
[0x00000000]> ar r_00
0x00000001
------------------------------
--- --- --- ---
----------------
[0x00010000]> "ae
0xdeadbeef,0x10000,=
[4],"
124
ESIL
[4],"
=[] [0x00010000]> pxw
=[*] 4@0x10000
=[1] 0x00010000
src,dst poke *dst=src 0xdeadbeef ....
=[2]
=[4] [0x00010000]> "ae
=[8] 0x0,0x10000,=[4],"
[0x00010000]> pxw
4@0x10000
0x00010000
0x00000000
[0x00010000]> w
test@0x10000
[] [0x00010000]> "ae
[*] 0x10000,[4],"
[1] 0x74736574
src peek stack=*src
[2] [0x00010000]> ar
[4] r_00=0x10000
[8] [0x00010000]> "ae
r_00,[4],"
0x74736574
|=[]
|=[1]
[0x00000000]>
|=[2] reg nombre code
[0x00000000]>
|=[4]
|=[8]
Swap two top
SWAP Swap SWAP
elements
Pick nth element
PICK n Pick from the top of 2,PICK
the stack
If top element is a
reference
(register name,
NUM Numeric label, etc), NUM
dereference it
and push its real
value
CLEAR Clear Clear stack CLEAR
Stops ESIL
BREAK Break BREAK
emulation
125
ESIL
Stops execution
(reason: ESIL
TODO To Do TODO
expression not
completed)
ESIL Flags
ESIL VM has an internal state flags that are read only and can be used to export those
values to the underlying target CPU flags. It is because the ESIL VM always calculates all
flag changes, while target CPUs only update flags under certain conditions or at specific
instructions.
126
ESIL
The ? operator uses the value of its argument to decide whether to evaluate the
expression in curly braces.
If you want to run several expressions under a conditional, put them in curly braces:
zf,?{,eip,esp,=[],eax,eip,=,$r,esp,-=,}
Whitespaces, newlines and other chars are ignored. So the first thing when processing a
ESIL program is to remove spaces:
Syscalls need special treatment. They are indicated by '$' at the beginning of an expression.
You can pass an optional numeric value to specify a number of syscall. An ESIL emulator
must handle syscalls. See (r_esil_syscall).
a,b,- b - a
a,b,/= b /= a
Special Instructions
NOPs are represented as empty strings. As it was said previously, syscalls are marked by '$'
command. For example, '0x80,$'. It delegates emulation from the ESIL machine to a
callback which implements syscalls for a specific OS/kernel.
127
ESIL
Traps are implemented with the <code>,TRAP command. They are used to throw exceptions
for invalid instructions, division by zero, memory read error, etc.
Quick Analysis
Here is a list of some quick checks to retrieve information from an ESIL string. Relevant
information will be probably found in the first expression of the list.
Common operations:
Check dstreg
Check srcreg
Get destinaion
Is jump
Is conditional
Evaluate
Is syscall
CPU Flags
CPU flags are usually defined as single bit registers in the RReg profile. They and
sometimes found under the 'flg' register type.
Variables
Properties of the VM variables:
1. They have no predefined bit width. This way it should be easy to extend them to 128,
256 and 512 bits later, e.g. for MMX, SSE, AVX, Neon SIMD.
128
ESIL
4. Numbers can be specified in any base supported by RNum (dec, hex, oct, binary ...)
5. Each ESIL backend should have an associated RReg profile to describe the ESIL
register specs.
Bit Arrays
What to do with them? What about bit arithmetics if use variables instead of registers?
Arithmetics
1. ADD ("+")
2. MUL ("*")
3. SUB ("-")
4. DIV ("/")
5. MOD ("%")
Bit Arithmetics
1. AND "&"
2. OR "|"
3. XOR "^"
4. SHL "<<"
5. SHR ">>"
6. ROL "<<<"
7. ROR ">>>"
8. NEG "!"
129
ESIL
Usage example:
rep cmpsb
cx,!,?{,BREAK,},esi,[1],edi,[1],==,?{,BREAK,},esi,++,edi,++,cx,--,0,GOTO
Unimplemented/unhandled Instructions
Those are expressed with the 'TODO' command. which acts as a 'BREAK', but displays a
warning message describing that an instruction is not implemented and will not be emulated.
For example:
130
ESIL
[0x1000010f8]> e asm.esil=true
[0x1000010f8]> pd $r @ entry0
; [0] va=0x1000010f8 pa=0x000010f8 sz=13299 vsz=13299 rwx=-r-x 0.__text
;-- section.0.__text:
0x1000010f8 55 8,rsp,-=,rbp,rsp,=[8]
0x1000010f9 4889e5 rsp,rbp,=
0x1000010fc 4883c768 104,rdi,+=
0x100001100 4883c668 104,rsi,+=
0x100001104 5d rsp,[8],rbp,=,8,rsp,+=
┌─< 0x100001105 e950350000 0x465a,rip,= ;[1]
│ 0x10000110a 55 8,rsp,-=,rbp,rsp,=[8]
│ 0x10000110b 4889e5 rsp,rbp,=
│ 0x10000110e 488d4668 rsi,104,+,rax,=
│ 0x100001112 488d7768 rdi,104,+,rsi,=
│ 0x100001116 4889c7 rax,rdi,=
│ 0x100001119 5d rsp,[8],rbp,=,8,rsp,+=
┌──< 0x10000111a e93b350000 0x465a,rip,= ;[1]
││ 0x10000111f 55 8,rsp,-=,rbp,rsp,=[8]
││ 0x100001120 4889e5 rsp,rbp,=
││ 0x100001123 488b4f60 rdi,96,+,[8],rcx,=
││ 0x100001127 4c8b4130 rcx,48,+,[8],r8,=
││ 0x10000112b 488b5660 rsi,96,+,[8],rdx,=
││ 0x10000112f b801000000 1,eax,= ; 0x00000001
││ 0x100001134 4c394230 rdx,48,+,[8],r8,==,cz,?=
┌───< 0x100001138 7f1a sf,of,!,^,zf,!,&,?{,0x1154,rip,=,} ;[2]
┌────< 0x10000113a 7d07 of,!,sf,^,?{,0x1143,rip,} ;[3]
││││ 0x10000113c b8ffffffff 0xffffffff,eax,= ; 0xffffffff
┌─────< 0x100001141 eb11 0x1154,rip,= ;[2]
│└────> 0x100001143 488b4938 rcx,56,+,[8],rcx,=
│ │││ 0x100001147 48394a38 rdx,56,+,[8],rcx,==,cz,?=
Introspection
To ease ESIL parsing we should have a way to express introspection expressions to extract
data we want. For example, we may want to get the target address of a jump. The parser for
ESIL expressions should offer API to make it possible to extract information by analyzing the
expressions easily.
> ao~esil,opcode
opcode: jmp 0x10000465a
esil: 0x10000465a,rip,=
We need a way to retrieve the numeric value of 'rip'. This is a very simple example, but there
are more complex, like conditional ones. We need expressions to be able to get:
opcode type
destination of jump
131
ESIL
condition depends on
all regs modified (write)
all regs accessed (read)
API HOOKS
It is important for emulation to be able to setup hooks in parser, so we can extend it to
implement analysis without having to change parser again and again. That is, every time an
operation is about to be executed, a user hook is called. It can be used to determine if RIP
is going to change, or if the instruction updates stack, etc. Later, we can split that callback
into several ones to have an event-based analysis API that may be extended in js like this:
esil.on('regset', function(){..
esil.on('syscall', function(){esil.regset('rip'
Other operations that require bindings to external functionalities to work. In this case, r_ref
and r_io . This must be defined when initializing the esil vm.
Io Get/Set
Out ax, 44
44,ax,:ou
Selectors (cs,ds,gs...)
132
Scripting
Radare2 provides a wide set of a features to automate boring work. It ranges from the
simple sequencing of the commands to the calling scripts/another programs via IPC (Inter-
Process Communication), called r2pipe.
[0x00404800]> pd 1 ; ao 1
0x00404800 b827e66100 mov eax, 0x61e627 ; "tab"
address: 0x404800
opcode: mov eax, 0x61e627
prefix: 0
bytes: b827e66100
ptr: 0x0061e627
refptr: 0
size: 5
type: mov
esil: 6415911,rax,=
stack: null
family: cpu
[0x00404800]>
It simply runs the second command after finishing the first one, like in a shell.
The second important way to sequence the commands is with a simple pipe |
ao|grep address
Note, the | pipe only can pipe output of r2 commands to external (shell) commands, like
system programs or builtin shell commands. There is a similar way to sequence r2
commands, using the backtick operator `command` . The quoted part will undergo command
substitution and the output will be used as an argument of the command line.
For example, we want to see a few bytes of the memory at the address referred to by the
'mov eax, addr' instruction. We can do that without jumping to it, using a sequence of
commands:
133
Scripting
[0x00404800]> pd 1
0x00404800 b827e66100 mov eax, 0x61e627 ; "tab"
[0x00404800]> ao
address: 0x404800
opcode: mov eax, 0x61e627
prefix: 0
bytes: b827e66100
ptr: 0x0061e627
refptr: 0
size: 5
type: mov
esil: 6415911,rax,=
stack: null
family: cpu
[0x00404800]> ao~ptr[1]
0x0061e627
0
[0x00404800]> px 10 @ `ao~ptr[1]`
- offset - 0 1 2 3 4 5 6 7 8 9 A B C D E F 0123456789ABCDEF
0x0061e627 7461 6200 2e69 6e74 6572 tab..inter
[0x00404800]>
And of course it's possible to redirect the output of an r2 command into a file, using the >
and >> commands
The ?$? command describes several helpful variables you can use to do similar actions
even more easily, like the $v "immediate value" variable, or the $m opcode memory
reference variable.
134
Loops
One of the most common task in automation is looping through something, there are multiple
ways to do this in radare2.
@@ flagname-regex
[0x004047d6]> afi
#
offset: 0x004047d0
name: entry0
size: 42
realsz: 42
stackframe: 0
call-convention: amd64
cyclomatic-complexity: 1
bits: 64
type: fcn [NEW]
num-bbs: 1
edges: 0
end-bbs: 1
call-refs: 0x00402450 C
data-refs: 0x004136c0 0x00413660 0x004027e0
code-xrefs:
data-xrefs:
locals:0
args: 0
diff: type: new
[0x004047d6]>
Now let's say, for example, that we'd like see a particular field from this output for all
functions found by analysis. We can do that with a loop over all function flags (whose names
begin with fcn. ):
This command will extract the name field from the afi output of every flag with a name
matching the regexp fcn.* .
We can also loop over a list of offsets, using the following syntax:
@@=1 2 3 ... N
135
Loops
For example, say we want to see the opcode information for 2 offsets: the current one, and
at current + 2:
Note we're using the $$ variable which evaluates to the current offset. Also note that $$+2
is evaluated before looping, so we can use the simple arithmetic expressions.
A third way to loop is by having the offsets be loaded from a file. This file should contain one
offset per line.
radare2 also offers various foreach constructs for looping. One of the most useful is for
looping through all the instructions of a function:
136
Loops
[0x004047d0]> pdf
╒ (fcn) entry0 42
│ ; UNKNOWN XREF from 0x00400018 (unk)
│ ; DATA XREF from 0x004064bf (sub.strlen_460)
│ ; DATA XREF from 0x00406511 (sub.strlen_460)
│ ; DATA XREF from 0x0040b080 (unk)
│ ; DATA XREF from 0x0040b0ef (unk)
│ 0x004047d0 31ed xor ebp, ebp
│ 0x004047d2 4989d1 mov r9, rdx
│ 0x004047d5 5e pop rsi
│ 0x004047d6 4889e2 mov rdx, rsp
│ 0x004047d9 4883e4f0 and rsp, 0xfffffffffffffff0
│ 0x004047dd 50 push rax
│ 0x004047de 54 push rsp
│ 0x004047df 49c7c0c03641. mov r8, 0x4136c0
│ 0x004047e6 48c7c1603641. mov rcx, 0x413660 ; "AWA..AVI..AU
I..ATL.%.. "
0x00413660 ; "AWA..AVI..AUI..ATL.%.. "
│ 0x004047ed 48c7c7e02740. mov rdi, main ; "AWAVAUATUH..
S..H...." @
0x4027e0
│ 0x004047f4 e857dcffff call sym.imp.__libc_start_main
╘ 0x004047f9 f4 hlt
[0x004047d0]> pi 1 @@i
mov r9, rdx
pop rsi
mov rdx, rsp
and rsp, 0xfffffffffffffff0
push rax
push rsp
mov r8, 0x4136c0
mov rcx, 0x413660
mov rdi, main
call sym.imp.__libc_start_main
hlt
In this example the command pi 1 runs over all the instructions in the current function
(entry0). There are other options too (not complete list, check @@? for more information):
@@b - iterate over all basic blocks of current function (see afb)
The last kind of looping lets you loop through predefined iterator types:
symbols
imports
registers
137
Loops
threads
comments
functions
flags
This is done using the @@@ command. The previous example of listing information about
functions can also be done using the @@@ command:
This will extract name field from afi output and will output a huge list of function names.
We can choose only the second column, to remove the redundant name: on every line:
138
Macros
Apart from simple sequencing and looping, radare2 allows to write simple macros, using this
construction:
This will define a macro called 'qwe' which runs sequentially first 'pd 4' then 'ao'. Calling the
macro using syntax .(macro) is simple:
address: 0x404800
opcode: mov eax, 0x61e627
prefix: 0
bytes: b827e66100
ptr: 0x0061e627
refptr: 0
size: 5
type: mov
esil: 6415911,rax,=
stack: null
family: cpu
[0x00404800]>
[0x00404800]> (*
(qwe , pd 4, ao)
And if want to remove some macro, just add '-' before the name:
[0x00404800]> (-qwe)
Macro 'qwe' removed.
[0x00404800]>
Moreover, it's possible to create a macro that takes arguments, which comes in handy in
some simple scripting situations. To create a macro that takes arguments you simply add
them to macro definition. Be sure, if you're using characters like ';', to quote the whole
command for proper parsing.
139
Macros
[0x00404800]
[0x004047d0]> "(foo x y,pd $0; s +$1)"
[0x004047d0]> .(foo 5 6)
;-- entry0:
0x004047d0 31ed xor ebp, ebp
0x004047d2 4989d1 mov r9, rdx
0x004047d5 5e pop rsi
0x004047d6 4889e2 mov rdx, rsp
0x004047d9 4883e4f0 and rsp, 0xfffffffffffffff0
[0x004047d6]>
As you can see, the arguments are named by index, starting from 0: $0, $1, ...
140
R2pipe
R2pipe
The r2pipe api was initially designed for NodeJS in order to support reusing the web's r2.js
API from the commandline. The r2pipe module permits interacting with r2 instances in
different methods:
Examples
Python
$ pip install r2pipe
import r2pipe
r2 = r2pipe.open("/bin/ls")
r2.cmd('aa')
print(r2.cmd("afl"))
print(r2.cmdj("aflj")) # evaluates JSONs and returns an object
141
R2pipe
NodeJS
$ npm install r2pipe
function doSomeStuff(r) {
var r2 = require (r2jsapi)(r);
r2.analOp ("entry0", function(op) {
console.log (op.size, op.opcode, op.esil);
});
r2.cmd ('af @ entry0', function (o) {
r2.cmd ("pdf @ entry0", function (o) {
console.log (o);
r.quit ()
});
});
}
Go
$ r2pm -i r2pipe-go
-- go-r2pipe repo
142
R2pipe
package main
import (
"fmt"
"github.com/radare/r2pipe-go"
)
func main() {
r2p, err := r2pipe.NewPipe("/bin/ls")
if err != nil {
panic(err)
}
defer r2p.Close()
_, err = r2p.Cmd("aaaa")
if err != nil {
panic(err)
}
buf0, err := r2p.Cmd("S*")
if err != nil {
panic(err)
}
fmt.Println(buf0)
Rust
$ cat Cargo.toml
...
[dependencies]
r2pipe = "*"
143
R2pipe
#[macro_use]
extern crate r2pipe;
use r2pipe::R2Pipe;
fn main() {
// Initialize a new pipe
let mut r2p = open_pipe!(Some("/bin/ls")).unwrap();
Ruby
$ gem install r2pipe
require 'r2pipe'
puts 'r2pipe ruby api demo'
puts '===================='
r2p = R2Pipe.new '/bin/ls'
puts r2p.cmd 'pi 5'
puts r2p.cmd 'pij 1'
puts r2p.json(r2p.cmd 'pij 1')
puts r2p.cmd 'px 64'
r2p.quit
Perl
144
R2pipe
#!/usr/bin/perl
use R2::Pipe;
use strict;
my $r = R2::Pipe->new ("/bin/ls");
print $r->cmd ("pd 5")."\n";
print $r->cmd ("px 64")."\n";
$r->quit ();
Erlang
#!/usr/bin/env escript
%% -*- erlang -*-
%%! -smp enable
%% -sname hr
-mode(compile).
-export([main/1]).
main(_Args) ->
%% adding r2pipe to modulepath, set it to your r2pipe_erl location
R2pipePATH = filename:dirname(escript:script_name()) ++ "/ebin",
true = code:add_pathz(R2pipePATH),
Haskell
145
R2pipe
import R2pipe
import qualified Data.ByteString.Lazy as L
showMainFunction ctx = do
cmd ctx "s main"
L.putStr =<< cmd ctx "pD `fl $$`"
main = do
-- Run r2 locally
open "/bin/ls" >>= showMainFunction
-- Connect to r2 via HTTP (e.g. if "r2 -qc=h /bin/ls" is running)
open "http://127.0.0.1:9090" >>= showMainFunction
Dotnet
146
R2pipe
using System;
using System.Collections.Generic;
using System.Diagnostics;
using System.Linq;
using System.Text;
using System.Threading.Tasks;
using r2pipe;
namespace LocalExample
{
class Program
{
static void Main(string[] args)
{
#if __MonoCS__
using(IR2Pipe pipe = new R2Pipe("/bin/ls"))
#else
using (IR2Pipe pipe = new R2Pipe(@"C:\Windows\notepad.exe",
@"C:\radare2\radare2.exe"))
#endif
{
Console.WriteLine("Hello r2! " + pipe.RunCommand("?V"));
Task<string> async = pipe.RunCommandAsync("?V");
Console.WriteLine("Hello async r2!" + async.Result);
QueuedR2Pipe qr2 = new QueuedR2Pipe(pipe);
qr2.ExecuteCommands();
}
}
}
}
Java
147
R2pipe
import org.radare.r2pipe.R2Pipe;
Swift
if let r2p = R2Pipe(url:nil) {
r2p.cmd ("?V", closure:{
(str:String?) in
if let s = str {
print ("Version: \(s)");
exit (0);
} else {
debugPrint ("R2PIPE. Error");
exit (1);
}
});
NSRunLoop.currentRunLoop().run();
} else {
print ("Needs to run from r2")
}
Vala
148
R2pipe
NewLisp
(load "r2pipe.lsp")
(println "pd 3:\n" (r2pipe:cmd "pd 3"))
(exit)
Dlang
import std.stdio;
import r2pipe;
void main() {
auto r2 = r2pipe.open ();
writeln ("Hello "~ r2.cmd("?e World"));
writeln ("Hello "~ r2.cmd("?e Works"));
149
Rabin2
$ rabin2 -h
Usage: rabin2 [-AcdeEghHiIjlLMqrRsSvVxzZ] [-@ at] [-a arch] [-b bits] [-B addr]
[-C F:C:D] [-f str] [-m addr] [-n str] [-N m:M] [-P[-P] pdb]
[-o str] [-O str] [-k query] [-D lang symname] | file
-@ [addr] show section, symbol or import at addr
-A list sub-binaries and their arch-bits pairs
-a [arch] set arch (x86, arm, .. or <arch>_<bits>)
-b [bits] set bits (32, 64 ...)
-B [addr] override base address (pie bins)
-c list classes
-C [fmt:C:D] create [elf,mach0,pe] with Code and Data hexpairs (see -a)
-d show debug/dwarf information
-D lang name demangle symbol name (-D all for bin.demangle=true)
-e entrypoint
-E globally exportable symbols
-f [str] select sub-bin named str
-F [binfmt] force to use that bin plugin (ignore header check)
-g same as -SMZIHVResizcld (show all info)
-G [addr] load address . offset to header
-h this help message
-H header fields
-i imports (symbols imported from libraries)
-I binary info
-j output in json
-k [sdb-query] run sdb query. for example: '*'
-K [algo] calculate checksums (md5, sha1, ..)
-l linked libraries
-L [plugin] list supported bin plugins or plugin details
-m [addr] show source line at addr
-M main (show address of main symbol)
-n [str] show section, symbol or import named str
-N [min:max] force min:max number of chars per string (see -z and -zz)
-o [str] output file/folder for write operations (out by default)
-O [str] write/extract operations (-O help)
-p show physical addresses
-P show debug/pdb information
-PP download pdb file for binary
-q be quiet, just show fewer data
-qq show less info (no offset/size for -z for ex.)
-Q show load address used by dlopen (non-aslr libs)
150
Rabin2
-r radare output
-R relocations
-s symbols
-S sections
-u unfiltered (no rename duplicated symbols/sections)
-v display version and quit
-V Show binary version information
-x extract bins contained in file
-X [fmt] [f] .. package in fat or zip the given files and bins contained in file
-z strings (from data section)
-zz strings (from raw bins [e bin.rawstr=1])
-zzz dump raw strings to stdout (for huge files)
-Z guess size of binary program
......
151
File Identification
$ rabin2 -I /bin/ls
file /bin/ls
type EXEC (Executable file)
pic false
has_va true
root elf
class ELF64
lang c
arch x86
bits 64
machine AMD x86-64 architecture
os linux
subsys linux
endian little
strip true
static false
linenum false
lsyms false
relocs false
rpath NONE
To make rabin2 output information in format that the main program, radare2, can
understand, pass -Ir option to it:
152
Entrypoint
Code Entrypoints
The -e option passed to rabin2 will show entrypoints for given binary. Two examples:
$ rabin2 -e /bin/ls
[Entrypoints]
addr=0x00004888 off=0x00004888 baddr=0x00000000
1 entrypoints
153
Imports
Imports
Rabin2 is able to find imported objects by an executable, as well as their offsets in its PLT.
This information is useful, for example, to understand what external function is invoked by
call instruction. Pass -i flag to rabin to get a list of imports. An example:
154
Exports
Exports
Rabin2 is able to find Exports. An example:
155
Symbols (exports)
Symbols (Exports)
With rabin2, the generated symbols list format is similar to the imports list. Use the -s
option to get it:
With the -sr option rabin2 produces a radare2 script instead. It can later be passed to the
core to automatically flag all symbols and to define corresponding byte ranges as functions
and data blocks.
fs symbols
Cd 8 @ 0x0021a610
f sym.stdout 8 0x0021a610
f sym._edata 0 0x0021a600
f sym._end 0 0x0021b388
Cd 8 @ 0x0021a600
f sym.__progname 8 0x0021a600
Cd 8 @ 0x0021a630
f sym.program_invocation_name 8 0x0021a630
f sym.__bss_start 0 0x0021a600
156
Libraries
List Libraries
Rabin2 can list libraries used by a binary with the -l option:
$ rabin2 -l /bin/ls
[Linked libraries]
libselinux.so.1
librt.so.1
libacl.so.1
libc.so.6
4 libraries
If you compare the outputs of rabin2 -l and ldd , you will notice that rabin2 lists fewer
libraries than ldd . The reason is that rabin2 does not follow and does not show
dependencies of libraries. Only direct binary dependencies are shown.
157
Strings
Strings
The -z option is used to list readable strings found in the .rodata section of ELF binaries,
or the .text section of PE files. Example:
With the -zr option, this information is represented as a radare2 commands list. It can be
used in a radare2 session to automatically create a flag space called "strings" pre-populated
with flags for all strings found by rabin2. Furthermore, this script will mark corresponding
byte ranges as strings instead of code.
158
Strings
159
Program Sections
Program Sections
Rabin2 called with the -S option gives complete information about the sections of an
executable. For each section the index, offset, size, alignment, type and permissions, are
shown. The next example demonstrates this:
$ rabin2 -S /bin/ls
[Sections]
idx=00 addr=0x00000238 off=0x00000238 sz=28 vsz=28 perm=-r-- name=.interp
idx=01 addr=0x00000254 off=0x00000254 sz=32 vsz=32 perm=-r-- name=.note.ABI_tag
idx=02 addr=0x00000274 off=0x00000274 sz=36 vsz=36 perm=-r-- name=.note.gnu.build_id
idx=03 addr=0x00000298 off=0x00000298 sz=104 vsz=104 perm=-r-- name=.gnu.hash
idx=04 addr=0x00000300 off=0x00000300 sz=3096 vsz=3096 perm=-r-- name=.dynsym
idx=05 addr=0x00000f18 off=0x00000f18 sz=1427 vsz=1427 perm=-r-- name=.dynstr
idx=06 addr=0x000014ac off=0x000014ac sz=258 vsz=258 perm=-r-- name=.gnu.version
idx=07 addr=0x000015b0 off=0x000015b0 sz=160 vsz=160 perm=-r-- name=.gnu.version_r
idx=08 addr=0x00001650 off=0x00001650 sz=168 vsz=168 perm=-r-- name=.rela.dyn
idx=09 addr=0x000016f8 off=0x000016f8 sz=2688 vsz=2688 perm=-r-- name=.rela.plt
idx=10 addr=0x00002178 off=0x00002178 sz=26 vsz=26 perm=-r-x name=.init
idx=11 addr=0x000021a0 off=0x000021a0 sz=1808 vsz=1808 perm=-r-x name=.plt
idx=12 addr=0x000028b0 off=0x000028b0 sz=64444 vsz=64444 perm=-r-x name=.text
idx=13 addr=0x0001246c off=0x0001246c sz=9 vsz=9 perm=-r-x name=.fini
idx=14 addr=0x00012480 off=0x00012480 sz=20764 vsz=20764 perm=-r-- name=.rodata
idx=15 addr=0x0001759c off=0x0001759c sz=1820 vsz=1820 perm=-r-- name=.eh_frame_hdr
idx=16 addr=0x00017cb8 off=0x00017cb8 sz=8460 vsz=8460 perm=-r-- name=.eh_frame
idx=17 addr=0x00019dd8 off=0x00019dd8 sz=8 vsz=8 perm=-rw- name=.init_array
idx=18 addr=0x00019de0 off=0x00019de0 sz=8 vsz=8 perm=-rw- name=.fini_array
idx=19 addr=0x00019de8 off=0x00019de8 sz=8 vsz=8 perm=-rw- name=.jcr
idx=20 addr=0x00019df0 off=0x00019df0 sz=512 vsz=512 perm=-rw- name=.dynamic
idx=21 addr=0x00019ff0 off=0x00019ff0 sz=16 vsz=16 perm=-rw- name=.got
idx=22 addr=0x0001a000 off=0x0001a000 sz=920 vsz=920 perm=-rw- name=.got.plt
idx=23 addr=0x0001a3a0 off=0x0001a3a0 sz=608 vsz=608 perm=-rw- name=.data
idx=24 addr=0x0001a600 off=0x0001a600 sz=3464 vsz=3464 perm=-rw- name=.bss
idx=25 addr=0x0001a600 off=0x0001a600 sz=8 vsz=8 perm=---- name=.gnu_debuglink
idx=26 addr=0x0001a608 off=0x0001a608 sz=254 vsz=254 perm=---- name=.shstrtab
27 sections
With the -Sr option, rabin2 will flag the start/end of every section, and will pass the rest of
information as a comment.
160
Program Sections
161
Binary Diffing
Binary Diffing
This section is based on the http://radare.today article "binary diffing"
Without any parameters, radiff2 by default shows what bytes are changed and their
corresponding offsets:
$ rasm2 -d 85c00f94c0
test eax, eax
sete al
For bulk processing, you may want to have a higher-level overview of differences. This is
why radare2 is able to compute the distance and the percentage of similarity between two
files with the -s option:
If you want more concrete data, it's also possible to count the differences, with the -c
option:
If you are unsure whether you are dealing with similar binaries, with -C flag you can check
there are matching functions. It this mode, it will give you three columns for all functions:
"First file offset", "Percentage of matching" and "Second file offset".
162
Binary Diffing
Moreover, we can ask radiff2 to perform analysis first - adding -A option will run aaa on
the binaries. And we can specify binaries architecture for this analysis too using
And now a cool feature : radare2 supports graph-diffing, à la DarunGrim, with the -g
option. You can either give it a symbol name, of specify two offsets, if the function you want
to diff is named differently in compared files. For example, radiff2 -g main /bin/true
/bin/false | xdot - will show differences in main() function of Unix true and false
programs. You can compare it to radiff2 -g main /bin/false /bin/true (Notice the order of
the arguments) to get the two versions. This is the result:
163
Binary Diffing
Parts in yellow indicate that some offsets do not match. The grey piece means a perfect
match. The red one highlights a strong difference. If you look closely, you will see that the
left part of the picture has mov edi, 0x1; call sym.imp.exit , while the right one has xor
edi, edi; call sym.imp.exit .
Binary diffing is an important feature for reverse engineering. It can be used to analyze
security updates, infected binaries, firmware changes and more...
We have only shown the code analysis diffing functionality, but radare2 supports additional
types of diffing between two binaries: at byte level, deltified similarities, and more to come.
We have plans to implement more kinds of bindiffing algorithms into r2, and why not, add
support for ascii art graph diffing and better integration with the rest of the toolkit.
164
Rasm2
Rasm2
rasm2 is an inline assembler/disassembler. Initially, rasm tool was designed to be used for
binary patching. It is main function is get bytes corresponding to given machine instruction
opcode.
$ rasm2 -h
Usage: rasm2 [-CdDehLBvw] [-a arch] [-b bits] [-o addr] [-s syntax]
[-f file] [-F fil:ter] [-i skip] [-l len] 'code'|hex|-
-a [arch] Set architecture to assemble/disassemble (see -L)
-b [bits] Set cpu register size (8, 16, 32, 64) (RASM2_BITS)
-c [cpu] Select specific CPU (depends on arch)
-C Output in C format
-d, -D Disassemble from hexpair bytes (-D show hexpairs)
-e Use big endian instead of little endian
-f [file] Read data from file
-F [in:out] Specify input and/or output filters (att2intel, x86.pseudo, ...)
-h Show this help
-i [len] ignore/skip N bytes of the input buffer
-k [kernel] Select operating system (linux, windows, darwin, ..)
-l [len] Input/Output length
-L List supported asm plugins
-o [offset] Set start address for code (default 0)
-O [file] Output file name (rasm2 -Bf a.asm -O a)
-s [syntax] Select syntax (intel, att)
-B Binary input/output (-l is mandatory for binary input)
-v Show version information
-w What's this instruction for? describe opcode
If '-l' value is greater than output length, output is padded with nops
If the last argument is '-' reads from stdin
Plugins for supported target architectures can be listed with the -L option. Knowing a
plugin name, you can use it by specifying its name to the -a option
165
Rasm2
$ rasm2 -L
_d 16 8051 PD 8051 Intel CPU
_d 16 32 arc GPL3 Argonaut RISC Core
ad 16 32 64 arm GPL3 Acorn RISC Machine CPU
_d 16 32 64 arm.cs BSD Capstone ARM disassembler
_d 16 32 arm.winedbg LGPL2 WineDBG's ARM disassembler
_d 16 32 avr GPL AVR Atmel
ad 32 bf LGPL3 Brainfuck
_d 16 cr16 LGPL3 cr16 disassembly plugin
_d 16 csr PD Cambridge Silicon Radio (CSR)
ad 32 64 dalvik LGPL3 AndroidVM Dalvik
ad 16 dcpu16 PD Mojang's DCPU-16
_d 32 64 ebc LGPL3 EFI Bytecode
_d 8 gb LGPL3 GameBoy(TM) (z80-like)
_d 16 h8300 LGPL3 H8/300 disassembly plugin
_d 8 i8080 BSD Intel 8080 CPU
ad 32 java Apache Java bytecode
_d 32 m68k BSD Motorola 68000
_d 32 malbolge LGPL3 Malbolge Ternary VM
ad 32 64 mips GPL3 MIPS CPU
_d 16 32 64 mips.cs BSD Capstone MIPS disassembler
_d 32 nios2 GPL3 NIOS II Embedded Processor
_d 32 64 ppc GPL3 PowerPC
_d 32 64 ppc.cs BSD Capstone PowerPC disassembler
ad rar LGPL3 RAR VM
_d 32 sh GPL3 SuperH-4 CPU
_d 32 64 sparc GPL3 Scalable Processor Architecture
_d 32 tms320 LGPLv3 TMS320 DSP family
_d 32 ws LGPL3 Whitespace esotheric VM
_d 16 32 64 x86 BSD udis86 x86-16,32,64
_d 16 32 64 x86.cs BSD Capstone X86 disassembler
a_ 32 64 x86.nz LGPL3 x86 handmade assembler
ad 32 x86.olly GPL2 OllyDBG X86 disassembler
ad 8 z80 NC-GPL2 Zilog Z80
Note that "ad" in the first column means both assembler and disassembler are offered by a
corresponding plugin. "d" indicates disassembler, "a" means only assembler is available.
166
Assemble
Assembler
rasm2 can be used from the command-line to quickly copy-paste hexpairs that represent a
The assembler understands the following input languages and their flavors: x86 (Intel and
AT&T variants), olly (OllyDBG syntax), powerpc (PowerPC), arm and java. For Intel syntax,
rasm2 tries to mimic NASM or GAS.
There are several examples in the rasm2 source code directory. Consult them to understand
how you can assemble a raw binary file from a rasm2 description.
167
Assemble
$ cat selfstop.rasm
;
; Self-Stop shellcode written in rasm for x86
;
; --pancake
;
.arch x86
.equ base 0x8048000
.org 0x8048000 ; the offset where we inject the 5 byte jmp
selfstop:
push 0x8048000
pusha
mov eax, 20
int 0x80
ret
[0x00000000]> e asm.bits = 32
[0x00000000]> wx `!rasm2 -f a.rasm`
[0x00000000]> pd 20
0x00000000 6800800408 push 0x8048000 ; 0x08048000
0x00000005 60 pushad
0x00000006 b814000000 mov eax, 0x14 ; 0x00000014
0x0000000b cd80 int 0x80
syscall[0x80][0]=?
0x0000000d 89c3 mov ebx, eax
0x0000000f b913000000 mov ecx, 0x13 ; 0x00000013
0x00000014 b825000000 mov eax, 0x25 ; 0x00000025
0x00000019 cd80 int 0x80
syscall[0x80][0]=?
0x0000001b 61 popad
0x0000001c c3 ret
0x0000001d c3 ret
168
Disassemble
Disassembler
Passing the -d option to rasm2 allows you to disassemble a hexpair string:
169
Ragg2
Purpose
ragg2 compiles programs written in a simple high-level language into tiny binaries for x86,
x86-64, and ARM.
this means that you have to define the proper stackframe size at the
The compiler generates assembly code for x86-{32,64} and arm. But it aims
to support more platforms. This code is the compiled with r_asm and
on-disk binaries.
The generated code is not yet optimized, but it's safe to be executed
Preprocessor
Aliases
Sometimes you just need to replace at compile time a single entity on
AF_INET@alias(2);
printf@alias(0x8053940);
170
Ragg2
Includes
Use cat(1) or the preprocessor to concatenate multiple files to be compiled.
INCDIR@alias("/usr/include/ragg2");
sys-osx.r@include(INCDIR);
Hashbang
eggs can use a hashbang to make them executable.
$ head -n1 hello.r
#!/usr/bin/ragg2 -X
$ ./hello.r
Hello World!
Main
The execution of the code is done as in a flow. The first function to be
defined will be the first one to be executed. If you want to run main()
main();
...
main@global(128,64) {
...
Function definition
You may like to split up your code into several code blocks. Those blocks
Function signatures
name@type(stackframesize,staticframesize) { body }
171
Ragg2
Function types
alias Used to create aliases
Syscalls
r_egg offers a syntax sugar for defining syscalls. The syntax is like this:
exit@syscall(1);
@syscall() {
: int 0x80
main@global() {
exit (0);
Libraries
At the moment there is no support for linking r_egg programs to system
libraries. but if you inject the code into a program (disk/memory) you
can define the address of each function using the @alias syntax.
Core library
There's a work-in-progress libc-like library written completely in r_egg
172
Ragg2
Variables
.arg
.arg0
.arg1
.arg2
.var0
.var2
.fix
.bp
.pc
.sp
Attention: All the numbers after .var and .arg mean the offset with the
Arrays
Supported as raw pointers. TODO: enhance this feature
Tracing
Sometimes r_egg programs will break or just not work as expected. Use the
Pointers
TODO: Theorically '*' is used to get contents of a memory pointer.
Virtual registers
TODO: a0, a1, a2, a3, sp, fp, bp, pc
Math operations
Ragg2 supports local variables assignment by math operating, including
173
Ragg2
+ - * / & | ^
Return values
The return value is stored in the a0 register, this register is set when
$ cat test.r
add@global(4) {
.var0;
main@global() {
add (3,4);
$ ./test
$ echo $?
Traps
Each architecture have a different instruction to break the execution of
the program. REgg language captures calls to 'break()' to run the emit_trap
Inline assembly
Lines prefixed with ':' char are just inlined in the output assembly.
: jmp 0x8048400
174
Ragg2
: .byte 33,44
Labels
You can define labels using the : keyword like this:
:label_name:
/* loop forever */
goto(label_name )
Control flow
goto (addr) -- branch execution
while (cond)
if (cond)
Comments
Supported syntax for comments are:
/* multiline comment */'
175
Analysis
176
Code Analysis
Code Analysis
Code analysis is a common technique used to extract information from assembly code.
Radare uses internal data structures to identify basic blocks, function trees, to extract
opcode-level information etc. The most common radare2 analysis command sequence is:
[0x08048440]> aa
[0x08048440]> pdf @ main
177
Code Analysis
| fcn.080485a5() ; fcn.080484c6+223
| 0x080486c0 b800000000 mov eax, 0x0
| 0x080486c5 8b4df4 mov ecx, [ebp-0xc]
| 0x080486c8 8b5df8 mov ebx, [ebp-0x8]
| 0x080486cb 8b75fc mov esi, [ebp-0x4]
| 0x080486ce 89ec mov esp, ebp
| 0x080486d0 5d pop ebp
| 0x080486d1 8d61fc lea esp, [ecx-0x4]
\ 0x080486d4 c3 ret
In this example, we analyze the whole file ( aa ) and then print disassembly of the main()
function ( pdf ).
178
Symbols information
Radare2 automatically parsing available imports and exports sections in the binary, but
moreover it can load additional debugging information, if presented. Two main formats are
supported: DWARF and PDB (for Windows binaries). Note, that unlike many tools radare2
doesn't rely on Windows API to parse PDB files, thus they can be loaded on any other
supported platform - e.g. Linux or OS X.
DWARF debuginfo loads automatically by default, because usually it's stored right in the
executable file. PDB is a bit of different beast - it is always stored as a separate binary, thus
the different logic of handling it.
At first, one of the common scenarios is to analyse the file from Windows distribution. In this
case all PDB files are available in the Microsoft server, which is by default is in options. See
all pdb options in radare2:
pdb.autoload = 0
pdb.extract = 1
pdb.server = https://msdl.microsoft.com/download/symbols
pdb.useragent = Microsoft-Symbol-Server/6.11.0001.402
Using the variable pdb.server you can change the address where radare2 will try to
download the PDB file by the GUID stored in executable header. Usually there is no reason
to change default pdb.useragent , but who knows where it could be handy?
Because those PDB files are stored as "cab" archives on the server, pdb.extract=1 says to
automatically extract them.
Note, that for automatical downloading to work you need "cabextract" tool, and wget/curl
installed.
Sometimes you don't need to do that from the radare2 itself, thus - two handy rabin2
options:
where -PP automatically downloads the pdb for the selected binary, using those pdb.*
config options. -P will dump the contents of the PDB file, which is useful sometimes for a
quick understanding of the symbols stored in it.
Apart from the basic scenario of just opening file, PDB information can be additionally
manipulated by the id commands:
179
Symbols information
[0x000051c0]> id?
|Usage: id Debug information
| Output mode:
| '*' Output in radare commands
| id Source lines
| idp [file.pdb] Load pdb file information
| idpi [file.pdb] Show pdb file information
| idpd Download pdb file on remote server
Where idpi is basically the same as rabin2 -P . Note, that idp can be also used not
only in the static analysis mode, but also in the debugging mode, even if connected via
WinDbg.
For simplifying the loading PDBs, especially for the processes with many linked DLLs,
radare2 can autoload all required PDBs automatically - you need just set the e
pdb.autoload=true option. Then if you load some file in debugging mode in Windows, using
r2 -d file.exe or r2 -d 2345 (attach to pid 2345), all related PDB files will be loaded
automatically.
DWARF information loading on the other hand is completely automated, you don't need to
run any commands/change any options:
r2 `which rabin2`
[0x00002437 8% 300 /usr/local/bin/rabin2]> pd $r
┌──< 0x00002437 752f jne 0x2468 ;[1]
│⁝ 0x00002439 48833d9f5b20. cmp qword reloc.__cxa_finalize_224, 0
│⁝ 0x00002441 55 push rbp
│⁝ 0x00002442 4889e5 mov rbp, rsp
┌───< 0x00002445 740c je 0x2453 ;[2]
││⁝ 0x00002447 488d3df25720. lea rdi, obj.__dso_handle ; 0x207c40 ; "@|
"
││⁝ 0x0000244e e80dffffff call 0x2360 ;[3]
└───> 0x00002453 e848ffffff call sym.deregister_tm_clones ;[4]
│⁝ 0x00002458 c605915e2000. mov byte [obj.completed.6991], 1 ; obj.__TMC
_END__ ; [0x2082f0:1]=0
│⁝ 0x0000245f 5d pop rbp
│⁝ 0x00002460 c3 ret
│⁝ 0x00002461 0f1f80000000. nop dword [rax]
└──> 0x00002468 f3c3 ret
⁝ 0x0000246a 660f1f440000 nop word [rax + rax]
⁝ ;-- entry1.init:
⁝ ;-- frame_dummy:
⁝ 0x00002470 55 push rbp
⁝ 0x00002471 4889e5 mov rbp, rsp
⁝ 0x00002474 5d pop rbp
└─< 0x00002475 e966ffffff jmp sym.register_tm_clones ;[5]
;-- blob_version:
0x0000247a 55 push rbp ; ../blob/versi
on.c:18
180
Symbols information
As you can see it loads function names and source line information.
181
Types
Radare2 supports the C-syntax data types description. Those types are parsed by C11-
compatible parser and stored in the internal SDB, thus introspectable with k command.
[0x000051c0]> t?
|Usage: t # cparse types commands
| t List all loaded types
| t <type> Show type in 'pf' syntax
| t* List types info in r2 commands
| t- <name> Delete types by its name
| t-* Remove all types
| ta <type> Mark immediate as a type offset
| tb <enum> <value> Show matching enum bitfield for given number
| tc ([cctype]) calling conventions listing and manipulations
| te[?] List all loaded enums
| te <enum> <value> Show name for given enum number
| td[?] <string> Load types from string
| tf List all loaded functions signatures
| tk <sdb-query> Perform sdb query
| tl[?] Show/Link type to an address
| tn[?] [-][addr] manage noreturn function attributes and marks
| to - Open cfg.editor to load types
| to <path> Load types from C header file
| tos <path> Load types from parsed Sdb database
| tp <type> = <address> cast data at <address> to <type> and print it
| ts[?] print loaded struct types
| tu[?] print loaded union types
[0x000051c0]>
Note, that the basic (atomic) types are not those from C standard - not char , _Bool , or
short . Because those types can be different from one platform to another, radare2 uses
definite types like as int8_t or uint64_t and will convert int to int32_t or int64_t
Basic types can be listed using t command, for the structured types you need to use ts ,
tu or te for enums:
182
Types
[0x000051c0]> t
char
char *
int
int16_t
int32_t
int64_t
int8_t
long
long long
...
[0x000051c0]> ts
S1
[0x000051c0]>
There are three easy ways to define a new types: directly from the string using td
command, from the file using to command or open an $EDITOR to type the definitions in
place.
Notice here we used tp command, which is basically converts the C type description (or to
be precise it's SDB representation) into the the sequence of pf commands. See more
about print format.
This command performs just temporary "cast". But if we want to link some address or
variable with the chosen type, we can use tl command to store the relationship in SDB.
183
Types
[0x000051c0]> tl S1 = 0x51cf
[0x000051c0]> tl
(S1)
x : 0x000051cf = [ 2315619660, 1207959810, 34803085 ]
y : 0x000051db = [ 2370306049, 4293315645, 3860201471, 4093649307 ]
z : 0x000051eb = 4464399
[0x000051c0]>
Moreover, the link will be shown in the disassembly output or visual mode:
Note, that if you define one of the fields as a char , you will get UTF8/ASCII string printed:
[0x00000000]> px 32
- offset - 0 1 2 3 4 5 6 7 8 9 A B C D E F 0123456789ABCDEF
0x00000000 2261 7364 6173 6422 1010 2349 8473 5733 "asdasd"..#I.sW3
0x00000010 9420 3492 3400 0000 0000 0000 0000 0000 . 4.4...........
[0x00000000]> "td struct mystruct { char str[6]; int b; void *p; uint64_t l;};"
[0x00000000]> ts
mystruct
[0x00000000]> tp mystruct
str : 0x00000000 = "asdas
b : 0x00000006 = 269492836
p : 0x0000000a = (qword)0x2094335773844923
l : 0x00000012 = (qword)0x0000000000349234
[0x00000000]>
To see the internal representation of the structure you can use tk command:
184
Types
[0x000051c0]> tk~S1
S1=struct
struct.S1=x,y,z
struct.S1.x=int32_t,0,3
struct.S1.x.meta=4
struct.S1.y=int32_t,12,4
struct.S1.y.meta=4
struct.S1.z=int32_t,28,0
struct.S1.z.meta=0
[0x000051c0]>
There is one more important aspect of using types in radare2 - it can change the immediate
in the opcode to the structure offset. Lets see a simple example of [R]SI-relative addressing
[0x000052f0]> pd 1
0x000052f0 488b4608 mov rax, qword [rsi + 8] ; [0x8:8]=0
[0x000052f0]>
Here 8 - is some offset in the memory, where rsi probably holds some structure pointer.
Imagine that we have the following structures
[0x000052f0]> "td struct ms { char b[8]; int member1; int member2; };"
[0x000052f0]> "td struct ms1 { uint64_t a; int member1; };"
[0x000052f0]> "td struct ms2 { uint16_t a; int64_t b; int member1; };"
Now we need to set the proper structure member offset instead of 8 in this instruction. At
first we need to list available types matching this offset:
[0x000052f0]> tas 8
ms.member1
ms1.member1
[0x000052f0]>
Note, that ms2 is not listed, because it has no members with offset 8 . After listing
available options we can link it to the chosen offset at the current address:
[0x000052f0]> ta ms1.member1
[0x000052f0]> pd 1
0x000052f0 488b4608 mov rax, qword [rsi + ms1.member1] ; [0x
8:8]=0
[0x000052f0]>
Internal representation
185
Types
Defining primitive types requires understanding of basic pf formats, you can find the whole
list of format specifier in pf?? :
-----------------------------------------------------------------
| format specifier | explanation |
|---------------------------------------------------------------|
| b | byte (unsigned) |
| c | char (signed byte) |
| d | 0x%%08x hexadecimal value (4 bytes) |
| f | float value (4 bytes) |
| i | %%i integer value (4 bytes) |
| o | 0x%%08o octal value (4 byte) |
| p | pointer reference (2, 4 or 8 bytes) |
| q | quadword (8 bytes) |
| s | 32bit pointer to string (4 bytes) |
| S | 64bit pointer to string (8 bytes) |
| t | UNIX timestamp (4 bytes) |
| T | show Ten first bytes of buffer |
| u | uleb128 (variable length) |
| w | word (2 bytes unsigned short in hex) |
| x | 0x%%08x hex value and flag (fd @ addr) |
| X | show formatted hexpairs |
| z | \0 terminated string |
| Z | \0 terminated wide string |
-----------------------------------------------------------------
there are basically 3 mandatory keys for defining basic data types: X=type
type.X=format_specifier type.X.size=size_in_bits For example, lets define UNIT ,
UINT=type
type.UINT=d
type.UINT.size=32
X.type.pointto=Y
This one may only be used in case of pointer type.X=p , one good example is LPFILETIME
definition, it is pointer to _FILETIME which happens to be a structure. Assuming that we are
targeting only 32 bit windows machine, it will be defined as the following:
LPFILETIME=type
type.LPFILETIME=p
type.LPFILETIME.size=32
type.LPFILETIME.pointto=_FILETIME
186
Types
that last field is not mandatory because some times the data structure internals will be
property, and we will not have a clean representation for it.
type.UINT.meta=4
This entry is for integration with C parser and carry the type class information: integer size,
signed/unsigned, etc.
Structures
Those are the basic keys for structs (with just two elements):
X=struct
struct.X=a,b
struct.X.a=a_type,a_offset,a_number_of_elements
struct.X.b=b_type,b_offset,b_number_of_elements
The first line is used to define a structure called X , second line defines the elements of X
as comma separated values. After that we just define each element info.
struct _FILETIME {
DWORD dwLowDateTime;
DWORD dwHighDateTime;
}
assuming we have DWORD defined, the struct will look like this
_FILETIME=struct
struct._FILETIME=dwLowDateTime,dwHighDateTime
struct._FILETIME.dwLowDateTime=DWORD,0,0
struct._FILETIME.dwHighDateTime=DWORD,4,0
Note that the number of elements filed is used in case of arrays only to identify how many
elements are in arrays, other than that it is zero by default.
Unions
Unions are defined exactly like structs the only difference is that you will replace the word
struct with the word union .
187
Types
Function prototypes
Function prototypes representation is the most detail oriented and the most important one
one of them all. Actually this is the one used directly for type matching
X=func
func.X.args=NumberOfArgs
func.x.arg0=Arg_type,arg_name
.
.
.
func.X.ret=Return_type
func.X.cc=calling_convention
It should be self explanatory lets do strncasecmp as an example for x86 arch for linux
machines According to man pages, strncasecmp is defined as the following:
When converting it into its sdb representation it will looks like the following:
strcasecmp=func
func.strcasecmp.args=3
func.strcasecmp.arg0=char *,s1
func.strcasecmp.arg1=char *,s2
func.strcasecmp.arg2=size_t,n
func.strcasecmp.ret=int
func.strcasecmp.cc=cdecl
Note that the .cc part is optional and if it didn't exist the default calling convention for your
target architecture will be used instead. Their is one extra optional key
func.x.noreturn=true/false
This key is used to mark functions that will not return once called like exit and _exit .
188
Rahash2
$ radare2 /bin/ls
[0x08049790]> bf entry0
[0x08049790]> ph md5
d2994c75adaa58392f953a448de5fba7
[0x00404890]> ph?
md5
sha1
sha256
sha384
sha512
crc16
crc32
md4
xor
xorpair
parity
entropy
hamdist
pcprint
mod255
xxhash
adler32
luhn
The ph command accepts an optional numeric argument to specify length of byte range to
be hashed, instead of default block size. For example:
[0x08049A80]> ph md5 32
9b9012b00ef7a94b5824105b7aaad83b
[0x08049A80]> ph md5 64
a71b087d8166c99869c9781e2edcf183
[0x08049A80]> ph md5 1024
a933cc94cd705f09a41ecc80c0041def
[0x08049A80]>
189
Rahash2
190
Rahash Tool
Rahash2
The rahash2 tool can be used to calculate checksums and has functions of byte streams,
files, text strings.
$ rahash2 -h
Usage: rahash2 [-rBhLkv] [-b sz] [-a algo] [-s str] [-f from] [-t to] [file] ...
-a algo comma separated list of algorithms (default is 'sha256')
-b bsize specify the size of the block (instead of full file)
-B show per-block hash
-e swap endian (use little endian)
-f from start hashing at given address
-i num repeat hash N iterations
-S seed use given seed (hexa or s:string) use ^ to prefix
-k show hash using the openssh's randomkey algorithm
-q run in quiet mode (only show results)
-L list all available algorithms (see -a)
-r output radare commands
-s string hash this string instead of files
-t to stop hashing at given address
-v show version information
It is possible to calculate hash values for contents of files. But do not attempt to do it for very
large files because rahash2 buffers the whole input in memory before computing the hash.
191
Rahash Tool
192
Debugger
Debugger
Debuggers are implemented as IO plugins. Therefore, radare can handle different URI types
for spawning, attaching and controlling processes. The complete list of IO plugins can be
viewed with r2 -L . Those that have "d" in the first column ("rwd") support debugging. For
example:
There are different backends for many target architectures and operating systems, e.g.,
GNU/Linux, Windows, MacOS X, (Net,Free,Open)BSD and Solaris.
Process memory is treated as a plain file. All mapped memory pages of a debugged
program and its libraries can be read and interpreted as code, data structures etc.
Communication between radare and the debugger IO layer is wrapped into system() calls,
which accept a string as an argument, and executes it as a command. An answer is then
buffered in the output console, its contents can be additionally processed by a script. Access
to the IO system is achieved with =! . Most IO plugins provide help with =!? or =!help .
For example:
$ r2 -d /bin/ls
...
[0x7fc15afa3cc0]> =!help
Usage: =!cmd args
=!ptrace - use ptrace io
=!mem - use /proc/pid/mem io if possible
=!pid - show targeted pid
=!pid <#> - select new pid
In general, debugger commands are portable between architectures and operating systems.
Still, as radare tries to support the same functionality for all target architectures and
operating systems, certain things have to be handled separately. They include injecting
shellcodes and handling exceptions. For example, in MIPS targets there is no hardware-
supported single-stepping feature. In this case, radare2 provides its own implementation for
single-step by using a mix of code analysis and software breakpoints.
193
Debugger
To restart your debugging session, you can type oo or oo+ , depending on desired
behavior.
194
Getting Started
Getting Started
native debugger, but does not run the program. You’ll see a prompt (radare2) - all
examples are from this prompt.
db flag : place a breakpoint at flag, where flag can be either an address or a function
name
db - flag : remove the breakpoint at flag, where flag can be either an address or a
function name
ood arg1 arg2 : reopen in debug mode with arg1 and arg2
195
Migration from ida, GDB or WinDBG
r2 ptrace://pid - same as above, but only for io (not debugger backend hooked)
r2 -c $@ | awk $@ - run thru awk get asm from function => [link]
[0x80480423]> #!c - enter C repl (see #! to list all available RLang plugins) => [video],
196
Migration from ida, GDB or WinDBG
shortcuts
Command IDA Pro radare2 r2 (visual mode) GDB
Analysis
Automatically aaa or -A (aaaa
Analysis of launched when or -AA for even
opening a experimental
N/A N/A
everything
binary analysis)
Navigation
xref to x axt x N/A
xref from ctrl + j axf X N/A
xref to graph ? agt [offset] ? N/A
xref from graph ? agf [offset] ? N/A
list functions alt + 1 afl;is t N/A
listing alt + 2 pdf p N/A
hex mode alt + 3 pxa P N/A
197
Migration from ida, GDB or WinDBG
Debugger
Detach ? o- ? detach
step into F7 ds s n
step into 4
? ds 4 F7 n 4
instructions
disassembly
N/A pd Vp disas
forward
disassembly N
N/A pd X Vp x/i
instructions
disas <a-o>
disassembly N
N/A pd -X Vp <a>
(backward)
198
Migration from ida, GDB or WinDBG
Information on the
bin
iS or S
Menu maint info
Sections/regions sections (append j for N/A
sections
json)
Load symbol file
asm.dwarf.file , add-
Sections/regions pdb menu N/A
pdb.XX ) symbol-file
BackTrace
dbt
Partial Backtrace ( dbg.btdepth
N/A N/A bt -
(outermost) dbg.btalgo )
thread
Stacktrace for all
N/A dbt@t N/A apply all
threads bt
Breakpoints
info
Breakpoint list Ctrl+Alt+B db ? breakpoints
Threads
Switch to thread Thread menu dp N/A thread <N>
Frames
any bt
Frame Numbers N/A ? N/A command
Parameters/Locals
info
Display parameters N/A afv N/A locals
Display
info
parameters/locals N/A afvj N/A locals
in json
list addresses
where vars are N/A afvR/afvW N/A ?
accessed(R/W)
199
Migration from ida, GDB or WinDBG
Project Related
Common features
r2 accepts FLIRT signatures
r2 can connect to GDB, LLVM and WinDbg
r2 can write/patch in place
r2 have fortunes and [s]easter eggs[/s]balls of steel
r2 can do basic loading of ELF core files from the box and MDMP (Windows
minidumps)
200
Registers
Registers
The registers are part of a user area stored in the context structure used by the scheduler.
This structure can be manipulated to get and set the values of those registers, and, for
example, on Intel hosts, it is possible to directly manipulate DR0-DR7 hardware registers to
set hardware breakpoints.
There are different commands to get values of registers. For the General Purpose ones use:
[0x4A13B8C0]> dr
r15 = 0x00000000
r14 = 0x00000000
r13 = 0x00000000
r12 = 0x00000000
rbp = 0x00000000
rbx = 0x00000000
r11 = 0x00000000
r10 = 0x00000000
r9 = 0x00000000
r8 = 0x00000000
rax = 0x00000000
rcx = 0x00000000
rdx = 0x00000000
rsi = 0x00000000
rdi = 0x00000000
oeax = 0x0000003b
rip = 0x7f20bf5df630
rsp = 0x7fff515923c0
Interaction between a plugin and the core is done by commands returning radare
instructions. This is used, for example, to set flags in the core to set values of registers.
201
Registers
An old copy of registers is stored all the time to keep track of the changes done during
execution of a program being analyzed. This old copy can be accessed with oregs .
202
Registers
[0x7f1fab84c630]> dro
r15 = 0x00000000
r14 = 0x00000000
r13 = 0x00000000
r12 = 0x00000000
rbp = 0x00000000
rbx = 0x00000000
r11 = 0x00000000
r10 = 0x00000000
r9 = 0x00000000
r8 = 0x00000000
rax = 0x00000000
rcx = 0x00000000
rdx = 0x00000000
rsi = 0x00000000
rdi = 0x00000000
oeax = 0x0000003b
rip = 0x7f1fab84c630
rflags = 0x00000200
rsp = 0x7fff386b5080
[0x7f1fab84c630]> dr
r15 = 0x00000000
r14 = 0x00000000
r13 = 0x00000000
r12 = 0x00000000
rbp = 0x00000000
rbx = 0x00000000
r11 = 0x00000000
r10 = 0x00000000
r9 = 0x00000000
r8 = 0x00000000
rax = 0x00000000
rcx = 0x00000000
rdx = 0x00000000
rsi = 0x00000000
rdi = 0x7fff386b5080
oeax = 0xffffffffffffffff
rip = 0x7f1fab84c633
rflags = 0x00000202
rsp = 0x7fff386b5080
To store and restore register values you can just dump the output of 'dr*' command to disk
and then re-interpret it again:
203
Registers
You can get a string which represents latest changes of registers using drd command (diff
registers):
[0x4A13B8C0]> drd
oeax = 0x0000003b was 0x00000000 delta 59
rip = 0x7f00e71282d0 was 0x00000000 delta -418217264
rflags = 0x00000200 was 0x00000000 delta 512
rsp = 0x7fffe85a09c0 was 0x00000000 delta -396752448
204
Reverse Debugging
Reverse Debugging
Radare2 has reverse debugger, that can seek program counter backward. (e.g. reverse-
next, reverse-continue in gdb) Firstly you need to save program state at the point that you
want to start recording. The syntax for recording is:
[0x004028a0]> dts+
You can use dts commands for recording and managing program states. After recording
the states, you can seek pc back and forth to any points after saved address. So after
recording, you can try single step back:
[0x004028a0]> 2dso
[0x004028a0]> dr rip
0x004028ae
[0x004028a0]> dsb
continue until 0x004028a2
hit breakpoint at: 4028a2
[0x004028a0]> dr rip
0x004028a2
When you run dsb , reverse debugger restore previous recorded state and execute
program from it until desired point.
[0x004028a0]> db 0x004028a2
[0x004028a0]> 10dso
[0x004028a0]> dr rip
0x004028b9
[0x004028a0]> dcb
[0x004028a0]> dr rip
0x004028a2
dcb seeks program counter until hit the latest breakpoint. So once set a breakpoint, you
[0x004028a0]> dts
session: 0 at:0x004028a0 ""
session: 1 at:0x004028c2 ""
205
Reverse Debugging
NOTE: Program records can be saved at any moments. These are diff style format that save
only different memory area from previous. It saves memory space rather than entire dump.
You can leave notes for each records to keep in your mind. dsb and dcb commands
restore the program state from latest record if there are many records.
Program records can exported to file and of course import it. Export/Import records to/from
file:
Moreover, you can do reverse debugging in ESIL mode. In ESIL mode, program state can
be managed by aets commands.
[0x00404870]> aets+
206
Remoting Capabilities
[0x00405a04]> =?
|Usage: =[:!+-=hH] [...] # radare remote command execution protocol
|
rap commands:
| = list all open connections
| =<[fd] cmd send output of local command to remote fd
| =[fd] cmd exec cmd at remote 'fd' (last open is default one)
| =! cmd run command via r_io_system
| =+ [proto://]host add host (default=rap://, tcp://, udp://)
| =-[fd] remove all hosts or host 'fd'
| ==[fd] open remote session with host 'fd', 'q' to quit
| =!= disable remote cmd mode
| !=! enable remote cmd mode
|
rap server:
| =:port listen on given port using rap protocol (o rap://9999)
| =&:port start rap server in background
| =:host:port cmd run 'cmd' command on remote server
|
http server:
| =h port listen for http connections (r2 -qc=H /bin/ls)
| =h- stop background webserver
| =h* restart current webserver
| =h& port start http server in background)
| =H port launch browser and listen for http
| =H& port launch browser and listen for http in background
|
gdbserver:
| =g port file listen on 'port' debugging 'file' using gdbserver
| =g! port file same as above, but debug protocol messages (like gdbserver --remo
te-debug)
You can learn radare2 remote capabilities by displaying the list of supported IO plugins:
radare2 -L .
A little example should make this clearer. A typical remote session might look like this:
207
Remoting Capabilities
$ radare2 rap://:1234
$ radare2 rap://:1234
At localhost:
$ radare2 -
; Add hosts
[0x004048c5]> =+ rap://<host1>:1234//bin/ls
Connected to: <host1> at port 1234
waiting... ok
[0x004048c5]> =
0 - rap://<host1>:1234//bin/ls
You can open remote files in debug mode (or using any IO plugin) specifying URI when
adding hosts:
[0x004048c5]> =+ =+ rap://<host2>:1234/dbg:///bin/ls
Connected to: <host2> at port 1234
waiting... ok
0 - rap://<host1>:1234//bin/ls
1 - rap://<host2>:1234/dbg:///bin/ls
[0x004048c5]> =0 px
[0x004048c5]> = s 0x666
[0x004048c5]> ==1
fd:6> pi 1
...
fd:6> q
208
Remoting Capabilities
[0x004048c5]> =-
You can also redirect radare output to a TCP or UDP server (such as nc -l ). First, Add the
server with '=+ tcp://' or '=+ udp://', then you can redirect the output of a command to be sent
to the server:
[0x004048c5]> =+ tcp://<host>:<port>/
Connected to: <host> at port <port>
5 - tcp://<host>:<port>/
[0x004048c5]> =<5 cmd...
The =<' command will send the output from the execution of cmd` to the remote connection
number N (or the last one used if no id specified).
209
Remote GDB
$ r2 -D gdb gdb://<host>:<port>
Or if the gdbserver is running in extended mode, you can attach to a process on the host
with:
$ r2 -D gdb gdb://<host>:<port>/<pid>
radare2 does not yet load symbols from gdbserver, so it needs the binary to be locally
present to load symbols from it. In case symbols are not loaded even if the binary is present,
you can try specifying the path with e dbg.exe.path :
If symbols are loaded at an incorrect base address, you can try specifying the base address
too with e bin.baddr :
Usually the gdbserver reports the maximum packet size it supports. Otherwise, radare2
resorts to sensible defaults. But you can specify the maximum packet size with the
environment variable R2_GDB_PKTSZ . You can also check and set the max packet size during
a session with the IO system, =! .
$ export R2_GDB_PKTSZ=512
$ r2 -D gdb gdb://<host>:<port>
= attach <pid> <tid>
Assuming filepath <path/to/exe>
[0x7ff659d9fcc0]> =!pktsz
packet size: 512 bytes
[0x7ff659d9fcc0]> =!pktsz 64
[0x7ff659d9fcc0]> =!pktsz
packet size: 64 bytes
210
Remote GDB
The gdb IO system provides useful commands which might not fit into any standard radare2
commands. You can get a list of these commands with =!? . (Remember, =! accesses the
underlying IO plugin's system() ).
[0x7ff659d9fcc0]> =!?
Usage: =!cmd args
=!pid - show targeted pid
=!pkt s - send packet 's'
=!monitor cmd - hex-encode monitor command and pass to target interpreter
=!detach [pid] - detach from remote/detach specific pid
=!inv.reg - invalidate reg cache
=!pktsz - get max packet size used
=!pktsz bytes - set max. packet size as 'bytes' bytes
=!exec_file [pid] - get file which was executed for current/specified pid
$ r2 -
[0x00000000]> =?
...
gdbserver:
| =g port file [args] listen on 'port' debugging 'file' using gdbserver
| =g! port file [args] same as above, but debug protocol messages (like gdbserver --re
mote-debug)
$ r2 -
[0x00000000]> =g 8000 /bin/radare2 -
And then connect to it like you would to any gdbserver. For example, with radare2:
$ r2 -D gdb gdb://localhost:8000
211
Remote WinDbg
WinDBG
The WinDBG support for r2 allows you to attach to VM running Windows using a named
socket file (will support more IOs in the future) to debug a windows box using the KD
interface over serial port.
Bear in mind that WinDBG support is still work-in-progress, and this is just an initial
implementation which will get better in time.
It is also possible to use the remote GDB interface to connect and debug Windows kernels
without depending on Windows capabilities.
bcdedit /debug on
bcdedit /dbgsettings serial debugport:1 baudrate:115200
Starting from Windows 8 there is no way to enforce debugging for every boot, but it is
possible to always show the advanced boot options, which allows to enable kernel
debugging:
Or like this for Windows XP: Open boot.ini and add /debug /debugport=COM1
/baudrate=115200:
[boot loader]
timeout=30
default=multi(0)disk(0)rdisk(0)partition(1)\WINDOWS
[operating systems]
multi(0)disk(0)rdisk(0)partition(1)\WINDOWS="Debugging with Cable" /fastdetect /debug
/debugport=COM1 /baudrate=57600
In case of VMWare
212
Remote WinDbg
Radare2 will use the 'windbg' io plugin to connect to a socket file created by virtualbox or
qemu. Also, the 'windbg' debugger plugin and we should specify the x86-32 too. (32 and 64
bit debugging is supported)
[0x828997b8]> pd 20
;-- eip:
0x828997b8 cc int3
0x828997b9 c20400 ret 4
0x828997bc cc int3
0x828997bd 90 nop
0x828997be c3 ret
0x828997bf 90 nop
In order to skip that trap we will need to change eip and run 'dc' twice:
dr eip=eip+1
dc
dr eip=eip+1
dc
213
Remote WinDbg
Now the Windows VM will be interactive again. We will need to kill r2 and attach again to get
back to control the kernel.
In addition, the dp command can be used to list all processes, and dpa or dp= to attach
to the process. This will display the base address of the process in the physical memory
layout.
214
Plugins
Plugins
It is common to have an issues when you write a plugin, especially if you do this for the first
time. This is why debugging them is very important. The first step for debugging is to set an
environment variable when running radare2 instance:
R_DEBUG=yes r2 /bin/ls
Loading /usr/local/lib/radare2/2.2.0-git//bin_xtr_dyldcache.so
Cannot find symbol 'radare_plugin' in library '/usr/local/lib/radare2/2.2.0-git//bin_x
tr_dyldcache.so'
Cannot open /usr/local/lib/radare2/2.2.0-git//2.2.0-git
Loading /home/user/.config/radare2/plugins/asm_mips_ks.so
PLUGIN OK 0x55b205ea6070 fcn 0x7f298de08762
Loading /home/user/.config/radare2/plugins/asm_sparc_ks.so
PLUGIN OK 0x55b205ea6070 fcn 0x7f298de08762
Cannot open /home/user/.config/radare2/plugins/pimp
Cannot open /home/user/.config/radare2/plugins/yara
Loading /home/user/.config/radare2/plugins/asm_arm_ks.so
PLUGIN OK 0x55b205ea6070 fcn 0x7f298de08762
Loading /home/user/.config/radare2/plugins/core_yara.so
Module version mismatch /home/user/.config/radare2/plugins/core_yara.so (2.1.0) vs (2.
2.0-git)
Loading /home/user/.config/radare2/plugins/asm_ppc_ks.so
PLUGIN OK 0x55b205ea6070 fcn 0x7f298de08762
Loading /home/user/.config/radare2/plugins/lang_python3.so
PLUGIN OK 0x55b205ea5ed0 fcn 0x7f298de08692
Loading /usr/local/lib/radare2/2.2.0-git/bin_xtr_dyldcache.so
Cannot find symbol 'radare_plugin' in library '/usr/local/lib/radare2/2.2.0-git/bin_xt
r_dyldcache.so'
Cannot open /usr/local/lib/radare2/2.2.0-git/2.2.0-git
Cannot open directory '/usr/local/lib/radare2-extras/2.2.0-git'
Cannot open directory '/usr/local/lib/radare2-bindings/2.2.0-git'
USER CONFIG loaded from /home/user/.config/radare2/radare2rc
-- In visual mode press 'c' to toggle the cursor mode. Use tab to navigate
[0x00005520]>
IO plugins
All access to files, network, debugger, etc. is wrapped by an IO abstraction layer that allows
radare to treat all data as if it were just a file.
IO plugins are the ones used to wrap the open, read, write and 'system' on virtual file
systems. You can make radare understand anything as a plain file. E.g., a socket
connection, a remote radare session, a file, a process, a device, a gdb session, etc..
215
Plugins
So, when radare reads a block of bytes, it is the task of an IO plugin to get these bytes from
any place and put them into internal buffer. An IO plugin is chosen by a file's URI to be
opened. Some examples:
Debugging URIs
$ r2 dbg:///bin/ls $ r2 pid://1927
Remote sessions
$ r2 rap://:1234 $ r2 rap://:1234//bin/ls
Virtual buffers
$ r2 -L
rw_ zip Open zip files apk://foo.apk//MANIFEST or zip://foo.apk//theclass/fun
.class, show files with: zip://foo.apk/, open all files with zipall:// (BSD)
rwd windbg Attach to a KD debugger (LGPL3)
rw_ sparse sparse buffer allocation (sparse://1024 sparse://) (LGPL3)
rw_ shm shared memory resources (shm://key) (LGPL3)
rw_ self read memory from myself using 'self://' (LGPL3)
rw_ rap radare network protocol (rap://:port rap://host:port/file) (LGPL3)
rwd ptrace ptrace and /proc/pid/mem (if available) io (LGPL3)
rw_ procpid /proc/pid/mem io (LGPL3)
rw_ mmap open file using mmap:// (LGPL3)
rw_ malloc memory allocation (malloc://1024 hex://cd8090) (LGPL3)
r__ mach mach debug io (unsupported in this platform) (LGPL)
rw_ ihex Intel HEX file (ihex://eeproms.hex) (LGPL)
rw_ http http get (http://radare.org/) (LGPL3)
rw_ gzip read/write gzipped files (LGPL3)
rwd gdb Attach to gdbserver, 'qemu -s', gdb://localhost:1234 (LGPL3)
r_d debug Debug a program or pid. dbg:///bin/ls, dbg://1388 (LGPL3)
rw_ bfdbg BrainFuck Debugger (bfdbg://path/to/file) (LGPL3)
216
Plugins
r_debug : debugger
The most basic feature you usually want to support from a specific architecture is the
disassembler. You first need to read into a human readable form the bytes in there.
Bear in mind that plugins can be compiled static or dynamically, this means that the arch will
be embedded inside the core libraries or it will distributed as a separated shared library.
To configure which plugins you want to compile use the ./configure-plugins script which
accepts the flags --shared and --static to specify them. You can also add it manually inside
the plugins.def.cfg and then remove the plugins.cfg and run ./configure-plugins
again to update the libr/config.mk and libr/config.h .
$ cd my-cpu
$ cat Makefile
NAME=mycpu
R2_PLUGIN_PATH=$(shell r2 -hh|grep LIBR_PLUGINS|awk '{print $$2}')
CFLAGS=-g -fPIC $(shell pkg-config --cflags r_asm)
LDFLAGS=-shared $(shell pkg-config --libs r_asm)
OBJS=$(NAME).o
SO_EXT=$(shell uname|grep -q Darwin && echo dylib || echo so)
LIB=$(NAME).$(SO_EXT)
all: $(LIB)
clean:
rm -f $(LIB) $(OBJS)
$(LIB): $(OBJS)
$(CC) $(CFLAGS) $(LDFLAGS) $(OBJS) -o $(LIB)
install:
cp -f $(NAME).$(SO_EXT) $(R2_PLUGIN_PATH)
uninstall:
rm -f $(R2_PLUGIN_PATH)/$(NAME).$(SO_EXT)
$ cat mycpu.c
/* example r_asm plugin by pancake at 2014 */
217
Plugins
#include <r_asm.h>
#include <r_lib.h>
#define OPS 17
218
Plugins
#ifndef CORELIB
struct r_lib_struct_t radare_plugin = {
.type = R_LIB_TYPE_ASM,
.data = &r_asm_plugin_mycpu
};
#endif
$ make
$ sudo make install
Let's open an empty file using the 'mycpu' arch and write some random code there.
219
Plugins
$ r2 -
-- I endians swap
[0x00000000]> e asm.arch=mycpu
[0x00000000]> woR
[0x00000000]> pd 10
0x00000000 888e mov r8, 14
0x00000002 b2a5 ifnot r10, r5
0x00000004 3f67 ret
0x00000006 7ef6 bl r15, r6
0x00000008 2701 xor r0, 1
0x0000000a 9826 mov r2, 6
0x0000000c 478d xor r8, 13
0x0000000e 6b6b store r6, 11
0x00000010 1382 add r8, r2
0x00000012 7f15 ret
libr/asm/p/asm_mycpu.c : implementation
Check out how the NIOS II cpu was implemented by reading those commits:
220
Plugins
2. Make a function with 2 subfunctions - assemble and disassemble and returning plugin
structure - for RAsm plugin
def mycpu(a):
def assemble(s):
return [1, 2, 3, 4]
def disassemble(buf):
try:
opcode = get_opcode(buf)
opstr = optbl[opcode][1]
return [4, opstr]
except:
return [4, "unknown"]
return {
"name" : "mycpu",
"arch" : "mycpu",
"bits" : 32,
"endian" : "little",
"license" : "GPL",
"desc" : "MYCPU disasm",
"assemble" : assemble,
"disassemble" : disassemble,
}
221
Plugins
def mycpu_anal(a):
def set_reg_profile():
profile = "=PC pc\n" + \
"=SP sp\n" + \
"gpr r0 .32 0 0\n" + \
"gpr r1 .32 4 0\n" + \
"gpr r2 .32 8 0\n" + \
"gpr r3 .32 12 0\n" + \
"gpr r4 .32 16 0\n" + \
"gpr r5 .32 20 0\n" + \
"gpr sp .32 24 0\n" + \
"gpr pc .32 28 0\n"
return profile
except:
result = analop
# Don't forget to return proper instruction size!
return [4, result]
222
Plugins
return {
"name" : "mycpu",
"arch" : "mycpu",
"bits" : 32,
"license" : "GPL",
"desc" : "MYCPU anal",
"esil" : 1,
"set_reg_profile" : set_reg_profile,
"op" : op,
}
You can combine everything in one file and load it using -i option:
r2 -I mycpu.py some_file.bin
See also:
Python
Javascript
If you want to add support for the gdb, you can see the register profile in the active gdb
session using command maint print registers .
223
Plugins
More to come..
Related article: http://radare.today/posts/extending-r2-with-new-plugins/
Extensa:
https://github.com/radare/radare2/commit/6f1655c49160fe9a287020537afe0fb8049085
d7
Malbolge: https://github.com/radare/radare2/pull/579
6502: https://github.com/radare/radare2/pull/656
h8300: https://github.com/radare/radare2/pull/664
GBA: https://github.com/radare/radare2/pull/702
CR16: https://github.com/radare/radare2/pull/721/ &&
https://github.com/radare/radare2/pull/726
XCore:
https://github.com/radare/radare2/commit/bb16d1737ca5a471142f16ccfa7d444d2713a5
4d
Sharp LH5801:
https://github.com/neuschaefer/radare2/commit/f4993cca634161ce6f82a64596fce45fe6
b818e7
MSP430: https://github.com/radare/radare2/pull/1426
HP PA-RISC:
https://github.com/radare/radare2/commit/f8384feb6ba019b91229adb8fd6e0314b0656f
7b
V810: https://github.com/radare/radare2/pull/2899
TMS320: https://github.com/radare/radare2/pull/596
Z80:
https://github.com/radare/radare2/commit/8ff6a92f65331cf8ad74cd0f44a60c258b137a0
6
224
Plugins
NAME=anal_snes
R2_PLUGIN_PATH=$(shell r2 -hh|grep LIBR_PLUGINS|awk '{print $$2}')
CFLAGS=-g -fPIC $(shell pkg-config --cflags r_anal)
LDFLAGS=-shared $(shell pkg-config --libs r_anal)
OBJS=$(NAME).o
SO_EXT=$(shell uname|grep -q Darwin && echo dylib || echo so)
LIB=$(NAME).$(SO_EXT)
all: $(LIB)
clean:
rm -f $(LIB) $(OBJS)
$(LIB): $(OBJS)
$(CC) $(CFLAGS) $(LDFLAGS) $(OBJS) -o $(LIB)
install:
cp -f anal_snes.$(SO_EXT) $(R2_PLUGIN_PATH)
uninstall:
rm -f $(R2_PLUGIN_PATH)/anal_snes.$(SO_EXT)
anal_snes.c:
225
Plugins
#include <string.h>
#include <r_types.h>
#include <r_lib.h>
#include <r_asm.h>
#include <r_anal.h>
#include "snes_op_table.h"
static int snes_anop(RAnal *anal, RAnalOp *op, ut64 addr, const ut8 *data, int len) {
memset (op, '\0', sizeof (RAnalOp));
op->size = snes_op[data[0]].len;
op->addr = addr;
op->type = R_ANAL_OP_TYPE_UNK;
switch (data[0]) {
case 0xea:
op->type = R_ANAL_OP_TYPE_NOP;
break;
}
return op->size;
}
#ifndef CORELIB
struct r_lib_struct_t radare_plugin = {
.type = R_LIB_TYPE_ANAL,
.data = &r_anal_plugin_snes,
.version = R2_VERSION
};
#endif
226
Plugins
snes_op_table.h:
https://github.com/radare/radare2/blob/master/libr/asm/arch/snes/snes_op_table.h
Example:
6502:
https://github.com/radare/radare2/commit/64636e9505f9ca8b408958d3c01ac8e3ce254
a9b
SNES:
https://github.com/radare/radare2/commit/60d6e5a1b9d244c7085b22ae8985d0002762
4b49
NAME=bin_nes
R2_PLUGIN_PATH=$(shell r2 -hh|grep LIBR_PLUGINS|awk '{print $$2}')
CFLAGS=-g -fPIC $(shell pkg-config --cflags r_bin)
LDFLAGS=-shared $(shell pkg-config --libs r_bin)
OBJS=$(NAME).o
SO_EXT=$(shell uname|grep -q Darwin && echo dylib || echo so)
LIB=$(NAME).$(SO_EXT)
all: $(LIB)
clean:
rm -f $(LIB) $(OBJS)
$(LIB): $(OBJS)
$(CC) $(CFLAGS) $(LDFLAGS) $(OBJS) -o $(LIB)
install:
cp -f $(NAME).$(SO_EXT) $(R2_PLUGIN_PATH)
uninstall:
rm -f $(R2_PLUGIN_PATH)/$(NAME).$(SO_EXT)
bin_nes.c:
227
Plugins
#include <r_bin.h>
static void * load_bytes(RBinFile *arch, const ut8 *buf, ut64 sz, ut64 loadaddr, Sdb *
sdb){
check_bytes (buf, sz);
return R_NOTNULL;
}
if (!arch || !arch->buf) {
free (ret);
return NULL;
}
ret->file = strdup (arch->file);
ret->type = strdup ("ROM");
ret->machine = strdup ("Nintendo NES");
ret->os = strdup ("nes");
ret->arch = strdup ("6502");
ret->bits = 8;
return ret;
}
228
Plugins
.check = &check,
.baddr = NULL,
.check_bytes = &check_bytes,
.entries = NULL,
.sections = NULL,
.info = &info,
};
#ifndef CORELIB
struct r_lib_struct_t radare_plugin = {
.type = R_LIB_TYPE_BIN,
.data = &r_bin_plugin_nes,
.version = R2_VERSION
};
#endif
Some Examples
XBE - https://github.com/radare/radare2/pull/972
COFF - https://github.com/radare/radare2/pull/645
TE - https://github.com/radare/radare2/pull/61
Zimgz -
https://github.com/radare/radare2/commit/d1351cf836df3e2e63043a6dc728e880316f00
eb
OMF -
https://github.com/radare/radare2/commit/44fd8b2555a0446ea759901a94c06f20566bb
c40
1. import r2lang
load
load_bytes
destroy
check_bytes
baddr
entries
sections
229
Plugins
imports
relocs
binsym
info
def le_format(a):
def load(binf):
return [0]
def check_bytes(buf):
try:
if buf[0] == 77 and buf[1] == 90:
lx_off, = struct.unpack("<I", buf[0x3c:0x40])
if buf[lx_off] == 76 and buf[lx_off+1] == 88:
return [1]
return [0]
except:
return [0]
and so on. Please be sure of the parameters for each function and format of returns.
Note, that functions entries , sections , imports , relocs returns a list of special
formed dictionaries - each with a different type. Other functions return just a list of
numerical values, even if single element one. There is a special function, which returns
information about the file - info :
def info(binf):
return [{
"type" : "le",
"bclass" : "le",
"rclass" : "le",
"os" : "OS/2",
"subsystem" : "CLI",
"machine" : "IBM",
"arch" : "x86",
"has_va" : 0,
"bits" : 32,
"big_endian" : 0,
"dbg_info" : 0,
}]
3. This structure should contain a pointers to the most important functions like
check_bytes , load and load_bytes , entries , relocs , imports , etc.
230
Plugins
return {
"name" : "le",
"desc" : "OS/2 LE/LX format",
"license" : "GPL",
"load" : load,
"load_bytes" : load_bytes,
"destroy" : destroy,
"check_bytes" : check_bytes,
"baddr" : baddr,
"entries" : entries,
"sections" : sections,
"imports" : imports,
"symbols" : symbols,
"relocs" : relocs,
"binsym" : binsym,
"info" : info,
}
231
Crackmes
Crackmes
Crackmes (from "crack me" challenge) are the training ground for reverse engineering
people. This section will go over tutorials on how to defeat various crackmes using r2.
232
IOLI
IOLI CrackMes
The IOLI crackme is a good starting point for learning r2. This is a set of tutorials based on
the tutorial at dustri
233
IOLI 0x00
IOLI 0x00
This is the first IOLI crackme, and the easiest one.
$ ./crackme0x00
IOLI Crackme Level 0x00
Password: 1234
Invalid Password!
The first thing to check is if the password is just plaintext inside the file. In this case, we don't
need to do any disassembly, and we can just use rabin2 with the -z flag to search for strings
in the binary.
$ rabin2 -z ./crackme0x00
vaddr=0x08048568 paddr=0x00000568 ordinal=000 sz=25 len=24 section=.rodata type=a stri
ng=IOLI Crackme Level 0x00\n
vaddr=0x08048581 paddr=0x00000581 ordinal=001 sz=11 len=10 section=.rodata type=a stri
ng=Password:
vaddr=0x0804858f paddr=0x0000058f ordinal=002 sz=7 len=6 section=.rodata type=a string
=250382
vaddr=0x08048596 paddr=0x00000596 ordinal=003 sz=19 len=18 section=.rodata type=a stri
ng=Invalid Password!\n
vaddr=0x080485a9 paddr=0x000005a9 ordinal=004 sz=16 len=15 section=.rodata type=a stri
ng=Password OK :)\n
So we know what the following section is, this section is the header shown when the
application is run.
234
IOLI 0x00
But what is this? It's a string, but we haven't seen it in running the application yet.
$ ./crackme0x00
IOLI Crackme Level 0x00
Password: 250382
Password OK :)
So we now know that 250382 is the password, and have completed this crackme.
235
IOLI 0x01
IOLI 0x01
This is the second IOLI crackme.
$ ./crackme0x01
IOLI Crackme Level 0x01
Password: test
Invalid Password!
$ rabin2 -z ./crackme0x01
vaddr=0x08048528 paddr=0x00000528 ordinal=000 sz=25 len=24 section=.rodata type=a stri
ng=IOLI Crackme Level 0x01\n
vaddr=0x08048541 paddr=0x00000541 ordinal=001 sz=11 len=10 section=.rodata type=a stri
ng=Password:
vaddr=0x0804854f paddr=0x0000054f ordinal=002 sz=19 len=18 section=.rodata type=a stri
ng=Invalid Password!\n
vaddr=0x08048562 paddr=0x00000562 ordinal=003 sz=16 len=15 section=.rodata type=a stri
ng=Password OK :)\n
This isn't going to be as easy as 0x00. Let's try disassembly with r2.
236
IOLI 0x01
"aa" tells r2 to analyze the whole binary, which gets you symbol names, among things.
237
IOLI 0x01
Disassemble
Function
This will print the disassembly of the main function, or the main() that everyone knows. You
can see several things as well: weird names, arrows, etc.
"imp." stands for imports. Those are imported symbols, like printf()
If you look carefully, you'll see a cmp instruction, with a constant, 0x149a. cmp is an x86
compare instruction, and the 0x in front of it specifies it is in base 16, or hex (hexadecimal).
[0x08048330]> ? 0x149a
5274 0x149a 012232 5.2K 0000:049a 5274 10011010 5274.0 0.000000
So now we know that 0x149a is 5274 in decimal. Let's try this as a password.
$ ./crackme0x01
IOLI Crackme Level 0x01
Password: 5274
Password OK :)
Bingo, the password was 5274. In this case, the password function at 0x0804842b was
comparing the input against the value, 0x149a in hex. Since user input is usually decimal, it
was a safe bet that the input was intended to be in decimal, or 5274. Now, since we're
hackers, and curiosity drives us, let's see what happens when we input in hex.
$ ./crackme0x01
IOLI Crackme Level 0x01
Password: 0x149a
Invalid Password!
It was worth a shot, but it doesn't work. That's because scanf() will take the 0 in 0x149a to
be a zero, rather than accepting the input as actually being the hex value.
238
IOLI 0x01
239
IOLI 0x01
.intro
After a few years of missing out on wargames at Hacktivity, this year I've finally found the
time to begin, and almost finish (yeah, I'm quite embarrassed about that unfinished webhack
:) ) one of them. There were 3 different games at the conf, and I've chosen the one that was
provided by avatao. It consisted of 8 challenges, most of them being basic web hacking
stuff, one sandbox escape, one simple buffer overflow exploitation, and there were two
reverse engineering exercises too. You can find these challenges on
https://platform.avatao.com.
240
IOLI 0x01
.radare2
I've decided to solve the reversing challenges using radare2, a free and open source
reverse engineering framework. I have first learned about r2 back in 2011. during a huge
project, where I had to reverse a massive, 11MB statically linked ELF. I simply needed
something that I could easily patch Linux ELFs with. Granted, back then I've used r2
alongside IDA, and only for smaller tasks, but I loved the whole concept at first sight. Since
then, radare2 evolved a lot, and I was planning for some time now to solve some crackmes
with the framework, and write writeups about them. Well, this CTF gave me the perfect
opportunity :)
Because this writeup aims to show some of r2's features besides how the crackmes can be
solved, I will explain every r2 command I use in blockquote paragraphs like this one:
If you know r2, and just interested in the crackme, feel free to skip those parts! Also keep in
mind please, that because of this tutorial style I'm going to do a lot of stuff that you just don't
do during a CTF, because there is no time for proper bookkeeping (e.g. flag every memory
area according to its purpose), and with such small executables you can succeed without
doing these stuff.
A few advice if you are interested in learning radare2 (and frankly, if you are into RE, you
should be interested in learning r2 :) ):
The framework has a lot of supplementary executables and a vast amount of functionality -
and they are very well documented. I encourage you to read the available docs, and use the
built-in help (by appending a ? to any command) extensively! E.g.:
241
IOLI 0x01
[0x00000000]> ?
Usage: [.][times][cmd][~grep][@[@iter]addr!size][|>pipe] ; ...
Append '?' to any char command to get detailed help
Prefix with number to repeat command N times (f.ex: 3x)
|%var =valueAlias for 'env' command
| *off[=[0x]value] Pointer read/write data/values (see ?v, wx, wv)
| (macro arg0 arg1) Manage scripting macros
| .[-|(m)|f|!sh|cmd] Define macro or load r2, cparse or rlang file
| = [cmd] Run this command via rap://
| / Search for bytes, regexps, patterns, ..
| ! [cmd] Run given command as in system(3)
| # [algo] [len] Calculate hash checksum of current block
| #!lang [..] Hashbang to run an rlang script
| a Perform analysis of code
| b Get or change block size
...
[0x00000000]> a?
|Usage: a[abdefFghoprxstc] [...]
| ab [hexpairs] analyze bytes
| aa analyze all (fcns + bbs) (aa0 to avoid sub renaming)
| ac [cycles] analyze which op could be executed in [cycles]
| ad analyze data trampoline (wip)
| ad [from] [to] analyze data pointers to (from-to)
| ae [expr] analyze opcode eval expression (see ao)
| af[rnbcsl?+-*] analyze Functions
| aF same as above, but using anal.depth=1
...
Also, the project is under heavy development, there is no day without commits to the GitHub
repo. So, as the readme says, you should always use the git version!
Cheatsheet by pwntester
Radare2 Book
Radare2 Blog
Radare2 Wiki
242
IOLI 0x01
.first_steps
OK, enough of praising r2, lets start reversing this stuff. First, you have to know your enemy:
r2 tip: rabin2 is one of the handy tools that comes with radare2. It can be used to
extract information (imports, symbols, libraries, etc.) about binary executables. As
always, check the help (rabin2 -h)!
So, its a dynamically linked, stripped, 64bit Linux executable - nothing fancy here. Let's try to
run it:
243
IOLI 0x01
OK, so it reads a number as a size from the standard input first, than reads further, probably
"size" bytes/characters, processes this input, and outputs either "Wrong!", nothing or
something else, presumably our flag. But do not waste any more time monkeyfuzzing the
executable, let's fire up r2, because in asm we trust!
r2 tip: The -A switch runs aaa command at start to analyze all referenced code, so we
will have functions, strings, XREFS, etc. right at the beginning. As usual, you can get
help with ?.
It is a good practice to create a project, so we can save our progress, and we can come
back at a later time:
[0x00400720]> Ps avatao_reverse4
avatao_reverse4
[0x00400720]>
r2 tip: You can save a project using Ps [file], and load one using Po [file]. With the -p
option, you can load a project when starting r2.
[0x00400720]> fs strings
[0x00400720]> f
0x00400e98 7 str.Wrong_
0x00400e9f 27 str.We_are_in_the_outer_space_
0x00400f80 18 str.Size_of_data:__u_n
0x00400f92 23 str.Such_VM__MuCH_reV3rse_
0x00400fa9 16 str.Use_everything_
0x00400fbb 9 str.flag.txt
0x00400fc7 26 str.You_won__The_flag_is:__s_n
0x00400fe1 21 str.Your_getting_closer_
[0x00400720]>
r2 tip: r2 puts so called flags on important/interesting offsets, and organizes these flags
into flagspaces (strings, functions, symbols, etc.) You can list all flagspaces using fs,
and switch the current one using fs [flagspace] (the default is *, which means all the
flagspaces). The command f prints all flags from the currently selected flagspace(s).
OK, the strings looks interesting, especially the one at 0x00400f92. It seems to hint that this
crackme is based on a virtual machine. Keep that in mind!
244
IOLI 0x01
These strings could be a good starting point if we were talking about a real-life application
with many-many features. But we are talking about a crackme, and they tend to be small
and simple, and focused around the problem to be solved. So I usually just take a look at the
entry point(s) and see if I can figure out something from there. Nevertheless, I'll show you
how to find where these strings are used:
r2 tip: We can list crossreferences to addresses using the axt [addr] command
(similarly, we can use axf to list references from the address). The @@ is an iterator, it
just runs the command once for every arguments listed.
The argument list in this case comes from the command f~[0]. It lists the strings from
the executable with f, and uses the internal grep command ~ to select only the first
column ([0]) that contains the strings' addresses.
245
IOLI 0x01
.main
As I was saying, I usually take a look at the entry point, so let's just do that:
[0x00400720]> s main
[0x00400c63]>
r2 tip: You can go to any offset, flag, expression, etc. in the executable using the s
command (seek). You can use references, like $$ (current offset), you can undo (s-) or
redo (s+) seeks, search strings (s/ [string]) or hex values (s/x 4142), and a lot of other
useful stuff. Make sure to check out s?!
Now that we are at the beginning of the main function, we could use p to show a
disassembly (pd, pdf), but r2 can do something much cooler: it has a visual mode, and it can
display graphs similar to IDA, but way cooler, since they are ASCII-art graphs :)
r2 tip: The command family p is used to print stuff. For example it can show
disassembly (pd), disassembly of the current function (pdf), print strings (ps), hexdump
(px), base64 encode/decode data (p6e, p6d), or print raw bytes (pr) so you can for
example dump parts of the binary to other files. There are many more functionalities,
check ?!
R2 also has a minimap view which is incredibly useful for getting an overall look at a
function:
246
IOLI 0x01
247
IOLI 0x01
r2 tip: With command V you can enter the so-called visual mode, which has several
views. You can switch between them using p and P. The graph view can be displayed
by hitting V in visual mode (or using VV at the prompt).
Hitting p in graph view will bring up the minimap. It displays the basic blocks and the
connections between them in the current function, and it also shows the disassembly of
the currently selected block (marked with @@@@@ on the minimap). You can select
the next or the previous block using the *\* and the *\\* keys respectively. You can also
select the true or the false branches using the t and the f keys.
It is possible to bring up the prompt in visual mode using the : key, and you can use o to
seek.
Lets read main node-by-node! The first block looks like this:
We can see that the program reads a word (2 bytes) into the local variable named
local_10_6, and than compares it to 0xbb8. Thats 3000 in decimal, btw:
[0x00400c63]> ? 0xbb8
3000 0xbb8 05670 2.9K 0000:0bb8 3000 10111000 3000.0 0.000000f 0.000000
r2 tip: yep, ? will evaluate expressions, and print the result in various formats.
248
IOLI 0x01
First, the "Size of data: " message we saw when we run the program is printed. So now we
know that the local variable local_10_6 is the size of the input data - so lets name it
accordingly (remember, you can open the r2 shell from visual mode using the : key!):
r2 tip: The af command family is used to analyze functions. This includes manipulating
arguments and local variables too, which is accessible via the afv commands. You can
list function arguments (afa), local variables (afv), or you can even rename them (afan,
afvn). Of course there are lots of other features too - as usual: use the "?", Luke!
249
IOLI 0x01
After this an input_size bytes long memory chunk is allocated, and filled with data from the
standard input. The address of this memory chunk is stored in local_10 - time to use afvn
again:
We've almost finished with this block, there are only two things remained. First, an 512
(0x200) bytes memory chunk is zeroed out at offset 0x00602120. A quick glance at XREFS
to this address reveals that this memory is indeed used somewhere in the application:
r2 tip: Flags can be managed using the f command family. We've just added the flag
sym.memory to a 0x200 bytes long memory area at 0x602120. It is also possible to
remove (f-name), rename (fr [old] [new]), add comment (fC [name] [cmt]) or even color
(fc [name] [color]) flags.
While we are here, we should also declare that memory chunk as data, so it will show up as
a hexdump in disassembly view:
r2 tip: The command family C is used to manage metadata. You can set (CC) or edit
(CC) comments, declare memory areas as data (Cd), strings (Cs), etc. These
commands can also be issued via a menu in visual mode invoked by pressing d.
The only remaining thing in this block is a function call to 0x400a45 with the input data as an
argument. The function's return value is compared to "*", and a conditional jump is executed
depending on the result.
Earlier I told you that this crackme is probably based on a virtual machine. Well, with that
information in mind, one can guess that this function will be the VM's main loop, and the
input data is the instructions the VM will execute. Based on this hunch, I've named this
250
IOLI 0x01
r2 tip: The af command is used to analyze a function with a given name at the given
address. The other two commands should be familiar from earlier.
After renaming local variables, flagging that memory area, and renaming the VM loop
function the disassembly looks like this:
So, back to that conditional jump. If vmloop returns anything else than "*", the program just
exits without giving us our flag. Obviously we don't want that, so we follow the false branch.
251
IOLI 0x01
Now we see that a string in that 512 bytes memory area (sym.memory) gets compared to
"Such VM! MuCH reV3rse!". If they are not equal, the program prints the bytecode, and
exits:
OK, so now we know that we have to supply a bytecode that will generate that string when
executed. As we can see on the minimap, there are still a few more branches ahead, which
probably means more conditions to meet. Lets investigate them before we delve into
vmloop!
If you take a look at the minimap of the whole function, you can probably recognize that
there is some kind of loop starting at block [0d34], and it involves the following nodes:
[0d34]
[0d65]
[0d3d]
[0d61]
Here are the assembly listings for those blocks. The first one puts 0 into local variable
local_10_4:
And this one compares local_10_4 to 8, and executing a conditional jump based on the
result:
252
IOLI 0x01
It's pretty obvious that local_10_4 is the loop counter, so lets name it accordingly:
The memory area at 0x6020e0 is treated as an array of dwords (4 byte values), and
checked if the ith value of it is zero. If it is not, the loop simply continues:
253
IOLI 0x01
If the value is zero, the loop breaks and this block is executed before exiting:
It prints the following message: Use everything!" As we've established earlier, we are dealing
with a virtual machine. In that context, this message probably means that we have to use
every available instructions. Whether we executed an instruction or not is stored at
0x6020e0 - so lets flag that memory area:
Assuming we don't break out and the loop completes, we are moving on to some more
checks:
This piece of code may look a bit strange if you are not familiar with x86_64 specific stuff. In
particular, we are talking about RIP-relative addressing, where offsets are described as
displacements from the current instruction pointer, which makes implementing PIE easier.
Anyways, r2 is nice enough to display the actual address (0x602104). Got the address, flag
it!
254
IOLI 0x01
Keep in mind though, that if RIP-relative addressing is used, flags won't appear directly in
the disassembly, but r2 displays them as comments:
If sym.good_if_ne_zero is zero, we get a message ("Your getting closer!"), and then the
program exits. If it is non-zero, we move to the last check:
Here the program compares a dword at 0x6020f0 (again, RIP-relative addressing) to 9. If its
greater than 9, we get the same "Your getting closer!" message, but if it's lesser, or equal to
9, we finally reach our destination, and get the flag:
255
IOLI 0x01
Well, it seems that we have fully reversed the main function. To summarize it: the program
reads a bytecode from the standard input, and feeds it to a virtual machine. After VM
execution, the program's state have to satisfy these conditions in order to reach the goodboy
code:
This concludes our analysis of the main function, we can now move on to the VM itself.
256
IOLI 0x01
.vmloop
[offset]> fcn.vmloop
Well, that seems disappointingly short, but no worries, we have plenty to reverse yet. The
thing is that this function uses a jump table at 0x00400a74,
and r2 can't yet recognize jump tables (Issue 3201), so the analysis of this function is a bit
incomplete. This means that we can't really use the graph view now, so either we just use
visual mode, or fix those basic blocks. The entire function is just 542 bytes long, so we
certainly could reverse it without the aid of the graph mode, but since this writeup aims to
include as much r2 wisdom as possible, I'm going to show you how to define basic blocks.
But first, lets analyze what we already have! First, rdi is put into local_3. Since the
application is a 64bit Linux executable, we know that rdi is the first function argument (as
you may have recognized, the automatic analysis of arguments and local variables was not
257
IOLI 0x01
entirely correct), and we also know that vmloop's first argument is the bytecode. So lets
rename local_3:
Next, sym.memory is put into another local variable at rbp-8 that r2 did not recognize. So
let's define it!
r2 tip: The afv [idx] [name] [type] command is used to define local variable at [frame
pointer - idx] with the name [name] and type [type]. You can also remove local variables
using the afv- [idx] command.
In the next block, the program checks one byte of bytecode, and if it is 0, the function returns
with 1.
If that byte is not zero, the program subtracts 0x41 from it, and compares the result to 0x17.
If it is above 0x17, we get the dreaded "Wrong!" message, and the function returns with 0.
This basically means that valid bytecodes are ASCII characters in the range of "A" (0x41)
through "X" (0x41 + 0x17). If the bytecode is valid, we arrive at the code piece that uses the
jump table:
The jump table's base is at 0x400ec0, so lets define that memory area as a series of
qwords:
258
IOLI 0x01
[0x00400a74]> s 0x00400ec0
[0x00400ec0]> Cd 8 @@=`?s $$ $$+8*0x17 8`
r2 tip: Except for the ?s, all parts of this command should be familiar now, but lets
recap it! Cd defines a memory area as data, and 8 is the size of that memory area. @@
is an iterator that make the preceding command run for every element that @@ holds.
In this example it holds a series generated using the ?s command. ?s simply generates
a series from the current seek ($$) to current seek + 80x17 ($$+80x17) with a step of 8.
This is how the disassembly looks like after we add this metadata:
[0x00400ec0]> pd 0x18
; DATA XREF from 0x00400a76 (unk)
0x00400ec0 .qword 0x0000000000400a80
0x00400ec8 .qword 0x0000000000400c04
0x00400ed0 .qword 0x0000000000400b6d
0x00400ed8 .qword 0x0000000000400b17
0x00400ee0 .qword 0x0000000000400c04
0x00400ee8 .qword 0x0000000000400c04
0x00400ef0 .qword 0x0000000000400c04
0x00400ef8 .qword 0x0000000000400c04
0x00400f00 .qword 0x0000000000400aec
0x00400f08 .qword 0x0000000000400bc1
0x00400f10 .qword 0x0000000000400c04
0x00400f18 .qword 0x0000000000400c04
0x00400f20 .qword 0x0000000000400c04
0x00400f28 .qword 0x0000000000400c04
0x00400f30 .qword 0x0000000000400c04
0x00400f38 .qword 0x0000000000400b42
0x00400f40 .qword 0x0000000000400c04
0x00400f48 .qword 0x0000000000400be5
0x00400f50 .qword 0x0000000000400ab6
0x00400f58 .qword 0x0000000000400c04
0x00400f60 .qword 0x0000000000400c04
0x00400f68 .qword 0x0000000000400c04
0x00400f70 .qword 0x0000000000400c04
0x00400f78 .qword 0x0000000000400b99
As we can see, the address 0x400c04 is used a lot, and besides that there are 9 different
addresses. Lets see that 0x400c04 first!
259
IOLI 0x01
We get the message "Wrong!", and the function just returns 0. This means that those are not
valid instructions (they are valid bytecode though, they can be e.g. parameters!) We should
flag 0x400c04 accordingly:
As for the other offsets, they all seem to be doing something meaningful, so we can assume
they belong to valid instructions. I'm going to flag them using the instructions' ASCII values:
Ok, so these offsets were not on the graph, so it is time to define basic blocks for them!
r2 tip: You can define basic blocks using the afb+ command. You have to supply what
function the block belongs to, where does it start, and what is its size. If the block ends
in a jump, you have to specify where does it jump too. If the jump is a conditional jump,
the false branch's destination address should be specified too.
We can get the start and end addresses of these basic blocks from the full disasm of
vmloop.
260
IOLI 0x01
261
IOLI 0x01
As I've mentioned previously, the function itself is pretty short, and easy to read, especially
with our annotations. But a promise is a promise, so here is how we can create the missing
bacic blocks for the instructions:
It is also apparent from the disassembly that besides the instructions there are three more
basic blocks. Lets create them too!
Note that the basic blocks starting at 0x00400c15 and 0x00400c2d ending in a conditional
jump, so we had to set the false branch's destination too!
And here is the graph in its full glory after a bit of manual restructuring:
262
IOLI 0x01
I think it worth it, don't you? :) (Well, the restructuring did not really worth it, because it is
apparently not stored when you save the project.)
r2 tip: You can move the selected node around in graph view using the HJKL keys.
BTW, here is how IDA's graph of this same function looks like for comparison:
263
IOLI 0x01
As we browse through the disassembly of the instr_LETTER basic blocks, we should realize
a few things. The first: all of the instructions starts with a sequence like these:
It became clear now that the 9 dwords at sym.instr_dirty are not simply indicators that an
instruction got executed, but they are used to count how many times an instruction got
called. Also I should have realized earlier that sym.good_if_le_9 (0x6020f0) is part of this 9
dword array, but yeah, well, I didn't, I have to live with it... Anyways, what the condition
"sym.good_if_le_9 have to be lesser or equal 9" really means is that instr_P can not be
executed more than 9 times:
Another similarity of the instructions is that 7 of them calls a function with either one or two
parameters, where the parameters are the next, or the next two bytecodes. One parameter
example:
264
IOLI 0x01
We should also realize that these blocks put the number of bytes they eat up of the bytecode
(1 byte instruction + 1 or 2 bytes arguments = 2 or 3) into a local variable at 0xc. r2 did not
recognize this local var, so lets do it manually!
If we look at instr_J we can see that this is an exception to the above rule, since it puts the
return value of the called function into instr_ptr_step instead of a constant 2 or 3:
And speaking of exceptions, here are the two instructions that do not call functions:
265
IOLI 0x01
This one simply puts the next bytecode (the first the argument) into eax, and jumps to the
end of vmloop. So this is the VM's ret instruction, and we know that vmloop has to return "*",
so "R*" should be the last two bytes of our bytecode.
This is a one argument instruction, and it puts its argument to 0x6020c0. Flag that address!
Oh, and by the way, I do have a hunch that instr_C also had a function call in the original
code, but it got inlined by the compiler. Anyways, so far we have these two instructions:
instr_I
instr_D
instr_P
instr_X
instr_J
instr_A
instr_S
What remains is the reversing of the seven functions that are called by the instructions, and
finally the construction of a valid bytecode that gives us the flag.
instr_A
The function this instruction calls is at offset 0x40080d, so lets seek there!
[offset]> 0x40080d
266
IOLI 0x01
r2 tip: In visual mode you can just hit \ when the current line is a jump or a call, and r2
will seek to the destination address.
If we seek to that address from the graph mode, we are presented with a message that says
"Not in a function. Type 'df' to define it here. This is because the function is called from a
basic block r2 did not recognize, so r2 could not find the function either. Lets obey, and type
df! A function is indeed created, but we want some meaningful name for it. So press dr while
still in visual mode, and name this function instr_A!
r2 tip: You should realize that these commands are all part of the same menu system in
visual mode I was talking about when we first used Cd to declare sym.memory as data.
Ok, now we have our shiny new fcn.instr_A, lets reverse it! We can see from the shape of
the minimap that probably there is some kind cascading if-then-elif, or a switch-case
statement involved in this function. This is one of the reasons the minimap is so useful: you
can recognize some patterns at a glance, which can help you in your analysis (remember
the easily recognizable for loop from a few paragraphs before?) So, the minimap is cool and
useful, but I've just realized that I did not yet show you the full graph mode, so I'm going to
do this using full graph. The first basic blocks:
267
IOLI 0x01
The two function arguments (rdi and rsi) are stored in local variables, and the first is
compared to 0. If it is, the function returns (you can see it on the minimap), otherwise the
same check is executed on the second argument. The function returns from here too, if the
argument is zero. Although this function is really tiny, I am going to stick with my
methodology, and rename the local vars:
And we have arrived to the predicted switch-case statement, and we can see that arg1's
value is checked against "M", "P", and "C".
268
IOLI 0x01
It basically loads an address from offset 0x602088 and adds arg2 to the byte at that
address. As r2 kindly shows us in a comment, 0x602088 initially holds the address of
sym.memory, the area where we have to construct the "Such VM! MuCH reV3rse!" string. It
is safe to assume that somehow we will be able to modify the value stored at 0x602088, so
this "M" branch will be able to modify bytes other than the first. Based on this assumption, I'll
flag 0x602088 as sym.current_memory_ptr:
269
IOLI 0x01
Yes, this is the piece of code that allows us to modify sym.current_memory_ptr: it adds arg2
to it.
Well, it turned out that instr_C is not the only instruction that modifies
sym.written_by_instr_C: this piece of code adds arg2 to it.
And that was instr_A, lets summarize it! Depending on the first argument, this instruction
does the following:
instr_S
This function is not recognized either, so we have to manually define it like we did with
instr_A. After we do, and take a look at the minimap, scroll through the basic blocks, it is
pretty obvious that these two functions are very-very similar. We can use radiff2 to see the
difference.
r2 tip: radiff2 is used to compare binary files. There's a few options we can control the
type of binary diffing the tool does, and to what kind of output format we want. One of
the cool features is that it can generate DarumGrim-style bindiff graphs using the -g
option.
270
IOLI 0x01
Since now we want to diff two functions from the same binary, we specify the offsets with -g,
and use reverse4 for both binaries. Also, we create the graphs for comparing instr_A to
instr_S and for comparing instr_S to instr_A.
271
IOLI 0x01
A sad truth reveals itself after a quick glance at these graphs: radiff2 is a liar! In theory, grey
boxes should be identical, yellow ones should differ only at some offsets, and red ones
should differ seriously. Well this is obviously not the case here - e.g. the larger grey boxes
are clearly not identical. This is something I'm definitely going to take a deeper look at after
I've finished this writeup.
Anyways, after we get over the shock of being lied to, we can easily recognize that instr_S is
basically a reverse-instr_A: where the latter does addition, the former does subtraction. To
summarize this:
instr_I
272
IOLI 0x01
This one is simple, it just calls instr_A(arg1, 1). As you may have noticed the function call
looks like call fcn.0040080d instead of call fcn.instr_A . This is because when you save
and open a project, function names get lost - another thing to examine and patch in r2!
instr_D
instr_P
It's local var rename time again!
273
IOLI 0x01
This function is pretty straightforward also, but there is one oddity: const_M is never used. I
don't know why it is there - maybe it is supposed to be some kind of distraction? Anyways,
this function simply writes arg1 to sym.current_memory_ptr, and than calls instr_I("P"). This
basically means that instr_P is used to write one byte, and put the pointer to the next byte.
So far this would seem the ideal instruction to construct most of the "Such VM! MuCH
reV3rse!" string, but remember, this is also the one that can be used only 9 times!
instr_X
Another simple one, rename local vars anyways!
instr_J
274
IOLI 0x01
This one is not as simple as the previous ones, but it's not that complicated either. Since I'm
obviously obsessed with variable renaming:
After the result of arg1 & 0x3f is put into a local variable, arg1 & 0x40 is checked against 0. If
it isn't zero, arg1_and_0x3f is negated:
The next branching: if arg1 >= 0, then the function returns arg1_and_0x3f,
275
IOLI 0x01
276
IOLI 0x01
277
IOLI 0x01
.instructionset
We've now reversed all the VM instructions, and have a full understanding about how it
works. Here is the VM's instruction set:
278
IOLI 0x01
.bytecode
Well, we did the reverse engineering part, now we have to write a program for the VM with
the instruction set described in the previous paragraph. Here is the program's functional
specification:
Since this document is about reversing, I'll leave the programming part to the fellow reader :)
But I'm not going to leave you empty-handed, I'll give you one advice: Except for "J", all of
the instructions are simple, easy to use, and it should not be a problem to construct the
"Such VM! MuCH reV3rse!" using them. "J" however is a bit complicated compared to the
others. One should realize that its sole purpose is to make sym.good_if_ne_zero bigger than
zero, which is a requirement to access the flag. In order to increment sym.good_if_ne_zero,
three conditions should be met:
I think this is enough information, you can go now and write that program. Or, you could just
reverse engineer the quick'n'dirty one I've used during the CTF:
Keep in mind though, that it was written on-the-fly, parallel to the reversing phase - for
example there are parts that was written without the knowledge of all possible instructions.
This means that the code is ugly and unefficient.
279
IOLI 0x01
280
IOLI 0x01
.outro
Well, what can I say? Such VM, much reverse! :)
What started out as a simple writeup for a simple crackme, became a rather lengthy
writeup/r2 tutorial, so kudos if you've read through it. I hope you enjoyed it (I know I did), and
maybe even learnt something from it. I've surely learnt a lot about r2 during the process, and
I've even contributed some small patches, and got a few ideas of more possible
improvements.
281
Reference Card
Survival Guide
Command Description
aa Auto analyze
pdf@fcn(Tab) Disassemble function
f fcn(Tab) List functions
f str(Tab) List strings
fr [flagname] [newname] Rename flag
psz [offset] Print string
Flagspaces
Command Description
fs Display flagspaces
fs * Select all flagspaces
Flags
Command Description
f List flags
282
Reference Card
Information
Command Description
ii Information on imports
iI Info on binary
ie Display entrypoint
iS Display sections
ir Display relocations
Print string
Command Description
psz [offset] Print zero terminated string
psb [offset] Print strings in current block
psx [offset] Show string with scaped chars
psp [offset] Print pascal string
Visual mode
Command Description
c Toggle (c)ursor
q Back to Radare shell
sS Step/step over
283
Reference Card
C Toggle (C)olors
R Randomize color palette (ecr)
t Track flags (browse symbols, functions..)
T Browse anal info and comments
v Visual code analysis menu
V/W (V)iew graph (agv?), open (W)ebUI
uU Undo/redo seek
x Show xrefs to seek between them
Searching
284
Reference Card
Command Description
Saving
Command Description
Po [file] Open project
285
Reference Card
$S Section offset
$SS Section size
$j Jump address (jmp 0x10, jz 0x10 => 0x10)
$f Jump fail address (jz 0x10 => next instruction)
$I Number of instructions of current function
$F Current function size
$Jn Get nth jump of function
$Cn Get nth call of function
License
This chapter is based on the Radare 2 reference card by Thanat0s, which is under the GNU
GPL. Original license is as follows:
This card may be freely distributed under the terms of the GNU
general public licence — Copyright c by Thanat0s - v0.1 -
286
Reference Card
287