Instrumenting Cpython With Dtrace and Systemtap: Guido Van Rossum and The Python Development Team
Instrumenting Cpython With Dtrace and Systemtap: Guido Van Rossum and The Python Development Team
SystemTap
Release 3.7.1rc1
Contents
5 SystemTap Tapsets 6
6 Examples 7
Index 8
or:
On macOS, you can list available DTrace probes by running a Python process in the background and listing
all probes made available by the Python provider:
$ python3.6 -q &
$ sudo dtrace -l -P python$! # or: dtrace -l -m python3.6
On Linux, you can verify if the SystemTap static markers are present in the built binary by seeing if it
contains a “.note.stapsdt” section.
If you’ve built Python as a shared library (with –enable-shared), you need to look instead within the shared
library. For example:
$ readelf -n ./python
The above metadata contains information for SystemTap describing how it can patch strategically-placed
machine code instructions to enable the tracing hooks used by a SystemTap script.
The following example DTrace script can be used to show the call/return hierarchy of a Python script, only
tracing within the invocation of a function called “start”. In other words, import-time function invocations
are not going to be listed:
python$target:::function-entry
/copyinstr(arg1) == "start"/
{
self->trace = 1;
}
python$target:::function-entry
/self->trace/
{
printf("%d\t%*s:", timestamp, 15, probename);
printf("%*s", self->indent, "");
printf("%s:%s:%d\n", basename(copyinstr(arg0)), copyinstr(arg1), arg2);
self->indent++;
(continues on next page)
(continued from previous page)
}
python$target:::function-return
/self->trace/
{
self->indent--;
printf("%d\t%*s:", timestamp, 15, probename);
printf("%*s", self->indent, "");
printf("%s:%s:%d\n", basename(copyinstr(arg0)), copyinstr(arg1), arg2);
}
python$target:::function-return
/copyinstr(arg1) == "start"/
{
self->trace = 0;
}
156641360502280 function-entry:call_stack.py:start:23
156641360518804 function-entry: call_stack.py:function_1:1
156641360532797 function-entry: call_stack.py:function_3:9
156641360546807 function-return: call_stack.py:function_3:10
156641360563367 function-return: call_stack.py:function_1:2
156641360578365 function-entry: call_stack.py:function_2:5
156641360591757 function-entry: call_stack.py:function_1:1
156641360605556 function-entry: call_stack.py:function_3:9
156641360617482 function-return: call_stack.py:function_3:10
156641360629814 function-return: call_stack.py:function_1:2
156641360642285 function-return: call_stack.py:function_2:6
156641360656770 function-entry: call_stack.py:function_3:9
156641360669707 function-return: call_stack.py:function_3:10
156641360687853 function-entry: call_stack.py:function_4:13
156641360700719 function-return: call_stack.py:function_4:14
156641360719640 function-entry: call_stack.py:function_5:18
156641360732567 function-return: call_stack.py:function_5:21
156641360747370 function-return:call_stack.py:start:28
probe process("python").mark("function__entry") {
filename = user_string($arg1);
funcname = user_string($arg2);
lineno = $arg3;
probe process("python").mark("function__return") {
filename = user_string($arg1);
funcname = user_string($arg2);
lineno = $arg3;
$ stap \
show-call-hierarchy.stp \
-c "./python test.py"
probe process("python").mark("function__entry") {
probe process("python").library("libpython3.6dm.so.1.0").mark("function__entry") {
5 SystemTap Tapsets
The higher-level way to use the SystemTap integration is to use a “tapset”: SystemTap’s equivalent of a
library, which hides some of the lower-level details of the static markers.
Here is a tapset file, based on a non-shared build of CPython:
/*
Provide a higher-level wrapping around the function__entry and
function__return markers:
\*/
probe python.function.entry = process("python").mark("function__entry")
{
filename = user_string($arg1);
funcname = user_string($arg2);
lineno = $arg3;
frameptr = $arg4
}
probe python.function.return = process("python").mark("function__return")
{
filename = user_string($arg1);
funcname = user_string($arg2);
lineno = $arg3;
(continues on next page)
(continued from previous page)
frameptr = $arg4
}
If this file is installed in SystemTap’s tapset directory (e.g. /usr/share/systemtap/tapset), then these
additional probepoints become available:
python.function.entry(str filename, str funcname, int lineno, frameptr)
This probe point indicates that execution of a Python function has begun. It is only triggered for
pure-Python (bytecode) functions.
python.function.return(str filename, str funcname, int lineno, frameptr)
This probe point is the converse of python.function.return(), and indicates that execution of a
Python function has ended (either via return, or via an exception). It is only triggered for pure-
Python (bytecode) functions.
6 Examples
This SystemTap script uses the tapset above to more cleanly implement the example given above of tracing
the Python function-call hierarchy, without needing to directly name the static markers:
probe python.function.entry
{
printf("%s => %s in %s:%d\n",
thread_indent(1), funcname, filename, lineno);
}
probe python.function.return
{
printf("%s <= %s in %s:%d\n",
thread_indent(-1), funcname, filename, lineno);
}
The following script uses the tapset above to provide a top-like view of all running CPython code, showing
the top 20 most frequently-entered bytecode frames, each second, across the whole system:
global fn_calls;
probe python.function.entry
{
fn_calls[pid(), filename, funcname, lineno] += 1;
}
probe timer.ms(1000) {
printf("\033[2J\033[1;1H") /* clear screen \*/
printf("%6s %80s %6s %30s %6s\n",
"PID", "FILENAME", "LINE", "FUNCTION", "CALLS")
foreach ([pid, filename, funcname, lineno] in fn_calls- limit 20) {
printf("%6d %80s %6d %30s %6d\n",
pid, filename, lineno, funcname,
fn_calls[pid, filename, funcname, lineno]);
}
delete fn_calls;
}
Index
F
function__entry (C function), 5
function__return (C function), 6
G
gc__done (C function), 6
gc__start (C function), 6
I
import__find__load__done (C function), 6
import__find__load__start (C function), 6
L
line (C function), 6
P
python.function.entry (C function), 7
python.function.return (C function), 7