Running Untrusted Application Code: Sandboxing
Running Untrusted Application Code: Sandboxing
ghostview, outlook
! Goal:
Approach: confinement
! Confinement: ensure application does not deviate from
pre-approved behavior
Hardware: run application on isolated hw (air gap) w difficult to manage Virtual machines: isolate OS s on single hardware System call interposition: w Isolates a process in a single operating system Isolating threads sharing same address space: w Software Fault Isolation (SFI) Application specific: e.g. browser-based confinement
n n
Implementing confinement
! Key component:
n
reference monitor
Mediates requests from applications w Implements protection policy w Enforces isolation and confinement Must always be invoked: w Every application request must be mediated Tamperproof: w Reference monitor cannot be killed w or if killed, then monitored process is killed too Small enough to be analyzed and validated
A simple example:
! To use do: (must be root)
chroot /tmp/guest su guest
chroot
! Now
open( /etc/passwd ,
r )
Jailkit
Problem: all utility progs (ls, ps, vi) must live inside jail jailkit project: auto builds files, libs, and dirs needed in jail environment jk_init: creates jail environment jk_check: checks jail env for security problems checks for any modified programs, checks for world writable directories, etc. jk_lsh: restricted shell to be used inside jail note: simple chroot jail does not limit network access
open( /tmp/guest/../../etc/passwd ,
otherwise jailed app can do: w create dummy file /aaa/etc/passwd w run chroot /aaa w run su root to become root (bug in Ultrix 4.0)
Freebsd jail
! Stronger mechanism than simple chroot ! To run:
jail jail-path hostname IP-addr cmd
n n
(no
../../
escape)
can only bind to sockets with specified IP address and authorized ports can only communicate with process inside jail root is limited, e.g. cannot load kernel modules
n n
All or nothing access to file system Inappropriate for apps like web browser w Needs read access to files outside jail (e.g. for sending attachments in gmail)
Accessing network and messing with other machines Trying to crash host OS
! Idea:
n
! Implementation options:
n n n
Completely kernel space (e.g. GSWTK) Completely user space (e.g. program shepherding) Hybrid (e.g. Systrace)
Initial implementation
! Linux ptrace:
(Janus)
process tracing tracing process calls: ptrace ( , pid_t pid , ) and wakes up when pid makes sys call.
user space
monitor
OS Kernel
Complications
! If app forks, monitor must also fork
n
! If monitor crashes, app must be killed ! Monitor must maintain all OS state associated with app
n n
current-working-dir (CWD), UID, EUID, GID Whenever app does cd path monitor must also update its CWD w otherwise: relative path requests interpreted incorrectly
Trace all system calls or none w e.g. no need to trace close system call Monitor cannot abort sys-call without killing app
Example: symlink: me -> mydata.dat proc 1: open( me ) monitor checks and authorizes not atomic proc 2: me -> /etc/passwd OS executes open( me ) Classic TOCTOU bug: time-of-check / time-of-use
time
n
systrace permit/deny
OS Kernel
(saves context switches) ! systrace resolves sym-links and replaces sys-call path arguments by full path to target ! When app calls execve, monitor loads new policy file
Policy
! Sample policy file:
path allow /tmp/* path deny /etc/passwd network deny all
Systrace can auto-gen policy by learning how app behaves on good inputs If policy does not cover a specific sys-call, ask user but user has no way to decide
outer sandbox: restricts capabilities using sys call interposition Inner sandbox: uses x86 memory segmentation to isolate application memory from one another
Virtual Machines
VM2
Apps
Apps
VM1
Guest OS 2
Host OS
Hardware
Example: NSA NetTop
Guest OS 1
Few computers, lots of users VMs allow many users to shares a single computer non-existent
Too many computers, too few users w Print server, Mail server, Web server, File server, Database server, Wasteful to run each service on a different computer w VMs save hardware while isolating services More generally: VMs heavily used in cloud computing
Malware can infect guest OS and guest apps But malware cannot escape from the infected VM
VMM is much simpler than full OS but device drivers run in Host OS
malware
VMM
interrupts,
VMM Introspection:
[GR 03]
Kernel root kit can shutdown protection system Common practice for modern malware run IDS system in the network Problem: insufficient visibility into user s machine
(protected from malware)
! Standard solution:
n
VMM can monitor virtual hardware for anomalies VMI: Virtual Machine Introspection
w Allows VMM to check Guest OS internals
Sample checks
Stealth malware: n Creates processes that are invisible to ps n Opens sockets that are invisible to netstat 1. Lie detector check n Goal: detect stealth malware that hides processes and network activity n Method:
w VMM lists processes running in GuestOS w VMM requests GuestOS to list processes (e.g. ps) w If mismatch, kill VM
Sample checks
2. Application code integrity detector n VMM computes hash of user app-code running in VM n Compare to whitelist of hashes w Kills VM if unknown program appears 3. Ensure GuestOS kernel integrity n example: detect changes to sys_call_table 4. Virus signature detector n Run virus signature detector on GuestOS memory 5. Detect if GuestOS puts NIC in promiscuous mode
Subvirt
! Virus idea:
n n n
Once on the victim machine, install a malicious VMM Virus hides in VMM Invisible to virus detector running inside VM Anti-virus
HW
Anti-virus
OS HW
OS
VMM and virus
The MATRIX
VMM Detection
! Can an OS detect it is running on top of a VMM? ! Applications:
n n
Virus detector can detect VMBR Normal virus (non-VMBR) can detect VMM w refuse to run to avoid reverse engineering Software that binds to hardware refuse to run on top of VMM
(e.g. MS Windows)
can
VMM detection
n
2. VMM introduces time latency variances n Memory cache behavior differs in presence of VMM n Results in relative latency in time variations for any two operations 3. VMM shares the TLB with GuestOS n GuestOS can detect reduced TLB size and many more methods
[GAWF 07]
VMM Detection
Bottom line: The perfect VMM does not exist
confine apps running in same address space Codec code should not interfere with media player Device drivers should not corrupt kernel
Problem: slow if apps communicate frequently w requires context switch per message
code segment
app #1
app #2
Locate unsafe instructions: jmp, load, store w At compile time, add guards before unsafe instructions w When loading code, ensure all guard are present
Compiler pretends these registers don t exist not Guard ensures code does dr2 contains segment ID R12 [addr]
dr1 addr scratch-reg (dr1 >> 20) compare scratch-reg and dr2 trap if not equal R12 [addr]
callee domain
draw: return
stub
! Only stubs allowed to make cross-domain jumps ! Jump table contains allowed exit points from callee
n
! Performance
n
Usually good:
mpeg_play, 4% slowdown
variable length instructions: unclear where to put guards few registers: can t dedicate three to SFI many instructions affect memory: more guards needed
Summary
! Many sandboxing techniques:
n n n n n
Physical air gap, Virtual air gap (VMMs), System call interposition Software Fault isolation Application specific (e.g. Javascript in browser)
Specifying policy:
THE END