0% found this document useful (0 votes)

91 views

Identifying Performance Issues Beyond Oracle Wait

Uploaded by

fqchina

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

91 views

Identifying Performance Issues Beyond Oracle Wait

Uploaded by

fqchina

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Identifying performance issues beyond the

Oracle wait interface 

 
 
 
 
 
Stefan Koehler!

11.11.15! Page 1!
About me!
Stefan Koehler!
•  Independent Oracle performance consultant and researcher!
•  12+ years using Oracle RDBMS!
•  Oracle performance and internals geek!
•  Main interests: Cost based optimizer and Oracle RDBMS internals!
!

Focus & Services: “It is all about performance” !

•  Oracle performance tuning (e.g. Application, CBO, Database, Design, SQL)!
•  Oracle core internals researching (e.g. DTrace, GDB, Perf, etc.)!
•  Troubleshooting nontrivial Oracle RDBMS issues (e.g. Heap dumps, System State
dumps, etc.)!
•  Services are mainly based on short-term contracting ! !!
! !!
! !www.soocs.de ! [email protected] ! !@OracleSK !
11.11.15! Page 2!
Agenda!
•  Systematic troubleshooting - What are we talking about?!
•  “System call trace” vs. “Stack trace”!
•  Capturing and interpreting “Stack traces” with focus on Linux!
•  Safety warning - Are “Stack traces” safe to use in production?!
•  Combine Oracle wait interface and “Stack traces”!
•  Real life root cause identified + fixed with help of “Stack traces”!

11.11.15! Page 3!
Systematic troubleshooting - What are  
we talking about? (1)!

11.11.15! Page 4!
Systematic troubleshooting - What are  
we talking about? (2)!
1.  Identify performance bottleneck based on response time
Method R by Cary Millsap!
Business process is affected by
single SQL running on CPU only

2.  Interpret execution plan with help of additional SQL execution

statistics (or Real-Time SQL Monitoring) and wait interface!
•  PL/SQL package DBMS_XPLAN or DBMS_SQLTUNE!
No execution plan issue found
!

3.  Capture and interpret session statistics and performance

counters!
•  Tools like Snapper by Tanel Poder!
Still no obvious root cause
! for the high CPU load

4.  Capture and interpret system call or stack traces!

•  This is what this session is about. Disassembling Oracle code.!
11.11.15! Page 5!
“System call trace” vs. “Stack trace”!
•  System call trace!
•  A system call is the fundamental interface between an application and the (Linux)
kernel and is generally not invoked directly, but rather via wrapper functions in
glibc (or some other library). For example: truncate() à truncate() or truncate64()!
•  Example of Oracle using system calls: gettimeofday(), pread(), etc.!
•  Tools: Strace (Linux), Truss (AIX / Solaris), Tusc (HP-UX)!
•  Be aware of vDSO / vSyscall64 feature when tracing system calls on Linux!
!

•  Stack trace / Stack backtrace!

•  A call stack is the list of names of methods called at run time from the beginning of
a program until the execution of the current statement!
•  Tools: Oradebug (Oracle), GDB + wrappers or Perf or SystemTap (Linux),
! ! DTrace (Solaris), Procstack (AIX)!
! The stack trace includes the called methods / functions of an Oracle process
and the system call trace includes only the (function) requests to the OS kernel
11.11.15! Page 6!
Capturing “Stack traces” with focus on  
Linux (1)!
•  Tool “Oradebug” (Oracle tool and platform independent)!
!SQL>oradebug SETMYPID / SETOSPID <PID>
SQL> oradebug SHORT_STACK Code path of oradebug request - SIGUSR2 signal
+<NUM> = Offset in bytes from beginning of symbol
! (function) where child function call happened

11.11.15! Page 7!
Capturing “Stack traces” with focus on  
Linux (2)!
•  Tool “GDB” (GNU debugger) and its wrapper script pstack!
shell> gdb shell> /usr/bin/pstack <PID>
(gdb) attach <PID>
(gdb) backtrace

GDB is based on ptrace() system calls

11.11.15! Page 8!
Capturing “Stack traces” with focus on  
Linux (3)!
•  Performance counters for Linux (Linux kernel-based subsystem)!
•  Framework for collecting and analyzing performance data, e.g. hardware events,
including retired instructions and processor clock cycles and many more!
•  Based on sampling (default avg. 1000 Hz respectively 1000 samples/sec)!
•  Caution in virtualized environments when capturing cpu-cycles events (VMware
KB #2030221)!
•  Tool ”Perf” is based on perf_events interface exported by Linux kernel (>= 2.6.31)!
shell> perf record -e cpu-cycles -o /tmp/perf.out -g -p <PID>
Hardware event (cpu-cycles) = Usage of kernel’s performance registers
! Software event (cpu-clock) = Depends on timer interrupt

•  Poor man’s stack profiling!

•  When no other tool is available and you need a quick insight into sampled stacks!
shell> export LC_ALL=C ; for i in {1..20} ; do pstack <PID>
| ./os_explain -a ; done | sort -r | uniq -c!
Script by Tanel Poder to translate C function names into known functionality
11.11.15! Page 9!
Capturing “Stack traces” with focus on  
Linux (4)!
•  Listing other capturing tools for completeness!
•  OStackProf by Tanel Poder (needs to be run from Windows SQL*Plus client as
based on oradebug short_stack and VBS script for post processing)!

•  DTrace on Solaris (e.g. DTrace toolkit script “hotuser” by Brendan Gregg or

analysis with PID provider)!

•  DTrace on Linux lacks in case of userspace integration / probing!

•  SystemTap (with Linux kernel >= 3.5 for userspace probing) otherwise “utrace
patch” needs to be applied!

!
11.11.15! Page 10!
Interpreting “Stack traces” with focus on  
Linux!
•  Performance counters for Linux (Linux kernel-based subsystem)!
•  Tool “Perf”!
shell> perf report -i /tmp/perf.out -g none -n --stdio
shell> perf report -i /tmp/perf.out -g graph -n --stdio !
!Problem: Depending on the stack trace content there may be too much data to
!interpret in this format. Main question: Where is the bulk of CPU time spent?!
!
•  Tool “Flame Graph” by Brendan Gregg (works with DTrace & SystemTap too)!
!shell> perf script -i /tmp/perf.out | ./stackcollapse-
perf.pl > out.perf.folded
shell>./flamegraph.pl out.perf.folded > perf-out.svg

11.11.15! Page 11!

Safety warning - Are “Stack traces” safe  
to use in production?!
•  If your database is already in such a state …! ! ! ! !
… then don’t worry about the possible ! ! !
consequences and issues by capturing ! ! ! ! !
stack traces!
!
•  Be aware of different behavior by capturing stack traces, if only
some specific business processes are affected !
•  Tool “Oradebug” - “Unsafe” as it alters code path / SIGUSR2 (e.g bug #15677306)!
•  Tool “GDB” (and its wrappers) - “Unsafe” as it suspends the process (ptrace
syscall) with possible impact on communication to kernel or other processes!
•  Tool “Perf” based on Linux performance counters - Safe by design, but fallback to
the other tools is still needed, if the process is not running on CPU and stuck
somewhere else!
•  DTrace (Solaris) - Safe by design!

11.11.15! Page 12!

Combine Oracle wait interface and  
“Stack traces”!
•  Fulltime.sh by Craig Shallahamer and Frits Hoogland!
•  Based on V$SESSION_EVENT and Linux performance counters!
shell> fulltime.sh <PID> <SAMPLE_DURATION> <SAMPLE_COUNT>

•  Oracle 12c enhancement - Diagnostic event “wait_event[]” in

"new" kernel diagnostics & tracing infrastructure!
SQL> oradebug doc event name wait_event
wait_event: event to control wait event post-wakeup actions
SQL> alter session set events 'wait_event["<wait event name>"]
trace("%s\n", shortstack())';
Combine extended SQL trace & event wait_event[]
Function kslwtectx marks end of wait event

11.11.15! Page 13!

Real life root cause identified + fixed  
with help of “Stack traces” (1)!
•  Environment and issue!
•  Large SAP system with Oracle 11.2.0.2 running on AIX 6.1 !
•  Most of the SAP work processes are stuck in a simple INSERT statement and
burning up all CPUs on database server!
•  Index key compression and OLTP compression is enabled!
•  SQL statement:!
SQL> INSERT INTO "BSIS” VALUES(:A0 , ... ,:A81);

•  Applying systematic troubleshooting!

•  Identify performance bottleneck based on response time with Method R!
!Performance bottleneck is clearly caused by the INSERT statement as 100%
!of the end user response time is spent on it and all application processes are
!affected by this !
!No further response time analysis needed here!
11.11.15! ! Page 14!
Real life root cause identified + fixed  
with help of “Stack traces” (2)!
•  Applying systematic troubleshooting!
•  Interpret execution plan with help of additional SQL execution statistics (or Real-
Time SQL Monitoring) and wait interface!
!

11.11.15! Page 15!

Real life root cause identified + fixed  
with help of “Stack traces” (3)!
•  Applying systematic troubleshooting!
•  Capture and interpret session statistics and performance counters!
!

11.11.15! Page 16!

Real life root cause identified + fixed  
with help of “Stack traces” (4)!
•  Applying systematic troubleshooting!
•  Capture and interpret session statistics and performance counters!

11.11.15! Page 17!

Real life root cause identified + fixed  
with help of “Stack traces” (5)!
•  Applying systematic troubleshooting!
•  Capture and interpret system call or stack traces!
!!
!
!
!

•  Process is stuck in main call stack “ktspscan_bmb” + on-top functions. The

high CPU usage (“session logical reads”) is the consequence of it!
•  Table “BSIS” is stored in an ASSM tablespace and the call stack “ktspfsrch <-
ktspscan_bmb” is related to “first level bitmap block search”!
•  MOS search results in bug #13641076 – “HIGH AMOUNT OF BUFFER GETS
FOR INSERT STATEMENT REJECTIONLIST DOES NOT FIRE”!
•  Root cause found and can be fixed by applying corresponding patch!

11.11.15! ! Page 18!

!
!
Questions and answers!
!
!
!
!
!
!
!
!!
Download links and further information to all mentioned tools and procedures
! are in the reference section of the manuscript
! ! !!
! !!
! !www.soocs.de ! [email protected] ! !@OracleSK !
11.11.15! Page 19!

Always (Morris Gleitzman) (Z-Library)
No ratings yet
Always (Morris Gleitzman) (Z-Library)
223 pages
2PX4 Table Staad Report Documents
No ratings yet
2PX4 Table Staad Report Documents
23 pages
Rootkits Subverting The Windows Kernel
50% (2)
Rootkits Subverting The Windows Kernel
524 pages
Innovations in Portland Cement Manufacturing
80% (5)
Innovations in Portland Cement Manufacturing
1,283 pages
Extreme Replication - Performance Tuning Oracle GoldenGate by Bobby Curtis (UTOUG 2015 Fall Conference)
No ratings yet
Extreme Replication - Performance Tuning Oracle GoldenGate by Bobby Curtis (UTOUG 2015 Fall Conference)
60 pages
Byffer Cache Deep Dive - V2 PDF
No ratings yet
Byffer Cache Deep Dive - V2 PDF
55 pages
Oracle 19c Install & Upgrade
No ratings yet
Oracle 19c Install & Upgrade
5 pages
DB Monitoring & Performance Script
No ratings yet
DB Monitoring & Performance Script
14 pages
Oracle Upgrade
No ratings yet
Oracle Upgrade
23 pages
Oracle Tuning AWR Code Depot
No ratings yet
Oracle Tuning AWR Code Depot
146 pages
05 HCP Network Configuration v4-0
No ratings yet
05 HCP Network Configuration v4-0
28 pages
John Arthur Religion Morality and Conscience
No ratings yet
John Arthur Religion Morality and Conscience
3 pages
Advanced Research Techniques
No ratings yet
Advanced Research Techniques
35 pages
Tanel Poder Advanced Oracle Troubleshooting
No ratings yet
Tanel Poder Advanced Oracle Troubleshooting
31 pages
Collecting Oracle Extended Trace
No ratings yet
Collecting Oracle Extended Trace
3 pages
Diagnosis For DB Hung
No ratings yet
Diagnosis For DB Hung
5 pages
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
From Everand
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
Joerg Christian Seubert
No ratings yet
ORACLE Performance Tuning Exerpt
No ratings yet
ORACLE Performance Tuning Exerpt
10 pages
Systematic Oracle Tuning
No ratings yet
Systematic Oracle Tuning
12 pages
Tuning
No ratings yet
Tuning
12 pages
Troubleshooting CM
No ratings yet
Troubleshooting CM
6 pages
Awr Recreate
No ratings yet
Awr Recreate
2 pages
Oracle Wait Event - Common Issues and Solutions
100% (1)
Oracle Wait Event - Common Issues and Solutions
7 pages
Install Statspack
No ratings yet
Install Statspack
12 pages
Performance Tuning Oracle Rac On Linux
No ratings yet
Performance Tuning Oracle Rac On Linux
12 pages
Memory Management and Latching
No ratings yet
Memory Management and Latching
34 pages
DATA GUARD FSFO Reference Configuration
No ratings yet
DATA GUARD FSFO Reference Configuration
58 pages
2014-Db-Franck Pachot-Interpreting Awr Reports Straight To The Goal-Manuskript
No ratings yet
2014-Db-Franck Pachot-Interpreting Awr Reports Straight To The Goal-Manuskript
11 pages
SRVCTL Commands in Oracle RAC
No ratings yet
SRVCTL Commands in Oracle RAC
20 pages
Learning RMAN On Windows PDF
No ratings yet
Learning RMAN On Windows PDF
5 pages
Enkitec RealWorldExadata
No ratings yet
Enkitec RealWorldExadata
38 pages
Table Name Acronym Expanded
No ratings yet
Table Name Acronym Expanded
14 pages
Snapper SQL
No ratings yet
Snapper SQL
49 pages
SQL Controls & SQL Profiles
No ratings yet
SQL Controls & SQL Profiles
9 pages
Oracle Database Performance Tuning FAQ
100% (1)
Oracle Database Performance Tuning FAQ
8 pages
2009 06 02 Library-Cache-Lock
No ratings yet
2009 06 02 Library-Cache-Lock
9 pages
Tuning The Redolog Buffer Cache and Resolving Redo Latch Contention
No ratings yet
Tuning The Redolog Buffer Cache and Resolving Redo Latch Contention
5 pages
Wait Event Enhancements in Oracle 10g
No ratings yet
Wait Event Enhancements in Oracle 10g
32 pages
Tuning PGA Memory: Area. Due To The Memory-Intensive Nature of These Operations, Tuning The
No ratings yet
Tuning PGA Memory: Area. Due To The Memory-Intensive Nature of These Operations, Tuning The
12 pages
Resolving Common Oracle Wait Events Using The Wait Interface
No ratings yet
Resolving Common Oracle Wait Events Using The Wait Interface
14 pages
Migrating and Upgrading To Oracle Database 12c Quickly With Near-Zero Downtime
No ratings yet
Migrating and Upgrading To Oracle Database 12c Quickly With Near-Zero Downtime
31 pages
Oracle Goldengate Fundamentals Troubleshooting and Tuning
No ratings yet
Oracle Goldengate Fundamentals Troubleshooting and Tuning
5 pages
Raccheck - Rac Configuration Audit Tool (Id 1268927.1) : 30-May-2013 Script Published 1
No ratings yet
Raccheck - Rac Configuration Audit Tool (Id 1268927.1) : 30-May-2013 Script Published 1
8 pages
Contention - Perf - Tuning - OraPub PHLOUG CBC Analysis 1d
No ratings yet
Contention - Perf - Tuning - OraPub PHLOUG CBC Analysis 1d
23 pages
Hacktivity LT 2011 en
No ratings yet
Hacktivity LT 2011 en
46 pages
Enq TX - Index Contention
100% (1)
Enq TX - Index Contention
4 pages
Purging Statistics From The SYSAUX Tablespace
No ratings yet
Purging Statistics From The SYSAUX Tablespace
5 pages
Oracle Rman Duplicate Database Feature
No ratings yet
Oracle Rman Duplicate Database Feature
3 pages
OPDG For Resolving Slow Database Performance Issue
No ratings yet
OPDG For Resolving Slow Database Performance Issue
359 pages
Oracle SQL Tuning - File IO Performance
No ratings yet
Oracle SQL Tuning - File IO Performance
6 pages
Advanced RAC Troubleshooting
No ratings yet
Advanced RAC Troubleshooting
121 pages
Oracle Streams Step by Step
No ratings yet
Oracle Streams Step by Step
16 pages
Exadata
No ratings yet
Exadata
43 pages
11 Advanced Oracle Troubleshooting Guide When The Wait Interface Is Not Enough
No ratings yet
11 Advanced Oracle Troubleshooting Guide When The Wait Interface Is Not Enough
5 pages
Installing Oracle Database 12c R1 On Linux 6 With ASM
100% (1)
Installing Oracle Database 12c R1 On Linux 6 With ASM
47 pages
Upgrade Oracle Database Manually From 12c To 19c
No ratings yet
Upgrade Oracle Database Manually From 12c To 19c
14 pages
SGA and Background Process - Architecture
100% (2)
SGA and Background Process - Architecture
68 pages
D105019GC10 Oracle Database Performance Management and Tuning Ed 1
No ratings yet
D105019GC10 Oracle Database Performance Management and Tuning Ed 1
2 pages
Performing Database Backups
No ratings yet
Performing Database Backups
20 pages
Rman Tutorial
100% (1)
Rman Tutorial
20 pages
High-Performance Oracle: Proven Methods for Achieving Optimum Performance and Availability
From Everand
High-Performance Oracle: Proven Methods for Achieving Optimum Performance and Availability
Geoff Ingram
No ratings yet
Oracle Data Guard A Clear and Concise Reference
From Everand
Oracle Data Guard A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet
Oracle 11g Streams Implementer's Guide
From Everand
Oracle 11g Streams Implementer's Guide
Ann L. R. McKinnell
No ratings yet
Oracle Database Mastery: Comprehensive Techniques for Advanced Application
From Everand
Oracle Database Mastery: Comprehensive Techniques for Advanced Application
Adam Jones
No ratings yet
ORACLE 12C Complete Self-Assessment Guide
From Everand
ORACLE 12C Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
Oracle Solaris 11 Advanced Administration Cookbook
From Everand
Oracle Solaris 11 Advanced Administration Cookbook
Alexandre Borges
No ratings yet
002 Choosing An Application Version Based On Observed Performance Study Guide
No ratings yet
002 Choosing An Application Version Based On Observed Performance Study Guide
4 pages
001 Selecting The Proper IO Scheduling Algorithm Study Guide
No ratings yet
001 Selecting The Proper IO Scheduling Algorithm Study Guide
5 pages
HCP Support Activities v4-0
No ratings yet
HCP Support Activities v4-0
26 pages
HDI Initial Configuration v1-0
No ratings yet
HDI Initial Configuration v1-0
67 pages
011 Using EBPF Tools To Diagnose System and Application Behavior Part 2 Study Guide
No ratings yet
011 Using EBPF Tools To Diagnose System and Application Behavior Part 2 Study Guide
6 pages
Veeam Backup Datasheet
No ratings yet
Veeam Backup Datasheet
2 pages
HFSM and HDI Cluster Initial Configuration - v1-0
No ratings yet
HFSM and HDI Cluster Initial Configuration - v1-0
79 pages
HDI Software Installation and HCP Preparation - v1-0
No ratings yet
HDI Software Installation and HCP Preparation - v1-0
34 pages
HDI Hardware Components v1-0
No ratings yet
HDI Hardware Components v1-0
39 pages
Installing and Configuring HCP Anywhere and Hitachi Data Ingestor
No ratings yet
Installing and Configuring HCP Anywhere and Hitachi Data Ingestor
14 pages
Operating and Managing Hitachi Content Platform v8.2: Management API
No ratings yet
Operating and Managing Hitachi Content Platform v8.2: Management API
32 pages
Operating and Managing Hitachi Content Platform v8.x: Hardware Components
100% (1)
Operating and Managing Hitachi Content Platform v8.x: Hardware Components
21 pages
HCP Replication Activities v4-0
No ratings yet
HCP Replication Activities v4-0
40 pages
Communicating in A Virtual Classroom
No ratings yet
Communicating in A Virtual Classroom
10 pages
Get More of Your Existing Storage: IBM System Storage SAN Volume Controller
No ratings yet
Get More of Your Existing Storage: IBM System Storage SAN Volume Controller
79 pages
Virtualize More, Manage Less: IBM System Storage SAN Volume Controller
No ratings yet
Virtualize More, Manage Less: IBM System Storage SAN Volume Controller
66 pages
MySQL Index
No ratings yet
MySQL Index
79 pages
Virtualize More, Manage Less: IBM System Storage SAN Volume Controller
No ratings yet
Virtualize More, Manage Less: IBM System Storage SAN Volume Controller
67 pages
Dell Compellent AIX Best Practices
No ratings yet
Dell Compellent AIX Best Practices
22 pages
Dell Compellent With IBM SAN Volume Controller SVC Best Practices
No ratings yet
Dell Compellent With IBM SAN Volume Controller SVC Best Practices
19 pages
Syllabus: Cambridge IGCSE (9-1) First Language English 0990
No ratings yet
Syllabus: Cambridge IGCSE (9-1) First Language English 0990
35 pages
S5 Ch.5 Permutation and Combination
No ratings yet
S5 Ch.5 Permutation and Combination
15 pages
Ied Product Disassembly Chart 1 2
No ratings yet
Ied Product Disassembly Chart 1 2
2 pages
Osai Controller Manual
100% (1)
Osai Controller Manual
98 pages
Cognitive Learning Theory Module
No ratings yet
Cognitive Learning Theory Module
18 pages
Project Camelot Aaron McCollum Transcript
No ratings yet
Project Camelot Aaron McCollum Transcript
31 pages
Claret College of Isabela: Senior High School
0% (1)
Claret College of Isabela: Senior High School
5 pages
Lesson 3
No ratings yet
Lesson 3
8 pages
Pe and Health 12 q4 Module 4a
100% (2)
Pe and Health 12 q4 Module 4a
20 pages
Topics Entrance Tests Maths Physics
No ratings yet
Topics Entrance Tests Maths Physics
1 page
Bài tập Anh 6 Smart World Unit 4
No ratings yet
Bài tập Anh 6 Smart World Unit 4
10 pages
Full Ironclad Captains of The Civil War 1st Edition Myron J. Smith Ebook All Chapters
100% (3)
Full Ironclad Captains of The Civil War 1st Edition Myron J. Smith Ebook All Chapters
76 pages
233185P VR28-013 Voltage Regulator
No ratings yet
233185P VR28-013 Voltage Regulator
16 pages
0.1 Differential Operator: D DX D DX 2
No ratings yet
0.1 Differential Operator: D DX D DX 2
16 pages
ICSE Classical Language2
No ratings yet
ICSE Classical Language2
4 pages
Spanish Grammar Manual
No ratings yet
Spanish Grammar Manual
380 pages
Literary Genre On Creative Multimedia Presentation
No ratings yet
Literary Genre On Creative Multimedia Presentation
21 pages
Midterm Exam 1: This Page Is Scratch Paper
No ratings yet
Midterm Exam 1: This Page Is Scratch Paper
12 pages
NRF Proposal Writing Guide
No ratings yet
NRF Proposal Writing Guide
42 pages
CED Assignment 2&3
No ratings yet
CED Assignment 2&3
4 pages
Tandem-EnFORM Reference Manual
No ratings yet
Tandem-EnFORM Reference Manual
242 pages
Version Control Systems
No ratings yet
Version Control Systems
6 pages
8779303-STANDARD
No ratings yet
8779303-STANDARD
4 pages
O2c Cycle
No ratings yet
O2c Cycle
14 pages
Financial Analysis of Household Photovoltaic Self-Consumption in The Context of The Vehicle-to-Home (V2H) in Portugal
No ratings yet
Financial Analysis of Household Photovoltaic Self-Consumption in The Context of The Vehicle-to-Home (V2H) in Portugal
21 pages