Exadata Performance

This document provides an overview of Exadata performance debugging. It discusses checking if Exadata cells are IO bound and examines flash cache and smart scan functionality. It also covers monitoring Exadata storage servers and cells using metrics to analyze I/O performance and identify potential issues.

Uploaded by

sdranga123

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

709 views

Exadata Performance

Uploaded by

sdranga123

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 35

Exadata Performance Debugging

Biswaroop Biswal / Ranga Sarvabhouman

Agenda

What basic information one must know about Exadata I/O related performance:

- Check if cells are IO bound
- Flash Cache
- Smart Scan

Check if cells are IO bound
Check if db nodes are CPU or memory bound
Check if smart scan works as expected

Check if HCC/Partitioning/DBFS are used
If none is true, go back to database performance tuning and planning

Check if cells are IO bound?
Differentiate between slowness vs. full utilization
Use OSW iostat and/or CellDisk metrics to compute total HDD and FLASH
throughput (MBPS) and IOPS
Refer to Exadata DBM data sheet for peak numbers:
Watch out for high latency if IOs ever approach peak numbers
High latency does NOT mean slow disks
each IO takes long primarily due to time waiting in disk queue
IO latency can be >100ms (note disks are not slow!)
IO latency depends on disk queue length so can be varied based on
different workloads
Be aware that max MBPS and max IOPS can not be reached
simultaneously
How to evaluate mixed workload?
Examine disk utilization - is it close to 100%?
Run calibrate if needed (requires cells being quiesced)

What happens when cells are IO bound
Consider SAME (Stripe And Mirror Everywhere)
when any disk or cell is maxed out, performance will be
throttled by that disk/cell even with workload parallelization
Be aware of potential performance disparity between system and
data disks
System disks not only have user data but also have cells own file
systems
System disks may run slower than data disks
More pronounced on High Capacity 2TB drives due to lower
IOPS capacity when compared with High Performance
600GB drives

Exadata Flash Cache - overview

Know your Flash:

Flash storage entities and relationships:

When/How and what to measure in Flash cache:

When to measure:
- Missing SLA
- Poor performance across the environment.
- AWR reports
- Users screaming
How to measure:
- V$SYSSTAT/V$SQL/V$SESSION_WAIT/V$SESSION_EVENT and V$SySTEM_EVENT
- Exadata Storage Servers using views
- Exadata Storage Servers using metrics (cellcli and dcli commands)
- Exadata Smart flash log with metrics
What to measure:
- Effective use of flash
- Percentage of I/O requests satisfied by flash
- Number of objects kept on flash
- Size of objects kept on flash

Smart Flash log what to look for?

Smart Flash Logging affects log file parallel write time, not log file sync
time.
Check the AWR report for high log file parallel write times; there should be
very few log file parallel write waits > 32 ms, and no waits > 0.5 seconds.
This can be verified in the following sections of an AWR report:
Wait Event Histogram
Wait Event Histogram Detail (64 msec to 2 sec)
If a cell has a non-zero value for FL_IO_W_SKIP_BUSY, then this means that
the hard disks which contain the database log files (or their mirrored
copies) are not performing well. This can due to throughput or heavy load on
the database. It can be resolved by expanding your system to more cells.
If a cell has a non-zero value for FL_ACTUAL_OUTLIERS and
FL_EFFICIENCY_PERCENTAGE is not 100%, then this means that flash disks
are not performing well. This can be due to load or performance issue. If it is
load issue then try to reduce the load or replace the flash disk.
Besides hard disk and flash disk performance, there are other factors which
can affect redo log write latencies:
Examine database nodes to make sure that LGWR is not experiencing
scheduling hiccups due to factors such as swapping.
Check whether the network is impacting the performance.

Smart Scan works as expected & how to debug
Symptoms
In AWR report you would see these 2 wait events
Cell table smart scan
Cell index smart scan
V$views v$sysstat and v$cell_state, statistics that you need to look for (also found system
statitiscs section of AWR report):
Cell physical IO bytes eligible for predicate offload
Cell physical IO bytes saved by storage index
Cell physical IO bytes send directly to DB node to balance CPU usage
Cell physical IO interconnect bytes returned by smart scan
%cell num smart%
PREDIO (v$cell_state)
Oswatcher logs
Reasons
Smart scan has less filtering/no filtering
Suboptimal storage index usage
Less/No filtering due to quarantine/ time zone upgrade
Less filtering due to CPU pass through
Other reasons
Cell is CPU bound or IO bound
Suboptimal flash usage
How to identify long running transaction that
causes sub optimal Storage Index usage?
Purpose:
Long running transaction prevents min active scn from progressing,
there by causing scans to not use storage index.
Steps
Get global min active scn by setting system event to 55703, level 1; the
min active scn is printed in alert log once every 3 minutes. Unset the
event after you get the min active scn. Convert the min scn from hex
to decimals.
Use scn_to_timestamp to compare scn from alert log and current_scn,
if there is differ by good amount then continue.
Query the X$KTUXE to get the slot number, undo segment number
where the column KTUXESTA is not like INACTIVE.
Query v$process to obtain the instance ID, process ID.
Now either you can kill the process or use pstack to obtain more
information.
Less/No Filtering due to quarantines.
Presence of SQL or DB quarantines result in smart scan not being used.
Look at following v$cell_state statistics to see if filtering is not happening
due to quarantines
Smart IO not used to SQL Step or DB Quarantine.
Smart IO not used to Disk Region Quarantine.
Use dcli or cellcli on cells to check for quarantines, for eg: , list quarantine
where QuarantineType=Database in cellcli prompt.
Quarantines are removed automatically when cell software is upgraded or
use cellcli drop quarantine to remove manually.
Less/No filtering due to timezone upgrade; CPU
passthrough
Look at v$cell_state statistics to see if smart scan is not
happening due to timezone upgrade:
Smart IO uses passthru as timezone file is unavailable. OR
Select value from v$sysstat where name = cell num smart
IO sessions using passthru mode due to timezone;
Smart scans will take place after timezone file is made available.
Reasons for CPU passthrough (storage is CPU bound)
More scan queries is running on the storage server resulting in high
CPU usage.
Suboptimal Storage Index usage results in more physical IO being
performed. Which results in more filtering on storage server which
increases CPU usage.
Scans on encrypted tables increases storage CPU usage.
I/O Resource Management Plans
I/O Resource Management Plans : Example
I/O Resource Management Plans : Example
I/O Resource Management Plans : Example
Setting the IORM Objective
Available IORM objective settings:
basic
IORM does not enforce user-defined plans.
IORM protects against extreme latencies for small I/O requests.
Maximum throughput is maintained.
low_latency
Minimizes latency by limiting the number of concurrent I/O requests
Useful for critical OLTP workloads
Performance of high-throughput workloads may suffer
high_throughput
Maximizes throughput by not limiting concurrent I/O requests
Useful for DSS and data warehouse workloads
Performance of latency-critical workloads may suffer
balanced
Balances low disk latency and high throughput
Useful for mixed workloads
auto
IORM decides the best objective setting based on active plans and workloads
Intradatabase Plan : Example
Interdatabase Plan : Example
Using Share-Based Allocation in the
Interdatabase Plan
Commencing with Exadata Storage Server software release 11.2.3.1.0, I/O allocations in the
Interdatabase plan can be expressed as shares rather than using the level and allocation attributes
shown on the previous page. Each share is a value between 1 and 32, with 1 being the lowest share, and
32 being the highest share. The share value represents the relative importance of each database rather
than specifying an IO allocation percentage.
Share-based allocation is a simplified approach designed to support large numbers of databases. Using
shared-based allocations, an interdatabase plan can support up to 1024 directives.
Setting Database I/O Utilization Limits
Database Roles
Category Plan: Example
IORM and Exadata Storage Server Flash Memory
IORM manages only I/O queues for physical disks.
IORM does not arbitrate requests serviced by Exadata Smart Flash
Cache.
IORM can control whether a database can use Exadata Smart Flash
Cache.
IORM can control whether a database can use Exadata Smart Flash Log.
Exadata Storage Server software release 11.2.2.3, IORM can be used to specify if a
database is allowed to use Exadata Smart Flash Cache. This allows flash cache to be
reserved for the most important databases, which is especially useful in environments
that are used to consolidate multiple databases
Complete Example
Complete Example
Exadata Cells
Exadata Metrics and alerts
Monitoring Exadata Storage Server with
Metrics
Monitoring Exadata Cell Metrics
Metric abbreviation :

CL_ (cell)
CD_ (cell disk)
GD_ (grid disk)
FC_ (flash cache)
DB_ (database)
CG_ (consumer group)
CT_ (category)
N_ (interconnect network)
_R for read or _W for write.
_SM or _LG to identify small or large I/Os
At the end of the name, there could be _SEC to signify per second or _RQ to signify
per request

CD_IO_RQ_R_SM is the number of requests to read small blocks on a cell disk.
GD_IO_BY_W_LG_SEC is the number of MB of large block I/O per second on a
grid disk.
I/O-related metric name :

IO_RQ (number of requests)
IO_BY (number of MB)
IO_TM (I/O latency)
IO_WT (I/O wait time)
Monitoring Exadata Storage Server with Alerts
Isolating Faults with Exadata Storage Server
Quarantine
In addition to metrics and alerts, when prescribed faults are detected in Exadata
Storage Server, a quarantine object is automatically created. By this, the action that
caused the fault can be quarantined, so that the fault can be avoided in the future.
Quarantine reduces the chance of storage server software crashes, and improves
storage availability
Exadata Storage Server Quarantines
Types of automatic quarantine :
SQL PLAN: Created when the cell crashes while performing Smart Scan for a SQL statement. The SQL Plan
for the SQL statement is quarantined, and Smart Scan is disabled for the SQL plan.
DISK REGION: Created when the cell crashes while performing Smart Scan of a disk region. The 1 MB disk
region being scanned is quarantined and Smart Scan is disabled for the disk region.
Database: Created when the cell detects that a particular database causes instability. Instability detection
is based on the number of SQL Plan Quarantines for a database. Smart Scan is disabled for the database.
Cell Offload: Created when the cell detects that some offload feature has caused instability. Instability
detection is based on the number of database quarantines for a cell. Smart Scan is disabled for all
databases
CellCLI commands to manually manipulate quarantines:

LIST QUARANTINE: To show the quarantines currently on the cell
ALTER QUARANTINE: To set the comment attribute. The comment attribute is the only quarantine attribute
that can be modified.
DROP QUARANTINE: To manually remove a quarantine.
CREATE QUARANTINE: To manually create a quarantine object. Manual quarantines are created to proactively
isolate SQL statements that are known to cause problems. Example :
CELLCLI> CREATE QUARANTINE quarantineType="SQLID", sqlid="5xnjp4cutc1s8"
Choosing the Flash Cache Mode
Choosing the Flash Cache Mode
Setting the Flash Cache Mode
Enabling write-back mode
Enabling write-through mode
Exadata specific system statistics
Gather Exadata specific system statistics:

Enables the optimizer to more accurately cost operations using
actual performance information:
CPU speed
IO Performance
Sets multi block read count (MBRC) correctly for Exadata
Requires at least Oracle Database version 11.2.0.2 BP 18 or 11.2.0.3 BP 8
Recommended for all new databases
Test thoroughly before changing existing databases.
Databases with stable good plans do not require a change.

Payne J. The Marine Electrical and Electronics Bible 2ed 1998
100% (1)
Payne J. The Marine Electrical and Electronics Bible 2ed 1998
438 pages
Acura TL 2006
No ratings yet
Acura TL 2006
83 pages
Oracle Exadata Database Machine: Implementation and Administration
No ratings yet
Oracle Exadata Database Machine: Implementation and Administration
5 pages
RAC - Cheatsheet
100% (1)
RAC - Cheatsheet
5 pages
Cloning EBS R12: A Step by Step Detailing by Orazer Technologies
No ratings yet
Cloning EBS R12: A Step by Step Detailing by Orazer Technologies
13 pages
Oracle 19c Install & Upgrade
No ratings yet
Oracle 19c Install & Upgrade
5 pages
RAC Grid Infrastucture Startup Sequence and Important Logfile Location
0% (1)
RAC Grid Infrastucture Startup Sequence and Important Logfile Location
5 pages
Active Data Guard Hands On Lab
100% (1)
Active Data Guard Hands On Lab
50 pages
Oracle 19c AutoUpgrade Best Practices: A Step-by-step Expert-led Database Upgrade Guide to Oracle 19c Using AutoUpgrade Utility
From Everand
Oracle 19c AutoUpgrade Best Practices: A Step-by-step Expert-led Database Upgrade Guide to Oracle 19c Using AutoUpgrade Utility
Sambaiah Sammeta
No ratings yet
Oracle Database Security Interview Questions, Answers, and Explanations: Oracle Database Security Certification Review
From Everand
Oracle Database Security Interview Questions, Answers, and Explanations: Oracle Database Security Certification Review
equitypress
No ratings yet
Deep Beam Design Based On ACI 318-14: Input Data
No ratings yet
Deep Beam Design Based On ACI 318-14: Input Data
3 pages
Exadata Support Checklist
No ratings yet
Exadata Support Checklist
7 pages
Oracle Performance Tuning Basic PDF
No ratings yet
Oracle Performance Tuning Basic PDF
50 pages
AWR Analysis Part-1 PDF
100% (1)
AWR Analysis Part-1 PDF
24 pages
Oracle Database 11g - Underground Advice for Database Administrators: Beyond the basics
From Everand
Oracle Database 11g - Underground Advice for Database Administrators: Beyond the basics
April C. Sims
No ratings yet
Concise Oracle Database For People Who Has No Time
From Everand
Concise Oracle Database For People Who Has No Time
Billy Aung Myint
No ratings yet
Exadata Migration
100% (1)
Exadata Migration
13 pages
Exadata X8M
100% (1)
Exadata X8M
29 pages
Lab15b Cloning Disk Group
No ratings yet
Lab15b Cloning Disk Group
37 pages
Oracle Cache Fusion Private Inter Connects and Practical Performance Management Considerations in Oracle Rac
No ratings yet
Oracle Cache Fusion Private Inter Connects and Practical Performance Management Considerations in Oracle Rac
25 pages
Oracle ASM Stuff
100% (1)
Oracle ASM Stuff
6 pages
Adg Hands On Lab 176003
No ratings yet
Adg Hands On Lab 176003
58 pages
Using Automatic Workload Repository For Database Tuning Tips For Expert DBAs
No ratings yet
Using Automatic Workload Repository For Database Tuning Tips For Expert DBAs
49 pages
ASM Pocket PDF
No ratings yet
ASM Pocket PDF
2 pages
DBA Task Finished
100% (2)
DBA Task Finished
37 pages
DBA Tips Archive For Oracle (Activating The Standby Database)
No ratings yet
DBA Tips Archive For Oracle (Activating The Standby Database)
19 pages
Oracle Asm Questions
No ratings yet
Oracle Asm Questions
2 pages
Module - 1 1. Oracle E-Business Suite 11i, R12.1.3/12.2.3 Architecture
No ratings yet
Module - 1 1. Oracle E-Business Suite 11i, R12.1.3/12.2.3 Architecture
4 pages
Oracle Architecture
No ratings yet
Oracle Architecture
102 pages
Oracle Commands For RAC & Processes
100% (1)
Oracle Commands For RAC & Processes
4 pages
Upgrade Oracle GI DB From 12 2 To 19c 1644774477
No ratings yet
Upgrade Oracle GI DB From 12 2 To 19c 1644774477
19 pages
Oracle SQL Tuning - File IO Performance
No ratings yet
Oracle SQL Tuning - File IO Performance
6 pages
10 Steps For Cloning A Database.: Summary
No ratings yet
10 Steps For Cloning A Database.: Summary
5 pages
Getting More Knowledge (Theory) With Oracle RAC PDF
No ratings yet
Getting More Knowledge (Theory) With Oracle RAC PDF
8 pages
Apply Rolling PSU Patch in Oracle Database 12c RAC Environment
No ratings yet
Apply Rolling PSU Patch in Oracle Database 12c RAC Environment
6 pages
Oracle RAC Interview Questions
100% (1)
Oracle RAC Interview Questions
7 pages
Oracle 19C MultiTenant Database 1704253431
No ratings yet
Oracle 19C MultiTenant Database 1704253431
18 pages
Understanding and Tuning Buffer Cache and DBWR
No ratings yet
Understanding and Tuning Buffer Cache and DBWR
8 pages
Oracle 11G Dataguard Configuration
100% (2)
Oracle 11G Dataguard Configuration
18 pages
Automating Oracle Database Startup and Shutdown On Linux SOP
100% (1)
Automating Oracle Database Startup and Shutdown On Linux SOP
3 pages
Demo - 2 Tablespace Management
No ratings yet
Demo - 2 Tablespace Management
17 pages
04 - 08 - 2021 Data Guard New Features and Best Practices
No ratings yet
04 - 08 - 2021 Data Guard New Features and Best Practices
106 pages
1z0-064.prepaway - Premium.exam.119q: Numbe R: 1z0-064 Passing Scor E: 800 Time Limi T: 120 Min File Version: 5.0
No ratings yet
1z0-064.prepaway - Premium.exam.119q: Numbe R: 1z0-064 Passing Scor E: 800 Time Limi T: 120 Min File Version: 5.0
63 pages
Oracle Tablespace Guide
No ratings yet
Oracle Tablespace Guide
29 pages
Oracle Interview Questions:: Company: Interview Type: ZOOM Meeting Date: Interview Time: 45 Hours
No ratings yet
Oracle Interview Questions:: Company: Interview Type: ZOOM Meeting Date: Interview Time: 45 Hours
3 pages
7 Oracle SQL Tuning Tactics You Can Start Implementing Immediately
No ratings yet
7 Oracle SQL Tuning Tactics You Can Start Implementing Immediately
10 pages
Rac Q&a
No ratings yet
Rac Q&a
51 pages
Oracle DBA Checklist
33% (3)
Oracle DBA Checklist
15 pages
Updating Exadata Database Server Software
No ratings yet
Updating Exadata Database Server Software
15 pages
Data Guard Failover Test Using SQL
100% (1)
Data Guard Failover Test Using SQL
8 pages
Upgrade
No ratings yet
Upgrade
27 pages
Dataguard Switchover Steps
No ratings yet
Dataguard Switchover Steps
5 pages
Core DBA Scripts
75% (4)
Core DBA Scripts
115 pages
Security Database Overview 11gr2 100419083446 Phpapp02 PDF
No ratings yet
Security Database Overview 11gr2 100419083446 Phpapp02 PDF
34 pages
Monitoring Exadata Performance
100% (1)
Monitoring Exadata Performance
20 pages
DB Tuning
No ratings yet
DB Tuning
6 pages
Troubleshooting CM
No ratings yet
Troubleshooting CM
6 pages
Dataguard Interview Scenarios ORACLE - DBA - HELP
No ratings yet
Dataguard Interview Scenarios ORACLE - DBA - HELP
16 pages
Backup and Recovery Interview Questions For An Oracle DBA
100% (1)
Backup and Recovery Interview Questions For An Oracle DBA
8 pages
What Is RAC
No ratings yet
What Is RAC
6 pages
Oracle Exadata Complete Self-Assessment Guide
From Everand
Oracle Exadata Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
Oracle Data Guard A Clear and Concise Reference
From Everand
Oracle Data Guard A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet
Oracle GoldenGate With Microservices: Real-Time Scenarios with Oracle GoldenGate
From Everand
Oracle GoldenGate With Microservices: Real-Time Scenarios with Oracle GoldenGate
Yenugula Venkata Ravi Kumar
No ratings yet
Cisco Catalyst 3560 Series Switches Datasheet
No ratings yet
Cisco Catalyst 3560 Series Switches Datasheet
21 pages
Skill BVFX Datesheet End Term Exam , Dec-2024- Jan-2025I,III & V.xlsx - Sem-I & 2
No ratings yet
Skill BVFX Datesheet End Term Exam , Dec-2024- Jan-2025I,III & V.xlsx - Sem-I & 2
1 page
Share Capital - Solution
No ratings yet
Share Capital - Solution
7 pages
GST Notes 2017 18
No ratings yet
GST Notes 2017 18
39 pages
Agriculture Field Officer Study Material
No ratings yet
Agriculture Field Officer Study Material
2 pages
STD PIP VEDV1003 2012 Document Requirements For Vessels Div 1 & 2
No ratings yet
STD PIP VEDV1003 2012 Document Requirements For Vessels Div 1 & 2
35 pages
RB供貨討論-20240708_ANDRITZ_reply
No ratings yet
RB供貨討論-20240708_ANDRITZ_reply
21 pages
Librarian Cover Letter Examples
100% (1)
Librarian Cover Letter Examples
8 pages
Jur PH Cases Justice Amy
No ratings yet
Jur PH Cases Justice Amy
110 pages
SOP 0102 Standard Operating Procedures 1
No ratings yet
SOP 0102 Standard Operating Procedures 1
6 pages
Observ H.264
No ratings yet
Observ H.264
69 pages
Middleware: Seminar Web Services"
No ratings yet
Middleware: Seminar Web Services"
37 pages
Ross Emmett - The Elgar Companion To The Chicago School of Economics PDF
100% (1)
Ross Emmett - The Elgar Companion To The Chicago School of Economics PDF
361 pages
Iad Questions.
No ratings yet
Iad Questions.
3 pages
High Density Planting System in Fruit Crops
No ratings yet
High Density Planting System in Fruit Crops
5 pages
Flightpath Helicopter Operations Teachers Notes
No ratings yet
Flightpath Helicopter Operations Teachers Notes
2 pages
Katalog Promo April 2024
No ratings yet
Katalog Promo April 2024
81 pages
Get Test Bank For Health Psychology, 7th Edition: Taylor Free All Chapters
100% (8)
Get Test Bank For Health Psychology, 7th Edition: Taylor Free All Chapters
45 pages
Batteries
No ratings yet
Batteries
19 pages
Agile Framework
100% (2)
Agile Framework
13 pages
Designer Importexport Module Manual: Nextfem
No ratings yet
Designer Importexport Module Manual: Nextfem
25 pages
Final Jason Bond Picks Momentum Hunter Ebook March 2020 2 PDF
No ratings yet
Final Jason Bond Picks Momentum Hunter Ebook March 2020 2 PDF
83 pages
Salient Features of CSR Policy 2012
No ratings yet
Salient Features of CSR Policy 2012
10 pages
FM200 - Fire Extinguishing Control Panel
100% (1)
FM200 - Fire Extinguishing Control Panel
13 pages
Power Shell Scripts 1
No ratings yet
Power Shell Scripts 1
8 pages
Union Special 39500A, B, P and AF-1
No ratings yet
Union Special 39500A, B, P and AF-1
30 pages
Computer Arch Test
No ratings yet
Computer Arch Test
8 pages

Exadata Performance

Uploaded by

Exadata Performance

Uploaded by

Exadata Performance Debugging

Biswaroop Biswal / Ranga Sarvabhouman

You might also like