Predictive Analytics - Share - V5
Predictive Analytics - Share - V5
Ed Woods
Consulting IT Specialist
IBM Corporation
2012 IBM Corporation
Agenda
What is Predictive Analytics?
Examples
How is predictive analytics relevant to IT Service
Management?
Typical monitoring and management paradigms
Real time information versus historical data
collection
Univariate versus multivariate analysis
Examples of relevant metrics
Where to begin
2012 IBM Corporation
Data
sources
Data Mining
Analysis of data to identify underlying trends,
patterns, or relationships
Identify data to be used to develop the
predictive model
Data
cleansing and
organizing
Data
mining
Model development
Predictive Model
2012 IBM Corporation
Predictive Analytics
About Regression Models And Types Of Models
Regression models are the core of predictive analytics
A wide variety of models can be applied
Linear regression model
Analyzes the relationship between the response or dependent variable
and a set of independent or predictor variables
Partial or Stepwise regression
Modeler does not specify all the explanatory variables
Variables are added iteratively
Logit or Probit regressions
Allow one to predict a discrete outcome (yes/no) from a set of variables
Time series models
Used for predicting or forecasting the future behavior of variables
Data points taken over time may have an inherent time relation
Developed to decompose the trend, seasonal and cyclical component of
the data
Many more models
2012 IBM Corporation
Data
CICS Response time
Alerts
DB2 network
CICS network
Alerts
IMS Response time
IMS network
Commands
2012 IBM Corporation
Real time
10
Today
Example compare
todays CPU utilization
trend to yesterday
11
The Problem:
Traditional Monitoring Approaches Have Limitations
Many tools, data sources and metrics available
Many are Resource/Single Metric Focused (Univariate)
12
13
14
Predict: Eruption
forecasting using
seismic energy..
15
16
Alert!
Memory
(%)
Normal
Range
Memory
(%)
HTTP
Requests
Multivariate = Alerts earlier on Deviation
17
IMS example
High IMS message region occupancy time >>
IMS transactions queued >> longer IMS transaction scheduling
time >> longer IMS response time >> lower IMS transaction
processing rate
MQ example
Lower MQ message input rate >>
Higher MQ message queue depth >> lower transaction
processing rate >> longer CICS/IMS transaction response time
2012 IBM Corporation
18
Problem transaction
count by status
IMS Bottlenecks
Enqueue/dequeue rates
2012 IBM Corporation
19
20
Connection
bottlenecks
Network
delays
IMSConnect
Connect
IMS
APPC
APPC
Message In
OTMA
OTMA
Telnet
Telnet
IMS
Control
Region
Queues &
Scheduling
IMS
Message
& BMP
Regions
Network
delays
IMS
Control
Region
Message Out
Threads
DB, BP
I/O delays
Connection
bottlenecks
Threads
DB2 Subsystem
DB, BP
I/O delays
IMS
DLI
Lock
Conflicts
IRLM
Lock
Conflicts
2012 IBM Corporation
21
DB2 App
DB2 App
DB2 Connect
Network
delays
Network
delays
z/OS
IMS
MSTR
CICS
Batch
Connection
bottlenecks
Connections
Threads
Logging
TSO
WebSphere
IRLM
LockLocking
Conflicts
Connection
bottlenecks
DBM1
DDF
SQL
SQL
EDM
PTs, SKPTs, CTs, SKCTs
DSC Dynamic SQL
Sort Pool
RID Pool
DB, BP
Buffer
Mgr - VPs
I/O delays
Distributed
threads
SP Addr Space(s)
Stored Procedures
UDFs
SP Sched
Delays
22
CICS
Queue depth
Message send/receive rate
DLQ depth
Channel status and performance
z/OS
System CPU rate
Paging rate
WLM Performance Index
DASD I/O MSR time and rate
Critical console messages
WebSphere
Method call count and elapsed time
Heap size
Garbage collection
Connection pool utilization
Network
Network Connection status and performance
Network interface utilization
2012 IBM Corporation
23
24
Predictive Analytics
Defining A Strategy Whats Required
A predictive analytics approach starts with a comprehensive historical
data collection strategy
Gather history to a common point, if technically feasible
25
Diagnostic
Value
Some historical data will be more relevant and useful than other data
Consider the context, nature, and meaningfulness of the data
2012 IBM Corporation
26
Summary data
Data that summarizes underlying detail data
Either an aggregation or an averaging of underlying detail records
May be useful for longer term trending and analysis
Reduces quantity of data and reduces cost of retention, post processing
Less detail may mean less diagnostic value
27
Snapshot data
Typically a point in time snapshot of activity
Snapshots are usually based on a specified time interval
Snapshots may be taken of types of history (detail, summary, or interval)
Snapshots will show activity at time of the snapshot, but may/may not reflect
activity between snapshots
Useful for problem analysis and trending analysis
Useful as an aid in setting alert thresholds
Examples snapshot history captured by performance monitoring,
2012 IBM Corporation
28
Predictive Analytics
Defining A Strategy Whats Required - continued
Pursue a multivariate approach where feasible
Univariate is useful for certain trending and modeling uses
Multiple variables provide a more meaningful indication of potential
issues
Multivariate is more relevant to todays composite applications
29
Define metrics
What are the most critical metrics to track?
History
Define a collection strategy that allows for the aggregation of data
Correlation methodology
How to correlate metrics in real time and in history
Is there a way to automate the correlation process?
Prediction
How predictive are the chosen metrics?
2012 IBM Corporation
30
Thank You!!
31
32