What I Learned This Month:: IBM DB2 Analytics Accelerator

Scott Chapman shares insights on implementing the IBM DB2 Analytics Accelerator (IDAA) at American Electric Power, highlighting its ability to accelerate analytical queries by managing data complexity directly within the operational database. Initial tests show promising performance improvements, but challenges such as hardware requirements, data type compatibility, and replication monitoring have emerged. The document emphasizes the importance of ensuring data accuracy and timely updates while exploring further testing with larger datasets and more complex queries.

Uploaded by

leonardo.russo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views3 pages

What I Learned This Month:: IBM DB2 Analytics Accelerator

Uploaded by

leonardo.russo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

What I Learned This Month: IBM DB2 Analytics Accelerator

Scott Chapman
American Electric Power

We're currently working on getting our IBM DB2 Analytics Accelerator (IDAA1) up
and running. We're still figuring many things out, so "What I'm Learning This
Month" would be a more accurate title, but I thought I'd write up what I've learned
so far.
The IDAA is a hardware appliance (based on the Netezza technology that IBM
acquired in 2010) that is managed and exploited by DB2 on z/OS to greatly
accelerate certain queries. The primary targets are analytical queries that might
traditionally be handled with a data warehouse solution because they would
perform poorly in the traditional operational database. While data warehouse
solutions have become ubiquitous, there are some issues with them. In
particular, they represent a second copy of corporate data to secure and
manage. And you need some sort of mechanism to keep that second copy up to
date with the changes in the operational data store. Typically there is also a fair
bit of work to design the data warehouse to summarize and transform data such
that particularly interesting business queries can be answered quickly. Of course
over time, the business requirements may change, necessitating changes to the
data warehouse.
In contrast, the idea behind the IDAA is to allow DB2 to manage all of that
complexity and allow all queries to run against the operational database. DB2 will
decide whether to satisfy the queries using traditional z/OS resources or using
the IDAA resources. This could be a tremendous simplification of the
environment relative to the traditional data warehousing solution.
Additionally, because of the highly parallel nature of the IDAA, typical IDAA
eligible queries may execute an order of magnitude (or more) faster than they
would if they used z/OS resources. Queries that are running on the IDAA are
also not consuming z/OS resources, which may be a potential cost savings
and/or may allow other z/OS workloads to perform better.
All and all, it sounds great. So far it does perform well in our initial limited tests.
I've done scans of 100-million row tables in just a couple of seconds. But there
are some caveats and some things we're still working out.
One of the first issues we ran into was the hardware requirements: the
connections between the mainframe CECs and the IDAA appliance must be
10GbE network connections. There were numerous discussions about whether
these could be direct connections or whether we were required to install switches
and whether or not those switches needed to be dedicated to this purpose. Both
the 10GbE cards and the switches are fairly expensive. This caused us a lot of
consternation. It would have been a lot easier if the IDAA was simply orderable
1
I will refer to it herein as the IDAA as a handy abbreviation, even though IBM is no longer
branding as that due to a request from International Doctors in Alcoholics Anonymous.
with an embedded switch, but that's not an orderable feature. In the end we did
use non-dedicated switches, although they did need a configuration because
they initially didn't have jumbo frame support enabled.
On the software side, DB2 v10 New Function Mode is required, at least if you're
going to be using the incremental update feature to keep the IDAA tables in sync
with the DB2 tables. We're still rolling out NFM across our environments, but
we've been able to start experimenting with the IDAA in one of our first NFM
environments. In some simple tests we've seen very simple queries that would
take over a minute to do a tablespace scan in DB2 on z/OS run in less than 10
seconds in the IDAA. Those are encouraging results, even if they are for
simplistic cases and relatively small tables.
One unexpected issue that we ran into is that the IDAA oddly can't handle all the
valid data types that DB2 can. In particular, DB2 allows 24:00:00 as a valid time,
but that is not valid in the IDAA. IBM has some patches that they can apply to the
IDAA to make it more tolerant of 24:00:00 in the DB2 tables, but those fixes
involve changing the value to 23:59:59.999999. Whether that's acceptable or not
will be application-dependent. If the application is using 24:00:00 to indicate "end
of day", changing the value may not be acceptable. For now we changed the
data in the DB2 tables and the application changed their code to avoid storing
24:00:00. We're still debating over applying the IDAA patches, but I think we
probably will because without them in place, the incremental update data
replication will break if a value of 24:00:00 shows up in the future.
Breaking replication could be a significant problem because the use of the IDAA
is truly transparent: once a table has been initially copied to the IDAA and
enabled for acceleration, any dynamic SQL query for that table may be executed
in the IDAA. But since the data in the IDAA is a copy of the data in DB2 on z/OS,
the IDAA copy is by necessity somewhat out of date. If replication is enabled, the
IDAA data may only be seconds behind DB2. Even a data latency2 of a few
minutes is probably adequate for the vast number of business queries. However,
latencies of hours could be problematic for a large number of business queries.
My concern is that we've seen instances where we've been able to break
replication for a table (more on that in a moment), but the table remains enabled
for acceleration. If the replication failure isn't noticed and handled by somebody
in a timely fashion, it's entirely conceivable that we could have queries running
against data in the IDAA that's hours old. In some cases that could be very bad.
Fast answers are nice, but correct answers are an absolute requirement!
IBM provides some sample code3 to monitor latency in the Redbook "Hybrid
Analytics Solution using IBM DB2 Analytics Accelerator for z/OS 3.1". It seems
likely we'll build some sort of automated checking to make sure that the
replication process is always running correctly and disable IDAA acceleration if

2
The time between updates being committed in DB2 z/OS and being applied to the IDAA's copy
of the data.
3
See Appendix E in http://www.redbooks.ibm.com/redbooks/pdfs/sg248151.pdf
it's not. You can enable or disable acceleration for individual tables: it would be
nice if there was another setting that was "enable if the data is no more than x
seconds old".
As for why we were able to break replication, that has to do with the amount of
storage that we gave the CDC (Change Data Capture) task. The CDC task uses
a staging area in memory to hold uncommitted changes. If you have SQL
statements that update or delete millions of rows of data in a single unit of work,
you may find that you need to make that area multiple GBs. The amount of
storage required is a function of the row length and the number of rows affected.
Updates require twice as much memroy as both the before and after image is
staged.
Determining the total number of potentially uncommitted inserted, updated, or
deleted rows that might be in flight at any given time is a challenging task.
However my friendly DBA pointed out that for our largest application we don't
allow lock escalation to occur automatically, so the total number of rows or pages
that have been updated but not committed is limited to the maximum number of
locks that a thread can have.
With that idea in mind, I did some rough worst-case calculations which suggested
that a staging area of about 1.2GB would be sufficient for our normal workload.
There are a lot of assumptions built into those calculations and we do have
certain occasional processes that lock a whole table to do a massive update,
which can drive the need for a much larger staging area. In fact during testing we
found we needed a 1.7GB staging area when we were running a process that
deleted or inserted 3 million rows with a single commit. We decided to make the
staging area 3GB and monitor and measure it to make sure that we're not
running close to that limit. I expect that we would have been fine with the 2GB
default, at least until something came along and impacted several million rows at
once. I'm glad that we found this during testing instead of finding it by surprise in
production some weeks or months after implementation.
So that's what I've learned so far about the IDAA. I'm looking forward to working
through the questions that I still have and doing some tests with larger volumes
of data and more sophisticated queries that better represent some of the
problems we expect that it will solve for us. Performance will also likely get more
"interesting" once we start sending multiple queries to it simultaneously.
As always, if you have questions or comments, you can reach me via email at
[email protected].

Day 11 Datastage
No ratings yet
Day 11 Datastage
489 pages
IBM Storage For IBM Z - Level 1 Passed
100% (3)
IBM Storage For IBM Z - Level 1 Passed
15 pages
Chit Fund Management System
67% (6)
Chit Fund Management System
79 pages
DB2 AdministrationGuide PDF
100% (1)
DB2 AdministrationGuide PDF
1,352 pages
Teradata Performance Tuning and Optimization
100% (1)
Teradata Performance Tuning and Optimization
9 pages
db2 10 Questions Answered
No ratings yet
db2 10 Questions Answered
13 pages
01 - Introduction To Optimization S1
100% (1)
01 - Introduction To Optimization S1
119 pages
JD Edwards Building Blocks
100% (1)
JD Edwards Building Blocks
4 pages
DB2 Hadr
100% (1)
DB2 Hadr
35 pages
Advanced Query Tuning Using IBM Data Studio
No ratings yet
Advanced Query Tuning Using IBM Data Studio
64 pages
DB2 Storage Management
No ratings yet
DB2 Storage Management
97 pages
Ab-Initio Enterview Questions
67% (3)
Ab-Initio Enterview Questions
33 pages
Nosql Module 2
100% (1)
Nosql Module 2
87 pages
Resolving Locks NA06I07
No ratings yet
Resolving Locks NA06I07
61 pages
DB2 Advance Performance Monitoring
100% (1)
DB2 Advance Performance Monitoring
47 pages
Sybase Interview Questions and Answers ..... NEW
100% (1)
Sybase Interview Questions and Answers ..... NEW
7 pages
EU05C09
No ratings yet
EU05C09
49 pages
IBM Storage For IBM Z Level 1 Quiz - Attempt Review
100% (1)
IBM Storage For IBM Z Level 1 Quiz - Attempt Review
15 pages
Administración de Base de Datos
No ratings yet
Administración de Base de Datos
50 pages
InformationServer 85 To 115
No ratings yet
InformationServer 85 To 115
72 pages
Integrating Kofax and Microsoft SharePoint
50% (2)
Integrating Kofax and Microsoft SharePoint
40 pages
IBM Storage For IBM Z Quiz Level 1
100% (2)
IBM Storage For IBM Z Quiz Level 1
4 pages
IBM Db2 Database Report From PeerSpot 2023-03-11 16j4
No ratings yet
IBM Db2 Database Report From PeerSpot 2023-03-11 16j4
26 pages
Db2 For Linux, Unix, and Windows - Version 11+ Highlights
No ratings yet
Db2 For Linux, Unix, and Windows - Version 11+ Highlights
164 pages
Introduction to Oracle Database Administration
From Everand
Introduction to Oracle Database Administration
Ying Wang
5/5 (1)
Leica Cyclone Register 360 Rel
No ratings yet
Leica Cyclone Register 360 Rel
24 pages
VX VM
100% (1)
VX VM
237 pages
Oracle GoldenGate 11g Implementer's guide
From Everand
Oracle GoldenGate 11g Implementer's guide
John P Jeffries
5/5 (1)
+1. CA Performance Handbook For DB2 For zOS - CHAPTER 7 - Application Design and Tuning For Performance
No ratings yet
+1. CA Performance Handbook For DB2 For zOS - CHAPTER 7 - Application Design and Tuning For Performance
16 pages
Db2 For ZOS Ultra High Performance and Tuning
No ratings yet
Db2 For ZOS Ultra High Performance and Tuning
44 pages
DBA's Guide to NoSQL
From Everand
DBA's Guide to NoSQL
The Enlightened DBA
5/5 (1)
Schema Management - Eliminating The DBA Nightmare of UDCL
No ratings yet
Schema Management - Eliminating The DBA Nightmare of UDCL
53 pages
Sg244725-Locking in DB2 For MVSESA Environment
No ratings yet
Sg244725-Locking in DB2 For MVSESA Environment
205 pages
+1. CA Performance Handbook For DB2 For zOS - CHAPTER 10 - Tuning Tips
No ratings yet
+1. CA Performance Handbook For DB2 For zOS - CHAPTER 10 - Tuning Tips
7 pages
Mitel Communication Director
No ratings yet
Mitel Communication Director
179 pages
Eaton Fire Addressable Control Panel fx6000 Datasheet 0719 PDF
No ratings yet
Eaton Fire Addressable Control Panel fx6000 Datasheet 0719 PDF
2 pages
Database Administration: Db2® For I Provides Various Methods For Setting Up and Managing Databases
No ratings yet
Database Administration: Db2® For I Provides Various Methods For Setting Up and Managing Databases
2 pages
Module 6
No ratings yet
Module 6
16 pages
1.1 - DB2 10 LUW Features Spotlight - v20120330
No ratings yet
1.1 - DB2 10 LUW Features Spotlight - v20120330
24 pages
Teradata 13.10 Features
No ratings yet
Teradata 13.10 Features
43 pages
Riqas-To-Multiqc 3.2 - User Manual: 1. Intended Use
No ratings yet
Riqas-To-Multiqc 3.2 - User Manual: 1. Intended Use
4 pages
Advanced Monitoring
No ratings yet
Advanced Monitoring
41 pages
Counters With Numbers. What Is The Meaning of Those Counters?
0% (1)
Counters With Numbers. What Is The Meaning of Those Counters?
5 pages
IMTC 2014 - Technical Tracks and Sessions External
No ratings yet
IMTC 2014 - Technical Tracks and Sessions External
489 pages
How To Fix Raw External Hard Drive Without Formatting - (6 Best Fixes)
No ratings yet
How To Fix Raw External Hard Drive Without Formatting - (6 Best Fixes)
9 pages
Retire Apps Dec 15
No ratings yet
Retire Apps Dec 15
61 pages
Inforbright IEE Architecture Overview
No ratings yet
Inforbright IEE Architecture Overview
14 pages
Idug Emea 2012 PDF
No ratings yet
Idug Emea 2012 PDF
46 pages
Introduction To Mainframe
No ratings yet
Introduction To Mainframe
7 pages
The DB2 Admin 001
No ratings yet
The DB2 Admin 001
2 pages
IBM Replication Updates: 4+ in 45: The Fillmore Group - February 2019
No ratings yet
IBM Replication Updates: 4+ in 45: The Fillmore Group - February 2019
46 pages
MD 100
No ratings yet
MD 100
402 pages
The PPT COA
No ratings yet
The PPT COA
18 pages
Network Switch Setup For Q-SYS Platform: D-Link DGS-1210 Series DGS-1500 Series DGS-1510 Series
No ratings yet
Network Switch Setup For Q-SYS Platform: D-Link DGS-1210 Series DGS-1500 Series DGS-1510 Series
8 pages
DB2 11 Overview
No ratings yet
DB2 11 Overview
26 pages
Design Patterns Embedded Systems
No ratings yet
Design Patterns Embedded Systems
9 pages
1.difference Between Z/Os and LUW: Accessing DB2
No ratings yet
1.difference Between Z/Os and LUW: Accessing DB2
12 pages
Introtothe DWweb
No ratings yet
Introtothe DWweb
44 pages
SAN Switch Cheat Sheet For B-Series - M-Series - MDS-series
No ratings yet
SAN Switch Cheat Sheet For B-Series - M-Series - MDS-series
4 pages
MBE1323 Information Technology in TVET
No ratings yet
MBE1323 Information Technology in TVET
46 pages
2008 Canadian Computing Competition: Senior Division: Sponsor
No ratings yet
2008 Canadian Computing Competition: Senior Division: Sponsor
11 pages
DB2 Tech Talk Technical Tour of DB2 10 and Info Sphere Warehouse 10
No ratings yet
DB2 Tech Talk Technical Tour of DB2 10 and Info Sphere Warehouse 10
45 pages
Lab Manual # 11: Title: C++ Structures Clo: Clo-1
No ratings yet
Lab Manual # 11: Title: C++ Structures Clo: Clo-1
13 pages
Part#4. Consume CDS View in An ABAP Program - SAP Blogs
No ratings yet
Part#4. Consume CDS View in An ABAP Program - SAP Blogs
11 pages
OMEGAMON DB2 Performance Basics
No ratings yet
OMEGAMON DB2 Performance Basics
3 pages
A Interview Questions and Answers
No ratings yet
A Interview Questions and Answers
34 pages
Simatic - PC - Adapter - Usb 2023
No ratings yet
Simatic - PC - Adapter - Usb 2023
38 pages
BACON Creatin Custom Visualization in Cognos
No ratings yet
BACON Creatin Custom Visualization in Cognos
1 page
Cognos Auditing - Tips and Tricks For Large & High Volume Environments - Motio
No ratings yet
Cognos Auditing - Tips and Tricks For Large & High Volume Environments - Motio
12 pages
DB2 Tech Talk PureData Systems Presentation PDF
No ratings yet
DB2 Tech Talk PureData Systems Presentation PDF
65 pages
UCS645 ProjectReport MergeSort
No ratings yet
UCS645 ProjectReport MergeSort
22 pages
IBM Info Sphere Warehouse 9.7 Brochure
No ratings yet
IBM Info Sphere Warehouse 9.7 Brochure
6 pages
MIA 4. Arduino Applications
No ratings yet
MIA 4. Arduino Applications
4 pages
Iod 2010 Cbi 1290
No ratings yet
Iod 2010 Cbi 1290
24 pages
Chapter 6 - Achieving The Quality Attributes - System Modifiability
No ratings yet
Chapter 6 - Achieving The Quality Attributes - System Modifiability
38 pages
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
From Everand
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
alasdair gilchrist
5/5 (1)
Security, Backup, Recovery, Tuning, Testing of Data Mining and Warehousing
No ratings yet
Security, Backup, Recovery, Tuning, Testing of Data Mining and Warehousing
16 pages
Visual Basic 2010 Coding Briefs Data Access
From Everand
Visual Basic 2010 Coding Briefs Data Access
Kevin Hough
5/5 (1)
Nicky Van Vroenhoven - Query Folding in Power BI
No ratings yet
Nicky Van Vroenhoven - Query Folding in Power BI
53 pages
Maintenance of IBM Cognos 10 BI Servers - Allthingscognos
No ratings yet
Maintenance of IBM Cognos 10 BI Servers - Allthingscognos
14 pages
Lab - Monitor and Manage System Resources: Part 1: Event Viewer
No ratings yet
Lab - Monitor and Manage System Resources: Part 1: Event Viewer
4 pages
Practical Play Framework: Focus on what is really important
From Everand
Practical Play Framework: Focus on what is really important
Alberto Souza
No ratings yet
Using Cognos Framework Manager Parameter Maps - Senturus
No ratings yet
Using Cognos Framework Manager Parameter Maps - Senturus
4 pages
BACon 2021 Custom Viz in Cognos
No ratings yet
BACon 2021 Custom Viz in Cognos
28 pages
Cognos Prompting - NIL Parameter Values and The Dropping of Optional Filters
No ratings yet
Cognos Prompting - NIL Parameter Values and The Dropping of Optional Filters
3 pages
SAP Basis Configuration Frequently Asked Questions
From Everand
SAP Basis Configuration Frequently Asked Questions
Equity Press
3.5/5 (4)
Major Project Report
No ratings yet
Major Project Report
22 pages
Unable To Start Cognos Query Services
No ratings yet
Unable To Start Cognos Query Services
2 pages
How To Create A Job To Run and Save Queries Used For Prompts in A Cognos Report
No ratings yet
How To Create A Job To Run and Save Queries Used For Prompts in A Cognos Report
2 pages
How Can We Validate If Caching Is Getting Applied For Report Queries
No ratings yet
How Can We Validate If Caching Is Getting Applied For Report Queries
2 pages
Numbers To Info - Row Level Security On Relational Datasource in Cognos Framework Manager
No ratings yet
Numbers To Info - Row Level Security On Relational Datasource in Cognos Framework Manager
2 pages
Add An Advanced Server Property That Will Allow To Specify A Chunk of Batch Report Service Tasks
No ratings yet
Add An Advanced Server Property That Will Allow To Specify A Chunk of Batch Report Service Tasks
2 pages
ExaGrid Multi Hop - DS
No ratings yet
ExaGrid Multi Hop - DS
2 pages
FreeRDP User Manual 2
No ratings yet
FreeRDP User Manual 2
2 pages
Elements of Android Room
From Everand
Elements of Android Room
Mark Murphy
No ratings yet
Informatic Question
No ratings yet
Informatic Question
2 pages
Google BigQuery Analytics
From Everand
Google BigQuery Analytics
Jordan Tigani
3/5 (1)
Concise Oracle Database For People Who Has No Time
From Everand
Concise Oracle Database For People Who Has No Time
Billy Aung Myint
No ratings yet
Mastering DuckDB: High-Performance Analytics Made Easy
From Everand
Mastering DuckDB: High-Performance Analytics Made Easy
Robert Johnson
No ratings yet
AWS Certified Solutions Architect - Professional
From Everand
AWS Certified Solutions Architect - Professional
VB Dev
No ratings yet
SAS Interview Questions You'll Most Likely Be Asked
From Everand
SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

What I Learned This Month:: IBM DB2 Analytics Accelerator

Uploaded by

What I Learned This Month:: IBM DB2 Analytics Accelerator

Uploaded by

What I Learned This Month: IBM DB2 Analytics Accelerator

You might also like