DataStage 8 Overview
DataStage 8 Overview
0
Richard Hedges Program Director, Product Management IBM Information Server
Agenda
IBM Information Server Overview & Architecture WebSphere DataStage Usability Improvements Best in class Data Transformation Focus on Connectivity Performance, Performance, and Performance Installation, Configuration, Administration, Reporting Upgrade to WebSphere DataStage v8.0
Analysis Interface
Development Interface
Metadata Services
Security Services
UNIFIED METADATA
Understand
Cleanse
Transform
Deliver
Design
Operational
COMMON CONNECTIVITY
Agenda
IBM Information Server Overview & Architecture WebSphere DataStage Usability Improvements Best in class Data Transformation Focus on Connectivity Performance, Performance, and Performance Installation, Configuration, Administration, Reporting Upgrade to WebSphere DataStage v8.0
Creation
Date/Time By User
Last Modification
Date/Time By User
Where Used
What other objects use this object?
Dependencies of
What does this object use?
Options
Case Match on name & description or name or description
Tables
Export Improvements
The new GUI allows modification of the original populated export list. Items can be added, removed, filtered out.
Available from
Agenda
IBM Information Server Overview & Architecture WebSphere DataStage Usability Improvements Best in class Data Transformation Focus on Connectivity Performance, Performance, and Performance Installation, Configuration, Administration, Reporting Upgrade to WebSphere DataStage v8.0
How it works
Uses built-in state files or DBMS sequences (DB2 & Oracle) Supports large integer (uint64) surrogate key values Can be used to discover surrogate key values which are already being used so that use of duplicate key values will be avoided Customizable block size to manage key gaps vs. performance
Agenda
IBM Information Server Overview & Architecture WebSphere DataStage Usability Improvements Best in class Data Transformation Focus on Connectivity Performance, Performance, and Performance Installation, Configuration, Administration, Reporting Upgrade to WebSphere DataStage v8.0
Connectivity Updates
New functionality and more DB supported in SQL builders
SQL Server, Teradata, ODBC
New Connectivity
Stages for WebSphere Federation and Classic Federation
Server and Enterprise stages DRS Support Native integration with Federation and Classic Federation
Connection Objects
New top-level repository object Allows saving of a re-usable connection path to a specific source or target
Username, password, db name etc.
DB2 Q107
For DPF and non-DPF
Teradata Q107
New support for Teradata Parallel Transport (TPT)
Oracle Q107
New support for 10gR2
WebSphere MQ Q107
Adding support for client only configuration
Connection objects allow properties to be dropped onto stage Diagram lets you select the link to edit as though youre on the canvas
New Functionality
Enhanced support for Siebel EIM and Business Components New Metadata browser and importer for Oracle Applications Greater support for large enterprise class deployments
Agenda
IBM Information Server Overview & Architecture WebSphere DataStage Usability Improvements Best in class Data Transformation Focus on Connectivity Performance, Performance, and Performance Installation, Configuration, Administration, Reporting Upgrade to WebSphere DataStage v8.0
Performance Improvements
Improved Job Startup Time
Allow efficient use of DS EE against smaller data sets
Buffer Optimization
Improved buffer placement algorithm E.g., Removed unnecessary buffer before parallel sort in some instances
Combinability Optimizations
More combinable stages Intelligent combining
Resource Estimation
Difficult to estimate resources required for job execution
Scratch space, CPU, etc.
What happens if data volume increases? How do I prevent job aborting due to lack of system resources?
Agenda
IBM Information Server Overview & Architecture WebSphere DataStage Usability Improvements Best in class Data Transformation Focus on Connectivity Performance, Performance, and Performance Installation, Configuration, Administration, Reporting Upgrade to WebSphere DataStage v8.0
2.
3.
4. 5.
Security Services
Internal Directory
Defines users, groups, roles Support browsing/creation/deletion/update operations
External Directories
LDAP, Active Directory, Unix External directories password are not stored Support browsing/partial update operations
Roles
Suite roles: Suite User, Suite Administrator Product roles: e.g. DataStage user Project roles: e.g. Information Analyzer User
Logging
A new common logging facility
Used by all the products of the Suite Logs go into the operational repository
DataStage Client log viewer does not change Logging administration done from the administration console Logging Views are saved queries
Opening a view displays the log events corresponding to the saved query Example
Severity level: Error Category: DataStage Timestamp: past 12 hours
A user can now view logs in a Production environment via a browser and perform nothing else in that environment
Reporting Console
Can publish reports from DataStage to the IBM Information Server Reporting Console Job Reports, Advanced Find, Impact Analysis, etc.
Agenda
IBM Information Server Overview & Architecture WebSphere DataStage Usability Improvements Best in class Data Transformation Focus on Connectivity Performance, Performance, and Performance Installation, Configuration, Administration, Reporting Upgrade to WebSphere DataStage v8.0
Upgrade
All objects from DataStage v7 projects upgrade into DataStage v8.0
Export projects and Import into DataStage v8.0 All jobs (Server, Parallel, Mainframe, and Sequencer) along with all other objects will migrate
Unix users can install IBM Information Server and previous versions on the same server Note: DataStage Version Control not in v8.0.
Platforms
At GA
DS & QS Client: Windows XP Windows Server 2003 AIX 5.2, 5.3 Red Hat Enterprise Linux AS 3.0 Red Hat Enterprise Linux AS 4.0 SuSE Enterprise Linux 9, 10 HP-UX 11i1 (11.11), 11i2 (11.23) PA-RISC Solaris 2.9, 2.10
Thank You!