DataStage Training Outline

Download as pdf or txt
Download as pdf or txt
You are on page 1of 4

PR3 Systems

2909 Colton Ct,


Lisle, IL 60532.
T 630-452-9883
F 630-364-5719
[email protected]

IBM DataStage Advanced Enterprise Edition Outline


4 day course (32 hrs)

1. Introduction to the Enterprise Edition Architecture


• Parallel processing architecture
• Partition parallelism
• Pipeline parallelism
• Enterprise Edition development environment
• Enterprise Edition framework
• Enterprise Edition job execution model

2. Compilation and Execution


• Code generation
• Viewing and understanding the generated OSH
• Stage to operator mappings
• EE runtime architecture
• Viewing and understanding the Score

3. Enterprise Edition Data Types


• Datasets
• Schemas
• Datatype mappings and conversions
• Processing external data
• Null handling
• Runtime Column Propagation (RCP)

4. Partitioning Data
• Understand how Partitioning works in the Framework
• Viewing partitioners in the Score
• Selecting partitioning algorithms
• Generate sequences of numbers (surrogate keys) in a partitioned, parallel
environment

5. Collecting and Sorting Data


• Collecting data
• Collector algorithms
• Sorting data in parallel framework
• Inserted sorts
• Optimizing Fork-Join jobs
• Using Sort stages to determine the last row

6. Buffering in Enterprise Edition


• Buffering in EE
• Buffer tuning
• Avoiding buffer contentions

7. Extending Enterprise Edition


• Create BuildOp stages
• Create new External Function Routines
• Introduce CustomOp stages

8. Best Practices and Job Design Guidelines


• Overall job design tips
• Stage usage tips

9. Database Usage
• Overall Database Guidelines
• DB2 Guidelines
• Oracle Guidelines
• Teradata Guidelines

10. Environment Variables


• Runtime environment
• Exporting environment variables
• Job monitoring variables
• Job debugging variables
• Job design variables
• Sequential file variables
• Database variables

11. Performance Tuning


• Performance tuning methodology
• Selectively Disabling Operator Combination
• Configuration file guidelines
• Impact of partitioning
• Impact of sorting
• Impact of Transformers
• Impact of buffering
IBM DataStage Server Training Outline
5 day course (40 hrs)
1. Concepts of Data Warehousing:
• Why Data Warehousing?
• Approaches to Data Warehousing.
2. Introduction to WebSphere DataStage
3. Unique Features of DataStage
4. DataStage Client-Server Architecture
5. DataStage Components
6. DataStage Administration and Configuration
• Introduction to DataStage Administrator
• Installing Plugins
• Defining User roles
• Defining Run-time default parameters
7. Designing DataStage Jobs
• Introduction to DataStage Designer
• Different types of jobs
• Stages and its usage
• Further discussion on the various types of stages
• Designing a simple job
8. Running DataStage Jobs
• Introduction to DataStage Director
• Monitoring jobs
• Job Run Logs
9. Working with Metadata
10. Constraints and Derivation
11. Creating BASIC Expressions
12. Troubleshooting
13. Defining Lookups
14. Aggregating Data
15. Job Control
16. Scheduling and Reporting
17. Optimizing Job Performance

IBM DataStage Enterprise Edition (Parallel Extender)


Training Outline
4 day course (32 hrs)
1. Introduction to Parallel Extender
2. Concepts in Parallel Processing
3. Partitioning and Collecting Data
4. Importing/Exporting Data
5. Overview of Some Parallel Extender Stages
6. Using RDBMS with Parallel Extender
7. Wrapping Unix Executables
8. Building Native Stages

IBM DataStage Design Best Practices


1 day course (8 hrs)

1. Guidelines to Effective Design


2. DataStage Execution Architecture
3. Introduction to WebServices
4. The interaction with other Scheduling Tools
5. Interfacing with Messaging Framework
6. Interfacing with SAP R/3

Please call or email to attend the FREE Introductory DataStage Overview. You
enroll only when you LIKE the course contents and the value addition.

You might also like