0% found this document useful (0 votes)
122 views3 pages

DataStage Detailed

This 4-day course teaches students how to build data warehousing applications using IBM DataStage. Students will learn to design, run, monitor, and optimize DataStage jobs to load data into a star schema. Topics include DataStage components, modeling the data warehouse, extracting and transforming data, working with relational databases, debugging jobs, and optimizing performance. The goal is for students to gain hands-on experience building a simple data warehouse using DataStage.

Uploaded by

Kam Pan
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
122 views3 pages

DataStage Detailed

This 4-day course teaches students how to build data warehousing applications using IBM DataStage. Students will learn to design, run, monitor, and optimize DataStage jobs to load data into a star schema. Topics include DataStage components, modeling the data warehouse, extracting and transforming data, working with relational databases, debugging jobs, and optimizing performance. The goal is for students to gain hands-on experience building a simple data warehouse using DataStage.

Uploaded by

Kam Pan
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
You are on page 1/ 3

DataStage Server Edition

Its for learning purpose. Any user who is going to learn Datastage can refer this doc. Duration
4 Days The DataStage course teaches students how to create data warehousing applications requiring not only the use of standard stages, but also custom-built objects. Students will gain hands-on experience by building a network of jobs to load a simple star schema data warehouse. Course topics will also include DataStage standards, adding performance-enhancing stages to jobs, using the new Intelligent Assistants and tips and tricks.

Prerequisites
None

Course Objectives
At the completion of this course you will be able to: Design jobs Compile and test jobs Run jobs Monitor a job's activities Create and control networks of jobs Schedule jobs Create job reports Optimize job performance

Course Topics Business Intelligence and Data Warehousing


The road map to Business Intelligence (BI) Data warehouses compared with Online Transaction Processing (OLTP) Management information systems and decision support systems (DSS) Business drivers for data warehouses Typical uses of a data warehouse

Defining Data Warehouse Concepts and Terminology


Common data warehouse definitions Data warehouse properties and characteristics Warehouse development approaches Components of data warehouse design and implementation Components of a data warehouse Data warehouse compared with data mart Dependent and independent data marts

Modeling the Data Warehouse


Data warehouse database design phases Defining the business model Choosing the architecture The dimensional model Using time in the data warehouse Using summary data The physical model Extracting, Transforming, and Loading data

Leaving a Metadata Trail


Defining warehouse metadata Developing a metadata strategy Metadata management tools

Introduction to DataStage
Describe typical target data systems Describe DataStage Identify the DataStage server and client components Describe DataStage projects Describe DataStage jobs Identify the steps involved in building a DataStage job

Installing DataStage
Install the DataStage Server on Windows NT Create a DataStage project Install the DataStage clients

Configuring Projects
Set project properties in Administrator Set global properties in Administrator

Working with Meta Data


Describe the DataStage Manager components and functionality Import and export DataStage object Import metadata for a sequential file Load metadata into a Sequential stage

Designing and Running Jobs


Describe a DataStage job List the steps involved in creating a job Describe links and stages Identify the different types of stages Design a simple extraction and load job Compile, validate, and run your job Monitor the execution of your job

Working with Relational Data


Set up an ODBC connection to a relational database Import relational metadata Extract data from a relational table Load data into a relational table

Constraints and Derivations


Define constraints Define a reject link Define derivations Add constants, operands, and operators to a derivation Create and use a stage variable

Creating Basic Expressions


Use DataStage BASIC operators and functions Use system variables Use DataStage functions and routines Define a DataStage Transform Manipulate dates

Troubleshooting
Troubleshoot a job using the job log file Modify optional settings in Director Troubleshoot a job using the DataStage Debugger

Defining Lookups
Create and access hashed files Define hashed file lookups Define ODBC lookups

Aggregating Data
Aggregate data using the Aggregator stage

Job Control
Define job parameters Specify Before and After Routines Use job control routines Create a job that controls other jobs Use the DataStage Job Sequencer Build and use DataStage Job Containers

Working with Plug-Ins


Install a plug-in Use the sort plug-in in a job

Scheduling and Reporting


Schedule jobs to run at specific dates and times Create a project report

Optimizing Job Performance


Use performance statistics to determine reasons for job performance limitations Build DataStage server jobs that utilize parallel processing techniques to maximize performance

You might also like