0% found this document useful (0 votes)
39 views3 pages

Data Stage Course Content

This document provides an overview of the course content for a Data Stage course. The course covers data warehousing concepts, ETL processes, Data Stage architecture, job design in Data Stage, working with file and database stages, data transformation techniques, performance tuning, and additional Data Stage tools. It introduces concepts like partitioning, parallel processing, slowly changing dimensions, and scheduling and monitoring jobs. The course aims to teach students how to design, develop and administer ETL jobs and processes in Data Stage.

Uploaded by

AnonymousHP
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
39 views3 pages

Data Stage Course Content

This document provides an overview of the course content for a Data Stage course. The course covers data warehousing concepts, ETL processes, Data Stage architecture, job design in Data Stage, working with file and database stages, data transformation techniques, performance tuning, and additional Data Stage tools. It introduces concepts like partitioning, parallel processing, slowly changing dimensions, and scheduling and monitoring jobs. The course aims to teach students how to design, develop and administer ETL jobs and processes in Data Stage.

Uploaded by

AnonymousHP
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 3

Data Stage Course content

Data warehousing concepts


_ Data warehouse & Data
warehousing
_ Data warehousing Architectures
_ ETL project phases
_ ETL Process
_ ETL Architecture
_ Over view of project Architecture
_ ER, Dimensional Model & I E
Notation
_ Data warehousing Teams & Roles
_ Understand Mapping Documents
Introduction to Data Stage
_ History & Features
_ Difference between 7.5X2 & 8.x
_ IBS DS & QS 8.0.1
_ DS info Sphere 8.5 Enhancements
_ View on Web console
_ IBM Data Stage 7.5x2 & 8.x
Architecture
_ Traditional Batch Processing
_ Partition & Pipeline parallelism
_ Partitioning & re- Partitioning
techniques
_ Combinality, Combining & collecting
Techniques
_ Configuration File & Node
Components
About IBM web sphere DS & QS
_ 7.5X2 & 8X Enhancements-New
Features
_ Parallel Processing environments
(SMP, MPP etc.)
_ DS Client Components Overview
_ DS architecture Components
Overview
Data Stage Jobs & Stages
_ Difference between server & parallel
jobs
_ Overview of Parallel jobs, server
jobs, mainframe jobs, migration jobs
and job sequencing jobs
_ Repository, DS Designer Toolbar &
Palette
_ Active & Passive stages
_ Palette Customization
_ About Link Markers
_ Frame work Operators
Design, Compile & Run DS jobs
_ DS Job Design Process

_ Designer Canvas customization


_ Compile, Force Compile & multiple
job compile
_ DTD & OSH code
Working with file stages
_ Sequential file & stage rules
_ Data set & types
_ File set
_ Differences-Data et, file set & seq
file stages
_ Lookup file set stage
Development/Debug & STUB Stages
_ Row generator & column generator
_ Head, tail & sample stages
_ Peek stage
_ Copy stage
Working with relational databases
_ Enterprise and plug-in stages
overview
_ Oracle, DB/2, SQL server stages
_ TERADATA
_ ODBC (oracle, SQL server, Excel &
Access)
_ Dynamic RDBMS stage
_ Orchestrate schema import

Processing stages

Horizontal combining
_ Join stage
_ Lookup stage (normal, sparse, range,
case less)s
_ Merge state
_ Funnel stage & funneling
Sorting & vertical combining
_ In-stage sorts (traditional sort)
_ Sort stage (complex & simple sorts)
_ Aggregate stage
_ Remove duplicate stage
Data transformation with transformer
_ Basic transformer vs. parallel
transformer
_ External functions & macros
_ Stage variables & system variables
_ Transformer constraints
_ Transformer execution order
Filtering methods
_ Constraints & source level
_ Filter stage
_ Switch stage
_ External filter stage
Other stages

_ Modify stage
_ Compress & Expand stage
_ Decode & Encode stage
_ Pivot stage
Slowly changing dimensions
_ Type -0, 1, 2 Dimensions
_ Implementing SCD -I & II in 8.0.1 &
7.5x2
_ SCD stage
_ Change capture & change apply
stage
_ Surrogate key stage (state file &
sequence object)
Performance tuning tips
_ Performance tuning with best
practices
_ Performance tuning with
partitioning techniques
_ Performance Analysis & Estimate
resource
Advance topics
_ Parameter set & Parameter file
creation
_ Data connection
Job control
_ Job sequencing (run stages, error
handling stages, flow control stages
& etc)
_ Batch jobs
Data stage designer additional features
_ Import & export of repository
components
_ Node configuration
_ Generate exports
Data stage director
_ Schedule/run/validate a compiled
job
_ Message handling (job & project
level)
_ Monitor jobs, filter/purge logs
_ Unlocking job & customize
Web console & DS administrator
_ security services, session
management
_ Domain management, reporting,
scheduling..
_ ADD, Delete and Organize projects
Information analyzer
_ Column, primary key, foreign key,
cross domain & base line analysis

You might also like