0% found this document useful (0 votes)
185 views2 pages

DataStage Parallel Extender (DataStage PX)

DataStage Parallel Extender (DataStage PX) is an IBM data integration tool that can collect information from various sources, transform the data as needed, and load it into data warehouses. It has a parallel processing architecture that speeds up data integration. DataStage PX uses stages to process source data and load it into target databases, and containers and sequences to reuse jobs and run multiple jobs simultaneously.

Uploaded by

rachit
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
185 views2 pages

DataStage Parallel Extender (DataStage PX)

DataStage Parallel Extender (DataStage PX) is an IBM data integration tool that can collect information from various sources, transform the data as needed, and load it into data warehouses. It has a parallel processing architecture that speeds up data integration. DataStage PX uses stages to process source data and load it into target databases, and containers and sequences to reuse jobs and run multiple jobs simultaneously.

Uploaded by

rachit
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
You are on page 1/ 2

DataStage Parallel Extender (DataStage PX)

Definition - What does DataStage Parallel Extender (DataStage PX) mean?


DataStage Parallel Extender (DataStage PX) is an IBM data integration tool. It is one among the many widely used extraction, transformation and loading (ETL) tools in the data warehousing industry. This tool can collect information from heterogeneous sources, perform transformations as per a business's needs and load the data into respective data warehouses. DataStage PX may also be called DataStage Enterprise Edition.

Techopedia explains DataStage Parallel Extender (DataStage PX)


DataStage Parallel Extender has a parallel architecture to process data. The two main types of parallelism implemented in DataStage PX are pipeline and partition parallelism. The ability to process data in a parallel fashion speeds up data processing to a large extent. DataStage Parallel Extender incorporates a variety of stages through which source data is processed and reinforced into target databases. These are defined in terms of terabytes. Besides stages, DataStage PX uses containers to reuse the job components and sequences to run and schedule multiple jobs at the same time. The commonly used stages in DataStage Parallel Extender include:

Transformer Aggregator Data set Copy Change apply Modify Filter Join Merge Look up

Orchestrate itself is an ETL tool with extensive parallel processing capabilities and running on UNIX platform. Datastage used Orchestrate with Datastage XE (Beta version of 6.0) to incorporate the parallel processing capabilities. Now Datastage has purchased Orchestrate and integrated it with Datastage XE and released a new version Datastage 6.0 i.e Parallel Extender.

You might also like