Day 2 (1) .1.2 DataStage Projects Life Cycle
Day 2 (1) .1.2 DataStage Projects Life Cycle
Introduction
Requirements
Design
Build
Testing
Implementation
Support
Requirements
Requirements Design
Design Build
Build Test
Test Implement
Implement Support
Support
User views need to be modeled and aligned more closely to meet business needs
Identify the dependencies between all aspects of the project like ETL feeds, User
Views etc. to facilitate better control over project execution
Performance related requirements need to be identified and documented.
Source Data Analysis need to be done to understand the type of data which needs to
be processed.
A detailed Analysis/High level design phase is required to drill down the requirements
Requirements
Requirements Design
Design Build
Build Test
Test Implement
Implement Support
Support
Requirements
Requirements Design
Design Build
Build Test
Test Implement
Implement Support
Support
A fluid Data Model will result in lot of rework. Changes might be small, but might be required at
multiple places increasing volume of rework.
Changing Data Model leads to difficulty in Metadata Management, which is very critical for an
enterprise data warehouse. Metadata needs to be extracted and loaded into DataStage every time
there is a change. This process needs a significant lead time.
Design should be robust and accommodate process health features like Auditing, ACR balancing,
Error processing and reprocessing, Restart ability, Recovery etc
Perform POC on critical requirements and identify performance bottlenecks upfront
ACR checkpoints in the data flow will help in identifying the data problems early in the process
before data is loaded to warehouse.
Design patterns should be reusable across projects to reduce development time
Brainstorm and consider various aspects of Framework , Finalize and Bring Clarity.
A flexible framework design which takes care of recovery in case of a downtime is very critical from
application support perspective.
Requirements
Requirements Design
Design Build
Build Test
Test Implement
Implement Support
Support
Requirements
Requirements Design
Design Build
Build Test
Test Implement
Implement Support
Support
Multiple stages can be used to establish similar/same function. Choice of selecting the
right stage and configuration is key in developing a quality solution
Implementation of encryption routines using Open SSL library for AES
encryption/Decryption/ SHA-1 hashing etc should be taken care in the start of the
phase.
Metadata is a key aspect of a successful data warehouse implementation. Standards
need to be clearly defined and followed
Accessing DataStage over Citrix server has improved productivity to a large extent.
This has also given the flexibility to try out multiple options and provide the best
solution. Hence Citrix server should be used for accessing datastage.
Knowledge Management practices capture and disseminate information. Repository of
knowledge articles, learnings, checklist should be built from experience
Requirements
Requirements Design
Design Build
Build Test
Test Implement
Implement Support
Support
Define framework for each process (like error processing, record processing,
Requirements
Requirements Design
Design Build
Build Test
Test Implement
Implement Support
Support
Experience in handling large volumes of data in multiple projects, including the huge
CSPAM volumes from Target Stores
Broader understanding and good experience from innumerable challenges that we
have overcome across projects and environments, old as well as new.
Understanding the role of the various teams involved. Ability to
partner/coordinate/collaborate with multiple teams.
Testing of DataStage jobs requires considerable amount of time. Adequate testing time
should be planned
Preparing a good test data bed is often complex and difficult. Plan well in advance.
Plan to have enough database capacity and test schemas to have a smooth testing
phase.
Learning's from Target DataStage/UDB environment is critical in successful testing
phase
Requirements
Requirements Design
Design Build
Build Test
Test Implement
Implement Support
Support
Requirements
Requirements Design
Design Build
Build Test
Test Implement
Implement Support
Support
Need to plan in advance for the implementation phase. Need to collaborate with
different stake holders to successfully implement various aspects of the application
such as DataStage jobs, Control-M schedule, Unix scripts, ACR application, etc.
In case of a new environment like grmetlprod01, there needs to be a test
implementation phase to iron out any environment related surprises.
Awareness of the new processes in place for DataStage implementation such as the
deployment using WBSD. This will help in resolving problems and reducing delays
A well developed deployment checklist which can be reused across projects
Requirements
Requirements Design
Design Build
Build Test
Test Implement
Implement Support
Support