0% found this document useful (0 votes)

381 views9 pages

21 Ibm Websphere Datastage Interview Questions A Answers

The document contains questions and answers about IBM Websphere DataStage. It discusses topics like renaming jobs, ODBC stages, solving past project problems, connecting to databases, using sequencers, performance tuning, and other DataStage concepts.

Uploaded by

sanjaysahu160

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

381 views9 pages

21 Ibm Websphere Datastage Interview Questions A Answers

Uploaded by

sanjaysahu160

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

IBM Websphere DataStage Interview questions & Answers

1. How do you rename all of the jobs to support your new File-naming conventions?
A: Create a Excel spreadsheet with new and old names. Export the whole project as a dsx. Write a Perl program, which can do a simple rename of the strings looking up the Excel file. Then import the new dsx file probably into a new project for testing. Recompile all jobs. Be cautious that the name of the jobs has also been changed in your job control jobs or Sequencer jobs. So you have to make the necessary changes to these Sequencers.

2. Does the selection of 'Clear the table and Insert rows' in the ODBC stage send a Truncate statement to the DB or does it do some kind of Delete logic.
A: There is no TRUNCATE on ODBC stages. It is Clear table blah blah and that is a delete from statement. On an OCI stage such as Oracle, you do have both Clear and Truncate options. They are radically different in permissions (Truncate requires you to have alter table permissions where Delete doesn't).

3. Tell me one situation from your last project, where you had faced problem and How did u solve it?
A: The jobs in which data is read directly from OCI stages are running extremely slow. I had to stage the data before sending to the transformer to make the jobs run faster. B. The job aborts in the middle of loading some 500,000 rows. Have an option either cleaning/deleting the loaded data and then run the fixed job or run the job again from the row the job has aborted. To make sure the load is proper we opted the former.

4. The above might rise another question: Why do we

1/9

IBM Websphere DataStage Interview questions & Answers

have to load the dimensional tables first, then fact tables:

A: As we load the dimensional tables the keys (primary) are generated and these keys (primary) are Foreign keys in Fact tables.

5. How will you determine the sequence of jobs to load into data warehouse?
A: First we execute the jobs that load the data into Dimension tables, then Fact tables, then load the Aggregator tables (if any).

6. What are the command line functions that import and export the DS jobs?
A: A. dsimport.exe- imports the DataStage components. B. dsexport.exe- exports the DataStage components.

7. What is the utility you use to schedule the jobs on a UNIX server other than using Ascential Director?
A: Use crontab utility along with dsexecute() function along with proper parameters passed.

8. How would call an external Java function which are not

2/9

IBM Websphere DataStage Interview questions & Answers

supported by DataStage?
A: Starting from DS 6.0 we have the ability to call external Java functions using a Java package from Ascential. In this case we can even use the command line to invoke the Java function and write the return values from the Java program (if any) and use that files as a source in DataStage job.

9. What will you in a situation where somebody wants to send you a file and use that file as an input or reference and then run job.
A: A. Under Windows: Use the 'WaitForFileActivity' under the Sequencers and then run the job. May be you can schedule the sequencer around the time the file is expected to arrive. B. Under UNIX: Poll for the file. Once the file has start the job or sequencer depending on the file.

10. Read the String functions in DS

A: Functions like [] -> sub-string function and ':' -> concatenation operator Syntax: string [ [ start, ] length ] string [ delimiter, instance, repeats ]

11. How did u connect with DB2 in your last project?

A: Most of the times the data was sent to us in the form of flat files. The data is dumped and sent to us. In some cases were we need to connect to DB2 for look-ups as an instance then we used ODBC drivers to connect to DB2 (or) DB2-UDB depending the situation and availability.

3/9

IBM Websphere DataStage Interview questions & Answers

Certainly DB2-UDB is better in terms of performance as you know the native drivers are always better than ODBC drivers. 'iSeries Access ODBC Driver 9.00.02.02' - ODBC drivers to connect to AS400/DB2.

12. What are Sequencers?

A: Sequencers are job control programs that execute other jobs with preset Job parameters.

13. How did you handle an 'Aborted' sequencer?

A: In almost all cases we have to delete the data inserted by this from DB manually and fix the job and then run the job again.

14. What are other Performance tunings you have done in your last project to increase the performance of slowly running jobs?
A: 1. Staged the data coming from ODBC/OCI/DB2UDB stages or any database on the server using Hash/Sequential files for optimum performance also for data recovery in case job aborts. 2. Tuned the OCI stage for 'Array Size' and 'Rows per Transaction' numerical values for faster inserts, updates and selects. 3. Tuned the 'Project Tunables' in Administrator for better performance. 4. Used sorted data for Aggregator. 5. Sorted the data as much as possible in DB and reduced the use of DS-Sort for better performance of jobs 6. Removed the data not used from the source as early as possible in the job. 7. Worked with DB-admin to create appropriate Indexes on tables for better performance of DS

4/9

IBM Websphere DataStage Interview questions & Answers

queries 8. Converted some of the complex joins/business in DS to Stored Procedures on DS for faster execution of the jobs. 9. If an input file has an excessive number of rows and can be split-up then use standard logic to run jobs in parallel. 10. Before writing a routine or a transform, make sure that there is not the functionality required in one of the standard routines supplied in the sdk or ds utilities categories. Constraints are generally CPU intensive and take a significant amount of time to process. This may be the case if the constraint calls routines or external macros but if it is inline code then the overhead will be minimal. 11. Try to have the constraints in the 'Selection' criteria of the jobs itself. This will eliminate the unnecessary records even getting in before joins are made. 12. Tuning should occur on a job-by-job basis. 13. Use the power of DBMS. 14. Try not to use a sort stage when you can use an ORDER BY clause in the database. 15. Using a constraint to filter a record set is much slower than performing a SELECT WHERE. 16. Make every attempt to use the bulk loader for your particular database. Bulk loaders are generally faster than using ODBC or OLE.

15. How did you handle reject data?

A: Typically a Reject-link is defined and the rejected data is loaded back into data warehouse. So Reject link has to be defined every Output link you wish to collect rejected data. Rejected data is typically bad data like duplicates of Primary keys or null-rows where data is expected.

16. If worked with DS6.0 and latest versions what are Link-Partitioner and Link-Collector used for?

5/9

IBM Websphere DataStage Interview questions & Answers

A: Link Partitioner - Used for partitioning the data. Link Collector - Used for collecting the partitioned data.

17. What are Routines and where/how are they written and have you written any routines before?
A: Routines are stored in the Routines branch of the DataStage Repository, where you can create, view or edit. The following are different types of routines: 1) Transform functions 2) Before-after job subroutines 3) Job Control routines

18. What are OConv () and Iconv () functions and where are they used?
A: IConv() - Converts a string to an internal storage format OConv() - Converts an expression to an output format.

19. How did u connect to DB2 in your last project?

A: Using DB2 ODBC drivers.

20. Do u know about METASTAGE?

A: MetaStage is used to handle the Metadata which will be very useful for data lineage and data analysis later on. Meta Data defines the type of data we are handling. This Data Definitions are stored in repository and can be accessed with the use of MetaStage.

6/9

IBM Websphere DataStage Interview questions & Answers

21. Do you know about INTEGRITY/QUALITY stage?

A: Qulaity Stage can be integrated with DataStage, In Quality Stage we have many stages like investigate, match, survivorship like that so that we can do the Quality related works and we can integrate with datastage we need Quality stage plugin to achieve the task.

22. Explain the differences between Oracle8i/9i?

A: Oracle 8i does not support pseudo column sysdate but 9i supports Oracle 8i we can create 256 columns in a table but in 9i we can upto 1000 columns(fields)

23. How do you merge two files in DS?

A: Either use Copy command as a Before-job subroutine if the metadata of the 2 files are same or create a job to concatenate the 2 files into one if the metadata is different.

24. What is DS Designer used for?

A: You use the Designer to build jobs by creating a visual design that models the flow and transformation of data from the data source through to the target warehouse. The Designer graphical interface lets you select stage icons, drop them onto the Designer work area, and add links.

7/9

IBM Websphere DataStage Interview questions & Answers

25. What is DS Administrator used for?

A: The Administrator enables you to set up DataStage users, control the purging of the Repository, and, if National Language Support (NLS) is enabled, install and manage maps and locales.

26. What is DS Director used for?

A: datastage director is used to run the jobs and validate the jobs. we can go to datastage director from datastage designer it self.

27. What is DS Manager used for?

A: The Manager is a graphical tool that enables you to view and manage the contents of the DataStage Repository

28. What are Static Hash files and Dynamic Hash files?
A: As the names itself suggest what they mean. In general we use Type-30 dynamic Hash files. The Data file has a default size of 2Gb and the overflow file is used if the data exceeds the 2GB size.

29. What is Hash file stage and what is it used for?

A: Used for Look-ups. It is like a reference table. It is also used in-place of ODBC, OCI tables for better performance.

30. Have you ever involved in updating the DS versions

8/9

IBM Websphere DataStage Interview questions & Answers

like DS 5.X, if so tell us some the steps you have taken in doing so?
A: Yes. The following are some of the steps; I have taken in doing so: 1) Definitely take a back up of the whole project(s) by exporting the project as a .dsx file 2) See that you are using the same parent folder for the new version also for your old jobs using the hard-coded file path to work. 3) After installing the new version import the old project(s) and you have to compile them all again. You can use 'Compile All' tool for this. 4) Make sure that all your DB DSN's are created with the same name as old one's. This step is for moving DS from one machine to another. 5) In case if you are just upgrading your DB from Oracle 8i to Oracle 9i there is tool on DS CD that can do this for you. 6) Do not stop the 6.0 server before the upgrade, version 7.0 install process collects project information during the upgrade. There is NO rework (recompilation of existing jobs/routines) needed after the upgrade.

31. Did you Parameterize the job or hard-coded the values in the jobs?
A: Always parameterized the job. Either the values are coming from Job Properties or from a Parameter Manager a third part tool. There is no way you will hardcode some parameters in your jobs. The often Parameterized variables in a job are: DB DSN name, username, password, dates W.R.T for the data to be looked against at.

9/9

Installer Preparation and Creating Bootable Devices
100% (5)
Installer Preparation and Creating Bootable Devices
17 pages
Nodejs TCS Fresco Play
80% (5)
Nodejs TCS Fresco Play
1 page
50+ TOP DataStage Interview Questions and Answers PDF
No ratings yet
50+ TOP DataStage Interview Questions and Answers PDF
2 pages
Data Stage Interview Questions
No ratings yet
Data Stage Interview Questions
15 pages
Problem Statement: Inventory Management System
67% (9)
Problem Statement: Inventory Management System
3 pages
What Is Difference Between Server Jobs and Parallel Jobs? Ans:-Server Jobs
No ratings yet
What Is Difference Between Server Jobs and Parallel Jobs? Ans:-Server Jobs
71 pages
Ds Questions
No ratings yet
Ds Questions
11 pages
Pipeline Parallelism 2. Partition Parallelism
No ratings yet
Pipeline Parallelism 2. Partition Parallelism
12 pages
Workload Management Server
No ratings yet
Workload Management Server
40 pages
DataStage Administration
No ratings yet
DataStage Administration
98 pages
DataStage Adminguide
0% (1)
DataStage Adminguide
40 pages
Issues Datastage
No ratings yet
Issues Datastage
4 pages
DataStage How To Kick Start
100% (2)
DataStage How To Kick Start
133 pages
Ibm Infosphere Datastage Performance Tuning: Menu
No ratings yet
Ibm Infosphere Datastage Performance Tuning: Menu
9 pages
Asset Retirement White Paper
100% (1)
Asset Retirement White Paper
18 pages
DataStage Theory Part
No ratings yet
DataStage Theory Part
18 pages
Imp Datastage New
No ratings yet
Imp Datastage New
153 pages
Looping in Datastage
No ratings yet
Looping in Datastage
7 pages
CG Datastage
No ratings yet
CG Datastage
122 pages
E-DS Administrator, Designer, Director - Other Functions
No ratings yet
E-DS Administrator, Designer, Director - Other Functions
20 pages
DataStage XML and Web Services Packs Overview
No ratings yet
DataStage XML and Web Services Packs Overview
71 pages
Data Stage Scenarios: Scenario1. Cummilative Sum
No ratings yet
Data Stage Scenarios: Scenario1. Cummilative Sum
13 pages
Data Stage
No ratings yet
Data Stage
3 pages
Datastage Errors and Resolution
No ratings yet
Datastage Errors and Resolution
10 pages
Datastage
No ratings yet
Datastage
12 pages
000 421
No ratings yet
000 421
46 pages
Datastage Performance Guide
No ratings yet
Datastage Performance Guide
108 pages
Teradata Overview
No ratings yet
Teradata Overview
64 pages
Database Tuning: Definition - What Does Workload Mean?
No ratings yet
Database Tuning: Definition - What Does Workload Mean?
27 pages
Tting Started With SAP S4HANA, On-Premise PDF
No ratings yet
Tting Started With SAP S4HANA, On-Premise PDF
17 pages
Datastage Answers
No ratings yet
Datastage Answers
3 pages
Info Sphere DataStage Parallel Framework Standard Practices
No ratings yet
Info Sphere DataStage Parallel Framework Standard Practices
460 pages
Partitioning in Datastage
No ratings yet
Partitioning in Datastage
27 pages
E-Tender System: A System Development Project On
No ratings yet
E-Tender System: A System Development Project On
92 pages
Dynamic Lookup PDF
No ratings yet
Dynamic Lookup PDF
5 pages
Common Issues in Datastage
100% (2)
Common Issues in Datastage
12 pages
Datastage Slowly Changing Dimensions
No ratings yet
Datastage Slowly Changing Dimensions
10 pages
Datastage Unixcommonds
No ratings yet
Datastage Unixcommonds
9 pages
DataStage Configuration File
No ratings yet
DataStage Configuration File
10 pages
NX Fea Tutorial
0% (1)
NX Fea Tutorial
5 pages
Ionic 5: Create Awesome Apps For iOS, Android, Desktop and Web
100% (1)
Ionic 5: Create Awesome Apps For iOS, Android, Desktop and Web
108 pages
New - Datastage Architecture
No ratings yet
New - Datastage Architecture
5 pages
Symantec DLP 15.0 Admin Guide
100% (1)
Symantec DLP 15.0 Admin Guide
1,975 pages
Tax Reporter Guide - 2009
0% (1)
Tax Reporter Guide - 2009
90 pages
Sandy's DataStage Notes
No ratings yet
Sandy's DataStage Notes
23 pages
Datastage Admin
No ratings yet
Datastage Admin
161 pages
Get Off To A Fast Start With Db2 V9 Purexml, Part 2
No ratings yet
Get Off To A Fast Start With Db2 V9 Purexml, Part 2
16 pages
Aec Novdec18 PDF
No ratings yet
Aec Novdec18 PDF
60 pages
But Why Anyone Will Need A Dynamic Cache?
No ratings yet
But Why Anyone Will Need A Dynamic Cache?
8 pages
Chapter 11
No ratings yet
Chapter 11
23 pages
Rhapsody Workshop1
No ratings yet
Rhapsody Workshop1
64 pages
3 Teradata Interview Questions and Answers
No ratings yet
3 Teradata Interview Questions and Answers
7 pages
Fuzz Testing (Fuzzing) Tutorial - What Is, Types, Tools & Example
No ratings yet
Fuzz Testing (Fuzzing) Tutorial - What Is, Types, Tools & Example
8 pages
1 11L BPM857 BuildingSimpleIntegrations PDF
No ratings yet
1 11L BPM857 BuildingSimpleIntegrations PDF
29 pages
Information Retrieval: Dr. Bassel ALKHATIB
No ratings yet
Information Retrieval: Dr. Bassel ALKHATIB
55 pages
Cisco Unified Wireless Security: Lwapp Lwapp
No ratings yet
Cisco Unified Wireless Security: Lwapp Lwapp
36 pages
Introduction To Sales Analysis Tools
No ratings yet
Introduction To Sales Analysis Tools
15 pages
CP R80.20 Gaia AdminGuide
No ratings yet
CP R80.20 Gaia AdminGuide
343 pages
Memory Application
No ratings yet
Memory Application
26 pages
Xitel Mdportdg2
No ratings yet
Xitel Mdportdg2
11 pages
NDLM Exam Guide by Gait Ictl
No ratings yet
NDLM Exam Guide by Gait Ictl
7 pages
2 PPT Security Audit Vault Reports Tech Screens
No ratings yet
2 PPT Security Audit Vault Reports Tech Screens
9 pages
Inurl
No ratings yet
Inurl
4 pages
Ensayo Ley 30 de 1992
No ratings yet
Ensayo Ley 30 de 1992
11 pages
Centos Installation and Docker
No ratings yet
Centos Installation and Docker
5 pages
Pavan: Professional Summary
No ratings yet
Pavan: Professional Summary
4 pages
Computer Delivered Testing: Network and Bandwidth Requirements
No ratings yet
Computer Delivered Testing: Network and Bandwidth Requirements
3 pages
Mongo DB
No ratings yet
Mongo DB
3 pages
University of Dar Es Salaam Department of Informatics Is 263: Database Concepts Test 1 (15 Marks)
No ratings yet
University of Dar Es Salaam Department of Informatics Is 263: Database Concepts Test 1 (15 Marks)
1 page
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
From Everand
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
Eric Tome
No ratings yet
IBM InfoSphere Replication Server and Data Event Publisher
From Everand
IBM InfoSphere Replication Server and Data Event Publisher
Pav Kumar-Chatterjee
No ratings yet
HBase Administration Cookbook
From Everand
HBase Administration Cookbook
Yifeng Jiang
No ratings yet
Oracle Database Mastery: Comprehensive Techniques for Advanced Application
From Everand
Oracle Database Mastery: Comprehensive Techniques for Advanced Application
Adam Jones
No ratings yet
IBM WEBSPHERE Frequently Asked Questions
From Everand
IBM WEBSPHERE Frequently Asked Questions
equitypress
1/5 (1)
Oracle Essbase 11 Development Cookbook
From Everand
Oracle Essbase 11 Development Cookbook
Jose R. Ruiz
No ratings yet
Beginning Microsoft SQL Server 2012 Programming
From Everand
Beginning Microsoft SQL Server 2012 Programming
Paul Atkinson
1/5 (1)
HDInsight Essentials - Second Edition
From Everand
HDInsight Essentials - Second Edition
Rajesh Nadipalli
No ratings yet
Oracle Essbase 9 Implementation Guide
From Everand
Oracle Essbase 9 Implementation Guide
Joseph Sydney Gomez
No ratings yet
Oracle SOA BPEL Process Manager 11gR1 A Hands-on Tutorial
From Everand
Oracle SOA BPEL Process Manager 11gR1 A Hands-on Tutorial
Ravi Saraswathi
5/5 (1)
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
From Everand
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
Joerg Christian Seubert
No ratings yet
Instant Pentaho Data Integration Kitchen
From Everand
Instant Pentaho Data Integration Kitchen
Sergio Ramazzina
No ratings yet
Monitoring Hadoop
From Everand
Monitoring Hadoop
Gurmukh Singh
No ratings yet
Oracle OBIEE Interview Q & A
From Everand
Oracle OBIEE Interview Q & A
Mohammed Azizuddin Aamer
3/5 (1)
Unofficial TIBCO® Business Works™ Interview Questions, Answers, and Explanations: TIBCO Certification Review Questions
From Everand
Unofficial TIBCO® Business Works™ Interview Questions, Answers, and Explanations: TIBCO Certification Review Questions
equitypress
3.5/5 (2)
Lotus Domino Interview Questions, Answers, and Explanations: Lotus Domino Certification Review
From Everand
Lotus Domino Interview Questions, Answers, and Explanations: Lotus Domino Certification Review
Equity Press
No ratings yet
JSP-Servlet Interview Questions You'll Most Likely Be Asked
From Everand
JSP-Servlet Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
ORACLE 12C Complete Self-Assessment Guide
From Everand
ORACLE 12C Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
Oracle Exadata Complete Self-Assessment Guide
From Everand
Oracle Exadata Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
TIBCO Software The Ultimate Step-By-Step Guide
From Everand
TIBCO Software The Ultimate Step-By-Step Guide
Gerardus Blokdyk
No ratings yet
Oracle Data Guard A Clear and Concise Reference
From Everand
Oracle Data Guard A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet

21 Ibm Websphere Datastage Interview Questions A Answers

Uploaded by

21 Ibm Websphere Datastage Interview Questions A Answers

Uploaded by

IBM Websphere DataStage Interview questions & Answers

4. The above might rise another question: Why do we

IBM Websphere DataStage Interview questions & Answers

have to load the dimensional tables first, then fact tables:

8. How would call an external Java function which are not

IBM Websphere DataStage Interview questions & Answers

10. Read the String functions in DS

11. How did u connect with DB2 in your last project?

IBM Websphere DataStage Interview questions & Answers

12. What are Sequencers?

13. How did you handle an 'Aborted' sequencer?

IBM Websphere DataStage Interview questions & Answers

15. How did you handle reject data?

IBM Websphere DataStage Interview questions & Answers

19. How did u connect to DB2 in your last project?

20. Do u know about METASTAGE?

IBM Websphere DataStage Interview questions & Answers

21. Do you know about INTEGRITY/QUALITY stage?

22. Explain the differences between Oracle8i/9i?

23. How do you merge two files in DS?

24. What is DS Designer used for?

IBM Websphere DataStage Interview questions & Answers

25. What is DS Administrator used for?

26. What is DS Director used for?

27. What is DS Manager used for?

29. What is Hash file stage and what is it used for?

30. Have you ever involved in updating the DS versions

IBM Websphere DataStage Interview questions & Answers

You might also like