75% found this document useful (4 votes)
2K views2 pages

Generic ETL Questionnaire

The document contains a questionnaire for gathering requirements for an ETL (extract, transform, load) project. It includes questions about the existing system, estimated needs for the new system, and operational details. Key points are: - The primary business requirement and users of the current system are identified. - Questions are asked about incremental data handling, projected data growth, and out-of-scope needs. - Details of the current ETL process, architecture, automation, and documentation are requested. - Estimation questions focus on aggregate tables needed, number of ETL routines/mappings, source systems, data sharing agreements, staging area design, extraction/loading methods, scheduling, and performance constraints.

Uploaded by

ronnyb13119549
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
75% found this document useful (4 votes)
2K views2 pages

Generic ETL Questionnaire

The document contains a questionnaire for gathering requirements for an ETL (extract, transform, load) project. It includes questions about the existing system, estimated needs for the new system, and operational details. Key points are: - The primary business requirement and users of the current system are identified. - Questions are asked about incremental data handling, projected data growth, and out-of-scope needs. - Details of the current ETL process, architecture, automation, and documentation are requested. - Estimation questions focus on aggregate tables needed, number of ETL routines/mappings, source systems, data sharing agreements, staging area design, extraction/loading methods, scheduling, and performance constraints.

Uploaded by

ronnyb13119549
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
You are on page 1/ 2

Questionnaire for ETL Requirements Gathering and Analysis

Review Reference No.: Review Reference Documents: Review Date:

Sl. No. General Questions 1. 2. 3. 4. 5. 6.

Questionaire

Response

What is the Primary Business Requirement of this system? Who are the Business Groups/ Users of the system? Any strategy in place to handle incremental data and SCD? What is the Projected growth of DWH? Pls specify 'out of scope' requirements Need client contacts for any clarifications

Questions on Existing System 7. 8. 9. Please explain the current process / methodology followed in the existing system Please explain the current ETL architecture with the breakup of Development servers, QA servers and Production servers Is the system fully automated or any kind of manual intervention required (Ex: during extraction, data load etc). How about the new system? Any documentation available related to the existing system? Please provide access to the same Any project prototyping done? If yes, give details Any problems with mappings and resolution or any architectural challenges Are there any known data quality issues? Are there any issues/bottlenecks related to the ETL Process? Please indicate the number of existing Informatica mappings Please indicate the complexity distribution of current ETL mappings Please indicate if the current ETL jobs are pulling data from the source applications or it is being pushed into Informatica What would be the approximate volume of the data in the database? What is the batch load window being used today

10.
11. 12. 13. 14. 15. 16. 17. 18. 19.

Questionnaire for ETL Requirements Gathering and Analysis


Sl. No. 20. Questionaire What is the database system used? Response

Questions on Estimation 21. 22. 23. 24. 25. 26. Whether aggregate tables need to be created? If yes, how many and what subject areas? How many ETL routines/mappings required? Pls classify with complexity as simple, medium & complex. Definition of Simple , Medium & Complex. Pls provide the source system details, Name, Platform, Description Any Data Sharing Agreements with Source Data Owners needed? Staging Area : What is the design of the Staging area? How much of data is retained in the Staging area? Are there any, Aggregations Calculations Denormalizations Business Rules to be applied in the ETL transformations? If yes pls provide details. What is the type of extraction Full / Incremental? If incremental, how do you identify data what data has changed? Are you using any specific tool for this? What is the volume of incremental data? What is the loading Mechanism to be used Bulk Load/ Update-Insert. ETL Schedule Daily/Weekly/Monthly and scheduling process Any performance constraints like Time window for data Extraction / Transformations / Loading? What should be the strategy on ETL Monitoring Processes? Error Handling Exception Handling Level of Logging Notification process What is the Security architecture of the application Is the security at the application level, report level or data level?

27. 28. 29. 30. 31. 32.

33. 34.

You might also like