Building Databases
Building Databases
www.ue.edu.ph
BUILDING THE DATABASE
Is one of the most difficult and complex parts of
implementing a data warehouse environment.
ETL( Extract, Transform and Load )
www.ue.edu.ph
EXTRACT , TRANSFORM AND LOAD (ETL )
www.ue.edu.ph
EXTRACT , TRANSFORM AND LOAD (ETL )
www.ue.edu.ph
EXTRACT , TRANSFORM AND LOAD (ETL )
ETL Data
Source Data Warehouse
www.ue.edu.ph
EXTRACT , TRANSFORM AND LOAD (ETL )
www.ue.edu.ph
ETL SYSTEM REQUIREMENTS
●
Based on Data Profiling
●
Processing Rules
●
Guidelines for Compliance with Legal
Requirements
●
Processing window
●
Audit Trail Specifications
www.ue.edu.ph
ETL SYSTEM DESIGN
www.ue.edu.ph
ETL SYSTEM DESIGN
●
Specific Rules for Building the Dimension
●
Fact Tables
●
Data Element Definitions
●
Data Mapping Audit Trail
●
Backup/Recovery Capabilities.
www.ue.edu.ph
ETL SYSTEM CONSTRUCTION
●
Includes writing programs or using
technology to perform the work
●
Work can be divided up among different
people or even different teams
●
Using the cohesive design, each team can
work on its part
www.ue.edu.ph
ETL SYSTEM TESTING
●
Important to conduct a thorough testing
●
Identify any bottlenecks and improve the system's
performance. In order to prepare for testing
●
Series of test cases need to be developed to provide
realistic conditions to determine whether the system is
working properly
●
Test cases must represent actual business situations and
need to be defined by representatives from the business
community
www.ue.edu.ph
EXTRACT , TRANSFORM AND LOAD (ETL )
Include Backup
and Recovery
www.ue.edu.ph
EXTRACT , TRANSFORM AND LOAD (ETL )
System Functionality: What Exactly does an ETL
System do? The system will provide several common functions. It
describe the functionality and flow of an ETL system to
directly populate dimensional data structures, called
presentation servers or data marts. Additional steps
would be needed to populate a normalized data
warehouse, but most of the functionality would be
similar
ETL Data
Source Data Warehouse
www.ue.edu.ph
EXTRACT , TRANSFORM AND LOAD (ETL )
www.ue.edu.ph
EXTRACT , TRANSFORM AND LOAD (ETL )
represents a wide
First name, Middle Name
TL
E
variety of Select all “ PH”
performed to
10000 to 50000
www.ue.edu.ph
EXTRACT , TRANSFORM AND LOAD (ETL )
Marketing
Sales Customer Accounting
Customer
Table Customer Table
Master
List of Do we
Customers with know the
Transactions customer
ID?
Check for
Yes Duplication
Build Customer
Validate
Attributes
Transformation : Has this
customer
Reference Data
changed?
represents a wide Yes High‐level Customer dimension data flow diagram
variety of functions Build Changed
www.ue.edu.ph
EXTRACT , TRANSFORM AND LOAD (ETL )
Convert Cases to
data
High- level sales fact table data flow diagram[1]
www.ue.edu.ph
OTHER TASKS DURING TRANSFORMATION
www.ue.edu.ph
OTHER TASKS DURING TRANSFORMATION
www.ue.edu.ph
EXTRACT , TRANSFORM AND LOAD (ETL )
L
ET
Data Warehouse
User Cubes
www.ue.edu.ph
Fundamentals of Data Warehouse
www.ue.edu.ph
ROLE OF BUSINESS IN ETL
Provide data, information and requirements necessary
for building the data warehouse
www.ue.edu.ph
ROLE OF BUSINESS IN ETL
Provide guidance and critical decisions needed by the
business
Do that…
No
Are we
going to
push the
sales?
Yes
Do this…
www.ue.edu.ph
DEFINING THE BUSINESS RULES
As the roles are clearly stated, the Business must also
define and provide the rules on how to process the
data.
Help to pick the
data source.
www.ue.edu.ph
DEFINING THE EXPECTED RESULTS:
TEST PLAN FOR ETL
Knowing the expected result is needed to
ensure data is correct. Is this Tested?
Is it completed?
Does it have the right
configurations?
Do we do a stress test?
What happens when this fails?
ETL
www.ue.edu.ph
Fundamentals of Data Warehouse
www.ue.edu.ph
APPLYING NEW BUSINESS RULES
Process and results must be validated to ensure that the new
rules have been implemented.
Old Process
and Systems
Old Process
and Systems
www.ue.edu.ph
27
Working Toward Long‐Term Solutions
Quick fix
ETL 12345
Error Code Redeploy ETL
12345
www.ue.edu.ph
Fundamentals of Data Warehouse
www.ue.edu.ph
Tracking Progress—Are We There Yet?
www.ue.edu.ph
What Else Can You Do to Help?
• Encourage,
• Acknowledge and
• Support your team!!!
www.ue.edu.ph
Proactive Communication
The project manager is the spokesperson to share progress,
concerns, and changes in timeline, deliverables, or costs
www.ue.edu.ph
Ensuring Continued Business Participation
After the work has been done, developers are much needed for the support of the
deployment if errors or bugs happens during production.
Why??
Because they are the only one that can fix and provide enhancements
technically.
Developer
www.ue.edu.ph
Fundamentals of Data Warehouse
Development Support
www.ue.edu.ph
DEVELOPMENT SUPPORT
Once the design works, development begins.
www.ue.edu.ph
BAFWARE: Fundamentals of Data Warehouse
www.ue.edu.ph
TESTING THE ETL
Testing ensures the data provided by the system is
accurate and right.
ETL
1 2
1 ETL 2
www.ue.edu.ph
ETL TOOLS
Cognos Accelerator
Copy Manager, Data Migrator for SAP,
PeopleSoft (Information Builders)
DataPropagator (IBM)
ETI Extract (Evolutionary Technologies)
Sagent Solution (Sagent Technology)
PowerMart (Informatica)…
www.ue.edu.ph
REPORT AND QUERY TOOLS
www.ue.edu.ph