Module - 2 Business Data Warehousing
Module - 2 Business Data Warehousing
Learning Objectives:
To comprehend the purpose of and the nature of a data warehouse
A thorough examination of the development of data warehouses across time
To understand what makes up a data warehouse
To comprehend the distinction between data and information
To comprehend the distinction between OLTP and OLAP
Module - 2: Business Data Warehousing
Two major factors driving data warehousing in most organizations today are:
• Integrated View of Information: A business requires an integrated, company-wide, view of high-quality information
• Information Vs. Operational Systems: The information systems department must separate informational from
operational systems to improve performance in managing company data.
Module - 2: Business Data Warehousing
STUDENT EMPLOYEE
StudentNo Address Dept. Hours ***
124-45-6789 1218 Elk Drive MIS 8
987-65-4123 256 Mesa Drive BUAN 10
STUDENT HEALTH
StudentName Telephone Insurance ID ***
Britney T. DeLuke 883-8881 BCBS 124-45-6789
Eric M. Smith 777-1234 ? 987-65-4123
Module - 2: Business Data Warehousing
Run the business data: Produced by corporate applications such as the one your company uses to fill customer orders for its
products or the one your company uses to manage the financial transaction. The new materials for a data warehouse.
Integrate the business data: Built to improve the quality of and synchronize two or more corporate applications. Such as a
master list of customers. Data leveraged to integrate applications that weren’t designed to work with each other.
Monitor the business data: Presented to end users for reporting and decision support, such as your financial dashboard. The
data is cleaned to enable users to better understand progress and evaluate the cause-and-effect relationship in the data.
Module - 2: Business Data Warehousing
• The team in charge of building the data warehouse assigns a group of business users and other key individuals within the
company to play the role of subject-matter-expert
• The group goes through the list of information (data assets), item by item, and figures out where the data warehouse can
obtain that particular piece of data (raw material)
• After completing the details of where the business can get each piece of information, the data warehousing team creates
extraction programs
Module - 2: Business Data Warehousing
Extraction Programs:
• Extraction programs collect data from various internal databases and files, copy certain data to a staging area (a work
area outside the data warehouse), cleanse the data to ensure that the data has no errors, and then copy the higher-
quality data (data assets) into a data warehouse
• Extraction programs are created either by hand (custom coded) or by using specialized data warehousing products – ETL
(extract, Transform, and load) tools
Module - 2: Business Data Warehousing
Historical Perspective
of Data Warehousing
Module - 2: Business Data Warehousing
• Consolidate data across the organization to get a consistent and agreed view
Benefits:
• Better data: Adding data sources to a data warehouse enables organizations to ensure that they are collecting consistent
and relevant data from that source. This ensures higher data quality and integrity for sound decision-making.
• Faster decisions — Data in a warehouse is consistent and is ready to be analyzed. It provides the analytical power and a
more complete dataset to base decisions on hard facts. Thus, decision-makers no longer need to rely on hunches,
incomplete data, or poor-quality data and risk delivering slow and inaccurate results.
Module - 2: Business Data Warehousing
Progression of Systems:
PERSONAL COMPUTERS AND 4GL TECHNOLOGY - To placate the end user’s cry for accessing data, two technologies
emerged—personal computer technology and 4GL technology.
• Personal computer technology allowed anyone to bring his/her own computer into the corporation and to do
his/her own processing at will. Personal computer software such as spreadsheet software appeared. In addition, the
owner of the personal computer could store his/her own data on the computer.
• At about the same time, along came a technology called “ 4GL ” — fourth-generation technology. The idea behind
4GL technology was to make programming and system development so straightforward that anyone could do it.
Module - 2: Business Data Warehousing
• The frustration of the end user, the IT professional, and the management resulted in a movement to a different
information architecture.
• That information systems architecture was one that centered around a data warehouse
Module - 2: Business Data Warehousing
Data warehouse lite: A relatively straightforward implementation of a modest scope (often, for a small user group or team)
in which you don’t go out on any technological limbs; almost a low-tech implementation.
Data warehouse deluxe: A standard data warehouse implementation that uses advanced technologies to solve complex
business information and analytical needs across a broader user population.
Data warehouse supreme: A data warehouse that has large-scale data distribution and advanced technologies that can
integrate various “ run the business” systems, improving the overall quality of the data assets across business information
analytical needs, and transactional needs.
Module - 2: Business Data Warehousing
SAP BW Evolution:
Module - 2: Business Data Warehousing
SAP BW Architecture:
Module - 2: Business Data Warehousing
e) Data Marts:
• A data mart is an access layer that is used to get data out to the users.
• It is presented as an option for large-size data warehouses as it takes less time and money to build.
Module - 2: Business Data Warehousing
Data vs Information:
Data Information
OLTP vs OLAP:
IT systems can be divided into –
• Transactional (OLTP)
• Analytical (OLAP)
OLTP systems provide source data to data warehouses, whereas OLAP systems help to analyze it.
Module - 2: Business Data Warehousing
Learning Outcomes:
You now understand the purpose and nature of a data warehouse
You are aware of the development of data warehouses across time
You are familiar with a data warehouse's components
You can list the distinctions between data and information
You can list the distinctions between OLTP and OLAP