Data Warehousing and DSS
Data Warehousing and DSS
Decision Support
Systems
UNIT II
What is Data Warehousing?
• Data warehousing is the process of collecting, storing, and managing
large volumes of data from various sources in a centralized repository.
• This repository, known as a data warehouse, is designed to support
business intelligence (BI) activities, such as reporting, analysis, and
decision-making.
• Data warehousing is a critical component of modern business
intelligence strategies, helping organizations to efficiently manage,
analyze, and leverage their data for better decision-making.
Key Features of Data
Warehousing:
• Centralized Repository: Data from different sources (e.g., databases, spreadsheets,
external data) is consolidated into a single location, making it easier to manage and
analyze.
• Historical Data Storage: Data warehouses typically store historical data, allowing
organizations to analyze trends over time.
• Data Integration: Data from various sources is often inconsistent in format or structure.
The data warehousing process involves transforming this data into a consistent format.
• Query and Analysis: Data warehouses are optimized for complex queries and data
analysis, rather than just transaction processing. This enables efficient and fast retrieval
of data for reports and analysis.
• Support for Decision-Making: By providing access to integrated and historical data,
data warehouses help organizations make informed decisions.
Components of a Data
Warehouse:
• ETL Process (Extract, Transform, Load): The process of extracting data
from various sources, transforming it into a consistent format, and
loading it into the data warehouse.
• Data Storage: The actual storage of data in the warehouse, usually in a
format optimized for query performance.
• Metadata: Information about the data stored in the warehouse, such as
data definitions, mappings, and relationships, which helps in
understanding and managing the data.
• Data Access Tools: Tools used by business analysts and decision-makers
to access and analyze the data, such as SQL queries, reporting tools, and
dashboards.
Benefits of Data Warehousing:
• Improved Data Quality and Consistency: By consolidating data from
multiple sources, data warehouses ensure that data is accurate,
complete, and consistent.
• Enhanced Business Intelligence: Data warehouses provide a
foundation for advanced data analysis and reporting, enabling
organizations to gain deeper insights.
• Scalability: Data warehouses are designed to handle large volumes of
data, making them suitable for growing organizations.
Designing and Developing Data
Warehouses
• Designing and developing a data warehouse involves a series of
structured steps that ensure the data warehouse meets the specific
needs of the organization. Below is an overview of the key phases in
the process:
Key Phases of designing and
developing a Data Warehouse
• 1. Requirements Gathering
• Identify Business Requirements: Understand the goals of the data
warehouse by engaging with stakeholders to determine the types of
reports, analyses, and business insights they need.
• Define Data Sources: Identify all the data sources (e.g., databases,
CRM systems, external data feeds) that will feed into the data
warehouse.
2. Data Modeling
• https://www.ibm.com/docs/en/informix-servers/12.10?topic=databases-overview-data
-warehousing
• Overview of data warehousing
• https://www.youtube.com/watch?v=qB0vspslPn4
• How I helped this brand with a simple dashboard
• https://www.youtube.com/watch?v=YSriO71a4Ac
• Case Study: Clinical Decision Support Systems