Assignment 2 DWM
Assignment 2 DWM
facilitate complex data analysis and provide insights through a multidimensional approach. Here are
the key components and principles that define OLAP systems:
1. Multidimensional Data Model: OLAP systems are built on a multidimensional data model,
allowing users to view and analyze data from multiple perspectives (dimensions). Common
dimensions include time, geography, product categories, and customer segments. This model
enables operations like slicing, dicing, rolling up, and drilling down through data.
2. OLAP Cubes: Data in OLAP systems is often organized into cubes, which are multidimensional
arrays that store aggregated data. Each cell in the cube represents a measure (fact) related to
the dimensions. This structure allows for fast retrieval and analysis of data.
3. Data Aggregation: OLAP systems pre-calculate and store aggregated data, which speeds up
query response times. This is particularly useful for complex queries that involve
summarizing large datasets.
4. User-Friendly Interface: OLAP tools typically provide intuitive interfaces that allow users to
manipulate data easily through point-and-click or drag-and-drop functionalities. This
accessibility enables non-technical users to perform analyses without needing extensive
programming knowledge.
5. Support for Complex Queries: OLAP systems are designed to handle complex analytical
queries efficiently. They allow users to perform operations such as trend analysis,
forecasting, and what-if scenarios, which are essential for strategic decision-making.
6. Data Integration: OLAP systems can integrate data from various sources, including relational
databases, data warehouses, and external data sources. This integration provides a
comprehensive view of the organization's data.
7. Performance Optimization: OLAP systems are optimized for fast query performance, even as
the volume of data and the number of dimensions increase. Techniques such as indexing,
caching, and partitioning are often employed to enhance performance.
8. Multiuser Support: OLAP systems are designed to support multiple users accessing the data
concurrently while ensuring data integrity and security. This is crucial for organizations where
various departments need to analyze data simultaneously.
9. Flexibility and Customization: Users can define their dimensions, hierarchies, and
calculations, allowing for tailored analyses that meet specific business needs.
10. Real-Time Analysis: Some OLAP systems support real-time data analysis, enabling users to
make informed decisions based on the most current data available.
The main characteristics of OLAP (Online Analytical Processing) are as follows:
1. Multidimensional Conceptual View: OLAP systems provide a dimensional and logical view of
data, allowing users to perform slice and dice operations effectively 1.
2. Multi-User Support: OLAP operations support multiple users and provide standard database
operations, including retrieval, updates, adequacy control, integrity, and security 1.
3. Accessibility: OLAP acts as a mediator between data warehouses and front-end applications,
facilitating access to data 1.
4. Storing OLAP Results: Results generated by OLAP are stored separately from the original
data sources 1.
6. Handling of Zero and Missing Values: OLAP distinguishes between zero values and missing
values to ensure accurate aggregate computations 1.
7. Interactive Query and Complex Analysis: OLAP facilitates interactive querying and complex
analysis for users, enhancing their ability to derive insights from data 1.
8. Drill Down and Roll Up Capabilities: Users can drill down for more detailed information or
roll up for aggregations along a single business dimension or across multiple dimensions 1.
9. Intricate Calculations and Comparisons: OLAP allows users to perform complex calculations
and comparisons on the data 1.
10. Meaningful Result Presentation: OLAP presents results in various meaningful formats,
including charts and graphs, to aid in data interpretation 1.
1. Relational Online Analytical Processing (ROLAP) :
ROLAP servers are placed between relational backend server and client front-end tools. It uses
relational or extended DBMS to store and manage warehouse data. ROLAP has basically 3 main
components: Database Server, ROLAP server, and Front-end tool.
Advantages of ROLAP –
Disadvantages of ROLAP –
Advantages of MOLAP –
Disadvantages of MOLAP –
Advantages of HOLAP –
Advantages:
Disadvantages:
Advantages:
Disadvantages:
Advantages:
Disadvantages:
More complex architecture due to the integration of both ROLAP
and MOLAP.
Additionally, the document mentions other less common OLAP types, such as Web-Enabled OLAP
(WOLAP), Desktop OLAP (DOLAP), Mobile OLAP (MOLAP), and Spatial OLAP (SOLAP), each catering to
specific user needs and environments.
In summary, the choice of OLAP type depends on the specific requirements of data handling,
performance, and investment capabilities of an organization.
o Data Storage: ROLAP uses relational databases to store data. It relies on the existing
relational database management systems (RDBMS) to handle data storage and
management.
o Data Processing: It generates SQL queries to retrieve data from the relational
database, allowing for dynamic data analysis. This means that ROLAP can handle
large volumes of data without the limitations of pre-aggregated data.
o Data Storage: MOLAP stores data in multidimensional cubes, which are optimized
for fast retrieval and analysis. The data is pre-aggregated and stored in a proprietary
format.
o Data Processing: Since calculations are performed when the cube is created, MOLAP
can provide quick responses to complex queries and calculations.
o Limitations: It can struggle with very large datasets due to the constraints of cube
storage, as it requires significant memory and processing power to manage the
cubes.
o Data Storage: HOLAP combines the features of both ROLAP and MOLAP. It stores
detailed data in relational databases while keeping aggregated data in
multidimensional cubes.
o Data Processing: HOLAP allows users to drill down from aggregated data in the cube
to detailed data in the relational database, providing a comprehensive view of the
data.
o Efficiency: This hybrid approach balances the need for fast access to aggregated data
with the ability to handle large volumes of detailed data without duplication.