Bi Unit 2
Bi Unit 2
They are mainly concerned with entry, storage and retrieval of data.
They support online transactions and query
processing. ex: supermarkets, banking, airlines,
insurance etc
They cover day-to-day operations of an organization like purchasing,
inventory, manufacturing, payroll, accounting etc.
ex: point of sales at a supermarket is a OLTP system.
• OLTP systems include transactions on databases like
1) INSERT( in a supermarket a record of final purchase is added to database)
2) UPDATE ( price of a product is raised from 100 INR to 200 INR)
3) DELETE ( if a product is out of demand then the store removes it from the
shelf and hence the database)
OLTP
• Consider a supermarket database that consists of following tables to manage its
data about products, employees, inventory supplies etc.
• Transactions table
• ProductMaster table
• EmployeeDetails table
• InventorySupplies table
• Suppliers
Considering the sample supermarket database, the Queries that an OLTP system can process
are:
Considering the sample supermarket database, the Queries that OLTP systems
cannot answer:
• Which new product the supermarket should introduce?
• Should the new product be specific to few customer segments?
• How much discount the supermarket should offer at their year-end sale?
• Should different discounts be given to different customer segments?
• How to identify the most consistent salesperson depending on various parameters?
ALL the above questions require some analysis that OLTP is unable to provide.
• OLAP deals with Historical Data or Archival Data, and it is characterized by relatively
low volume of transactions.
• Hence OLAP tools are based on multi-dimensional data models that views data in form
of data cube.
• The Queries needed for these systems are often very complex and involve aggregations
• Ex: in fig 3.4, we are looking at salesAmount data with one perspective called section.
• Similarly in fig 3.5 we are looking at salesAmount data with one perspective
called ProductCategoryName.
• And similarly in fig 3.6 we are looking at salesAmount data with one perspective
called YearQuarter.
2. Two dimensional data: The data that is viewed or plotted using two
• In the previous table 3.7: the salesAmount data is been plotted along two dimensions
called as yearQuarter and productCategoryName.
3. Three dimensional data: the data that is viewed or plotted using two perspectives.
• Consider the fig 3.8: here the data is plotted in three perspectives called as
productCategoryName, section and YearQuarter.
• Hence an analyst can now easily look for the section which recorded
maximum accessories sales in Q2.
• It is also possible to go beyond 3rd dimension depending on what kind of data is stored
and what kind if quesries is required from OLAP systems.
•
Queries that an OLAP system can process:
Considering the fig 3.8 that plots 3-D data by productCategoryName, section and
YearQuarter, the OLAP can answer following questions:
• What will be the future sales trend for accessories in Kid’s section?
• Given customers buying pattern, will it be profitable to launch product XYZ in the
Kids section?
• Consistency of information
• “what if “ analysis
• Single platform for all information and business needs like planning,
budgeting, forecasting, reporting and analysis
1) MOLAP
•
Advantages of MOLAP:
• It can perform complex calculations which are pre-generated when the cube is created.
Disadvantages of MOLAP;
• It can handle limited amount of data and large amount of data cannot be included in
the cube.
2) ROLAP:
• Usually includes adding a “WHERE” clause in SQL statements to implement the slice
and dice operations in OLAP.
Advantages:
Disadvantages
• It makes use of greater scalability feature of ROLAP and faster performance &
summary type information feature of MOLAP.
Disadvantage:
• OLTP helps in execution and storage of day to day transaction in alignment with business
strategy.
• This data is used for analytics , finding patterns & trends and decision making
which brings efficiency in operations of an organization.
DATA MODEL FOR OLTP:
• Dimension:
b) Each dimension will have a table associated with it called as the dimension
table.
• Facts:
1) Star Model:
• This has a central fact table which is connected to dimension tables surrounding it.
• Each dimension is represented by only one table and each table has a set of attributes.
2) Snowflake model:
• This has a central fact table which is connected to dimension tables surrounding it.
• The OLAP system then extracts information from data warehouse and stores it in a
multidimensional database using ROLAP or MOLAP which inturn stores the data in
a cube.
• Now the users can use query & reporting tools, analysis & data mining tools over
this data
• Now the OLAP can produce roll-up reports, drill down reports, drill-through reports,
aggregations, summaries, pivot tables on varied views of data.
SHOULD OLAP BE PERFORMED DIRECTLY ON OPERATIONAL
DATABASES
• OLTP systems are designed to query operational databases and OLAP need a
data warehouse built where data is integrated from various sources..
• OLTP systems support locking and logging. OLAP requires read only access for
aggregation and summarization. Hence applying locking and logging to OLAP
will impact the throughput of OLAP system.
• In the past, business executives used numerical information to support their decisions.
The IT applications that provided such numerical information were called Analytical
applications.
• Later BI provided a set of concepts and processes that allowed business executives
to take informed decisions.
BI made decision making faster, reliable, consistent and highly team oriented
• Informed decisions are based on facts and facts alone and not on gut feeling. Hence
the chances of it being correct are more often.
• When similar facts are presented to large set of decisions makers it is likely that they
arrive at same conclusion.
1) Market research :
• This helps in better understanding of the marketplace in which the business is operating.
• It includes understanding the customers, competitors, products, changing
market dynamics etc.
2) Statistical data
• ex: variance in production rate, correlation of sales with campaigns, cluster analysis
of shopping patterns etc.
• This helps decision makes to see new opportunities or innovate products and services
3) Management reporting:
• The IT teams within the organizations prepare ad hoc reports by using specialized tools.
4) Market survey:
• Definition:
• Features of BI :
• The same piece of data is available at more than one place that agrees wholly and in
every respect.
• BI enables looking at business from various perspectives which will help each person in
the project/program team to look at data from his role to find attributes that help in
decision making.
BI provides same facts to the decision makers/ stakeholders / executives who work on
common project/ business goals/ purpose and who are spread across geographic
locations.
• BI helps making long term decisions that affect the entire organization.
• ex: for goodfood restaurant chain it answers questions like where could be the next 5
restaurants?
2) Tactical level :
• ex: What are the right months to redeem customer loyalty points?
3) Operational level:
• Ex: what menu item needs to be dropped this week to handle bad weather?
• MIS: management information was provided to decision makers by the IT team using
MIS (management information system).
• Too many versions of data cannot serve any new requirement and hence needs to
be discarded
• Until the report reaches the executives, their requirements might change leading to
dissatisfaction of services.
Solutions to above challenges were :
• These reporting tools combine data from multiple sources, store metadata and report
specifications for faster re-runs and deliver reports in multiple formats like pdf, doc
or xls.
ERP BI
• Data warehouse stores the data and BI converts this data into meaningful information
• The volume, amount and velocity of data is growing in leaps and bounds.
Hence managing this data is a challenge.
• Unstructured data like emails, text msgs, memos need to blend in order to support
better decision making.
• ex: adding comments /suggestions from customers into BI applications to help market
segment analysis
• The statistical analysis capabilities of BI allows to dig deep into current and past data.
And answers questions like why this is happening? Why the customers prefer a particular
brand over another?
• BI also helps in forecasting and predictive modeling and answers questions like what if
the trend continues? What is likely to happen next? What will be in demand?
• Before storing the data is also converted into a unified format supported by the
data warehouse. This is called transformation and then the data/information is
delivered.
analytics,