Business Analytics Book
Business Analytics Book
• Helps in forecasting
Features of BI
Ranking
reports
Open Executive
Integration dashboards
Operational
Interactive reports
reports
Geospatia
l Mapping
Framework of BI
BI Components
Data Business
warehouse Analytics
Business
Performance User
Managemen Interface
t
Data Warehousing
• A data warehouse (DW) is an organized collection of integrated, subject oriented databases designed to support
decision support functions
• DW is organized at the right level of granularity to provide clean enterprise-wide data in a standardized format for
reports, queries, and analysis
• DW is physically and functionally separate from an operational and transactional database
• It can facilitate distributed access to up-to-date business knowledge for departments and functions, thus improving
business efficiency and customer service
• DW enables a consolidated view of corporate data, all cleaned and organized.
• DW thus provides better and timely information. It simplifies data access and allows end users to perform extensive
analysis.
Business Analytics
• Business analytics (BA) refers to the skills, technologies, and practices for continuous iterative
exploration and investigation of past business performance to gain insight and drive business
planning
• Business analytics focuses on developing new insights and understanding of business performance
based on data and statistical methods
• Business analytics makes extensive use of analytical modeling and numerical analysis, including
explanatory and predictive modeling, and fact-based management to drive decision making
• Business analytics can answer questions like why is this happening, what if these trends continue,
what will happen next (predict), and what is the best outcome that can happen (optimize)
• BI components include – Reporting and queries; Advanced Analytics; Data, Text and Web mining
Business Performance
Management
• Business Performance Management (BPM), otherwise termed as Corporate Performance Management (CPM)
or Enterprise Performance Management, is tuned toward optimization of overall business performance and
achievement of business goals
• It enables an organization to enhance the management of their business performance through the aid of reports,
analytics, Key Performance Indicators, etc. that help them measure and monitor efficiency and success of their
business activities
• The optimisation of comprehensive performance of an organisation is the main aim of BPM
• BPM includes the following processes – Budgeting, Planning & Forecasting; Business Modeling; Scorecard;
Dashboarding; Financial, statutory & management reporting; Risk management; Predictive analysis; Internal
Controls
User Interface
Dashboards
Visualisation Tools
• Improved association
• Ease of Reporting
Disadvantages of BI
• Lost of Historical data
• Difficulties in implementation
• Time consuming
Data
Data Mining
Visualisation
Predictive
Text Mining
Analytics
Forecasting
Types & Techniques of BA
Descriptive Analytics
• Perhaps the most basic and still the most important and widely used kind of analytics is descriptive analytics
• This deals which uncovering the truth regarding business by analyzing the historical data. A number of
factual information is revealed in this form of analytics
• This is where, the grouping of data, use of descriptive statistics, and a number of visualization techniques
come in handy
• Here for example, by finding frequency, mean, median, mode, maximum, minimum values of a subject in
different scenarios help in covering a lot of information
• This allows the leadership to understand what has happened until now and gives a brief glimpse of what
could happen next.
Types & Techniques of BA
Diagnostic Analytics
• This form of analytics deals with finding the reasons for whatever that has happened in the
business so far
• Methodologies such as Segmentation etc comes in handy where patterns are detected in the
data to give a better insight into the scenario in which the company is present
• For example, running analytics on the customer base of a company and identifying the
different types of customers the company has been dealing with and targeting the specific kind
of customers that might have been pulling back the companies’ growth.
Types & Techniques of BA
Predictive Analytics
• This is that branch of analytics that deals with the future.
• Here, again based on the historical data, a range of sophisticated statistical and machine learning methodologies are put to use to
understand what can happen in the future given certain conditions or the pace at which the current scenario is moving
• This is done by identifying patterns in the data, figuring out the important drivers and features, and finding its relation with the
objective that we are trying to predict
• In none of these methods, time is involved as when time gets involved then a particular kind of predictive analytics is performed
known as forecasting
• Forecasting refers to predicting a value over a fixed period of time where time also acts as a driver i.e. plays a role in deciding what the
predicted value is going to be in the output.
• Sometimes a very specific type of prediction is also performed such as Text Mining where texts are predicted to create products that
can aid the business operation and can help in increasing the profits
• In Predictive Analytics, advanced Machine Learning and Deep Learning algorithm are developed, and sometimes statistical models are
also created
Types & Techniques of BA
Prescriptive Analytics
• The most advanced form of analytics, here not only we try to predict but also try to find a course of action
that is best suited to reach the objective
• While predictable analytics provide us what will happen, prescriptive analytics provide us with the answer on
how to avoid the prediction (in the case the predicted output is something not in the interests of the company)
• Different strategies are devised here and are put to use to check the different outcomes. This is where
optimization and simulation methodologies are put to use and compared to the previously mentioned forms
of analytics, this is a new and developing form of analytics
• Advanced Machine & Deep Learning methodologies are often used in this type of analytics that allows us to
create different scenarios and find the best course of action.
Important BA Tools
• SQL
– It is among the most important tool as SQL queries allow the user to easily filter out and create subsets of an otherwise large
dataset
– By having the relevant amount of data, the analyst can quickly start working on the cleaning of the data and then creating
models out of it
• Birt
– Another useful report based tool allows us to create graphs and dashboards, however, it is relatively complex than tableau as the
user needs to have a decent knowledge of Java to make the most out of it.
Important BA Tools
• Python
– One of the most advanced tools, python allows the user to perform multiple things
– Python can be used to perform basic steps such as data cleaning to a complex aspect of analytics that includes the development of various
kinds of models.
– The development of highly complex machine learning and deep learning model is particularly effective through this tool. Python also
allows us to create reports and has libraries for visualization but it is up to the user to use them or use dedicated visualization tools
• R
– This statistical tool created “by the statisticians for the statisticians”, allows a business analyst to perform all the descriptive and inferential
statistics along with the development of statistical models
– If compared to python it has a bit of a steep learning curve but this eventually pays off as it has a large community of users and is respected
in the world of corporate as well as academia
• MS Excel
– One of the most basic yet widely used and effective tool
– The importance of MS Excel in the field of Business Analytics can be understood from realizing the difference between a sword and a
needle
Important BA Tools
• SPSS Modeler(Clementine)
– A data mining tool by SPSS Inc. (IBM)
– Has an intuitive GUI & its point & click modelling capabilities are very comprehensive
• KXEN
– One of the few to drive automated Analytics
– Can work with very large amount of data
– Drawback is its complexity in understanding the results
• WEKA
– Waikato Environment for Knowledge Analysis is a popular machine learning software
– Its written in Java script & is an open source software
– It contains a GUI for interacting with data files & produces visual results & graphs
Advantages of BA
• Improving the decision making process
• Improving competitiveness
• Data science is the art & science of acquiring knowledge through data
• It involves principles, processes and techniques for understanding phenomena via the analysis of
data
Data Science
• Components of Data Science – Organising, Packaging & Delivering the data
• Advantages
– Helps management with better & faster decisions
– Empowers the decision makers with solid data and outlines a path to achieve business goals
– Can anticipate new challenges & opportunities through the power of data
Fast Analysis
• The dimension table is joined to the fact table using a foreign key
• The Star schema is easy to understand and provides optimal disk usage.
• OLAP provides the building blocks for business modeling tools, Data mining tools, performance reporting
tools
• Allows users to do slice and dice cube data by various dimensions, measures, and filters
• It is good for analyzing time series
• Finding some clusters and outliers is easy with OLAP
• It is a powerful visualization online analytical process system which provides faster response times
Disadvantages of OLAP
• OLAP requires organizing data into a star or snowflake schema.
These schemas are complicated to implement and administer
• You cannot have large number of dimensions in a single OLAP cube
• Transactional data cannot be accessed with OLAP system.
• Any modification in an OLAP cube needs a full update of the cube.
This is a time-consuming process
Online Transaction Processing (OLTP)
• OLTP, or online transactional processing, enables the real-time execution of large numbers of
database transactions by large numbers of people, typically over the internet
• OLTP is basically focused on query processing, maintaining data integrity in multi-access
environments as well as effectiveness that is measured by the total number of transactions per second
• In OLTP, the emphasis is on fast processing, because OLTP databases are read, written, and updated
frequently
• If a transaction fails, built-in system logic ensures data integrity
• Classic examples of OLTP systems are order entry, retail sales, and financial transaction systems
OLTP Characteristics
Short Response time
Small Transactions
High Concurrency
High Availability
• Every table will be a specific data field (key) that would uniquely identify
each relation (or row) in that table
• Each master table or database relation should have programs to create, read,
update, and delete the records
Advantages of OLTP
• OLTP offers accurate forecast for revenue and expense.
• It provides a solid foundation for a stable business /organization due to timely modification of all transactions.
• OLTP makes transactions much easier on behalf of the customers.
• It broadens the client base for an organization by speeding up and simplifying individual processes.
• OLTP provides support for bigger databases.
• We need OLTP to use the tasks which are frequently performed by the system when we need only a small number of
records
• It is used when you need consistency and concurrency in order to perform tasks that ensure its greater availability.
• It is designed typically for use by clerks, cashiers, etc.
• It efficiently allows its users to read, write, and delete data quickly
Disadvantages of OLTP
• If the OLTP system faces hardware failures, then online transactions get severely affected
• OLTP systems allow multiple users to access and change the same data at the same time, which
many times created an unprecedented situation
• If the server hangs for seconds, it can affect to a large number of transactions
• OLTP required a lot of staff working in groups in order to maintain inventory
• OLTP makes the database much more susceptible to hackers and intruders
• Server failure may lead to wiping out large amounts of data from the database