0% found this document useful (0 votes)

33 views30 pages

Descriptive Analytics in Business Decisions

Business analytics uses math, statistics, and machine learning to find patterns in data. It can be categorized into descriptive, predictive, and prescriptive analytics. Exploratory data analysis (EDA) explores data through visualization and summary statistics to discover patterns and relationships. Correlation analysis measures the strength of relationships between variables. Regression analysis predicts outcomes, while machine learning allows systems to automatically learn from data without being explicitly programmed. The main challenges of machine learning include insufficient or poor quality data.

Uploaded by

Prathamesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views30 pages

Descriptive Analytics in Business Decisions

Uploaded by

Prathamesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Introduction to

Business Analytics
and Machine Learning

TUTORIAL 3
Definition
• Analytics is a field of computer science that uses math, statistics, and machine
learning to find meaningful patterns in data.
• Business Analytics- When analytics is applied to make business decisions
• Analytics may be considered as a three-step process-

1. Descriptive Analytics- (What happened?) It involves looking at the history of business

activities to get a fair idea of how a business performed in the past. (Technique- EDA)

2. Predictive Analytics (forecasting)- (What is likely to happen in the future?) Here the
facts or information from the past are leveraged to understand the future course the
business may assume. (Techniques- Regression, Decision Tree, Machine Learning, etc.)

3. Prescriptive Analytics- (What action should we take?) Based on the findings of

descriptive and predictive analytics, it determines the best course of action in a scenario
(Techniques- Optimization algorithms {linear/non-linear programming, genetic
algorithms, etc.}, simulations, game theory, etc.)
Exploratory Data Analysis (EDA)
• Analysts need first to explore the data for potential research questions
before jumping into confirming the answers with hypothesis testing and
inferential statistics.

• Observations & Variables

We call them variables

because their values may
vary across
observations
Further reference: Sarah Boslaugh’s Statistics in a Nutshell, 2nd edition (O’Reilly)

EDA contd…
Types of variables

It can take only a

It can only take two levels Variable with more It takes more than two It can in theory take an
fixed number of
(e.g. Married? (yes or no) than two levels is a levels, where there is an infinite number of values
countable values
• Made purchase? (yes or no) nominal variable intrinsic ordering between any two values
between any two
• Wine type? (red or white)) (e.g. Favorite color between these levels (e.g. Height (within a
values. (e.g. units
(orange, blue, burnt (e.g. Beverage size range of 59 and 75
sold 10, 12, 13, …)
sienna, and so forth)) (small, medium, large)) inches, 59.3, 60.2, …)
EDA contd…
• Interval vs. Ratio Type?

• Exploratory Data Analysis refers to the critical process of performing

initial investigations on data so as to discover patterns, to spot anomalies, to
test hypothesis and to check assumptions with the help of summary
statistics and graphical representations
EDA contd…
Data Visualization
• Univariate visualization: Only one variable is visualized graphically (e.g. bar
charts, pie charts, histogram, etc.)

• Bivariate visualization: Each point is placed according to its value on two

attributes (e.g. scatterplot)

• Multivariate visualization: More than two variables are visualized

simultaneously
EDA contd…
Data Visualization

• Tools- Tableau, Python, Qlik View, SAS Visual Analytics, Power Bi, R, etc.
EDA contd…
Correlation
• Correlation Analysis is
statistical method that
is used to discover if
there is a relationship
between two
variables/datasets, and
how strong that
relationship may be.
EDA contd…
Methods to find Correlation Coefficient
• Pearson Coefficient (generally, useful for linear relationship between two
continuous variables)

• Spearman's Rank Coefficient (generally, useful for ordinal or non-normally

distributed data)

• Kendall's Rank Coefficient (generally, appropriate for ordinal or non-

normally distributed data)

“Correlation Does Not Imply Causation”

Regression for Predictive Model
Building
• Businesses want to take faster and better decisions compared to their
competitors. So they would like to get a fairly good idea regarding what is
expected to happen in the future.

Simple Linear Regression

• The estimated impact of a unit change of the independent variable X on the
dependent variable Y.
• The equation for linear regression
Y = 𝛽0 + 𝛽1 𝑋 + 𝜖
• H0: There is no linear influence of our independent variable on our
dependent variable.
• Ha: There is a linear influence of our independent variable on our dependent
variable.
What is Machine Learning?
• Machine Learning is the science (and art) of programming
computers so they can learn from data.
• For example, a bank might deploy machine learning to
detect whether a customer will default on a loan. As more
data is fed in, the algorithm may find patterns and
relationships in the data and use them to better predict
the likelihood of a default.
Why Use Machine Learning?
• Consider how you would write a spam filter using traditional programming
techniques?

1. First you would look at what spam typically looks like. You might notice
that some words or phrases (such as “4U,” “credit card,” “free,” and
“amazing”) tend to come up a lot in the subject.

2. You would write a detection algorithm for each of the patterns that you
noticed, and your program would flag emails as spam if a number of these
patterns are detected.

3. You would test your program, and repeat steps 1 and 2 until it is good
enough
Traditional approach
Machine Learning approach
Problem with Traditional approach
• If spammers notice that all their emails containing “4U” are
blocked, they might start writing “For U” instead. A spam
filter using traditional programming techniques would need to
be updated to flag “For U” emails. If spammers keep working
around your spam filter, you will need to keep writing new
rules forever.
Automatically adapting to change
Types of Machine Learning Systems
• Broadly classifying:
1. Supervised learning
 In supervised learning, the training data you feed to the algorithm includes the
desired solutions, called labels
 The spam filter is a good example of this: it is trained with many example emails
along with their class (spam or ham), and it must learn how to classify new emails.
Types of Machine Learning Systems
• Supervised learning deals with two distinct kinds of problems:
 Classification problems
 Classification problems are often resolved using algorithms such as Naïve Bayes,
Support Vector Machines, Random Forest, Logistic Regression (It is used to
calculate or predict the probability of a binary (yes/no) event occurring), etc.

 Regression problems
 linear regression, non-linear regression, Bayesian linear regression, etc.

• Recommender systems are a notable example of supervised learning. E-

commerce companies such as Amazon, streaming sites like Netflix, and
social media platforms such as TikTok, Instagram, and even YouTube
among many others make use of recommender systems to make appropriate
recommendations to their target audience.
Types of Machine Learning Systems
2. Unsupervised Learning
 In unsupervised learning, as you might guess, the training data is unlabeled
 The main task of unsupervised learning is to find patterns in the data.
Types of Machine Learning Systems
• Some of the most important unsupervised learning algorithms:

• Clustering
 k-Means
 Hierarchical Cluster Analysis (HCA)
 Expectation Maximization

• Visualization and dimensionality reduction

 Principal Component Analysis (PCA)
 Kernel PCA
Types of Machine Learning Systems
3. Reinforcement Learning

• The learning system (agent), can observe the environment, select and
perform actions, and get rewards in return (or penalties in the form of
negative rewards

• It does not have a labelled dataset or results associated with data so the only
way to perform a given task is to learn from experience.

• For every correct action or decision of an algorithm, it is rewarded with

positive reinforcement whereas, for every incorrect action, it is rewarded
with negative reinforcement.
• Summary

• [Link]
Main Challenges of Machine
Learning
• Insufficient or poor-quality data

It should be noted,
however, that small- and
medium sized datasets
are still very common,
and it is not always easy
or cheap to get extra
training data, so don’t
abandon algorithms just
yet
Main Challenges of Machine
Learning (Contd.)
• Nonrepresentative Training Data
 In order to generalize well, it is crucial that your training data be representative of
the new cases you want to generalize to

 The set of countries we used earlier for training the linear model was not perfectly
representative; a few countries were missing
 It seems that very rich countries are not happier than moderately rich countries (in
fact they seem unhappier), and conversely some poor countries seem happier than
many rich countries.
Main Challenges of Machine
Learning (Contd.)
• Poor quality data (training data is full of errors, outliers, and noise)

• Overfitting the Training Data

 Say you are visiting a foreign country and the taxi driver rips you off. You might be
tempted to say that all taxi drivers in that country are thieves (overgeneralization)
 In Machine Learning this is called overfitting

Why did it happen?

• Training set is noisy, or if it is too small
(which introduces sampling noise), then
the model is likely to detect patterns in
the noise itself
Main Challenges of Machine
Learning (Contd.)
• Overfitting happens when the model is too complex relative to the amount
and noisiness of the training data. The possible solutions are:

 To simplify the model by selecting one with fewer parameters (e.g., a linear model
rather than a high-degree polynomial model), by reducing the number of attributes
in the training data or by constraining the model
 To gather more training data
 To reduce the noise in the training data (e.g., fix data errors and remove outliers)
Main Challenges of Machine
Learning (Contd.)
• Underfitting the Training Data
 A linear model of life satisfaction is prone to underfit; reality is just more complex
than the model
 Selecting a more powerful model, with more parameters
 Feeding better features to the learning algorithm (feature engineering)
 Reducing the constraints on the model

• Ethical considerations and bias

• Interpretability and explainability of models

• Selection of appropriate algorithms

Aiya Session 4
No ratings yet
Aiya Session 4
42 pages
Machine Learning With R and Python
No ratings yet
Machine Learning With R and Python
290 pages
Module1 Introduction
No ratings yet
Module1 Introduction
35 pages
Introduction to Predictive Analytics
No ratings yet
Introduction to Predictive Analytics
30 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
39 pages
Anintroductiontomachinelearning: Michaelclark Centerforsocialresearch Universityofnotredame
No ratings yet
Anintroductiontomachinelearning: Michaelclark Centerforsocialresearch Universityofnotredame
43 pages
Which ML Algo Should I Use SAS
No ratings yet
Which ML Algo Should I Use SAS
20 pages
Intro To ML
No ratings yet
Intro To ML
26 pages
ML Unit 1
No ratings yet
ML Unit 1
74 pages
Unit 1-1
No ratings yet
Unit 1-1
32 pages
01 - ML - Introduction
No ratings yet
01 - ML - Introduction
65 pages
Unit 1
100% (1)
Unit 1
13 pages
Essentials of Machine Learning Algorithms
No ratings yet
Essentials of Machine Learning Algorithms
15 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
8 pages
A Preliminary Idea On Machine Learning
No ratings yet
A Preliminary Idea On Machine Learning
40 pages
2 Machine Learning Algorithms For Business
No ratings yet
2 Machine Learning Algorithms For Business
33 pages
CS601 - Machine Learning - Unit 1 - Notes - 1672759748
No ratings yet
CS601 - Machine Learning - Unit 1 - Notes - 1672759748
13 pages
Chapter Four
No ratings yet
Chapter Four
75 pages
Chapter 4 - Machine Learning
No ratings yet
Chapter 4 - Machine Learning
81 pages
AIML
No ratings yet
AIML
30 pages
Machine Learning
No ratings yet
Machine Learning
51 pages
INTRODUCTION
No ratings yet
INTRODUCTION
51 pages
MLT Study
No ratings yet
MLT Study
22 pages
Lecture 1
No ratings yet
Lecture 1
19 pages
Data Analyst Interview Questionaries
No ratings yet
Data Analyst Interview Questionaries
16 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
6 pages
Introduction To Basics of Machine Learning Algorithms: Pankaj Oli
100% (1)
Introduction To Basics of Machine Learning Algorithms: Pankaj Oli
13 pages
Types of Machine Learning Algorithms
No ratings yet
Types of Machine Learning Algorithms
14 pages
Data Science Unit-4 B.sc. III Sem. MDC
No ratings yet
Data Science Unit-4 B.sc. III Sem. MDC
6 pages
1 - Intro To Machine Learning
No ratings yet
1 - Intro To Machine Learning
34 pages
Machine Learning
No ratings yet
Machine Learning
48 pages
Machine Learning Overview Guide
No ratings yet
Machine Learning Overview Guide
68 pages
Machine Learning Algorithms Guide
No ratings yet
Machine Learning Algorithms Guide
10 pages
Machine Learning: A Report Submitted in Partial Fulfillment of The Requirement For The Award of The Degree of
No ratings yet
Machine Learning: A Report Submitted in Partial Fulfillment of The Requirement For The Award of The Degree of
39 pages
Data Science & ML Course Guide
No ratings yet
Data Science & ML Course Guide
83 pages
MUST Research - AI For All
No ratings yet
MUST Research - AI For All
32 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
89 pages
Capture D'écran . 2025-02-13 À 17.59.41
No ratings yet
Capture D'écran . 2025-02-13 À 17.59.41
69 pages
Notes On Data Science and Machine Learning
No ratings yet
Notes On Data Science and Machine Learning
53 pages
DAC ML Tutorial Final Deck
No ratings yet
DAC ML Tutorial Final Deck
150 pages
Machine Learning Basics: An Illustrated Guide For Non-Technical Readers
50% (2)
Machine Learning Basics: An Illustrated Guide For Non-Technical Readers
27 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
15 pages
Classification vs Regression in ML
No ratings yet
Classification vs Regression in ML
15 pages
Machine Learning Tasks and Models Explained
No ratings yet
Machine Learning Tasks and Models Explained
22 pages
Topic 1
No ratings yet
Topic 1
39 pages
Data Analysis Chap 3
No ratings yet
Data Analysis Chap 3
21 pages
Intro To Machine Learning
No ratings yet
Intro To Machine Learning
22 pages
Machine Learning Basics: An Illustrated Guide For Non-Technical Readers
100% (4)
Machine Learning Basics: An Illustrated Guide For Non-Technical Readers
27 pages
Module 1 ML
No ratings yet
Module 1 ML
51 pages
Intro to Machine Learning Algorithms
No ratings yet
Intro to Machine Learning Algorithms
72 pages
4-1 - Machine Learning - Intro-Classification
100% (1)
4-1 - Machine Learning - Intro-Classification
63 pages
Module 3 Data Science Machine Learning
No ratings yet
Module 3 Data Science Machine Learning
53 pages
Ai Word Document Session 2 Detailed Exaple
No ratings yet
Ai Word Document Session 2 Detailed Exaple
15 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
68 pages
Unit 3 ML
No ratings yet
Unit 3 ML
119 pages
3309ENG Engineering Electromagnetics Assignment 2 Design of Power Transmission Line
No ratings yet
3309ENG Engineering Electromagnetics Assignment 2 Design of Power Transmission Line
5 pages
A Level Maths Practice Test
No ratings yet
A Level Maths Practice Test
8 pages
Properties of Surfaces and Solids
No ratings yet
Properties of Surfaces and Solids
14 pages
ACIO-II Exam Application 2017
No ratings yet
ACIO-II Exam Application 2017
1 page
AAO Maths Solved 2024
No ratings yet
AAO Maths Solved 2024
38 pages
B.Sc Physics Admission at Scottish Church
No ratings yet
B.Sc Physics Admission at Scottish Church
3 pages
Java Data Types MCQ Practice
No ratings yet
Java Data Types MCQ Practice
6 pages
Mechanical Behavior of 316L Stainless Steel After
No ratings yet
Mechanical Behavior of 316L Stainless Steel After
6 pages
SOLIDWORKS XDesign Lesson Sketching, Constraints, and Dimensions
No ratings yet
SOLIDWORKS XDesign Lesson Sketching, Constraints, and Dimensions
16 pages
Math Enthusiasts: Circle Squaring
No ratings yet
Math Enthusiasts: Circle Squaring
2 pages
(Group 4) Informal Activities, Cooperative Learning, Teacher-Directed Activities
No ratings yet
(Group 4) Informal Activities, Cooperative Learning, Teacher-Directed Activities
6 pages
Ar 319B - Development Controls
No ratings yet
Ar 319B - Development Controls
35 pages
Reaosning (Eng) SSC CGL 2024 T-2 RBE Compressed
No ratings yet
Reaosning (Eng) SSC CGL 2024 T-2 RBE Compressed
14 pages
A History of Mathematical Impossibility - JESPER LÜTZEN
100% (2)
A History of Mathematical Impossibility - JESPER LÜTZEN
415 pages
Gauss's Law and Electric Flux
No ratings yet
Gauss's Law and Electric Flux
56 pages
Syllabus High Voltage
No ratings yet
Syllabus High Voltage
37 pages
Making Use of External Corrosion Defect Assessment (ECDA) Data To Predict DCVG %IR Drop and Coating Defect Area
No ratings yet
Making Use of External Corrosion Defect Assessment (ECDA) Data To Predict DCVG %IR Drop and Coating Defect Area
20 pages
Understanding Discrete Cosine Transform
No ratings yet
Understanding Discrete Cosine Transform
8 pages
Module 3
No ratings yet
Module 3
9 pages
Syllabus For Jee Mains
No ratings yet
Syllabus For Jee Mains
11 pages
Chapter 7 Statistics
No ratings yet
Chapter 7 Statistics
14 pages
Data Structures Viva Questions
100% (1)
Data Structures Viva Questions
8 pages
Skip To Main ContentAccessibility Help
No ratings yet
Skip To Main ContentAccessibility Help
8 pages
Math Students' Matrix Quiz
No ratings yet
Math Students' Matrix Quiz
8 pages
DM Unit 6
No ratings yet
DM Unit 6
39 pages
UNIT 13.1 - The Probability Scale
No ratings yet
UNIT 13.1 - The Probability Scale
37 pages
G5 Helical Spring Sample Problem Lecture and Solution
No ratings yet
G5 Helical Spring Sample Problem Lecture and Solution
24 pages
QCD 5
No ratings yet
QCD 5
39 pages
NT II Unit III Notes
No ratings yet
NT II Unit III Notes
57 pages
Form Two Test 1
No ratings yet
Form Two Test 1
2 pages

Descriptive Analytics in Business Decisions

Uploaded by

Descriptive Analytics in Business Decisions

Uploaded by

Introduction to

1. Descriptive Analytics- (What happened?) It involves looking at the history of business

3. Prescriptive Analytics- (What action should we take?) Based on the findings of

• Observations & Variables

We call them variables

It can take only a

• Exploratory Data Analysis refers to the critical process of performing

• Bivariate visualization: Each point is placed according to its value on two

• Multivariate visualization: More than two variables are visualized

• Spearman's Rank Coefficient (generally, useful for ordinal or non-normally

• Kendall's Rank Coefficient (generally, appropriate for ordinal or non-

“Correlation Does Not Imply Causation”

Simple Linear Regression

• Recommender systems are a notable example of supervised learning. E-

• Visualization and dimensionality reduction

• For every correct action or decision of an algorithm, it is rewarded with

• Overfitting the Training Data

Why did it happen?

• Ethical considerations and bias

• Interpretability and explainability of models

• Selection of appropriate algorithms

You might also like