0% found this document useful (0 votes)

6 views7 pages

Business Analytics Notes

The document covers key concepts in business analytics, including mean, standard deviation, skewness, normality, t-tests, chi-square tests, and cluster analysis. It emphasizes the importance of these statistical measures and tests in understanding data distributions, making informed decisions, and improving business operations. Additionally, it highlights how data analytics can enhance customer insights, forecasting, operational efficiency, and competitive advantage.

Uploaded by

priya24laasya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views7 pages

Business Analytics Notes

Uploaded by

priya24laasya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

BUSINESS ANALYTICS NOTES

3. Standard Deviation, Mean, Skewness

Mean

The mean is a fundamental measure of central tendency that provides a single value
representing the average of a dataset. It is calculated by summing all values and dividing by
the number of observations. The formula for mean is:

Mean=∑xn\text{Mean} = \frac{\sum x}{n}Mean=n∑x

where ∑x\sum x∑x is the sum of all data points, and nnn is the number of data points. The
mean gives an idea of the "typical" value in a dataset and is widely used in business analytics
to understand key metrics like average sales, customer spending, or production output.
However, it is sensitive to extreme values (outliers), which can distort its accuracy. For
example, if a company’s average employee salary is calculated including a very high CEO
salary, the result may not represent the typical employee’s salary.

Standard Deviation (SD)

Standard Deviation is a measure of dispersion or spread in a dataset. It indicates how much

individual data points deviate from the mean. A low standard deviation suggests that the
values are close to the mean, whereas a high standard deviation indicates that the values are
more spread out. The formula for standard deviation (for a sample) is:

PAGE NO. = 53

Standard deviation is critical in business to assess risk, consistency and performance. For
example, in stock market analysis, a high standard deviation means the stock is volatile and
risky, whereas a low SD means it is more stable.

Skewness
Skewness measures the asymmetry of the distribution of data. If the data is perfectly
symmetrical, the distribution is said to have zero skewness and is considered normal. If the
tail is longer on the right side, the distribution is positively skewed, and if the tail is longer
on the left, it is negatively skewed. The formula for sample skewness is:

PAGE NO. = 61, 64 & 65

Skewness helps in identifying whether the mean is a reliable measure of central tendency. For
example, in customer income data, if a few customers earn significantly more than the rest,
the data will show positive skewness. This impacts the choice of statistical techniques and
summarization methods.

4. Normality and Distribution of Data

What is Normality?

Normality refers to a condition where the dataset follows a normal distribution, also known
as the Gaussian distribution. A normal distribution is symmetric, bell-shaped, and centered
around the mean. In this distribution, the mean, median, and mode are all equal. It is widely
used in statistics due to its natural occurrence in many real-life phenomena such as employee
performance, product weight, or exam scores.

Properties of Normal Distribution

The normal distribution is defined by two parameters: mean (μ) and standard deviation (σ).
The shape of the curve is determined by these two. It has key properties:

 It is symmetric around the mean.

 About 68.26% of the data lies within ±1σ, 95.44% within ±2σ, and 99.73% within
±3σ.
 The total area under the curve is 1.
 It extends infinitely in both directions, though practically most data lies within ±3σ.

Why Normality Matters in Analytics

Many parametric tests such as t-tests, regression analysis, and ANOVA assume normality
of data. If the assumption of normality is violated, the results from these tests may not be
valid. For example, if you want to evaluate employee performance based on a training
program using a t-test, the data should ideally be normally distributed for accurate
interpretation.

How to Check for Normality

Normality can be visually assessed using:

 Histograms
 Box plots
 Q-Q (quantile-quantile) plots

Additionally, statistical tests like the Shapiro-Wilk Test and Kolmogorov-Smirnov Test are
used to test normality.

Non-Normal Distributions

If data is not normally distributed, it could be skewed, bimodal, or uniform. In such cases,
non-parametric tests such as the Mann-Whitney U test or Kruskal-Wallis test are more
appropriate. For example, income data often follows a positively skewed distribution, and
using a non-parametric approach would yield more reliable results.

6. T-Test

Definition and Purpose

The t-test is a parametric test used to compare the means of two groups and determine if
the differences are statistically significant. It is especially useful when the sample size is
small and population standard deviation is unknown.

Types of T-Tests

1. One-sample t-test: Compares the mean of a single group with a known or

hypothesized population mean.
Example: Is the average delivery time of a service different from 30 minutes?
2. Independent (two-sample) t-test: Compares the means of two independent groups.
Example: Compare average sales of two different branches.
3. Paired t-test: Used when the same group is measured twice (before and after a
treatment).
Example: Measure productivity of employees before and after training.

Formula for Independent t-test

Assumptions

 Data should be normally distributed

 Samples are independent
 Variance between groups should be equal (homogeneity of variance)

The t-test is widely used in business to compare employee performance, marketing

campaign results, or sales before and after a price change.

7. Chi-Square Test

Overview

The Chi-Square Test (χ²) is a non-parametric test used to examine the association
between categorical variables. It is used when data is in the form of frequencies or counts,
not continuous variables.

Types of Chi-Square Tests

1. Chi-square test of independence:

o Tests whether two categorical variables are independent.
o Example: Is customer satisfaction independent of geographic location?
2. Chi-square goodness-of-fit test:
o Checks if a sample distribution matches an expected distribution.
o Example: Are product sales equally distributed across all weekdays?
Formula

Assumptions

 Data must be in counts

 Categories must be mutually exclusive
 Expected frequency in each cell should be ≥ 5

The Chi-square test is often used in business to assess customer preferences, employee
satisfaction by department, or relationship between product category and return rate.

8. Cluster Analysis

Introduction

Cluster Analysis is a powerful unsupervised learning technique used to group similar data
points into clusters, where the data within each cluster is more similar to each other than to
those in other clusters. It is widely used in market research, customer segmentation, and
pattern recognition.

Purpose and Importance

The main purpose of clustering is to discover hidden structures or patterns in large

datasets. For instance, a company can use clustering to segment its customers into groups
such as price-sensitive, brand-loyal, and occasional buyers, allowing targeted marketing
strategies for each group.

Types of Clustering

1. Hierarchical Clustering:
o Creates a tree-like structure (dendrogram) to group data.
o Useful for small datasets.
2. K-Means Clustering:
o Divides data into K predefined clusters.
o Minimizes the within-cluster variance.

Steps in K-Means Clustering

 Choose number of clusters (K)

 Randomly assign data points to clusters
 Calculate cluster centroids
 Reassign points based on nearest centroid
 Repeat until convergence

Application in Business

 Customer segmentation for personalized offers

 Fraud detection in banking
 Inventory categorization based on turnover and value

Cluster analysis helps businesses maximize marketing ROI, reduce churn, and improve
operational efficiency.

9. Importance of Data Analytics in Business

Decision-Making

Data analytics enables fact-based decision-making by transforming raw data into actionable
insights. It replaces intuition with data-driven strategies. For example, analyzing past sales
can help forecast future demand, aiding in inventory planning.

Customer Insights

Businesses can use analytics to deeply understand customer behavior, preferences, and
feedback. By analyzing customer purchase history, a retailer can personalize product
recommendations and marketing messages, leading to improved customer retention.

Forecasting and Planning

Using predictive analytics, businesses can anticipate future trends, customer behavior, or
risks. For example, banks use historical loan data to predict default risks, allowing better
credit decisions.

Operational Efficiency

Analytics helps in identifying inefficiencies in processes. By tracking KPIs (Key

Performance Indicators), companies can reduce wastage, optimize resource utilization, and
improve service delivery. In manufacturing, analytics is used for quality control and
production optimization.

Competitive Advantage

Data-driven companies gain a competitive edge by reacting faster to market trends and
making smarter strategic decisions. For instance, companies like Amazon and Netflix thrive
because they leverage analytics to recommend products and content, enhancing customer
satisfaction.

OTHERS IN NOTE BOOK FOR REFERAL.

FULL Version Testbank Coordinate Geometry For JEE Advanced 3rd Edition G Tewani Multiple Formats
No ratings yet
FULL Version Testbank Coordinate Geometry For JEE Advanced 3rd Edition G Tewani Multiple Formats
409 pages
Business Statistics - Prof. Dr. Mukesh Kumar Barua
100% (1)
Business Statistics - Prof. Dr. Mukesh Kumar Barua
991 pages
Ebooks File (Ebook PDF) Business Statistics: A First Course 8th Edition All Chapters
100% (3)
Ebooks File (Ebook PDF) Business Statistics: A First Course 8th Edition All Chapters
50 pages
Data (Prod & Admin) - July 2023 - August
No ratings yet
Data (Prod & Admin) - July 2023 - August
332 pages
Law of Property and Easement-NOTES
No ratings yet
Law of Property and Easement-NOTES
62 pages
Electrical Installation Level 5 Learning Guide
No ratings yet
Electrical Installation Level 5 Learning Guide
76 pages
Final SRB Unit 2
No ratings yet
Final SRB Unit 2
162 pages
Complete SPSS Tests
No ratings yet
Complete SPSS Tests
148 pages
DBB2102 - Quantitative Techniques For Management
No ratings yet
DBB2102 - Quantitative Techniques For Management
11 pages
133838232771020568
No ratings yet
133838232771020568
269 pages
Presentation On Data Analysis: Submitted by
No ratings yet
Presentation On Data Analysis: Submitted by
38 pages
Runehammer OSE Hacked 1.2
100% (1)
Runehammer OSE Hacked 1.2
17 pages
Madrid Protocol TMR
No ratings yet
Madrid Protocol TMR
21 pages
HR Metrics and Analytics
No ratings yet
HR Metrics and Analytics
59 pages
Inderbir Singh Human Embryology 11th Edition by Subhadra Devi ISBN 9789352701155 9352701151 Instant Download
100% (4)
Inderbir Singh Human Embryology 11th Edition by Subhadra Devi ISBN 9789352701155 9352701151 Instant Download
46 pages
Quantitative Methods 3
No ratings yet
Quantitative Methods 3
174 pages
E-Note 33325 Content Document 20250319114322AM
No ratings yet
E-Note 33325 Content Document 20250319114322AM
69 pages
Module 5
No ratings yet
Module 5
51 pages
PaperCrafter - Issue 168, February 2022
100% (4)
PaperCrafter - Issue 168, February 2022
92 pages
Data Mining: Prepared By: Eesha Tur Razia Babar
No ratings yet
Data Mining: Prepared By: Eesha Tur Razia Babar
49 pages
Bs Regyular
No ratings yet
Bs Regyular
22 pages
ST LINES + CIRCLES TOP 200 PYQs of JEE Mains 2022
No ratings yet
ST LINES + CIRCLES TOP 200 PYQs of JEE Mains 2022
60 pages
Statistics Theory Notes
No ratings yet
Statistics Theory Notes
20 pages
Lesson 02 Probability and Statistics
No ratings yet
Lesson 02 Probability and Statistics
127 pages
T-Spot Test Results
No ratings yet
T-Spot Test Results
1 page
DS Unit 1
No ratings yet
DS Unit 1
99 pages
02 - Data Exploration: IS5740: Management Support and Business Intelligence Systems
No ratings yet
02 - Data Exploration: IS5740: Management Support and Business Intelligence Systems
37 pages
Pad Unit 2 Ibm
No ratings yet
Pad Unit 2 Ibm
61 pages
SFM Unit-1
No ratings yet
SFM Unit-1
48 pages
Module-1 Introduction To Statistics Definitions
No ratings yet
Module-1 Introduction To Statistics Definitions
54 pages
QM 1
No ratings yet
QM 1
58 pages
Typical Statistical Testing Procedures
No ratings yet
Typical Statistical Testing Procedures
29 pages
2-17-Descriptive Inferential Statistics - PT 1 - JA Edit
No ratings yet
2-17-Descriptive Inferential Statistics - PT 1 - JA Edit
49 pages
Unit 3
No ratings yet
Unit 3
20 pages
Foundations or Research Analysis
No ratings yet
Foundations or Research Analysis
31 pages
Business Statistics
100% (2)
Business Statistics
123 pages
Antim Prahar Business Statistics and Analysis - 240328 - 180758
No ratings yet
Antim Prahar Business Statistics and Analysis - 240328 - 180758
15 pages
The Data Analyst's Guide To Data Types, Distributions, and Statistical Tests
No ratings yet
The Data Analyst's Guide To Data Types, Distributions, and Statistical Tests
38 pages
Xie 2021
No ratings yet
Xie 2021
8 pages
Spark Streaming Assignment
No ratings yet
Spark Streaming Assignment
2 pages
DBB2102 Quantitative Techniques For Management
No ratings yet
DBB2102 Quantitative Techniques For Management
12 pages
Data Analysis
No ratings yet
Data Analysis
10 pages
Data Science 2
No ratings yet
Data Science 2
8 pages
Rotax 912 Operator's Manual
No ratings yet
Rotax 912 Operator's Manual
85 pages
Analytics - PrepBook 2018 PDF
No ratings yet
Analytics - PrepBook 2018 PDF
34 pages
Quantitative Methods in Management: Term II 4 Credits MGT 408
No ratings yet
Quantitative Methods in Management: Term II 4 Credits MGT 408
106 pages
Business Statistics KMBN-104 - Q - Ans
100% (1)
Business Statistics KMBN-104 - Q - Ans
30 pages
FAC3761 - Exam Prep - Mock Question Paper - Suggested Solution
No ratings yet
FAC3761 - Exam Prep - Mock Question Paper - Suggested Solution
9 pages
Ba Textbook Part2
No ratings yet
Ba Textbook Part2
10 pages
Submitted To Submitted by
No ratings yet
Submitted To Submitted by
44 pages
Quantitative Analysis Paper
No ratings yet
Quantitative Analysis Paper
15 pages
Business Statistics: Shalabh Singh Room No: 231 Shalabhsingh@iim Raipur - Ac.in
No ratings yet
Business Statistics: Shalabh Singh Room No: 231 Shalabhsingh@iim Raipur - Ac.in
58 pages
Lioba CV
No ratings yet
Lioba CV
5 pages
Prefinal-1 Model Paper (2024-25)
No ratings yet
Prefinal-1 Model Paper (2024-25)
4 pages
Section A-19241095-Sinhaj Noor
No ratings yet
Section A-19241095-Sinhaj Noor
33 pages
Sample Certificate of Non-Claim (Car Insurance Claim)
71% (7)
Sample Certificate of Non-Claim (Car Insurance Claim)
1 page
QMM Epgdm 1
No ratings yet
QMM Epgdm 1
113 pages
The NGINX Real-Time API Handbook
No ratings yet
The NGINX Real-Time API Handbook
26 pages
Business Statistics Assignment 2 & 3
No ratings yet
Business Statistics Assignment 2 & 3
6 pages
Korea University Urban Planning and Urban Design Lab
No ratings yet
Korea University Urban Planning and Urban Design Lab
4 pages
Ge8 Statistics
No ratings yet
Ge8 Statistics
2 pages
Syltherm HF Tds
No ratings yet
Syltherm HF Tds
2 pages
STATISTICS Grand Viva
No ratings yet
STATISTICS Grand Viva
28 pages
Submitted To Submitted by
No ratings yet
Submitted To Submitted by
44 pages
Satyam Cnlu Torts Roughdraft
No ratings yet
Satyam Cnlu Torts Roughdraft
4 pages
Quantitative Methods For Management: Term II 4 Credits MGT 408
No ratings yet
Quantitative Methods For Management: Term II 4 Credits MGT 408
49 pages
En10272 PDF
100% (1)
En10272 PDF
42 pages
10 Question Answer
No ratings yet
10 Question Answer
2 pages
Random Details
No ratings yet
Random Details
2 pages
Term Paper Stat
No ratings yet
Term Paper Stat
20 pages
1.1 Identify Ty
No ratings yet
1.1 Identify Ty
7 pages
EG8145V5 Quick Start 01 (R20C00)
No ratings yet
EG8145V5 Quick Start 01 (R20C00)
16 pages
Business Statistics A First Course - 6ed Index
0% (2)
Business Statistics A First Course - 6ed Index
7 pages
SCM 100 Review
No ratings yet
SCM 100 Review
23 pages
Balloon Tutorial
No ratings yet
Balloon Tutorial
19 pages
Lesson-Plan 1
No ratings yet
Lesson-Plan 1
2 pages
Welcome: To All MBA Students
No ratings yet
Welcome: To All MBA Students
60 pages
Chapter 1
No ratings yet
Chapter 1
36 pages
How Statistical Theory and Application Assists Business To Formulate and Design Strategies
No ratings yet
How Statistical Theory and Application Assists Business To Formulate and Design Strategies
8 pages
Instructor'S Manual: Statistical Techniques in Financial Management
No ratings yet
Instructor'S Manual: Statistical Techniques in Financial Management
3 pages
Lesson Planning in Teaching
No ratings yet
Lesson Planning in Teaching
10 pages
Multiple Choice Questions (1-5) 1 Tick For Each Correct Answer PDF
No ratings yet
Multiple Choice Questions (1-5) 1 Tick For Each Correct Answer PDF
2 pages
Business Statistics
No ratings yet
Business Statistics
20 pages
Research Analytics
25% (4)
Research Analytics
2 pages

Business Analytics Notes

Uploaded by

Business Analytics Notes

Uploaded by

BUSINESS ANALYTICS NOTES

3. Standard Deviation, Mean, Skewness

Mean=∑xn\text{Mean} = \frac{\sum x}{n}Mean=n∑x

Standard Deviation (SD)

Standard Deviation is a measure of dispersion or spread in a dataset. It indicates how much

PAGE NO. = 61, 64 & 65

4. Normality and Distribution of Data

Properties of Normal Distribution

 It is symmetric around the mean.

Why Normality Matters in Analytics

How to Check for Normality

Normality can be visually assessed using:

Definition and Purpose

1. One-sample t-test: Compares the mean of a single group with a known or

Formula for Independent t-test

 Data should be normally distributed

The t-test is widely used in business to compare employee performance, marketing

Types of Chi-Square Tests

1. Chi-square test of independence:

 Data must be in counts

Purpose and Importance

The main purpose of clustering is to discover hidden structures or patterns in large

Steps in K-Means Clustering

 Choose number of clusters (K)

 Customer segmentation for personalized offers

9. Importance of Data Analytics in Business

Forecasting and Planning

Analytics helps in identifying inefficiencies in processes. By tracking KPIs (Key

OTHERS IN NOTE BOOK FOR REFERAL.

You might also like