BI-LEc 3
BI-LEc 3
Analysis (EDA)
Lecture 3- Miss Ushna Tasleem
Exploratory Data Analysis (EDA) is a process of
describing the data using statistical and
visualization techniques to bring important
aspects of that data into focus for further analysis.
This involves inspecting the dataset from many
angles, describing & summarizing it without
Introduction making any assumptions about its contents.
Exploratory data that can be tested further using more sophisticated analytical
EDA Example:
EDA features).
EDA • Example:
• T-tests and ANOVA: A BI analyst conducts a t-test to
Techniques compare the average sales before and after a new
marketing campaign, determining if the observed
increase is statistically significant.
• Chi-Square Test: An e-commerce company uses a Chi-
Square test to examine the relationship between
customer gender and product category preference,
finding significant associations that inform targeted
marketing strategies.
8. Time Series Analysis
• Trend Analysis: Identifying trends over time, such as
seasonal patterns or long-term shifts, using line plots or
time series decomposition.
• Autocorrelation: Measures the correlation of a time
series with its own past values, helping to identify
patterns over time.
EDA • Example:
Techniques • Trend Analysis: A BI team analyzes monthly sales
data over several years to identify seasonal trends,
such as increased sales during the holiday season, and
uses this insight for inventory planning.
• Autocorrelation: An autocorrelation function is
applied to daily website traffic data to detect any
repeating patterns or cycles, such as weekly peaks in
traffic.
9. Clustering and Segmentation
K-Means Clustering: Grouping data points into
clusters based on similarity, often used for customer
segmentation in BI.
Hierarchical Clustering: Creating a tree of clusters to
understand data groupings at different levels of
similarity.
EDA Examples: