TECH 4070-Ch02
TECH 4070-Ch02
TECH 4070-Ch02
Intelligence,
Analytics,
and Data Chapter 2
Science: A Descriptive Analytics I:
Nature of Data, Statistical
Managerial Modeling, and Visualization
Perspective
Learning Objectives (1 of 2)
• Data reduction
1. Variables
– Dimensional reduction
– Variable selection
2. Cases/samples
– Sampling
– Balancing / stratification
Data Preprocessing Tasks and Methods (1 of 3)
• Statistics
– A collection of mathematical techniques to
characterize and interpret data
• Descriptive Statistics
– Describing the data (as it is)
• Inferential statistics
– Drawing inferences about the population based on
sample data
• Descriptive statistics for descriptive analytics
Descriptive Statistics Measures of Centrality
Tendency
• Arithmetic mean
x1 + x2 + + xn
n
x
x = x = i =1 i
n n
• Median
– The number in the middle
• Mode
– The most frequent observation
Descriptive Statistics Measures of
Dispersion (1 of 2)
• Dispersion
– Degree of variation in a given
variable
• Range
– Max - Min
Standard Deviation
• Variance
n
n
( xi − x) 2 ( xi − x) 2
s =
2 i =1 s = i =1
n −1 n −1
• Mean Absolute Deviation (MAD)
– Average absolute deviation from the mean
Descriptive Statistics Measures of
Dispersion (2 of 2)
• Quartiles
• Box-and-Whiskers Plot
– a.k.a. box-plot
– Versatile / informative
Descriptive Statistics Shape of a Distribution
i =1 i
n
( x − x ) 3
Skewness = S =
(n − 1) s 3
• Kurtosis
– Peak/tall/skinny nature of the distribution
i =1 i
n
( x − x ) 4
Kurtosis = K = 4
− 3
ns
Relationship Between Dispersion and
Shape Properties
Technology Insights 2.1 (1 of 2)
Descriptive Statistics in Excel
Technology Insights 2.1 (2 of 2)
Descriptive Statistics in Excel Creating box-plot in Microsoft Excel
Business Reporting Definitions and Concepts