Unit 1
Unit 1
UNIT- 1:
Topics to be Covered
• Quantitative and Qualitative Data,
• Attributes and variables,
• Scales of measurement: nominal, ordinal, interval and
ratio,
• Measures of Central Value: Mean, Median, Mode,
• Measures of Dispersion: Absolute and Relative
measures of dispersion – Range, Quartile Deviation,
Mean Deviation, Standard Deviation, Moments,
Skewness, Kurtosis.
• Visualization of Data: Histograms, Stem and Leaf Plots,
Five Number Summary and Box Plots.
• Introduction to Big Data: Characteristics and Stages.
Readings
• Statistics for Business by Stiner & Foster D.,
Pearson India
(Chapter 2,3,4)
• Statistical Methods by S.P.Gupta, Vol. 1
(Chapter 5,6,7,8,9)
• Business Statistics by N.D.Vohra
• Box and Whisker Plot, Moments, Five Point
Summary)
Quantitative & Qualitative Data
• Quantitative Data:
• Quantitative data involves numerical information
and is used to quantify observations or
measurements. It deals with objective,
measurable data.
• Examples: Sales figures, revenue, market share,
number of employees, customer satisfaction
scores.
• Methods of Collection: Surveys, experiments,
structured observations, numerical records.
Qualitative Data:
Attribute Rank
Food quality 1
Prices 3
Menu variety 2
Ambience 5
Service 4
INTERVAL SCALE
• Next higher level of measurement than ordinal scale
• Takes care of limitation of ordinal scale where
difference between scores on ordinal scale does not
have any meaningful interpretation
• In interval scale the difference of the score on the
scale has meaningful interpretation
• Mathematical form:
Y = a + bX where a is not equal to 0
• Interval scale data has an arbitrary origin (non-zero
origin).
Example
• Celcius and Fahrenheit
• C’ = 5/9 (F’ – 32)
• The difference in score has a meaningful
interpretation but the ratio of the score in not
meaningful
• How important is price to you while buying a car?
a. Least important (1)
b. Unimportant (2)
c. Neutral (3)
d. Important (4)
e. Most Important (5)
Freq 80 15 25 90 30
Class width 40 10 10 30 20
200-400 300
400-600 500
600-800 700
800-1000 900= A
1000-1200 1100
1200-1400 1300
1400-1600 1500
total
Question.
Class Interval Mid points F d fd
(X)
200-400 300 500 -3 -1500
400-600 500 300 -2 -600
600-800 700 280 -1 -280
800-1000 900= A 120 0 0
1000-1200 1100 100 1 100
1200-1400 1300 80 2 160
1400-1600 1500 20 3 60
total 1400 -2060
Combined Mean
•
Boys Girls
Number 100 50
Mean Weight 60 kg 45 kg
Answer
•
Median
•
Question
Class interval f cf
18-22 120 120
22-26 125 245
26-30 280 525
30-34 260 785
34-38 155 940
38-42 184 1124
42-46 162 1286
46-50 86 1372
50-54 75 1447
54-58 53 1500
•
Quartile deviation
• Interquartile range = Q3-Q1
• Q.D. = (Q3-Q1) / 2
• It is an absolute measure of dispersion
• Example: obtain the quartile deviation for the
following data:
490, 540, 590, 600, 620, 650, 680, 770, 830, 840, 890,
900
• Here Q1= median of lower half of set (490, 540, 590,
600, 620, 650) = (590+ 600)/ 2 = 595
• And Q3 = median of upper half of set (680, 770, 830,
840, 890, 900) = (830+840/2) = 835
• Hence Q.D. = (835-595)/ 2 = 120
Mean Deviation
•
Example
• Suppose you have the following set of exam
scores for a class:
• ={68,72,75,78,80}X={68,72,75,78,80}
Steps to Calculate Mean Deviation: