Research Methodology - 11
Research Methodology - 11
Research Methodology - 11
MODULE XI: DATA ANALYSIS
Example: Monthly sales (₹ 000s) for 5 months: 12, 15, 18, 20, 25
Sum = 12+15+18+20+25 = 90
Mean = 90 ÷ 5 = 18
Definition: The middle observation when data are ordered. If n is even, it’s the average of the two central values.
Example (Odd n): Sales above → ordered [12, 15, 18, 20, 25], median = 18 (3ᵈ value).
Example (Even n): Add one more month, say 30 → [12, 15, 18, 20, 25, 30], median = (18+20)/2 = 19
Definition: Observation that occurs most often. A distribution may be unimodal, bimodal, or multimodal.
Key Point: Only measure defined for nominal data; may not exist (all unique) or not be unique (multiple modes).
B. MEASURES OF DISPERSION
Quantify the spread or variability in the data.
1. Range
Research Methodology - 11 1
Key Point: Simple but highly sensitive to outliers; ignores intermediate values.
Formula:
s=1n−1∑i=1n(xi−xˉ)2 s = \sqrt{\frac{1}{n-1}\sum_{i=1}^n (x_i - \bar{x})^2}
4. SD = √24.5 ≈ 4.95
Key Point: Uses all data; common basis for further analysis (e.g., z-scores).
1. Correlation
Purpose: Measures the strength and direction of the linear relationship between two variables X and Y.
Pearson’s r formula:
r=∑(xi−xˉ)(yi−yˉ)∑(xi−xˉ)2 ∑(yi−yˉ)2 r = \frac{\sum (x_i - \bar{x})(y_i - \bar{y})}{\sqrt{\sum(x_i - \bar{x})^2
\;\sum(y_i - \bar{y})^2}}
Perfect linear ↑ → r = +1
Interpretation:
r ≈ +1 strong positive
r ≈ 0 no linear relationship
r ≈ –1 strong negative
Example (continued):
Key Points:
a = predicted Y when X = 0.
INTEGRATED EXAMPLE
A small bakery tracks daily advertising spend (₹000s) and daily sales (₹000s) over 7 days:
1 2 20
Research Methodology - 11 2
2 3 30
3 5 45
4 4 40
5 6 60
6 7 70
7 8 80
Mean X = 5; Mean Y = 50
Range sales = 80 – 20 = 60
Regression line:
a = 50 – 10×5 = 0
Research Methodology - 11 3