Introduction To Statistics
Introduction To Statistics
Median
Median is the middle value in a dataset when arranged in ascending or descending order.
•If the number of observations is odd, the median is the middle value.
•If the number of observations is even, the median is the average of the two middle values.
Mode
Mode is the value that appears most frequently in a dataset. Example: Given the dataset {2, 4, 5, 5,
6, 7}, the mode is 5 since it appears twice.
Exercise : BP608T
Exercise : BP608T
Measures of Dispersion
Dispersion refers to the spread or variability of data in a dataset. It helps in understanding how much
the data varies from one another.
Types of Dispersion Measures
1.Absolute Measures of Dispersion: These have the same unit as the original dataset (e.g., Range,
Standard Deviation, Quartile Deviation).
2.Relative Measures of Dispersion: These are expressed as ratios or percentages (e.g., Coefficients
of Range, Variation, Standard Deviation, Quartile Deviation, Mean Deviation).
Range
Range is the difference between the highest and lowest values in a dataset.
Standard Deviation (SD)
Standard Deviation (SD) measures the amount of variation or dispersion in a dataset.
•A low SD indicates that values are close to the mean.
•A high SD suggests that values are spread over a wider range.
Exercise : BP608T
Standard Deviation (SD)
Standard Deviation (SD) measures the amount of variation or dispersion in a dataset.
•A low SD indicates that values are close to the mean.
•A high SD suggests that values are spread over a wider range.
Formula:
Exercise : BP608T
Karl Pearson's Coefficient of Correlation
Karl Pearson's Coefficient of Correlation is a statistical measure used to determine the degree of
linear relationship between two variables. It is also called the product-moment correlation
coefficient and is denoted by 'r'.
Formula
The coefficient of correlation is given by:
Exercise : BP608T
Merits
1.It provides a precise and quantitative measure of correlation with a meaningful interpretation.
2.It indicates both the direction (positive or negative) and the strength of the correlation.
Demerits
1.The calculation is time-consuming for large datasets.
2.The correlation coefficient is always between -1 and +1:
Exercise : BP608T
Multiple Correlation
Multiple correlation measures the relationship between three or more variables.
If z is the dependent variable and x and y are independent variables, the multiple correlation
formula is:
Exercise : BP608T