DS2 1
DS2 1
: 210430116083
Experiment No: 2
Date:
AIM: Study of Measures of Central Tendency, Correlation, Percentile, Decile, Quartile, Measure
of Variation, and Measure of Shape (Skewness and Kurtosis) with Excel Functions
Objective:
The objective of this lab practical is to provide students with hands-on experience in using Excel
functions to explore and analyze a sample data sheet. Students will learn to calculate measures of
central tendency, correlation, percentile, decile, quartile, measure of variation, and measure of
shape using Excel functions. Additionally, students will learn to create visualizations to better
understand the data.
Materials:
- Computer with Microsoft Excel installed
- Sample data sheet (provided below or dataset may be provided by subject teacher)
Test1 Test2
StudentID Score Score Age Gender
1 85 92 19 Male
2 92 87 20 Female
3 78 80 18 Male
4 85 89 19 Male
5 90 95 21 Female
6 75 82 18 Male
7 83 87 20 Female
8 92 90 19 Male
9 80 85 18 Female
10 87 88 20 Female
|
Procedure:
Part 2: Correlation
1. Calculate the correlation between test 1 score and test 2 score using Excel functions.
9
Information Technology
Enrolment No.: 210430116083
2. Create a scatter plot to visualize the relationship between test 1 score and test 2 score.
3. Write a brief interpretation of the results.
Interpretation/Program/Code/Output:
Test 1 score
Mean 84.7
Median 85
Mode 85
3. Calculate the mean, median, and mode for the test 2 score column using Excel functions.
Test 2 score
Mean 87.5
Median 87
Mode 87
10
Information Technology
Enrolment No.: 210430116083
overall sense of the central value of the data. Here, the mean test 1 score is 84.7, it means that, on
average, the students scored 84.7 on this test. The Median represents the middle value when all
the scores are sorted in ascending order. Here, the median test 2 score is 87, it means that half of
the students scored above 87 and half scored below 87. The Mode is the value that occurs most
frequently in the data set. Here, the mode for test 1 scores is 85, it means that 85 is the score that
appeared most often among the students.
Part 2: Correlation
1. Calculate the correlation between test 1 score and test 2 score using Excel functions.
Correlation: - “=CORREL(A2:Ax, B2:By)”
2. Create a scatter plot to visualize the relationship between test 1 score and test 2 score.
88
86
84
82
80
78
0 20 40 60 80 100
test 1 score
11
Information Technology
Enrolment No.: 210430116083
2. Calculate the 30th, 40th, and 70th deciles using Excel functions for both test 1 score and test 2
score columns.
For 30th decile: “=PERCENTILE(A2:Ax, 0.3)”
For 40th decile: “=PERCENTILE(A2:Ax, 0.4) ”
For 70th decile: “=PERCENTILE(A2:Ax, 0.7) ”
3. Calculate the first and third quartiles using Excel functions for both test 1 score and test 2 score
columns.
For first quartile: “=QUARTILE(A2:Ax, 1)”
For third quartile: “=QUARTILE(A2:Ax, 3)”
4. Create a box plot for both test 1 score and test 2 score columns.
12
Information Technology
Enrolment No.: 210430116083
13
Information Technology
Enrolment No.: 210430116083
heavy tails and more extreme values than a normal distribution (leptokurtic), while negative
kurtosis implies lighter tails and fewer extremes (platykurtic). A kurtosis value near zero signifies
a distribution resembling a normal one.
Conclusion:
In this experiment, we explored various statistical measures using Excel functions. We
calculated central tendencies, correlation, percentiles, deciles, quartiles, measures of variation,
skewness, and kurtosis to gain a comprehensive understanding of the data's distribution and
relationships. These analyses provided valuable insights into the data's characteristics and allowed
us to make informed interpretations about its central tendency, spread, shape, and associations.
Quiz:
1) What Excel function can be used to calculate the mean of a dataset?
a) AVERAGE
b) MEDIAN
c) MODE
d) STANDARDIZE
2) What does the correlation coefficient measure in terms of the relationship between two
variables?
a) Strength of the linear relationship
b) Variability of the data
c) Difference between mean and median
d) Skewness of the distribution
Suggested References:
1. "Microsoft Excel Data Analysis and Business Modeling" by Wayne L. Winston
2. "Excel 2021: Data Analysis and Business Modeling" by Wayne L. Winston
14
Information Technology