0% found this document useful (0 votes)
6 views6 pages

DS2 1

The document outlines a lab practical aimed at teaching students how to use Excel functions to analyze data through measures of central tendency, correlation, percentiles, deciles, quartiles, variation, skewness, and kurtosis. It includes detailed procedures for each part of the analysis, interpretations of results, and a quiz to assess understanding. Additionally, it provides suggested references for further reading and a rubric for grading.

Uploaded by

Armankhan Pathan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
6 views6 pages

DS2 1

The document outlines a lab practical aimed at teaching students how to use Excel functions to analyze data through measures of central tendency, correlation, percentiles, deciles, quartiles, variation, skewness, and kurtosis. It includes detailed procedures for each part of the analysis, interpretations of results, and a quiz to assess understanding. Additionally, it provides suggested references for further reading and a rubric for grading.

Uploaded by

Armankhan Pathan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

Enrolment No.

: 210430116083

Experiment No: 2

Date:

AIM: Study of Measures of Central Tendency, Correlation, Percentile, Decile, Quartile, Measure
of Variation, and Measure of Shape (Skewness and Kurtosis) with Excel Functions

Relevant CO: CO1, CO2

Objective:
The objective of this lab practical is to provide students with hands-on experience in using Excel
functions to explore and analyze a sample data sheet. Students will learn to calculate measures of
central tendency, correlation, percentile, decile, quartile, measure of variation, and measure of
shape using Excel functions. Additionally, students will learn to create visualizations to better
understand the data.

Materials:
- Computer with Microsoft Excel installed
- Sample data sheet (provided below or dataset may be provided by subject teacher)

Sample Data Sheet:

Test1 Test2
StudentID Score Score Age Gender
1 85 92 19 Male
2 92 87 20 Female
3 78 80 18 Male
4 85 89 19 Male
5 90 95 21 Female
6 75 82 18 Male
7 83 87 20 Female
8 92 90 19 Male
9 80 85 18 Female
10 87 88 20 Female
|
Procedure:

Part 1: Measures of Central Tendency


1. Open the sample data sheet in Excel.
2. Calculate the mean, median, and mode for the test 1 score column using Excel functions.
3. Calculate the mean, median, and mode for the test 2 score column using Excel functions.
4. Write a brief interpretation of the results.

Part 2: Correlation
1. Calculate the correlation between test 1 score and test 2 score using Excel functions.

9
Information Technology
Enrolment No.: 210430116083

2. Create a scatter plot to visualize the relationship between test 1 score and test 2 score.
3. Write a brief interpretation of the results.

Part 3: Percentile, Decile, and Quartile


1. Calculate the 25th, 50th, and 75th percentiles using Excel functions for both test 1 score and test
2 score columns.
2. Calculate the 30th, 40th, and 70th deciles using Excel functions for both test 1 score and test 2
score columns.
3. Calculate the first and third quartiles using Excel functions for both test 1 score and test 2 score
columns.
4. Create a box plot for both test 1 score and test 2 score columns.
5. Write a brief interpretation of the results.

Part 4: Measure of Variation


1. Calculate the range, inter-quartile distance, variance, and standard deviation for both test 1 score
and test 2 score columns using Excel functions.
2. Write a brief interpretation of the results.

Part 5: Measure of Shape


1. Calculate the skewness and kurtosis for both test 1 score and test 2 score columns using Excel
functions.
2. Write a brief interpretation of the results.

Interpretation/Program/Code/Output:

Part 1: Measures of Central Tendency


1. Open the sample data sheet in Excel.
2. Calculate the mean, median, and mode for the test 1 score column using Excel functions.
Mean: - “=AVERAGE(A2:Ax)”
Median: - “=MEDIAN(A2: Ax)”
Mode: - “ =MODE.(A2:Ax)”

Test 1 score
Mean 84.7
Median 85
Mode 85
3. Calculate the mean, median, and mode for the test 2 score column using Excel functions.
Test 2 score
Mean 87.5
Median 87
Mode 87

4. Write a brief interpretation of the results.


The Mean (average) gives you the arithmetic average of the test scores. It provides an

10
Information Technology
Enrolment No.: 210430116083

overall sense of the central value of the data. Here, the mean test 1 score is 84.7, it means that, on
average, the students scored 84.7 on this test. The Median represents the middle value when all
the scores are sorted in ascending order. Here, the median test 2 score is 87, it means that half of
the students scored above 87 and half scored below 87. The Mode is the value that occurs most
frequently in the data set. Here, the mode for test 1 scores is 85, it means that 85 is the score that
appeared most often among the students.

Part 2: Correlation
1. Calculate the correlation between test 1 score and test 2 score using Excel functions.
Correlation: - “=CORREL(A2:Ax, B2:By)”

Correlation of test 1 score and


0.744684233
test 2 score

2. Create a scatter plot to visualize the relationship between test 1 score and test 2 score.

Relationship between test 1 score and test 2 score


96
94
92
90
test 2 score

88
86
84
82
80
78
0 20 40 60 80 100
test 1 score

3. Write a brief interpretation of the results.


The correlation coefficient ranges from -1 to 1. Here the value of correlation coefficient is
0.744684233, that indicates, test 1 score and test 2 score have a positive relationship that means if
test 1 score increases then test 2 score will increase linearly.

Part 3: Percentile, Decile, and Quartile


1. Calculate the 25th, 50th, and 75th percentiles using Excel functions for both test 1 score and test
2 score columns.

For 25th percentile: “=PERCENTILE(A2:Ax, 0.25)”


For 50th percentile: “=PERCENTILE(A2:Ax, 0.5) ”
For 75th percentile: “=PERCENTILE(A2:Ax, 0.75) ”

11
Information Technology
Enrolment No.: 210430116083

Test 1 score Test 2 score


th
25 Percentile 80.75 85.5
50th Percentile 85 87
75th Percentile 87.75 89.25

2. Calculate the 30th, 40th, and 70th deciles using Excel functions for both test 1 score and test 2
score columns.
For 30th decile: “=PERCENTILE(A2:Ax, 0.3)”
For 40th decile: “=PERCENTILE(A2:Ax, 0.4) ”
For 70th decile: “=PERCENTILE(A2:Ax, 0.7) ”

Test 1 score Test 2 score


th
30 Decile 83.17 87.05
40th Decile 84.36 87
th
70 Decile 85.4 87.6

3. Calculate the first and third quartiles using Excel functions for both test 1 score and test 2 score
columns.
For first quartile: “=QUARTILE(A2:Ax, 1)”
For third quartile: “=QUARTILE(A2:Ax, 3)”

Test 1 score Test 2 score


First Quartile 80.75 85.5
Third Quartile 90 89

4. Create a box plot for both test 1 score and test 2 score columns.

12
Information Technology
Enrolment No.: 210430116083

5. Write a brief interpretation of the results.


Percentiles provide insight into the distribution of your data. The 25th percentile (1st
quartile) represents the value below which 25% of the data falls. The 50th percentile (median) is
the middle value of your data. The 75th percentile (3rd quartile) represents the value below which
75% of the data falls. Deciles provide further granularity, showing values that divide your data
into 10 equal parts. Quartiles give you a sense of the spread of your data. The 1st quartile and 3rd
quartile represent the lower and upper quarters of your data, respectively.

Part 4: Measure of Variation


1. Calculate the range, inter-quartile distance, variance, and standard deviation for both test 1 score
and test 2 score columns using Excel functions.
Range: “=MAX(A2:Ax) - MIN(A2:Ax)”
Inter-quartile distance: “=QUARTILE(A2:Ax, 3) - QUARTILE(A2:Ax, 1)”
Variance: “=VAR(A2:Ax)”
Standard deviation: “=STDEV(A2:Ax)”

Test 1 score Test 2 score


Range 17 15
Inter-quartile distance 9.25 3.5
Variance 33.78888889 19.83333333
Standard deviation 5.812821078 4.453463072

2. Write a brief interpretation of the results.


The range indicates the spread or variability in your data. A larger range suggests a wider
spread of scores, while a smaller range suggests a more concentrated distribution. Inter-quartile
Distance (IQR) provides a measure of the spread of the middle 50% of your data, which is less
affected by outliers than the range. Variance and standard deviation measure the average deviation
of data points from the mean. A higher variance or standard deviation indicates greater variability
in scores.

Part 5: Measure of Shape


1. Calculate the skewness and kurtosis for both test 1 score and test 2 score columns using Excel
functions.
Skewness: “=SKEW(A2:Ax)”
Kurtosis: “=KURT(A2:Ax)”

Test 1 score Test 2 score


Skewness -0.270863675 -0.113215642
Kurtosis -0.924830004 -0.046667406

2. Write a brief interpretation of the results.


Skewness assesses how lopsided or asymmetrical a data distribution is. Positive skewness
means the distribution is skewed to the right with a longer tail on that side, while negative skewness
indicates a leftward skew with a longer tail on the left. Near-zero skewness suggests a relatively
symmetric distribution. Kurtosis measures the distribution's tail behavior. Positive kurtosis implies

13
Information Technology
Enrolment No.: 210430116083

heavy tails and more extreme values than a normal distribution (leptokurtic), while negative
kurtosis implies lighter tails and fewer extremes (platykurtic). A kurtosis value near zero signifies
a distribution resembling a normal one.

Conclusion:
In this experiment, we explored various statistical measures using Excel functions. We
calculated central tendencies, correlation, percentiles, deciles, quartiles, measures of variation,
skewness, and kurtosis to gain a comprehensive understanding of the data's distribution and
relationships. These analyses provided valuable insights into the data's characteristics and allowed
us to make informed interpretations about its central tendency, spread, shape, and associations.

Quiz:
1) What Excel function can be used to calculate the mean of a dataset?
a) AVERAGE
b) MEDIAN
c) MODE
d) STANDARDIZE

2) What does the correlation coefficient measure in terms of the relationship between two
variables?
a) Strength of the linear relationship
b) Variability of the data
c) Difference between mean and median
d) Skewness of the distribution

Suggested References:
1. "Microsoft Excel Data Analysis and Business Modeling" by Wayne L. Winston
2. "Excel 2021: Data Analysis and Business Modeling" by Wayne L. Winston

Rubrics wise marks obtained

Understanding of Analysis of Capability of Documentation


Total
Problem the Problem writing program
02 02 05 01 10

14
Information Technology

You might also like