0% found this document useful (0 votes)
7 views3 pages

Parc 6

The document outlines a statistical analysis of a dataset using Python, focusing on key measures such as mean, median, mode, variance, and standard deviation. It emphasizes the importance of these measures in understanding data distribution and variability, and provides a Python code example for their computation. By the end of the analysis, users will be equipped to perform these statistical calculations efficiently using Python libraries.

Uploaded by

Vedant Kolte
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views3 pages

Parc 6

The document outlines a statistical analysis of a dataset using Python, focusing on key measures such as mean, median, mode, variance, and standard deviation. It emphasizes the importance of these measures in understanding data distribution and variability, and provides a Python code example for their computation. By the end of the analysis, users will be equipped to perform these statistical calculations efficiently using Python libraries.

Uploaded by

Vedant Kolte
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Title:

Statistical Analysis of a Dataset Using Python

Objective:
To compute and analyze key statistical measures such as mean, median,
mode, variance, and standard deviation for a given dataset using Python
libraries (numpy and statistics).

Problem Statement:
Calculate mean, median, mode, variance and standard deviation of a
dataset.

Outcomes:
By the end of this analysis, you will be able to:
1. Understand and calculate the mean, median, and mode of a dataset.
2. Compute variance and standard deviation to measure data
dispersion.
3. Differentiate between sample and population-based statistical
calculations.
4. Implement statistical computations using Python efficiently.

Theory:
Statistical measures provide insights into data distribution and variability.
1. Mean: The arithmetic average of all values in a dataset. It represents
the central tendency.
2. Median: The middle value when the dataset is sorted. If the number
of elements is even, it is the average of the two middle values.
3. Mode: The most frequently occurring value in a dataset. A dataset
may have one or multiple modes.
4. Variance: Measures the spread of data points around the mean. A
higher variance indicates greater variability.
5. Standard Deviation: The square root of variance, representing how
much data deviates from the mean.
Python provides built-in functions in numpy and statistics to efficiently
compute these measures.

Conclusion:
This analysis successfully computed fundamental statistical measures using
Python. The results provide insights into the dataset’s distribution and
variability. The mean indicates the central value, the median gives the
midpoint, and the mode highlights the most frequent value. The variance and
standard deviation quantify data dispersion. Understanding these measures is
essential for statistical analysis and data science applications.

CODE:
import statistics

import numpy

data = [10,15,82,33,82,55,44,91,30,90]

mean = numpy.mean(data)

mode = statistics.mode(data)

median = numpy.median(data)

std = numpy.std(data)

var = numpy.var(data)

print("Mean: ", mean)

print("Mode: ", mode)

print("Median: ", median)

print("Variance: ", var)

print("Standard Deviation: ", std)


OUTPUT:

You might also like