0% found this document useful (0 votes)

15 views6 pages

Adsexp 1

The document outlines an experiment focused on studying and implementing descriptive and inferential statistics using a dataset. It explains the concepts of descriptive statistics, including measures of central tendency and dispersion, as well as inferential statistics methods like regression analysis and hypothesis testing. The document also includes a Python program for calculating and visualizing statistical measures using the Iris dataset.

Uploaded by

om29khatri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views6 pages

Adsexp 1

Uploaded by

om29khatri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Roll no:

Date:
EXPERIMENT NO.:01

Aim : Study and Implement Descriptive and Inferential Statistics on a given dataset.

Theory:
Statistics is the branch of mathematics that deals with collecting, analyzing, interpreting,
presenting, and organizing data. It involves the study of methods for gathering, summarizing,
and interpreting data to make informed decisions and draw meaningful conclusions.

Descriptive Statistics: Descriptive statistics is a term given to the analysis of data that helps to
describe, show and summarize data in a meaningful way. It is a simple way to describe our data.
Descriptive statistics is very important to present our raw data in an ineffective/meaningful way
using numerical calculations or graphs or tables. This type of statistics is applied to already
known data.

Types of Descriptive Statistics:

1. Measure of Central Tendency
2. Measure of Dispersion

1. Measures of Central Tendency:

● Mean: Calculated as the sum of values divided by the number of values. It is sensitive to
outliers.
In Python, we can calculate data mean with the following code.
round(tips[‘tip’].mean(),3)
● Median: The middle value when the data is sorted, useful when dealing with skewed
data.We can calculate the Median with Python using the following code.
tips[‘tip’].median()
● Mode: The most frequent value in the dataset. Useful for categorical data.We can
calculate the data Mode with the following code.

tips['day'].mode()
2. Measures of Spread:
● Range: The difference between the maximum and minimum values.
tips['tip'].max() - tips['tip'].min()

● Variance: The average of the squared differences from the mean. Sensitive to outliers.
round(tips['tip'].var(),3)

● Standard Deviation: The square root of the variance. Provides a measure of data spread
in the same units as the original data.
round(tips['tip'].std(),3)

3. Skewness and Kurtosis

● Skewness: Measures the asymmetry of the data distribution (whether the data is skewed
to the left or right).
● Kurtosis: Measures the "tailedness" of the data distribution (how heavy or light the tails
are, and how extreme the values are compared to a normal distribution).

Inferential Statistics: In inferential statistics, predictions are made by taking any group of data
in which you are interested. It can be defined as a random sample of data taken from a
population to describe and make inferences about the population. Any group of data that includes
all the data you are interested in is known as population. It basically allows you to make
predictions by taking a small sample instead of working on the whole population.

Types of Inferential Statistics:

1. Regression analysis

2. Hypothesis Testing

1. Regression analysis:Calculates how one variable will change to another. Linear
regression is the most common type of regression used in inferential statistics.
2. Hypothesis Testing: A method to draw conclusions about a population parameter based
on sample data. It involves setting null and alternative hypotheses, determining a
significance level, and using the p-value to decide whether to reject the null hypothesis.
● Z-Test is mainly used when comparing two groups or a sample mean to a
population mean, especially with large sample sizes.
● ANOVA is used when comparing three or more groups to assess if there is any
significant difference between them. It is widely used in experimental designs
involving multiple groups.
Aspect Descriptive Statistics Inferential Statistics

Purpose Summarizes and describes the Makes predictions or generalizations about a

characteristics of a data set. population based on a sample.

Scope Deals with observed data Deals with inferences and predictions about
(specific to the sample or a larger population based on a sample
population).

Outcome Provides summary measures and Draws conclusions, tests hypotheses, and
visual representations (e.g., mean, makes predictions beyond the data (e.g.,
variance). p-values, confidence intervals).

Key Methods Mean, Median, Mode, Standard Hypothesis Testing (t-tests, chi-square tests),
Deviation, Range, Histograms, Confidence Intervals, Regression, ANOVA,
Bar Charts Correlation.

Example Calculating the average age of a Using data from a sample to predict the
group of students, creating a average height of a population of students or
histogram of test scores testing if two groups have different average
heights.

Data Focus Describes the actual data Makes inferences about a population or
collected future events based on the sample.
Program:
# Import necessary libraries
import pandas as pd
import numpy as np
import seaborn as sns
import matplotlib.pyplot as plt
from scipy import stats
from sklearn.datasets import load_iris

# Load the Iris dataset

iris = load_iris()
iris_df = pd.DataFrame(data=iris.data, columns=iris.feature_names)

# Calculate mean, median, and mode for each feature

mean_values = iris_df.mean()
median_values = iris_df.median()
mode_values = iris_df.apply(lambda x: stats.mode(x)[0][0])

# Display results
print("Mean values:")
print(mean_values)

print("\nMedian values:")
print(median_values)

print("\nMode values:")
print(mode_values)

# Visualization
# Create subplots for each feature
fig, axes = plt.subplots(2, 2, figsize=(14, 10))

# Visualization - KDE plots with mean, median, and mode

plt.figure(figsize=(14, 10))

# Plot KDE for each feature

for i, feature in enumerate(iris.feature_names):
plt.subplot(2, 2, i + 1)

# Plot KDE using seaborn

sns.kdeplot(iris_df[feature], shade=True, color='gray', label='KDE')

# Plot vertical lines for mean, median, and mode

plt.axvline(mean_values[feature],color='r',linestyle='--',label=f'Mean:
{mean_values[feature]:.2f')
plt.axvline(median_values[feature],color='g',linestyle='-',label=f'Median:
{median_values[feature]:.2f}')
plt.axvline(mode_values[feature],color='b',linestyle='-.',label=f'Mode:
{mode_values[feature]:.2f}')

# Set the title and legend

plt.title(f'KDE of {feature}')
plt.legend()
# Adjust layout for better spacing
plt.tight_layout()
plt.show()

Output:
Conclusion: Thus, we studied Descriptive statistics summarize and visualize data, while
inferential statistics make predictions and generalizations about a population based on sample
data.

ADS LAB Merged
No ratings yet
ADS LAB Merged
86 pages
Data Visualization Notes Ou
No ratings yet
Data Visualization Notes Ou
125 pages
ADS EXP Assignments
No ratings yet
ADS EXP Assignments
38 pages
Lesson 2
No ratings yet
Lesson 2
39 pages
Advanced Statistics1
No ratings yet
Advanced Statistics1
19 pages
1 - Introduction - Jupyter Notebook
No ratings yet
1 - Introduction - Jupyter Notebook
5 pages
Exp2 Me
No ratings yet
Exp2 Me
3 pages
Lecture 3-Basic Statistics
No ratings yet
Lecture 3-Basic Statistics
49 pages
FDS CH 2
No ratings yet
FDS CH 2
2 pages
Inferential Statistics
No ratings yet
Inferential Statistics
29 pages
Ads Exp1
No ratings yet
Ads Exp1
6 pages
Lecture 1
No ratings yet
Lecture 1
72 pages
Statistics
No ratings yet
Statistics
18 pages
Viva Dsa
No ratings yet
Viva Dsa
11 pages
Statistics
100% (6)
Statistics
211 pages
Descriptive & Inferential Statistics
No ratings yet
Descriptive & Inferential Statistics
6 pages
Difference Between Descriptive and Inferential Statistics
No ratings yet
Difference Between Descriptive and Inferential Statistics
8 pages
ML Unit-3
No ratings yet
ML Unit-3
18 pages
Statistics
No ratings yet
Statistics
152 pages
6 DATA Analysis 2
No ratings yet
6 DATA Analysis 2
46 pages
Difference Between Descriptive and Inferential Statistics
100% (1)
Difference Between Descriptive and Inferential Statistics
9 pages
Ads Exp 1
No ratings yet
Ads Exp 1
13 pages
DS Chapter - 2
No ratings yet
DS Chapter - 2
73 pages
MNS3173 - Chapter 8 - Types of Data Analysis Methods
No ratings yet
MNS3173 - Chapter 8 - Types of Data Analysis Methods
19 pages
Unit 1 DS Vs IS
No ratings yet
Unit 1 DS Vs IS
5 pages
DSOST2
No ratings yet
DSOST2
44 pages
Unit II TYCS DS
No ratings yet
Unit II TYCS DS
176 pages
Descriptive vs. Inrerential
No ratings yet
Descriptive vs. Inrerential
10 pages
Bachu Assignment
No ratings yet
Bachu Assignment
25 pages
DV Unit 1&2 Notes
No ratings yet
DV Unit 1&2 Notes
50 pages
Angilan, Ef
No ratings yet
Angilan, Ef
5 pages
DeMeasure of Central Tendency and Dispersion
No ratings yet
DeMeasure of Central Tendency and Dispersion
15 pages
Statistics
No ratings yet
Statistics
23 pages
Data Analysis and Statistical Treatment
No ratings yet
Data Analysis and Statistical Treatment
99 pages
Statistics
No ratings yet
Statistics
11 pages
Reviewer For Psych Stats
No ratings yet
Reviewer For Psych Stats
36 pages
Statistics
No ratings yet
Statistics
45 pages
3 4 Research 8 2
No ratings yet
3 4 Research 8 2
54 pages
View
No ratings yet
View
4 pages
Statistics SS2020
No ratings yet
Statistics SS2020
12 pages
SPROB Polished
No ratings yet
SPROB Polished
8 pages
Statistics
No ratings yet
Statistics
152 pages
Lecture 4 - Data Science Statistics
No ratings yet
Lecture 4 - Data Science Statistics
21 pages
Statistics - Compendium - DMS IIT DELHI - 2025
No ratings yet
Statistics - Compendium - DMS IIT DELHI - 2025
18 pages
Experiment-1 2
No ratings yet
Experiment-1 2
6 pages
Quants
100% (1)
Quants
18 pages
CE 459 Statistics: Assistant Prof. Muhammet Vefa AKPINAR
No ratings yet
CE 459 Statistics: Assistant Prof. Muhammet Vefa AKPINAR
211 pages
Stats 1 Module Updated
No ratings yet
Stats 1 Module Updated
53 pages
Week 01 Introduction
No ratings yet
Week 01 Introduction
33 pages
DA Practical Lab 02 Statistical Functions
No ratings yet
DA Practical Lab 02 Statistical Functions
6 pages
Descriptive Statistics: Sample
No ratings yet
Descriptive Statistics: Sample
5 pages
Unit 8. Data Analysis
No ratings yet
Unit 8. Data Analysis
69 pages
Raja Daniyal (0000242740) 8614 - Assignment 1
No ratings yet
Raja Daniyal (0000242740) 8614 - Assignment 1
30 pages
CSE 323 (1) Statistics in Education
No ratings yet
CSE 323 (1) Statistics in Education
31 pages
Statistics and Its Types (v1.0)
No ratings yet
Statistics and Its Types (v1.0)
6 pages
Session 1 On Descriptive Statistics
No ratings yet
Session 1 On Descriptive Statistics
24 pages
Student T-Distribution Table
67% (3)
Student T-Distribution Table
1 page
Lecture Notes: (Introduction To Medical Laboratory Science Research)
No ratings yet
Lecture Notes: (Introduction To Medical Laboratory Science Research)
13 pages
OMBC106 Research Methodology
No ratings yet
OMBC106 Research Methodology
13 pages
Chapter Two: Bivariate Regression Mode
100% (1)
Chapter Two: Bivariate Regression Mode
54 pages
Pearson Product-Moment Correlation Coefficient Table of Critical Values
No ratings yet
Pearson Product-Moment Correlation Coefficient Table of Critical Values
2 pages
T Test
No ratings yet
T Test
141 pages
Factor Affecting Gross Domestic Product GDP Growth
No ratings yet
Factor Affecting Gross Domestic Product GDP Growth
13 pages
Hyperparameter Tuning
No ratings yet
Hyperparameter Tuning
3 pages
Simple Exponential Smoothing
No ratings yet
Simple Exponential Smoothing
32 pages
One Way Anova
No ratings yet
One Way Anova
47 pages
Math For ML
No ratings yet
Math For ML
10 pages
Summary of Surviving Your Dissertation
No ratings yet
Summary of Surviving Your Dissertation
28 pages
FRM一级百题数量分析
No ratings yet
FRM一级百题数量分析
67 pages
Uji Linier Berganda
No ratings yet
Uji Linier Berganda
75 pages
Mid Term Paper
100% (1)
Mid Term Paper
1 page
Correlation Regression
No ratings yet
Correlation Regression
62 pages
STATS Prep Volume 1
No ratings yet
STATS Prep Volume 1
96 pages
BSA Test of Difference May 10 2023
No ratings yet
BSA Test of Difference May 10 2023
9 pages
Chi Squared
No ratings yet
Chi Squared
3 pages
Powelletal 2019 Aquaculture Research
No ratings yet
Powelletal 2019 Aquaculture Research
11 pages
001 (Ayesha Iftikhar) Autocorrelation
No ratings yet
001 (Ayesha Iftikhar) Autocorrelation
6 pages
13 +Desri+Yeni+Sinaga+ (575-585)
No ratings yet
13 +Desri+Yeni+Sinaga+ (575-585)
11 pages
Httpsemas2.Ui - Ac.idpluginfile - Php2375826mod Resourcecontent1kuliah1 2 PDF
No ratings yet
Httpsemas2.Ui - Ac.idpluginfile - Php2375826mod Resourcecontent1kuliah1 2 PDF
31 pages
Anova
No ratings yet
Anova
9 pages
MATH 1281 - Unit 2 Assignment
No ratings yet
MATH 1281 - Unit 2 Assignment
6 pages
T-Test: T-TEST GROUPS Sex (1 2) /missing Analysis /VARIABLES Level - of - Satisfaction /CRITERIA CI (.95)
No ratings yet
T-Test: T-TEST GROUPS Sex (1 2) /missing Analysis /VARIABLES Level - of - Satisfaction /CRITERIA CI (.95)
16 pages
Test Bank Questions Chapter 7
No ratings yet
Test Bank Questions Chapter 7
3 pages
Minitab - Kolesterol: Normal Probability Plot Versus Fits
No ratings yet
Minitab - Kolesterol: Normal Probability Plot Versus Fits
2 pages
Seiko
No ratings yet
Seiko
4 pages
Final Assignment - 4 - MA2201
No ratings yet
Final Assignment - 4 - MA2201
2 pages
Introduction To Business Statistics Through R Software: Software
From Everand
Introduction To Business Statistics Through R Software: Software
Editor IJSMI
No ratings yet
Introduction To Non Parametric Methods Through R Software
From Everand
Introduction To Non Parametric Methods Through R Software
Editor IJSMI
No ratings yet
Descriptive Statistics: Six Sigma Thinking, #3
From Everand
Descriptive Statistics: Six Sigma Thinking, #3
Sumeet Savant
No ratings yet
Machine Learning - A Complete Exploration of Highly Advanced Machine Learning Concepts, Best Practices and Techniques: 4
From Everand
Machine Learning - A Complete Exploration of Highly Advanced Machine Learning Concepts, Best Practices and Techniques: 4
Peter Bradley
No ratings yet

Adsexp 1

Uploaded by

Adsexp 1

Uploaded by

Roll no:

Types of Descriptive Statistics:

1.​ Measures of Central Tendency:

3.​ Skewness and Kurtosis

Types of Inferential Statistics:

1.​ Regression analysis

Purpose Summarizes and describes the Makes predictions or generalizations about a

# Load the Iris dataset

# Calculate mean, median, and mode for each feature

# Visualization - KDE plots with mean, median, and mode

# Plot KDE for each feature

# Plot KDE using seaborn

# Plot vertical lines for mean, median, and mode

# Set the title and legend

You might also like

1. Measures of Central Tendency:

3. Skewness and Kurtosis

1. Regression analysis