0% found this document useful (0 votes)

4 views6 pages

SH Assignment

The document contains a Python script that analyzes two datasets, 'z' and 'y', using various statistical methods including histograms, cumulative frequency distributions, and box plots. It calculates key statistics such as mean, variance, skewness, and correlation coefficient, and also evaluates specific fractions of data based on defined criteria. Additionally, it estimates the area of a site cleaned up based on a critical concentration threshold.

Uploaded by

Account

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views6 pages

SH Assignment

Uploaded by

Account

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

SH Assignment

January 20, 2025

[11]: import numpy as np

import matplotlib.pyplot as plt
from scipy.stats import skew
import pandas as pd

data = {
'n': np.arange(1, 21),
'z': [1.7, 6.26, 7.56, 7.92, 0.96, 2.47, 2.55, 0.28, 1.34, 0.71, 1.66, 2.
↪99, 8.71, 0.09, 0.62, 0.99, 10.27, 2.96, 5.54, 3.61],

'y': [1.3, 17.02, 19.74, 12.01, 0.66, 1.8, 15.91, 0.62, 2.15, 2.07, 4.68, 2.
↪74, 11.72, 0.24, 2.3, 0.52, 5.67, 3.17, 5.92, 5.03]

z = np.array(data['z'])
y = np.array(data['y'])

[2]: # Question 1
plt.hist(z, bins=np.arange(0, 15, 5), edgecolor='black', alpha=0.7)
plt.title('Histogram of z')
plt.xlabel('Value Ranges')
plt.ylabel('Frequency')
plt.grid(axis='y')
plt.show()
fraction = np.sum((z >= 5) & (z < 10)) / len(z)
print(f"Fraction of data with z-values between 5 and 10: {fraction:.2f}")

1
Fraction of data with z-values between 5 and 10: 0.25

[12]: # Question 2
z_sorted = np.sort(z)
y_sorted = np.sort(y)
z_cumulative = np.cumsum(np.ones_like(z_sorted)) / len(z_sorted)
y_cumulative = np.cumsum(np.ones_like(y_sorted)) / len(y_sorted)

print("Cumulative Frequency Distribution of z:")

print(pd.DataFrame({'z': z_sorted, 'Cumulative Frequency': z_cumulative}))

print("\nCumulative Frequency Distribution of y:")

print(pd.DataFrame({'y': y_sorted, 'Cumulative Frequency': y_cumulative}))

Cumulative Frequency Distribution of z:

z Cumulative Frequency
0 0.09 0.05
1 0.28 0.10
2 0.62 0.15
3 0.71 0.20
4 0.96 0.25

2
5 0.99 0.30
6 1.34 0.35
7 1.66 0.40
8 1.70 0.45
9 2.47 0.50
10 2.55 0.55
11 2.96 0.60
12 2.99 0.65
13 3.61 0.70
14 5.54 0.75
15 6.26 0.80
16 7.56 0.85
17 7.92 0.90
18 8.71 0.95
19 10.27 1.00

Cumulative Frequency Distribution of y:

y Cumulative Frequency
0 0.24 0.05
1 0.52 0.10
2 0.62 0.15
3 0.66 0.20
4 1.30 0.25
5 1.80 0.30
6 2.07 0.35
7 2.15 0.40
8 2.30 0.45
9 2.74 0.50
10 3.17 0.55
11 4.68 0.60
12 5.03 0.65
13 5.67 0.70
14 5.92 0.75
15 11.72 0.80
16 12.01 0.85
17 15.91 0.90
18 17.02 0.95
19 19.74 1.00

[26]: # Question 3
def calculate_statistics(data):
mean = np.mean(data)
variance = np.var(data, ddof=1)
skewness = skew(data)
quantiles = np.quantile(data, [0.25, 0.5, 0.75])
iqr = quantiles[2] - quantiles[0]
return mean, variance, skewness, quantiles, quantiles[1], iqr

3
z_stats = calculate_statistics(z)
y_stats = calculate_statistics(y)

print("\nStatistics for z:")

print(f"Mean: {z_stats[0]:.2f}, Variance: {z_stats[1]:.2f}, Skewness:␣
↪{z_stats[2]:.2f}")

print(f"Quantiles: {z_stats[3]}, Median: {z_stats[4]}, Interquantile Range:␣

↪{z_stats[5]:.2f}")

print("\nStatistics for y:")

print(f"Mean: {y_stats[0]:.2f}, Variance: {y_stats[1]:.2f}, Skewness:␣
↪{y_stats[2]:.2f}")

print(f"Quantiles: {y_stats[3]}, Median: {y_stats[4]}, Interquantile Range:␣

↪{y_stats[5]:.2f}")

Statistics for z:
Mean: 3.46, Variance: 9.76, Skewness: 0.85
Quantiles: [0.9825 2.51 5.72 ], Median: 2.51, Interquantile Range: 4.74

Statistics for y:
Mean: 5.76, Variance: 36.94, Skewness: 1.14
Quantiles: [1.675 2.955 7.37 ], Median: 2.955, Interquantile Range: 5.70

[15]: # Question 4
plt.boxplot([z, y], labels=['z', 'y'], showmeans=True)
plt.title('Box-and-Whisker Plot of z and y')
plt.ylabel('Values')
plt.grid(axis='y')
plt.show()

C:\Users\satya\AppData\Local\Temp\ipykernel_9812\2455904157.py:2:
MatplotlibDeprecationWarning: The 'labels' parameter of boxplot() has been
renamed 'tick_labels' since Matplotlib 3.9; support for the old name will be
dropped in 3.11.
plt.boxplot([z, y], labels=['z', 'y'], showmeans=True)

4
[22]: # Question 5
z_mean = np.mean(z)
y_mean = np.mean(y)

covariance = np.sum((z - z_mean) * (y - y_mean)) / (len(z) - 1)

std_z = np.std(z,ddof=1)
std_y = np.std(y,ddof=1)
correlation_coefficient = covariance / (std_z * std_y)

print(f"Correlation coefficient between z and y: {correlation_coefficient:.2f}")

Correlation coefficient between z and y: 0.67

[24]: # Question 6
critical_concentration = 5
site_area = 8000
fraction_below_critical = np.sum(z < critical_concentration) / len(z)
cleanup_area = fraction_below_critical * site_area
print(f"Approximate area of the site cleaned up: {cleanup_area:.2f} m²")

Approximate area of the site cleaned up: 5600.00 m²

5
[27]: # Question 7
fraction = np.sum((z < 5) & (y < 10)) / len(z)
print(f"Fraction of data with z < 5 and y < 10: {fraction:.2f}")

Fraction of data with z < 5 and y < 10: 0.65

[10]: # Question 8
fraction_z_less_5_or_y_less_10 = np.sum((z < 5) | (y < 10)) / len(z)
print(f"Fraction of data with z < 5 or y < 10: {fraction_z_less_5_or_y_less_10:.
↪2f}")

Fraction of data with z < 5 or y < 10: 0.80

SAP BDEx Config Guide
100% (2)
SAP BDEx Config Guide
101 pages
Topic IV Hand Sketched Schematic Diagram
No ratings yet
Topic IV Hand Sketched Schematic Diagram
23 pages
Fresco
100% (2)
Fresco
17 pages
R Studio Cheat Sheet For Math1041
No ratings yet
R Studio Cheat Sheet For Math1041
3 pages
CISSP Simplilearn
80% (5)
CISSP Simplilearn
969 pages
SH Assignment 1 21ce01032
No ratings yet
SH Assignment 1 21ce01032
6 pages
Python Code - Summary Statistics
No ratings yet
Python Code - Summary Statistics
6 pages
AD3411
No ratings yet
AD3411
28 pages
ADS Practical Exam Questions
No ratings yet
ADS Practical Exam Questions
14 pages
Stats Lab (4-6)
No ratings yet
Stats Lab (4-6)
7 pages
Distributions
No ratings yet
Distributions
43 pages
4 12
No ratings yet
4 12
17 pages
Numpy and Pandas
No ratings yet
Numpy and Pandas
11 pages
FDS Lab 1 Manuel .1..1new
No ratings yet
FDS Lab 1 Manuel .1..1new
38 pages
Data For Problems 1-11-24 - Solutions
No ratings yet
Data For Problems 1-11-24 - Solutions
9 pages
FDSA Lab Manual
No ratings yet
FDSA Lab Manual
27 pages
Keeratsi HW8
No ratings yet
Keeratsi HW8
17 pages
FDS Lab 1 Manuel .1..1new
No ratings yet
FDS Lab 1 Manuel .1..1new
34 pages
Mayank Chaudhary DEV Practicals
No ratings yet
Mayank Chaudhary DEV Practicals
14 pages
Stats Assignment
No ratings yet
Stats Assignment
20 pages
Content From Jose Portilla's Udemy Course Learning Python For Data Analysis and Visualization Notes by Michael Brothers, Available On
No ratings yet
Content From Jose Portilla's Udemy Course Learning Python For Data Analysis and Visualization Notes by Michael Brothers, Available On
13 pages
Fds Assigns
No ratings yet
Fds Assigns
5 pages
Practical: 11: Aim: - Source Code
No ratings yet
Practical: 11: Aim: - Source Code
3 pages
Unit 5
No ratings yet
Unit 5
10 pages
His To Gramp
No ratings yet
His To Gramp
4 pages
04.05-Histograms-and-Binnings - Ipynb - Colaboratory
No ratings yet
04.05-Histograms-and-Binnings - Ipynb - Colaboratory
7 pages
Ad3411 - Data Science and Analytics Laboratory
No ratings yet
Ad3411 - Data Science and Analytics Laboratory
26 pages
Exp 2 SDK Ok
No ratings yet
Exp 2 SDK Ok
18 pages
Word File For Prob and Stats
No ratings yet
Word File For Prob and Stats
25 pages
Indexml Merged
No ratings yet
Indexml Merged
32 pages
AD3411 DATA SCIENCE AND ANALYTICS LAB (2) - Removed
No ratings yet
AD3411 DATA SCIENCE AND ANALYTICS LAB (2) - Removed
24 pages
Lab 3
No ratings yet
Lab 3
14 pages
Solutions Modernstatistics
No ratings yet
Solutions Modernstatistics
144 pages
Sampling
No ratings yet
Sampling
8 pages
Ad3411-Data Science and Analytics Laboratory
No ratings yet
Ad3411-Data Science and Analytics Laboratory
27 pages
Assignment 2
No ratings yet
Assignment 2
2 pages
Workshop 5: PDF Sampling and Statistics: Preview: Generating Random Numbers
No ratings yet
Workshop 5: PDF Sampling and Statistics: Preview: Generating Random Numbers
10 pages
Dal Programs With Output
No ratings yet
Dal Programs With Output
11 pages
Aashish Yadav Stats Final Practical
No ratings yet
Aashish Yadav Stats Final Practical
41 pages
Stats With Py
No ratings yet
Stats With Py
1 page
Industrial Statistics - A Computer Based Approach With Python
No ratings yet
Industrial Statistics - A Computer Based Approach With Python
140 pages
Statistics
No ratings yet
Statistics
18 pages
Data Science and Analtics Laboratory
No ratings yet
Data Science and Analtics Laboratory
21 pages
ML Lab
No ratings yet
ML Lab
12 pages
Matplotlib Starter: Import As Import As Import As
No ratings yet
Matplotlib Starter: Import As Import As Import As
24 pages
Density - Contour Plot
No ratings yet
Density - Contour Plot
18 pages
Stat 101 Exam 1: Important Formulas and Concepts 1
No ratings yet
Stat 101 Exam 1: Important Formulas and Concepts 1
18 pages
Mine 3
No ratings yet
Mine 3
6 pages
Data Science Algorithmen Master - 02 Data Handling
No ratings yet
Data Science Algorithmen Master - 02 Data Handling
76 pages
Statistical Analysis With Scipy?
No ratings yet
Statistical Analysis With Scipy?
9 pages
Boxplot, Histogram Codes With Explanations
No ratings yet
Boxplot, Histogram Codes With Explanations
2 pages
DAV Practicals
No ratings yet
DAV Practicals
26 pages
PML Ex3
No ratings yet
PML Ex3
20 pages
HW 1
No ratings yet
HW 1
11 pages
Note 02
No ratings yet
Note 02
31 pages
End Semester Answer Key Format-Fods
No ratings yet
End Semester Answer Key Format-Fods
8 pages
Edaunit IV
No ratings yet
Edaunit IV
15 pages
Statistical Analysis: 1 Data Analysis: Mean, Variance, Boxplots
No ratings yet
Statistical Analysis: 1 Data Analysis: Mean, Variance, Boxplots
4 pages
Datascience Lab
No ratings yet
Datascience Lab
24 pages
Fha-Pyhton Program Unit 1-4
No ratings yet
Fha-Pyhton Program Unit 1-4
13 pages
Python Course Cheat Sheet
No ratings yet
Python Course Cheat Sheet
30 pages
Problems
No ratings yet
Problems
22 pages
Develop Snakes & Ladders Game Complete Guide with Code & Design
From Everand
Develop Snakes & Ladders Game Complete Guide with Code & Design
Anurag Pandey
No ratings yet
Wolaita Sodo University Electrical and Computer Engineering Smart Boom Gate
No ratings yet
Wolaita Sodo University Electrical and Computer Engineering Smart Boom Gate
49 pages
Jagan Resume PDF
No ratings yet
Jagan Resume PDF
1 page
Beginner's Guide To Make A Game Controller
No ratings yet
Beginner's Guide To Make A Game Controller
23 pages
IRS Imp
No ratings yet
IRS Imp
76 pages
DBMS Unit-1 PPT 1.2 (Advantages & Disadvantages of DBMS, Components, Overall System Tructure)
100% (1)
DBMS Unit-1 PPT 1.2 (Advantages & Disadvantages of DBMS, Components, Overall System Tructure)
5 pages
Computational Mathematics
No ratings yet
Computational Mathematics
49 pages
Simple-Ostinato: Release 0.0.1
No ratings yet
Simple-Ostinato: Release 0.0.1
41 pages
CNET101 Computer Networks
No ratings yet
CNET101 Computer Networks
3 pages
FIRST SEMESTER 2022-2023: of Programming Languages 10 Edition, Pearson, 2012.
No ratings yet
FIRST SEMESTER 2022-2023: of Programming Languages 10 Edition, Pearson, 2012.
3 pages
Arduino Based Digital Thermometer
67% (3)
Arduino Based Digital Thermometer
3 pages
Aiml Demo
No ratings yet
Aiml Demo
12 pages
Mobile Banking
No ratings yet
Mobile Banking
8 pages
TIM-94N / TIM-94N-B / TIM-94N-BN: Description
No ratings yet
TIM-94N / TIM-94N-B / TIM-94N-BN: Description
5 pages
SAi Color Tester 2019
No ratings yet
SAi Color Tester 2019
1 page
Fashion - Worldwide Statista Market Forecast
No ratings yet
Fashion - Worldwide Statista Market Forecast
1 page
HCPP-03 - Small - and Medium-Sized Campus Network Design Guide-2022.01
No ratings yet
HCPP-03 - Small - and Medium-Sized Campus Network Design Guide-2022.01
77 pages
PR Digital Readouts Linear Encoders ID208864 en
No ratings yet
PR Digital Readouts Linear Encoders ID208864 en
19 pages
Intro To Threads PDF
No ratings yet
Intro To Threads PDF
4 pages
Adspower Script
No ratings yet
Adspower Script
3 pages
Sap-C S4ewm 2023
No ratings yet
Sap-C S4ewm 2023
31 pages
Final Project Report Mobile Phone Jammer
No ratings yet
Final Project Report Mobile Phone Jammer
19 pages
Skill-Lync - Aerospace Offerings - 2024
No ratings yet
Skill-Lync - Aerospace Offerings - 2024
32 pages
Quantum Autoencoders With Enhanced Data Encoding
No ratings yet
Quantum Autoencoders With Enhanced Data Encoding
7 pages
Apihackingin 90 Minutes 1660919248744
No ratings yet
Apihackingin 90 Minutes 1660919248744
51 pages
Sans Emea Curriculum Overview Catalogue 2020
No ratings yet
Sans Emea Curriculum Overview Catalogue 2020
20 pages
Digitax SF - Brochure - EN
No ratings yet
Digitax SF - Brochure - EN
28 pages
Mohammad Kausar Uddin
No ratings yet
Mohammad Kausar Uddin
3 pages

SH Assignment

Uploaded by

SH Assignment

Uploaded by

SH Assignment

January 20, 2025

[11]: import numpy as np

print("Cumulative Frequency Distribution of z:")

print("\nCumulative Frequency Distribution of y:")

Cumulative Frequency Distribution of z:

Cumulative Frequency Distribution of y:

print("\nStatistics for z:")

print(f"Quantiles: {z_stats[3]}, Median: {z_stats[4]}, Interquantile Range:␣

print("\nStatistics for y:")

print(f"Quantiles: {y_stats[3]}, Median: {y_stats[4]}, Interquantile Range:␣

covariance = np.sum((z - z_mean) * (y - y_mean)) / (len(z) - 1)

print(f"Correlation coefficient between z and y: {correlation_coefficient:.2f}")

Correlation coefficient between z and y: 0.67

Approximate area of the site cleaned up: 5600.00 m²

Fraction of data with z < 5 and y < 10: 0.65

Fraction of data with z < 5 or y < 10: 0.80

You might also like