0% found this document useful (0 votes)

10 views56 pages

Assignment Mridul

The document contains solutions to various assignments in Probability and Statistics, including simulations and plotting of empirical densities, confidence intervals, and sums of random variables. It utilizes Python libraries such as NumPy and Matplotlib for data generation and visualization. Each section details the methodology, parameters, and results of the statistical analyses performed.

Uploaded by

Abhilasha Choudhary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views56 pages

Assignment Mridul

Uploaded by

Abhilasha Choudhary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 56

Assignment

Probability & Statistics

Mridul Sarmah

Indian Institute of Management, Udaipur

Roll No: 2421007

01-10-2024
6.(b) Solution
In [ ]: import numpy as np
import matplotlib.pyplot as plt

# Step 1: Generate 1000 random samples using the inverse CDF method
n = 1000
u = np.random.uniform(0, 1, n)
x = np.sqrt(u) # Apply the inverse CDF: X = sqrt(U)

# Step 2: Plot the histogram of the generated random variable

plt.figure(figsize=(8, 6))
plt.hist(x, bins=30, density=True, alpha=0.6, color='b', label="Empirical Densit

# Step 3: Plot the theoretical PDF

# The PDF is derived by differentiating the CDF F(x) = x^2 for 0 ≤ x ≤ 1.
# PDF: f(x) = 2x for 0 ≤ x ≤ 1
x_vals = np.linspace(0, 1, 100)
pdf_vals = 2 * x_vals # Theoretical PDF

plt.plot(x_vals, pdf_vals, 'r-', lw=2, label="Theoretical PDF")

# Step 4: Add labels and legends

plt.xlabel('x')
plt.ylabel('Density')
plt.title('Empirical Density vs Theoretical PDF')
plt.legend()

# Show the plot

plt.show()
8.(b) Solution
In [ ]: import numpy as np
import matplotlib.pyplot as plt

# Parameters
P = 0.48 # True population proportion
n = 1500 # Sample size
num_samples = 100 # Number of samples
confidence_level = 0.98 # 98% confidence level
Z = 2.326 # Z-value for 98% confidence level

# Function to calculate a confidence interval for each sample

def confidence_interval(sample_proportion, Z, n):
se = np.sqrt(sample_proportion * (1 - sample_proportion) / n)
return sample_proportion - Z * se, sample_proportion + Z * se

# Simulating 100 samples and calculating confidence intervals

intervals = []
proportions = []
contains_P = []
for _ in range(num_samples):
# Generate a random sample of size 1500 from a binomial distribution
sample = np.random.binomial(n, P) / n
proportions.append(sample)
# Calculate the confidence interval for this sample
ci_lower, ci_upper = confidence_interval(sample, Z, n)
intervals.append((ci_lower, ci_upper))
# Check if the true population proportion P is within the interval
contains_P.append(ci_lower <= P <= ci_upper)

# Plotting the confidence intervals and sample proportions with flipped axes
plt.figure(figsize=(10, 8))
for i, (ci_lower, ci_upper) in enumerate(intervals):
if contains_P[i]:
plt.plot([ci_lower, ci_upper], [i, i], color='blue') # Interval contain
else:
plt.plot([ci_lower, ci_upper], [i, i], color='red') # Interval doesn't

# Plot the sample proportion estimates as markers (on the y-axis)

plt.scatter(proportions, range(num_samples), color='black', zorder=5, label='Sam

# Plot a vertical line for the true population proportion P

plt.axvline(x=P, color='green', linestyle='--', label='True P (0.48)')

# Add labels and legend

plt.xlabel('Proportion')
plt.ylabel('Sample Index')
plt.title(f'98% Confidence Intervals and Sample Proportions for {num_samples} Sa
plt.legend()

# Show the plot

plt.show()
9.(a) Solution
In [ ]: import numpy as np
import matplotlib.pyplot as plt
from scipy.stats import norm

# Parameters
n_samples = 25 # Number of independent samples in each trial
num_trials = 1000 # Number of trials
a, b = 0, 1 # Uniform distribution range [0, 1]

# Step 1: Simulate 1000 sums of 25 independent uniform random variables

sums = [np.sum(np.random.uniform(a, b, n_samples)) for _ in range(num_trials)]

# Step 2: Compute mean and standard deviation of the sums

mean_s25 = np.mean(sums)
std_s25 = np.std(sums)

# Step 3: Plot the histogram of the sums

plt.figure(figsize=(10, 6))
plt.hist(sums, bins=30, density=True, alpha=0.6, color='b', label="Histogram of

# Step 4: Plot the normal distribution with same mean and std
x = np.linspace(min(sums), max(sums), 100)
plt.plot(x, norm.pdf(x, mean_s25, std_s25), 'r', lw=2, label=f'Normal Density\nM

# Step 5: Add labels and legend

plt.title('Sums of 25 Uniform Random Variables repeated 1000 times\nAnd\n Corres
plt.xlabel('Sum of 25 Uniform Variables')
plt.ylabel('Density')
plt.legend()

# Show the plot

plt.show()
9.(b) Solution
In [ ]: import numpy as np
import matplotlib.pyplot as plt
from scipy.stats import norm

# Parameters
n_samples = 25 # Number of independent samples in each trial
num_trials = 1000 # Number of trials

# Step 1: Inverse transform sampling for f(x) = 2x

def sample_from_fx_2x(size):
u = np.random.uniform(0, 1, size) # Generate uniform random numbers
return np.sqrt(u) # Apply the inverse CDF transformation

# Step 2: Simulate 1000 sums of 25 independent random variables

sums = [np.sum(sample_from_fx_2x(n_samples)) for _ in range(num_trials)]

# Step 3: Compute mean and standard deviation of the sums

mean_s25 = np.mean(sums)
std_s25 = np.std(sums)

# Step 4: Plot the histogram of the sums

plt.figure(figsize=(10, 6))
plt.hist(sums, bins=30, density=True, alpha=0.6, color='b', label="Histogram of

# Step 5: Plot the normal distribution with the same mean and standard deviation
x = np.linspace(min(sums), max(sums), 100)
plt.plot(x, norm.pdf(x, mean_s25, std_s25), 'r', lw=2, label=f'Normal Density\nM

# Step 6: Add labels and legend

plt.title('Sums of 25 Random Variables (f(x)=2x) repeated 1000 times\n And\n Cor
plt.xlabel('Sum of 25 Variables')
plt.ylabel('Density')
plt.legend()

# Show the plot

plt.show()
9.(c) Solution
In [ ]: import numpy as np
import matplotlib.pyplot as plt
from scipy.stats import norm

# Parameters
n_samples = 25 # Number of independent samples in each trial
num_trials = 1000 # Number of trials

# Step 1: Inverse transform sampling for f(x) = 3x^2

def sample_from_fx_3x2(size):
u = np.random.uniform(0, 1, size) # Generate uniform random numbers
return u**(1/3) # Apply the inverse CDF transformation

# Step 2: Simulate 1000 sums of 25 independent random variables

sums = [np.sum(sample_from_fx_3x2(n_samples)) for _ in range(num_trials)]

# Step 3: Compute mean and standard deviation of the sums

mean_s25 = np.mean(sums)
std_s25 = np.std(sums)

# Step 4: Plot the histogram of the sums

plt.figure(figsize=(10, 6))
plt.hist(sums, bins=30, density=True, alpha=0.6, color='b', label="Histogram of

# Step 6: Add labels and legend

plt.title('Sums of 25 Random Variables (f(x)=3x^2) repeated 1000 times\nAnd\n Co
plt.xlabel('Sum of 25 Variables')
plt.ylabel('Density')
plt.legend()

# Show the plot

plt.show()
9.(d) Solution
In [ ]: import numpy as np
import matplotlib.pyplot as plt
from scipy.stats import norm

# Parameters
n_samples = 25 # Number of independent samples in each trial
num_trials = 1000 # Number of trials

# Step 1: Inverse transform sampling for f(x) = 4|x - 1/2|

def sample_from_fx_4_abs_x_minus_1_2(size):
u = np.random.uniform(0, 1, size) # Generate uniform random numbers
# Apply inverse CDF transformation
samples = np.where(u <= 0.5, 1-np.sqrt(1-u), (1 - np.sqrt(2*u)) / 2)
return samples

# Step 2: Simulate 1000 sums of 25 independent random variables

sums = [np.sum(sample_from_fx_4_abs_x_minus_1_2(n_samples)) for _ in range(num_t

# Step 3: Compute mean and standard deviation of the sums

mean_s25 = np.mean(sums)
std_s25 = np.std(sums)

# Step 4: Plot the histogram of the sums

plt.figure(figsize=(10, 6))
plt.hist(sums, bins=30, density=True, alpha=0.6, color='b', label="Histogram of

# Step 6: Add labels and legend

plt.title('Sums of 25 Random Variables (f(x) = 4|x - 1/2|) repeated 1000 times\n
plt.xlabel('Sum of 25 Variables')
plt.ylabel('Density')
plt.legend()

# Show the plot

plt.show()
6.
The e
ypoesia

Po.8

Ptm (outicod vale)

As the Lineny
Populatiou hottoy

inomia Dici kniou,

he beeple to bollotd
fe
ino i ol henee the ean

and Vauane Wold de

Since Aize
C>30)

(-e./

np(0-P)
M- 100Xo.z 2o.01
NJooxo.1X02

P2< 4
tu 2-Atehe
tale

m- 80 -2.93
4
80-9.32
W
= 70. 63

71-100X0.7

V]o0 x 0.9 2x0.22)

= P(22 -l-69)

=1- (I- (22149))

: P(241-69 )
2- uble)
= 0.9 545
7)- J0oX 0.7C
r (?2 NJoox0.7s025

(?2 -0.92)

0.81

71- )00X 0.72

P(? 160x0.9Lx 0.27

= P(2 -0.1)
P(2< O. 22)

D.527

(Y) P 0.70

Nj00x 0.90o.3
= P(22 0.22)

) 1 -.ET1
0 129
(v) = 0.48
7- J00X 0.61
P(2 2 )

|-r(2 20.64)

P( 22 ?|-)0 x0.45
Nlo0 xo-6sX0.3S

|- 0.39%2

b.103

(i) P= 0.63
-100x o. 63
Noxo.63 20.27

)-1(22|.·36)
| 0.95)S
,04 85
- P(?2 2-24)
- )-P(2can2u)
o.9 B75
o.o125

ot the
Aee thot the pokabi ity
he ml hypo fheas
reases
Cie. Pe tr eoh) dese de
We decheuse
igibi canty dhen
ole e u
to 0.6. So, TyPe I! eoh

we move

menKes

Cen Ahy that fhe test in vnoat ponbul.

Hence, ve

Solutions Manual For Statistical Computing With R - Rizzo
100% (1)
Solutions Manual For Statistical Computing With R - Rizzo
136 pages
BN2102 1-6 Notes
No ratings yet
BN2102 1-6 Notes
38 pages
Exam P Formula Sheet PDF
No ratings yet
Exam P Formula Sheet PDF
9 pages
Distributions Demo
No ratings yet
Distributions Demo
28 pages
Indexml Merged
No ratings yet
Indexml Merged
32 pages
Exp 4 Statistical Data Analysis With Python SDK Ok
No ratings yet
Exp 4 Statistical Data Analysis With Python SDK Ok
18 pages
Distributions
No ratings yet
Distributions
43 pages
Coding Probability and Statistics With Python From Scratch
No ratings yet
Coding Probability and Statistics With Python From Scratch
37 pages
Lec3 Inverse Transformation Rejection
No ratings yet
Lec3 Inverse Transformation Rejection
46 pages
Simulating Continuous and Non-Continuous Distributions
No ratings yet
Simulating Continuous and Non-Continuous Distributions
17 pages
Workshop 5: PDF Sampling and Statistics: Preview: Generating Random Numbers
No ratings yet
Workshop 5: PDF Sampling and Statistics: Preview: Generating Random Numbers
10 pages
Exp-4 Abhayraj Singh
No ratings yet
Exp-4 Abhayraj Singh
11 pages
Sem 5
No ratings yet
Sem 5
25 pages
Chapter 0 Introduction
No ratings yet
Chapter 0 Introduction
14 pages
Econometrics I - Problem Set 1: Econometricswithr Download R
No ratings yet
Econometrics I - Problem Set 1: Econometricswithr Download R
3 pages
Assignment 2
No ratings yet
Assignment 2
9 pages
Statistical Analysis in Physics Practical File
No ratings yet
Statistical Analysis in Physics Practical File
28 pages
PSet6 Solutions
No ratings yet
PSet6 Solutions
6 pages
Projectpdf
No ratings yet
Projectpdf
12 pages
Review Statistics
No ratings yet
Review Statistics
24 pages
AI Obse-2
No ratings yet
AI Obse-2
32 pages
Normal Distribution Questions 250121 115523
No ratings yet
Normal Distribution Questions 250121 115523
1 page
ML Lab
No ratings yet
ML Lab
12 pages
Sujal 4
No ratings yet
Sujal 4
31 pages
4 12
No ratings yet
4 12
17 pages
06 Assignment1
No ratings yet
06 Assignment1
3 pages
SlidesCourse 14 Oct
No ratings yet
SlidesCourse 14 Oct
10 pages
Pracs
No ratings yet
Pracs
12 pages
R Session - Note3
No ratings yet
R Session - Note3
4 pages
Sa 123
No ratings yet
Sa 123
13 pages
Cosc 416
No ratings yet
Cosc 416
6 pages
Econ306 Excel 4
No ratings yet
Econ306 Excel 4
23 pages
Introduction To Random Processes Project 2: Problem 1
No ratings yet
Introduction To Random Processes Project 2: Problem 1
15 pages
ECE286 Final Exam Aid Sheet
No ratings yet
ECE286 Final Exam Aid Sheet
4 pages
Solutions For Exercises: Reliability of Structures (2nd Edition) by Nowak & Collins
100% (1)
Solutions For Exercises: Reliability of Structures (2nd Edition) by Nowak & Collins
8 pages
STAT 512 Mathematical Statistics: Lecture Notes
No ratings yet
STAT 512 Mathematical Statistics: Lecture Notes
120 pages
Week1 PDF
No ratings yet
Week1 PDF
22 pages
Matlab Code For Random Variable
No ratings yet
Matlab Code For Random Variable
8 pages
Confidence Intervals
No ratings yet
Confidence Intervals
4 pages
A Probability and Statistics Cheatsheet
No ratings yet
A Probability and Statistics Cheatsheet
9 pages
Practical 2
No ratings yet
Practical 2
7 pages
Distribution Quick Reference
No ratings yet
Distribution Quick Reference
25 pages
Math10282 Ex05 - An R Session
No ratings yet
Math10282 Ex05 - An R Session
6 pages
1 Probability: 1.1 Normal Distribution
No ratings yet
1 Probability: 1.1 Normal Distribution
7 pages
Exam P Formula Sheet PDF
No ratings yet
Exam P Formula Sheet PDF
9 pages
Estimating PDF'S, Means, Variances: 1 Exp 1: Estimated PDF/CDF Plots From Data
No ratings yet
Estimating PDF'S, Means, Variances: 1 Exp 1: Estimated PDF/CDF Plots From Data
12 pages
Probability Distributions in R
No ratings yet
Probability Distributions in R
42 pages
Stats
No ratings yet
Stats
33 pages
Exercise 3 Computer Intensive Statistics
No ratings yet
Exercise 3 Computer Intensive Statistics
10 pages
Presentation 3
No ratings yet
Presentation 3
29 pages
Statistical Computing
No ratings yet
Statistical Computing
6 pages
Sample Project Report Structure
No ratings yet
Sample Project Report Structure
5 pages
SlidesCourse 21 Oct
No ratings yet
SlidesCourse 21 Oct
10 pages
BootcampStat Code Sol
No ratings yet
BootcampStat Code Sol
12 pages
Stat 130n Answers To The LAs in Lessons 3.1-3.3
No ratings yet
Stat 130n Answers To The LAs in Lessons 3.1-3.3
18 pages
2.2 Exercises With Matlab: 2.2.1 Standard Normal Distribution
No ratings yet
2.2 Exercises With Matlab: 2.2.1 Standard Normal Distribution
9 pages
Basic Exercises for Competitive Programming: Python
From Everand
Basic Exercises for Competitive Programming: Python
Jan Pol
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
From Everand
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
Gérard Blanchet
3/5 (1)
Python For Beginners
From Everand
Python For Beginners
Célio Azevedo
No ratings yet
Signal Detection Theory
No ratings yet
Signal Detection Theory
13 pages
Performance Prediction Through OEE-Model
No ratings yet
Performance Prediction Through OEE-Model
11 pages
Correlation 1
No ratings yet
Correlation 1
9 pages
LessonPlan5 ParameterAndStatistics TecsonRF
100% (1)
LessonPlan5 ParameterAndStatistics TecsonRF
3 pages
Hassan I 2010
No ratings yet
Hassan I 2010
12 pages
Introduction To ROC Analysis
No ratings yet
Introduction To ROC Analysis
15 pages
CH 3 3502
No ratings yet
CH 3 3502
9 pages
AP Online Quiz KEY Chapter 8: Estimating With Confidence
No ratings yet
AP Online Quiz KEY Chapter 8: Estimating With Confidence
6 pages
Sample Size Calculations For Evaluating Mediation
No ratings yet
Sample Size Calculations For Evaluating Mediation
17 pages
PTU MBA Syllabus
No ratings yet
PTU MBA Syllabus
85 pages
Descriptive, Diagnostic, Predictive and Prescriptive Analytics
No ratings yet
Descriptive, Diagnostic, Predictive and Prescriptive Analytics
26 pages
SAS Chapter 10
No ratings yet
SAS Chapter 10
5 pages
Comment For ISO5725
No ratings yet
Comment For ISO5725
9 pages
Syllabus
No ratings yet
Syllabus
30 pages
Data Analytics
No ratings yet
Data Analytics
30 pages
Social Media
No ratings yet
Social Media
10 pages
Theory and Research Methods in Sociology
No ratings yet
Theory and Research Methods in Sociology
99 pages
Thematic Article
No ratings yet
Thematic Article
24 pages
MSCBA - RTE01.H23 - Elizaveta & Alba
No ratings yet
MSCBA - RTE01.H23 - Elizaveta & Alba
11 pages
Data Spss World Viks
No ratings yet
Data Spss World Viks
7 pages
Model Prediction of Defects in Sheet Metal Forming Processes
No ratings yet
Model Prediction of Defects in Sheet Metal Forming Processes
12 pages
Mathematics IV
No ratings yet
Mathematics IV
2 pages
Chapter 3 - Demand Forecasting: # Het Begint Met Een Idee
No ratings yet
Chapter 3 - Demand Forecasting: # Het Begint Met Een Idee
32 pages
Revision Module 1,2,3
No ratings yet
Revision Module 1,2,3
129 pages
Book Assignment in PR2
No ratings yet
Book Assignment in PR2
4 pages
Module in Practical Research 2: Your Lesson For Today!
No ratings yet
Module in Practical Research 2: Your Lesson For Today!
20 pages
UNIT-1 Regression vs. Classification
No ratings yet
UNIT-1 Regression vs. Classification
25 pages
Chi Square Test
100% (1)
Chi Square Test
52 pages
What Is Simulation?: Simulation With Arena, 4 Ed
No ratings yet
What Is Simulation?: Simulation With Arena, 4 Ed
23 pages