Inference For Numerical Data

1. This document contains 4 problems involving statistical inference for numerical data. The first problem provides summary statistics and a histogram for the heights of 507 physically active individuals and asks questions about point estimates, standard deviation, and whether certain heights are unusually tall or short. The second problem examines gestation length data and asks about computing a standard error and confidence interval. The third problem uses a randomization test to examine differences in diamond prices based on carat weight. The fourth problem constructs a bootstrap confidence interval for the difference in diamond prices per carat for 0.99 carat and 1 carat diamonds.

Uploaded by

neha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

65 views3 pages

Inference For Numerical Data

Uploaded by

neha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Problem Set 8/9

Inference for Numerical Data

1. Heights of adults. Researchers studying anthropometry collected body measurements, as well as age,
weight, height and gender, for 507 physically active individuals. Summary statistics for the distribution
of heights (measured in centimeters), along with a histogram, are provided below.

Min Q1 Median Mean Q3 Max SD IQR

147.2 163.8 170.3 171.1 177.8 198.1 9.4 14

30
Count

0
160 180 200
Height (centimeters)
a. What is the point estimate for the average height of active individuals? What about the median?
b. What is the point estimate for the standard deviation of the heights of active individuals? What
about the IQR?
c. Is a person who is 1m 80cm (180 cm) tall considered unusually tall? And is a person who is 1m
55cm (155cm) considered unusually short? Explain your reasoning.
d. The researchers take another random sample of physically active individuals. Would you expect
the mean and the standard deviation of this new sample to be the ones given above? Explain your
reasoning.
e. The sample means obtained are point estimates for the mean height of all active individuals, if
the sample of individuals is equivalent to a simple random sample. What measure do we use
to quantify the variability of such an estimate? Compute this quantity using the data from the
original sample under the condition that the data are a simple random sample.
2. Length of gestation, confidence interval. Every year, the United States Department of Health
and Human Services releases to the public a large dataset containing information on births recorded in
the country. This dataset has been of interest to medical researchers who are studying the relation
between habits and practices of expectant mothers and the birth of their children. In this exercise we
work with a random sample of 1,000 cases from the dataset released in 2014. The length of pregnancy,
measured in weeks, is commonly referred to as gestation. The histograms below show the distribution
of lengths of gestation from the random sample of 1,000 births (on the left) and the distribution of
bootstrapped means of gestation from 1,500 different bootstrap samples (on the right).

1
Random sample of 1,000 births 1,500 bootstrap means
300

300

200

200
Count

Count
100
100

0 0

20 30 40 38.4 38.5 38.6 38.7 38.8 38.9

Gestation (weeks) Bootstrapped mean of gestation (weeks)

a. Given the bootstrap sampling distribution for the sample mean, find an approximate value for the
standard error of the mean.
b. By looking at the bootstrap sampling distribution (1,500 bootstrap samples were taken), find an
approximate 99% bootstrap percentile confidence interval for the true average gestation length
in the population from which the data were randomly sampled. Provide the interval as well as a
one-sentence interpretation of the interval.
3. Diamonds, randomization test. The prices of diamonds go up as the carat weight increases, but
the increase is not smooth. For example, the difference between the size of a 0.99 carat diamond and a
1 carat diamond is undetectable to the naked human eye, but the price of a 1 carat diamond tends
to be much higher than the price of a 0.99 diamond. In this question we use two random samples of
diamonds, 0.99 carats and 1 carat, each sample of size 23, and randomize the carat weight to the price
values in order compare the average prices of the diamonds to a null distribution. In order to be able to
compare equivalent units, we first divide the price for each diamond by 100 times its weight in carats.
That is, for a 0.99 carat diamond, we divide the price by 99. or a 1 carat diamond, we divide the price
by 100. The randomization distribution (with 1,000 repetitions) below describes the null distribution of
the difference in sample means (of price per carat) if there really was no difference in the population
from which these diamonds came.
1,000 randomized differences in means
80

60
Count

0
−10 0 10
Difference in randomized means of price per carat
(0.99 carats − 1 carat)
Using the randomization distribution of the difference in average price per carat (1,000 randomizations
were run), conduct a hypothesis test to evaluate if there is a difference between the prices per carat of

2
diamonds that weigh 0.99 carats and diamonds that weigh 1 carat. Make sure to state your hypotheses
clearly and interpret your results in context of the data. [@ggplot2]
4. Diamonds, bootstrap interval. We have data on two random samples of diamonds: one with
diamonds that weigh 0.99 carats and one with diamonds that weigh 1 carat. Each sample has 23
diamonds. Provided below is a histogram of bootstrap differences in means of price per carat of
diamonds that weight 0.99 carats and diamonds that weigh 1 carat.
1,000 bootstrapped differences in means

Count
60

0
−30 −20 −10 0
Difference in bootstrapped means of price per carat
(0.99 carats − 1 carat)
Using the bootstrap distribution, create a (rough) 95% bootstrap percentile confidence interval for the true
population difference in prices per carat of diamonds that weigh 0.99 carats and 1 carat. Interpret the interval
in the context of the this problem.

Course-Plan-Advance-Nursing-1st Year M.SC
No ratings yet
Course-Plan-Advance-Nursing-1st Year M.SC
16 pages
Normality, T-Test, ANOVA, Chi Square, Correlation
No ratings yet
Normality, T-Test, ANOVA, Chi Square, Correlation
31 pages
A Basic Course in Statistics (Fifth Edition) (PDFDrive)
No ratings yet
A Basic Course in Statistics (Fifth Edition) (PDFDrive)
764 pages
FORM 2 ENGLISH Lesson 16 Writing - A Personal Profile
100% (1)
FORM 2 ENGLISH Lesson 16 Writing - A Personal Profile
7 pages
Resampling Methods A Practical Guide To Data Analysis Digital EPUB Download
100% (17)
Resampling Methods A Practical Guide To Data Analysis Digital EPUB Download
16 pages
(Ebook PDF) Fundamentals of Biostatistics 8th Edition Instant Download
100% (5)
(Ebook PDF) Fundamentals of Biostatistics 8th Edition Instant Download
56 pages
Storingscodes Hisense Hi Therma
No ratings yet
Storingscodes Hisense Hi Therma
54 pages
Lecture Notes Ma12003 PDF
100% (1)
Lecture Notes Ma12003 PDF
105 pages
Mr. Dionisio Sold Domestic Stocks Directly To A Buy - ITProSpt
No ratings yet
Mr. Dionisio Sold Domestic Stocks Directly To A Buy - ITProSpt
6 pages
Introduction To Data Science Exploratory Data Analysis
No ratings yet
Introduction To Data Science Exploratory Data Analysis
55 pages
Clasar - Datasheet
No ratings yet
Clasar - Datasheet
8 pages
Assignment
No ratings yet
Assignment
12 pages
Statistics For Business and Economics
100% (1)
Statistics For Business and Economics
7 pages
Basic - Statistics 30 Sep 2013 PDF
100% (1)
Basic - Statistics 30 Sep 2013 PDF
20 pages
Chapter 1: Descriptive Statistics: Example 1: Making Steel Rods
No ratings yet
Chapter 1: Descriptive Statistics: Example 1: Making Steel Rods
20 pages
BTS File Format 1.19
No ratings yet
BTS File Format 1.19
29 pages
Assignment (Answers)
100% (1)
Assignment (Answers)
9 pages
A Comprehensive Statistics Cheat Sheet For Data Science 1685659812
No ratings yet
A Comprehensive Statistics Cheat Sheet For Data Science 1685659812
39 pages
Nonparametric Statistics
No ratings yet
Nonparametric Statistics
22 pages
Statistical Methods For Business and Economics
No ratings yet
Statistical Methods For Business and Economics
893 pages
Business Research Methods: Introductory Lecture Notes
No ratings yet
Business Research Methods: Introductory Lecture Notes
445 pages
URJ 3rd Issue Auges 2014
100% (1)
URJ 3rd Issue Auges 2014
2 pages
Statnotes PDF
No ratings yet
Statnotes PDF
300 pages
Book IntroStatistics PDF
No ratings yet
Book IntroStatistics PDF
263 pages
Complete Bundle Vile Boys Spine Ridge University Clarissa Wild HQ File
No ratings yet
Complete Bundle Vile Boys Spine Ridge University Clarissa Wild HQ File
406 pages
Unit 1 2000 PDF
No ratings yet
Unit 1 2000 PDF
253 pages
DSHCS AhujaG
No ratings yet
DSHCS AhujaG
251 pages
STA2023 Final Exam Grade Saver Fall 14 (New) Notes PDF
No ratings yet
STA2023 Final Exam Grade Saver Fall 14 (New) Notes PDF
36 pages
Analysis Report - Soil Nail SGHR100 MacMat
No ratings yet
Analysis Report - Soil Nail SGHR100 MacMat
2 pages
Individual Learner's Record (LR)
No ratings yet
Individual Learner's Record (LR)
2 pages
What Is Statistic
No ratings yet
What Is Statistic
129 pages
Business Analytics
No ratings yet
Business Analytics
47 pages
Making Sense of Data Mooc Notes PDF
No ratings yet
Making Sense of Data Mooc Notes PDF
32 pages
Prob & Stats (Slides) PDF
No ratings yet
Prob & Stats (Slides) PDF
101 pages
Introduction To Biostatistics
No ratings yet
Introduction To Biostatistics
53 pages
COM 201 - Inferential Statistics - 18032022-1
No ratings yet
COM 201 - Inferential Statistics - 18032022-1
58 pages
The Normal Distribution
No ratings yet
The Normal Distribution
94 pages
Unit 3 - Descriptive Statistics
No ratings yet
Unit 3 - Descriptive Statistics
44 pages
Group 3 Final 1
No ratings yet
Group 3 Final 1
112 pages
LEC 03 - Descriptive Statistics
No ratings yet
LEC 03 - Descriptive Statistics
42 pages
Exercises 4
No ratings yet
Exercises 4
30 pages
Lesson 2
No ratings yet
Lesson 2
9 pages
Parametric and Non Parametric Test
No ratings yet
Parametric and Non Parametric Test
76 pages
AEM Lecture 5
No ratings yet
AEM Lecture 5
52 pages
1 Intro Tree Diagram
No ratings yet
1 Intro Tree Diagram
35 pages
Chapter 1 - F2021 - IE 242
No ratings yet
Chapter 1 - F2021 - IE 242
35 pages
Averages and Variation Eda
No ratings yet
Averages and Variation Eda
29 pages
B1-Floorbeam (250 X 500) Beam Design
No ratings yet
B1-Floorbeam (250 X 500) Beam Design
2 pages
Making Sense of Data Statistic Course
No ratings yet
Making Sense of Data Statistic Course
39 pages
Statistica Formula Cheet
No ratings yet
Statistica Formula Cheet
51 pages
Introductory Notes
No ratings yet
Introductory Notes
30 pages
Screening Test: (English & Telugu Versions)
No ratings yet
Screening Test: (English & Telugu Versions)
38 pages
Sta3030 1-2 Merged Test 1
No ratings yet
Sta3030 1-2 Merged Test 1
114 pages
CENG3300 Lecture 2-2
No ratings yet
CENG3300 Lecture 2-2
23 pages
2.probability Statistics
No ratings yet
2.probability Statistics
44 pages
3030 Slides Module 1A
No ratings yet
3030 Slides Module 1A
24 pages
Week 12
No ratings yet
Week 12
37 pages
Building and Environment: Mosha Zhao, Schew-Ram Mehra, Hartwig M. Künzel
No ratings yet
Building and Environment: Mosha Zhao, Schew-Ram Mehra, Hartwig M. Künzel
16 pages
Ellemers 2018 Gender Stereotypes
No ratings yet
Ellemers 2018 Gender Stereotypes
26 pages
Basic Statistics: Populations and Samples
No ratings yet
Basic Statistics: Populations and Samples
10 pages
01 - Scales of Mesurement - Sumarising Numeric Data
No ratings yet
01 - Scales of Mesurement - Sumarising Numeric Data
26 pages
AML - Unit - 2
No ratings yet
AML - Unit - 2
29 pages
Stats Refr
No ratings yet
Stats Refr
11 pages
ch6 Notes 2016
No ratings yet
ch6 Notes 2016
16 pages
Problem Sheet II - Confidence Interval, Sample Size
No ratings yet
Problem Sheet II - Confidence Interval, Sample Size
4 pages
Sp25 Module 09 Reg
No ratings yet
Sp25 Module 09 Reg
58 pages
Statistical and Probability Tools For Cost Engineering
No ratings yet
Statistical and Probability Tools For Cost Engineering
16 pages
Water Use Reduction Additional Guidance 10-17-2016 v9 - 0
No ratings yet
Water Use Reduction Additional Guidance 10-17-2016 v9 - 0
8 pages
2019 Mark Mathys Award
No ratings yet
2019 Mark Mathys Award
11 pages
Proving Areaof Triangle Using Series
No ratings yet
Proving Areaof Triangle Using Series
9 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
42 pages
Ap Stat 1-7 Notes
No ratings yet
Ap Stat 1-7 Notes
12 pages
Test 4 Online
No ratings yet
Test 4 Online
6 pages
Discrete and Continuos Random Variable With Examples
No ratings yet
Discrete and Continuos Random Variable With Examples
8 pages
How Ai Could Lead To A Better Understanding of The Brain
No ratings yet
How Ai Could Lead To A Better Understanding of The Brain
4 pages
Assignment3 Tnlcs
No ratings yet
Assignment3 Tnlcs
7 pages
00 Probability 2
No ratings yet
00 Probability 2
19 pages
Class 7 S09
No ratings yet
Class 7 S09
3 pages
Statistics II Homework II Week 2-Week 3 Two-Sample Tests
No ratings yet
Statistics II Homework II Week 2-Week 3 Two-Sample Tests
3 pages
Question Test Mem360 Mac 2022
No ratings yet
Question Test Mem360 Mac 2022
3 pages
Research Analyst
No ratings yet
Research Analyst
3 pages
R Session Bootstrapping Randomisation 2024
No ratings yet
R Session Bootstrapping Randomisation 2024
4 pages
Determination of The Thermodynamic Solubility Product of Potassium Hydrogen Tartrate (KHT) Uncovering The Procedure - Expt 2
No ratings yet
Determination of The Thermodynamic Solubility Product of Potassium Hydrogen Tartrate (KHT) Uncovering The Procedure - Expt 2
2 pages
Discrete Distributions, Binomial
No ratings yet
Discrete Distributions, Binomial
2 pages
GEED 20073 BSID 1-1 Philippine Popular Culture - FINAL PAPER
No ratings yet
GEED 20073 BSID 1-1 Philippine Popular Culture - FINAL PAPER
2 pages
Scientific Notation Is A Way of Writing Numbers That Is Often Used by Scientists and
No ratings yet
Scientific Notation Is A Way of Writing Numbers That Is Often Used by Scientists and
1 page
Do You Ever Feel
No ratings yet
Do You Ever Feel
2 pages
16-Semantics (1-2) - 2023
No ratings yet
16-Semantics (1-2) - 2023
1 page
How Many Mice Make an Elephant?: And Other Big Questions About Size and Distance
From Everand
How Many Mice Make an Elephant?: And Other Big Questions About Size and Distance
Tracey Turner
No ratings yet
Physical Pharmaceutics-II Lab Manual as per the PCI Syllabus
From Everand
Physical Pharmaceutics-II Lab Manual as per the PCI Syllabus
A. Pavani
No ratings yet

Inference For Numerical Data

Uploaded by

Inference For Numerical Data

Uploaded by

Problem Set 8/9

Inference for Numerical Data

Min Q1 Median Mean Q3 Max SD IQR

20 30 40 38.4 38.5 38.6 38.7 38.8 38.9

You might also like