0% found this document useful (0 votes)

73 views13 pages

Mini Project - Golf: by Vishnu Vinod V.K

The document describes a study conducted by Par Inc., a golf equipment manufacturer, to test a new golf ball coating designed to be more durable and cut-resistant. 40 golf balls of the new model and 40 of the current model were subjected to driving distance tests. The results found no statistically significant difference in the mean driving distances between the two models based on a hypothesis test with a p-value greater than 0.05. While the coating shows promise, larger sample sizes across multiple golf courses are recommended before concluding the coating has no effect on performance.

Uploaded by

vishnuvk

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

73 views13 pages

Mini Project - Golf: by Vishnu Vinod V.K

Uploaded by

vishnuvk

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 13

MINI PROJECT – GOLF

BY VISHNU VINOD V.K

ASSIGNMENT

Par Inc., is a major manufacturer of golf equipment. Management

believes that Par’s market share could be increased with the
introduction of a cut-resistant, longer-lasting golf ball. Therefore, the
research group at Par has been investigating a new golf ball coating
designed to resist cuts and provide a more durable ball. The tests with
the coating have been promising. One of the researchers voiced
concern about the effect of the new coating on driving distances. Par
would like the new cut-resistant ball to offer driving distances
comparable to those of the current-model golf ball. To compare the
driving distances for the two balls, 40 balls of both the new and current
models were subjected to distance tests. The testing was performed
with a mechanical hitting machine so that any difference between the
mean distances for the two models could be attributed to a difference
in the design.

The results of the tests, with distances measured to the nearest yard,
are contained in the data set “Golf”. Prepare a Managerial Report

1. Formulate and present the rationale for a hypothesis test that par
could use to compare the driving distances of the current and new golf
balls
2. Analyze the data to provide the hypothesis testing conclusion. What
is the p-value for your test? What is your recommendation for Par
Inc.?
3. Provide descriptive statistical summaries of the data for each model
4. What is the 95% confidence interval for the population mean of
each model, and what is the 95% confidence interval for the difference
between the means of the two population? 5. Do you see a need for
larger sample sizes and more testing with the golf balls? Discuss
SOLLUTION

SET WORKING DIRECTORY AND LOADING DATA SET

 Setting working directory

setwd("D:/GL/miniproject")

 Loading dataset
golf <- read.csv("Golf.csv")

 Sample size: 40
 No.of samples : 2

EXPLORATORY DATA ANALYSIS

 Checking structure of dataset

str(golf)

'data.frame': 40 obs. of 2 variables:

$ New : int 277 269 263 266 262 251 262 289 286 264 ...
$ Current: int 264 261 267 272 258 283 258 266 259 270 ...

 Checking total number of rows and columns

dim(golf)

[1] 40 2

 Checking the names of the columns

names(golf)

[1] "New" "Current"

 Five point summary and standard deviation on both the samples
New Current
Min. :250.0 Min. :255.0
1st Qu.:262.0 1st Qu.:263.0
Median :265.0 Median :270.0
Mean :267.5 Mean :270.3
3rd Qu.:274.5 3rd Qu.:275.2
Max. :289.0 Max. :289.0

Summary of the given data shows mean and median are very close
the data is normally distributed.

 Checking standard deviation for current

sd(golf$Current)

[1] 8.752985

 Checking standard deviation for new

sd(golf$New)

[1] 9.896904

Also 5-point summary and standard deviations for both columns says
that there is no significant change in the driving distance of balls with
and without coating.

 Variance for current

var(golf$Current)

[1] 76.61474

 Variance for new

var(golf$New)

[1] 97.94872
HISTOGRAM AND BOXPLOT
From histogram we can see that both variable are nearly normally
distributed

Boxplot shows there are no outliers.

OBSERVATIONS
 Sample size:40

 Number of samples: 2

 Unpaired variables.

 DOF = 40+40-2 = 78

 There are no outliers in given data, neither missing values.

 Both the samples seem to be normally distributed.

 Mean and median values are not much different.

 The Current driving distance data looks more normally

distributed,whereas the driving distances data for New balls
looks right skewed.

 There is dip in the performance of Current and New balls driving

force as mean, median, min, max values differ.

HYPOTHESIS FORMULATION AND TESTING

 The level of significance (Alpha) = 0.05

 The sample size N = 40 which is sufficiently large for a Z stat

Test.

 But since the population standard deviation (Sigma) is unknown,

we have to use a T stat Test.

 Since the sample is different for both Sampling tests, we have

N+N-2 degrees of freedom = 78

 Since the sole purpose of the test is to check whether there is

any effect on driving distances due to the new coating, we could
prefer a Two Tailed T Test.

Null Hypothesis:

H0: µold - µnew = 0 (New coating does not have effect on driving
distances)
Alternate Hypothesis:

H1: µold – µnew #0 (New coating does have significant effect on

driving distances)
d̅ = Mean difference
µd = hypothesized difference (usually 0)
sd = Standard deviation of the difference

Welch Two Sample t-test

data: golf$Current and golf$New

t = 1.3284, df = 76.852, p-value = 0.188
alternative hypothesis: true difference in means is not equal to
0
95 percent confidence interval:
-1.384937 6.934937
sample estimates:
mean of x mean of y
270.275 267.500

Since it is a two-tailed test, the p-value = 0.1879 ÷ 2 = 0.094 (approx.)

The calculated p-value is greater than level of significance α (0.05)
Therefore, the Null Hypothesis (H0) will not be rejected.

TWO TAILED INDEPENDENT ONE SAMPLE T TEST FOR

CURRENT MEAN
One Sample t-test

data: golf$Current
t = 195.29, df = 39, p-value < 2.2e-16
alternative hypothesis: true mean is not equal to 0
95 percent confidence interval:
267.4757 273.0743
sample estimates:
mean of x
270.275

TWO TAILED INDEPENDENT ONE SAMPLE T TEST FOR

NEW MEAN

One Sample t-test

data: golf$New
t = 170.94, df = 39, p-value < 2.2e-16
alternative hypothesis: true mean is not equal to 0
95 percent confidence interval:
264.3348 270.6652
sample estimates:
mean of x
267.5

T-TEST CONCLUSION
TWO TAILED TWO SAMPLE INDEPENDENT T TEST

In this scenario, the p value is 0.094 which is greater than the 0.05.
Hence, we failed to reject the Null Hypothesis.

Thus, accepting the Null Hypothesis that there is no significant change in driving
distances due to the new coating.

95% confidence interval for difference in mean is [-1.384937 TO 6.934937]

TWO TAILED ONE SAMPLE T TEST

95% confidence interval for Current balls driving distance mean is

[267.4757 TO 273.0743]

95% confidence interval for New balls driving distance mean is

[264.3348 TO 270.6652]

 The difference in mean in the case of new balls can also be attributed to
the higher variance compared to `Current` balls.
 The variance of `New` balls driving distances is 97.95 is 28% more than
the variance of the driving distances of `Current` balls 76.61.
 We are unsure of the sampling error present in the data.
 Statistically there is no effect of new coating on driving distances. Though
it is suggested to check the effect on the weights and other characteristics
like size and shape of the new balls.
 Also, the given sample is from only one golf course, It is advisable that test
should perform on different

TYPE 1 AND TYPE 2 ERRORS

Type I Error alpha(α): Probability of rejecting null hypothesis when it is true, the
probability of a Type I error in hypothesis testing is predetermined by the
significance level.
Type II error (β) : Probability of falling to reject the null when it is false. Type II
error calculation **depends on the population mean which is unknown

POWER OF THE TEST AND SAMPLE SIZE

If alternative hypothesis µNew - µcurrent = µd = 5 yard as per our

assumption.

Null Hypothesis µNew - µcurrent = µd = 0

First we need to calculate the probability of Type I error which is

predetermined by significance level. If the significance level is 0.05,
then Type I error is 0.05 i.e. 5% probability we make Type I error -
rejecting null hypothesis when it is true.

Type II error calculation depends on a particular value of µ. In this

case lets assume difference between population µ is 5 yard. Lets also
assume that the significance level for the test is 0.05. Then the
calculation is as below:

This is a two tailed test.

We fail to reject the null hypothesis (commit a type II error) if we get a

Tstatistic less than 1.685954 for the sample size of 40

> abs(qt(0.05,38))
[1] 1.685954

SD for difference is 13.74397

Difference in mean is -2.775
Two-sample t test power calculation

n = 40
delta = 2.775
sd = 13.74397
sig.level = 0.05
power = 0.14274
alternative = two.sided

NOTE: n is number in each group

Basically, the power of the test is the probability that we make the right
decision when the null is not correct (i.e. we correctly reject it)

SAMPLE SIZE TO MAKE PROBABLITIES OF TYPE I AND

TYPE II ERROR

Let us assume that, we need Type I error and Type II error equal to
0.05
Assuming sample standard deviation is equal to population standard
deviation, we can calculate sample size needed as below:

Null hypothesis' mean difference µ0 is 0.

Alternative hypothesis' mean difference µ1is 5.
Sample Standard Deviation is 13.74397.
alpha value (α) is 0.05
Beta value (β) is 0.05 i.e. power of the test is 0.95 = 95 %

Two-sample t test power calculation

n = 197.3383
delta = 5
sd = 13.74397
sig.level = 0.05
power = 0.95
alternative = two.sided

NOTE: n is number in each group

Hence, In order to retain the power, we need to round the value to

next whole number. Therefore, we may conclude that we need a
sample size of 198 to get the Type I and Type II Errors equal.

CONCLUSION

From the given data, it may be concluded that, statistically there is no

significance change in driving distance due to new coating on golf
balls. However, our recommendation is that the test be carried out
with a larger sample size covering number of golf courses (at least a
five different) to improve the accuracy of the test results and negating
any effect of one type of ground. Also, the results need to interpreted
and future actions be planned with the understanding of other
characteristics like size, shape, weight etc

Par Inc Golf Case Study For Hypothesis Testing
100% (1)
Par Inc Golf Case Study For Hypothesis Testing
2 pages
Create A Database Schema and Table Relationships For A Logistic Company's Data
No ratings yet
Create A Database Schema and Table Relationships For A Logistic Company's Data
7 pages
Midterm Fall 2019
No ratings yet
Midterm Fall 2019
8 pages
Par Report
No ratings yet
Par Report
5 pages
Par Inc.
71% (7)
Par Inc.
6 pages
Par Inc Written Report
No ratings yet
Par Inc Written Report
1 page
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
From Everand
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
Joseph George Caldwell
No ratings yet
PG Program in Analytics: SQL Exam Questions Time: 1 HR
No ratings yet
PG Program in Analytics: SQL Exam Questions Time: 1 HR
1 page
Mini Project - Golf: By: Kantimati Subramanian Iyer
No ratings yet
Mini Project - Golf: By: Kantimati Subramanian Iyer
12 pages
Par Inc CaseStudy
No ratings yet
Par Inc CaseStudy
12 pages
Buisness Statistics
No ratings yet
Buisness Statistics
13 pages
Golf Project Report
No ratings yet
Golf Project Report
12 pages
Golf Project Report
No ratings yet
Golf Project Report
12 pages
ISDS 361A Analytics Project Final PDF
No ratings yet
ISDS 361A Analytics Project Final PDF
3 pages
R Exam-1
No ratings yet
R Exam-1
5 pages
Current New
No ratings yet
Current New
3 pages
DECSCI2 Business Case
50% (2)
DECSCI2 Business Case
4 pages
QBA Assignment Report (1) 123
No ratings yet
QBA Assignment Report (1) 123
6 pages
Chap 10, Case - Par Inc
No ratings yet
Chap 10, Case - Par Inc
3 pages
Project Sample Solution
100% (1)
Project Sample Solution
17 pages
Case Study 2
100% (2)
Case Study 2
9 pages
Session - 16 - Part A - MSST - 2019-21 PDF
No ratings yet
Session - 16 - Part A - MSST - 2019-21 PDF
60 pages
Par Inc Case Problem
No ratings yet
Par Inc Case Problem
2 pages
Two Population Means
No ratings yet
Two Population Means
33 pages
Chapter 10
No ratings yet
Chapter 10
45 pages
Chapter 10, Part A Statistical Inferences About Means and Proportions With Two Populations
No ratings yet
Chapter 10, Part A Statistical Inferences About Means and Proportions With Two Populations
48 pages
Project 4 Stats - Jacob Muscianese
No ratings yet
Project 4 Stats - Jacob Muscianese
3 pages
10 Comparisons Involving Means and Proportions (Class Version)
No ratings yet
10 Comparisons Involving Means and Proportions (Class Version)
60 pages
IAF 550 Case Question F24 Student
No ratings yet
IAF 550 Case Question F24 Student
3 pages
Lab 5 - Hypothesis Testing Using One Sample T-Test: Table 1
No ratings yet
Lab 5 - Hypothesis Testing Using One Sample T-Test: Table 1
7 pages
Lecture 10 Comparisons Involving Means
No ratings yet
Lecture 10 Comparisons Involving Means
38 pages
Chapter 10 - KT110H
No ratings yet
Chapter 10 - KT110H
23 pages
An Eigenvalue in Multivariate Analysis Represents The Amount of Variance in The Data Explained by Each Factor or Principal Component
No ratings yet
An Eigenvalue in Multivariate Analysis Represents The Amount of Variance in The Data Explained by Each Factor or Principal Component
1 page
10 Comparisons Involving Means and Proportions (Class Version)
No ratings yet
10 Comparisons Involving Means and Proportions (Class Version)
60 pages
Kxu Stat Anderson Ch10 Student
No ratings yet
Kxu Stat Anderson Ch10 Student
55 pages
Cumulative Proportion Shows How Much Total Variance
No ratings yet
Cumulative Proportion Shows How Much Total Variance
1 page
Inference About Means and Proportions With: Two Populations
No ratings yet
Inference About Means and Proportions With: Two Populations
4 pages
How Do You Use The Factor
No ratings yet
How Do You Use The Factor
1 page
Course 9
No ratings yet
Course 9
45 pages
CH10A
No ratings yet
CH10A
43 pages
Chapter 10, Part A Statistical Inferences About Means and Proportions With Two Populations
No ratings yet
Chapter 10, Part A Statistical Inferences About Means and Proportions With Two Populations
48 pages
DA Unit II - II
No ratings yet
DA Unit II - II
47 pages
2 - Estimation - 2 Pop Mean - Independent
No ratings yet
2 - Estimation - 2 Pop Mean - Independent
16 pages
DECSCI2 Lab Work
No ratings yet
DECSCI2 Lab Work
7 pages
TitanInsuranceCaseStudy Group8
No ratings yet
TitanInsuranceCaseStudy Group8
17 pages
Prob Stat Lesson 9
No ratings yet
Prob Stat Lesson 9
44 pages
T Test Lecture Example
No ratings yet
T Test Lecture Example
3 pages
Case Study of Par
100% (3)
Case Study of Par
1 page
SBE12ch10a Updated
No ratings yet
SBE12ch10a Updated
43 pages
05-06 - BIOE 211 - Supplement Notes of Hypothesis Testing and Inferential Stat
No ratings yet
05-06 - BIOE 211 - Supplement Notes of Hypothesis Testing and Inferential Stat
19 pages
Sampling Theory
No ratings yet
Sampling Theory
7 pages
EXAM
No ratings yet
EXAM
21 pages
MATHunit 3
No ratings yet
MATHunit 3
17 pages
Z Test
No ratings yet
Z Test
14 pages
Dafm Cia 2 - 2227610
No ratings yet
Dafm Cia 2 - 2227610
16 pages
(A) One-Sample T: Prob4-Tb4.2
No ratings yet
(A) One-Sample T: Prob4-Tb4.2
16 pages
4 T Distribution (Statistics IEM 2-2)
No ratings yet
4 T Distribution (Statistics IEM 2-2)
32 pages
Math Test 3
No ratings yet
Math Test 3
3 pages
Inferential Statistics For Psychology
No ratings yet
Inferential Statistics For Psychology
20 pages
Chap 4 2nd Part
No ratings yet
Chap 4 2nd Part
18 pages
Solutions Manual to accompany Introduction to Linear Regression Analysis
From Everand
Solutions Manual to accompany Introduction to Linear Regression Analysis
Douglas C. Montgomery
1/5 (1)
Student Solutions Manual to Accompany Loss Models: From Data to Decisions, Fourth Edition
From Everand
Student Solutions Manual to Accompany Loss Models: From Data to Decisions, Fourth Edition
Stuart A. Klugman
4/5 (1)
Mcdonald Analysis: by Vishnu Vinod V K
No ratings yet
Mcdonald Analysis: by Vishnu Vinod V K
11 pages
Time Seies Slides - 2
No ratings yet
Time Seies Slides - 2
26 pages
Advanced Statistics: Analysis of Variance (ANOVA) Dr. P.K.Viswanathan (Professor Analytics)
No ratings yet
Advanced Statistics: Analysis of Variance (ANOVA) Dr. P.K.Viswanathan (Professor Analytics)
19 pages
Good and Bad Customers For Granting Credit: Genpact Data Science Prodegree Logistic Regression: Problem Statement
No ratings yet
Good and Bad Customers For Granting Credit: Genpact Data Science Prodegree Logistic Regression: Problem Statement
2 pages
FBS Practice Assignment Fro Session
No ratings yet
FBS Practice Assignment Fro Session
5 pages
Business Problem: Time Series: Problem Statement
No ratings yet
Business Problem: Time Series: Problem Statement
2 pages
ISDS 361B Test 1 Review
No ratings yet
ISDS 361B Test 1 Review
5 pages
INDE 3364 Final Exam Cheat Sheet
No ratings yet
INDE 3364 Final Exam Cheat Sheet
5 pages
Ancient Indian Logic As A Theory of Case-Based Reasoning - Jonardon Ganeri
100% (1)
Ancient Indian Logic As A Theory of Case-Based Reasoning - Jonardon Ganeri
13 pages
Moshman - 2021 - Reasoning, Argumentation, and Deliberative Democra
No ratings yet
Moshman - 2021 - Reasoning, Argumentation, and Deliberative Democra
270 pages
IEPS Tax On Sugar-Sweetened Beverages
No ratings yet
IEPS Tax On Sugar-Sweetened Beverages
3 pages
Sow Mae541
No ratings yet
Sow Mae541
3 pages
Chapter One: 1.1 What Is Logic? 1.1.1 A Branch of Philosophy What Is Philosophy?
No ratings yet
Chapter One: 1.1 What Is Logic? 1.1.1 A Branch of Philosophy What Is Philosophy?
21 pages
CRI CRM Case
No ratings yet
CRI CRM Case
1 page
Listening Handout 2 - Listening Test - Part 2
No ratings yet
Listening Handout 2 - Listening Test - Part 2
5 pages
ICT - ICT Applications (2023) Final
No ratings yet
ICT - ICT Applications (2023) Final
23 pages
Philo
No ratings yet
Philo
26 pages
Regression
No ratings yet
Regression
14 pages
Ugca1945 Ai
No ratings yet
Ugca1945 Ai
2 pages
Mini Project - Golf: by Vishnu Vinod V.K
No ratings yet
Mini Project - Golf: by Vishnu Vinod V.K
13 pages
Simple Regression
No ratings yet
Simple Regression
18 pages
Abduction and Explanatory Reasoning - Philosoph
No ratings yet
Abduction and Explanatory Reasoning - Philosoph
11 pages
Unit-Ii Knowledge Representation and Reasoning Part-A
No ratings yet
Unit-Ii Knowledge Representation and Reasoning Part-A
10 pages
Expert Systems and Business Intelligence Applications in Knowledge Management Processes
100% (1)
Expert Systems and Business Intelligence Applications in Knowledge Management Processes
8 pages
Unit 5 Mba 1ST
No ratings yet
Unit 5 Mba 1ST
197 pages
CH 05 Wooldridge 6e PPT Updated
No ratings yet
CH 05 Wooldridge 6e PPT Updated
8 pages
ds72 Sol Review Scientific Method
No ratings yet
ds72 Sol Review Scientific Method
9 pages
Artificial Intelligence 1
No ratings yet
Artificial Intelligence 1
4 pages
Philosophy Assignment
No ratings yet
Philosophy Assignment
3 pages
Econometrics1 Cha2
100% (1)
Econometrics1 Cha2
77 pages
English Las8 q2 Week 3
No ratings yet
English Las8 q2 Week 3
7 pages
Chapter Viii
No ratings yet
Chapter Viii
4 pages
Bendrix Regression and Partial Slope
No ratings yet
Bendrix Regression and Partial Slope
36 pages
English Readings 4
No ratings yet
English Readings 4
8 pages
Chapter 4 HYPOTHESIS TESTING
No ratings yet
Chapter 4 HYPOTHESIS TESTING
48 pages
Level of Significance
No ratings yet
Level of Significance
4 pages

Mini Project - Golf: by Vishnu Vinod V.K

Uploaded by

Mini Project - Golf: by Vishnu Vinod V.K

Uploaded by

MINI PROJECT – GOLF

BY VISHNU VINOD V.K

Par Inc., is a major manufacturer of golf equipment. Management

SET WORKING DIRECTORY AND LOADING DATA SET

 Setting working directory

EXPLORATORY DATA ANALYSIS

 Checking structure of dataset

'data.frame': 40 obs. of 2 variables:

 Checking total number of rows and columns

 Checking the names of the columns

[1] "New" "Current"

 Checking standard deviation for current

 Checking standard deviation for new

 Variance for current

 Variance for new

Boxplot shows there are no outliers.

 There are no outliers in given data, neither missing values.

 Both the samples seem to be normally distributed.

 Mean and median values are not much different.

 The Current driving distance data looks more normally

 There is dip in the performance of Current and New balls driving

HYPOTHESIS FORMULATION AND TESTING

 The sample size N = 40 which is sufficiently large for a Z stat

 But since the population standard deviation (Sigma) is unknown,

 Since the sample is different for both Sampling tests, we have

 Since the sole purpose of the test is to check whether there is

H1: µold – µnew #0 (New coating does have significant effect on

Welch Two Sample t-test

data: golf$Current and golf$New

Since it is a two-tailed test, the p-value = 0.1879 ÷ 2 = 0.094 (approx.)

TWO TAILED INDEPENDENT ONE SAMPLE T TEST FOR

TWO TAILED INDEPENDENT ONE SAMPLE T TEST FOR

One Sample t-test

95% confidence interval for difference in mean is [-1.384937 TO 6.934937]

TWO TAILED ONE SAMPLE T TEST

95% confidence interval for Current balls driving distance mean is

95% confidence interval for New balls driving distance mean is

TYPE 1 AND TYPE 2 ERRORS

POWER OF THE TEST AND SAMPLE SIZE

If alternative hypothesis µNew - µcurrent = µd = 5 yard as per our

Null Hypothesis µNew - µcurrent = µd = 0

First we need to calculate the probability of Type I error which is

Type II error calculation depends on a particular value of µ. In this

This is a two tailed test.

We fail to reject the null hypothesis (commit a type II error) if we get a

SD for difference is 13.74397

NOTE: n is number in *each* group

SAMPLE SIZE TO MAKE PROBABLITIES OF TYPE I AND

Null hypothesis' mean difference µ0 is 0.

Two-sample t test power calculation

NOTE: n is number in *each* group

Hence, In order to retain the power, we need to round the value to

From the given data, it may be concluded that, statistically there is no

You might also like

NOTE: n is number in each group

NOTE: n is number in each group