0% found this document useful (0 votes)

82 views14 pages

Stats Project

This document summarizes a student's statistics project analyzing candy color distributions in bags of Skittles. The project involved: 1) Students counting Skittle colors in their bags and compiling class data into charts comparing individual results to the class averages. Most bags contained the most green and yellow Skittles, as predicted. 2) The student calculating statistics like the mean, standard deviation, and five-number summary for each color using the class data. Yellow and green Skittles had the highest averages per bag. 3) The student creating graphs like histograms and box plots of the color data, finding the distribution was slightly left-skewed. 4) The student computing confidence intervals for the population proportions, means

Uploaded by

api-302666758

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

82 views14 pages

Stats Project

Uploaded by

api-302666758

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

MATH 1040 Skittles Data Project

In my statistic 1040 math course fall 2019 I was required to complete a gathering data

project. This project required one to work alone and also have group discussions with other

fellow students. For the first part of the assignment each student in the class was required to buy

a bag of skittles and count that number of each color of candy in the bag. The class data was

complied and used for a number of other different exercises involving methods a statistician

might use in their life. The second part of the project I determined the proportion of each color of

candy and created different charts for the total number of each color of candies in the entire

class. These charts compared the data to my own personal data and made it easier for myself and

others to notice any differences and similarities between my bag of candy and the other students

in the class. The third part of the project I used the skittles data from the class to create statistic

summaries of the mean, standard deviation and a five number summary. I was able to make a

frequency histogram of the total number of candies as well as a box plot. Under each chart one

was able to read a description at what they might be looking at. The last part of the project

involved confident interval. I found three different confidence intervals for the population

proportion, mean, and standard deviation. Each of the confidence intervals had an analysis

written describing what each confidence interval meant.

Part #1 For this part of the project I was able to determine the proportion of each color of candy

and created different charts for the total number of each color of candies in the entire class.

These charts compared the data to my own personal data and made it easier for myself and others

to notice any differences and similarities between my bag of candy and the other students in the

class.

Hypothesis: I believe that upon the data collection there will be significantly more green and

yellow skittles than red, orange, or purple.

This graph represents the color of skittles verse the number of each color in a bag of skittles. The

class complied their data together comparing their own bag to everyone else in the class. The x

axis represents the different colors of skittles that were found in each students bag of skittles. The

y axis represents the relative amount of skittles of each color every student in the class had in

their own bag.

Skittles Colors Paragraph:

For my prediction before I counted what was in my skittles bag I thought that green and yellow

would have the majority of the count in the bag, but I was wrong. It turns out that for my

particular bag green and red were the smallest counts that I had. As fore the rest of the class they

seemed to stay true to my prediction of green and yellow having the majority of the count in their

bags. I was surprised that red had an 18% proportion and being the smallest count next to purple.
It seems like for the overall data collection that the data seems very uniform all the proportions

are around 20%.

Group Discussion:

Does the Class data represent a random sample?

Yes, the class data does represent a random sample. Although each student was asked to buy their

own bag of skittles and not every bag of skittles in the region had an equal chance of being

selected, the distribution of skittles from the central plant/warehouse was most likely random.

The skittles company most likely does not count colors as they load the bags and simply loads by

weight, and assuming students did not make any biased decisions about which bag to grab off the

shelf every bag produced had an equal chance of being shipped to any location in the country

and being selected at random by a student in the class.

What would the population be?

In this study, the sample is the class data. Since not everyone in the class is currently living in the

same state, the population would be all 2.17 ounce skittles bags in the United States. There are

currently different manufacturing plants operating overseas, therefore the population can only

reasonably be expanded to include the United States distribution circuit.

Part #2: I used the skittles data from the class to create statistic summaries of the mean, standard

deviation and a five number summary. I was able to make a frequency histogram of the total

number of candies as well as a box plot. Under each chart one was able to read a description at

what they might be looking at.

This table was made from the values of the total skittles in the class using a program called “Stat

Crunch.” This table shows the mean, standard deviation, and 5 number summary of each color or

skittles the class had found in their bags all compiled together with each student in the class.

Var2 shows all the data for the Red skittles. Var3 shows all the data for the Orange skittles. Var4

shows all the data for the Yellow skittles. Var5 shows all the data for the green skittles. Var 5

shows all the data for the Purple skittles. Comparing all the colors of the skittles together one can

see that Yellow and Green have the higher average per each students bag of skittles.
This image represents a Histogram of the Skittles Colors in each students bag of skittles. This

histogram shows the frequency of disruption. One can see that the histogram is slightly bell

shape but does appear to be more left skewed.

This box plot represents the frequency of the color of skittles in each students bag collected. By

analyzing this box plot one is able to distinguish that the distribution is skewed to the left.

4. Number of Candies:
By using the program, “Stat Crunch” one was able to analyze the average, standard deviation, 5

number summary, frequency of the colors of skittle in each students bag and also analyze the

follow box plot. Upon analyzing the individual images one is able to tell that the relative

distribution is lefty skewed. One was also able to determine that yellow and green are the most

common colors in each individual students bag by seeing the average is the highest in those two

colors. I was surprised by yellow and green being the most common colors because I thought

that red would be the most common. I can also see that the numbers in colors don’t differ by a lot

per bag. I would assume that the factory just randomly puts skittles in a bag by machine not

making sure the distribution was evenly through out, but upon analysis it shows that the

distribution is fairly even through out each students bag.

Group Discussion:

Categorical variables are also known as qualitative variables. These variables can be put into

different categories, such as a model of car, color, gender, etc. Quantitative data is data that can

be ordered and measured. The number of candies in a bag of skittles is quantitative, whereas the

color of the candy is categorical.

Graphing quantitative data is best done with histograms, stem leaf plots, dot plots, bar graphs,

and box plots. All of these types of graphs can be used to measure the quantity of a certain

variable. Categorical data is best graphed using a method that lets you compare the groups to one

another. A bar graph can work for both quantitative and categorical data, but a pie chart doesn’t

make sense for quantitative data because it is comparing categories to the whole. A pie chart
would effectively show the percentage of each color of skittles in a bag (categorical data), but

cannot effectively be used to show the number of skittles in a bag (quantitative data).

When it comes to calculations, mean and median only make sense for quantitative data. The

mean is the average quantity of something in an entire sample, therefore it is a more meaningful

calculation when applied to quantitative data. The median represents the middle value of the data

and once again makes the most sense only when applied to quantitative data. The best central

tendency to apply to categorical data is the mode. When looking at the colors of candy in a

skittles bag, you may not able to find the average color or the median color, but you can establish

which color occurs the most often. Likewise, when looking at the number of candies in a skittles

bag, the best values for probability distributions are going to be the average and median number

of skittles.

Part #4: The last part of the project involved confident interval. I found three different

confidence intervals for the population proportion, mean, and standard deviation. Each of the

confidence intervals had an analysis written describing what each confidence interval meant.

99% Confidence Interval estimate for the population proportion of yellow candies

X= 410

n= 1874

Z-value for 99%

p= 410/1874 = 0.2188

99% Confidence Interval Estimate: (0.194, 0.24228)

Confidence Intervals estimated from a population proportion are used to determine, with the

specified degree of confidence, the proportion of a characteristic found within a population. In

relation to the skittles, we are 99% confident that the proportion of yellow skittles in any bag of

skittles falls between 0.194 and 0.24228.

95% Confidence Interval estimate for the population mean number of skittles per bag

Sample mean= 58.56 (32/1874)

Sx= 2.422

Standard deviation= 2.384

n= 32

95% Confidence Interval Estimate: (57.876,59.258)

Confidence Interval estimates of the population mean use sample date to give an interval with

the specified degree of confidence that the mean characteristic of a population should fall within.

In this case, we are 95% confident that the mean number of skittles in any bag is between 57.876

and 59.258.

The purpose of taking sample data and calculating statistics from them is to apply those statistics

to a larger population. Since a population is larger than a sample, how well a sample statistic can

be used to estimate a population parameter is an issue. A confidence interval helps to solve that

issue by allowing us to provide a range of values that the population parameter is likely to fall

within. The intervals are constructed with a certain level of confidence, reflected as a percentage

such as 95%, or 99%. This means that if the same population were to be examined on multiple
occasions and a parameter interval calculated each time, the intervals would contain the true

parameter in X% of cases.

Conclusion:

From taking this course, I was able to not only become a lot more familiar with my calculator but

also with how one is able to collect data and show that data properly. I was able to better

understand promotions variables, the value of knowing how to calculate means and standard

deviations, reading graphs, bot plot and histograms, and also how to better communicate the data

that I was able to find. This project at times was challenging and as was the class. I am grateful

for everything that I have learned and I know it will better help reenforce my learning within the

rest of my college and life learning.

Understandable Statistics Solutions Manual Sosany2
100% (13)
Understandable Statistics Solutions Manual Sosany2
238 pages
Math 1040 Term Project
No ratings yet
Math 1040 Term Project
7 pages
Graphical Representation of Statistical Data
100% (5)
Graphical Representation of Statistical Data
56 pages
Free Access To Test Bank For Statistics For People Who Think They Hate Statistics 6th Edition Salkind 1506333834 9781506333830 Chapter Answers
100% (32)
Free Access To Test Bank For Statistics For People Who Think They Hate Statistics 6th Edition Salkind 1506333834 9781506333830 Chapter Answers
61 pages
Stat 20053 Statistical Analysis With Software Application PDF
100% (1)
Stat 20053 Statistical Analysis With Software Application PDF
141 pages
Manual of Research Methodology
No ratings yet
Manual of Research Methodology
110 pages
As 2542.2.3-2014
100% (1)
As 2542.2.3-2014
25 pages
Quiz File 8604 Merged ASK
100% (1)
Quiz File 8604 Merged ASK
116 pages
MPC-006 e 2024-25 (Mapc) GSPH@9891268050
No ratings yet
MPC-006 e 2024-25 (Mapc) GSPH@9891268050
25 pages
Skittles Report
No ratings yet
Skittles Report
9 pages
Final Skittles
No ratings yet
Final Skittles
6 pages
The Full Skittles Project
100% (1)
The Full Skittles Project
6 pages
Final Math Project
No ratings yet
Final Math Project
7 pages
Math 1040 Skittles Term Project
No ratings yet
Math 1040 Skittles Term Project
9 pages
Trends of Retail Marketing in Bangladesh
No ratings yet
Trends of Retail Marketing in Bangladesh
7 pages
Statistical Treatment
No ratings yet
Statistical Treatment
49 pages
Solution Manual For Statistics For Busin
100% (1)
Solution Manual For Statistics For Busin
8 pages
Skittle Project Final
No ratings yet
Skittle Project Final
6 pages
Croot Part6 Eportfolio
No ratings yet
Croot Part6 Eportfolio
16 pages
Skittle Term Project
No ratings yet
Skittle Term Project
15 pages
Skittles Project Complete
No ratings yet
Skittles Project Complete
9 pages
Group Project Part 6-E Portfolio
No ratings yet
Group Project Part 6-E Portfolio
10 pages
Team Project Part 6 Final Report
No ratings yet
Team Project Part 6 Final Report
8 pages
Project Final
No ratings yet
Project Final
10 pages
The Rainbow Report
No ratings yet
The Rainbow Report
11 pages
Empirical Research
No ratings yet
Empirical Research
22 pages
Eportfolio
No ratings yet
Eportfolio
12 pages
Statsfinalproject
No ratings yet
Statsfinalproject
7 pages
Final Skittles Project
No ratings yet
Final Skittles Project
9 pages
Term Project - Eportfolio
No ratings yet
Term Project - Eportfolio
7 pages
Skittle Pareto Chart
No ratings yet
Skittle Pareto Chart
7 pages
Skittles Research Project Statistics
No ratings yet
Skittles Research Project Statistics
6 pages
Skittles Project Complete
No ratings yet
Skittles Project Complete
8 pages
Math Profile
No ratings yet
Math Profile
9 pages
Math 1040 Statistics Term Project: Blake Freeman
No ratings yet
Math 1040 Statistics Term Project: Blake Freeman
10 pages
Statistics Project
No ratings yet
Statistics Project
10 pages
Skittles
No ratings yet
Skittles
7 pages
Final Project 2017
No ratings yet
Final Project 2017
7 pages
Skittles Project Part 1
No ratings yet
Skittles Project Part 1
6 pages
Term Project Part 5 Compile Term Project Reflection and Eportfolio Posting 1
No ratings yet
Term Project Part 5 Compile Term Project Reflection and Eportfolio Posting 1
8 pages
Skittles Final Project
No ratings yet
Skittles Final Project
5 pages
11th Statistics EM - WWW - Tntextbooks.in
No ratings yet
11th Statistics EM - WWW - Tntextbooks.in
344 pages
1040 Project
No ratings yet
1040 Project
4 pages
Sanela Sakic 4/18/19: Total: 2,584 Candies (Sample Size)
No ratings yet
Sanela Sakic 4/18/19: Total: 2,584 Candies (Sample Size)
7 pages
Skittles 2 Final
No ratings yet
Skittles 2 Final
12 pages
Math Skittle Project
No ratings yet
Math Skittle Project
11 pages
Eportfolio Statistics
No ratings yet
Eportfolio Statistics
8 pages
The Science of Collecting, Organizing, Summarizing, and Analyzing Information To Draw Conclusions or Answer Questions
No ratings yet
The Science of Collecting, Organizing, Summarizing, and Analyzing Information To Draw Conclusions or Answer Questions
7 pages
Question 8: P&G Has Developed A New Toothpaste That Provides Tooth and Gum Protection For
No ratings yet
Question 8: P&G Has Developed A New Toothpaste That Provides Tooth and Gum Protection For
3 pages
Term Project Skittles
No ratings yet
Term Project Skittles
4 pages
Final Project
No ratings yet
Final Project
9 pages
Skittles Project
No ratings yet
Skittles Project
4 pages
The Skittles Project Final
No ratings yet
The Skittles Project Final
8 pages
Skittles Project Stats 1040
No ratings yet
Skittles Project Stats 1040
8 pages
Math 1040
No ratings yet
Math 1040
5 pages
Skittles Proyect Part 5
No ratings yet
Skittles Proyect Part 5
7 pages
Skittles Project 2
No ratings yet
Skittles Project 2
10 pages
Skittles Project 2
No ratings yet
Skittles Project 2
5 pages
Group Project #1 - Skittles
No ratings yet
Group Project #1 - Skittles
9 pages
Math 1040 Skittles Term Project
No ratings yet
Math 1040 Skittles Term Project
9 pages
Math 1030 Skittles Term Project
No ratings yet
Math 1030 Skittles Term Project
8 pages
Skittles Term Project
No ratings yet
Skittles Term Project
10 pages
Skittles Project
No ratings yet
Skittles Project
5 pages
Math 1040 Skittles Term Project Eportfolio
No ratings yet
Math 1040 Skittles Term Project Eportfolio
7 pages
Skittles Project Group Final Word
No ratings yet
Skittles Project Group Final Word
6 pages
Final Project
No ratings yet
Final Project
6 pages
Skittle
No ratings yet
Skittle
6 pages
Batch 2024-B.ComAnalytics11Dec
No ratings yet
Batch 2024-B.ComAnalytics11Dec
42 pages
TP 2 Stats
No ratings yet
TP 2 Stats
5 pages
Skittles Term Project 11 9 14
No ratings yet
Skittles Term Project 11 9 14
5 pages
Skittles Project Final
No ratings yet
Skittles Project Final
5 pages
Skittles Sum
No ratings yet
Skittles Sum
1 page
Math 1040
No ratings yet
Math 1040
3 pages
Rma Midterm Reviewer
No ratings yet
Rma Midterm Reviewer
11 pages
Final Project
No ratings yet
Final Project
3 pages
Statistics Notes 2022
No ratings yet
Statistics Notes 2022
130 pages
Chapter 8 - Levels of Measurement
No ratings yet
Chapter 8 - Levels of Measurement
11 pages
Nonparametric-Test Fazlul
No ratings yet
Nonparametric-Test Fazlul
13 pages
Business Analytics - Unit - I
No ratings yet
Business Analytics - Unit - I
44 pages
Notebook One
No ratings yet
Notebook One
1 page
Descriptive Statistics: Prepared By: Maira Sami
No ratings yet
Descriptive Statistics: Prepared By: Maira Sami
58 pages
Reviewer Stat Midterm
No ratings yet
Reviewer Stat Midterm
4 pages
Blog
No ratings yet
Blog
3 pages
PR2 Collecting and Organizing Data
No ratings yet
PR2 Collecting and Organizing Data
18 pages
Chapter 1 Psych Stats Important Terms
No ratings yet
Chapter 1 Psych Stats Important Terms
42 pages
Quantiative Data Analysis
No ratings yet
Quantiative Data Analysis
30 pages
Attitude Measurement & Scaling With Examples - BRM
No ratings yet
Attitude Measurement & Scaling With Examples - BRM
8 pages
Biostatistics and Research Unit 1
No ratings yet
Biostatistics and Research Unit 1
28 pages
01.statistika Eda - NDK
No ratings yet
01.statistika Eda - NDK
39 pages
BRM 9e PPT CH 13 - Measurement and Scale
No ratings yet
BRM 9e PPT CH 13 - Measurement and Scale
23 pages
Psychology Term Paper
No ratings yet
Psychology Term Paper
16 pages
The Atomic Bomb and Society
No ratings yet
The Atomic Bomb and Society
14 pages
App 5 - PR 1ST Quarter
No ratings yet
App 5 - PR 1ST Quarter
2 pages

Stats Project

Uploaded by

Stats Project

Uploaded by

MATH 1040 Skittles Data Project

written describing what each confidence interval meant.

yellow skittles than red, orange, or purple.

their own bag.

are around 20%.

Does the Class data represent a random sample?

and being selected at random by a student in the class.

What would the population be?

reasonably be expanded to include the United States distribution circuit.

what they might be looking at.

shape but does appear to be more left skewed.

distribution is fairly even through out each students bag.

color of the candy is categorical.

Z-value for 99%

99% Confidence Interval Estimate: (0.194, 0.24228)

specified degree of confidence, the proportion of a characteristic found within a population. In

skittles falls between 0.194 and 0.24228.

Sample mean= 58.56 (32/1874)

Standard deviation= 2.384

95% Confidence Interval Estimate: (57.876,59.258)

rest of my college and life learning.

You might also like