0% found this document useful (0 votes)

18 views43 pages

Quantitative Skills 2 Data Analysis

The document discusses the importance of data analysis in validating observed patterns and distinguishing among hypotheses, focusing on descriptive and inferential statistics. It explains the differences between measurement and count data, the significance of sample size, and the use of various statistical tools for both parametric and nonparametric data. Additionally, it outlines the process of conducting data analysis, including creating histograms, calculating mean, variance, and standard deviation, and interpreting results with confidence intervals.

Uploaded by

gymhb860930

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views43 pages

Quantitative Skills 2 Data Analysis

Uploaded by

gymhb860930

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 43

Quantitative Skills:

Data Analysis
Data analysis is one of the first steps
toward determining whether an
observed pattern has validity. Data
analysis also helps distinguish among
multiple working hypotheses.
Descriptive statistics serves to
summarize the data. It helps show the
variation in the data, standard errors,
best-fit functions, and confidence that
sufficient data have been collected.
Inferential statistics involves inferring
parameters in the natural population
from a sample.
Most of the data you will collect will fit
into two categories: measurements or
counts.

Measurement data Count data

Most measurements are continuous,
meaning there is an infinite number of
potential measurements over a given
range.
Count data are recordings of
qualitative, or discrete, data.

Number of leaf stomata Number of white eyed

individuals
How much is good enough?

• How much data should a researcher collect to

make a claim with confidence? How big
should the size of the sample be?

• Is it possible the results were due to chance

instead of the manipulation of the variable
being tested?
Conducting Data Analysis
When an investigation involves
measurement data, one of the first
steps is to construct a histogram, or
frequency diagram, to represent the
data’s distribution
If the data show an approximate
normal distribution on a histogram,
then they are parametric data.
If the data do not show an approximate
normal distribution on a histogram, then
they are nonparametric data. Different
descriptive statistics and tests need to be
applied to those data.
Sometimes, due to
sampling bias, data
might not fit a
normal distribution
even when the actual
population could be
normally distributed.
In this case, a larger
sample size might be
needed.
For parametric data (a normal
distribution), the appropriate
descriptive statistics include :
• the mean (average)
• sample size
• variance
• standard deviation
• standard error
The mean (x)of the sample is the
average. The mean summarizes the
entire sample and might provide an
estimate of the entire population’s true
mean.
The sample size (n)
refers to how many
members of the
population are
included in the study.
Sample size is
important when
estimating how well
the sample set
represents the entire
Variance (s2) and standard deviation (s)
measure how far a data set is spread out. A
variance of zero indicates that all the values in a
data set are identical.

Variance Distance from the mean

Because the differences from the mean are
squared to calculate variance, the units of
variance are not the same units as in the
original data set. The standard deviation is
the square root of the variance. The
standard deviation is expressed in the same
units as the original data set, which makes
it generally more useful than the variance.
A small standard deviation indicates that the
data tend to be very close to the mean. A
large standard deviation indicates that the
data are very spread out away from the mean.
A little more than two-thirds of the data points
will fall between +1 standard deviation and −1
standard deviation from the sample mean.
More than 95% of the data falls between ±2
standard deviations from the sample mean.
68–95–99.7 Rule

In a normal distribution, 68.27% of all values lie within one

standard deviation of the mean. 95.45% of the values lie
within two standard deviations of the mean. 99.73% of the
values lie within three standard deviations of the mean.
Sample standard error (SE) is a statistic
used to make an inference about how
well the sample mean matches up to
the true population mean.
Standard error should be represented by
including error bars on graphs when
appropriate. Error bars are used on graphs
to indicate the uncertainty of a reported
measurement.
Different statistical tools are used in the
case of data that does not resemble a
normal distribution (nonparametric data,
or data that is skewed or includes large
outliers).

• median
• mode
• quartiles
• box-and-whisker plots
The median is the value separating the
higher half of a data sample from the
lower half. To find the median of a data
set, first arrange the data in order from
lowest to highest value and then select
the value in the middle.

5, 1, 3, 7, 2 1, 2, 3, 5, 7
median
If there are two values in the middle of
an ordered data set, the median is
found by averaging those two values.

5, 1, 3, 7, 4, 2 1, 2, 3, 4, 5, 7
3.5
median
The mode is the value that appears
most frequently in a data set.

3, 5, 1, 3, 7, 2
3 is the mode in this example
because it appears more
frequently than any other
number.
A bimodal distribution
Data Analysis Flowchart:

Type of Data

Measurement Data Count Data

(Continuous)
· Make histogram (Discrete)

Nonparametric
Parametric
(not a normal
(normal distribution)
distribution)

Mean,
Median, mode,
standard deviation,
standard error quartiles
Example of Data Analysis:
Do shady English ivy
leaves have a larger
surface area than sunny
English ivy leaves?
Since the data collected is in centimeters,
it is measurement data, not count data.
So the first step is to make a:

HISTOGRAM
Does the data resemble a normal
curve?

(Close enough, with possible differences due to sampling error)

Next, the appropriate statistical tools are
applied:
A bar graph can then be produced to compare
the means:
Do the error bars for the shady leaf
mean overlap with the error bars for
the sunny leaf mean?

(No.)
A more rigorous statistical test will need to
be performed, but because the error bars
do not overlap there is a high probability
that the two populations are indeed
different from each other.
Example of Data Analysis:
Is 98.6°F actually the average body
temperature for humans?
Since the data collected is in Farenheit,
it is measurement data, not count
data. So the first step is to make a:

HISTOGRAM
Does the data resemble a normal
curve?

(Close Enough)
Next, the appropriate statistical tools are
applied:

*
Note that by convention, descriptive statistics rounds
the calculated results to the same number of decimal
places as the number of data points plus 1.
According to the 68–95–99.7 Rule, 68%
of all samples lie within one standard
deviation from the mean. This means
that around 68% of the temperatures
should be between 97.51 and 98.99.
Including the standard error, we can
say with a 68% confidence that the
mean human body temperature of our
sample is 98.25 ± 0.06°F.

Basic Statistics (3685) PPT - Lecture On 20-01-2019
100% (1)
Basic Statistics (3685) PPT - Lecture On 20-01-2019
64 pages
Employee Job Satisfaction Research
100% (8)
Employee Job Satisfaction Research
40 pages
Prem Mann, Introductory Statistics, 9/E
No ratings yet
Prem Mann, Introductory Statistics, 9/E
64 pages
Descriptive Statistics and Exploratory Data Analysis
No ratings yet
Descriptive Statistics and Exploratory Data Analysis
36 pages
Safety Critical Element Methodology in Oil & Gas
100% (5)
Safety Critical Element Methodology in Oil & Gas
5 pages
Market Research For Microfinance Participant S Manual 1
No ratings yet
Market Research For Microfinance Participant S Manual 1
41 pages
Akash Summer Intern Report
50% (2)
Akash Summer Intern Report
29 pages
Statistics
100% (6)
Statistics
211 pages
Udacity Statistics Notes
No ratings yet
Udacity Statistics Notes
37 pages
Emgt 512 SP 2024
No ratings yet
Emgt 512 SP 2024
156 pages
Basic Statistics
100% (9)
Basic Statistics
73 pages
Statistics For Data Science
100% (1)
Statistics For Data Science
27 pages
Dredging
No ratings yet
Dredging
324 pages
An Application of Keller's Brand Equity Model in A B2B
100% (1)
An Application of Keller's Brand Equity Model in A B2B
30 pages
A - Al Emran Et Al (2020) Continuous Intention To Use M Learning - Q1
No ratings yet
A - Al Emran Et Al (2020) Continuous Intention To Use M Learning - Q1
20 pages
CE 459 Statistics: Assistant Prof. Muhammet Vefa AKPINAR
No ratings yet
CE 459 Statistics: Assistant Prof. Muhammet Vefa AKPINAR
211 pages
Statistics
No ratings yet
Statistics
12 pages
Summer Internship Goldy
No ratings yet
Summer Internship Goldy
125 pages
Unit II: Basic Data Analytic Methods
No ratings yet
Unit II: Basic Data Analytic Methods
38 pages
09 - Data Analysis - Descriptive Statistics
No ratings yet
09 - Data Analysis - Descriptive Statistics
23 pages
SSM & Da All Unit Notes
No ratings yet
SSM & Da All Unit Notes
152 pages
PPD Findings Final 121922
No ratings yet
PPD Findings Final 121922
49 pages
Statistics
No ratings yet
Statistics
68 pages
Basic Concepts of Statistics
No ratings yet
Basic Concepts of Statistics
43 pages
Math 553
No ratings yet
Math 553
271 pages
Summarising and Analysing Data
No ratings yet
Summarising and Analysing Data
36 pages
Unit II TYCS DS
No ratings yet
Unit II TYCS DS
176 pages
3 - Descriptive Stat
No ratings yet
3 - Descriptive Stat
70 pages
NITKclass 1
No ratings yet
NITKclass 1
50 pages
Article Review 1 Eng
No ratings yet
Article Review 1 Eng
30 pages
Entrep
No ratings yet
Entrep
41 pages
Ai - Ssmda
No ratings yet
Ai - Ssmda
142 pages
Advance Statistics For Data Science and Data Analysis
No ratings yet
Advance Statistics For Data Science and Data Analysis
47 pages
Lec 1
No ratings yet
Lec 1
54 pages
WEEK 8B BUSINESS STRATEGY - Implementing Strategy
No ratings yet
WEEK 8B BUSINESS STRATEGY - Implementing Strategy
18 pages
Lesson2 - Measures of Tendency
No ratings yet
Lesson2 - Measures of Tendency
65 pages
Unit 2 DS PDF
No ratings yet
Unit 2 DS PDF
97 pages
CHAPTER 6 - Basic Statistic Concepts
No ratings yet
CHAPTER 6 - Basic Statistic Concepts
46 pages
Cgiuki Board Evaluation - Full Report
No ratings yet
Cgiuki Board Evaluation - Full Report
53 pages
Measures of Central Tendency Position and Dispersion 1.Pptx 20241015 145631 0000
No ratings yet
Measures of Central Tendency Position and Dispersion 1.Pptx 20241015 145631 0000
44 pages
Unit 5.2 Advanced Genetics 2223
No ratings yet
Unit 5.2 Advanced Genetics 2223
102 pages
MMW Data Management
No ratings yet
MMW Data Management
35 pages
The Problem
No ratings yet
The Problem
106 pages
Data Analysis and Statistical Treatment
No ratings yet
Data Analysis and Statistical Treatment
99 pages
Statistics, Statistical Modelling & Data Analytics
No ratings yet
Statistics, Statistical Modelling & Data Analytics
68 pages
05 - Statistical Processing and Analysis of Medical Data
No ratings yet
05 - Statistical Processing and Analysis of Medical Data
14 pages
Bakercollegeproject 3
No ratings yet
Bakercollegeproject 3
19 pages
Unit .......
No ratings yet
Unit .......
45 pages
Descriptive Analysis
No ratings yet
Descriptive Analysis
20 pages
Quantitative Data Analysis
No ratings yet
Quantitative Data Analysis
50 pages
2 Research - 2ND QT - Week 1 - 10 14 2024
No ratings yet
2 Research - 2ND QT - Week 1 - 10 14 2024
13 pages
MR - Factor Analysis
No ratings yet
MR - Factor Analysis
61 pages
Part 3
No ratings yet
Part 3
36 pages
Chapter1 Statistics
No ratings yet
Chapter1 Statistics
17 pages
Some Imoprtant Topics of Statistics With Defination
No ratings yet
Some Imoprtant Topics of Statistics With Defination
46 pages
Stat 1101 4 7
No ratings yet
Stat 1101 4 7
18 pages
How Do SMEs Decide On International Market Entry - An Empirical Examination in The Middle East
No ratings yet
How Do SMEs Decide On International Market Entry - An Empirical Examination in The Middle East
18 pages
1 - Chapter (1) Analysis of Data and Its Types Exercise
No ratings yet
1 - Chapter (1) Analysis of Data and Its Types Exercise
10 pages
Business and Statistics
No ratings yet
Business and Statistics
29 pages
Quant Descriptive Statistics
No ratings yet
Quant Descriptive Statistics
37 pages
Background of Study
No ratings yet
Background of Study
5 pages
Module 2c - Exploratory Data Analysis
No ratings yet
Module 2c - Exploratory Data Analysis
18 pages
AQA Capacitor Energy
No ratings yet
AQA Capacitor Energy
25 pages
Week 4 Bioscience
No ratings yet
Week 4 Bioscience
37 pages
Reviewer in IE-SAN1
No ratings yet
Reviewer in IE-SAN1
5 pages
OCR A Capacitors Answers
No ratings yet
OCR A Capacitors Answers
6 pages
Lecture 2 Descriptive Statistics
No ratings yet
Lecture 2 Descriptive Statistics
46 pages
Statistics
No ratings yet
Statistics
25 pages
Ijert Ijert: Design of Pile Foundation at GALANDER-KANDIZAL Bridge in J&K
No ratings yet
Ijert Ijert: Design of Pile Foundation at GALANDER-KANDIZAL Bridge in J&K
10 pages
Wa Nko Nalipay PR
No ratings yet
Wa Nko Nalipay PR
12 pages
Quantitative Skills 3 Hypothesis Testing
No ratings yet
Quantitative Skills 3 Hypothesis Testing
19 pages
Wei Et Al-2018-Conservation Letters
No ratings yet
Wei Et Al-2018-Conservation Letters
11 pages
A Comparative Evaluation of Nigeria Television Authority and Arise Television Coverage of ENDSARS Protest in Nigeria
No ratings yet
A Comparative Evaluation of Nigeria Television Authority and Arise Television Coverage of ENDSARS Protest in Nigeria
10 pages
11572-Article Text-33483-1-10-20150410
No ratings yet
11572-Article Text-33483-1-10-20150410
10 pages
Chloride Diffusion Coefficient Calculation
No ratings yet
Chloride Diffusion Coefficient Calculation
1 page
Statistical Analysis - Descriptive Stat
No ratings yet
Statistical Analysis - Descriptive Stat
6 pages
Appendix B: Introduction To Statistics: Eneral Terminology
No ratings yet
Appendix B: Introduction To Statistics: Eneral Terminology
15 pages
Quantitative Skills 5 Mathematical Modelling
No ratings yet
Quantitative Skills 5 Mathematical Modelling
9 pages
The First Type of Data Analysis Is Descriptive Analysis 1694268813
No ratings yet
The First Type of Data Analysis Is Descriptive Analysis 1694268813
16 pages
Lund 2010
No ratings yet
Lund 2010
12 pages
Statistics 1
No ratings yet
Statistics 1
10 pages
CH 2 Lecture Notes
No ratings yet
CH 2 Lecture Notes
12 pages
San Carlos City, Negros Occidental
No ratings yet
San Carlos City, Negros Occidental
6 pages
Jose'S Mexican Resturant: Q1: How Should Quality Be Defined at This Restaurant ?
No ratings yet
Jose'S Mexican Resturant: Q1: How Should Quality Be Defined at This Restaurant ?
7 pages
DSBDL Asg 3 Write Up
No ratings yet
DSBDL Asg 3 Write Up
6 pages
Frequency Distribution Table: Measure of Dispersion: Range, Variance, Standard Deviation
No ratings yet
Frequency Distribution Table: Measure of Dispersion: Range, Variance, Standard Deviation
4 pages
Nursing Research Process in A Nutshell
No ratings yet
Nursing Research Process in A Nutshell
11 pages
Research Methodology Questions
No ratings yet
Research Methodology Questions
4 pages
Descriptive Statistics Summary (Session 1-5) : Types of Data - Two Types
No ratings yet
Descriptive Statistics Summary (Session 1-5) : Types of Data - Two Types
4 pages
Subtitle Big Data Coursera 1
No ratings yet
Subtitle Big Data Coursera 1
2 pages
2001 Quick Answers - General Maths 2001 Quick Answers
No ratings yet
2001 Quick Answers - General Maths 2001 Quick Answers
2 pages
Meta Analysis Monitor & Control Project Cost Management
No ratings yet
Meta Analysis Monitor & Control Project Cost Management
1 page
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet
Introduction To Non Parametric Methods Through R Software
From Everand
Introduction To Non Parametric Methods Through R Software
Editor IJSMI
No ratings yet
Overview Of Bayesian Approach To Statistical Methods: Software
From Everand
Overview Of Bayesian Approach To Statistical Methods: Software
Vinaitheerthan Renganathan
No ratings yet

Quantitative Skills 2 Data Analysis

Uploaded by

Quantitative Skills 2 Data Analysis

Uploaded by

Quantitative Skills:

Measurement data Count data

Number of leaf stomata Number of white eyed

• How much data should a researcher collect to

• Is it possible the results were due to chance

Variance Distance from the mean

In a normal distribution, 68.27% of all values lie within one

Measurement Data Count Data

(Close enough, with possible differences due to sampling error)

You might also like