P299 Module 8 Notes

Uploaded by

jbcruz2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views8 pages

P299 Module 8 Notes

Uploaded by

jbcruz2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

MODULE 8: QUANTITATIVE DATA ANALYSIS –  under specified conditions, we may

INFERENTIAL AND DESCRIPTIVE STATISTICS assume that sampling distributions of

statistics such as the sample mean are
Source: Inferential Statistics
normally distributed, even if the
Inference - method of making judgments about samples are drawn from populations
an unknown, drawing on what is already known that are not normally distributed
to be true.  also referred to as the bell curve due to
its characteristic shape
Statistical Inference - science of characterizing
 also referred to as the Gaussian
or making decisions about a population by using
distribution in honor of the eighteenth-
information from a sample drawn from that
century physicist and mathematician
population.
Karl Gauss, who used this distribution to
Descriptive Statistics - If the cases you are analyze astronomical data
studying represent the entire population of
Standard Normal Distribution (Z Distribution)
interest, and you do not wish to generalize
beyond those cases  normal distribution with a mean of 0
and standard deviation of 1
Inferential Statistics - If the cases you are
 Any normal distribution can be
studying do not represent the entire population
transformed to the standard normal
of interest, and you do wish to generalize
distribution by converting the original
beyond those cases
values to standardized scores
Theoretical Probability Distributions
Characteristics of Normal Distribution
 defined by a formula that specifies what
 Symmetry
values can be taken by data points
 Unimodality (a single most common
within the distribution and how
value)
common each value will be
 A continuous range from −∞ to +∞
 often presented in graphical form
(from negative infinity to positive
 useful in inferential statistics because
infinity)
their properties and characteristics are
 A total area under the curve of 1
known
 A common value for the mean, median,
Classifications of Probability Distribution and mode

1. Continuous – the data can take any Empirical Rule for Any Normal Distribution
value within a specified range
 About 68% of the data will fall within
2. Discrete - the data can take only certain
one standard deviation of the mean.
values
 About 95% of the data will fall within
Normal Distribution two standard deviations of the mean.
 About 99% of the data will fall within
 a reasonable description of how many
three standard deviations of the mean.
continuous variables are distributed in
reality, from industrial process variation Z-Score
to intelligence test scores
 The process of making such
comparisons is facilitated by converting
raw scores (scores in their natural 4. There is a fixed number of trials,
metric, for instance, weight measured denoted as n.
in pounds or kilograms) into Z-scores,
Formula for Binomial Distribution
which express the value of the score in
terms of units of the standard
deviation.
 sometimes referred to as normalized
scores
 facilitate comparison of scores from
populations with different means and
standard deviations. Variables
 distance of a data point from the mean, 1. Independent - presumed to influence
expressed in units of standard the value of the dependent variable
deviation. 2. Dependent - represent an outcome of
the study
3. Control - might influence the
dependent variable but are not the
main focus of interest

where: Population - consists of all the people or other

X – observed value entities that the researchers would like to study
µ – mean if they had infinite resources
σ – standard deviation Non-Probability Sampling
Binomial Distribution  there is a high probability that the
 applies to many types of real-life data sample drawn using a nonprobability
with dichotomous outcomes (outcomes method will not be representative of
that can take only two values), from the population of interest, and there is
machine parts that are either defective no way to correct the sample
or acceptable to students who either statistically
pass or fail a class.  popular because the researcher can
 Events in a binomial distribution are bypass the more cumbersome process
generated by a Bernoulli process of drawing a probability sample, but a
price is paid for this convenience
Requirements for Data Represented by  Conclusions based on data using
Binomial Distribution nonprobability sampling methods are
of limited usefulness in generalizing to
1. The outcome of each trial is one of two
a larger population because there is no
mutually exclusive outcomes.
way to know how the sample relates to
2. Each trial is independent, so the result
the population of interest
of one trial has no influence on the
result of any other trial. Types of Non-Probability Sampling
3. The probability of success, denoted as
p, is constant for every trial. 1. Volunteer Sampling
 Use of volunteer samples is best  a slight improvement over
reserved for circumstances in convenience sampling because
which it would be difficult to it can ensure representation of
select a sample randomly from different demographic groups
a population, for instance in a within the sample
study about people who use  you still have no way of
illegal drugs. knowing whether the people in
 Even with limited ability to the sample are representative
generalize, useful information of the population of interest
can be gained from volunteer  data collector might approach
samples, particularly in the people who seem most like
early stages of a project himself (for instance in age) or
 Results from volunteer samples who seem the friendliest or
have limited usefulness if the most approachable, rendering
goal is to generalize beyond the the sample even less useful as a
sample. means to acquire information
about a larger population.
2. Convenience Sampling
Probability Sampling
 can be used to collect
information in the early stages  every member of the population has a
of a study but have limited known probability to be selected for the
usefulness if the goal is to sample
generalize beyond the sample  preferred because the researcher can
 because those 50 people are generalize the results obtained from the
not a random selection of area sample to the population of interest.
residents, it would not be valid
to conclude that their opinions Types of Probability Sampling
reflect those of the area as a 1. Simple Random Sampling
whole  all samples of a given size have
 you might use the information an equal probability of being
gained from a survey selected
administered to a convenience  has the most desirable
sample to construct a statistical properties of any kind
questionnaire for a more of sampling,
scientific sample of the area’s  can be impossible or
population. prohibitively expensive to
execute in some contexts
3. Quota Sampling
 the data collector is instructed 2. Systematic Sampling
to get responses from a certain  you need a list or other
number or proportion of enumeration of your population
subjects within broad  You then choose a start number
classifications. at random between 1 and n and
include in your sample the
object representing the start distribution, even if the sample is drawn
number and every nth object from a population that is not normally
following distributed.
 particularly useful when the
Steps in Hypothesis Testing
population accrues over time
and there is no predetermined 1. Develop a research hypothesis that can
list of population members be tested mathematically.
 you must ensure that the data 2. Formally state the null and alternative
is not cyclic in a way that hypotheses.
corresponds with your random 3. Decide on an appropriate statistical
starting point and value of n. test, gather data, and do the
calculations.
3. Stratified Sampling 4. Make your decision based on the
 the population of interest is results.
divided into nonoverlapping
Types of Hypotheses
groups or strata based on
common characteristics 1. Null Hypothesis - always predicts no
 If comparing different strata or effect or no relationship between
making estimates of the variables
characteristics of subgroups is a
primary goal of the study, 2. Alternative Hypothesis - states your
stratified sampling is a good research prediction of an effect or
choice because it can be relationship
designed to ensure adequate
sampling from each stratum of One-Tailed vs Two-Tailed Tests
interest. 1. One-Tailed Test - allow for the
possibility of an effect in one direction
4. Cluster Sampling
 population is sampled by using 2. Two-Tailed Test - for the possibility of
preexisting groups an effect in two directions—positive
 often used in national surveys and negative
that require in-person
interviews or the collection of Type I and Type II Errors
physical specimens

Central Limit Theorem

 states that the sampling distribution of

the sample mean approximates the
normal distribution, regardless of the
distribution of the population from Point Estimate vs Interval Estimate
which the samples are drawn if the 1. Point Estimate – calculating a single
sample size is sufficiently large statistic that represents a single point
 enables us to make statistical inferences on the number line
based on the properties of the normal 2. Interval Estimate – a range of numbers
Confidence Interval

 interval between two values that

represent the upper and lower
confidence limits or confidence bounds
for a statistic
 formula used to calculate the Source: Descriptive Statistics
confidence interval depends on the
statistic being used Descriptive Statistics
 conveys important information about  the use of statistical and graphic
the precision of a point estimate techniques to present information
 if our test statistic is the mean and we about the data set being studied
are using a 95% confidence interval,  it is a common practice to begin an
over an infinite number of repetitions of analysis by examining graphical displays
drawing a sample and computing its of a data set and to compute some
mean, 95% of the time the confidence basic descriptive statistics to get a
interval thus constructed would contain better sense of the data to be analyzed
the true mean of the population.
Measures of Central Tendency
P-Value
1. Mean
 expresses the probability that results at  average of a set of values
least as extreme as those obtained in an  appropriate for interval and
analysis of sample data are due to ratio data
chance  not an appropriate summary
 commonly reported for most research measure for every data set
results involving statistical calculations, because it is sensitive to
in part because intuition is a poor guide extreme values, also known as
to how unusual a particular result is. outliers and can also be
Z-Statistic misleading for skewed
(nonsymmetrical) data.
 instead of asking what the probability of  Trimmed Mean (Winsorized
a particular score is, we are now Mean) – calculated by trimming
interested in the probability of a or discarding a certain
particular sample mean. percentage of the extreme
 an important example of the application values in a distribution and then
of the central limit theorem, which calculating the mean of the
allows us to compute the probability of remaining values
a sample result by using the normal
distribution, even if we don’t know the 2. Median
distribution of the population from  the middle value when the
which the sample was drawn values are ranked in ascending
or descending order.
 a better measure of central
tendency than the mean for
data that is asymmetrical or
contains outliers
 it does not matter whether the
data set contains some
extremely large or small values 4. Standard Deviation
because they will not affect the  square root of the variance
median more than less extreme
values.

3. Mode
 refers to the most frequently
occurring value.
 most often useful in describing
ordinal or categorical data.
5. Coefficient of Variation
Dispersion - refers to how variable or spread
out data values are.  a measure of relative variability
that makes it possible to
Measures of Dispersion compare variability across
variables measured in different
1. Range
units
 the difference between the
highest and lowest values.
 If there are one or a few
outliers in the data set, the
Outliers
range might not be a useful
summary measure.  a data point or observation whose value
is quite different from the others in the data
2. Interquantile Range set being analyzed.
 alternative measure of  a data point that seems to come from a
dispersion that is less different population or is outside the typical
influenced than the range by pattern of the other data points
extreme values
Graphic Methods
 the range of the middle 50% of
the values in a data set, which is 1. Frequency Tables
calculated as the difference  when the actual values of the
between the 75th and 25th numbers in different categories,
percentile values. rather than the general pattern
among the categories, are of
3. Variance primary interest.
 average of the squared  an efficient way to present large
deviations from the mean quantities of data and represent
a middle ground between text
(paragraphs describing the data and the least common the
values) and pure graphics (such furthest to the right), and a
as a histogram). cumulative frequency line is
 Absolute Frequency - raw superimposed over the bars
numbers or counts for each
category 5. Stem and Leaf Plot
 Relative Frequency - displays  divide your data into intervals
the percent of the total (using your common sense and
represented by each category the level of detail appropriate
 Cumulative Frequency - shows to your purpose) and display
the relative frequency for each each data point by using two
category and those below it columns.
 The stem is the leftmost column
2. Bar Chart and contains one value per row,
 particularly appropriate for and the leaf is the rightmost
displaying discrete data with column and contains one digit
only a few categories for each case belonging to that
row.
3. Pie Chart  plot that displays the actual
 shows graphically what values of the data set but also
proportion each part occupies assumes a shape indicating
of the whole which ranges of values are most
 most useful when there are common.
only a few categories of  not only tells us the actual
information and the differences values of the scores and their
among those categories are range but the basic shape of
fairly large their distribution as well.

4. Pareto Chart/Diagram 6. Box Plot

 combines the properties of a  also known as the hinge plot or
bar chart and a line chart; the the box-and-whiskers plot
bars display frequency and  a compact way to summarize
relative frequency, whereas the and display the distribution of a
line displays cumulative set of continuous data
frequency.  always constructed to highlight
 it is easy to see which factors five important characteristics of
are most important in a a data set: the median, the first
situation and, therefore, to and third quartiles (and hence
which factors most attention the interquartile range as well),
should be directed and the minimum and
 the bars are ordered in maximum.
descending frequency from left
to right (so the most common 7. Histogram
cause is the furthest to the left
 the bars (also known as bins
because you can think of them
as bins into which values from a
continuous distribution are
sorted) touch each other, unlike
the bars in a bar chart.
 The x-axis (vertical axis) in a
histogram represents a scale
rather than simply a series of
labels, and the area of each bar
represents the proportion of
values that are contained in
that range.

8. Bivariate Charts
 Charts that display information
about the relationship between
two variables
 Scatterplot - define each point
in a data set by two values,
commonly referred to as x and
y, and plot each point on a pair
of axes

Nature and Types of Planning Jan 7
No ratings yet
Nature and Types of Planning Jan 7
152 pages
Advanced Statistics Concepts
No ratings yet
Advanced Statistics Concepts
96 pages
Statistics and Probability Q3
No ratings yet
Statistics and Probability Q3
6 pages
SP Reviewer
No ratings yet
SP Reviewer
4 pages
Unit 2
No ratings yet
Unit 2
25 pages
Quantitative Techniques by Amit Ramawat
No ratings yet
Quantitative Techniques by Amit Ramawat
26 pages
6 Sampling and Basic Descriptive Statistics
No ratings yet
6 Sampling and Basic Descriptive Statistics
38 pages
Prof. Joy V. Lorin-Picar Davao Del Norte State College: New Visayas, Panabo City
No ratings yet
Prof. Joy V. Lorin-Picar Davao Del Norte State College: New Visayas, Panabo City
91 pages
chapter7-Sampling-Distribution
No ratings yet
chapter7-Sampling-Distribution
37 pages
Statistics
No ratings yet
Statistics
16 pages
UNIT-2 CONTINUOUS DISTRIBUTION.docx
No ratings yet
UNIT-2 CONTINUOUS DISTRIBUTION.docx
9 pages
Business Statistics
No ratings yet
Business Statistics
25 pages
Inferential Statistics
100% (1)
Inferential Statistics
38 pages
MFCS
No ratings yet
MFCS
53 pages
Reviewer Statistics
No ratings yet
Reviewer Statistics
5 pages
Probability Distribution
No ratings yet
Probability Distribution
16 pages
Sampling and Sampling Distribution
100% (1)
Sampling and Sampling Distribution
64 pages
Stat Notes
No ratings yet
Stat Notes
5 pages
GEA1000 Final CS
No ratings yet
GEA1000 Final CS
3 pages
Ch-6&7
No ratings yet
Ch-6&7
47 pages
Ist Sem
No ratings yet
Ist Sem
92 pages
Statistical Foundations: SOST70151 - LECTURE 6
No ratings yet
Statistical Foundations: SOST70151 - LECTURE 6
27 pages
3 Introduction To Probablities
No ratings yet
3 Introduction To Probablities
25 pages
Inferential 1 Student
No ratings yet
Inferential 1 Student
93 pages
Statistics and Data Management
No ratings yet
Statistics and Data Management
8 pages
Sampling Distribution
No ratings yet
Sampling Distribution
19 pages
STATISTICAL CONCEPTS-module1
No ratings yet
STATISTICAL CONCEPTS-module1
9 pages
Types of Non-Probability Sampling
No ratings yet
Types of Non-Probability Sampling
4 pages
Lec 11 Sampling and Its Types
No ratings yet
Lec 11 Sampling and Its Types
22 pages
Bio Statistics
No ratings yet
Bio Statistics
217 pages
Lecture 3: Sampling and Sample Distribution
No ratings yet
Lecture 3: Sampling and Sample Distribution
30 pages
Ayesha Ayub - 29883 - Sta410
No ratings yet
Ayesha Ayub - 29883 - Sta410
9 pages
Sampling & Sampling Distributions
No ratings yet
Sampling & Sampling Distributions
44 pages
Inferential Statistics
No ratings yet
Inferential Statistics
29 pages
Math
No ratings yet
Math
10 pages
- Module 4-Sampling 2
No ratings yet
- Module 4-Sampling 2
56 pages
Probability & Statistics
No ratings yet
Probability & Statistics
108 pages
To Statistics
No ratings yet
To Statistics
85 pages
Statistical Methods
No ratings yet
Statistical Methods
16 pages
Module 02 - AIML Statisitcs
No ratings yet
Module 02 - AIML Statisitcs
103 pages
6 Sampling and Basic Descriptive Statistics
No ratings yet
6 Sampling and Basic Descriptive Statistics
38 pages
PR2Q2 Reviewer
No ratings yet
PR2Q2 Reviewer
12 pages
Quant Part2
No ratings yet
Quant Part2
40 pages
Engineering Mathematics - IV (15MAT41) Module-V: SAMPLING THEORY and Stochastic Process
100% (1)
Engineering Mathematics - IV (15MAT41) Module-V: SAMPLING THEORY and Stochastic Process
28 pages
Business Analytics-Iii Ism-Unit 2 - Sampling Methods and Estimation
No ratings yet
Business Analytics-Iii Ism-Unit 2 - Sampling Methods and Estimation
5 pages
Lecture 03 Probability and Statistics Review Part2
No ratings yet
Lecture 03 Probability and Statistics Review Part2
74 pages
Random Sampling & Probability
No ratings yet
Random Sampling & Probability
54 pages
What Is a Probability Distribution
No ratings yet
What Is a Probability Distribution
11 pages
Statistics - The Big Picture
No ratings yet
Statistics - The Big Picture
4 pages
math_140_final_review_notes (1)
No ratings yet
math_140_final_review_notes (1)
20 pages
Preliminary Concepts On Statistical Inference
100% (1)
Preliminary Concepts On Statistical Inference
39 pages
Research Methodology Sampling
No ratings yet
Research Methodology Sampling
32 pages
Review of Chapters 1-5
No ratings yet
Review of Chapters 1-5
21 pages
Decsci Reviewer CHAPTER 1: Statistics and Data
No ratings yet
Decsci Reviewer CHAPTER 1: Statistics and Data
7 pages
Statistics and Probability - Midterm Reviewer
No ratings yet
Statistics and Probability - Midterm Reviewer
12 pages
MATH30-6-Lecture-1-1
No ratings yet
MATH30-6-Lecture-1-1
32 pages
Extending the Boundaries: An Expansive Journey into Nonparametric Curve Estimation
From Everand
Extending the Boundaries: An Expansive Journey into Nonparametric Curve Estimation
Pasquale De Marco
No ratings yet
Statistical Foundations for Psychology
From Everand
Statistical Foundations for Psychology
James C. Ware
No ratings yet
Overview Of Bayesian Approach To Statistical Methods: Software
From Everand
Overview Of Bayesian Approach To Statistical Methods: Software
Vinaitheerthan Renganathan
No ratings yet
Descriptive Statistics: Six Sigma Thinking, #3
From Everand
Descriptive Statistics: Six Sigma Thinking, #3
Sumeet Savant
No ratings yet
Exercises of Statistical Inference
From Everand
Exercises of Statistical Inference
Simone Malacrida
No ratings yet
2016-Annual-Report-1
No ratings yet
2016-Annual-Report-1
41 pages
2013-Annual-Report
No ratings yet
2013-Annual-Report
25 pages
PB2023-07-Smoke-and-Mirrors-The-Hidden-Costs-of-Carbon-Taxation-final
No ratings yet
PB2023-07-Smoke-and-Mirrors-The-Hidden-Costs-of-Carbon-Taxation-final
20 pages
2015-Annual-Report
No ratings yet
2015-Annual-Report
33 pages
2014-Annual-Report
No ratings yet
2014-Annual-Report
30 pages
AMRO-Analytical-Note_Carbon-Pricing-in-ASEAN3-Economies_final
No ratings yet
AMRO-Analytical-Note_Carbon-Pricing-in-ASEAN3-Economies_final
16 pages
climate-resilience-philippines-2021
No ratings yet
climate-resilience-philippines-2021
52 pages
Article 1
No ratings yet
Article 1
31 pages
15166_FinancingAdaptationorFundingChaos1
No ratings yet
15166_FinancingAdaptationorFundingChaos1
36 pages
8 - Presentation on the Call for Proposals for the 2024 Innovation Grants
No ratings yet
8 - Presentation on the Call for Proposals for the 2024 Innovation Grants
45 pages
Arlene Eleanor E. Liberal_Philippines
No ratings yet
Arlene Eleanor E. Liberal_Philippines
23 pages
Day 4 DR Regunay Sea
No ratings yet
Day 4 DR Regunay Sea
41 pages
Spatial Disaggregation of Landsat-Derived Land Surface Temperature Over A Heterogeneous Urban Landscape Using Planetscope Image Derivatives
No ratings yet
Spatial Disaggregation of Landsat-Derived Land Surface Temperature Over A Heterogeneous Urban Landscape Using Planetscope Image Derivatives
8 pages
P222 Module 2 - Measuring - SustainableDevelopment
No ratings yet
P222 Module 2 - Measuring - SustainableDevelopment
12 pages
Castro Mapping Gis Piep
No ratings yet
Castro Mapping Gis Piep
53 pages
Public Private Partnership PPP Case Studies - Philippines
100% (1)
Public Private Partnership PPP Case Studies - Philippines
19 pages
Project Planning Development - 2
No ratings yet
Project Planning Development - 2
38 pages
Land Use Planning Issues
No ratings yet
Land Use Planning Issues
6 pages
LET Reviewer Professional Education Prof. Ed.: Assessment and Evaluation of Learning Part 3
No ratings yet
LET Reviewer Professional Education Prof. Ed.: Assessment and Evaluation of Learning Part 3
2 pages
Yulu Business Case Study
No ratings yet
Yulu Business Case Study
42 pages
The Statistical Imagination: Chapter 5. Measuring Dispersion or Spread in A Distribution of Scores
No ratings yet
The Statistical Imagination: Chapter 5. Measuring Dispersion or Spread in A Distribution of Scores
14 pages
Students (1)
No ratings yet
Students (1)
2 pages
Univariate - Bivariate-Multivariate Analysis
No ratings yet
Univariate - Bivariate-Multivariate Analysis
10 pages
Math
No ratings yet
Math
14 pages
Statistics and Statistic
No ratings yet
Statistics and Statistic
11 pages
SMK Pending (Q)
No ratings yet
SMK Pending (Q)
7 pages
BÀI TẬP PSM VÀ DID
No ratings yet
BÀI TẬP PSM VÀ DID
7 pages
Pearson Correlation Coefficient
100% (2)
Pearson Correlation Coefficient
2 pages
Finance Calculation
No ratings yet
Finance Calculation
14 pages
Complete Answer Guide for Introductory Statistics 9th Edition Weiss Test Bank
100% (15)
Complete Answer Guide for Introductory Statistics 9th Edition Weiss Test Bank
66 pages
05 S1 Silver 1
No ratings yet
05 S1 Silver 1
17 pages
Measures of Spread
No ratings yet
Measures of Spread
4 pages
Frequency Table
No ratings yet
Frequency Table
14 pages
CH11 PPT
No ratings yet
CH11 PPT
33 pages
OTM Correlation Regression Dec 23
No ratings yet
OTM Correlation Regression Dec 23
8 pages
Decision Sciences Formulae Sheet
No ratings yet
Decision Sciences Formulae Sheet
3 pages
002 Applied Geostatistics For Reservoir Char-Halaman-27-61 PDF
No ratings yet
002 Applied Geostatistics For Reservoir Char-Halaman-27-61 PDF
35 pages
Uji Validitas 3 Variabel Fix
No ratings yet
Uji Validitas 3 Variabel Fix
12 pages
2022 Revision Test - 1 - Statistics - MR Share
No ratings yet
2022 Revision Test - 1 - Statistics - MR Share
4 pages
Math 7 1ST Quarter
No ratings yet
Math 7 1ST Quarter
57 pages
Lecture Note On Descriptive Statistics S
No ratings yet
Lecture Note On Descriptive Statistics S
63 pages
Math Worksheet
No ratings yet
Math Worksheet
5 pages
4.7 Measures of Variability (Grouped Data)
No ratings yet
4.7 Measures of Variability (Grouped Data)
1 page
Probability Final Threoms
No ratings yet
Probability Final Threoms
2 pages
Stats
No ratings yet
Stats
16 pages
B.Sc. III STATISTICS (Paper XV)
No ratings yet
B.Sc. III STATISTICS (Paper XV)
7 pages
09784997-b762-40a0-8955-55f82ba3c2dc
No ratings yet
09784997-b762-40a0-8955-55f82ba3c2dc
2 pages
Measures of Variability Quiz
No ratings yet
Measures of Variability Quiz
4 pages

P299 Module 8 Notes

Uploaded by

P299 Module 8 Notes

Uploaded by

MODULE 8: QUANTITATIVE DATA ANALYSIS –  under specified conditions, we may

INFERENTIAL AND DESCRIPTIVE STATISTICS assume that sampling distributions of

where: Population - consists of all the people or other

Central Limit Theorem

 states that the sampling distribution of

 interval between two values that

4. Pareto Chart/Diagram 6. Box Plot

You might also like