3 - Introduction To Inferential Statistics

This document discusses key concepts in inferential statistics including descriptive statistics, probability distributions, the normal distribution, the standard normal distribution, sampling distributions, and the central limit theorem. Specifically, it explains that inferential statistics is used to draw conclusions about populations based on sample data, the normal distribution is the most common probability distribution, the central limit theorem states that sampling distributions will be approximately normal for large sample sizes, and sampling distributions and the central limit theorem allow statisticians to make inferences about populations from samples.

Uploaded by

Vishal Shivhare

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

111 views32 pages

3 - Introduction To Inferential Statistics

Uploaded by

Vishal Shivhare

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

Data Science

Inferen-al Sta-s-cs
Descrip-ve Sta-s-cs
• It helps organize data and focuses on main characteristics of data
• It provides summary of data numerically
Inferen-al Sta-s-cs
• Inferential statistics is all about describing the larger picture of the analysis
with a limited set of data and deriving conclusions from it.
• Basically, inferential statistics aims at drawing conclusions on populations
based on the taken data samples.
• It uses a random sample of data taken from a population to describe and make
inferences about the population.
Inferen-al Sta-s-cs
• Inferential Statistics is used to draw inferences beyond the immediate data
available.
• In inferential statistics we use methods that rely on probability theory and
distribution helping us to predict, in particular, the population’s values based
on sample data.
• Inferential statistics helps us answer the following questions:
• Making inferences about a population from a sample
• Concluding whether a sample is significantly different from the population
Inferen-al Sta-s-cs
• There are two main areas of inferential statistics:
• Estimating parameters - taking a statistic from your sample data (for example the
sample mean) and using it to find something about a population parameter (i.e. the
population mean).
• Hypothesis tests - use sample data to answer research questions. For example,
finding if a new cancer drug is effective or not. Or if breakfast helps children
perform better in schools.
Inferen-al Sta-s-cs
• Prerequisites for understanding Inferential Statistics -
• Descriptive Statistics
• Probability
• Probability Distributions
Probability Distribu-on
Type of data
• Discrete
• Can take only specified values
• Continuous
• Can take any value within a given range
Terminologies
• Random Variable
• Whose value is determined by the outcome of a random experiment
• Discrete random variable
• Whose set of assumed values is countable (arises from counting)
• Continuous random variable
• Whose set of assumed values is uncountable (arises from measurement.)
Probability Distribu-on
• In statistics, with distribution we usually mean probability distribution
• A probability distribution is a function that shows the possible values for a
variable and how often they occur.
• A probability distribution is a list of all of the possible outcomes of a random
variable along with their corresponding probability values.
• For eg. rolling a dice, tossing a coin, measuring weight of a student etc
Probability Distribu-on
• Kind of variable determines the type of probability distribution -
• Discrete probability distributions for discrete variables
• Probability density functions for continuous variables
Discrete Probability Distribu-on
• Also known as Probability Mass Functions.
• For example - coin tosses, rolling a dice
• Each possible value has a non-zero likelihood
• The probabilities for all possible values must sum to one
Discrete Probability Distribu-on

Image Source - http://mathworld.wolfram.com/Dice.html

Discrete Probability Distribu-on
• There are a variety of discrete probability distributions that you can use to
model different types of data. The correct discrete distribution depends on the
properties of your data.
• Types of Discrete Distribution
• Binomial distribution
• Poisson distribution
• Uniform distribution
Con-nuous Probability Distribu-on
• Also known as Probability Density Functions
• For example - often measurements on a scale, such as height, weight, and
temperature.
• Specific values in continuous distributions have a zero probability. For
example, the likelihood of measuring a temperature that is exactly 32 degrees
is zero (because an individual value has an infinitesimally small probability
that is equivalent to zero).
Con-nuous Probability Distribu-on
• Probabilities for continuous distributions are measured over ranges of values
rather than single points.
• A probability indicates the likelihood that a value will fall within an interval.
• On a probability plot, the entire area under the distribution curve equals 1. This
fact is equivalent to how the sum of all probabilities must equal one for
discrete distributions.
• The most well-known continuous distribution is the Normal Distribution.
Normal Distribu-on
Normal Distribu-on
• A normal distribution is the most common and widely used distribution in
statistics because it approximates to a wide variety of random variables
• It is also called a "bell curve" and "Gaussian curve"
Characteris-cs of Normal Distribu-on
• Mean = Median = Mode
• It is symmetric, perfectly centred around mean
• The area under the curve is 1.
• The entire family of normal distribution is
differentiated by two parameters -
• Mean and
• Standard Deviation
• It is denoted as -
• N ~ (𝜇, 𝝈2)
Characteris-cs of Normal Distribu-on
Standard Normal Distribu-on
Standard Normal Distribu-on
• The standard normal distribution is a special case of the normal distribution -
• when a normal random variable has a mean of zero and
• a standard deviation of one
• So if we shift the mean by 𝜇 and the standard deviation by 𝞂 for any normal
distribution we will arrive at the standard normal distribution. We use the letter
Z to denote it
• z = (X - µ) / σ
• The normal random variable of a standard normal distribution is called a
standard score or a z score.
Standard Normal Distribu-on
• Why do we need standard Normal Distribution -
• Makes predictions and inferences much easier
• Compare different normally distributed datasets;
• Detect normality
• Detect outliers
• Create confidence intervals
• Test Hypothesis
Sampling Distribu-on
Sampling Distribu-on
• A Sampling Distribution is a probability distribution of a statistic obtained
through a large number of samples drawn from a specific population.
• For example, consider a normal population with mean µ and variance σ.
Assume we repeatedly take samples of a given size from this population and
calculate the arithmetic mean for each sample. This statistic is then called the
sample mean. Each sample has its own average value, and the distribution of
these averages is called the “sampling distribution of the sample mean.
Sampling Distribu-on
• A Sampling Distribution behaves much like a normal curve and has some
interesting properties like :
• The shape of the Sampling Distribution does not reveal anything about the shape of
the population.
• Sampling Distribution helps to estimate the population statistic, using Central
Limit Theorem
Central Limit Theorem (CLT)
Sampling Distribu-on
• Sampling distribution can be very useful in making inferences about the overall
population
• To find - how much sample means differ from each other, we’ll use standard deviation
of the sampling distribution
• This standard deviation is called the standard error.
• Standard error (SE) = s/sqrt(n)
Central Limit Theorem
• The central limit theorem states (given that sample size >= 30) -
• The sampling distribution of the sample mean has an approximately normal distribution.
• The mean of the sampling distribution is equals to the population mean
• The standard deviation of the sampling distribution equals the standard deviation in the
population divided by the square root of the sample size (i.e. standard error)
Central Limit Theorem
• Points to note -
• Central Limit Theorem holds true irrespective of the type of distribution of the population.
• Now, we have a way to estimate the population mean by just making repeated observations of
samples of a fixed size.
• Greater the sample size, lower the standard error and greater accuracy in determining the
population mean from the sample mean.
Central Limit Theorem
• Significance of Central Limit Theorem -
• Analyzing data involves statistical methods like hypothesis testing and constructing confidence
intervals. These methods assume that the population is normally distributed. In case of
unknown or non-normal distributions, we treat the sampling distribution as normal according
to the central limit theorem
• If we increase the samples drawn from the population, the standard deviation of sample means
will decrease. This helps us estimate the population mean much more accurately

4th Semester Model MCQ Fourth Paper
No ratings yet
4th Semester Model MCQ Fourth Paper
47 pages
Formal Experimental Research Design
80% (5)
Formal Experimental Research Design
13 pages
Examination - SPC Total Allowed Time:1.5 Hours
No ratings yet
Examination - SPC Total Allowed Time:1.5 Hours
3 pages
Pervasive Negative Effects of Rewards Intrinsic Motivation: The Myth Continues
No ratings yet
Pervasive Negative Effects of Rewards Intrinsic Motivation: The Myth Continues
44 pages
Formula 1
No ratings yet
Formula 1
8 pages
Data Science & Analytics Paper
No ratings yet
Data Science & Analytics Paper
55 pages
Review: Application of The Normal Distribution
No ratings yet
Review: Application of The Normal Distribution
70 pages
Statistika Minggu 3
No ratings yet
Statistika Minggu 3
9 pages
Worksheet November 21 Solutions - 2
No ratings yet
Worksheet November 21 Solutions - 2
8 pages
Unit 20 - Central Tendency and Dispersion (Student)
No ratings yet
Unit 20 - Central Tendency and Dispersion (Student)
13 pages
Sampling Distribution
No ratings yet
Sampling Distribution
102 pages
Independent and Paired Sample T-Test 2
No ratings yet
Independent and Paired Sample T-Test 2
11 pages
Pengaruh Hutang Dan Ekuitas Terhadap Profitabilitas Pada Perusahaan Aneka Industri Yang Terdaftar Di Bursa Efek Indonesia
No ratings yet
Pengaruh Hutang Dan Ekuitas Terhadap Profitabilitas Pada Perusahaan Aneka Industri Yang Terdaftar Di Bursa Efek Indonesia
11 pages
NJC Sampling Lecture Notes
No ratings yet
NJC Sampling Lecture Notes
24 pages
Assignment .2. STA301 Rimsha Hameed
No ratings yet
Assignment .2. STA301 Rimsha Hameed
5 pages
Nurse Professionalism Scale Development and Psycho
No ratings yet
Nurse Professionalism Scale Development and Psycho
17 pages
Work Immersion Notes Samples and Sample Techniques
No ratings yet
Work Immersion Notes Samples and Sample Techniques
4 pages
File004 Hatfield Sample Final Discussion
No ratings yet
File004 Hatfield Sample Final Discussion
16 pages
Unit 8. Data Analysis
No ratings yet
Unit 8. Data Analysis
69 pages
Ekonometrika
No ratings yet
Ekonometrika
5 pages
1 Intro-Statistics
No ratings yet
1 Intro-Statistics
61 pages
ST4250 23S1 Assignment 2
No ratings yet
ST4250 23S1 Assignment 2
2 pages
M3.Normal Distribution - Final PDF
No ratings yet
M3.Normal Distribution - Final PDF
23 pages
Statistical Foundations: SOST70151 - LECTURE 5
No ratings yet
Statistical Foundations: SOST70151 - LECTURE 5
49 pages
Central Limit Theorm
No ratings yet
Central Limit Theorm
101 pages
Central Limit Theorem
100% (3)
Central Limit Theorem
38 pages
E-Note 14653 Content Document 20231228101402AM
No ratings yet
E-Note 14653 Content Document 20231228101402AM
10 pages
Normal Distribution: X e X F
No ratings yet
Normal Distribution: X e X F
30 pages
M3.Normal Distribution - Final PDF
No ratings yet
M3.Normal Distribution - Final PDF
23 pages
Lecture Slides - Inferential Statistics
100% (1)
Lecture Slides - Inferential Statistics
42 pages
Probability Distribution
No ratings yet
Probability Distribution
15 pages
Unit II: Basic Data Analytic Methods
No ratings yet
Unit II: Basic Data Analytic Methods
38 pages
Lecture 3: Sampling and Sample Distribution
No ratings yet
Lecture 3: Sampling and Sample Distribution
30 pages
5-Introduction To The Normal Distribution (Bell Curve)
No ratings yet
5-Introduction To The Normal Distribution (Bell Curve)
9 pages
Topic 13 STAT 497 LN13 Cointegration
No ratings yet
Topic 13 STAT 497 LN13 Cointegration
70 pages
What Is Distribution?
No ratings yet
What Is Distribution?
4 pages
The Practice of Statistic For Business and Economics Is An Introductory
No ratings yet
The Practice of Statistic For Business and Economics Is An Introductory
15 pages
Business Statistics
No ratings yet
Business Statistics
25 pages
Decsci Reviewer CHAPTER 1: Statistics and Data
No ratings yet
Decsci Reviewer CHAPTER 1: Statistics and Data
7 pages
Research Method Final Exam For Principal College
No ratings yet
Research Method Final Exam For Principal College
3 pages
Normal Distribution:: - Probability - Characteristics and Application of Normal Probability Curve - Sampling Error
No ratings yet
Normal Distribution:: - Probability - Characteristics and Application of Normal Probability Curve - Sampling Error
21 pages
Probability Distributions-Sarin B
No ratings yet
Probability Distributions-Sarin B
20 pages
Manpower Training and Employee Performance in Mellienium Ltdawka, Anambra State
No ratings yet
Manpower Training and Employee Performance in Mellienium Ltdawka, Anambra State
10 pages
5-Introduction To The Normal Distribution (Bell Curve)
No ratings yet
5-Introduction To The Normal Distribution (Bell Curve)
9 pages
Key of Week1 - Lecture Notes
No ratings yet
Key of Week1 - Lecture Notes
10 pages
Classify Sample Observation
No ratings yet
Classify Sample Observation
2 pages
Week 9+10+11
No ratings yet
Week 9+10+11
82 pages
Statistics Unit 6 Notes
No ratings yet
Statistics Unit 6 Notes
10 pages
Vi. Standard Scores and The Normal Distribution
No ratings yet
Vi. Standard Scores and The Normal Distribution
6 pages
Stats Revieew
No ratings yet
Stats Revieew
9 pages
Biostatistics Unit 5. Measure of Skew
No ratings yet
Biostatistics Unit 5. Measure of Skew
38 pages
Normal Distribution & CLT
No ratings yet
Normal Distribution & CLT
3 pages
Thesis Book
No ratings yet
Thesis Book
73 pages
Probability Distribution
No ratings yet
Probability Distribution
10 pages
LQ1 Notes
No ratings yet
LQ1 Notes
15 pages
Ma Statsv2 3
No ratings yet
Ma Statsv2 3
3 pages
COM 201 - Inferential Statistics - 18032022-1
No ratings yet
COM 201 - Inferential Statistics - 18032022-1
58 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
51 pages
Deep Learning - IIT Ropar - Unit 8 - Week 5
No ratings yet
Deep Learning - IIT Ropar - Unit 8 - Week 5
4 pages
Stats
No ratings yet
Stats
3 pages
Review of Chapters 1-5
No ratings yet
Review of Chapters 1-5
21 pages
Assignment-Regression Analysis
No ratings yet
Assignment-Regression Analysis
6 pages
What Is Probability
No ratings yet
What Is Probability
8 pages
2NUBIONormalCurve2T24 25
No ratings yet
2NUBIONormalCurve2T24 25
50 pages
Module01 ProbabilityAndHypothesisTesting
No ratings yet
Module01 ProbabilityAndHypothesisTesting
62 pages
CS3352 Fds
No ratings yet
CS3352 Fds
1 page
Educational Statistics KCA Past Paper 3
No ratings yet
Educational Statistics KCA Past Paper 3
4 pages
Unit V PS QB Internal III (24.05.24) Questions
No ratings yet
Unit V PS QB Internal III (24.05.24) Questions
3 pages
Stat Notes
No ratings yet
Stat Notes
5 pages
2466939-EDA and STATISTICS NOTES
No ratings yet
2466939-EDA and STATISTICS NOTES
15 pages
Box-Plot Template
No ratings yet
Box-Plot Template
11 pages
Statistics For Management 2
No ratings yet
Statistics For Management 2
14 pages
Statistics and Probability
No ratings yet
Statistics and Probability
2 pages
Csc-Reviewer-Stats and Prob
No ratings yet
Csc-Reviewer-Stats and Prob
13 pages
Central Limit Theorem Grade 11 Group 4
No ratings yet
Central Limit Theorem Grade 11 Group 4
7 pages
Statistics and Probability Reviewer
No ratings yet
Statistics and Probability Reviewer
7 pages
Lecture Note On Biostatistics
No ratings yet
Lecture Note On Biostatistics
74 pages
2nd Year Statistics Chapter Wise Test
No ratings yet
2nd Year Statistics Chapter Wise Test
8 pages
Statistics 1
No ratings yet
Statistics 1
9 pages
Week 9
No ratings yet
Week 9
19 pages
Statisticsppt Copy 170221201132
No ratings yet
Statisticsppt Copy 170221201132
30 pages
T Test
No ratings yet
T Test
50 pages
FBA Module 2
No ratings yet
FBA Module 2
27 pages
UNIT - 4 Complete
No ratings yet
UNIT - 4 Complete
77 pages
Ders 1
No ratings yet
Ders 1
34 pages
DMV - Unit I
No ratings yet
DMV - Unit I
44 pages
Research - Stats Notes
No ratings yet
Research - Stats Notes
44 pages
Descriptive Statistics: Six Sigma Thinking, #3
From Everand
Descriptive Statistics: Six Sigma Thinking, #3
Sumeet Savant
No ratings yet
Sampling in Statistics
From Everand
Sampling in Statistics
Stephanie Glen
No ratings yet
Statistics II Essentials
From Everand
Statistics II Essentials
Emil Milewski
2.5/5 (1)

3 - Introduction To Inferential Statistics

Uploaded by

3 - Introduction To Inferential Statistics

Uploaded by

Data Science

Image Source - http://mathworld.wolfram.com/Dice.html

You might also like