0% found this document useful (0 votes)

20 views30 pages

1.4.1. Estimation and Inference

Uploaded by

havietthang02

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views30 pages

1.4.1. Estimation and Inference

Uploaded by

havietthang02

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 30

Estimation and Inference

1
Learning Goals
In this section, we will cover:
- Statistical estimation and inference
- Parametric and non-parametric approaches to modeling
- Common statistical distributions
- Frequentist vs. Bayesian statistics

2
Estimation vs. Inference
Estimation: is the application of an algorithm, for example taking an average:

Inference: involves putting an accuracy on the estimate

(e.g. standard error of an average):

3
Machine Learning and Statistical Inference
Machine learning and statistical inference are similar
(a case of computer science borrowing from a long history in statistics).

In both cases, we're using data to learn/infer qualities of a distribution that

generated the data (often termed the data-generating process).

We may care either about the whole distribution or just features (e.g. mean).

Machine learning applications that focus on understanding parameters and

individual effects involve more tools from statistical inference (some
applications are focused only on results).

4
Example: Customer Churn
Customer churn occurs when a customer leaves a company

Data related to churn may include a target variable for

whether or not the customer left

Features could include:

-The length of time as a customer
- The type and amount purchased
- Other customer characteristics

Churn prediction is often approached by predicting a score

for individuals that estimates the probability the customer will
leave.
5
Customer Churn: Estimation
Estimation of factors driving customer churn
involves measuring the impact of each factor in
predicting churn

Inference involves determining whether these

measured impacts are statistically significant

6
Customer Churn: Example Dataset
IBM Cognos Customer Churn Dataset:
- Data from fictional telecommunications firm

- Includes account type, customer characteristics,

revenue per customer, satisfaction score, estimate
of customer lifetime value

- Includes information on whether customer

churned (and some categories of churn type)

7
Customer Churn Example: Plotting

8
Customer Churn Example: Plotting

9
Customer Churn Example: Plotting

10
Customer Churn Example: Plotting

11
Parametric vs. Non-parametric

If inference is about trying to find out the Data-Generating Process

(DGP), then we can say that a statistical model (of the data) is a set of
possible distributions or maybe even regressions.
A parametric model is a particular type of statistical model: it's also a set
of distributions or regressions, but they have a finite number of
parameters.

12
Non-parametric Statistics

In non-parametric statistics, we make fewer assumptions.

In particular, we don't assume that the data belong to any particular
distribution (also called distribution-free inference).
This doesn't mean that we know nothing, though!

13
Non-parametric Inference

An example of non-parametric inference is creating a distribution of

the data (CDF or cumulative distribution function) using a histogram.
In this case, we're not specifying parameters.

14
Parametric Models
A parametric model is a particular type of statistical model: it's also a
set of distributions or regressions, but they have a finite number of
parameters.
An example of a parametric model: the Normal Distribution.

15
Example: Customer Lifetime Value
Customer lifetime value is an estimate of the
customer's value to the company

Data related to customer lifetime value might include:

- The expected length of time as a customer
- The expected amount spent over time

To estimate lifetime value, we make assumptions

about the data

These assumptions can be parametric (assuming a

specific distribution), or non- parametric

16
Parametric Models: Maximum Likelihood
The most common way of estimating parameters in a parametric model
is through maximum likelihood estimation (MLE).

The likelihood function is related to probability and is a function of the

parameters of the model:

17
Parametric Models: Maximum Likelihood
We choose the value of 0 (parameters) that maximizes the likelihood function.

18
Commonly Used Distributions

19
Commonly Used Distributions

20
Commonly Used Distributions

21
Commonly Used Distributions

22
Commonly Used Distributions

23
Frequentist vs. Bayesian Statistics
A Frequentist is concerned with repeated observations in the limit.

Processes may have true frequencies, but we're interested in modeling

probabilities as many, many repeats of an experiment.

Frequentist approach:
1. Derive the probabilistic property of a procedure
2. Apply the probability directly to the observed data

24
Frequentist vs. Bayesian: Bayesian
A Bayesian describes parameters by probability distributions.

Before seeing any data, a prior distribution (based on the

experimenters' belief) is formulated.

This prior distribution is then updated after seeing data (a sample

from the distribution).

After updating, the distribution is called the posterior distribution.

25
Frequentist vs. Bayesian: Bayesian
We will consider two examples of probabilistic systems:
● Coin flips - What is the probability of an unfair coin coming up
heads?
● Election of a particular candidate for UK Prime Minister - What
is the probability of seeing an individual candidate winning, who has
not stood before?

26
Frequentist vs. Bayesian: Bayesian

27
Frequentist vs. Bayesian Statistics
We use much of the same math and the same formulas in both
Frequentist and Bayesian statistics.

The element that differs is the interpretation.

We will point out the difference in interpretation, where appropriate.

28
Summary
● Estimation and Inference
○ Inferential Statistics consist in learning characteristics of the population from a
sample. The population characteristics are parameters, while the sample
characteristics are statistics. A parametric model, uses a certain number of
parameters like mean and standard deviation.
○ The most common way of estimating parameters in a parametric model is through
maximum likelihood estimation.
○ Through a hypothesis test, you test for a specific value of the parameter.
○ Estimation represents a process of determining a population parameter based on a
model fitted to the data.
○ The most common distribution functions are: uniform, normal, log normal,
exponential, and poisson.
○ A frequentist approach focuses in observing man repeats of an experiment. A
29
bayesian approach describes parameters through probability distributions.
Learning Recap
In this section, we discussed:
- Statistical estimation and inference
- Parametric and non-parametric approaches to modeling
- Common statistical distributions
- Frequentist vs. Bayesian statistics

Data Fusion With The Linear Kalman Filter Slides
No ratings yet
Data Fusion With The Linear Kalman Filter Slides
325 pages
Unit I Predictive Analytics
No ratings yet
Unit I Predictive Analytics
39 pages
DAV - Technical Book
No ratings yet
DAV - Technical Book
137 pages
Lecture 1
No ratings yet
Lecture 1
54 pages
Week 1 Course Material
No ratings yet
Week 1 Course Material
15 pages
DSOST2
No ratings yet
DSOST2
44 pages
FDS Sem5
No ratings yet
FDS Sem5
15 pages
Ass-3 Ds
No ratings yet
Ass-3 Ds
7 pages
Business Statistics
No ratings yet
Business Statistics
137 pages
Merge
No ratings yet
Merge
240 pages
To Statistical Inference: by Dr. Saddam Hussain
No ratings yet
To Statistical Inference: by Dr. Saddam Hussain
25 pages
Data Analysis: - Describing Data and Datasets
No ratings yet
Data Analysis: - Describing Data and Datasets
15 pages
Introduction - Lecture Slides-1
No ratings yet
Introduction - Lecture Slides-1
12 pages
Statistics
No ratings yet
Statistics
13 pages
Lecture 4 - Data Science Statistics
No ratings yet
Lecture 4 - Data Science Statistics
21 pages
Ch2 Statistical Learning
No ratings yet
Ch2 Statistical Learning
51 pages
Practical Statistical Questions: Session 1
No ratings yet
Practical Statistical Questions: Session 1
40 pages
Unit 2
No ratings yet
Unit 2
20 pages
To Statistics To Statistics: Objectives
No ratings yet
To Statistics To Statistics: Objectives
10 pages
Applications of Inference Statistics
No ratings yet
Applications of Inference Statistics
28 pages
Week2 StatisticalLearning
No ratings yet
Week2 StatisticalLearning
46 pages
Data Mining Techniques
No ratings yet
Data Mining Techniques
33 pages
QM Full Notes
No ratings yet
QM Full Notes
177 pages
Statistics Unit1ppt
No ratings yet
Statistics Unit1ppt
94 pages
Nonparametric Statistics Michaelmas 2024-25
No ratings yet
Nonparametric Statistics Michaelmas 2024-25
71 pages
Part 1 - Basic Statistics
No ratings yet
Part 1 - Basic Statistics
44 pages
Igual-SeguÃ 2017 Chapter StatisticalInference
No ratings yet
Igual-SeguÃ 2017 Chapter StatisticalInference
15 pages
Lecture1 Introduction
No ratings yet
Lecture1 Introduction
49 pages
Data Analytics Chat GPT
No ratings yet
Data Analytics Chat GPT
75 pages
Session 1 BSDM
No ratings yet
Session 1 BSDM
17 pages
Final Correction Basic Statistics Combined Chapter
No ratings yet
Final Correction Basic Statistics Combined Chapter
130 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
54 pages
Chapter1 Introduction
No ratings yet
Chapter1 Introduction
38 pages
SI Chapter-2
No ratings yet
SI Chapter-2
53 pages
Unit - II - Part I - Importance of Statistics in Data Science
No ratings yet
Unit - II - Part I - Importance of Statistics in Data Science
10 pages
Chapter1 K57 S
No ratings yet
Chapter1 K57 S
80 pages
Statistics Lec 1
No ratings yet
Statistics Lec 1
28 pages
Descriptive Analytics I: Nature of Data,: Statistical Modeling, and Visualization
No ratings yet
Descriptive Analytics I: Nature of Data,: Statistical Modeling, and Visualization
76 pages
Input Modeling For Simulation
No ratings yet
Input Modeling For Simulation
48 pages
Stat Lecture 2
No ratings yet
Stat Lecture 2
6 pages
مبادئ الاحصاء
No ratings yet
مبادئ الاحصاء
66 pages
Chapter 1 Slides
No ratings yet
Chapter 1 Slides
40 pages
Simple Regression Analysis
No ratings yet
Simple Regression Analysis
13 pages
Foundations of Business Analytics Vnuk - Fba
No ratings yet
Foundations of Business Analytics Vnuk - Fba
44 pages
7 Input Modeling 2024
No ratings yet
7 Input Modeling 2024
90 pages
CH Ii Business Stat
No ratings yet
CH Ii Business Stat
28 pages
Statistical Foundation For Analytics-Module 1
No ratings yet
Statistical Foundation For Analytics-Module 1
18 pages
Chapter2 BI
No ratings yet
Chapter2 BI
77 pages
Statistics - Unit1 PDF
No ratings yet
Statistics - Unit1 PDF
94 pages
Lec08 2025
No ratings yet
Lec08 2025
43 pages
Introduction To Statistical Modeling With SAS/STAT Software
No ratings yet
Introduction To Statistical Modeling With SAS/STAT Software
60 pages
DS Unit 1
No ratings yet
DS Unit 1
99 pages
Chapter 1 Slides PDF
No ratings yet
Chapter 1 Slides PDF
45 pages
Business Statistics - Prof. Dr. Mukesh Kumar Barua
100% (1)
Business Statistics - Prof. Dr. Mukesh Kumar Barua
991 pages
1 - Introduction To Statistics - June-22, 2011 (Compatibility Mode)
No ratings yet
1 - Introduction To Statistics - June-22, 2011 (Compatibility Mode)
12 pages
Business Research 2
No ratings yet
Business Research 2
8 pages
Statistic e Book
No ratings yet
Statistic e Book
61 pages
Introduction Key Concepts
No ratings yet
Introduction Key Concepts
37 pages
Statistics
No ratings yet
Statistics
53 pages
Overview Of Bayesian Approach To Statistical Methods: Software
From Everand
Overview Of Bayesian Approach To Statistical Methods: Software
Vinaitheerthan Renganathan
No ratings yet
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
4.1.1. Introduction To Unsupervised Learning
No ratings yet
4.1.1. Introduction To Unsupervised Learning
26 pages
1.1.2. Modern AI - Applications and Machine Learning Workflow
No ratings yet
1.1.2. Modern AI - Applications and Machine Learning Workflow
26 pages
1.2.1. Retrieving Data - 1.2.2. Cleaning Data
No ratings yet
1.2.1. Retrieving Data - 1.2.2. Cleaning Data
35 pages
1.3.1. Exploratory Data Analysis
No ratings yet
1.3.1. Exploratory Data Analysis
24 pages
Chapman-Kolmogorov Equations 28 Likelihood Intervals Are 48511
No ratings yet
Chapman-Kolmogorov Equations 28 Likelihood Intervals Are 48511
9 pages
A Tutorial On MM Algorithms
No ratings yet
A Tutorial On MM Algorithms
9 pages
ODE Estimation PDF
No ratings yet
ODE Estimation PDF
41 pages
Presentation Generalized Linear Model Theory
No ratings yet
Presentation Generalized Linear Model Theory
77 pages
Detail Syllabus For B.A. Part II Honours Anthropology Honours
No ratings yet
Detail Syllabus For B.A. Part II Honours Anthropology Honours
36 pages
Psychophysics Notes
No ratings yet
Psychophysics Notes
48 pages
ML and Ls
No ratings yet
ML and Ls
2 pages
Subject Guidelines Maths Literacy L4.GSeditV2
No ratings yet
Subject Guidelines Maths Literacy L4.GSeditV2
13 pages
Quantum Informatics View of Statistical Data Processing: Yu. I. Bogdanov, N. A. Bogdanova
No ratings yet
Quantum Informatics View of Statistical Data Processing: Yu. I. Bogdanov, N. A. Bogdanova
7 pages
601 sp09 Midterm Solutions
No ratings yet
601 sp09 Midterm Solutions
14 pages
Uncertainty Notes
No ratings yet
Uncertainty Notes
7 pages
UNIT1
No ratings yet
UNIT1
38 pages
Gumbel-Weibull Distribution - Properties and Applications
No ratings yet
Gumbel-Weibull Distribution - Properties and Applications
26 pages
DS100-2-Grp#4 Chapter 6 Advanced Analytical Theory and Methods Regression (CADAY, CASTOR, CRUZ, SANORIA, TAN)
No ratings yet
DS100-2-Grp#4 Chapter 6 Advanced Analytical Theory and Methods Regression (CADAY, CASTOR, CRUZ, SANORIA, TAN)
4 pages
Unit 2 Probabilistic Reasoning
No ratings yet
Unit 2 Probabilistic Reasoning
21 pages
Mixed Logit or Random Parameter Logit Model
No ratings yet
Mixed Logit or Random Parameter Logit Model
12 pages
Statistical Learning in Genetics An Introduction Using R Reference Book Download
No ratings yet
Statistical Learning in Genetics An Introduction Using R Reference Book Download
16 pages
RP ch07
No ratings yet
RP ch07
29 pages
What Uncertainties Do We Need in Bayesian Deep Learning For Computer Vision?
No ratings yet
What Uncertainties Do We Need in Bayesian Deep Learning For Computer Vision?
12 pages
Liu Columbia 0054D 10924 PDF
No ratings yet
Liu Columbia 0054D 10924 PDF
148 pages
Ejercicios Resueltos de Inferencia Estadistica
No ratings yet
Ejercicios Resueltos de Inferencia Estadistica
229 pages
Chapter 10
No ratings yet
Chapter 10
36 pages
The Math Behind TrueSkill
No ratings yet
The Math Behind TrueSkill
57 pages
Identifying Growth Patterns of The High-Tech Manufacturing Industry Across The Seoul Metropolitan Area Using Latent Class Analysis
No ratings yet
Identifying Growth Patterns of The High-Tech Manufacturing Industry Across The Seoul Metropolitan Area Using Latent Class Analysis
10 pages
Likelihood Ratio Tests: Instructor: Songfeng Zheng
No ratings yet
Likelihood Ratio Tests: Instructor: Songfeng Zheng
9 pages
Edu 2008 Spring C Questions
No ratings yet
Edu 2008 Spring C Questions
180 pages
How To Do A Logistic Regression in Excel
No ratings yet
How To Do A Logistic Regression in Excel
13 pages
BenchmarkingDefaultPredictionModels TR030124
No ratings yet
BenchmarkingDefaultPredictionModels TR030124
37 pages
Instrumental Variables Bowden Download
100% (1)
Instrumental Variables Bowden Download
81 pages

1.4.1. Estimation and Inference

Uploaded by

1.4.1. Estimation and Inference

Uploaded by

Estimation and Inference

Inference: involves putting an accuracy on the estimate

In both cases, we're using data to learn/infer qualities of a distribution that

Machine learning applications that focus on understanding parameters and

Data related to churn may include a target variable for

Features could include:

Churn prediction is often approached by predicting a score

Inference involves determining whether these

- Includes account type, customer characteristics,

- Includes information on whether customer

If inference is about trying to find out the Data-Generating Process

In non-parametric statistics, we make fewer assumptions.

An example of non-parametric inference is creating a distribution of

Data related to customer lifetime value might include:

To estimate lifetime value, we make assumptions

These assumptions can be parametric (assuming a

The likelihood function is related to probability and is a function of the

Processes may have true frequencies, but we're interested in modeling

Before seeing any data, a prior distribution (based on the

This prior distribution is then updated after seeing data (a sample

After updating, the distribution is called the posterior distribution.

The element that differs is the interpretation.

We will point out the difference in interpretation, where appropriate.

You might also like