0% found this document useful (0 votes)

28 views53 pages

Lecture 1and2-Revision Part1

The document discusses the fundamentals of machine learning and its application in solving predictive analytics problems, such as spam detection and score prediction. It outlines the process of building a machine learning algorithm, including defining tasks, collecting datasets, extracting features, and selecting models. Additionally, it covers evaluation metrics and optimization methods for improving model performance.

Uploaded by

achintyaharsha0317

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views53 pages

Lecture 1and2-Revision Part1

Uploaded by

achintyaharsha0317

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 53

REVISION -1

12 Oct 2023, Dinesh Babu J

SOLVING PREDICTIVE ANALYTICS PROBLEMS
◻
USING MACHINE
What is Machine Learning?
LEARNING…
◻ “Systematic study of algorithms and systems that improve their
knowledge or performance with experience on certain tasks” [Prof.
Tom Mitchell, CMU]

◻ In a Machine Learning framework, Predictive Analytics problems

(Spam detection; ML Score prediction; News article grouping) become
tasks

◻ Experience is in the form of data, say past data

◻ Performance: how well does the algorithm predict..

HOW TO BUILD AN ML ALGORITHM
• The previous viewpoint was a requirement viewpoint
• Let us take the engineering viewpoint i.e. how to build an
Machine Learning System
• Machine learning formulation consists of “tasks, dataset,
features, and models”
• To start: Pose a suitable task, Collect a good dataset,
Extract relevant features
• To solve: Choose a model to implement, Learn a model
using the dataset (learning algorithm) , use the model to
predict (inference algorithm)
•
HOW DO BUILD A ML ALGORITHM
The previous viewpoint was a requirement viewpoint
• Let us take an engineering viewpoint of Machine
Learning
• Machine learning consists of “tasks, models, features,
and datasets”
• To start: Pose a suitable task, Collect a good dataset,
Extract relevant features
• To solve: Choose a model to implement, Learn a model
using the dataset (learning algorithm), use the
…Let’s get started
model to
INFERENCE USING ML
Output MALE/FEMALE

ML What is this ML algorithm?

Algo How do we build this algo?

Feature Hair length

Feature extraction

Input Image
INFERENCE USING ML
Output MALE/FEMALE

ML What is this ML algorithm

Algo How do we build this algo?

Feature Hair length Feature Pitch

Feature extraction Feature extraction

Input Image Input Voice

PREDICTIVE ANALYTICS PROBLEMS E.G.

Output
• Spam detection prediction
• Input: email ; Output = Spam or not

• ML
Score prediction (out of 100) in ML Course
Algo
• Input: 10,12 math marks; Output = Predicted Score

• News article group prediction

• Input: Set of news articles; Output = Cluster ID
Feature
Feature extraction

Input
TASK, DATASET, FEATURES

SPAM DETECTION PROBLEM

• Task – Classification {SPAM(+1), HAM(-1)}

• Dataset – {Emails, SPAM/HAM Label}

• Gmail: User flagging

• Features – {x1, x2…} e.g. frequency of occurrence of certain words (LOTTERY

– 10; VIAGRA – 8..)
Learning
Pa at

An
st ure
fe

sw
Algorithm
Em s

er
Model

s:
ai

S
l

PA
New Email Inference Question: SPAM/HAM

M
features

/H
Algorithm

AM
TASK, DATASET, FEATURES

ML SCORE PREDICTION PROBLEM

• Task – Regression [0, 100]

• Dataset – {10th 12th Math score, ML score}

• Questionnaire to past students

• Features – {x1, x2} e.g. {75, 80}

Past students Learning Answers: ML score
Math marks Algorithm
Model
New student’s Inference Question: ML score?
Math marks Algorithm
Credit:
Prof. Srihar
TASK, DATASET, FEATURES

NEWS ITEM GROUPING PROBLEM

• Predictive Clustering task:
• Assign one of {1, 2, ….K} to a news article

• Dataset – set of news articles

• Query Google with “news”

• Features – {x1, x2, x3, x4} e.g. {topic distributions}

Features from Learning Cluster IDs for
collection of Algorithm every news item
news items
Model
New article Inference Which Cluster
features Algorithm Is best suited?
TASKS

Output = Output =
continuous discrete Solutions
Hierarchical, Deep NN
Non-linear Model
Supervise Classificat SVR/SVM,
Regression Non-linear Model
d ion NN
Linear Model
Dimensiona
Unsupervi
lity Clustering
sed
reduction
•
HOW DO BUILD A ML ALGORITHM
The previous viewpoint was a requirement viewpoint
• Let us take an engineering viewpoint of Machine
Learning
• Machine learning consists of “tasks, models, features,
and datasets”
• To start: Pose a suitable task, Collect a good dataset,
Extract relevant features
• To solve: Choose a model to implement, Learn a model
using the dataset (learning algorithm), use the model to
…Let’s solve
MODEL, LEARNING, INFERENCE ALGORITH
SOLUTION 1: CLASSIFICATION
PROBLEM
• e.g. Spam detection problem
An
Learning
Pa
s sw
Algorithm er
t
Em
s:
Sp
ai
am
ls
• Models: straight line to divide (a linear model)
or
no
x2 x2 t

x1 x1
MODEL, LEARNING, INFERENCE ALGORITH
SOLUTION 2: REGRESSION
PROBLEM
• e.g ML1 Score prediction problem

• Models: straight line to fit the data

Learning
Pa t ks

An
st h M
10 ar

Algorithm

sw
m

st a
u d th

er
s:
en s

M
ts

y y

L1
sc
o
re
?
x1 x1
MODEL, LEARNING, INFERENCE ALGORITH
SOLUTION 3: CLUSTERING
PROBLEM
• e.g. News grouping problem

Learning
Co w

Cl er
Algorithm
ne
l le i t

us y
ev
ct em

te ne
s
io s

r w
n

ID s
of

s it e
fo m
r
• Model: distance based

x2 x2

x1 x1
EVALUATION METRIC: CLASSIFICATION
PROBLEM
• How do we know the solution on the right is better than the left?

• Total misclassifications (left) = 2

• Total misclassifications (right) = 0; so right is better

x2 x2

x1 x1
EVALUATION METRIC: REGRESSION PROBLEM

y p r e d i =m x i+ b

∑¿

y y
ypre
di
ypredi
yi yi
x x
xi xi
ML EXPERIMENTS
Training Learning Training
Features Algorithm Labels
Model
Test Inference
Features Predicted
Algorithm
Labels

Evaluation Predicted
e.g. Accuracy Labels
Algorithm Test
Labels
REVISIT: SOLVING THE PREDICTIVE ANALYTICS
•
PROBLEM
Machine learning consists of “tasks, models, features,
and datasets”

• To start: Pose a suitable task, Collect a good dataset,

Extract relevant features

• To solve: Choose a model to implement, Learn a model

using the dataset
• Search the space of model parameters and optimise the error
measure; e.g. find the best line that minimises the classification
MODELS AND MODEL PARAMETERS

Support Vector
Bayes Classifer Logistic regression
Machine
REVISIT: SOLVING THE PREDICTIVE ANALYTICS
•
PROBLEM
Machine learning consists of “tasks, models, features,
and datasets”

• To start: Pose a suitable task, Collect a good dataset,

Extract relevant features

• To solve: Choose a model to implement, Learn a model

using the dataset
• Search the space of model parameters and optimise the error
measure; e.g. find the best line that minimises the classification
OPTIMIZATION METHODS
Unconstrained
Minimisation
OPTIMIZATION METHODS
Unconstrained
Minimisation
OPTIMIZATION METHODS

Constrained
Minimisation
OPTIMIZATION METHODS
Global vs local
optimum
- Neural networks

Single optimum
- SVM
MODEL DATA
SPACE SPACE

y p r e d i =m x i+ b

L ( m , b )=∑ ¿
MODEL DATA
SPACE SPACE
FITTING A STRAIGHT LINE
Score
In ML D a t a : { x i , y i }i=1 : N
course
M o d e l : y p r e d=a+ b∗ x

L o s s : J ( a , b)=∑ ¿

10th
math
FITTING A STRAIGHT LINE – COST FUNCTION
CLOSED FORM – MINIMIZE SUM OF SQUARE ERROR
GRADIENT DESCENT – MINIMIZE SUM OF SQUARE
ERROR
FN MINIMIZATION
Exercise

CLOSED FORM
ITERATIVE METHOD

′
❑
θ (n e w)=θ( o l d)−μ J (θ)

GRADIENT DESCENT
Cost function Gradient of the cost function
′

J ( θ )=1.2 ¿ J ❑
(θ)=2.4∗(θ−2)

at
′
❑
θ (n e w)=θ( o l d)−μ J (θ) θ=θ (o l d ) Gradient
Descent
θ (n e w )= θ( o l d)− μ∗2.4 (θ−2) at θ=θ (o l d )

θ (o l d )=1

θ (n e w )=1− μ∗2.4 (1− 2)

C a s e 1 : μ=0.1
θ (n e w )=1−0.1∗2.4 (1− 2)=1.24

C a s e 2 : μ=0.5
θ (n e w )=1−0.5∗2.4 (1−2)=2.2
FITTING A GAUSSIAN
PROBABILISTIC CLASSIFIERS

• Probabilistic classifiers estimate

• Given x the features, we want to estimate the

probability that the class label C is say

C1(i.e. MALE) or C2(i.e. FEMALE)

DETERMINISTIC VS PROBABILISTIC
CLASSIFIERS
Output MALE/FEMALE, C Output P(MALE|x), P(FEMALE|x)

ML ML
Algo Algo

random
Input Hair length, x Input X variable
BAYES CLASSIFIER
• Probabilistic classifiers estimate

• Given x the features, we want to estimate the probability that

the class label C is say

C1(i.e. MALE) or C2(i.e. FEMALE)

x1 is continuous

• Use Bayes theorem, assuming 1-D feature vector

NORMALIZED HISTOGRAM

• To start with we can bin the

continuous values and
observe the histogram (after
normalizing i.e. the elements
sum up to 1)
FITTING A GAUSSIAN DISTRIBUTION
• Makes sense to keep
only two parameters
mu and sigma of the
Gaussian distribution
and throw away the
original data

• Question: How to
estimate mu and
sigma?
FITTING A GAUSSIAN
DENSITY ESTIMATION TASK:
WHICH GAUSSIAN IS THE BEST?

Data
{ x i }i=1 : N

Model
1 −¿ ¿¿
p(x i ∨μ , σ )= e
σ √2 π

Hair length
of women
MAXIMUM LIKELIHOOD FUNCTION
p( X∨θ)= p( x 1 x 2 .. x N ∨θ)

• Let us define a cost function in terms of the

parameters to be estimated
• We should find that value of parameter which
maximizes the probability of observing the given N
samples
MAXIMUM LIKELIHOOD
p( X∨θ)= p( x 1 x 2 .. x N ∨θ) Definition
N
p( X∨θ)= ∏ p(x i∨θ) IID assumption
i=1
l ( θ ) =l n ¿ Take log
̂
θ =a r g m a x θ ( l (θ )) Cost function: Maximise log-likelihood
• Assuming the samples are independently drawn

• Take logarithm (makes our life easier for further steps;

as it is a monotonic function, we are allowed to do so)
MAXIMUM LIKELIHOOD (CLOSED FORM)
p( X∨θ)= p( x 1 x 2 .. x N ∨θ)
N
p( X∨θ)= ∏ p(x i∨θ)
i=1
l ( θ ) =l n ¿
̂
θ =a r g m a x θ ( l (θ )) at maxima also
N
∇ θ l=Σ i=1 ∇θ l n( p ( x i∨θ))=0
ML – SINGLE DIMENSIONAL GAUSSIAN
ML – SINGLE DIMENSIONAL GAUSSIAN
Home work

ML – SINGLE DIMENSIONAL GAUSSIAN

INFERENCE ALGORITHM

0.25 25%
BAYES CLASSIFIER
NAÏVE BAYES CLASSIFIER
• Probabilistic classifiers estimate
P(C =C ∨x ) k

• Use Bayes theorem

p( x 1 x 2∨C=C k ) P(C=C k )
P(C =C k ∨x )=
p( x)

• Use Naïve assumption, given class label the features

are independent (class conditional independence)
p( x 1 ∨C=C k ) p( x 2∨C=C k ) P (C=C k )
P(C =C k ∨x )=
p ( x)
BAYES CLASSIFIER
• Probabilistic classifiers estimate
P(C =C ∨x ) k

• Given the 2D features, we want to estimate the probability

that the class label C is say

C1(i.e. MALE) or C2(i.e. FEMALE)

p( x 1 x 2∨C=C k ) P(C=C k )
P(C =C k ∨x )=
p( x)

• Use Bayes theorem, assuming 2D feature vector

Classification
100% (2)
Classification
105 pages
Project
No ratings yet
Project
6 pages
Ai - Foundations of Machine Learning I
No ratings yet
Ai - Foundations of Machine Learning I
39 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
AVL Trees
No ratings yet
AVL Trees
41 pages
Classification Algorithms
100% (2)
Classification Algorithms
23 pages
3.exponential Moving Averages (EMA) in Forex Trading
No ratings yet
3.exponential Moving Averages (EMA) in Forex Trading
6 pages
FatFree User Manual
100% (1)
FatFree User Manual
41 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
45 pages
20ECE633T Machine Learning in VLSI
No ratings yet
20ECE633T Machine Learning in VLSI
81 pages
D. Inverse Trigonometric Functions: One-To-One Onto
No ratings yet
D. Inverse Trigonometric Functions: One-To-One Onto
69 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
27 pages
Qeee Solution Documnet
100% (1)
Qeee Solution Documnet
9 pages
Philosophy 1ST Prelim Notes 1
No ratings yet
Philosophy 1ST Prelim Notes 1
8 pages
Design Patterns - Assignment Sample 6 With Answers
100% (1)
Design Patterns - Assignment Sample 6 With Answers
13 pages
Summer of Science-Final Report
100% (1)
Summer of Science-Final Report
7 pages
Friction - DPPs
No ratings yet
Friction - DPPs
11 pages
CPSC540: Machine Learning Machine Learning Machine Learning Machine Learning
No ratings yet
CPSC540: Machine Learning Machine Learning Machine Learning Machine Learning
387 pages
Structures Congress 2017: Buildings and Special Structures
No ratings yet
Structures Congress 2017: Buildings and Special Structures
801 pages
Communications in Computer and Information Science 298
No ratings yet
Communications in Computer and Information Science 298
614 pages
AI.5 Machine Learning (21 26)
No ratings yet
AI.5 Machine Learning (21 26)
176 pages
Lecturenotes PDF
No ratings yet
Lecturenotes PDF
80 pages
An Introduction To Standard and Enriched Isogeomet
No ratings yet
An Introduction To Standard and Enriched Isogeomet
93 pages
CPSC540: Machine Learning Machine Learning Machine Learning Machine Learning
No ratings yet
CPSC540: Machine Learning Machine Learning Machine Learning Machine Learning
91 pages
Machine Learning
No ratings yet
Machine Learning
133 pages
ML 1 2 3
No ratings yet
ML 1 2 3
54 pages
July4 SaketAnand FriendlyIntroToML
No ratings yet
July4 SaketAnand FriendlyIntroToML
84 pages
Lecture Notes
No ratings yet
Lecture Notes
86 pages
Unit 3 in Machine Intelligence
No ratings yet
Unit 3 in Machine Intelligence
62 pages
ML Module No 01
No ratings yet
ML Module No 01
138 pages
Machine Learning Shortnote
No ratings yet
Machine Learning Shortnote
14 pages
Lecturenotes Cse176
No ratings yet
Lecturenotes Cse176
80 pages
DS-05 Introduction To Machine Learning
No ratings yet
DS-05 Introduction To Machine Learning
103 pages
ML Unit 1
No ratings yet
ML Unit 1
73 pages
05-1 Supervised Learning
No ratings yet
05-1 Supervised Learning
65 pages
EC8452 2marks PDF
No ratings yet
EC8452 2marks PDF
21 pages
Mathematics Talent Reward Programme
No ratings yet
Mathematics Talent Reward Programme
3 pages
Unit Ii
No ratings yet
Unit Ii
118 pages
Physics
No ratings yet
Physics
68 pages
BITS F464 ML Lecture Notes
No ratings yet
BITS F464 ML Lecture Notes
86 pages
Lecture 1
No ratings yet
Lecture 1
47 pages
L02 Fundamentals of ML
No ratings yet
L02 Fundamentals of ML
46 pages
Developing A Machining Learning Models From Start To Finish.
No ratings yet
Developing A Machining Learning Models From Start To Finish.
59 pages
02 Eisenman Cardboard Architecture
No ratings yet
02 Eisenman Cardboard Architecture
12 pages
Week11 - Regularization and Optimization
No ratings yet
Week11 - Regularization and Optimization
75 pages
CE880 Lecture5 Slides
No ratings yet
CE880 Lecture5 Slides
32 pages
ML Chap 2
No ratings yet
ML Chap 2
60 pages
ChatGPT - Machine Learning Overview
No ratings yet
ChatGPT - Machine Learning Overview
34 pages
Basics of Sigma-Delta Modulation
No ratings yet
Basics of Sigma-Delta Modulation
25 pages
Lecture 4.2 Supervised Learning Classification
No ratings yet
Lecture 4.2 Supervised Learning Classification
25 pages
Lecture 17&18 - Introduction To Machine Learning
No ratings yet
Lecture 17&18 - Introduction To Machine Learning
51 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
39 pages
07 Intro To ML
No ratings yet
07 Intro To ML
38 pages
Lecture 2-Regression
No ratings yet
Lecture 2-Regression
49 pages
BBA Full Syllybus-DBI COLLEGE
No ratings yet
BBA Full Syllybus-DBI COLLEGE
40 pages
Data Analytics - ML Lecturenotes
No ratings yet
Data Analytics - ML Lecturenotes
85 pages
Cours1 ML
No ratings yet
Cours1 ML
41 pages
INT354 - Unit 1
No ratings yet
INT354 - Unit 1
72 pages
Intro DL 01
No ratings yet
Intro DL 01
64 pages
ML Lecture1
No ratings yet
ML Lecture1
37 pages
QSRI Lecture1
No ratings yet
QSRI Lecture1
45 pages
High-Level Interpretability Detecting An AI's Objectives - LessWrong
No ratings yet
High-Level Interpretability Detecting An AI's Objectives - LessWrong
31 pages
Chapter 5 Machine Learning
No ratings yet
Chapter 5 Machine Learning
96 pages
Introduction
No ratings yet
Introduction
41 pages
Machine Learning AND Predictive Modeling: Rabi Kulshi
No ratings yet
Machine Learning AND Predictive Modeling: Rabi Kulshi
24 pages
Lecture 4 - Metrology & Measurement
No ratings yet
Lecture 4 - Metrology & Measurement
15 pages
ML Unit 2
No ratings yet
ML Unit 2
33 pages
Lec01 - Intro
No ratings yet
Lec01 - Intro
37 pages
ML 01
No ratings yet
ML 01
24 pages
Lecture 4-Revision - Part3 - PCA - Reg
No ratings yet
Lecture 4-Revision - Part3 - PCA - Reg
39 pages
Pgpool II Tutorial
No ratings yet
Pgpool II Tutorial
6 pages
Lecture 5-Perceptron - SVM1 - ConstrainedOptimization
No ratings yet
Lecture 5-Perceptron - SVM1 - ConstrainedOptimization
35 pages
Lecture 3-Revision - Part2
No ratings yet
Lecture 3-Revision - Part2
25 pages
Math 5 Week 2 Q1
No ratings yet
Math 5 Week 2 Q1
9 pages
CLS - XIII Phy Target-1 Level-2 Chapter-3
No ratings yet
CLS - XIII Phy Target-1 Level-2 Chapter-3
16 pages
p4 CO Duality Annotated
No ratings yet
p4 CO Duality Annotated
17 pages
5.1 Large Scale ML
No ratings yet
5.1 Large Scale ML
10 pages
Machine Learning C
No ratings yet
Machine Learning C
24 pages
Lecture8 CMOS Buffer
No ratings yet
Lecture8 CMOS Buffer
12 pages
Machine Learning Cheatsheet
No ratings yet
Machine Learning Cheatsheet
12 pages
Lecture 8-NeuralNetworks-Part1
No ratings yet
Lecture 8-NeuralNetworks-Part1
10 pages
ML-Unit - 3 & 4
No ratings yet
ML-Unit - 3 & 4
33 pages
Teaching Learning Based Optimization: Application and Variation
No ratings yet
Teaching Learning Based Optimization: Application and Variation
5 pages
Lecture3 PartA Non Ideal MOS Effects
No ratings yet
Lecture3 PartA Non Ideal MOS Effects
8 pages
Simone's Magnetic Force Lab: Research Question
No ratings yet
Simone's Magnetic Force Lab: Research Question
5 pages
En 10083 C50 Steel Plate High Carbon Steel
No ratings yet
En 10083 C50 Steel Plate High Carbon Steel
2 pages
3cs1111 Ir RPR December 2019
No ratings yet
3cs1111 Ir RPR December 2019
4 pages
Problem 1 017
No ratings yet
Problem 1 017
3 pages
A Natural Asymmetry in Electrical Systems With Far-Reaching Consequences
No ratings yet
A Natural Asymmetry in Electrical Systems With Far-Reaching Consequences
4 pages
Oracle SQL Cheatsheet
No ratings yet
Oracle SQL Cheatsheet
2 pages
PS 1
No ratings yet
PS 1
2 pages
Taxicab Geometry
No ratings yet
Taxicab Geometry
3 pages
Symbolic Mathematics in Data Science. Algebra, Calculus, and Geometry with Matlab
From Everand
Symbolic Mathematics in Data Science. Algebra, Calculus, and Geometry with Matlab
César Pérez López
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Mastering Data Structures and Algorithms in C and C++
From Everand
Mastering Data Structures and Algorithms in C and C++
Sachin Naha
No ratings yet
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet

Lecture 1and2-Revision Part1

Uploaded by

Lecture 1and2-Revision Part1

Uploaded by

REVISION -1

12 Oct 2023, Dinesh Babu J

◻ In a Machine Learning framework, Predictive Analytics problems

◻ Experience is in the form of data, say past data

◻ Performance: how well does the algorithm predict..

ML What is this ML algorithm?

Feature Hair length

ML What is this ML algorithm

Feature Hair length Feature Pitch

Input Image Input Voice

• News article group prediction

SPAM DETECTION PROBLEM

• Dataset – {Emails, SPAM/HAM Label}

• Features – {x1, x2…} e.g. frequency of occurrence of certain words (LOTTERY

ML SCORE PREDICTION PROBLEM

• Dataset – {10th 12th Math score, ML score}

• Features – {x1, x2} e.g. {75, 80}

NEWS ITEM GROUPING PROBLEM

• Dataset – set of news articles

• Features – {x1, x2, x3, x4} e.g. {topic distributions}

• Models: straight line to fit the data

• Total misclassifications (left) = 2

• Total misclassifications (right) = 0; so right is better

• To start: Pose a suitable task, Collect a good dataset,

• To solve: Choose a model to implement, Learn a model

• To start: Pose a suitable task, Collect a good dataset,

• To solve: Choose a model to implement, Learn a model

θ (n e w )=1− μ∗2.4 (1− 2)

• Probabilistic classifiers estimate

• Given x the features, we want to estimate the

C1(i.e. MALE) or C2(i.e. FEMALE)

• Given x the features, we want to estimate the probability that

C1(i.e. MALE) or C2(i.e. FEMALE)

• Use Bayes theorem, assuming 1-D feature vector

• To start with we can bin the

• Let us define a cost function in terms of the

• Take logarithm (makes our life easier for further steps;

ML – SINGLE DIMENSIONAL GAUSSIAN

• Use Bayes theorem

• Use Naïve assumption, given class label the features

• Given the 2D features, we want to estimate the probability

C1(i.e. MALE) or C2(i.e. FEMALE)

• Use Bayes theorem, assuming 2D feature vector

You might also like