Unit 7 - 2

- Machine learning models are classified as either discriminative or generative. Discriminative models make predictions based on conditional probabilities while generative models focus on the underlying data distribution. - Discriminative models directly estimate the probability of labels given features to classify data, while generative models estimate the joint probability of labels and features using Bayes' theorem. - Naive Bayes is an example of a simple generative model that makes strong assumptions of independence between features. It uses frequency tables to calculate conditional probabilities.

Uploaded by

Yuvraj Rana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

125 views59 pages

Unit 7 - 2

Uploaded by

Yuvraj Rana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 59

Discriminative and Generative Models

• Machine learning models can be classified into two types of models –

Discriminative and Generative models.
• In simple words, a discriminative model makes predictions on the unseen
data based on conditional probability and can be used either for
classification or regression problem statements.
• On the contrary, a generative model focuses on the distribution of a
dataset to return a probability for a given example.
Discriminative and Generative Models
Discriminative and Generative Models
• Suppose we are working on a classification problem where our task is to
decide if an email is a spam or not spam based on the words present in a
particular email. To solve this problem, we have a joint model over

• Labels: Y=y, and

• Features: X={x1, x2, …xn}
• Therefore, the joint distribution of the model can be represented as

• p(Y,X) = P(y,x1,x2…xn)
Discriminative and Generative Models
• Now, our goal is to estimate the probability of spam email i.e, P(Y=1|X).
• Both generative and discriminative models can solve this problem but in
different ways.
• The approach of Generative Models
• In the case of generative models, to find the conditional probability P(Y|
X), they estimate the prior probability P(Y) and likelihood probability
P(X|Y) with the help of the training data and uses the Bayes Theorem to
calculate the posterior probability P(Y |X):
Discriminative and Generative Models
• In the case of discriminative models, to find the probability, they directly
assume some functional form for P(Y|X) and then estimate the parameters
of P(Y|X) with the help of the training data.
• The discriminative model refers to a class of models used in Statistical
Classification, mainly used for supervised machine learning.
• These types of models are also known as conditional models since they
learn the boundaries between classes or labels in a dataset.
Discriminative and Generative Models
• Discriminative models (just as in the literal meaning) separate classes
instead of modeling the conditional probability and don’t make any
assumptions about the data points.
• But these models are not capable of generating new data points.
Therefore, the ultimate objective of discriminative models is to separate
one class from another.
• If we have some outliers present in the dataset, then discriminative
models work better compared to generative models i.e, discriminative
models are more robust to outliers. However, there is one major drawback
of these models is the misclassification problem, i.e., wrongly classifying
a data point.
Discriminative and Generative Models
Generative Models
• Generative models are considered as a class of statistical models that can
generate new data instances. These models are used in unsupervised
machine learning as a means to perform tasks such as

• Probability and Likelihood estimation,

• Modeling data points,
• To describe the phenomenon in data,
• To distinguish between classes based on these probabilities.
Discriminative and Generative Models
• Generative models focus on the distribution of individual classes in a dataset and
the learning algorithms tend to model the underlying patterns or distribution of
the data points.
• These models use the concept of joint probability and create the instances where
a given feature (x) or input and the desired output or label (y) exist at the same
time.
• These models use probability estimates and likelihood to model data points and
differentiate between different class labels present in a dataset. Unlike
discriminative models, these models are also capable of generating new data
points.
• However, they also have a major drawback – If there is a presence of outliers in
the dataset, then it affects these types of models to a significant extent.
Discriminative and Generative Models
• Training generative classifiers involve estimating a function f: X -> Y, or
probability P(Y|X):

• Assume some functional form for the probabilities such as P(Y), P(X|Y)
• With the help of training data, we estimate the parameters of P(X|Y), P(Y)
• Use the Bayes theorem to calculate the posterior probability P(Y |X)
• Some Examples of Generative Models
• ‌Naïve Bayes
• Generative Adversarial Networks (GANs)
• ‌Hidden Markov Models (HMMs)
Difference between Discriminative and
Generative Models
• Discriminative models draw boundaries in the data space, while
generative models try to model how data is placed throughout the space.
• A generative model focuses on explaining how the data was generated,
while a discriminative model focuses on predicting the labels of the data.
• In mathematical terms, a discriminative machine learning trains a model
which is done by learning parameters that maximize the conditional
probability P(Y|X), while on the other hand, a generative model learns
parameters by maximizing the joint probability of P(X, Y).
Difference between Discriminative and
Generative Models
• Discriminative models recognize existing data i.e, discriminative
modeling identifies tags and sorts data and can be used to classify data
while Generative modeling produces something.
• Generative models have more impact on outliers than discriminative
models.
• Discriminative models are computationally cheap as compared to
generative models.
Naïve Bayes Algorithm
• Naïve Bayes algorithm is a supervised learning algorithm, which is based
on Bayes theorem and used for solving classification problems.
• It is mainly used in text classification that includes a high-dimensional
training dataset.
• Naïve Bayes Classifier is one of the simple and most effective
Classification algorithms which helps in building the fast machine
learning models that can make quick predictions.
• It is a probabilistic classifier, which means it predicts on the basis of the
probability of an object.
Naïve Bayes Algorithm
• Bayes Theorem
• Bayes’ Theorem is a simple mathematical formula used for calculating
conditional probabilities.
• Conditional probability is a measure of the probability of an event
occurring given that another event has (by assumption, presumption,
assertion, or evidence) occurred.
Naïve Bayes Algorithm
Naïve Bayes Algorithm
• Which tells us: how often A happens given that B happens, written P(A|
B) also called posterior probability, When we know: how often B happens
given that A happens, written P(B|A) and how likely A is on its own,
written P(A) and how likely B is on its own, written P(B).
• In simpler terms, Bayes’ Theorem is a way of finding a probability when
we know certain other probabilities.
Naïve Bayes Algorithm
• The fundamental Naïve Bayes assumption is that each feature makes an:
• independent
• equal
contribution to the outcome.

• Let us take an example to get some better intuition. Consider the car theft
problem with attributes Color, Type, Origin, and the target, Stolen can be
either Yes or No.
Naïve Bayes Algorithm
Naïve Bayes Algorithm
• Concerning our dataset, the concept of assumptions made by the
algorithm can be understood as:

• We assume that no pair of features are dependent. For example, the color
being ‘Red’ has nothing to do with the Type or the Origin of the car.
Hence, the features are assumed to be Independent.
• Secondly, each feature is given the same influence(or importance). For
example, knowing the only Color and Type alone can’t predict the
outcome perfectly. So none of the attributes are irrelevant and assumed to
be contributing Equally to the outcome.
Naïve Bayes Algorithm
• The assumptions made by Naïve Bayes are generally not correct in real-
world situations. The independence assumption is never correct but often
works well in practice. Hence the name ‘Naï>ve’.
• Here in our dataset, we need to classify whether the car is stolen, given
the features of the car. The columns represent these features and the rows
represent individual entries. If we take the first row of the dataset, we can
observe that the car is stolen if the Color is Red, the Type is Sports and
Origin is Domestic. So we want to classify a Red Domestic SUV is
getting stolen or not. Note that there is no example of a Red Domestic
SUV in our data set.
Naïve Bayes Algorithm

The variable y is the class variable(stolen?), which

represents if the car is stolen or not given the
conditions. Variable X represents the
parameters/features.
Naïve Bayes Algorithm
• Here x1, x2…, xn represent the features, i.e they can be mapped to Color,
Type, and Origin. By substituting for X and expanding using the chain
rule we get,

• Now, you can obtain the values for each by looking at the dataset and
substitute them into the equation.
Naïve Bayes Algorithm
• The posterior probability P(y|X) can be calculated by first, creating a
Frequency Table for each attribute against the target.
• Then, molding the frequency tables to Likelihood Tables and finally, use
the Naïve Bayesian equation to calculate the posterior probability for each
class.
• The class with the highest posterior probability is the outcome of the
prediction.
• Below are the Frequency and likelihood tables for all three predictors.
Naïve Bayes Algorithm

Frequency and Likelihood tables of ‘Color’

Naïve Bayes Algorithm
• Frequency and Likelihood tables of ‘Type’
Naïve Bayes Algorithm
• Frequency and Likelihood tables of ‘Origin’
Naïve Bayes Algorithm
• So in our example, we have 3 predictors X.

As per the equations discussed above, we can calculate the posterior

probability P(Yes | X) as :
Naïve Bayes Algorithm
• and, P( No | X ):

• Since 0.144 > 0.048, Which means given the features RED SUV and
Domestic, our example gets classified as ’NO’ the car is not stolen.
Support Vector Machines
• A Support Vector Machine (SVM) is a powerful and versatile Machine
Learning model, capable of performing linear or nonlinear classification,
regression, and even outlier detection.
• SVMs are particularly well suited for classification of complex small- or
medium-sized datasets
• Linear SVM Classification
Linear SVM Classification
• The two classes can clearly be separated easily with a straight line (they are
linearly separable). The left plot shows the decision boundaries of three possible
linear classifiers. The model whose decision boundary is represented by the
dashed line is so bad that it does not even separate the classes properly. The other
two models work perfectly on this training set, but their decision boundaries
come so close to the instances that these models will probably not perform as
well on new instances.
• In contrast, the solid line in the plot on the right represents the decision boundary
of an SVM classifier; this line not only separates the two classes but also stays as
far away from the closest training instances as possible. You can think of an SVM
classifier as fitting the widest possible street (represented by the parallel dashed
lines) between the classes. This is called large margin classification.
Linear SVM Classification
• Notice that adding more training instances “off the street” will not affect
the decision boundary at all: it is fully determined (or “supported”) by the
instances located on the edge of the street. These instances are called the
support vectors (they are circled in Figure 5-1).
Linear SVM Classification
• SVMs are sensitive to the feature scales, as you can see in Figure 5-2: in
the left plot, the vertical scale is much larger than the horizontal scale, so
the widest possible street is close to horizontal.
• After feature scaling (e.g., using Scikit-Learn’s StandardScaler), the
decision boundary in the right plot looks much better.
Soft Margin Classification
• If we strictly impose that all instances must be off the street and on the
right side, this is called hard margin classification. There are two main
issues with hard margin classification. First, it only works if the data is
linearly separable. Second, it is sensitive to outliers. Figure 5-3 shows the
iris dataset with just one additional outlier: on the left, it is impossible to
find a hard margin; on the right, the decision boundary ends up very
different from the one we saw in Figure 5-1 without the outlier, and it will
probably not generalize as well.
Soft Margin Classification
Soft Margin Classification
• To avoid these issues, use a more flexible model. The objective is to find
a good balance between keeping the street as large as possible and
limiting the margin violations (i.e., instances that end up in the middle of
the street or even on the wrong side).
• This is called soft margin classification
Soft Margin Classification
• When creating an SVM model using Scikit-Learn, we can specify a
number of hyperparameters. C is one of those hyperparameters. If we set
it to a low value, then we end up with the model on the left of Figure 5-4.
• With a high value, we get the model on the right. Margin violations are
bad. It’s usually better to have few of them. However, in this case the
model on the left has a lot of margin violations but will probably
generalize better.
• If your SVM model is overfitting, you can try regularizing it by reducing
C.
Soft Margin Classification
Nonlinear SVM Classification
• Although linear SVM classifiers are efficient and work surprisingly well
in many cases, many datasets are not even close to being linearly
separable. One approach to handling nonlinear datasets is to add more
features, such as polynomial features (as in some cases this can result in a
linearly separable dataset.
• Consider the left plot in Figure 5-5: it represents a simple dataset with just
one feature, x1. This dataset is not linearly separable, as you can see. But
if you add a second feature x2 = (x1)^2, the resulting 2D dataset is
perfectly linearly separable
Nonlinear SVM Classification
Nonlinear SVM Classification
Polynomial Kernel
• Adding polynomial features is simple to implement and can work great with
all sorts of Machine Learning algorithms (not just SVMs). That said, at a
low polynomial degree, this method cannot deal with very complex datasets,
and with a high polynomial degree it creates a huge number of features,
making the model too slow.
• Fortunately, when using SVMs you can apply an almost miraculous
mathematical technique called the kernel trick (explained in a moment). The
kernel trick makes it possible to get the same result as if you had added
many polynomial features, even with very high-degree polynomials, without
actually having to add them. So there is no combinatorial explosion of the
number of features because you don’t actually add any features.
Polynomial Kernel
• SVM classifier using a third-degree polynomial kernel is represented on
the left in Figure 5-7. On the right is another SVM classifier using a 10th
degree polynomial kernel. Obviously, if your model is overfitting, you
might want to reduce the polynomial degree. Conversely, if it is
underfitting, you can try increasing it.
Similarity Features
• Another technique to tackle nonlinear problems is to add features
computed using a similarity function, which measures how much each
instance resembles a particular landmark. For example, let’s take the 1D
dataset discussed earlier and add two land‐marks to it at x1 = –2 and x1 =
1 (see the left plot in Figure 5-8). Next, let’s define the similarity function
to be the Gaussian Radial Basis Function (RBF) with γ = 0.3 (see
Equation 5-1).
Similarity Features
Similarity Features
• This is a bell-shaped function varying from 0 (very far away from the
landmark) to 1 (at the landmark). Now we are ready to compute the new
features. For example, let’s look at the instance x1 = –1: it is located at a
distance of 1 from the first landmark and 2 from the second landmark.
• Therefore its new features are x2 = exp(–0.3 × 12) ≈ 0.74 and x3 = exp(–
0.3 × 22) ≈ 0.30. The plot on the right in Figure 5-8 shows the
transformed dataset (dropping the original features). As you can see, it is
now linearly separable.
Similarity Features
• You may wonder how to select the landmarks. The simplest approach is
to create a landmark at the location of each and every instance in the
dataset. Doing that creates many dimensions and thus increases the
chances that the transformed training set will be linearly separable.
• The downside is that a training set with m instances and n features gets
transformed into a training set with m instances and m features (assuming
you drop the original features).
• If your training set is very large, you end up with an equally large number
of features
Similarity Features
• The other plots show models trained with different values of
hyperparameters gamma (γ) and C. Increasing gamma makes the bell-
shaped curve narrower (see the lefthand plots in Figure 5-8). As a result,
each instance’s range of influence is smaller: the decision boundary ends
up being more irregular, wiggling around individual instances.
Conversely, a small gamma value makes the bell-shaped curve wider:
instances have a larger range of influence, and the decision boundary ends
up smoother. So γ acts like a regularization hyperparameter: if your
model is overfitting, you should reduce it; if it is underfitting, you should
increase it (similar to the C hyperparameter).
Similarity Features
Similarity Features
• The other plots show models trained with different values of
hyperparameters gamma (γ) and C. Increasing gamma makes the bell-
shaped curve narrower (see the lefthand plots in Figure 5-8). As a result,
each instance’s range of influence is smaller: the decision boundary ends
up being more irregular, wiggling around individual instances.
Conversely, a small gamma value makes the bell-shaped curve wider:
instances have a larger range of influence, and the decision boundary ends
up smoother. So γ acts like a regularization hyperparameter: if your
model is overfitting, you should reduce it; if it is underfitting, you should
increase it (similar to the C hyperparameter).
Decision Function and Predictions
• The linear SVM classifier model predicts the class of a new instance x by
simply computing the decision function w⊺ x + b = w1 x1 + ⋯ + wn xn +
b. If the result is positive, the predicted class ŷ is the positive class (1),
and otherwise it is the negative class (0);see Equation 5-2.
Decision Function and Predictions
• Figure 5-12 shows the decision function that corresponds to the model in
the left in Figure 5-4: it is a 2D plane because this dataset has two
features (petal width and petal length). The decision boundary is the set of
points where the decision function is equal to 0: it is the intersection of
two planes, which is a straight line (represented by the thick solid line).
Decision Function and Predictions
Decision Function and Predictions
• The dashed lines represent the points where the decision function is equal
to 1 or –1: they are parallel and at equal distance to the decision
boundary, and they form a margin around it. Training a linear SVM
classifier means finding the values of w and b that make this margin as
wide as possible while avoiding margin violations (hard margin) or
limiting them (soft margin).
Training Objective
• Consider the slope of the decision function: it is equal to the norm of the
weight vector, ∥ w ∥. If we divide this slope by 2, the points where the
decision function is equal to ±1 are going to be twice as far away from the
decision boundary. In other words, dividing the slope by 2 will multiply
the margin by 2. This may be easier to visualize in 2D, as shown in Figure
5-13. The smaller the weight vector w, the larger the margin.
Training Objective
Training Objective
Training Objective
Quadratic Programming
• The hard margin and soft margin problems are both convex quadratic
optimization problems with linear constraints. Such problems are known
as Quadratic Programming (QP) problems.
• Many off-the-shelf solvers are available to solve QP problems by using a
variety of techniques.
Quadratic Programming

Ogilvy On Advertising
95% (108)
Ogilvy On Advertising
226 pages
Principles of Logo Design
92% (24)
Principles of Logo Design
204 pages
Tthe Non Designer's Design Book
29% (17)
Tthe Non Designer's Design Book
29 pages
The Art and Business of Online Writing How To Beat The Game of Capturing and Keeping Attention (Nicolas Cole (Cole, Nicolas) )
91% (22)
The Art and Business of Online Writing How To Beat The Game of Capturing and Keeping Attention (Nicolas Cole (Cole, Nicolas) )
281 pages
Creating A Brand Identity A Guide For Designers (Graphic Design Books, Logo Design, Marketing) PDF
100% (28)
Creating A Brand Identity A Guide For Designers (Graphic Design Books, Logo Design, Marketing) PDF
161 pages
Hook Point-How To Stand Out in A 3 Second World Ebook
95% (44)
Hook Point-How To Stand Out in A 3 Second World Ebook
305 pages
Austin Kleon - Keep Going
100% (9)
Austin Kleon - Keep Going
172 pages
Steve Krug Dont Make Me Think Second Edition
100% (61)
Steve Krug Dont Make Me Think Second Edition
216 pages
How To Think Like Leonarda Da Vinci
95% (39)
How To Think Like Leonarda Da Vinci
313 pages
The Forty Rules of Love
100% (14)
The Forty Rules of Love
229 pages
Easy French Step-by-Step PDF
100% (25)
Easy French Step-by-Step PDF
399 pages
The Graphic Design Idea Book - I - Steven Heller PDF
100% (32)
The Graphic Design Idea Book - I - Steven Heller PDF
129 pages
Logotype - Michael Evamy
98% (51)
Logotype - Michael Evamy
337 pages
Applied Statistics in Business and Economics 5th Edition Doane Solutions Manual 1
100% (75)
Applied Statistics in Business and Economics 5th Edition Doane Solutions Manual 1
25 pages
The Icon Handbook
100% (35)
The Icon Handbook
323 pages
The Grid System: A Brief Visual Introduction To Grids For Graphic Designers and Typographers
85% (13)
The Grid System: A Brief Visual Introduction To Grids For Graphic Designers and Typographers
12 pages
Steal Like An Artist
100% (53)
Steal Like An Artist
128 pages
Naïve Bayes
No ratings yet
Naïve Bayes
15 pages
Decoding Generative and Discriminative Models
No ratings yet
Decoding Generative and Discriminative Models
8 pages
Discriminative and Generative Models in Machine Learning
No ratings yet
Discriminative and Generative Models in Machine Learning
9 pages
7.1 Generative & Discriminative Learning
No ratings yet
7.1 Generative & Discriminative Learning
16 pages
Lecture 2 - Principle of Machine Learning
No ratings yet
Lecture 2 - Principle of Machine Learning
39 pages
6d7701 - Bayesean Classifer
No ratings yet
6d7701 - Bayesean Classifer
8 pages
Lecture3 Linear Classifiers
No ratings yet
Lecture3 Linear Classifiers
36 pages
Lecture - 4.1 - Bayes Classifier
No ratings yet
Lecture - 4.1 - Bayes Classifier
31 pages
U02Lecture07 Classification
100% (1)
U02Lecture07 Classification
56 pages
WK 08
No ratings yet
WK 08
10 pages
Unit 5-6
No ratings yet
Unit 5-6
18 pages
DL Highlights
No ratings yet
DL Highlights
6 pages
Discriminative vs. Generative Models
No ratings yet
Discriminative vs. Generative Models
6 pages
2 Naive Bayes
No ratings yet
2 Naive Bayes
49 pages
Naive Bayes Theory
No ratings yet
Naive Bayes Theory
4 pages
Unit-3 AML (Bayesian Concept Learning)
No ratings yet
Unit-3 AML (Bayesian Concept Learning)
40 pages
ML 5
No ratings yet
ML 5
28 pages
LM3 - Naive Bayes Model
No ratings yet
LM3 - Naive Bayes Model
21 pages
Lecture 6 - Generative Models
No ratings yet
Lecture 6 - Generative Models
33 pages
Deep Learning As A Building Block in Probabilistic Models: Pierre-Alexandre Mattei
No ratings yet
Deep Learning As A Building Block in Probabilistic Models: Pierre-Alexandre Mattei
62 pages
L25 - Naïve Bayes
No ratings yet
L25 - Naïve Bayes
18 pages
Unit 3
No ratings yet
Unit 3
20 pages
ML Module4 Classification
No ratings yet
ML Module4 Classification
79 pages
FML Unit3
No ratings yet
FML Unit3
18 pages
CSGL
No ratings yet
CSGL
11 pages
Naive Bayes
No ratings yet
Naive Bayes
7 pages
Lecture 06 Bayesian Networks 07112022 011127pm
No ratings yet
Lecture 06 Bayesian Networks 07112022 011127pm
33 pages
Generative VS Discriminative Models - by Prathap Manohar Joshi - Medium
No ratings yet
Generative VS Discriminative Models - by Prathap Manohar Joshi - Medium
1 page
Module - 4 - ECE3047 - Machine Learning
No ratings yet
Module - 4 - ECE3047 - Machine Learning
81 pages
Naive Bayes Classifiers - Parta
No ratings yet
Naive Bayes Classifiers - Parta
17 pages
AI NOTES Unit 2
No ratings yet
AI NOTES Unit 2
9 pages
Notes On Module 3 - Pattern Recognition
No ratings yet
Notes On Module 3 - Pattern Recognition
17 pages
NBayes Log Reg
No ratings yet
NBayes Log Reg
18 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
6 pages
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
No ratings yet
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
17 pages
Lecture 10 Naïve Bayes Classification
No ratings yet
Lecture 10 Naïve Bayes Classification
29 pages
BSC ML CH2
No ratings yet
BSC ML CH2
79 pages
Report Ai
No ratings yet
Report Ai
7 pages
Lecture-8 Machine Learning With Python
No ratings yet
Lecture-8 Machine Learning With Python
35 pages
Cours #5 - Naive Bayes Classification
No ratings yet
Cours #5 - Naive Bayes Classification
18 pages
AI Week 14
No ratings yet
AI Week 14
3 pages
Lecture10 - Bayesian Classifier
No ratings yet
Lecture10 - Bayesian Classifier
40 pages
UNIT 2 AAM Notes
No ratings yet
UNIT 2 AAM Notes
38 pages
Navies Bayes
No ratings yet
Navies Bayes
18 pages
Ame: Waqar Ali
No ratings yet
Ame: Waqar Ali
22 pages
What Is Naive Bayes Algorithm
No ratings yet
What Is Naive Bayes Algorithm
10 pages
CSL0777 L24
No ratings yet
CSL0777 L24
38 pages
Naive Bayes
No ratings yet
Naive Bayes
4 pages
Notes
No ratings yet
Notes
32 pages
DWM Exp5 C49
No ratings yet
DWM Exp5 C49
12 pages
NOTES
No ratings yet
NOTES
15 pages
An Introduction To Naive Bayes Algorithm For Beginners
No ratings yet
An Introduction To Naive Bayes Algorithm For Beginners
11 pages
AIML - Ex.3 Manual
No ratings yet
AIML - Ex.3 Manual
4 pages
ML Unit3
No ratings yet
ML Unit3
21 pages
Pgm5 With Output
No ratings yet
Pgm5 With Output
13 pages
Bayes Learning
No ratings yet
Bayes Learning
37 pages
Chapter 2
No ratings yet
Chapter 2
31 pages
Bayes Classifier
No ratings yet
Bayes Classifier
35 pages
Naive Bayes Classifier: Fundamentals and Applications
From Everand
Naive Bayes Classifier: Fundamentals and Applications
Fouad Sabry
No ratings yet
Statistical Classification: Fundamentals and Applications
From Everand
Statistical Classification: Fundamentals and Applications
Fouad Sabry
No ratings yet
Graphic Design Index
90% (10)
Graphic Design Index
78 pages
Aphic Style Manual For Understanding How Typography Affects Design Repost
100% (7)
Aphic Style Manual For Understanding How Typography Affects Design Repost
161 pages
The Ultimate Guide To Graphic Design - 2nd Edition PDF
100% (14)
The Ultimate Guide To Graphic Design - 2nd Edition PDF
196 pages
Exploratory Factor Analysis On Road Accidents in Cagayan de Oro City
No ratings yet
Exploratory Factor Analysis On Road Accidents in Cagayan de Oro City
23 pages
Assignment 3
No ratings yet
Assignment 3
1 page
215 Final Exam Formula Sheet
No ratings yet
215 Final Exam Formula Sheet
2 pages
Unit 6 MCQs
No ratings yet
Unit 6 MCQs
8 pages
2016 Hkdse M1
No ratings yet
2016 Hkdse M1
6 pages
Teknik Simulasi
No ratings yet
Teknik Simulasi
5 pages
Nested Logit
No ratings yet
Nested Logit
13 pages
5 Chi Square Tests
No ratings yet
5 Chi Square Tests
38 pages
C2025 - Pair HW - Binomial Distribution-1
No ratings yet
C2025 - Pair HW - Binomial Distribution-1
12 pages
Statistics & Probability
No ratings yet
Statistics & Probability
11 pages
Response Surface Methodology To Optimize The Extraction of
No ratings yet
Response Surface Methodology To Optimize The Extraction of
17 pages
Practical Geostatistics 2000: Geostokos (Ecosse) Limited, Scotland
No ratings yet
Practical Geostatistics 2000: Geostokos (Ecosse) Limited, Scotland
23 pages
Stochastic Process
No ratings yet
Stochastic Process
24 pages
Chapter 4: Forecasting: Problem 1: Auto Sales at Carmen's Chevrolet Are Shown Below. Find A Naive Forecast
No ratings yet
Chapter 4: Forecasting: Problem 1: Auto Sales at Carmen's Chevrolet Are Shown Below. Find A Naive Forecast
11 pages
The Design and Statistical Analysis of Animal Experiments 1st Edition New Edition PDF
100% (14)
The Design and Statistical Analysis of Animal Experiments 1st Edition New Edition PDF
14 pages
Analisis Baab 6
No ratings yet
Analisis Baab 6
9 pages
5 Correlation and Cofficient 2023
No ratings yet
5 Correlation and Cofficient 2023
51 pages
Effect of Greenhouse Height
No ratings yet
Effect of Greenhouse Height
9 pages
Chapter 8 PPT New Period 3
No ratings yet
Chapter 8 PPT New Period 3
12 pages
BS Assignment4
100% (1)
BS Assignment4
2 pages
Econometrics Project
No ratings yet
Econometrics Project
17 pages
Genetics Lab Exercise 3
No ratings yet
Genetics Lab Exercise 3
4 pages
Results and Discussion Assessment of Tourist Attractions in Gonzaga
No ratings yet
Results and Discussion Assessment of Tourist Attractions in Gonzaga
13 pages
Abubakar Et Al 2012 One Way Anova Concepts and Application in Agricultural System
No ratings yet
Abubakar Et Al 2012 One Way Anova Concepts and Application in Agricultural System
5 pages
Universiteit Hasselt Concepts in Bayesian Inference Exam June 2015
No ratings yet
Universiteit Hasselt Concepts in Bayesian Inference Exam June 2015
8 pages
Bman 07
No ratings yet
Bman 07
75 pages
Data Intrepretation 240911 160633
No ratings yet
Data Intrepretation 240911 160633
298 pages
Time Delay and Cost Overrun in Qatari Public Construction Projects
No ratings yet
Time Delay and Cost Overrun in Qatari Public Construction Projects
8 pages
Stat Ipuc Itest PQP
No ratings yet
Stat Ipuc Itest PQP
2 pages

Unit 7 - 2

Uploaded by

Unit 7 - 2

Uploaded by

Discriminative and Generative Models

• Machine learning models can be classified into two types of models –

• Labels: Y=y, and

• Probability and Likelihood estimation,

The variable y is the class variable(stolen?), which

Frequency and Likelihood tables of ‘Color’

As per the equations discussed above, we can calculate the posterior

You might also like