100% found this document useful (7 votes)

2K views

Machine Learning Unit 1

The document provides an introduction to machine learning including: 1) It discusses the definition of machine learning and compares it to traditional programming, noting that machine learning allows computers to learn without being explicitly programmed. 2) It describes different types of machine learning including supervised, unsupervised, and reinforcement learning. 3) It outlines several models used in machine learning like geometric, probabilistic, and logical models.

Uploaded by

Aanchal Padmavat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (7 votes)

2K views

Machine Learning Unit 1

Uploaded by

Aanchal Padmavat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 112

Unit-1 Introduction To Machine

Learning
• Introduction to Machine Learning, Comparison of Machine
learning with traditional programming,
ML vs AI vs Data Science.
• Types of learning: Supervised, Unsupervised, and semi-
supervised, reinforcement learning techniques,
• Models of Machine learning: Geometric model,
Probabilistic Models, Logical Models, Grouping and grading
models.
• Parametric and non-parametric models.
• Important Elements of Machine Learning- Data formats
• Learnability .
• Statistical learning approaches .
1
Introduction to Machine Learning
• Arthur Samuel, an early American leader in
the field of computer gaming and artificial
intelligence, coined the term “Machine
Learning ” in 1959 while at IBM.
• He defined machine learning as “the field of
study that gives computers the ability to
learn without being explicitly programmed “.
However, there is no universally accepted
definition for machine learning.
• Different authors define the term differently.
2
Introduction to Machine Learning
• Machine learning is programming computers to
optimize a performance criterion using example data
or past experience .
• We have a model defined up to some parameters, and
learning is the execution of a computer program to
optimize the parameters of the model using the
training data or past experience.
• The model may be predictive to make predictions in
the future, or descriptive to gain knowledge from
data.
• The field of study known as machine learning is concerned with
the question of how to construct computer programs that
automatically improve with experience.
3
Introduction to Machine Learning
• Definition of learning: A computer program is
said to learn from experience E with respect to
some class of tasks T and performance
measure P,
• if its performance at tasks T, as measured by P
improves with experience E.
• Examples Handwriting recognition learning problem
– Task T : Recognizing and classifying handwritten words
within images
– Performance P : Percent of words correctly classified
– Training experience E : A dataset of handwritten words
with given classifications
4
Introduction to Machine Learning
A robot driving learning problem
– Task T : Driving on highways using vision sensors
– Performance P : Average distance traveled before an
error
– Training experience E : A sequence of images and
steering commands recorded while observing a
human driver
• Definition: A computer program which
learns from experience is called a
machine learning program or simply a
learning program .
5
Introduction to Machine Learning

• face recognition on your phone or voice understanding,

• diagnose diseases by symptoms (Watson),
• advise products,
• books (Amazon),
• movies (Netflix),
• music (Spotify),
6
Comparison of Machine learning with traditional
programming

• Traditional programming you hard code the

behavior of the program.
• In machine learning, you leave a lot of that to
the machine to learn from data.
7
Introduction to Machine Learning
• ML just like AI is not a substitution, but
supplementation for traditional programming
approaches.
• For instance, ML can be used to build predictive
algorithms for an online trading platform, while
the platform’s UI, data visualization and other
elements will be performed in a mainstream
programming language such as Ruby, or Java.
• ML is used in the case when traditional
programming strategy falls behind and it is not
enough to fully implement a certain task.
8
Introduction to Machine Learning
• Traditional programming approach
• For any solution, the first task is the creation of the
most suitable algorithm and writing the code.
• Thereafter, it is mandatory to set the input parameters
and, in fact, if an implemented algorithm is ok it will
produce the expected result.

9
Introduction to Machine Learning
• However, when we need to predict something,
we need to use an algorithm with a variety of
input parameters.
• In case of prediction of the exchange rate, it’s
mandatory to add such details like yesterday’s
rate; external and internal economic changes
in the country that issues the currency and
more.
• Well, it’s simple, we need to add a thousand
and hundreds of parameters, whereas their
limited set allows building a very basic and
unscalable model. 10
Introduction to Machine Learning
• How a data engineer develops a solution using
machine learning https://morioh.com/p/1063d43ef15e

• Instead of developing an algorithm on its own, they need to collect an array of

historical data that will be used for semi-automatic model building.
• managing a satisfactory set of data, the data engineer loads it into already tailored
ML-algorithms. The result is a model that can predict a new result, receiving new
data as input.
11
ML vs AI vs Data Science.

Ref- https://pythongeeks.org/ai-vs-data-science-vs-deep-learning-vs-ml/

12
Introduction to Machine Learning
• Artificial Intelligence-
• It focuses mainly on building smart machines
that are capable of performing tasks that
replicate human intelligence without any
human interference.
• These systems, built using Artificial Intelligence
models, tend to mimic human cognitive
functions, that allows decision making, and also
helps to improve learning.
• Example- forecast financial and business outcomes
and can efficiently provide solutions for businesses.
13
• We can demonstrate the working of AI in brief with the
following steps:
1. Collect Data
2. Clean and Prepare Data
3. Train the Model
4. Test the Data
5. Improve 14
• Machine learning algorithms make use of
computational methods and try to “learn” from the
input data without the requirement of any
predetermined equation.
• It comprises an application of AI that allows the
systems to learn and improve significantly from the
past data experience.
15
• The working of the Machine Learning models is
simply put as:
1. Gather data from source
2. Clean and filter the data
3. Choose the effective algorithm according to
your problem
4. Train the test model
5. Tune in the parameters for best performance
6. Test the models and try to improve the
efficiency
7. Deploy the final model having precise outputs
16
• Deep learning is again a subfield of the artificial
intelligence domain.
• It makes use of a multi-layered structure of algorithms,
more commonly known as a neural network.
• Deep Learning, like Machine Learning algorithms, also
need data for learning and solving problems like
classification and prediction.
• We can even consider Deep Learning as a subdomain
of machine learning. 17
DEEP LEARNING
• Unlike Machine Learning, we do not need trained data for
the model training in Deep Learning, since we can use this
technology when we do not have well classified data.
• The Deep Learning system looks for appropriate
differentiators in the given data points without considering
any external classification.
• This is how we can avoid the situation of human interference
for the training of the Deep Learning Model.
• These models analyze new entities for new features at each
layer and use it to choose the way in which we can classify
the entries.
• The system keeps on checking itself in order to look for new
classifications or categories that we can generate from the
new entities.
18
DATA SCIENCE

• The main focus of data science models is to

recognize patterns in the given input data sets.
• It makes use of numerous statistical techniques
to analyze and extract information like features
and classifiers of the data from the given data
sets. 19
DATA SCIENCE
• With the help of these valuable insights, data
scientists can help companies make smarter
business decisions based on their data about
previous sales and marketing figures.
• We can understand the working of Data
Science model with the following points in brief:
1. Gather data from sources
2. Filter and process the data
3. Find trends in data and get insight
4. Build useful data models
5. Measure the performance 20
21
Types of -Machine Learning

• Supervised Machine Learning:

• Supervised learning is a machine learning method in
which models are trained using labeled data. In supervised
learning, models need to find the mapping function to map
the input variable (X) with the output variable (Y).

• Supervised learning needs supervision to train the model,

which is similar to as a student learns things in the
presence of a teacher. Supervised learning can be used for
two types of problems: Classification and Regression.
22
Types of -Machine Learning
• Let us say we have a dataset that contains pictures of
different kinds of fruits and we want Machine Learning to
segregate the photos based on the kind of fruits.
• First we provide the dataset to the system i.e we provide
the input data.
• The system goes through the entire dataset or analyses it
to find patterns based on size, shapes, colors, etc.
• Now that it has figured out the patterns, the systems
takes decisions and starts separating the photos based
on the patterns.
• Once the work is done, the system learns from the
feedback it gets. If it gets any of the fruit type wrong, it
will make sure it does not happen in the future.
23
Supervised Machine Learning

• In the real-world, supervised learning can be used

for Risk Assessment, Image classification, Fraud
Detection, spam filtering, etc. 24
How Supervised Learning Works?

25
How Supervised Learning Works?
• Suppose we have a dataset of different types of shapes
which includes square, rectangle, triangle, and Polygon. Now
the first step is that we need to train the model for each
shape.
• If the given shape has four sides, and all the sides are equal,
then it will be labelled as a Square.
• If the given shape has three sides, then it will be labelled as
a triangle.
• If the given shape has six equal sides then it will be labelled
as hexagon.
• Now, after training, we test our model using the test set, and
the task of the model is to identify the shape.
• The machine is already trained on all types of shapes, and
when it finds a new shape, it classifies the shape on 26the
How Supervised Learning Works?
• Steps Involved in Supervised Learning:
• First Determine the type of training dataset
• Collect/Gather the labelled training data.
• Split the training dataset into training dataset, test dataset, and
validation dataset.
• Determine the input features of the training dataset, which should
have enough knowledge so that the model can accurately predict
the output.
• Determine the suitable algorithm for the model, such as support
vector machine, decision tree, etc.
• Execute the algorithm on the training dataset. Sometimes we need
validation sets as the control parameters, which are the subset of
training datasets.
• Evaluate the accuracy of the model by providing the test set. If the
model predicts the correct output, which means our model is
accurate. 27
Types of supervised Machine learning Algorithms:
1. Regression
• Regression algorithms are used if there is a
relationship between the input variable and the
output variable.
• It is used for the prediction of continuous variables,
such as Weather forecasting, Market Trends, etc.
• Below are some popular Regression algorithms which
come under supervised learning:
• Linear Regression
• Regression Trees
• Non-Linear Regression
• Bayesian Linear Regression
• Polynomial Regression 28
Supervised Machine Learning
2. Classification
• Classification algorithms are used when the
output variable is categorical, which means
there are two classes such as Yes-No, Male-
Female, True-false, etc.
• Spam Filtering,
• Random Forest
• Decision Trees
• Logistic Regression
• Support vector Machines
29
Advantages of Supervised learning:
• With the help of supervised learning, the model can predict
the output on the basis of prior experiences.
• In supervised learning, we can have an exact idea about the
classes of objects.
• Supervised learning model helps us to solve various real-world
problems such as fraud detection, spam filtering, etc.
Disadvantages of supervised learning:
• Supervised learning models are not suitable for handling the
complex tasks.
• Supervised learning cannot predict the correct output if the
test data is different from the training dataset.
• Training required lots of computation times.
• In supervised learning, we need enough knowledge about the
classes of object.
30
Unsupervised Machine Learning
• Unsupervised learning is where you only have input
data (X) and no corresponding output variables.
• As the name suggests, unsupervised learning is a
machine learning technique in which models are
not supervised using training dataset.
• Instead, models itself find the hidden patterns and
insights from the given data. It can be compared to
learning which takes place in the human brain
• The goal of unsupervised learning is to find the
underlying structure of dataset, group that data
according to similarities, and represent that
dataset in a compressed format.
31
Working of Unsupervised Learning

32
• Here, we have taken an unlabeled input data,
which means it is not categorized and
corresponding outputs are also not given.
• Now, this unlabeled input data is fed to the
machine learning model in order to train it.
Firstly, it will interpret the raw data to find the
hidden patterns from the data and then will
apply suitable algorithms such as k-means
clustering, Decision tree, etc.
• Once it applies the suitable algorithm, the
algorithm divides the data objects into groups
according to the similarities and difference
between the objects.
33
Types of Unsupervised Learning Algorithm:
• Clustering: Clustering is a method of grouping the objects into
clusters such that objects with most similarities remains into
a group and has less or no similarities with the objects of
another group.
• Cluster analysis finds the commonalities between the data
objects and categorizes them as per the presence and
absence of those commonalities.
• Association: An association rule is an unsupervised learning
method which is used for finding the relationships between
variables in the large database. It determines the set of items
that occurs together in the dataset.
• Association rule makes marketing strategy more effective.
Such as people who buy X item (suppose a bread) are also
tend to purchase Y (Butter/Jam) item. A typical example of
Association rule is Market Basket Analysis. 34
• Unsupervised Learning algorithms:
• Below is the list of some popular unsupervised
learning algorithms:
• K-means clustering
• KNN (k-nearest neighbors)
• Hierarchal clustering
• Anomaly detection
• Neural Networks
• Principle Component Analysis
• Independent Component Analysis
• Apriori algorithm
• Singular value decomposition 35
Advantages of Supervised learning:
• Unsupervised learning is used for more complex tasks
as compared to supervised learning because, in
unsupervised learning, we don't have labeled input
data.
• Unsupervised learning is preferable as it is easy to get
unlabeled data in comparison to labeled data.
Disadvantages of Unsupervised Learning
• Unsupervised learning is intrinsically more difficult than
supervised learning as it does not have corresponding
output.
• The result of the unsupervised learning algorithm might
be less accurate as input data is not labeled, and
algorithms do not know the exact output in advance. 36
Difference between Supervised vs Unsupervised Learning

37
Difference between Supervised vs Unsupervised Learning

38
Semi-Supervised Learning

39
Semi-supervised learning
• Semi-supervised learning bridges supervised
learning and unsupervised learning techniques to
solve their key challenges.
• With it, you train an initial model on a few labeled
samples and then iteratively apply it to the greater
number of unlabeled data.
• Unlike unsupervised learning, SSL works for a variety
of problems from classification and regression to
clustering and association.
• Unlike supervised learning, the method uses small
amounts of labeled data and also large amounts of
unlabeled data, which reduces expenses on manual
annotation and cuts data preparation time. 40
How Semi-supervised learning work

41
• You pick a small amount of labeled data, e.g., images showing cats and
dogs with their respective tags, and you use this dataset to train a
base model with the help of ordinary supervised methods.
• Then you apply the process known as pseudo-labeling — when you
take the partially trained model and use it to make predictions for
the rest of the database which is yet unlabeled. The labels generated
thereafter are called pseudo as they are produced based on the
originally labeled data that has limitations (say, there may be an
uneven representation of classes in the set resulting in bias — more
dogs than cats).
• From this point, you take the most confident predictions made with
your model (for example, you want the confidence of over 80 percent
that a certain image shows a cat, not a dog). If any of the pseudo-
labels exceed this confidence level, you add them into the labeled
dataset and create a new, combined input to train an improved model.
• The process can go through several iterations (10 is often a standard
amount) with more and more pseudo-labels being added every time.
Provided the data is suitable for the process, the performance of the
42
model will keep increasing at each iteration.
Semi-supervised learning examples
• Speech Recognition- Facebook (now Meta)
has successfully applied semi-supervised
learning (namely the self-training method) to its
speech recognition models and improved them.
• Web content classification- Many search
engines, including Google, apply SSL to their
ranking component to better understand
human language and the relevance of candidate
search results to queries.
• Text document classification- building of a
text document classifier.
43
Reinforcement Learning Techniques
• Reinforcement learning is an area of Machine
Learning. It is about taking suitable action to
maximize reward in a particular situation.
• in reinforcement learning, there is no answer
but the reinforcement agent decides what to do
to perform the given task. In the absence of a
training dataset, it is bound to learn from its
experience.
• Example: The problem is as follows: We have an agent
and a reward, with many hurdles in between. The
agent is supposed to find the best possible path to
reach the reward. The following problem explains the
44
problem more easily.
Reinforcement Learning Techniques

• The above image shows the robot, diamond, and fire.

The goal of the robot is to get the reward that is the
diamond and avoid the hurdles that are fired.
• The robot learns by trying all the possible paths and
then choosing the path which gives him the reward
with the least hurdles.
• Each right step will give the robot a reward and each
wrong step will subtract the reward of the robot. The
total reward will be calculated when it reaches the final
reward that is the diamond. 45
Reinforcement Learning Techniques
• Main points in Reinforcement learning –
Input: The input should be an initial state from
which the model will start
• Output: There are many possible outputs as there
are a variety of solutions to a particular problem
• Training: The training is based upon the input, The
model will return a state and the user will decide to
reward or punish the model based on its output.
• The model keeps continues to learn.
• The best solution is decided based on the
maximum reward.
46
Difference

47
Types of Reinforcement:
• There are two types of Reinforcement:
• Positive
Positive Reinforcement is defined as when an event,
occurs due to a particular behavior, increases the
strength and the frequency of the behavior. In other
words, it has a positive effect on behavior.
• Advantages of reinforcement learning are:
– Maximizes Performance
– Sustain Change for a long period of time
– Too much Reinforcement can lead to an overload
of states which can diminish the results

48
Types of Reinforcement:
• Negative –
Negative Reinforcement is defined as
strengthening of behavior because a negative
condition is stopped or avoided.
• Advantages of reinforcement learning:
– Increases Behavior
– Provide defiance to a minimum standard of performance
– It Only provides enough to meet up the minimum behavior
• State-Action-Reward-State-Action (SARSA): SARSA is an On-
policy algorithm based on the Markov decision process. It uses
the action performed by the current policy to learn the Q-value.
The SARSA algorithm stands for State Action Reward State
Action, which symbolizes the tuple (s, a, r, s', a').
49
Practical applications Reinforcement Learning
• RL can be used in robotics for industrial automation.
• RL can be used in machine learning and data processing
• RL can be used to create training systems that provide
custom instruction and materials according to the
requirement of students.
• RL can be used in large environments in the following
situations:
A model of the environment is known, but an analytic
solution is not available;
• Only a simulation model of the environment is given
(the subject of simulation-based optimization)
• The only way to collect information about the
environment is to interact with it. 50
Models of Machine learning: Geometric model
• Some broad categories of models:
1. Geometric models
• E.g. K-nearest neighbors, linear regression, support
vector machine, logistic regression, …
2. Probabilistic models
• Naïve Bayes, Gaussian process regression, conditional
random field, …
3. Logical models
• Decision tree, random forest, … Compositional models
Neural networks, logistic regression, ..
4. Ensemble models- Boosting, bagging, random forest.
5. Grading vs grouping models
51
Models of Machine learning: Geometric model
• Machine learning is concerned with using the right
features to build the right models that achieve the
right tasks.
• The basic idea of Learning models has divided into
three categories.
• For a given problem, the collection of all possible outcomes
represents the sample space or instance space.
• Using a Logical expression. (Logical models)
• Using the Geometry of the instance space. (Geometric
models)
• Using Probability to classify the instance space.
(Probabilistic models)
• Grouping and Grading
52
Logical models
• Logical models use a logical expression to divide the
instance space into segments and hence construct
grouping models.
• A logical expression is an expression that returns a
Boolean value, i.e., a True or False outcome.
• Once the data is grouped using a logical expression, the
data is divided into homogeneous groupings for the
problem we are trying to solve.
• For example, for a classification problem, all the
instances in the group belong to one class.
• There are mainly two kinds of logical models: Tree
models and Rule models.
53
Logical models
• Rule models consist of a collection of
implications or IF-THEN rules.
• Tree-based models, the ‘if-part’ defines a
segment and the ‘then-part’ defines the
behavior of the model for this segment.
• Rule models follow the same reasoning.
• Tree models can be seen as a particular type of
rule model where the if-parts of the rules are
organized in a tree structure.
• Both Tree models and Rule models use the
same approach to supervised learning.
54
Logical models
• Logical models and Concept learning
• To understand logical models further, we need to understand
the idea of Concept Learning.
• Concept Learning involves learning logical expressions or
concepts from examples.
• Concept learning forms the basis of both tree-based and rule-
based models.
• More formally, Concept Learning involves acquiring the
definition of a general category from a given set of positive and
negative training examples of the category.
• A Formal Definition for Concept Learning is “The inferring of a
Boolean-valued function from training examples of its input
and output.”
• In concept learning, we only learn a description for the positive
class and label everything that doesn’t satisfy that description as
55
negative.
Logical models

• The problem can be represented by a series of hypotheses. Each

hypothesis is described by a conjunction of constraints on the
attributes. The training data represents a set of positive and
negative examples of the target function. In the example above,
each hypothesis is a vector of six constraints, specifying the values
of the six attributes – Sky, AirTemp, Humidity, Wind, Water, and
Forecast.
• The training phase involves learning the set of days (as a
conjunction of attributes) for which Enjoy Sport = yes. 56
Models of Machine learning: Geometric model
• Given instances X which represent a set of all
possible days, each described by the attributes:
1. Sky – (values: Sunny, Cloudy, Rainy),
2. AirTemp – (values: Warm, Cold),
3. Humidity – (values: Normal, High),
4. Wind – (values: Strong, Weak),
5. Water – (values: Warm, Cold),
6. Forecast – (values: Same, Change).
• Try to identify a function that can predict the
target variable Enjoy Sport as yes/no, i.e., 1 or 0.
57
Models of Machine learning: Geometric model
• The tree shows survival numbers of passengers on the Titanic
• ("sibsp" is the number of spouses or siblings aboard).
• The values under the leaves show the probability of survival and
the percentage of observations in the leaf.
• The model can be summarised as: Your chances of survival were
good if you were (i) a female or (ii) a male younger than 9.5
years with less than 2.5 siblings.
•

58
Geometric model
• we have seen that with logical models, such as
decision trees, a logical expression is used to partition
the instance space.
• Two instances are similar when they end up in the
same logical segment.
• In this section, we consider models that define
similarity by considering the geometry of the instance
space.
• In Geometric models, features could be described as
points in two dimensions (x- and y-axis) or a three-
dimensional space (x, y, and z).
• Eg. , temperature as a function of time can be
modelled in two axes)
59
Geometric model
• There are two ways we could impose similarity.
• We could use geometric concepts like lines or planes
to segment (classify) the instance space. These are
called Linear models.
• Alternatively, we can use the geometric notion of
distance to represent similarity.
• In this case, if two points are close together, they have
similar values for features and thus can be classed as
similar. We call such models as Distance-based
models.

60
Geometric model
• 1. Linear models are relatively simple.
• In this case, the function is represented as a linear
combination of its inputs.
• Thus, if x1 and x2 are two scalars or vectors of the same
dimension and a and b are arbitrary scalars,
then ax1 + bx2 represents a linear combination of x1 and x2.
• In the simplest case where f(x) represents a straight line,
we have an equation of the form
• f (x) = mx + c where c represents the intercept
and m represents the slope.

61
Geometric model
• Linear models are parametric, which means that they
have a ﬁxed form with a small number of numeric
parameters that need to be learned from data.
• For example, in f (x) = mx + c, m and c are the para
• meters that we are trying to learn from the data.
• Linear models are stable, i.e., small variations in the
training data have only a limited impact on the learned
model.
• In contrast, tree models tend to vary more with the
training data, as the choice of a different split at the
root of the tree typically means that the rest of the
tree is different as well.
62
Geometric model
• Linear models have low variance and high bias.
• This implies that Linear models are less likely to
overfit the training data than some other models.
• However, they are more likely to underfit. For example,
if we want to learn the boundaries between countries
based on labelled data, then linear models are not
likely to give a good approximation
• Errors in Machine Learning?
• Reducible errors: These errors can be reduced to
improve the model accuracy.
• Irreducible errors: These errors will always be present
in the model
63
Geometric model
• What is Bias?
• In general, a machine learning model analyses the data, find
patterns in it and make predictions.
• While training, the model learns these patterns in the dataset
and applies them to test data for prediction.
• While making predictions, a difference occurs between
prediction values made by the model and actual values/
expected values, and this difference is known as bias errors
or Errors due to bias.
• Low Bias: A low bias model will make fewer assumptions
about the form of the target function.
• High Bias: A model with a high bias makes more assumptions,
and the model becomes unable to capture the important
features of our dataset. A high bias model also cannot
perform well on new data. 64
Geometric model
• Some examples of machine learning algorithms
with low bias are Decision Trees, k-Nearest
Neighbours and Support Vector Machines. At the
same time, an algorithm with high bias is Linear
Regression, Linear Discriminant Analysis and
Logistic Regression.
• Ways to reduce High Bias:
• High bias mainly occurs due to a much simple model.
Below are some ways to reduce the high bias:
• Increase the input features as the model is
underfitted.
• Use more complex models, such as including some
polynomial features. 65
Geometric model
• What is a Variance Error?
• The variance would specify the amount of variation in the
prediction if the different training data was used.
• In simple words, variance tells that how much a random
variable is different from its expected value.
• Ideally, a model should not vary too much from one training
dataset to another, which means the algorithm should be good
in understanding the hidden mapping between inputs and
output variables.
• Variance errors are either of low variance or high variance.
• Low variance means there is a small variation in the prediction
of the target function with changes in the training data set. At
the same time, High variance shows a large variation in the
prediction of the target function with changes in the training
dataset.
66
Models of Machine learning: Geometric model
• A model that shows high variance learns a lot and
perform well with the training dataset, and does not
generalize well with the unseen dataset.
• As a result, such a model gives good results with the
training dataset but shows high error rates on the test
dataset.
• Since, with high variance, the model learns too much
from the dataset, it leads to overfitting of the model.
• A model with high variance has the below problems:
• A high variance model leads to overfitting.
• Increase model complexities.

67
Models of Machine learning: Geometric model

• Some examples of machine learning algorithms with

low variance are, Linear Regression, Logistic
Regression, and Linear discriminant analysis.
• At the same time, algorithms with high variance
are decision tree, Support Vector Machine, and K-
nearest neighbours.

68
Models of Machine learning: Geometric model
Ways to Reduce High Variance:
• Reduce the input features or number of parameters as a model
is overfitted.
• Do not use a much complex model.
• Increase the training data.
• Increase the Regularization term.
Different Combinations of Bias-Variance
Low-Bias, Low-Variance:
The combination of low bias and low variance
shows an ideal machine learning model.
However, it is not possible practically.

69
Models of Machine learning: Geometric model
• Low-Bias, High-Variance: With low bias and high
variance, model predictions are inconsistent and
accurate on average. This case occurs when the model
learns with a large number of parameters and hence
leads to an overfitting
• High-Bias, Low-Variance: With High bias and low
variance, predictions are consistent but inaccurate on
average. This case occurs when a model does not
learn well with the training dataset or uses few
numbers of the parameter. It leads
to underfitting problems in the model.
• High-Bias, High-Variance:
With high bias and high variance, predictions are
inconsistent and also inaccurate on average. 70
71
Geometric model
• 2. Distance-based models Distance-based models are the
second class of Geometric models.
• Like Linear models, distance based models are based on
the geometry of data.
• As the name implies, distance-based models
• In the context of Machine learning, the concept of
distance is not based on merely the physical distance
between two points.
• Instead, we could think of the distance between two
points considering the mode of transport between two
points.
• Travelling between two cities by plane 6 covers less
distance physically than by train because a plane is
unrestricted. 72
Models of Machine learning: Geometric model
– length of a segment connecting two points.
– Useful for Less dimensionality
– increases of your data might be skewed

– Distance measure on chess board

– distance between two vectors
– if they could only move right angles

73
Minkowski distance
• In Minkowski distance a metric used in Normed vector
space (n-dimensional real space), which means that it can
be used in a space where distances can be represented as a
vector that has a length.
• This measure has three requirements:
• Zero Vector — The zero vector has a length of zero whereas
every other vector has a positive length. For example, if we
travel from one place to another, then that distance is always
positive. However, if we travel from one place to itself, then
that distance is zero.
• Scalar Factor — When you multiple the vector with a positive
number its length is changed whilst keeping its direction. For
example, if we go a certain distance in one direction and add the
same distance, the direction does not change.
• Triangle Inequality — The shortest distance between two
74
points is a straight line.
Models of Machine learning: Geometric model
• Most interestingly about this distance measure is the
use of parameter p.
• We can use this parameter to manipulate the distance
metrics to closely resemble others.
• Common values of p are:
• p=1 — Manhattan distance
• p=2 — Euclidean distance
• p=∞ —Chebyshev distance
Chebyshev distance- it is simply the maximum distance
along one axis. Due to its nature, it is often referred to as
Chessboard distance since the minimum number of
moves needed by a king to go from one square to
another is equal to Chebyshev distance. 75
Models of Machine learning: Geometric model

• V

• The Jaccard index (or Intersection over Union) is a metric used

to calculate the similarity and diversity of sample sets. It is the
size of the intersection divided by the size of the union of the
sample sets.
• In practice, it is the total number of similar entities between sets
divided by the total number of entities. For example, if two sets
have 1 entity in common and there are 5 different entities in
total, then the Jaccard index would be 1/5 = 0.2.
76
Geometric model
• Examples of distance-based models include
the nearest-neighbour models, which use the training
data as exemplars – for example, in classification. The K-
means clustering algorithm also uses exemplars to
create clusters of similar data points.
• Medoids are similar in concept to means or centroids.
Medoids are most commonly used on data when a
mean or centroid cannot be defined.
• They are used in contexts where the centroid is not
representative of the dataset, such as in image data.

77
Probabilistic Models
• Probabilistic models use the idea of probability to
classify new entities.
• Probabilistic models see features and target variables
as random variables.
• The process of modelling represents and manipulates
the level of uncertainty with respect to these variables.
• There are two types of probabilistic models:
Predictive and Generative.
• Predictive probability models use the idea of a
conditional probability distribution P (Y |X) from which
Y can be predicted from X.
• Generative models estimate the joint distribution P (Y,
X). O 78
Probabilistic Models
• Once we know the joint distribution for the generative
models, we can derive any conditional or marginal
distribution involving the same variables.
• Thus, the generative model is capable of creating new
data points and their labels, knowing the joint
probability distribution.
• The joint distribution looks for a relationship between
two variables.
• Once this relationship is inferred, it is possible to infer
new data points.
• Naïve Bayes is an example of a probabilistic classifier

79
Naïve Bayes Classifier Algorithm
• Naïve Bayes algorithm is a supervised learning
algorithm, which is based on Bayes theorem and used
for solving classification problems.
• It is mainly used in text classification that includes a
high-dimensional training dataset.
• It is a probabilistic classifier, which means it predicts
on the basis of the probability of an object.
• Some popular examples of Naïve Bayes Algorithm
are spam filtration, Sentimental analysis, and
classifying articles.
• Naïve: It is called Naïve because it assumes that the occurrence of a certain
feature is independent of the occurrence of other features. Such as if the
fruit is identified on the bases of color, shape, and taste, then red, spherical,
and sweet fruit is recognized as an apple. 80
Naïve Bayes Classifier Algorithm
• Bayes: It is called Bayes because it depends on the
principle of Bayes' Theorem.
• Bayes' Theorem: is used to determine the probability
of a hypothesis with prior knowledge. It depends on
the conditional probability.
• The formula for Bayes' theorem is given as:

• Where,
• P(A|B) is Posterior probability: Probability of
hypothesis A on the observed event B.
• P(B|A) is Likelihood probability: Probability of the
evidence given that the probability of a hypothesis is
true. 81
Naïve Bayes Classifier Algorithm
• P(A) is Prior Probability: Probability of hypothesis
before observing the evidence.
• P(B) is Marginal Probability: Probability of Evidence.
• uppose we have a dataset of weather conditions and
corresponding target variable "Play". So using this
dataset we need to decide that whether we should
play or not on a particular day according to the
weather conditions. So to solve this problem, we need
to follow the below steps:
• Convert the given dataset into frequency tables.
• Generate Likelihood table by finding the probabilities
of given features.
82
Naïve Bayes Classifier Algorithm
• Now, use Bayes theorem to calculate the posterior probability.
• Problem: If the weather is sunny, then the Player should play or not?
• Solution: To solve this, first consider the below dataset:

83
Naïve Bayes Classifier Algorithm

Likelihood table weather condition:

84
Naïve Bayes Classifier Algorithm
• Applying Bayes'theorem:
• P(Yes|Sunny)= P(Sunny|Yes)*P(Yes)/P(Sunny)
• P(Sunny|Yes)= 3/10= 0.3
• P(Sunny)= 0.35
• P(Yes)=0.71
• So P(Yes|Sunny) = 0.3*0.71/0.35= 0.60
• P(No|Sunny)= P(Sunny|No)*P(No)/P(Sunny)
• P(Sunny|NO)= 2/4=0.5
• P(No)= 0.29
• P(Sunny)= 0.35
• So P(No|Sunny)= 0.5*0.29/0.35 = 0.41
• So as we can see from the above calculation
that P(Yes|Sunny)>P(No|Sunny)
• Hence on a Sunny day, Player can play the game. 85
Advantages of Naïve Bayes Classifier:
• Naïve Bayes is one of the fast and easy ML algorithms to predict a
class of datasets.
• It can be used for Binary as well as Multi-class Classifications.
• It performs well in Multi-class predictions as compared to the
other Algorithms.
• It is the most popular choice for text classification problems.
Disadvantages of Naïve Bayes Classifier:
• Naive Bayes assumes that all features are independent or
unrelated, so it cannot learn the relationship between features.
• Applications of Naïve Bayes Classifier:
• It is used for Credit Scoring.
• It is used in medical data classification.
• It can be used in real-time predictions because Naïve Bayes
Classifier is an eager learner.
• It is used in Text classification such as Spam
filtering and Sentiment analysis. 86
Grouping and Grading Model
• Grading vs grouping is an orthogonal categorization to
geometric-probabilistic-logical-compositional.
• Grouping models break the instance space up into
groups or segments and in each segment apply a very
simple method (such as majority class).
• E.g. decision tree, KNN.
• Grading models form one global model over the
instance space.
• E.g. Linear classifiers – Neural networks

87
Parametric Machine Learning Algorithms
• A learning model that summarizes data with a set of
parameters of fixed size (independent of the number
of training examples) is called a parametric model.
• No matter how much data you throw at a parametric
model, it won’t change its mind about how many
parameters it needs.
• The algorithms involve two steps:
• Select a form for the function.
• Learn the coefficients for the function from the training
data.
• An easy to understand functional form for the mapping
function is a line, as is used in linear regression:
• b0 + b1*x1 + b2*x2 = 0 88
Naïve Bayes Classifier Algorithm
• Where b0, b1 and b2 are the coefficients of the line that
control the intercept and slope, and x1 and x2 are two
input variables.
• a linear combination of the input variables and as such
parametric machine learning algorithms are often also
called “linear machine learning algorithms“.
• Some more examples of parametric machine learning
algorithms include:
• Logistic Regression
• Linear Discriminant Analysis
• Perceptron Naive Bayes Simple Neural Networks
• Parameters for using the normal distribution is as follows:
• Mean
• Standard Deviation
89
Naïve Bayes Classifier Algorithm
• Benefits of Parametric Machine Learning Algorithms:
• Simpler: These methods are easier to understand and interpret
results.
• Speed: Parametric models are very fast to learn from data.
• Less Data: They do not require as much training data and
can work well even if the fit to the data is not perfect.
• Limitations of Parametric Machine Learning Algorithms:
• Constrained: By choosing a functional form these methods are
highly constrained to the specified form.
• Limited Complexity: The methods are more suited to simpler
problems.
• Poor Fit: In practice the methods are unlikely to match the
underlying mapping function.

90
Nonparametric Machine Learning Algorithms
• Nonparametric methods are good when you have a lot
of data and no prior knowledge, and when you don’t
want to worry too much about choosing just the right
features.
• Nonparametric methods seek to best fit the training
data in constructing the mapping function, whilst
maintaining some ability to generalize to unseen data.
As such, they are able to fit a large number of
functional forms.
• An easy to understand nonparametric model is the k-
nearest neighbors algorithm that makes predictions
based on the k most similar training patterns for a new
data instance.
91
Nonparametric Machine Learning Algorithms
• Some more examples of popular nonparametric machine learning algorithms are:
• k-Nearest Neighbors Decision Trees like CART and C4.5
• Support Vector Machines
• Benefits of Nonparametric Machine Learning Algorithms:
• Flexibility: Capable of fitting a large number of functional forms.
• Power: No assumptions (or weak assumptions) about the
underlying function.
• Performance: Can result in higher performance models for
prediction.
• Limitations of Nonparametric Machine Learning Algorithms:
• More data: Require a lot more training data to estimate the
mapping function.
• Slower: A lot slower to train as they often have far more
parameters to train.
• Overfitting: More of a risk to overfit the training data and it
92
is harder to explain why specific predictions are made.
93
94
Important Elements in Machine
Learning
• Data formats
• In a supervised learning problem, there will
always be a dataset, defined as a finite set of
real vectors with m features each:

95
Data Format
• Labeled data: Data consisting of a set
of training examples, where each example is
a pair consisting of an input and a desired
output value (also called the supervisory
signal, labels, etc)
• Classification: The goal is to predict discrete
values, e.g. {1,0}, {True, False}, {spam, not
spam}.
• Regression: The goal is to predict continuous
values, e.g. home prices.
96
• Feature vector: A typical setting for machine
learning is to be given a collection of objects (or data
points), each of which is characterised by several
different features.
• Features can be of different sorts: e.g., they might
be continuous (say, real- or integer-valued) or
categorical (for instance, a feature for colour can
have values like green, blue, red ).
• A vector containing all of the feature values for a
given data point is called the feature vector;
• if this is a vector of length m, then one can think of
each data point as being mapped to a m-
dimensional vector space (in the case of real-valued
features, this is R m ), called the feature space. 97
• This means all variables belong to the same
distribution D, and considering an arbitrary
subset of m values, it happens that:

• The corresponding output values can be both

numerical-continuous or categorical. In the
first case, the process is called regression,
while in the second, it is called classification.
Examples of numerical outputs are:

98
• Categorical examples are

• We define generic regressor, a vector-valued

function which associates an input value to a
continuous output and generic classifier, a vector-
values function whose predicted output is
categorical (discrete).
• If they also depend on an internal parameter vector
which determines the actual instance of a generic
predictor, the approach is called parametric learning:

99
interpretation can be expressed in terms of
additive noise:

In unsupervised learning, we normally only have an

input set X with m-length vectors, and we define
clustering function (with n target clusters) with the
following expression:

In most scikit-learn models, there is an instance variable coef_ which

contains all trained parameters
100
Learnability

101
• there's an example of a dataset whose points
must be classified as red (Class A) or blue
(Class B).
• Three hypotheses are shown: the first one
(the middle line starting from left)
misclassifies one sample,
• while the lower and upper ones misclassify 13
and 23 samples respectively:
• the first hypothesis is optimal and should be
selected; however, it's important to
understand an essential concept which can
determine a potential overfitting
102
103
• The blue classifier is linear while the red one
is cubic. At a glance, non-linear strategy
seems to perform better, because it can
capture more expressivity, thanks to its
concavities.
• However, if new samples are added following
the trend defined by the last four ones (from
the right), they'll be completely misclassified.
• In fact, while a linear function is globally
better but cannot capture the initial
oscillation between 0 and 4, a cubic approach
can fit this data almost perfectly but, at the
104
same time, loses its ability to keep a global
Error measures
• In general, when working with a supervised
scenario, we define a non-negative error
measure em which takes two arguments
(expected & predicted output ) and allows us
to compute a total error value over the whole
dataset (made up of n samples):

105
• This value is also implicitly dependent on the
specific hypothesis H through the parameter
set, therefore optimizing the error implies
finding an optimal hypothesis

• it's useful to consider the mean square error

(MSE):

106
107
Statistical learning approaches
• Imagine that you need to design a spam-
filtering algorithm starting from this initial
(over- simplistic) classification based on two
parameters:
Parameter Spam emails (X1) Regular emails (X2)

P1 Contains > 5 blacklisted

words 80 20
p2 - Message length < 20 75 25
characters
• We have collected 200 email messages (X) (for
simplicity, we consider p1 and p2 mutually exclusive)
and we need to find a couple of probabilistic
hypotheses (expressed in terms of p1 and p2), to
determine: 108
• For example, we could think about rules
(hypotheses) like: "If there are more than five
blacklisted words" or "If the message is less than 20
characters in length" then "the probability of spam is
high" (for example, greater than 50 percent).
However, without assigning probabilities, it's difficult
to generalize when the dataset changes (like in a real
world antispam filter). We also want to determine a
partitioning threshold (such as green, yellow, and
red signals) to help the user in deciding what to keep
and what to trash.
• As the hypotheses are determined through the
dataset X, we can also write (in a discrete form): 109
• In this example, it's quite easy to determine the value of each
term. However, in general, it's necessary to introduce the
Bayes formula

• In the previous equation, the first term is

called a posteriori (which comes after)
probability, because it's determined by a
marginal Apriori (which comes first)
probability multiplied by a factor which is
called likelihood.
110
111
112

MACHINE LEARNING AL3451
No ratings yet
MACHINE LEARNING AL3451
10 pages
MACHINE LEARNING R23 material
100% (8)
MACHINE LEARNING R23 material
32 pages
DL Notes 1 5 Deep Learning
100% (1)
DL Notes 1 5 Deep Learning
189 pages
Machine Learning Assignment
100% (1)
Machine Learning Assignment
55 pages
Question Bank - Machine Learning (Repaired)
100% (1)
Question Bank - Machine Learning (Repaired)
78 pages
Regular Bail Application
100% (11)
Regular Bail Application
4 pages
ML Unit-1
100% (2)
ML Unit-1
12 pages
ME P4252-II Semester - MACHINE LEARNING
No ratings yet
ME P4252-II Semester - MACHINE LEARNING
48 pages
Question Bank
No ratings yet
Question Bank
14 pages
Unit 1 - Machine Learning - WWW - Rgpvnotes.in
No ratings yet
Unit 1 - Machine Learning - WWW - Rgpvnotes.in
23 pages
Rizal in London
No ratings yet
Rizal in London
32 pages
ML - LAB Record
No ratings yet
ML - LAB Record
36 pages
Unit I Notes Machine Learning Techniques 1
No ratings yet
Unit I Notes Machine Learning Techniques 1
21 pages
AI - Unit I QB
100% (1)
AI - Unit I QB
1 page
Machine Learning Question Paper Solved ML
No ratings yet
Machine Learning Question Paper Solved ML
55 pages
Unit-I Notes
No ratings yet
Unit-I Notes
29 pages
Machine Learning Notes - TutorialsDuniya
100% (1)
Machine Learning Notes - TutorialsDuniya
58 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
Data Engineering Lab
No ratings yet
Data Engineering Lab
55 pages
AD3501 Deep Learning Syllabus
No ratings yet
AD3501 Deep Learning Syllabus
1 page
CP4252-ML-SYLLABUS
No ratings yet
CP4252-ML-SYLLABUS
4 pages
CS-605 Data - Analytics - Lab Complete Manual (2) - 1672730238
No ratings yet
CS-605 Data - Analytics - Lab Complete Manual (2) - 1672730238
56 pages
Machine Learning Techniques
100% (2)
Machine Learning Techniques
45 pages
Ccs337 - Cognitive Science Laboratory Lab Manual Record
No ratings yet
Ccs337 - Cognitive Science Laboratory Lab Manual Record
27 pages
Deep Learning Questions
50% (2)
Deep Learning Questions
51 pages
CS3491 Unit 2 Aiml
100% (1)
CS3491 Unit 2 Aiml
21 pages
Data Science-Lab Manual
100% (1)
Data Science-Lab Manual
15 pages
CS3491 Ai & ML Lab Manual
No ratings yet
CS3491 Ai & ML Lab Manual
57 pages
Unit 1 Introduction of Machine Learning Notes
No ratings yet
Unit 1 Introduction of Machine Learning Notes
57 pages
Deep Learning Unit-II
No ratings yet
Deep Learning Unit-II
19 pages
ML Notes
No ratings yet
ML Notes
202 pages
Machine Learning-Unit-V-Notes
No ratings yet
Machine Learning-Unit-V-Notes
23 pages
ML Mid Sem Question Bank
No ratings yet
ML Mid Sem Question Bank
11 pages
AL3391 Notes Unit I
100% (1)
AL3391 Notes Unit I
52 pages
Cs3491 - Aiml - Unit III - Introduction To Machine Learning1
100% (1)
Cs3491 - Aiml - Unit III - Introduction To Machine Learning1
23 pages
NNDL Technical Publication Notes
No ratings yet
NNDL Technical Publication Notes
81 pages
Deep Learning Interview Questions
No ratings yet
Deep Learning Interview Questions
17 pages
Unit I Content Beyond Syllabus - I Introduction To Data Mining and Data Warehousing What Are Data Mining and Knowledge Discovery?
No ratings yet
Unit I Content Beyond Syllabus - I Introduction To Data Mining and Data Warehousing What Are Data Mining and Knowledge Discovery?
12 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
4 pages
Ad3461 Ml Lab Manual
100% (1)
Ad3461 Ml Lab Manual
54 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
23 pages
Lec-1 ML Intro
No ratings yet
Lec-1 ML Intro
15 pages
DEEP LEARNING NOTES - Btech
No ratings yet
DEEP LEARNING NOTES - Btech
26 pages
LP I ML Viva Questions
100% (1)
LP I ML Viva Questions
9 pages
Jntuk r20 Unit-I Deep Learning Techniques (WWW - Jntumaterials.co - In)
No ratings yet
Jntuk r20 Unit-I Deep Learning Techniques (WWW - Jntumaterials.co - In)
23 pages
Introduction To Machine Learning PPT Main
No ratings yet
Introduction To Machine Learning PPT Main
15 pages
CS3491 Artificial Intelligence and Machine Learning Two Mark Questions 1
No ratings yet
CS3491 Artificial Intelligence and Machine Learning Two Mark Questions 1
23 pages
AI & DS - AD3351 DAA - 2marks (Unit 1 & 2) Question Bank
No ratings yet
AI & DS - AD3351 DAA - 2marks (Unit 1 & 2) Question Bank
7 pages
AKTU Notes Machine Learning (ROE083) Unit-1 - UPTU Notes PDF
50% (2)
AKTU Notes Machine Learning (ROE083) Unit-1 - UPTU Notes PDF
66 pages
AIML LAB MANAUAL R23
100% (1)
AIML LAB MANAUAL R23
10 pages
ML UNIT-4 Notes PDF
100% (1)
ML UNIT-4 Notes PDF
40 pages
Ann Lab Manual 1
No ratings yet
Ann Lab Manual 1
50 pages
It8073 Information Security Reg 17 Question Bank
0% (1)
It8073 Information Security Reg 17 Question Bank
4 pages
ML OLD Question Paper
63% (8)
ML OLD Question Paper
2 pages
Assignment # 01 Bscs - 7 Semester: Machine Learning
100% (1)
Assignment # 01 Bscs - 7 Semester: Machine Learning
5 pages
ML Lab Manual - Ex No. 1 To 9
No ratings yet
ML Lab Manual - Ex No. 1 To 9
26 pages
ccs355 Syllabus NNDL
100% (1)
ccs355 Syllabus NNDL
3 pages
CP5191 NAAC - Machine Learning Techniques Lesson Plan - M.E 2017
No ratings yet
CP5191 NAAC - Machine Learning Techniques Lesson Plan - M.E 2017
4 pages
AIML Course File
No ratings yet
AIML Course File
31 pages
CS3451 Course Plan
100% (1)
CS3451 Course Plan
10 pages
Machine Learning with Python: Design and Develop Machine Learning and Deep Learning Technique using real world code examples
From Everand
Machine Learning with Python: Design and Develop Machine Learning and Deep Learning Technique using real world code examples
Abhishek Vijayvargia
No ratings yet
Lecture 1
No ratings yet
Lecture 1
65 pages
Code HTML
No ratings yet
Code HTML
1 page
ML Unit 2
No ratings yet
ML Unit 2
90 pages
Software Testing and Quality Assurance - : Unit - 1 Two Marks Questions
100% (2)
Software Testing and Quality Assurance - : Unit - 1 Two Marks Questions
5 pages
Unit I: Introduction To Software Testing
No ratings yet
Unit I: Introduction To Software Testing
51 pages
Metaverse Cheat Sheet 1
100% (1)
Metaverse Cheat Sheet 1
13 pages
Anh 6 Kim Hien Unit 2
No ratings yet
Anh 6 Kim Hien Unit 2
5 pages
KL 24 May 1st Tisarana
No ratings yet
KL 24 May 1st Tisarana
29 pages
Unit 3 Chemical Pathways
0% (1)
Unit 3 Chemical Pathways
28 pages
(Ebook) Queer Alliances: How Power Shapes Political Movement Formation by Erin Mayo-Adam ISBN 9781503610354, 1503610357instant download
100% (3)
(Ebook) Queer Alliances: How Power Shapes Political Movement Formation by Erin Mayo-Adam ISBN 9781503610354, 1503610357instant download
56 pages
2 TKAM Reading Guide (Full)
No ratings yet
2 TKAM Reading Guide (Full)
15 pages
An Analysis of Charpy Impact Testing
No ratings yet
An Analysis of Charpy Impact Testing
14 pages
Hormonal Imbalance
100% (9)
Hormonal Imbalance
1 page
Close Relationship
No ratings yet
Close Relationship
2 pages
Annals of The New York Academy of Sciences - 2016 - Gotlieb - Cultivating The Social Emotional Imagination in Gifted
No ratings yet
Annals of The New York Academy of Sciences - 2016 - Gotlieb - Cultivating The Social Emotional Imagination in Gifted
10 pages
Lab-4 Kelvin Bridge
No ratings yet
Lab-4 Kelvin Bridge
3 pages
Usnei: Structure of The U.S. Education System: Curriculum and Content Standards
No ratings yet
Usnei: Structure of The U.S. Education System: Curriculum and Content Standards
2 pages
Cranial Nerves Torres
No ratings yet
Cranial Nerves Torres
37 pages
Copycaller Software V2.0: User Guide
No ratings yet
Copycaller Software V2.0: User Guide
92 pages
Ujian Kosa Kata 1 Esl
No ratings yet
Ujian Kosa Kata 1 Esl
5 pages
Plural Nouns: Cat Wish Glass - Kiss ' Chair Forest
No ratings yet
Plural Nouns: Cat Wish Glass - Kiss ' Chair Forest
1 page
Written Report Information Literacy
No ratings yet
Written Report Information Literacy
5 pages
Teacher Competition - Lesson Plan Example
No ratings yet
Teacher Competition - Lesson Plan Example
3 pages
Herbalism Kit
No ratings yet
Herbalism Kit
1 page
Sumaeta M.A. Thesis Contents
No ratings yet
Sumaeta M.A. Thesis Contents
9 pages
Daffodils
No ratings yet
Daffodils
5 pages
Q4 Week 1 Math 7
No ratings yet
Q4 Week 1 Math 7
4 pages
Grelha Expert-Teaching-Observation-Form-V2-1
No ratings yet
Grelha Expert-Teaching-Observation-Form-V2-1
6 pages
Activity 2 Nursing Care Plan Making: College of Health Sciences Department of Nursing
No ratings yet
Activity 2 Nursing Care Plan Making: College of Health Sciences Department of Nursing
3 pages
BIO 202 Circulation I Lab 22S PDF
No ratings yet
BIO 202 Circulation I Lab 22S PDF
2 pages
A2+ High. Achievers. Teacher S Resource Book
No ratings yet
A2+ High. Achievers. Teacher S Resource Book
2 pages
Cvsu Account Bscrim
No ratings yet
Cvsu Account Bscrim
10 pages
Tcs Codevita Season-8 Round-2 Questions
No ratings yet
Tcs Codevita Season-8 Round-2 Questions
23 pages

Machine Learning Unit 1

Uploaded by

Machine Learning Unit 1

Uploaded by

Unit-1 Introduction To Machine

• face recognition on your phone or voice understanding,

• Traditional programming you hard code the

• Instead of developing an algorithm on its own, they need to collect an array of

• The main focus of data science models is to

• Supervised Machine Learning:

• Supervised learning needs supervision to train the model,

• In the real-world, supervised learning can be used

• The above image shows the robot, diamond, and fire.

• The problem can be represented by a series of hypotheses. Each

• Some examples of machine learning algorithms with

– Distance measure on chess board

• The Jaccard index (or Intersection over Union) is a metric used

Likelihood table weather condition:

• The corresponding output values can be both

• We define generic regressor, a vector-valued

In unsupervised learning, we normally only have an

In most scikit-learn models, there is an instance variable coef_ which

• it's useful to consider the mean square error

P1 Contains > 5 blacklisted

• In the previous equation, the first term is

You might also like