Introduction To Machine Learning Notes
Introduction To Machine Learning Notes
The breakthrough comes with the idea that a machine can singularly learn
from the data (i.e., an example) to produce accurate results. Machine learning
is closely related to data mining and Data Science. The machine receives data
as input and uses an algorithm to formulate answers.
Machine Learning uses a data- Traditional programming is AI can involve many different
driven approach, It is typically typically rule-based and techniques, including Machine
trained on historical data and then deterministic. It hasn’t self- Learning and Deep Learning, as
used to make predictions on new learning features like well as traditional rule-based
data. Machine Learning and AI. programming.
ML can find patterns and insights Traditional programming is Sometimes AI uses a combination
Machine Learning Traditional Programming Artificial Intelligence
Machine Learning is the subset of Traditional programming is AI is a broad field that includes
AI. And Now it is used in various often used to build many different applications,
AI-based tasks like Chatbot applications and software including natural language
Question answering, self-driven systems that have specific processing, computer vision, and
car., etc. functionality. robotics.
1. Study the Problems: The first step is to study the problem. This step
involves understanding the business problem and defining the
objectives of the model.
2. Data Collection: When the problem is well-defined, we can collect
the relevant data required for the model. The data could come from
various sources such as databases, APIs, or web scraping.
3. Data Preparation: When our problem-related data is collected. then
it is a good idea to check the data properly and make it in the desired
format so that it can be used by the model to find the hidden patterns.
This can be done in the following steps:
Data cleaning
Data Transformation
Explanatory Data Analysis and Feature Engineering
Split the dataset for training and testing.
4.Semi-Supervised Learning
To work with the unlabeled dataset, there must be a relationship between the
objects. To understand this, semi-supervised learning uses any of the following
assumptions:
o ContinuityAssumption:
As per the continuity assumption, the objects near each other tend to
share the same group or label. This assumption is also used in supervised
learning, and the datasets are separated by the decision boundaries. But in
semi-supervised, the decision boundaries are added with the smoothness
assumption in low-density boundaries.
o Cluster assumptions- In this assumption, data are divided into different
discrete clusters. Further, the points in the same cluster share the output
label.
o Manifold assumptions- This assumption helps to use distances and
densities, and this data lie on a manifold of fewer dimensions than input
space.
o The dimensional data are created by a process that has less degree of
freedom and may be hard to model directly. (This assumption becomes
practical if high).
Semi-supervised learning uses pseudo labeling to train the model with less
labeled training data than supervised learning. The process can combine various
neural network models and training ways. The whole working of semi-
supervised learning is explained in the below points:
o Firstly, it trains the model with less amount of training data similar to the
supervised learning models. The training continues until the model gives
accurate results.
o The algorithms use the unlabeled dataset with pseudo labels in the next
step, and now the result may not be accurate.
o Now, the labels from labeled training data and pseudo labels data are
linked together.
o The input data in labeled training data and unlabeled training data are also
linked.
o In the end, again train the model with the new combined input as did in
the first step. It will reduce errors and improve the accuracy of the model.
Here are some specific areas where machine learning is being used:
Now in this Machine learning tutorial, let’s learn the applications of Machine
Learning:
This is the most basic form of regression analysis and is used to model a linear
relationship between a single dependent variable and one or more independent
variables.
Here, a linear regression model is instantiated to fit a linear relationship
between input features (X) and target values (y). This code is used for simple
demonstration of the approach.
Types of Linear Regression
There are two main types of linear regression:
Simple Linear Regression
This is the simplest form of linear regression, and it involves only one
independent variable and one dependent variable. The equation for simple
linear regression is:
where:
Y is the dependent variable
X is the independent variable
β0 is the intercept
β1 is the slope
Multiple Linear Regression
This involves more than one independent variable and one dependent variable.
The equation for multiple linear regression is:
where:
Y is the dependent variable
X1, X2, …, Xp are the independent variables
β0 is the intercept
β1, β2, …, βn are the slopes
What is the best Fit Line?
Our primary objective while using linear regression is to locate the best-fit
line, which implies that the error between the predicted and actual values
should be kept to a minimum. There will be the least error in the best-fit line.
The best Fit Line equation provides a straight line that represents the
relationship between the dependent and independent variables. The slope of
the line indicates how much the dependent variable changes for a unit change
in the independent variable(s).
Linear Regression
Linear regression is
It is used for solving
used for solving
classification problems.
2 Regression problem.
Maximum likelihood
Least square estimation
estimation method is
method is used for
used for Estimation of
estimation of accuracy.
5 accuracy.
It required linear
relationship between It not required linear
dependent and relationship.
7 independent variables.