Null 5
Null 5
Machine Learning:
• Machine Learning is said as a subset of artificial intelligence that is mainly
concerned with the development of algorithms which allow a computer to learn
from the data and past experiences on their own.
• Machine Learning is the field of study that gives computers the capability to
learn without being explicitly programmed.
• A Machine Learning system learns from historical data, builds the
prediction models, and whenever it receives new data, predicts the output
for it.
There are four key steps you would follow when creating a machine learning
model.
For labeled, data should be divided into a training subset and a testing subset.
The former is used to train the model and the latter to evaluate the effectiveness
of the model and find ways to improve it.
2. Select an Algorithm to Apply to the Training Data Set
The type of machine learning algorithm you choose will primarily depend on a
few aspects:
For prediction or classification use cases, you would usually use regression
algorithms such as ordinary least square regression or logistic regression. With
unlabeled data, you are likely to rely on clustering algorithms such as k -means or
nearest neighbor. Some algorithms like neural networks can be configured to
work with both clustering and prediction use cases.
Training the algorithm is the process of tuning model variables and parameters to
more accurately predict the appropriate results. Training the machine learning
algorithm is usually iterative and uses a variety of optimization methods
depending upon the chosen model. These optimization methods do not require
human intervention which is part of the power of machine learning. The machine
learns from the data you give it with little to no specific direction from the user.
The last step is to feed new data to the model as a means of improving its
effectiveness and accuracy over time. Where the new information will come from
depends on the nature of the problem to be solved. For instance, a machine
learning model for self-driving cars will ingest real-world information on road
conditions, objects and traffic laws.
The need for machine learning is increasing day by day. The reason behind the
need for machine learning is that it is capable of doing tasks that are too complex
for a person to implement directly. As a human, we have some limitations as we
cannot access the huge amount of data manually, so for this, we need some
computer systems and here comes the machine learning to make things easy for
us.
We can train machine learning algorithms by providing them the huge
amount of data and let them explore the data, construct the models, and predict
the required output automatically. The performance of the machine learning
algorithm depends on the amount of data, and it can be determined by the cost
function. With the help of machine learning, we can save both time and money.
The importance of machine learning can be easily understood by its uses
cases, Currently, machine learning is used in self-driving cars, cyber fraud
detection, face recognition, and friend suggestion by Facebook, etc. Various
top companies such as Netflix and Amazon have build machine learning models
that are using a vast amount of data to analyze the user interest and recommend
product accordingly.
1. Supervised learning
2. Semi-supervised learning
3. Unsupervised learning
4. Reinforcement learning
1) Supervised Learning
Supervised learning is a type of machine learning method in which we
provide sample labeled data to the machine learning system in order to train it,
and on that basis, it predicts the output.
The goal of supervised learning is to map input data with the output data.
The supervised learning is based on supervision, and it is the same as when a
student learns things in the supervision of the teacher. Supervised learning can be
grouped further in two categories of algorithms:
o Classification
o Regression
2) Semi-supervised learning:
o Clustering
o Association
4) Reinforcement Learning
Reinforcement learning is a feedback-based learning method, in which a
learning agent gets a reward for each right action and gets a penalty for each
wrong action. The agent learns automatically with these feedbacks and improves
its performance. In reinforcement learning, the agent interacts with the
environment and explores it. The goal of an agent is to get the most reward points,
and hence, it improves its performance.
Supervised learning
Supervised learning, as the name indicates, has the presence of a supervisor as a
teacher. Basically supervised learning is when we teach or train the machine using
data that is well labelled. Which means some data is already tagged with the correct
answer. After that, the machine is provided with a new set of examples(data) so that
the supervised learning algorithm analyses the training data(set of training examples)
and produces a correct outcome from labelled data.
1. Regression
Regression algorithms are used if there is a relationship between the input variable
and the output variable. It is used for the prediction of continuous variables, such as
Weather forecasting, Market Trends, etc. Below are some popular Regression
algorithms which come under supervised learning:
o Linear Regression
o Regression Trees
o Non-Linear Regression
o Bayesian Linear Regression
o Polynomial Regression
2. Classification
Classification algorithms are used when the output variable is categorical, which
means there are two classes such as Yes-No, Male-Female, True-false, etc.
Spam Filtering,
o Random Forest
o Decision Trees
o Logistic Regression
o Support vector Machines
Advantages of Supervised learning:
o With the help of supervised learning, the model can predict the output on the
basis of prior experiences.
o In supervised learning, we can have an exact idea about the classes of objects.
o Supervised learning model helps us to solve various real-world problems such
as fraud detection, spam filtering, etc.
Unsupervised learning
Unsupervised learning is the training of a machine using information that is neither
classified nor labeled and allowing the algorithm to act on that information without
guidance. Here the task of the machine is to group unsorted information according to
similarities, patterns, and differences without any prior training of data. Unsupervised
learning cannot be directly applied to a regression or classification problem because
unlike supervised learning, we have the input data but no corresponding output data.
The goal of unsupervised learning is to find the underlying structure of dataset,
group that data according to similarities, and represent that dataset in a
compressed format.
o Unsupervised learning is helpful for finding useful insights from the data.
o Unsupervised learning is much similar as a human learns to think by their own
experiences, which makes it closer to the real AI.
o Unsupervised learning works on unlabeled and uncategorized data which make
unsupervised learning more important.
o In real-world, we do not always have input data with the corresponding output
so to solve such cases, we need unsupervised learning.
Here, we have taken an unlabeled input data, which means it is not categorized and
corresponding outputs are also not given. Now, this unlabeled input data is fed to the
machine learning model in order to train it. Firstly, it will interpret the raw data to find
the hidden patterns from the data and then will apply suitable algorithms such as k-
means clustering, Decision tree, etc.
o K-means clustering
o KNN (k-nearest neighbors)
o Hierarchal clustering
o Anomaly detection
o Neural Networks
o Principle Component Analysis
o Independent Component Analysis
o Apriori algorithm
o Singular value decomposition
Supervised Unsupervised
Parameters machine learning machine learning
Computational
Complexity Simpler method Computationally complex
Semi-supervised Learning
• Semi-supervised machine learning is a combination
of supervised and unsupervised learning.
• It uses a small amount of labeled data and a large amount of unlabeled data,
which provides the benefits of both unsupervised and supervised learning while
avoiding the challenges of finding a large amount of labeled data.
• That means you can train a model to label data without having to use as much
labeled training data.
• The basic disadvantage of supervised learning is that it requires hand-labeling
by ML specialists or data scientists, and it also requires a high cost to process.
• Further unsupervised learning also has a limited spectrum for its applications.
• To overcome these drawbacks of supervised learning and unsupervised
learning algorithms, the concept of Semi-supervised learning is
introduced.
• In this algorithm, training data is a combination of both labeled and unlabeled
data.
• However, labeled data exists with a very small amount while it consists of a
huge amount of unlabeled data.
• Initially, similar data is clustered along with an unsupervised learning algorithm,
and further, it helps to label the unlabeled data into labeled data.
• It is why label data is a comparatively, more expensive acquisition than
unlabeled data.
Reinforcement learning
• Reinforcement learning is an area of Machine Learning.
• It is about taking suitable action to maximize reward in a particular situation.
• It is employed by various software and machines to find the best possible
behaviour or path it should take in a specific situation.
• Reinforcement learning differs from supervised learning in a way that in
supervised learning the training data has the answer key with it so the model is
trained with the correct answer itself whereas in reinforcement learning, there
is no answer but the reinforcement agent decides what to do to perform the
given task.
• In the absence of a training dataset, it is bound to learn from its experience.
Example: The problem is as follows: We have an agent and a reward, with many
hurdles in between. The agent is supposed to find the best possible path to reach the
reward. The following problem explains the problem more easily.
The above image shows the robot, diamond, and fire. The goal of the robot is to get
the reward that is the diamond and avoid the hurdles that are fired. The robot learns
by trying all the possible paths and then choosing the path which gives him the reward
with the least hurdles. Each right step will give the robot a reward and each wrong step
will subtract the reward of the robot. The total reward will be calculated when it reaches
the final reward that is the diamond.
Main points in Reinforcement learning –
• Input: The input should be an initial state from which the model will start
• Output: There are many possible outputs as there are a variety of solutions to
a particular problem
• Training: The training is based upon the input, The model will return a state and
the user will decide to reward or punish the model based on its output.
• The model keeps continues to learn.
• The best solution is decided based on the maximum reward.
o Resource Management:
o Robotics:
RL is widely being used in Robotics applications. Robots are used in the
industrial and manufacturing area, and these robots are made more powerful
with reinforcement learning. There are different industries that have their vision
of building intelligent robots using AI and Machine learning technology.
o Text Mining
Text-mining, one of the great applications of NLP, is now being implemented
with the help of Reinforcement Learning by Salesforce company.
Advantages and Disadvantages of Reinforcement Learning
Advantages
Disadvantage
o Amazon Sagemaker
o TIBCO Software
o Alteryx Analytics
o SAS
o H2O.ai
o DataRobot
o RapidMiner