0% found this document useful (0 votes)

21 views23 pages

Unit-1 ML

Uploaded by

Varsha Saxena

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views23 pages

Unit-1 ML

Uploaded by

Varsha Saxena

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Machine learning

UNIT-1
Machine learning
Machine learning is a growing technology which enables computers to learn automatically from past
data. Machine learning uses various algorithms for building mathematical models and making
predictions using historical data or information. Currently, it is being used for various tasks such
as image recognition, speech recognition, email filtering, Facebook auto-tagging, recommender
system, and many more.

Machine Learning is said as a subset of artificial intelligence that is mainly concerned with
the development of algorithms which allow a computer to learn from the data and past
experiences on their own. The term machine learning was first introduced by Arthur
Samuel in 1959. We can define it in a summarized way as:

Machine learning enables a machine to automatically learn from data, improve performance
from experiences, and predict things without being explicitly programmed.

With the help of sample historical data, which is known as training data, machine learning
algorithms build a mathematical model that helps in making predictions or decisions
without being explicitly programmed. Machine learning brings computer science and
statistics together for creating predictive models. Machine learning constructs or uses the
algorithms that learn from historical data. The more we will provide the information, the
higher will be the performance.

A machine has the ability to learn if it can improve its performance by gaining more data.

How does Machine Learning work

A Machine Learning system learns from historical data, builds the prediction models,
and whenever it receives new data, predicts the output for it. The accuracy of predicted
output depends upon the amount of data, as the huge amount of data helps to build a better
model which predicts the output more accurately.

Suppose we have a complex problem, where we need to perform some predictions, so instead
of writing a code for it, we just need to feed the data to generic algorithms, and with the help
of these algorithms, machine builds the logic as per the data and predict the output. Machine
learning has changed our way of thinking about the problem. The below block diagram
explains the working of Machine Learning algorithm:
Features of Machine Learning:
o Machine learning uses data to detect various patterns in a given dataset.
o It can learn from past data and improve automatically.
o It is a data-driven technology.

Machine learning is much similar to data mining as it also deals with the huge amount
of the data.

Classification of Machine Learning

At a broad level, machine learning can be classified into three types:

1. Supervised learning
2. Unsupervised learning
3. Reinforcement learning

1) Supervised Learning
Supervised learning is a type of machine learning method in which we provide sample
labeled data to the machine learning system in order to train it, and on that basis, it predicts
the output.

The system creates a model using labeled data to understand the datasets and learn about each
data, once the training and processing are done then we test the model by providing a sample
data to check whether it is predicting the exact output or not.

The goal of supervised learning is to map input data with the output data. The supervised
learning is based on supervision, and it is the same as when a student learns things in the
supervision of the teacher. The example of supervised learning is spam filtering.

Supervised learning can be grouped further in two categories of algorithms:

o Classification
o Regression
2) Unsupervised Learning
Unsupervised learning is a learning method in which a machine learns without any
supervision.

The training is provided to the machine with the set of data that has not been labeled,
classified, or categorized, and the algorithm needs to act on that data without any supervision.
The goal of unsupervised learning is to restructure the input data into new features or a group
of objects with similar patterns.

In unsupervised learning, we don't have a predetermined result. The machine tries to find
useful insights from the huge amount of data. It can be further classifieds into two categories
of algorithms:

o Clustering

o Association

3) Reinforcement Learning
Reinforcement learning is a feedback-based learning method, in which a learning agent gets a
reward for each right action and gets a penalty for each wrong action. The agent learns
automatically with these feedbacks and improves its performance. In reinforcement learning,
the agent interacts with the environment and explores it. The goal of an agent is to get the
most reward points, and hence, it improves its performance.

The robotic dog, which automatically learns the movement of his arms, is an example of
Reinforcement learning.

History of Machine Learning

Before some years (about 40-50 years), machine learning was science fiction, but today it is
the part of our daily life. Machine learning is making our day to day life easy from self-
driving cars to Amazon virtual assistant "Alexa". However, the idea behind machine
learning is so old and has a long history. Below some milestones are given which have
occurred in the history of machine learning:

The early history of Machine Learning (Pre-1940):

o 1834: In 1834, Charles Babbage, the father of the computer, conceived a device that
could be programmed with punch cards. However, the machine was never built, but
all modern computers rely on its logical structure.
o 1936: In 1936, Alan Turing gave a theory that how a machine can determine and
execute a set of instructions.

The era of stored program computers:

o 1940: In 1940, the first manually operated computer, "ENIAC" was invented, which
was the first electronic general-purpose computer. After that stored program computer
such as EDSAC in 1949 and EDVAC in 1951 were invented.
o 1943: In 1943, a human neural network was modeled with an electrical circuit. In
1950, the scientists started applying their idea to work and analyzed how human
neurons might work.

Computer machinery and intelligence:

o 1950: In 1950, Alan Turing published a seminal paper, "Computer Machinery and
Intelligence," on the topic of artificial intelligence. In his paper, he asked, "Can
machines think?"

Machine intelligence in Games:

o 1952: Arthur Samuel, who was the pioneer of machine learning, created a program
that helped an IBM computer to play a checkers game. It performed better more it
played.
o 1959: In 1959, the term "Machine Learning" was first coined by Arthur Samuel.

The first "AI" winter:

o The duration of 1974 to 1980 was the tough time for AI and ML researchers, and this
duration was called as AI winter.
o In this duration, failure of machine translation occurred, and people had reduced their
interest from AI, which led to reduced funding by the government to the researches.

Machine Learning from theory to reality

o 1959: In 1959, the first neural network was applied to a real-world problem to remove
echoes over phone lines using an adaptive filter.
o 1985: In 1985, Terry Sejnowski and Charles Rosenberg invented a neural
network NETtalk, which was able to teach itself how to correctly pronounce 20,000
words in one week.
o 1997: The IBM's Deep blue intelligent computer won the chess game against the
chess expert Garry Kasparov, and it became the first computer which had beaten a
human chess expert.
Machine Learning at 21st century

o 2006: In the year 2006, computer scientist Geoffrey Hinton has given a new name to
neural net research as "deep learning," and nowadays, it has become one of the most
trending technologies.
o 2012: In 2012, Google created a deep neural network which learned to recognize the
image of humans and cats in YouTube videos.
o 2014: In 2014, the Chabot "Eugen Goostman" cleared the Turing Test. It was the
first Chabot who convinced the 33% of human judges that it was not a machine.
o 2014: DeepFace was a deep neural network created by Facebook, and they claimed
that it could recognize a person with the same precision as a human can do.
o 2016: AlphaGo beat the world's number second player Lee sedol at Go game. In
2017 it beat the number one player of this game Ke Jie.
o 2017: In 2017, the Alphabet's Jigsaw team built an intelligent system that was able to
learn the online trolling. It used to read millions of comments of different websites to
learn to stop online trolling.

Well defining learning problems-

A computer program is said to learn from experience E in context to some task T
and some performance measure P, if its performance on T, as was measured by P,
upgrades with experience E.
Any problem can be segregated as well-posed learning problem if it has three traits –

 Task
 Performance Measure
 Experience
Certain examples that efficiently defines the well-posed learning problem are –
1. A checkers learning problem
 Task – Playing checkers game
 Performance Measure – percent of games won against opposer
 Experience – playing implementation games against itself
2. Handwriting Recognition Problem
 Task – Acknowledging handwritten words within portrayal
 Performance Measure – percent of words accurately classified
 Experience – a directory of handwritten words with given classifications
3. A Robot Driving Problem
 Task – driving on public four-lane highways using sight scanners
 Performance Measure – average distance progressed before a fallacy
 Experience – order of images and steering instructions noted down while
observing a human driver
 Steps for Designing Learning System are:

 Step 1) Choosing the Training Experience: The very important and first
task is to choose the training data or training experience which will be fed to
the Machine Learning Algorithm. It is important to note that the data or
experience that we fed to the algorithm must have a significant impact on the
Success or Failure of the Model. So Training data or experience should be
chosen wisely.
 Step 2- Choosing target function: The next important step is choosing the
target function. It means according to the knowledge fed to the algorithm the
machine learning will choose NextMove function which will describe what
type of legal moves should be taken. For example : While playing chess with
the opponent, when opponent will play then the machine learning algorithm
will decide what be the number of possible legal moves taken in order to get
success.
 Step 3- Choosing Representation for Target function: When the machine
algorithm will know all the possible legal moves the next step is to choose the
optimized move using any representation i.e. using linear Equations,
Hierarchical Graph Representation, Tabular form etc. The NextMove function
will move the Target move like out of these move which will provide more
success rate. For Example : while playing chess machine have 4 possible
moves, so the machine will choose that optimized move which will provide
success to it.
 Step 4- Choosing Function Approximation Algorithm: An optimized move
cannot be chosen just with the training data. The training data had to go
through with set of example and through these examples the training data will
approximates which steps are chosen and after that machine will provide
feedback on it. For Example : When a training data of Playing chess is fed to
algorithm so at that time it is not machine algorithm will fail or get success
and again from that failure or success it will measure while next move what
step should be chosen and what is its success rate.
 Step 5- Final Design: The final design is created at last when system goes
from number of examples , failures and success , correct and incorrect
decision and what will be the next step etc. Example: DeepBlue is an
intelligent computer which is ML-based won chess game against the chess
expert Garry Kasparov, and it became the first computer which had beaten a
human chess expert.

Artificial Neural Network

 The term "Artificial Neural Network" is derived from Biological neural networks
that develop the structure of a human brain. Similar to the human brain that has
neurons interconnected to one another, artificial neural networks also have neurons
that are interconnected to one another in various layers of the networks. These
neurons are known as nodes

 The given figure illustrates the typical diagram of Biological Neural Network.
 The typical Artificial Neural Network looks something like the given figure.

 Dendrites from Biological Neural Network represent inputs in Artificial Neural
Networks, cell nucleus represents Nodes, synapse represents Weights, and Axon
represents Output.
 Relationship between Biological neural network and artificial neural network:

Biological Neural Network Artificial Neural Network

Dendrites Inputs

Cell nucleus Nodes

Synapse Weights

Axon Output

 An Artificial Neural Network in the field of Artificial intelligence where it attempts

to mimic the network of neurons makes up a human brain so that computers will have
an option to understand things and make decisions in a human-like manner. The
artificial neural network is designed by programming computers to behave simply like
interconnected brain cells.
 There are around 1000 billion neurons in the human brain. Each neuron has an
association point somewhere in the range of 1,000 and 100,000. In the human brain,
data is stored in such a manner as to be distributed, and we can extract more than one
piece of this data when necessary from our memory parallelly. We can say that the
human brain is made up of incredibly amazing parallel processors.

Artificial Neural Network primarily consists of three layers:

Input Layer:

As the name suggests, it accepts inputs in several different formats provided by the
programmer.

Hidden Layer:

The hidden layer presents in-between input and output layers. It performs all the calculations
to find hidden features and patterns.

Output Layer:

The input goes through a series of transformations using the hidden layer, which finally
results in output that is conveyed using this layer.

The artificial neural network takes input and computes the weighted sum of the inputs and
includes a bias. This computation is represented in the form of a transfer function.

Advantages of Artificial Neural Network (ANN)

Parallel processing capability:

Artificial neural networks have a numerical value that can perform more than one task
simultaneously.

Storing data on the entire network:

Data that is used in traditional programming is stored on the whole network, not on a
database. The disappearance of a couple of pieces of data in one place doesn't prevent the
network from working.

Capability to work with incomplete knowledge:

After ANN training, the information may produce output even with inadequate data. The loss
of performance here relies upon the significance of missing data.

Having a memory distribution:

For ANN is to be able to adapt, it is important to determine the examples and to encourage
the network according to the desired output by demonstrating these examples to the network.
The succession of the network is directly proportional to the chosen instances, and if the
event can't appear to the network in all its aspects, it can produce false output.

Having fault tolerance:

Extortion of one or more cells of ANN does not prohibit it from generating output, and this
feature makes the network fault-tolerance.

Disadvantages of Artificial Neural Network:

Assurance of proper network structure:

There is no particular guideline for determining the structure of artificial neural networks.
The appropriate network structure is accomplished through experience, trial, and error.

Unrecognized behavior of the network:

It is the most significant issue of ANN. When ANN produces a testing solution, it does not
provide insight concerning why and how. It decreases trust in the network.

Hardware dependence:

Artificial neural networks need processors with parallel processing power, as per their
structure. Therefore, the realization of the equipment is dependent.

Difficulty of showing the issue to the network:

ANNs can work with numerical data. Problems must be converted into numerical values
before being introduced to ANN. The presentation mechanism to be resolved here will
directly impact the performance of the network. It relies on the user's abilities.

Clustering in Machine Learning

Clustering or cluster analysis is a machine learning technique, which groups the unlabelled
dataset. It can be defined as "A way of grouping the data points into different clusters,
consisting of similar data points. The objects with the possible similarities remain in a
group that has less or no similarities with another group."

It does it by finding some similar patterns in the unlabelled dataset such as shape, size, color,
behavior, etc., and divides them as per the presence and absence of those similar patterns.

It is an unsupervised learning method, hence no supervision is provided to the algorithm, and

it deals with the unlabeled dataset.

After applying this clustering technique, each cluster or group is provided with a cluster-ID.
ML system can use this id to simplify the processing of large and complex datasets.

The clustering technique can be widely used in various tasks. Some most common uses of
this technique are:

o Market Segmentation
o Statistical data analysis
o Social network analysis
o Image segmentation
o Anomaly detection, etc.

Apart from these general usages, it is used by the Amazon in its recommendation system to
provide the recommendations as per the past search of products. Netflix also uses this
technique to recommend the movies and web-series to its users as per the watch history.

The below diagram explains the working of the clustering algorithm. We can see the different
fruits are divided into several groups with similar properties.
Types of Clustering Methods
The clustering methods are broadly divided into Hard clustering (datapoint belongs to only
one group) and Soft Clustering (data points can belong to another group also). But there are
also other various approaches of Clustering exist. Below are the main clustering methods
used in Machine learning:

1. Partitioning Clustering
2. Density-Based Clustering
3. Distribution Model-Based Clustering
4. Hierarchical Clustering
5. Fuzzy Clustering

o Decision Tree is a Supervised learning technique that can be used for both classification
and Regression problems, but mostly it is preferred for solving Classification problems. It is a
tree-structured classifier, where internal nodes represent the features of a dataset,
branches represent the decision rules and each leaf node represents the outcome.
o In a Decision tree, there are two nodes, which are the Decision Node and Leaf
Node. Decision nodes are used to make any decision and have multiple branches, whereas
Leaf nodes are the output of those decisions and do not contain any further branches.
o The decisions or the test are performed on the basis of features of the given dataset.
o It is a graphical representation for getting all the possible solutions to a problem/decision
based on given conditions.
o It is called a decision tree because, similar to a tree, it starts with the root node, which
expands on further branches and constructs a tree-like structure.
o In order to build a tree, we use the CART algorithm, which stands for Classification and
Regression Tree algorithm.
o A decision tree simply asks a question, and based on the answer (Yes/No), it further split the
tree into subtrees.
o Below diagram explains the general structure of a decision tree:
Note: A decision tree can contain categorical data (YES/NO) as well as numeric data.

Why use Decision Trees?

There are various algorithms in Machine learning, so choosing the best algorithm for the
given dataset and problem is the main point to remember while creating a machine learning
model. Below are the two reasons for using the Decision tree:

o Decision Trees usually mimic human thinking ability while making a decision, so it is easy to
understand.
o The logic behind the decision tree can be easily understood because it shows a tree-like
structure.

Decision Tree Terminologies

Root Node: Root node is from where the decision tree starts. It represents the entire dataset, which
further gets divided into two or more homogeneous sets.

Leaf Node: Leaf nodes are the final output node, and the tree cannot be segregated further after
getting a leaf node.

Splitting: Splitting is the process of dividing the decision node/root node into sub-nodes according
to the given conditions.

Branch/Sub Tree: A tree formed by splitting the tree.

Pruning: Pruning is the process of removing the unwanted branches from the tree.

Parent/Child node: The root node of the tree is called the parent node, and other nodes are called
the child nodes.

How does the Decision Tree algorithm Work?

In a decision tree, for predicting the class of the given dataset, the algorithm starts from the
root node of the tree. This algorithm compares the values of root attribute with the record
(real dataset) attribute and, based on the comparison, follows the branch and jumps to the
next node.

For the next node, the algorithm again compares the attribute value with the other sub-nodes
and move further. It continues the process until it reaches the leaf node of the tree. The
complete process can be better understood using the below algorithm:

o Step-1: Begin the tree with the root node, says S, which contains the complete
dataset.
o Step-2: Find the best attribute in the dataset using Attribute Selection Measure
(ASM).
o Step-3: Divide the S into subsets that contains possible values for the best attributes.
o Step-4: Generate the decision tree node, which contains the best attribute.
o Step-5: Recursively make new decision trees using the subsets of the dataset created
in step -3. Continue this process until a stage is reached where you cannot further
classify the nodes and called the final node as a leaf node.

Example: Suppose there is a candidate who has a job offer and wants to decide whether he
should accept the offer or Not. So, to solve this problem, the decision tree starts with the root
node (Salary attribute by ASM). The root node splits further into the next decision node
(distance from the office) and one leaf node based on the corresponding labels. The next
decision node further gets split into one decision node (Cab facility) and one leaf node.
Finally, the decision node splits into two leaf nodes (Accepted offers and Declined offer).
Consider the below diagram:
Advantages of the Decision Tree
o It is simple to understand as it follows the same process which a human follow while
making any decision in real-life.
o It can be very useful for solving decision-related problems.
o It helps to think about all the possible outcomes for a problem.
o There is less requirement of data cleaning compared to other algorithms.

Disadvantages of the Decision Tree

o The decision tree contains lots of layers, which makes it complex.
o It may have an overfitting issue, which can be resolved using the Random Forest
algorithm.
o For more class labels, the computational complexity of the decision tree may increase.

Bayesian Belief Network in artificial intelligence

Bayesian belief network is key computer technology for dealing with probabilistic events and
to solve a problem which has uncertainty. We can define a Bayesian network as:
"A Bayesian network is a probabilistic graphical model which represents a set of variables
and their conditional dependencies using a directed acyclic graph."

It is also called a Bayes network, belief network, decision network, or Bayesian model.

Bayesian networks are probabilistic, because these networks are built from a probability
distribution, and also use probability theory for prediction and anomaly detection.

Real world applications are probabilistic in nature, and to represent the relationship between
multiple events, we need a Bayesian network. It can also be used in various tasks
including prediction, anomaly detection, diagnostics, automated insight, reasoning, time
series prediction, and decision making under uncertainty.

Bayesian Network can be used for building models from data and experts opinions, and it
consists of two parts:

o Directed Acyclic Graph

o Table of conditional probabilities.

The generalized form of Bayesian network that represents and solve decision problems under
uncertain knowledge is known as an Influence diagram.

A Bayesian network graph is made up of nodes and Arcs (directed links), where:

o Each node corresponds to the random variables, and a variable can

be continuous or discrete.
o Arc or directed arrows represent the causal relationship or conditional probabilities between
random variables. These directed links or arrows connect the pair of nodes in the graph.
These links represent that one node directly influence the other node, and if there is no
directed link that means that nodes are independent with each other

o In the above diagram, A, B, C, and D are random variables represented by the

nodes of the network graph.
o If we are considering node B, which is connected with node A by a directed
arrow, then node A is called the parent of Node B.
o Node C is independent of node A.

Note: The Bayesian network graph does not contain any cyclic graph. Hence, it is known as
a directed acyclic graph or DAG.

The Bayesian network has mainly two components:

o Causal Component
o Actual numbers

Each node in the Bayesian network has condition probability distribution P(Xi |Parent(Xi) ),
which determines the effect of the parent on that node.

Support Vector Machine Algorithm

Support Vector Machine or SVM is one of the most popular Supervised Learning algorithms,
which is used for Classification as well as Regression problems. However, primarily, it is
used for Classification problems in Machine Learning.

The goal of the SVM algorithm is to create the best line or decision boundary that can
segregate n-dimensional space into classes so that we can easily put the new data point in the
correct category in the future. This best decision boundary is called a hyperplane.

SVM chooses the extreme points/vectors that help in creating the hyperplane. These extreme
cases are called as support vectors, and hence algorithm is termed as Support Vector
Machine. Consider the below diagram in which there are two different categories that are
classified using a decision boundary or hyperplane:
Example: SVM can be understood with the example that we have used in the KNN
classifier. Suppose we see a strange cat that also has some features of dogs, so if we want a
model that can accurately identify whether it is a cat or dog, so such a model can be created
by using the SVM algorithm. We will first train our model with lots of images of cats and
dogs so that it can learn about different features of cats and dogs, and then we test it with this
strange creature. So as support vector creates a decision boundary between these two data
(cat and dog) and choose extreme cases (support vectors), it will see the extreme case of cat
and dog. On the basis of the support vectors, it will classify it as a cat. Consider the below
diagram:s
SVM algorithm can be used for Face detection, image classification, text
categorization, etc.

Types of SVM
SVM can be of two types:

o Linear SVM: Linear SVM is used for linearly separable data, which means if a
dataset can be classified into two classes by using a single straight line, then such data
is termed as linearly separable data, and classifier is used called as Linear SVM
classifier.
o Non-linear SVM: Non-Linear SVM is used for non-linearly separated data, which
means if a dataset cannot be classified by using a straight line, then such data is
termed as non-linear data and classifier used is called as Non-linear SVM classifier.

Genetic Algorithm in Machine Learning

A genetic algorithm is an adaptive heuristic search algorithm inspired by "Darwin's theory
of evolution in Nature." It is used to solve optimization problems in machine learning. It is
one of the important algorithms as it helps solve complex problems that would take a long
time to solve.

genetic algorithm as a heuristic search algorithm to solve optimization problems. It is a subset

of evolutionary algorithms, which is used in computing. A genetic algorithm uses genetic and
natural selection concepts to solve optimization problems.

The genetic algorithm works on the evolutionary generational cycle to generate high-quality
solutions. These algorithms use different operations that either enhance or replace the
population to give an improved fit solution.

It basically involves five phases to solve the complex optimization problems, which are given
as below:

o Initialization
o Fitness Assignment
o Selection
o Reproduction
o Termination

General Workflow of a Simple Genetic Algorithm

Advantages of Genetic Algorithm
o The parallel capabilities of genetic algorithms are best.
o It helps in optimizing various problems such as discrete functions, multi-objective
problems, and continuous functions.
o It provides a solution for a problem that improves over time.
o A genetic algorithm does not need derivative information.

Limitations of Genetic Algorithms

o Genetic algorithms are not efficient algorithms for solving simple problems.
o It does not guarantee the quality of the final solution to a problem.
o Repetitive calculation of fitness values may generate some computational challenges.

Issues in machine learning-

1. Poor Quality of Data

Data plays a significant role in the machine learning process. One of the
significant issues that machine learning professionals face is the absence of
good quality data. Unclean and noisy data can make the whole process
extremely exhausting. We don’t want our algorithm to make inaccurate or
faulty predictions. Hence the quality of data is essential to enhance the output.
2. Non-representative training data

To make sure our training model is generalized well or not, we have to ensure
that sample training data must be representative of new cases that we need to
generalize. The training data must cover all cases that are already occurred as
well as occurring.

3. Inadequate Training Data

The major issue that comes while using machine learning algorithms is the lack of quality as
well as quantity of data. Although data plays a vital role in the processing of machine
learning algorithms,

o Noisy Data- It is responsible for an inaccurate prediction that affects the decision as
well as accuracy in classification tasks.
o Incorrect data- It is also responsible for faulty programming and results obtained in
machine learning models. Hence, incorrect data may affect the accuracy of the results
also.
o Generalizing of output data- Sometimes, it is also found that generalizing output
data becomes complex, which results in comparatively poor future actions.

4. Overfitted
Overfitting is one of the most common issues faced by Machine Learning engineers and data
scientists. Whenever a machine learning model is trained with a huge amount of data, it starts
capturing noise and inaccurate data into the training data set. It negatively affects the
performance of the model.
5. Underfitting:

Underfitting is just the opposite of overfitting. Whenever a machine learning model is trained with
fewer amounts of data, and as a result, it provides incomplete and inaccurate data and destroys the
accuracy of the machine learning model.

6. Monitoring and Maintenance

As we know that generalized output data is mandatory for any machine learning model;
hence, regular monitoring and maintenance become compulsory for the same. Different
results for different actions require data change;

Difference between Machine learning and Data Science

Data Science Machine Learning

It deals with understanding and finding hidden It is a subfield of data science that enables the machine
patterns or useful insights from the data, which learn from the past data and experiences automatically.
helps to take smarter business decisions.

It is used for discovering insights from the data. It is used for making predictions and classifying the res
for new data points.

It is a broad term that includes various steps to It is used in the data modeling step of the data science a
create a model for a given problem and deploy the complete process.
model.

A data scientist needs to have skills to use big Machine Learning Engineer needs to have skills such
data tools like Hadoop, Hive and Pig, statistics, computer science fundamentals, programming skills
programming in Python, R, or Scala. Python or R, statistics and probability concepts, etc.

It can work with raw, structured, and unstructured It mostly requires structured data to work on.
data.

Data scientists spent lots of time in handling the ML engineers spend a lot of time for managing
data, cleansing the data, and understanding its complexities that occur during the implementation
patterns. algorithms and mathematical concepts behind that
o

Techniques of Value Analysis and Engineering by Lawrence D Miles
84% (38)
Techniques of Value Analysis and Engineering by Lawrence D Miles
383 pages
MLT Unit 1 Notes
No ratings yet
MLT Unit 1 Notes
29 pages
Machine Learning Tutorial
100% (1)
Machine Learning Tutorial
775 pages
Unit-3 ML Mech 3-2
No ratings yet
Unit-3 ML Mech 3-2
16 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
52 pages
Awesome Angles For Mathematics Competitions Book1 Look Inside
No ratings yet
Awesome Angles For Mathematics Competitions Book1 Look Inside
11 pages
Machine Learning
No ratings yet
Machine Learning
54 pages
Machine Learning Tutorial
100% (2)
Machine Learning Tutorial
139 pages
Lecture 1 - SML Introduction
No ratings yet
Lecture 1 - SML Introduction
29 pages
Describe Artificial Intelligence and Machine Learning
No ratings yet
Describe Artificial Intelligence and Machine Learning
27 pages
Machine Learning
No ratings yet
Machine Learning
81 pages
Unit-1 MLA
No ratings yet
Unit-1 MLA
31 pages
Unit 1
No ratings yet
Unit 1
88 pages
Machine Learning
No ratings yet
Machine Learning
97 pages
Unit-1 New
No ratings yet
Unit-1 New
48 pages
AI and Machine Learning Autosaved
No ratings yet
AI and Machine Learning Autosaved
26 pages
ml3 2
No ratings yet
ml3 2
59 pages
ML-1st Unit
No ratings yet
ML-1st Unit
23 pages
Unit 1
No ratings yet
Unit 1
55 pages
Machine Learning UNIT I
No ratings yet
Machine Learning UNIT I
42 pages
Machine Learning Documentation
No ratings yet
Machine Learning Documentation
18 pages
Chapter 1
No ratings yet
Chapter 1
30 pages
Machine Learning UNIT-3
100% (1)
Machine Learning UNIT-3
16 pages
Cse443 11904916
No ratings yet
Cse443 11904916
24 pages
Learning
No ratings yet
Learning
24 pages
Unit 4
No ratings yet
Unit 4
39 pages
Machine Learning
No ratings yet
Machine Learning
73 pages
Notes Unit 1 ML
No ratings yet
Notes Unit 1 ML
17 pages
MLF 1
No ratings yet
MLF 1
15 pages
1 Introduction
No ratings yet
1 Introduction
24 pages
ML Unit 1
No ratings yet
ML Unit 1
16 pages
UNIT-IV Notes
No ratings yet
UNIT-IV Notes
42 pages
Rizwan Report
No ratings yet
Rizwan Report
23 pages
Machine Learning - UNIT I
No ratings yet
Machine Learning - UNIT I
70 pages
ML Notes
No ratings yet
ML Notes
18 pages
(IJCST-V7I5P8) :rednam S S Jyothi, Eswar Patnala, V.Vikas
No ratings yet
(IJCST-V7I5P8) :rednam S S Jyothi, Eswar Patnala, V.Vikas
7 pages
ML Notes
No ratings yet
ML Notes
15 pages
Unit 5 Machine Learning
No ratings yet
Unit 5 Machine Learning
14 pages
ML-UNIT - I - Part A
No ratings yet
ML-UNIT - I - Part A
88 pages
FAM Unit4
No ratings yet
FAM Unit4
11 pages
1 ML
No ratings yet
1 ML
24 pages
ML Unit I
No ratings yet
ML Unit I
10 pages
Machine Learning Tutorial
100% (1)
Machine Learning Tutorial
44 pages
Unit-1 Part-1 Material
No ratings yet
Unit-1 Part-1 Material
45 pages
Eda 5
No ratings yet
Eda 5
48 pages
University Institute of Technology Barkatullah University Bhopal
No ratings yet
University Institute of Technology Barkatullah University Bhopal
16 pages
Module 4 & 5
No ratings yet
Module 4 & 5
58 pages
ML Basic
No ratings yet
ML Basic
12 pages
Unit I - Machine Learning at CSJMU - 6 Slides Handouts
No ratings yet
Unit I - Machine Learning at CSJMU - 6 Slides Handouts
4 pages
Unit 5
No ratings yet
Unit 5
26 pages
Engineering Drawing - Lettering and Lines Presentation
No ratings yet
Engineering Drawing - Lettering and Lines Presentation
67 pages
Mehakreport
No ratings yet
Mehakreport
23 pages
UNIT I-Machine Learning
No ratings yet
UNIT I-Machine Learning
68 pages
Unit-1 Introduction To Machine Learning
No ratings yet
Unit-1 Introduction To Machine Learning
24 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
68 pages
ML 01
No ratings yet
ML 01
23 pages
Introduction To Machine Learning For Beginners: Ayush Pant
No ratings yet
Introduction To Machine Learning For Beginners: Ayush Pant
28 pages
AI - Module-III (Introduction To ML)
No ratings yet
AI - Module-III (Introduction To ML)
20 pages
Unit - 5.1 - Introduction To Machine Learning
No ratings yet
Unit - 5.1 - Introduction To Machine Learning
38 pages
Name: Aarsh Trivedi Roll No: 16BME176D Sub: MD 2 Case Study Topic: Worm Gear
No ratings yet
Name: Aarsh Trivedi Roll No: 16BME176D Sub: MD 2 Case Study Topic: Worm Gear
40 pages
Wenaas Catalogue
No ratings yet
Wenaas Catalogue
29 pages
Unit Rate 2024
No ratings yet
Unit Rate 2024
44 pages
Manual Espejo Retrovisor Con Cámara
100% (1)
Manual Espejo Retrovisor Con Cámara
37 pages
Egl DMT 147 P03 DT Sew 002
No ratings yet
Egl DMT 147 P03 DT Sew 002
1 page
Introduction To Machine Learning For Beginners
No ratings yet
Introduction To Machine Learning For Beginners
5 pages
Information Bulletin For PHD Admission AUTUMN 2025
No ratings yet
Information Bulletin For PHD Admission AUTUMN 2025
26 pages
Unit-2 ML
No ratings yet
Unit-2 ML
39 pages
Desert Rivers
No ratings yet
Desert Rivers
6 pages
Po 390
No ratings yet
Po 390
264 pages
Autodesk Inventor 2015 Courses Contents by Serer N - Anglocad.
No ratings yet
Autodesk Inventor 2015 Courses Contents by Serer N - Anglocad.
9 pages
Unit 4
No ratings yet
Unit 4
35 pages
Seaport
No ratings yet
Seaport
22 pages
Unit 2
No ratings yet
Unit 2
33 pages
Krishna-Kaalii Abheda Varnanam
No ratings yet
Krishna-Kaalii Abheda Varnanam
18 pages
Science - Q2 Lesson
No ratings yet
Science - Q2 Lesson
5 pages
Cloud Computing Unit-4
No ratings yet
Cloud Computing Unit-4
34 pages
09 MSDS Wax Dispersant
No ratings yet
09 MSDS Wax Dispersant
8 pages
Python Notes of Unit-3
No ratings yet
Python Notes of Unit-3
30 pages
Chapter 2 Memory Organization
No ratings yet
Chapter 2 Memory Organization
23 pages
Lock Out Tag Out (LOTO) Safety Awareness
No ratings yet
Lock Out Tag Out (LOTO) Safety Awareness
27 pages
Calg t2 4 Filled in
No ratings yet
Calg t2 4 Filled in
7 pages
Eureka Math Grade 2 Module 2 Parent Tip Sheet 1
No ratings yet
Eureka Math Grade 2 Module 2 Parent Tip Sheet 1
2 pages
Advanced MR Imaging of The Pancreas
No ratings yet
Advanced MR Imaging of The Pancreas
15 pages
Machine Learning Assignment-1
No ratings yet
Machine Learning Assignment-1
2 pages
IENG300 Assignment - 1 Solution
No ratings yet
IENG300 Assignment - 1 Solution
6 pages
No Load & Short Circuit Test On 3 Phase Alternator
No ratings yet
No Load & Short Circuit Test On 3 Phase Alternator
6 pages
HOPE 3 Mod 4
No ratings yet
HOPE 3 Mod 4
14 pages
Arm Position and Blood Pressure Readings The ARMS Crossover Randomized Clinical Trial. JAMA Internal Medicine 2024
No ratings yet
Arm Position and Blood Pressure Readings The ARMS Crossover Randomized Clinical Trial. JAMA Internal Medicine 2024
7 pages
Manua - LS-LG GH24NSD1 Specifications
No ratings yet
Manua - LS-LG GH24NSD1 Specifications
3 pages
Finite Potential Well - Scattering
No ratings yet
Finite Potential Well - Scattering
10 pages
Bce622 Virtual Experiment 6 - Work, Energy, and Power
No ratings yet
Bce622 Virtual Experiment 6 - Work, Energy, and Power
6 pages
Distillation of Mixtures: Activity 2.3
No ratings yet
Distillation of Mixtures: Activity 2.3
4 pages
Map of The State of Pennsylvania, USA - Nations Online Project
No ratings yet
Map of The State of Pennsylvania, USA - Nations Online Project
1 page
Machine Learning
From Everand
Machine Learning
George Anthony Kulz
No ratings yet
PYTHON CODING: Become a Coder Fast. Machine Learning, Data Analysis Using Python, Code-Creation Methods, and Beginner's Programming Tips and Tricks (2022 Crash Course for Newbies)
From Everand
PYTHON CODING: Become a Coder Fast. Machine Learning, Data Analysis Using Python, Code-Creation Methods, and Beginner's Programming Tips and Tricks (2022 Crash Course for Newbies)
Pierce Weaver
2/5 (1)
Python Machine Learning Illustrated Guide For Beginners & Intermediates: The Future Is Here!
From Everand
Python Machine Learning Illustrated Guide For Beginners & Intermediates: The Future Is Here!
William Sullivan
5/5 (1)

Unit-1 ML

Uploaded by

Unit-1 ML

Uploaded by

Machine learning

How does Machine Learning work

Classification of Machine Learning

Supervised learning can be grouped further in two categories of algorithms:

History of Machine Learning

The early history of Machine Learning (Pre-1940):

The era of stored program computers:

Computer machinery and intelligence:

Machine intelligence in Games:

The first "AI" winter:

Machine Learning from theory to reality

Well defining learning problems-

Artificial Neural Network

Biological Neural Network Artificial Neural Network

Cell nucleus Nodes

 An Artificial Neural Network in the field of Artificial intelligence where it attempts

Artificial Neural Network primarily consists of three layers:

Advantages of Artificial Neural Network (ANN)

Storing data on the entire network:

Capability to work with incomplete knowledge:

Having a memory distribution:

Having fault tolerance:

Disadvantages of Artificial Neural Network:

Unrecognized behavior of the network:

Difficulty of showing the issue to the network:

Clustering in Machine Learning

It is an unsupervised learning method, hence no supervision is provided to the algorithm, and

Why use Decision Trees?

Decision Tree Terminologies

Branch/Sub Tree: A tree formed by splitting the tree.

How does the Decision Tree algorithm Work?

Disadvantages of the Decision Tree

Bayesian Belief Network in artificial intelligence

o Directed Acyclic Graph

o Each node corresponds to the random variables, and a variable can

o In the above diagram, A, B, C, and D are random variables represented by the

The Bayesian network has mainly two components:

Support Vector Machine Algorithm

Genetic Algorithm in Machine Learning

genetic algorithm as a heuristic search algorithm to solve optimization problems. It is a subset

General Workflow of a Simple Genetic Algorithm

Limitations of Genetic Algorithms

Issues in machine learning-

3. Inadequate Training Data

6. Monitoring and Maintenance

Difference between Machine learning and Data Science

Data Science Machine Learning

You might also like