0% found this document useful (0 votes)

19 views

Machine - Learning - Unit - 1

Uploaded by

Santhosh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views

Machine - Learning - Unit - 1

Uploaded by

Santhosh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 70

An introduction to

Machine Learning

Ms. Deepa A

Assistant Professor

Kongu Engineering College, Perundurai

AI, ML, DL
Artificial Programs with the ability to learn and
Intelligence
reason like humans

Machine Learning
Algorithms with the ability to learn
without being explicitly
programmed

Deep Learning Subset of Machine Learning in which

artificial neural networks adapt and
learn from vast amount of data
Machine Learning
• Machine Learning is the science of getting computers to act
without being explicitly programmed-Andrew Ng

• Machine Learning is the Study of algorithms that

✔ Improve their performance(P)

✔ At some task(T)

✔ With experience(E)-Tom Mitchell

Machine Learning Vs Programming
Machine Learning Types
Supervised Learning
• Given:
-a set of input features X1,….Xn
-A target feature Y
-a set of training examples where the values for the input features
and the target features are given for each example
-a new example, where the values for the input features are given
• Predict the values for the target features for the new example.
-Classification when Y is discrete
-Regression when Y is continuous
Supervised Learning
Unsupervised Learning
• Data with no target attribute. Describe hidden structure from
unlabelled data.
• Explore the data to find some intrinsic structures in them.
• Clustering: the task of grouping a set of objects in such a way that
objects in the same group(called a cluster) are more similar to each
other than to those in other clusters.
• Useful for
Automatically organizing data
Understanding hidden structure in data
Preprocessing for further analysis.
Reinforcement Learning
• Reinforcement-anything that strengthens or increases behaviour or
giving praise
• Reinforcement learning is a sub-branch of ML that trains a model to
return an optimum solution for a problem by taking a sequence of
decision by itself.
• Decision process
• Reward system
• Learn series of actions
Reinforcement Learning Analogy
Machine Learning Applications
• Traffic Alerts
• Image Recognition
• Online supporting Chatbots
• Google Translate
• Online Video streaming applications
• Stock Market Analysis
• Social Media Analysis
Most popular programming languages for
ML
• Python
•R
• C++
• Java
• JavaScript
Python Libraries for ML
Open Source tools for ML Implementation
• Anaconda(Jupyter, Spyder, Orange3,etc.)-Offline Resources

• Google Colab-Online Resources

• Kaggle
UNIT 1
INTRODUCTION
• What is Machine Learning?
• Types of Machine Learning
• Supervised Learning: Regression and Classification
• Machine Learning Process
• Some Terminology
• Testing ML algorithm
• Turning data into probabilities
• Naïve Bayes Classifier
• The brain and Neuron
• Neural Networks
• Perceptron
Types of Machine Learning

Machine Learning

Supervised Unsupervised Reinforcement Evolutionary

Learning Learning Learning Learning
Supervised Learning
• Set of Training data + Target data

• Also known as learning from exemplars

• Usually written with

• Supervised Learning can be implemented in two ways:

✔ Regression-Value of the Output(Continuous one)

✔ Classification-Target Class(Discrete one)

Regression

Find y when
x=0.44
Contd..
-a statistical technique that relates a dependent variable to one or
more independent variables.
-ultimate goal of the regression algorithm is to plot a best-fit line or a
curve between the data.

Regression-fit a mathematical function describing a

curve, so that the curve passes as close as possible to
all of the datapoints. It is generally a problem of
function approximation or interpolation, working out
the value between values that we know.
Classification
• Classification is a supervised machine learning method where
the model tries to predict the correct label of a given input data.
• In classification, the model is fully trained using the training
data, and then it is evaluated on test data before being used to
perform prediction on new unseen data.
• Novelty detection is the process of identifying new or unknown data
or patterns in a dataset that a machine learning system has not been
exposed to during training.
Contd..
• Curse of dimensionality:
• As the dimensionality of the features space
increases, the number configurations can
grow exponentially, and thus the number of
configurations covered by an observation
decreases.

Example: Predicting house price

Evolutionary learning
• Biological evolution can be seen as learning process.

• Biological organisms adapt to improve their survival rates and chance

of having offspring in their environment.

• This solves problems by employing processes that mimic the

behaviours of living things.
Machine Learning Process

Data • Data can be collected from various

Collection and
Preparation sources and prepare for next process

• Is the method of reducing the input variable to your model by

Feature using only the relevant data and getting rid of noise in data
Selection

• Given the data set the choice of an

Algorithm
Choice appropriate algorithm

Parameter
and model • Selecting the best algorithm and model architecture
Selection suited for a particular task

• Given the dataset,algorithm,

Training
and parameters,training
should be simply the use of
computational resources. • The model is evaluated to
Evaluation test if the model is any
good.
Some Terminology
(terminologies, Weight space, curse of
dimensionality)
• Inputs-an input vector is the data given as one input to algorithm(x)

• Weights-weighted connections between node i and j(wij)

• Outputs-Output vector if y

• Targets-The target vector t

• Activation function-g(.)

• Error E
Weight Space
• Since we are using neural network to implement the solution, we
need to find the distance between the input and the neuron. This is
computed by Euclidean distance,

If the neuron is close to the

input then it should fire or else
shouldn’t.
Curse of dimensionality
• If the number of dimensions increases, the volume of the unit
hypersphere does not increases with it.
Testing Machine Learning Algorithm
• Overfitting
• Training, Testing and Validation Sets
• The Confusion Matrix
• Accuracy metrics
• Receiver Operator Characteristic(ROC) Curve
• Unbalanced Datasets
• Measurement Precision
Overfitting
• If we train the machine with too many training data it may led to
overfitting.
Training, Testing and Validation sets
The Confusion Matrix
• Confusion Matrix is a table used in ML and statistics to assess the
performance of a classification model.
• Compare the results of actual output and predicted output.
• Accuracy can be calculated by dividing
the sum of elements on the leading diagonal
by the sum of all of the elements in the matrix.
Accuracy metrics
• Accuracy is one metric for evaluating classification model.
• It is defined as the sum of the number of true positives and true negatives
divided by the total number of examples.

1. True positive: An instance for which both predicted and actual values are
positive.
2. True negative: An instance for which both predicted and actual values are
negative.
3. False Positive: An instance for which predicted value is positive but actual
value is negative.
4. False Negative: An instance for which predicted value is negative but actual
value is positive.
• Confusion Matrix

• There are two complementary pair of measurements that can help us to

interpret the performance of a classifier
• Sensitivity and Specificity
• Precision and Recall
• Sensitivity(TPR) is the ratio of number of correct positive examples to the
number classified as positive.
• Specificity(FPR) is the ratio of number of correct negative examples to the
number classified as negative.
• Precision: ratio of the number of correct positive examples to the number
of actual positive examples
• Recall: ratio of the number of correct positive examples to the number that
are classified as positive.
Precision and recall can be combined to give
a single measure called F1 measure,
The Receiver Operator Characteristic Curve
• ROC is a graph showing the performance of a classification model.
This curve plots two parameters: TPR and FPR. It is computed by Area
Under Curve(AUC).
Unbalanced Datasets
• In some cases the datasets is not balanced one(i.e. Not contain same
number of positive and negative examples).At this time, a more
correct measure is Matthew’s Correlation Coefficient, which is
computed as,
Turning Data into Probabilities
• In ML turning data into probabilities often involves using probabilistic
models or algorithms to make predictions or classifications.
• Now worked out two things from our training data:
• the joint probability and
• The conditional probability

There is a link between the joint probability and conditional probability

By equating the above 2 equations we get Bayes’ rule,

• However, if we notice that any observation Xk has to belong to some
class Ci ,then we can marginalise over the classes to compute.

• Where x is a vector of feature values

Instead of just one feature. This is known
as maximum a posteriori or MAP hypothesis
Naïve Bayes’ Classifier
• It is a supervised learning algorithm, which is based on bayes theorem
and used for solving classification problems.
• The crux of the classifier is based on Bayes theorem
• It describes the probability of an event, based on prior knowledge of
conditions that might be related to the event.
• The classifier calculates the conditional probability of each feature
given a class and then combines these probabilities to determine the
overall probability of the observation belonging to a certain class.
Example
Brain and the Neuron
• In ML, the inspiration drawn from the human brain, especially the
neural networks in the brain, has led to the development of Artificial
neural network(ANN)
• The fundamental building block of both ANN and the human brain is
the neuron
• Neurons receive signals from other neurons through dendrites,
process these signals in the cell body, and then transmit signals to
other neurons through an axon.
• The connections between neurons are known as synapses where
information is transmitted through the release of neurotransmitters.
•Synaptic Plasticity:
•Modifying the strength of synaptic connections
between the neurons and creating new connections
•The strength of connections between neurons can be
adjusted over time in a process is called Synaptic
Plasticity
Hebb’s Rule
• Hebb’s rule says that the changes in the strength of synaptic
connections are proportional to the correlation in the firing of the
two connecting neurons.
• So, if two neurons consistently fire simultaneously, then any
connection between them will change in strength, becoming
stronger.
• However, if the two neurons never fire simultaneously the
connection between them will die away.
• It is also known as long-term potentiation and neural plasticity, and it
does appear to have correlates in real brains.
McCulloch and Pitts Neurons
• Studying neurons isn’t actually easy. You need to be able to extract
the neuron from the brain and then keep it alive so that you can see
how it reacts in controlled circumstances.
• McCulloch and Pitts produced a perfect example of t his when they
modelled a neuron as:
1. A set of weighted inputs wi that correspond to the synapses
2. An adder that sums the input signals (equivalent to the membrane
of the cell that collects electrical charge)
3. An activation function that decides whether the neuron fires for the
current inputs.
Contd..

• A picture of McCulloch and Pitts mathematical model of a neuron.

The inputs are multiplied by the weights and the neurons sum their
values. If the sum is greater then the threshold Ɵ then the neuron
fires; otherwise it does not.
Contd..

o is the activation function if the obtained

value is greater than the threshold value
then the neuron should fire otherwise does
not.
Limitations of McCulloch and Pitts Neuron
Model
• This model uses a binary activation function(output is either 0 or 1).
This binary nature over simplifies the behaviour of real neurons,
which exhibit graded responses and can transmit continuous signals
• The original model does not include weights on connection between
neurons. In real neural networks, the strength of connection plays a
crucial role in information processing
• The neurons are not updated sequentially according to a computer
clock, but update themselves randomly whereas in many of our
models we will update themselves randomly.
Neural Networks
• A neural network is a computational mode inspired by the structure
and function of the human brain’s neural networks.
• It consists of interconnected nodes called neurons.
• Each neuron receives input signals, processes them and then
produces an output signal that may be passed to other neurons.
• These networks are trained using algorithms to recognize patterns
and relationships in data, making them particularly useful for tasks
like classification, regression, clustering and pattern recognition.
Perceptron
• Perceptron is Machine Learning algorithm for supervised learning of
various binary classification tasks.
• This model consists of four main parameters
• Input values
• Weights
• Net Sum
• Activation function
Perceptron Network
Contd..
• Suppose if we are getting the output for the above model is
(0,1,0,0,1) the neuron 2,5 should fire others shouldn’t
• In some cases, the output of each neuron may be incorrect. This may
be corrected in two ways
1. By changing the weight value of input and neuron
2. By multiplying a parameter(ƞ-Learning rate) with weight and input value.
The weight of the network can be changed using the following
equation:

New weight value =Obtained value plus old weight value

Contd..
•In some other cases weight of the value may be correct, if a
neuron is wrong, changing the relevant weight doesn’t do
anything; we need to change the threshold value.
•This is done by multiplying a parameter called learning rate
ƞ with the McCulloch and Pitts Neuron model,

•By using different values for the learning rate tends to make
the network unstable, so that it never settles down. We
therefore use a moderate learning rate, typically 0.1<ƞ<0.4,
depending on how much error we expect in the inputs.
Bias Input
• When we discussed the McCulloch and Pitts neuron, we gave each
neuron a firing threshold Ɵ that determined what value it needed
before it should fire.
• This threshold should be adjustable, so that we can change the value
that the neuron fires at.
• In a network, if all the input value is zero, no matter what is the value
of weights were set.
• Suppose if we set all the threshold value for neuron at zero. Now we
add extra input weight to the neuron, with the value of the input to
that weight always being fixed (usually +-1 )
Contd..
• Usually we will take -1, even when all other inputs are zero. This input
is called a bias node.
The Perceptron Learning Algorithm
• The perceptron algorithm is divided into two parts: a training phase
and a recall phase
• Training phase-
Complexity O(mn)
• Recall phase-
Complexity O(Tmn)
Example of Perceptron Learning: Logic
Functions
Contd..
• Initially assign weights to small random numbers ,
w0=-0.05 w1=-0.02, w2=0.02
So the value reaches to the neuron for(0,0)
-0.05*-1+-0.02*-1+0.02*-1=0.05. So this value is 1, so the neuron fires
and output is 1.It is wrong.
Hence we will apply learning rate to find it out (0,0)
0.2*-1+-0.02*-1+0.02*-1=-0.2
So this value is 0- neuron does
not fires
Output is 0

aiml manual 6th sem
No ratings yet
aiml manual 6th sem
15 pages
Microsoft AI-900 Vfeb-2024 by - Xakinato 110q
No ratings yet
Microsoft AI-900 Vfeb-2024 by - Xakinato 110q
69 pages
Module 1 ML
No ratings yet
Module 1 ML
51 pages
ML-chap-2
No ratings yet
ML-chap-2
60 pages
Classification
No ratings yet
Classification
53 pages
Unit 1ML
No ratings yet
Unit 1ML
12 pages
Introduction Class
No ratings yet
Introduction Class
134 pages
Unit Ii
No ratings yet
Unit Ii
118 pages
Basics of ML and Evaluation
No ratings yet
Basics of ML and Evaluation
42 pages
Lecture 1
No ratings yet
Lecture 1
47 pages
Presentation on ML - Copy
No ratings yet
Presentation on ML - Copy
469 pages
Data in ML
No ratings yet
Data in ML
26 pages
Chapter 01 Introduction To Machine Learning
No ratings yet
Chapter 01 Introduction To Machine Learning
59 pages
2021 Machine Learning Intro
No ratings yet
2021 Machine Learning Intro
43 pages
An Enlightenment To Machine Learning
100% (1)
An Enlightenment To Machine Learning
16 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
32 pages
dbms-10 marks
No ratings yet
dbms-10 marks
32 pages
Unit4_PPT (2)
No ratings yet
Unit4_PPT (2)
126 pages
Machine Learning
No ratings yet
Machine Learning
54 pages
Module 1 ML Mumbai University
No ratings yet
Module 1 ML Mumbai University
47 pages
Unit III - I
No ratings yet
Unit III - I
15 pages
UNIT-I
No ratings yet
UNIT-I
132 pages
Module 2 - ML
No ratings yet
Module 2 - ML
53 pages
0 Machine Learning Overview and Metrics LT
No ratings yet
0 Machine Learning Overview and Metrics LT
84 pages
Machine Learning Updated
No ratings yet
Machine Learning Updated
14 pages
FML - KNN
No ratings yet
FML - KNN
64 pages
ML Fundamentals by Bitspace
No ratings yet
ML Fundamentals by Bitspace
19 pages
Lecture 11
No ratings yet
Lecture 11
18 pages
Machine Learning Models: by Mayuri Bhandari
No ratings yet
Machine Learning Models: by Mayuri Bhandari
48 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
Intro To Machine Learning With PyTorch
No ratings yet
Intro To Machine Learning With PyTorch
48 pages
Chapter 7 - LAST
No ratings yet
Chapter 7 - LAST
29 pages
Unit I MACHINE LEARNING
No ratings yet
Unit I MACHINE LEARNING
87 pages
Lec1 Intoduction
No ratings yet
Lec1 Intoduction
34 pages
Unit I
No ratings yet
Unit I
44 pages
Machine Learning HC
No ratings yet
Machine Learning HC
4 pages
Chapter 4- Machine Learning
No ratings yet
Chapter 4- Machine Learning
81 pages
Machine Learning
No ratings yet
Machine Learning
24 pages
Lec 8
No ratings yet
Lec 8
35 pages
Classification
100% (2)
Classification
105 pages
July4 SaketAnand FriendlyIntroToML
No ratings yet
July4 SaketAnand FriendlyIntroToML
84 pages
ML Unit-1
No ratings yet
ML Unit-1
39 pages
Lecture 2
No ratings yet
Lecture 2
26 pages
Machine Learning - Unit - 1
100% (1)
Machine Learning - Unit - 1
58 pages
AIYA SESSION 4
No ratings yet
AIYA SESSION 4
42 pages
04 Machine Learning Overview
No ratings yet
04 Machine Learning Overview
109 pages
04 Machine Learning Overview
No ratings yet
04 Machine Learning Overview
109 pages
module3_DS_ppt
No ratings yet
module3_DS_ppt
68 pages
Module2 ch2
No ratings yet
Module2 ch2
36 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
24 pages
Big-Data Unit-3
100% (1)
Big-Data Unit-3
54 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
10 pages
Air quality prediction using machine learning
No ratings yet
Air quality prediction using machine learning
29 pages
Machine Learning
No ratings yet
Machine Learning
42 pages
ML Intro Theory
No ratings yet
ML Intro Theory
10 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
ML 2
No ratings yet
ML 2
4 pages
Introduction to ML Unit-1 PPT
No ratings yet
Introduction to ML Unit-1 PPT
90 pages
APS1070 Lecture (3) Slides
No ratings yet
APS1070 Lecture (3) Slides
70 pages
03-Introduction To Machine Learning - DNN
No ratings yet
03-Introduction To Machine Learning - DNN
35 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
13 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Ensemble of CNN Models For Identifying Stages of Alzheimers Disease An Approach Using MRI Scans
No ratings yet
Ensemble of CNN Models For Identifying Stages of Alzheimers Disease An Approach Using MRI Scans
5 pages
Research Report
No ratings yet
Research Report
42 pages
Data4800 Report Ai
No ratings yet
Data4800 Report Ai
8 pages
BA4027 Datamining For BI
100% (1)
BA4027 Datamining For BI
67 pages
HACKATHON
No ratings yet
HACKATHON
6 pages
The Linguistics of Sentiment Analysis
No ratings yet
The Linguistics of Sentiment Analysis
34 pages
Efficient Graph-Friendly COCO Metric Computation For Train-Time Model Evaluation
No ratings yet
Efficient Graph-Friendly COCO Metric Computation For Train-Time Model Evaluation
7 pages
Cs8080 Unit3 Text Classification and Clustering
No ratings yet
Cs8080 Unit3 Text Classification and Clustering
171 pages
Proposal Defense v6
No ratings yet
Proposal Defense v6
55 pages
Predicting Travel Insurance Purchases in An Insura
No ratings yet
Predicting Travel Insurance Purchases in An Insura
16 pages
Customer Churn Prediction On E-Commerce Using Machine Learning
No ratings yet
Customer Churn Prediction On E-Commerce Using Machine Learning
8 pages
Course Plan 21CSC307P - Machine Learning For Data Analytics
No ratings yet
Course Plan 21CSC307P - Machine Learning For Data Analytics
13 pages
SR Internship
No ratings yet
SR Internship
25 pages
Trendsin Stock Market
No ratings yet
Trendsin Stock Market
7 pages
Final 011
No ratings yet
Final 011
47 pages
Information Storage, Retrieval, Indexing
No ratings yet
Information Storage, Retrieval, Indexing
20 pages
Yousof Haghshenas 2020
No ratings yet
Yousof Haghshenas 2020
11 pages
Review of Text Classification Methods On Deep Learning
No ratings yet
Review of Text Classification Methods On Deep Learning
13 pages
Unit - 3:: Explain Briefly About Automatic Indexing? Explain About Types of Classes Automatic Indexing?
No ratings yet
Unit - 3:: Explain Briefly About Automatic Indexing? Explain About Types of Classes Automatic Indexing?
28 pages
Heart Disease Prediction Using Machine Learning IJERTV9IS040614
No ratings yet
Heart Disease Prediction Using Machine Learning IJERTV9IS040614
4 pages
Study of Multiclass Classification For Imbalanced Biomedical Data
No ratings yet
Study of Multiclass Classification For Imbalanced Biomedical Data
5 pages
Zoeynull - 1500 - Big Data Analytics - Jisha - Jisha - 1may
No ratings yet
Zoeynull - 1500 - Big Data Analytics - Jisha - Jisha - 1may
5 pages
Semantics Analysis of Agricultural Experts Opinions For Crop Productivity Through Machine Learning
No ratings yet
Semantics Analysis of Agricultural Experts Opinions For Crop Productivity Through Machine Learning
17 pages
paper-4
No ratings yet
paper-4
14 pages
(IJCST-V11I1P5) :jitendra Maan, Harsh Maan
No ratings yet
(IJCST-V11I1P5) :jitendra Maan, Harsh Maan
6 pages
Example of A Qa Project Plan Review Checklist
No ratings yet
Example of A Qa Project Plan Review Checklist
15 pages
Automatic Image Segmentation by Dynamic Region Merging
No ratings yet
Automatic Image Segmentation by Dynamic Region Merging
28 pages
Mini Project Report Format
No ratings yet
Mini Project Report Format
16 pages

Machine - Learning - Unit - 1

Uploaded by

Machine - Learning - Unit - 1

Uploaded by

An introduction to

Kongu Engineering College, Perundurai

Deep Learning Subset of Machine Learning in which

• Machine Learning is the Study of algorithms that

✔ With experience(E)-Tom Mitchell

• Google Colab-Online Resources

Supervised Unsupervised Reinforcement Evolutionary

• Also known as learning from exemplars

• Usually written with

• Supervised Learning can be implemented in two ways:

✔ Classification-Target Class(Discrete one)

Regression-fit a mathematical function describing a

Example: Predicting house price

• Biological organisms adapt to improve their survival rates and chance

• This solves problems by employing processes that mimic the

Data • Data can be collected from various

• Is the method of reducing the input variable to your model by

• Given the data set the choice of an

• Given the dataset,algorithm,

• Weights-weighted connections between node i and j(wij)

• Targets-The target vector t

If the neuron is close to the

• There are two complementary pair of measurements that can help us to

There is a link between the joint probability and conditional probability

By equating the above 2 equations we get Bayes’ rule,

• Where x is a vector of feature values

• A picture of McCulloch and Pitts mathematical model of a neuron.

o is the activation function if the obtained

New weight value =Obtained value plus old weight value

You might also like