0% found this document useful (0 votes)

86 views19 pages

Machine Learning

AI aims to create intelligent machines that mimic human behavior through techniques like machine learning. Machine learning uses sample data to automatically learn patterns and make better decisions. There are several types of machine learning including supervised learning which uses labeled training data, unsupervised learning which makes inferences from unlabeled data, and reinforcement learning which learns through rewards and punishments. Machine learning algorithms like neural networks, logistic regression, random forests, and gradient boosted trees are commonly used for classification and regression tasks.

Uploaded by

Daksh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

86 views19 pages

Machine Learning

Uploaded by

Daksh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Artificial Intelligence (AI)

AI is a branch of computer science that aims to create intelligent machines that mimic
human behaviour such as knowledge, reasoning, problem-solving, perception, learning,
planning, ability to manipulate and move objects

AI is an area of computer science that emphasizes the creation of intelligent machines

that work and react like humans.

Artificial intelligence
In computer science, artificial intelligence ( AI), sometimes called machine intelligence, is intelligence demonstrated…
en.wikipedia.org

What is AI (artificial intelligence)? - Definition from WhatIs.com

Artificial intelligence (AI) is the simulation of human intelligence processes by machines, especially computer…
searchenterpriseai.techtarget.com

Machine Learning (ML)

Machine learning falls under the umbrella of AI, that provides systems with the ability to
automatically learn and improve from experience without being explicitly programmed.

The process of learning begins with observations or data, such as examples, direct
experience, or instruction, in order to look for patterns in data and make better decisions
in the future based on the examples we provide.

The primary aim is to allow the computers to learn automatically without human
intervention or assistance and adjust actions accordingly.

Supervised Learning
Supervised learning is a machine learning task of learning a function that maps an input
to an output based on example input-output pairs. A supervised learning algorithm
analyzes the training data and produces an inferred function, which can be used for
mapping new examples.

In supervised learning, we have labelled training data.

Supervised learning
Supervised learning is the machine learning task of learning a function that maps an input to an output based on…
en.wikipedia.org

Unsupervised Learning
Unsupervised learning is a machine learning task that draws inferences from datasets
consisting of input data without labelled responses. The goal of unsupervised learning is
to model the underlying structure or distribution in the data in order to learn more about
the data.

Clustering and association are some of the unsupervised learning subcategories.

Neural Network or Artificial Neural Network (ANN)

A neural network is a biologically-inspired programming paradigm which enables a
computer to learn from observational data. The design of an artificial neural network is
inspired by the biological neural network of the human brain, leading to a process of
learning that’s far more capable than that of standard machine learning models.

Neural networks, also known as artificial neural networks, consists of input and output
layers, as well as a hidden layer consisting of units that transform the input into
something that the output layer can use. They perform very well in tasks that require to
find patterns.

Back-propagation
It is a concept in neural networks, which allows networks to adjust their hidden layers of
neurons in situations where the outcome doesn’t match what the creator is hoping for.

Deep Neural Network (DNN) or Deep Learning

Deep learning is a subset of machine learning where multiple layers of neural networks
are stacked to create a huge network to map input into the output. It allows the network
to extract different features until it can recognize what it is looking for.

Linear regression
Linear regression is a machine learning algorithm based on supervised learning. It
performs a regression task. Regression models a target prediction value based on
independent variables. It is mostly used for finding out the relationship between variables
and forecasting. One example of a task where linear regression can be used is forecasting
housing price based on past values.

The cost function of linear regression is Root Mean Squared Error (RMSE) between
predicted y value (pred) and true y value (y).

Linear Regression (By Sewaqu — Own work, Public

Logistic regression
Logistic regression is a supervised machine learning algorithm which is used for the
classification problem. It is a classification algorithm used to assign observations to a
discrete set of classes. Some of the examples of classification problems are Email spam or
not spam, Online transaction fraud or not a fraud.

Logistic regression transforms its output using the logistic sigmoid function to return a
probability value.

There are two types of logistic regression:

1. Binary
2. Multi-class

K-Nearest Neighbors (K-NN)

The k-nearest neighbors (KNN) algorithm is a simple, easy-to-implement supervised
machine learning algorithm that can be used to solve both classification and regression
problems.

The KNN algorithm assumes that similar things exist in close proximity. In other words,
similar things are near to each other.

Can be used on recommendation systems.

KNN works by finding the distances between a query and all the examples in the data,
selecting the specified number examples (K) closest to the query, then voting for the most
frequent label (in the case of classification) or averages the labels (in the case of
regression).
Random forest
Random forest is like a universal machine learning technique that can be used for both
regression and classification purpose. It consists of a large number of individual decision
trees that operate as an ensemble. Each individual decision tree in the random forest spits
out a class prediction and the class with the most votes become our model’s prediction.

In general, a random forest model does not overfit, and even if it does, it is easy to stop it
from overfitting.

There is no need for a separate validation set for a random forest model.

It makes only a few statistical assumptions. Does not assume that your data is normally
distributed, nor it assumes that the relationships are linear.

It requires very few pieces of feature engineering.

Ensemble learning
Ensemble learning helps improve machine learning results by combining several models.
This approach allows the production of better performance compared to a single model.

Ensemble methods are meta-algorithms that combine several machine learning

techniques into one predictive model in order to decrease variance (bagging), bias
(boosting), or improve prediction (stacking).

Examples are random forest, Gradient boosted decision trees, ADA boost.

Gradient boosted decision trees

Boosting is an ensemble technique in which the predictors are not made independently,
but sequentially.

It is a method of converting weak learners into strong learners. Gradient boosting is an

example of boosting. It is a machine learning technique for regression and classification
problems, which produces a prediction model in the form of an ensemble or weak
prediction models, typically decision trees.

Overfitting
Overfitting happens when a model that models the training data too well.

Overfitting happens when a model learns the detail and noise in the training data to the
extent that it negatively impacts the performance of the model on new data. It negatively
impacts the model's ability to generalize.

It can be prevented by:

1. Cross-validation

2. Regularization
Underfitting
Underfitting refers to a model that can neither model the training data nor generalize to
new data. It will have poor performance on the training data.

Regularization
Regularization is a technique to modify machine learning models to avoid the problem of
overfitting. You can apply regularization to any machine learning model. Regularization
simplifies overly complex models that are prone to be overfitted by adding penalty tern to
the objective function. If a model is overfitted, it will have problem generalizing and thus
will give inaccurate predictions when it is exposed to new data sets.

Regularization in Machine Learning

One of the major aspects of training your machine learning model is avoiding overfitting. The model will have a low…
towardsdatascience.com

All you need to know about Regularization

Alice : Hey Bob!!! I have been training my model for 10 hrs but my model is yielding very bad accuracy although it…
towardsdatascience.com

L1 vs L2 regularization
A regression model that uses the L1 regularization technique is called Lasso Regression. A
model which uses the L2 regularization technique is called Rigid Regression.
The key difference between the two is the penalty term which is added to the loss
function.

Rigid regression adds “squared magnitude” of coefficient as penalty term to the loss
function. Lasso regression (Least Absolute Shrinkage and Selection Operator) adds
“absolute value of magnitude” of coefficient as penalty term to the loss function.

L1 L2 Regularization
In this article we will understand why do we need regularization, what is regularization, what are different types of…
medium.com

Regularization for Simplicity: L₂ Regularization | Machine Learning Crash Course | Google…

Estimated Time: 7 minutes Consider the following generalization curve, which shows the loss for both the training set…
developers.google.com

Differences between L1 and L2 as Loss Function and Regularization

2014/11/30: Updated the L1-norm vs L2-norm loss function via a programmatic validated diagram. Thanks readers for
the…
www.chioka.in

Cross-validation
Cross-validation is a technique for evaluating machine learning models by training
several ML models on subsets of the available input data and evaluating them on a
complementary subset of the data. It is used to prevent overfitting of the model.

Different types of cross-validation techniques are:

1. Holdout method

2. K-fold (most popular)

3. Leave-P-out

Cross-Validation
Validation is probably in one of most important techniques that a data scientist use as there is always a need to…
towardsdatascience.com

Why and how to Cross Validate a Model?

Once we are done with training our model, we just can’t assume that it is going to work well on data that it has not…
towardsdatascience.com

Performance metrics for regression

Mean Absolute Error (MAE): measures the average of the absolute difference
between actual and predicted values.

Root Mean Squared Error (RMSE): measures the square root of the average of the
differences of the squares between the actual and the predicted values.

Performance metrics for classification problems

Confusion matrix: It is one of the most intuitive and easiest metrics used for finding
the correctness and accuracy of the model. It is used for classification problem where the
output can be of two or more types of classes.

Confusion Matrix (Source)

True Positives (TP): are the cases when the actual class of the data point was 1 (True)
and the predicted is also 1 (True).

True Negatives (TN): are the cases when the actual class of the data point was 0 (false)
and the predicted is also 0 (False).

False Positives (FP): are the cases when the actual class of the data point was 0 (False)
and the predicted is 1 (True). False is because the model has predicted incorrectly and
positive because the class predicted was a positive one.
False Negatives (FN): are the cases when the actual class of the data point was 1 (True)
and the predicted is 0 (False). False because the model has predicted incorrectly and
negative because the class predicted was a negative one (0).

Accuracy: Accuracy in classification problems is the number of correct predictions made

by the model over all the predictions made.

Accuracy in the confusion matrix (Source)

When to use accuracy: accuracy is a good measure when the target variable classes in
the data are nearly balanced.

When not to use accuracy: accuracy should never be used as a measure when the target
variable classes in the data are a majority of one class.

Precision (hits): Precision is a measure that tells us what proportion of predicted

values as True is actually True.
Recall or sensitivity (misses): Recall is a measure that tells of what proportion of
patients are actually true were predicted as being true by the model.
F1 score: Represents both precision and recall.

F1 Score (Source)

Receiver Operating Characteristic (ROC) curve: An ROC curve is a graph showing

the performance of a classification model at all classification thresholds.

The curve plots two parameters:

1. True Positive Rate (Recall)

2. False Positive Rate (Specificity)

ROC Curve (Source)

AUC (Area Under the ROC Curve): AUC measures the entire two-dimensional area
underneath the entire ROC curve.

It provides an aggregate measure of the performance across all possible classification

thresholds.
Area under ROC curve (Source)

Performance Metrics for Classification problems in Machine Learning

“Numbers have an important story to tell. They rely on you to give them a voice.” — Stephen Few
medium.com

Understanding Confusion Matrix

When we get the data, after data cleaning, pre-processing and wrangling, the first step we do is to feed it to an…
towardsdatascience.com

The topics discussed above were the basics of machine learning. We discussed the basic
terms such as AI, machine learning and deep learning, different types of machine
learning: supervised and unsupervised learning, some machine learning algorithms such
as linear regression, logistic regression, k-nn, and random forest, and performance
evaluation matrices for different algorithms.

Real World Algorithms A Beginner's Guide Panos Louridas Z Library
100% (1)
Real World Algorithms A Beginner's Guide Panos Louridas Z Library
527 pages
Should Maruti Suzuki Invest in Electric Cars
No ratings yet
Should Maruti Suzuki Invest in Electric Cars
19 pages
Advance Statistics Project: Karthikeyan M
100% (1)
Advance Statistics Project: Karthikeyan M
21 pages
10 Forecasting IPE 493 CSE JAN 24
No ratings yet
10 Forecasting IPE 493 CSE JAN 24
49 pages
L 0007634413 PDF
0% (1)
L 0007634413 PDF
30 pages
Apl Statistics
100% (1)
Apl Statistics
353 pages
Analisis Pengaruh Kualitas Pelayanan Terhadap Kepuasan PENGHUNI (Studi Kasus Rusunawa Jurug Surakarta)
No ratings yet
Analisis Pengaruh Kualitas Pelayanan Terhadap Kepuasan PENGHUNI (Studi Kasus Rusunawa Jurug Surakarta)
7 pages
MCQ Analysis of Variance, ANOVA, Anova, Qtt501, Lpu-Noteshanger, LPU, Galgotias, Amity
No ratings yet
MCQ Analysis of Variance, ANOVA, Anova, Qtt501, Lpu-Noteshanger, LPU, Galgotias, Amity
10 pages
Radiometry of Image Formation (Computer Vision)
No ratings yet
Radiometry of Image Formation (Computer Vision)
14 pages
Mis Notas de R PDF
100% (1)
Mis Notas de R PDF
396 pages
Efficient Python Tricks and Tools For Data Scientists - by Khuyen Tran
No ratings yet
Efficient Python Tricks and Tools For Data Scientists - by Khuyen Tran
20 pages
2022 - Chua Shi, Xiao Wang, Philip S. Yu - Heterogeneous Graph Representation Learning and Applications-Springer
No ratings yet
2022 - Chua Shi, Xiao Wang, Philip S. Yu - Heterogeneous Graph Representation Learning and Applications-Springer
329 pages
Speech and Language Processing - J&M
No ratings yet
Speech and Language Processing - J&M
599 pages
Water Quality Prediction Using Machine Learning Technique
No ratings yet
Water Quality Prediction Using Machine Learning Technique
9 pages
Big Data Smart Cities
0% (1)
Big Data Smart Cities
52 pages
Machine Learning Notes
100% (1)
Machine Learning Notes
8 pages
ARI 2101 Introduction To Statistics and Data Analysis
No ratings yet
ARI 2101 Introduction To Statistics and Data Analysis
5 pages
Business Statistics Assignment
No ratings yet
Business Statistics Assignment
8 pages
Ols Proof
100% (1)
Ols Proof
2 pages
Machine Learning
No ratings yet
Machine Learning
56 pages
Segmentation
100% (1)
Segmentation
51 pages
Theoryapplicatio0000saat 1
No ratings yet
Theoryapplicatio0000saat 1
372 pages
ML Notes MAKAUT 7th Sem
No ratings yet
ML Notes MAKAUT 7th Sem
31 pages
BKC Internship Certificate - Daksh Malhotra
No ratings yet
BKC Internship Certificate - Daksh Malhotra
1 page
Machine Learning
No ratings yet
Machine Learning
27 pages
Scenarios in Marketing - 2012 - Ringland - A History of Scenarios
No ratings yet
Scenarios in Marketing - 2012 - Ringland - A History of Scenarios
10 pages
Foreword 2021 Machine Learning and Data Science in The Oil and Gas Industry
No ratings yet
Foreword 2021 Machine Learning and Data Science in The Oil and Gas Industry
3 pages
PHS 3E Manual 20190418
No ratings yet
PHS 3E Manual 20190418
16 pages
SMII Group 3
No ratings yet
SMII Group 3
7 pages
A Synopsis of The Thesis Project
100% (1)
A Synopsis of The Thesis Project
3 pages
Fundamentals of Machine Learning II
No ratings yet
Fundamentals of Machine Learning II
13 pages
PROJECT REPORT p2
No ratings yet
PROJECT REPORT p2
82 pages
ML Unit-1
No ratings yet
ML Unit-1
39 pages
Analisis de Datos MIT
No ratings yet
Analisis de Datos MIT
340 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
68 pages
AI
No ratings yet
AI
52 pages
Unit 1
No ratings yet
Unit 1
52 pages
Fine-Tuning Llama 2 On A Custom Dataset
No ratings yet
Fine-Tuning Llama 2 On A Custom Dataset
22 pages
Automotive Servicing (Engine Repair) NC II Modules of Instruction Content 1
No ratings yet
Automotive Servicing (Engine Repair) NC II Modules of Instruction Content 1
42 pages
Full Notes
No ratings yet
Full Notes
37 pages
MLT Unit - 1
No ratings yet
MLT Unit - 1
38 pages
Devi Project
No ratings yet
Devi Project
56 pages
Tech Mining: After 12 Years: Alan Porter
No ratings yet
Tech Mining: After 12 Years: Alan Porter
31 pages
Machine Learning Is The Branch of
No ratings yet
Machine Learning Is The Branch of
12 pages
Intorduction of ML
No ratings yet
Intorduction of ML
14 pages
Image Segmentation DeepLearning
No ratings yet
Image Segmentation DeepLearning
18 pages
Unit 5.1 Testing The Difference Between Two Independent Population Means
No ratings yet
Unit 5.1 Testing The Difference Between Two Independent Population Means
26 pages
Report
No ratings yet
Report
27 pages
Business Location and Success: The Case of Internet Café Business in Indonesia
No ratings yet
Business Location and Success: The Case of Internet Café Business in Indonesia
23 pages
Introduction To Machine Learning, Neural Networks, and Deep Learning
No ratings yet
Introduction To Machine Learning, Neural Networks, and Deep Learning
12 pages
1 s2.0 S0306261921016676 Main
No ratings yet
1 s2.0 S0306261921016676 Main
19 pages
BUAN6359 - Spring2022 Exam2 Practice
No ratings yet
BUAN6359 - Spring2022 Exam2 Practice
13 pages
Sales Playbook
No ratings yet
Sales Playbook
12 pages
Unit Iv
No ratings yet
Unit Iv
12 pages
BSD 3101-Lab Exercise 1
No ratings yet
BSD 3101-Lab Exercise 1
12 pages
E-Commerce-Ant Financial
No ratings yet
E-Commerce-Ant Financial
11 pages
Newbold Stat7 Ism 09
No ratings yet
Newbold Stat7 Ism 09
17 pages
MAA SL 4.4 LINEAR REGRESSION (Concise)
No ratings yet
MAA SL 4.4 LINEAR REGRESSION (Concise)
10 pages
Machine Learning Based Crime Rate Analysis Using Python
No ratings yet
Machine Learning Based Crime Rate Analysis Using Python
7 pages
Diffraction: Diffraction Refers To Various Phenomena
No ratings yet
Diffraction: Diffraction Refers To Various Phenomena
70 pages
Decision Trees in Machine Learning - by Prashant Gupta - Towards Data Science
No ratings yet
Decision Trees in Machine Learning - by Prashant Gupta - Towards Data Science
6 pages
Introduction To Generative Models
No ratings yet
Introduction To Generative Models
13 pages
Four Ethical Issues of The Information Age
No ratings yet
Four Ethical Issues of The Information Age
9 pages
IBM Merged
No ratings yet
IBM Merged
13 pages
Group 5 Patanjali PDF
No ratings yet
Group 5 Patanjali PDF
8 pages
Decision Tree Algorithm - A Complete Guide: Data Science Blogathon
No ratings yet
Decision Tree Algorithm - A Complete Guide: Data Science Blogathon
13 pages
DA Project Report
No ratings yet
DA Project Report
17 pages
Sta 32101 Questions-Descriptives
No ratings yet
Sta 32101 Questions-Descriptives
7 pages
Dream Big
No ratings yet
Dream Big
5 pages
1 s2.0 S0301479716310660 Main
No ratings yet
1 s2.0 S0301479716310660 Main
9 pages
Eye Tracking A Comprehensive Guide To Methods and
No ratings yet
Eye Tracking A Comprehensive Guide To Methods and
22 pages
JFLAP Manual PDF
No ratings yet
JFLAP Manual PDF
23 pages
Machine Learning
No ratings yet
Machine Learning
12 pages
Preference of Animation Students Between Traditional and Digital DRAWING: A Comparative Analysis
No ratings yet
Preference of Animation Students Between Traditional and Digital DRAWING: A Comparative Analysis
9 pages
Data Science With Python ML Course Syllabus
No ratings yet
Data Science With Python ML Course Syllabus
4 pages
Unit 4 AI LASK
No ratings yet
Unit 4 AI LASK
7 pages
ML Final Print Upload
No ratings yet
ML Final Print Upload
10 pages
Research Paper
No ratings yet
Research Paper
7 pages
Glass Production: CHE170-1/B11 Group 8 Lopez - Osias - Surnit
No ratings yet
Glass Production: CHE170-1/B11 Group 8 Lopez - Osias - Surnit
36 pages
Theoretical and Conceptual Framework in Literature Review
No ratings yet
Theoretical and Conceptual Framework in Literature Review
7 pages
R Hitung Dan R Tabel
No ratings yet
R Hitung Dan R Tabel
6 pages
Machine Learning
No ratings yet
Machine Learning
6 pages
DMUU Assignment2 - GroupC
No ratings yet
DMUU Assignment2 - GroupC
4 pages
Image Acquisition: Illuminating A Scene and Absorbing
No ratings yet
Image Acquisition: Illuminating A Scene and Absorbing
24 pages
IVR Service Report
No ratings yet
IVR Service Report
2 pages
Tesla Group 1 Section A
No ratings yet
Tesla Group 1 Section A
7 pages
SM SEC A Group 07
No ratings yet
SM SEC A Group 07
7 pages
Machine Learning in Advanced Python
No ratings yet
Machine Learning in Advanced Python
7 pages
Group 4 - Patanjali Presentation
No ratings yet
Group 4 - Patanjali Presentation
7 pages
SM-II Group 09
No ratings yet
SM-II Group 09
7 pages
STA 2311 Statistical Programming II
No ratings yet
STA 2311 Statistical Programming II
3 pages
Twitter Sentiment Analysis
No ratings yet
Twitter Sentiment Analysis
5 pages
AQL and LQ Schemes
No ratings yet
AQL and LQ Schemes
6 pages
A Study On Deep Learning
No ratings yet
A Study On Deep Learning
6 pages
Applied Photometry, Radiometry, and Measurements of Optical Losses
No ratings yet
Applied Photometry, Radiometry, and Measurements of Optical Losses
6 pages
Introduction To Factor Analysis (Compatibility Mode) PDF
No ratings yet
Introduction To Factor Analysis (Compatibility Mode) PDF
20 pages
Properties of Light
No ratings yet
Properties of Light
19 pages
Univariate Statistics: Assignment 2
No ratings yet
Univariate Statistics: Assignment 2
5 pages
Neural Network: From Wikipedia, The Free Encyclopedia
No ratings yet
Neural Network: From Wikipedia, The Free Encyclopedia
9 pages
Rigaku SmartLab Instructions 01162013
No ratings yet
Rigaku SmartLab Instructions 01162013
6 pages
Caret Package Infographic PDF
No ratings yet
Caret Package Infographic PDF
1 page
MAT2377 Final Formula Sheet
No ratings yet
MAT2377 Final Formula Sheet
4 pages
Imi New Delhi Only / Imi New Delhi + Imi Kolkata And/Or Imi Bhubaneswar
No ratings yet
Imi New Delhi Only / Imi New Delhi + Imi Kolkata And/Or Imi Bhubaneswar
3 pages
Lagrangian Methods For Constrained Optimization
No ratings yet
Lagrangian Methods For Constrained Optimization
6 pages
Electronic Receipt Application Number: D217157: I Accept That Fees Paid Is Non Refundable
No ratings yet
Electronic Receipt Application Number: D217157: I Accept That Fees Paid Is Non Refundable
1 page
Dr. Ashish Chhabra
No ratings yet
Dr. Ashish Chhabra
1 page
Aditya 2
No ratings yet
Aditya 2
1 page