Machine Learning Algorithms

The document provides an overview of machine learning algorithms, including supervised, unsupervised, and reinforcement learning, along with their applications and challenges. It details supervised learning techniques such as regression and classification, and unsupervised learning methods like clustering, specifically K-means clustering. Key concepts such as overfitting, bias, and the importance of data quality are also discussed.

Uploaded by

sanjudxbreddy

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Machine Learning Algorithms

Uploaded by

sanjudxbreddy

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

MACHINE LEARNING ALGORITHMS

MACHINE LEARNING
ML: improve automatically through experience and by the use of data. ML
algorithms include decision trees, neural networks. Trained models serve as
representations of the learned data
Applications: Netflix, speech recognition, medical diagnosis, and autonomous
vehicles, Chatbots, personalized ads, and fraud detection systems.
Problems: Overfitting, where models become too specialized on training data,
lead to poor performance on new data. Bias in training data causes boxes.

TYPES OF MACHINE LEARNING

Supervised learning: Labelled data. Learns to map input to output labels based
on examples in training. Eg: linear regression, decision trees etc.
Unsupervised learning: unlabelled data. Finds hidden patterns or structures.
Eg: k means clustering, clustering etc.
Reinforcement learning: trial and error method. Reward system and penalty
system.

SUPERVISED LEARNING: allows machines to learn from labeled data, making

predictions or decisions based on that learning.
1. Regression – works with continuous data
2. Classification – works with discrete data

REGRESSION CORRELATION:
measure of the strength of a linear relationship between two quantitative
variables (e.g. price, sales). If the change in one variable appears to be
accompanied by a change in the other variable the two variables are said to be
correlated and this is called correlation.
Causation: one event is the result of the occurrence of the other event.
Pearson’s R: measures the strength and direction of the linear relationship
between two continuous variables. Pearson correlation coefficient is 0.35
1. Scale of measurement should be interval or ratio.
2. Variables should be approximately normally distributed.
3. The association should be linear.
4. There should be no outliers in the data.
May not be suitable in situations like: No correlation, outliners, non linear
relationships and violation of assumptions.
When we make a distribution in which there is an involvement of more than one
variable, then such an analysis is called Regression Analysis. Depends on
regression line or curve.
The least squares method is commonly employed to find this best-fit line or
curve. This method minimizes the squared differences between observed and
predicted values

Linear Regression: consists of a predictor variable and a dependent variable

related linearly to each other
a) Simple Linear Regression: Value is predicted using a single independent
variable in simple linear regression.
b) Multiple Linear Regression: More than one independent variable is used to
predict the value of the dependent variable

Applications: market analysis, sales forecasting, prediction salary, sports and

med research.
Advantages: simple, easy, efficient to train
Disadvantages: sensitive to outliners, which impacts analysis. Limited to linear
relations btw variables.

CLASSIFITCATION: categorizing data into predefined classes or categories.

Assign labels based on features.
Working:
 classes or categories
 features/attributes
 training data
 classification model
 prediction
Types
1) Binary Classification: with 2 class labels. Email spam, exam result
2) Multi-Class Classification: more than 2 class labels. Img classification
3) Multi-Label Classification: each example may belong to multiple class labels.
Photo classification
4) Imbalanced Classification: unequally distributed class, like majority and
minority. Fraud det

KNN (k nearest neighbour algorithm): operates based on the principle of

proximity, making predictions or classifications by considering the similarity
between data points.
Need: useful with classification problems where the decision boundaries are not
clearly defined or when the dataset does not have a well-defined structure.
Provides a simple yet effective method for identifying the category.
UNSUPERVISED LEARNING:
CLUSTERING: group unlabelled dataset into clusters or groups based on
similarity. It is unsupervised learning. The clustering technique is commonly
used for statistical data analysis.
How it works:
1) Prepare the Data: Select the right features for clustering
2) Create Similarity Metrics: Define how similar data points are by comparing
their features.
3) Run the Clustering Algorithm: Apply a clustering algorithm to group the data.
4) Interpret the Results: Analyse the clusters to understand what they
represent.
Types:
 Partitioning Clustering: divides the data into non-hierarchical groups. It
is also known as the centroid based method. Eg: k means clustering.
 Density Based clustering: connects the highly-dense areas into clusters,
and the arbitrarily shaped distributions are formed as long as the dense
region can be connected.
 Distribution model based: data is divided based on the probability of how
a dataset belongs to a particular distribution. Also called gaussian
distribution. Eg: GMM
 Hierarchical Clustering: the dataset is divided into clusters to create a
tree-like structure, which is also called a dendrogram.
________________________________________________________________
K MEANS CLIUSTERING: unsupervised learning algorithm that is used to solve
the clustering problems in machine learning. Classifies the dataset by dividing
the samples into different clusters of equal variances.
Applications: Market segmentation, Image segmentation, document clustering
and customer segmentation.
Advantages: easy to implement, handles large datasets, easy to understand,
works well w various features.
Limitations: results vary on centroid placement, no of clusters must be known
beforehand, outliners distort clusters.

Unit 6
No ratings yet
Unit 6
22 pages
Evolutional Study On KNN and K-Means Algorithms (SP)
No ratings yet
Evolutional Study On KNN and K-Means Algorithms (SP)
9 pages
Introduction To Basics of Machine Learning Algorithms: Pankaj Oli
100% (1)
Introduction To Basics of Machine Learning Algorithms: Pankaj Oli
13 pages
ML - Machine Learning PDF
No ratings yet
ML - Machine Learning PDF
13 pages
Algorithms 1
No ratings yet
Algorithms 1
23 pages
Module 3 (1)
No ratings yet
Module 3 (1)
63 pages
Machine Learning Theory
100% (1)
Machine Learning Theory
12 pages
Machine_Learning
No ratings yet
Machine_Learning
35 pages
ML notes
No ratings yet
ML notes
10 pages
Unit 1
No ratings yet
Unit 1
15 pages
Colloquium Evaluation: Faculty of Computer Science and Engineering To:Kanika Gupta Ma'Am Bhavya Sethi 16csu082
No ratings yet
Colloquium Evaluation: Faculty of Computer Science and Engineering To:Kanika Gupta Ma'Am Bhavya Sethi 16csu082
12 pages
Machine Learning File
No ratings yet
Machine Learning File
7 pages
Unit 4 Supervised Learning
100% (1)
Unit 4 Supervised Learning
75 pages
Machine Learning Clustering AlgorithmsI
No ratings yet
Machine Learning Clustering AlgorithmsI
129 pages
Machine Learning
No ratings yet
Machine Learning
22 pages
Unit 3 big data
No ratings yet
Unit 3 big data
50 pages
Classification
No ratings yet
Classification
50 pages
Fulldoc - Dsec Mca - Crime Prediction (1) - 051521
No ratings yet
Fulldoc - Dsec Mca - Crime Prediction (1) - 051521
65 pages
Machine Learning
No ratings yet
Machine Learning
15 pages
Unit - 2 ML notes
No ratings yet
Unit - 2 ML notes
14 pages
FAM_QUESTION_BANK_CT[1]
No ratings yet
FAM_QUESTION_BANK_CT[1]
14 pages
MACHINE LEARNING
No ratings yet
MACHINE LEARNING
5 pages
Machine Learning QNA
No ratings yet
Machine Learning QNA
1 page
Machine Learning Notes
No ratings yet
Machine Learning Notes
17 pages
3.popular Machine Learning Algorithm
No ratings yet
3.popular Machine Learning Algorithm
11 pages
Unit V - Big Data Programming
No ratings yet
Unit V - Big Data Programming
22 pages
DOC-20241106-WA0007
No ratings yet
DOC-20241106-WA0007
48 pages
UNIT-2 Material
No ratings yet
UNIT-2 Material
71 pages
Classifying in Machine Learning
No ratings yet
Classifying in Machine Learning
26 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
(KtabPDF Com) xrwA7TEBGp
No ratings yet
(KtabPDF Com) xrwA7TEBGp
32 pages
Chapter Four
No ratings yet
Chapter Four
75 pages
2nd Unit NN Final Class Notes (1)
No ratings yet
2nd Unit NN Final Class Notes (1)
50 pages
Lect8 IoT BigDataAnalyticsTechniques
No ratings yet
Lect8 IoT BigDataAnalyticsTechniques
20 pages
Chapter - 4
No ratings yet
Chapter - 4
14 pages
Unit 4
No ratings yet
Unit 4
23 pages
Module 1 & 2
No ratings yet
Module 1 & 2
21 pages
MLT Unit 1
No ratings yet
MLT Unit 1
15 pages
UNIT1
No ratings yet
UNIT1
38 pages
machine learning
No ratings yet
machine learning
37 pages
New Classification and Regression Models
No ratings yet
New Classification and Regression Models
7 pages
Unit 4 Introduction to Algorithm
No ratings yet
Unit 4 Introduction to Algorithm
10 pages
Untitled Document 15
No ratings yet
Untitled Document 15
7 pages
Module 1 ML Mumbai University
No ratings yet
Module 1 ML Mumbai University
47 pages
Machine Learning in A Nutshell
No ratings yet
Machine Learning in A Nutshell
36 pages
Ch4
No ratings yet
Ch4
8 pages
ML Algorithms
No ratings yet
ML Algorithms
12 pages
Machine Learning (Part 1) : Iykra Data Fellowship Batch 3
No ratings yet
Machine Learning (Part 1) : Iykra Data Fellowship Batch 3
28 pages
BUSINESS ANALYTICS Assignment
No ratings yet
BUSINESS ANALYTICS Assignment
14 pages
COMP1801 - Copy 1
No ratings yet
COMP1801 - Copy 1
18 pages
Machine Learning Concepts
No ratings yet
Machine Learning Concepts
68 pages
APznzab0G8iLD5cDfn798Gn-fXshRpam8ullbf6ZS5Hd4l0BEcKNHy9gDG24DS66RfgvnKXAQjMAivMmmi5cmDWF9tqOaPMy3afuzafCU1kpG1xfQIr7b98q406ZWiqt50nL8WhMI6azoYzWSgf7c7khnqww3VlQ9I90ROmc0QL4DbmipYYoLleGYR6TO4UYmc_PsaQB5v0XmLUwPEub3QuwGdUnUEr2dp_hV4bds0MuRbpJ
No ratings yet
APznzab0G8iLD5cDfn798Gn-fXshRpam8ullbf6ZS5Hd4l0BEcKNHy9gDG24DS66RfgvnKXAQjMAivMmmi5cmDWF9tqOaPMy3afuzafCU1kpG1xfQIr7b98q406ZWiqt50nL8WhMI6azoYzWSgf7c7khnqww3VlQ9I90ROmc0QL4DbmipYYoLleGYR6TO4UYmc_PsaQB5v0XmLUwPEub3QuwGdUnUEr2dp_hV4bds0MuRbpJ
34 pages
Business Analytics MGN801-CA2 KAJAL (11917586) Section - Q1959
No ratings yet
Business Analytics MGN801-CA2 KAJAL (11917586) Section - Q1959
14 pages
Machine Learning
No ratings yet
Machine Learning
33 pages
AIML Unit-IV & V
100% (1)
AIML Unit-IV & V
47 pages
overview_basics
No ratings yet
overview_basics
16 pages
Classification
No ratings yet
Classification
7 pages
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
César Pérez López
No ratings yet
1.1. Holding Risky Financial Assets and Subjective Wellbeing Empirical Evidence From China
No ratings yet
1.1. Holding Risky Financial Assets and Subjective Wellbeing Empirical Evidence From China
14 pages
Amiblu Stream Magazine November19
No ratings yet
Amiblu Stream Magazine November19
18 pages
Vif Procedure
No ratings yet
Vif Procedure
4 pages
Mathematical Modeling of Drying Characteristics of Tropical Fruits
No ratings yet
Mathematical Modeling of Drying Characteristics of Tropical Fruits
6 pages
778-Article Text-2137-1-10-20240630
No ratings yet
778-Article Text-2137-1-10-20240630
12 pages
Fevo 10 1063736
No ratings yet
Fevo 10 1063736
11 pages
Logistic Regression Essentials in R - Articles - STHDA
No ratings yet
Logistic Regression Essentials in R - Articles - STHDA
10 pages
Transportation Planning Process: CE - 751, SLD, Class Notes, Fall 2006, IIT Bombay
No ratings yet
Transportation Planning Process: CE - 751, SLD, Class Notes, Fall 2006, IIT Bombay
38 pages
The Ultimate Guide To Data Cleaning
No ratings yet
The Ultimate Guide To Data Cleaning
18 pages
Computational Methods For Mixed Models
No ratings yet
Computational Methods For Mixed Models
21 pages
Non Invasive Blood Glucose Monitoring
No ratings yet
Non Invasive Blood Glucose Monitoring
3 pages
CH 3 & 4 Practice Test Resit Version
No ratings yet
CH 3 & 4 Practice Test Resit Version
6 pages
Probability+&+Statistics Formulas
No ratings yet
Probability+&+Statistics Formulas
47 pages
Amidst The Online Education The Healthy Lifestyle and Its Influence On The Psychological Well-Being of Filipino Tertiary Students
No ratings yet
Amidst The Online Education The Healthy Lifestyle and Its Influence On The Psychological Well-Being of Filipino Tertiary Students
11 pages
Do UN Interventions Cause Peace Using Matching To
No ratings yet
Do UN Interventions Cause Peace Using Matching To
43 pages
Statistical Modeling for Biomedical Researchers A Simple Introduction to the Analysis of Complex Data 2nd Edition William D. Dupont 2024 Scribd Download
100% (7)
Statistical Modeling for Biomedical Researchers A Simple Introduction to the Analysis of Complex Data 2nd Edition William D. Dupont 2024 Scribd Download
40 pages
Eva Output
No ratings yet
Eva Output
24 pages
Midterm 2008s Solution
No ratings yet
Midterm 2008s Solution
12 pages
Why More Intelligent Individuals Like Classical Music
No ratings yet
Why More Intelligent Individuals Like Classical Music
12 pages
Statistics and Epidemiology Courses Studied
No ratings yet
Statistics and Epidemiology Courses Studied
2 pages
Investment Management Assignment
No ratings yet
Investment Management Assignment
7 pages
Challenges For Small and Micro Enterprises in Accessing Finance (Case of Wolaita Soddo Town)
No ratings yet
Challenges For Small and Micro Enterprises in Accessing Finance (Case of Wolaita Soddo Town)
10 pages
07 Hogg Fuerstenau
No ratings yet
07 Hogg Fuerstenau
8 pages
Theory and Problems For The Final Exam - 1
No ratings yet
Theory and Problems For The Final Exam - 1
3 pages
MATH 231-Statistics-Dr. Hanif Mian
No ratings yet
MATH 231-Statistics-Dr. Hanif Mian
3 pages
Variable Selection 8.1 The Model Building Problem
No ratings yet
Variable Selection 8.1 The Model Building Problem
18 pages
Quantile Regression Models and Their Applications A Review 2155 6180 1000354
No ratings yet
Quantile Regression Models and Their Applications A Review 2155 6180 1000354
6 pages
DNN Merged Sugata
No ratings yet
DNN Merged Sugata
243 pages
3 Residual Analysis
No ratings yet
3 Residual Analysis
5 pages
Xu Et Al. - 2010-Information Seeking in An Information Systems Project Team
No ratings yet
Xu Et Al. - 2010-Information Seeking in An Information Systems Project Team
12 pages

Machine Learning Algorithms

Uploaded by

Machine Learning Algorithms

Uploaded by

MACHINE LEARNING ALGORITHMS

TYPES OF MACHINE LEARNING

SUPERVISED LEARNING: allows machines to learn from labeled data, making

Linear Regression: consists of a predictor variable and a dependent variable

Applications: market analysis, sales forecasting, prediction salary, sports and

CLASSIFITCATION: categorizing data into predefined classes or categories.

KNN (k nearest neighbour algorithm): operates based on the principle of

You might also like