0% found this document useful (0 votes)

11 views23 pages

ICT202B AI ML and Emerging Technologies UNIT 3 (Classification and Regression) 2

The document outlines a course on AI, ML, and emerging technologies, focusing on classification and regression techniques. It covers various machine learning algorithms, including K-Nearest Neighbors, Naive Bayes, and Support Vector Machines, along with regression analysis methods like linear and polynomial regression. The course aims to equip students with skills in data analysis, programming in Python, and applying machine learning to biological data.

Uploaded by

rachitmadhal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views23 pages

ICT202B AI ML and Emerging Technologies UNIT 3 (Classification and Regression) 2

Uploaded by

rachitmadhal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 23

ICT202B: AI, ML and Emerging technologies

UNIT 3 Classification & Regression

AI, ML and Emerging technologies
Course Outcome: Credit : 02

CO1 :: Remember the concepts of data analysis in genomics and proteomics

CO2 :: Discuss the advanced topics of Python language used for programming

CO3 :: Apply neural networks for medical diagnosis by medical image classification

CO4 :: Analyze clustering for unlabeled data for training

CO5 :: Evaluate the concepts of machine learning

CO6 :: Validate the application of machine learning for biological data analysis
UNIT I
Data & Feature Engineering : Data vs information, types of data: numerical data (discrete and continuous),
categorical data (ordinal and nominal data), time series data, unstructured data, data labelling, What is feature,
importance of feature selection, feature selection algorithms, Sequential forward selection, sequential backward
selection, bidirectional feature selection, feature extraction

UNIT II

Advanced Python packages : introduction to numpy, creation and accessing of nD arrays, operations on nD
arrays, introduction to pandas, data-frame, reading csv/excel data, Dimensionality reduction, PCA and LDA,
visualization using Matplot-lib, line plot, subplots, scatter plot, bar graph, histogram, pie chart

UNIT II

Classification & Regression : Introduction to classification, KNN, Decision Tree, Naive Bayes classifier, Support
Vector Machine classifier, classification on a given dataset, Introduction to regression, linear regression,
Polynomial regression, regression on a given dataset
Machine Learning
Introduction to classification
(K-NN) algorithm is a versatile and widely used machine learning
algorithm that is primarily used for its simplicity and ease of
implementation.

• It does not require any assumptions about the underlying data distribution.
• It can also handle both numerical and categorical data, making it a flexible
choice for various types of datasets in classification and regression tasks.
• It is a non-parametric method that makes predictions based on the
similarity of data points in a given dataset.
• K-NN is less sensitive to outliers compared to other algorithms.
Introduction to classification
Naive Bayes classifier

A Naive Bayes classifiers, a family of algorithms based on Bayes’ Theorem.

Despite the “naive” assumption of feature independence, these classifiers are
widely utilized for their simplicity and efficiency in machine learning.

Naive Bayes classifiers are a collection of classification algorithms based on

Bayes’ Theorem. It is not a single algorithm but a family of algorithms where all
of them share a common principle, i.e. every pair of features being classified is
independent of each other.
Basically, we are trying to find probability of event A, given the event B is true. Event B is also termed as evidence.
• P(A) is the priori of A (the prior probability, i.e. Probability of event before evidence is seen). The evidence is an
attribute value of an unknown instance(here, it is event B).
• P(B) is Marginal Probability: Probability of Evidence.
• P(A|B) is a posteriori probability of B, i.e. probability of event after evidence is seen.
• P(B|A) is Likelihood probability i.e the likelihood that a hypothesis will come true based on the evidence.
Support Vector Machines (SVM)

• In SVM, the line that is used to separate the classes is

referred to as hyperplane. The data points on either side
of the hyperplane that are closest to the hyperplane are
called Support Vectors which is used to plot the
boundary line.
• In SVM Classification, the data can be either linear or
non-linear. There are different kernels that can be set in
an SVM Classifier. For a linear dataset, we can set the
kernel as ‘linear’.
• On the other hand, for a non-linear dataset, there are two
kernels, namely ‘rbf’ and ‘polynomial’. In this, the data
is mapped to a higher dimension which makes it easier to
draw the hyperplane. Afterwards, it is brought down to
the lower dimension.
From the above diagram, we can see that there are two classes of shapes, rectangle and circle. As it is difficult
to draw a SVM line in the 2D Plane, we map the data points to a higher dimension (3D Plane) and then draw
the hyperplane. It is then brought down to the original plane with the SVM Classifier drawn in red color.
Introduction to regression

Regression is a statistical method that tries to determine the strength and

character of the relationship between one dependent variable and a
series of other variables.

Regression analysis is a set of statistical operations to estimate the

relationships between an independent variable (X) and a dependent
variables (Y)
Regression Line is defined as a
statistical concept that facilitates and
predicts the relationship between two or
more variables. A regression line is a
straight line that reflects the best-fit
connection in a dataset between
independent and dependent variables.

The equation of a simple linear regression line is given by:

Y = a + bX + ε
Here,
•Y is the dependent variable
•X is the independent variable
•a is the y-intercept, which represents the value of Y when X is 0.
•b is the slope, which represents the change in Y for a unit change in X
•ε is residual error.
Polynomial regression

Polynomial regression is a statistical and machine learning technique that

models the relationship between variables using higher-degree
polynomials.

Polynomial regression uses higher-degree functions of the independent

variable, like squares and cubes, to fit the data.
When to use it
Polynomial regression is useful when there is a non-linear relationship between the
variables, like when predicting how many likes a social media post will get over time.

Model complexity
As the degree of the model increases, its performance may improve, but it also increases
the risk of over-fitting or under-fitting the data.

Model selection
There are two approaches to choosing the order of a polynomial model: forward selection
and backward elimination.

Bias and variance

A model with high bias is unable to capture patterns in the data, while a model with high
variance passes through most of the data points. Ideally, a model should have low bias and
low variance, but this is usually not possible in practice.

ML 1 PPT Unit 1
No ratings yet
ML 1 PPT Unit 1
93 pages
Foundation of Machine Learning F-PMLFML02-WS
No ratings yet
Foundation of Machine Learning F-PMLFML02-WS
352 pages
MATHS Worksheet and Class 2 CH-6 L1
No ratings yet
MATHS Worksheet and Class 2 CH-6 L1
40 pages
Understanding Machine Learning Algorithms - in Depth
No ratings yet
Understanding Machine Learning Algorithms - in Depth
167 pages
Lecture - 2 & 3
No ratings yet
Lecture - 2 & 3
62 pages
11 Physics - Volume I (E/m) 5 Marks Questions & Answers
100% (1)
11 Physics - Volume I (E/m) 5 Marks Questions & Answers
14 pages
Internship Supervisor's Evalution Form Makerere
100% (1)
Internship Supervisor's Evalution Form Makerere
2 pages
Machine Learning Theory
100% (1)
Machine Learning Theory
12 pages
Normality, T-Test, ANOVA, Chi Square, Correlation
No ratings yet
Normality, T-Test, ANOVA, Chi Square, Correlation
31 pages
Mutalia Quran Quide 11th Class KPK
No ratings yet
Mutalia Quran Quide 11th Class KPK
258 pages
Machine Learning Algorithms Laiki
No ratings yet
Machine Learning Algorithms Laiki
123 pages
Class 3 - Classification
No ratings yet
Class 3 - Classification
80 pages
M2 - Supervised Machine Learning
No ratings yet
M2 - Supervised Machine Learning
79 pages
KCA 034 - Unit 2
No ratings yet
KCA 034 - Unit 2
97 pages
Unit 5
No ratings yet
Unit 5
73 pages
Lesson 3: Velocities in Machines
No ratings yet
Lesson 3: Velocities in Machines
13 pages
Algorithms 1
No ratings yet
Algorithms 1
23 pages
Shields Rob Alternative Geographies of Modernity
100% (1)
Shields Rob Alternative Geographies of Modernity
24 pages
ML Introduction
No ratings yet
ML Introduction
76 pages
Machine Learning
No ratings yet
Machine Learning
87 pages
Supervised ML
No ratings yet
Supervised ML
69 pages
Module 3
No ratings yet
Module 3
63 pages
ML Algorithms Week 3
No ratings yet
ML Algorithms Week 3
30 pages
Aiya Session 4
No ratings yet
Aiya Session 4
42 pages
ML 2 ND Unit
No ratings yet
ML 2 ND Unit
50 pages
UNIT3 Machine Learning
No ratings yet
UNIT3 Machine Learning
53 pages
Classification
No ratings yet
Classification
50 pages
Data Science Unit 3
No ratings yet
Data Science Unit 3
33 pages
Unit Ii
No ratings yet
Unit Ii
48 pages
Machine Learning
No ratings yet
Machine Learning
37 pages
1 - Intro To Machine Learning
No ratings yet
1 - Intro To Machine Learning
34 pages
Machine Learning Concepts
No ratings yet
Machine Learning Concepts
68 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
AI and DS QB1
No ratings yet
AI and DS QB1
31 pages
Supervised Learning
No ratings yet
Supervised Learning
46 pages
Lesson 8 - Classification
No ratings yet
Lesson 8 - Classification
74 pages
Machine Learning
No ratings yet
Machine Learning
33 pages
AIML
No ratings yet
AIML
30 pages
W8-Supervised Learning Methods
No ratings yet
W8-Supervised Learning Methods
30 pages
Machine Learning Ppts
No ratings yet
Machine Learning Ppts
38 pages
ML Module 3
No ratings yet
ML Module 3
34 pages
Lecture 9
No ratings yet
Lecture 9
27 pages
Module 3
No ratings yet
Module 3
25 pages
Week 7. Intro To ML. Regression
No ratings yet
Week 7. Intro To ML. Regression
24 pages
University Institute of Computing: Big Data Analytics 22CAH-782
No ratings yet
University Institute of Computing: Big Data Analytics 22CAH-782
27 pages
MLT UNIT-2 Notes
No ratings yet
MLT UNIT-2 Notes
16 pages
ML Unit-4
No ratings yet
ML Unit-4
20 pages
Unit 1 (DS)
No ratings yet
Unit 1 (DS)
15 pages
Unit 2 Supervised Learning and Applications
No ratings yet
Unit 2 Supervised Learning and Applications
13 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
48 pages
Tutorial 7 Machine Learning Algorithms
No ratings yet
Tutorial 7 Machine Learning Algorithms
30 pages
Fiches Machine Learning
No ratings yet
Fiches Machine Learning
21 pages
Unit 1
No ratings yet
Unit 1
15 pages
Unit V - Big Data Programming
No ratings yet
Unit V - Big Data Programming
22 pages
Machine Learning
No ratings yet
Machine Learning
53 pages
Machine Learning (Part 1) : Iykra Data Fellowship Batch 3
No ratings yet
Machine Learning (Part 1) : Iykra Data Fellowship Batch 3
28 pages
Unit 3
No ratings yet
Unit 3
12 pages
PerceptiLabs-ML Handbook
No ratings yet
PerceptiLabs-ML Handbook
31 pages
41 Machine Learning Algorithms I
No ratings yet
41 Machine Learning Algorithms I
8 pages
Aiml Unit 3
No ratings yet
Aiml Unit 3
9 pages
Week 9 - PROG 8510 Week 9
No ratings yet
Week 9 - PROG 8510 Week 9
27 pages
Unit Iii
No ratings yet
Unit Iii
18 pages
International Financial Management 6th Edition by Cheol S Eun Bruce G Resnick
No ratings yet
International Financial Management 6th Edition by Cheol S Eun Bruce G Resnick
319 pages
Classification
No ratings yet
Classification
7 pages
Machine Learning For Beginners PDF
No ratings yet
Machine Learning For Beginners PDF
29 pages
Machine Learning QNA
No ratings yet
Machine Learning QNA
1 page
Reading Writing Skills Whole 3rd Quarter LAS
No ratings yet
Reading Writing Skills Whole 3rd Quarter LAS
51 pages
Statik Und Festigkeitslehre (STAFL)
No ratings yet
Statik Und Festigkeitslehre (STAFL)
9 pages
Conflict Management Ogl 220 Worksheet
No ratings yet
Conflict Management Ogl 220 Worksheet
4 pages
Segmen 10 Mode Choice - Choice Modelling
No ratings yet
Segmen 10 Mode Choice - Choice Modelling
6 pages
Alma Mater Studiorum Università Di Bologna Archivio Istituzionale Della Ricerca
No ratings yet
Alma Mater Studiorum Università Di Bologna Archivio Istituzionale Della Ricerca
40 pages
ICT202B AI ML and Emerging Technologies UNIT 2 (Advanced Phython Packages)
No ratings yet
ICT202B AI ML and Emerging Technologies UNIT 2 (Advanced Phython Packages)
20 pages
Vte Current Handbook
No ratings yet
Vte Current Handbook
39 pages
NOTES Module 2 - ANOVA (Analysis of Variance)
No ratings yet
NOTES Module 2 - ANOVA (Analysis of Variance)
37 pages
Individual Learner's Record (LR)
No ratings yet
Individual Learner's Record (LR)
2 pages
Conceptualizations of Intrinsic Motivation and Self-Determination
No ratings yet
Conceptualizations of Intrinsic Motivation and Self-Determination
2 pages
Ai ML Unit 3
No ratings yet
Ai ML Unit 3
15 pages
Patient Complaint Form Template
No ratings yet
Patient Complaint Form Template
3 pages
UNIT III Introduction To Bio Bricks & Its Applications
No ratings yet
UNIT III Introduction To Bio Bricks & Its Applications
24 pages
Brochure TA TBV C
No ratings yet
Brochure TA TBV C
12 pages
Ob 3 A B Final Covert
No ratings yet
Ob 3 A B Final Covert
8 pages
Coal Geology Assignment
No ratings yet
Coal Geology Assignment
16 pages
16 Cooperative Structures
No ratings yet
16 Cooperative Structures
6 pages
BIO2133 LEC1 Jan 11 - 2021 - 1 Slide Per Page
No ratings yet
BIO2133 LEC1 Jan 11 - 2021 - 1 Slide Per Page
36 pages
Licenciatura Biologia
No ratings yet
Licenciatura Biologia
26 pages
UI Unit 8 Test B
No ratings yet
UI Unit 8 Test B
3 pages
AC Biode Flexibuster EN-1
No ratings yet
AC Biode Flexibuster EN-1
17 pages
Chapter 0 - Miscellaneous Preliminaries: EE 520: Topics - Compressed Sensing Linear Algebra Review
No ratings yet
Chapter 0 - Miscellaneous Preliminaries: EE 520: Topics - Compressed Sensing Linear Algebra Review
18 pages
4ES0 01 Que 20160609
No ratings yet
4ES0 01 Que 20160609
24 pages
Activity 1 Algebra & Trigonometry
No ratings yet
Activity 1 Algebra & Trigonometry
3 pages
Annu Mam Syllabus
No ratings yet
Annu Mam Syllabus
2 pages
Author's Writing Style - The Lady or The Tiger
No ratings yet
Author's Writing Style - The Lady or The Tiger
2 pages

ICT202B AI ML and Emerging Technologies UNIT 3 (Classification and Regression) 2

Uploaded by

ICT202B AI ML and Emerging Technologies UNIT 3 (Classification and Regression) 2

Uploaded by

ICT202B: AI, ML and Emerging technologies

UNIT 3 Classification & Regression

CO1 :: Remember the concepts of data analysis in genomics and proteomics

CO4 :: Analyze clustering for unlabeled data for training

CO5 :: Evaluate the concepts of machine learning

A Naive Bayes classifiers, a family of algorithms based on Bayes’ Theorem.

Naive Bayes classifiers are a collection of classification algorithms based on

• In SVM, the line that is used to separate the classes is

Regression is a statistical method that tries to determine the strength and

Regression analysis is a set of statistical operations to estimate the

The equation of a simple linear regression line is given by:

Polynomial regression is a statistical and machine learning technique that

Polynomial regression uses higher-degree functions of the independent

Bias and variance

You might also like