18.1 - How "Classification" Works - mp4

knn

Uploaded by

NAKKA PUNEETH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views5 pages

18.1 - How "Classification" Works - mp4

knn

Uploaded by

NAKKA PUNEETH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

So in this chapter, we will learn about what is classification, what is regression, and we'll

learn a very, very simple, yet very powerful machine learning technique called the knee rest
neighbors. It's also often called as KNN. But before we go and understand all of it, first, let's
take classification itself. Let's understand how classification works. How classification
works. So let's take our Amazon food reviews data set. We had our Amazon fine food
reviews data set that we saw as an example. So we will keep using this Amazon fine foods
data set as a running real world example. We will use this as a real world example
throughout this course. And it will be a running example so that you know how different
algorithms perform. Well. Why are we learning new algorithms? We learn all of that using
Amazon fine food review as a running example. We'll also use mnest, if you recall our mnest,
mnest that we learned when we learned about PCA and when we learned about t snee. So
just to quickly recap, MnisT is basically you're given a vector representation of images of
handwritten characters, and you have to determine whether it is, whether the character is
0123 or nine, so on and so forth or nine. Right. So that's the MNIST data set. So we'll keep
using Amazon fine foods data set and the MNIST data set as running examples when we
learn different algorithms. Before we go, what does classification actually mean? Right.
Classification, to put it very simply, let's take our Amazon fine Foods review for
classification to understand classification, right. So we have reviews. We have multiple
reviews. We have like 360K, roughly about 360K reviews here. For each review, we are
using the review text because we felt that review text is the most informative signal or
informative feature or informative variable. Each text, we got it converted to a vector. Right?
So we converted into a vector using multiple techniques. Right? Either bag of words or tfidf
or word to. Right. We learned multiple techniques on how to convert the text into a vector.
Now, for each review, I have a vector, right. And for each review, again, I also have the data
whether it is a positive review or negative review. Right? So what is classification all about?
The problem of classification. So the problem of classification is this. This is very, very
important. This is the crux of whole of machine learning algorithms which fall under
classification, right? So now we have our three hundred and sixty k, three hundred and sixty
four K reviews for which for each review, we have a vector representation. And we also
have whether the review is positive or negative. Right? So this is the data that we have now,
what classification means is in this data. Okay, so classification is all about finding a
function. So if I think about it, classification is all about given a new review, given a new
review, given a new review text. In our case, determine, determine, or predict or predict if
the review is positive or not. If the review is positive or negative. This is the task of
classification. Because why is it called classification? Because given a new review, given a
new review, let's call it RQ R query. You're querying and you're asking, given this review,
tell me whether it is a positive review or a negative review. This is what we want to find. So,
we are classifying a new review. We are classifying a new review into two classes. The first
class is positive class. The second class is negative class. That's why it's called classification,
right? Very simple. So classification can be thought of. Let's try to put all of this
mathematically. Classification can be thought of as finding a function. Let me explain what it
means. Right? So it can be thought of as finding a function like this. Actually, most of
machine learning is about finding a function. Let me explain that. Let me connect those two
mathematical dots. So, imagine each review is represented by a vector called X, okay?
Instead of Vi, let's just say the notation is not Vi. The notation is actually xi. It's a standard
notation. Every data point, or every review in our case, or every data point we are given, is
represented using a mathematical vector called xi. Now, given an x, given an x, I want to find
a function. I want to find a function f that will return a y for me. What is y here y is whether
the review is positive or negative. What is x here x is my review text. So, mathematically
speaking, my classification, the objective of my classification, is all about finding this magic
function f. Finding this function f, such that given a review text, if I apply this function f, the
mathematical function f, on the review text, I would get y, which says whether the review is
positive or not. This is the crux. This is the central concept. Crux, in English, basically means
the central concept. For those of you who didn't know that this is the central concept of all
of machine learning and specifically classification, right? It's all about given a new review.
Given a new review. Let's call it xq review. This is basically a query review. This is basically
a query review. Why is it called a query review? Because you're querying your machine
learning algorithm, saying, this is the review that I have. Now tell me, now tell me, what is
its class? You're querying it, asking it what is its class? It will return me a YQ. This YQ should
say whether it is positive or negative. This is the whole objective. This is a problem we are
trying to solve. In most problems in machine learning, not all the problems, there are
problems where it's not exactly this, but classification is all about this classification is all
about where YQ takes a few classes, right? In our case, what classes we have, we have the
positive class and the negative class. We'll come to understand what if YQ is something else?
We'll come to that little later in this chapter. But for now, all you have to remember is this is
the crux of machine learning. And how does machine learning work? How does
classification algorithms work? How does classification algorithms work? How does
classification algorithms work? So let's assume this is your algorithm. Let's assume this is
your classification algorithm. We don't understand. Let's keep it as a black box. Right now.
Let's not worry. What is there inside? You give it something called a training data set. You
give it something called a data set, right? Or it's also called as a training data set. When you
give this data set, the algorithm, the training data set contains all your pairs of Xi and Yi. It
has Xi pairs of Yi and Xi, many, many such pairs. So it says, if this is the Xi, what is the
corresponding Yi? So let's say I goes from one to, let's say 100K. Okay? You give it lot of
data. You give it lot of. So to put it again in our perspective, you give it lot of data, where you
say, this is my review and this is the result, whether it's positive or not. The algorithm now
takes all of this input. This is called the training data, because the algorithm trains on this
data. Here you're giving Xi and Yi both. Right? Now the algorithm learns the function f.
Algorithm learns because it is seeing lot of examples, right? It's seeing lot of examples of xi
and yi. By looking at all these examples, it learns this function. Right? Now, when I take this
function, now, after it has learned this function f, if I give it any new point, if I give it any
new point, it will return me its corresponding class. This is the crux of classification. This is
how classification works. This stage is called the training stage. This stage is called the
training stage. This stage is called the testing or the evaluation stage. This stage is called the
testing or the evaluation stage. Because here, what are we doing here? We are giving it some
data. And we are saying, here is the mapping. Here is my x one. Here is my y one. Here is my
x two, here is my y two, here is my x three, y three, so on and so forth, x n, y n. Now, using all
of this data, try to learn the mapping or the function. Try to learn the function such that f of
x I equals to Y. I try to learn this, and that's what the algorithm tries to do. That's what the
algorithm tries to do. It trains on this data and learns this new function. Once it learns this
function, once it learns this function, our training is over. Now, in test or evaluation stage,
we give it new points that it has not seen. Remember, this xq may not be there in this data
set. So if you give it new data set and if it can predict yq accurately, then you say, wow, I've
learned the right function that I care about. Of course, machine learning is not perfect. It will
not learn the perfect function here. It will try to do its best job using various techniques. Of
course, some techniques will be able to learn better functions, some techniques will be able
to learn worse off functions. But this is the core idea of whole of classification and.

So in this chapter, we will learn about what is classification, what is regression, and we'll
learn a very, very simple, yet very powerful machine learning technique called the knee rest
neighbors. It's also often called as KNN. But before we go and understand all of it, first, let's
take classification itself. Let's understand how classification works. How classification
works. So let's take our Amazon food reviews data set. We had our Amazon fine food
reviews data set that we saw as an example. So we will keep using this Amazon fine foods
data set as a running real world example. We will use this as a real world example
throughout this course. And it will be a running example so that you know how different
algorithms perform. Well. Why are we learning new algorithms? We learn all of that using
Amazon fine food review as a running example. We'll also use mnest, if you recall our mnest,
mnest that we learned when we learned about PCA and when we learned about t snee. So
just to quickly recap, MnisT is basically you're given a vector representation of images of
handwritten characters, and you have to determine whether it is, whether the character is
0123 or nine, so on and so forth or nine. Right. So that's the MNIST data set. So we'll keep
using Amazon fine foods data set and the MNIST data set as running examples when we
learn different algorithms. Before we go, what does classification actually mean? Right.
Classification, to put it very simply, let's take our Amazon fine Foods review for
classification to understand classification, right. So we have reviews. We have multiple
reviews. We have like 360K, roughly about 360K reviews here. For each review, we are
using the review text because we felt that review text is the most informative signal or
informative feature or informative variable. Each text, we got it converted to a vector. Right?
So we converted into a vector using multiple techniques. Right? Either bag of words or tfidf
or word to. Right. We learned multiple techniques on how to convert the text into a vector.
Now, for each review, I have a vector, right. And for each review, again, I also have the data
whether it is a positive review or negative review. Right? So what is classification all about?
The problem of classification. So the problem of classification is this. This is very, very
important. This is the crux of whole of machine learning algorithms which fall under
classification, right? So now we have our three hundred and sixty k, three hundred and sixty
four K reviews for which for each review, we have a vector representation. And we also
have whether the review is positive or negative. Right? So this is the data that we have now,
what classification means is in this data. Okay, so classification is all about finding a
function. So if I think about it, classification is all about given a new review, given a new
review, given a new review text. In our case, determine, determine, or predict or predict if
the review is positive or not. If the review is positive or negative. This is the task of
classification. Because why is it called classification? Because given a new review, given a
new review, let's call it RQ R query. You're querying and you're asking, given this review,
tell me whether it is a positive review or a negative review. This is what we want to find. So,
we are classifying a new review. We are classifying a new review into two classes. The first
class is positive class. The second class is negative class. That's why it's called classification,
right? Very simple. So classification can be thought of. Let's try to put all of this
mathematically. Classification can be thought of as finding a function. Let me explain what it
means. Right? So it can be thought of as finding a function like this. Actually, most of
machine learning is about finding a function. Let me explain that. Let me connect those two
mathematical dots. So, imagine each review is represented by a vector called X, okay?
Instead of Vi, let's just say the notation is not Vi. The notation is actually xi. It's a standard
notation. Every data point, or every review in our case, or every data point we are given, is
represented using a mathematical vector called xi. Now, given an x, given an x, I want to find
a function. I want to find a function f that will return a y for me. What is y here y is whether
the review is positive or negative. What is x here x is my review text. So, mathematically
speaking, my classification, the objective of my classification, is all about finding this magic
function f. Finding this function f, such that given a review text, if I apply this function f, the
mathematical function f, on the review text, I would get y, which says whether the review is
positive or not. This is the crux. This is the central concept. Crux, in English, basically means
the central concept. For those of you who didn't know that this is the central concept of all
of machine learning and specifically classification, right? It's all about given a new review.
Given a new review. Let's call it xq review. This is basically a query review. This is basically
a query review. Why is it called a query review? Because you're querying your machine
learning algorithm, saying, this is the review that I have. Now tell me, now tell me, what is
its class? You're querying it, asking it what is its class? It will return me a YQ. This YQ should
say whether it is positive or negative. This is the whole objective. This is a problem we are
trying to solve. In most problems in machine learning, not all the problems, there are
problems where it's not exactly this, but classification is all about this classification is all
about where YQ takes a few classes, right? In our case, what classes we have, we have the
positive class and the negative class. We'll come to understand what if YQ is something else?
We'll come to that little later in this chapter. But for now, all you have to remember is this is
the crux of machine learning. And how does machine learning work? How does
classification algorithms work? How does classification algorithms work? How does
classification algorithms work? So let's assume this is your algorithm. Let's assume this is
your classification algorithm. We don't understand. Let's keep it as a black box. Right now.
Let's not worry. What is there inside? You give it something called a training data set. You
give it something called a data set, right? Or it's also called as a training data set. When you
give this data set, the algorithm, the training data set contains all your pairs of Xi and Yi. It
has Xi pairs of Yi and Xi, many, many such pairs. So it says, if this is the Xi, what is the
corresponding Yi? So let's say I goes from one to, let's say 100K. Okay? You give it lot of
data. You give it lot of. So to put it again in our perspective, you give it lot of data, where you
say, this is my review and this is the result, whether it's positive or not. The algorithm now
takes all of this input. This is called the training data, because the algorithm trains on this
data. Here you're giving Xi and Yi both. Right? Now the algorithm learns the function f.
Algorithm learns because it is seeing lot of examples, right? It's seeing lot of examples of xi
and yi. By looking at all these examples, it learns this function. Right? Now, when I take this
function, now, after it has learned this function f, if I give it any new point, if I give it any
new point, it will return me its corresponding class. This is the crux of classification. This is
how classification works. This stage is called the training stage. This stage is called the
training stage. This stage is called the testing or the evaluation stage. This stage is called the
testing or the evaluation stage. Because here, what are we doing here? We are giving it some
data. And we are saying, here is the mapping. Here is my x one. Here is my y one. Here is my
x two, here is my y two, here is my x three, y three, so on and so forth, x n, y n. Now, using all
of this data, try to learn the mapping or the function. Try to learn the function such that f of
x I equals to Y. I try to learn this, and that's what the algorithm tries to do. That's what the
algorithm tries to do. It trains on this data and learns this new function. Once it learns this
function, once it learns this function, our training is over. Now, in test or evaluation stage,
we give it new points that it has not seen. Remember, this xq may not be there in this data
set. So if you give it new data set and if it can predict yq accurately, then you say, wow, I've
learned the right function that I care about. Of course, machine learning is not perfect. It will
not learn the perfect function here. It will try to do its best job using various techniques. Of
course, some techniques will be able to learn better functions, some techniques will be able
to learn worse off functions. But this is the core idea of whole of classification and.

ML Unit 4
No ratings yet
ML Unit 4
76 pages
Inductive Learning and Machine Learning
100% (1)
Inductive Learning and Machine Learning
321 pages
CS585 Lecture October03rd
No ratings yet
CS585 Lecture October03rd
146 pages
Learning AI
No ratings yet
Learning AI
34 pages
Date: Venue:: 28-11-2023, Saveetha School of Engineering
No ratings yet
Date: Venue:: 28-11-2023, Saveetha School of Engineering
100 pages
The Hundred-Page Machine Learning Book-Andriy Burkov (2019) - Removed
No ratings yet
The Hundred-Page Machine Learning Book-Andriy Burkov (2019) - Removed
145 pages
Topic 08 - Data Modelling - Part II
No ratings yet
Topic 08 - Data Modelling - Part II
59 pages
Machine Learning SELF
No ratings yet
Machine Learning SELF
29 pages
Supervised Learning
No ratings yet
Supervised Learning
30 pages
Chapter 5: Database Design 1: Normalization True / False: Cengage Learning Testing, Powered by Cognero
100% (1)
Chapter 5: Database Design 1: Normalization True / False: Cengage Learning Testing, Powered by Cognero
6 pages
Dmunit 4
No ratings yet
Dmunit 4
23 pages
BSC ML CH1
No ratings yet
BSC ML CH1
63 pages
OSHÚN
100% (1)
OSHÚN
10 pages
DM Assignment 2
No ratings yet
DM Assignment 2
23 pages
Notes
No ratings yet
Notes
35 pages
Machinelearning GateNotes
No ratings yet
Machinelearning GateNotes
105 pages
AI ML Concepts
No ratings yet
AI ML Concepts
97 pages
Module 1 Lab 2
No ratings yet
Module 1 Lab 2
7 pages
Domingos
No ratings yet
Domingos
9 pages
ML Mid Syllabus
No ratings yet
ML Mid Syllabus
182 pages
Unit 3
No ratings yet
Unit 3
123 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
21 pages
IntroClassificationDA 2024
No ratings yet
IntroClassificationDA 2024
129 pages
Unit Ii
No ratings yet
Unit Ii
118 pages
ML Sample PDF
No ratings yet
ML Sample PDF
5 pages
4 DL
No ratings yet
4 DL
81 pages
Lec 2
No ratings yet
Lec 2
15 pages
Basics of Machine Learning and Classifications: Dr. Helal Uddin Ahmed
No ratings yet
Basics of Machine Learning and Classifications: Dr. Helal Uddin Ahmed
18 pages
Data Science Introduction
No ratings yet
Data Science Introduction
6 pages
Introduction Class
No ratings yet
Introduction Class
134 pages
M2 Transcript
No ratings yet
M2 Transcript
11 pages
ML Week 3
No ratings yet
ML Week 3
6 pages
Machine Learning
No ratings yet
Machine Learning
95 pages
OOSD All Units Notes by MultiAtoms
No ratings yet
OOSD All Units Notes by MultiAtoms
93 pages
ML Chap 2
No ratings yet
ML Chap 2
60 pages
ML Unit 2
No ratings yet
ML Unit 2
31 pages
Unit 5 Classification PDF
No ratings yet
Unit 5 Classification PDF
131 pages
Univan Ship Management LTD.: Chennai Office
No ratings yet
Univan Ship Management LTD.: Chennai Office
17 pages
18.2 - Data Matrix Notation - mp4
No ratings yet
18.2 - Data Matrix Notation - mp4
3 pages
Pattern Recognition 14
No ratings yet
Pattern Recognition 14
46 pages
Spam Not Spam
No ratings yet
Spam Not Spam
7 pages
Machine Lar Arii
No ratings yet
Machine Lar Arii
9 pages
Classification FoundationalMathofAI S24
No ratings yet
Classification FoundationalMathofAI S24
6 pages
ML 4
No ratings yet
ML 4
32 pages
Unit 3
No ratings yet
Unit 3
27 pages
ML
No ratings yet
ML
49 pages
Chapter 2 Machine Learning Draft-85-172
No ratings yet
Chapter 2 Machine Learning Draft-85-172
88 pages
Machine Learning
No ratings yet
Machine Learning
42 pages
STA404 Exam Booklet - 20.03.2023
No ratings yet
STA404 Exam Booklet - 20.03.2023
153 pages
08 Class Basic
No ratings yet
08 Class Basic
141 pages
Day35 Classification Algorithm
No ratings yet
Day35 Classification Algorithm
5 pages
4.18.2024 Impartiality-Confidentiality
No ratings yet
4.18.2024 Impartiality-Confidentiality
24 pages
Definition of Cryptocurrency
No ratings yet
Definition of Cryptocurrency
13 pages
ML 7th Sem AIML ITE Notes Complete LONG (1) - 10-33
No ratings yet
ML 7th Sem AIML ITE Notes Complete LONG (1) - 10-33
24 pages
ML Unit I
No ratings yet
ML Unit I
14 pages
Classification
No ratings yet
Classification
15 pages
ML Notes
No ratings yet
ML Notes
10 pages
Machine Learning
No ratings yet
Machine Learning
24 pages
5700 13024 1 PB
No ratings yet
5700 13024 1 PB
12 pages
Supervised Learning Final With Diagrams Cleaned
No ratings yet
Supervised Learning Final With Diagrams Cleaned
7 pages
CCTR-809 Asset GPS Tracker User Manual
No ratings yet
CCTR-809 Asset GPS Tracker User Manual
16 pages
ABP DWDM UNIT 4 Classification 1
No ratings yet
ABP DWDM UNIT 4 Classification 1
51 pages
FS - 720 - Общее описание - A6V10210355
No ratings yet
FS - 720 - Общее описание - A6V10210355
182 pages
Machine Learning - Brief
No ratings yet
Machine Learning - Brief
12 pages
Junos Genius PDF
No ratings yet
Junos Genius PDF
12 pages
Machine Learning Tutorial For Beginners
No ratings yet
Machine Learning Tutorial For Beginners
15 pages
Machine Learning HC
No ratings yet
Machine Learning HC
4 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
9 pages
API Checklist INFO
No ratings yet
API Checklist INFO
17 pages
21st Century Learning For 21st Century Skills 7th European Conference Of Technology Enhanced Learning Ectel 2012 Saarbrcken Germany September 1821 2012 Proceedings 1st Edition Richard Noss Auth instant download
No ratings yet
21st Century Learning For 21st Century Skills 7th European Conference Of Technology Enhanced Learning Ectel 2012 Saarbrcken Germany September 1821 2012 Proceedings 1st Edition Richard Noss Auth instant download
77 pages
Exam 2-1-25
No ratings yet
Exam 2-1-25
4 pages
Led 08 02 2020
No ratings yet
Led 08 02 2020
41 pages
Minchenkov 2022
No ratings yet
Minchenkov 2022
6 pages
How To Make Speakers
No ratings yet
How To Make Speakers
4 pages
105 Machine Learning Paper
No ratings yet
105 Machine Learning Paper
6 pages
GAIA Liste Public2025.03.28
No ratings yet
GAIA Liste Public2025.03.28
6 pages
Das 350
No ratings yet
Das 350
6 pages
Add 020264
No ratings yet
Add 020264
20 pages
Aishwarya Digitec Profile Present
No ratings yet
Aishwarya Digitec Profile Present
11 pages
Horus Heresy Cost Efficiency
No ratings yet
Horus Heresy Cost Efficiency
11 pages
FYP Final Report Preparation 2019-2020 - MKMJ PDF
No ratings yet
FYP Final Report Preparation 2019-2020 - MKMJ PDF
10 pages
18.15 - Visualizing Train, Validation and Test Datasets - mp4
No ratings yet
18.15 - Visualizing Train, Validation and Test Datasets - mp4
3 pages
28.7 - Polynomial Kernel - mp4
No ratings yet
28.7 - Polynomial Kernel - mp4
3 pages
2.7 - Operators - mp4
No ratings yet
2.7 - Operators - mp4
3 pages
PP 2500PC 20221010
No ratings yet
PP 2500PC 20221010
2 pages
38.1 - Problem Formulation Movie Reviews - mp4
No ratings yet
38.1 - Problem Formulation Movie Reviews - mp4
5 pages
Advanced Sessions STEAM
No ratings yet
Advanced Sessions STEAM
9 pages
57.7 - USE, DESCRIBE, SHOW TABLES - mp4
No ratings yet
57.7 - USE, DESCRIBE, SHOW TABLES - mp4
4 pages
Distributed Generation and Microturbines
No ratings yet
Distributed Generation and Microturbines
5 pages
28.13 - Cases - mp4
No ratings yet
28.13 - Cases - mp4
3 pages
56.11 - PageRank - mp4
No ratings yet
56.11 - PageRank - mp4
3 pages
57.10 - ORDER BY - mp4
No ratings yet
57.10 - ORDER BY - mp4
2 pages
Dpa M.tech
No ratings yet
Dpa M.tech
3 pages
2.2 - Why Learn Python - mp4
No ratings yet
2.2 - Why Learn Python - mp4
1 page
2.4 - Comments, Indentation and Statements - mp4
No ratings yet
2.4 - Comments, Indentation and Statements - mp4
2 pages
Data Dictionary Example
No ratings yet
Data Dictionary Example
3 pages
Resume - Taha - Taha Jamal
No ratings yet
Resume - Taha - Taha Jamal
1 page
p102613 Docjl Burnerspec Sheet 3
No ratings yet
p102613 Docjl Burnerspec Sheet 3
2 pages
MOX+ 16mb
No ratings yet
MOX+ 16mb
1 page
GROKKING ALGORITHM BLUEPRINT: A Comprehensive Beginner's Guide to Learn the Realms of Grokking Algorithms from A-Z and Become Efficient Programmers
From Everand
GROKKING ALGORITHM BLUEPRINT: A Comprehensive Beginner's Guide to Learn the Realms of Grokking Algorithms from A-Z and Become Efficient Programmers
William Turner
No ratings yet
egghead's Guide to Algebra
From Everand
egghead's Guide to Algebra
Peterson's
No ratings yet

18.1 - How "Classification" Works - mp4

Uploaded by

18.1 - How "Classification" Works - mp4

Uploaded by

So in this chapter, we will learn about what is classification, what is regression, and we'll

You might also like