Data Imputation With KNN: E (A, B) X X E (A, B) X X

The document discusses using K-nearest neighbors (KNN) imputation to fill in missing data values. It explains that KNN imputation works by calculating the Euclidean distance between points to identify the K nearest neighbors, then imputing the missing value as the mean of the known values for those neighbors. The document provides an example using a dataset with missing values, calculating distances to identify the 2 nearest neighbors for each missing value, then imputing the mean of those neighbors.

Uploaded by

Hsu Let Yee Hnin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views2 pages

Data Imputation With KNN: E (A, B) X X E (A, B) X X

Uploaded by

Hsu Let Yee Hnin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

Data Imputation with KNN

 The K Nearest Neighbor is the assigning a value based on how nearly it similar the points in the
training set.
 The data is imputed with the mean of nearest neighbors.
2
 E ( a , b) = √ ∑ (x −x )
i∈ D
ai bi

 E ( a , b ) is the distance between the two cases a and b

 x ai and x bi are the values of attribute i in cases a and b respectively,

 D is the set of attributes with non_missing values in both cases

No TotayDayMinutes TotalDayCalls TotalDayCharge

1 100.0 30.0 NaN
2 90.0 45.0 40.0
3 NaN 56.0 80.0
4 95.0 NaN 98.0

In this example calculation, k is set to 2.

TodayDayMinutes
E( r 3 , r 1)=√(56−30)2 =26
2 2
E( r 3 , r 2)=√ (56−45 ) + ( 80−40 ) =41.48

E( r 3 , r 4)=√(80−98)2=18
Select the first two values of the ascending Euclidean distance.
The first two values are 100 and 95.
The mean value of these is 97.5.

No TotayDayMinutes TotalDayCalls TotalDayCharge

1 100.0 30.0 NaN
2 90.0 45.0 40.0
3 97.5 56.0 80.0
4 95.0 NaN 98.0

TotalDayCalls
2
E( r 4 , r 1)= √( 95−100 ) =5
2 2
E( r 4 , r 2)= √( 95−90 ) + ( 98−40 ) =58.21
2 2
E( r 4 , r 3)=√ ( 95−97.5 ) + ( 98−80 ) =18.17

The selected values are 30 and 56.

The imputed data is 43.
No TotayDayMinutes TotalDayCalls TotalDayCharge
1 100.0 30.0 NaN
2 90.0 45.0 40.0
3 97.5 56.0 80.0
4 95.0 43.0 98.0

TotalDayCharge
2 2
E( r 1 , r 2)= √( 100−9 0 ) −( 30−45 ) =15.81
2 2
E( r 1 , r 3)=√( 100−97.5 ) + ( 30−56 ) =26.1 1
2 2
E( r 1 , r 4)= √ ( 100−95 ) + ( 30−43 ) =13.92
The selected values are 40 and 98.
The imputed data (mean of neighbors) is 69.
No TotayDayMinutes TotalDayCalls TotalDayCharge
1 100.0 30.0 69
2 90.0 45.0 40.0
3 97.5 56.0 80.0
4 95.0 43.0 98.0

Caie As Level Psychology 9990 Methodology 63d5229efa0a7313631e05cb 853
No ratings yet
Caie As Level Psychology 9990 Methodology 63d5229efa0a7313631e05cb 853
9 pages
ANSYS Workbench: Mechanical Examples
No ratings yet
ANSYS Workbench: Mechanical Examples
54 pages
Tiger Tools
No ratings yet
Tiger Tools
2 pages
KNN Imputer For Multivariate Missing Data Imputation
No ratings yet
KNN Imputer For Multivariate Missing Data Imputation
5 pages
K-Nearest Neighbor (KNN) : Non-Parametric Algorithm
No ratings yet
K-Nearest Neighbor (KNN) : Non-Parametric Algorithm
7 pages
KNN Imputation Details and Results
No ratings yet
KNN Imputation Details and Results
1 page
Road Traffic Algorithm
No ratings yet
Road Traffic Algorithm
5 pages
A Novel K NN Algorithm With Data Driven
No ratings yet
A Novel K NN Algorithm With Data Driven
12 pages
KNN Assignment Report
No ratings yet
KNN Assignment Report
3 pages
What Is KNN
No ratings yet
What Is KNN
9 pages
Lecture#2. K Nearest Neighbors
No ratings yet
Lecture#2. K Nearest Neighbors
10 pages
K-Nearest Neighbours (KNN)
No ratings yet
K-Nearest Neighbours (KNN)
10 pages
KNN - Algorithm - SVM - Algorithm
No ratings yet
KNN - Algorithm - SVM - Algorithm
27 pages
Supervised Learning and K Nearest Neighbors: Business Intelligence For Managers
No ratings yet
Supervised Learning and K Nearest Neighbors: Business Intelligence For Managers
15 pages
K Nearestneighborknnalgorithm 241117075907 d767c46d
No ratings yet
K Nearestneighborknnalgorithm 241117075907 d767c46d
13 pages
KNN Algorithm - PPT (Autosaved)
0% (1)
KNN Algorithm - PPT (Autosaved)
8 pages
Week 3. K-Nearest Neighbours (KNN) : Dr. Shuo Wang
No ratings yet
Week 3. K-Nearest Neighbours (KNN) : Dr. Shuo Wang
18 pages
A Complete Guide To K Nearest Neighbors Algorithm 1598272616
No ratings yet
A Complete Guide To K Nearest Neighbors Algorithm 1598272616
13 pages
Ch2 - Lec2 - K Nearest Neighbour (KNN)
No ratings yet
Ch2 - Lec2 - K Nearest Neighbour (KNN)
18 pages
Introduction To Classification - KNN
No ratings yet
Introduction To Classification - KNN
29 pages
KNN
No ratings yet
KNN
53 pages
KNN 2
No ratings yet
KNN 2
53 pages
K-Nearest Neighbors Algorithm
No ratings yet
K-Nearest Neighbors Algorithm
7 pages
Chap7 KNN
No ratings yet
Chap7 KNN
15 pages
Decision Tree KNN
No ratings yet
Decision Tree KNN
9 pages
KNN Algorithm
No ratings yet
KNN Algorithm
15 pages
INSY446 - 5 - Classification Part 2
No ratings yet
INSY446 - 5 - Classification Part 2
37 pages
Saputra 2019 J. Phys. Conf. Ser. 1235 012006
No ratings yet
Saputra 2019 J. Phys. Conf. Ser. 1235 012006
7 pages
Machine Learning KNN Presentation
No ratings yet
Machine Learning KNN Presentation
28 pages
Machine Learning KNN Presentation
No ratings yet
Machine Learning KNN Presentation
28 pages
CSE445 NSU Week - 5
No ratings yet
CSE445 NSU Week - 5
26 pages
K-Nearest Neighbor in Missing Data Imputation: Ms.R.Malarvizhi, DR - Antony Selvadoss Thanamani
No ratings yet
K-Nearest Neighbor in Missing Data Imputation: Ms.R.Malarvizhi, DR - Antony Selvadoss Thanamani
3 pages
K-Nearest Neighbour Classifier: Prerequisite
No ratings yet
K-Nearest Neighbour Classifier: Prerequisite
6 pages
KNN Dan KMeans
No ratings yet
KNN Dan KMeans
37 pages
K Nearest Neighbours Based On Mutual Inf
No ratings yet
K Nearest Neighbours Based On Mutual Inf
6 pages
K Nearest Neighbor KNN
No ratings yet
K Nearest Neighbor KNN
18 pages
KNN With Example
No ratings yet
KNN With Example
21 pages
Supervised Learning KNN
No ratings yet
Supervised Learning KNN
23 pages
30 Interview Questions To Test Your Skills On KNN Algorithm
No ratings yet
30 Interview Questions To Test Your Skills On KNN Algorithm
12 pages
Chapter 7 - K-Nearest-Neighbor: Data Mining For Business Analytics in Python
No ratings yet
Chapter 7 - K-Nearest-Neighbor: Data Mining For Business Analytics in Python
21 pages
4 KNN Classifier
No ratings yet
4 KNN Classifier
6 pages
Why Do We Need A K-NN Algorithm?
No ratings yet
Why Do We Need A K-NN Algorithm?
11 pages
4 KNN Classifier
No ratings yet
4 KNN Classifier
6 pages
K Nearest Neighbors - Classification: Algorithm
No ratings yet
K Nearest Neighbors - Classification: Algorithm
4 pages
Chapter 4. K Nearest Neighbors
No ratings yet
Chapter 4. K Nearest Neighbors
55 pages
12 ML KNN
No ratings yet
12 ML KNN
28 pages
K Nearest Neighbors: Probably A Duck."
No ratings yet
K Nearest Neighbors: Probably A Duck."
14 pages
K-Means and KNN
No ratings yet
K-Means and KNN
11 pages
Instance-Based Learning: K-Nearest Neighbour Learning
No ratings yet
Instance-Based Learning: K-Nearest Neighbour Learning
21 pages
Lec 7
No ratings yet
Lec 7
40 pages
Part A 3. KNN Classification
No ratings yet
Part A 3. KNN Classification
35 pages
Machine Learning: Lecture # 2 Data Normalization, KNN & Minimum Distance
No ratings yet
Machine Learning: Lecture # 2 Data Normalization, KNN & Minimum Distance
74 pages
ML 06 KNN
No ratings yet
ML 06 KNN
28 pages
Aiml Unit 3 2
No ratings yet
Aiml Unit 3 2
21 pages
1694600817-Unit2.3 KNN CU 2.0
No ratings yet
1694600817-Unit2.3 KNN CU 2.0
25 pages
K-Nearest Neighbor Classification-Algorithm and Characteristics
No ratings yet
K-Nearest Neighbor Classification-Algorithm and Characteristics
6 pages
ML Lec-13
No ratings yet
ML Lec-13
17 pages
Dynamic KNNF
No ratings yet
Dynamic KNNF
3 pages
S3 K Nearest Neighbor LKW 15jan2025
No ratings yet
S3 K Nearest Neighbor LKW 15jan2025
16 pages
K Nearest Neighbors (KNN)
No ratings yet
K Nearest Neighbors (KNN)
14 pages
Unit II 2 Mark Answers ML
No ratings yet
Unit II 2 Mark Answers ML
3 pages
K-Nearest Neighbor (KNN) ..: Class or Value
No ratings yet
K-Nearest Neighbor (KNN) ..: Class or Value
18 pages
Core Concepts in Real Analysis
From Everand
Core Concepts in Real Analysis
Roshan Trivedi
No ratings yet
Feature Selection: Mean Square Between Groups Mean Square Within Groups
No ratings yet
Feature Selection: Mean Square Between Groups Mean Square Within Groups
4 pages
Api Ocr
No ratings yet
Api Ocr
60 pages
SMOTE Samples Calculation: X - Class (Y 1)
No ratings yet
SMOTE Samples Calculation: X - Class (Y 1)
2 pages
Comparisons of Bi Tools: Presented by Hsu Let
No ratings yet
Comparisons of Bi Tools: Presented by Hsu Let
6 pages
BIM-FDB-BI Tools Quotation
No ratings yet
BIM-FDB-BI Tools Quotation
1 page
Without SMOTE - Data - Imputation - LR - D2
No ratings yet
Without SMOTE - Data - Imputation - LR - D2
7 pages
SMOTE Samples Calculation: X - Class (Y 1)
No ratings yet
SMOTE Samples Calculation: X - Class (Y 1)
2 pages
Logistic: Regression Sigmoid Function
No ratings yet
Logistic: Regression Sigmoid Function
4 pages
Answer
No ratings yet
Answer
3 pages
University of Computer Studies, Mandalay (UCSM)
No ratings yet
University of Computer Studies, Mandalay (UCSM)
12 pages
Chapter 6 - Exercises
No ratings yet
Chapter 6 - Exercises
5 pages
Chapter 2 Digital Image Fundamentals
No ratings yet
Chapter 2 Digital Image Fundamentals
85 pages
Chapter 9 Morphological Image Processing1
No ratings yet
Chapter 9 Morphological Image Processing1
79 pages
Mesh PR Ofile Est Imation Using Logistic Regressi On and Random Forest
No ratings yet
Mesh PR Ofile Est Imation Using Logistic Regressi On and Random Forest
17 pages
University of Computer Studies, Mandalay (UCSM)
No ratings yet
University of Computer Studies, Mandalay (UCSM)
23 pages
Comparison of Learning Techniques For Prediction of Customer Churn in Telecommunication
No ratings yet
Comparison of Learning Techniques For Prediction of Customer Churn in Telecommunication
36 pages
Cut & Bent Reinforcement
No ratings yet
Cut & Bent Reinforcement
3 pages
CS Practice Set-Quiz
No ratings yet
CS Practice Set-Quiz
6 pages
02 Chapter 3 - Weight Volume Relationships
No ratings yet
02 Chapter 3 - Weight Volume Relationships
42 pages
Summative MATH 6 Q1
0% (1)
Summative MATH 6 Q1
4 pages
물리 교재 28단원
No ratings yet
물리 교재 28단원
26 pages
Reuleaux Triangle Summary
No ratings yet
Reuleaux Triangle Summary
4 pages
Studying The Factors Affecting The Settling Velocity of Solid Particles in Non-Newtonian Fluids
No ratings yet
Studying The Factors Affecting The Settling Velocity of Solid Particles in Non-Newtonian Fluids
10 pages
Convolution Sum PDF
No ratings yet
Convolution Sum PDF
17 pages
LectureSeven UnitCommitment PDF
No ratings yet
LectureSeven UnitCommitment PDF
15 pages
Integrating Factors Found by Inspection
50% (2)
Integrating Factors Found by Inspection
14 pages
Scan 9 Apr 2019 PDF
No ratings yet
Scan 9 Apr 2019 PDF
26 pages
g8 q3 Week Lesson If Then Statements
100% (2)
g8 q3 Week Lesson If Then Statements
14 pages
M2 - T4 - Cell Number Formats
No ratings yet
M2 - T4 - Cell Number Formats
2 pages
Submitted in Partial Fulfilment For The Award of Degree of
No ratings yet
Submitted in Partial Fulfilment For The Award of Degree of
13 pages
On The Impact 0F Uni-Directional Forces On High-Voltage Towers Following Con Ductor-Breakâge
No ratings yet
On The Impact 0F Uni-Directional Forces On High-Voltage Towers Following Con Ductor-Breakâge
10 pages
Time Complexity: Dr. Zahid Halim
No ratings yet
Time Complexity: Dr. Zahid Halim
32 pages
High Voltage Transformer
No ratings yet
High Voltage Transformer
12 pages
PrOBLEM Reading and Measuring THERMOMETER
No ratings yet
PrOBLEM Reading and Measuring THERMOMETER
16 pages
2 Limits To Post
No ratings yet
2 Limits To Post
3 pages
Allied Radio Data Handbook 1943
No ratings yet
Allied Radio Data Handbook 1943
52 pages
Torque Rotation
No ratings yet
Torque Rotation
6 pages
Elementary Statistics A Step by Step Approach 9th Edition Bluman Test Bank PDF Download
100% (2)
Elementary Statistics A Step by Step Approach 9th Edition Bluman Test Bank PDF Download
65 pages
ChE 3323 Syllabus 2016
No ratings yet
ChE 3323 Syllabus 2016
5 pages
Acceleration
No ratings yet
Acceleration
4 pages
Trigonometry Sheet - 05
No ratings yet
Trigonometry Sheet - 05
10 pages
Quarter 2 Module 5
50% (2)
Quarter 2 Module 5
4 pages
4 Volume of A Prism Ws
No ratings yet
4 Volume of A Prism Ws
3 pages