0% found this document useful (0 votes)

32 views

Lecture 2.3 Data Normalization

Feature normalization is a preprocessing step essential for ensuring that features with different scales contribute equally in machine learning algorithms, particularly those that compute distances. Two common methods for normalization are min-max scaling, which rescales features to a range of [0, 1], and standardization, which involves subtracting the mean and dividing by the standard deviation. It is important to note that the output or target variable should not be rescaled or standardized.

Uploaded by

homerajasekhar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views

Lecture 2.3 Data Normalization

Uploaded by

homerajasekhar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Note about Normalization

Features Normalization
• Feature normalization is a preprocessing step used to normalize the range
of the features.

• It is important when the features have very different scales.

– For example, if the values of feature 𝟏𝟏 are ∈ [0, 1] but the values of feature 𝟐𝟐
are ∈ [120, 190], then normalizing the features is important.

• Motivation:
– Suppose that some ML algorithm computes the Euclidean distance between two
points. If one of the features has a broad range of values, the distance will be
governed by this particular feature. Therefore, the range of all features should be
normalized so that each feature contributes approximately proportionately to
the final distance.

0.11 0.52
− = 3.027
182 179

2
min-max Features scaling

𝑣𝑣𝑗𝑗 − min(𝑣𝑣𝑗𝑗 )
𝑣𝑣𝑣𝑗𝑗 =
max 𝑣𝑣𝑗𝑗 − min(𝑣𝑣𝑗𝑗 )

• 𝑣𝑣𝑗𝑗 is a column (corresponding to feature 𝑗𝑗) from the data matrix 𝑋𝑋.
• 𝑣𝑣′𝑗𝑗 are the normalized values of feature 𝑗𝑗. These values will be ∈ [0, 1]

3
min-max Features scaling
• Before Features Scaling • After Features Scaling

4
Features Standardization

𝑣𝑣𝑗𝑗 − mean(𝑣𝑣𝑗𝑗 )
𝑣𝑣𝑣𝑗𝑗 =
stdev 𝑣𝑣𝑗𝑗

• 𝑣𝑣𝑗𝑗 is a column (corresponding to feature 𝑗𝑗) from the data matrix 𝑋𝑋.
• 𝑣𝑣′𝑗𝑗 are the normalized values of feature 𝑗𝑗. These values will be ∈ [0, 1].
• To normalize, we just subtract the mean and divide by the standard deviation.

5
Features Standardization
• Before • After

6
• NOTE: do not rescale or standardize the output (target variable).

𝑣𝑣𝑗𝑗 − min(𝑣𝑣𝑗𝑗 )
𝑣𝑣𝑣𝑗𝑗 =
max 𝑣𝑣𝑗𝑗 − min(𝑣𝑣𝑗𝑗 )
𝑣𝑣𝑗𝑗 − mean(𝑣𝑣𝑗𝑗 )
𝑣𝑣𝑣𝑗𝑗 =
stdev 𝑣𝑣𝑗𝑗

MTU Engine 4000-Series Functional Description
100% (5)
MTU Engine 4000-Series Functional Description
42 pages
Yamato PPC-200W Manual Tecnico
100% (1)
Yamato PPC-200W Manual Tecnico
17 pages
ML Unit 2
No ratings yet
ML Unit 2
90 pages
Normalization Vs Standardization
No ratings yet
Normalization Vs Standardization
2 pages
Feature Scaling (Standardization & Normalization)
No ratings yet
Feature Scaling (Standardization & Normalization)
35 pages
Standardization & Normalization In: ML With Python Example
No ratings yet
Standardization & Normalization In: ML With Python Example
8 pages
Unit 2 ML 2019
No ratings yet
Unit 2 ML 2019
91 pages
8 Normalization Methods
No ratings yet
8 Normalization Methods
10 pages
Presentation #1 Data Mining Minahel Khan BSIT(E)22!11!1
No ratings yet
Presentation #1 Data Mining Minahel Khan BSIT(E)22!11!1
7 pages
Seven Lab Instruction
No ratings yet
Seven Lab Instruction
38 pages
Standardization vs Normalization in Pattern Recognition
No ratings yet
Standardization vs Normalization in Pattern Recognition
1 page
23.-Scaling-Techniques
No ratings yet
23.-Scaling-Techniques
30 pages
Practical 6
No ratings yet
Practical 6
6 pages
Data Preparation
No ratings yet
Data Preparation
11 pages
ML - WEEK 04
No ratings yet
ML - WEEK 04
33 pages
Summary Chap 1 & 2
No ratings yet
Summary Chap 1 & 2
5 pages
Lecture-11 - Feature Scaling
No ratings yet
Lecture-11 - Feature Scaling
26 pages
1737527078055
No ratings yet
1737527078055
111 pages
Normalization and Standardization: Methods To Preprocess Data To Have Consistent Scales and Distributions
No ratings yet
Normalization and Standardization: Methods To Preprocess Data To Have Consistent Scales and Distributions
10 pages
Normalization: Normalization Techniques at A Glance
No ratings yet
Normalization: Normalization Techniques at A Glance
5 pages
Lec7 (1)
No ratings yet
Lec7 (1)
9 pages
04_data-normalization-in-python.en
No ratings yet
04_data-normalization-in-python.en
1 page
Lecture 7 Data Transformation and Dimensionality Reduction
No ratings yet
Lecture 7 Data Transformation and Dimensionality Reduction
22 pages
Data Normalization and Standardization
No ratings yet
Data Normalization and Standardization
6 pages
Feature Scaling Techniques: Machine Learning
No ratings yet
Feature Scaling Techniques: Machine Learning
27 pages
3_AML _Lecture 3_Feature Engg
No ratings yet
3_AML _Lecture 3_Feature Engg
39 pages
Standardisation vs Normalisation
No ratings yet
Standardisation vs Normalisation
6 pages
3 1 Chapter 3 Normalization
No ratings yet
3 1 Chapter 3 Normalization
22 pages
ML Normalization Techniques_ Overview & Practical Guide
No ratings yet
ML Normalization Techniques_ Overview & Practical Guide
5 pages
Session 7 Feature Selection & Dimensionality Reduction
No ratings yet
Session 7 Feature Selection & Dimensionality Reduction
20 pages
CH1
No ratings yet
CH1
64 pages
Data Normalization in Data Mining
No ratings yet
Data Normalization in Data Mining
8 pages
5.Feauture Engineering
No ratings yet
5.Feauture Engineering
34 pages
Well Posed Learning Problem
100% (1)
Well Posed Learning Problem
4 pages
Data Normalizationand Standardization ATechnical Report
No ratings yet
Data Normalizationand Standardization ATechnical Report
6 pages
Feature Scaling
No ratings yet
Feature Scaling
13 pages
5 Data Preprocessing III Editted Notes
No ratings yet
5 Data Preprocessing III Editted Notes
17 pages
Lecture 10 -Data Transformation-M
No ratings yet
Lecture 10 -Data Transformation-M
8 pages
Example Data mining
No ratings yet
Example Data mining
4 pages
ML Lecture # 04 Multiple Regression
No ratings yet
ML Lecture # 04 Multiple Regression
29 pages
Unit 3-2
No ratings yet
Unit 3-2
15 pages
Unit 4
No ratings yet
Unit 4
33 pages
Feature Engineering: Getting The Most Out of Data For Predictive Models
No ratings yet
Feature Engineering: Getting The Most Out of Data For Predictive Models
75 pages
Standar Ization
No ratings yet
Standar Ization
7 pages
Feature Engineering
No ratings yet
Feature Engineering
18 pages
Normalization A Preprocessing Stage
No ratings yet
Normalization A Preprocessing Stage
5 pages
Preprocessing
No ratings yet
Preprocessing
5 pages
Data Normalization
No ratings yet
Data Normalization
7 pages
Data Normalization and Standardization - Google Docs
No ratings yet
Data Normalization and Standardization - Google Docs
6 pages
Feature Engineering PDF
100% (1)
Feature Engineering PDF
75 pages
FeatureEngineering (1)
No ratings yet
FeatureEngineering (1)
50 pages
Machine Learning - Lec4 - 5
No ratings yet
Machine Learning - Lec4 - 5
41 pages
Explore Feature Engineering
No ratings yet
Explore Feature Engineering
10 pages
Chapter 6: Data Preprocessing, Parameter Selection, and Inductive Conformal Prediction
No ratings yet
Chapter 6: Data Preprocessing, Parameter Selection, and Inductive Conformal Prediction
56 pages
Week 10
No ratings yet
Week 10
50 pages
Lecture 1.3
No ratings yet
Lecture 1.3
11 pages
Summery of Feature Eng
No ratings yet
Summery of Feature Eng
4 pages
PPA Data Preparation
No ratings yet
PPA Data Preparation
31 pages
Lec3 4 ML Project
No ratings yet
Lec3 4 ML Project
26 pages
Lecture01 &02 (1)
No ratings yet
Lecture01 &02 (1)
77 pages
CH 2
No ratings yet
CH 2
121 pages
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
From Everand
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
Fouad Sabry
No ratings yet
Pro Series Datasheet 0
No ratings yet
Pro Series Datasheet 0
9 pages
Lumprem 2
No ratings yet
Lumprem 2
15 pages
4_Gradient Descent and Stochastic GD
No ratings yet
4_Gradient Descent and Stochastic GD
37 pages
AI for beginners
No ratings yet
AI for beginners
105 pages
9626_s24_qp_11[1]
No ratings yet
9626_s24_qp_11[1]
12 pages
Deadlocks Concept 2
No ratings yet
Deadlocks Concept 2
5 pages
Ewx 11831
No ratings yet
Ewx 11831
106 pages
Configuration Presentation
No ratings yet
Configuration Presentation
17 pages
SSC-2000 SSC - 1000: Features Specifications Features Specifications
No ratings yet
SSC-2000 SSC - 1000: Features Specifications Features Specifications
1 page
Brilliant - Chebyshev Polynomials
No ratings yet
Brilliant - Chebyshev Polynomials
8 pages
COSC 101- Computer Networks and the Internet
No ratings yet
COSC 101- Computer Networks and the Internet
39 pages
Network Operations Center: Best Practices
No ratings yet
Network Operations Center: Best Practices
22 pages
Nosql Databases Unit-1
No ratings yet
Nosql Databases Unit-1
16 pages
2021COAZ4108
No ratings yet
2021COAZ4108
228 pages
ME8097 Notes
No ratings yet
ME8097 Notes
101 pages
Swift Mailer
No ratings yet
Swift Mailer
65 pages
Platform 2
No ratings yet
Platform 2
5 pages
Tech Note - MXview License Activation Process - v1.2
No ratings yet
Tech Note - MXview License Activation Process - v1.2
16 pages
LCD HD44780U (LCD-II) PC 1601-F + PC 0802-A - Codes PDF
No ratings yet
LCD HD44780U (LCD-II) PC 1601-F + PC 0802-A - Codes PDF
1 page
Software Risk, Configuration Management
No ratings yet
Software Risk, Configuration Management
35 pages
P5 Mathematics CA1 Paper1
No ratings yet
P5 Mathematics CA1 Paper1
7 pages
A Smart E-Learning System For Social Networking June1
No ratings yet
A Smart E-Learning System For Social Networking June1
15 pages
Prasad CV Sap b1 Functional Consultant
No ratings yet
Prasad CV Sap b1 Functional Consultant
4 pages
Poweredge With DCW and MX Server: Isabelle Kispotta
No ratings yet
Poweredge With DCW and MX Server: Isabelle Kispotta
3 pages
Lab 6: User Management: Goals
No ratings yet
Lab 6: User Management: Goals
14 pages
Create A Hunger Games Tier List - TierMaker
No ratings yet
Create A Hunger Games Tier List - TierMaker
1 page
Direct Cache Mapping
No ratings yet
Direct Cache Mapping
23 pages
Multiplex
No ratings yet
Multiplex
17 pages

Lecture 2.3 Data Normalization

Uploaded by

Lecture 2.3 Data Normalization

Uploaded by

Note about Normalization

• It is important when the features have very different scales.

You might also like