Feature Selection

Uploaded by

vishnucheppanam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views2 pages

Feature Selection

Uploaded by

vishnucheppanam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Feature Selection

Reference : https://machinelearningmastery.com/feature-selection-machine-learning-python/
https://www.datacamp.com/tutorial/feature-selection-python https://towardsdatascience.com/feature-
selection-methods-and-how-to-choose-them-1e7469100e7e https://www.youtube.com/watch?
v=za1aA9U4kbI

It is the process where you automatically select the features which are most important. It is also known as
variable selection or attribute selection.

Importance of feature selection

Proper feature selection improves the performance of model and vice versa. The benefits are;
Reduces overfitting
Improves accuracy
Reduces running time
Improves explainability
Improves data model compatibility

Difference between feature selection, feature extraction,

feature engineering and dimensionality reduction?
Feature Engineering and feature extraction refers to the creation of new features from existing ones.
They are performed before the feature selection.
[[Dimensionality reduction Techniques]] method reduce number of features by creating new combinations
of attributes (also known as feature transformation). It is performed after feature selection if required.
Some examples are;
Principal Component Analysis
Singular Value Decomposition
Linear Discriminant Analysis ...etc
Feature selection involves only inclusion and exclusion of features.

What are the different methods of feature selection?

![[Feature Selection-2.png]]

Unsupervised methods
Unsupervised feature selection methods doesn't require any labels. They don't need access to the target
variables. It works by,
Discarding almost constant variables.
Dropping incomplete features.
Dropping high multicollinear variables.

Supervised methods
Wrapper methods
![[Feature Selection-4.png]] Wrapper method uses a model to evaluate the performance of different
subsets and the best subset is selected. But there is a chance of overfitting. So it is recommended to check
the subset selected with another model. Another disadvantage is the large computational requirements.
Popular wrapper methods are,

Backward Selection

In Backward selection, a full model with all features is analysed. In each iteration the feature contributed
more to the performance is removed. Process is repeated until we gain desired number of features.

Forward Selection

Forward Selection, is initiated with a null model and features are added by one by one maximizing the
performance of the model.

Recursive Feature Elimination (RFE)

Recursive Feature Elimination is similar to Backward Selection. Difference is the selection of features to
discard. RFE uses importance of features to discard the features. Which is weight in linear models, impurity
decrease in tree based models ...etc.

Filter Methods
![[Feature Selection-3.png]] In Filter Methods the statistical relation of features with the target variable is
analysed using measures like correlation or mutual information. This is more simpler, faster and model-
agnostic than wrapper methods. They are less prone to overfitting. The major draw back of this method is
that, it ignores features that are weak predictors of the target variable. But makes more sense when
combined with other features.

Embedded Methods
The idea of embedded methods are to combine the benefits of filter methods and wrapper methods. It
focuses on getting faster results while getting the best subset like wrapper methods. There aren't many
embedded methods or inclusive algorithms available. One example is LASSO regression. Where the
weights of the features gradually shrunk towards zero. Many zero weighted features are removed while the
rest of the non zero features are remained. An example of LASSO regression is the computer vision.

Implementation

Unfold Data Science

https://www.youtube.com/watch?v=LTE7YbRexl8

![[Feature Selection-6.png]]

Unit - 3 Feature Engineering
No ratings yet
Unit - 3 Feature Engineering
29 pages
Feature Selection and Feature Extraction in Pattern Analysis: A Literature Review
No ratings yet
Feature Selection and Feature Extraction in Pattern Analysis: A Literature Review
14 pages
Krejcie and Morgan Table
71% (14)
Krejcie and Morgan Table
2 pages
Harmon Case - Group 6
No ratings yet
Harmon Case - Group 6
8 pages
02-BCA-Statistical Methods and Their Applications
No ratings yet
02-BCA-Statistical Methods and Their Applications
1 page
Featuere Selection
No ratings yet
Featuere Selection
5 pages
Feature Selection Techniques in Machine Learning
No ratings yet
Feature Selection Techniques in Machine Learning
9 pages
An Introduction To Feature Selection
No ratings yet
An Introduction To Feature Selection
45 pages
Module-3 DSV
No ratings yet
Module-3 DSV
20 pages
Feature Selection in PR
No ratings yet
Feature Selection in PR
6 pages
Feature Selection: Slide 1
No ratings yet
Feature Selection: Slide 1
29 pages
AI5003 AML Week07
No ratings yet
AI5003 AML Week07
14 pages
Presentation 1
No ratings yet
Presentation 1
22 pages
7 Selectia Trasaturilor
No ratings yet
7 Selectia Trasaturilor
54 pages
3.1 Dimensionality Reduction
No ratings yet
3.1 Dimensionality Reduction
24 pages
Feature Selection
No ratings yet
Feature Selection
5 pages
Warpper Method
No ratings yet
Warpper Method
8 pages
Feature Engineering
No ratings yet
Feature Engineering
5 pages
ML Notes
No ratings yet
ML Notes
15 pages
Feature Selection Technique
No ratings yet
Feature Selection Technique
7 pages
Feature Selection Techniques in Machine Learning - Javatpoint
No ratings yet
Feature Selection Techniques in Machine Learning - Javatpoint
9 pages
Features Selection and Featurs Generation
No ratings yet
Features Selection and Featurs Generation
5 pages
Deep Learning Vocabulary
No ratings yet
Deep Learning Vocabulary
6 pages
Feature Selection in Machine Learning
No ratings yet
Feature Selection in Machine Learning
4 pages
L5 Dimensionality Reduction
No ratings yet
L5 Dimensionality Reduction
47 pages
Unit 3
No ratings yet
Unit 3
50 pages
Embedded Methods: Isabelle Guyon André Elisseeff
No ratings yet
Embedded Methods: Isabelle Guyon André Elisseeff
12 pages
Feature Selection
No ratings yet
Feature Selection
18 pages
Feature Selection 16891042299
No ratings yet
Feature Selection 16891042299
23 pages
Feature Selection
No ratings yet
Feature Selection
13 pages
Wrapper Method
No ratings yet
Wrapper Method
58 pages
11.feature Selection, Extraction
No ratings yet
11.feature Selection, Extraction
38 pages
ML Lecture 6 7 Preprocess
No ratings yet
ML Lecture 6 7 Preprocess
43 pages
Machine Learning
No ratings yet
Machine Learning
35 pages
Module5.2 Feature Selection Methods
No ratings yet
Module5.2 Feature Selection Methods
64 pages
Presentation 1
No ratings yet
Presentation 1
15 pages
Tripti Ahmed 20 42960 1
No ratings yet
Tripti Ahmed 20 42960 1
11 pages
Part 3
No ratings yet
Part 3
15 pages
Filter Based Feature Selection Using ANOVA: Suppose A Company Wants To Analyze Whether The
No ratings yet
Filter Based Feature Selection Using ANOVA: Suppose A Company Wants To Analyze Whether The
66 pages
Lecture#10
No ratings yet
Lecture#10
24 pages
A Short Guide For Feature Engineering and Feature Selection
No ratings yet
A Short Guide For Feature Engineering and Feature Selection
32 pages
Feature Engineering
No ratings yet
Feature Engineering
2 pages
Feature Engg Pre Processing Python
No ratings yet
Feature Engg Pre Processing Python
68 pages
Chandra Shekar 2014
No ratings yet
Chandra Shekar 2014
13 pages
What Is Feature Selection
No ratings yet
What Is Feature Selection
9 pages
The 5 Feature Selection Algorithms Every Data Scientist Should Know
No ratings yet
The 5 Feature Selection Algorithms Every Data Scientist Should Know
29 pages
KNIME - Seven Techs For Dimensionality Reduction
No ratings yet
KNIME - Seven Techs For Dimensionality Reduction
17 pages
Module-3 - DS (Autosaved)
No ratings yet
Module-3 - DS (Autosaved)
18 pages
Shap-Select:: Lightweight Feature Selection Using SHAP Values and Regression
No ratings yet
Shap-Select:: Lightweight Feature Selection Using SHAP Values and Regression
13 pages
Wa0028.
No ratings yet
Wa0028.
10 pages
International Journal of Engineering Research and Development (IJERD)
No ratings yet
International Journal of Engineering Research and Development (IJERD)
5 pages
Feature Engineering in Machine Learning
No ratings yet
Feature Engineering in Machine Learning
7 pages
Feature Selection Techniques in Machine Learning
No ratings yet
Feature Selection Techniques in Machine Learning
49 pages
ML Lecture 02
No ratings yet
ML Lecture 02
40 pages
Lecture 5 - Feature Extraction, Model Building & Evaluation
No ratings yet
Lecture 5 - Feature Extraction, Model Building & Evaluation
35 pages
Unit 6aics
No ratings yet
Unit 6aics
25 pages
Feature Extraction: 4.1. Principal Component Analysis (PCA)
No ratings yet
Feature Extraction: 4.1. Principal Component Analysis (PCA)
10 pages
Kernels, Model Selection and Feature Selection
No ratings yet
Kernels, Model Selection and Feature Selection
5 pages
Module 3
No ratings yet
Module 3
33 pages
Life Lesson
No ratings yet
Life Lesson
13 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Angular Performance Optimization: Everything you need to know
From Everand
Angular Performance Optimization: Everything you need to know
Abdelfattah Ragab
No ratings yet
Key Principles of IT Architecture
From Everand
Key Principles of IT Architecture
Nelson Ambrose
No ratings yet
STA 211 Lecture 1
No ratings yet
STA 211 Lecture 1
18 pages
Taleb (2012) The Future Has Thicker Tails Than The Past Model Error As Branching Counterfactuals
No ratings yet
Taleb (2012) The Future Has Thicker Tails Than The Past Model Error As Branching Counterfactuals
11 pages
Quiz Unit 3 Exam Remotely Proctored 1 PDF
No ratings yet
Quiz Unit 3 Exam Remotely Proctored 1 PDF
20 pages
HTMM Notes
No ratings yet
HTMM Notes
3 pages
PROJECT - Time Series Forecasting by Akshay Kharote PDF
100% (2)
PROJECT - Time Series Forecasting by Akshay Kharote PDF
85 pages
DMBA103 - Combined Question Answers
No ratings yet
DMBA103 - Combined Question Answers
7 pages
IAT-III Question Paper With Solution of 18EC54 Information Theory and Coding Dec-2020-Harsha B K
No ratings yet
IAT-III Question Paper With Solution of 18EC54 Information Theory and Coding Dec-2020-Harsha B K
24 pages
MAT102 - Statistics For Business - UEH-ISB - T3 2022 - Unit Guide - DR Chon Le
No ratings yet
MAT102 - Statistics For Business - UEH-ISB - T3 2022 - Unit Guide - DR Chon Le
12 pages
2 Pengenalan Geostatistik
No ratings yet
2 Pengenalan Geostatistik
59 pages
Lecture 5-Chapt 4-Normal Distribution & Sampling
No ratings yet
Lecture 5-Chapt 4-Normal Distribution & Sampling
24 pages
Principal Component Analysis R Program and Output
No ratings yet
Principal Component Analysis R Program and Output
7 pages
Răspuns: Binomial (N 150, T) Distribution
100% (2)
Răspuns: Binomial (N 150, T) Distribution
4 pages
Reportproject Groupe5
No ratings yet
Reportproject Groupe5
13 pages
Unit 8 - Variogram Modelling
No ratings yet
Unit 8 - Variogram Modelling
17 pages
Probability & Statistics Assignment 2 Submission Deadline: Friday, 28th October 2016 Submission Venue: Only During The Lecture in The Class Total Points: 100 Points
No ratings yet
Probability & Statistics Assignment 2 Submission Deadline: Friday, 28th October 2016 Submission Venue: Only During The Lecture in The Class Total Points: 100 Points
1 page
ST102/ST109 Elementary Statistical Theory Course Pack 2022/23 (Michaelmas Term)
100% (1)
ST102/ST109 Elementary Statistical Theory Course Pack 2022/23 (Michaelmas Term)
235 pages
Actsc445 f2022 Lec4
No ratings yet
Actsc445 f2022 Lec4
23 pages
Analysis of Variance-20220125072228
No ratings yet
Analysis of Variance-20220125072228
120 pages
DLP Statistics and Probability
No ratings yet
DLP Statistics and Probability
5 pages
Analysis of Variance
No ratings yet
Analysis of Variance
20 pages
Course Handout - MA2201 - Jan-May 2023
No ratings yet
Course Handout - MA2201 - Jan-May 2023
5 pages
Data Visualisation
No ratings yet
Data Visualisation
4 pages
BEC503 Digital Communication
No ratings yet
BEC503 Digital Communication
1 page
Marketing Research Project On Starbucks
No ratings yet
Marketing Research Project On Starbucks
49 pages
Assignment No.2: HOANG Nguyen Phong
No ratings yet
Assignment No.2: HOANG Nguyen Phong
6 pages
Football Math PDF
100% (1)
Football Math PDF
66 pages
Solutions To Sample Final Exam ECO2151
No ratings yet
Solutions To Sample Final Exam ECO2151
7 pages