Open navigation menu

Scribd

0% found this document useful (0 votes)

16 views2 pages

New .........

Uploaded by

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views2 pages

New .........

Uploaded by

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

Feature Selection

In this chapter the focus will be on an important component of dataset prep-

aration for data science: feature selection. An overused rubric in data science

circles is that 80% of the analysis effort is spent on data cleaning and prepa-

ration and only 20% is typically spent on modeling. In light of this it may

seem strange that this book has devoted more than a dozen chapters to

modeling techniques and only a couple to data preparation! However, data

cleansing and preparation are things that are better learned through experi-

ence and not so much from a book. That said, it is essential to be conversant

with the many techniques that are available for these important early
process

steps. In this chapter the focus will not be on data cleaning, as it was
partially

covered in Chapter 2, Data Science Process, but rather on reducing a dataset

to its essential characteristics or features. This process is known by various

terms: feature selection, dimension reduction, variable screening, key

parame-

ter identification, attribute weighting or regularization. Regularization was

briefly covered in Chapter 5 as applied to multiple linear regression. There it

was introduced as a process that helps to reduce overfitting, which is essen-

tially what feature selection techniques implicitly achieve. [Technically, there

is a subtle difference between dimension reduction and feature selection.

Dimension reduction methods—such as principal component analysis

(PCA), discussed in Section 14.2—combine or merge actual attributes in

order to reduce the number of attributes of a raw dataset. Feature selection

methods work more like filters that eliminate some attributes.]

First a brief introduction to feature selection along with the need for this pre-

processing step is given. There are fundamentally two types of feature selec-
tion processes: filter type and wrapper type. Filter approaches work by
selecting

only those attributes that rank among the top in meeting certain stated crite-

ria (Blum & Langley, 1997; Yu & Liu, 2003). Wrapper approaches work by

iteratively selecting, via a feedback loop, only those attributes that improve

the performance of an algorithm (Kohavi & John, 1997). Among the filter-

type methods, one can further classify based on the data types: numeric

versus nominal. The most common wrapper-type methods are the ones
associated with multiple regression: stepwise regression, forward selection,

and backward elimination. A few numeric filter-type methods will be

explored: PCA, which is strictly speaking a dimension reduction method;

information gainbased filtering; and one categorical filter-type method: chi-

square-based filtering.

You might also like

Unit - 3 Feature Engineering
No ratings yet
Unit - 3 Feature Engineering
29 pages
Dja2500 - 4000 Service Manual PDF
73% (22)
Dja2500 - 4000 Service Manual PDF
38 pages
PowerPoint Presentation
No ratings yet
PowerPoint Presentation
60 pages
Pre Necta STD Iv No 2, Mesp Tanzania
No ratings yet
Pre Necta STD Iv No 2, Mesp Tanzania
12 pages
What Is Machine Learning?
No ratings yet
What Is Machine Learning?
11 pages
College Information
No ratings yet
College Information
7 pages
Protocol
No ratings yet
Protocol
19 pages
UNI III (AI) Upto Temporal Models
No ratings yet
UNI III (AI) Upto Temporal Models
29 pages
Unit - 5 (PHP) Q
No ratings yet
Unit - 5 (PHP) Q
1 page
Unit 3 (MFDS) Q
No ratings yet
Unit 3 (MFDS) Q
1 page
Unit 3 (Principles of DS)
No ratings yet
Unit 3 (Principles of DS)
1 page
Data Mining Preparatory QP New
No ratings yet
Data Mining Preparatory QP New
2 pages
DR Ravi Kumar MCA Major Report Marks
No ratings yet
DR Ravi Kumar MCA Major Report Marks
2 pages
Advanced Data Structure and Algorithms
No ratings yet
Advanced Data Structure and Algorithms
2 pages
Upsc Cms Guru Answerkey2022p1
No ratings yet
Upsc Cms Guru Answerkey2022p1
45 pages
Unit 4 (Javascript)
No ratings yet
Unit 4 (Javascript)
18 pages
2 Term 9 Form
No ratings yet
2 Term 9 Form
31 pages
Documents For Aicte
No ratings yet
Documents For Aicte
4 pages
A Variable Is Said To Be A Bound Variable in A Formula If It Occurs Within The Scope of The Quantifier
No ratings yet
A Variable Is Said To Be A Bound Variable in A Formula If It Occurs Within The Scope of The Quantifier
79 pages
PHP
No ratings yet
PHP
12 pages
Module 1 PPT
No ratings yet
Module 1 PPT
48 pages
Ma Theses-The Effectiveness of Project-Based Learning On Students Achievement and Motivation
No ratings yet
Ma Theses-The Effectiveness of Project-Based Learning On Students Achievement and Motivation
155 pages
ICT360 TechEd Report Vol 1
No ratings yet
ICT360 TechEd Report Vol 1
16 pages
Othello Analysis
No ratings yet
Othello Analysis
2 pages
اخلاق طبابت
No ratings yet
اخلاق طبابت
230 pages
Feature Selection Techniques For ML - A Survey of More Than Two Decades of Research - Dipti Theng
No ratings yet
Feature Selection Techniques For ML - A Survey of More Than Two Decades of Research - Dipti Theng
63 pages
Diagnostic Procedures in Gynecology (2023)
No ratings yet
Diagnostic Procedures in Gynecology (2023)
3 pages
Prediction of Compressive Strength of Concrete With Agricultural Waste and Natural Fibre 2024
No ratings yet
Prediction of Compressive Strength of Concrete With Agricultural Waste and Natural Fibre 2024
5 pages
FotoFocus Biennial 2016 Marlo Pascual Three Works Gallery Guide
No ratings yet
FotoFocus Biennial 2016 Marlo Pascual Three Works Gallery Guide
3 pages
SSC CPO 2023 Answer Key in English GS2
No ratings yet
SSC CPO 2023 Answer Key in English GS2
7 pages
Iso 20819 2018
No ratings yet
Iso 20819 2018
9 pages
Shap-Select:: Lightweight Feature Selection Using SHAP Values and Regression
No ratings yet
Shap-Select:: Lightweight Feature Selection Using SHAP Values and Regression
13 pages
Feature Selection Techniques and Its Importance in Machine Learning: A Survey
No ratings yet
Feature Selection Techniques and Its Importance in Machine Learning: A Survey
6 pages
Feature Selection
No ratings yet
Feature Selection
13 pages
Miraña Genus Aeromonas
No ratings yet
Miraña Genus Aeromonas
1 page
Operator'S Manual: AVI Survival Product, Inc. 1655 NW 136 Avenue, Bldg. M Sunrise, Florida, USA 33323
100% (1)
Operator'S Manual: AVI Survival Product, Inc. 1655 NW 136 Avenue, Bldg. M Sunrise, Florida, USA 33323
19 pages
ASM-BDM - Module 3 - Notes
No ratings yet
ASM-BDM - Module 3 - Notes
12 pages
ML Unit 2 Part - 2
No ratings yet
ML Unit 2 Part - 2
6 pages
Emcee Script
100% (2)
Emcee Script
2 pages
Amazonico London A La Carte Menu
No ratings yet
Amazonico London A La Carte Menu
2 pages
Feature Engineering and Dimensionality Reduction
No ratings yet
Feature Engineering and Dimensionality Reduction
146 pages
Thomasyl CV
No ratings yet
Thomasyl CV
7 pages
Unit 3
No ratings yet
Unit 3
23 pages
Wrapper Method
No ratings yet
Wrapper Method
58 pages
Module5.2 Feature Selection Methods
No ratings yet
Module5.2 Feature Selection Methods
64 pages
Unit 3
No ratings yet
Unit 3
50 pages
Feature Selection Techniques
No ratings yet
Feature Selection Techniques
5 pages
Check Point FW MONITOR Cheat Sheet 3.1d
No ratings yet
Check Point FW MONITOR Cheat Sheet 3.1d
2 pages
Mosdorfer Catalog Clamps
No ratings yet
Mosdorfer Catalog Clamps
44 pages
6BT - 6BTA ReCon - Cummins Inc
No ratings yet
6BT - 6BTA ReCon - Cummins Inc
7 pages
Geoid - Wikipedia
No ratings yet
Geoid - Wikipedia
23 pages
03 Corpo Rigido-2d
No ratings yet
03 Corpo Rigido-2d
91 pages
Feature Selection
No ratings yet
Feature Selection
61 pages
Control System Configuration PDF
100% (1)
Control System Configuration PDF
2 pages
Feature Selection
No ratings yet
Feature Selection
6 pages
Feature Selection - New
No ratings yet
Feature Selection - New
41 pages
Part 1 Icao by Diogo
No ratings yet
Part 1 Icao by Diogo
8 pages
DM Prathameshwadnerkar92
No ratings yet
DM Prathameshwadnerkar92
9 pages
Culture Teaching Methods in Foreign Language Education: Pre-Service Teachers' Reported Beliefs and Practices
No ratings yet
Culture Teaching Methods in Foreign Language Education: Pre-Service Teachers' Reported Beliefs and Practices
18 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
47 pages
Lua Chon Dac Trung
No ratings yet
Lua Chon Dac Trung
18 pages
Data Selection
No ratings yet
Data Selection
6 pages
5 Data Pre Processing III
No ratings yet
5 Data Pre Processing III
30 pages
R21 Unit 2
No ratings yet
R21 Unit 2
101 pages
ML Lecture 02
No ratings yet
ML Lecture 02
40 pages
JBL Bar Studio
No ratings yet
JBL Bar Studio
2 pages
Conference 101719
No ratings yet
Conference 101719
7 pages
Comparartive
No ratings yet
Comparartive
7 pages
Steps in Making Item Analysis
No ratings yet
Steps in Making Item Analysis
5 pages
Machine Learning Unit-5
No ratings yet
Machine Learning Unit-5
49 pages
r20 DWDM Unit 2 PART 2
No ratings yet
r20 DWDM Unit 2 PART 2
15 pages
3.1 Dimensionality Reduction
No ratings yet
3.1 Dimensionality Reduction
24 pages
Feature Pruning and Normalization
No ratings yet
Feature Pruning and Normalization
8 pages
A Review of Feature Selection Methods With Applications
No ratings yet
A Review of Feature Selection Methods With Applications
6 pages
Feature Selection
No ratings yet
Feature Selection
18 pages
Flairs99 042
No ratings yet
Flairs99 042
5 pages
An Introduction To Feature Selection
No ratings yet
An Introduction To Feature Selection
45 pages
E-Note 14653 Content Document 20231228101402AM
No ratings yet
E-Note 14653 Content Document 20231228101402AM
10 pages
Lecture#10
No ratings yet
Lecture#10
24 pages
Feature Selection in PR
No ratings yet
Feature Selection in PR
6 pages
Unit No.02 - Feature Extraction and Selection
No ratings yet
Unit No.02 - Feature Extraction and Selection
17 pages
Explore Feature Engineering
No ratings yet
Explore Feature Engineering
10 pages
Xplore Feature Engineering
No ratings yet
Xplore Feature Engineering
9 pages
Module-3 DSV
No ratings yet
Module-3 DSV
20 pages
L-10 - Presentation1-09052024-072206pm
No ratings yet
L-10 - Presentation1-09052024-072206pm
27 pages
Feature Selection: A Data Perspective
No ratings yet
Feature Selection: A Data Perspective
45 pages
Lecture4-Dimensionality Reduction Methods
No ratings yet
Lecture4-Dimensionality Reduction Methods
40 pages
Feature Selection Techniques in Machine Learning
No ratings yet
Feature Selection Techniques in Machine Learning
9 pages
Feature Selection
No ratings yet
Feature Selection
18 pages
6 - Data Pre-Processing-III
No ratings yet
6 - Data Pre-Processing-III
30 pages
The Operational Auditing Handbook: Auditing Business and IT Processes
From Everand
The Operational Auditing Handbook: Auditing Business and IT Processes
Andrew Chambers
4.5/5 (5)
Foundations of Machine Learning: Sudeshna Sarkar IIT Kharagpur
No ratings yet
Foundations of Machine Learning: Sudeshna Sarkar IIT Kharagpur
40 pages
Types of Data (Qualitative and Quantitative)
No ratings yet
Types of Data (Qualitative and Quantitative)
89 pages
Kernels, Model Selection and Feature Selection
No ratings yet
Kernels, Model Selection and Feature Selection
5 pages
3b Features PDF
No ratings yet
3b Features PDF
40 pages
Toward Integrating Feature Selection Algorithms For Classification and Clustering-M7s PDF
No ratings yet
Toward Integrating Feature Selection Algorithms For Classification and Clustering-M7s PDF
12 pages
Literature Review On Feature Selection Methods For HighDimensional Data
No ratings yet
Literature Review On Feature Selection Methods For HighDimensional Data
9 pages
Chandra Shekar 2014
No ratings yet
Chandra Shekar 2014
13 pages
Review On Online Feature Selection
No ratings yet
Review On Online Feature Selection
4 pages
Literature Review On Feature Subset Selection Techniques
No ratings yet
Literature Review On Feature Subset Selection Techniques
3 pages
International Journal of Engineering Research and Development (IJERD)
No ratings yet
International Journal of Engineering Research and Development (IJERD)
5 pages
A Novel Approach For Feature Selection Based On Correlation Measures CFS and Chi Square
No ratings yet
A Novel Approach For Feature Selection Based On Correlation Measures CFS and Chi Square
13 pages
A Study On Feature Selection Techniques in Bio Informatics
100% (1)
A Study On Feature Selection Techniques in Bio Informatics
7 pages
Learn Design and Analysis of Algorithms in 24 Hours
From Everand
Learn Design and Analysis of Algorithms in 24 Hours
Alex Nordeen
No ratings yet