0% found this document useful (0 votes)

48 views62 pages

Features Election

Feature selection is used to reduce the number of features in datasets to improve machine learning models. It is necessary when datasets have too many features which can reduce accuracy and require large training databases. Feature selection methods evaluate features individually and in combinations to select an optimal subset of important features. Correlation-based ranking is commonly used but not comprehensive as it does not consider feature interactions. More rigorous methods evaluate all possible feature subsets which becomes computationally infeasible with many features. Forward selection grows the feature set incrementally to overcome this issue.

Uploaded by

Rohit Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views62 pages

Features Election

Uploaded by

Rohit Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 62

Feature Selection

Slides are prepared from different resources on the web.

Why

• Too many features

• Require large training databases
• Increased requirement of time
The accuracy of all test Web URLs when chang the number of
top words for category file

90%
88%
86%
Accuracy

84%
82%
80%
78%
76%
74%
10

0
10

20
top

top

top
top

top

top
Number of top words for category file

David Corne, and Nick Taylor, Heriot-Watt University - [email protected]

These slides and related resources: http://www.macs.hw.ac.uk/~dwcorne/Teaching/dmml.html
From http://elpub.scix.net/data/works/att/02-28.content.pdf
Slide Credit: David Corne, and Nick Taylor
Quite easy to find lots more cases from
papers, where experiments show that
accuracy reduces when you use more
features

Slide Credit: David Corne, and Nick Taylor

• Why does accuracy reduce with more
features?
• How does it depend on the specific choice
of features?
• What else changes if we use more features?
• So, how do we choose the right features?
Why accuracy reduces:
• Note: suppose the best feature set has 20
features. If you add another 5 features,
typically the accuracy of machine learning
may reduce. But you still have the original
20 features!! Why does this happen???
Noise / Spurious Correlations /
Explosion
• The additional features typically add noise. Machine
learning will pick up on spurious correlations, that might
be true in the training set, but not in the test set.
• For some ML methods, more features means more
parameters to learn (more NN weights, more decision tree
nodes, etc…) – the increased space of possibilities is more
difficult to search.
– Removing irrelevant data.
– increasing predictive accuracy of learned
models.
– reducing the cost of the data.
– improving learning efficiency, such as
reducing storage requirements and
computational cost.
– reducing the complexity of the resulting
model description, improving the
understanding of the data and the model.
What to do?

• Feature Selection:
– Feature Selection is a process that
chooses an optimal subset of features
according to a certain criterion
• Dimensionality Reduction:
– Transforms data from a high dimensional
space to a low dimensional space
Feature Selection: What

You have some data, and you want to use it to

build a classifier, so that you can predict
something (e.g. likelihood of cancer)

Slide Credit: David Corne, and Nick Taylor

Feature Selection: What

You have some data, and you want to use it to

build a classifier, so that you can predict something
(e.g. likelihood of cancer)

The data has 10,000 fields (features)

Slide Credit: David Corne, and Nick Taylor

Feature Selection: What
You have some data, and you want to use it to
build a classifier, so that you can predict something
(e.g. likelihood of cancer)

The data has 10,000 fields (features)

you need to cut it down to 1,000 fields before

you try machine learning. Which 1,000?

Slide Credit: David Corne, and Nick Taylor

Feature Selection: What
You have some data, and you want to use it to
build a classifier, so that you can predict something
(e.g. likelihood of cancer)

The data has 10,000 fields (features)

you need to cut it down to 1,000 fields before

you try machine learning. Which 1,000?
The process of choosing the 1,000 fields to use is called
Feature Selection
Slide Credit: David Corne, and Nick Taylor
Datasets with many features

Gene expression datasets (~10,000 features)

http://www.ncbi.nlm.nih.gov/sites/entrez?db=gds

Proteomics data (~20,000 features)

http://www.ebi.ac.uk/pride/

Slide Credit: David Corne, and Nick Taylor

Feature selection methods
Feature selection methods

Slide Credit: David Corne, and Nick Taylor

Correlation-based feature ranking
It is indeed used often, by practitioners (who
perhaps don’t understand the issues
involved in FS)

It is actually fine for certain datasets.

It is not even considered in Dash & Liu’s
survey.
A made-up dataset
f1 f2 f3 f4 … class
0.4 0.6 0.4 0.6 1
0.2 0.4 1.6 -0.6 1
0.5 0.7 1.8 -0.8 1
0.7 0.8 0.2 0.9 2
0.9 0.8 1.8 -0.7 2
0.5 0.5 0.6 0.5 2
Correlated with the class
f1 f2 f3 f4 … class
0.4 0.6 0.4 0.6 1
0.2 0.4 1.6 -0.6 1
0.5 0.7 1.8 -0.8 1
0.7 0.8 0.2 0.9 2
0.9 0.8 1.8 -0.7 2
0.5 0.5 0.6 0.5 2
uncorrelated with the class /
seemingly random
f1 f2 f3 f4 … class
0.4 0.6 0.4 0.6 1
0.2 0.4 1.6 -0.6 1
0.5 0.7 1.8 -0.8 1
0.7 0.8 0.2 0.9 2
0.9 0.8 1.8 -0.7 2
0.5 0.5 0.6 0.5 2
David Corne, and Nick Taylor, Heriot-Watt University - [email protected]
These slides and related resources: http://www.macs.hw.ac.uk/~dwcorne/Teaching/dmml.html
Correlation based FS reduces the
dataset to this.
f1 f2 … class
0.4 0.6 1
0.2 0.4 1
0.5 0.7 1
0.7 0.8 2
0.9 0.8 2
0.5 0.5 2
But, col 5 shows us f3 + f4 – which
is perfectly correlated with the class!
f1 f2 f3 f4 … class
0.4 0.6 0.4 0.6 1 1
0.2 0.4 1.6 -0.6 1 1
0.5 0.7 1.8 -0.8 1 1
0.7 0.8 0.2 0.9 1.1 2
0.9 0.8 1.8 -0.7 1.1 2
0.5 0.5 0.6 0.5 1.1 2
Good FS Methods therefore:
• Need to consider how well features work
together
• As we have noted before, if you take 100
features that are each well correlated with
the class, they may simply be correlated
strongly with each other, so provide no
more information than just one of them
`Complete’ methods
Original dataset has N features
You want to use a subset of k features
A complete FS method means: try every
subset of k features, and choose the
best!
The number of subsets is N! / k!(N−k)!
what is this when N is 100 and k is 5?
David Corne, and Nick Taylor, Heriot-Watt University - [email protected]
These slides and related resources: http://www.macs.hw.ac.uk/~dwcorne/Teaching/dmml.html
`Complete’ methods
Original dataset has N features
You want to use a subset of k features
A complete FS method means: try
every subset of k features, and choose
the best!
The number of subsets is N! / k!(N−k)!
What is this when N is 100 and k is 5?
75,287,520 -- almost nothing
`Complete’ methods
Original dataset has N features
You want to use a subset of k features
A complete FS method means: try every
subset of k features, and choose the best!
The number of subsets is N! / k!(N−k)!
What is this when N is 10,000 and k is 100?

David Corne, and Nick Taylor, Heriot-Watt University - [email protected]

These slides and related resources: http://www.macs.hw.ac.uk/~dwcorne/Teaching/dmml.html
`Complete’ methods
Original dataset has N features
You want to use a subset of k features
A complete FS method means: try every
subset of k features, and choose the best!
The number of subsets is N! / k!(N−k)!
What is this when N is 10,000 and k is 100?

Actually it is around 5 × 1035,101

(there are around 1080 atoms in the universe)
Feature Search Space
`forward’ methods

These methods `grow’ a set S of features –

• S starts empty
• Find the best feature to add (by checking
which one gives best performance on a test
set when combined with S).
• If overall performance has improved, return
to step 2; else stop

Slide Credit: David Corne, and Nick Taylor

Forward selection illustrated
F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …

Selected feature set {}

Test each feature in turn to find out which
works best with current feature set …
F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …

Selected feature set {}

65%
Selected feature set {}
Test each feature in turn to find out which
works best with current feature set …
F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …

58%
Selected feature set {}
Test each feature in turn to find out which
works best with current feature set …
F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …

54%
Selected feature set {}
Test each feature in turn to find out which
works best with current feature set …
F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …

72%
Selected feature set {}
Test each feature in turn to find out which
works best with current feature set …
F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …

64%
Selected feature set {}
Etc

F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …
65% 58% 54% 72% 64% 61% 62% 25% 49% ….

Selected feature set {}

Add the winning feature to the selected
feature set
F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …

65% 58% 54% 72% 64% 61% 62% 25% 49% ….

Selected feature set {F4}
We have completed one ‘round’ of forward
selection
F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …

65% 58% 54% 72% 64% 61% 62% 25% 49% ….

Selected feature set {F4}
Test each feature in turn to find out which
works best with current feature set …
F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …

Selected feature set {F4}

61%
Selected feature set {F4}
Test each feature in turn to find out which
works best with current feature set …
F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …

59%
Selected feature set {F4}
Test each feature in turn to find out which
works best with current feature set …
F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …

58%
Selected feature set {F4}
Test each feature in turn to find out which
works best with current feature set …
F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …

Selected feature set {F4}

66%
Selected feature set {F4}
Etc

F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …
61% 59% 58% 66% 68% 75% 47% 49% ….

Selected feature set {F4}

Add the winning feature to the selected
feature set
F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …
61% 59% 58% 66% 68% 75% 47% 49% ….

Selected feature set {F4, F7}

We have completed the second ‘round’ of
forward selection
F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …
61% 59% 58% 66% 68% 75% 47% 49% ….

Selected feature set {F4, F7}

Continue…
adding one feature after each round,
until overall accuracy starts to reduce
F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …
61% 59% 58% 66% 68% 75% 47% 49% ….

Selected feature set {F4, F7}

`backward’ methods
These methods remove features one by one.
• S starts with the full feature set
• Find the best feature to remove (by
checking which removal from S gives best
performance on a test set).
• If overall performance has improved,
return to step 2; else stop

Slide Credit: David Corne, and Nick Taylor

– Bidirectional Generation (BG): Begins the search in both
directions, performing SFG and SBG concurrently. They
stop in two cases: (1) when one search finds the best
subset comprised of m features before it reaches the
exact middle, or (2) both searches achieve the middle of
the search space. It takes advantage of both SFG and
SBG.

– Random Generation (RG): It starts the search in a

random direction. The choice of adding or removing a
features is a random decision. RGtries to avoid the
stagnation into a local optima by not following a fixed
way for subset generation. Unlike SFG or SBG, the size of
the subset of features cannot be stipulated.
Selection Criteria
– Information Measures.
• Information serves to measure the
uncertainty of the receiver when she/he
receives a message.
• Shannon’s Entropy:

• Information gain:
Selection Criteria
Distance Measures.
Measures of separability, discrimination or
divergence measures . The most typical is
derived from distance between the class
conditional density functions.
Selection Criteria
Dependence Measures.
• known as measures of association or correlation.
• Its main goal is to quantify how strongly two
variables are correlated or present some
association with each other, in such way that
knowing the value of one of them, we can derive
the value for the other.
• Pearson correlation coefficient:
Selection Criteria
– Consistency Measures.
• They attempt to find a minimum number of
features that separate classes as the full set of
features can.

• They aim to achieve P(C|FullSet) = P(C|SubSet).

• An inconsistency is defined as the case of two

examples with the same inputs (same feature
values) but with different output feature values
(classes in classification).
Selection Criteria
Accuracy Measures: This form of evaluation relies
on the classifier or learner. Among various possible
subsets of features, the subset which yields the
best predictive accuracy is chosen

Unit - 3 Feature Engineering
No ratings yet
Unit - 3 Feature Engineering
29 pages
UNIT - 1 - Datawarehouse & Data Mining
100% (1)
UNIT - 1 - Datawarehouse & Data Mining
24 pages
Iso 9001: 2008 Gap - Analysis Report
No ratings yet
Iso 9001: 2008 Gap - Analysis Report
7 pages
Module5.2 Feature Selection Methods
No ratings yet
Module5.2 Feature Selection Methods
64 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
117 pages
Feature Selection
No ratings yet
Feature Selection
56 pages
Feature Selection: Slide 1
No ratings yet
Feature Selection: Slide 1
29 pages
Lecture#10
No ratings yet
Lecture#10
24 pages
3ML.03.Feature Reduction
No ratings yet
3ML.03.Feature Reduction
44 pages
Dimenn Red PDF
No ratings yet
Dimenn Red PDF
135 pages
Workbook of Pattern Recognition
No ratings yet
Workbook of Pattern Recognition
11 pages
Feature Selection Techniques For ML - A Survey of More Than Two Decades of Research - Dipti Theng
No ratings yet
Feature Selection Techniques For ML - A Survey of More Than Two Decades of Research - Dipti Theng
63 pages
Feature Selection Techniques in Machine Learning
No ratings yet
Feature Selection Techniques in Machine Learning
49 pages
Feature Selection
No ratings yet
Feature Selection
36 pages
Feature Selection Techniques
No ratings yet
Feature Selection Techniques
17 pages
A Fast Clustering-Based Feature Subset Selection Algorithm For High Dimensional Data
No ratings yet
A Fast Clustering-Based Feature Subset Selection Algorithm For High Dimensional Data
8 pages
Wa0028.
No ratings yet
Wa0028.
10 pages
The Art of Finding The Best Features For Machine Learning - by Rebecca Vickery - Towards Data Science
No ratings yet
The Art of Finding The Best Features For Machine Learning - by Rebecca Vickery - Towards Data Science
14 pages
Wrapper Method
No ratings yet
Wrapper Method
58 pages
IDS 6 Feature Engineering
No ratings yet
IDS 6 Feature Engineering
68 pages
Types of Data (Qualitative and Quantitative)
No ratings yet
Types of Data (Qualitative and Quantitative)
89 pages
Flairs99 042
No ratings yet
Flairs99 042
5 pages
Feature Selection Engineering
No ratings yet
Feature Selection Engineering
72 pages
Icml 2005
No ratings yet
Icml 2005
8 pages
Conference 101719
No ratings yet
Conference 101719
7 pages
Feature Selection in PR
No ratings yet
Feature Selection in PR
6 pages
ML Lecture 02
No ratings yet
ML Lecture 02
40 pages
ML Inter Q&A
No ratings yet
ML Inter Q&A
54 pages
CDT B1 Lab06 MondayWeek2
No ratings yet
CDT B1 Lab06 MondayWeek2
6 pages
Feature Selection Techniques in Machine Learning - Javatpoint
No ratings yet
Feature Selection Techniques in Machine Learning - Javatpoint
9 pages
3b Features PDF
No ratings yet
3b Features PDF
40 pages
Foundations of Machine Learning: Module 3: Instance Based Learning and Feature Reduction
No ratings yet
Foundations of Machine Learning: Module 3: Instance Based Learning and Feature Reduction
40 pages
Foundations of Machine Learning: Sudeshna Sarkar IIT Kharagpur
No ratings yet
Foundations of Machine Learning: Sudeshna Sarkar IIT Kharagpur
40 pages
Lua Chon Dac Trung
No ratings yet
Lua Chon Dac Trung
18 pages
Feature Engineering and Dimensionality Reduction
No ratings yet
Feature Engineering and Dimensionality Reduction
146 pages
Module-3 DSV
No ratings yet
Module-3 DSV
20 pages
Feature Selection Methods
No ratings yet
Feature Selection Methods
24 pages
A Review of Feature Selection Methods On Synthetic Data
No ratings yet
A Review of Feature Selection Methods On Synthetic Data
37 pages
Feature Selection 1692278667
No ratings yet
Feature Selection 1692278667
100 pages
A Review of Feature Selection Methods With Applications
No ratings yet
A Review of Feature Selection Methods With Applications
6 pages
Comparartive
No ratings yet
Comparartive
7 pages
Eel891 Selecao Atributos George Bebis
No ratings yet
Eel891 Selecao Atributos George Bebis
58 pages
Data Reduction SQ
No ratings yet
Data Reduction SQ
3 pages
10 - Chapter 3
No ratings yet
10 - Chapter 3
15 pages
Feature Selection
No ratings yet
Feature Selection
18 pages
کتاب پنجم بارگزاری شده
No ratings yet
کتاب پنجم بارگزاری شده
35 pages
AI5003 AML Week07
No ratings yet
AI5003 AML Week07
14 pages
A Review of Feature Selection and Its Methods
No ratings yet
A Review of Feature Selection and Its Methods
15 pages
GAIN RATIO and Correlation
No ratings yet
GAIN RATIO and Correlation
7 pages
CS464 Ch5 FeatureSelection
No ratings yet
CS464 Ch5 FeatureSelection
31 pages
Explore Feature Engineering
No ratings yet
Explore Feature Engineering
10 pages
Feature Selection: A Data Perspective
No ratings yet
Feature Selection: A Data Perspective
45 pages
UNIT04
No ratings yet
UNIT04
35 pages
A Survey On Evolutionary Multiobjective Feature Selection in Classification Approaches Applications and Challenges
No ratings yet
A Survey On Evolutionary Multiobjective Feature Selection in Classification Approaches Applications and Challenges
21 pages
7 Selectia Trasaturilor
No ratings yet
7 Selectia Trasaturilor
54 pages
E-Note 14653 Content Document 20231228101402AM
No ratings yet
E-Note 14653 Content Document 20231228101402AM
10 pages
An Introduction To Feature Selection
No ratings yet
An Introduction To Feature Selection
45 pages
KNIME - Seven Techs For Dimensionality Reduction
No ratings yet
KNIME - Seven Techs For Dimensionality Reduction
17 pages
Data Reduction
No ratings yet
Data Reduction
23 pages
Feature Selection Techniques in Machine Learning
No ratings yet
Feature Selection Techniques in Machine Learning
9 pages
Feature Selection
No ratings yet
Feature Selection
8 pages
The Little Book of Sitecore® Tips: Volume 2
From Everand
The Little Book of Sitecore® Tips: Volume 2
Neil P Shack
No ratings yet
Lectures3 5
No ratings yet
Lectures3 5
57 pages
ICA Dim Red
No ratings yet
ICA Dim Red
39 pages
Dimensionality Reduction 22-01-22
No ratings yet
Dimensionality Reduction 22-01-22
47 pages
Decision Tree
No ratings yet
Decision Tree
43 pages
Bayes Classification
No ratings yet
Bayes Classification
86 pages
Resume Rohit Singh
No ratings yet
Resume Rohit Singh
1 page
Unit 1 Data Science and Big Data
No ratings yet
Unit 1 Data Science and Big Data
23 pages
Mathematics: Quarter 1 - Module 7: Rectangular Coordinate System
No ratings yet
Mathematics: Quarter 1 - Module 7: Rectangular Coordinate System
20 pages
Software Testing 2
100% (1)
Software Testing 2
48 pages
NNFL-Unit 4
No ratings yet
NNFL-Unit 4
58 pages
Bbit 3202 Artificial Intelligence Exam 2
No ratings yet
Bbit 3202 Artificial Intelligence Exam 2
3 pages
Introduction To Software Testing
No ratings yet
Introduction To Software Testing
35 pages
Kolej Vokasional Batu Lanchang Rancangan Sesi Latihan (RSL)
No ratings yet
Kolej Vokasional Batu Lanchang Rancangan Sesi Latihan (RSL)
4 pages
Vehicle Torque Vectoring Control: April 6, 2015
100% (1)
Vehicle Torque Vectoring Control: April 6, 2015
20 pages
SE Mod 1-6
No ratings yet
SE Mod 1-6
85 pages
Notes of Soft Computing
100% (1)
Notes of Soft Computing
2 pages
(Template) ASRecapEcolRelat
No ratings yet
(Template) ASRecapEcolRelat
2 pages
How LLM Work
No ratings yet
How LLM Work
3 pages
BCA Railway Reservation System
0% (1)
BCA Railway Reservation System
30 pages
Artificial Intelligence in Service-Oriented Software Design
No ratings yet
Artificial Intelligence in Service-Oriented Software Design
22 pages
T.y.bms Unit 4 Job Sequence
No ratings yet
T.y.bms Unit 4 Job Sequence
9 pages
L - Unbalanced Credit Card Fraud Detection
No ratings yet
L - Unbalanced Credit Card Fraud Detection
8 pages
Functional Dependency To Minimal Cover To 3NF Decomposition
No ratings yet
Functional Dependency To Minimal Cover To 3NF Decomposition
3 pages
Training Schedule 2023 - EGS - Final
No ratings yet
Training Schedule 2023 - EGS - Final
2 pages
"Communication Is An Exchange Of: Facts, Ideas, Opinions or Emotions by Two or More"
No ratings yet
"Communication Is An Exchange Of: Facts, Ideas, Opinions or Emotions by Two or More"
24 pages
Develop The Following Programs in The MATLAB Environment
No ratings yet
Develop The Following Programs in The MATLAB Environment
7 pages
Ai Syllabus
No ratings yet
Ai Syllabus
2 pages
MATLAB Assignment BASIC
No ratings yet
MATLAB Assignment BASIC
28 pages
Omg-12-10-17 - V2.4.1 Uml
No ratings yet
Omg-12-10-17 - V2.4.1 Uml
16 pages
Implementation of Input-Process-Output M PDF
No ratings yet
Implementation of Input-Process-Output M PDF
10 pages
From The Autopoiesis To The Allopoiesis of Law-1
No ratings yet
From The Autopoiesis To The Allopoiesis of Law-1
24 pages
Elect Re
No ratings yet
Elect Re
2 pages
Brochure - AVEVA Insight
No ratings yet
Brochure - AVEVA Insight
8 pages

Features Election

Uploaded by

Features Election

Uploaded by

Feature Selection

Slides are prepared from different resources on the web.

• Too many features

David Corne, and Nick Taylor, Heriot-Watt University - [email protected]

Slide Credit: David Corne, and Nick Taylor

You have some data, and you want to use it to

Slide Credit: David Corne, and Nick Taylor

You have some data, and you want to use it to

The data has 10,000 fields (features)

Slide Credit: David Corne, and Nick Taylor

The data has 10,000 fields (features)

you need to cut it down to 1,000 fields before

Slide Credit: David Corne, and Nick Taylor

The data has 10,000 fields (features)

you need to cut it down to 1,000 fields before

Gene expression datasets (~10,000 features)

Proteomics data (~20,000 features)

Slide Credit: David Corne, and Nick Taylor

Slide Credit: David Corne, and Nick Taylor

It is actually fine for certain datasets.

David Corne, and Nick Taylor, Heriot-Watt University - [email protected]

Actually it is around 5 × 1035,101

These methods `grow’ a set S of features –

Slide Credit: David Corne, and Nick Taylor

Selected feature set {}

Selected feature set {}

Selected feature set {}

65% 58% 54% 72% 64% 61% 62% 25% 49% ….

65% 58% 54% 72% 64% 61% 62% 25% 49% ….

Selected feature set {F4}

Selected feature set {F4}

Selected feature set {F4}

Selected feature set {F4, F7}

Selected feature set {F4, F7}

Selected feature set {F4, F7}

Slide Credit: David Corne, and Nick Taylor

– Random Generation (RG): It starts the search in a

• They aim to achieve P(C|FullSet) = P(C|SubSet).

• An inconsistency is defined as the case of two

You might also like