Features Election
Features Election
90%
88%
86%
Accuracy
84%
82%
80%
78%
76%
74%
10
20
30
40
50
60
70
80
90
0
10
11
12
13
14
15
16
17
18
19
20
top
top
top
top
top
top
top
top
top
top
top
top
top
top
top
top
top
top
top
top
Number of top words for category file
• Feature Selection:
– Feature Selection is a process that
chooses an optimal subset of features
according to a certain criterion
• Dimensionality Reduction:
– Transforms data from a high dimensional
space to a low dimensional space
Feature Selection: What
65%
Selected feature set {}
Test each feature in turn to find out which
works best with current feature set …
F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …
58%
Selected feature set {}
Test each feature in turn to find out which
works best with current feature set …
F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …
54%
Selected feature set {}
Test each feature in turn to find out which
works best with current feature set …
F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …
72%
Selected feature set {}
Test each feature in turn to find out which
works best with current feature set …
F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …
64%
Selected feature set {}
Etc
F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …
65% 58% 54% 72% 64% 61% 62% 25% 49% ….
61%
Selected feature set {F4}
Test each feature in turn to find out which
works best with current feature set …
F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …
59%
Selected feature set {F4}
Test each feature in turn to find out which
works best with current feature set …
F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …
58%
Selected feature set {F4}
Test each feature in turn to find out which
works best with current feature set …
F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …
66%
Selected feature set {F4}
Etc
F1 F2 F3 F4 F5 F6 F7 F8 F9 Etc
…
2 5 65 67 2 2 12 2 234 …
1 2 4 5 13 1 1 43 12 …
4 3 43 2 4 6 2 2 1 …
5 4 2 3 5 5 13 1 2 …
3 5 1 4 7 3 4 6 13 …
2 2 6 5 7 1 5 4 4 …
1 3 4 4 55 4 7 55 43 …
61% 59% 58% 66% 68% 75% 47% 49% ….
• Information gain:
Selection Criteria
Distance Measures.
Measures of separability, discrimination or
divergence measures . The most typical is
derived from distance between the class
conditional density functions.
Selection Criteria
Dependence Measures.
• known as measures of association or correlation.
• Its main goal is to quantify how strongly two
variables are correlated or present some
association with each other, in such way that
knowing the value of one of them, we can derive
the value for the other.
• Pearson correlation coefficient:
Selection Criteria
– Consistency Measures.
• They attempt to find a minimum number of
features that separate classes as the full set of
features can.