0% found this document useful (0 votes)

1 views

Hands on Practical Examples On Sequential Feature Selection in Python

The document provides practical examples of Sequential Feature Selection (SFS) using Python, specifically with the Iris dataset and the KNeighborsClassifier from scikit-learn. It demonstrates how to implement SFS, SBS, SFFS, and SBFS, showcasing the selection of the best features and their corresponding scores, as well as visualizing results in DataFrames. The examples highlight the effectiveness of different selection methods and the importance of cross-validation in feature selection.

Uploaded by

paulrajarshi7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

1 views

Hands on Practical Examples On Sequential Feature Selection in Python

Uploaded by

paulrajarshi7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

Hands on Practical Examples On Sequential Feature

Selection in Python
Example 1 - A simple Sequential Forward Selection example
Initializing a simple classifier from scikit-learn:

Code:

from sklearn.neighbors import KNeighborsClassifier

from sklearn.datasets import load_iris

iris = load_iris()
X = iris.data
y = iris.target
knn = KNeighborsClassifier(n_neighbors=4)

We start by selection the "best" 3 features from the Iris dataset via Sequential Forward
Selection (SFS). Here, we set forward=True and floating=False. By choosing cv=0, we don't
perform any cross-validation, therefore, the performance (here: 'accuracy') is computed
entirely on the training set.

Code:

from mlxtend.feature_selection import SequentialFeatureSelector as SFS

sfs1 = SFS(knn,
k_features=3,
forward=True,
floating=False,
verbose=2,
scoring='accuracy',
cv=0)

sfs1 = sfs1.fit(X, y)

Output:

[Parallel(n_jobs=1)]: Using backend SequentialBackend with 1 concurrent

workers.
[Parallel(n_jobs=1)]: Done 1 out of 1 | elapsed: 0.0s remaining:
0.0s
[Parallel(n_jobs=1)]: Done 4 out of 4 | elapsed: 0.0s finished

[2022-09-06 20:51:22] Features: 1/3 -- score: 0.96[Parallel(n_jobs=1)]:

Using backend SequentialBackend with 1 concurrent workers.
[Parallel(n_jobs=1)]: Done 1 out of 1 | elapsed: 0.0s remaining:
0.0s
[Parallel(n_jobs=1)]: Done 3 out of 3 | elapsed: 0.0s finished

[2022-09-06 20:51:22] Features: 2/3 -- score:

0.9733333333333334[Parallel(n_jobs=1)]: Using backend SequentialBackend
with 1 concurrent workers.
[Parallel(n_jobs=1)]: Done 1 out of 1 | elapsed: 0.0s remaining:
0.0s
[Parallel(n_jobs=1)]: Done 2 out of 2 | elapsed: 0.0s finished

[2022-09-06 20:51:22] Features: 3/3 -- score: 0.9733333333333334

Via the subsets_ attribute, we can take a look at the selected feature indices at each step:

Code:

sfs1.subsets_

Output:

{1: {'feature_idx': (3,),

'cv_scores': array([0.96]),
'avg_score': 0.96,
'feature_names': ('3',)},
2: {'feature_idx': (2, 3),
'cv_scores': array([0.97333333]),
'avg_score': 0.9733333333333334,
'feature_names': ('2', '3')},
3: {'feature_idx': (1, 2, 3),
'cv_scores': array([0.97333333]),
'avg_score': 0.9733333333333334,
'feature_names': ('1', '2', '3')}}

Note that the 'feature_names' entry is simply a string representation of the 'feature_idx' in this
case. Optionally, we can provide custom feature names via the fit method's
custom_feature_names parameter:

Code:

feature_names = ('sepal length', 'sepal width', 'petal length', 'petal

width')
sfs1 = sfs1.fit(X, y, custom_feature_names=feature_names)
sfs1.subsets_

Output:

[Parallel(n_jobs=1)]: Using backend SequentialBackend with 1 concurrent

workers.
[Parallel(n_jobs=1)]: Done 1 out of 1 | elapsed: 0.0s remaining:
0.0s
[Parallel(n_jobs=1)]: Done 4 out of 4 | elapsed: 0.0s finished

[2022-09-06 20:51:22] Features: 1/3 -- score: 0.96[Parallel(n_jobs=1)]:

Using backend SequentialBackend with 1 concurrent workers.
[Parallel(n_jobs=1)]: Done 1 out of 1 | elapsed: 0.0s remaining:
0.0s
[Parallel(n_jobs=1)]: Done 3 out of 3 | elapsed: 0.0s finished

[2022-09-06 20:51:22] Features: 2/3 -- score:

{1: {'feature_idx': (3,),

'cv_scores': array([0.96]),
'avg_score': 0.96,
'feature_names': ('petal width',)},
2: {'feature_idx': (2, 3),
'cv_scores': array([0.97333333]),
'avg_score': 0.9733333333333334,
'feature_names': ('petal length', 'petal width')},
3: {'feature_idx': (1, 2, 3),
'cv_scores': array([0.97333333]),
'avg_score': 0.9733333333333334,
'feature_names': ('sepal width', 'petal length', 'petal width')}}

Furthermore, we can access the indices of the 3 best features directly via the k_feature_idx_
attribute:

Code:

sfs1.k_feature_idx_

Output:

(1, 2, 3)

And similarly, to obtain the names of these features, given that we provided an argument to
the custom_feature_names parameter, we can refer to the
sfs1.k_feature_names_ attribute:

Code:

sfs1.k_feature_names_

Output:

('sepal width', 'petal length', 'petal width')

Finally, the prediction score for these 3 features can be accesses via k_score_:

Code:

sfs1.k_score_

Output:

0.9733333333333334
Example 2 - Toggling between SFS, SBS, SFFS, and SBFS
Using the forward and floating parameters, we can toggle between SFS, SBS, SFFS, and
SBFS as shown below. Note that we are performing (stratified) 4-fold cross-validation for
more robust estimates in contrast to Example 1. Via n_jobs=-1, we choose to run the cross-
validation on all our available CPU cores.

Code:

# Sequential Forward Selection

sfs = SFS(knn,
k_features=3,
forward=True,
floating=False,
scoring='accuracy',
cv=4,
n_jobs=-1)
sfs = sfs.fit(X, y)

print('\nSequential Forward Selection (k=3):')

print(sfs.k_feature_idx_)
print('CV Score:')
print(sfs.k_score_)

###################################################

# Sequential Backward Selection

sbs = SFS(knn,
k_features=3,
forward=False,
floating=False,
scoring='accuracy',
cv=4,
n_jobs=-1)
sbs = sbs.fit(X, y)

print('\nSequential Backward Selection (k=3):')

print(sbs.k_feature_idx_)
print('CV Score:')
print(sbs.k_score_)

###################################################

# Sequential Forward Floating Selection

sffs = SFS(knn,
k_features=3,
forward=True,
floating=True,
scoring='accuracy',
cv=4,
n_jobs=-1)
sffs = sffs.fit(X, y)

print('\nSequential Forward Floating Selection (k=3):')

print(sffs.k_feature_idx_)
print('CV Score:')
print(sffs.k_score_)
###################################################

# Sequential Backward Floating Selection

sbfs = SFS(knn,
k_features=3,
forward=False,
floating=True,
scoring='accuracy',
cv=4,
n_jobs=-1)
sbfs = sbfs.fit(X, y)

print('\nSequential Backward Floating Selection (k=3):')

print(sbfs.k_feature_idx_)
print('CV Score:')
print(sbfs.k_score_)

Output:

/Users/sebastianraschka/miniforge3/lib/python3.9/site-
packages/scipy/__init__.py:146: UserWarning: A NumPy version >=1.16.5 and
<1.23.0 is required for this version of SciPy (detected version 1.23.1
warnings.warn(f"A NumPy version >={np_minversion} and <{np_maxversion}"
/Users/sebastianraschka/miniforge3/lib/python3.9/site-
packages/scipy/__init__.py:146: UserWarning: A NumPy version >=1.16.5 and
<1.23.0 is required for this version of SciPy (detected version 1.23.1
warnings.warn(f"A NumPy version >={np_minversion} and <{np_maxversion}"
/Users/sebastianraschka/miniforge3/lib/python3.9/site-
packages/scipy/__init__.py:146: UserWarning: A NumPy version >=1.16.5 and
<1.23.0 is required for this version of SciPy (detected version 1.23.1
warnings.warn(f"A NumPy version >={np_minversion} and <{np_maxversion}"
/Users/sebastianraschka/miniforge3/lib/python3.9/site-
packages/scipy/__init__.py:146: UserWarning: A NumPy version >=1.16.5 and
<1.23.0 is required for this version of SciPy (detected version 1.23.1
warnings.warn(f"A NumPy version >={np_minversion} and <{np_maxversion}"
/Users/sebastianraschka/miniforge3/lib/python3.9/site-
packages/scipy/__init__.py:146: UserWarning: A NumPy version >=1.16.5 and
<1.23.0 is required for this version of SciPy (detected version 1.23.1
warnings.warn(f"A NumPy version >={np_minversion} and <{np_maxversion}"
/Users/sebastianraschka/miniforge3/lib/python3.9/site-
packages/scipy/__init__.py:146: UserWarning: A NumPy version >=1.16.5 and
<1.23.0 is required for this version of SciPy (detected version 1.23.1
warnings.warn(f"A NumPy version >={np_minversion} and <{np_maxversion}"
/Users/sebastianraschka/miniforge3/lib/python3.9/site-
packages/scipy/__init__.py:146: UserWarning: A NumPy version >=1.16.5 and
<1.23.0 is required for this version of SciPy (detected version 1.23.1
warnings.warn(f"A NumPy version >={np_minversion} and <{np_maxversion}"

Sequential Forward Selection (k=3):

(1, 2, 3)
CV Score:
0.9731507823613088

Sequential Backward Selection (k=3):

(1, 2, 3)
CV Score:
0.9731507823613088
Sequential Forward Floating Selection (k=3):
(1, 2, 3)
CV Score:
0.9731507823613088

Sequential Backward Floating Selection (k=3):

(1, 2, 3)
CV Score:
0.9731507823613088

In this simple scenario, selecting the best 3 features out of the 4 available features in the Iris
set, we end up with similar results regardless of which sequential selection algorithms we
used.

Example 3 - Visualizing the results in DataFrames

For our convenience, we can visualize the output from the feature selection in a pandas
DataFrame format using the get_metric_dict method of the SequentialFeatureSelector
object. The columns std_dev and std_err represent the standard deviation and standard
errors of the cross-validation scores, respectively.

Below, we see the DataFrame of the Sequential Forward Selector from Example 2:

Code:

import pandas as pd
pd.DataFrame.from_dict(sfs.get_metric_dict()).T

Output:

feature_id avg_scor feature_name ci_boun

cv_scores std_dev std_err
x e s d
[0.973684210526315
8, 0.04831 0.03014 0.01740
1 (3,) 0.959993 (3,)
0.9473684210526315 9 3 3
, 0.918...
[0.973684210526315 0.04831 0.03014 0.01740
2 (2, 3) 0.959993 (2, 3)
8, 9 3 3
feature_id avg_scor feature_name ci_boun
cv_scores std_dev std_err
x e s d
0.9473684210526315
, 0.918...
[0.973684210526315
8, 1.0, 0.03063 0.01911 0.01103
3 (1, 2, 3) 0.973151 (1, 2, 3)
0.9459459459459459 9 3 5
, ...

Now, let's compare it to the Sequential Backward Selector:

Code:

pd.DataFrame.from_dict(sbs.get_metric_dict()).T

Output:

feature_id avg_scor feature_name ci_boun

cv_scores std_dev std_err
x e s d
[0.973684210526315
8, 0.02247 0.01297
4 (0, 1, 2, 3) 0.953236 (0, 1, 2, 3) 0.03602
0.9473684210526315 1 4
, 0.918...
[0.973684210526315
8, 1.0, 0.01911 0.01103
3 (1, 2, 3) 0.973151 (1, 2, 3) 0.030639
0.9459459459459459 3 5
, ...

We can see that both SFS and SBFS found the same "best" 3 features, however, the
intermediate steps where obviously different.

The ci_bound column in the DataFrames above represents the confidence interval around the
computed cross-validation scores. By default, a confidence interval of 95% is used, but we
can use different confidence bounds via the confidence_interval parameter. E.g., the
confidence bounds for a 90% confidence interval can be obtained as follows:

Code:

pd.DataFrame.from_dict(sbs.get_metric_dict(confidence_interval=0.90)).T

Output:

feature_id avg_scor feature_name ci_boun

cv_scores std_dev std_err
x e s d
[0.973684210526315
0.02247 0.01297
4 (0, 1, 2, 3) 8, 0.953236 (0, 1, 2, 3) 0.027658
1 4
0.9473684210526315
feature_id avg_scor feature_name ci_boun
cv_scores std_dev std_err
x e s d
, 0.918...
[0.973684210526315
8, 1.0, 0.01911 0.01103
3 (1, 2, 3) 0.973151 (1, 2, 3) 0.023525
0.9459459459459459 3 5
, ...

Example 4 - Plotting the results

After importing the little helper function plotting.plot_sequential_feature_selection
, we can also visualize the results using matplotlib figures.

Code:

from mlxtend.plotting import plot_sequential_feature_selection as plot_sfs

import matplotlib.pyplot as plt

sfs = SFS(knn,
k_features=4,
forward=True,
floating=False,
scoring='accuracy',
verbose=2,
cv=5)

sfs = sfs.fit(X, y)

fig1 = plot_sfs(sfs.get_metric_dict(), kind='std_dev')

plt.ylim([0.8, 1])
plt.title('Sequential Forward Selection (w. StdDev)')
plt.grid()
plt.show()

Output:

[Parallel(n_jobs=1)]: Using backend SequentialBackend with 1 concurrent

workers.
[Parallel(n_jobs=1)]: Done 1 out of 1 | elapsed: 0.0s remaining:
0.0s
[Parallel(n_jobs=1)]: Done 4 out of 4 | elapsed: 0.0s finished

[2022-09-06 20:51:24] Features: 1/4 -- score: 0.96[Parallel(n_jobs=1)]:

Using backend SequentialBackend with 1 concurrent workers.
[Parallel(n_jobs=1)]: Done 1 out of 1 | elapsed: 0.0s remaining:
0.0s
[Parallel(n_jobs=1)]: Done 3 out of 3 | elapsed: 0.0s finished

[2022-09-06 20:51:24] Features: 2/4 -- score:

0.9666666666666668[Parallel(n_jobs=1)]: Using backend SequentialBackend
with 1 concurrent workers.
[Parallel(n_jobs=1)]: Done 1 out of 1 | elapsed: 0.0s remaining:
0.0s
[Parallel(n_jobs=1)]: Done 2 out of 2 | elapsed: 0.0s finished

[2022-09-06 20:51:24] Features: 3/4 -- score:

0.9533333333333334[Parallel(n_jobs=1)]: Using backend SequentialBackend
with 1 concurrent workers.
[Parallel(n_jobs=1)]: Done 1 out of 1 | elapsed: 0.0s remaining:
0.0s
[Parallel(n_jobs=1)]: Done 1 out of 1 | elapsed: 0.0s finished

[2022-09-06 20:51:24] Features: 4/4 -- score: 0.9733333333333334

Output:

Example 5 - Sequential Feature Selection for Regression

Similar to the classification examples above, the SequentialFeatureSelector also supports
scikit-learn's estimators for regression.

Code:

from sklearn.linear_model import LinearRegression

from sklearn.datasets import fetch_california_housing

data = fetch_california_housing()
X, y = data.data, data.target

lr = LinearRegression()

sfs = SFS(lr,
k_features=8,
forward=True,
floating=False,
scoring='neg_mean_squared_error',
cv=10)
sfs = sfs.fit(X, y)
fig = plot_sfs(sfs.get_metric_dict(), kind='std_err')

plt.title('Sequential Forward Selection (w. StdErr)')

plt.grid()
plt.show()

Example 6 -- Feature Selection with Fixed Train/Validation

Splits
If you do not wish to use cross-validation (here: k-fold cross-validation, i.e., rotating training
and validation folds), you can use the PredefinedHoldoutSplit class to specify your own,
fixed training and validation split.

Code:

from sklearn.datasets import load_iris

from mlxtend.evaluate import PredefinedHoldoutSplit
import numpy as np

iris = load_iris()
X = iris.data
y = iris.target

rng = np.random.RandomState(123)
my_validation_indices = rng.permutation(np.arange(150))[:30]
print(my_validation_indices)

Output:

[ 72 112 132 88 37 138 87 42 8 90 141 33 59 116 135 104 36 13

63 45 28 133 24 127 46 20 31 121 117 4]
Code:

from sklearn.neighbors import KNeighborsClassifier

from mlxtend.feature_selection import SequentialFeatureSelector as SFS

knn = KNeighborsClassifier(n_neighbors=4)
piter = PredefinedHoldoutSplit(my_validation_indices)

sfs1 = SFS(knn,
k_features=3,
forward=True,
floating=False,
verbose=2,
scoring='accuracy',
cv=piter)

sfs1 = sfs1.fit(X, y)

Output:

[Parallel(n_jobs=1)]: Using backend SequentialBackend with 1 concurrent

workers.
[Parallel(n_jobs=1)]: Done 1 out of 1 | elapsed: 0.0s remaining:
0.0s
[Parallel(n_jobs=1)]: Done 4 out of 4 | elapsed: 0.0s finished

[2022-09-06 20:51:25] Features: 1/3 -- score:

0.9666666666666667[Parallel(n_jobs=1)]: Using backend SequentialBackend
with 1 concurrent workers.
[Parallel(n_jobs=1)]: Done 1 out of 1 | elapsed: 0.0s remaining:
0.0s
[Parallel(n_jobs=1)]: Done 3 out of 3 | elapsed: 0.0s finished

[2022-09-06 20:51:25] Features: 2/3 -- score:

[2022-09-06 20:51:25] Features: 3/3 -- score: 0.9666666666666667

Example 7 -- Using the Selected Feature Subset For Making

New Predictions
Code:

# Initialize the dataset

from sklearn.neighbors import KNeighborsClassifier

from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split
iris = load_iris()
X, y = iris.data, iris.target
X_train, X_test, y_train, y_test = train_test_split(
X, y, test_size=0.33, random_state=1)

knn = KNeighborsClassifier(n_neighbors=4)

Code:

# Select the "best" three features via

# 5-fold cross-validation on the training set.

from mlxtend.feature_selection import SequentialFeatureSelector as SFS

sfs1 = SFS(knn,
k_features=3,
forward=True,
floating=False,
scoring='accuracy',
cv=5)
sfs1 = sfs1.fit(X_train, y_train)
print('Selected features:', sfs1.k_feature_idx_)

Output:

Selected features: (1, 2, 3)

Code:

# Generate the new subsets based on the selected features

# Note that the transform call is equivalent to
# X_train[:, sfs1.k_feature_idx_]

X_train_sfs = sfs1.transform(X_train)
X_test_sfs = sfs1.transform(X_test)

# Fit the estimator using the new feature subset

# and make a prediction on the test data
knn.fit(X_train_sfs, y_train)
y_pred = knn.predict(X_test_sfs)

# Compute the accuracy of the prediction

acc = float((y_test == y_pred).sum()) / y_pred.shape[0]
print('Test set accuracy: %.2f %%' % (acc * 100))

Output:

Test set accuracy: 96.00 %

Example 8 -- Sequential Feature Selection and GridSearch
In the following example, we are tuning the SFS's estimator using GridSearch. To avoid
unwanted behavior or side-effects, it's recommended to use the estimator inside and outside
of SFS as separate instances.

Code:

# Initialize the dataset

from sklearn.neighbors import KNeighborsClassifier

from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split

iris = load_iris()
X, y = iris.data, iris.target
X_train, X_test, y_train, y_test = train_test_split(
X, y, test_size=0.2, random_state=123)

Code:

from sklearn.model_selection import GridSearchCV

from sklearn.pipeline import Pipeline
from mlxtend.feature_selection import SequentialFeatureSelector as SFS
import mlxtend

knn1 = KNeighborsClassifier()
knn2 = KNeighborsClassifier()

sfs1 = SFS(estimator=knn1,
k_features=3,
forward=True,
floating=False,
scoring='accuracy',
cv=5)

pipe = Pipeline([('sfs', sfs1),

('knn2', knn2)])

param_grid = {
'sfs__k_features': [1, 2, 3],
'sfs__estimator__n_neighbors': [3, 4, 7], # inner knn
'knn2__n_neighbors': [3, 4, 7] # outer knn
}

gs = GridSearchCV(estimator=pipe,
param_grid=param_grid,
scoring='accuracy',
n_jobs=1,
cv=5,
refit=False)

# run gridearch
gs = gs.fit(X_train, y_train)

Let's take a look at the suggested hyperparameters below:

for i in range(len(gs.cv_results_['params'])): print(gs.cv_results_['params'][i], 'test acc.:',

gs.cv_results_['mean_test_score'][i])

The "best" parameters determined by GridSearch are ...

Code:

print("Best parameters via GridSearch", gs.best_params_)

Output:

Best parameters via GridSearch {'knn2__n_neighbors': 7,

'sfs__estimator__n_neighbors': 3, 'sfs__k_features': 3}

Code:

pipe.set_params(**gs.best_params_).fit(X_train, y_train)
Pipeline(steps=[('sfs',

SequentialFeatureSelector(estimator=KNeighborsClassifier(n_neighbors=3),
k_features=3,
scoring='accuracy')),
('knn2', KNeighborsClassifier(n_neighbors=7))])

Example 9 -- Selecting the "best" feature combination in a k-

range

If k_features is set to to a tuple (min_k, max_k) (new in 0.4.2), the SFS will now
select the best feature combination that it discovered by iterating from k=1 to max_k
(forward), or max_k to min_k (backward). The size of the returned feature subset is then
within max_k to min_k, depending on which combination scored best during cross
validation.

Code:

X.shape

Output:
(150, 4)

Code:
from mlxtend.feature_selection import SequentialFeatureSelector as SFS
from sklearn.neighbors import KNeighborsClassifier
from mlxtend.data import wine_data
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler
from sklearn.pipeline import make_pipeline

X, y = wine_data()
X_train, X_test, y_train, y_test= train_test_split(X, y,
stratify=y,
test_size=0.3,
random_state=1)

knn = KNeighborsClassifier(n_neighbors=2)

sfs1 = SFS(estimator=knn,
k_features=(3, 10),
forward=True,
floating=False,
scoring='accuracy',
cv=5)

pipe = make_pipeline(StandardScaler(), sfs1)

pipe.fit(X_train, y_train)

print('best combination (ACC: %.3f): %s\n' % (sfs1.k_score_,

sfs1.k_feature_idx_))
print('all subsets:\n', sfs1.subsets_)
plot_sfs(sfs1.get_metric_dict(), kind='std_err');

Output:

best combination (ACC: 0.992): (0, 1, 2, 3, 6, 8, 9, 10, 11, 12)

all subsets:
{1: {'feature_idx': (6,), 'cv_scores': array([0.84 , 0.64 , 0.84 , 0.8 ,
0.875]), 'avg_score': 0.799, 'feature_names': ('6',)}, 2: {'feature_idx':
(6, 9), 'cv_scores': array([0.92 , 0.88 , 1. , 0.96 ,
0.91666667]), 'avg_score': 0.9353333333333333, 'feature_names': ('6',
'9')}, 3: {'feature_idx': (6, 9, 12), 'cv_scores': array([0.92 , 0.92
, 0.96 , 1. , 0.95833333]), 'avg_score': 0.9516666666666665,
'feature_names': ('6', '9', '12')}, 4: {'feature_idx': (3, 6, 9, 12),
'cv_scores': array([0.96 , 0.96 , 0.96 , 1. ,
0.95833333]), 'avg_score': 0.9676666666666666, 'feature_names': ('3', '6',
'9', '12')}, 5: {'feature_idx': (3, 6, 9, 10, 12), 'cv_scores':
array([0.92, 0.96, 1. , 1. , 1. ]), 'avg_score': 0.976, 'feature_names':
('3', '6', '9', '10', '12')}, 6: {'feature_idx': (2, 3, 6, 9, 10, 12),
'cv_scores': array([0.92, 0.96, 1. , 0.96, 1. ]), 'avg_score': 0.968,
'feature_names': ('2', '3', '6', '9', '10', '12')}, 7: {'feature_idx': (0,
2, 3, 6, 9, 10, 12), 'cv_scores': array([0.92, 0.92, 1. , 1. , 1. ]),
'avg_score': 0.968, 'feature_names': ('0', '2', '3', '6', '9', '10',
'12')}, 8: {'feature_idx': (0, 2, 3, 6, 8, 9, 10, 12), 'cv_scores':
array([1. , 0.92, 1. , 1. , 1. ]), 'avg_score': 0.984, 'feature_names':
('0', '2', '3', '6', '8', '9', '10', '12')}, 9: {'feature_idx': (0, 2, 3,
6, 8, 9, 10, 11, 12), 'cv_scores': array([1. , 0.92, 1. , 1. , 1. ]),
'avg_score': 0.984, 'feature_names': ('0', '2', '3', '6', '8', '9', '10',
'11', '12')}, 10: {'feature_idx': (0, 1, 2, 3, 6, 8, 9, 10, 11, 12),
'cv_scores': array([1. , 0.96, 1. , 1. , 1. ]), 'avg_score': 0.992,
'feature_names': ('0', '1', '2', '3', '6', '8', '9', '10', '11', '12')}}

Example 10 -- Using other cross-validation schemes

In addition to standard k-fold and stratified k-fold, other cross validation schemes can be used
with SequentialFeatureSelector. For example, GroupKFold or LeaveOneOut cross-
validation from scikit-learn.

Using GroupKFold with SequentialFeatureSelector

Code:

from mlxtend.feature_selection import SequentialFeatureSelector as SFS

from sklearn.neighbors import KNeighborsClassifier
from mlxtend.data import iris_data
from sklearn.model_selection import GroupKFold
import numpy as np

X, y = iris_data()
groups = np.arange(len(y)) // 10
print('groups: {}'.format(groups))

Output:

groups: [ 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1 1 2 2
2 2
2 2 2 2 2 2 3 3 3 3 3 3 3 3 3 3 4 4 4 4 4 4 4 4
4 4 5 5 5 5 5 5 5 5 5 5 6 6 6 6 6 6 6 6 6 6 7 7
7 7 7 7 7 7 7 7 8 8 8 8 8 8 8 8 8 8 9 9 9 9 9 9
9 9 9 9 10 10 10 10 10 10 10 10 10 10 11 11 11 11 11 11 11 11 11 11
12 12 12 12 12 12 12 12 12 12 13 13 13 13 13 13 13 13 13 13 14 14 14 14
14 14 14 14 14 14]

Calling the split() method of a scikit-learn cross-validator object will return a generator
that yields train, test splits

Code:

cv_gen = GroupKFold(4).split(X, y, groups)

cv_gen

<generator object _BaseKFold.split at 0x2877b27b0>

The cv parameter of SequentialFeatureSelector must be either an int or an

iterable yielding train, test splits. This iterable can be constructed by passing the train, test
split generator to the built-in list() function.

Code:

cv = list(cv_gen)
knn = KNeighborsClassifier(n_neighbors=2)
sfs = SFS(estimator=knn,
k_features=2,
scoring='accuracy',
cv=cv)

sfs.fit(X, y)

print('best combination (ACC: %.3f): %s\n' % (sfs.k_score_,

sfs.k_feature_idx_))

Output:

best combination (ACC: 0.940): (2, 3)

Example 11 - Interrupting Long Runs for Intermediate Results

If your run is taking too long, it is possible to trigger a KeyboardInterrupt (e.g., ctrl+c
on a Mac, or interrupting the cell in a Jupyter notebook) to obtain temporary results.

Toy dataset

Code:

from sklearn.datasets import make_classification

from sklearn.model_selection import train_test_split

X, y = make_classification(
n_samples=20000,
n_features=500,
n_informative=10,
n_redundant=40,
n_repeated=25,
n_clusters_per_class=5,
flip_y=0.05,
class_sep=0.5,
random_state=123,
)

X_train, X_test, y_train, y_test = train_test_split(

X, y, test_size=0.2, random_state=123
)

Output:

Long run with interruption

Code:

from mlxtend.feature_selection import SequentialFeatureSelector as SFS

from sklearn.linear_model import LogisticRegression

model = LogisticRegression()

sfs1 = SFS(model,
k_features=10,
forward=True,
floating=False,
verbose=2,
scoring='accuracy',
cv=5)

sfs1 = sfs1.fit(X_train, y_train)

Output:

[Parallel(n_jobs=1)]: Using backend SequentialBackend with 1 concurrent

workers.
[Parallel(n_jobs=1)]: Done 1 out of 1 | elapsed: 0.0s remaining:
0.0s
[Parallel(n_jobs=1)]: Done 500 out of 500 | elapsed: 7.8s finished

[2022-09-13 21:10:39] Features: 1/10 -- score: 0.5965[Parallel(n_jobs=1)]:

Using backend SequentialBackend with 1 concurrent workers.
[Parallel(n_jobs=1)]: Done 1 out of 1 | elapsed: 0.2s remaining:
0.0s
[Parallel(n_jobs=1)]: Done 499 out of 499 | elapsed: 25.5s finished

[2022-09-13 21:11:04] Features: 2/10 -- score:

0.6256875000000001[Parallel(n_jobs=1)]: Using backend SequentialBackend
with 1 concurrent workers.
[Parallel(n_jobs=1)]: Done 1 out of 1 | elapsed: 0.1s remaining:
0.0s
STOPPING EARLY DUE TO KEYBOARD INTERRUPT...

Finalizing the fit

Note that the feature selection run hasn't finished, so certain attributes may not be available.
In order to use the SFS instance, it is recommended to call finalize_fit, which will make
SFS estimator appear as "fitted" process the temporary results:

Code:

sfs1.finalize_fit()
print(sfs1.k_feature_idx_)
print(sfs1.k_score_)

Output:

(128, 160)
0.6256875000000001

Example 12 - Using Pandas DataFrames

Optionally, we can also use pandas DataFrames and pandas Series as input to the fit
function. In this case, the column names of the pandas DataFrame will be used as feature
names. However, note that if custom_feature_names are provided in the fit function,
these custom_feature_names take precedence over the DataFrame column-based
feature names.

Code:

import pandas as pd
from sklearn.neighbors import KNeighborsClassifier
from sklearn.datasets import load_iris
from mlxtend.feature_selection import SequentialFeatureSelector as SFS

iris = load_iris()
X = iris.data
y = iris.target
knn = KNeighborsClassifier(n_neighbors=4)

sfs1 = SFS(knn,
k_features=3,
forward=True,
floating=False,
scoring='accuracy',
cv=0)

Code:

X_df = pd.DataFrame(X, columns=['sepal len', 'petal len',

'sepal width', 'petal width'])
X_df.head()

Output:

sepal len petal len sepal width petal width

0 5.1 3.5 1.4 0.2
1 4.9 3.0 1.4 0.2
2 4.7 3.2 1.3 0.2
3 4.6 3.1 1.5 0.2
4 5.0 3.6 1.4 0.2

Also, the target array, y, can be optionally be cast as a Series:

Code:

y_series = pd.Series(y)
y_series.head()

Output:

0 0
1 0
2 0
3 0
4 0
dtype: int64

Code:

sfs1 = sfs1.fit(X_df, y_series)

Note that the only difference of passing a pandas DataFrame as input is that the
sfs1.subsets_ array will now contain a new column,

Code:

sfs1.subsets_

Output:

{1: {'feature_idx': (3,),

'cv_scores': array([0.96]),
'avg_score': 0.96,
'feature_names': ('petal width',)},
2: {'feature_idx': (2, 3),
'cv_scores': array([0.97333333]),
'avg_score': 0.9733333333333334,
'feature_names': ('sepal width', 'petal width')},
3: {'feature_idx': (1, 2, 3),
'cv_scores': array([0.97333333]),
'avg_score': 0.9733333333333334,
'feature_names': ('petal len', 'sepal width', 'petal width')}}
In mlxtend version >= 0.13 pandas DataFrames are supported as feature inputs to the
SequentianFeatureSelector instead of NumPy arrays or other NumPy-like array
types.

Example 13 - Specifying Fixed Feature Sets

Often, it may be useful to specify a fixed set of features we want to use for a given model
(e.g., determined by prior knowledge or domain knowledge). Since MLxtend v 0.18.0, it is
now possible to specify such features via the fixed_features attribute. This will mean that
these features are guaranteed to be included in the selected subsets.

Note that this feature works for all options regarding forward and backward selection, and
using floating selection or not.

The example below illustrates how we can set the features 0 and 2 in the dataset as fixed:

Code:

from sklearn.neighbors import KNeighborsClassifier

from sklearn.datasets import load_iris

iris = load_iris()
X = iris.data
y = iris.target
knn = KNeighborsClassifier(n_neighbors=3)

from mlxtend.feature_selection import SequentialFeatureSelector as SFS

sfs1 = SFS(knn,
k_features=4,
forward=True,
floating=False,
verbose=2,
scoring='accuracy',
fixed_features=(0, 2),
cv=3)

sfs1 = sfs1.fit(X, y)

Output:

[Parallel(n_jobs=1)]: Using backend SequentialBackend with 1 concurrent

workers.
[Parallel(n_jobs=1)]: Done 1 out of 1 | elapsed: 0.0s remaining:
0.0s
[Parallel(n_jobs=1)]: Done 2 out of 2 | elapsed: 0.0s finished

[2022-09-13 21:17:21] Features: 3/4 -- score:

0.9733333333333333[Parallel(n_jobs=1)]: Using backend SequentialBackend
with 1 concurrent workers.
[Parallel(n_jobs=1)]: Done 1 out of 1 | elapsed: 0.0s remaining:
0.0s
[Parallel(n_jobs=1)]: Done 1 out of 1 | elapsed: 0.0s finished

[2022-09-13 21:17:21] Features: 4/4 -- score: 0.9733333333333333

Code:

sfs1.subsets_

Output:

{2: {'feature_idx': (0, 2),

'cv_scores': array([0.98, 0.92, 0.94]),
'avg_score': 0.9466666666666667,
'feature_names': ('0', '2')},
3: {'feature_idx': (0, 2, 3),
'cv_scores': array([0.98, 0.96, 0.98]),
'avg_score': 0.9733333333333333,
'feature_names': ('0', '2', '3')},
4: {'feature_idx': (0, 1, 2, 3),
'cv_scores': array([0.98, 0.96, 0.98]),
'avg_score': 0.9733333333333333,
'feature_names': ('0', '1', '2', '3')}}

If the input dataset is a pandas DataFrame, we can also use the column names directly:

Code:

import pandas as pd
X_df = pd.DataFrame(X, columns=['sepal len', 'petal len',
'sepal width', 'petal width'])
X_df.head()

Output:

sepal len petal len sepal width petal width

0 5.1 3.5 1.4 0.2
1 4.9 3.0 1.4 0.2
2 4.7 3.2 1.3 0.2
3 4.6 3.1 1.5 0.2
4 5.0 3.6 1.4 0.2

Code:

sfs2 = SFS(knn,
k_features=4,
forward=True,
floating=False,
verbose=2,
scoring='accuracy',
fixed_features=('sepal len', 'petal len'),
cv=3)

sfs2 = sfs2.fit(X_df, y_series)

Output:
[Parallel(n_jobs=1)]: Using backend SequentialBackend with 1 concurrent
workers.
[Parallel(n_jobs=1)]: Done 1 out of 1 | elapsed: 0.0s remaining:
0.0s
[Parallel(n_jobs=1)]: Done 2 out of 2 | elapsed: 0.0s finished

[2022-09-13 21:17:25] Features: 3/4 -- score:

0.9466666666666667[Parallel(n_jobs=1)]: Using backend SequentialBackend
with 1 concurrent workers.
[Parallel(n_jobs=1)]: Done 1 out of 1 | elapsed: 0.0s remaining:
0.0s
[Parallel(n_jobs=1)]: Done 1 out of 1 | elapsed: 0.0s finished

[2022-09-13 21:17:25] Features: 4/4 -- score: 0.9733333333333333

Code:

sfs2.subsets_

Output:

{2: {'feature_idx': (0, 1),

'cv_scores': array([0.72, 0.74, 0.78]),
'avg_score': 0.7466666666666667,
'feature_names': ('sepal len', 'petal len')},
3: {'feature_idx': (0, 1, 2),
'cv_scores': array([0.98, 0.92, 0.94]),
'avg_score': 0.9466666666666667,
'feature_names': ('sepal len', 'petal len', 'sepal width')},
4: {'feature_idx': (0, 1, 2, 3),
'cv_scores': array([0.98, 0.96, 0.98]),
'avg_score': 0.9733333333333333,
'feature_names': ('sepal len', 'petal len', 'sepal width', 'petal
width')}}

Example 13 - Working with Feature Groups

Since mlxtend v0.21.0, it is possible to specify feature groups. Feature groups allow you to
group certain features together, such that they are always selected as a group. This can be
very useful in contexts similar to one-hot encoding -- if you want to treat the one-hot encoded
feature as a single feature:
In the following example, we specify sepal length and sepal width as a feature group so that
they are always selected together:

Code:

from sklearn.datasets import load_iris

import pandas as pd

iris = load_iris()
X = iris.data
y = iris.target

X_df = pd.DataFrame(X, columns=['sepal len', 'petal len',

'sepal wid', 'petal wid'])
X_df.head()

Output:

sepal len petal len sepal wid petal wid

0 5.1 3.5 1.4 0.2
1 4.9 3.0 1.4 0.2
2 4.7 3.2 1.3 0.2
3 4.6 3.1 1.5 0.2
4 5.0 3.6 1.4 0.2

Code:

from sklearn.neighbors import KNeighborsClassifier

from mlxtend.feature_selection import SequentialFeatureSelector as SFS

knn = KNeighborsClassifier(n_neighbors=3)
sfs1 = SFS(knn,
k_features=2,
scoring='accuracy',
feature_groups=(['sepal len', 'sepal wid'], ['petal len'],
['petal wid']),
cv=3)

Code:
sfs1 = sfs1.fit(X_df, y)
sfs1 = SFS(knn,
k_features=2,
scoring='accuracy',
feature_groups=[[0, 2], [1], [3]],
cv=3)

sfs1 = sfs1.fit(X, y)

API
SequentialFeatureSelector(estimator, k_features=1, forward=True, floating=False,
verbose=0, scoring=None, cv=5, n_jobs=1, pre_dispatch='2n_jobs', clone_estimator=True,
fixed_features=None, feature_groups=None)*

Sequential Feature Selection for Classification and Regression.

Methods

fit(X, y, custom_feature_names=None, groups=None, fit_params)

Perform feature selection and learn model from training data.

fit_transform(X, y, groups=None, fit_params)

Fit to training data then reduce X to its most important features.

get_metric_dict(confidence_interval=0.95)

Return metric dictionary

get_params(deep=True)

Get parameters for this estimator.

set_params(params)

Set the parameters of this estimator. Valid parameter keys can be listed with get_params().

Returns

self
transform(X)

Reduce X to its most important features.

Properties

named_estimators

Returns

List of named estimator tuples, like [('svc', SVC(...))]

ython

Multirate Digital Signal Processing Crochiere-Rabiner
100% (4)
Multirate Digital Signal Processing Crochiere-Rabiner
431 pages
Minutely
No ratings yet
Minutely
1 page
Backward && Forward Feature Selection PART-2
No ratings yet
Backward && Forward Feature Selection PART-2
6 pages
5) Randomforest - Ipynb - Colaboratory
No ratings yet
5) Randomforest - Ipynb - Colaboratory
12 pages
MDS372_LAB4_2448001
No ratings yet
MDS372_LAB4_2448001
17 pages
Know Your Dataset: Season Holiday Weekday Workingday CNT 726 727 728 729 730
No ratings yet
Know Your Dataset: Season Holiday Weekday Workingday CNT 726 727 728 729 730
1 page
Programs Lab Bca
No ratings yet
Programs Lab Bca
16 pages
Homework: Grid Search For Hyperparameter Tuning: From Sklearn - Model - Selection Import Train - Test - Split
No ratings yet
Homework: Grid Search For Hyperparameter Tuning: From Sklearn - Model - Selection Import Train - Test - Split
9 pages
Warpper Method
No ratings yet
Warpper Method
8 pages
FB Models PDF
No ratings yet
FB Models PDF
14 pages
Smart Factory Energy Prediction.ipynb
No ratings yet
Smart Factory Energy Prediction.ipynb
355 pages
MlLabManualdocx 2024 09 04 22 02 58
No ratings yet
MlLabManualdocx 2024 09 04 22 02 58
19 pages
Lab 4_Logistic Regression_kNN_Notes
No ratings yet
Lab 4_Logistic Regression_kNN_Notes
6 pages
Codigo Modelo
No ratings yet
Codigo Modelo
5 pages
COMPARISON - Jupyter Notebook
No ratings yet
COMPARISON - Jupyter Notebook
5 pages
This Study Resource Was
No ratings yet
This Study Resource Was
5 pages
Import Numpy As NP Import Pandas As PD
No ratings yet
Import Numpy As NP Import Pandas As PD
7 pages
vertopal.com_project
No ratings yet
vertopal.com_project
16 pages
MLA Lab 6:-Implementation of Decision Tree
No ratings yet
MLA Lab 6:-Implementation of Decision Tree
16 pages
ML0101EN Clas K Nearest Neighbors CustCat Py v1
100% (1)
ML0101EN Clas K Nearest Neighbors CustCat Py v1
11 pages
05 E RandomForest LoanData
No ratings yet
05 E RandomForest LoanData
8 pages
Mlda - Lab
No ratings yet
Mlda - Lab
35 pages
lab manual ML
No ratings yet
lab manual ML
23 pages
ML
No ratings yet
ML
11 pages
Decision Tree Algorithm in Machine Learning
No ratings yet
Decision Tree Algorithm in Machine Learning
13 pages
FDP Session 4 (Decision Tree)
No ratings yet
FDP Session 4 (Decision Tree)
1 page
Code:: To Find Frequent Itemsets and Association Between Different Itemsets Using Apriori Algorithm
No ratings yet
Code:: To Find Frequent Itemsets and Association Between Different Itemsets Using Apriori Algorithm
28 pages
ai int-1
No ratings yet
ai int-1
6 pages
MANUAL (2)
No ratings yet
MANUAL (2)
33 pages
Slip
No ratings yet
Slip
5 pages
Decision tree
No ratings yet
Decision tree
4 pages
v
No ratings yet
v
8 pages
graph_analysis3_code
No ratings yet
graph_analysis3_code
2 pages
1
No ratings yet
1
13 pages
S6 - Data Mining Lab Experiments (Except 1)
No ratings yet
S6 - Data Mining Lab Experiments (Except 1)
6 pages
External
No ratings yet
External
11 pages
AI Assignment-6
No ratings yet
AI Assignment-6
7 pages
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 7
No ratings yet
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 7
23 pages
MLT - Lab - Manual FINAL
No ratings yet
MLT - Lab - Manual FINAL
38 pages
Mini Project With Output
No ratings yet
Mini Project With Output
8 pages
CO3
No ratings yet
CO3
8 pages
CSET301 LabW8L2
No ratings yet
CSET301 LabW8L2
1 page
vertopal.com_Untitled57
No ratings yet
vertopal.com_Untitled57
4 pages
Feature Select
No ratings yet
Feature Select
13 pages
ml.yogesh
No ratings yet
ml.yogesh
23 pages
Tensor Flow and Keras Sample Programs
No ratings yet
Tensor Flow and Keras Sample Programs
22 pages
Machine learning lab manual
No ratings yet
Machine learning lab manual
9 pages
23ucc542_ml9
No ratings yet
23ucc542_ml9
6 pages
ML Lab File[1]
No ratings yet
ML Lab File[1]
43 pages
Aiml Lab
No ratings yet
Aiml Lab
14 pages
AIML CODES
No ratings yet
AIML CODES
11 pages
Machine Learning Practice
No ratings yet
Machine Learning Practice
17 pages
Implementing Custom Randomsearchcv: 'Red' 'Blue'
No ratings yet
Implementing Custom Randomsearchcv: 'Red' 'Blue'
1 page
Pramkk
No ratings yet
Pramkk
10 pages
Ex 6,EX 7 AIML
No ratings yet
Ex 6,EX 7 AIML
9 pages
MLLabManual
No ratings yet
MLLabManual
24 pages
m1
No ratings yet
m1
10 pages
Ml Manual
No ratings yet
Ml Manual
30 pages
Tutorial Classification Py
No ratings yet
Tutorial Classification Py
7 pages
ML NEW Final Format
No ratings yet
ML NEW Final Format
37 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
4th_ETE
No ratings yet
4th_ETE
4 pages
4th ME
No ratings yet
4th ME
4 pages
MC
No ratings yet
MC
79 pages
2310.06422v2
No ratings yet
2310.06422v2
7 pages
Pollution and congestion in urban areas
No ratings yet
Pollution and congestion in urban areas
19 pages
python filtering
No ratings yet
python filtering
7 pages
Chi-Square Test
No ratings yet
Chi-Square Test
6 pages
group_by python code
No ratings yet
group_by python code
11 pages
Merge Append Python Code
No ratings yet
Merge Append Python Code
5 pages
Covariance Matrix
No ratings yet
Covariance Matrix
6 pages
RF Filter Design PPT
100% (2)
RF Filter Design PPT
120 pages
Handout For Diploma in Medical Image Processing
No ratings yet
Handout For Diploma in Medical Image Processing
4 pages
Question Paper
No ratings yet
Question Paper
3 pages
Adaptive Quadrature
No ratings yet
Adaptive Quadrature
8 pages
8 Puzzle
No ratings yet
8 Puzzle
2 pages
hw1 Sols PDF
No ratings yet
hw1 Sols PDF
5 pages
Chapter 6: Test Practice Problems
No ratings yet
Chapter 6: Test Practice Problems
6 pages
Module 6
No ratings yet
Module 6
5 pages
MMC Unit III-1
No ratings yet
MMC Unit III-1
122 pages
FEM - TYME - Lecture 2 PDF
No ratings yet
FEM - TYME - Lecture 2 PDF
5 pages
Digital Signal Processing Module 1 BPUT Syllabus
No ratings yet
Digital Signal Processing Module 1 BPUT Syllabus
80 pages
Hmmorgan
No ratings yet
Hmmorgan
26 pages
pereira2021
No ratings yet
pereira2021
7 pages
2021ece-,2022,2023-8th-EE-DIP-PYQ BEU
No ratings yet
2021ece-,2022,2023-8th-EE-DIP-PYQ BEU
13 pages
Analysis and Design of Algorithm Notes
No ratings yet
Analysis and Design of Algorithm Notes
44 pages
1-s2.0-S2772941924000358-main
No ratings yet
1-s2.0-S2772941924000358-main
16 pages
Ibrahim Chapter6c
No ratings yet
Ibrahim Chapter6c
15 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
32 pages
Detecting Phishing Websites Using Machine Learning
No ratings yet
Detecting Phishing Websites Using Machine Learning
6 pages
CEDRON - OLA-LP Model Formulation
No ratings yet
CEDRON - OLA-LP Model Formulation
7 pages
Chapter 13 - Association Rules: Data Mining For Business Intelligence
No ratings yet
Chapter 13 - Association Rules: Data Mining For Business Intelligence
22 pages
Chap3 HW
No ratings yet
Chap3 HW
2 pages
Simplified AES
No ratings yet
Simplified AES
9 pages
Biomedical Signal Processing
No ratings yet
Biomedical Signal Processing
72 pages
Semester - 3 EE 3101 Introduction To System Theory Distribution Theory and Fourier Transform of Some Special Functions
No ratings yet
Semester - 3 EE 3101 Introduction To System Theory Distribution Theory and Fourier Transform of Some Special Functions
20 pages
Data Science Final Exam Fall 2023 SOL
No ratings yet
Data Science Final Exam Fall 2023 SOL
6 pages
Shape Functions and Numerical Integration
No ratings yet
Shape Functions and Numerical Integration
3 pages
100 Days of ML
100% (1)
100 Days of ML
15 pages
Session 3 - Local Search
No ratings yet
Session 3 - Local Search
34 pages