0% found this document useful (0 votes)

67 views

Feature Selection Using Forest Optimization Algorithm

The document summarizes a research paper that proposes a new feature selection method called Feature Selection using Forest Optimization Algorithm (FSFOA). FSFOA uses the Forest Optimization Algorithm (FOA), which was originally proposed for continuous search problems, and adapts it for use in feature selection, which involves discrete search spaces. The paper aims to select informative features from datasets to improve the classification accuracy of learning algorithms like KNN, C4.5, and SVM classifiers. It compares the proposed FSFOA method to other feature selection algorithms on several real-world datasets.

Uploaded by

Abdullah Nadeem

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

67 views

Feature Selection Using Forest Optimization Algorithm

Uploaded by

Abdullah Nadeem

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Pattern Recognition 60 (2016) 121–129

Contents lists available at ScienceDirect

Pattern Recognition
journal homepage: www.elsevier.com/locate/pr

Feature selection using Forest Optimization Algorithm

Manizheh Ghaemi a,n, Mohammad-Reza Feizi-Derakhshi b
a
Faculty of Electrical and Computer Engineering, K. N. Toosi University of Technology, Tehran, Iran
b
Faculty of Electrical and Computer Engineering, University of Tabriz, Tabriz, Iran

art ic l e i nf o a b s t r a c t

Article history: Feature selection as a combinatorial optimization problem is an important preprocessing step in data
Received 17 August 2015 mining; which improves the performance of the learning algorithms with the help of removing the
Received in revised form irrelevant and redundant features. As evolutionary algorithms are reported to be suitable for optimiza-
26 March 2016
tion tasks, so Forest Optimization Algorithm (FOA) – which is initially proposed for continuous search
Accepted 11 May 2016
problems – is adapted to be used for feature selection as a discrete search space problem. As the result,
Available online 24 May 2016
Feature Selection using Forest Optimization Algorithm (FSFOA) is proposed in this article in order to
Keywords: select the more informative features from the datasets. The proposed FSFOA is validated on several real
Feature selection world datasets and it is compared with some other methods including HGAFS, PSO and SVM-FuzCoc. The
Forest Optimization Algorithm (FOA)
results of the experiments show that, FSFOA can improve the classification accuracy of classifiers in some
KNN classifier
selected datasets. Also, we have compared the dimensionality reduction of the proposed FSFOA with
Dimension reduction
FSFOA other available methods.
& 2016 Elsevier Ltd. All rights reserved.

1. Introduction weighting [1,9,27,30,32,33]. In feature weighting problem, fea-

tures are assigned a value which shows their importance in the
One of the inevitable steps in knowledge discovery is data machine learning process but, in feature selection problem a fea-
mining and the knowledge obtained as the result of data mining is ture is either retained or deleted and the weights are limited to
used in many trends; like business and medical use [6,15,20,37]. just ‘0’ and ‘1’. In fact, feature selection algorithms are a proper
These days, there has been an increase in the number of collected subset of feature weighting algorithms which use binary weights
and stored features in databases but not all the features are useful (i.e., 0 or 1).
for data mining, so that some of the features are completely irre- It has been proved that feature selection has an impact on the
levant or redundant [8,10,36,23]. These features not only have no accuracy and complexity of the classifiers [11]. The mostly used
use in the process of knowledge discovery, but also they increase criterions for evaluating the selected feature subset is classification
the complexity and incomprehensibility of the results. So, feature accuracy (CA) on new instances (test dataset). In fact, we expect
selection helps to reduce the dimensionality of the datasets before that dimensionality reduction with the help of feature selection
will increase classification accuracy or at least it remains the same.
data mining. In large databases with many features to handle,
The objective of this paper is to select the useful features of the
when there are n features, time complexity to evaluate all the
datasets with the help of FOA as a new evolutionary algorithm. As
subsets of features is exponential (O (2n )) [31], which is practically
FOA is reported to be suitable in continuous search spaces, in this
impossible. So, feature selection methods are the bases for data
article we have attempted to investigate the performance of FOA
mining to keep useful features for latter learning tasks alongside
in feature selection (FS) as a discrete search problem and we have
the ignoring of the most irrelevant and less important ones [11]. In introduced a method named as Feature Selection using Forest
fact feature selection techniques ignore the irrelevant features so, Optimization Algorithm (FSFOA). In fact, FSFOA searches for the
learning process can be done more efficiently. It is also proved that best feature subset with the objective of improving the classifi-
feature selection increases the classification accuracy of machine cation accuracy of some classifiers as learning algorithms includ-
learning algorithms like KNN classifier [11]. ing KNN, C4.5 and SVM classifiers. The contribution of this paper is
Feature selection is the special case of feature weighting pro- twofold: adapting FOA for solving discrete problems and also
blem [34]. Many studies have shown the beneficial effect of feature solving feature selection problem with the help of discrete FOA
which leads to the proposed FSFOA method.
n
Corresponding author.
The rest of this paper is organized as follows. In Section 2, an
E-mail addresses: [email protected] (M. Ghaemi), overview of feature selection methods is presented. An overview
[email protected] (M.-R. Feizi-Derakhshi). of Forest Optimization Algorithm (FOA) is given in Section 3. In

http://dx.doi.org/10.1016/j.patcog.2016.05.012
0031-3203/& 2016 Elsevier Ltd. All rights reserved.
122 M. Ghaemi, M.-R. Feizi-Derakhshi / Pattern Recognition 60 (2016) 121–129

Section 4, the application of FOA for feature selection (FSFOA) is to the random nature of meta-heuristic search methods, the ap-
presented and Section 5 is devoted to the experiments and results plication of genetic algorithms, particle swarm optimization al-
on the proposed FSFOA. Finally, Section 6 summarizes the main gorithm and ant colony optimization in feature selection domain
conclusions. have shown promising results [18]; some of which are summar-
ized in the following.
Hamdani et al. proposed a new algorithm based on hierarchical
2. An overview of feature selection methods genetic algorithms with bi-coded chromosome representation and
new evaluation function [13]. In order to minimize the computa-
Many researchers have addressed feature selection (FS) pro- tional cost and also speed up the convergence speed, they used a
blem up to now and also more attempt is needed to further speed hierarchical algorithm with homogeneous and heterogeneous
up the process of selecting informative and useful features in da- population. In another attempt, Zhu et al. proposed a new algo-
tabases for data mining. rithm which is a combination of genetic algorithm and local search
The earliest methods in FS literature based on the machine method [40]. At first, GA population is generated randomly, then
learning algorithms are filters [11,12]. In all the filters, heuristic local search is applied to all of the individuals of the population in
techniques based on the general characteristics of data such as order to improve the classification accuracy and speed up the
information gain and distance is used instead of learning algo- searching process. Tan et al. used SVM (Support Vector Machine)
rithms. Another approach in feature selection is wrapper methods based on wrapper approach [31] in GA. In their proposed algo-
[11,19]. In contrary to filters, wrappers use learning algorithms to rithm, GA searches for the best feature subset and the classifica-
investigate the worthy of the selected features [41]. Generally, tion accuracy of SVM guides the search process. Gheyas et al.
wrappers produce better results than filters; because while using combined both simulated annealing (SA) and GA to use the ad-
wrapper approach, the relationship between the learning algo- vantages of both SA and GA [10]. In their proposed SAGA, GA helps
rithm and the training data is considered. The well-known draw- to escape from local optimum of SA with the crossover operator.
back of wrappers is that they are slower than filters; because the Nemati et al. proposed a new hybrid algorithm of GA and ACO in
learning algorithm must be repeatedly executed for every selected order to use the advantages of both algorithms [22]. In their al-
feature subset. Sometimes a hybrid of filter-wrapper methods is gorithm, ACO performs a local search, while GA is used to perform
used. Hybrid methods integrate feature selection within the a global search. Sivagaminathan et al. used ACO which searches for
learning algorithm in order to exploit the advantages of both near-optimum solution and ANN is used as a classifying function
wrappers and filters [11]. Ignoring the filter or wrapper approach [28]. ElAlami et al. proposed an algorithm based on GA, which
for feature selection methods, they can match any of the following optimizes the output nodes of ANN [7]. In their method, ANN is
groups: complete search, heuristic search and meta-heuristic used to give a weight to each of the features and GA finds the
methods. optimal relevant features. Kabir et al. proposed a new hybrid al-
Almuallim and Dietterich presented FOCUS method which gorithm that combines GA with local search method (HGAFS) [17].
completely searches the search space up to reaching to the smal- Their proposed method selects the feature subset with a limited
lest set of features that divides the training data into pure classes size; which is the important aspect of their method. Their method
[2,3]. But with n features to handle, there are (2n ) − 1 possible is a wrapper based method that uses both GA and ANN. In another
subsets of features so, evaluating all of the subsets is practically attempt, Tabakhi et al. presented an unsupervised feature selec-
impossible in datasets with many features. As the result, complete tion method based on ant colony optimization, called UFSACO [29].
search methods are seldom used for feature selection in large Their proposed UFSACO is a filter-based method and the search
datasets with many features. space is represented as a fully connected undirected weighted
Heuristic methods of feature selection problem include greedy graph. Xue et al proposed a series of methods based on PSO with
hill climbing algorithm [25,26], branch and bound method, beam novel initialization and updating mechanisms [35]. In their pro-
search and best first algorithm. Greedy hill climbing algorithm posed algorithm, three new initialization strategies and three new
evaluates all local changes in order to select the relevant features personal best and global best updating mechanisms in PSO are
[11,25]. SFS (Sequential Forward Selection) and SBS (Sequential presented to develop novel feature selection approaches; in which,
Backward Selection) are two kinds of hill climbing methods. SFS maximizing the classification performance, minimizing the num-
starts with an empty set of selected features and each step of the ber of features and reducing the computational time are the main
algorithm adds one of the informative features to the selected set; goals.
but, SBS starts will the full set of features and in each step, one of Despite good progress in solving feature selection problem,
the redundant or irrelevant features is omitted. Bi-directional more study is also welcomed to further optimize the solutions. In
search is another method which considers both adding and de- all the proposed methods, one should choose either computa-
leting the features simultaneously [11]. The main drawback of both tionally feasible or optimality of the selected features. Further
SFS and SBS algorithms is the “nesting effect” problem; which research is needed to develop more promising methods for feature
means that while a change is considered positive (either addition selection with the aim of providing very good results. In the pre-
or deletion of a feature), there is no chance of re-evaluating that sent work, FSFOA algorithm is proposed to further optimize the
feature. Later in order to overcome the “nesting effect” of SFS and results of feature selection methods in the case of improving
SBS algorithms, SFFS (Sequential Forward Floating Selection) and classification accuracy.
SBFS (Sequential Forward Floating Selection) were introduced [24].
Best first search is another method which like hill climbing con-
siders local changes in the search space but, it allows backtracking 3. An overview of the Forest Optimization Algorithm (FOA)
in the search space unlike hill climbing methods [11].
Heuristic algorithms perform better than complete search Forest Optimization algorithm is an evolutionary algorithm,
methods while comparing time complexities, but recently meta- which is inspired by the procedure of a few trees in the forests [9].
heuristic algorithms like Genetic Algorithm (GA), Particle Swarm FOA is proposed to solve continuous search space problems, but in
Intelligence Optimization (PSO) and Ant Colony Optimization this article we have attempted to adjust it to use in discrete search
(ACO) show more desirable results. The main advantage of the space problems like feature selection. FOA involves three main
meta-heuristic methods is their acceptable time complexity. Due stages: 1 – Local seeding of the trees, 2 – Population limiting, and 3
M. Ghaemi, M.-R. Feizi-Derakhshi / Pattern Recognition 60 (2016) 121–129 123

seeds fall just beneath the parent tree and then they turn into
young trees [9]; which is simulated by local seeding in FOA. After
initialization of the trees, the local seeding stage will operate on
trees with “Age” ‘0’ to simulate the nearby seeds of the parent
trees. Then all the trees, except new generated ones, get old and
their “Age” increases by ‘1’. This stage simulates the local search of
the algorithm.
Next stage is population limiting in which the trees with “Age”
bigger than “life time” parameter will be omitted from the forest
and they will form the candidate population [9]. Also in population
limiting stage, the rest of the trees of the forest are sorted ac-
cording to their fitness value and if the number of whole trees of
the forest exceeds the pre-defined “area limit” parameter, the extra
trees will join to the candidate population too. In the global
seeding stage, a percentage of the candidate population is chosen.
The selected trees from the candidate population will be used in
the global seeding stage. Global seeding stage simulates the global
search of FOA [9]. Next stage in FOA is updating the best tree in
which the best solution is selected according to its fitness value
and its “Age” is set to 0 in order to avoid the aging and afterward
removing the best tree from the forest. These stages will continue
iteratively until the termination criterion is met. Forest Optimi-
zation Algorithm has 5 parameters which should be initialized at
the start of the algorithm [9]:

1. “Local Seeding Changes” or “LSC”,

2. The limitation of the forest or “area limit”,
3. The maximum allowed “Age” of a tree, which is named as “life
time” parameter,
4. Percentage of the candidate population to be used in the global
seeding stage or “transfer rate”,
5. The number of the variables, whose values will be changed in
the global seeding stage, is another parameter of the algorithm
and is named as “Global Seeding Changes” or “GSC”.

As FOA is proposed for continuous space problems, in this article

we have adapted FOA to be suitable in discrete space problems like
feature selection. The pseudo code of Forest Optimization Algo-
rithm for feature selection, named as FSFOA, is illustrated as Al-
gorithm 1. As it is shown in Algorithm 1, the needed changes in
FOA to be suitable for feature selection problem should be in
initialization, local seeding and global seeding stages. In the next
section the stages of FSFOA to handle the discrete search space of
the feature selection problem are explained in more details.

Algorithm 1. FSFOA (life time, LSC, GSC, transfer rate, area limit)

Input: life time, LSC, GSC, transfer rate, area limit

Output: The best feature set with the highest ﬁtness
1: Procedure FSFOA
2: Initialize forest with random 0/1 trees
3: Each tree is a (D þ1)-dimensional vector x (D is the
number of all features).
4: The “Age” of each tree is initially zero.
While stop condition is not satisﬁed do

1: Perform local seeding on trees with Age 0

2: For i ¼1: “LSC” do
Fig. 1. Flowchart of FOA [9].
3: Randomly choose a variable of the selected tree
4: change from 0 to 1 or vice versa
– Global seeding of the trees. Fig. 1 shows the ﬂowchart of FOA. 5: end for
FOA starts with the initial population of trees (solutions) which 6: Increase the Age of all trees by 1
forms forest in this algorithm. Each tree represents a potential 7: Population limiting
solution of the problem. A tree has a part that represents the “Age” 8: Global seeding
of the related tree in addition to the values of the variables. The 9: Choose “transfer rate” percent of the candidate
“Age” of each newly generated tree is set to ‘0’ [9]. population
In the nature when seeding procedure of the trees begins, some 10: for each selected tree do
124 M. Ghaemi, M.-R. Feizi-Derakhshi / Pattern Recognition 60 (2016) 121–129

11: Choose “GSC” variables of the selected tree randomly

12: change from 0 to 1 or vice versa
13: end for
14: Update the best so far tree Fig. 3. An example of global seeding operator on one tree“GSC” ¼3.
15: Sort trees according to their ﬁtness value
16: Set the Age of the best tree to 0 Table 1
End While Summary of the selected datasets.
5: end procedure
Return the best tree which shows the best selected feature Dataset #Features #Instances #Class

subset
Heart-statlog 13 270 2
Vehicle 18 846 4
Cleveland 13 303 5
Dermatology 34 366 6
4. The proposed feature selection using forest optimization Ionosphere 34 351 2
algorithm (FSFOA) Sonar 60 208 2
Glass 9 214 7
Wine 13 178 3
The stages of FOA for feature selection problem are adapted as Segmentation 19 2310 7
the following. SRBCT 2308 63 4
Hepatitis 19 155 2
4.1. Initialize trees

Table 2
The forest is initialized by randomly generated trees [9]. At ﬁrst, The value of “LSC” and “GSC” parameters for each dataset.
each variable of each tree in FSFOA is initialized randomly with
either ‘0’ or ‘1’. If a dataset has n features, the size of each tree will Dataset #Features “LSC” “GSC”
be 1*(n þ1); where one of the variables shows the “Age” of that
Heart-statlog 13 3 6
tree. Each ‘1’ in a tree indicates that the corresponding feature is
Vehicle 18 4 9
selected and therefore is involved in the machine learning process Cleveland 13 3 6
and each ‘0’ shows the exclusion of the related feature in the Dermatology 34 7 15
learning process. At ﬁrst, the “Age” of each tree is considered to be Ionosphere 34 7 15
Sonar 60 12 30
‘0’, but local seeding in each iteration of the algorithm will increase
Glass 9 2 4
the “Age” of all trees except new generated ones in the local Wine 13 3 6
seeding stage. Segmentation 19 4 9
SRBCT 2308 460 700
4.2. Local seeding Hepatitis 19 4 10

This stage adds some neighbors of each tree with “Age” 0 to the
forest [9]. In order to simulate this stage in FSFOA, for each tree of Table 3
the forest with “Age” 0, some variables are selected randomly (“LSC” Summary of the methods for our comparisons.
parameter determines the number of the selected variables). Then
Method name Dataset Description/year
the values of the selected variables are changed from 0 to 1 or vice
splitting
versa. This procedure simulates local search in the search space;
because each time the importance of one feature is evaluated by SFS, SBS, SFFS 70–30 Greedy hill climbing methodsa[21] (2010)
adding and removing that feature prior to learning algorithm. NSM 10-fold Neighbor soft margin [14]/2010
Fig. 2 shows an example of local seeding operator on one tree, SVM-FuzCoc 70–30% A novel SVM- based FS [21]/2010
HGAFS 2-fold Hybrid genetic algorithm for FS [16]/2007
where the number of the features of the dataset is 5 and the value FS-NEIR 10-fold Neighborhood effective information ratio
of “LSC” is considered to be 2. After performing the local seeding based FS [40]/2013
stage, the “Age” of all trees except new generated ones, is increased UFSACO 70–30 Unsupervised FS algorithm based on ACO
by '1. [29]/2014
PSO(4-2) 10-fold Particle swarm optimization for feature se-
lection [35]/2013
4.3. Population limiting
a
Sequential Forward selection, Sequential Backward selection, Sequential
In this stage two series of trees will be omitted from the forest Floating Forward selection method reported from [21]
to form the candidate population: 1 – trees with “Age” bigger than
“life time” parameter and 2 – the extra trees that exceed “area limit” 4.4. Global seeding
parameter after sorting the trees according to their fitness value.
This stage forms the candidate population and pre-defined per- In order to perform this stage in FSFOA, at first for each selected
centage of the candidate population is used later in global seeding tree from the candidate population, some of the variables are se-
stage. lected randomly. The number of the selected variables is de-
termined by the “GSC” parameter. Then, the value of each selected
variable will be negated (changing from 0 to 1 or vice versa). But
this time, adding or deleting some features are considered si-
multaneously and not just one feature at a time. This operator
performs a global search in the search space. An example of per-
forming this operator on one tree is shown as Fig. 3. In Fig. 3, the
Fig. 2. An example of local seeding operation on one tree with “LSC” ¼ 2. value of “GSC” parameter is considered to be 3 (3 variables are
M. Ghaemi, M.-R. Feizi-Derakhshi / Pattern Recognition 60 (2016) 121–129 125

Fig. 4. Comparison between the Classiﬁcation Accuracy (Accuracy) and Dimension Reduction (DR) obtained by FSFOA and other available methods on “Heart-statlog”,
“Cleveland”, “Vehicle”, “Dermatology”, “Sonar” “Ionosphere”, “Glass”, “Segmentation”, “Hepatitis”, “SRBCT” and “Wine” datasets.
126 M. Ghaemi, M.-R. Feizi-Derakhshi / Pattern Recognition 60 (2016) 121–129

Table 4 experiments of Section 5. Classiﬁcation accuracy is an effective

Summary of the configuration for the classifiers. way for feature selection validation [38] and it is defined as Eq. (1);
where N _CC is the number of correct classifications and N _AS
Classifier Configuration
denotes the number of all samples of the dataset. Dimension Re-
KNN K¼ 1, K¼3, K ¼5 duction ratio is calculated as in Eq. (2), where N _SF is the number
C4.5 J48 of selected features and N _AF is the number of all features of each
SVM rbf kernel
dataset [5].
CA = N _CC /N _AS (1)

negated to form a new tree). DR = 1 − (N _SF /N _AF ) (2)

Zhao et al. used ﬁtness function which is the combination of

4.5. Update the best tree classification accuracy, the number of selected features and the
feature costs [39]. Another objective function for choosing the
In this stage, after sorting the trees according to their fitness informative features considers the internal relation of the features
value, the tree with the highest fitness value is selected as the best [23], which measures the within-class and between-class corre-
tree and its “Age” will be set to ‘0’. These stages are performed
lation of the features. Also, the combination of the objective
iteratively until the stop condition is satisfied.
functions can be used in this need. In this article, we have just
considered the classification accuracy as our fitness function but,
4.6. Fitness function
we have also compared the dimensionality reduction of our pro-
posed FSFOA method with other methods. Also when comparing
K-nearest Neighbor (KNN) is the bases of many lazy learning
the results with each other, we have used the same partitioning
algorithms [33], but it has several drawbacks such as high storage
and the same classifiers with other methods which will be men-
requirements and sensitivity to noise. So, many researchers have
tioned in the following.
attempted to address these drawbacks. Feature selection (FS),
prototype generation/selection and feature weighting (FW)
methods are all used to further improve the performance of KNN
5. Experiments and results
[9,33]. We have attempted to improve the performance of some
classifiers including KNN ( K ∈ {1, 3, 5}) with the help of feature
The proposed FSFOA is validated with 11 datasets. Ten datasets
selection using FOA. In other words, the classification accuracy of
are obtained from the UCI machine learning repository [4] and
some classifiers is used as our fitness function in our experiments.
also a high dimentional dataset (“SRBCT”) from microarray data-
K-Nearest-Neighbor (1-NN, 3-NN and 5-NN), Support Vector Ma-
sets. In our experiments, we used publicly available package
chine (SVM) and C4.5 (J48) classifiers are selected in this need.
WEKA, which is a java-based machine learning toolkit. KNN, SVM
More explanation about the classifiers’ configuration is presented
and J48 classifiers of WEKA software are used in our experiments.
in Section 5.3.
All the experiments are performed on an ASUS machine with Intel
Because of the fact that partitioning the datasets with different
Core i3 CPU (2.40 GHz) and 4 GB of RAM and the main program-
percentages for the training and testing datasets may affect the
ming language is in Java.
results [42] especially in datasets with small number of instances, so
in our experiments datasets are partitioned according to different
methods. These methods include 10-fold cross validation method; 5.1. Datasets
70% for the training and 30% for the testing dataset, and also 2-fold
cross validation. The results of the experiments in Section 5.3 are The selected benchmark datasets include “Ionosphere”, “Glass”,
reported according to these methods where needed in comparisons. “Segmentation”, “Hepatitis”, “SRBCT”, “Heart-statlog”, “Cleveland”,
Reporting the results of the training datasets is not a good indicator “Vehicle”, “Dermatology”, “Sonar” and “Wine” datasets. These da-
for the performance of the selected features because of the high tasets cover the examples of small, medium and large dimensional
probability of overfitting problem; the same is true for reporting datasets. Summary of the selected datasets is presented in Table 1.
just the testing accuracy. A good cure for the problem of overfitting Table 1 contains the number of features (#features), number of
is to partition the dataset into 3 sets: training, validation, and testing classes (#class) and the number of instances of each dataset
datasets. As the result, the validation set will be used to prevent the (#instances). In feature selection problem, datasets are of small
overfitting problem. But this method could be problematic in da- scale, medium scale, or large scale if number of features belongs to
tasets with small number of samples; because some extent of the [0, 19], [20, 49], or [50, ∞], respectively [30]. So, 6 datasets among
dataset will be ignored and is used as a validation set. Due to the 11 ones are small scale datasets, 3 of them are medium scale da-
small selected datasets in this article, after putting away the testing tasets and “Sonar” and “SRBCT” datasets are large scale ones.
dataset, we have not put away the validation set and instead, the
training dataset is itself trained using 10-fold cross validation; 5.2. Parameters of FSFOA for our experiments
where the training set is divided into 10 distinct sets and 9 parts of
10 parts is used as the training set and the rest is used for the va- The parameters of FSFOA are defined as the following. Among
lidation need and this process is repeated 10 times and at last the the parameters, the value of “life time” parameter, “area limit” and
average of 10 runs is reported. “transfer rate” does not depend on the size of the datasets [9]; so
After that the training phase have finished, using an unseen we will consider the following values for these parameters: “life
testing dataset apart from the training dataset is inevitable and at time”¼15, “area limit” ¼50, “transfer rate”¼5%. Because the value of
last all the candidate feature subsets are evaluated on the same “LSC” and “GSC” parameters depend on the number of the features
testing dataset. In other words at last the average results of 10 runs of each dataset, so the value of these parameters are shown se-
due to the testing dataset is reported as the final result. The parately as Table 2. According to the experiments in [9], we will set
classification accuracy (CA) and dimension reduction (DR) of the the value of “LSC” parameter to 1/5 of the dimension of each
experiments on the unseen testing datasets are reported in the dataset.
M. Ghaemi, M.-R. Feizi-Derakhshi / Pattern Recognition 60 (2016) 121–129 127

Fig. 5. Graphical comparisons according to Accuracy for each data sets. “Hepatitis” and “Vehicle” are compared according to J48 and the others are compared due to KNN
classiﬁcation accuracy.

5.3. Results and comparisons with 95% conﬁdence interval. Feature selection algorithms se-
lected for comparisons are: Neighborhood soft margin (NSM)
We have compared our proposed FSFOA method with some method proposed by Hu et al. [14], SVM-FuzCoc by Moustakidis
other methods. All the results of our experiments are reported et al. [21], hybrid genetic algorithm for FS (HGAFS) by Huang [16],
128 M. Ghaemi, M.-R. Feizi-Derakhshi / Pattern Recognition 60 (2016) 121–129

FS-NEIR which uses a different feature evaluation criterion by Zhu 3 datasets (“Heart-statlog”, “Ionosphere” and “Segmentation”) of 11
et al. [40], an unsupervised feature selection algorithm based on ones according to classification accuracy. In datasets “Dermatol-
ant colony optimization (UFSACO) proposed by Tabakhi et al. [29] ogy”, “Sonar”, “Wine”, “Glass”, “Cleveland” and “Hepatitis”, FSFOA
and PSO(4-2) which is a PSO based method by Xue et al. [35]. outperforms many of the other methods except for one method
Among the methods, HGAFS uses support vector machine. SFS, where it has the second rank. In the other two datasets FSFOA
SBS and SFFS are among greedy methods and are chosen from the couldn't have the desirable performance. Among the selected
article of [21]. SVM-FuzCoc, PSO(4-2) and NSM use 1NN, 5NN and methods for comparison, there are methods which employ GA,
3NN classifiers respectively. UFSACO and FS-NEIR reported the PSO, and ACO; which are well-known algorithms. These results
classification accuracy of J48 classifier. The summary of these show that FSFOA has acceptable performance in solving feature
methods is shown in Table 3. Also, Table 3 shows the way each selection as a real optimization problem.
method have used the datasets (10 fold cross-validation, 70%
training and 30% testing, or 2-fold cross validation for training and
testing). 6. Conclusion
Classification accuracy and dimensionality reduction of FSFOA
and the other methods of Table 3 are reported in the tables Feature selection is considered to be an important preproces-
of Fig. 4. The results reported for FSFOA in Fig. 4 are over 10 in- sing step in machine learning and pattern recognition. Many
dependent runs. For each dataset the best classification accuracy heuristic and meta-heuristic methods have been proposed to ad-
and the best dimension reduction (DR) are highlighted in bold dress this problem.
form. Dimension Reduction (DR) in Fig. 4 is calculated by Eq. (2). In In this article, we have attempted to use Forest Optimization
order to provide fair comparisons, for each dataset multiple results Algorithm (FOA) for solving feature selection problem. As FOA is
according to dataset splitting with different percentages are re- reported to be suitable for continuous search space problems, so
ported and they are considered in our comparisons. Also, for each we have adjusted the stages of FOA for discrete search space of
method the used classifier- i.e. KNN, SVM or J48 is presented in feature selection problem and proposed FSFOA algorithm.
each table. Summary of the configuration parameters of the clas- In order to investigate the performance of FSFOA, we have se-
sifiers is presented as Table 4; which indicates that KNN classifier lected some well-known datasets from the UCI repository and
is used with different values for K where needed for comparisons compared the results of FSFOA with other methods. Among the
( K ∈ {1, 3, 5}) and J48 classifier of Weka is used as a decision tree selected methods for comparison, there are GA, ACO and PSO
based method. The kernel function for SVM classifier is radial basis based algorithms. The results of the experiments showed the su-
function (Rbf-svm). periority of our method in most of the selected datasets. In this
As it is obvious from the tables of Fig. 4, Classification Accuracy article, we have used KNN, SVM and J48 classifiers of WEKA
(CA) of “Heart-statlog”, “Ionosphere” and “Segmentation” datasets software to evaluate the fitness of each potential solution and
have improved in comparison with all the selected methods. classification accuracy is considered as our fitness function.
Among the selected methods there are GA_based and ACO_based This study shows that FOA is an effective search technique for
methods. This shows that FOA could improve the performance of feature selection problems but further research is also welcomed.
KNN, J48 and SVM classifier by reducing the redundant features in For our future research, we will attempt to investigate the per-
these datasets. In datasets “Sonar”, “Wine”, “Hepatitis”, “cleveland”, formance of FSFOA in very large datasets with huge number of
“Dermatology”, and “Glass”, FSFOA could outperform in almost all features (e.g. over 10,000); because the size of datasets both in the
the selected methods for comparisons; FSFOA has the second rank number of features and instances grows these days and data
in these datasets where it couldn't outperform. In “Vehicle” data- mining in very large datasets is a big concern. Also, involving the
set, FSFOA could outperform just one of the methods with the number of the selected features in fitness function with the aim of
same partitioning and classifier. FSFOA did not show a good per- improving the dimension reduction rate (DR) will be our feature
formance in “SRBCT” dataset. “SRBCT” dataset is the only dataset attempt. This can be implemented by multi-objective fitness
where the number of features are much more than the number of function which takes into account the classification accuracy and
samples; this makes it difficult to select the proper features for the number of selected features simultaneously.
prediction. As the number of samples are not sufficient for se-
lecting the more informative features and applying the traditional
methods yields poor results. Also, this dataset is partitioned to
Conflict of interest
70%-30% training and testing dataset; this makes the problem
worse; because some extend of the dataset is ignored during the
None declared.
training phase. This shows that feature selection in large datasets
with many features and where the number of samples is limited is
a challenging problem and deserves more research.
References
Comparing the DR of the methods in Fig. 4, it is obvious that
FSFOA couldnot outperform the selected methods; because as we
[1] W. Aha David, Feature weithing for lazy learning algorithms, in: Huan Liu,
mentioned before, the number of the selected features is not in- Hiroshi, Motoda (Eds.), Feature Extraction Construction and Selection: a Data
volved in the fitness evaluation of each potential solution and Mining Perspective, Kluwer Academic Publishers, Massachussetts, 1998, 13–32.
classification accuracy is considered as the fitness function. For [2] Almuallim Hussein, Thomas G. Dietterich, Learning Boolean concepts in the
presence of many irrelevant features, Artif. Intell. 69 (1) (1994) 279–305.
better performance illustration, we have shown the results gra- [3] H. Almuallim, T.G. Dietterich, Learning with many irrelevant features, in:
phically in the charts of Fig. 5. For datasets “Hepatitis” and “Ve- Proceedings of the AAAI, vol. 91, 1991 July 14, 547–552.
hicle” the two selected methods are compared according to J48 [4] C. Blake, E. Keogh, C.J. Merz, UCI Repository of machine learning databases,
University of California, Irvine, 〈http://www.ics.uci.edu/ mlearn/MLReposi
classifier and for datasets “Dermatology”, “Sonar”, “SRBCT”, “Wine”,
tory.html〉.
“Heart-statlog”, “Ionosphere”, “Glass”, “Cleveland” and “Segmenta- [5] M. Jose Cadenas, M. Carmen Carrido, Raquel Martinez, Feature subset selection
tion” KNN classifier is chosen in the graphical comparisons of filter-wrapper based on low quality data, Expert Syst. Appl. 40 (2013)
Fig. 5. 6241–6252.
[6] K.J. Cios, G. William Moore, Uniqueness of medical data mining, Artif. In-
While comparing the results of Fig. 4 and also charts of Fig. 5, tell. Med. 26 (1) (2002) 1–24.
FSFOA performs absolutely better than the other methods in [7] M.E. ElAlami, A filter model for feature subset selection based on genetic
M. Ghaemi, M.-R. Feizi-Derakhshi / Pattern Recognition 60 (2016) 121–129 129

algorithm, Knowl.-Based Syst. 22 (5) (2009) 356–362. [27] O. Seral, S. Gunes, Attribute weighting via genetic algorithms for attribute
[8] E. Gasca, J.S. Sanchez, R. Alonso, Eliminating redundancy and irrelevance using weighted artificial immune system (AWAIS) and its application to heart dis-
a new MLP-based feature selection method, Pattern Recognit. 39 (2006) ease and liver disorders problems, Expert Syst. Appl. 36 (2009) 386–392.
313–315. [28] Rahul Karthik Sivagaminathan, Sreeram Ramakrishnan, A hybrid approach for
[9] Manizheh Ghaemi, Mohammad-Reza Feizi-Derakhshi, Forest optimization al- feature subset selection using neural networks and ant colony optimization,
gorithm, Expert Syst. Appl. 41 (15) (2014) 6676–6687. Expert Syst. Appl. 33 (2007) 49–60.
[10] A. Gheyas Iffat, S. Smith Leslie, Feature subset selection in large dimensionality [29] S. Tabakhi, P. Moradi, F. Akhlaghian, An unsupervised feature selection algo-
domains, Pattern Recognit. 43 (2010) 5–13. rithm based on ant colony optimization, Eng. Appl. Artif. Intell. 32 (2014)
[11] A. Hall Mark, Correlation-based feature selection for machine learning (Ph.D. 112–123.
thesis), Hamilton, NewZealand, 1999. [30] M.A. Tahir, A. Bouridane, F. Kurugollu, Simultaneous feature selection and
[12] A. Hall Mark, Correlation-based feature selection for discrete and numeric feature weighting using Hybrid Tabu Search/K nearest neighbor classifier,
class machine learning, in: Proceedings of 17th International Conference on Pattern Recognit. Lett. 28 (2007) 438–446.
Machine Learning, 2000, 359–366. [31] K.C. Tan, E.J. Teoh, Q. Yu, K.C. Goh, A hybrid evolutionary algorithm for attri-
[13] M. Hamdani Tarek, Jin-Myung Won, M. Alimi Adel, Karray Fakhri, Hierarchical bute selection in data mining, Expert Syst. Appl. (2009) 8616–8630.
genetic algorithm with new evaluation function and bi-coded representation [32] A. Tosun, B. Turhan, A.B. Bener, Feature weighting heuristics for analogy-based
for the selection of features considering their confidence rate, Appl. Soft effort estimation models, Expert Syst. Appl. 36 (2009) 10325–10333.
Comput. 11 (2011) 2501–2509. [33] I. Triguero, J. Derrac, S. Garca, F. Herrera, Integrating a differential evolution
[14] Q. Hu, X. Che, L. Zhang, D. Yu, Feature evaluation and selection based on feature weighting scheme into prototype generation, Neurocomputing 97
neighborhood soft margin, Neurocomputing 73 (10) (2010) 2114–2124. (2012) 332–343.
[15] Q.H. Hu, D. Yu, J.F. Liu, C. Wu, Neighborhood rough set based heterogeneous [34] Dietrich Wettschereck, David W. Aha, Takao Mohri, A review and empirical
feature subset selection, Inf. Sci. 178 (2008) 3577–3594. evaluation of feature weighting methods for a class of lazy learning algo-
[16] J. Huang, Y. Cai, X. Xu, A hybrid genetic algorithm for feature selection wrapper rithms, Artif. Intell. Rev. 11 (1997) 273–314.
based on mutual information, Pattern Recognit. Lett. 28 (2007) 1825–1844. [35] B. Xue, M. Zhang, W.N. Browne, Particle swarm optimisation for feature se-
[17] Md. Kabir Monirul, Md. Shahjahan, Kazuyuki Murase, A new local search lection in classification: novel initialisation and updating mechanisms, Appl.
based hybrid genetic algorithm for feature selection, Neurocomputing 74 Soft Comput. 18 (2013) 261–276.
(2011) 2914–2928. [36] Zhi-Min Yang, Jun-Yun He, Yuan Hai Shao, Feature selection based on linear
[18] Md. Kabir Monirul, Md. Shahjahan, Kazuyuki Murase, A new hybrid ant colony twin support vector machine, Proc. Comput. Sci. 17 (2013) 1039–1046.
optimization algorithm for feature selection, Expert Syst. Appl. 39 (2012) [37] J.Y. Yeh, T.H. Wu, C.W. Tsao, Using data mining techniques to predict hospi-
3747–3763. talization of hemodialysis patients, Decis. Support Syst. 50 (2) (2011) 439–448.
[19] Ron Kohavi, H. John George, Wrappers for feature subset selection, Artif. Intell. [38] Y. Zhang, A. Yang, C. Xiong, T. Wang, Z. Zhang, Feature selection using data
97 (12) (1997) 273–324. envelopment analysis, Knowl.-Based Syst. 64 (2014) 70–80.
[20] N. Lavrac, Selected techniques for data mining in medicine, Artif. Intell. Med. [39] Mingyuan Zhao, Chong Fu, Luping Ji, Ke Tang, Mingtian Zhou, Feature selec-
16 (1) (1999) 3–23. tion and parameter optimization for support vector: a new approach based on
[21] S.P. Moustakidis, J.B. Theocharis, SVM-FuzCoC: a novel SVM based feature genetic machines algorithm with feature chromosomes, Expert Syst. Appl.
selection method using a fuzzy complementary criterion, Pattern Recognit. 43 38 (5) (2011) 5197–5204.
(2010) 3712–3729. [40] Wenzhi Zhu, Si Gangquan, Zhang Yanbin, Wang Jingcheng, Neighborhood
[22] Shahla Nemati, Mohammad Ehsan Basiri, Nasser Ghasem Aghaee, Mehdi effective information ratio for hybrib feature evaluation and selection, Neu-
Hosseinzadeh Aghdam, A novel ACO-GA hybrid algorithm for feature selection rocomputing 99 (2013) 25–37.
in protein function prediction, Expert Syst. Appl. 36 (2009) 12086–12094. [41] Zexuan Zhu, Yew-soon Ong, Manoranjan Dash, Wrapper-filter feature selec-
[23] G.A. Papakostas, A.S. Polydoros, D.E. Koulouriotis, V.D. Tourassis, Evolutionary tion algorithm using a memetic framework, IEEE Trans. Syst., Man, Cybern. 37
Feature Subset Selection for Pattern Recognition Applications, INTECH Open (2007) 70–76.
Access Publisher, 2011. [42] Alexandros Kalousis, Julien Prados, Melanie Hilario, Stability of feature selec-
[24] P. Pudil, J. Novovicov, J. Kittler, Floating search methods in feature selection, tion algorithms: a study on high-dimensional spaces, Knowl. Inf. Syst. 12 (1)
Pattern Recognit. Lett. 15 (1994) 1119–1125. (2007) 95–116.
[25] Bart Selman, Carla P. Gomes, Hill‐climbing Search, Encyclopedia of Cognitive
Science, 2006.
[26] B. Selman, H.J. Levesque, D.G. Mitchell, A new method for solving hard satisfiability
problems, in: Proceedings of the AAAI, vol. 92, July 12, 1992, 440–446.

Manizheh Ghaemi received her B.S. and M.S. degrees in Computer Science from the University of Tabriaz, Iran. She is now a Ph.D. student for Artiﬁcial Intelligence in K.N.
Toosi University of Technology, Iran. Her research interests include nature-based evolutionary algorithms, optimization and machine learning algorithms.

Mohammad-Reza Feizi-Derakhshi received his B.S. in Software Engineering from the University of Isfahan. He received his M.S. and Ph.D. in AI from the Iran University of
Science and Technology. He is currently a faculty member at the University of Tabriz. His research interests include: NLP, optimization algorithms and intelligent databases.

7 Ways To Fight Cps
94% (18)
7 Ways To Fight Cps
5 pages
Secure - Docs - The Amazing Formula For Creating Wealth
100% (1)
Secure - Docs - The Amazing Formula For Creating Wealth
288 pages
1 s2.0 S1568494623005768 Main
No ratings yet
1 s2.0 S1568494623005768 Main
19 pages
Ip2023 01 005
No ratings yet
Ip2023 01 005
10 pages
Feature Selection Using Binary Particle Swarm Optimization With Time Varying Inertia Weight Strategies
No ratings yet
Feature Selection Using Binary Particle Swarm Optimization With Time Varying Inertia Weight Strategies
9 pages
Effective Feature Selection Strategy for Supervised Classification
No ratings yet
Effective Feature Selection Strategy for Supervised Classification
21 pages
A Survey On Evolutionary Multiobjective Feature Selection in Classification Approaches Applications and Challenges
No ratings yet
A Survey On Evolutionary Multiobjective Feature Selection in Classification Approaches Applications and Challenges
21 pages
Zhang 2017
No ratings yet
Zhang 2017
38 pages
5.binary Grosopher
No ratings yet
5.binary Grosopher
20 pages
Binary Ebola Optimization Search Algorithm For Feature Selection and Classification Problems
No ratings yet
Binary Ebola Optimization Search Algorithm For Feature Selection and Classification Problems
46 pages
2015 Elsevier an Advanced ACO Algorithm for Feature Subset Selection
No ratings yet
2015 Elsevier an Advanced ACO Algorithm for Feature Subset Selection
9 pages
An Unsupervised Feature Selection Algori
No ratings yet
An Unsupervised Feature Selection Algori
12 pages
Correlation Based Feature Selection
No ratings yet
Correlation Based Feature Selection
4 pages
Feature Selection For Domain Adaptation Using Complexity Meas - 2023 - Neurocomp
No ratings yet
Feature Selection For Domain Adaptation Using Complexity Meas - 2023 - Neurocomp
14 pages
Binary Grasshopper
No ratings yet
Binary Grasshopper
39 pages
Improving Floating Search Feature Selection Using Genetic Algorithm
No ratings yet
Improving Floating Search Feature Selection Using Genetic Algorithm
19 pages
Woa 1
No ratings yet
Woa 1
5 pages
Bio-Inspired Feature Selection an Improved Binary
No ratings yet
Bio-Inspired Feature Selection an Improved Binary
15 pages
s11227-024-06606-8
No ratings yet
s11227-024-06606-8
34 pages
4.hybrid Particale Swarm
No ratings yet
4.hybrid Particale Swarm
17 pages
10.1515 - Jaiscr 2015 0031
No ratings yet
10.1515 - Jaiscr 2015 0031
8 pages
Automatic Feature Subset Selection Using Genetic Algorithm For Clustering
No ratings yet
Automatic Feature Subset Selection Using Genetic Algorithm For Clustering
5 pages
Futureinternet 14 00178
No ratings yet
Futureinternet 14 00178
16 pages
Feature Selection Techniques
No ratings yet
Feature Selection Techniques
17 pages
W Kiylm OMa Ga AKLp 0 ABBUXpo 79 MW 660
No ratings yet
W Kiylm OMa Ga AKLp 0 ABBUXpo 79 MW 660
19 pages
3038-Article Text-5729-1-10-20210418
No ratings yet
3038-Article Text-5729-1-10-20210418
6 pages
A.shabbir Pjeas 2014
No ratings yet
A.shabbir Pjeas 2014
12 pages
2015-Elsevier-A binary ABC algorithm based on advanced similarity scheme for feature selection
No ratings yet
2015-Elsevier-A binary ABC algorithm based on advanced similarity scheme for feature selection
15 pages
A Study On Feature Selection Techniques in Bio Informatics
100% (1)
A Study On Feature Selection Techniques in Bio Informatics
7 pages
Computational Intelligence - 2020 - Mohammadzadeh - A novel hybrid whale optimization algorithm with f
No ratings yet
Computational Intelligence - 2020 - Mohammadzadeh - A novel hybrid whale optimization algorithm with f
34 pages
Feature Extraction Techniques Using Support Vector Machines in Disease Prediction
No ratings yet
Feature Extraction Techniques Using Support Vector Machines in Disease Prediction
8 pages
n2020
No ratings yet
n2020
6 pages
A Hybrid Approach From Ant Colony Optimization and K-Neare
No ratings yet
A Hybrid Approach From Ant Colony Optimization and K-Neare
13 pages
Using RL To Find An Optimal Set of Features
No ratings yet
Using RL To Find An Optimal Set of Features
13 pages
A Fast Clustering-Based Feature Subset Selection Algorithm For High Dimensional Data
No ratings yet
A Fast Clustering-Based Feature Subset Selection Algorithm For High Dimensional Data
8 pages
Evolutionary Population Dynamics and Grasshopper Optimization Approaches For Feature Selection Problems
No ratings yet
Evolutionary Population Dynamics and Grasshopper Optimization Approaches For Feature Selection Problems
21 pages
Biomimetics 09 00648
No ratings yet
Biomimetics 09 00648
24 pages
Entropy 25 01128 v2
No ratings yet
Entropy 25 01128 v2
23 pages
(IJCST-V8I6P8) :K.Ashfaq Ahamed
No ratings yet
(IJCST-V8I6P8) :K.Ashfaq Ahamed
6 pages
Features Election
No ratings yet
Features Election
62 pages
Quasi-reflection learning arithmetic optimization algorithm firefly search for feature selection
No ratings yet
Quasi-reflection learning arithmetic optimization algorithm firefly search for feature selection
11 pages
icml2005
No ratings yet
icml2005
8 pages
zhou2020 DT para FS
No ratings yet
zhou2020 DT para FS
14 pages
An Ant Colony Approach For Clustering
No ratings yet
An Ant Colony Approach For Clustering
18 pages
eTasci
No ratings yet
eTasci
26 pages
A Review of Feature Selection Methods On Synthetic Data
No ratings yet
A Review of Feature Selection Methods On Synthetic Data
37 pages
GAIN RATIO and Correlation
No ratings yet
GAIN RATIO and Correlation
7 pages
A Review of Feature Selection and Its Methods: Cybernetics and Information Technologies March 2019
No ratings yet
A Review of Feature Selection and Its Methods: Cybernetics and Information Technologies March 2019
25 pages
Toward Integrating Feature Selection Algorithms For Classification and Clustering-M7s PDF
No ratings yet
Toward Integrating Feature Selection Algorithms For Classification and Clustering-M7s PDF
12 pages
A New Feature Subset Selection Using Bottom-Up Clu
No ratings yet
A New Feature Subset Selection Using Bottom-Up Clu
11 pages
International Journal of Engineering Research and Development (IJERD)
No ratings yet
International Journal of Engineering Research and Development (IJERD)
5 pages
Expert Systems With Applications: Md. Monirul Kabir, Md. Shahjahan, Kazuyuki Murase
No ratings yet
Expert Systems With Applications: Md. Monirul Kabir, Md. Shahjahan, Kazuyuki Murase
17 pages
Lecture#10
No ratings yet
Lecture#10
24 pages
2.hybrid Particle Swarm Optimization For Feature Subset Selection by Integrating
No ratings yet
2.hybrid Particle Swarm Optimization For Feature Subset Selection by Integrating
14 pages
A Review of Feature Selection Techniques in BioinformaticsBioinformatics
No ratings yet
A Review of Feature Selection Techniques in BioinformaticsBioinformatics
11 pages
A Review of Feature Selection Techniques in Bioinformatics
No ratings yet
A Review of Feature Selection Techniques in Bioinformatics
11 pages
A Comprehensive Review of Feature Selection and Fe
No ratings yet
A Comprehensive Review of Feature Selection and Fe
16 pages
陌陌陌陌莫迪
No ratings yet
陌陌陌陌莫迪
9 pages
A Review of Feature Selection and Its Methods
No ratings yet
A Review of Feature Selection and Its Methods
15 pages
Study of Intelligent Evolutionary Algorithms For Solving High Dimensional Problems
No ratings yet
Study of Intelligent Evolutionary Algorithms For Solving High Dimensional Problems
2 pages
An Investigation into the Use of a Neural Tree Classifier for Knowledge Discovery in OLAP Databases
From Everand
An Investigation into the Use of a Neural Tree Classifier for Knowledge Discovery in OLAP Databases
David R Swinburne
No ratings yet
Union-Find Data Structures and Algorithms: Definitive Reference for Developers and Engineers
From Everand
Union-Find Data Structures and Algorithms: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Contact Terminology: Knife Switch Braking
No ratings yet
Contact Terminology: Knife Switch Braking
1 page
Room Cleaning
No ratings yet
Room Cleaning
29 pages
W1AW Operating Schedule Jan 2022
No ratings yet
W1AW Operating Schedule Jan 2022
2 pages
The Waste of Waiting
No ratings yet
The Waste of Waiting
4 pages
The Freiburg Groceries Dataset
No ratings yet
The Freiburg Groceries Dataset
7 pages
Action Plan in Reading English
No ratings yet
Action Plan in Reading English
3 pages
Sky Habitat
100% (1)
Sky Habitat
3 pages
RF Dynam
No ratings yet
RF Dynam
71 pages
CH2: Overview of TPS and ERP Systems
No ratings yet
CH2: Overview of TPS and ERP Systems
11 pages
AirThread Case
No ratings yet
AirThread Case
2 pages
Trade Union - Functions and Regulations. IRPM
No ratings yet
Trade Union - Functions and Regulations. IRPM
17 pages
Level 6 Diploma Occupational Safety Health Leadership Management Factsheet
No ratings yet
Level 6 Diploma Occupational Safety Health Leadership Management Factsheet
2 pages
EXP-3
No ratings yet
EXP-3
3 pages
Crisis Management Report
100% (1)
Crisis Management Report
30 pages
Procurement Process
No ratings yet
Procurement Process
9 pages
Business Expert, Nour Hamed
No ratings yet
Business Expert, Nour Hamed
14 pages
Comparative Analysis of Nano ESP32, Raspberry Pi 4, and Portenta H7
No ratings yet
Comparative Analysis of Nano ESP32, Raspberry Pi 4, and Portenta H7
11 pages
Particle Swarm Optimization Based Reactive Power Dispatch For Pow
100% (1)
Particle Swarm Optimization Based Reactive Power Dispatch For Pow
98 pages
connection_log
No ratings yet
connection_log
2 pages
Hamza Masoud C.V - "I&C" Instrument and Control Specialist
No ratings yet
Hamza Masoud C.V - "I&C" Instrument and Control Specialist
3 pages
Financial Distress - Corporate FInance
No ratings yet
Financial Distress - Corporate FInance
16 pages
Seminar On Ajax
0% (1)
Seminar On Ajax
16 pages
Sample Resume
No ratings yet
Sample Resume
3 pages
Application For The Post of - Senior Architectural Designer
No ratings yet
Application For The Post of - Senior Architectural Designer
5 pages
PST Unit 3
No ratings yet
PST Unit 3
7 pages
Environmental Engineering Questions
No ratings yet
Environmental Engineering Questions
6 pages
HPLC Training New Latest
No ratings yet
HPLC Training New Latest
60 pages
My Ideal House Essay
100% (2)
My Ideal House Essay
4 pages

Feature Selection Using Forest Optimization Algorithm

Uploaded by

Feature Selection Using Forest Optimization Algorithm

Uploaded by

Pattern Recognition 60 (2016) 121–129

Contents lists available at ScienceDirect

Feature selection using Forest Optimization Algorithm

1. Introduction weighting [1,9,27,30,32,33]. In feature weighting problem, fea-

1. “Local Seeding Changes” or “LSC”,

As FOA is proposed for continuous space problems, in this article

Input: life time, LSC, GSC, transfer rate, area limit

1: Perform local seeding on trees with Age 0

11: Choose “GSC” variables of the selected tree randomly

Table 4 experiments of Section 5. Classiﬁcation accuracy is an effective

negated to form a new tree). DR = 1 − (N _SF /N _AF ) (2)

Zhao et al. used ﬁtness function which is the combination of

You might also like