SVOH: Rigorous Selection Approach For Optimal Hyperparameter Values

The problem we address in this paper is a model selection problem. We consider the k-fold cross-validation (KCV) technique, applied to the Gaussian support vector machine (SVM) classification algorithm. In the cross-validation process, the value of k for the number of subsets is generally chosen and set aprioristically (without any ex- periment). However, the value of k affects the choice of the best compromise between the estimation error and the ap- proximation error of the model.

Uploaded by

International Journal of Innovative Science and Research Technology

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views6 pages

SVOH: Rigorous Selection Approach For Optimal Hyperparameter Values

Uploaded by

International Journal of Innovative Science and Research Technology

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Volume 9, Issue 10, October – 2024 International Journal of Innovative Science and Research Technology

ISSN No:-2456-2165 https://doi.org/10.38124/ijisrt/IJISRT24OCT497

SVOH: Rigorous Selection Approach for Optimal

Hyperparameter Values

Kopoin NDiffon Charlemagne1 ; Koffi Dagou Augustin1 ; Zouneme Boris Stéphane2

1
Ecole Supérieure Africaine des TIC, Abidjan-Treichville, Côte d’Ivoire
2
Université Nangui Abrogoua, Abidjan, Côte d’Ivoire

Abstract:- The problem we address in this paper is a model generally referred to in the machine learning literature as the
selection problem. We consider the k-fold cross-validation model selection phase [9] and is strictly linked to the evalua-
(KCV) technique, applied to the Gaussian support vector tion of the SVM's generalisation capacity or, in other words,
machine (SVM) classification algorithm. In the cross-vali- the error rate that the SVM can achieve on new data (unknown
dation process, the value of k for the number of subsets is data). In fact, it is common practice to select the optimal SVM
generally chosen and set aprioristically (without any ex- (i.e. the optimal hyperparameters) by choosing the one with
periment). However, the value of k affects the choice of the the lowest generalisation error. The methods for carrying out
best compromise between the estimation error and the ap- the model selection phase can be divided into two categories
proximation error of the model. In this way, the k value of according to [9] : theoretical methods [10] and methods based
the number of subsets can severely influence the optimal on resampling techniques [11].
values of the SVM classifier's hyperparameters and conse-
quently affect the performance of the selected model and Theoretical methods provide in-depth information about
its ability to generalize. classification algorithms but are often inapplicable and incal-
culable to be of any practical use. On the other hand, as men-
In this work, we propose a rigorous approach for tioned by [5], Practitioners have found procedures based on
finding the values of the hyperparameters of the Gaussian resampling techniques, which work well in practice but offer
SVM known as SVOH (Selection of Optimal Hyperparam- no theoretical guarantee of generalisation error. One of the
eter Values) in a context of protein-protein interaction (PPI) most popular resampling techniques is the k-fold cross-valida-
prediction, where it is necessary to classify the pairs of pro- tion (KCV) procedure [8], which is simple, effective and reli-
teins that interact together and those that do not interact able. The KCV technique consists of dividing a data set into k
together. The proposed approach considers the k value of independent subsets. All but one of these subsets is used to
the number of subsets as an influential parameter of the form a classifier, while the remaining subset is used to evaluate
model and therefore performs learning to find an optimal the generalisation error. After training, it is possible to calcu-
value of k. late an upper limit on the generalisation error for each of the
trained classifiers.
Keywords:- Machine Learning, Model Selection, Cross-Vali-
dation, Prediction of Protein-Protein Interactions In the literature, the choice of the value of k is fixed at 5
or 10. Choosing a fixed value of subsets for cross-validation
I. INTRODUCTION can produce a model with a high bias and variance [9]. Cross-
validation takes the average of several estimates of the reten-
The support vector machine (SVM) is one of the most tion risk corresponding to different splits of data. In [5] we can
algorithms for classification tasks, parti-cularly in the classifi- check that the value of k influences the stability of the mean
cation of protein-protein interactions [1]–[3]. SVM belongs to error. Still according to [9], Model selection performance with
the field of artificial neural networks (ANN) [4] but is charac- cross-validation is gene-rally optimal when the variance is as
terised by the solid foundations of statistical learning theory. low as possible. This variance generally decreases as the num-
SVMs are learned by searching a set of para-meters obtained ber k of subsets increases, with a fixed training sample size n.
by solving a constrained quadratic convex programming prob- When k is fixed, the variance of the cross-validation also de-
lem (CCQP), for which a number of efficient techniques have pends on n. In fact, in [8], we can see that the value of γ de-
been deve-loped. The search for optimal parameters does not, pends strongly on the training set used. The choice of k there-
however, complete the learning process, because there is a set fore influences the variance of the cross-validation estimator
of additional variables, hyperparameters, which must be set to and, according to [6], [12], can have a significant impact on
achieve optimal classification performance, e.g. for ANNs, the the search for the optimal values of the hyperparameters.
hyperparameter is the number of hidden nodes. In the Gauss-
ian SVM framework, these are the regulator parameters C and
γ. This setting is not trivial and is an open research problem
[5]–[8]. The process of finding the best hyperparameters is