0% found this document useful (0 votes)

31 views19 pages

Enhancing The Decoding Acc of Eeg Sigls by The Introduction of Anchored STFT and Adversarial Data Augmentation Method

Uploaded by

gunda manasa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views19 pages

Enhancing The Decoding Acc of Eeg Sigls by The Introduction of Anchored STFT and Adversarial Data Augmentation Method

Uploaded by

gunda manasa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

www.nature.

com/scientificreports

OPEN Enhancing the decoding accuracy

of EEG signals by the introduction
of anchored‑STFT and adversarial
data augmentation method
Omair Ali1,4,5*, Muhammad Saif‑ur‑Rehman3,5, Susanne Dyck1, Tobias Glasmachers2,
Ioannis Iossifidis3 & Christian Klaes 1
Brain-computer interfaces (BCIs) enable communication between humans and machines by
translating brain activity into control commands. Electroencephalography (EEG) signals are one of
the most used brain signals in non-invasive BCI applications but are often contaminated with noise.
Therefore, it is possible that meaningful patterns for classifying EEG signals are deeply hidden. State-
of-the-art deep-learning algorithms are successful in learning hidden, meaningful patterns. However,
the quality and the quantity of the presented inputs are pivotal. Here, we propose a feature extraction
method called anchored Short Time Fourier Transform (anchored-STFT), which is an advanced version
of STFT, as it minimizes the trade-off between temporal and spectral resolution presented by STFT. In
addition, we propose a data augmentation method derived from l2-norm fast gradient sign method
(FGSM), called gradient norm adversarial augmentation (GNAA). GNAA is not only an augmentation
method but is also used to harness adversarial inputs in EEG data, which not only improves the
classification accuracy but also enhances the robustness of the classifier. In addition, we also propose
a CNN architecture, namely Skip-Net, for the classification of EEG signals. The proposed pipeline
outperforms the current state-of-the-art methods and yields classification accuracies of 90.7% on
BCI competition II dataset III and 89.5%, 81.8%, 76.0% and 85.4%, 69.1%, 80.9% on different data
distributions of BCI Competition IV dataset 2b and 2a, respectively.

A brain computer interface (BCI) is used to translate neural signals into command signals to control an extra-
corporeal robotic device1. Henceforth, a BCI establishes an alternative pathway of communication and control
between the user and an external machine. The successful translation of neural signals into command signals
is vital in the rehabilitation of physically disabled people2–7. The first step is to record neural signals from the
areas of the brain which process the user’s i ntent3,8–13. These neural signals are recorded either by i nvasive4,5 or
non-invasive methods8,12,13. Invasive methods include implanting electrodes in the brain at the area of interest,
whereas, the most non-invasive BCI systems use EEG signals, i.e., the electrical brain activity recorded from elec-
trodes which are placed on the scalp. In the next stage, the recorded signals are digitized and preprocessed using
digital signal processors (DSPs). The preprocessed signals are then utilized to extract feature vectors, which are
further fed to a decoding algorithm to finally map the given neural activity into corresponding intended actions.
The output of the decoding algorithm is then transformed into control signals to control the external device.
In this study, we focus on non-invasive BCI systems using EEG signals to decode movement related
information14. Movement related signals that are generated from the motor cortex by just imagining move-
ments without any overt limb movement, are called motor imagery (MI)15–17. Classifying the MI-EEG signal is
quite challenging due to two main reasons. Firstly, it has low signal-to-noise ratio. Secondly, it is a non-linear
and non-stationary signal.
The successful classification of a MI-EEG signal into a corresponding control signal mainly depends on
feature extraction techniques and machine learning algorithms. The current state-of-the-art feature extraction

1
Faculty of Medicine, Department of Neurosurgery, University Hospital Knappschaftskrankenhaus Bochum GmbH,
Bochum, Germany. 2Institut Für Neuroinformatik, Ruhr University Bochum, Bochum, Germany. 3Department
of Computer Science, Ruhr-West University of Applied Science, Mülheim an der Ruhr, Germany. 4Department of
Electrical Engineering and Information Technology, Ruhr-University Bochum, Bochum, Germany. 5These authors
contributed equally: Omair Ali and Muhammad Saif-ur-Rehman. *email: [email protected]

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 1

Vol.:(0123456789)
www.nature.com/scientificreports/

algorithms include common spatial pattern (CSP)9,12, short time Fourier transform (STFT)15 and wavelet trans-
form (WT)16. The conventional classifiers used to classify EEG signals12,18,19 include linear discriminant analysis
(LDA)17, Bayesian c lassifiers20 and support vector machines (SVM)2,21.
Recently, deep-learning algorithms produced state-of-the-art results in several computer vision t asks22,23.
Deep learning has also gained popularity in BCI and spike sorting studies24–26. In27, a deep belief network
(DBN) has outperformed SVM in the classification of MI-EEG tasks. In another study28, DBN was used to detect
anomalies in the EEG signals. I n29, DBN was also used to extract feature vectors for the classification algorithm.
Convolution neural networks (CNNs) are also successfully used for decoding in BCI applications. In30, CNN
was employed in classification of MI-EEG signals. To model cognitive events from EEG signals, a novel multi-
dimensional feature extraction technique using recurrent convolutional neural networks was proposed i n31.
Today, algorithms based on the CNN architecture are among the most successful algorithms in image rec-
ognition tasks. One reason behind this success is the translation invariance of CNNs. Therefore, in a few BCI
studies, algorithms to convert EEG signal into an image representation are proposed. I n15, the information about
location, time, and frequency is combined using short time Fourier transform (STFT) to convert an EEG signal
to an image structure. In16, the MI-EEG signal is transformed into an image using a wavelet transform, only
later to be used by CNN for the classification of the signal. I n32, a hybrid scale CNN architecture is presented for
MI-EEG classification, which extracts the features from different frequency bands using multiple kernel scales.
Furthermore, Zhang et al.33 reported the current state-of-the-art results for MI-EEG signals classification. Here,
the authors presented a deep learning model, named EEG-Inception. EEG-Inception uses the inception layers
for feature extraction. It uses the raw EEG signals as inputs and maps them to intended actions.
In this study, we presented a pipeline for MI-EEG classification, which outperformed all the current state-of-
the-art studies on two publicly available datasets. The contributions of this study are as follows:

1. Conventional STFT uses a fixed-length window for the mapping of time domain signal into frequency
domain, and consequently presents a trade-off between temporal and spectral resolution which is critical
for feature extraction. Henceforth, an extension of short time Fourier Transform (STFT) that uses multiple
windows of variable sizes for the transformation, called anchored-STFT, is proposed for better feature extrac-
tion.
2. Obtaining large, labeled data sets is still a challenge in training deep learning models for BCI applications,
henceforth, a generative model-based data augmentation method called Gradient Norm adversarial aug-
mentation (GNAA) is proposed that enhances the robustness and the classification accuracy of the classifier.
3. Since accurate predictions are critical for BCI applications, a shallow CNN-based architecture with few
trainable parameters called Skip-Net is proposed which enhances the classification accuracy and avoids
overfitting by adding a skip connection to a shallow architecture of CNN.

The proposed pipeline outperforms the current state-of-the-art studies by achieving the average classifica-
tion accuracies of 89.5%, 81.8% and 76.0% on different data distributions of BCI Competition IV dataset
2b. It also outperforms the state-of-the-art studies of BCI Competition IV dataset 2a on its different data
distributions by achieving the average classification accuracies of 85.4%, 69.1% and 80.9%.

Materials and methods

In this study, the classification of MI-EEG signals is performed. The complete proposed pipeline of the clas-
sification process is shown as a block diagram in Fig. 1. It consists of three modules: anchored-STFT as feature
extraction, GNAA as data augmentation and Skip-Net for classification. We used three publicly available datasets
(BCI competition IV dataset 2 a34, BCI competition IV dataset 2 b34, BCI competition II dataset I II35), which are
mostly used as the benchmark for the comparison of the classification of MI-EEG signals. As we used publicly
available datasets, the recording of the EEG signals is not included in the pipeline.
First, the features are extracted from EEG signals using anchored-STFT, as shown in Fig. 1. In training
mode, the extracted features are then used by GNAA method to generate the adversarial inputs as well as the
new legitimate training examples for the Skip-Net algorithm. In testing mode, the extracted features from the
anchored-STFT are directly used by the Skip-Net algorithm for classification. Voting is done on the output of
the Skip-Net algorithm to get the final classification result.
A detailed explanation of each of the modules of the pipeline is available in the following sections.

Anchored Short‑Time Fourier Transform (anchored‑STFT). Short-time Fourier transform (STFT) is

a variant of Fourier transform that improves the trade-off between temporal and spectral resolution. It is used for
transforming non-stationary time-series signals; signals in which the frequency components vary over time, into
frequency domain. STFT extracts the segments of the time-series signal by moving a window of fixed length on
the time-series signal and applies the Fourier transform on each extracted segment of the signal, hence providing
time-localized frequency information of the signal. On the contrary, the standard Fourier transform consid-
ers the entire signal and results in the frequency information that is averaged over the entire time domain and
consequently loses the information about the time when these frequencies occurred in the time-series signal.
Even though, STFT tries to preserve the time-localized frequency information of the signal, yet it presents
a trade-off between time and frequency resolution because of a fixed-length window for the transformation of
the time-series signal into frequency domain. The impact of the length of the window is directly proportional to
frequency resolution and inversely proportional to time resolution. The detailed mathematical formulation of
STFT can be seen in Supplementary Section ‘Short-Time Fourier Transform (STFT)’.

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 2

Vol:.(1234567890)
www.nature.com/scientificreports/

Figure 1. The workflow of the MI-EEG signal classification process in this study. Features are extracted from
raw EEG signals using anchored-STFT. During training, the GNAA method is employed on the extracted
features to generate the adversarial inputs and to enhance the amount of training data to train Skip-Net
algorithm. During testing, the extracted features are directly fed to the Skip-Net algorithm to perform
classification and voting is done on the output of the Skip-Net algorithm to get the final classification result.

Figure 2. Representation of time–frequency resolution of standard STFT and anchored-STFT. (a) shows
the time–frequency resolution of a fixed length window K of STFT. (a 1.1) A fixed length window K that is
convolved with the time series signal with a fixed stride (s). (a 1.2) The spectrogram obtained by convolving
the window K with time series signal. Here, frequency resolution remains the same for all locations of the
spectrogram. (b) The time–frequency resolution of anchored-STFT. (b 1.1) That anchors of different lengths
are convolved with the time series signal using stride (s). (b 1.2) That anchor K1 with short length results into
better time resolution and low frequency resolution spectrogram. Anchor K3 with longer length provides
better frequency but low time resolution spectrogram. The green and black colored boxes show a frequency
component computed for anchors of different lengths which in turn provides different frequency resolution for
each anchor length.

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 3

Vol.:(0123456789)
www.nature.com/scientificreports/

As STFT uses a fixed-length window (see Fig. 2(a 1.1)), the frequency resolution of the STFT remains same for
all the locations in the spectrogram (see Fig. 2(a 1.2)). STFT only provides a suboptimal trade-off between time
and frequency resolution. Here, an extension of STFT is proposed to address this tradeoff by defining multiple
anchors of variable lengths (see Fig. 2b). The proposed algorithm is named as anchored-STFT. Anchored-STFT
is inspired by wavelet t ransform36 and Faster R
CNN23.
The working principle of anchored-STFT is as follows:

1. First, K anchors of the same shape but different lengths are defined. All the defined anchors have the same
focal point (anchor position). The focal point can either be defined at the center or the left corner of the
anchors (see Fig. 2(b).
2. K is the maximum number of possible anchors, which is mathematically defined in Eq. (1)
log (sL)
K= (1)
log (2)
3. The shape of the anchors could be selected by using the windows which are normally used by STFT e.g.,
Hann window etc.
• sL = length of the signal
• aLi = length of an anchor i = 2i ; i = 1,2, …, K
• Minimum length of an anchor = minL = 2i=1
• Maximum length of an anchor = maxL = 2i=K
• When the focal point is defined at the centre of the anchors, then the length of the anchors is given by:
aLi = length of an anchor i = 2i + 1; i = 1,2, …, K
4. N anchors are then selected from K using grid search method, where N ⊆ K.
5. The stride ‘s’ by which the anchors are slid on time-series signal is half of the length of the anchor which has
the smallest length among N selected anchors in case when the focal point is defined at the left corner of the
anchors. In case when the focal point is at the center of the anchors, stride ‘s’ is defined as (minL_N ± 1)/2.
minL_N = minimum length of the anchor among N selected anchors. Same stride is used for all N anchors.
The length of the anchors and stride determine the number of anchor positions and consequently the number
of segments of time-series signal that are extracted by the anchors.
6. Zero-padding is applied to the signal to ensure that the same amount of signal segments or frames are
extracted for anchors of different lengths. Zero-padding is applied either on both ends of the signal or just
one end depending on whether the anchors are centered around the anchor position or cornered at the
anchor position.
7. Fourier transform is applied to each segment of the time-series signal extracted by anchors and converted
to frequency domain (see Supplementary Fig. 1).
8. A separate spectrogram of the time-series signal is generated for each length anchor by aligning the spectra
of adjacent, overlapping signal segments obtained by that length anchor as shown in Supplementary Fig. 1.
For example, if anchors of 4 different lengths are used, then 4 spectra of the time-series signal are generated.
9. The overlap between anchors of the adjacent anchor locations and number of anchor locations are obtained
by Eqs. (2) and Eq. (3), respectively.
overlap = aL − stride (2)

sL − minLN
no. of anchorlocations = 1 + (3)
s

It is clear from Fig. 2(a 1.2), that the frequency resolution of the STFT remains the same for all the locations in
the spectrogram. However, it is shown in Fig. 2(b 1.2) that an anchor (K1) of smaller length provides better time
resolution and lower frequency resolution, whereas the anchor (K3) of longer length provides better frequency
resolution and lower time resolution. The green and black boxes show the same frequency components computed
for anchors of different lengths. Each frequency component has a different resolution for each anchor of different
length which consequently provides better time–frequency resolution, which is also shown in Fig. 4. Figure 4
shows the input images of different time–frequency resolution generated by 5 anchors of different lengths for
right-hand MI-task performed by subject 4 of BCI competition IV dataset 2b.
An intuitive explanation of the workflow of anchored-STFT is provided in Supplementary Section ‘Workflow
of anchored-STFT’.

Gradient norm adversarial augmentation (GNAA). In this study, we used the proposed GNAA
method for harnessing new training inputs from the existing training inputs for the EEG data. The proposed data
augmentation algorithm is different from any other existing data augmentation techniques. At first, it requires a
trained neural network for the selection of meaningful features. Then, it calculates the gradient of cost function
(of trained neural network) with respect to a given training input. We used the Frobenius norm (L^2 norm)
for the normalization in Eq. (4) and Eq. (5). This gradient provides the direction of the decision boundary. The
given training input x is slightly perturbed (by factor ε) towards the direction of decision boundary. As a result,
it generates new inputs xnew as shown in Eq. (4). ‘Gradient norm’ method is not only a method of generating

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 4

Vol:.(1234567890)
www.nature.com/scientificreports/

new inputs, but it also ensures the selection of features in the given feature vector that play a pivotal role in the
prediction.
 
∂(cost)
xnew = x + ε  �� ∂x ��  (4)
� ∂(cost)
∂x �

We not only used Eq. (4) for data generation but also to study the existence of adversarial inputs in the domain
of BCI studies. In this study, we define the term ‘adversarial inputs’ as the inputs which are modified versions of
original inputs but are highly correlated. However, the employed classification algorithm fails to predict them
correctly. Here, the term β in the Eq. (5) defines the required minimum amount of perturbation, such that, the
difference between two inputs (original input and perturbed input) remains indistinguishable in terms of cor-
relation but the classifier can be fooled with perturbed inputs. The value of β is (0.01) determined empirically.
 
∂(cost)
xadv = x + β  �� ∂x ��  (5)
� ∂(cost)
∂x �

Here, we also determine the ‘pockets’ of adversarial inputs. The ‘pockets’ are defined as the number of inputs
in the training dataset that can be converted into adversarial inputs (using trained classifier) by applying the
amount of perturbation defined by β in Eq. (6).
Additionally, we compared the perturbation applied by the ‘gradient norm’ method with another existing
method of crafting adversarial inputs called ‘gradient sign’ method37 defined in Eq. (6). The perturbation applied
by the two methods are significantly different as shown in Fig. 3. The original input, the applied perturbation and
the new generated perturbed input by the gradient norm method are shown in Fig. 3(a). Whereas the original
input, the applied perturbation and the new generated perturbed input by the fast gradient sign method are
shown in Fig. 3(b). The perturbation applied by the ‘gradient norm’ method carefully selects only the features
that are important for the employed classification algorithm as shown in Fig. 3(a.2). The more important features
are replaced with higher values and the value of the least important feature is slightly changed. The direction of
the perturbations tends to be towards the decision boundary.
However, the perturbation applied by the ‘fast gradient sign’ method seems to be less informative (see
Fig. 3b.2). The meaningful information in perturbation is lost because of the signum operator in Eq. (6). The
signum operator maps all the values greater than 0 to 1 and the values less than 0 to − 1 in the perturbation matrix
(see Fig. 3b.2). Mathematically, the signum operator is defined in Eq. (7). As a result, the perturbation matrix is
filled with values of either 1 or − 1 and importance of each feature is disregarded.

∂(cost)
xadv = x + εsign (6)
∂x

−1 if x < 0

sign := 0 if x = 0 (7)
1 if x > 0

Feature formation. In this study, we used a convolutional neural network (CNN) based algorithm called
Skip-Net for the classification of MI-EEG signals. Since the CNN based algorithms have shown state-of-art
results in image recognition, therefore we also converted the EEG signals into images to use for classification by
the Skip-Net algorithm.
In this study, three publicly available datasets (BCI competition IV dataset 2a, BCI competition IV dataset
2b, BCI competition II dataset III) are used. The data acquisition, preprocessing and the other significant details
about the datasets are discussed in detail in Supplementary Section ‘Datasets & Preprocessing’. This section
contains only the necessary information to extract the features from the raw EEG signals.
In case of BCI competition IV dataset 2b, the EEG signal from second 3 to second 5.5 (2.5 s in total) is consid-
ered for each trial and converted into frequency domain using anchored-STFT (see “Gradient norm adversarial
augmentation (GNAA)” section). We call this interval (from second 3 to second 5.5) of the EEG signal the signal
of interest (SOI) in the rest of the document. The SOI for BCI competition IV dataset 2a lasts from second 2 to
second 4.5. Whereas the SOI for dataset III BCI competition II lasts from second 2.75 to second 7.25. In case
of 250 Hz sampling frequency, each SOI consists of 625 samples. Anchors of five different lengths are used to
transform each SOI into frequency domain. As a result, five spectrums of different time–frequency resolution are
obtained for each SOI. We treat these spectra as images. The lengths (in samples) of anchors used are as follows:
16, 32, 64, 128, 256. All the lengths considered are of power of 2. Stride of 8 samples is used to slide each anchor
across the SOI. Here the anchors are cornered at the anchor positions. Anchor with the shortest length (8 sam-
ples) and the stride are used to determine the number of anchor positions for all the anchors and consequently
the number of segments into which each SOI is divided. This results in 78 anchor locations or segments for an
SOI. Since the first anchor position considered is the first sample of the SOI, so the zero-padding is only applied
after the last sample of the SOI such that the 78 segments are extracted from SOI for each anchor. Equation (8)
is used to calculate the zero-padding required. 257 unique FFT points as used by15 are used to get the frequency
components. This leads to a 257 × 78 image (spectrum) for each anchor, where 257 and 78 are the number of
samples along the frequency and time axes, respectively.

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 5

Vol.:(0123456789)
www.nature.com/scientificreports/

Figure 3. Comparison of perturbations offered by two methods; gradient norm method and gradient signum
method. (a) The original image, perturbations produced by gradient norm method and the new generated
perturbed input are shown. (b) The original image, perturbations produced by gradient sign method and the
new generated perturbed input are shown.

(8)

Zeropadding = stride ∗ no. of anchorlocations − 1 − signallength + anchorlength

Pfurtscheller and Lopes Da Silva38 showed that mu band (8–13 Hz) and beta band (13–30 Hz) are of high
interest for the classification of MI-EEG signals. Since there is an event related desynchronization (ERD) and
event related synchronization (ERS) in mu and beta bands respectively when an MI task is performed, these
bands are vital for the classification of MI-EEG signals. Henceforth, we considered these bands for further
processing. Here, the mu band is represented by frequencies between 4–15 Hz and beta band is represented by
the frequencies between 19–30 Hz. We then extracted the mu and beta frequency bands from each spectrum of
a SOI. The size of images for extracted mu and beta frequency bands is 22 × 78 and 23 × 78, respectively. To get
the equal representation of each band, we resized the beta band to 22 × 78 using cubic interpolation method.
Finally, we combined these images to get an image of size Nfr x Nt (44 × 78); where Nfr = 44 (no. of frequency
components) and Nt = 78 (no. of time sample points). Since, the dataset contains the EEG signals from Nc = 3
electrodes (C3, Cz and C4), we repeat the same process for all three electrodes and combine all these images from
three electrodes which results in a final image of size Nh × Nt (132 × 78); where Nh = Nfr × Nc = 132 for one anchor.
We then repeat the whole process for all five anchors and get 5 images of size 132 × 78 each for each SOI. Figure 4
shows the input images generated by using 5 anchors for an SOI of right-hand MI-task performed by subject 4.

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 6

Vol:.(1234567890)
www.nature.com/scientificreports/

Figure 4. Spectral representation obtained by anchored-STFT. Input images generated by 5 anchors from an
SOI of right-hand MI-task performed by subject 4.

The decrease of energy in mu band (4–15 Hz) and increase of energy in beta band (19–30 Hz) in the C3
channel clearly shows the ERD and ERS effect, respectively for this right-hand MI-task, which is common while
performing a MI-task.
Same process is done for BCI competition IV dataset 2a and dataset III of BCI competition II to get the input
features.

Skip‑Net. In this study, we proposed a shallow CNN-based architecture for the classification of MI-EEG
signals which contains one skip connection, hence named as Skip-Net.
The Skip-Net comprises two convolutional layers. The first convolutional layer uses filters that convolve on
the time axis and extracts frequency domain features along the time axis, whereas the second convolutional layer
extracts the time-domain features. We used the additive skip connection to combine the extracted frequency and
time domain features to prevent the loss of any information which in turn improves the classification perfor-
mance of the Skip-Net compared to other classifiers. Skip-connection enhances the classification performance.
The proposed architecture contains significantly less trainable parameters as compared to its counterparts pro-
posed in15,29,32,33. Skip-connection as well as less parameters also reduce the risk of overfitting.
The architecture of the Skip-Net is shown in Fig. 5. First layer in Skip-Net architecture is the input layer. The
dimensions of the input layer are Nh × Nt . The second layer is the convolutional layer which uses 16 kernels of
size Nh × 1 to convolve the input image at a stride of 1 in both horizontal and vertical directions. Rectified linear
units (ReLUs) are used as the activation functions.
The output of the convolutional layer is of the size 1 × Nt × 16. Batch normalization is applied at the output
of the convolutional layer. The next layer is the second convolutional layer which uses 16 kernels of size 1 × 3
to convolve the output of the last layer in horizontal direction with a stride of 1. ReLUs are used here as the
activation function and batch normalization is also applied at the output of the second convolutional layer. Next
layer is the addition layer which adds the output of the first ReLU and second ReLU function. Same padding is
applied in the second convolutional layer to keep the dimensions of the second convolutional feature map to be
the same as the output of the first convolutional feature map so that both feature maps are compatible for the
addition layer. The output of the addition layer is then fed to a fully connected layer which has 128 neurons and
uses a dropout of 50% as regularization to avoid overfitting. ReLUs are also used as activation function here. The
last layer is the output layer, which uses Softmax function to output the predictions. The proposed architecture
is inspired by residual learning f ramework39.

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 7

Vol.:(0123456789)
www.nature.com/scientificreports/

Figure 5. Skip-Net architecture. Illustration of the Skip-Net architecture for the classification of MI-EEG
signals.

Workflow at inference time. Figure 1 shows that the features (spectra) generated by anchored-STFT
are directly used by the Skip-Net algorithm to produce the classification results in test mode. As mentioned in
section Feature formation, each SOI is transformed into 5 spectra of different time–frequency resolutions as
graphically represented in Fig. 6. Skip-Net classifies each spectrogram into one class which results in 5 predicted
outputs for each SOI (one for each spectrogram). Final classification is based on majority voting using the 5
predicted outputs. The number of anchors (N) used must be odd to prevent ties. The graphical representation of
the forward pass of the whole pipeline during the testing mode is shown in Fig. 6.

Source code. We will upload the code and the trained models on GitHub after the successful publication of
the manuscript so that others could also use it.

Results
A detailed evaluation performance comparison of the proposed pipeline with several existing state-of-the-art
studies is presented here. This section also includes the ablation studies, which briefly explains the tuning pro-
cess of hyperparameters of the proposed pipeline. A comprehensive explanation of ablation study is provided in
Supplementary Materials. Three publicly available, benchmark datasets are used for the validation of the results,
which are as follows.

• BCI competition IV dataset 2b

• BCI competition IV dataset 2a
• BCI competition II dataset III

BCI competition IV dataset 2b A benchmark dataset for most of the published MI BCI studies is used with
three different data distributions. This dataset contains a total of five recording sessions, 01 T, 02 T, 03 T, 04E
and, 05E for each subject. Most recent studies33, Dai et al.32 first combined the data of all the available recording
sessions (01 T, 02 T, 03 T, 04E, 05E), then randomly split the data into training and evaluation datasets. We named
this data distribution Data-Distribution 1. Other s tudies11,15,40–42 used first three recording sessions (01 T, 02 T,
03 T) for training and last two recording sessions (04E, 05E) for the evaluation. We named this data distribu-
tion Data-Distribution 2. In addition, few s tudies11 and 43 also reported the cross-validation results using the
first three recording sessions (01 T, 02 T, 03 T). We named this Data-Distribution 3. In this study, to have a fair
comparison, the proposed pipeline is evaluated for all three data distributions, separately.
BCI competition IV dataset 2a Another often used benchmark dataset for the decoding of MI-tasks in
BCI studies is used with three different data distributions. This dataset contains two recording sessions for each
subject. The most recent studies—SW-LCR, SW-Mode i n44, and RK-SVM i n45—used session 1 for training and
session 2 for evaluation. We named this data distribution Data-Distribution 4. DeepConvNet and EEGNet
methods46 reported cross-validation performance in sessions 1 and 2. We named it Data-Distribution 5. MMI-
LinT, MMI-nonLinT, FBCSP, C SP47 performed across-session analyses (training on session 1 and evaluation on
session 2 and vice versa). We named it Data-Distribution 6. In addition, Ozdenizi and Erdogmus47 also reported
cross-validation performance only in session 1. We named it Data-Distribution 7. In this study, we evaluated
our pipeline for all these data distributions.

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 8

Vol:.(1234567890)
www.nature.com/scientificreports/

Figure 6. Graphical representation of whole pipeline in testing mode. Five spectra are computed for each SOI
for each channel. Each spectrogram is then fed to Skip-Net to make five predictions in total for each SOI. Voting
is done on five output predictions. Class with maximum number of occurrences is the final predicted class for
the trial.

BCI competition II dataset III is provided with two sessions, where one session is used for training and
the second session is used for the evaluation. The majority studies used the training session for training the
algorithms and evaluation recording session for the validation of the result. W named this distribution as Data-
Distribution 8. We also used the same criteria to validate our findings.

Evaluation metrics. We used accuracy and kappa values as the evaluation metrics. Kappa value is calcu-
lated by Eq. (9).

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 9

Vol.:(0123456789)
www.nature.com/scientificreports/

BCI IV, DATASET 2B Method Metric S1 S2 S3 S4 S5 S6 S7 S8 S9 AVG

Acc 80.5 70.6 85.6 94.6 98.3 86.6 89.6 95.6 87.4 87.6
HS-CNN32
Kappa 0.610 0.412 0.712 0.892 0.966 0.732 0.792 0.912 0.748 0.752
Acc 87.2 79.7 84.1 96.3 94.0 89.2 82.9 90.6 92.8 88.5
EEG-Inception33
Kappa 0.744 0.594 0.682 0.926 0.880 0.784 0.658 0.812 0.856 0.770
Data-Distribution 1
Acc 79.5 72.7 83.5 94.6 92.9 85.6 86.9 89.8 87.4 85.8
EEGNet46
Kappa 0.590 0.454 0.670 0.892 0.858 0.712 0.738 0.796 0.748 0.717
Acc 89.8 80.4 80.4 98.2 94.6 91.7 88.5 90.6 91.7 89.5
anchored-STFT + Skip-Net-GNAA
Kappa 0.796 0.608 0.608 0.964 0.892 0.834 0.770 0.812 0.834 0.790
Acc 70.0 61.6 62.0 95.6 91.3 77.5 81.4 86.9 78.1 78.3
EEGNet46
Kappa 0.400 0.232 0.240 0.912 0.826 0.550 0.628 0.738 0.562 0.566
Acc 70.3 50.6 52.8 93.8 63.8 74.1 61.9 83.1 77.2 69.7
TLCSD43
Kappa 0.406 0.012 0.056 0.876 0.276 0.482 0.238 0.662 0.544 0.394
Acc 72.5 56.4 55.6 97.2 88.4 78.7 77.5 91.9 83.4 77.9
RSMM49
Kappa 0.450 0.128 0.112 0.944 0.768 0.574 0.550 0.838 0.668 0.558
Acc 77.0 64.5 61.0 96.5 82.0 84.5 75.0 91.0 87.0 80.0
Data-Distribution 2 Bi-Spectrum 50
Kappa 0.540 0.290 0.220 0.930 0.640 0.690 0.500 0.820 0.740 0.600
Acc 65.9 61.5 56.3 96.3 76.3 75.0 77.2 92.8 82.8 76.0
CSP11
Kappa 0.319 0.229 0.125 0.925 0.525 0.500 0.544 0.856 0.656 0.520

11
Acc 70.0 60.4 60.9 97.5 92.8 80.7 77.5 92.5 87.2 79.9
FBCSP
Kappa 0.400 0.207 0.219 0.950 0.856 0.613 0.550 0.850 0.744 0.599
Acc 75.0 61.6 59.7 96.9 92.2 87.2 81.9 93.4 87.8 81.8
anchored-STFT + Skip-Net-GNAA
Kappa 0.500 0.232 0.194 0.938 0.844 0.744 0.638 0.868 0.756 0.635
Acc 70.4 61.2 56.7 88.9 76.2 70.7 84.3 61.8 66.3 70.7
TLCSD43
Kappa 0.408 0.224 0.134 0.778 0.524 0.414 0.686 0.236 0.326 0.414
Acc 77.3 60.4 62.2 94.4 84.6 76.7 70.5 70.7 79.2 75.1
Data-Distribution 3 FBCSP11
Kappa 0.546 0.208 0.244 0.888 0.692 0.534 0.409 0.413 0.583 0.502
Acc 79.9 57.3 56.2 95.1 87.5 83.1 75.6 71.4 77.9 76.0
anchored-STFT + Skip-Net-GNAA
Kappa 0.598 0.145 0.124 0.902 0.749 0.662 0.512 0.427 0.558 0.520

Table 1. Performance comparison of anchored-STFT + Skip-Net-GNAA with state-of-the-art methods on

Data-Distribution 1, Data-Distribution 2 and Data-Distribution 2 of dataset 2b from BCI competition IV.
Significant values are in bold.

accuracy − random accuracy

kappa = (9)
1 − random accuracy
Statistical analysis We performed paired t-test to determine the statistical significance of our proposed
method.

Comparison of proposed pipeline with state‑of‑the‑art studies on dataset 2b Competition

IV. Here, the comparison of the proposed pipeline, in terms of classification accuracy and kappa values, with
state-of-the-art studies11,15,32,33,40–43,46–50 is presented using Data-Distribution 1, Data-Distribution 2 and Data-
Distribution 3.
In all the remaining analyses for all datasets and data-distributions, the values of the hyperparameters of the
anchored-STFT used are as such:

Anchors = [16,32,64,128,256].
Stride = 8.

The details of which are provided in section Tuning of hyperparameters of anchored-STFT.

Comparison with state‑of‑the‑art studies using Data‑Distribution 1. In order to have a fair comparison with
state-of-the-art methods using Data-Distribution 1, we restructured the data as reported in most recent study33.
The performance comparison in shown in Table 1.
Table 1 shows that, for Data-Distribution 1, the proposed pipeline outperformed all the state-of-the-art studies
by yielding the average classification accuracy of 89.5% which is 1% higher than most recent results produced by
EEG-inception method reported i n33, whereas it is 1.9% higher than HS-CNN method reported i n32. In addition,
Fig. 7 shows the performance comparison of anchored-STFT + Skip-Net-GNAA with two well-known state-of-
the-art architectures, namely E EGNet46 and D eepConvNet48 for Data-Distribution 1. It is shown in Fig. 7 that

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 10

Vol:.(1234567890)
www.nature.com/scientificreports/

Figure 7. Performance comparison on dataset 2b. Performance comparison of anchored-STFT + Skip-Net-

GNAA with EEGNet and DeepConvNet using Data distribution 1 and Data distribution 2 of BCI Competition
IV dataset 2b.

the proposed method obtained 3.7% and 4.8% higher average classification accuracy compared to EEGNet and
DeepConvNet architectures, respectively. It is evident from Table 1 and Fig. 7, that the proposed method also
yields the highest kappa value of 0.790, which is 5% higher than HS-CNN, whereas it is 2.6% higher than for the
EEG-Inception model. These numbers are 10.1% and 13.8% for the EEGNet and DeepConvNet architectures,
respectively. The proposed pipeline outperformed the EEG-inception method for 7 out of 9 subjects whereas,
it outperformed HS-CNN for 6 out of 9 subjects and it outperformed EEGNet for 8 out of 9 subjects. For Data-
Distribution 1, our proposed method significantly outperformed EEGNet and DeepConvNet (p < 0.05), whereas
its performance is statistically similar to EEG-Inception and HS-CNN.

Comparison with state‑of‑the‑art studies using Data‑Distribution 2. The performance comparison of anchored-
STFT + Skip-Net-GNAA with state-of-the-art methods using Data-Distribution 2 is also presented in Table 1.
Table 1 illustrates that the proposed method achieved the highest average classification accuracy of 81.8%, which
is 1.9% higher than the FBCSP method, whereas it is 5.8% higher than for the CSP method. These numbers
are 1.8%, 3.9% and 12.1% for the Bi-Spectrum, RSMM and TLCSD methods, respectively. In addition, Fig. 7
also shows the performance comparison of anchored-STFT + Skip-Net-GNAA with EEGNet and DeepConvNet
architectures for Data-Distribution 2. It is indicated from the Fig. 7, that the proposed method outperformed
EEGNet by yielding an improvement of 3.5% in average classification performance whereas, it improved the
accuracy by 1.9% compared to DeepConvNet. The results also indicate that anchored-STFT + Skip-Net-GNAA
significantly outperformed (p < 0.05) EEGNet, TLCSD, RSMM and CSP methods, whereas it performed statisti-
cally similar to Bi-Spectrum and FBCSP algorithms.
Table 1 and Fig. 7 also imply that anchored-STFT + Skip-Net-GNAA provided the highest kappa value of
0.635 compared to other methods. It indicates that the presented method provided 22.1%, 6.0%, 5.8%, 61.1%,
13.8%, 12.2% and 6.2% improvement in terms of average kappa value with respect to CSP, FBCSP, Bi-Spectrum,
TLCSD, RSMM, EEGNet and DeepConvNet methods respectively.
Table 1 shows that our method outperformed the FBCSP algorithm and Bi-Spectrum for 6 out of 9 subjects,
whereas it outperformed EEGNet and RSMM for 7 out of 9 subjects and 8 out of 9 subjects, respectively. It out-
performed the CSP algorithm and TLCSD for all subjects.

Comparison with state‑of‑the‑art studies using Data‑Distribution 3. Ang et al.11 introduced the Filter Bank
Common Spatial Pattern (FBCSP) algorithm, which is the winner algorithm of the BCI Competition IV dataset
2b on Data-Distribution 3 and performed tenfold cross-validation on the training data. The cross-validation
performance of the proposed pipeline is compared with the FBCSP algorithm and TLCSD43, which is shown
Table 1 in terms of accuracy and kappa values.
Here, the average kappa value of the FBCSP method and TLCSD is 0.502 and 0.414 respectively, whereas the
anchored-STFT + Skip-Net-GNAA obtained the average kappa value of 0.520. The higher kappa value of the
proposed methods in comparison with the other methods indicates better generalization quality. The proposed
pipeline increased the kappa value by 25.6% and 3.6% with respect to TLCSD and FBCSP, respectively. Table 1
shows that the proposed approach outperformed the FBCSP method for 6 out of 9 subjects. For Data-Distribution
3, all the methods performed statistically similar.
In addition to average kappa values for tenfold cross-validation, we also compared the performance of our
approach with some other m ethods15,40–42 that provided the best kappa values for dataset 2b of BCI competition
IV. We also used the best kappa values of the proposed method for this comparison. Our approach outperformed
the existing studies in terms of maximum kappa value comparison by yielding the maximum average kappa value
of 0.737. The detailed comparison is shown in Supplementary Table 11.

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 11

Vol.:(0123456789)
www.nature.com/scientificreports/

BCI IV, DATASET 2A Methods Metric LH/RH LH/BF LH/TO RH/BF RH/TO BF/TO AVG
Acc 75.9 80.8 82.9 84.2 81.9 73.4 79.9
CSP + LDA45
Kappa 0.518 0.616 0.658 0.684 0.638 0.468 0.598
Acc 73.0 78.2 81.0 77.0 77.5 68.5 75.9
SVM vec45
Kappa 0.460 0.564 0.620 0.540 0.550 0.370 0.518
Acc 80.6 85.0 85.0 80.5 83.6 75.5 81.7
Cref = Identity
Kappa 0.612 0.700 0.700 0.610 0.672 0.510 0.634

45
Acc 80.6 85.8 85.6 83.6 83.5 74.2 82.2
RK-SVM Cref = Arithmetic mean
Kappa 0.612 0.716 0.712 0.672 0.670 0.484 0.644
Data-Distribution 4
Acc 79.9 87.3 86.9 85.9 86.0 77.2 83.9
Cref = Geometric mean
Kappa 0.598 0.746 0.738 0.718 0.720 0.544 0.678
Acc 80.0 83.6 86.2 84.6 83.5 73.0 81.8
SW-LCR44
Kappa 0.600 0.672 0.724 0.692 0.670 0.460 0.636
Acc 79.8 83.7 86.0 85.0 84.0 73.4 82.0
SW-Mode44
Kappa 0.596 0.674 0.720 0.700 0.680 0.468 0.640
Acc 81.2 87.6 89.6 86.8 89.0 79.2 85.4
anchored-STFT + Skip-Net-GNAA
Kappa 0.624 0.752 0.792 0.736 0.780 0.584 0.708

Table 2. Performance comparison of anchored-STFT + Skip-Net-GNAA with other methods on all pairwise
two class MI tasks of dataset 2a from BCI competition IV. Significant values are in bold.

In addition, a comparison is made between the proposed pipeline and the algorithms presented in15. The
proposed method outperformed all the presented algorithms in15 including its counterparts (CNN and CNN-
SAE) by providing 5.6% and 2.9% higher average accuracy, respectively. The detailed comparison is given in
Supplementary Table 10.

Comparison of proposed pipeline with state‑of‑the‑art studies on dataset 2a Competition

IV. To further validate the performance of our methods, we employed our proposed pipeline on another
publicly available dataset 2a from BCI competition IV. Here, we present the comparison of our algorithms with
EGNet46, DeepConvNet48, MMI-LinT, MMI-non-
results of well-known state-of-the-art methods reported in E
LinT, FBCSP, CSP47, RK-SVM45, SW-LCR and SW-Mode44.
To have a fair comparison with state-of-the-art methods, we performed the same experiments using the
same data distributions as used by DeepConvNet, EEGNet, MMI-LinT, MMI-nonLinT, FBCSP, CSP, RK-SVM,
SW-LCR and SW-Mode methods, respectively.
The experimental protocols used by the state-of-the-art methods in respective studies are:

1. RK-SVM, SW-LCR and SW-Mode reported the performance on all possible pairwise two class MI tasks
averaged over all subjects using Data-Distribution 4.
2. DeepConvNet and EEGNet methods reported four-class decoding analysis using fourfold cross-validation
on Data-Distribution 5.
3. MMI-LinT, MMI-nonLinT, FBCSP, CSP performed 4-class decoding on Data-Distribution 6. In addition,
2-class (LH/RH) decoding analysis on Data-Distribution 7 is also reported for MMI-LinT, MMI-nonLinT,
FBCSP, CSP.

Comparison with state‑of‑the‑art studies using Data‑Distribution 4. Here, a performance comparison is made
between our proposed method and SW-Mode, SW-LCR, RK-SVM, SVM vec and CSP + LDA methods in pair-
wise MI tasks decoding. Table 2 shows the performance comparison of our proposed pipeline with other meth-
ods on all pairwise two class MI tasks averaged over all subjects. It is shown that anchored-STFT + Skip-Net-
GNAA outperformed SW-Mode, SW-LCR, RK-SVM, SVM vec and CSP + LDA by yielding the highest average
classification accuracy of 85.4%, which is 3.4% and 3.6% higher than SW-Mode and SW-LCR, respectively. It is
1.5% higher than RK-SVM when the geometric mean is used as the reference point, whereas these numbers are
3.2% and 3.7% when arithmetic mean and identity are used as reference point, respectively. This difference is
5.5% for CSP + LDA and 9.5% for SVM vec (half-vectorization of the covariance matrices).
Similarly, anchored-STFT + Skip-Net-GNAA showed an improvement in average kappa value compared to
other methods by producing the highest average kappa value of 0.708. The improvement is 10.63% and 11.3%
compared to SW-Mode and SW-LCR methods. Whereas these numbers are 4.4%, 9.9% and 11.7% compared to
RK-SVM with geometric mean, arithmetic mean and identity as the reference point, respectively. However, there
is a significant increase of 36.7% and 18.4% compared to the SVM vec and CSP + LDA methods, respectively.
For Data-Distribution 4, anchored-STFT + Skip-Net-GNAA significantly outperformed CSP + LDA, SVM vec,
RK-SVM, SW-LCR and SW-Mode methods (p < 0.05).

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 12

Vol:.(1234567890)
www.nature.com/scientificreports/

Figure 8. Performance comparison on dataset 2a. Performance comparison of anchored-STFT + Skip-Net-

GNAA with well-known state-of-the-art methods on dataset 2a from BCI competition IV.

Comparison with state‑of‑the‑art studies in four‑class decoding analysis using fourfold cross‑validation on
Data‑Distribution 5. In addition to pairwise decoding of MI tasks, we also evaluated our pipeline in the four-
class decoding protocol and compared its performance with the very well-known state-of-the-art methods i.e.,
EEGNet and DeepConvNet. I n46, the authors performed fourfold cross validation in sessions 1 and 2 to evaluate
the performance of the EEGNet and DeepConvNet architectures for four-class MI decoding. Henceforth, we
also performed the same experiment and the performance comparison of our proposed pipeline with EEGNet
and DeepConvNet is shown in Fig. 8(a). It is evident from Fig. 8(a) that anchored-STFT + Skip-Net-GNAA
obtained the highest average classification accuracy as well as the highest average kappa value compared to
both variants of EEGNet and DeepConvNet. The proposed method yielded an average accuracy improvement
of 1% and 5.6% compared to EEGNet-8,2 and EEGNet-4,2 respectively, whereas a substantial improvement of
19% is seen compared to DeepConvNet architecture. However, these numbers are 2.4%, 14.6% and 75.5% in

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 13

Vol.:(0123456789)
www.nature.com/scientificreports/

CNN 15 CNN-SAE 15 winner algorithm 51 anchored-STFT + Skip-Net-GNAA

Accuracy 89.3 90.0 89.3 90.7
Kappa 0.786 0.800 0.783 0.814

Table 3. Comparison of accuracy and kappa results on BCI competition II dataset III produced by anchored-
STFT + Skip-Net-GNAA, CNN, CNN-SAE 15 and the winner algorithm 51. Significant values are in bold.

terms of kappa values compared to EEGNet-8,2, EEGNet-4,2 and DeepConvNet, respectively. Here, the pro-
posed method significantly outperformed DeepConvNet and EEGNet-4,2 (p < 0.05), whereas its performance is
statistically similar to EEGNet-8,2.

Comparison with state‑of‑the‑art studies in four‑class decoding analysis using Data‑Distribution 6. An analysis
is made to evaluate the performance of proposed pipeline with the state-of-the-art methods CSP, FBCSP, MMI-
LinT and MMI-nonLinT in four class decoding across sessions. The performance comparison is presented in
Fig. 8(b). The results in Fig. 8(b) show that anchored-STFT + Skip-Net-GNAA outperformed other methods by
producing the highest average classification accuracy of 69.1% and an average kappa value of 0.588. Here, the
proposed method enhanced the average classification accuracy by 15.4%, 13.2%, 12% and 12.2% compared to
CSP, FBCSP, MMI-LinT and MMI-nonLinT, respectively. The same trend is also seen in terms of kappa values
where the improvement is 53.9%, 42.7%, 37.4% and 38.4% compared to CSP, FBCSP, MMI-LinT and MMI-non-
LinT, respectively. For Data-Distribtuion 6, the proposed pipeline significantly outperformed all the competing
methods CSP, FBCSP, MMI-LinT and MMI-nonLinT (p < 0.05).

Comparison with state‑of‑the‑art studies in two‑class (LH/RH) decoding analysis using cross‑validation on
Data‑Distribution 7. In addition to four-class decoding, we also performed two class decoding (left hand vs
right hand) to compare the performance of our proposed pipeline with CSP, FBCSP, MMI-LinT and MMI-
nonLinT. The results are presented in Fig. 8 (b). In this analysis, anchored-STFT + Skip-Net-GNAA performed
the best in terms of average classification accuracy and kappa values compared to other mentioned methods.
Figure 8(b) shows that anchored-STFT + Skip-Net-GNAA achieved 80.9% average classification accuracy which
is 5.2% to 3.8% higher than for other methods. This range is 20.2% to 14.0% in terms of kappa values. Anchored-
STFT + Skip-Net-GNAA performed significantly better than CSP, FBCSP, MMI-LinT and MMI-nonLinT
(p < 0.05) for Data-Distribution 7.

Comparison of proposed pipeline with state‑of‑the‑art studies on BCI II, dataset III using
Data‑Distribution 8. To further validate the performance of our method, we employed the proposed pipe-
line on another publicly available dataset III from BCI competition II. Since this dataset is well divided into
training and test data, the evaluation of the presented pipeline is trivial. Here, we only performed the evalua-
tion on the unseen (test) dataset. The input images are computed as explained in the section Feature formation.
Table 3 provides the comparison of classification accuracy and kappa values on this dataset produced by pro-
posed method, methods presented in15 (CNN, CNN-SAE) and the winner a lgorithm51 of the BCI competition
II, dataset III.
Table 3 illustrates that the proposed method outperformed the winner algorithm and provided 1.4% and
3.9% improvement in terms of accuracy and kappa value, respectively. It also outperformed CNN and CNN-
SAE methods by 1.4% and 0.7%, respectively in terms of accuracy and 3.56% and 1.75%, respectively in terms
of kappa values.

Ablation study. BCI competition IV dataset 2b is originally provided with Data-Distribution 2 by the organ-
izers of the competition. Using Data-Distribution 2 for tuning the parameters ensures the unfamiliarity of the
evaluation data to the classifier during the training process, since there is no overlap between the two. Therefore,
it is more transparent way to validate the results. Henceforth, we used data distribution 2 for the ablation study.

Tuning of hyperparameters of anchored‑STFT. Anchored-STFT includes number, combination of anchors, and

stride as the hyperparameters. The values of hyperparameters effect the evaluation accuracy as well as the com-
putational cost. Therefore, we tried to optimize the tradeoff between evaluation accuracy and computational cost
while selecting the values of the hyperparameters.
The total number of anchors in anchored-STFT are calculated using Eq. (2). In principle, a higher number of
anchors results in higher classification accuracy, but that also increases the computational cost. Higher number
of anchors may also increase the redundancy in the extracted information, which could cause the overfitting in
shallow CNN architectures such as Skip-Net. A deeper architecture with more convolutions and fully connected
layers may be required to learn the hidden meaningful patterns which in turn lead to higher computational cost,
that is undesirable for online decoding of neural signals in BCI applications.
To analyze the effect of different numbers and combination of anchors on the evaluation accuracy and the
computation cost, several analyses are performed which investigate the relation between the numbers and com-
bination of anchors used and their effect on the overall evaluation accuracy and the computational power.
Based on the analysis presented in Supplementary Table 1, Supplementary Table 2, Supplementary Table 3, and

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 14

Vol:.(1234567890)
www.nature.com/scientificreports/

Supplementary Table 4 of Supplementary Materials, total number of anchors selected are 5 and the combinations
used are 16,32,64,128,256.
The selection of stride is also a hyperparameter, which effects the evaluation accuracy as well as the computa-
tional cost. Stride is selected based on the anchor with smallest length. The criteria for the selection of stride are
such that the overlap between smallest anchor at adjacent anchor locations is 50% minimum. However, the detail
analysis of stride which results in overlap of 100%, 75%, 50%, 25% and 0% on the overall evaluation accuracy is
presented in Supplementary Table 5 of Supplementary Materials. Based on the analysis, the selected stride is 8
which ensures at least the 50% overlap between the anchor of smallest length at adjacent anchor locations. This
stride ensures the optimized trade-off between the evaluation accuracy and the computation cost.
In all the remaining analyses, the values of the hyperparameters used are as such:

Anchors = [16,32,64,128,256]
Stride = 8.

Performance comparison of anchored‑STFT with Continuous wavelet transform (CWT) and STFT feature extrac‑
tion methods and the effect of adding skip‑connection to CNN architecture. Since our method is inspired from
wavelet transform, and is an extension of STFT, a comparison of these methods with anchored-STFT is per-
formed to validate the findings. Data-Distribution 2 of dataset 2b from BCI competition IV is used for this
analysis. The comparison is made on two CNN based architectures i.e., proposed CNN architecture with skip
connection (Skip-Net) and standard CNN architecture. Anchored-STFT using Skip-Net outperformed the STFT
and CWT methods by 3.7% and 3.6%, respectively. However, the Anchored-STFT using standard CNN architec-
ture outperformed the STFT and CWT methods by 3.1% and 5.4%, respectively. The comprehensive comparison
of each subject is presented in Supplementary Table 6. This analysis depicts that adding a skip-connection to the
standard CNN architecture yields improvement in the performance of the classifier.

Hyperparameters tuning during training for Skip‑Net. The Skip-net explained in section Skip-Net is a deep-
learning model. It involves several hyperparameters and the tuning of hyperparameters is done using grid
search. The hyperparameters and their corresponding values after tuning used to train the Skip-Net algorithm
are as follows:

1. Optimization algorithm = Adam

2. Momentum = 0.9
3. Initial Learning rate = 0.01
4. Learning rate drop factor = 0.5
5. Learning rate drop period = 5 epochs
6. Regularization = L2 norm (0.01), Dropout (0.5)
7. Max Epochs = 200
8. Mini batch size = 200

Evaluation of robustness of classifier using inputs generated by GNAA. It is of cardinal importance to enhance
the robustness of the classifier at inference time. The data generated by GNAA improves the robustness as well
as the classification accuracy. This fact is validated by a comprehensive analysis which is performed to evaluate
the impact of new inputs generated by GNAA method on the robustness of the classifier. This analysis is shown
in section ‘Impact of inputs generated by GNAA on robustness of classifier’ of Supplementary Materials. In addi-
tion, a quantitative comparison of perturbations generated by GNAA, and gradient sign method is performed
(see section Impact of inputs generated by GNAA on robustness of classifier’ of Supplementary Materials).
The following conclusions are drawn from the aforementioned analyses:

1. The existence of adversarial inputs is not random in nature (Fig. 3a.2) as produced by gradient sign method
which uses the ‘sign’ operator (see Fig. 3b.2). However, GNAA method selects only the meaningful features
to perturb the inputs to generate the adversarial inputs as shown in Fig. 3(a).
2. Training the classifier on original training data plus perturbed inputs generated by GNAA method slightly
improve the overall average classification accuracy as compared to gradient sign method, since the carefully
perturbed inputs generate more training inputs that resemble closely the data distribution of the original
training data.
3. Training the model on perturbed inputs along with the original training data enhances the robustness against
adversarial attacks.
4. The perturbations applied by GNAA, and gradient sign method can provide the insight of the quality of the
training data. As shown in Supplementary Table 8, subject 2 and subject 3 resulted in a greater number of
adversarial examples compared to subject 4 and subject 5. It can be concluded that the discrimination power
between the different classes of subject 2 and subject 3 is less as compared to subject 4 and subject 5 which is
also evident from classification accuracy of these subjects as reported in Supplementary Table 7. It can also
be inferred that, in case of subject 2 and subject 3, the feature vectors of distinct classes are quite close to the
decision boundary determined by the classifier which also results in greater number of adversarial inputs
when slightly perturbed.

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 15

Vol.:(0123456789)
www.nature.com/scientificreports/

Summary and discussion

In order to enhance the quality of neural signal, we developed a novel algorithm for feature formation called
anchored-STFT in conjunction with a data augmentation method named GNAA. The proposed anchored-STFT
is inspired by wavelet t ransform36 and Faster R
CNN52. Wavelets transform scales and dilates the mother wavelet.
It then slides these scaled and dilated wavelets across the time-domain signal to generate a scalogram in the
frequency domain. However, anchored-STFT uses anchors of different lengths. It slides these anchors across
the time-domain signal to transform it to a spectrogram with different time–frequency resolution in frequency
domain. Anchored-STFT generates one spectrogram for each anchor whereas the wavelet transform produces
only one scalogram for all the used scales and translation factors. The anchored-STFT also addresses the limita-
tion of standard STFT by minimizing the trade-off between temporal and spectral resolution. Anchored-STFT
uses anchors of different lengths to extract segments of corresponding lengths from the time-series signal and
applies Fourier transform to each extracted segmented signal. Henceforth, temporal, and spectral resolution is
optimized.
The performance of deep learning algorithms is dependent on the quality as well as quantity of training exam-
ples. Therefore, in addition to feature formation technique, a data augmentation based on crafting adversarial
inputs that increases the amount of training examples is proposed. The proposed data augmentation algorithm
used the objective function of the previously trained model, which is trained on the original training examples.
Then, the new inputs are crafted by perturbing the original training examples towards the direction of the deci-
sion boundary of the classifier. The direction of perturbation of each new input is determined by calculating the
gradient of the optimized objective with respect to its original input, as defined in Eq. (5). The magnitude of the
perturbation is kept small and defined by factor epsilon (see Eq. (6)).
Recently, existence of adversarial inputs in EEG based BCI studies got n oticed53,54. Jiang and Zhang X iao53
proposed transferability-based black-box attacks. At first, attacker trains a substitute model to learn the target
model, and then generates adversarial inputs from the existing substitute model to attack the target model.
They crafted the inputs using unsupervised fast gradient signum method (UFGSM). Contrarily, we compared
the adversarial perturbations generated with original method ‘fast gradient signum method (FGSM)’ with the
proposed GNAA. In FGSM and GNAA, the directions of perturbations are preserved by taking the gradient
of the cost function with respect to the given input. However, the signum operator in FGSM does not keep the
exact values intact. As a result, perturbations are applied with equal magnitude in all directions. On the contrary,
GNAA honors the importance of each feature by generating the perturbation in each direction with different
magnitude which depends on its significance.
Lastly, we proposed a shallow convolutional neural network-based architecture with a skip connection; hence,
it is named Skip-Net. The Skip-Net comprises two convolutional layers. The first convolutional layer uses filters
that convolve on the time axis and extracts frequency domain features along the time axis, whereas the second
convolutional layer extracts the time-domain features. We used the additive skip connection to combine the
extracted frequency and time domain features to prevent the loss of any information which in turn improved
the classification performance of the Skip-Net compared to other classifiers.
In this study, we showed that the proposed pipeline outperformed several state-of-the-art
algorithms11,15,32,33,40–51 on three publicly available, benchmark MI EEG datasets, as shown in Tables 1, 2, 3,
Figs. 7 and 8. The aforementioned state-of-the-art studies lack uniformity in criterion to validate and compare
the results. The previous studies used dataset 2a and 2b from BCI competition IV differently for the performance
comparisons. Zhang et al.33 and Dai et al.32 firstly combined all the recording sessions (training and evaluation
sessions) of dataset 2b and then randomly split them into training and evaluation datasets. However, Refs.11,43,49,50
used first three sessions of dataset 2b (training sessions) as the training dataset and the last two sessions (evalu-
ation sessions) as the evaluation dataset as provided and recommended by the organizers of the dataset 2b
from BCI competition IV. Tabar and Halici15 used only the training sessions of dataset 2b for the performance
validation of the proposed methods. The authors used first and second recording sessions (training sessions)
for training the algorithms whereas they used only the third session (third training session) for the evalua-
tion. Similarly, different studies used dataset 2a from BCI competition IV differently for the comparison. Gaur
et al.44, and Barachant et al.45 used session 1 for training and used session 2 for evaluation. However, Lawhern
et al.46 combined all the recording sessions (training and evaluation sessions) and performed cross validation.
Ozdenizi and E rdogmus47 performed across-session analysis (training on session 1 and evaluation on session
2 and vice versa). In addition, Ozdenizi and E rdogmus47 also reported cross-validation performance only in
session 1. Therefore, it is very difficult to compare the proposed algorithm with previously existing algorithms.
However, to overcome this important issue and to have a transparent comparison, we defined in total eight dif-
ferent data distributions. By defining these data distributions, the proposed algorithm is compared with existing
algorithms on equal grounds. However, if the BCI community strictly follows the original data distribution of
the datasets, provided by the organizer of the competition, the comparison of the algorithms can become more
straight-forward, fair, and transparent. Table 1 and Fig. 7 show that the proposed method outperformed all the
state-of-the-art methods such as EEG-inception, HS-CNN and EEGNet and DeepConvNet by obtaining 89.5%
average classification accuracy on Data-Distribution 1 of dataset 2b from BCI competition IV. These numbers
are 81.8% and 76% for Data-Distribution 2 and Data-Distribution 3 respectively. The same trend is seen in
terms of kappa values. Table 2 and Fig. 8 show that the presented algorithm achieves the highest classification
accuracy and kappa value 85.4% and 0.708 for Data-Distribution 4 compared to SW-Mode, SW-LCR and RK-
SVM. However, for Data-Distribution 5 these numbers are 69.1% and 0.588 compared to DeepConvNet, and
EEGNet. Similarly, for Data-Distribution 6 and Data-Distribution 7 it reaches to 69.1%, 0.588 and 80.9%, 0.618
respectively, compared to CSP, FBCSP, MMI-LinT and MMI-nonLinT. Table 3 shows that the proposed method
outperformed the winner algorithm on Data-Distribution 8 of dataset III from BCI competition II and provided

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 16

Vol:.(1234567890)
www.nature.com/scientificreports/

1.4% and 3.9% improvement in terms of accuracy and kappa value, respectively. It also outperformed CNN and
CNN-SAE methods by 1.4% and 0.7%, respectively in terms of accuracy and 3.56% and 1.75%, respectively in
terms of kappa values. We conclude that the proposed method systematically improves the state of the art, and
that in some cases the improvements are quite substantial.
The results generated by using different data distributions for training and evaluation are fairly different, as
shown in Tables 1, 2, 3, Figs. 7 and 8. Therefore, in our perspective, using a standardized data distribution as
provided and recommended by the organizers of the datasets would be more useful for fair comparison.
The current version of anchored-STFT constructs a separate feature matrix for each defined anchor and each
feature matrix is provided to the classifier. Then, the voting strategy is applied to take the final decision. In the
future, we are aiming to construct a single but more meaningful feature matrix from all the anchors. We believe
that if all the necessary information is provided at once, it can increase the generalization quality of deep learn-
ing models. As a result, the computational cost of the proposed pipeline can also be reduced. Here, we briefly
investigated the existence of adversarial inputs in neural data. However, more thorough investigation is required.
Therefore, in future we are aiming to extract adversarial inputs created by different methods and try to train a
more robust classifier by training it on data that has more variability.

Received: 16 August 2021; Accepted: 22 February 2022

References
1. Graimann, B., Allison, B. & Pfurtscheller, G. Brain-Computer Interfaces: A Gentle Introduction (Springer, 2010).
2. Kübler, A. et al. A brain-computer interface controlled auditory event-related potential (p300) spelling system for locked-in patients.
Ann. N. Y. Acad. Sci. https://doi.org/10.1111/j.1749-6632.2008.04122.x (2009).
3. Klaes, C. et al. Hand shape representations in the human posterior parietal cortex. J. Neurosci. 35, 15466–15476 (2015).
4. Kellis, S. et al. Decoding spoken words using local field potentials recorded from the cortical surface. J. Neural Eng. 7, 056007
(2010).
5. Aflalo, T. et al. Decoding motor imagery from the posterior parietal cortex of a tetraplegic human. Science 348, 906–910 (2015).
6. Ajiboye, A. B. et al. Restoration of reaching and grasping movements through brain-controlled muscle stimulation in a person
with tetraplegia: A proof-of-concept demonstration. Lancet 389(10081), 1821–1830 (2017).
7. Choi, J., Kim, S., Ryu, R., Kim, S. & Sohn, J. Implantable neural probes for brain-machine interfaces - current developments and
future prospects. Exp. Neurobiol. 27(6), 453–471 (2018).
8. Pfurtscheller, G. & Lopes da Silva, F. Event-related EEG/MEG synchronization and desynchronization: Basic principles. Clin.
Neurophysiol. 110, 1842–1857 (1999).
9. Müller-Gerking, J., Pfurtscheller, G. & Flyvbjerg, H. Designing optimal spatial filters for single-trial EEG classification in a move-
ment task. Clin. Neurophysiol. 110, 787–798 (1999).
10. Grosse-Wentrup, M. & Buss, M. Multiclass common spatial patterns and information theoretic feature extraction. IEEE Trans.
Biomed. Eng. 55, 1991–2000 (2008).
11. Ang, K., Chin, Z., Wang, C., Guan, C. & Zhang, H. Filter bank common spatial pattern algorithm on BCI competition IV Datasets
2a and 2b. Front. Neurosci. https://doi.org/10.3389/fnins.2012.00039 (2012).
12. Ramoser, H., Muller-Gerking, J. & Pfurtscheller, G. Optimal spatial filtering of single trial EEG during imagined hand movement.
IEEE Trans. Rehabil. Eng. https://doi.org/10.1109/86.895946 (2000).
13. Mousavi, E. A., Maller, J. J., Fitzgerald, P. B. & Lithgow, B. J. Wavelet Common Spatial Pattern in asynchronous offline brain com-
puter interfaces. Biomed. Signal Process. Control 6, 121–128 (2011).
14. Nicolas-Alonso, L. F. & Gomez-Gil, J. Brain computer interfaces, a review. Sensors https://doi.org/10.3390/s120201211 (2012).
15. Tabar, Y. R. & Halici, U. A novel deep learning approach for classification of EEG motor imagery signals. J. Neural Eng. https://
doi.org/10.1088/1741-2560/14/1/016003 (2017).
16. Li, F. et al. A novel simplified convolutional neural network classification algorithm of motor imagery EEG signals based on deep
learning. Appl. Sci. 10, 1605 (2020).
17. Fukunaga, K. Introduction to Statistical Pattern Recognition (Elsevier, 2013).
18. Firat Ince, N., Arica, S. & Tewfik, A. Classification of single trial motor imagery EEG recordings with subject adapted non-dyadic
arbitrary time-frequency tilings. J. Neural Eng. https://doi.org/10.1088/1741-2560/3/3/006 (2006).
19. Schlögl, A., Lee, F., Bischof, H. & Pfurtscheller, G. Characterization of four-class motor imagery EEG data for the BCI-competition
2005. J. Neural Eng. https://doi.org/10.1088/1741-2560/2/4/L02 (2005).
20. Nielsen, T. D. & Jensen, F. V. Bayesian Networks and Decision Graphs (Springer, 2001).
21. Cortes, C. & Vapnik, V. Support-vector networks. Mach. Learn. 20, 273–297 (1995).
22. Shah, Z. H. et al. Deep-learning based denoising and reconstruction of super-resolution structured illumination microscopy
images. bioRxiv 12, 988 (2020).
23. Ren, S., He, K., Girshick, R. & Sun, J. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE
Trans. Pattern Anal. Mach. Intell. 39(6), 1137–1149 (2017).
24. Saif-ur-Rehman, M. et al. SpikeDeeptector: A deep-learning based method for detection of neural spiking activity. J. Neural Eng.
16, 5 (2019).
25. Saif-ur-Rehman, M. et al. SpikeDeep-Classifier: A deep-learning based fully automatic offline spike sorting algorithm. J. Neural
Eng. https://doi.org/10.1088/1741-2552/abc8d4 (2020).
26. Issar, D., Williamson, R. C., Khanna, S. B. & Smith, M. A. A neural network for online spike classification that improves decoding
accuracy. J. Neurophysiol. 123(4), 1472–1485 (2020).
27. An, X., Kuang, D., Guo, X., Zhao, Y. & He, L. A deep learning method for classification of EEG data based on motor imagery. Intell.
Comput. Bioinform. https://doi.org/10.1007/978-3-319-09330-7_25 (2014).
28. Wulsin, D. F., Gupta, J. R., Mani, R., Blanco, J. A. & Litt, B. Modeling electroencephalography waveforms with semi-supervised deep
belief nets: Fast classification and anomaly measurement. J. Neural Eng. https://doi.org/10.1088/1741-2560/8/3/036015 (2011).
29. Ren, Y. & Wu, Y. Convolutional deep belief networks for feature extraction of EEG signal. In International Joint Conference on
Neural Networks (IJCNN), Beijing (2014).
30. Yang, H., Sakhavi, S., Ang, K. K. & Guan, C. On the use of convolutional neural networks and augmented CSP features for multi-
class motor imagery of EEG signals classification. In Annual International Conference of the IEEE Engineering in Medicine and
Biology Society (EMBC), Milan (2015).
31. Bashivan, P., Rish, I., Yeasin, M. & Codella, N. Learning representations from EEG with deep recurrent-convolutional neural
networks. arXiv, https://arxiv.org/abs/1511.06448 (2015)

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 17

Vol.:(0123456789)
www.nature.com/scientificreports/

32. Dai, G., Zhou, J., Huang, J. & Wang, N. HS-CNN: A CNN with hybrid convolution scale for EEG motor imagery classification. J.
Neural Eng. 17, 016025 (2020).
33. Zhang, C., Kim, Y.-K. & Eskandarian, A. EEG-inception: An accurate and robust end-to-end neural network for EEG-based motor
imagery classification. J. Neural Eng. 18(4), 046014 (2021).
34. Tangermann, M. et al. Review of the BCI competition IV. Front. Neurosci. 6, 00055 (2012).
35. Schlögl, A. Outcome of the BCI-competition 2003 on the Graz data set (Graz University of Technology, 2003).
36. DebnathJean, L. & Antoine, J.-P. Wavelet Transforms and Their Applications, Louvain-la-Neuve: Physics Today (2003).
37. Goodfellow, I. J., Shlens, J. & Szegedy, C. Explaining and Harnessing Adversarial Examples. arXiv, https://arxiv.org/abs/1412.6572
(2014).
38. Pfurtscheller, G. & Lopes Da Silva, F. H. Event-related EEG/MEG synchronization and desynchronization: Basic principles. Clin.
Neurophysiol. 110(11), 1842–1857 (1999).
39. He, K., Zhang, X., Ren, S. & Sun, J. Deep Residual Learning for Image Recognition. In IEEE Conference on Computer Vision and
Pattern Recognition (CVPR) (2016).
40. Suk, H.-I. & Seong-Whan, L. Data-driven frequency bands selection in EEG-based brain-computer interface. In International
Workshop on Pattern Recognition in NeuroImaging. IEEE, 2011, 25–28 (2011).
41. Gandhi, V., Arora, V., Behera, L., Prasad, G., Coyle, D. & McGinnity, T. EEG denoising with a recurrent quantum neural network
for a brain-computer interface. In In The 2011 International Joint Conference on Neural Networks. IEEE (2011).
42. Shahid, S., Sinha, R. & Prasad, G. A bispectrum approach to feature extraction for a motor imagery based brain-computer interfac-
ing system. In 18th European Signal Processing Conference. IEEE, 2010 (2010).
43. Raza, H., Cecotti, H. & Li, Y. Adaptive learning with covariate shift-detection for motor imagery-based brain–computer interface.
Soft Comput. 20, 3085 (2016).
44. Gaur, P. et al. A sliding window common spatial pattern for enhancing motor imagery classification in EEG-BCI. IEEE Trans.
Instrum. Meas. 70, 1–9 (2021).
45. Barachant, A., Bonnet, S., Congedo, M. & Jutten, C. Classification of covariance matrices using a Riemannian-based kernel for
BCI applications. Neurocomputing 112, 172–178 (2013).
46. Lawhern, V. J. et al. EEGNet: A compact convolutional neural network for EEG-based brain–computer interfaces. J. Neural Eng.
15, 0560 (2018).
47. Ozdenizi, O. & Erdogmus, D. Information theoretic feature transformation learning for brain interfaces. IEEE Trans. Biomed. Eng.
67(1), 69–78 (2020).
48. Tibor Schirrmeister, R. et al. Deep learning with convolutional neural networks for EEG decoding and visualization. Hum. Brain
Mapp. 38(11), 5391–5420 (2017).
49. Zheng, Q., Zhu, F., & Heng, P.-A. Robust support matrix machine for single trial EEG classification. In IEEE Transactions on Neural
Systems and Rehabilitation Engineering (2018).
50. Shahid, S. & Prasad, G. Bispectrum-based feature extraction technique for devising a practical brain–computer interface. J. Neural
Eng. 8, 025014 (2011).
51. Lemm, S., Schäfer, C. & Curio, G. BCI competition 2003-data set III: Probabilistic modeling of sensorimotor μ rhythms for clas-
sification of imaginary hand movements. IEEE Trans. Biomed. Eng. 51, 1077–1080 (2004).
52. Ren, S., He, K., Girshick, R. & Sun, J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. arXiv,
https://arxiv.org/abs/1506.01497 (2015).
53. Jiang, X. & Zhang Xiao, W. D. Active Learning for Black-Box Adversarial Attacks in EEG-Based Brain-Computer Interfaces. In
IEEE Symposium Series on Computational Intelligence, Xiamen, China (2019).
54. Feng, B., Wang, Y. & Ding, Y. Saga: Sparse Adversarial Attack on EEG-Based Brain Computer Interface. In ICASSP 2021 - 2021
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, ON, Canada (2021).

Acknowledgements
This work is supported by the Ministry of Economics, Innovation, Digitization and Energy of the State of North
Rhine-Westphalia and the European Union, Grants GE-2-2-023A (REXO) and IT-2-2-023 (VAFES).

Author contributions
O.A. and M.S. jointly performed the analysis and wrote the main manuscript. Also, they prepared all figures
included in this work. S.D. reviewed the manuscript. T.G., I.I. and C.K. are the senior authors. They supervised
the entire work and the process of data analysis. They also streamlined the ideas and reviewed the manuscript.

Funding
Open Access funding enabled and organized by Projekt DEAL.

Competing interests
The authors declare no competing interests.

Additional information
Supplementary Information The online version contains supplementary material available at https://doi.org/
10.1038/s41598-022-07992-w.
Correspondence and requests for materials should be addressed to O.A.
Reprints and permissions information is available at www.nature.com/reprints.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and
institutional affiliations.

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 18

Vol:.(1234567890)
www.nature.com/scientificreports/

Open Access This article is licensed under a Creative Commons Attribution 4.0 International
License, which permits use, sharing, adaptation, distribution and reproduction in any medium or
format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the
Creative Commons licence, and indicate if changes were made. The images or other third party material in this
article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the
material. If material is not included in the article’s Creative Commons licence and your intended use is not
permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from
the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 19

Vol.:(0123456789)

JMSS 12 40
No ratings yet
JMSS 12 40
8 pages
Sulaiman Idris Sulaiman Proposal
No ratings yet
Sulaiman Idris Sulaiman Proposal
36 pages
Bioengineering 10 00186
No ratings yet
Bioengineering 10 00186
20 pages
Sic - Et - Non by Peter Abelard
100% (2)
Sic - Et - Non by Peter Abelard
475 pages
CPT5 - Short Circuit Analysis - July 25, 2005
100% (3)
CPT5 - Short Circuit Analysis - July 25, 2005
235 pages
Sensors: State-of-the-Art On Brain-Computer Interface Technology
No ratings yet
Sensors: State-of-the-Art On Brain-Computer Interface Technology
28 pages
A Novel Method To Reduce The Motor Imagery BCI Illiteracy
No ratings yet
A Novel Method To Reduce The Motor Imagery BCI Illiteracy
13 pages
Current Trends, Challenges, and Future Research Directions of Hybrid and Deep Learning Techniques For Motor Imagery Brain-Computer Interface
No ratings yet
Current Trends, Challenges, and Future Research Directions of Hybrid and Deep Learning Techniques For Motor Imagery Brain-Computer Interface
24 pages
Sensors 25 02259
No ratings yet
Sensors 25 02259
25 pages
Applsci 11 09948 v2
No ratings yet
Applsci 11 09948 v2
18 pages
Applsci 12 05807 v2
No ratings yet
Applsci 12 05807 v2
19 pages
Non-Invasive Brain-Computer Interfaces State of The Art and Trends
No ratings yet
Non-Invasive Brain-Computer Interfaces State of The Art and Trends
24 pages
Electronics 12 01186
No ratings yet
Electronics 12 01186
16 pages
A 1D CNN For High Accuracy Classification and
No ratings yet
A 1D CNN For High Accuracy Classification and
16 pages
Cbsystems 0044
No ratings yet
Cbsystems 0044
12 pages
A Neural Network-Based Time Series Prediction Approach For Feature Extraction in A Brain-Computer Interface
No ratings yet
A Neural Network-Based Time Series Prediction Approach For Feature Extraction in A Brain-Computer Interface
10 pages
Art:10.1007/s00500 014 1443 1
No ratings yet
Art:10.1007/s00500 014 1443 1
14 pages
Intelligent Control of Robotic Arm Using Brain Computer Interface and Artificial Intelligence
No ratings yet
Intelligent Control of Robotic Arm Using Brain Computer Interface and Artificial Intelligence
14 pages
Sensors 19 00210 v2
No ratings yet
Sensors 19 00210 v2
17 pages
PSUDO-CODE-Paper 107-A Novel Hybrid Deep Neural Network Classifier
No ratings yet
PSUDO-CODE-Paper 107-A Novel Hybrid Deep Neural Network Classifier
12 pages
Pretrained CNN
No ratings yet
Pretrained CNN
14 pages
EEG Classification of Forearm Movement Imagery Using A Hierarchical Flow Convolutional Neural Network
No ratings yet
EEG Classification of Forearm Movement Imagery Using A Hierarchical Flow Convolutional Neural Network
10 pages
Sensors 23 07908
No ratings yet
Sensors 23 07908
16 pages
EEG Classification Algorithm of Motor Imagery Based On CNN-Transformer Fusion Network
No ratings yet
EEG Classification Algorithm of Motor Imagery Based On CNN-Transformer Fusion Network
8 pages
Continuous Tracking
No ratings yet
Continuous Tracking
12 pages
Abiri Etal 2019
No ratings yet
Abiri Etal 2019
22 pages
Ai Powred Brain Computer Interface For Paralysed Patients
No ratings yet
Ai Powred Brain Computer Interface For Paralysed Patients
17 pages
Spatio-Spectral Filters For Improving The Classification of Single Trial EEG
No ratings yet
Spatio-Spectral Filters For Improving The Classification of Single Trial EEG
7 pages
Advancements in Temporal Fusion: A New Horizon For EEG-Based Motor Imagery Classification
No ratings yet
Advancements in Temporal Fusion: A New Horizon For EEG-Based Motor Imagery Classification
10 pages
Signal Processing Techniques For Motor Imagery Brain Computer Inte - 2019 - Arra
No ratings yet
Signal Processing Techniques For Motor Imagery Brain Computer Inte - 2019 - Arra
12 pages
Multimodal Classification of EEG During Physical Activity
No ratings yet
Multimodal Classification of EEG During Physical Activity
10 pages
Janani Et Al. Investigation of DNN For FNIRS Signal Classification
No ratings yet
Janani Et Al. Investigation of DNN For FNIRS Signal Classification
12 pages
Multi-CNN Feature Fusion For Efficient EEG Classification
No ratings yet
Multi-CNN Feature Fusion For Efficient EEG Classification
6 pages
Status of Deep Learning For EEG-based Brain-Computer Interface Applications
No ratings yet
Status of Deep Learning For EEG-based Brain-Computer Interface Applications
17 pages
1 s2.0 S1110016824015278 Main
No ratings yet
1 s2.0 S1110016824015278 Main
10 pages
Social Bookmarking Site List With Page Rank
100% (2)
Social Bookmarking Site List With Page Rank
19 pages
A Deep Learning Approach For Robotic Arm Control Using Brain-Computer Interface
No ratings yet
A Deep Learning Approach For Robotic Arm Control Using Brain-Computer Interface
8 pages
A Novel Framework For Classification of Twoclass Motor Imagery EEG
No ratings yet
A Novel Framework For Classification of Twoclass Motor Imagery EEG
18 pages
Filter Bank Convolutional Neural Network ForShort Time-Window Steady-State VisualEvoked Potential Classificatio
No ratings yet
Filter Bank Convolutional Neural Network ForShort Time-Window Steady-State VisualEvoked Potential Classificatio
10 pages
(P - N) Motor Imagery EEG Recognition Based On Conditional Optimization
No ratings yet
(P - N) Motor Imagery EEG Recognition Based On Conditional Optimization
11 pages
Refer 2
No ratings yet
Refer 2
11 pages
Bilateral Adaptation and Neurofeedback For Brain Computer Interface System
No ratings yet
Bilateral Adaptation and Neurofeedback For Brain Computer Interface System
7 pages
M-FANet Multi-Feature Attention Convolutional Neural Network For Motor Imagery Decoding
No ratings yet
M-FANet Multi-Feature Attention Convolutional Neural Network For Motor Imagery Decoding
11 pages
1302-Full Paper (With Author Names and Affiliations, Etc.) - 8265-1-10-20240901
No ratings yet
1302-Full Paper (With Author Names and Affiliations, Etc.) - 8265-1-10-20240901
10 pages
Biomedical Signal Processing and Control: Emre Arı, Ertu Grul Taçgın
No ratings yet
Biomedical Signal Processing and Control: Emre Arı, Ertu Grul Taçgın
15 pages
Liu 2012
No ratings yet
Liu 2012
5 pages
Paper 5988
No ratings yet
Paper 5988
4 pages
2020 - Temporal-Spatial-Frequency Depth Extraction of Brain-Computer
No ratings yet
2020 - Temporal-Spatial-Frequency Depth Extraction of Brain-Computer
13 pages
Convolutional Neural Networks For P300 Detection With Application To Brain-Computer Interfaces
No ratings yet
Convolutional Neural Networks For P300 Detection With Application To Brain-Computer Interfaces
13 pages
A Comprehensive Review of EEG-based Brain-Computer Interface Paradigms
No ratings yet
A Comprehensive Review of EEG-based Brain-Computer Interface Paradigms
44 pages
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
No ratings yet
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
21 pages
Applied Sciences
No ratings yet
Applied Sciences
19 pages
A Transfer Learning Framework Based On Motor Imagery Rehabilitation For Stroke
No ratings yet
A Transfer Learning Framework Based On Motor Imagery Rehabilitation For Stroke
9 pages
Hybrid Deep Neural Network Using Transfer Learning For EEG Motor Imagery
No ratings yet
Hybrid Deep Neural Network Using Transfer Learning For EEG Motor Imagery
7 pages
EEG Signal Classification Using Principal Component Analysis 06528498
No ratings yet
EEG Signal Classification Using Principal Component Analysis 06528498
5 pages
Motor Imagery Recognition of EEG Signal Using Cuckoo Search Masking Empirical Mode Decomposition
No ratings yet
Motor Imagery Recognition of EEG Signal Using Cuckoo Search Masking Empirical Mode Decomposition
4 pages
A Review On Signal Pre-Processing Techniques in Brain Computer Interface
No ratings yet
A Review On Signal Pre-Processing Techniques in Brain Computer Interface
5 pages
Liebherr A309 Litronic TCD Wheel Excavator Service Repair Manual SN 40998 and Up PDF
No ratings yet
Liebherr A309 Litronic TCD Wheel Excavator Service Repair Manual SN 40998 and Up PDF
50 pages
Luwax and Poligen - Application Guide BAFS
100% (1)
Luwax and Poligen - Application Guide BAFS
9 pages
Brain-Computer-Interface: 1 Pilla Chaitanya 2 Majjari Lakshmi Gowtham
No ratings yet
Brain-Computer-Interface: 1 Pilla Chaitanya 2 Majjari Lakshmi Gowtham
6 pages
Unsupervised Feature Extraction With Autoencoders For EEG Based Multiclass Motor Imagery BCI
No ratings yet
Unsupervised Feature Extraction With Autoencoders For EEG Based Multiclass Motor Imagery BCI
10 pages
Paper 28-EEG Mouse A Machine Learning-Based Brain Computer Interface
No ratings yet
Paper 28-EEG Mouse A Machine Learning-Based Brain Computer Interface
6 pages
Brain Computing Interface For Wheel Chair Control: Naveen.R.S, Anitha Julian
No ratings yet
Brain Computing Interface For Wheel Chair Control: Naveen.R.S, Anitha Julian
5 pages
Edc - Assignment Questions - Nba
No ratings yet
Edc - Assignment Questions - Nba
5 pages
Interfacing 16×2 LCD With 8051
No ratings yet
Interfacing 16×2 LCD With 8051
39 pages
Plane Strain and Plane Stress
No ratings yet
Plane Strain and Plane Stress
35 pages
Unit - Iii MPMC-1
100% (1)
Unit - Iii MPMC-1
79 pages
Deep Learning Based Prediction of EEG Motor Imagery of Stroke Patients' For Neuro-Rehabilitation Application
No ratings yet
Deep Learning Based Prediction of EEG Motor Imagery of Stroke Patients' For Neuro-Rehabilitation Application
8 pages
Stock Watson 3U ExerciseSolutions Chapter5 Students PDF
No ratings yet
Stock Watson 3U ExerciseSolutions Chapter5 Students PDF
9 pages
M. Tech. Bulletin: Aerospace Engineering Department
No ratings yet
M. Tech. Bulletin: Aerospace Engineering Department
34 pages
Presentation of Financial Statements
No ratings yet
Presentation of Financial Statements
27 pages
Swivel Grease MSDS
No ratings yet
Swivel Grease MSDS
8 pages
Laforteza vs. Machuca, 333 SCRA 643, June 16, 2000
0% (1)
Laforteza vs. Machuca, 333 SCRA 643, June 16, 2000
22 pages
Template Erasmus Mundus
100% (1)
Template Erasmus Mundus
3 pages
Technology Newsletter
No ratings yet
Technology Newsletter
5 pages
Asme Section V B Se-1211
No ratings yet
Asme Section V B Se-1211
6 pages
Launching A New Category For Crocs
No ratings yet
Launching A New Category For Crocs
71 pages
Mark Meadows Motion To Dismiss
No ratings yet
Mark Meadows Motion To Dismiss
34 pages
T1 - Universal Beam
No ratings yet
T1 - Universal Beam
8 pages
Unit-1 MPMC
No ratings yet
Unit-1 MPMC
40 pages
Image and Video Processing in The Compressed Domain Jayanta Mukhopadhyay
No ratings yet
Image and Video Processing in The Compressed Domain Jayanta Mukhopadhyay
45 pages
Engine Test Stands For Automotive Technicians
No ratings yet
Engine Test Stands For Automotive Technicians
6 pages
MPMC Unit-1
No ratings yet
MPMC Unit-1
26 pages
NORSOK STANDARD M-650 Edition 4 Qualification of Manufacturers of Special Materials
No ratings yet
NORSOK STANDARD M-650 Edition 4 Qualification of Manufacturers of Special Materials
19 pages
Conf - Analysis of Eeg Sig For The Estimation of Concentration Level of Humans
No ratings yet
Conf - Analysis of Eeg Sig For The Estimation of Concentration Level of Humans
8 pages
Untitled Design
No ratings yet
Untitled Design
15 pages
Busi 601 Final
No ratings yet
Busi 601 Final
17 pages
Cabbage: Schedule of Cabbage Production Practices
No ratings yet
Cabbage: Schedule of Cabbage Production Practices
19 pages
5.load Transfer Mechanism and Load Test - 2
No ratings yet
5.load Transfer Mechanism and Load Test - 2
18 pages
Data Augmentation For DNN Model in Eeg Classification Task-A Review
No ratings yet
Data Augmentation For DNN Model in Eeg Classification Task-A Review
15 pages
(ABRIDGED) RMUN 2021 (UNHCR) - Study Guide
No ratings yet
(ABRIDGED) RMUN 2021 (UNHCR) - Study Guide
15 pages
How To Add or Remove An Employee
No ratings yet
How To Add or Remove An Employee
4 pages
World Trade Organization and IPR
No ratings yet
World Trade Organization and IPR
5 pages
Nlud Circ CPL Sep 2017 Batch
No ratings yet
Nlud Circ CPL Sep 2017 Batch
1 page
SO12913 ORBITech PDF
No ratings yet
SO12913 ORBITech PDF
1 page
24 Coercion Exercise
No ratings yet
24 Coercion Exercise
1 page

Enhancing The Decoding Acc of Eeg Sigls by The Introduction of Anchored STFT and Adversarial Data Augmentation Method

Uploaded by

Enhancing The Decoding Acc of Eeg Sigls by The Introduction of Anchored STFT and Adversarial Data Augmentation Method

Uploaded by

www.nature.

OPEN Enhancing the decoding accuracy

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 1

Materials and methods

Anchored Short‑Time Fourier Transform (anchored‑STFT). Short-time Fourier transform (STFT) is

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 2

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 3

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 4

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 5

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 6

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 7

• BCI competition IV dataset 2b

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 8

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 9

BCI IV, DATASET 2B Method Metric S1 S2 S3 S4 S5 S6 S7 S8 S9 AVG

Table 1. Performance comparison of anchored-STFT + Skip-Net-GNAA with state-of-the-art methods on

accuracy − random accuracy

Comparison of proposed pipeline with state‑of‑the‑art studies on dataset 2b Competition

The details of which are provided in section Tuning of hyperparameters of anchored-STFT.

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 10

Figure 7. Performance comparison on dataset 2b. Performance comparison of anchored-STFT + Skip-Net-

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 11

Comparison of proposed pipeline with state‑of‑the‑art studies on dataset 2a Competition

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 12

Figure 8. Performance comparison on dataset 2a. Performance comparison of anchored-STFT + Skip-Net-

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 13

CNN 15 CNN-SAE 15 winner algorithm 51 anchored-STFT + Skip-Net-GNAA

Tuning of hyperparameters of anchored‑STFT. Anchored-STFT includes number, combination of anchors, and

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 14

1. Optimization algorithm = Adam

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 15

Summary and discussion

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 16

Received: 16 August 2021; Accepted: 22 February 2022

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 17

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 18

© The Author(s) 2022

Scientific Reports | (2022) 12:4245 | https://doi.org/10.1038/s41598-022-07992-w 19

You might also like