0% found this document useful (0 votes)

75 views11 pages

A Dual-Branch Dynamic Graph Convolution Based Adaptive TransFormer Feature Fusion Network For EEG Emotion Recognition

Uploaded by

Peter Lovanas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

75 views11 pages

A Dual-Branch Dynamic Graph Convolution Based Adaptive TransFormer Feature Fusion Network For EEG Emotion Recognition

Uploaded by

Peter Lovanas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

2218 IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, VOL. 13, NO.

4, OCTOBER-DECEMBER 2022

A Dual-Branch Dynamic Graph Convolution

Based Adaptive TransFormer Feature Fusion
Network for EEG Emotion Recognition
Mingyi Sun, Weigang Cui , Shuyue Yu , Hongbin Han, Bin Hu , Member, IEEE, and Yang Li

Abstract—Electroencephalograph (EEG) emotion recognition plays an important role in the brain-computer interface (BCI) field.
However, most of recent methods adopted shallow graph neural networks using a single temporal feature, leading to the limited
emotion classification performance. Furthermore, the existing methods generally ignore the individual divergence between
different subjects, resulting in poor transfer performance. To address these deficiencies, we propose a dual-branch dynamic
graph convolution based adaptive transformer feature fusion network with adapter-finetuned transfer learning (DBGC-ATFFNet-
AFTL) for EEG emotion recognition. Specifically, a dual-branch graph convolution network (DBGCN) is firstly designed to
effectively capture the temporal and spectral characterizations of EEG simultaneously. Second, the adaptive Transformer
feature fusion network (ATFFNet) is conducted by integrating the obtained feature maps with the channel-weight unit, leading to
significant difference between different channels. Finally, the adapter-finetuned transfer learning method (AFTL) is applied in
cross-subject emotion recognition, which proves to be parameter-efficient with few samples of the target subject. The
competitive experimental results on three datasets have shown that our proposed method achieves the promising emotion
classification performance compared with the state-of-the-art methods. The code of our proposed method will be available at:
https://github.com/smy17/DANet.

Index Terms—EEG, emotion recognition, graph neural network, Transformer, transfer learning

1 INTRODUCTION Brain-Computer Interactions (aBCIs) have received increas-

ing attention [2], [3]. Traditional input signals for affective
HE brain computer interface (BCI) links us with com-
T puters in an innovative way [1]. Emotions play an impor-
tant role in the process of communication. As one of the most
computing include audio, text and video (facial expressions,
body movements, etc.) [4]. As the rise of metaverse, research-
ers showed great interests in the role of physiological signals,
significant interdisciplinary fields, researches on affective
for example, EEG, EOG and ECG [5]. In recent years, there
are plenty of researches on neuroscience demonstrating that
Mingyi Sun is with the Department of Automation Science and Electrical the change of EEG signals is strongly associated with emo-
Engineering, Beihang University, Beijing 100191, China. tion status [6], [7]. Apart from its reliability, the flexible col-
E-mail: [email protected].
Weigang Cui is with the School of Engineering Medicine, Beihang Univer- lection of EEG signals has made it possible to conduct real-
sity, Beijing 100191, China. E-mail: [email protected]. time monitoring of emotions [8], [9]. Thus, the EEG based
Shuyue Yu is with Beijing Aerospace Measurement & Control Technology pattern recognition and correct decoding are of great neces-
Co. Ltd, Beijing 100024, China. E-mail: [email protected].
Hongbin Han is with the Institute of Medical Technology, Peking Univer-
sity in the affective computing area.
sity Health Science Center, Peking University, Beijing 100191, China, and In order to realize EEG-based emotion recognition, many
also with the Beijing Key Laboratory of Magnetic Resonance Imaging classical machine learning algorithms have been proposed
Technology, Beijing 100191, China. E-mail: [email protected]. to classify the hand-crafted features from original EEG sig-
Bin Hu is with the Gansu Provincial Key Laboratory of Wearable Comput-
ing, School of Information Science and Engineering, Lanzhou University, nals. For example, Murugappan et al. [10] and Reuderink
Lanzhou, Gansu 730000, China. E-mail: [email protected]. et al. [11] adopted shallow models to classify the time-fre-
Yang Li is with the Beijing Advanced Innovation Center for Big Data and quency domain features. Differential entropy (DE) features
Brain Computing, Department of Automation Science and Electrical Engi-
of EEG segments were computed and classified by the k-
neering, Beihang University, Beijing 100191, China. E-mail: liyang@buaa.
edu.cn. Nearest Neighbor (kNN) method [12], which avoided heavy
Manuscript received 17 April 2022; revised 10 July 2022; accepted 10 computational cost of raw EEG signals. Zheng et al. [13] fur-
August 2022. Date of publication 16 August 2022; date of current version ther introduced a deep belief network and constructed the
15 November 2022. three-type emotion classification model with the prepro-
This work was supported in part by the National Natural Science Foundation cessed DE features as inputs. A concatenated feature map,
of China under Grants U1809209, 61671042, and 61403016, in part by the
Beijing Municipal Education Commission - Natural Science Foundation, and which was obtained from differential entropy and power
in part by the Beijing United Imaging Research Institute of Intelligent Imag- spectral density (PSD), was used for further detection of
ing Foundation under Grants KZ202110025036 and CRIBJQY202103. emotion status [14]. However, these methods above just
(Corresponding author: Yang Li.)
Recommended for acceptance by K. Qian.
processed the obtained feature map in the single shallow
Digital Object Identifier no. 10.1109/TAFFC.2022.3199075 network without taking full advantage of temporal and
1949-3045 © 2022 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See ht_tps://www.ieee.org/publications/rights/index.html for more information.
Authorized licensed use limited to: Srinakharinwirot University provided by UniNet. Downloaded on May 13,2024 at 15:45:21 UTC from IEEE Xplore. Restrictions apply.
SUN ET AL.: DUAL-BRANCH DYNAMIC GRAPH CONVOLUTION BASED ADAPTIVE TRANSFORMER FEATURE FUSION NETWORK FOR EEG... 2219

spectral features simultaneously, which results in the insuf- Main contributions of this paper are summarized as
ficient extraction of EEG emotional information. follows:
In addition, recent studies have demonstrated that deep
learning methods are able to learn more discriminative fea- 1) We propose a novel DBGC-ATFFNet-AFTL method
tures from data automatically [15]. Many researchers have for EEG emotion recognition, which integrates high-
started to explore high-level information in EEG emotion level features with dual branches into the deep learn-
recognition with graph neural networks [16] and then fuse ing network. The proposed DBGC-ATFFNet-AFTL
the extracted feature maps with shallow 2D convolution. method performs more accurately and efficiently on
For instance, Song et al. [17] applied the graph neural net- emotion classification than the widely used dynamic
work to classify the DE features, which efficiently modeled Graph Neural Network.
the connection between different EEG channels with a 2) A dual-branch dynamic graph convolution block is
dynamic adjacent matrix. In order to simultaneously take developed to acquire the temporal and spectral char-
the distribution of brain regions into account, a novel fea- acteristics by dual branches, which overcomes the
ture representation of EEG features was discussed [18] and weakness of insufficient emotional information extrac-
4D convolution was applied to extract high-level feature tion with a single encoding path.
maps. However, conventional CNNs can only focus on local 3) We design an adaptive transformer feature fusion
spatial features of the brain network, which leaded to the network to implement the fusion of high-level tem-
loss of patterns in higher dimensional space. In order to poral and spectral features simultaneously, effi-
avoid this limitation, the attention units were applied with ciently associating the spatial distribution of EEG
LSTM to improve the invariance ability against the emo- channels with deeply encoded emotion characteris-
tional intensity fluctuation and automatically adjust the tics, and thus boosts the classification performance.
weights of channels [35], which may provide a solution to 4) We propose an adapter-finetuned transfer learning
efficiently fusing the obtained feature maps from GNNs. algorithm to realize the rapid cross-subject EEG
Some recent studies started to introduce cross-subject emotion recognition through finetuning the Adapter
experiments in order to enable rapid application of emo- modules. It can effectively avoid the overfitting
tion recognition in BCIs. Although the classical subject- problem brought by the subject-dependent method
dependent methods [18] have been proved to show an and show an outstanding performance with quite a
outstanding performance in emotion recognition, the risk small number of trainable parameters.
of overfitting and dependence on the amount of data
limit its flexible application. In order to overcome these
problems, Song et al. [17] and Li et al. [35] applied the 2 METHODOLOGY
leave-one-subject-out method to transfer the general The overall architecture of our proposed DBGC-ATFFNet-
emotion pattern of source subjects to the target subject. AFTL is outlined in Fig. 1, and summarized as follows.
Although these methods achieved promising classifica- For the design of our proposed model, differential entropy
tion results, non-negligible divergence between different (DE) and power spectral density (PSD) of EEG segments
individuals and the simple transferring without adaption are first calculated by inverse fast Fourier transform
maybe leads to misjudgment of the pretrained model, (IFFT) and short time Fourier transform (STFT), and then
which results in poor classification performance on the the DBGC module captures both the temporal DE and
target subject. spectral PSD information of EEG signals by using dual
To address the issues above, in this article, we propose a branches of graph convolution; second, the ATFFNet
novel dual-branch dynamic graph convolution based adap- effectively fuses the obtained feature maps, which consists
tive transformer feature fusion network with adapter-fine- of multi-head self-attention mechanism and subject-adap-
tuned transfer learning, namely, DBGC-ATFFNet-AFTL for tive unit (SAU); finally, the classification block gives the
EEG emotion recognition. First, both differential entropy and final recognition results. Additionally, the pipeline of our
power spectral density features of each EEG segment are proposed adapter-finetuned transfer learning is given as
computed and the dual-branch graph dynamic convolution follows: 1) We firstly divide the whole dataset into two
(DBGC) network is designed to capture deeper temporal and parts: the i-th subject is the target subject, which is ran-
spectral features in different frequency bands from dual domly selected, while the rest (N-1) subjects are the source
branches respectively. Second, the Adaptive Transformer fea- subjects. Specifically, half samples of the target subject are
ture fusion Network (ATFFNet) is adopted to apply self- used for finetuning while the other half samples from the
attention mechanism on different kinds of feature maps in i-th subject are applied to evaluate the classification per-
consideration of the channel connection, in order to effec- formance; 2) We then pretrain the proposed model on all
tively capture the global pattern of the emotion status. samples of source subjects and update all the trainable
Moreover, the adapter-finetuned transfer learning (AFTL) parameters; 3) Based on training samples of the target
algorithm is proposed to efficiently avoid the overfitting subject, we only finetune the parameters of Adapter,
problem caused by insufficient samples of target subject which is embedded in the SAU, to bridge the gaps
through finetuning the Adapter modules. The proposed between the target subject and source subjects. 4) Finally,
DBGC-ATFFNet-AFTL method is evaluated on three public the well-trained model is evaluated on test samples of the
datasets and gains promising performance compared with target subject. In the following three subsections, we dis-
the state-of-the-art methods, demonstrating its efficacy in cuss in detail the specific implementation of the proposed
EEG emotion recognition. innovation modules.
Authorized licensed use limited to: Srinakharinwirot University provided by UniNet. Downloaded on May 13,2024 at 15:45:21 UTC from IEEE Xplore. Restrictions apply.
2220 IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, VOL. 13, NO. 4, OCTOBER-DECEMBER 2022

Fig. 1. The pipeline of the proposed DBGC-ATFFNet-AFTL method.

2.1 Dual-Branch Dynamic Graph Convolution The PSD features can be computed by using Short Time
Network Fourier Transform (STFT), which is defined by [17]:
The original EEG signals are defined as E ¼ fðXi ; yi Þji ¼
1; 2; . . . ; Kg, where Xi 2 RCS is a two-dimension array hp ðXÞ ¼ E x2 (2)
representing the i-th EEG trail with C channels and S sam-
ples. K is the total number of EEG signal trails. yi is the corre- where x is formally a signal variable acquired from a certain
sponding label of Xi and takes its value from label set L frequency band on a certain EEG channel.
including M classes in an emotion recognition task. For Therefore, we extract the DE feature tensor and PSD fea-
example, the label set for the discrete datasets (i.e., SEED) ture tensor, F D ; F P 2 RCB from each segment, where C is
consists of the explicit emotion statuses: L ¼ fl1 ¼ 00neutral00 ; the number of EEG channels and B is the number of fre-
l2 ¼ 00happy00 ; l3 ¼ 00 sad00 g. quency bands, respectively. Since the above extracted fea-
As previous works [13], [20], we firstly split the original tures are relatively independent and fail to fully consider
EEG signals with a T s-long window without overlapping, the effects of different brain regions on emotion, in this part,
and it has been proved that real-time emotion recognition a DBGC module is designed to further explore deeper tem-
can be approximately realized with emotional information poral and spectral features with channel relationship. The
largely preserved when T is set 1 [17]. Each segment is layout of the detailed DBGC is depicted in Fig. 2, which
assigned with the same label as the original EEG signals. includes two synchronized Graph-Conv branches with a
According to the experiment results in recent studies deeply encoded adjacent matrix.
[17], [18], differential entropy (DE) and power spectral den- Based on the previous extracted DE and PSD features,
sity (PSD) features have been proved to achieve a promising a dynamic adjacent matrix is proposed to model the con-
performance in depicting the emotional fluctuation. How- nection among EEG channels [21], [22]. We firstly ran-
ever, for different types of emotional stimulation, the two domly initialize an adjacent matrix A 2 RCC , where the
features make a different contribution to the final recogni- ði; jÞ-th element measures the coupling strengths between
tion result. Under video stimulation (i.e., SEED dataset), for the i-th and j-th EEG channel. In this way, A shows that
example, DE features perform better in identifying the emo- every channel is densely related with each other, taking
tion of subjects [12], while PSD features prove to have an direction and strength into account simultaneously. Then
outstanding result under music stimulation (i.e., DEAP the matrix is encoded using Tanh nonlinearity to simulate
dataset) [19]. Therefore, we choose both DE and PSD as the the directional dependencies between different channels
basic input features for our proposed model. as follows:
The two types of features of all EEG channels are computed
according to the five frequency bands [20] (i.e., d[14 Hz], A ~
~dd ¼ s W 2 d W 1 A (3)
u[48 Hz], a[814 Hz], b[1431 Hz] and g[3151Hz]), both of
which have been proven to be effective for emotion recognition where A ~ 2 RCC is vectorized from A, W 1 2 Rð r ÞðCCÞ CC

CC
[5]. The DE feature for Gaussian distribution is defined as fol- and W 2 2 RðCCÞð r Þ are weight matrixes, dðÞ and sðÞ are
lows [12]: ELU and Tanh functions respectively, and r is the reduction
Z ratio. Therefore, a dense adjacent matrix Add 2 RCC is
1
1 ðx mÞ2 1 ðx mÞ2 obtained by reshaping A ~dd 2 RðCCÞ1 into RCC , where the
D ðX Þ ¼ pffiffiffiffiffiffi exp ln pffiffiffiffiffiffiffiffiffiffi exp dx
1 2ps
2 2s 2 2ps 2 2s 2 ði; jÞ-th entry is learnable and reflects the directional depen-
1 dency between the i-th and j-th EEG channel. Then we
¼ ln2pes 2
2 adopt a rectified linear unit (ReLU) to penal weak channel
(1) couplings and as a result, a non-negative adjacent matrix
where X denotes the Gaussian distribution Nðm; s 2 Þ, x is a Ads is achieved. Thus, G is defined as G ¼ fV; F D ; F P ; Ads g,
variable, p and e are constants, respectively. where V is the vertex set with jVj ¼ C nodes, node attributes
Authorized licensed use limited to: Srinakharinwirot University provided by UniNet. Downloaded on May 13,2024 at 15:45:21 UTC from IEEE Xplore. Restrictions apply.
SUN ET AL.: DUAL-BRANCH DYNAMIC GRAPH CONVOLUTION BASED ADAPTIVE TRANSFORMER FEATURE FUSION NETWORK FOR EEG... 2221

Fig. 3. The outline of adaptive transformer feature fusion network.

2.2 Adaptive Transformer Feature Fusion Network

With Adapter-Finetuned Transfer Learning
From the aforementioned blocks, high-level features of EEG
signals are generated from both the DE and PSD indepen-
dently, which is regarded as the input feature H C of the ATFF
Network here. We can separate the whole block into two
parts: the multi-head self-attention mechanism MHSAðÞ and
the subject-adaptive unit SAUðÞ. The output of the block H T
can be calculated via:

HT0 ¼ MHSAðLN ðH ÞÞ þ H C

H T ¼ SAU LN HT0 þ HT0 (5)
Fig. 2. The layout of dual-branch dynamic graph conv network.

where LNðÞ denotes the layer normalization. As shown in

are differential entropy and power spectral density features Fig. 3, the MHSA mainly consists of self-attention mecha-
under different rhythms F D ; F P 2 RCB , connections nisms and fully connected layer. For the self-attention
between nodes are determined by the sparse adjacent mechanisms, there are totally three weight matrixes pro-
matrix Ads 2 RCC . posed: a query weight matrix W Q 2 RðB2Þdk , a key weight
Given two different types of EEG features as the matrix W K 2 RðB2Þdk , and a value weight matrix W V 2
inputs, one adjacent matrix is adopted to be shared in RðB2ÞdV , where dk and dv are hyperparameters. Thus, the
these two branches and we also introduce the residual query Q, key K and value V can be defined as follows:
architecture to avoid the gradient vanishing problem.
Moreover, in order to reduce the heavy computation Q ¼ HC W Q
cost, the K order Chebyshev polynomials framework is
not adopted in this paper as other GNN works do [17], K ¼ HC W K
instead, we operate graph convolution on the feature V ¼ HC W V (6)
tensor extracted from multiple frequency bands. In this As [23] points out, self-attention mechanism in the
method, the hidden states of two branches can be com- Transformer architecture is not sensitive to the positional
puted via information of the input sequences, therefore, positional
embeddings should be added to construct the spatial

H D ¼ d2 D1 Ads d1 ðF D W 11 ÞW 12 þ F D relationship between the input elements. However, pre-
1 (4) vious works [23], [24] only applied 1-Dimension abso-
H P ¼ d2 D Ads d1 ðF P W 21 ÞW 22 þ F P
lutely positional embeddings, which is not optimal to
where H D ; H P 2 RCB are corresponding hidden states of the construction of brain channels due to omitting the
P dependent relationship from different channels. In order
DE and PSD features, Dii ¼ j Aij ds is the degree matrix of
0 0
to take the dependent relationship between different
Ads , W 1 2 RBB and W 2 2 RB B are weight matrixes and EEG channels into full consideration, we introduce the
B0 is an adjustable hyperparameter, both d1 ðÞ and d2 ðÞ are adjacent matrix Ads obtained from the above DBGC mod-
ELU nonlinearity functions. We concatenate H D and H P ule, which is defined as the Channel-Weight unit. As a
above, and then obtain high-level features H C 2 RCðB2Þ as result, the output of the self-attention mechanism Z can
the outputs. be calculated as follows:
Authorized licensed use limited to: Srinakharinwirot University provided by UniNet. Downloaded on May 13,2024 at 15:45:21 UTC from IEEE Xplore. Restrictions apply.
2222 IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, VOL. 13, NO. 4, OCTOBER-DECEMBER 2022

TABLE 1
QK T
Z ¼ Ads softmax pffiffiffiffiffi V (7) Details of SEED, SEED IV, and DEAP
dk
Item SEED SEED-IV DEAP

Eq. (7) calculates the weight of the global spatial connec- Channel Num 62 62 32
Subject Num 15 15 32
tion between all EEG channels and fuses different kinds of Video Num 15 24 40
extracted features efficiently. Stimulus Materials Film Clips Film Clips Music Videos
Moreover, we add the Adapter module to the subject- Emotion Status 3 classes 4 classes —
adaptive unit (SAU) apart from the common multi-layer
perceptron [23] for the purpose of transfer learning. The
architecture of adapter module is depicted in Fig. 3, which
indicator function. In order to avoid the overfitting problem
consists of a bottleneck including two feed-forward layers
of the proposed model, we also introduce the trade-off regu-
and an ELU activation unit. To make it more parameter-effi-
larization to the Eq. (8), where u refers to the learnable
cient, the adapter also contains a skip-connection. Adapter
parameters of the model and is the regularization weight.
modules perform more general architectural modifications
to make a pre-trained model suitable for the target subject,
which involves adding a small number of new parameters. 3 EXPERIMENTAL RESULTS
The detailed description of transfer learning is given in the 3.1 EEG Datasets
next section. Finally, the fused feature map is obtained after The effectiveness of the proposed DBGC-ATFFNet-AFTL is
processing the output of self-attention mechanism by means evaluated on three public EEG datasets which are described
of the matrix multiplication and layer normalization. in this section:

2.3 Classification Block 1) SJTU Emotion EEG Dataset: (SEED) [12] contains EEG data
The classification block in the proposed model is designed of 15 subjects (7 males and 8 females), which are col-
to give the final affective computing results based on the lected via 62 EEG electrodes from the subjects when
fused high-level features. We firstly flat all the feature maps they are watching fifteen Chinese film clips with three
into 1-Dimension vector and feed them into two fully con- types of emotions, i.e., negative, positive and neutral.
nected layers. Then, the Softmax function computes the Each subject has three sessions and there are 15 trails (5
classification probabilities from the output vectors, the max- for each class) per session. The EEG signals were
imum of which is considered as the classification result. recorded and down sampled to 200 Hz by 62 electrodes.
2) SJTU Emotion EEG Dataset IV: (SEED IV) [25] comprises
EEG data of 15 subjects (7 males and 8 females)
Algorithm 1. The Learning Rate and Parameter Update
recorded in 62 channels. The experiment setting is the
in the Proposed DBGC-ATFFNet-AFTL Method
same as SEED. The data were collected when partici-
Input: Raw EEG signals x, the corresponding class labels y, the pants watch movies in four types of emotions, namely
maximum epoch t, learning rate , regularization weight , the neutral, sad, fear, and happy, and the eye movement
DBGC-ATFFNet-AFTL NetðÞ; features are not used in this paper. Each movie lasts
Output: The affective computing result O; around 2 minutes. Three sessions of data are collected
1 Initializing parameters in the as uð0Þ ; and each session comprises 24 trials/movies for each
2 Initializing data Di in one batch with i ¼ 1; 2; . . . ; N; subject.
3 Initializing t ¼ 200, ¼ 1 103 , q ¼ 0; 3) Database for Emotion Analysis using Physiological Signals:
4 while q 6¼ t do
(DEAP) [19] consists of EEG data from 32 subjects (16
5 for i ¼ 1 to N do
males and 16 females) who were shown 40 music vid-
6 Generate conditional probability pj ¼ NetðuðqÞ ; Di Þ;
eos, and the physiological signals of the subjects were
7 Calculate loss J ðqÞ on x by Eq. (8);
8 Calculate the gradient g ¼ rJ ðqÞ ;
recorded. Additionally, the subjects specified rating
9 Update the parameters: uðqþ1Þ uðqÞ g; values according to four emotional states (valence,
10 end for arousal, liking, and dominance) using consecutive
11 q þ þ; numbers between 1 and 9. The length of the DEAP data
12 end while was 63s sampled at 128 Hz in 32 channels.
13 Get the classification result O ¼ NetðuðtÞ ; xÞ; As is discussed in [26], there are repeated sessions in
SEED and the first one reflects stronger emotion feedback
To summarize, the optimizing procedure of our proposed that is more reliable than the latter two sessions, we also
DBGC-ATFFNet-AFTL is shown in Algorithm 1. The model only use the first session for each subject to ensure the con-
is trained by minimizing the cross-entropy loss J between sistency of evaluation in the experiment of this paper. We
model prediction and the label, which is defined by: strictly follow the evaluation protocol in [5] for all three
datasets and the detail information of three datasets can be
N X
X M
J¼ log ðpi Þvðyi ¼ li Þ þ kuk (8) found in Table 1.
j¼1 i¼1
3.2 Evaluation Metrics and Models
where pi is the j-th conditional probability generated by the The proposed DBGC-ATFFNet-AFTL is evaluated by the
model, lj is the j-th class from the label set L, vðÞ is the classification accuracy (Acc) [12], F1-score (F1) [27], and
Authorized licensed use limited to: Srinakharinwirot University provided by UniNet. Downloaded on May 13,2024 at 15:45:21 UTC from IEEE Xplore. Restrictions apply.
SUN ET AL.: DUAL-BRANCH DYNAMIC GRAPH CONVOLUTION BASED ADAPTIVE TRANSFORMER FEATURE FUSION NETWORK FOR EEG... 2223

TABLE 2
The Overall Comparison of Classification Performance on SEED Dataset

subject DGCNN [17] 4D-CRNN [18] resHGCN [27] Ours

Acc Std(%) F1 AUC Acc Std(%) F1 AUC Acc Std(%) F1 AUC Acc Std(%) F1 AUC
1 89.39 0.93 0.892 0.964 93.75 0.93 0.936 0.980 95.22 1.27 0.941 0.984 97.41 2.46 0.973 0.994
2 80.00 1.95 0.799 0.920 87.97 0.71 0.879 0.965 87.83 1.97 0.867 0.952 94.81 0.79 0.947 0.984
3 83.85 1.34 0.838 0.936 91.69 1.11 0.916 0.975 91.74 1.97 0.907 0.971 95.93 0.65 0.959 0.990
4 94.01 1.87 0.939 0.986 97.05 1.15 0.970 0.992 98.23 0.66 0.972 0.995 98.05 1.01 0.980 0.996
5 85.12 1.09 0.850 0.950 92.33 1.23 0.923 0.969 93.28 1.54 0.922 0.978 97.49 1.02 0.974 0.993
6 91.45 0.78 0.913 0.970 94.84 1.28 0.948 0.989 97.40 0.49 0.963 0.991 98.08 0.66 0.980 0.995
7 91.45 1.41 0.913 0.968 94.66 0.66 0.946 0.984 95.72 1.00 0.946 0.981 97.96 0.72 0.979 0.995
8 87.77 1.74 0.876 0.958 92.54 1.08 0.924 0.982 93.72 0.95 0.927 0.979 95.82 0.31 0.957 0.991
9 94.37 1.66 0.943 0.985 97.14 0.81 0.971 0.990 97.93 0.72 0.979 0.995 97.19 0.56 0.961 0.990
10 82.99 1.50 0.827 0.928 93.75 1.29 0.936 0.979 93.95 1.60 0.929 0.974 95.43 5.60 0.953 0.990
11 92.10 0.67 0.920 0.979 94.52 1.39 0.945 0.986 96.05 1.15 0.950 0.989 97.58 0.55 0.976 0.995
12 90.04 1.99 0.900 0.970 95.63 0.71 0.956 0.989 95.99 1.04 0.949 0.986 98.47 0.31 0.984 0.997
13 90.66 1.04 0.905 0.961 95.25 1.69 0.952 0.985 95.34 0.70 0.942 0.981 97.52 0.49 0.975 0.995
14 92.60 1.25 0.925 0.982 96.05 0.72 0.960 0.991 98.52 0.68 0.975 0.995 98.70 0.35 0.987 0.997
15 97.79 0.46 0.977 0.996 99.44 0.48 0.994 0.999 99.46 0.22 0.994 0.999 99.23 0.36 0.992 0.999
Aver 89.39 0.93 0.892 0.964 94.44 2.70 0.944 0.984 95.38 3.06 0.943 0.983 97.31 1.47 0.972 0.993

Where bold fonts indicate best results.

area under curve (AUC) [28], respectively. Moreover, stan- which is 1.93%, 2.87%, and 7.92% higher than baseline
dard deviation (Std) [29] is introduced to assess the robust- methods respectively. The obvious improvement on Acc
ness of our proposed model. Specifically, for a certain shows that our proposed method can better fuse two types
subject, the Std value is calculated based on the accuracy in of features and in the meantime, a relatively low standard
multiple folds of testing, while for the average result, Std is deviation of our method indicates that its robust ability of
obtained from average accuracies of all subjects. In order to high- level feature extraction.
compare with the proposed method, we use three baseline In terms of SEED IV dataset, our proposed approach gets
models and all these models are retested on three datasets 89.97%, 0.898 and 0.971 on Acc, F1, and AUC, which per-
in the same experiment environment. We faithfully repro- forms better than all three other baseline models. Moreover,
duced these deep learning models and brief introductions a relatively low standard deviation of 2.85% demonstrates
of the three baseline models are given as follows: the robustness of our proposed method. Particularly, since
there are totally 32 subjects in DEAP dataset, we summarize
1) DGCNN [17]: This method uses a dynamic adjacent the experiment results in Fig. 4 due to space limitation. The
matrix to simulate the channel relationship with shal- confusion matrixes are utilized for validation in Fig. 5. We
low DE features, which has shown its ability in EEG can find that our proposed DBGC-ATFFNet-AFTL can
feature extraction. achieve encouraging results of VA-space (4-class: HVHA,
2) 4D-RCNN [18]: This is a CNN-based model which also HVLA, LVHA, LVLA) classification on Acc, F1 and AUC
combines recurrent neural network and DE features to metrics. We can see that our DBGC-ATFFNet gains out-
integrate both spatial and temporal information. standing accuracy results on all three datasets, with an
3) resHGCN [27]: This is a deep learning model with the impressive recognition performance close to 1 in three kinds
residual architecture and an encoded adjacent matrix of emotions on SEED dataset. As a result, the above experi-
using multi fully connected layers, which probably mental results prove the ability of our proposed method in
retains DE features of preprocessed EEG signals. EEG emotion classification.
Moreover, for further investigating the importance of To demonstrate the superiority of our DBGC-ATFFNet-
each feature in our DBGC-ATFFNet-AFTL, we carry out the AFTL on emotion recognition, Table 4 makes a comparison
ablation study with two simplified models, each of which between classification results reported in recent years on
consists of only one feature branch. both SEED and SEED IV datasets. Since most methods only
report Acc and Std on their papers, we use these two met-
3.3 Performance of Subject-Dependent rics for comparison here. From Table 4, we can learn that
Experiments our proposed method has a better classification perfor-
In order to evaluate the efficacy of the proposed framework, mance than most of these methods for the SEED and SEED
we compare our DBGC-ATFFNet-AFTL with the above IV datasets. Compared with the methods which adopted
baseline models. Tables 2 and 3 list the classification results the single DE feature with GNNs, our method fuses both
of subject-dependent experiments on SEED and SEED IV the DE and PSD features and gains accuracy increases of
datasets. From Table 2 we can learn that all three baseline 1.21% and 7.24% on the subject-dependent task of SEED
models can achieve relatively high average accuracies on 15 and SEED IV datasets. Moreover, in comparison with 4D-
subjects of SEED dataset. Especially, our proposed DBGC- CRNN which used convolution operation, our proposed
ATFFNet-AFTL has the highest average Acc, F1, and AUC. method with Adaptive Transformer cares not only the spa-
In terms of Acc, our DBGC-ATFFNet-AFTL reaches 97.31%, tial location relationship but also the global connection of
Authorized licensed use limited to: Srinakharinwirot University provided by UniNet. Downloaded on May 13,2024 at 15:45:21 UTC from IEEE Xplore. Restrictions apply.
2224 IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, VOL. 13, NO. 4, OCTOBER-DECEMBER 2022

TABLE 3
The Overall Comparison of Classification Performance on SEED IV Dataset

Subject DGCNN [17] 4D-CRNN [18] resHGCN [27] Ours

Acc Std(%) F1 AUC Acc Std(%) F1 AUC Acc Std(%) F1 AUC Acc Std(%) F1 AUC
1 69.44 5.37 0.684 0.911 83.24 4.05 0.841 0.949 86.24 4.86 0.862 0.951 87.84 3.43 0.878 0.961
2 91.55 9.17 0.914 0.985 90.42 1.20 0.903 0.977 90.59 1.11 0.906 0.974 90.66 2.17 0.907 0.974
3 92.71 1.75 0.925 0.981 87.60 2.36 0.876 0.955 87.31 1.95 0.873 0.961 89.07 1.11 0.896 0.975
4 94.24 2.85 0.943 0.987 89.95 4.25 0.899 0.972 91.56 4.15 0.909 0.977 91.54 3.02 0.914 0.976
5 79.31 8.26 0.791 0.947 85.48 2.82 0.850 0.954 86.13 2.06 0.857 0.956 87.48 1.47 0.871 0.963
6 85.44 8.28 0.854 0.967 88.19 2.99 0.881 0.970 89.31 1.87 0.891 0.971 90.43 2.77 0.903 0.969
7 78.48 16.19 0.768 0.959 86.42 2.62 0.861 0.955 88.01 2.74 0.883 0.933 90.02 4.03 0.900 0.969
8 88.83 3.78 0.886 0.973 87.95 3.82 0.878 0.953 88.36 3.32 0.884 0.962 91.42 1.35 0.911 0.974
9 92.22 2.69 0.922 0.979 90.78 2.23 0.906 0.979 91.89 1.83 0.917 0.980 91.77 4.13 0.915 0.982
10 80.25 11.25 0.801 0.940 87.71 2.06 0.875 0.958 88.36 4.11 0.879 0.965 92.48 1.66 0.923 0.973
11 65.45 11.60 0.602 0.919 85.25 3.47 0.850 0.950 84.95 3.03 0.848 0.948 85.84 1.85 0.854 0.957
12 73.22 17.06 0.703 0.934 83.72 1.11 0.835 0.946 82.61 3.34 0.825 0.946 84.14 2.91 0.838 0.956
13 86.48 6.24 0.863 0.971 88.77 2.34 0.887 0.963 89.42 2.15 0.896 0.970 90.83 1.94 0.905 0.973
14 82.83 11.98 0.828 0.957 86.26 2.62 0.862 0.958 91.89 1.28 0.918 0.970 89.54 2.68 0.891 0.965
15 96.70 1.59 0.966 0.991 93.71 2.06 0.935 0.987 95.65 3.21 0.955 0.961 96.59 1.82 0.961 0.992
Aver 83.81 9.41 0.830 0.960 87.75 2.72 0.876 0.962 88.79 3.21 0.886 0.966 89.97 2.85 0.898 0.971

Where bold fonts indicate best results.

Fig. 4. The overall comparison of classification performance on DEAP dataset.

different EEG channels, and reaches 3.32% higher in accu- an impressive classification performance in cross-subject
racy. Especially for SEED IV dataset, which contains shorter experiments. As DEAP is a dimensional dataset [19] and
records of EEG signals, our method learns the common pat- there was no previous work which adopted this dataset for
tern of feature distribution from the subjects and has a more similar testing, the experiments are carried out on SEED
robust classification performance. Additionally, the perfor- and SEED IV datasets for further comparison. There are
mance comparison on three kinds of classification tasks of totally 15 subjects in each dataset, to apply adapter-fine-
DEAP dataset is listed in Table 5, and our proposed method tuned transfer learning algorithm, we split the whole data-
performs better on Valence, Arousal and VA-Space classifi- set into two parts: EEG signal data of 14 subjects is used for
cation tasks and gains relatively lower standard deviation pretraining, and EEG data of the rest one subject is used for
of 3.27%, 2.89% and 3.10% respectively. Consequently, it is finetuning and evaluating the performance of the proposed
obvious that our DBGC-ATFFNet-AFTL can distinguish method. The detailed process is depicted in Fig. 1
between EEG signals of different emotions more accurately Firstly, our proposed approach is pretrained on the
and effectively. source data and then freeze all the learnable parameters
except the Adapter modules. It is proven through a series of

3.4 Performance of Cross-Subject Experiments

TABLE 4
Apart from encouraging results in the subject-dependent
Comparison with the State-Of-The-Art Methods of Subject-
experiment, our proposed DBGC-ATFFNet-AFTL also has Dependent on SEED and SEED IV Datasets

Methods Year SEED SEED IV

Acc Std (%) Acc Std (%)
GELM [5] 2017 91.07 7.54 -
DGCNN [17] 2018 90.40 8.49 -
4D-CRNN [18] 2020 94.74 2.32 -
RGNN [30] 2020 94.24 5.95 79.37 10.54
3DCNN&PST [31] 2021 95.76 4.98 82.73 8.96
4D-aNN [14] 2022 96.10 2.61 -
Ours 2022 97.31 1.47 89.97 2.85
Fig. 5. The confusion matrixes of our proposed method on all three
datasets. Where bold fonts indicate best results.
Authorized licensed use limited to: Srinakharinwirot University provided by UniNet. Downloaded on May 13,2024 at 15:45:21 UTC from IEEE Xplore. Restrictions apply.
SUN ET AL.: DUAL-BRANCH DYNAMIC GRAPH CONVOLUTION BASED ADAPTIVE TRANSFORMER FEATURE FUSION NETWORK FOR EEG... 2225

TABLE 5
Comparison With the State-of-the-Art Methods of Subject-Dependent on DEAP Dataset

Methods Year Valence Arousal VA-Space

Acc Std(%) Acc Std(%) Acc Std(%)
GELM[5] 2017 - - 69.97
DGCNN[17] 2018 86.23 12.29 84.54 10.18 85.02 10.25
MMResLSTM[32] 2019 92.31 1.55 92.87 2.11 -
4D-CRNN[18] 2020 93.52 3.26 93.78 4.19 -
ACRNN[33] 2020 93.72 3.21 93.38 3.73 -
SparseDGCNN[26] 2021 95.72 3.75 91.75 5.23 -
Ours 2022 95.91 3.27 94.61 2.89 91.98 3.10

Where bold fonts indicate best results.

experiments that our adapter-finetuned transfer learning dataset. It is obvious that the introduction of PSD features
strategy has the optimal classification performance with a compensates for the lack of information in the spectral
relatively low resource of data when 50 percent of the target domain of the EEG signals. Additionally, we convolve these
samples are randomly chosen to finetune the parameters of two types of pre-computed features with dual branches of
Adapters. Based on the well-trained model, finally, we carry graph convolution to make the most of different distribu-
out the evaluation experiments on the rest EEG data of the tions in temporal and spectral domain. This enhancement
target subject. To demonstrate the superiority of our indicates that it is helpful to input more pre-processed fea-
Adapter-finetuning method on cross-subject emotion recog- tures and there is probably complementary information in
nition, Table 6 makes comparison between classification these two types of features, which are both beneficial to the
results reported in recent years on both SEED and SEED IV extraction of high-level features.
datasets. Our method gains accuracy of 94.39% and 89.78%
on SEED and SEED IV datasets respectively. Especially for
SEED IV dataset, which contains shorter records of EEG sig- 4.2 Efficacy of the Adaptive Transformer Feature
nals, our proposed method learns the common pattern of Fusion Network
feature distribution from source subjects and has a more To assess the effectiveness of Adaptive Transformer Feature
robust classification performance on the target subject. Fusion Network, we conduct ablation experiments. From
the results which are listed Table 7, we can find that accu-
racy increases from 95.44%, 88.75% and 90.46% to 97.31%,
4 DISCUSSIONS
4.1 Efficacy of Dual-Branch Dynamic Graph
Convolution Network
To validate the superiority of our extracted features, the t-
SNE visualization is utilized. The t-SNE visualizes the
extracted EEG features into a 2D embedding space. The
experimental results for all three datasets are presented in
Fig. 6. From the t-SNE plots, compared with those methods
that applied DGCNN [17], 4D-CRNN [18] and resHGCN
[27] to single DE features, our proposed method obtains an
outstanding classification performance with considerable
inter-class distance on discrete emotion datasets: SEED and
SEED IV, while there is a relatively dense distribution of
features for the DEAP, which is a dimension emotion Fig. 6. The t-SNE visualization in 2D embedding space of different types
of features on all three datasets.

TABLE 6
Comparison With the State-of-the-art Methods of Cross-Subject
on SEED and SEED IV Datasets TABLE 7
Ablation Study of Adaptive Transformer Feature Fusion Network
Method Year SEED SEED-IV on All Three Datasets
Acc Std (%) Acc Std (%)
Method SEED SEED IV DEAP
DGCNN[17] 2018 79.95 9.02 —
RGNN[30] 2020 85.30 6.72 73.84 8.02 AccStd (%) AccStd (%) AccStd (%)
TANN[34] 2021 84.41 8.75 68.00 8.35 w / o ATTFFNet 95.443.06 88.753.72 90.463.98
BiHDM[35] 2021 85.40 7.53 69.03 8.66 w / ATFFNet 97.311.47 89.972.85 91.983.10
Ours 2022 94.39 3.23 89.78 3.09
Where bold fonts indicate best results, “w /o” denotes “without” and “w /”
Where bold fonts indicate best results. denotes “with”.
Authorized licensed use limited to: Srinakharinwirot University provided by UniNet. Downloaded on May 13,2024 at 15:45:21 UTC from IEEE Xplore. Restrictions apply.
2226 IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, VOL. 13, NO. 4, OCTOBER-DECEMBER 2022

Fig. 7. Comparison between non-transfer DBGC-ATFFNet and our

DBGC-ATFFNet-AFTL on SEED and SEED IV datasets. Fig. 8. Saliency maps of the DBGC-ATFFNet-AFTL on SEED dataset.

89.97%, and 91.98% on all three datasets with relatively low finetuning Adapter module contributes to EEG emotion
standard deviations. In general, the outstanding perfor- classification of crossing subjects. In terms of computational
mance of our DBGC-ATFFNet-AFTL method indicates that cost, as in Table 8, our proposed Adapter-finetuned DBGC-
ATFFNet unit effectively fuses the aforemost extracted fea- ATFFNet spends less time for training and has less parame-
tures and explores the connection between EEG channels ters involved, which enhances the efficiency of transfer
more comprehensively in the meantime. Unsimilar to 2D learning. It is worth mentioning that the training and testing
convolution and 3D convolution, which are commonly used of the deep learning model are performed on Nvidia Tesla
in previous works [18], the Adaptive Transformer in our V100 GPU with 32 GB memory. In this experiment environ-
ATFFNet can avoid the overdependence on the explicit spa- ment, it averagely just takes 53.3s and 22.2s to finish the
tial location distribution. Moreover, the features with the finetuning step and apply the pretrained model to the target
sequential form are beneficial to decrease the number of subject on SEED and SEED IV datasets respectively, which
trainable parameters of the model and reduce the risk of also significantly reduces the quantity of the trainable
overfitting. Additionally, the Channel-Weight unit reinfor- parameters in the meantime compared with the non-trans-
ces the learning of the model about the channel connection fer model.
with the existing trainable parameters.
4.4 Saliency Map Analysis of EEG Channels
4.3 Efficacy of the Adapter-Finetuned In order to demonstrate the interpretability of our pro-
Transfer Learning posed approach, we apply the saliency map method [36],
In order to assess the effectiveness of our adapter-finetuned [37] based on the gradient propagation, which is widely
transfer learning method, we apply the same protocol to the used in the computer vision area. Fig. 8 depicts the aver-
corresponding subject-dependent experiments and the age neural patterns for positive, neutral and negative
results of two training methods on SEED and SEED IV data- emotions in five frequency bands. The visualization on
sets are compared in Fig. 7 and Table 8, respectively. We the scalp maps presents spatial distributions for emotion
explore the effectiveness of the Adapter-finetuning method recognition tasks, directly reflecting how our proposed
from two perspectives. In terms of the classification perfor- method behaves when making inferences on EEG signals.
mance, from Fig. 7, we can learn that the proposed DBGC- For example, from Fig. 8, the emotionally active areas are
ATFFNet-AFTL with finetuned Adapters reaches average mainly concentrated on the left prefrontal and parietal
accuracies of 94.39% and 89.78% on two datasets, which are sites. Brain activities associated with emotions generally
both higher than the non-transfer DBGC-ATFFNet method. have a significant response in b bands. The findings of
Especially on SEED IV dataset, which has a relatively small these saliency maps have been demonstrated and are in
number of EEG samples, the accuracy of the model using line with the existing emotion studies [38], [39], [40].
transfer learning is 4.81% higher than the non-transfer one. Apart from these findings, we can further learn that the
The experiment results indicate the strategy of transfer responses of both the b and g bands are more notable in
learning indeed helps the model learn to extract higher- all three kinds of emotional states. For neutral emotions,
level features which are easily missed in analyzing a single the neural patterns tend to be gentler compared with the
subject. Thus, it is obvious that transfer learning based on positive and negative, while for negative emotions, there

TABLE 8
Comparison of Parameters and Computational Cost on SEED and SEED IV Datasets

Dataset Method Parameter (thousand) Time Cost(s)

SEED Non-tranfer DBGC-ATFFNet 241.239 398.6
Our DBGC-ATFFNet-AFTL 1.456 53.3
SEED-IV Non-tranfer DBGC-ATFFNet 237.540 156.7
Our DBGC-ATFFNet-AFTL 1.456 22.2

Where bold fonts indicate best results.

Authorized licensed use limited to: Srinakharinwirot University provided by UniNet. Downloaded on May 13,2024 at 15:45:21 UTC from IEEE Xplore. Restrictions apply.
SUN ET AL.: DUAL-BRANCH DYNAMIC GRAPH CONVOLUTION BASED ADAPTIVE TRANSFORMER FEATURE FUSION NETWORK FOR EEG... 2227

are significantly higher b responses at the prefrontal cor- [4] J. J. Yan, W. M. Zheng, Q. Y. Xu, G. M. Lu, H. B. Li, and B. Wang,
“Sparse kernel reduced-rank regression for bimodal emotion rec-
tex, and the lateral temporal and parietal areas activate ognition from facial expression and speech,” IEEE Trans. Multime-
more, which are similar to those of positive emotions. dia, vol. 18, no. 7, pp. 1319–1329, Jul. 2016.
[5] W. L. Zheng, J. Y. Zhu, and B. L. Lu, “Identifying stable patterns
over time for emotion recognition from EEG,” IEEE Trans. Affect.
4.5 Limitations and Future Directions Comput., vol. 10, no. 3, pp. 417–429, Jul.–Sep. 2017.
[6] J. C. Britton, K. L. Phan, S. F. Taylor, R. C. Welsh, K. C. Berridge, and
Although the proposed method achieves outstanding classi- I. Liberzon, “Neural correlates of social and nonsocial emotions: An
fication results, our present work still suffers from several fMRI study,” Neuroimage, vol. 31, no. 1, pp. 397–409, 2006.
limitations. First, the extracted features from preprocessed [7] E. Lotfi and M.-R. Akbarzadeh-T, “Practical emotional neural
networks,” Neural Netw., vol. 59, pp. 61–72, 2014.
EEG signals are constraint to prior knowledge, which may [8] G. Pfurtscheller et al., “The hybrid BCI,” Front. Neurosci., vol. 4,
lead to loss of some useful information hidden in original 2010, Art. no. 3.
EEG signals. Therefore, our important future work is to han- [9] S. Alhagry, A. A. Fahmy, and R. A. El-Khoribi, “Emotion recogni-
dle affective computing tasks with different types of feature tion based on EEG using LSTM recurrent neural network,” Emo-
tion, vol. 8, no. 10, pp. 355–358, 2017.
maps, which are directly extracted from raw signals by [10] M. Murugappan, M. Rizon, R. Nagarajan, and S. Yaacob, “Inferring
using end-to-end deep learning models. Second, although of human emotional states using multichannel EEG,” Eur. J. Sci.
our method shows the effectiveness in cross-subject emo- Res., vol. 48, no. 2, pp. 281–299, 2010.
[11] B. Reuderink, C. M€ uhl, and M. Poel, “Valence, arousal and domi-
tion classification, we involve all the available EEG channels nance in the EEG during game play,” Int. J. Auton. Adaptive Com-
into transfer learning. It is possible that several channels mun. Syst., vol. 6, no. 1, pp. 45–62, 2013.
make greater contribution to affective computing while [12] W. L. Zheng and B. L. Lu, “Investigating critical frequency bands
others provide noisy signals that are harmful to the analysis. and channels for EEG-based emotion recognition with deep neu-
ral networks,” IEEE Trans. Auton. Ment. Develop., vol. 7, no. 3,
Thus, we will consider adaptively selecting a few EEG chan- pp. 162–175, Sep. 2015.
nels with outstanding performance to improve our method [13] W. L. Zheng, J. Y. Zhu, Y. Peng, and B. L. Lu, “EEG-based emotion
in the future work. classification using deep belief networks,” in Proc. IEEE Int. Conf.
Multimedia Expo, 2014, pp. 1–6.
[14] G. W. Xiao, M. Shi, M. W. Ye, B. W. Xu, Z. D. Chen, and Q. S. Ren,
“4D attention-based neural network for EEG emotion recog-
5 CONCLUSION nition,” Cogn. Neurodyn., vol. 16, pp. 805–818, 2022.
[15] H. Zeng et al., “EEG emotion classification using an improved
In this article, we propose a novel dual-branch dynamic sincnet-based deep learning model,” Brain Sci., vol. 9, no. 11, 2019,
graph convolution based adaptive Transformer feature Art. no. 326.
fusion network (DBGC-ATFFNet-AFTL) for EEG emotion [16] Y. Li, J. Y. Liu, Z. Y. Tang, and B. Y. Lei, “Deep spatial-temporal
recognition. Specifically, our proposed DBGC-ATFFNet- feature fusion from adaptive dynamic functional connectivity for
MCI identification,” IEEE Trans. Med. Imag., vol. 39, no. 9,
AFTL first compute differential entropy and power spectral pp. 2818–2830, Sep. 2020.
density features from preprocessed EEG signals. Next, the [17] T. F. Song, W. M. Zheng, P. Song, and Z. Cui, “EEG emotion rec-
dual-branch dynamic graph convolution network extracts ognition using dynamical graph convolutional neural networks,”
IEEE Trans. Affect. Comput., vol. 11, no. 3, pp. 532–541, Third Quar-
high-level EEG information of different emotions through ter 2020.
two branches of feature inputs. Moreover, the adaptive [18] F. Y. Shen, G. J. Dai, G. Lin, J. H. Zhang, W. Z. Kong, and H. Zeng,
transformer feature fusion network is further employed to “EEG-based emotion recognition using 4D convolutional recur-
fuse the obtained features and learn the connection relation- rent neural network,” Cogn. Neurodyn., vol. 14, no. 6, pp. 815–828,
2020.
ship of EEG channels simultaneously in the meantime. [19] S. Koelstra et al., “DEAP: A database for emotion analysis; using
Additionally, we introduce adapter-finetuned transfer learn- physiological signals,” IEEE Trans. Affect. Comput., vol. 3, no. 1,
ing to the cross-subject emotion classification task by finetun- pp. 18–31, Jan.–Mar. 2011.
ing the Adapter modules. We conduct experiments on three [20] L. N. Wang et al., “Automatic epileptic seizure detection in EEG
signals using multi-domain feature extraction and nonlinear ana-
public emotional EEG datasets to evaluate the effectiveness of lysis,” Entropy, vol. 19, no. 6, 2017, Art. no. 222.
our proposed DBGC-ATFFNet-AFTL method. Experimental [21] D. A. Spielman, “Spectral graph theory and its applications,” in Proc.
results show that our proposed method has a better perfor- 48th Annu. IEEE Symp. Foundations Comput. Sci., 2007, pp. 29–38.
[22] Y. Li, Y. Liu, W. G. Cui, Y. Z. Guo, H. Huang, and Z. Y. Hu,
mance in accuracy, F1-score and AUC value. Detailed discus- “Epileptic seizure detection in EEG signals using a unified tempo-
sions demonstrate our proposed approach is able to ral-spectral squeeze-and-excitation network,” IEEE Trans. Neural
effectively learn emotion information from EEG signals, Syst. Rehabil. Eng., vol. 28, no. 4, pp. 782–794, Apr. 2020.
which has a potential application prospect in the affective [23] A. Vaswani et al., “Attention is all you need,” in Proc. 31st Int.
Conf. Neural Inf. Process. Syst., 2017, pp. 5998–6008.
computing area. [24] Z. Liu et al., “Swin transformer: Hierarchical vision transformer
using shifted windows,” in Proc. IEEE/CVF Int. Conf. Comput. Vis.,
REFERENCES 2021, pp. 10 012–10 022.
[25] W. L. Zheng, W. Liu, Y. F. Lu, B. L. Lu, and A. Cichocki,
[1] L. H. He, D. Hu, M. Wan, Y. Wen, K. M. Von Deneen, and M. C. “Emotionmeter: A multimodal framework for recognizing human
Zhou, “Common bayesian network for classification of EEG-based emotions,” IEEE Trans. Cybern., vol. 49, no. 3, pp. 1110–1122, Mar.
multiclass motor imagery BCI,” IEEE Trans. Syst., Man, Cybern.: 2019.
Syst., vol. 46, no. 6, pp. 843–854, Jun. 2016. [26] G. H. Zhang, M. J. Yu, Y. J. Liu, G. Z. Zhao, D. Zhang, and W. M.
[2] L. Fiorini, G. Mancioppi, F. Semeraro, H. Fujita, and F. Cavallo, Zheng, “SparseDGCNN: Recognizing emotion from multichannel
“Unsupervised emotional state classification through physiologi- EEG signals,” IEEE Trans. Affect. Comput., to be published,
cal parameters for social robotics applications,” Knowl.-Based doi: 10.1109/TAFFC. 2021.3051332.
Syst., vol. 190, 2020, Art. no. 105217. [27] Y. Li, Y. Liu, Y. Z. Guo, X. F. Liao, B. Hu, and T. Yu, “Spatio-temporal-
[3] S. Katsigiannis and N. Ramzan, “DREAMER: A database for emo- spectral hierarchical graph convolutional network with semisuper-
tion recognition through EEG and ECG signals from wireless low- vised active learning for patient-specific seizure prediction,” IEEE
cost off-the-shelf devices,” IEEE J. Biomed. Health Inform., vol. 22, Trans. Cybern., to be published, doi: 10.1109/TCYB.2021.3071860.
no. 1, pp. 98–107, Jan. 2018.
Authorized licensed use limited to: Srinakharinwirot University provided by UniNet. Downloaded on May 13,2024 at 15:45:21 UTC from IEEE Xplore. Restrictions apply.
2228 IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, VOL. 13, NO. 4, OCTOBER-DECEMBER 2022

[28] K. Li et al., “Multi-label spacecraft electrical signal classification Shuyue Yu received the BS degree in measure-
method based on DBN and random forest,” PLoS One, vol. 12, ment, control technology, and instrument and the
no. 5, 2017, Art. no. e0176614. MS degree in control science and engineering
[29] Y. Li, L. H. Guo, Y. Liu, J. Y. Liu, and F. G. Meng, “A temporal- from the Beijing University of Posts and Telecom-
spectral-based squeeze-and-excitation feature fusion network for munications, Beijing, China, in 2016 and 2019,
motor imagery EEG decoding,” IEEE Trans. Neural Syst. Rehabil. respectively. She is currently an engineer with
Eng., vol. 29, pp. 1534–1545, 2021. Beijing Aerospace Measurement and Control
[30] P. X. Zhong, D. Wang, and C. Y. Miao, “EEG-based emotion recogni- Technology Company, Ltd. Her current research
tion using regularized graph neural networks,” IEEE Trans. Affect. interests include robotics and BCI.
Comput., to be published, doi: 10.1109/TAFFC.2020.2994159.
[31] J. Y. Liu, Y. X. Zhao, H. Wu, and D. M. Jiang, “Positional-spectral-
temporal attention in 3D convolutional neural networks for EEG
emotion recognition,” 2021, arXiv:2110.09955.
[32] J. X. Ma, H. Tang, W. L. Zheng, and B. L. Lu, “Emotion recognition
using multimodal residual LSTM network,” in Proc. 27th ACM Int. Hongbin Han received the undergraduate and
Conf. Multimedia, 2019, pp. 176–183. graduate degrees in clinical medicine from Dalian
[33] W. Tao et al., “EEG-based emotion recognition via channel-wise Medical University, Dalian, China, in 1996, and the
attention and self attention,” IEEE Trans. Affect. Comput., to be doctor of radiology degree from Peking University
published, doi: 10.1007/S42486-021-00078-Y. Health Center, Beijing, China, in 1998. He com-
[34] Y. Li, B. X. Fu, F. Li, G. M. Shi, and W. M. Zheng, “A novel trans- pleted the radiology residency with the Peking Uni-
ferability attention neural network model for EEG emotion recog- versity Health Center, joined the faculty of the
nition,” Neurocomputing, vol. 447, pp. 92–101, 2021. Department of Radiology, Peking University Third
[35] Y. Li et al., “A novel bi-hemispheric discrepancy model for EEG Hospital in 1998, and is currently the professor with
emotion recognition,” IEEE Trans. Cogn. Devlop. Syst., vol. 13, the Radiology Department. He has established the
no. 2, pp. 354–367, Jun. 2021. Beijing MRI Technology Research Laboratory and
[36] Y. Li, M. Y. Lei, W. G. Cui, Y. Z. Guo, and H. L. Wei, “A paramet- severed as the Director since 2010. His research interests include the
ric time-frequency conditional granger causality method using development and application of advanced MRI techniques for diagnosis,
ultra-regularized orthogonal least squares and multiwavelets for and therapy of human brain diseases.
dynamic connectivity analysis in EEGs,” IEEE Trans. Biomed. Eng.,
vol. 66, no. 12, pp. 3509–3525, Dec. 2019.
[37] Y. Li, H. Yang, B. Y. Lei, J. Y. Liu, and C. Y. Wee, “Novel effective Bin Hu (Member, IEEE) received PhD degree in
connectivity inference using ultra-group constrained orthogonal computer science from the Institute of Computing
forward regression and elastic multilayer perceptron classifier for Technology, Chinese Academy of Science, China,
MCI identification,” IEEE Trans. Med. Imag., vol. 38, no. 5, in 1998. Since 2008, he has been a professor with
pp. 1227–1239, May 2019. the School of Information Science and Engineer-
[38] S. K. Hadjidimitriou and L. J. Hadjileontiadis, “Toward an EEG- ing, Lanzhou University, China. He had been also
based recognition of music liking using time-frequency analysis,” guest professorship in ETH Zurich, Switzerland till
IEEE Trans. Biomed. Eng., vol. 59, no. 12, pp. 3498–3510, Dec. 2012. 2011. His research interests include pervasive
[39] R. Jenke, A. Peer, and M. Buss, “Feature extraction and selection computing, computational psychophysiology, and
for emotion recognition from EEG,” IEEE Trans. Affect. Comput., data modeling.
vol. 5, no. 3, pp. 327–339, Jul.–Sep. 2014.
[40] M. Balconi and C. Lucchiari, “Consciousness and arousal
effects on emotional face processing as revealed by brain oscil- Yang Li received the PhD degree in automatic
lations. A gamma band analysis,” Int. J. Psychophysiol., vol. 67, control and systems engineering from the Univer-
no. 1, pp. 41–46, 2008. sity of Sheffield, Sheffield, U.K., in 2011. He did
post-doctoral research with the Department of
Computer and Biomedical Engineering, Univer-
Mingyi Sun received the bachelor’s degree in sity of North Carolina at Chapel Hill, Chapel Hill,
NC, for one year. In 2013, he joined the Depart-
engineering from Beihang University, Beijing,
ment of Automation Sciences and Electrical Engi-
China, in 2021, where he is currently working
toward the master’s degree with the Department neering, Beihang University, Beijing, China, as a
of Automation Science and Electrical Engineer- professor. His current research interests include
ing. His current research interests include signal system identification and modeling for complex
processing, machine learning, and brain-com- nonlinear processes: NARMAX methodology and applications, nonsta-
puter interface. tionary signal processing and sparse representation, medical image
analysis, and brain–computer interface.

" For more information on this or any other computing topic,

please visit our Digital Library at www.computer.org/csdl.
Weigang Cui received the bachelor’s degree in
mathematics and PhD degree from the Depart-
ment of Automation Science and Electrical Engi-
neering, Beihang University, Beijing, China, in 2016
and 2021, respectively. He is currently doing post-
doctoral research with the School of Engineering
Medicine, Beihang University. His research inter-
ests include medical image analysis, machine
learning, and brain functional connectivity.

Authorized licensed use limited to: Srinakharinwirot University provided by UniNet. Downloaded on May 13,2024 at 15:45:21 UTC from IEEE Xplore. Restrictions apply.

Beamprop Rsoft Training Material
100% (1)
Beamprop Rsoft Training Material
280 pages
AI in UX Design - V0.5
No ratings yet
AI in UX Design - V0.5
37 pages
National Semiconductor Linear Applications Handbook 1994
No ratings yet
National Semiconductor Linear Applications Handbook 1994
1,278 pages
PSIM User Manual
No ratings yet
PSIM User Manual
313 pages
Leonardo Book Series Dyson Frances The Tone of Our Times Sound Sense Economy and Ecology The MIT Press 2014
No ratings yet
Leonardo Book Series Dyson Frances The Tone of Our Times Sound Sense Economy and Ecology The MIT Press 2014
209 pages
C14 - Speech Emotion Recognition Using Machine Learning
No ratings yet
C14 - Speech Emotion Recognition Using Machine Learning
118 pages
Facial Expression Recognition Using Deep Learning
No ratings yet
Facial Expression Recognition Using Deep Learning
13 pages
T6963C Graphic LCD Library (MC PRO For PIC) - Support Center
100% (1)
T6963C Graphic LCD Library (MC PRO For PIC) - Support Center
15 pages
Face Expression Recognition Thesis
100% (3)
Face Expression Recognition Thesis
5 pages
Lab5 - Logic Gates Simulation Using LTSpice
No ratings yet
Lab5 - Logic Gates Simulation Using LTSpice
7 pages
Ex2000 Parts
No ratings yet
Ex2000 Parts
35 pages
T-Spice 13 User Guide-Contents: 1 Getting Started 10
No ratings yet
T-Spice 13 User Guide-Contents: 1 Getting Started 10
569 pages
EEG Report Final
No ratings yet
EEG Report Final
45 pages
XMC14 PSFB Hot
100% (1)
XMC14 PSFB Hot
46 pages
XP Cat18-19
No ratings yet
XP Cat18-19
92 pages
Efficient CORDIC-Based Activation Functions For RNN Acceleration On FPGAs
No ratings yet
Efficient CORDIC-Based Activation Functions For RNN Acceleration On FPGAs
11 pages
BJT Switching Circuits
No ratings yet
BJT Switching Circuits
48 pages
Mentor Graphics Procedure
No ratings yet
Mentor Graphics Procedure
49 pages
Speech Emotion Recognition Using Machine Learning
No ratings yet
Speech Emotion Recognition Using Machine Learning
14 pages
Generation of Basic Signals: AIM: To Write A MATLAB Program To Generate Various Type of Signals. Algorithm
No ratings yet
Generation of Basic Signals: AIM: To Write A MATLAB Program To Generate Various Type of Signals. Algorithm
39 pages
Microcontrollers and Its Applications: Shivaum Heranjal L27+28 17BEC0315 Padmini T N
100% (1)
Microcontrollers and Its Applications: Shivaum Heranjal L27+28 17BEC0315 Padmini T N
6 pages
Image Analysis and Recognition 11th International Conference ICIAR 2014 Vilamoura Portugal October 22 24 2014 Proceedings Part II 1st Edition Aurélio Campilho PDF Download
No ratings yet
Image Analysis and Recognition 11th International Conference ICIAR 2014 Vilamoura Portugal October 22 24 2014 Proceedings Part II 1st Edition Aurélio Campilho PDF Download
56 pages
Beeje Guide C Programming
No ratings yet
Beeje Guide C Programming
223 pages
WEG CFW11 Programming Manual 0899.5620 en
No ratings yet
WEG CFW11 Programming Manual 0899.5620 en
312 pages
Power - MOSFET - and - IGBT
No ratings yet
Power - MOSFET - and - IGBT
23 pages
C15 Documentation
No ratings yet
C15 Documentation
75 pages
Ece113 Lec12 Smith Chart
No ratings yet
Ece113 Lec12 Smith Chart
74 pages
Mixed-Signal-Electronics: PD Dr.-Ing. Stephan Henzler
No ratings yet
Mixed-Signal-Electronics: PD Dr.-Ing. Stephan Henzler
38 pages
Developed Title List - Updated Till FEB
No ratings yet
Developed Title List - Updated Till FEB
67 pages
DC Operating Point at Cadence PDF
100% (1)
DC Operating Point at Cadence PDF
3 pages
Notes On The Field Effect Transistor (Fet)
No ratings yet
Notes On The Field Effect Transistor (Fet)
5 pages
Capstone Project Topic Submission On Emotional AI
No ratings yet
Capstone Project Topic Submission On Emotional AI
36 pages
Intelligent Systems and Applications: Proceedings of The 2020 Intelligent Systems Conference (Intellisys) Volume 3 Kohei Arai
100% (3)
Intelligent Systems and Applications: Proceedings of The 2020 Intelligent Systems Conference (Intellisys) Volume 3 Kohei Arai
65 pages
Blood Pressure Monitoring Using Arduino-Android Platform
No ratings yet
Blood Pressure Monitoring Using Arduino-Android Platform
5 pages
Experiment 11: Setup of Labview Interface For Arduino Safety Precautions
No ratings yet
Experiment 11: Setup of Labview Interface For Arduino Safety Precautions
7 pages
Pressure Sensors-Design Engineers Guide USA-V2
No ratings yet
Pressure Sensors-Design Engineers Guide USA-V2
97 pages
System Memory ARM
No ratings yet
System Memory ARM
15 pages
Lecturas
No ratings yet
Lecturas
260 pages
bXlzZ2RzZW9qMDAyODAwNTk0NQ PDF
No ratings yet
bXlzZ2RzZW9qMDAyODAwNTk0NQ PDF
2 pages
Course Information Sheet: Biomedical Engineering Department Sir Syed University of Engineering & Technology
No ratings yet
Course Information Sheet: Biomedical Engineering Department Sir Syed University of Engineering & Technology
9 pages
The Effect of Severe Traumatic Brain Injury On Social Cognition, Emotion Regulation, and Mood
No ratings yet
The Effect of Severe Traumatic Brain Injury On Social Cognition, Emotion Regulation, and Mood
26 pages
Bird Species Project Report Final
No ratings yet
Bird Species Project Report Final
50 pages
L10 - Intro - To - Deep - Learning
No ratings yet
L10 - Intro - To - Deep - Learning
75 pages
Mosfets 191024184553 PDF
No ratings yet
Mosfets 191024184553 PDF
87 pages
Online Learners' Engagement Detection Via Facial Emotion Recognition in Online Learning Context Using Hybrid Classification Model
No ratings yet
Online Learners' Engagement Detection Via Facial Emotion Recognition in Online Learning Context Using Hybrid Classification Model
19 pages
Facial Emotion Detection
No ratings yet
Facial Emotion Detection
20 pages
A Dual-Branch Dynamic Graph Convolution Based Adaptive TransFormer Feature Fusion Network For EEG Emotion Recognition
No ratings yet
A Dual-Branch Dynamic Graph Convolution Based Adaptive TransFormer Feature Fusion Network For EEG Emotion Recognition
11 pages
Analysis and Design of MOSFET Based Amplifier in Different Configurations
No ratings yet
Analysis and Design of MOSFET Based Amplifier in Different Configurations
8 pages
Minor Project File Format 2025
No ratings yet
Minor Project File Format 2025
45 pages
Unthought Meets The Assemblage Brain PDF
No ratings yet
Unthought Meets The Assemblage Brain PDF
25 pages
Efficient Facial Expression Recognition Algorithm Based On Hierarchical Deep Neural Network Structure
No ratings yet
Efficient Facial Expression Recognition Algorithm Based On Hierarchical Deep Neural Network Structure
13 pages
Smart Cradle
No ratings yet
Smart Cradle
6 pages
Advancing Human Computer Interaction A Study On Facial Expression Recognition Systems
No ratings yet
Advancing Human Computer Interaction A Study On Facial Expression Recognition Systems
10 pages
AIMLQuestion Bank
No ratings yet
AIMLQuestion Bank
16 pages
Temperature Control Using Labview
No ratings yet
Temperature Control Using Labview
5 pages
Speaker Emotion Recognition: Leveraging Self-Supervised Models For Feature Extraction Using Wav2Vec2 and Hubert
No ratings yet
Speaker Emotion Recognition: Leveraging Self-Supervised Models For Feature Extraction Using Wav2Vec2 and Hubert
9 pages
Content-Based Fake News Detection With Machine and Deep Learning
No ratings yet
Content-Based Fake News Detection With Machine and Deep Learning
13 pages
Systolic Array Architecture For Educational Use
No ratings yet
Systolic Array Architecture For Educational Use
6 pages
RISC-VTF RISC-V Based Extended Instruction Set For Transformer
No ratings yet
RISC-VTF RISC-V Based Extended Instruction Set For Transformer
6 pages
A Combined Modular System For Face Detection Head Pose Estimation Face Tracking and Emotion Recognition in Thermal Infrared Images
No ratings yet
A Combined Modular System For Face Detection Head Pose Estimation Face Tracking and Emotion Recognition in Thermal Infrared Images
6 pages
DS12887
No ratings yet
DS12887
30 pages
science-Quanser-magnetic-levitation-User Manual
No ratings yet
science-Quanser-magnetic-levitation-User Manual
18 pages
Developing A Face Recognition System Using Convolutional Neural Network and Raspberry Pi Including Facial Expression
No ratings yet
Developing A Face Recognition System Using Convolutional Neural Network and Raspberry Pi Including Facial Expression
13 pages
VHDL Piano Using Xilink Sparta Board
No ratings yet
VHDL Piano Using Xilink Sparta Board
12 pages
A Computational Model of Empathy For Interactive Agents
No ratings yet
A Computational Model of Empathy For Interactive Agents
6 pages
DIGPRA3 - Assignment 2 - 34006559
No ratings yet
DIGPRA3 - Assignment 2 - 34006559
22 pages
Facial Smile Detection Based On Deep Learning Features
No ratings yet
Facial Smile Detection Based On Deep Learning Features
5 pages
Speech Emotion Recognition Using Deep Learning
No ratings yet
Speech Emotion Recognition Using Deep Learning
4 pages
Speech Emotions Recognition Using Machine Learning
No ratings yet
Speech Emotions Recognition Using Machine Learning
5 pages
Difference Between Von Neumann and Harvard Architecture
No ratings yet
Difference Between Von Neumann and Harvard Architecture
6 pages
Design Optimisation of An Inductor-Integrated MF Transformer For A High-Power Isolated Dual-Active-Bridge DC-DC Converter
No ratings yet
Design Optimisation of An Inductor-Integrated MF Transformer For A High-Power Isolated Dual-Active-Bridge DC-DC Converter
11 pages
Speech Emotion Recognition: Ashish B. Ingale, D. S. Chaudhari
No ratings yet
Speech Emotion Recognition: Ashish B. Ingale, D. S. Chaudhari
4 pages
Recognizing Emotions From Videos by Studying Facial Expressions, Body Postures and Hand Gestures
No ratings yet
Recognizing Emotions From Videos by Studying Facial Expressions, Body Postures and Hand Gestures
4 pages
A Dual-Branch Dynamic Graph Convolution Based Adaptive TransFormer Feature Fusion Network For EEG Emotion Recognition
No ratings yet
A Dual-Branch Dynamic Graph Convolution Based Adaptive TransFormer Feature Fusion Network For EEG Emotion Recognition
11 pages
Simulations For Three Phase To Two Phase Transformation
No ratings yet
Simulations For Three Phase To Two Phase Transformation
5 pages
Resonant Converstion
No ratings yet
Resonant Converstion
27 pages
Recommendations For Port Setup When Using ADS Momentum and Modelithics Models
No ratings yet
Recommendations For Port Setup When Using ADS Momentum and Modelithics Models
7 pages
Paper Modulacion PWM With AN-42026
No ratings yet
Paper Modulacion PWM With AN-42026
16 pages
01 - Experiment Diode j09
No ratings yet
01 - Experiment Diode j09
19 pages
1.1 - Digital Signals: Fundamentals of Digital Systems and Logic Families
No ratings yet
1.1 - Digital Signals: Fundamentals of Digital Systems and Logic Families
3 pages
PBG BCI Robotics
No ratings yet
PBG BCI Robotics
5 pages
1992 - Performance Characterization of A High-Power Dual Active Bridge DC-To-DC Converter - Kheraluwala
100% (1)
1992 - Performance Characterization of A High-Power Dual Active Bridge DC-To-DC Converter - Kheraluwala
8 pages
Artificial Intelligence in Mechanical Engineering: A Case Study On Vibration Analysis of Cracked Cantilever Beam
No ratings yet
Artificial Intelligence in Mechanical Engineering: A Case Study On Vibration Analysis of Cracked Cantilever Beam
4 pages
Speech Emotion Recognition
No ratings yet
Speech Emotion Recognition
6 pages
Interfacing Dot Matrix Led Display With An At89c51 Microcontroller
No ratings yet
Interfacing Dot Matrix Led Display With An At89c51 Microcontroller
8 pages
Analog-To-digital Converter - Wikipedia, The Free Encyclopedia
No ratings yet
Analog-To-digital Converter - Wikipedia, The Free Encyclopedia
6 pages
Hdal Api
No ratings yet
Hdal Api
22 pages
Webster Chapter 03
No ratings yet
Webster Chapter 03
35 pages
Single Instruction
No ratings yet
Single Instruction
2 pages
Mahendra College of Engineering: Lab Manual
No ratings yet
Mahendra College of Engineering: Lab Manual
33 pages

A Dual-Branch Dynamic Graph Convolution Based Adaptive TransFormer Feature Fusion Network For EEG Emotion Recognition

Uploaded by

A Dual-Branch Dynamic Graph Convolution Based Adaptive TransFormer Feature Fusion Network For EEG Emotion Recognition

Uploaded by

2218 IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, VOL. 13, NO.

A Dual-Branch Dynamic Graph Convolution

1 INTRODUCTION Brain-Computer Interactions (aBCIs) have received increas-

Fig. 1. The pipeline of the proposed DBGC-ATFFNet-AFTL method.

Fig. 3. The outline of adaptive transformer feature fusion network.

2.2 Adaptive Transformer Feature Fusion Network

where LNðÞ denotes the layer normalization. As shown in

subject DGCNN [17] 4D-CRNN [18] resHGCN [27] Ours

Where bold fonts indicate best results.

Subject DGCNN [17] 4D-CRNN [18] resHGCN [27] Ours

Where bold fonts indicate best results.

Fig. 4. The overall comparison of classification performance on DEAP dataset.

3.4 Performance of Cross-Subject Experiments

Methods Year SEED SEED IV

Methods Year Valence Arousal VA-Space

Where bold fonts indicate best results.

Fig. 7. Comparison between non-transfer DBGC-ATFFNet and our

Dataset Method Parameter (thousand) Time Cost(s)

Where bold fonts indicate best results.

" For more information on this or any other computing topic,

You might also like

where LNðÞ denotes the layer normalization. As shown in