Emotion Recognition Based On EEG Features in Movie Clips With Channel Selection
DOI 10.1007/s40708-017-0069-3
Received: 12 April 2017 / Accepted: 7 July 2017 / Published online: 15 July 2017
The Author(s) 2017. This article is an open access publication
Abstract Emotion plays an important role in human algorithms. The classification performance obtained with
interaction. People can explain their emotions in terms of both the algorithms are computed and compared. The
word, voice intonation, facial expression, and body lan- average overall accuracies were obtained as 77.14 and
guage. However, brain–computer interface (BCI) systems 72.92% by using MLPNN and kNN, respectively.
have not reached the desired level to interpret emotions.
Automatic emotion recognition based on BCI systems has Keywords Emotion EEG Classification Wavelet
been a topic of great research in the last few decades. transform Channel selection
Electroencephalogram (EEG) signals are one of the most
crucial resources for these systems. The main advantage of
using EEG signals is that it reflects real emotion and can 1 Introduction
easily be processed by computer systems. In this study,
EEG signals related to positive and negative emotions have Emotion is a human consciousness and plays a critical role
been classified with preprocessing of channel selection. in rational decision-making, perception, human interaction,
Self-Assessment Manikins was used to determine emo- and human intelligence. While emotions can be reflected
tional states. We have employed discrete wavelet transform through non-physiological signals such as words, voice
and machine learning techniques such as multilayer per- intonation, facial expression, and body language, many
ceptron neural network (MLPNN) and k-nearest neigh- studies on emotion recognition based on these non-physi-
borhood (kNN) algorithm to classify EEG signals. The ological signals have been reported in recent decades [1, 2].
classifier algorithms were initially used for channel selec- Signals obtained by recording voltage changes occurring
tion. EEG channels for each participant were evaluated on skull surface as a result of electrical activity of active
separately, and five EEG channels that offered the best neurons in the brain are called EEG [3]. From the clinical
classification performance were determined. Thus, final point of view, EEG is the mostly used brain-activity-
feature vectors were obtained by combining the features of measuring technique for emotion recognition. Furthermore
EEG segments belonging to these channels. The final EEG-based BCI systems provide a new communication
feature vectors with related positive and negative emotions channel by detecting the variation in the underlying pattern
were classified separately using MLPNN and kNN of brain activities while performing different tasks [4].
However, BCI systems have not reached the desired level
to interpret people’s emotions.
M. S. Özerdem
Electrical and Electronics Engineering, Dicle University,
The interpretation of people’s different emotional states
21000 Diyarbakır, Turkey via BCI systems and automatic identification of the emo-
e-mail: [email protected] tions may enable robotic systems to emotionally react to
humans in the future. They will be more useful especially
H. Polat (&)
Electrical and Electronics Engineering, Mus Alparslan
in fields such as medicine, entertainment, education and in
University, 49000 Muş, Turkey many other areas [5]. BCI systems need variable resources
e-mail: [email protected]
that can be taken from humans and processed to understand were used. Some main studies related with emotions are
emotions. EEG signal is one of the most important given below.
resources to achieve this target. Emotion recognition is
• Murugappan et al. [18] classified five emotions based
combined with different areas of knowledge such as psy-
on EEG signals. They used EEG signals that recorded
chology, neurology and engineering. SAM questionnaires
from 64, 24, and 8 EEG channels, respectively. They
are usually used for classified affective responses of sub-
achieved maximum classification accuracy of 83.26 and
jects in the design of emotion recognition systems [6].
75.21% using kNN and linear discriminant analysis
However, affective responses are not easily classified into
(LDA) algorithm, respectively. Researchers employed
distinctive emotion responses due to the overlapping of
DWT method for decomposing the EEG signal into
alpha, beta, and gamma bands. These frequency bands
Emotions can be discriminated with either discrete
were analyzed for feature extraction.
classification spaces or dimensional spaces. A discrete
• Channel et al. [19] classified two emotions. They
space allows the assessment of a few basic emotions such
employed SAM to determine participant emotions.
as happiness and sadness and is more suitable for unimodal
They used Naive Bayes (NB) and Fisher discriminant
systems [7]. A dimensional space (valence–arousal plane)
analysis (FDA) as the classification algorithms. Clas-
allows a continuous representation of emotions on two
sification accuracy was obtained as 72 and 70% for NB
axes. Valence dimension is ranging from unpleasant to
and FDA, respectively.
pleasant, and arousal dimension is ranging from calm to
• Zhang et al. [20] applied PCA method for feature
excited state [8]. Higher dimension is better in the under-
extraction. The features were extracted from two
standing of different states, but the classification accuracies
channels (F3 and F4). The classification accuracy was
of application can be lower as obtained in Ref [7]. Thus, in
obtained as 73% by the researchers.
this study, EEG signals that are related to positive and
• Bhardwaj et al. [21] recognized seven emotions using
negative emotions have been classified with channel
support vector machine (SVM) and LDA. Three EEG
selection for only valence dimension.
channels (Fp1, P3 and O1) were used in their exper-
In the literature, there are studies in which various sig-
iment. Researchers investigated sub-bands (theta, alpha
nals obtained/measured from people are used in order to
and beta) of EEG signal. The overall average accura-
determine emotions automatically. We can gather these
cies obtained were 74.13% using SVM and 66.50%
studies under three areas [9]. The first approach includes
using LDA.
studies intended to predict emotions using face expressions
• Lee et al. [22] classified positive and negative emo-
and/or speech signals [10]. However, the main disadvan-
tions. Classification accuracy was obtained as 78.45%
tage of this approach is that permanently catching the
by using adaptive neuro-fuzzy inference system
spontaneous face expressions that do not reflect real emo-
tions is quite difficult. Speech and facial expressions vary
across cultures and nations as well. The second main In reference to the literature, it is seen that a limited
approach is based on emotion prediction by tracking the number of EEG channels (e.g., two or three) have been
changes in central automatic nervous system [11, 12]. used to detect different emotional states with different
Various signals such as electrocardiogram (ECG), skin classification algorithms such as SVM, MLPNN, and kNN.
conductance response (SCR), breath rate, and pulse are The aim of this study was to classify EEG signals related
recorded; hence, emotion recognition is applied by pro- to different emotions based on audiovisual stimuli with the
cessing them. The third approach includes studies intended preprocessing of channel selection. SAM was used to
for EEG-based emotion recognition. determine participants’ emotional states. Participants rated
In order to recognize emotions, a large variety of studies each audiovisual stimulus in terms of the level of valence,
were specifically conducted within the scope of EEG sig- arousal, like/dislike, dominance and familiarity. EEG sig-
nals. These studies can simply be gathered under three nals that related to positive and negative emotions have
main areas; health, game, and advertisement. Studies in been classified according to participants’ valence ratings.
health are generally conducted by physicians for purposes DWT method was used for feature extraction from EEG
of helping in disease diagnosis [13–15]. Game sector signals. Wavelet coefficients of EEG signals were assumed
involve studies in which people use EEG recordings as feature vectors and statistical features were used to
instead of joysticks and keyboards [16, 17]. As per this reduce the dimension of those feature vectors. EEG signals
study, advertisement sector generally involves studies related to positive and negative emotions groups have been
which aim at recognizing emotions from EEG signals. classified by MLPNN and kNN algorithm. After the pre-
There are several studies in which different algorithms processing and feature extraction stages, classifiers were
related to EEG-based classification of emotional states used for channel selection (Fig. 1). EEG channels that offer
the best classification performance were determined. Thus, the authors selected 120 music clips. Half of these stimuli
final feature vectors were obtained by combining the fea- were selected by semi automatically and another half was
tures of those EEG channel. The final feature vectors were selected manually [24]. From the initial collection of 120
classified and their performances were compared. The steps music clips, the final 40 test music clips were determined to
followed in the classification process are depicted in Fig. 1. present in the paradigm. These music clips were selected to
As a remainder, this paper is organized as follows elicit emotion prominently. A 1-min segment related to the
Sects. 2 and 3 describe the Materials and Methods maximum emotional content was extracted from each
employed in the proposed EEG-based emotion recognition music clips, and these segments were presented in final
system. Section 4 presents the experimental results. Sec- experiment.
tion 5 presents the results and discussion. Finally, Sect. 6
provides the conclusion of this paper. 2.3 Task
3 Methods
Fig. 4 Sample EEG signals related to positive and negative emotions. Sample EEG signals were measured from channel Fp1. Time axis was
defined as second; amplitude axis was defined as lV
At the end of DWT method and statistical procedures training algorithm technique adjusts the weights to obtain
used for feature extraction, five-dimensional feature vec- network that is closed to the desired output [32].
tors belonging to EEG segments related to every emotional
state were obtained. 3.3.2 k-nearest neighborhood
(FP), the number of false decision related to positive EEG segments selected for testing stage. In order to
emotion by automated system. True negative (TN), the increase reliability of the classification results, the training
number of true decision related to negative emotion, and and testing data were randomly changed four times.
False negative (FN), the number of false decision related to Single hidden layer with 5 9 n 9 2 architecture was
negative emotion. used in MLPNN architecture for determining five EEG
channels having the best classification performances.
Accuracy (Eq. 6) was taken as model success criteria for
4 Experimental results determining the channels. In training stage of MLPNN, the
network parameters are learning coefficient 0.7 and
4.1 Channel selection momentum coefficient 0.9.
In this study, EEG signals recorded from 32 channels
In this study, MLPNN was firstly used for channel selec- were examined, and five EEG channels having highest
tion. EEG recordings measured from 32 channels for every performance in emotion recognition were dynamically
participant were evaluated separately and five EEG chan- determined. It was generally observed that same channels
nels having the highest performance were dynamically except a few of them provided the highest performances.
determined. As we evaluate the results related to all par- The main intention of determining the EEG channels is the
ticipants, same channels provided the highest perfor- simultaneous processing of EEG signals recorded from
mances. The classification of EEG signals were achieved different regions of the brain. EEG signals recorded from
by a dynamic model in which the channels were selected different regions provide a more comprehensive and
for each participant. The dynamic selection process is dynamic solution to the description of emotional state. At
given below. the end of the process, the classification results revealed
For every participant, feature vectors related to EEG that the channels having high performances were P3, FC2,
segments consisting of positive and negative emotions AF3, O1 and Fp1 (Fig. 7). From this point of the study,
were classified by a MLPNN. The feature vectors obtained those five channels were used instead of 32.
by DWT with statistical calculation were used as input sets
for MLPNN. The number of neurons in the input layer of 4.2 Classification of emotions by using MLPNN
MLPNN was five due to the size of feature vector. MLPNN
output vectors were defined as [1 0] for positive emotion Final feature vectors were obtained by combining the
and [0 1] for negative emotion. Thus, the number of neu- features of EEG segments belonging to the selected chan-
rons in the output layer of network was two. Hence, the nels (P3, FC2, AF3, O1 and Fp1). Thus, new feature vec-
structure used in this study was (5 9 n 9 2), which is tors composed of 25 samples for every EEG segment
shown in Fig. 6, where n represents the number of neurons
in the hidden layer. The number of neurons used in the
hidden layer is separately determined for each participant.
Each participant had 40 EEG segments (training and test-
ing patterns) totally. Thirty EEG segments were randomly
selected for network training stage, and the remaining 10
related to positive and negative emotions were obtained. Table 2 Classification of emotions for each participant using
The formation procedure of final feature vectors with the MLPNN
selected five EEG channels is shown in Fig. 8. All proce- Participants Accuracy (%) Specificity (%) Sensitivity (%)
dures were applied separately for each participant.
1 77.5 78.9 76.2
The same procedure that was applied for channel
selection was employed for the classification of emotions 2 72.5 76.4 69.5
as well. MLPNN output vectors were defined as [1 0] for 3 80 89.4 74.1
positive emotion and [0 1] for negative emotion. While 4 7 64.2 83.3
training the network, 30 EEG segments are used and 10 5 80 100 71.4
EEG segments are used for testing. Single hidden layer 6 75 81.2 70.8
with 25 9 n 9 2 architecture was used to classify EEG 7 90 100 83.3
related with emotional states. The number of neurons used 8 80 80 80
in the hidden layer is separately determined for each par- 9 60 62.5 58.3
ticipant. In training stage, learning and momentum coeffi- 10 65 63.6 66.6
cients were 0.7 and 0.9, respectively. 11 80 74.1 89.4
The classification process was applied for each partici- 12 85 93.7 79.1
pant and results are shown in Table 2. According to SAM 13 75 91.6 67.8
valence, two participants out of 22 lacked health assess- 14 67.5 62.9 76.9
ment and for that reason classification process was not 15 80 87.5 75
applied on them. 16 72.5 76.4 69.5
As shown in Table 2, the percentages of accuracy, 17 80 80 80
specificity, and sensitivity are in the range of [60 90], [62.5 18 77.5 86.6 72
100] and [58.3 89.4], respectively. To estimate the overall 19 80 100 71.4
performance of the MLPNN model, statistical measures 20 77.5 78.9 76.1
(accuracy, specificity, sensitivity) were averaged. Average 77.14 92 76.75
Table 3 Classification of emotions for each participant using kNN most popular tools for EEG analysis. In this study,
Participants Accuracy (%) Specificity (%) Sensitivity (%)
MLPNN was employed for determining the emo-
tional state from the EEG signals. In this study, kNN
1 77.5 78.9 76.2 algorithm was also used as a classifier for increasing
2 75 75 75 the reliability of the results obtained by MLPNN.
3 75 81.2 70.1 kNN is one of the most fundamental and simple
4 55 53.5 72.7 classification methods. In many EEG applications,
5 80 87.5 75 kNN algorithm is frequently used. The fact that the
6 70 81.2 64.7 results produced by the two algorithms were close to
7 82.5 100 74 each other. This matching support the process
8 77.5 82.3 73.9 reliability.
9 65 71.4 61.5 (d) As shown in Table 2, the percentages of accuracy,
10 65 71.4 61.5 specificity, and sensitivity of MLPNN are in the
11 67.5 73.3 64 ranges of [60 90], [62.5 100], and [58.3 89.4],
12 85 100 74 respectively. These values indicate that the proposed
13 70 91.6 63.1 MLPNN model is successful, and the test results also
14 62.5 60.8 64.7 show that the generalization ability of MLPNN is
15 75 85.7 69.2 well. On the other hand, the percentages of accuracy,
16 72.5 53.5 58.33 specificity, and sensitivity of kNN are in the ranges
17 75 90.9 65.5 of [55 85], [53.5 100], and [63.1 85], respectively
18 82.5 93.3 76 (Table 3). It was observed that both classifiers
19 77.5 86.6 72 showed parallel performance for each participant.
20 85 85 85 The best classification accuracy of MLPNN was
Average 72.92 90 74.37
obtained as 90% (specificity: 100% and sensitivity:
83.3%) for participant 7. On the other part, the best
classification accuracy of kNN was obtained as 85%
for participant 12 and 20.
methods with channel selection. The discussion of findings (e) To estimate the overall performance of MLPNN and
obtained from this study is presented below. kNN classifiers, statistical measures (accuracy,
specificity, sensitivity) were averaged. The compar-
(a) Wavelet coefficients related to different emotions
ison of averaged values for classification of emotions
were obtained using DWT method. These wavelet
is shown in Fig. 10. As shown in the figure, the
coefficients were evaluated as feature vectors. The
performance of MLPNN was higher than that of
size of feature vectors was reduced by using five
kNN. However, both methods can be accepted as
statistical parameters to get rid of the computing
(f) It was concluded that MLPNN and kNN used in this
(b) Thirty-two channels for every participant were
study give good accuracy results for classification of
evaluated separately, and five channels were deter-
mined. MLPNN (5 9 10 9 2) structure was used for
channel selection. The results revealed that similar The channel selection has opened the door to improve
channels (P3, FC2, AF3, O1 and Fp1) had the the performance of automatic detection for emotion
highest performance for every participant. The recognition. Recently, Zhang et al [20] used two EEG
channels having the highest performance were channels (F3 and F4) for feature extraction. Bhardwaj et al.
selected for classification of emotions. From this [21] used three EEG channels (Fp1, P3 and O1) in order to
point of the study, five channels were used instead of detect emotion. Murugappan et al. [18] extracted features
32. The results revealed that the channels accepted from 64, 24 and 8 EEG channels, respectively. When these
as brain regions had relation with emotions. The studies were reviewed, it can be observed that the main
selected channels determined in this study were also EEG channels and region of brain was considered for
compatible with the channels selected in other detection of emotions. However, in this study, all EEG
papers [21]. channels were evaluated and considered separately and five
(c) Combined feature vectors of selected five channels EEG channels that offer the best classification performance
were classified by using MLPNN and kNN methods. were determined. Final feature vectors were obtained by
In reference to the literature, MLPNN is one of the combining the features of those EEG channels, and the
Fig. 9 Comparison of classification performances for each participant in terms of, a accuracy, b specificity and, c sensitivity (x-axis—
participants, y-axis—performances)
6 Conclusion
neering, Dicle University, as an Assist. Prof. in 2006. His research