Real-Time Convolutional Neural Networks For Emotion and Gender Classification

This paper proposes and implements real-time convolutional neural network architectures for emotion and gender classification from faces. The authors develop two CNN models: 1) A sequential fully-convolutional network that achieves 96% accuracy for gender classification and 66% for emotion classification with 600,000 parameters. 2) An improved model inspired by Xception that uses depth-wise separable convolutions and residual modules, achieving similar accuracy with fewer parameters. The models are deployed on a robot for real-time classification, demonstrating the feasibility of CNNs for human-robot interaction tasks.

Uploaded by

Divyank Pandey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

106 views5 pages

Real-Time Convolutional Neural Networks For Emotion and Gender Classification

Uploaded by

Divyank Pandey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Real-time Convolutional Neural Networks for

Emotion and Gender Classification

Octavio Arriaga Paul G. Plöger Matias Valdenegro
Hochschule Bonn-Rhein-Sieg Hochschule Bonn-Rhein-Sieg Heriot-Watt University
Sankt Augustin Germany Sankt Augustin Germany Edinburgh, UK
Email: [email protected] Email: [email protected] Email: [email protected]

Abstract—In this paper we propose an implement a general

convolutional neural network (CNN) building framework for
designing real-time CNNs. We validate our models by creat-
ing a real-time vision system which accomplishes the tasks of
face detection, gender classification and emotion classification
simultaneously in one blended step using our proposed CNN
architecture. After presenting the details of the training pro-
cedure setup we proceed to evaluate on standard benchmark
sets. We report accuracies of 96% in the IMDB gender dataset
and 66% in the FER-2013 emotion dataset. Along with this we
also introduced the very recent real-time enabled guided back-
propagation visualization technique. Guided back-propagation
uncovers the dynamics of the weight changes and evaluates
the learned features. We argue that the careful implementation
of modern CNN architectures, the use of the current regu-
larization methods and the visualization of previously hidden
features are necessary in order to reduce the gap between slow
performances and real-time architectures. Our system has been
validated by its deployment on a Care-O-bot 3 robot used during Fig. 1: Samples of the FER-2013 emotion dataset [4].
RoboCup@Home competitions. All our code, demos and pre-
trained architectures have been released under an open-source
license in our public repository.

I. I NTRODUCTION
The success of service robotics decisively depends on a
smooth robot to user interaction. Thus, a robot should be
able to extract information just from the face of its user,
e.g. identify the emotional state or deduce gender. Interpret-
ing correctly any of these elements using machine learning
(ML) techniques has proven to be complicated due the high
variability of the samples within each task [4]. This leads to
models with millions of parameters trained under thousands of
samples [3]. Furthermore, the human accuracy for classifying
an image of a face in one of 7 different emotions is 65% ±
5% [4]. One can observe the difficulty of this task by trying
to manually classify the FER-2013 dataset images in Figure
1 within the following classes {“angry”, “disgust”, “fear”, Fig. 2: Samples of the IMDB dataset [9].
“happy”, “sad”, “surprise”, “neutral”}.
In spite of these difficulties, robot platforms oriented to
attend and solve household tasks require facial expressions becomes unfeasible. In this paper we propose an implement
systems that are robust and computationally efficient. More- a general CNN building framework for designing real-time
over, the state-of-the-art methods in image-related tasks such CNNs. The implementations have been validated in a real-time
as image classification [1] and object detection are all based on facial expression system that provides face-detection, gender
Convolutional Neural Networks (CNNs). These tasks require classification and that achieves human-level performance when
CNN architectures with millions of parameters; therefore, classifying emotions. This system has been deployed in a
their deployment in robot platforms and real-time systems care-O-bot 3 robot, and has been extended for general robot
platforms and the RoboCup@Home competition challenges.
Furthermore, CNNs are used as black-boxes and often their
learned features remain hidden, making it complicated to
establish a balance between their classification accuracy and
unnecessary parameters. Therefore, we implemented a real-
time visualization of the guided-gradient back-propagation
proposed by Springenberg [11] in order to validate the features
learned by the CNN.
II. R ELATED W ORK
Commonly used CNNs for feature extraction include a
set of fully connected layers at the end. Fully connected
layers tend to contain most of the parameters in a CNN.
Specifically, VGG16 [10] contains approximately 90% of all
its parameters in their last fully connected layers. Recent
architectures such as Inception V3 [12], reduced the amount
of parameters in their last layers by including a Global
Average Pooling operation. Global Average Pooling reduces
each feature map into a scalar value by taking the average over
all elements in the feature map. The average operation forces
the network to extract global features from the input image.
Modern CNN architectures such as Xception [1] leverage from
the combination of two of the most successful experimental
assumptions in CNNs: the use of residual modules [6] and
depth-wise separable convolutions [2]. Depth-wise separable
convolutions reduce further the amount of parameters by
separating the processes of feature extraction and combination
within a convolutional layer.
Furthermore, the state-of-the-art model for the FER2-2013
dataset is based on CNN trained with square hinged loss Fig. 3: Our proposed model for real-time classification.
[13]. This model achieved an accuracy of 71% [4] using
approximately 5 million parameters. In this architecture 98%
of all parameters are located in the last fully connected layers. to each reduced feature map. Our initial proposed architecture
The second-best methods presented in [4] achieved an is a standard fully-convolutional neural network composed of
accuracy of 66% using an ensemble of CNNs. 9 convolution layers, ReLUs [5], Batch Normalization [7]
III. M ODEL and Global Average Pooling. This model contains approx-
imately 600,000 parameters. It was trained on the IMDB
We propose two models which we evaluated in accordance gender dataset, which contains 460,723 RGB images where
to their test accuracy and number of parameters. Both models each image belongs to the class “woman” or “man”, and it
were designed with the idea of creating the best accuracy achieved an accuracy of 96% in this dataset. We also validated
over number of parameters ratio. Reducing the number of this model in the FER-2013 dataset. This dataset contains
parameters help us overcoming two important problems. First, 35,887 grayscale images where each image belongs to one
the use of small CNNs alleviate us from slow performances of the following classes {“angry”, “disgust”, “fear”, “happy”,
in hardware-constrained systems such robot platforms. And “sad”, “surprise”, “neutral”}. Our initial model achieved an
second, the reduction of parameters provides a better gener- accuracy of 66% in this dataset. We will refer to this model
alization under an Occam’s razor framework. Our first model as “sequential fully-CNN”.
relies on the idea of eliminating completely the fully connected Our second model is inspired by the Xception [1] archi-
layers. The second architecture combines the deletion of the tecture. This architecture combines the use of residual mod-
fully connected layer and the inclusion of the combined ules [6] and depth-wise separable convolutions [2]. Residual
depth-wise separable convolutions and residual modules. Both modules modify the desired mapping between two subsequent
architectures were trained with the ADAM optimizer [8]. layers, so that the learned features become the difference of the
Following the previous architecture schemas, our initial ar- original feature map and the desired features. Consequently,
chitecture used Global Average Pooling to completely remove the desired features H(x) are modified in order to solve an
any fully connected layers. This was achieved by having in the easier learning problem F (X) such that:
last convolutional layer the same number of feature maps as
number of classes, and applying a softmax activation function H(x) = F (x) + x (1)
to a reduction of one percent with respect to our initial
implementation. Furthermore, we tested this architecture in
the FER-2013 dataset and we obtained the same accuracy of
66% for the emotion classification task. Our final architecture
weights can be stored in an 855 kilobytes file. By reducing our
architectures computational cost we are now able to join both
models and use them consecutively in the same image without
any serious time reduction. Our complete pipeline including
(a)
the openCV face detection module, the gender classification
and the emotion classification takes 0.22 ± 0.0003 ms on a
i5-4210M CPU. This corresponds to a speedup of 1.5× when
compared to the original architecture of Tang.
We also added to our implementation a real-time guided
back-propagation visualization to observe which pixels in the
image activate an element of a higher-level feature map.
Given a CNN with only ReLUs as activation functions for
the intermediate layers, guided-back propagation takes the
derivative of every element (x, y) of the input image I with
respect to an element (i, j) of the feature map f L in layer L.
The reconstructed image R filters all the negative gradients;
(b) consequently, the remaining gradients are chosen such that
Fig. 4: [2] Difference between (a) standard convolutions and they only increase the value of the chosen element of the
(b) depth-wise separable convolutions. feature map. Following [11], a fully ReLU CNN reconstructed
image in layer l is given by:
l l+1 l+1
Since our initial proposed architecture deleted the last fully Ri,j = (Ri,j > 0) ∗ Ri,j (2)
connected layer, we reduced further the amount of parame-
IV. R ESULTS
ters by eliminating them now from the convolutional layers.
This was done trough the use of depth-wise separable con- Results of the real-time emotion classification task in un-
volutions. Depth-wise separable convolutions are composed seen faces can be observed in Figure 5. Our complete real-
of two different layers: depth-wise convolutions and point- time pipeline including: face detection, emotion and gender
wise convolutions. The main purpose of these layers is to classification have been fully integrated in our Care-O-bot 3
separate the spatial cross-correlations from the channel cross- robot.
correlations [1]. They do this by first applying a D × D filter An example of our complete pipeline can be seen in Figure
on every M input channels and then applying N 1 × 1 × M 6 in which we provide emotion and gender classification.
convolution filters to combine the M input channels into N In Figure 7 we provide the confusion matrix results of our
output channels. Applying 1 × 1 × M convolutions combines emotion classification mini-Xception model. We can observe
each value in the feature map without considering their spatial several common misclassifications such as predicting “sad”
relation within the channel. instead of “fear” and predicting “angry” instead “disgust”.
Depth-wise separable convolutions reduces the computation A comparison of the learned features between several emo-
with respect to the standard convolutions by a factor of N1 + tions and both of our proposed models can be observed in
1
D 2 [2]. A visualization of the difference between a normal
Figure 8. The white areas in figure 8b correspond to the pixel
Convolution layer and a depth-wise separable convolution can values that activate a selected neuron in our last convolution
be observed in Figure 4. layer. The selected neuron was always selected in accordance
Our final architecture is a fully-convolutional neural net- to the highest activation. We can observe that the CNN learned
work that contains 4 residual depth-wise separable convolu- to get activated by considering features such as the frown, the
tions where each convolution is followed by a batch nor- teeth, the eyebrows and the widening of one’s eyes, and that
malization operation and a ReLU activation function. The each feature remains constant within the same class. These
last layer applies a global average pooling and a soft-max results reassure that the CNN learned to interpret understand-
activation function to produce a prediction. This architecture able human-like features, that provide generalizable elements.
has approximately 60, 000 parameters; which corresponds to These interpretable results have helped us understand several
a reduction of 10× when compared to our initial naive common misclassification such as persons with glasses being
implementation, and 80× when compared to the original CNN. classified as “angry”. This happens since the label “angry”
Figure 3 displays our complete final architecture which we is highly activated when it believes a person is frowning
refer to as mini-Xception. This architectures obtains an accu- and frowning features get confused with darker glass frames.
racy of 95% in gender classification task. Which corresponds Moreover, we can also observe that the features learned in
Fig. 7: Normalized confusion matrix of our mini-Xception
Fig. 5: Results of the provided real-time emotion classification network.
provided in our public repository

our mini-Xception model are more interpretable than the ones

learned from our sequential fully-CNN. Consequently the use
of more parameters in our naive implementations leads to less
robust features.
V. F UTURE WORK
Machine learning models are biased in accordance to their
training data. In our specific application we have empirically
found that our trained CNNs for gender classification are
biased towards western facial features and facial accessories.
We hypothesize that this misclassfications occurs since our
training dataset consist of mostly western: actors, writers and
cinematographers as observed in Figure 2.
Furthermore, as discussed previously, the use of glasses
might affect the emotion classification by interfering with
the features learned. However, the use of glasses can also
interfere with the gender classification. This might be a result
from the training data having most of the images of persons
wearing glasses assigned with the label “man”. We believe
that uncovering such behaviours is of extreme importance
when creating robust classifiers, and that the use of the
visualization techniques such as guided back-propagation will
become invaluable when uncovering model biases.

Fig. 6: Results of the provided combined gender and emotion

inferences demo. The color blue represents the assigned class
woman and red the class man
VI. C ONCLUSIONS
We have proposed and tested a general building designs
for creating real-time CNNs. Our proposed architectures have
been systematically built in order to reduce the amount of
parameters. We began by eliminating completely the fully
connected layers and by reducing the amount of parameters
in the remaining convolutional layers via depth-wise separable
convolutions. We have shown that our proposed models can
be stacked for multi-class classifications while maintaining
real-time inferences. Specifically, we have developed a vision
system that performs face detection, gender classification
and emotion classification in a single integrated module. We
have achieved human-level performance in our classifications
tasks using a single CNN that leverages modern architecture
constructs. Our architecture reduces the amount of parameters
(a) 80× while obtaining favorable results. Our complete pipeline
has been successfully integrated in a Care-O-bot 3 robot.
Finally we presented a visualization of the learned features
in the CNN using the guided back-propagation visualization.
This visualization technique is able to show us the high-level
features learned by our models and discuss their interpretabil-
ity.
ACKNOWLEDGMENTS
We gratefully acknowledge the continued support by the b-it
Bonn-Aachen International Center for Information Technology
and the Hochschule Bonn-Rhein-Sieg.
R EFERENCES
[1] François Chollet. Xception: Deep learning with depthwise separable
convolutions. CoRR, abs/1610.02357, 2016.
[2] Andrew G. Howard et al. Mobilenets: Efficient convolutional neural
networks for mobile vision applications. CoRR, abs/1704.04861, 2017.
[3] Dario Amodei et al. Deep speech 2: End-to-end speech recognition in
english and mandarin. CoRR, abs/1512.02595, 2015.
(b) [4] Ian Goodfellow et al. Challenges in Representation Learning: A report
on three machine learning contests, 2013.
[5] Xavier Glorot, Antoine Bordes, and Yoshua Bengio. Deep sparse
rectifier neural networks. In Proceedings of the Fourteenth International
Conference on Artificial Intelligence and Statistics, pages 315–323,
2011.
[6] Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep
residual learning for image recognition. In Proceedings of the IEEE
conference on computer vision and pattern recognition, pages 770–778,
2016.
[7] Sergey Ioffe and Christian Szegedy. Batch normalization: Accelerating
deep network training by reducing internal covariate shift. In Interna-
tional Conference on Machine Learning, pages 448–456, 2015.
[8] Diederik Kingma and Jimmy Ba. Adam: A method for stochastic
optimization. arXiv preprint arXiv:1412.6980, 2014.
[9] Rasmus Rothe, Radu Timofte, and Luc Van Gool. Deep expectation
of real and apparent age from a single image without facial landmarks.
International Journal of Computer Vision (IJCV), July 2016.
[10] Karen Simonyan and Andrew Zisserman. Very deep convolu-
tional networks for large-scale image recognition. arXiv preprint
arXiv:1409.1556, 2014.
[11] Jost Tobias Springenberg, Alexey Dosovitskiy, Thomas Brox, and Martin
Riedmiller. Striving for simplicity: The all convolutional net. arXiv
(c)
preprint arXiv:1412.6806, 2014.
Fig. 8: All sub-figures contain the same images in the same [12] Christian Szegedy, Vincent Vanhoucke, Sergey Ioffe, Jon Shlens, and
Zbigniew Wojna. Rethinking the inception architecture for computer
order. Every row starting from the top corresponds respec- vision. In Proceedings of the IEEE Conference on Computer Vision and
tively to the emotions {“angry”, “happy”, “sad”, “surprise”} Pattern Recognition, pages 2818–2826, 2016.
(a) Samples from the FER-2013 dataset (b) Guided back- [13] Yichuan Tang. Deep learning using linear support vector machines.
arXiv preprint arXiv:1306.0239, 2013.
propagation visualization of our mini-Xception model (c)
Guided back-propagation visualization of our sequential fully-
CNN.

Cat and Dog Classification Using CNN Fin
No ratings yet
Cat and Dog Classification Using CNN Fin
34 pages
Lecture2.2 UnimodalRepresentations Part1 PDF
No ratings yet
Lecture2.2 UnimodalRepresentations Part1 PDF
92 pages
Real-Time Face Detection and Emotion & Gender Classification Using Convolution Neural Network
No ratings yet
Real-Time Face Detection and Emotion & Gender Classification Using Convolution Neural Network
26 pages
FT04 Haghighat Independent 2023
No ratings yet
FT04 Haghighat Independent 2023
40 pages
Images and Convolutional Neural Networks: Practical Deep Learning
No ratings yet
Images and Convolutional Neural Networks: Practical Deep Learning
34 pages
Sania Technical Seminar
No ratings yet
Sania Technical Seminar
14 pages
Recent Advances in Convolutional Neural Networks-2018
No ratings yet
Recent Advances in Convolutional Neural Networks-2018
42 pages
ML 2
No ratings yet
ML 2
70 pages
CV Mot
No ratings yet
CV Mot
69 pages
DL Unit 3
No ratings yet
DL Unit 3
27 pages
Anthony
No ratings yet
Anthony
33 pages
Convolutional Neural Networks (CNNS) : Foundations and Applications in Visual Representation Learning
No ratings yet
Convolutional Neural Networks (CNNS) : Foundations and Applications in Visual Representation Learning
9 pages
Intro To CNN
No ratings yet
Intro To CNN
93 pages
A Quantum Convolutional Neural Network
No ratings yet
A Quantum Convolutional Neural Network
16 pages
DL Unit 3
No ratings yet
DL Unit 3
12 pages
MRS Sot Seminar Report
No ratings yet
MRS Sot Seminar Report
16 pages
Assignment-6 STC-DL
No ratings yet
Assignment-6 STC-DL
17 pages
Rec03 - Deep Architectures
No ratings yet
Rec03 - Deep Architectures
65 pages
Topic: Convolution Neural Network: Presented by
No ratings yet
Topic: Convolution Neural Network: Presented by
13 pages
Output 93
No ratings yet
Output 93
57 pages
AMED Project 2
No ratings yet
AMED Project 2
14 pages
Unit 3 CNN 2024
No ratings yet
Unit 3 CNN 2024
58 pages
UNIT-III Convolution Neural Networks
No ratings yet
UNIT-III Convolution Neural Networks
9 pages
5-Convolutional Neural Network
No ratings yet
5-Convolutional Neural Network
43 pages
Harley MSC Thesis Menos Especializadpo
No ratings yet
Harley MSC Thesis Menos Especializadpo
71 pages
Reviewer - Convolutional Neural Networks (CNNS) - Muqaddas Bin Tahir
No ratings yet
Reviewer - Convolutional Neural Networks (CNNS) - Muqaddas Bin Tahir
8 pages
IJCRT2304305
No ratings yet
IJCRT2304305
15 pages
Co2 CNN 3
No ratings yet
Co2 CNN 3
31 pages
Dip 7
No ratings yet
Dip 7
4 pages
Kirkvik Acit2022
No ratings yet
Kirkvik Acit2022
155 pages
Classificationusingcnn 170430184308
No ratings yet
Classificationusingcnn 170430184308
24 pages
Vlog Rubrics
100% (2)
Vlog Rubrics
1 page
CNN Notes Architecture
No ratings yet
CNN Notes Architecture
4 pages
Data and Hardware Efficient Design For Convolutional Neural Network!
No ratings yet
Data and Hardware Efficient Design For Convolutional Neural Network!
10 pages
What Is A Convolutional Neural Network (CNN) ?
No ratings yet
What Is A Convolutional Neural Network (CNN) ?
5 pages
Unit - 4 DL
No ratings yet
Unit - 4 DL
19 pages
Advancements in Image Classification Using Convolutional Neural Network
No ratings yet
Advancements in Image Classification Using Convolutional Neural Network
8 pages
Module 5
No ratings yet
Module 5
20 pages
Project Report (2) RRRRRRRRRRR
No ratings yet
Project Report (2) RRRRRRRRRRR
10 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
8 pages
Image Recognition Using Neural Networks
No ratings yet
Image Recognition Using Neural Networks
18 pages
Deep Learning Image Classification
No ratings yet
Deep Learning Image Classification
11 pages
Object Detection With Deep Learning - A Review Summary
No ratings yet
Object Detection With Deep Learning - A Review Summary
11 pages
UNIT 2 Self Notes
No ratings yet
UNIT 2 Self Notes
10 pages
CV PPT Mt101
No ratings yet
CV PPT Mt101
16 pages
BMM 2018 - Deep Learning Tutorial
No ratings yet
BMM 2018 - Deep Learning Tutorial
47 pages
Guddu Jha - Organized
No ratings yet
Guddu Jha - Organized
3 pages
Boosted Convolutional Neural Network For Real Time Facial Expression Recognition
No ratings yet
Boosted Convolutional Neural Network For Real Time Facial Expression Recognition
4 pages
3098 15835 1 PB 2011 PDF
No ratings yet
3098 15835 1 PB 2011 PDF
6 pages
CNN Project
No ratings yet
CNN Project
16 pages
Kaam Ka Chiz
No ratings yet
Kaam Ka Chiz
4 pages
Burghardt, G. M. (2010) - Defining and Recognizing Play. Oxford Handbooks Online.
No ratings yet
Burghardt, G. M. (2010) - Defining and Recognizing Play. Oxford Handbooks Online.
18 pages
10 1109@icaccs48705 2020 9074302
No ratings yet
10 1109@icaccs48705 2020 9074302
4 pages
2015WS HS SpikingVision
No ratings yet
2015WS HS SpikingVision
23 pages
PPT
No ratings yet
PPT
20 pages
Image Classification Using Small Convolutional Neural Network
No ratings yet
Image Classification Using Small Convolutional Neural Network
5 pages
SCIENCE, TECHNOLOGY AND SOCIETY (2) Syllabus
No ratings yet
SCIENCE, TECHNOLOGY AND SOCIETY (2) Syllabus
12 pages
Research Paper-2
No ratings yet
Research Paper-2
5 pages
Paper - Deep - Learning - Emotion Detection PDF
No ratings yet
Paper - Deep - Learning - Emotion Detection PDF
7 pages
Reaserch Ethics Training PDF
No ratings yet
Reaserch Ethics Training PDF
210 pages
Seminar
No ratings yet
Seminar
16 pages
Attention To Diversity Theough Clil
No ratings yet
Attention To Diversity Theough Clil
58 pages
Training Need Identification Format
No ratings yet
Training Need Identification Format
1 page
DLL - New 1 1
No ratings yet
DLL - New 1 1
2 pages
Lesson Plan Template M
No ratings yet
Lesson Plan Template M
4 pages
Pofessional Goals ps1 - Deshann Valenitne
No ratings yet
Pofessional Goals ps1 - Deshann Valenitne
3 pages
English As A Foreign Language Area Curricular Objectives of The English As A Foreign Language Area For Subnivel Elemental of Educación General Básica
No ratings yet
English As A Foreign Language Area Curricular Objectives of The English As A Foreign Language Area For Subnivel Elemental of Educación General Básica
7 pages
ATG - Novicio
100% (1)
ATG - Novicio
4 pages
Action Research Orientation - Salutan
No ratings yet
Action Research Orientation - Salutan
37 pages
Cid Monitoring Tool 1
No ratings yet
Cid Monitoring Tool 1
3 pages
Space Planning
No ratings yet
Space Planning
8 pages
The Life and Works of Rizal The Life and Works of Rizal
No ratings yet
The Life and Works of Rizal The Life and Works of Rizal
100 pages
CBSE - Application From Private Candidates
No ratings yet
CBSE - Application From Private Candidates
2 pages
Finalfinal Deped Lessonplan
No ratings yet
Finalfinal Deped Lessonplan
3 pages
CNN Eem305
100% (1)
CNN Eem305
7 pages
Cot Math 6 Volume
No ratings yet
Cot Math 6 Volume
5 pages
Face Detection Project
No ratings yet
Face Detection Project
10 pages
TUP Strategic Planning and Control Syllabus
No ratings yet
TUP Strategic Planning and Control Syllabus
11 pages
New Normal LP-CO2
No ratings yet
New Normal LP-CO2
5 pages
Student Portfolio
No ratings yet
Student Portfolio
6 pages
(Template & Example) Assessment Roadmap
No ratings yet
(Template & Example) Assessment Roadmap
2 pages
Technology Education and Educational Technology Planning For Educational Technology Programs
No ratings yet
Technology Education and Educational Technology Planning For Educational Technology Programs
20 pages
DLL AOM (Week 0 June 20-21)
No ratings yet
DLL AOM (Week 0 June 20-21)
3 pages
LAC Proposals 2023
No ratings yet
LAC Proposals 2023
9 pages
Vision Mission: Under A Culture of Unity and Teamwork For Excellence
No ratings yet
Vision Mission: Under A Culture of Unity and Teamwork For Excellence
14 pages
Textbooks
No ratings yet
Textbooks
5 pages
The Churrotera: Sales Report
No ratings yet
The Churrotera: Sales Report
4 pages
Academic Calender 2010 &amp 11
No ratings yet
Academic Calender 2010 &amp 11
2 pages
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet

Real-Time Convolutional Neural Networks For Emotion and Gender Classification

Uploaded by

Real-Time Convolutional Neural Networks For Emotion and Gender Classification

Uploaded by

Real-time Convolutional Neural Networks for

Emotion and Gender Classification

Abstract—In this paper we propose an implement a general

our mini-Xception model are more interpretable than the ones

Fig. 6: Results of the provided combined gender and emotion

You might also like