0% found this document useful (0 votes)

7 views43 pages

Deep Learning Approaches To Face Expression Classification - 111124

The document discusses deep learning approaches for face expression classification, emphasizing the use of Convolutional Neural Networks (CNNs) for image classification tasks. It outlines the process of training models, including dataset preparation, model selection, evaluation, and deployment, while highlighting the importance of custom models and transfer learning. Additionally, it provides insights into the necessary dataset sizes and considerations for effective image classification.

Uploaded by

Sanynita Kiskindy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views43 pages

Deep Learning Approaches To Face Expression Classification - 111124

Uploaded by

Sanynita Kiskindy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 43

DEEP LEARNING APPROACHES TO

FACE EXPRESSION CLASSIFICATION

• Djoko Purwanto
• Artificial Intelligence and Health Technology Research Center
• Institut Teknologi Sepuluh Nopember (ITS)
DEEP LEARNING & IMAGE CLASSIFICATION
Deep Learning for Image Classification
Deep learning, particularly through Convolutional Neural Networks (CNNs), has
revolutionized image classification by automatically learning features from raw data, leading
to higher accuracy and efficiency compared to traditional machine learning methods

Key Points
 Feature Learning: Deep learning models can automatically learn and extract features from
images, eliminating the need for manual feature engineering.
 Accuracy: Deep learning models, especially CNNs, have achieved state-of-the-art
performance in various image classification tasks.
 Scalability: These models can handle large datasets and complex image classification
problems more effectively than traditional methods.
 Transfer Learning: Pre-trained deep learning models can be fine-tuned for specific tasks,
making them versatile and efficient.
2
Image Classification
Image classification using deep learning involves training a model to recognize and categorize images
into predefined classes.

Brief Overview
 Dataset: Collect a large set of labeled images, each tagged with the correct category.
 Preprocessing: Resize, normalize, and augment the images to prepare them for training.
 Model Selection: Choose a deep learning model, for image data.
 Training: Train the model with the preprocessed images to learn features and patterns,
minimizing errors through forward and backpropagation.
 Evaluation: Test the model on separate images to check its accuracy and performance using
metrics like accuracy, precision, recall, and F1-score.
 Fine-Tuning: Adjust the model’s parameters and architecture to improve performance,
possibly using transfer learning.
 Deployment: Deploy the model to classify new images in real-time applications once it
performs well

3
FACE EXPRESSION CLASSIFICATION
Face expression classification identifies emotions through facial cues, focusing on three classes: Angry,
Happy, and Sad. Angry expressions show furrowed brows and narrowed eyes, indicating tension;
Happy expressions feature raised mouth corners and bright eyes, conveying joy; and Sad expressions
are marked by downturned lips and drooping eyelids, reflecting sorrow. Deep learning techniques, can
be uset to analyze these expressions for applications such as customer service, mental health, and
interactive technologies.

Image input
predicted output
Model [ Happy ]

parameter

4
Dataset
The dataset comprises a diverse collection of facial expression images sourced from various online
platforms, aimed at enhancing the study of emotion recognition through deep learning. It includes
labeled images representing key emotional states: Angry, Happy, and Sad.

5
Custom Model
A custom deep learning model is specifically designed and tailored to meet the unique
requirements of a particular task or dataset. Unlike pre-trained models, trained on large, generic
datasets, custom models are built from scratch or fine-tuned to address specific problems.

Key Aspects
 Architecture Design: You can design the architecture of the neural network to suit your specific
needs. This includes choosing the number of layers, types of layers (e.g., convolutional, recurrent),
and the connections between them.
 Training from Scratch: Custom models can be trained from scratch using your own dataset. This is
useful when you have a unique dataset that doesn’t match the data used to train pre-existing
models.
 Transfer Learning: Often, custom models are built using transfer learning, where a pre-trained
model is adapted to a new task. This involves taking a model trained on a large dataset and fine-
tuning it with your specific data.

6
 Hyperparameter Tuning: Custom models allow for extensive hyperparameter tuning to
optimize performance. This includes adjusting learning rates, batch sizes, and other
parameters to achieve the best results.
 Specialized Layers and Functions: You can incorporate specialized layers or custom functions
that are not available in standard models. This might include custom loss functions, activation
functions, or other unique components.
 Application-Specific: Custom models are tailored to specific applications, such as medical
image analysis, natural language processing, or autonomous driving, ensuring they perform
optimally for the intended use case.

7
Custom Model Example

8
Custom Model using Transfer Learning

9
Deployment
Deployment of face expression classification models involves integrating the trained model into
a real-world application, ensuring it can process input data (like images or video) and return
accurate emotion predictions. This process includes setting up the deployment environment,
optimizing the model for performance, and conducting thorough testing to ensure reliability.

Image input
predicted output
Face
Face [ Happy ]
Expression
Detection
Classification

10
DEEP LEARNING ELEMENTS
Rescaling
 Definition: Adjusting the scale of data or images to fit a specific range or size.

 In Data Processing:
o Normalization: Scaling data values to a common range (e.g., 0 to 1).
o Standardization: Transforming data to have a mean of 0 and a standard deviation of 1.

 In Image Processing:
o Resizing: Changing the dimensions of an image (e.g., increasing or decreasing width and
height).
o Interpolation: Estimating pixel values during resizing using methods like nearest neighbor
or bilinear interpolation

11
Batch Normalization

 Definition: A technique used in neural networks to normalize the inputs of each layer, improving
training speed and stability.
 Purpose:
o Reduces internal covariate shift by normalizing layer inputs.
o Helps mitigate issues related to vanishing/exploding gradients.
 How It Works:
o Normalizes the output of a layer by subtracting the batch mean and dividing by the batch
standard deviation.
o Applies learnable parameters (scale and shift) to allow the model to retain the ability to
represent the original distribution.
 Implementation:
o Typically inserted after the linear transformation (e.g., before activation functions).
o Can be applied to both fully connected and convolutional layers.

12
Convolutional 2D
 Definition: A core operation in Convolutional Neural Networks (CNNs) for processing 2D data,
primarily images.
 Convolution Operation:
o Involves sliding a filter (kernel) over the input image.
o Performs element-wise multiplication and sums the results to produce an output value.
 Input and Output:
o Input: 2D array (grayscale) or 3D array (color images).
o Output: Feature map that highlights detected features.
 Stride and Padding:
o Stride: Determines how far the filter moves (e.g., stride of 1 moves one pixel at a time).
o Padding: Adds extra pixels around the image to control output size.
 Multiple Filters: Uses various filters to capture different features, resulting in multiple feature maps.
 Applications: Image classification, object detection, and image segmentation.
 Benefits: Captures spatial hierarchies of features, making CNNs effective for visual data tasks.

13
14
Pooling
 Definition: A downsampling operation used in Convolutional Neural Networks (CNNs) to reduce the
spatial dimensions of feature maps.
 Purpose:
o Decreases the number of parameters and computations in the network.
o Helps prevent overfitting by providing an abstracted representation of the input.
 Types of Pooling:
o Max Pooling: Takes the maximum value from a defined window (e.g., 2x2) of the feature map.
o Average Pooling: Computes the average value from the defined window.
o Global Average Pooling: Averages all values in the feature map, resulting in a single value per
feature map.
 Stride and Window Size:
o Stride: Determines how far the pooling window moves (e.g., a stride of 2 skips every other pixel).
o Window Size: Defines the dimensions of the pooling operation (e.g., 2x2, 3x3).
 Applications: Commonly used in CNN architectures for image classification and object detection.

15
16
Flatten

 Definition: A layer in neural networks that converts a multi-

dimensional input (e.g., a 2D feature map) into a one-
dimensional vector.
 Purpose: Prepares data for fully connected layers by
transforming the output of convolutional or pooling layers
into a flat format.
 Implementation: Typically used after convolutional and
pooling layers in CNN architectures.
 Applications: Commonly used in image classification tasks
where the output from convolutional layers needs to be fed
into dense layers.

17
Dropout
 Definition: A regularization technique used in neural networks to prevent overfitting by randomly
setting a fraction of the neurons to zero during training.
 Purpose:
o Reduces reliance on specific neurons, encouraging the network to learn more robust features.
o Improves generalization to unseen data.
 How It Works:
o During each training iteration, a specified percentage (e.g., 20%) of neurons are randomly
“dropped out” (set to zero).
o The remaining neurons continue to learn and update their weights.
 Implementation:
o Typically applied after activation functions in fully connected layers or convolutional layers.
o The dropout rate is a hyperparameter that can be tuned.

18
19
ReLU Activation
 Definition: ReLU (Rectified Linear Unit) is an activation
function used in neural networks that outputs the input
directly if it is positive; otherwise, it outputs zero.
 Purpose:
o Introduces non-linearity into the model, allowing it to
learn complex patterns.
o Helps mitigate the vanishing gradient problem
commonly seen with sigmoid or tanh functions.
 Characteristics:
o Sparsity: Activates only a portion of neurons, leading
to a sparse representation.
o Computational Efficiency: Simple to compute, making
it faster than other activation functions.
 Implementation: Widely used in hidden layers of deep
learning models, especially in convolutional neural networks
(CNNs).
20
PROGRAMMING
Face Expression Classification using Custom Model

21
22
23
24
25
26
27
28
29
30
31
32
33
34
Face Expression Classification using Transfer Learning
The transfer learning program is similar to the custom models program described earlier, with
the primary difference being the model architecture.

35
Performance Evaluation

36
37
38
Inference using Existing Model
Custom Model

39
40
Model from Transfer Learning
The program closely resembles the
one described earlier, with the key
difference being the need to modify
the model file declared in the main
function.

41
AMOUNT OF DATA FOR IMAGE CLASSIFICATION
1. Minimum Dataset Size
 Small Datasets: For simple tasks or when using transfer learning with pre-trained models, you might
get away with as few as 100-1,000 images per class.
 Moderate Datasets: For more complex tasks, aim for 1,000-10,000 images per class.
2. Ideal Dataset Size
Large Datasets: For robust performance, especially with deep learning models, having 10,000+ images
per class is ideal. Some successful models use hundreds of thousands of images.
3. Considerations
 Class Imbalance: Ensure that you have a balanced number of images across classes to avoid bias.
 Data Augmentation: Techniques like rotation, flipping, and scaling can effectively increase your
dataset size without needing more images.
 Quality Over Quantity: High-quality, well-labeled images are more beneficial than a large number
of poorly labeled ones.
4. Benchmarking
Look at similar projects or datasets in your domain to gauge what has worked well for others.
5. Experimentation
Start with a smaller dataset and gradually increase it while monitoring model performance to find the
sweet spot for your specific application.
42
THANK YOU

Super VIP Cheatsheet - Deep Learning
No ratings yet
Super VIP Cheatsheet - Deep Learning
47 pages
Image Super Resolution Report
No ratings yet
Image Super Resolution Report
12 pages
Introduction To Convolutional Neural Networks
No ratings yet
Introduction To Convolutional Neural Networks
41 pages
Chapter #5 - Deep Learning
No ratings yet
Chapter #5 - Deep Learning
34 pages
Deep Learning For IoT Big Data and Streaming Analytics
No ratings yet
Deep Learning For IoT Big Data and Streaming Analytics
34 pages
Emotion Detection
No ratings yet
Emotion Detection
17 pages
Image Recognition Using Neural Networks
No ratings yet
Image Recognition Using Neural Networks
18 pages
Brain Computer Interface
No ratings yet
Brain Computer Interface
1 page
Final Report On Facial Emotion Detection Using Machine Learning
No ratings yet
Final Report On Facial Emotion Detection Using Machine Learning
12 pages
Super VIP Cheetsheet - Deep Learning, AI, ML
No ratings yet
Super VIP Cheetsheet - Deep Learning, AI, ML
47 pages
CMR University School of Engineering and Technology Department of Cse and It
No ratings yet
CMR University School of Engineering and Technology Department of Cse and It
6 pages
Seminar Report cnn1
No ratings yet
Seminar Report cnn1
23 pages
Stranger Detection: Yada Arun Kumar
No ratings yet
Stranger Detection: Yada Arun Kumar
9 pages
Week8 - Machine Learning
No ratings yet
Week8 - Machine Learning
35 pages
An Overview of Convolutional Neural Network Architectures For Deep Learning
No ratings yet
An Overview of Convolutional Neural Network Architectures For Deep Learning
22 pages
Introduction To Deep Convolutional Neural Networks: March 2016
No ratings yet
Introduction To Deep Convolutional Neural Networks: March 2016
51 pages
Learning
No ratings yet
Learning
12 pages
Lecture 08 On Neural Networks 1
No ratings yet
Lecture 08 On Neural Networks 1
15 pages
Boosted Convolutional Neural Network For Real Time Facial Expression Recognition
No ratings yet
Boosted Convolutional Neural Network For Real Time Facial Expression Recognition
4 pages
CC511 Week 7 - Deep - Learning
No ratings yet
CC511 Week 7 - Deep - Learning
33 pages
Deep Learning Notes For Easy Access
No ratings yet
Deep Learning Notes For Easy Access
14 pages
L09-10 DL and CNN
No ratings yet
L09-10 DL and CNN
56 pages
Face Identification Based On K-Nearest Neighbor
No ratings yet
Face Identification Based On K-Nearest Neighbor
21 pages
Data Analytics With Cognos Questions
No ratings yet
Data Analytics With Cognos Questions
15 pages
DLCV Ch2 Neural Network
No ratings yet
DLCV Ch2 Neural Network
68 pages
Project Report On Emotion Aware Smart Music Recommended System Using CNN
No ratings yet
Project Report On Emotion Aware Smart Music Recommended System Using CNN
11 pages
Unit 4a - Convolutional Neural Networks
No ratings yet
Unit 4a - Convolutional Neural Networks
107 pages
Computer Vision NN Architecture
No ratings yet
Computer Vision NN Architecture
19 pages
03 Convolution Neural Networks and Computer Vision With Tensorflow
No ratings yet
03 Convolution Neural Networks and Computer Vision With Tensorflow
21 pages
UNIT 2 Self Notes
No ratings yet
UNIT 2 Self Notes
10 pages
ML Labs
No ratings yet
ML Labs
46 pages
6-DeepVisualLearning L6
No ratings yet
6-DeepVisualLearning L6
82 pages
Stage 424 June 2023
No ratings yet
Stage 424 June 2023
89 pages
Be - Computer Engineering - Semester 3 - 2022 - December - Engineering Mathematics III Rev 2019 C Scheme
No ratings yet
Be - Computer Engineering - Semester 3 - 2022 - December - Engineering Mathematics III Rev 2019 C Scheme
2 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
11 pages
Facial Emotion Detection
No ratings yet
Facial Emotion Detection
10 pages
IIS Question Bank Module 4-6
No ratings yet
IIS Question Bank Module 4-6
3 pages
Top 10 NLP Question - Answer
No ratings yet
Top 10 NLP Question - Answer
16 pages
Ann 5TH
No ratings yet
Ann 5TH
98 pages
2023 IEEE TNNLS A Survey On Evolutionary Neural Architecture Search
No ratings yet
2023 IEEE TNNLS A Survey On Evolutionary Neural Architecture Search
21 pages
Introduction To Convolutional Neural Networks
No ratings yet
Introduction To Convolutional Neural Networks
4 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
155 pages
Deep Learning Lab
No ratings yet
Deep Learning Lab
11 pages
CV PPT Mt101
No ratings yet
CV PPT Mt101
16 pages
Bootcamp Schedule
No ratings yet
Bootcamp Schedule
1 page
Steven Kolawole SOP
No ratings yet
Steven Kolawole SOP
2 pages
Unit2 CNN
No ratings yet
Unit2 CNN
34 pages
Assignment # 02
No ratings yet
Assignment # 02
1 page
The Evolution of AI
No ratings yet
The Evolution of AI
8 pages
CV - T3 - Unit-7
No ratings yet
CV - T3 - Unit-7
36 pages
DL Lab Ex - No.5
No ratings yet
DL Lab Ex - No.5
2 pages
L11 Learning III Neural Network Architectures
No ratings yet
L11 Learning III Neural Network Architectures
35 pages
MLT UNIT-4 & 5 Imp Sol
No ratings yet
MLT UNIT-4 & 5 Imp Sol
22 pages
Antim Prahar AI and ML For Business 2025
No ratings yet
Antim Prahar AI and ML For Business 2025
45 pages
Pro Rep
No ratings yet
Pro Rep
25 pages
Pattern Recognition
No ratings yet
Pattern Recognition
14 pages
M4 Ia2
No ratings yet
M4 Ia2
6 pages
Group 4
No ratings yet
Group 4
30 pages
Machine Learning (CSO851) - Lecture 10
No ratings yet
Machine Learning (CSO851) - Lecture 10
83 pages
Introai Last Edit
No ratings yet
Introai Last Edit
11 pages
Unit 3
No ratings yet
Unit 3
105 pages
Churn Forecasting Using Deep Ljearning Model
No ratings yet
Churn Forecasting Using Deep Ljearning Model
5 pages
03 Pytorch Computer Vision
No ratings yet
03 Pytorch Computer Vision
29 pages
ch4 CNN
No ratings yet
ch4 CNN
35 pages
CNN 3
No ratings yet
CNN 3
21 pages
Rec03 - Deep Architectures
No ratings yet
Rec03 - Deep Architectures
65 pages
DL Module - (4,5)
No ratings yet
DL Module - (4,5)
70 pages
Lecture 3
No ratings yet
Lecture 3
48 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
8 pages
Unit II
No ratings yet
Unit II
38 pages
Kusu Ai
No ratings yet
Kusu Ai
10 pages
GDG SOF WEEK 1 (Intro To GenAI)
No ratings yet
GDG SOF WEEK 1 (Intro To GenAI)
15 pages
CNN - NASA Battery Dataset
No ratings yet
CNN - NASA Battery Dataset
7 pages
BLIP-2: Bootstrapping Language-Image Pre-Training With Frozen Image Encoders and Large Language Models
No ratings yet
BLIP-2: Bootstrapping Language-Image Pre-Training With Frozen Image Encoders and Large Language Models
13 pages
Experiment 3.3
No ratings yet
Experiment 3.3
3 pages
Intro To CNN
No ratings yet
Intro To CNN
93 pages
Unit 3 CNN 2024
No ratings yet
Unit 3 CNN 2024
58 pages
DL Unit Iv
No ratings yet
DL Unit Iv
18 pages
CS601 Machine Learning Unit 3
No ratings yet
CS601 Machine Learning Unit 3
47 pages
Autoencoders - by Kavishka Abeywardana - Medium
No ratings yet
Autoencoders - by Kavishka Abeywardana - Medium
19 pages
Lec14 CNNRNNModels
No ratings yet
Lec14 CNNRNNModels
64 pages
Convolution Neural Network
No ratings yet
Convolution Neural Network
6 pages
Unit 3
No ratings yet
Unit 3
59 pages
Unit Iv - NNDL
No ratings yet
Unit Iv - NNDL
32 pages
Lecture - 07 (Convolutional Neural Networks)
No ratings yet
Lecture - 07 (Convolutional Neural Networks)
57 pages
DeekshikaJadyada21 AP24LDS11
No ratings yet
DeekshikaJadyada21 AP24LDS11
5 pages
Combining Classifiers
No ratings yet
Combining Classifiers
12 pages
Cheat Sheet-Building Unsupervised Learning Models
No ratings yet
Cheat Sheet-Building Unsupervised Learning Models
3 pages
Financial Documents For Effcient Retirval
No ratings yet
Financial Documents For Effcient Retirval
87 pages
Generative Ai: A Comprehensive Guide to Innovative Ai Models (A Step-by-step Understanding of Fundamental Concepts With Practical Applications)
From Everand
Generative Ai: A Comprehensive Guide to Innovative Ai Models (A Step-by-step Understanding of Fundamental Concepts With Practical Applications)
Anthony Phillips
No ratings yet

Deep Learning Approaches To Face Expression Classification - 111124

Uploaded by

Deep Learning Approaches To Face Expression Classification - 111124

Uploaded by

DEEP LEARNING APPROACHES TO

FACE EXPRESSION CLASSIFICATION

 Definition: A layer in neural networks that converts a multi-

You might also like