0% found this document useful (0 votes)

6 views15 pages

DL Mid

This document discusses experiments conducted to develop a YOLO model for face detection. It is divided into two parts where a basic YOLO implementation is first trained for face detection and then modifications are made to create a personalized YOLO model optimized specifically for face detection. The personalized model demonstrates improved accuracy and efficiency compared to the basic implementation according to the results.

Uploaded by

sana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views15 pages

DL Mid

Uploaded by

sana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

1

Deep Learning

Mid Term

Instructor:

Dr. Mirza Mubasher Baig

Submitted by:

Sana Farooq 23L-8000

Amna Akbar 23L-7802

Contents

Introduction .......................................................................................................................... 3

Methodology .......................................................................................................................... 6

Results and Discussion ........................................................................................................... 8

Part A ................................................................................................................................ 8

Part B ................................................................................................................................ 8

Conclusion........................................................................................................................... 14

References ........................................................................................................................... 15
3

Introduction

Object detection is a critical task in computer vision which enables machines to identify and

locate objects within images or video frames. Among the various object detection methods, the

You Only Look Once (YOLO) model stands out for its real-time performance and high

accuracy. YOLO is a convolutional neural network-based approach that processes images in a

single pass, making it exceptionally fast compared to traditional detection systems that rely on

sliding window approaches.

The YOLO model operates by dividing the input image into a grid and predicting bounding

boxes and class probabilities for each grid cell. This approach allows YOLO to detect multiple

objects of different classes in a single inference step, making it well-suited for real-time

applications such as autonomous driving, surveillance, and robotics [1].

In this report, we delve into the application of YOLO for face detection, a task crucial for

various domains, including security systems, human-computer interaction, and biometric

authentication. We undertake two main tasks:

1. Part A: Basic YOLO Implementation for Face Detection:

In this task, we begin by obtaining a face detection dataset from Kaggle, which contains

annotated images with bounding boxes around faces. We then implement a basic

YOLOv3 model using OpenCV and fine-tune it to detect faces specifically. The

objective is to update the fully connected component of the YOLO model to specialize

in detecting human faces. We train the model using the provided dataset and evaluate

its performance on a separate validation set.

2. Part B: Development of Personalized YOLO for Face Detection:

Building upon the basic YOLO implementation, we explore innovative modifications

to the YOLO architecture to develop a personalized version optimized for face

detection. This task involves experimenting with various modifications, such as

removing certain pre-trained layers, adjusting network parameters, or introducing new

components. The goal is to create a streamlined version of YOLO tailored specifically

for detecting human faces with improved efficiency and accuracy.

Throughout both tasks, we aim to not only achieve accurate face detection but also optimize

the models for deployment on resource-constrained devices, such as mobile phones or edge

computing platforms. By leveraging the capabilities of YOLO and customizing it for face

detection, we seek to address the unique challenges posed by this task and pave the way for

practical applications in diverse real-world scenarios.

Experiment Description

In this experiment, we aim to utilize the YOLO model for face detection. The basic YOLOv3

model is trained to detect a wide range of objects across 80 different classes. However, for our

specific task of face detection, we need to adapt the model to detect faces exclusively.

For Part A, we start by downloading a face detection dataset from Kaggle. We then implement

a basic YOLOv3 model and fine-tune it to detect faces using the provided dataset. The model

is modified to update only the fully connected component to specialize in face detection.

For Part B, we propose an innovative modification to the YOLO architecture to develop a

personalized version. We explore the impact of removing certain pre-trained layers from the

original YOLO network and contrast it with a single multi-layer perceptron. Additionally, we

aim to reduce the number of trainable parameters while maintaining or improving performance.
6

Methodology

In Part A of our experiment, we commenced by preparing the face detection dataset obtained

from Kaggle. After organizing and preprocessing the dataset, including resizing images to a

consistent resolution, we proceeded to implement the basic YOLOv3 model using the OpenCV

library. We initialized the model with pre-trained weights and modified the fully connected

component to focus exclusively on detecting human faces. The model was trained using the

annotated dataset, where we optimized the model parameters iteratively to minimize the

detection loss.

For Part B, which involved the development of a personalized YOLO model for face detection,

we adopted a similar approach but with additional considerations for model modification and

optimization. Given the substantial size of the face detection dataset, consisting of over 60,000

images, we opted to train the model on batches of 3000 images to manage computational

resources efficiently. This batch-wise training strategy allowed us to iteratively update the

model parameters while monitoring performance metrics to ensure convergence. During model

training, we explored innovative modifications to the YOLO architecture, experimenting with

various network architectures, layer configurations, and optimization techniques. We aimed to

streamline the model for face detection by removing unnecessary layers, adjusting network

parameters, and introducing novel components tailored specifically to the task at hand.

Throughout the experimentation process, we meticulously documented the changes made to

the model architecture and tracked the impact on performance metrics to inform our decision-

making process.

Upon completing model training and optimization, we conducted a comprehensive evaluation

of the personalized YOLO model on a separate validation set. We compared the performance

of the personalized model with the basic YOLO implementation, assessing key metrics such as

detection accuracy, inference speed, and model size. Additionally, we analyzed the trade-offs
7

between model complexity and performance to identify the optimal configuration for face

detection tasks. Through rigorous experimentation and evaluation, we aimed to demonstrate

the effectiveness and efficiency of the personalized YOLO model for real-world applications,

laying the groundwork for future research and development in the field of computer vision and

object detection.
8

Results and Discussion

Part A

The basic YOLO implementation successfully detects faces in images from the provided

dataset. However, some images may exhibit occlusions or complex backgrounds, leading to

false positives or missed detections. Overall, the model demonstrates promising results in

detecting faces across different poses and lighting conditions.

Part B

In Part B, our personalized YOLO model for face detection demonstrated strong performance,

accurately localizing human faces with precise bounding boxes. Despite training on batches of

3000 images from the large dataset, totalling over 60,000 images, the model showcased

efficiency and real-time capabilities, outperforming the basic YOLO implementation.

An essential aspect of face detection is the confidence scores associated with each detection.

These scores reflect the model's certainty in its predictions, ranging from 0 to 1. We observed

that clear and well-defined faces yielded higher confidence scores, while occluded or

ambiguous faces resulted in lower scores, highlighting the model's uncertainty in challenging

scenarios. The confusion matrix for validation shows that the model performed well on the

validation set. All the non-zero values are on the diagonal, indicating that the model correctly

classified all the objects in the validation set.

The following figures show the original image, the image with faces bounded in rectangles,

and their respective confusion matrices.

11
12
13
14

Conclusion

In conclusion, we have demonstrated the effectiveness of using YOLO for face detection tasks.

By fine-tuning the basic YOLOv3 model and developing a personalized version, we achieve

accurate and efficient face detection capabilities. These models have various applications in

security, surveillance, and human-computer interaction. Future work may involve further

optimizing the personalized YOLO architecture and exploring additional enhancements for

robust face detection in challenging environments.

References

[1] F. Gurkan, B. Sagman and B. Gunsel, "YOLOv3 as a Deep Face Detector," 2019 11th
International Conference on Electrical and Electronics Engineering (ELECO), Bursa,
Turkey, 2019, pp. 605-609, doi: 10.23919/ELECO47770.2019.8990641

Mastering All YOLO Models From YOLOv1 To YOLO
100% (1)
Mastering All YOLO Models From YOLOv1 To YOLO
58 pages
Anosha Muzammil, Misbah Rashid Presentation
No ratings yet
Anosha Muzammil, Misbah Rashid Presentation
23 pages
Human Maze Learning 1
0% (1)
Human Maze Learning 1
6 pages
19bce0014 VL2021220702099 Pe003
No ratings yet
19bce0014 VL2021220702099 Pe003
17 pages
Water Compressed
No ratings yet
Water Compressed
16 pages
Yolo-Facev2: A Scale and Occlusion Aware Face Detector
No ratings yet
Yolo-Facev2: A Scale and Occlusion Aware Face Detector
18 pages
You Only Look Once - Object Detection Models A Review
No ratings yet
You Only Look Once - Object Detection Models A Review
8 pages
Project
100% (1)
Project
30 pages
Yolo NL
No ratings yet
Yolo NL
18 pages
Masked Facial Recognition in Security Systems Using Transfer Learning
No ratings yet
Masked Facial Recognition in Security Systems Using Transfer Learning
7 pages
Evaluating The Evolution of YOLO You Only Look Onc
No ratings yet
Evaluating The Evolution of YOLO You Only Look Onc
20 pages
YOLO v4 Based Human Detection System Using Aerial Thermal Imaging For UAV Based Surveillance Applications
No ratings yet
YOLO v4 Based Human Detection System Using Aerial Thermal Imaging For UAV Based Surveillance Applications
7 pages
Conference Paper 1.
No ratings yet
Conference Paper 1.
6 pages
Presentation 4
No ratings yet
Presentation 4
23 pages
tY2zFmUjrJVtcjtIYVrJHFj 7zmrrorf
No ratings yet
tY2zFmUjrJVtcjtIYVrJHFj 7zmrrorf
4 pages
Yolov10: Real-Time End-To-End Object Detection: Ao Wang Hui Chen Lihao Liu Kai Chen Zijia Lin Jungong Han Guiguang Ding
No ratings yet
Yolov10: Real-Time End-To-End Object Detection: Ao Wang Hui Chen Lihao Liu Kai Chen Zijia Lin Jungong Han Guiguang Ding
21 pages
Multi TargetFacial Recognition System Us PDF
No ratings yet
Multi TargetFacial Recognition System Us PDF
7 pages
You Only Look Once Model-Based Object Identification in Computer Vision
No ratings yet
You Only Look Once Model-Based Object Identification in Computer Vision
12 pages
YOLOv10 - Revolutionizing Real-Time Object Detection
No ratings yet
YOLOv10 - Revolutionizing Real-Time Object Detection
9 pages
Real-Time Face Detection Based On YOLO
No ratings yet
Real-Time Face Detection Based On YOLO
4 pages
Yolo
No ratings yet
Yolo
20 pages
A Deep Learning Approach For Face Detection Using YOLO
No ratings yet
A Deep Learning Approach For Face Detection Using YOLO
4 pages
Literature Survey
No ratings yet
Literature Survey
4 pages
Sannan Yaqoob YAQ23622005
No ratings yet
Sannan Yaqoob YAQ23622005
4 pages
Yolo Algorithm
No ratings yet
Yolo Algorithm
37 pages
BIOMETRICS
No ratings yet
BIOMETRICS
18 pages
Lab 3 Yolo Object Detection
No ratings yet
Lab 3 Yolo Object Detection
5 pages
Automatic Number Plate Detection System and Automating The Fine Generation Using YOLO-v3
No ratings yet
Automatic Number Plate Detection System and Automating The Fine Generation Using YOLO-v3
8 pages
YOLOv1 v8综述
No ratings yet
YOLOv1 v8综述
36 pages
YOLO
No ratings yet
YOLO
7 pages
AR Yolo 12: A - B E - P V: Eview of V Ttention Ased Nhancements VS Revious Ersions
No ratings yet
AR Yolo 12: A - B E - P V: Eview of V Ttention Ased Nhancements VS Revious Ersions
18 pages
Conference-Ppt Namat
No ratings yet
Conference-Ppt Namat
17 pages
YOLO
No ratings yet
YOLO
10 pages
Risk Management For Sport & Recreation
100% (1)
Risk Management For Sport & Recreation
145 pages
Overview of YOLO ObjectDetectionAlgorithm
No ratings yet
Overview of YOLO ObjectDetectionAlgorithm
7 pages
Evaluation of Personnel
100% (2)
Evaluation of Personnel
7 pages
Yolopdf
No ratings yet
Yolopdf
10 pages
MJEER-Volume 30-Issue 1 - Page 52-57
No ratings yet
MJEER-Volume 30-Issue 1 - Page 52-57
6 pages
1 s2.0 S1877050924033301 Main
No ratings yet
1 s2.0 S1877050924033301 Main
7 pages
Object Detection Using Yolo
No ratings yet
Object Detection Using Yolo
42 pages
Unified Real-Time Object Detection
No ratings yet
Unified Real-Time Object Detection
36 pages
YOLOV1论文-同济子豪兄批注You Only Look Once Unified Real-time Object Detection
No ratings yet
YOLOV1论文-同济子豪兄批注You Only Look Once Unified Real-time Object Detection
10 pages
Paper 5
No ratings yet
Paper 5
13 pages
Final Synopsis1
No ratings yet
Final Synopsis1
10 pages
Synopsis - Internship - Group-53
No ratings yet
Synopsis - Internship - Group-53
8 pages
Efficient Object Detection With YOLO A C
No ratings yet
Efficient Object Detection With YOLO A C
13 pages
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized For Non-GPU Computers
No ratings yet
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized For Non-GPU Computers
8 pages
Enhancing Surveillance Systems With YOLO Algorithm For Real-Time Object Detection and Tracking
No ratings yet
Enhancing Surveillance Systems With YOLO Algorithm For Real-Time Object Detection and Tracking
4 pages
Ex No 06
No ratings yet
Ex No 06
4 pages
Object Detection Document
No ratings yet
Object Detection Document
4 pages
Improved Small-Object Detection Using YOLOv8 A Com
No ratings yet
Improved Small-Object Detection Using YOLOv8 A Com
9 pages
YOLO5Face: Why Reinventing A Face Detector: Delong Qi, Weijun Tan, Qi Yao, Jingfeng Liu
No ratings yet
YOLO5Face: Why Reinventing A Face Detector: Delong Qi, Weijun Tan, Qi Yao, Jingfeng Liu
10 pages
The Real-Time Detection of Traffic Participants Using YOLO Algorithm
No ratings yet
The Real-Time Detection of Traffic Participants Using YOLO Algorithm
4 pages
YOLO Algorithm For Real-Time Object Detection: 2.1. Network Design
No ratings yet
YOLO Algorithm For Real-Time Object Detection: 2.1. Network Design
3 pages
Yolo Paper
No ratings yet
Yolo Paper
10 pages
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized For Non-GPU Computers
No ratings yet
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized For Non-GPU Computers
8 pages
(IJCST-V8I3P4) :sakshi Gupta, Dr. T. Uma Devi
No ratings yet
(IJCST-V8I3P4) :sakshi Gupta, Dr. T. Uma Devi
5 pages
Object Detection and Classification Using Yolov3 IJERTV10IS020078
No ratings yet
Object Detection and Classification Using Yolov3 IJERTV10IS020078
6 pages
How To Calculate Sample Size
100% (4)
How To Calculate Sample Size
6 pages
Supply Chain Performance Measures For Gaining Competitive Advantage: A Review
No ratings yet
Supply Chain Performance Measures For Gaining Competitive Advantage: A Review
8 pages
YOLO V3 ML Project
No ratings yet
YOLO V3 ML Project
15 pages
Deep Learning YOLOv2
No ratings yet
Deep Learning YOLOv2
3 pages
You Only Look Once - Unified, Real-Time Object Detection
No ratings yet
You Only Look Once - Unified, Real-Time Object Detection
10 pages
Values in The Workplace
No ratings yet
Values in The Workplace
15 pages
2022 Bordarie Et - Al Validation Study HSPS Scale
No ratings yet
2022 Bordarie Et - Al Validation Study HSPS Scale
8 pages
Thesis Topics For Radiologic Technology
100% (1)
Thesis Topics For Radiologic Technology
5 pages
Project Management Dissertation Topics
100% (2)
Project Management Dissertation Topics
5 pages
2 - Health Care Politics in Nursing...
No ratings yet
2 - Health Care Politics in Nursing...
18 pages
Ai Dissertation Topics
100% (2)
Ai Dissertation Topics
5 pages
Contemporary Issuyin Education
No ratings yet
Contemporary Issuyin Education
71 pages
Cornell VC Directory
No ratings yet
Cornell VC Directory
186 pages
Won Fong K Lau Examining A Brief Measure of Parent Involvement in Children's Education
No ratings yet
Won Fong K Lau Examining A Brief Measure of Parent Involvement in Children's Education
12 pages
Statistics EXP-5
No ratings yet
Statistics EXP-5
10 pages
AI Magazine - 2023 - Munz - Maximizing AI Reliability Through Anticipatory Thinking and Model Risk Audits
No ratings yet
AI Magazine - 2023 - Munz - Maximizing AI Reliability Through Anticipatory Thinking and Model Risk Audits
12 pages
12 AS Statistics and Mechanics Practice Paper F Mark Scheme
No ratings yet
12 AS Statistics and Mechanics Practice Paper F Mark Scheme
9 pages
Tobit
100% (1)
Tobit
20 pages
Document 11
No ratings yet
Document 11
23 pages
Validation Et Étude Des Propriétés Psychométriques D'une Version Française de L'échelle D'hypersensibilité (HSPS-FR)
No ratings yet
Validation Et Étude Des Propriétés Psychométriques D'une Version Française de L'échelle D'hypersensibilité (HSPS-FR)
29 pages
Drugs Addiction Essay
100% (2)
Drugs Addiction Essay
3 pages
Assessment Task 2 of 2: BSB41515 Certificate IV in Project Management Practice (Release 4)
No ratings yet
Assessment Task 2 of 2: BSB41515 Certificate IV in Project Management Practice (Release 4)
26 pages
Chapter 4 Exercise 10
No ratings yet
Chapter 4 Exercise 10
8 pages
Digital Marketing Intern
No ratings yet
Digital Marketing Intern
2 pages
EC Marie Curie Initial Training Network: Advanced Technologies For Biogas Efficiency, Sustainability and Transport
No ratings yet
EC Marie Curie Initial Training Network: Advanced Technologies For Biogas Efficiency, Sustainability and Transport
15 pages
Testing Herzberg's Duality Theory: Analyzing Job Satisfaction Among State Administration Employees
No ratings yet
Testing Herzberg's Duality Theory: Analyzing Job Satisfaction Among State Administration Employees
16 pages
Statistics Chapter 2
No ratings yet
Statistics Chapter 2
1 page
The Role of Artificial Intelligence in Healthcare Management
No ratings yet
The Role of Artificial Intelligence in Healthcare Management
1 page
Critical Review - Ditha Dwiastuti - n1d219047 - Universitas Halu Oleo
No ratings yet
Critical Review - Ditha Dwiastuti - n1d219047 - Universitas Halu Oleo
3 pages
YOLO Object Detection Explained: Definitive Reference for Developers and Engineers
From Everand
YOLO Object Detection Explained: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Pedestrian Detection: Please, suggest a subtitle for a book with title 'Pedestrian Detection' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
From Everand
Pedestrian Detection: Please, suggest a subtitle for a book with title 'Pedestrian Detection' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
Fouad Sabry
No ratings yet

DL Mid

Uploaded by

DL Mid

Uploaded by

1

Dr. Mirza Mubasher Baig

Sana Farooq 23L-8000

Amna Akbar 23L-7802

Results and Discussion ........................................................................................................... 8

accuracy. YOLO is a convolutional neural network-based approach that processes images in a

sliding window approaches.

applications such as autonomous driving, surveillance, and robotics [1].

various domains, including security systems, human-computer interaction, and biometric

authentication. We undertake two main tasks:

1. Part A: Basic YOLO Implementation for Face Detection:

its performance on a separate validation set.

2. Part B: Development of Personalized YOLO for Face Detection:

Building upon the basic YOLO implementation, we explore innovative modifications

to the YOLO architecture to develop a personalized version optimized for face

detection. This task involves experimenting with various modifications, such as

removing certain pre-trained layers, adjusting network parameters, or introducing new

components. The goal is to create a streamlined version of YOLO tailored specifically

for detecting human faces with improved efficiency and accuracy.

practical applications in diverse real-world scenarios.

For Part B, we propose an innovative modification to the YOLO architecture to develop a

training, we explored innovative modifications to the YOLO architecture, experimenting with

various network architectures, layer configurations, and optimization techniques. We aimed to

Throughout the experimentation process, we meticulously documented the changes made to

Upon completing model training and optimization, we conducted a comprehensive evaluation

detection tasks. Through rigorous experimentation and evaluation, we aimed to demonstrate

Results and Discussion

detecting faces across different poses and lighting conditions.

efficiency and real-time capabilities, outperforming the basic YOLO implementation.

classified all the objects in the validation set.

and their respective confusion matrices.

robust face detection in challenging environments.

You might also like