0% found this document useful (0 votes)

56 views54 pages

Week8 WEB

The document discusses convolutional neural networks (CNNs) and the YOLO object detection model. It provides an overview of CNN architecture including convolution, activation, pooling, flattening, and fully connected layers. It explains how CNNs use shared weights and biases to detect features across image regions. The document also describes how YOLO improves on previous models by predicting bounding boxes and class probabilities simultaneously for real-time object detection in images.

Uploaded by

Ankit Shaw

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

56 views54 pages

Week8 WEB

Uploaded by

Ankit Shaw

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 54

TECHIN513 – Managing

Signal and Data Processing

Week 8
Today’s Agenda
• CNN
• YOLO
• ICTE
• FPDAWT
Today’s Agenda
• Convolutional Neural Network
• You Only Look Once
• In Class Team Exercise
• Final Project Discussion And Work Time
Announcement
• Purchasing supplies for final project
• Budget of $40 per team
• Requests must be made by Monday, February 26 at 9:59am

Link to Request Form:

TECHIN513 Final Project Supply Request Form - Google Sheets
What is a convolutional neural network?
• A network architecture for deep learning
• CNNs can have tens or hundreds of hidden layers
• Includes a typical artificial neural network architecture
• Useful for finding patterns in images to recognize objects
Stages of a CNN
• Input image
• Convolution
• Activation
• Pooling
• Flattening
• Fully Connected ANN
• Activation
image source

• Output

Convolutional Operations | Medium

pixel values range
Greyscale Image Data from 0 to 255

24x16 matrix

How Do Machines Read and Store Images? | Analytics Vidhya

Color Image Data

one image has

three matrices or
pixel values range “channels”
from 0 to 255

How Do Machines Read and Store Images? | Analytics Vidhya

CNN Overview
Feature Extraction

Feature Extraction with CNNs | Towards Data Science

Typical Artificial Neural Network
• Each neuron in the input layer
is connected to a neuron in the
hidden layer
• Each connection has a weight
value
• Each neuron has a bias value
• The model learns these values
during the training process
• Values are updated with each
new training example

Introduction to Deep Learning - MATLAB

Convolutional Neural Network
• The weights and bias values are
the same for all neurons in a
hidden layer
• All hidden layers are detecting
the same feature (e.g. edge) in
different regions of an image
• The network is better equipped
to detect the feature regardless
of its location in an image

Introduction to Deep Learning - MATLAB

Convolutional Operation

An operation on two functions

which produces a third
combined function

Convolution Integral | Statistics How To

Convolutional Operation
kernel types

• A convolutional kernal is a
small 2D matrix
• The kernal maps on to the
input image by matrix
multiplication and addition
• The output is a matrix of
lower dimensions
Sliding window protocol
where stride =1

Lower dimension matrix

(feature map) Convolutional Operations | Medium
Convoluting to Create Feature Maps

CNNs | simplilearn
45*0
+ 12*(-1)
+ 5*0
+ 22*(-1)
+ 10*5
+ 35*(-1)
+ 88*0
+ 26*(-1)
+ 51*0
= - 45
Activation Step Rectified
Linear
Unit
• Activation function takes the
output of a neuron and maps it
to the highest positive value
• If output is negative, the
function maps it to zero
• ReLU is a commonly used
activation function in deep
learning

Introduction to Deep Learning - MATLAB

ReLu activation retains only positive values

CNNs | simplilearn
CNN Overview
Pooling Step New
Feature
Map
• Pooling reduces dimensionality
of features map by using
different filters
• Condenses regions of neurons
into a single output
• Simplifies model by reducing
the number of parameters the
model needs to learn
• Pooling retains the most
important information but
lowers resolution

Introduction to Deep Learning - MATLAB

Pooling Applies Various Filters

CNNs | simplilearn
Pooling Enhances Edges Three iterations of
max pooling using a
(2, 2) kernel

Features (edges) are

enhanced, but
resolution is reduced

Pooling In Convolutional Neural Networks | paperspace

CNN Overview
Flattening
• The flatten layer lies
between the CNN and the
Softmax
ANN
• Converts the feature map
from the pooling layer into
an input that the ANN can
understand
• The ANN requires a one-
dimensional array as input
Artificial Neural Network

Feature Maps | educative.io , Dense layers | Pysource

Softmax Activation Step
Mathematical
representation
Last fully
• Often used as the last connected layer
activation function to
normalize the output of a
network to a probability
distribution over predicted
output classes
• The output of a Softmax is a
vector with probabilities of
each possible outcome.

Softmax Activation Function | Towards Data Science

CNN Output Layer
The final layer of the CNN architecture provides the final
classification output
A vector of length K
equal to the
number of classes

Introduction to Deep Learning - MATLAB

Classification, Detection, & Segmentation

or object localization

Object Segmentation vs. Object Detection | LinkedIn

You Only Look Once
• "You Only Look Once" (YOLO)
• YOLOv1 paper published May 2016
• Uses CNN as its backbone
network architecture
• YOLO predicts bounding boxes
and class probabilities for these
boxes simultaneously
• Improvement on previous model:
R-CNN

https://arxiv.org/abs/1506.02640
YOLO

https://pjreddie.com/darknet/yolo/

https://arxiv.org/abs/1506.02640
Previous Model for Image Detection: R-CNN
• Regions with CNN features
• Published Oct 2014
• link to article
• Splits an image into 2000
regions in boundary boxes
then classify each region
• Drawbacks:
• Long time to train – classify
2000 regions per image
• Detection not in real-time: 47
sec for test image
• Boundary box inaccuracies

R-CNN | Towards Data Science

How does YOLO work?
• Resizes the input image into YOLO Architecture
448x448
• A 1x1 convolution is first applied
to reduce the number of
channels
• 24 convolutional layers
• 4 max pooling layers
• The activation function is ReLU
• Two fully connected layers

https://arxiv.org/abs/1506.02640
What is Object
Detection?
First let’s talk about
object localization

36
What is object localization?
width (bw)
Object localization is
finding what and where a
(single) object exists in a
single image

height
(bh)

(bx, by)
How is object localization described
numerically in YOLO?
• The coordinates of a bounding x_train

box are described as a vector

y_train

Pc 1
Probability Bx 0.5
of class By 0.6
Bw 0.4
Bh 0.3
C1 1
C2 0
C1 = car class
C2 = motorcycle class
How is object localization described
numerically in YOLO? (0.5,0.6)
• The coordinates of a bounding (0,0) x_train

box are described as a vector

y_train

Pc 1 (bx,by)
Probability Bx 0.5 bh
of class By 0.6
0.3
Bw 0.4
Bh 0.3 bw
C1 1
C2 0 (1,1)
C1 = car class 0.4
C2 = motorcycle class
How is object localization described
numerically in YOLO? (0.5,0.6)
• The coordinates of a bounding (0,0)
box are described as a vector

Output of
Neural Network

Pc 1 (bx,by)
Probability Bx 0.5 bh
of class By 0.6
0.3
Bw 0.4
Bh 0.3 bw
C1 0.97
C2 0.03 (1,1)
C1 = car class 0.4
C2 = motorcycle class
How is object localization described
numerically in YOLO?
• The coordinates of a bounding x_train

box are described as a vector

y_train

Pc 0
Probability Bx -
of class By -
Bw -
Bh -
C1 -
C2 -
C1 = car class
C2 = motorcycle class
What about multiple objects?

YOLO algorithm | YouTube

What about multiple objects?

Pc 0
Bx -
By -
Bw -
Bh -
C1 -
C2 -

C1 = dog class
C2 = person class

YOLO algorithm | YouTube

What about multiple objects?
Person’s
object
belongs to
this cell

Pc 1
Bx 0.05
By 0.3
Bw 2
Bh 1.3
C1 1
C2 0

C1 = dog class
C2 = person class

YOLO algorithm | YouTube

What about multiple objects?

Pc 1
Bx 0.32
By 0.02
Bw 2.2
Bh 1.7
C1 0
C2 1

C1 = dog class
C2 = person class

YOLO algorithm | YouTube

What about multiple objects?

All other cells 4x4x7 matrix

Pc 0
Bx -
By -
Bw -
Bh -
C1 -
C2 -

C1 = dog class
C2 = person class

YOLO algorithm | YouTube

Training the YOLO Model

YOLO algorithm | YouTube

YOLO Prediction

YOLO algorithm | YouTube

Evaluating Image Detection Models
• Common Objects in Context
(COCO) dataset
• Published by Microsoft
• Used to evaluate algorithms’
performance of real-time
object detection
• 330,000 images
• 200,000 are labeled Pc 1

• 1.5 million object instances y_train

Bx
By
0.5
0.6
Bw 0.4
• 5 captions per image Bh
C1
0.3
1
C2 0

COCO Dataset | viso.ai

Evaluating Image Detection Models
Error Matrix

• Mean Average Precision (mAP)

• Benchmark metric used to
evaluate the robustness of
object detection models
• Incorporates mathematics image source

from:
• Error matrix
• Intersection over union (IoU)
ratio for bounding box

image source

Understanding Confusion Matrix | Towards Data Science

Best Object Detection Models

Object Detection | viso.ai

YOLOv8

YOLOv8 Tutorial - Colaboratory (google.com)

YOLOv8

Ultralytics YOLOv8 | GitHub

ICTE

Anne of Green Gables L2 Orginal
100% (5)
Anne of Green Gables L2 Orginal
65 pages
4th Unit Aktu Machine Learning
No ratings yet
4th Unit Aktu Machine Learning
9 pages
Super VIP Cheatsheet - Deep Learning
No ratings yet
Super VIP Cheatsheet - Deep Learning
47 pages
Unit 1
No ratings yet
Unit 1
109 pages
B 3
No ratings yet
B 3
6 pages
Joseph Macwan
No ratings yet
Joseph Macwan
3 pages
3748 Ist Assighment Week 1
100% (1)
3748 Ist Assighment Week 1
7 pages
Updated Final Bachelor of Commerce Honours Degree in Data Science and Informatics
No ratings yet
Updated Final Bachelor of Commerce Honours Degree in Data Science and Informatics
14 pages
Neural Network Brief Presentation
No ratings yet
Neural Network Brief Presentation
35 pages
English Project (2025-26) - Xii
No ratings yet
English Project (2025-26) - Xii
4 pages
Ryan International School Chandigarh Winter Holiday Homework
100% (1)
Ryan International School Chandigarh Winter Holiday Homework
7 pages
4'as LESSON PLAN
0% (1)
4'as LESSON PLAN
3 pages
What Is Convolutional Neural Network
No ratings yet
What Is Convolutional Neural Network
16 pages
CACES Bible Quiz 2025 Edited 2.0
No ratings yet
CACES Bible Quiz 2025 Edited 2.0
3 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
7 pages
Neural Network Notes
No ratings yet
Neural Network Notes
268 pages
ActEdUK SP8 GNP12 2020 V01
No ratings yet
ActEdUK SP8 GNP12 2020 V01
2 pages
Additional CNN
No ratings yet
Additional CNN
82 pages
FileHandler 1
No ratings yet
FileHandler 1
71 pages
L09-10 DL and CNN
No ratings yet
L09-10 DL and CNN
56 pages
Stage 424 June 2023
No ratings yet
Stage 424 June 2023
89 pages
Oct2022 CSC649 SupervisedDL - CNN
No ratings yet
Oct2022 CSC649 SupervisedDL - CNN
79 pages
Cnnbasics 171028092801
No ratings yet
Cnnbasics 171028092801
43 pages
Unit 3
No ratings yet
Unit 3
105 pages
DLCV Ch2 Neural Network
No ratings yet
DLCV Ch2 Neural Network
68 pages
Unit - 2
No ratings yet
Unit - 2
51 pages
DL Unit4
No ratings yet
DL Unit4
31 pages
Machine Learning (CSO851) - Lecture 10
No ratings yet
Machine Learning (CSO851) - Lecture 10
83 pages
Chap 2 DL
No ratings yet
Chap 2 DL
88 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
108 pages
CNN2
No ratings yet
CNN2
70 pages
AP Orientation: Ekonomiks W/ Maam MJ
No ratings yet
AP Orientation: Ekonomiks W/ Maam MJ
31 pages
Some Important Question
No ratings yet
Some Important Question
59 pages
CNN MLFA Ons-Part1
No ratings yet
CNN MLFA Ons-Part1
65 pages
CNN 2
No ratings yet
CNN 2
47 pages
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
No ratings yet
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
44 pages
CNN For Computer Vision Problem (Session 1)
No ratings yet
CNN For Computer Vision Problem (Session 1)
43 pages
91 Computer Science
No ratings yet
91 Computer Science
23 pages
Convolutional Neural Networks - Deeplearning-Notes
No ratings yet
Convolutional Neural Networks - Deeplearning-Notes
43 pages
21CS743 DL Module4 Notes
No ratings yet
21CS743 DL Module4 Notes
7 pages
UNIT-III DeepLearning Notes
No ratings yet
UNIT-III DeepLearning Notes
30 pages
CS 230 - Convolutional Neural Networks Cheatsheet
No ratings yet
CS 230 - Convolutional Neural Networks Cheatsheet
17 pages
Military AI-Week 05-AI in Computer Vision
No ratings yet
Military AI-Week 05-AI in Computer Vision
65 pages
Unit - 4 DL
No ratings yet
Unit - 4 DL
19 pages
CNNs
No ratings yet
CNNs
22 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
9 pages
Lecture - 07 (Convolutional Neural Networks)
No ratings yet
Lecture - 07 (Convolutional Neural Networks)
57 pages
Scan 30 Sep 23 18 20 44
No ratings yet
Scan 30 Sep 23 18 20 44
30 pages
Unit 8 - Lesson C - Upc - English 2
No ratings yet
Unit 8 - Lesson C - Upc - English 2
28 pages
Aiml Ece Unit-5
No ratings yet
Aiml Ece Unit-5
48 pages
Introduction To CNN: Convolution Relu Pooling Fully Connected
No ratings yet
Introduction To CNN: Convolution Relu Pooling Fully Connected
15 pages
Deep LearningUNIT-IV
No ratings yet
Deep LearningUNIT-IV
16 pages
Slides CNN
No ratings yet
Slides CNN
17 pages
Deep Learning Convolution Neural Networks
No ratings yet
Deep Learning Convolution Neural Networks
73 pages
Module 3 Notes
No ratings yet
Module 3 Notes
22 pages
Conditional Sentence0
No ratings yet
Conditional Sentence0
4 pages
DL Unit2
No ratings yet
DL Unit2
25 pages
Gned 02: Ethics: Philia Is The Love That Seeks The Truth, Whether The
No ratings yet
Gned 02: Ethics: Philia Is The Love That Seeks The Truth, Whether The
3 pages
CNN
No ratings yet
CNN
10 pages
E-Note 33951 Content Document 20250328020322PM
No ratings yet
E-Note 33951 Content Document 20250328020322PM
29 pages
Aiml Ece Unit-5
No ratings yet
Aiml Ece Unit-5
48 pages
CNN Notes Unit 3 Notes
No ratings yet
CNN Notes Unit 3 Notes
17 pages
Approaches To Curriculum Planning Report
No ratings yet
Approaches To Curriculum Planning Report
12 pages
Understanding of Convolutional Neural Network (CNN)
No ratings yet
Understanding of Convolutional Neural Network (CNN)
9 pages
CNN 3
No ratings yet
CNN 3
21 pages
DL Unit 4&5
No ratings yet
DL Unit 4&5
30 pages
Day8 (CNN)
No ratings yet
Day8 (CNN)
35 pages
What Is A Convolutional Neural Network-Unit3
No ratings yet
What Is A Convolutional Neural Network-Unit3
12 pages
Job Duties and Tasks For: "Registered Nurse"
No ratings yet
Job Duties and Tasks For: "Registered Nurse"
7 pages
21CS743 Module4 Notes
No ratings yet
21CS743 Module4 Notes
15 pages
Jiji-12th Marksheet
No ratings yet
Jiji-12th Marksheet
2 pages
Presented By:: SANTHOSH.K-927622BIT087 SIVA BHARAT.B-927622BIT099 NITHISH KUMAR.S-927622BIT066 SUNTHAR SHREE-927622BIT110
No ratings yet
Presented By:: SANTHOSH.K-927622BIT087 SIVA BHARAT.B-927622BIT099 NITHISH KUMAR.S-927622BIT066 SUNTHAR SHREE-927622BIT110
16 pages
SLAC SESSION On How To LIVESTREAM Via Facebook Using OBS Studio Application
No ratings yet
SLAC SESSION On How To LIVESTREAM Via Facebook Using OBS Studio Application
5 pages
Eapp Week 6
No ratings yet
Eapp Week 6
5 pages
Class XI '17 Mock Test 2 Result
No ratings yet
Class XI '17 Mock Test 2 Result
6 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu - Medium
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu - Medium
8 pages
Convolutional Networks
No ratings yet
Convolutional Networks
37 pages
Deep Learning Image Classification
No ratings yet
Deep Learning Image Classification
11 pages
Bny Sec Acr 2505191205 1100644398 1 1
No ratings yet
Bny Sec Acr 2505191205 1100644398 1 1
50 pages
Unit Iv - NNDL
No ratings yet
Unit Iv - NNDL
32 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu Raghav - Medium
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu Raghav - Medium
10 pages
1A.P5.S1,2 Describing A Picture
No ratings yet
1A.P5.S1,2 Describing A Picture
8 pages
MGT 452 Final Paper
No ratings yet
MGT 452 Final Paper
4 pages
Using An Inquiry Approach To Teach Science To Seco
No ratings yet
Using An Inquiry Approach To Teach Science To Seco
7 pages
Exam Form
No ratings yet
Exam Form
1 page
Aieee CCB Spot Round
No ratings yet
Aieee CCB Spot Round
4 pages
M4 Ia2
No ratings yet
M4 Ia2
6 pages
Circ 11-293 14 June Annex IALA Members RD Summary
No ratings yet
Circ 11-293 14 June Annex IALA Members RD Summary
3 pages
13467
No ratings yet
13467
2 pages
SC - MSC - Cs - 1st Sem - 2018
No ratings yet
SC - MSC - Cs - 1st Sem - 2018
2 pages
What Is Rectilinear Propagation of Light
No ratings yet
What Is Rectilinear Propagation of Light
2 pages
Automatic Agarbatti Making Machine
No ratings yet
Automatic Agarbatti Making Machine
2 pages
HTML Executable Compilation Log: Publication Information
No ratings yet
HTML Executable Compilation Log: Publication Information
2 pages
Govt. of West Bengal E-Challan West Bengal Police: GRN: GRN Date: Payment Gateway
No ratings yet
Govt. of West Bengal E-Challan West Bengal Police: GRN: GRN Date: Payment Gateway
1 page
Pharma Sales Executives Across Tamilnadu
No ratings yet
Pharma Sales Executives Across Tamilnadu
1 page
M.Sc. Semester Examination Timetable (April 2025) - 1
No ratings yet
M.Sc. Semester Examination Timetable (April 2025) - 1
1 page
Elsieduininck Resume
No ratings yet
Elsieduininck Resume
1 page
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet

Week8 WEB

Uploaded by

Week8 WEB

Uploaded by

TECHIN513 – Managing

Signal and Data Processing

Link to Request Form:

Convolutional Operations | Medium

How Do Machines Read and Store Images? | Analytics Vidhya

one image has

How Do Machines Read and Store Images? | Analytics Vidhya

Feature Extraction with CNNs | Towards Data Science

Introduction to Deep Learning - MATLAB

Introduction to Deep Learning - MATLAB

Introduction to Deep Learning - MATLAB

Introduction to Deep Learning - MATLAB

An operation on two functions

Convolution Integral | Statistics How To

Lower dimension matrix

Introduction to Deep Learning - MATLAB

Introduction to Deep Learning - MATLAB

Features (edges) are

Pooling In Convolutional Neural Networks | paperspace

Feature Maps | educative.io , Dense layers | Pysource

Softmax Activation Function | Towards Data Science

Introduction to Deep Learning - MATLAB

Object Segmentation vs. Object Detection | LinkedIn

R-CNN | Towards Data Science

box are described as a vector

box are described as a vector

box are described as a vector

YOLO algorithm | YouTube

YOLO algorithm | YouTube

YOLO algorithm | YouTube

YOLO algorithm | YouTube

All other cells 4x4x7 matrix

YOLO algorithm | YouTube

YOLO algorithm | YouTube

YOLO algorithm | YouTube

• 1.5 million object instances y_train

COCO Dataset | viso.ai

• Mean Average Precision (mAP)

Understanding Confusion Matrix | Towards Data Science

Object Detection | viso.ai

YOLOv8 Tutorial - Colaboratory (google.com)

Ultralytics YOLOv8 | GitHub

You might also like