Identify Web Cam Images Using Neural Networks

This document discusses classifying webcam images in real time using deep learning and convolutional neural networks. It describes using a pretrained AlexNet model to identify objects from webcam images. AlexNet has been trained on over 1 million images and can classify images into 1000 categories. The document outlines problems with image classification like lighting and outlines key aspects of convolutional neural networks like convolutional layers, ReLU activation, pooling, and fully connected layers. It provides details on the architecture of AlexNet, which contains convolutional, pooling, and fully connected layers to classify images.

Uploaded by

gaurav

We take content rights seriously. If you suspect this is your content, claim it here.

0% found this document useful (0 votes)

172 views17 pages

Identify Web Cam Images Using Neural Networks

Uploaded by

gaurav

We take content rights seriously. If you suspect this is your content, claim it here.

You are on page 1/ 17

CLASSIFY

WEBCAM IMAGES
USING DEEP
LEARNING
ABSTRACT

• Deep learning has emerged as a new era in machine learning which is being applied to a
number of signal and image applications. The main purpose of the work presented in this
paper, is to apply the concept of a Deep Learning algorithm namely, Convolutional neural
networks (CNN) in classifying webcam images in real time. The pretrained deep
convolutional neural network that we are using here is AlexNet that has been trained on
over a million images and can classify images into 1000 object categories (such as
keyboard, coffee mug, pencil, and many animals). Alexnet has learned rich feature
representations for a wide range of images. Images will be captured from our system
webcam and our pretrained deep convolutional neural network, AlexNet will identify
objects in our surroundings.
PROBLEMS IN CLASSIFYING IMAGES

 Large amount of intra-class variability

 Different lightening conditions
 Misalignment
 Non rigid deformation
 Occlusion
 Corruption
WHAT IS DEEP LEARNING?

• Deep learning (also known as deep structured learning or hierarchical learning)

is part of a broader family of machine learning methods based on learning data
representations, as opposed to task-specific algorithms. Learning can be supervised, semi-
supervised or unsupervised.
• Deep learning architectures such as deep neural networks, deep belief networks and
recurrent neural networks have been applied to fields including computer vision, speech
recognition, natural language processing, audio recognition, social network filtering,
machine translation, bioinformatics, drug design, medical image analysis, material inspection
and board game programs, where they have produced results comparable to and in some
cases superior to human experts.
WHY DEEP LEARNING?

• Learning features from data of interest is considered as a possible method of remedying

the limitations of hand-crafted features.
• Discover multiple levels of representation with the hope that higher level features can
represent more abstract semantics of the data. Such abstract representations learned from
a deep network are expected to provide greater robustness to intra-class variability.
• One key ingredient to the success of deep learning in image classiﬁcation is the use of
convolutional architectures. A convolutional deep neural network (ConvNet) architecture
consists of multiple trainable stages stacked on top of each other followed by a supervised
classiﬁer
CNN

• A CNN network is a class of feed forward artificial neural networks, most commonly
applied to analyzing visual imagery. Convolutional neural networks are inspired by
biological processes. In CNN connectivity pattern between neurons resembles the
organization of the animal visual cortex. Individual cortical neurons respond to stimuli
only in a restricted region of the visual field known as receptive field. The receptive fields
of different neurons partially overlap such that they cover entire visual field.
• CNNs use relatively little pre-processing compared to other image classification
algorithms which means our network learns the filters that in traditional algorithms were
hard engineered. This independence from prior knowledge and human effort in feature
design is a major advantage.
CNN-ALEXNET

• It was designed by Alex Krizhevsky and published with Liya Sutskever and Geoffrey
Hinton. AlexNet competed in the ImageNet Large Scale Visual Recognition Challenge in
2012.
• The network achieved a top-5 error of 15.3%, more than 10.8 percent points lower than
that of the runner up.AlexNet shows the probability of the image
• it captures from the camera. It shows the top five highest categories with the maximum
probabilities and according to that a chart is prepared. AlexNet is trained over more than
50000 times and shows more correct results as compared to previous trained models.
ARCHITECTURE OF CNN

• A CNN consists number of convolutional and subsampling layers optionally followed by fully
connected layers. The input to a convolutional layer is a m x m x r image where m is the height
and width of the image and r is the number of channels, e.g. an RGB image has r=3.
• The convolutional layer will have kk filters (or kernels) of size n x n x q where n is smaller than
the dimension of the image and q can either be the same as the number of channels r or
smaller and may vary for each kernel. The size of the filters gives rise to the locally connected
structure which are each convolved with the image to produce k feature maps of size m−n+1.
Each map is then subsampled typically with mean or max pooling over p x p contiguous regions
where p ranges between 2 for small images and is usually not more than 5 for larger inputs.
A SIMPLE CONV-NET
OPERATIONS IN CONV-NET

• Convolution
• Non-Linearity (ReLU)
• Pooling or Sub Sampling
• Classification (Fully Connected Layer)
CONVOLUTION

• ConvNets derive their name from the “convolution” operator. The primary purpose of
Convolution in case of a ConvNet is to extract features from the input image. Convolution
preserves the spatial relationship between pixels by learning image features using small squares
of input data .In CNN terminology, the 3×3 matrix is called a ‘filter‘ or ‘kernel’ or ‘feature
detector’ and the matrix formed by sliding the filter over the image and computing the dot
product is called the ‘Convolved Feature’ or ‘Activation Map’ or the ‘Feature Map‘. It is
important to note that filters act as feature detectors from the original input image. In practice,
a CNN learns the values of these filters on its own during the training process (although we
still need to specify parameters such as number of filters, filter size, architecture of the network
etc. before the training process). More number of filters we have, the more image features get
extracted and the better our network becomes at recognizing patterns in unseen images.
NON-LINEARITY (RELU)

• An additional operation called ReLU has been used after every Convolution operation.
ReLU stands for Rectified Linear Unit and is a non-linear operation.
• ReLU is an element wise operation (applied per pixel) and replaces all negative pixel
values in the feature map by zero. The purpose of ReLU is to introduce non-linearity in
our ConvNet, since most of the real-world data we would want our ConvNet to learn
would be non-linear (Convolution is a linear operation – element wise matrix
multiplication and addition, so we account for non-linearity by introducing a non-linear
function like ReLU).
POOLING STEP

• Spatial Pooling (also called subsampling or down-sampling) reduces the dimensionality of

each feature map but retains the most important information. Spatial Pooling can be of
different types: Max, Average, Sum etc. In case of Max Pooling, we define a spatial
neighborhood (for example, a 2×2 window) and take the largest element from the
rectified feature map within that window. Instead of taking the largest element we could
also take the average (Average Pooling) or sum of all elements in that window.
FULLY CONNECTED LAYER

• The Fully Connected layer is a traditional Multi-Layer Perceptron that uses a softmax
activation function in the output layer (other classifiers like SVM can also be used, but will
stick to softmax in this post). The term “Fully Connected” implies that every neuron in the
previous layer is connected to every neuron on the next layer. The output from the
convolutional and pooling layers represent high-level features of the input image. The
purpose of the Fully Connected layer is to use these features for classifying the input
image into various classes based on the training dataset.
ALEXNET ARCHITECHTURE
DESCRIBING NETWORK

• The net contains eight layers with weights; the first five are convolutional and the remaining three are fully-connected. The output of the last fully-
connected layer is fed to a 1000-way softmax which produces a distribution over the 1000 class labels. The response-normalization layers follow
the first and second convolutional layers. Max-pooling layers follow both of the response-normalization layers as well as the last (fifth)
convolutional layer. The ReLU non-linearity is applied to the output of every convolutional and fully-connected layer.
•
• The input to the net is a 227 × 227 × 3 image. The filters for each convolutional layer are:
• 96 kernels of size 11 × 11 × 3 with step size 4
• 256 kernels of size 5 × 5 × 48* with step size 1
• 384 kernels of size 3 × 3 × 256 with step size 1
• 384 kernels of size 3 × 3 × 192* with step size 1
• 256 kernels of size 3 × 3 × 192* with step size 1
THANK YOU

Achievers B1+ Teachers Book
100% (7)
Achievers B1+ Teachers Book
371 pages
Inputs Outputs: Navigation Menu
No ratings yet
Inputs Outputs: Navigation Menu
10 pages
HV Turbo Aeration Compressor Service: Engineering
No ratings yet
HV Turbo Aeration Compressor Service: Engineering
3 pages
Classify Webcam Images Using Deep Learning
No ratings yet
Classify Webcam Images Using Deep Learning
17 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
38 pages
Variants of CNN (Page No 17-23), Structured Output (29-31), Datatypes
No ratings yet
Variants of CNN (Page No 17-23), Structured Output (29-31), Datatypes
31 pages
Deep Convolutional Neural Networks For Image Classification: Many Slides From Rob Fergus (NYU and Facebook)
No ratings yet
Deep Convolutional Neural Networks For Image Classification: Many Slides From Rob Fergus (NYU and Facebook)
55 pages
Military AI-Week 05-AI in Computer Vision
No ratings yet
Military AI-Week 05-AI in Computer Vision
65 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
7 pages
PPT
No ratings yet
PPT
20 pages
Convolution Neural Networks
No ratings yet
Convolution Neural Networks
80 pages
Module 5
No ratings yet
Module 5
20 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
8 pages
Deep Learning: Seungsang Oh
No ratings yet
Deep Learning: Seungsang Oh
39 pages
CV PPT Mt101
No ratings yet
CV PPT Mt101
16 pages
Oct2022 CSC649 SupervisedDL - CNN
No ratings yet
Oct2022 CSC649 SupervisedDL - CNN
79 pages
Ch-3 Convolutional Neural Networks (CNNS)
No ratings yet
Ch-3 Convolutional Neural Networks (CNNS)
11 pages
CNN Notes Unit-3
No ratings yet
CNN Notes Unit-3
12 pages
What Is A Convolutional Neural Network-Unit3
No ratings yet
What Is A Convolutional Neural Network-Unit3
12 pages
COMP3220 Lect 11 - Introduction To Convolutional Neural Networks
No ratings yet
COMP3220 Lect 11 - Introduction To Convolutional Neural Networks
13 pages
DLP&P Notes Faculty: Ms. Meenakshi Chaudhary: What Is A Convolutional Neural Network (CNN) ?
No ratings yet
DLP&P Notes Faculty: Ms. Meenakshi Chaudhary: What Is A Convolutional Neural Network (CNN) ?
50 pages
Some Important Question
No ratings yet
Some Important Question
59 pages
Image Recognition Using Neural Networks
No ratings yet
Image Recognition Using Neural Networks
18 pages
Chap 2 DL
No ratings yet
Chap 2 DL
88 pages
CVlecture 5
No ratings yet
CVlecture 5
56 pages
Module 05 CNN Arctitecture
No ratings yet
Module 05 CNN Arctitecture
7 pages
CNN 1
No ratings yet
CNN 1
19 pages
Advancements in Image Classification Using Convolutional Neural Network
No ratings yet
Advancements in Image Classification Using Convolutional Neural Network
8 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu Raghav - Medium
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu Raghav - Medium
10 pages
Unit Iv - NNDL
No ratings yet
Unit Iv - NNDL
32 pages
Deep Learning Approach For Object Detection Using CNN: Abstract
No ratings yet
Deep Learning Approach For Object Detection Using CNN: Abstract
7 pages
6-DeepVisualLearning L6
No ratings yet
6-DeepVisualLearning L6
82 pages
Unit 3 CNN 2024
No ratings yet
Unit 3 CNN 2024
58 pages
CNN 2
No ratings yet
CNN 2
47 pages
Deep Learning (MODULE-3)
No ratings yet
Deep Learning (MODULE-3)
85 pages
Unit 2
No ratings yet
Unit 2
20 pages
DL-Unit-3 Final
No ratings yet
DL-Unit-3 Final
25 pages
Intro CNN PDF
No ratings yet
Intro CNN PDF
31 pages
CNN
No ratings yet
CNN
31 pages
Unit 3
No ratings yet
Unit 3
105 pages
UNIT-III Convolution Neural Networks
No ratings yet
UNIT-III Convolution Neural Networks
9 pages
CNN 3
No ratings yet
CNN 3
21 pages
An Analysis of Convolutional Neural Network Architectures
No ratings yet
An Analysis of Convolutional Neural Network Architectures
54 pages
L09-10 DL and CNN
No ratings yet
L09-10 DL and CNN
56 pages
Deep Neural Network DNN
No ratings yet
Deep Neural Network DNN
5 pages
Convolutional Neural Networks: CMSC 733 Fall 2015 Angjoo Kanazawa
No ratings yet
Convolutional Neural Networks: CMSC 733 Fall 2015 Angjoo Kanazawa
55 pages
NN Jaguar Lava 122
No ratings yet
NN Jaguar Lava 122
10 pages
Convolutional Networks
No ratings yet
Convolutional Networks
211 pages
Module 3 Notes
No ratings yet
Module 3 Notes
22 pages
ML 13
No ratings yet
ML 13
34 pages
Images and Convolutional Neural Networks: Practical Deep Learning
No ratings yet
Images and Convolutional Neural Networks: Practical Deep Learning
34 pages
Image Classification Using Small Convolutional Neural Network
No ratings yet
Image Classification Using Small Convolutional Neural Network
5 pages
Convolutional Neural Networks: Computer Vision CS 543 / ECE 549 University of Illinois Jia-Bin Huang
No ratings yet
Convolutional Neural Networks: Computer Vision CS 543 / ECE 549 University of Illinois Jia-Bin Huang
76 pages
NN 07
No ratings yet
NN 07
24 pages
Deep Learning Image Classification
No ratings yet
Deep Learning Image Classification
11 pages
Convolutional Neural Network
100% (1)
Convolutional Neural Network
78 pages
Convolution Neural Network
No ratings yet
Convolution Neural Network
66 pages
Convolutional Networks 2024
No ratings yet
Convolutional Networks 2024
44 pages
Introduction To Convolutional Neural Networks
No ratings yet
Introduction To Convolutional Neural Networks
41 pages
Deeplearning - PPT - Unit 4 and 5
No ratings yet
Deeplearning - PPT - Unit 4 and 5
154 pages
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
No ratings yet
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
44 pages
Convolutional Neural Networks CNN
No ratings yet
Convolutional Neural Networks CNN
8 pages
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
From Everand
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
Fouad Sabry
No ratings yet
ISTQB FL Chap 1
No ratings yet
ISTQB FL Chap 1
10 pages
CSC 111 - Introduction To Computer Science - Corrected Version
No ratings yet
CSC 111 - Introduction To Computer Science - Corrected Version
93 pages
217 Lec2
No ratings yet
217 Lec2
24 pages
Ir 2113
No ratings yet
Ir 2113
18 pages
Lec6 MobileRobotControl
No ratings yet
Lec6 MobileRobotControl
5 pages
Blender Tutorials-1
No ratings yet
Blender Tutorials-1
1 page
Rob Ford Full Documents Dec 4
No ratings yet
Rob Ford Full Documents Dec 4
474 pages
Asynchronous Data Transfer in Computer Organization - Javatpoint
No ratings yet
Asynchronous Data Transfer in Computer Organization - Javatpoint
8 pages
Order PDF
No ratings yet
Order PDF
1 page
Experiment 3
No ratings yet
Experiment 3
6 pages
Unit 4
No ratings yet
Unit 4
16 pages
Irjet V4i4783 PDF
No ratings yet
Irjet V4i4783 PDF
3 pages
Computer Architecture and Organization 1st Edition by Ian East ISBN 0273030388 9780273030386 Download
100% (4)
Computer Architecture and Organization 1st Edition by Ian East ISBN 0273030388 9780273030386 Download
43 pages
What Is Satellite Radio?: Satellites
No ratings yet
What Is Satellite Radio?: Satellites
4 pages
To For "Passport Mela" With Of: On Their
No ratings yet
To For "Passport Mela" With Of: On Their
2 pages
أثر استخدام تقنية الذكاء الاصطناعي (chat gpt) على التحصيل العلمي للطلبة الجامعيين في ظل اقتصاد المعرفة، دراسة ميدانية على عينة من طلبة جامعة الجزائر 2
No ratings yet
أثر استخدام تقنية الذكاء الاصطناعي (chat gpt) على التحصيل العلمي للطلبة الجامعيين في ظل اقتصاد المعرفة، دراسة ميدانية على عينة من طلبة جامعة الجزائر 2
17 pages
Střešní Krytina - RTL - CS - EN - SUNLUX
No ratings yet
Střešní Krytina - RTL - CS - EN - SUNLUX
2 pages
Bandwidth Part (BWP) in 5G-NR
No ratings yet
Bandwidth Part (BWP) in 5G-NR
18 pages
Flowchart: North Fairview High School - West Fairview Annex
No ratings yet
Flowchart: North Fairview High School - West Fairview Annex
14 pages
Rama 88203 06011181621075
No ratings yet
Rama 88203 06011181621075
108 pages
E-Cobus Anleitung English BMS Und SAE CAN Auslesen 2
No ratings yet
E-Cobus Anleitung English BMS Und SAE CAN Auslesen 2
35 pages
Elite-7x: Operation Manual
No ratings yet
Elite-7x: Operation Manual
0 pages
Vocabulary Classroom Objects
No ratings yet
Vocabulary Classroom Objects
2 pages
RobotStudio 2023-4-1 Release Notes
No ratings yet
RobotStudio 2023-4-1 Release Notes
21 pages
Awsadmst
No ratings yet
Awsadmst
371 pages
Settingsprovider
No ratings yet
Settingsprovider
49 pages
Joint Admission Counselling, Delhi 2017 Final List of Admitted Candidates
No ratings yet
Joint Admission Counselling, Delhi 2017 Final List of Admitted Candidates
109 pages

Identify Web Cam Images Using Neural Networks

Uploaded by

Identify Web Cam Images Using Neural Networks

Uploaded by

CLASSIFY

 Large amount of intra-class variability

• Deep learning (also known as deep structured learning or hierarchical learning)

• Learning features from data of interest is considered as a possible method of remedying

• Spatial Pooling (also called subsampling or down-sampling) reduces the dimensionality of

You might also like