AI MasterClass Day11Intern

Uploaded by

tasilapoornashree

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views20 pages

AI MasterClass Day11Intern

Uploaded by

tasilapoornashree

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 20

AI Master Class series –

Day 11
Object Recognition
Day-11 Agenda.

01. 02. 03.

Implementing Object
Object Recognition Recognition Deep Neural Network
Object Recognition Methodology DNN & Object
recognition Model

04. 05.
Deploying Real-time Object
MobileNetSSD recognition
Pre-trained Model Pre-Trained Model
Object Recognition.
Object recognition is a computer vision technique
for identifying objects in images or videos. Object
recognition is a key output of deep learning and
machine learning algorithms. When humans look at
a photograph or watch a video, we can readily spot
people, objects, scenes, and visual details.
Implementing Object
Recognition.
PRE-TRAINED
0
1 MODEL

0
TRANSFER
2 LEARNING

0
BUILDING FROM
3 SCRATCH
Deep Neural Network -
DNN.
• Solve Complex Task
• When it gets new information in the system, it
learns how to act accordingly to a new situation.
• Learning becomes deeper when tasks you solve
get harder.
• Helps to load pre-trained Model from DL
frameworks such as
 Tensorflow
 Caffe
 Darknet
 Torch
Speed Comparison on Image
Classification.
Pre-trained Model for Object
recognition.
• MobileNet-SSD
• GoogleNet
• Squeezenet
• Faster R-CNN
• ResNet
• Inception
• YOLO
• VGGNet
MobileNet SSD (Single shot Multibox
Detector).
• The MobileNet model is based on depthwise separable convolutions which are a
form of factorized convolutions. These factorize a standard convolution into a
depthwise convolution and a 1 × 1 convolution called a pointwise convolution.

• For MobileNets, the depthwise convolution applies a single filter to each input
channel. The pointwise convolution then applies a 1 × 1 convolution to combine
the outputs of the depthwise convolution.

• A standard convolution both filters and combines inputs into a new set of outputs
in one step. The depthwise separable convolution splits this into two layers
– a separate layer for filtering and a separate layer for combining. This
factorization has the effect of drastically reducing computation and model size.

• The SSD architecture is a single convolution network that learns to predict

bounding box locations and classify these locations in one pass. Hence, SSD can be
trained end-to-end.
MobileNet SSD.
MobileNet SSD
Architecture.
ReLu.
• The Rectified Linear Unit is the most commonly
used activation function in deep learning
models.

• The function returns 0 if it receives any negative

input, but for any positive value x it returns
that value back. So it can be written as
f(x)=max(0,x) .

• the ReLu function is able to accelerate the

training speed of deep neural networks
compared to traditional activation functions
since the derivative of ReLu is 1 for a positive
input.

• Due to a constant, deep neural networks do not

need to take additional time for computing error
terms during training phase.
OpenCV Basic Syntax for DNN.
Loading Image from Disk to DNN
cv2.dnn.blobFromImage
cv2.dnn.blobFromImages

Import Model from various Framework

cv2.dnn.createCaffeImporter
cv2.dnn.createTensorFlowImporter
cv2.dnn.createTorchImporter

cv2.dnn.readNetFromCaffe
cv2.dnn.readNetFromTensorFlow
cv2.dnn.readNetFromTorch
cv2.dnn.readhTorchBlob

.forward` method is used to forward-propagate our image and obtain

cv2.dnn.blobFromImage.

Mean Subtracted Normalized Image =

cv2.dnn.blobFromImage(resizedImage,scalingFactor,
Spatial Size, Mean Subtraction Values)

blob =
cv2.dnn.blobFromImage(imResizeBlob,0.007843,
(300, 300), 127.5)
Block Diagram – Workflow of DNN in
OpenCV.
Select Reading frame
Load Model Select target
Backend from camera

Convert to
Forward Post Process
Blob
Numpy Basic Syntax.
Numpy.array
arr = np.array([1, 2, 3, 4, 5])

Numpy.arrange

NumPy arange() is one of the array creation routines

based on numerical ranges. It creates an instance of
ndarray with evenly spaced values and returns the
reference to it.

for i in np.arange(0,detShape):
Practical
session using
Object Recognition
Pre-trained Model with DNN
in OpenCV
import numpy as np
import imutils
import time
import cv2

prototxt = "MobileNetSSD_deploy.prototxt.txt"
model = "MobileNetSSD_deploy.caffemodel"
confThresh = 0.2
CLASSES = ["background", "aeroplane", "bicycle", "bird", "boat",
"bottle", "bus", "car", "cat", "chair", "cow", "diningtable",
"dog", "horse", "motorbike", "person", "pottedplant", "sheep",
"sofa", "train", "tvmonitor"]
COLORS = np.random.uniform(0, 255, size=(len(CLASSES), 3))
print("Loading model...")
net = cv2.dnn.readNetFromCaffe(prototxt, model)
print("Model Loaded")
print("Starting Camera Feed...")
vs = cv2.VideoCapture(0)
time.sleep(2.0)
while True:
_,frame = vs.read()
frame = imutils.resize(frame, width=500)

(h, w) = frame.shape[:2]
imResizeBlob = cv2.resize(frame, (300, 300))
blob = cv2.dnn.blobFromImage(imResizeBlob,0.007843, (300, 300), 127.5)
net.setInput(blob)
detections = net.forward()
detShape = detections.shape[2]

for i in np.arange(0,detShape):
confidence = detections[0, 0, i, 2]
if confidence > confThresh:
idx = int(detections[0, 0, i, 1])
box = detections[0, 0, i, 3:7] * np.array([w, h, w, h])
(startX, startY, endX, endY) = box.astype("int")
label = "{}: {:.2f}%".format(CLASSES[idx],confidence * 100)
cv2.rectangle(frame, (startX, startY), (endX, endY),COLORS[idx], 2)
if startY - 15 > 15:
y = startY - 15
else:
startY + 15
cv2.putText(frame, label, (startX, y),
cv2.FONT_HERSHEY_SIMPLEX, 0.5, COLORS[idx], 2)
cv2.imshow("Frame", frame)
key = cv2.waitKey(1)
if key == 27:
break
vs.release()
cv2.destroyAllWindows()
AI News – Day 11.
Yesterday - 2020

American Institute of Physics

● The range of AI technologies available for
dealing with brain disease is growing fast,
and exciting new methods are being
applied to brain problems as computer
scientists gain a deeper understanding of
the capabilities of advanced algorithms.
● Researchers conducted a systematic
literature review to understand the state
of the art in the use of AI for brain
disease. Their qualitative review sheds
light on the most interesting corners of AI
development.
Tomorrow
Thanks! session
Do you have any questions? Image Classification using
[email protected] CNN

www.pantechsolutions.net

Machine Vison Homework 10
No ratings yet
Machine Vison Homework 10
11 pages
Object Detection With Deep Learning
No ratings yet
Object Detection With Deep Learning
3 pages
Transfer Learning Models
No ratings yet
Transfer Learning Models
5 pages
W11 Lecture ITS69204 Image Recognition
No ratings yet
W11 Lecture ITS69204 Image Recognition
44 pages
Helmet and Vehicle License Plate Detection System
No ratings yet
Helmet and Vehicle License Plate Detection System
26 pages
Real Time Object Detection and Recognition Using Mobilenet SSD With Opencv IJERTV11IS010070
No ratings yet
Real Time Object Detection and Recognition Using Mobilenet SSD With Opencv IJERTV11IS010070
2 pages
Unit Iv - NNDL
No ratings yet
Unit Iv - NNDL
32 pages
2802 8020 1 PB
No ratings yet
2802 8020 1 PB
3 pages
Deep Residual Learning
No ratings yet
Deep Residual Learning
80 pages
Object Detection With Deep Learning 221232297 1
No ratings yet
Object Detection With Deep Learning 221232297 1
19 pages
A Comparative Study On Convolutional Neural Network Based Face Recognition
No ratings yet
A Comparative Study On Convolutional Neural Network Based Face Recognition
5 pages
CV Unit V
No ratings yet
CV Unit V
18 pages
Deep Learning With Pytorch: Ai Courses by Opencv
No ratings yet
Deep Learning With Pytorch: Ai Courses by Opencv
9 pages
1 Realtimeobjectdetection
No ratings yet
1 Realtimeobjectdetection
6 pages
Faster R-CNN
No ratings yet
Faster R-CNN
20 pages
Scalable Object Detection
No ratings yet
Scalable Object Detection
8 pages
CNN Models To Detect Multiple Leds For Multilateral Occ.: Project: Ieee P802.15 Ig Vat
No ratings yet
CNN Models To Detect Multiple Leds For Multilateral Occ.: Project: Ieee P802.15 Ig Vat
9 pages
Deep Learning in Matlab
No ratings yet
Deep Learning in Matlab
36 pages
Vitamin Deficiency Detection (Base Paper)
No ratings yet
Vitamin Deficiency Detection (Base Paper)
3 pages
Neural Network Project Report.
No ratings yet
Neural Network Project Report.
12 pages
Classify Webcam Images Using Deep Learning
No ratings yet
Classify Webcam Images Using Deep Learning
17 pages
Younis 2020
No ratings yet
Younis 2020
5 pages
Deep Object Pose Estimation For Semantic Robotic Grasping of Household Objects
No ratings yet
Deep Object Pose Estimation For Semantic Robotic Grasping of Household Objects
11 pages
Deep Learning Approach For Object Detection Using CNN: Abstract
No ratings yet
Deep Learning Approach For Object Detection Using CNN: Abstract
7 pages
M10 - Introduction To TensorFlow, Deep Learning and Application
No ratings yet
M10 - Introduction To TensorFlow, Deep Learning and Application
25 pages
Deep Convolutional Neural Networks For Image Classification: Many Slides From Rob Fergus (NYU and Facebook)
No ratings yet
Deep Convolutional Neural Networks For Image Classification: Many Slides From Rob Fergus (NYU and Facebook)
55 pages
Object Detection Using Deep Learning Approach
100% (1)
Object Detection Using Deep Learning Approach
9 pages
Legodnn: Block-Grained Scaling of Deep Neural Networks For Mobile Vision
No ratings yet
Legodnn: Block-Grained Scaling of Deep Neural Networks For Mobile Vision
14 pages
DNN Architectures
No ratings yet
DNN Architectures
12 pages
Last Lab Report
No ratings yet
Last Lab Report
6 pages
Nivetha Me P2 PPT
No ratings yet
Nivetha Me P2 PPT
18 pages
CV Ss16 0609 Deep Learning
No ratings yet
CV Ss16 0609 Deep Learning
91 pages
Realtime Visual Recognition in Deep Convolutional Neural Networks
No ratings yet
Realtime Visual Recognition in Deep Convolutional Neural Networks
13 pages
Fulltext01 P
No ratings yet
Fulltext01 P
78 pages
Espinosa, Velastin, Branch - 2017 - Vehicle Detection Using Alex Net and Faster R-CNN Deep Learning Models A Comparative Study-Annotated
No ratings yet
Espinosa, Velastin, Branch - 2017 - Vehicle Detection Using Alex Net and Faster R-CNN Deep Learning Models A Comparative Study-Annotated
14 pages
Conference Paper
No ratings yet
Conference Paper
3 pages
Deep Learning For Vision Lab Manual 2024
100% (1)
Deep Learning For Vision Lab Manual 2024
25 pages
Mobilenet For Image Classification
No ratings yet
Mobilenet For Image Classification
3 pages
Task 9 Implementation of Object Detection and Localization
No ratings yet
Task 9 Implementation of Object Detection and Localization
7 pages
Lect11 Neural Nets2
No ratings yet
Lect11 Neural Nets2
48 pages
Under The Guidance of BY, Prof. Meiliu Lu Shekhar Shiroor
No ratings yet
Under The Guidance of BY, Prof. Meiliu Lu Shekhar Shiroor
17 pages
Mobile Net
No ratings yet
Mobile Net
9 pages
Tmp4e31 TMP
No ratings yet
Tmp4e31 TMP
7 pages
Computer Vision Engineer Interview Preparation Guide
No ratings yet
Computer Vision Engineer Interview Preparation Guide
20 pages
Advance Questions Answers
No ratings yet
Advance Questions Answers
4 pages
Object Detection and Recognition: Final Project Title
No ratings yet
Object Detection and Recognition: Final Project Title
6 pages
CNN 2
No ratings yet
CNN 2
47 pages
Ding 2018 IOP Conf. Ser. Mater. Sci. Eng. 322 062024
No ratings yet
Ding 2018 IOP Conf. Ser. Mater. Sci. Eng. 322 062024
6 pages
Brief Introduction of Mobilenetv1 V2 V3 Lightweight Network
No ratings yet
Brief Introduction of Mobilenetv1 V2 V3 Lightweight Network
29 pages
Identify Web Cam Images Using Neural Networks
No ratings yet
Identify Web Cam Images Using Neural Networks
17 pages
Tiny Object Recognition
No ratings yet
Tiny Object Recognition
8 pages
Week5 Computer Vision
No ratings yet
Week5 Computer Vision
58 pages
MYPPTT
No ratings yet
MYPPTT
19 pages
MobileNetV2 Inverted Residuals and Linear Bottlenecks
No ratings yet
MobileNetV2 Inverted Residuals and Linear Bottlenecks
11 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
Object Classification Using CNN
No ratings yet
Object Classification Using CNN
9 pages
Convolutional Neural Networks: CMSC 733 Fall 2015 Angjoo Kanazawa
No ratings yet
Convolutional Neural Networks: CMSC 733 Fall 2015 Angjoo Kanazawa
55 pages
Comprehensive Notes On Advanced CNN Concepts & Vision Tasks
No ratings yet
Comprehensive Notes On Advanced CNN Concepts & Vision Tasks
5 pages
Deepiris: Iris Recognition Using A Deep Learning Approach
No ratings yet
Deepiris: Iris Recognition Using A Deep Learning Approach
4 pages
Lecture 1 - Introduction To ML
No ratings yet
Lecture 1 - Introduction To ML
41 pages
Back Propagation Back Propagation Network Network Network Network
No ratings yet
Back Propagation Back Propagation Network Network Network Network
29 pages
Hands-On Machine Learning: Chapter 5: Support Vector Machines
No ratings yet
Hands-On Machine Learning: Chapter 5: Support Vector Machines
32 pages
Deep Learning Strategy For Braille Character Recognition
No ratings yet
Deep Learning Strategy For Braille Character Recognition
15 pages
NLP Lab2
No ratings yet
NLP Lab2
7 pages
SEMG Basedhandgesturesclassificationusingasemi Supervisedmulti LayerneuralnetworkswithAutoencoder
No ratings yet
SEMG Basedhandgesturesclassificationusingasemi Supervisedmulti LayerneuralnetworkswithAutoencoder
10 pages
NLP Text Summary
No ratings yet
NLP Text Summary
21 pages
Progress in Energy and Combustion Science: Masoud Aliramezani, Charles Robert Koch, Mahdi Shahbakhti
No ratings yet
Progress in Energy and Combustion Science: Masoud Aliramezani, Charles Robert Koch, Mahdi Shahbakhti
38 pages
Guo Generating Diverse and Natural 3D Human Motions From Text CVPR 2022 Paper
No ratings yet
Guo Generating Diverse and Natural 3D Human Motions From Text CVPR 2022 Paper
10 pages
딥러닝 기반 의미론적 분할 기법을 통한 건물 자동추출 연구 모델의 가중치 경중과 전이학습에
No ratings yet
딥러닝 기반 의미론적 분할 기법을 통한 건물 자동추출 연구 모델의 가중치 경중과 전이학습에
11 pages
LLM4TS-Aligning Pre-Trained LLMs As Data-Efficient Time-Series Forecasters
No ratings yet
LLM4TS-Aligning Pre-Trained LLMs As Data-Efficient Time-Series Forecasters
14 pages
Keras Cheat Sheet Python For Data Science: Model Architecture Inspect Model
No ratings yet
Keras Cheat Sheet Python For Data Science: Model Architecture Inspect Model
1 page
Literature Survey Ai Resume Screening
No ratings yet
Literature Survey Ai Resume Screening
3 pages
Object Detection
No ratings yet
Object Detection
57 pages
Machine Learning-5
No ratings yet
Machine Learning-5
89 pages
Breast Cancer Classification-Group240
No ratings yet
Breast Cancer Classification-Group240
4 pages
1694600937-Unit2.5 Support Vector Machine CU 2.0
No ratings yet
1694600937-Unit2.5 Support Vector Machine CU 2.0
26 pages
Students Placement Prediction System
No ratings yet
Students Placement Prediction System
5 pages
9036 - English
No ratings yet
9036 - English
2 pages
Machine Learning
No ratings yet
Machine Learning
256 pages
Artificial General Intelligence
No ratings yet
Artificial General Intelligence
6 pages
LAB MANUAL CST (Soft Computing) 12-02-2019
No ratings yet
LAB MANUAL CST (Soft Computing) 12-02-2019
68 pages
Basics of Artificial Intelligence (AI)
No ratings yet
Basics of Artificial Intelligence (AI)
2 pages
UNIT-5 Part1
No ratings yet
UNIT-5 Part1
15 pages
An Innovative Method For Hindi Word Sense Disambiguation: Binod Kumar Mishra Suresh Jain
No ratings yet
An Innovative Method For Hindi Word Sense Disambiguation: Binod Kumar Mishra Suresh Jain
17 pages
AICTE-vaani Proposal Template
100% (1)
AICTE-vaani Proposal Template
2 pages
Neural Networks-A Diffusion Model Changing The Landscape
No ratings yet
Neural Networks-A Diffusion Model Changing The Landscape
13 pages
Unit 4 - Week 3: Assignment 3
No ratings yet
Unit 4 - Week 3: Assignment 3
3 pages
Transformers
No ratings yet
Transformers
20 pages