Image and Vedio Anlaytics
Image and Vedio Anlaytics
22ADC
IMAGE AND VIDEO ANALYTICS
Instruction 3 L Hours per Week
Duration of SEE 3 Hours
SEE 60 Marks
CIE 40 Marks
Credits 3
Prerequisite: NIL.
Course Objectives:
This course aims to:
1. To impart knowledge on the basic principles and concepts in digital image and video analytics.
2. To explore and demonstrate real time image and video analytics in solving practical problems of
commercial and scientific interests.
Course Outcomes:
Upon completion of this course, students will be able to:
1. Understand the requirements of image processing for computer vision and video analysis.
2. Illustrate the image pre-processing methods.
3. Develop various object detection techniques.
4. Understand the various face recognition mechanisms.
5. Elaborate on deep learning-based video analytics.
PO/PS
PO PO PO PO PO PO PO PO PO PO PO PO PSO PSO PSO
O
1 2 3 4 5 6 7 8 9 10 11 12 1 2 3
CO
CO 1 3 3 2 2 3 2 3
CO 2 3 3 2 2 3 2 3
CO 3 3 3 2 2 3 2 3
CO 4 3 3 2 2 3 2 3
CO 5 3 3 3 2 3 2 3
UNIT - I
INTRODUCTION
Computer Vision – Image representation and image analysis tasks - Imagerepresentations –digitization – properties
– color images – Data structures for Image Analysis - Levels of image data representation - Traditional and
Hierarchical image data structures.
UNIT - II
IMAGE PRE-PROCESSING: Local pre-processing - Image smoothing - Edge detectors - Zero-crossings of the
second derivative- Scale in image processing - Canny edge detection -Parametric edge models -Local pre-
processing in the frequency domain - Line detection by local pre-processing operators - Image restoration.
UNIT – III
Object Detection using Machine Learning: Phasor Object detection- Object detection methods- Deep Learning
framework for Object detection- bounding box approach-Intersection over Union (IoU) – Deep Learning
Architectures- R-CNN-Faster-R-CNN-You Only Look Once (OYLO)-Salient features-Loss Functions-YOLO
architectures.
UNIT – IV
Face Recognition and Gesture Recognition: Face Recognition- Introduction-Applications of Face Recognition-
Process of Face Recognition- Deep Face solution by Facebook – FaceNet for Face Recognition- Implementation
using FaceNet-Gesture Recognition.
UNIT – V
Video Analytics: Video processing – use cases of video analytics - Vanishing Gradient and exploding gradient
problem – Restnet architecture – RestNet and skip connections – Inception Network – GoogleNet architecture –
Improvement in Inception v2-Video analytics – RestNet and Inception v3.
Text Books:
1. Milan Sonka, Vaclav Hlavac, Roger Boyle, Image Processing, Analysis and Machine Vision”, 4 t h
edition, Thamson Learning, 2013.
2. Vaibhav Verdhan, “Computer Vision Using Deep Learning” Neural Networks Architectures with Python
and Keras, Apress 2021.
Suggested Reading:
1. Richard Szeliski, : Computer Vision:Algorithms and Applications”, Springer Verlag London Limited 2011.
2. Caifeng Shan, Faith Porikli, Tao Xiang Shaogang Gong, “ Video Analytics for Business Intelligence”,
Springer 2012.
3. D.A. Forsyth, J.Ponce, “ Computer Vision: A Modern Approach”, Pearson Education, 2003.