1 Open CV

Uploaded by

Aritra Ghosh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views6 pages

1 Open CV

Uploaded by

Aritra Ghosh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

1OpenCV’s VideoCapture function

seize the real-time video feed from the camera that enabled us to access the camera and read
the video frames.

2Open Source Computer Vision Library.

obtaining the video frames, we preprocessed them using

resizing the frames to the necessary dimensions,

converting them into the required color space,

and normalizing the pixel values.

3 YOLO-NAS

After preprocessing, we input the frames into our model for object detection.

The model processed each frame and returned the coordinates of the bounding boxes for the
detected objects.

4 OpenCV’s drawing functions

to illustrate these bounding boxes on the original frames. We also labeled the boxes with the
classes of the detected objects.

5 Segment Anything Model (SAM)

to segment the objects in the frames..

6 Sort Algorithm

After obtaining the bounding boxes and segmentation masks, we had a lot of data points.

To manage this data effectively, we used the Sort Algorithm.

It helped us organize the bounding boxes and segmentation masks based on various criteria
such as their size, position, and class.

This sorting process made it easier for us to analyze the results and also improved the
efficiency of subsequent operations like tracking objects across frames.

7 OpenCV’s VideoWriter function

write the processed frames to an output video file.

This allowed us to create a video that visually represented the results of our object detection
and segmentation tasks.
reasons behind my choice of YOLO-NAS

Firstly, YOLO-NAS is the product of advanced Neural Architecture Search technology,

meticulously designed to address the limitations of previous YOLO models2. It introduces a
new quantization-friendly basic block3, which is designed to improve quantization performance
compared to its predecessors3. This new block allows YOLO-NAS to achieve higher accuracy
while maintaining efficiency3.

Secondly, YOLO-NAS significantly improves small object detection and localization accuracy3.
It also demonstrates superior performance in critical aspects such as small object detection,
localization accuracy, post-training quantization, and real-time edge-device applications1.

Thirdly, YOLO-NAS is open-source4 and is 10-20% faster than the pre-existing YOLO models4. It
uses a better architecture, AutoNAC4, which sets a new record in object detection, providing
the best accuracy and latency tradeoff performance4.

Now, let’s talk about the accuracy of YOLO-NAS. The model, when converted to its INT8
quantized version, experiences a minimal precision drop, a significant improvement over other
models2. In terms of pure numbers, YOLO-NAS is ~0.5 mAP point more accurate and 10-20%
faster than equivalent variants of YOLOv8 and YOLOv756.
role of OpenCV
OpenCV, or Open Source Computer Vision, is a powerful library that we utilized
extensively in our project. It provided us with tools to process images videos, which
are crucial in the field of object detection.
One of the primary uses of OpenCV in our project was in preprocessing the input
images. Before feeding the images into our YOLO-NAS model, we used OpenCV to
resize the images to the required dimensions, and normalize the pixel values. This
ensured that our model received inputs in a consistent and standardized format,
thereby improving its performance.
We also used OpenCV for drawing bounding boxes and labels on our output images.
Once our YOLO-NAS model detected objects in an image, we used OpenCV’s drawing
functions to visually represent these detections. This involved drawing bounding
boxes around the detected objects and labeling them with their respective classes.
This visual representation made it easier for us to interpret the model’s output and
verify its accuracy.
In addition, OpenCV was instrumental in handling real-time video streams. We used
its VideoCapture and VideoWriter classes to read input from a video file or a camera
in real-time, run our YOLO-NAS model on each frame, and then write the output
frames (with the detections drawn on them) to an output video file. This allowed us
to use our model for real-time object detection tasks.
Lastly, OpenCV’s vast array of image processing functions were invaluable
in augmenting our training data. We used functions like rotation, translation, scaling,
and flipping to artificially increase the size of our training dataset and introduce
more variability into it. This helped improve the robustness of our model and its
ability to generalize to new, unseen data.
Segment Anything Model (SAM)**,

segmentation model developed by Meta's FAIR (Fundamental AI Research) lab¹.

Firstly, SAM's unmatched versatility⁵ and zero-shot inference capabilities¹ were

instrumental in our project. Unlike traditional models that require extensive training on specific
tasks, SAM can accurately segment images without prior specific training¹. This means that
SAM could handle a wide array of objects across different images and videos in our project,
regardless of its training data⁵.

Secondly, SAM's **high accuracy**³ in segmenting objects in images was a key advantage. With
over 1 billion masks on 11M licensed and privacy-respecting images, SAM’s zero-shot
performance is often competitive with or even superior to prior fully supervised results². This
ensured that our project benefited from precise and reliable segmentation results.

In conclusion, the Segment Anything Model (SAM) has been an invaluable tool in our project. Its
versatility, high accuracy, ease of use, and real-time interaction capabilities have significantly
enhanced the performance and efficiency of our project.
Sort Algorithm

Firstly, the Sort Algorithm was used in organizing our data.. However, unorganized data
can lead to inefficiency and inaccuracies. By using the Sort Algorithm, we were able to
arrange our data in a specific order, be it numerical, alphabetical, or based on any other
relevant criteria. This made our data more manageable and accessible.

Thirdly, the Sort Algorithm was instrumental in identifying patterns and making
predictions. Once our data was sorted, it was easier to observe trends and patterns. This was
particularly useful in the data analysis phase of our project, where we needed to make
informed decisions based on our data.

Lastly, the Sort Algorithm helped in improving the user experience. For instance, if our
project involved displaying information to the user, sorted data allowed us to present this
information in a more structured and understandable manner.

In conclusion, the Sort Algorithm was an invaluable tool in our project. Its ability to organize
data, optimize search operations, identify patterns, and improve user experience significantly
enhanced the efficiency and effectiveness of our project.
a responsive alerting mechanism

In our project, we developed a responsive alerting mechanism for timely threat notification.
This process began with the detection of potential threats in the video feed using our YOLO-
NAS model. When a threat was identified, the model generated a bounding box around it and
classified it.

Following the detection, the Segment Anything Model (SAM) was used to segment the
detected threat from the rest of the image. This segmentation provided us with more granular
information about the threat, such as its exact shape and size.

Once the threat was detected and segmented, we evaluated it based on various criteria such as
its size, position, and class. This evaluation helped us to determine the severity of the threat.

If the threat issevere enough, an alert was generated. This alert contained information about
the threat, such as its class, position, and size. The generated alert was then sent to the
relevant parties through an appropriate notification channel. This could be an email, a push
notification, or any other form of communication that was suitable for the situation.

DLV Lab Manual Print
No ratings yet
DLV Lab Manual Print
29 pages
ML Chapter 3
No ratings yet
ML Chapter 3
25 pages
Master of Business Administration
No ratings yet
Master of Business Administration
3 pages
Elliptic Curve Cryptography
No ratings yet
Elliptic Curve Cryptography
60 pages
Computer Vision Engineer Interview Preparation Guide
No ratings yet
Computer Vision Engineer Interview Preparation Guide
20 pages
01-02 Introduction To CV and Segmentation
No ratings yet
01-02 Introduction To CV and Segmentation
85 pages
1 Recurrent Neural Networks
No ratings yet
1 Recurrent Neural Networks
36 pages
Real-Time objec-WPS Office
No ratings yet
Real-Time objec-WPS Office
5 pages
Grp2 Final PPT YOLO Moving Object Classification
No ratings yet
Grp2 Final PPT YOLO Moving Object Classification
26 pages
A Review On Various Methodologies Used For Vehicle Classification, Helmet Detection and Number Plate Recognition
No ratings yet
A Review On Various Methodologies Used For Vehicle Classification, Helmet Detection and Number Plate Recognition
9 pages
Design of Secure User Authenticated Key Management Protocol For Generic Iot Networks
No ratings yet
Design of Secure User Authenticated Key Management Protocol For Generic Iot Networks
14 pages
Sepm Exp. 0-5
No ratings yet
Sepm Exp. 0-5
14 pages
2 Inventive.2016.7830158
No ratings yet
2 Inventive.2016.7830158
8 pages
Kaustubh Shlok SK Report
No ratings yet
Kaustubh Shlok SK Report
47 pages
A9 Mini
No ratings yet
A9 Mini
68 pages
1 2 3 4 5 6 7 8 Merged Compressed
No ratings yet
1 2 3 4 5 6 7 8 Merged Compressed
269 pages
Improving Rag Systems Via Sentence Clustering and Reordering
No ratings yet
Improving Rag Systems Via Sentence Clustering and Reordering
10 pages
Final Model Study and Benchmarking For Computer Vision Solution
No ratings yet
Final Model Study and Benchmarking For Computer Vision Solution
7 pages
With A Neat Diagram
No ratings yet
With A Neat Diagram
4 pages
24 DeepLearning Fa NNDL Tutorial 1
No ratings yet
24 DeepLearning Fa NNDL Tutorial 1
32 pages
Autocertify Final11
No ratings yet
Autocertify Final11
23 pages
Master Thesis Report-Compressed
No ratings yet
Master Thesis Report-Compressed
53 pages
1 Muy and Mucho
No ratings yet
1 Muy and Mucho
22 pages
Arjun 1123
No ratings yet
Arjun 1123
20 pages
Final Review
No ratings yet
Final Review
26 pages
BbbbbbbbE Project Research PPR 1-4-6
No ratings yet
BbbbbbbbE Project Research PPR 1-4-6
3 pages
2-Module1 - Storage Devices and Servers, Infrastructure Devices, Computer Assets, Content Management-11!01!2024
No ratings yet
2-Module1 - Storage Devices and Servers, Infrastructure Devices, Computer Assets, Content Management-11!01!2024
39 pages
Script
No ratings yet
Script
10 pages
Pa Cls 10 A.I Final
No ratings yet
Pa Cls 10 A.I Final
2 pages
3isbast 2014 7013103
No ratings yet
3isbast 2014 7013103
6 pages
PPPR 2 Final
No ratings yet
PPPR 2 Final
37 pages
What Is A Potential Use of Anyword For Writing Cre
No ratings yet
What Is A Potential Use of Anyword For Writing Cre
49 pages
ASC Unit I
No ratings yet
ASC Unit I
32 pages
1 Recurrent Neural Networks
No ratings yet
1 Recurrent Neural Networks
34 pages
L1-L3 Computer Vision
No ratings yet
L1-L3 Computer Vision
15 pages
Natural Scene Recognition
No ratings yet
Natural Scene Recognition
26 pages
Object Detection System (1) (3) - Removed
No ratings yet
Object Detection System (1) (3) - Removed
10 pages
Fyp Zainab 1
No ratings yet
Fyp Zainab 1
16 pages
Thesis (2) Removed
No ratings yet
Thesis (2) Removed
34 pages
Feature Detection and Matching
100% (1)
Feature Detection and Matching
50 pages
Arjun Present
No ratings yet
Arjun Present
20 pages
Badri Mtech
No ratings yet
Badri Mtech
162 pages
IJCSP23A1333
No ratings yet
IJCSP23A1333
7 pages
Nivetha Me P2 Report
No ratings yet
Nivetha Me P2 Report
86 pages
Notebook 1 Data Preparation and Eda and Data Augmentation
No ratings yet
Notebook 1 Data Preparation and Eda and Data Augmentation
27 pages
An Identity-Preserved Model For Face Sketch-Photo Synthesis
No ratings yet
An Identity-Preserved Model For Face Sketch-Photo Synthesis
5 pages
Group Number - 2 - MOVING OBJECT CLASSIFICATION USING YOLO Algorithm
No ratings yet
Group Number - 2 - MOVING OBJECT CLASSIFICATION USING YOLO Algorithm
15 pages
CCTV
No ratings yet
CCTV
23 pages
Part B Eti-1
No ratings yet
Part B Eti-1
7 pages
Artificial Intelligence Lec 1
No ratings yet
Artificial Intelligence Lec 1
20 pages
Final Report
No ratings yet
Final Report
62 pages
Practice Questions
No ratings yet
Practice Questions
14 pages
21R91A7312 Merged
No ratings yet
21R91A7312 Merged
16 pages
On Hyperparameter Optimization of Machine Learning Algorithms: Theory and Practice
No ratings yet
On Hyperparameter Optimization of Machine Learning Algorithms: Theory and Practice
69 pages
Acknowledgements
No ratings yet
Acknowledgements
66 pages
Autonomous Car
100% (1)
Autonomous Car
12 pages
Object Tracking Using Opencv
No ratings yet
Object Tracking Using Opencv
19 pages
MYPPTT
No ratings yet
MYPPTT
19 pages
L3 Backpropagation
No ratings yet
L3 Backpropagation
61 pages
Lecture 3 LSTM, GRU
No ratings yet
Lecture 3 LSTM, GRU
45 pages
Automatic Detection of Bike Riders Without Helmet: Under The Guidance of Mrs - Shanthi.E Submitted by
No ratings yet
Automatic Detection of Bike Riders Without Helmet: Under The Guidance of Mrs - Shanthi.E Submitted by
31 pages
2 Addition
No ratings yet
2 Addition
1 page
1 Addition
No ratings yet
1 Addition
4 pages
Synopsisss
No ratings yet
Synopsisss
3 pages
Object Detection Using CNN
No ratings yet
Object Detection Using CNN
6 pages
Ou Anane 2013
No ratings yet
Ou Anane 2013
5 pages
SSTP Poster PDF
No ratings yet
SSTP Poster PDF
1 page
The Code
No ratings yet
The Code
4 pages
Real Time Object Detection System
No ratings yet
Real Time Object Detection System
31 pages
Paper Id 334 (New) With Animation - PPTX - 20240311 - 215722 - 0000
No ratings yet
Paper Id 334 (New) With Animation - PPTX - 20240311 - 215722 - 0000
11 pages
What Is Machine Learning ?
No ratings yet
What Is Machine Learning ?
4 pages
AhanaBasu 11500120098 Grp-2 Mid-Term-Project-Evaluation REPORT
No ratings yet
AhanaBasu 11500120098 Grp-2 Mid-Term-Project-Evaluation REPORT
9 pages
The Weka Multilayer Perceptron Classifier: Daniel I. MORARIU, Radu G. Creţulescu, Macarie Breazu
No ratings yet
The Weka Multilayer Perceptron Classifier: Daniel I. MORARIU, Radu G. Creţulescu, Macarie Breazu
9 pages
Real Time Object Detection and Recognition Using Mobilenet SSD With Opencv IJERTV11IS010070
No ratings yet
Real Time Object Detection and Recognition Using Mobilenet SSD With Opencv IJERTV11IS010070
2 pages
Fin Irjmets1657104229
No ratings yet
Fin Irjmets1657104229
6 pages
CNN-RNN Based Handwritten Text Recognition: G.R. Hemanth, M. Jayasree, S. Keerthi Venii, P. Akshaya, and R. Saranya
No ratings yet
CNN-RNN Based Handwritten Text Recognition: G.R. Hemanth, M. Jayasree, S. Keerthi Venii, P. Akshaya, and R. Saranya
7 pages
2802 8020 1 PB
No ratings yet
2802 8020 1 PB
3 pages
YOLO Object Detection Explained: Definitive Reference for Developers and Engineers
From Everand
YOLO Object Detection Explained: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Python Timetable
No ratings yet
Python Timetable
3 pages
Assignment-1 ML
No ratings yet
Assignment-1 ML
1 page
Wen Wen 2021 Thesis
No ratings yet
Wen Wen 2021 Thesis
114 pages
Object Detection Using Tensorflow....
No ratings yet
Object Detection Using Tensorflow....
9 pages
Unit 3 DL
No ratings yet
Unit 3 DL
15 pages
Unit 1
No ratings yet
Unit 1
20 pages
Kode Password Database 2019
No ratings yet
Kode Password Database 2019
10 pages
Real Time Object Detection With Deep Learning and OpenCV
No ratings yet
Real Time Object Detection With Deep Learning and OpenCV
5 pages
Btech Major Project Paper
No ratings yet
Btech Major Project Paper
3 pages
Vehicle Detection in Videos Using OpenCV and Python
No ratings yet
Vehicle Detection in Videos Using OpenCV and Python
1 page
Helmet Detection Using Machine Learning and Automatic License Final
75% (4)
Helmet Detection Using Machine Learning and Automatic License Final
47 pages
Synopsis of Real Time Security System: Submitted in Partial Fulfillment of The Requirements For The Award of
No ratings yet
Synopsis of Real Time Security System: Submitted in Partial Fulfillment of The Requirements For The Award of
6 pages
Multilayer Perceptron and Uppercase Handwritten Characters Recognition
No ratings yet
Multilayer Perceptron and Uppercase Handwritten Characters Recognition
4 pages
Realtime Visual Recognition in Deep Convolutional Neural Networks
No ratings yet
Realtime Visual Recognition in Deep Convolutional Neural Networks
13 pages
Video Summarization Project Presentaion
No ratings yet
Video Summarization Project Presentaion
34 pages
Minor Project
No ratings yet
Minor Project
21 pages
Pedestrian Detection: Please, suggest a subtitle for a book with title 'Pedestrian Detection' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
From Everand
Pedestrian Detection: Please, suggest a subtitle for a book with title 'Pedestrian Detection' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
Fouad Sabry
No ratings yet
Pyramid Image Processing: Exploring the Depths of Visual Analysis
From Everand
Pyramid Image Processing: Exploring the Depths of Visual Analysis
Fouad Sabry
No ratings yet
GenAI Unit1 3
No ratings yet
GenAI Unit1 3
31 pages
Aphelion Software: Unlocking Vision: Exploring the Depths of Aphelion Software
From Everand
Aphelion Software: Unlocking Vision: Exploring the Depths of Aphelion Software
Fouad Sabry
No ratings yet
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
From Everand
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
Mark Magic
No ratings yet
Machine Vision: Insights into the World of Computer Vision
From Everand
Machine Vision: Insights into the World of Computer Vision
Fouad Sabry
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet