0% found this document useful (0 votes)

7 views2 pages

Vyorius Test (Computer Vision Intern)

The Vyorius Test involves using a zero-shot vision model to recognize custom object categories from real-time or pre-recorded video, ensuring that none of the detected objects are from the COCO dataset. The project requires implementation in Python with OpenCV and PyTorch, and includes tasks such as displaying annotated video with bounding boxes and confidence scores. Deliverables include a GitHub repository with code, a README, a short write-up on the project's workings and challenges, and a video demonstration.

Uploaded by

Jyanu Ratna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views2 pages

Vyorius Test (Computer Vision Intern)

Uploaded by

Jyanu Ratna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Vyorius Test (Computer Vision Intern)

Use a zero-shot vision model to recognize objects from a real-time or pre-recorded video.
The twist: the object categories provided will not be part of the common COCO dataset.
This tests the model's generalization ability in a live setting.

Task Breakdown

1. Accept input from a webcam or a local video file.

2. Use a list of custom object categories as text prompts (examples below).

3. Run each frame through a zero-shot model.

4. Display annotated video with:

o Bounding boxes (if supported by model)

o Labels & confidence scores

5. Ensure none of the detected objects are from the COCO dataset.

Object Categories (Not in COCO):

You must detect objects that are not in COCO. Here are some examples you can use:

• A lightbulb

• A Matchstick

• A Monitor

• A lion

• A gaming console

You're welcome to add more, as long as they’re not in COCO (no chairs, people, dogs, etc.)

Technical Requirements

• Must be implemented in Python

• Use OpenCV for video input

• Use PyTorch and pre-trained zero-shot models like CLIP or OWL-ViT

• Write clean, modular, and well-commented code

• Either display results in a live window, or print predictions to console

Bonus Points

• Live prompt editing (change detection classes during runtime)

• Frame rate optimization (>=10 FPS)

• Logging predictions to a file (JSON or CSV)

• Using ONNX or TorchScript to accelerate inference

• Visualize detections with a minimal dashboard or UI

Deliverables

• GitHub repo or zipped folder

• Code + README with:

o Setup instructions

o Model download/usage steps

• Short write-up (1–2 paragraphs) on:

o How it works

o Challenges faced

o What could be improved or added next

• A video Demonstration of the above

Grp2 Final PPT YOLO Moving Object Classification
No ratings yet
Grp2 Final PPT YOLO Moving Object Classification
26 pages
Autonomous Car
100% (1)
Autonomous Car
12 pages
A9 Mini
No ratings yet
A9 Mini
68 pages
MAJOR
No ratings yet
MAJOR
30 pages
Code 1
No ratings yet
Code 1
18 pages
Fruit Dect Final
No ratings yet
Fruit Dect Final
35 pages
Minor Project
No ratings yet
Minor Project
13 pages
Final Model Study
No ratings yet
Final Model Study
14 pages
2207.02696v1 2
No ratings yet
2207.02696v1 2
15 pages
Object Detection SDS
No ratings yet
Object Detection SDS
27 pages
PR Project Ankit
No ratings yet
PR Project Ankit
9 pages
Add A Heading
No ratings yet
Add A Heading
15 pages
Devspec Template
No ratings yet
Devspec Template
9 pages
Inicai 3V1
No ratings yet
Inicai 3V1
7 pages
Real-Time objec-WPS Office
No ratings yet
Real-Time objec-WPS Office
5 pages
Col780 A3-1
No ratings yet
Col780 A3-1
5 pages
Real Object Detection System Using Yolov3 Images
No ratings yet
Real Object Detection System Using Yolov3 Images
6 pages
Autocertify Final11
No ratings yet
Autocertify Final11
23 pages
Wa0000.
No ratings yet
Wa0000.
40 pages
Project
No ratings yet
Project
51 pages
Comprehensive PyTorch Coding Challenges Across Mac
No ratings yet
Comprehensive PyTorch Coding Challenges Across Mac
5 pages
Technology Stack
No ratings yet
Technology Stack
5 pages
Project 2
No ratings yet
Project 2
10 pages
Development of A Pumpkin Fruits Pick
No ratings yet
Development of A Pumpkin Fruits Pick
13 pages
The PASCAL Visual Object Classes Challenge 2010 (VOC2010) Development Kit
No ratings yet
The PASCAL Visual Object Classes Challenge 2010 (VOC2010) Development Kit
29 pages
Machine Learning Program 6 (SHANKAR)
No ratings yet
Machine Learning Program 6 (SHANKAR)
8 pages
757 - Haripriya - Description - EchoBot A Chatbot For Object Detection and Navigation - 22.10.2024
No ratings yet
757 - Haripriya - Description - EchoBot A Chatbot For Object Detection and Navigation - 22.10.2024
3 pages
Assigh
No ratings yet
Assigh
2 pages
Presentation 4
No ratings yet
Presentation 4
23 pages
Detection Project Meeting Notes
No ratings yet
Detection Project Meeting Notes
2 pages
Sepm Exp. 0-5
No ratings yet
Sepm Exp. 0-5
14 pages
Fyp Zainab 1
No ratings yet
Fyp Zainab 1
16 pages
Arjun Present
No ratings yet
Arjun Present
20 pages
Assignment For AI Interns
No ratings yet
Assignment For AI Interns
3 pages
Screenshot 2025-04-22 at 11.44.52 AM
No ratings yet
Screenshot 2025-04-22 at 11.44.52 AM
10 pages
Dowsiness Detection System
No ratings yet
Dowsiness Detection System
2 pages
Assignment 4
No ratings yet
Assignment 4
5 pages
QUAL230475 Detectify
No ratings yet
QUAL230475 Detectify
7 pages
Untitled Document
No ratings yet
Untitled Document
4 pages
21BCS1133 - Exp 2.3
No ratings yet
21BCS1133 - Exp 2.3
4 pages
Group Number - 2 - MOVING OBJECT CLASSIFICATION USING YOLO Algorithm
No ratings yet
Group Number - 2 - MOVING OBJECT CLASSIFICATION USING YOLO Algorithm
15 pages
Part B Eti-1
No ratings yet
Part B Eti-1
7 pages
Multimedia Projects History - G.K.Md. Muttakin
No ratings yet
Multimedia Projects History - G.K.Md. Muttakin
8 pages
CCTV
No ratings yet
CCTV
23 pages
CV Task
No ratings yet
CV Task
6 pages
CV Lab 9
No ratings yet
CV Lab 9
4 pages
The Code
No ratings yet
The Code
4 pages
Constructon
No ratings yet
Constructon
10 pages
MYPPTT
No ratings yet
MYPPTT
19 pages
Task 1 ML
No ratings yet
Task 1 ML
7 pages
AhanaBasu 11500120098 Grp-2 Mid-Term-Project-Evaluation REPORT
No ratings yet
AhanaBasu 11500120098 Grp-2 Mid-Term-Project-Evaluation REPORT
9 pages
Computer Vision Report
No ratings yet
Computer Vision Report
21 pages
Object Detection Document
No ratings yet
Object Detection Document
4 pages
CV VIII Sem 2025
No ratings yet
CV VIII Sem 2025
2 pages
Review 2
No ratings yet
Review 2
30 pages
A Deep Learning Based Assistant For The Visually Impaired
No ratings yet
A Deep Learning Based Assistant For The Visually Impaired
11 pages
Social Distancing Detection Using Tensorflow
No ratings yet
Social Distancing Detection Using Tensorflow
4 pages
Realtime Visual Recognition in Deep Convolutional Neural Networks
No ratings yet
Realtime Visual Recognition in Deep Convolutional Neural Networks
13 pages
NECN-R21-B.tech-ECE-COURSE STRUCTURE AND SYLLABUS-25-4-2023
No ratings yet
NECN-R21-B.tech-ECE-COURSE STRUCTURE AND SYLLABUS-25-4-2023
102 pages
Introduction to Python Programming: Learn Coding with Hands-On Projects for Beginners
From Everand
Introduction to Python Programming: Learn Coding with Hands-On Projects for Beginners
Kiet Huynh
No ratings yet
DSP Mid Question Bank
No ratings yet
DSP Mid Question Bank
5 pages
Face Tracking Turret Team 8
No ratings yet
Face Tracking Turret Team 8
11 pages
Emtl
No ratings yet
Emtl
6 pages
R21 Vlsid Mid QB
No ratings yet
R21 Vlsid Mid QB
3 pages
The Beginner’s Guide to Kilo Code
From Everand
The Beginner’s Guide to Kilo Code
Steven Mcananey
No ratings yet
Vlsitask11 A04h9
No ratings yet
Vlsitask11 A04h9
1 page

Vyorius Test (Computer Vision Intern)

Uploaded by

Vyorius Test (Computer Vision Intern)

Uploaded by

Vyorius Test (Computer Vision Intern)

1. Accept input from a webcam or a local video file.

2. Use a list of custom object categories as text prompts (examples below).

3. Run each frame through a zero-shot model.

4. Display annotated video with:

o Bounding boxes (if supported by model)

o Labels & confidence scores

Object Categories (Not in COCO):

• Must be implemented in Python

• Use OpenCV for video input

• Use PyTorch and pre-trained zero-shot models like CLIP or OWL-ViT

• Write clean, modular, and well-commented code

• Either display results in a live window, or print predictions to console

• Live prompt editing (change detection classes during runtime)

• Frame rate optimization (>=10 FPS)

• Logging predictions to a file (JSON or CSV)

• Using ONNX or TorchScript to accelerate inference

• Visualize detections with a minimal dashboard or UI

• GitHub repo or zipped folder

• Code + README with:

o Model download/usage steps

• Short write-up (1–2 paragraphs) on:

o What could be improved or added next

• A video Demonstration of the above

You might also like