0% found this document useful (0 votes)

8 views9 pages

VisionSense - Real - Time - Object - Detection Report

The project report details 'VisionSense: Real-Time Object Recognition on Android', developed by Bondhon Das at Bangabandhu Sheikh Mujibur Rahman Science and Technology University. It utilizes TensorFlow Lite and the SsdMobilenetV1 model for efficient real-time object detection on smartphones, demonstrating high accuracy and performance. The application aims to enhance user interaction and has potential applications in augmented reality, image recognition, and security systems.

Uploaded by

bondhondas20cse016

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views9 pages

VisionSense - Real - Time - Object - Detection Report

Uploaded by

bondhondas20cse016

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Bangabandhu Sheikh Mujibur Rahman

Science and Technology University,

Gopalganj

Project Report
Course Code: CSE278
VisionSense: Real-Time Object Recognition on
Android
By
Student Name: Bondhon Das
ID: 20CSE016
Session: 2020-2021

Department of Computer Science and Engineering

Bangabandhu Sheikh Mujibur Rahman Science and
Technology University
VisionSense: Real-Time Object Recognition on Android

(This is report is submitted in the fulfilment of the requirement for the project
of “ Second Year Second Semester” in Computer Science and Engineering.)

Bondhon Das
ID: 20CSE016
(Second Year Second Semster)
Session : 2020-2021

Supervised By

Abu Bakar Muhammad Abdullah

Assistant Professor

Department of Computer Science and Engineering

Bangabandhu Sheikh Mujibur Rahman Science and Technology
University
Declaration

The project work entitled “VisionSense: Real-Time Object Recognition on An-

droid” has been carried out in the Department of Computer Science and Engineering, Banga-
bandhu Sheikh Mujibur Rahman Science and Technology University is original and conforms
the regulations of this University.
I understand the University’s policy on plagiarism and declare that no part of this project has
been copied from other sources or been previously submitted elsewhere for the award of any
degree or diploma.

Signature of the Candidate Signature of the Supervisor

Bondhon Das Abu Bakar Muhammad Abdullah
ID: 20CSE016 Assistant Professor
Date: 12.05.2024
Contents
Abstract 2

1 Introduction 2
1.1 Project Objectives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.2 Project Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.3 Scope . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

2 Methodology 2
2.1 Camera Integration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
2.2 TensorFlow Lite Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2.3 Object Detection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2.4 Visualization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2.5 Real-Time Rendering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

3 Implementation 4
3.1 Permissions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
3.2 Camera Initialization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
3.3 Model Loading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
3.4 Real-Time Inference . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
3.5 Annotation Overlay . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
3.6 Display . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

4 Results 4
4.1 Performance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
4.2 Accuracy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
4.3 Application Screenshots . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

5 Conclusion 5

6 References 6

1
Abstract
The project ”VisionSense: Real-Time Object Recognition on Android” leverages deep learning
and mobile computing to enable real-time object detection on smartphones. It integrates the
efficient and accurate SsdMobilenetV1 model using TensorFlow Lite, ensuring real-time perfor-
mance without compromising accuracy. The user-friendly interface provides instant feedback
on detected objects, enhancing user interaction. The project’s practical applicability spans
augmented reality, image recognition, and security systems, offering innovative solutions to
real-world challenges. By democratizing advanced computer vision capabilities on Android
smartphones, the project empowers users and opens new avenues for intelligent applications.
In summary, ”VisionSense” signifies a significant leap in mobile computing and AI, redefining
object recognition possibilities on Android with its efficiency, accuracy, and practicality.

1 Introduction
This is the introduction section.

1.1 Project Objectives

The VisionSense project aims to develop a sophisticated Android application capable of real-
time object recognition using the device’s camera. Leveraging machine learning algorithms,
specifically TensorFlow Lite, the application can detect objects in real-time, annotate them
with bounding boxes and labels, and display the results to the user.

1.2 Project Overview

This report provides a detailed overview of the VisionSense project, including its objectives,
architecture, functionality, implementation details, challenges faced, and future prospects.

1.3 Scope
In recent years, the field of computer vision has witnessed significant advancements, particularly
in the domain of object recognition. With the proliferation of smartphones equipped with high-
performance processors, cameras, and machine learning frameworks, there is immense potential
to bring these capabilities to handheld devices. VisionSense seeks to harness this potential by
developing a real-time object recognition system for Android smartphones. By enabling users
to detect and classify objects in real-time, VisionSense opens up new possibilities for intelligent
applications across diverse domains.

2 Methodology
The core methodology of VisionSense involves several key components:

2.1 Camera Integration

VisionSense integrates with the Android camera API to capture live video frames from the
device’s camera. The Camera2 API provides ways to query for available extensions, configure
an extension camera session, and communicate with the Camera Extensions OEM library. This
allows your application to use extensions like Night, HDR, Auto, Bokeh, or Face Retouch.

2
2.2 TensorFlow Lite Model
The SsdMobilenetV1 model is loaded into the application using TensorFlow Lite, enabling
efficient inference on mobile devices. TensorFlow Lite is specially optimized for on-device
machine learning (Edge ML). As an Edge ML model, it is suitable for deployment to resource-
constrained edge devices. Edge intelligence, the ability to move deep learning tasks (object
detection, image recognition, etc.)

2.3 Object Detection

Each frame captured from the camera is processed using the loaded model to detect objects
within the scene. Given an image or a video stream, an object detection model can identify
which of a known set of objects might be present and provide information about their positions
within the image.
Standard version of SSD MobileNet model can detect 90 objects, and we can use these existing
models inside Android for our custom use cases and build smart Mobile Applications. Some of
the objects which these pretrained models can detect are ←
1. Person 21. Elephant 41. Wine glass 61. Dining table
2. Bicycle 22. Bear 42. Cup 62. Toilet
3. Car 23. Zebra 43. Fork 63. TV
4. Motorcycle 24. Giraffe 44. Knife 64. Laptop
5. Airplane 25. Backpack 45. Spoon 65. Mouse
6. Bus 26. Umbrella 46. Bowl 66. Remote
7. Train 27. Handbag 47. Banana 67. Keyboard
8. Truck 28. Tie 48. Apple 68. Cell phone
9. Boat 29. Suitcase 49. Sandwich 69. Microwave
10. Traffic light 30. Frisbee 50. Orange 70. Oven
11. Fire hydrant 31. Skis 51. Broccoli 71. Toaster
12. Stop sign 32. Snowboard 52. Carrot 72. Sink
13. Parking meter 33. Sports ball 53. Hot dog 73. Refrigerator
14. Bench 34. Kite 54. Pizza 74. Book
15. Bird 35. Baseball bat 55. Donut 75. Clock
16. Cat 36. Baseball glove 56. Cake 76. Vase
17. Dog 37. Skateboard 57. Chair 77. Scissors
18. Horse 38. Surfboard 58. Couch 78. Teddy bear
19. Sheep 39. Tennis racket 59. Potted plant 79. Hair drier
20. Cow 40. Bottle 60. Bed 80. Toothbrush

2.4 Visualization
Detected objects are visually represented on the live video feed using bounding boxes and
corresponding labels.

3
2.5 Real-Time Rendering
The processed video stream with overlaid annotations is rendered in real-time on the device’s
screen

3 Implementation
The implementation section provides an overview of the steps involved in developing the Vi-
sionSense application.

3.1 Permissions
The application requests camera permissions from the user to access the device’s camera
hardware. The Android framework supports capturing images and video through the an-
droid.hardware.camera2 API or camera Intent.

3.2 Camera Initialization

Upon permission approval, VisionSense initializes the camera hardware and sets up a camera
capture session.

3.3 Model Loading

The SsdMobilenetV1 model and label file are loaded into memory using TensorFlow Lite.

3.4 Real-Time Inference

As each frame becomes available from the camera, VisionSense performs object detection in-
ference using the loaded model. Real-time inference refers to the process of running predictions
or making decisions using a machine learning model with minimal delay, typically within mil-
liseconds or microseconds.

3.5 Annotation Overlay

Detected objects are annotated with bounding boxes and labels, which are overlaid onto the
live video feed.

3.6 Display
The annotated video stream is displayed on the device’s screen using a TextureView and Im-
ageView combination.

4 Results
VisionSense successfully achieves real-time object recognition on Android devices, providing
users with an intuitive interface for identifying objects in their environment. The application
demonstrates high performance and accuracy in detecting various objects, making it suitable
for a wide range of practical applications. Here are some screenshots of this application:

4
4.1 Performance
In terms of performance, VisionSense outperforms existing object recognition applications on
Android. Through efficient implementation and optimization techniques, VisionSense achieves
real-time object detection with minimal latency. The application utilizes the device’s hardware
resources effectively, ensuring smooth and responsive user experience even on lower-end devices.

4.2 Accuracy
The accuracy of VisionSense’s object recognition capabilities is commendable. Leveraging state-
of-the-art machine learning models, VisionSense consistently achieves high detection accuracy
across various object categories. The application’s ability to accurately identify objects in
diverse environments enhances its utility for users across different use cases.

4.3 Application Screenshots

(a) Detect Bicycle (b) Detect Book (c) Detect Laptop and KeyBoard

5 Conclusion
In conclusion, ”VisionSense” represents a significant advancement in the realm of mobile-based
real-time object recognition, demonstrating the fusion of cutting-edge machine learning tech-
niques with the convenience and ubiquity of Android devices. By harnessing the power of
TensorFlow Lite and deploying the SsdMobilenetV1 model directly on mobile hardware, Vi-
sionSense showcases the practicality and efficiency of on-device AI inference. This approach
not only reduces reliance on cloud-based processing but also enhances user privacy by keeping
data localized.
The successful implementation of VisionSense underscores its potential to revolutionize var-
ious domains, including accessibility, augmented reality, and computer vision. In the realm of
accessibility, VisionSense has the capacity to empower visually impaired individuals by pro-
viding them with instant object recognition capabilities, thereby enhancing their independence
and quality of life. Moreover, in augmented reality applications, VisionSense can serve as a

5
cornerstone for creating immersive experiences that seamlessly integrate virtual and real-world
elements, opening up new avenues for entertainment, education, and commerce.

Furthermore, VisionSense holds immense promise in the field of computer vision, offering a
powerful tool for tasks such as object tracking, scene understanding, and automated content
tagging. Its ability to perform real-time inference directly on Android devices enables applica-
tions to respond swiftly to dynamic environments, making it suitable for a wide range of use
cases, from industrial automation to interactive gaming.

Looking ahead, VisionSense is poised to inspire further innovation in mobile-based AI appli-

cations, spurring the development of even more sophisticated models and intelligent systems.
As the capabilities of mobile hardware continue to evolve, VisionSense stands at the forefront
of a new era in which AI-driven solutions are seamlessly integrated into everyday mobile ex-
periences, enriching lives and transforming industries. With its robust methodology, tangible
outcomes, and far-reaching implications, VisionSense represents a paradigm shift in how we
perceive and interact with technology, setting the stage for a future defined by intelligent,
responsive, and accessible mobile applications.

6 References
• https://developer.android.com/media/camera/camera2
• https://www.tensorflow.org/lite/android/tutorials/object detection

Arcane Schools
100% (1)
Arcane Schools
275 pages
Internship
No ratings yet
Internship
18 pages
Types of Headlines
100% (1)
Types of Headlines
5 pages
Deep Learning For Remote Sensing Images With Open Source Software (Rémi Cresson) (Z-Library)
No ratings yet
Deep Learning For Remote Sensing Images With Open Source Software (Rémi Cresson) (Z-Library)
165 pages
2.3.3.3 Lab - Building A Simple Network
80% (5)
2.3.3.3 Lab - Building A Simple Network
11 pages
ENG 219 - ĐỀ ÔN TẬP - SV - 8.23
No ratings yet
ENG 219 - ĐỀ ÔN TẬP - SV - 8.23
10 pages
Importance of Language Laboratory in Developing La
No ratings yet
Importance of Language Laboratory in Developing La
6 pages
Test Taker Brochure
No ratings yet
Test Taker Brochure
11 pages
COT in English Q4
100% (1)
COT in English Q4
5 pages
TEACHING SOCIAL STUDIES IN THE ELEMENTARY GRADES Culture
89% (9)
TEACHING SOCIAL STUDIES IN THE ELEMENTARY GRADES Culture
88 pages
Grade9 Week5 Music DLL Format
No ratings yet
Grade9 Week5 Music DLL Format
1 page
Translation of Sad Songs of Rafi: Dil Ka Soona Saaz...
No ratings yet
Translation of Sad Songs of Rafi: Dil Ka Soona Saaz...
2 pages
Computer Vision55
100% (1)
Computer Vision55
268 pages
Opencv 2 Refman
No ratings yet
Opencv 2 Refman
913 pages
Opencv2refman PDF
No ratings yet
Opencv2refman PDF
899 pages
Improving Embedded Deep Learning Object Detection by Integrating Infrared Camera
No ratings yet
Improving Embedded Deep Learning Object Detection by Integrating Infrared Camera
98 pages
Opencv Reference Manual PDF
No ratings yet
Opencv Reference Manual PDF
817 pages
A Comprehensive Review of Modern Object Segmentation Approaches
No ratings yet
A Comprehensive Review of Modern Object Segmentation Approaches
177 pages
Class 9 - SCT
No ratings yet
Class 9 - SCT
9 pages
Autonomous Car
100% (1)
Autonomous Car
12 pages
Opencv 2 Refman
No ratings yet
Opencv 2 Refman
643 pages
Nivetha Me P2 Report
No ratings yet
Nivetha Me P2 Report
86 pages
Deep Residual Learning
No ratings yet
Deep Residual Learning
80 pages
Summary
No ratings yet
Summary
65 pages
Predicting Images Using Convolutional Networks - Visual Scene Understanding With Pixel Maps
No ratings yet
Predicting Images Using Convolutional Networks - Visual Scene Understanding With Pixel Maps
149 pages
Anand Bhat PHD Thesis
No ratings yet
Anand Bhat PHD Thesis
173 pages
Al-Naqshbndi, S. MSC Thesis
No ratings yet
Al-Naqshbndi, S. MSC Thesis
85 pages
Dissertation
No ratings yet
Dissertation
86 pages
Fulltext01 P
No ratings yet
Fulltext01 P
78 pages
Pfa2 2024 13
No ratings yet
Pfa2 2024 13
48 pages
Improving The Accuracy of 2D On - Road Object Detection Based On Deep Learning Techniques
No ratings yet
Improving The Accuracy of 2D On - Road Object Detection Based On Deep Learning Techniques
69 pages
DL Acceleration On The Edge
No ratings yet
DL Acceleration On The Edge
78 pages
2021 12 Masters Thesis Maximilian Fortkord Compressed
No ratings yet
2021 12 Masters Thesis Maximilian Fortkord Compressed
84 pages
Final Project Report
No ratings yet
Final Project Report
60 pages
1-Recent Advances in Object Detection in The Age of Deep Convolutional Neural Networks
No ratings yet
1-Recent Advances in Object Detection in The Age of Deep Convolutional Neural Networks
104 pages
Wen Wen 2021 Thesis
No ratings yet
Wen Wen 2021 Thesis
114 pages
Nivetha Me Phase1rep
No ratings yet
Nivetha Me Phase1rep
57 pages
Start An Essay
100% (2)
Start An Essay
7 pages
MTech Thesis 163190012 IITB LIBRARY Signed
No ratings yet
MTech Thesis 163190012 IITB LIBRARY Signed
41 pages
0 Computer Vision Panikzettel
No ratings yet
0 Computer Vision Panikzettel
28 pages
Helmet and Vehicle License Plate Detection System
No ratings yet
Helmet and Vehicle License Plate Detection System
26 pages
Untitled
No ratings yet
Untitled
6 pages
Mit PDF
No ratings yet
Mit PDF
45 pages
MYPPTT
No ratings yet
MYPPTT
19 pages
Part B Eti-1
No ratings yet
Part B Eti-1
7 pages
RemovePagesResult 2025 04 16 06 39 14
No ratings yet
RemovePagesResult 2025 04 16 06 39 14
21 pages
Object Recognition On The REEM Robot
No ratings yet
Object Recognition On The REEM Robot
88 pages
Paper Id 334 (New) With Animation - PPTX - 20240311 - 215722 - 0000
No ratings yet
Paper Id 334 (New) With Animation - PPTX - 20240311 - 215722 - 0000
11 pages
Nivetha Me P2 PPT
No ratings yet
Nivetha Me P2 PPT
18 pages
Sepm Exp. 0-5
No ratings yet
Sepm Exp. 0-5
14 pages
Content-Based Image Retrieval Using Deep Learning
No ratings yet
Content-Based Image Retrieval Using Deep Learning
44 pages
Index Page Arjun
No ratings yet
Index Page Arjun
8 pages
Object Detection Using Deep CNNs Trained On Synthetic Images
No ratings yet
Object Detection Using Deep CNNs Trained On Synthetic Images
8 pages
Front Papers of Thesis
No ratings yet
Front Papers of Thesis
8 pages
An Experimental Study of The Accuracy Vs Inference Speed of RGB-D Object Recognition in Mobile Robotics
No ratings yet
An Experimental Study of The Accuracy Vs Inference Speed of RGB-D Object Recognition in Mobile Robotics
8 pages
Kanoria Shubham Anil 2023HT01569
No ratings yet
Kanoria Shubham Anil 2023HT01569
9 pages
Deep Learning
No ratings yet
Deep Learning
9 pages
Object Detection New
No ratings yet
Object Detection New
10 pages
Project Proposal - 22MCS043
No ratings yet
Project Proposal - 22MCS043
8 pages
SundaysSpecialDays 2PGCalendar 2022
No ratings yet
SundaysSpecialDays 2PGCalendar 2022
2 pages
Object Detection Using Tensorflow....
No ratings yet
Object Detection Using Tensorflow....
9 pages
Computer Vision Engineer Interview Preparation Guide
No ratings yet
Computer Vision Engineer Interview Preparation Guide
20 pages
Developing Cultural Competence in PT Practice - APTA
No ratings yet
Developing Cultural Competence in PT Practice - APTA
7 pages
Object Detection Using Deep Learning
No ratings yet
Object Detection Using Deep Learning
45 pages
Background Job Results Sending Through Mail
No ratings yet
Background Job Results Sending Through Mail
5 pages
Vitamin Deficiency Detection (Base Paper)
No ratings yet
Vitamin Deficiency Detection (Base Paper)
3 pages
Deep Learning Models
No ratings yet
Deep Learning Models
3 pages
Paper Review of Five Machine Vision Topics
No ratings yet
Paper Review of Five Machine Vision Topics
3 pages
Introduction To Access SQL: SELECT Statements
No ratings yet
Introduction To Access SQL: SELECT Statements
5 pages
Achievement Test 2 (Word)
No ratings yet
Achievement Test 2 (Word)
5 pages
How To Debug Javascript, Jquery
No ratings yet
How To Debug Javascript, Jquery
3 pages
2802 8020 1 PB
No ratings yet
2802 8020 1 PB
3 pages
A House On Fire Book
No ratings yet
A House On Fire Book
77 pages
Test 2 Combined
No ratings yet
Test 2 Combined
31 pages
Calming The Storm in Matthew
No ratings yet
Calming The Storm in Matthew
2 pages
Real Time Object Detection and Recognition Using Mobilenet SSD With Opencv IJERTV11IS010070
No ratings yet
Real Time Object Detection and Recognition Using Mobilenet SSD With Opencv IJERTV11IS010070
2 pages
Assignment 3 English For Physics: "The Gerund"
No ratings yet
Assignment 3 English For Physics: "The Gerund"
4 pages
Real Time Object Detection With Deep Learning and OpenCV
No ratings yet
Real Time Object Detection With Deep Learning and OpenCV
5 pages
The Magic of The Pen: Select Miniatures From The Khamsa of Nizami Ganjavi
No ratings yet
The Magic of The Pen: Select Miniatures From The Khamsa of Nizami Ganjavi
276 pages
Deep-Drone-Object 2
No ratings yet
Deep-Drone-Object 2
8 pages
Sagar Institute of Science and Technology (Sistec-Ratibad Campus)
No ratings yet
Sagar Institute of Science and Technology (Sistec-Ratibad Campus)
7 pages
FPGA Opencv
No ratings yet
FPGA Opencv
8 pages
Instant Access To ¿Como Se Dice ? Student Text 11th Edition Ana Jarvis Ebook Full Chapters
100% (1)
Instant Access To ¿Como Se Dice ? Student Text 11th Edition Ana Jarvis Ebook Full Chapters
47 pages
Choleski's Method
No ratings yet
Choleski's Method
10 pages
Prabhakar Mishra Resume
No ratings yet
Prabhakar Mishra Resume
1 page
Gray Hat Hacking the Ethical Hacker's
From Everand
Gray Hat Hacking the Ethical Hacker's
Çağatay Şanlı
5/5 (1)
Chapter 9
No ratings yet
Chapter 9
24 pages
Chapter 12
No ratings yet
Chapter 12
20 pages
Computer Networks (Review Question and Problem)
No ratings yet
Computer Networks (Review Question and Problem)
4 pages
CSE-351 Computer Graphics Exam 2023
No ratings yet
CSE-351 Computer Graphics Exam 2023
1 page
Digital Logic Session Plan
No ratings yet
Digital Logic Session Plan
5 pages
Ayachi BPSC Tre 4 0 English (TGT, Class-9th & 10th) Complete Foundation With Final Selection Batch 2024 - Online Live Classes by Adda 247
No ratings yet
Ayachi BPSC Tre 4 0 English (TGT, Class-9th & 10th) Complete Foundation With Final Selection Batch 2024 - Online Live Classes by Adda 247
2 pages
Content Creation Revolution with chatGPT
From Everand
Content Creation Revolution with chatGPT
Maria Cowen
No ratings yet
2 1 Notice Board 1
No ratings yet
2 1 Notice Board 1
3 pages
Software Patterns Made Easy
From Everand
Software Patterns Made Easy
Justice Nanhou
No ratings yet
I Wanna Dance With Somebody-Skrzypce
No ratings yet
I Wanna Dance With Somebody-Skrzypce
1 page

VisionSense - Real - Time - Object - Detection Report

Uploaded by

VisionSense - Real - Time - Object - Detection Report

Uploaded by

Bangabandhu Sheikh Mujibur Rahman

Science and Technology University,

Department of Computer Science and Engineering

Abu Bakar Muhammad Abdullah

Department of Computer Science and Engineering

The project work entitled “VisionSense: Real-Time Object Recognition on An-

Signature of the Candidate Signature of the Supervisor

1.1 Project Objectives

1.2 Project Overview

2.1 Camera Integration

2.3 Object Detection

3.2 Camera Initialization

3.3 Model Loading

3.4 Real-Time Inference

3.5 Annotation Overlay

4.3 Application Screenshots

Looking ahead, VisionSense is poised to inspire further innovation in mobile-based AI appli-

You might also like