Computer Vision

The document outlines the primary tasks of computer vision, which include semantic segmentation, classification and localization, and object detection. It explains semantic segmentation as classifying images based on visual content, while classification and localization involve identifying objects and placing bounding boxes around them. Additionally, it discusses the concepts of pixels, resolution, grayscale images, and colored images in the context of digital photography.

Uploaded by

Sanskriti shankar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views12 pages

Computer Vision

Uploaded by

Sanskriti shankar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Computer Vision

Computer Vision
Primary Tasks- There are primarily three tasks that
Computer vision accomplishes:

1.Semantic Segmentation (Image Classification)

2.Classification + Localization
3. Object Detection
Semantic Segmentation
Semantic Segmentation is also called the Image classification.
Semantic segmentation is a process in Computer Vision where
an image is classified depending on its visual content.
Basically, a set of classes (objects to identify in images) are
defined and a model is trained to recognize them with the
help of labelled example photos.
In simple terms it takes an image as an input and outputs a
class i.e. a cat, dog etc. or a probability of classes from which
one has the highest chance of being correct. For human, this
ability comes naturally and effortlessly but for machines, it’s a
fairly complicated process.
•Image Classification: Predict the type or class of an
object in an image.
• Input: An image with a single object, such as a
photograph.

• Output: A class label (e.g. one or more integers that

are mapped to class labels).
Classification and Localization
Once the object classified and labelled, the localization task is
evoked which puts a bounding box around the object in the
picture. The term ‘localization’ refers to where the object is in
the image. Say we have a dog in an image, the algorithm
predicts the class and creates a bounding box around the
object in the image.
•Object Localization: Locate the presence of objects
in an image and indicate their location with a bounding
box.
•Input: An image with one or more objects, such as a
photograph.
•Output: One or more bounding boxes (e.g. defined by
a point, width, and height)
Object Detection
• When human beings see a video or an image, they
immediately identify the objects present in them. This
intelligence can be duplicated using a computer. If we have
multiple objects in the image, the algorithm will identify all
of them and localise (put a bounding box around) each one
of them. You will therefore, have multiple bounding boxes
and labels around the objects.
•Object Detection: Locate the presence of objects with a bounding box and types or

classes of the located objects in an image.

• Input: An image with one or more objects, such as a photograph.

• Output: One or more bounding boxes (e.g. defined by a point, width, and

height), and a class label for each bounding box.

Pixel- The word “pixel” means a picture element. Every photograph, in digital form, is
made up of pixels. They are the smallest unit of information that make up a picture.
Usually round or square, they are typically arranged in a 2-dimensional grid.
RESOLUTION -The number of pixels in an image is called the resolution.
Another convention is to express the number of pixels as a single number, like a 5 mega
pixel camera (a megapixel is a million pixels). This means the pixels along the width
multiplied by the pixels along the height of the image taken by the camera equals 5
million pixels. In the case of our 1280×1024 monitors, it could also be expressed as 1280 x
1024 = 1,310,720, or 1.31 megapixels.
Since each pixel uses 1 byte of an image, which is equivalent to 8 bits of
data. Since each bit can have two possible values which tells us that the
8 bit can have 255 possibilities of values which starts from 0 and ends at
255.

Grayscale Images Grayscale images are images which have a range of shades of gray without
apparent colour. The darkest possible shade is black, which is the total absence of colour or zero
value of pixel. The lightest possible shade is white, which is the total presence of colour or 255
value of a pixel . Intermediate shades of gray are represented by equal brightness levels of the
three primary colours.
Coloured Images
All the images that we see around are coloured images. These images are made up
of three primary colours Red, Green and Blue. All the colours that are present can be
made by combining different intensities of red, green and blue.

6SL3100 0BE21 6AB0 Datasheet en
No ratings yet
6SL3100 0BE21 6AB0 Datasheet en
402 pages
Video Editing: So, Here's Everything To Consider When Taking Up Video Editing
100% (1)
Video Editing: So, Here's Everything To Consider When Taking Up Video Editing
11 pages
Unit 1
No ratings yet
Unit 1
200 pages
Online Library Management System Report
No ratings yet
Online Library Management System Report
35 pages
COMPUTER VISION Notes
No ratings yet
COMPUTER VISION Notes
3 pages
Assistance by Leslye Headland PDF
100% (1)
Assistance by Leslye Headland PDF
102 pages
Class 10 AI 417 Computer Vision
No ratings yet
Class 10 AI 417 Computer Vision
22 pages
Computer Vision Class 10 Notes
100% (5)
Computer Vision Class 10 Notes
7 pages
Sales Forc
No ratings yet
Sales Forc
217 pages
Ip CV Summary Finaaaal-1
No ratings yet
Ip CV Summary Finaaaal-1
178 pages
Computer Vision Class 10 AI Notes CBSE
No ratings yet
Computer Vision Class 10 AI Notes CBSE
8 pages
AI-Computer Vision
No ratings yet
AI-Computer Vision
16 pages
Class 10 Maths Previous Year Questions - Polynomials
No ratings yet
Class 10 Maths Previous Year Questions - Polynomials
9 pages
什么是评论？
100% (2)
什么是评论？
7 pages
Computer Vision Class 10 Notes
No ratings yet
Computer Vision Class 10 Notes
5 pages
Classical Computer Vision - Session 1
No ratings yet
Classical Computer Vision - Session 1
130 pages
Computer Vision Class X
No ratings yet
Computer Vision Class X
17 pages
Introduction To Computer Vision: Domain of AI
No ratings yet
Introduction To Computer Vision: Domain of AI
4 pages
PRAESENSA 2.10 Configuration Manual EnUS 100857072779
No ratings yet
PRAESENSA 2.10 Configuration Manual EnUS 100857072779
212 pages
WEG Enterprise 15 A 200 Kva Manual Usuario 10004782585 PT
No ratings yet
WEG Enterprise 15 A 200 Kva Manual Usuario 10004782585 PT
265 pages
Class X Artificial Intelligence: Computer Vision
No ratings yet
Class X Artificial Intelligence: Computer Vision
54 pages
Pik Best
50% (2)
Pik Best
3 pages
C10 - Ai - Computer Vision
No ratings yet
C10 - Ai - Computer Vision
40 pages
Chapter-4 Computer Vision Study Material
No ratings yet
Chapter-4 Computer Vision Study Material
4 pages
ProteusAMT-L RevJ
No ratings yet
ProteusAMT-L RevJ
188 pages
Computer Vision Class X
No ratings yet
Computer Vision Class X
39 pages
Computer Vision
No ratings yet
Computer Vision
29 pages
Computer Vision Technology
No ratings yet
Computer Vision Technology
29 pages
Lecture 1 AI Summary
No ratings yet
Lecture 1 AI Summary
31 pages
Computer Vision Notes
No ratings yet
Computer Vision Notes
4 pages
Mobile Edge Computing - A Survey On Architecture and Computation Offloading
No ratings yet
Mobile Edge Computing - A Survey On Architecture and Computation Offloading
28 pages
Sam Satapathy Resume
No ratings yet
Sam Satapathy Resume
11 pages
AI 10th Grade Pdfs
No ratings yet
AI 10th Grade Pdfs
30 pages
CH 3
No ratings yet
CH 3
22 pages
MATHEMATICS 7-10 Edited LAS WEEK 1 AND 2
100% (2)
MATHEMATICS 7-10 Edited LAS WEEK 1 AND 2
5 pages
Computer Vision
No ratings yet
Computer Vision
36 pages
Computer Vision
No ratings yet
Computer Vision
21 pages
6960795-Class10 Ai Partb Unit5 Computervision
No ratings yet
6960795-Class10 Ai Partb Unit5 Computervision
17 pages
Dev 1 Boomi
100% (1)
Dev 1 Boomi
11 pages
CV - Unit 1
No ratings yet
CV - Unit 1
14 pages
CH 05 Computer Vision 1
No ratings yet
CH 05 Computer Vision 1
27 pages
Class 10:artificial Intelligence: Computer Vision
No ratings yet
Class 10:artificial Intelligence: Computer Vision
36 pages
Computer Vision
No ratings yet
Computer Vision
17 pages
Unit 3 - 1 - 1709014556934
No ratings yet
Unit 3 - 1 - 1709014556934
49 pages
Wa0000.
No ratings yet
Wa0000.
18 pages
Computer Vision
No ratings yet
Computer Vision
15 pages
Computer Vision
No ratings yet
Computer Vision
21 pages
Unit 1
No ratings yet
Unit 1
15 pages
Computer Vision
No ratings yet
Computer Vision
13 pages
CV (Unit1&2ans)
No ratings yet
CV (Unit1&2ans)
32 pages
Computer Vision
No ratings yet
Computer Vision
19 pages
PhilRice Citizens Charter Handbook v3
No ratings yet
PhilRice Citizens Charter Handbook v3
55 pages
Computer Vision: Facial Recognition
No ratings yet
Computer Vision: Facial Recognition
9 pages
CS312 Module 4
No ratings yet
CS312 Module 4
21 pages
Class 10 Revision
No ratings yet
Class 10 Revision
10 pages
Unit - 2: Onventional Ncryption Principles
No ratings yet
Unit - 2: Onventional Ncryption Principles
35 pages
Computer Vision
No ratings yet
Computer Vision
13 pages
Screenshot 2023-10-23 at 5.51.17 AM
No ratings yet
Screenshot 2023-10-23 at 5.51.17 AM
14 pages
Ai
No ratings yet
Ai
14 pages
Chunk 2
No ratings yet
Chunk 2
31 pages
Ai CV Notes
No ratings yet
Ai CV Notes
6 pages
Artificial Intelligence (Computer Vision) : by Dr. Sehat Ullah Department of Computer Science & IT University of Malakand
No ratings yet
Artificial Intelligence (Computer Vision) : by Dr. Sehat Ullah Department of Computer Science & IT University of Malakand
35 pages
Dbms 2
No ratings yet
Dbms 2
28 pages
Computer Vision
No ratings yet
Computer Vision
4 pages
Math 8 q2w1 Enhanced20pdf
No ratings yet
Math 8 q2w1 Enhanced20pdf
18 pages
PB 000097 IIM 46234 v0.1
No ratings yet
PB 000097 IIM 46234 v0.1
1 page
Question Bank 9
No ratings yet
Question Bank 9
6 pages
DSBDA GRP B Print
No ratings yet
DSBDA GRP B Print
21 pages
Ch-Computer Vision
No ratings yet
Ch-Computer Vision
6 pages
Introduction To Computer Vision
No ratings yet
Introduction To Computer Vision
8 pages
Unit-5 Computer Vision
No ratings yet
Unit-5 Computer Vision
3 pages
Ebben Is Van Ilyen Processzoros Modell
No ratings yet
Ebben Is Van Ilyen Processzoros Modell
20 pages
Key Differences Between BioMérieux MALDI-ToF MS (V
No ratings yet
Key Differences Between BioMérieux MALDI-ToF MS (V
9 pages
Personalized Stress Monitoring Using Wearable Sensors in Everyday Settings
No ratings yet
Personalized Stress Monitoring Using Wearable Sensors in Everyday Settings
4 pages
Computer Vision Notes
No ratings yet
Computer Vision Notes
4 pages
HW 675075 1compu
No ratings yet
HW 675075 1compu
3 pages
PDF Computer Vision
No ratings yet
PDF Computer Vision
3 pages
PartA Unit5 Ass01
No ratings yet
PartA Unit5 Ass01
3 pages
Computer Vision-Notes X
No ratings yet
Computer Vision-Notes X
2 pages
Computer Vision
No ratings yet
Computer Vision
3 pages
Unit-8 StructuresandUnions
No ratings yet
Unit-8 StructuresandUnions
9 pages
52 BDB
No ratings yet
52 BDB
3 pages
Pdf&rendition 1
No ratings yet
Pdf&rendition 1
2 pages
Software Architecture Design - L01
No ratings yet
Software Architecture Design - L01
13 pages
Chapter 4 Algorithms and Flowcharts Class 8 ICSE
No ratings yet
Chapter 4 Algorithms and Flowcharts Class 8 ICSE
4 pages
ASSIGNMENT 5 - X - AI Handout Computer Vision1
No ratings yet
ASSIGNMENT 5 - X - AI Handout Computer Vision1
3 pages
Internet of Things & Urban Transportation Planning
No ratings yet
Internet of Things & Urban Transportation Planning
10 pages
Programming 1 Course Outline
No ratings yet
Programming 1 Course Outline
3 pages
Cotton EntranceAdmit
No ratings yet
Cotton EntranceAdmit
1 page

Computer Vision

Uploaded by

Computer Vision

Uploaded by

Computer Vision

1.Semantic Segmentation (Image Classification)

• Output: A class label (e.g. one or more integers that

classes of the located objects in an image.

• Input: An image with one or more objects, such as a photograph.

height), and a class label for each bounding box.

You might also like