0% found this document useful (0 votes)
24 views3 pages

PartA Unit5 Ass01

Uploaded by

jomepec352
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
24 views3 pages

PartA Unit5 Ass01

Uploaded by

jomepec352
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
You are on page 1/ 3

PartA_Unit5_Computer Vision

Q1. Define Computer Vision.

Computer Vision is a domain of Artificial Intelligence that enables machines to see


through images or visual data, process and analyze them using algorithms and
methods in order to understand and analyze actual phenomena with images.

Q2. Write about any four applications of computer vision.

Four applications of computer vision mentioned in the text are:

1. Facial Recognition: Used in smart homes and cities for security, guest
recognition, and attendance systems in schools.

2. Self-Driving Cars: Fundamental technology for developing autonomous vehicles,


involving object identification, navigation, and environment monitoring.

3. Medical Imaging: Helps create and analyze images, converting 2D scans into
interactive 3D models for detailed understanding of patients' health conditions.

4. Google Translate App: Uses optical character recognition and augmented reality
to translate text in images to different languages in real-time.

Q3. How is Computer Vision being applied in the field of medical imaging? What
advantages does it offer to medical professionals?

Computer Vision in medical imaging helps create and analyze images, converting 2D
scans into interactive 3D models. This enables medical professionals to gain a
detailed understanding of a patient's health condition.

Q4. What role does Computer Vision play in the development of self-driving cars?
What specific tasks does it perform in this context?

In self-driving cars, Computer Vision is used for identifying objects, getting


navigational routes, and environment monitoring. It's fundamental for developing
autonomous vehicles.

Q5. Describe the process of how the Google Translate app uses Computer Vision to
translate text in images. What technologies are involved in this application?

The Google Translate app uses optical character recognition to see and interpret
text in images. It then uses augmented reality to overlay an accurate translation
of the text in the user's preferred language.

Q6. How is computer vision useful in the Retail field?

Computer Vision is useful in the Retail field in several ways:

1. Customer Movement Tracking: Retailers can track customers' movements through


stores, analyze navigational routes, and detect walking patterns.

2. Inventory Management: Through security camera image analysis, Computer Vision


algorithms can generate accurate estimates of available items in the store.

3. Shelf Space Optimization: It can analyze the use of shelf space to identify
suboptimal configurations and suggest better item placement.
Q7. What are the four main Computer Vision tasks and how do they differ from each
other?

The four main Computer Vision tasks are:


Classification: Assigning a label to an input image from a fixed set of categories.
Classification + Localization: Identifying what object is in the image and its
location (for single objects).
Object Detection: Finding instances of real-world objects in images or videos (for
multiple objects).
Instance Segmentation: Detecting object instances, categorizing them, and labeling
each pixel accordingly.

Q8. Explain the concept of instance segmentation in Computer Vision. How does it
differ from simple object detection?

Instance segmentation involves detecting object instances, categorizing them, and


then labeling each pixel based on that categorization. It goes beyond object
detection by providing pixel-level information about each detected object.

Q9. What is the difference between image classification and object detection in
Computer Vision?

Image classification assigns a single label to an entire image, while object


detection identifies and locates multiple objects within an image.

Q10. Define the terms pixel and resolution.

Pixel: A pixel, short for "picture element," is the smallest unit of information
that makes up a digital image. Pixels are typically round or square and are
arranged in a 2-dimensional grid.

Resolution: Resolution refers to the number of pixels in an image. It can be


expressed in two ways:
1. As width by height (e.g., 1280�1024), indicating the number of pixels
horizontally and vertically.
2. As a single number (e.g., 5 megapixels), representing the total number of pixels
in the image.

Q11. What is a grayscale image? What is the darkest and lightest possible color in
a grayscale image? How is its size defined?

A grayscale image is an image that has a range of shades of gray without apparent
color.

The darkest possible shade in a grayscale image is black, which represents the
total absence of color or a pixel value of zero.

The lightest possible shade is white, which represents the total presence of color
or a pixel value of 255.

The size of a grayscale image is defined as the Height x Width of that image. Each
pixel in a grayscale image is represented by 1 byte, and the image consists of a
single plane of a 2D array of pixels.

Q12. What is an RGB image?

An RGB image is a color image made up of three primary colors: Red, Green, and
Blue. Here are the key points about RGB images:
1. All colors in an RGB image can be created by combining different intensities of
red, green, and blue.

2. Computers store RGB images in the form of three different channels: the R
channel, G channel, and B channel.

3. Each channel is a plane of pixels, with each pixel value ranging from 0 to 255.

4. When the three planes are combined, they form a color image.

5. In an RGB image, each pixel has a set of three different values (one for each
channel) which together determine the color of that particular pixel.

6. The size of an RGB image is three times that of a grayscale image of the same
dimensions, as it has three channels instead of one.

Q13. How does pixel value work in digital images? What is the range of possible
values for a pixel in a typical byte image?

Each of the pixels that represents an image stored inside a computer has a pixel
value which describes how bright that pixel is, and/or what colour it should be.
The most common pixel format is the byte image

In a typical byte image, each pixel is represented by an 8-bit integer. This gives
a range of possible values from 0 to 255. Usually, 0 represents black (no color)
and 255 represents white (full color).

Q14. Why do we use 255 as the maximum value for pixel intensity in 8-bit images?
Explain the relationship between bits and pixel values.

We use 255 because an 8-bit system can represent 256 different values (2^8 = 256),
ranging from 0 to 255. Each bit in a computer system can have either a zero or a
one, so 8 bits together can have 256 possible combinations.

Q15. Describe how RGB values combine to create different colors. How would you
represent pure red, pure green, and pure blue in the RGB color model?

In the RGB color model, colors are created by combining different intensities of
red, green, and blue. Each color channel has a value from 0 to 255.
- Pure red would be represented as (255, 0, 0)
- Pure green as (0, 255, 0)
- Pure blue as (0, 0, 255)

You might also like