PartA Unit5 Ass01
PartA Unit5 Ass01
1. Facial Recognition: Used in smart homes and cities for security, guest
recognition, and attendance systems in schools.
3. Medical Imaging: Helps create and analyze images, converting 2D scans into
interactive 3D models for detailed understanding of patients' health conditions.
4. Google Translate App: Uses optical character recognition and augmented reality
to translate text in images to different languages in real-time.
Q3. How is Computer Vision being applied in the field of medical imaging? What
advantages does it offer to medical professionals?
Computer Vision in medical imaging helps create and analyze images, converting 2D
scans into interactive 3D models. This enables medical professionals to gain a
detailed understanding of a patient's health condition.
Q4. What role does Computer Vision play in the development of self-driving cars?
What specific tasks does it perform in this context?
Q5. Describe the process of how the Google Translate app uses Computer Vision to
translate text in images. What technologies are involved in this application?
The Google Translate app uses optical character recognition to see and interpret
text in images. It then uses augmented reality to overlay an accurate translation
of the text in the user's preferred language.
3. Shelf Space Optimization: It can analyze the use of shelf space to identify
suboptimal configurations and suggest better item placement.
Q7. What are the four main Computer Vision tasks and how do they differ from each
other?
Q8. Explain the concept of instance segmentation in Computer Vision. How does it
differ from simple object detection?
Q9. What is the difference between image classification and object detection in
Computer Vision?
Pixel: A pixel, short for "picture element," is the smallest unit of information
that makes up a digital image. Pixels are typically round or square and are
arranged in a 2-dimensional grid.
Q11. What is a grayscale image? What is the darkest and lightest possible color in
a grayscale image? How is its size defined?
A grayscale image is an image that has a range of shades of gray without apparent
color.
The darkest possible shade in a grayscale image is black, which represents the
total absence of color or a pixel value of zero.
The lightest possible shade is white, which represents the total presence of color
or a pixel value of 255.
The size of a grayscale image is defined as the Height x Width of that image. Each
pixel in a grayscale image is represented by 1 byte, and the image consists of a
single plane of a 2D array of pixels.
An RGB image is a color image made up of three primary colors: Red, Green, and
Blue. Here are the key points about RGB images:
1. All colors in an RGB image can be created by combining different intensities of
red, green, and blue.
2. Computers store RGB images in the form of three different channels: the R
channel, G channel, and B channel.
3. Each channel is a plane of pixels, with each pixel value ranging from 0 to 255.
4. When the three planes are combined, they form a color image.
5. In an RGB image, each pixel has a set of three different values (one for each
channel) which together determine the color of that particular pixel.
6. The size of an RGB image is three times that of a grayscale image of the same
dimensions, as it has three channels instead of one.
Q13. How does pixel value work in digital images? What is the range of possible
values for a pixel in a typical byte image?
Each of the pixels that represents an image stored inside a computer has a pixel
value which describes how bright that pixel is, and/or what colour it should be.
The most common pixel format is the byte image
In a typical byte image, each pixel is represented by an 8-bit integer. This gives
a range of possible values from 0 to 255. Usually, 0 represents black (no color)
and 255 represents white (full color).
Q14. Why do we use 255 as the maximum value for pixel intensity in 8-bit images?
Explain the relationship between bits and pixel values.
We use 255 because an 8-bit system can represent 256 different values (2^8 = 256),
ranging from 0 to 255. Each bit in a computer system can have either a zero or a
one, so 8 bits together can have 256 possible combinations.
Q15. Describe how RGB values combine to create different colors. How would you
represent pure red, pure green, and pure blue in the RGB color model?
In the RGB color model, colors are created by combining different intensities of
red, green, and blue. Each color channel has a value from 0 to 255.
- Pure red would be represented as (255, 0, 0)
- Pure green as (0, 255, 0)
- Pure blue as (0, 0, 255)