Ai CV Notes
Ai CV Notes
Computer Vision
Computer Vision
Facial Recognition
The most frequently used technology is smartphones. It is a technology to
remember and verify a person, object, etc from the visuals from the given pre-defined
data. Such kinds of mechanics are often used for security and safety purposes.
For eg
: Face security lock-in devices and traffic cameras are some examples using facial
Recognition.
Face Filters
Modern days social media apps like Snapchat and Instagram use such kinds of
technology that extract facial landmarks and process them using AI to get the best
result.
To search data, Google uses computer vision for capturing and analyzing different
features of the input image to the database of images and then gives us the search.
For eg:Companies like Tesla are now interested in developing self-driving cars.
Medical Imaging
For the last decades, computer vision medical imaging application has been a
trustworthy help for physicians and doctors. It creates and analyzes images and helps
doctors with their interpretation. The application is used to read and convert 2D scan
images into interactive 3D models.
Google Translate App
To read signs written in a foreign language, all you have to do is point the
camera on your phone at the text, and the Google Translate software will very
immediately translate them into the language of your choice. This is a useful
application that makes use of Computer Vision, utilizing optical character recognition
to view the image and augmented reality to overlay an accurate translation.
Computer Vision Tasks
The Application of the computer is performed by certain tasks on the data or
input provided by the user so it can process and analyze the situation and predict the
outcome.
I. Single object
1) Image Classification
Image Classification is the task of identifying an object in the input
image and label from a predefined category.
2) Classification + Localization
This is the task which involves both processes of identifying what object is
present in the image and at the same time identifying at what location that object is
present in that image. It is used only for single objects.
Basics of Images
The word "pixel" means a picture element.
Pixels
● Pixels are the fundamental element of a photograph.
● They are the smallest unit of information that make up a picture.
● They are typically arranged in a 2-dimensional grid.
● In general terms, The more pixels you have, the more closely the image
resembles the original.
Resolution
Pixel value
Grayscale Images
● Grayscale images are images which have a range of shades of gray without
apparent color.
● The lightest shade is white total presence of color or 255 and darkest color is
black at 0.
● Intermediate shades of gray have equal brightness levels of the three primary
colors RGB.
● The computers store the images we see in the form of these numbers.
RBG Images
● All the coloured images are made up of three primary colors Red, Green and
Blue.
● All the other colors are formed by using these primary colors at different
proportions.
● Computer stores RGB Images in three different channels called the R channel,
G channel and the B channel.