0% found this document useful (0 votes)
34 views14 pages

Computer Vision

Uploaded by

firaolgonde96
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
34 views14 pages

Computer Vision

Uploaded by

firaolgonde96
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 14

Computer Vision

Group 2 Name ID
Name ID 5,
1, Dereje Shibiru 2558/14 6,
2, 7,
3, 8,
4, 9,
5, 10.
Computer Vision
Computer vision
is a subfield of artificial intelligence that deals with acquiring, processing,
analyzing, and making sense of visual data such as digital images and
videos.
it enables computers and systems to derive meaningful information from
digital images, videos, and other visual inputs — and take actions or make
recommendations based on that information.
 If AI enables computers to think, computer vision enables them to see,
observe or analyzing, and understand.
History of Computer Vision

• The evolution of computer vision is classified as follows:

• 1959: The first experiment with computer vision was initiated in 1959, where
they showed a cat as an array of images. Initially, they found that the system
reacts first to hard edges or lines, and scientifically, this means that image
processing begins with simple shapes such as straight edges.

• 1960: In 1960, artificial intelligence was added as a field of academic study to


solve human vision problems.
• 1963: This was another great achievement for scientists when they developed
computers that could transform 2D images into 3-D images.

• 1974: This year, optical character recognition (OCR) and intelligent character
recognition (ICR) technologies were successfully discovered. The OCR has solved the
problem of recognizing text printed in any font or typeface, whereas ICR can decrypt
handwritten text. These inventions are one of the greatest achievements in document
and invoice processing, vehicle number plate recognition, mobile payments, machine
translation, etc.
• 1982: In this year, the algorithm was developed to detect edges, corners, curves, and other shapes.
Further, scientists also developed a network of cells that could recognize patterns.

• 2000: In this year, scientists worked on a study of object recognition.

• 2001: The first real-time face recognition application was developed.

• 2010: The ImageNet data set became available to use with millions of tagged images, which can
be considered the foundation for recent Convolutional Neural Network (CNN) and deep learning
models.

• 2012: CNN has been used as an image recognition technology with a reduced error rate.

• 2014: COCO has also been developed to offer a dataset for object detection and support future
research.
• 2015-2020
• Advancements in real-time object detection with systems like YOLO
(You Only Look Once).
• in facial recognition technology, used in various applications like
unlocking smartphones and surveillance.
• Integration of computer vision in augmented reality (AR) and virtual
reality (VR) systems.
• Use of computer vision in medical imaging for disease diagnosis.
• 2020-2025 (Predicted)
• Further advancements in real-time object detection and image
recognition.
• More sophisticated use of computer vision in autonomous vehicles.
• Increased use of computer vision in healthcare for early disease
detection and treatment.
• Integration of computer vision in more consumer products, like smart
home devices.
How does Computer Vision Work?
• Computer vision systems use algorithms to process visual information.

• The steps typically involve:

 Image Acquisition:
The process begins with image acquisition, which could be a single frame from a
video, a photo from a camera, or even a live feed.

Preprocessing:
The image may need to be converted from RGB to grayscale, normalized, or resized to
prepare for analysis.
Cont...

Feature Extraction:
The system identifies key features in the image that are necessary for
understanding its content.
Recognition:
After feature extraction, the system compares the features to known labels
to recognize objects or patterns.
Interpretation:
The final step involves making sense of the recognized objects, often
involving complex decision-making or learning over time.
Cont
Task Associated with Computer
Vision
 Object CLASSIFICATION : Classifying objects from a predefined set of
classes.
 Object LOCALIZATION : Finding the location of the object within the
image.
 Object DETECTION : Finding the possible present object location and
classify the object.
 Object SEGMENTATION : Finding the pixels belonging to that object.
 Object TRACKING : Analyzing the video/stream, and tracking the object
such as velocity, relative position of the object.
 Optical Character Recognition (OCR) : Identifying printed or hand-written
characters in image.
 Facial Recognition : Identifying face of individual and variety type of facial
expressions.
 Pose estimation : Estimating the object position or orientation of a specific
Computer Vision Applications
• As per the increasing demand for AI and Machine Learning technologies, computer vision also
has a great demand among different sectors. It has a massive impact on different industries,
including retail, security, healthcare, automotive, agriculture, etc. Below are some most popular
applications of computer vision:
• Healthcare: From analyzing X-rays and MRI scans to monitoring patients to detect
symptoms that require care, computer vision is revolutionizing medicine.
•Automotive: Self-driving cars use computer vision to navigate and avoid obstacles.
•Manufacturing: In quality control, computer vision systems identify defects and
irregularities in products on assembly lines with high accuracy.
• Agriculture: Farmers use computer vision to monitor crops and livestock, often using
drones for image capture.
Conclusion

Computer vision is a transformative technology that bridges the gap between


visual data and actionable insight.

It is a cornerstone of modern AI applications, enabling machines to interpret


and make decisions based on visual input.

As the technology continues to mature, we can anticipate even more


innovative applications that will further integrate AI into the fabric of
society.
Reference
• https://www.ibm.com/think/topics/computer-vision
• https://deepai.org/machine-learning-glossary-and-terms/computer-
vision
• https://medium.com/@rajshekhar_k/introduction-to-computer-
vision-7cfdacfc3f2e

You might also like