0% found this document useful (0 votes)

25 views25 pages

Unit 1 Introduction

The document provides an overview of computer vision, detailing its importance in artificial intelligence and its distinction from image processing. It explains the hierarchy of computer vision, including low-level, mid-level, and high-level processes, and how machines interpret visual data through pattern recognition. Additionally, it discusses the evolution of computer vision technology and its applications in understanding and analyzing visual information.

Uploaded by

aryansuthar194

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views25 pages

Unit 1 Introduction

Uploaded by

aryansuthar194

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 25

Unit 1: Introduction

Contents

◉ Computer Vision
◉ Image Processing
◉ Low-level Computer Vision
◉ Mid-level Computer Vision
◉ High-level Computer Vision
◉ Overview of Diverse Computer Vision
Computer Vision
Computer Vision (Cont.)

◉ Computer vision vs human vision

What we see What a computer sees

Computer Vision (Cont.)

◉ Computer vision is one of the most important fields of artificial intelligence (AI) and
computer science engineering that focuses on creating digital systems that can process,
analyze, and make sense of visual data (images or videos) in the same way that humans
do.
◉ The concept of computer vision is based on teaching computers to process an image at a pixel level
and understand it.
◉ Further, it also helps to take appropriate actions and make recommendations based on the extracted
information.
◉ If artificial intelligence enables computer systems to think intelligently, computer vision makes them
capable of seeing, analyzing, and understanding.
◉ The image data can take many forms, such as a video sequence, depth images, views from multiple
cameras, or multi-dimensional data from a medical scanner
Image Processing

How computers see image?

Pixel
[10,250,0]

What Is an Image?
◉ An image is represented by its dimensions (height and width) based on the number of pixels. For
example, if the dimensions of an image are 500 x 400 (width x height), the total number of pixels in
the image is 200000.
◉ This pixel is a point on the image that takes on a specific shade, opacity or color.
Image Processing (Cont.)

◉ Pixel is usually represented in one of the following:

⮚ Grayscale - A pixel is an integer with a value between 0 to 255 (0 is completely black and 255 is
completely white).
⮚ RGB - A pixel is made up of 3 integers between 0 to 255 (the integers represent the intensity of
red, green, and blue).
⮚ RGBA - It is an extension of RGB with an added alpha field, which represents the opacity of the
image.
Image Processing (Cont.)
Image processing is the process of transforming an image into a digital form and performing
certain operations to get some useful information from it.

Enhanced image

Mathematical
Digital
operation or
Image
algorithm

Edge
Image Processing (Cont.)

◉ Image processing requires fixed sequences of operations that are performed at each pixel of an
image.
◉ The image processor performs the first sequence of operations on the image, pixel by pixel. Once this
is fully done, it will begin to perform the second operation, and so on.
◉ The output value of these operations can be computed at any pixel of the image.
◉ Image Processing Techniques
⮚ Image Segmentation
⮚ Color Image Processing
⮚ Image Restoration
⮚ Object Detection
⮚ Morphological Operations
Computer Vision
Why Computer Vision?

◉ Computer vision helps us solve some of the most difficult problems there are in
computer science related to real-time processing and understanding of visual
information such as an image, a video stream, etc. These problems were hard to
solve in the past because we did not, at that time, have the processing power
required to process such data at a fast enough speed. Also, we did not have any
way for our machines to be able to understand what a particular object looked like
and what it should be called.
◉ Because of these issues, even though our machines were becoming quite good at
tasks such as loading, transferring, and displaying data in visual formats like videos
and images, we were not able to build systems that could understand this kind of
data in any meaningful way. Tasks such as figuring out the text contained in an
image or being able to recognize a number in an image looked simple but were
quite hard practically. Even a simple task like detecting the presence of human
faces in a photo or a video was very hard to accomplish and was done after a lot of
research and failed attempts.
Why Computer Vision (Cont.)

◉ Giving machines the ability to understand these kinds of visual images has become
even more important in today’s digital age, where everyone has access to the
Internet and can put any content on any of the online social media platforms.
◉ For example, if someone tries to put some false information in a textual format on
any of these platforms, then most of these platforms are smart enough to either tag
it as unverified or even remove it. However, if the same information is put online as
an image or a video, then these systems, without computer vision, would not be
able to understand its content and would, therefore, have to publish it until
someone reports it.
Computer Vision (Cont.)

How does Computer Vision Work?

◉ Computer vision technology tends to mimic the way the human brain works. But how does our brain
solve visual object recognition? One of the popular hypothesis states that our brains rely on patterns
to decode individual objects. This concept is used to create computer vision systems.
◉ Computer vision algorithms that we use today are based on pattern recognition. We train computers
on a massive amount of visual data—computers process images, label objects on them, and find
patterns in those objects.
◉ Firstly, a vast amount of visual labelled data is provided to machines to train it. This labeled data
enables the machine to analyse different patterns in all the data points and can relate to those labels.
E.g., suppose we provide visual data of millions of dog images. In that case, the computer learns from
this data, analyzes each photo, shape, the distance between each shape, color, etc., and hence
identifies patterns similar to dogs and generates a model. As a result, this computer vision model can
now accurately detect whether the image contains a dog or not for each input image.
Computer Vision (Cont.)

◉ Machines interpret images as a series of pixels, each with their own set of color values.
◉ For example, below is a picture of Abraham Lincoln. Each pixel’s brightness in this image is
represented by a single 8-bit number, ranging from 0 (black) to 255 (white). These numbers are what
software sees when you input an image. This data is provided as an input to the computer vision
algorithm that will be responsible for further analysis and decision making.
Computer Vision v/s Image Processing

◉ Computer vision is quite a different field from image processing, and these two things should not be
considered as being similar. Digital image processing is the process of creating new images from an
existing image. The new images are created using special algorithms designed for achieving a specific
output from an image. This includes tasks such as creating a black and white version of an image,
removing noise from an image, etc. This is similar to digital signal processing. In other words, digital
image processing is used for the generation of new images and does not in any way try to understand
the content of an image, i.e., it has no idea what object an image contains. It only knows how to
convert it from one form to another.
◉ Computer vision, on the other hand, is used for understanding the content of an image or a video. It
deals with extracting useful information out of images, e.g., if an image contains a human face,
whether it was taken during the day or the night, what the objects are there in the image, etc.
Computer vision does not manipulate images or create new ones in any way.
Computer Vision Hierarchy

◉ The continuum from image processing to computer vision can be broken up into low-, mid- and high-
level processes
◉ Low-level vision − It includes processing image for feature extraction.
◉ Intermediate-level vision − It includes object recognition and segmentation
◉ High-level vision − It includes conceptual description of a scene like activity, intention and
behaviour.
Low Level Vision

◉ Set of operations performed on images aiming at enhancing their quality and selecting useful
information, which will be processed by humans or other algorithms.
◉ It is mainly concerned with extracting descriptions from images (that are usually represented as
images themselves). The analysis usually does not know anything about what objects are actually in
the scene, nor where the scene is relative to the observer. There may be multiple, largely independent
descriptions, such as edge fragments, spots, reflectances, line fragments, etc.
◉ For example, if one was looking at an image of a coffee mug on a desk, the low level descriptions
would make explicit where the mug edges were, where specular highlights were on the mug surface,
what the colours on the mug were. As this description is still linked to an image, these descriptions
would apply everywhere in the image, not just to the mug.
◉ Tasks:
⮚ Primitive operations such as image processing to reduce noise, contrast enhancement, and image
sharpening

Image Image
Image processing
Low Level Vision (Cont.)

sharpening

blurring
Low Level Vision (Cont.)

◉ Noise Removal Example

Calculate dirty Background Subtract

Dirty Image background Median Filtering
image’s Noise
background noise from denoised image
noise dirty image
Mid level vision

◉ In the mid-level process, inputs are generally images but its output are generally image attributes
(e.g., edges, contours, identity of individual objects)
◉ Includes extraction of symbolic information from pre-processed images (low-level vision output) and
analysis techniques of the visual characteristics of the objects that are in the images.
◉ It is mainly concerned with extracting descriptions of the scene from the image descriptions extracted
at the low level. The output is usually in some more symbolic form, describing the position and shape
of portions of the scene. The analysis usually does not know anything about what objects are in the
scene, but does use a lot of knowledge of scene shape and how shape appears in an image.
◉ In our coffee mug example, the kinds of descriptions one might expect are 3D position of the edges of
the mug, portions of its surface shape, depth relationships between adjacent surface patches, which
features are moving and where, etc.
◉ Tasks:
⮚ Segmentation(Partitioning an image into regions or objects)
⮚ Description of those objects to reduce them to a form suitable for computer processing.
⮚ Classifications (recognition) of objects
High-level vision

◉ High-level vision is to infer the semantics, for example, object recognition and scene understanding
◉ In this input is attribute and output is understanding
◉ It includes interpretation of the evolving information provided by the middle level vision as well as
directing what middle and low level vision tasks should be performed. Interpretation may include
conceptual description of a scene like activity, intention, and behaviour.
◉ High level vision is concerned mainly with the interpretation of scene in term of the objects in it, and is
usually based on knowledge of specific objects and relationships. The analysis usually involves
symbolic descriptions, although it might make reference to results from the low and middle levels to
verify hypotheses.
◉ Typical results of high level analysis are a naming of objects present in the scene, estimates of their
position, identification of objects that can satisfy a particular function, descriptions of what sorts of
motions are occurring, or summaries of what sort of scene it is (e.g. an office scene).
◉ In the coffee mug example, the results might say that we are looking at a coffee mug, sitting in a desk
at a given position, the mug is half-full, there is nothing else nearby that could be used to hold coffee,
and the desk is cluttered.
High-level vision
Overview
Overview of Computer Vision

◉ Computer vision is a field of artificial intelligence that trains computers to interpret and understand
the visual world. Machines can accurately identify and locate objects then react to what
they “see” using digital images from cameras, videos, and deep learning models.
◉ As computer vision evolved, programming algorithms were created to solve individual challenges.
Machines became better at doing the job of vision recognition with repetition. Over the years, there
has been a huge improvement of deep learning techniques and technology. We now have the ability to
program supercomputers to train themselves, self-improve over time and provide capabilities to
businesses as online applications.
Thank You

FarmBeats Student Kit Build Instructions
No ratings yet
FarmBeats Student Kit Build Instructions
28 pages
Computer Vision Presentation AI
83% (6)
Computer Vision Presentation AI
16 pages
Unit 4_ Image Segmentation
No ratings yet
Unit 4_ Image Segmentation
24 pages
Unit 1
No ratings yet
Unit 1
186 pages
108103174
No ratings yet
108103174
1,559 pages
Lecture Notes
No ratings yet
Lecture Notes
144 pages
Report on Computer Vision
No ratings yet
Report on Computer Vision
33 pages
Image Processing and Computer Vision (Notes)
No ratings yet
Image Processing and Computer Vision (Notes)
64 pages
Computer Vision
No ratings yet
Computer Vision
19 pages
Sample Computer Practical File 12
No ratings yet
Sample Computer Practical File 12
130 pages
CS 474 Lec 01 Introduction
No ratings yet
CS 474 Lec 01 Introduction
69 pages
Agile_Modeling_with_the_UML
100% (1)
Agile_Modeling_with_the_UML
15 pages
Regionalization PDF
No ratings yet
Regionalization PDF
15 pages
C10_AI_COMPUTER VISION (1)
No ratings yet
C10_AI_COMPUTER VISION (1)
40 pages
Image Processing and Computer Vision Both Are Very Exciting Field of Computer Science
No ratings yet
Image Processing and Computer Vision Both Are Very Exciting Field of Computer Science
56 pages
Intro To Quantum Neuoroscience
No ratings yet
Intro To Quantum Neuoroscience
26 pages
What is Computer Vision
No ratings yet
What is Computer Vision
18 pages
Chapter - 2 - Week 4-11 Feb
No ratings yet
Chapter - 2 - Week 4-11 Feb
45 pages
Chapter 1 [CV & IP]
No ratings yet
Chapter 1 [CV & IP]
41 pages
1 Intro to CV
No ratings yet
1 Intro to CV
76 pages
IPCV Unit 01
No ratings yet
IPCV Unit 01
18 pages
Computer Science & Mathematics Major for College_ Mathematics by Slidesgo
No ratings yet
Computer Science & Mathematics Major for College_ Mathematics by Slidesgo
21 pages
E-Notes_2079_Content_Document_20250402084536AM
No ratings yet
E-Notes_2079_Content_Document_20250402084536AM
38 pages
20200829_meterOS_Energy_Data_Hackdays_2020_Pitch
No ratings yet
20200829_meterOS_Energy_Data_Hackdays_2020_Pitch
9 pages
CV-1.1
No ratings yet
CV-1.1
18 pages
Enter The Input Data in Memory Location 6200 AND 6201. Enter The Above Opcodes From 6000. Execute The Program. Result Stored in 6202
No ratings yet
Enter The Input Data in Memory Location 6200 AND 6201. Enter The Above Opcodes From 6000. Execute The Program. Result Stored in 6202
27 pages
B2C Commerce Variation Group Guide
No ratings yet
B2C Commerce Variation Group Guide
18 pages
Computer Vision Class X
No ratings yet
Computer Vision Class X
39 pages
Supplier Quality General Requirements
No ratings yet
Supplier Quality General Requirements
19 pages
502355296-Computer-Vision-Presentation-AI
No ratings yet
502355296-Computer-Vision-Presentation-AI
16 pages
Machine - Learning (Computer Vision)
No ratings yet
Machine - Learning (Computer Vision)
56 pages
Assignment 1 Final
No ratings yet
Assignment 1 Final
52 pages
COMPUTER VISION Intro
No ratings yet
COMPUTER VISION Intro
7 pages
Amplify Your Thumbnails With AI Thumbnail Game Maker
No ratings yet
Amplify Your Thumbnails With AI Thumbnail Game Maker
10 pages
Computer Vision
No ratings yet
Computer Vision
34 pages
1a. Introduction
No ratings yet
1a. Introduction
32 pages
Chapter One
No ratings yet
Chapter One
17 pages
CV (Unit1&2ans)
No ratings yet
CV (Unit1&2ans)
32 pages
1 Vision Lec 1
No ratings yet
1 Vision Lec 1
49 pages
IT5409 Ch1 Intro
No ratings yet
IT5409 Ch1 Intro
14 pages
AAST-CC312-Fall 21-Lec 11
No ratings yet
AAST-CC312-Fall 21-Lec 11
17 pages
Computer Vision
No ratings yet
Computer Vision
10 pages
CV Unit 1
No ratings yet
CV Unit 1
30 pages
Flutter-PortfolioApp-Development-Report
No ratings yet
Flutter-PortfolioApp-Development-Report
8 pages
Computer Vision Introduction
No ratings yet
Computer Vision Introduction
11 pages
Unit 1 Chapter 1
No ratings yet
Unit 1 Chapter 1
27 pages
Iceberg 3.0 - Auto Signup Process
No ratings yet
Iceberg 3.0 - Auto Signup Process
4 pages
IT5409 Ch1 Intro New Template
No ratings yet
IT5409 Ch1 Intro New Template
14 pages
CCS340 Compressed
No ratings yet
CCS340 Compressed
50 pages
Chapter One
No ratings yet
Chapter One
47 pages
Computer Vision in Aritificial Intelligence
No ratings yet
Computer Vision in Aritificial Intelligence
33 pages
Lec1 - Computer Vision - v1
No ratings yet
Lec1 - Computer Vision - v1
38 pages
Practical No 02 Gad 22034
No ratings yet
Practical No 02 Gad 22034
4 pages
Computer Vision
No ratings yet
Computer Vision
30 pages
Y10 05 CT27 Lesson Plan
No ratings yet
Y10 05 CT27 Lesson Plan
2 pages
CV_UNIT_1
No ratings yet
CV_UNIT_1
17 pages
CV #1 Course Introduction-1
No ratings yet
CV #1 Course Introduction-1
61 pages
CH 1
No ratings yet
CH 1
20 pages
Computer Vision
No ratings yet
Computer Vision
35 pages
AJAX & PHP Question Bank
No ratings yet
AJAX & PHP Question Bank
18 pages
Notes
No ratings yet
Notes
34 pages
AI-Computer Vision
No ratings yet
AI-Computer Vision
16 pages
Designing of Microstrip Patch Antenna For X-Band Application
No ratings yet
Designing of Microstrip Patch Antenna For X-Band Application
7 pages
Hi-Scan 100100v-2is: Heimann X-Ray Technology New: 160 KV X-Ray Source - Typical Steel Penetration 37 MM
No ratings yet
Hi-Scan 100100v-2is: Heimann X-Ray Technology New: 160 KV X-Ray Source - Typical Steel Penetration 37 MM
2 pages
Table of Contents
No ratings yet
Table of Contents
9 pages
A computer vision system processes images acquired
No ratings yet
A computer vision system processes images acquired
4 pages
Lect 1 Computervision Student PPT 16-9-2017
No ratings yet
Lect 1 Computervision Student PPT 16-9-2017
143 pages
Computer Vision and Data Science Notes
No ratings yet
Computer Vision and Data Science Notes
11 pages
Computer Vision Class 10 Notes
No ratings yet
Computer Vision Class 10 Notes
5 pages
AI CV NOTES
No ratings yet
AI CV NOTES
6 pages
Chapter 1 PDF
No ratings yet
Chapter 1 PDF
17 pages
Computer Vision: Chapter 1: Introduction
No ratings yet
Computer Vision: Chapter 1: Introduction
7 pages
Computer Vision
No ratings yet
Computer Vision
14 pages
Uit 1 & Unit 2 Notes
No ratings yet
Uit 1 & Unit 2 Notes
79 pages
Computer Vision PDF
No ratings yet
Computer Vision PDF
6 pages
Konfiguration Movilizer MFS Solution
No ratings yet
Konfiguration Movilizer MFS Solution
3 pages
Computer Vision Class 10 Notes
100% (5)
Computer Vision Class 10 Notes
7 pages
IGCSE ICT - Unit 1 - Chapter 1 (Edexel Pearson)
No ratings yet
IGCSE ICT - Unit 1 - Chapter 1 (Edexel Pearson)
41 pages
OSN 9800 U32 Enhanced Subrack Quick Installation Guide 03
No ratings yet
OSN 9800 U32 Enhanced Subrack Quick Installation Guide 03
78 pages
Introduction of Computer Vision
No ratings yet
Introduction of Computer Vision
5 pages
CV
No ratings yet
CV
9 pages
CS312 Module 4
No ratings yet
CS312 Module 4
21 pages
Tutorial 3
No ratings yet
Tutorial 3
3 pages
Revit Reviewer
No ratings yet
Revit Reviewer
7 pages
Sample Question Paper 1
No ratings yet
Sample Question Paper 1
2 pages
Chapter One-3
No ratings yet
Chapter One-3
8 pages
Computer Vision Report
No ratings yet
Computer Vision Report
31 pages
Unit-5 Computer Vision
No ratings yet
Unit-5 Computer Vision
3 pages
A Brief Introduction To Computer Vision
100% (1)
A Brief Introduction To Computer Vision
3 pages
Specifications-PC212DC212Fom212 Amended 02.2
No ratings yet
Specifications-PC212DC212Fom212 Amended 02.2
8 pages
DSP QB Updated - New
No ratings yet
DSP QB Updated - New
7 pages
Study Guide in Mil
No ratings yet
Study Guide in Mil
3 pages
Cursors in Dbms
No ratings yet
Cursors in Dbms
4 pages
Manual Arcadis Varic
No ratings yet
Manual Arcadis Varic
36 pages
ETN-24-SUPER-SF Series: Owner's Manual - Installation and Operating Instructions
100% (1)
ETN-24-SUPER-SF Series: Owner's Manual - Installation and Operating Instructions
6 pages
Imaging the World: Unlocking the Secrets of Digital Images
From Everand
Imaging the World: Unlocking the Secrets of Digital Images
Pasquale De Marco
No ratings yet

Unit 1 Introduction

Uploaded by

Unit 1 Introduction

Uploaded by

Unit 1: Introduction

◉ Computer vision vs human vision

What we see What a computer sees

How computers see image?

◉ Pixel is usually represented in one of the following:

How does Computer Vision Work?

◉ Noise Removal Example

Calculate dirty Background Subtract

You might also like