0% found this document useful (0 votes)

8 views

Lecture 4

The document discusses image representation and storage in computer vision, focusing on raster and vector images, their encoding, and techniques used in machine learning. It highlights the importance of color depth and various image formats, as well as representation methods such as pixel-based representation and feature extraction. A case study is included to illustrate the application of these techniques in developing an object recognition system for self-driving cars.

Uploaded by

najeebullah2637674

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Lecture 4

Uploaded by

najeebullah2637674

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 22

Image Processing &

Computer Vision
Lecture# 4
Content
• Image representation & storage
• Image Representation
• Raster Images
• Vector Images
• Image Representation Techniques in Machine Learning
What is the purpose of image encoding in computer science?

 Image encoding or image compression is the process of converting an image into a series of
bytes and codes, reducing the file size for storage or transmission and enhancing the efficiency
of computer resources.

 Image encoding in computer science is solely for data encryption and maintaining the privacy
of image content.

 The purpose of image encoding in computer science is to enlarge the file size for better image
quality.

 Image encoding or image compression is the process of editing an image to enhance its visual
elements like contrast and brightness.
Image Representation

Definition
Image representation refers to the way in which images are stored, processed, and displayed in
digital systems.

It encompasses the encoding of visual information so that it can be used by computer

algorithms.Images can be represented in various formats, and understanding how these
formats work is crucial for anyone working with digital media.

This representation can cover anything from basic pixel values to more complex data
structures.

Image Representation: The method or format used to encode, store, and

display visual information in digital systems.
There are two primary types of image representation:

 raster and vector. In raster images, the picture is made up of pixels, which are
small dots of color.

 This means that the quality of the image is tied to its resolution.
 higher resolution means more pixels and finer detail.

 In contrast, vector images are composed of paths defined by mathematical

formulas.

 These images can be scaled to any size without losing quality because they
do not rely on pixels.

 Understanding these fundamental types is vital for selecting the appropriate

image format for specific applications.
Example of Raster vs. Vector:
A photograph is typically represented as a raster image, while a logo that needs to be resized
frequently may be best represented as a vector image.

Image representation can also involve color depth, which refers to the number of bits used to
represent each pixel's color.

Common depths include:

 1-bit: Black and white images

 8-bit: 256 colors
 24-bit: over 16 million colors (True Color)

Higher color depths allow for more vibrant images but require more storage space.
Different image formats are available, each with unique properties that affect usage.Some
common formats include:

Format Type Use Cases

Photographs and web images due
JPEG Raster
to compression.

PNG Raster Images requiring transparency

GIF Raster Simple animations and graphics

SVG Vector Logos and icons that need scaling

TIFF Raster High-quality images in printing

This detailed understanding allows for better decisions in digital projects, ensuring that the right
format is chosen for quality, performance, and intended use.
Image Representation in Computer Science
Image representation plays a critical role in Computer Science, particularly in areas
involving graphics, web development, and media. Understanding how images are
encoded allows for effective manipulation, storage, and display in various
applications.

Two main categories of image representation exist: raster and vector images.

Raster Image: An image represented as a grid of pixels, where each pixel has its
specific color value.

Vector Image: An image defined by mathematical equations representing shapes,

allowing scaling without loss of quality.

For example, a photo taken with a camera is stored as a raster image, while a
company logo designed in a graphics editor may be saved as a vector image.
Another aspect of image representation is the concept of color depth, which
indicates the number of bits used to represent the color of a single pixel.
Common color depths include:

 1-bit: Black and white images

 8-bit: 256 colors
 24-bit: True color (16.7 million colors)

Higher color depth results in more accurate color representation but also
increases file sizes.

Color Depth: The number of bits allocated to represent the color of a pixel in
an image.
The choice of image format is vital for performance and quality. Here are some popular formats and
their characteristics:

Format Type Uses

Common for photographic images due

JPEG Raster
to compression

Supports transparency and is widely used on the

PNG Raster
web

GIF Raster Used for simple animations and graphics

SVG Vector Ideal for scalable graphics like logos

TIFF Raster High-quality images for printing and archiving

Selecting the appropriate format depends on the desired balance between image quality, file
size, and specific use cases.
Image Representation Techniques in Machine Learning
 In the realm of machine learning, understanding image representation is essential for tasks such
as image classification, object detection, and image generative modeling. Images must be
converted into a format that algorithms can interpret, which often involves translating visual data
into numerical representations.

 This conversion is key to enabling effective analysis and manipulation of image data.

 One common technique for image representation is pixel-based representation.

 This method involves representing each image as a grid of pixels, with each pixel containing
values for color components.

 Consider the structure of a 24-bit RGB image, where each pixel has three color components
(red, green, blue), typically represented by 8 bits each. Thus, an image of size 100x100 pixels
consists of an array with dimensions 100x100x3.
Pixel-based Representation:

 Pixel-based representation is the backbone of digital imaging, allowing computers to store

and manipulate visual data as discrete units of color and intensity. It's crucial for various
applications, from digital photography to computer vision, enabling the capture, processing,
and display of images in a format computers can understand.

 A method of representing an image by encoding each pixel's color information in a grid

format.

Example of Pixel Array:For a 2x2 pixel image in RGB format, the pixel representation could
look like the following:In this case, the first pixel is red, the second pixel is green, the third is
blue, and the fourth is yellow.
Another important technique is feature extraction, where key attributes of an image are
identified and used instead of raw pixel values.

This technique reduces the amount of data that needs to be processed and can significantly
improve model performance. Popular methods of feature extraction include:

 Histogram of Oriented Gradients (HOG)

 Scale-Invariant Feature Transform (SIFT)

 Principal Component Analysis (PCA)

 Feature Extraction:
 The process of transforming raw image data into a set of relevant features for analysis in
machine learning.
Feature extraction methods play a crucial role in image representation. Here are some popular
techniques with their applications:

Method Advantages Applications

Effective for recognizing Pedestrian detection, face

HOG
objects detection

Robust against scaling Image stitching, object

SIFT
and rotation recognition

Reduces dimensionality, Image compression, facial

PCA
denoising recognition

By leveraging these features, machine learning models can focus on the most significant
information in the image, leading to faster processing and better predictions.
Image Representation Examples and Explained Techniques
In computer science, various techniques are used to represent images effectively for processing
and analysis. Understanding these techniques can greatly enhance the performance of algorithms
that deal with visual data.

Two primary categories of image representation play a crucial role:

Pixel-based representation and Feature extraction.

Pixel-based Representation: The encoding of images as a grid of pixels, where each pixel
corresponds to a specific color value.

Example of a Pixel Array:

For a small 2x2 pixel image in RGB format, the pixel representation could look like this:
[ [[255, 0, 0], [0, 255, 0]], [[0, 0, 255], [255, 255, 0]]]

Here, each array holds the RGB values for each pixel – red, green, blue, and yellow.
Normalizing pixel values to a range between 0 and 1 can improve model performance in
machine learning applications.
Feature extraction involves identifying and utilizing key attributes from images instead of relying
solely on raw pixel values.

This approach allows algorithms to operate more efficiently, focusing on the most relevant
characteristics of the image data.Common techniques for feature extraction include:

 Histogram of Oriented Gradients (HOG)

 Scale-Invariant Feature Transform (SIFT)

 Principal Component Analysis (PCA)

Feature Extraction: A technique that transforms raw image data into a set of relevant
features for analysis, improving data handling efficiency.
Feature extraction techniques are crucial for image representation efficiency and effectiveness.
Below are notable methods with their descriptions and applications:

Method Advantages Applications

Good for object Pedestrian detection,

HOG
recognition tasks face recognition

Robust to scaling and Image stitching, 3D

SIFT
rotation changes modeling

Reduces dimensionality, Facial recognition, image

PCA
useful for denoising compression

By employing these techniques, machine learning models can enhance their accuracy and
efficiency when interpreting image data.
Image Representation : Key takeaways
 Image representation is the method used to encode, store, and display visual information in
digital systems, essential for image processing in computer science.
 The two primary types of image representation are raster, which consists of pixel grids, and
vector, which is defined by mathematical formulas for scalability.
 Color depth refers to the number of bits used to represent pixel colors; higher color depth
allows for more vibrant images at the cost of increased storage.
 In machine learning, image representation techniques like pixel-based representation and
feature extraction are crucial for processing and analyzing visual data effectively.
 Pixel-based representation encodes images as grids of pixels, while feature extraction
identifies relevant attributes to improve algorithm performance.
 Examples of image representation techniques include: Histogram of Oriented Gradients
(HOG), Scale-Invariant Feature Transform (SIFT), and Principal Component Analysis (PCA),
which enhance the efficacy of image processing tasks.
Case Study: Image Representation for Object
Recognition
Problem Statement
A self-driving car company wants to develop an object recognition
system to detect pedestrians, cars, and road signs. The system should
be able to represent images in a way that allows for efficient and
accurate object recognition.

Questions
1. What are the different image representation techniques?
2. How do these techniques affect object recognition accuracy?
3. Which technique is most suitable for the self-driving car's object
recognition system?
Solution
Image Representation Techniques

1. Pixel-based representation: Images are represented as a matrix

of pixel values.

2. Feature-based representation: Images are represented as a set of

features, such as edges, lines, or shapes.

3. Frequency-based representation: Images are represented in the

frequency domain using techniques such as Fourier transform.
Object Recognition Accuracy

1. Pixel-based representation: High accuracy for simple objects, but low

accuracy for complex objects or objects with varying lighting conditions.

2. Feature-based representation: High accuracy for objects with distinct

features, but low accuracy for objects with similar features.

3. Frequency-based representation: High accuracy for objects with distinct

frequency patterns, but low accuracy for objects with similar frequency
patterns.
Suitable Technique
Based on the requirements of the self-driving car's object recognition system, a
feature-based representation technique is most suitable. This technique allows
for efficient and accurate object recognition, even in complex environments
with varying lighting conditions.

Tools and Techniques

1. Convolutional Neural Networks (CNNs): A type of neural network that is
particularly well-suited for image classification tasks.
2. Object Detection Algorithms: Such as YOLO (You Only Look Once) or SSD
(Single Shot Detector).
3. Image Processing Libraries: Such as OpenCV or Pillow.

By using a feature-based representation technique and leveraging tools and techniques such
as CNNs, object detection algorithms, and image processing libraries, the self-driving car
company can develop an efficient and accurate object recognition system.

Introduction To Blender 30
No ratings yet
Introduction To Blender 30
8 pages
Ursina Cheat Sheet
No ratings yet
Ursina Cheat Sheet
54 pages
Classical Computer Vision - Session 1
No ratings yet
Classical Computer Vision - Session 1
130 pages
Introduction To Digital Image Processing
No ratings yet
Introduction To Digital Image Processing
14 pages
Chapter 3
No ratings yet
Chapter 3
34 pages
Chunk 2
No ratings yet
Chunk 2
31 pages
Computer Vision
No ratings yet
Computer Vision
15 pages
CV1 Qns Ans Key
No ratings yet
CV1 Qns Ans Key
11 pages
Image Processing - AL Computer Science
No ratings yet
Image Processing - AL Computer Science
6 pages
Image Processing Basics
No ratings yet
Image Processing Basics
8 pages
Digital Image
No ratings yet
Digital Image
32 pages
Chapter 3
No ratings yet
Chapter 3
12 pages
Computer Graphic
No ratings yet
Computer Graphic
18 pages
Computer Vision
No ratings yet
Computer Vision
29 pages
Lesson 9
No ratings yet
Lesson 9
9 pages
CPE101 L2
No ratings yet
CPE101 L2
3 pages
UNIT_1
No ratings yet
UNIT_1
15 pages
Image and Graphics
No ratings yet
Image and Graphics
9 pages
Brief Introduction and Overview of Visual Media Compression and Processing PDF
No ratings yet
Brief Introduction and Overview of Visual Media Compression and Processing PDF
11 pages
Screenshot 2023-10-23 at 5.51.17 AM
No ratings yet
Screenshot 2023-10-23 at 5.51.17 AM
14 pages
Dip I
No ratings yet
Dip I
6 pages
Chapter 3
No ratings yet
Chapter 3
8 pages
PDF Computer Vision
No ratings yet
PDF Computer Vision
3 pages
Notes - 1.2.1 - Multimedia - Graphics
No ratings yet
Notes - 1.2.1 - Multimedia - Graphics
9 pages
DIP_Notes.pdf bn
No ratings yet
DIP_Notes.pdf bn
37 pages
MM Chapter 3
No ratings yet
MM Chapter 3
60 pages
Chapter 2 DIP
No ratings yet
Chapter 2 DIP
20 pages
e98da8fbc33b80a8a7c6cfc6ddfd7cf5
No ratings yet
e98da8fbc33b80a8a7c6cfc6ddfd7cf5
36 pages
AI 10th grade pdfs
No ratings yet
AI 10th grade pdfs
30 pages
Ai
No ratings yet
Ai
14 pages
MUL_CHAP_3
No ratings yet
MUL_CHAP_3
44 pages
MULTIMEDIA
No ratings yet
MULTIMEDIA
38 pages
Multimedia unit-3
No ratings yet
Multimedia unit-3
9 pages
Ip Cv Summary Finaaaal-1
No ratings yet
Ip Cv Summary Finaaaal-1
178 pages
Images
No ratings yet
Images
10 pages
Pdf&rendition 1
No ratings yet
Pdf&rendition 1
2 pages
Computer vision
No ratings yet
Computer vision
13 pages
Computer Graphics Chapter 1
No ratings yet
Computer Graphics Chapter 1
92 pages
Computer Aplication Packages
No ratings yet
Computer Aplication Packages
20 pages
Chapter-4 Computer Vision Study material
No ratings yet
Chapter-4 Computer Vision Study material
4 pages
_19.+20301_+International+Journal+of+Intelligent
No ratings yet
_19.+20301_+International+Journal+of+Intelligent
6 pages
CV&IP chapter Two
No ratings yet
CV&IP chapter Two
29 pages
C10_AI_COMPUTER VISION (1)
No ratings yet
C10_AI_COMPUTER VISION (1)
40 pages
Computer Vision
No ratings yet
Computer Vision
19 pages
Chapter 1
No ratings yet
Chapter 1
58 pages
6960795-Class10 Ai Partb Unit5 Computervision
No ratings yet
6960795-Class10 Ai Partb Unit5 Computervision
17 pages
CV GTU ANSWERS
No ratings yet
CV GTU ANSWERS
56 pages
WAES3204 Image Processing: Part 1: Introduction
No ratings yet
WAES3204 Image Processing: Part 1: Introduction
31 pages
Chapter 3 Images and Graphics
No ratings yet
Chapter 3 Images and Graphics
63 pages
Image Processing With MATLAB: What Is Digital Image Processing? Transforming Digital Information Motivating Problems
No ratings yet
Image Processing With MATLAB: What Is Digital Image Processing? Transforming Digital Information Motivating Problems
7 pages
Introduction To Dig Ital Image Processi NG (DIP) : Prepared By: Laily Azyan Binti Ramlan
No ratings yet
Introduction To Dig Ital Image Processi NG (DIP) : Prepared By: Laily Azyan Binti Ramlan
27 pages
Types of Images &dimension of Images
No ratings yet
Types of Images &dimension of Images
26 pages
UNIT 2 Ms Bcom 6th Sem
No ratings yet
UNIT 2 Ms Bcom 6th Sem
26 pages
18 Graphics Creation
No ratings yet
18 Graphics Creation
27 pages
X AI SS CH5 LM
No ratings yet
X AI SS CH5 LM
54 pages
Images and Sounds
No ratings yet
Images and Sounds
15 pages
Image Feature Extraction
No ratings yet
Image Feature Extraction
11 pages
Chapter 1
No ratings yet
Chapter 1
27 pages
Image
100% (1)
Image
18 pages
ASSIGNMENT 5 - X - AI Handout Computer Vision1
No ratings yet
ASSIGNMENT 5 - X - AI Handout Computer Vision1
3 pages
Intro To Computer Graphics
No ratings yet
Intro To Computer Graphics
4 pages
Computer Graphics in Python
From Everand
Computer Graphics in Python
Martin McBride
No ratings yet
Upload Image To Sharpen & Upscale It - Cutout - Pro
No ratings yet
Upload Image To Sharpen & Upscale It - Cutout - Pro
1 page
Autodesk 3d Modeling
No ratings yet
Autodesk 3d Modeling
280 pages
Tree Tut
No ratings yet
Tree Tut
12 pages
Creative Core Pathway_Syllabus
No ratings yet
Creative Core Pathway_Syllabus
48 pages
GstarCAD 2021 Vs BricsCAD 2020
No ratings yet
GstarCAD 2021 Vs BricsCAD 2020
5 pages
Cute Girl Image - Google Search PDF
No ratings yet
Cute Girl Image - Google Search PDF
1 page
Pixel: Picture Element
No ratings yet
Pixel: Picture Element
9 pages
BUOL A Bottom-Up Framework With Occupancy-Aware Lifting For Panoptic 3D Scene Reconstruction From A Single Image
No ratings yet
BUOL A Bottom-Up Framework With Occupancy-Aware Lifting For Panoptic 3D Scene Reconstruction From A Single Image
10 pages
A Florio Resume 001
No ratings yet
A Florio Resume 001
3 pages
FirePro W7000
No ratings yet
FirePro W7000
8 pages
5418 - Guru Gobind Singh College of Engineering & Research Centre, Nashik
No ratings yet
5418 - Guru Gobind Singh College of Engineering & Research Centre, Nashik
10 pages
Evermotion Archmodels Vol 01 PDF
No ratings yet
Evermotion Archmodels Vol 01 PDF
2 pages
Exam Schedule Bachelor Spring 2024 4
No ratings yet
Exam Schedule Bachelor Spring 2024 4
10 pages
Maya Texturing - class notes
No ratings yet
Maya Texturing - class notes
6 pages
BCA - 21BCA502 (Computer Graphics)
No ratings yet
BCA - 21BCA502 (Computer Graphics)
19 pages
1.1 Opengl: Cineplex Arena
No ratings yet
1.1 Opengl: Cineplex Arena
21 pages
Unit II - Chapter 4 - Edge Detection
No ratings yet
Unit II - Chapter 4 - Edge Detection
43 pages
Clipping
No ratings yet
Clipping
37 pages
Lect5 1
No ratings yet
Lect5 1
32 pages
Excersices Topography
No ratings yet
Excersices Topography
5 pages
Watercolor Artist Action Set Guide PDF
No ratings yet
Watercolor Artist Action Set Guide PDF
7 pages
Lastcrash 63815381568
No ratings yet
Lastcrash 63815381568
10 pages
Group Project - Particle Simulation - Computer Graphics
No ratings yet
Group Project - Particle Simulation - Computer Graphics
6 pages
There Are Two Kinds of Computer Graphics
No ratings yet
There Are Two Kinds of Computer Graphics
2 pages
Panel Sharp Lk315t3la31 0
No ratings yet
Panel Sharp Lk315t3la31 0
25 pages
Designing Arcade Computer Game Graphics by Ari Feldman
100% (5)
Designing Arcade Computer Game Graphics by Ari Feldman
539 pages
Ray Casting 1
No ratings yet
Ray Casting 1
14 pages
What Resolution Should Your Images Be?: Use Pixel Size Resolution Preferred File Format Approx. File Size
No ratings yet
What Resolution Should Your Images Be?: Use Pixel Size Resolution Preferred File Format Approx. File Size
6 pages