0% found this document useful (0 votes)

2 views54 pages

Week03 1 S

Uploaded by

Ayana Negera

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views54 pages

Week03 1 S

Uploaded by

Ayana Negera

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 54

VIC TO R IA U NIVER S ITY O F W ELLINGTO N

Te Whare Wananga o te Upoko o te Ika a Maui

VUW
School of Engineering and Computer Science

COMP 422 — Weeks 2-3

Computer Vision and Image Processing —

Introduction

Mengjie Zhang
[email protected]
COMP422 IP and Vision: 2

Outline

• Overview – Computer Imaging

• Computer Vision and its applications
• Image Processing and its applications
• Image acquisition and display
• Image representation
• Image file formats
• Tools
• Image implementation
COMP422 IP and Vision: 3

Announcements

• netpbm/pbmplus package:
/vol/courses/comp422/src/netpbm/

• jpeg/jpg package:
/vol/courses/comp422/src/jpeg-6b/

• An implementation of pgm image representation source code:

/vol/courses/comp422/src/mengjie/pgm-rep/

• Three programs making a PGM image, circles and squares in

an image: /vol/courses/comp422/bin/
COMP422 IP and Vision: 4

Questions
• In what kinds of situation computers are used to process/analyse
images/pictures? Give some examples.
• What are the differences between computer vision and image
processing? Are the images examined/acted on by people or
automatically by computers?
• How do you acquire/obtain an image? Give some examples of
image acquisition device.
• How can we display an image? Give some examples of image
display device.
• Have you used digital cameras and scanners before?
• What types can images be represented?
• What image formats have you seen before?
• What image packages have you used before?
• How do we use current packages to process images?
COMP422 IP and Vision: 5

Goals

The main objectives of this topic – Introduction to computer vision

and image processing are:

1. Give an overview of concepts and applications of computer vi-

sion and image processing.
2. Discuss basic algorithms and techniques for image analysis.
3. Understand image representations and image formats, and im-
plementation of images.
4. Understand and learn commonly used image feature types and
feature extraction methods.
5. Perform pattern classification on vision problems – data mining
in image data – object classification.
COMP422 IP and Vision: 6

Overview: Computer Imaging

• Computer imaging is needed: Send/receive complex data; A

picture is worth a thousand words (in some cases); WWW, Vi-
sual information, Multimedia.
• Computer imaging can be defined as the acquisition and pro-
cessing of visual information by computer. [Scott Umbaugh]
• Two categories:
1. Processed images are for use by a computer – Vision appli-
cations
2. Output images are for use by people – image processing ap-
plications.
COMP422 IP and Vision: 7

Computer Imaging (Continued)

Computer Imaging

Computer Image
Vision Processing
COMP422 IP and Vision: 8

Computer Imaging (Continued)

Note: The boundaries separating these two are not very clear, or
fuzzy.

• Historically, IP grew from electrical engineering as an exten-

sion of signal processing, while computer vision was largely
derived from computer science discipline.
• Recently, the two groups have come together to create modern
computer imaging. They share some basic processing algorithm
and methods.
• They are often called together: “Image processing and com-
puter vision”. For example, the international conference on im-
age processing and computer vision is annually held in different
places of the world.
COMP422 IP and Vision: 9

Computer Vision

• Computer vision deals with the processing of image data for use
by a computer.
• In other words, the images are examined and acted upon by a
computer and the application does not involve a human being
in the visual loop.
• Note: People can participate the development of a vision sys-
tem, however, a computer can directly apply the patterns/rules
(knowledge) extracted by the system (to the unseen data).
COMP422 IP and Vision: 10

Computer Vision (continued)

A major topic in computer vision is Image Analysis, which involves

the examination of the image data for solving a vision problem.
Typically, this includes:

• Edge Detection
• Segmentation
• Transformation
• Feature extraction
• Pattern (object) classification

These topics will be discussed in somewhat detail later.

COMP422 IP and Vision: 11

Vision Applications

• Mechanical and manufacturing Engineering

– Quality control (Robot)
– Machinery Fault diagnosing system
• Medical community
– Skin tumor diagnosis
– Brain surgery aided system
– Automatic clinic test system (ES)
– Tissue and cell analysis (identification)
COMP422 IP and Vision: 12

Vision Applications (Continued)

• Law and enforcement and security

– Fingerprint identification
– DNA analysis
– Highway speed monitoring
• Object Recognition/Detection
– Autonomous vehicle recognition
– Vehicle tracking and identification
– Cyclone finding in satellite images
– Mine/Mine-like target detection
• Weather prediction
COMP422 IP and Vision: 13

Image Processing

• Image processing involve the manipulation of image data for

viewing by people.
• In other words, the images are examined and acted on by people
in IP applications.
• Major topics:
– Image restoration
– Image enhancement
– Image compression
COMP422 IP and Vision: 14

Image Restoration

• Image restoration is the process of taking an image with some

known or estimated degradation, and restoring it to its original
appearance.
• It is often used in the situation that an image was somehow
degraded but needs to be improved before the image can be
reused. Publishing is such an example.
• To restore an image to its original appearance, the process of
the degradation (such as a model) is often necessary.
COMP422 IP and Vision: 15

Image Restoration Example

A typical example for image restoration in space exploration to

eliminate artifacts generated by mechanical jitter in a spacecraft is:

Image with distortion Restored image

COMP422 IP and Vision: 16

Image Enhancement

• Image enhancement involves taking an image and improving

it visually, typically by taking advantage of human visual sys-
tem’s response.
• One typical image enhancement technique is contrast stretch-
ing, where the contrast of an image is simply stretched.
• Image enhancement methods are often domain dependent.
• Image enhancement vs image restoration
COMP422 IP and Vision: 17

Image Enhancement Example

Image with poor contrast Enhanced image

COMP422 IP and Vision: 18

Image Compression

• Image compression involves reducing the typically massive

amount of data needed to represent an image.
• Compression procedure only (often) eliminates data that are vi-
sually unnecessary and takes advantage of the redundancy that
is inherent in most images.
• Some images can be reduced 50 times and some even more.
COMP422 IP and Vision: 19

IP Applications

• Medical community in Diagnostic imaging: Computerized To-

mograph (CT), Magnetic Resonance Imaging (MRI) scanning,
Positron Emission Tomography (PET).
• Biological research: Enhance microscopic images to get more
features.
• Entertainment industry: Editing, creating artificial scenes and
beings; Processing new haircut styles, eyeglasses.
• Computer-aided design
COMP422 IP and Vision: 20

Image Acquisition and Display

Image Acquisition Image Display

(Scanner (Monitor
Computer System Printer
Camera
Film
Video Player Video Recorder
... ...) ... ...)

• Image acquisition device: Scanners, Cameras, Vedio Players, ...

• Image storage: Films, files, ...
• Image display device: Monitors, printers, films, video
recorders, ...
• Computer imaging system: Process saved images.
COMP422 IP and Vision: 21

Scanners

• Different types: flat-bed or drum

• 600×300 to 2000×2000 per inch
• Used when source images are already in hard copy form.
– photos or radiographs
– drawing
COMP422 IP and Vision: 22

Cameras

• TV cameras: CCTV, Broadcast

• Digital cameras: from 200×200 to 4096×4096
– Digital still cameras: Limited resolution but avoid the use
of film for simple applications; downloading to computers
is slow
– High quality digital cameras: up to 4096×4096, can have 8,
10, 12 bits; require special interfacing
• Line-scan cameras: randomly addressable CCD
• Various spectral sensitivities: infrared, visible, ultra violet — X
rays.
COMP422 IP and Vision: 23

Issues in Image Output

• Resolution: Spatial (say 1280×1024 pixels on a monitor), color

(16, 256,...), time (speed, 25, 500, 107 frames per second on TV)
• Dynamic range: Black (better on paper than a monitor screen),
white (better on transparencies), Possible range of color, ...
• Accuracy
COMP422 IP and Vision: 24

From Analog Video Signal to Digital Image

• Frame Grabber: A special-purpose hardware which accepts a

standard video signal and produces an image in the form that a
computer can understand.
• Digital image: The form of an image which a computer can
understand is called a digital image.
• Video signal is continuous, while a digital image is discrete.
• Image Digitization: the process of transforming a standard
video signal into a digital image is called image digitization or
simply digitization.
COMP422 IP and Vision: 25

Digitization (Continued)

• Digitization is done by sampling the continuous signal at a fixed

rate.
• One line of a video signal is often digitized by instantaneously
measuring the voltage of the signal at fixed intervals in time.
• The value of the voltage at each instant is converted into a num-
ber corresponding the brightness of the image at that point.
• The brightness of the image at one point is often called the
“value” of that pixel.
• A pixel is a “point” in an image.
• An image is often accessed as a two dimensional array, in a
form of column×row. For example, 1024×768 pixels.
COMP422 IP and Vision: 26

Digitization (Continued)

Voltage

x x
x x
x x x x x x
x x x
x
One line of information

Time

One line

One pixel
COMP422 IP and Vision: 27

Hierarchical Image Pyramid

OPERATIONS IMAGE REPRESENTATION

High Level
Pattern Classification Object Classes/Labels

Feature Extraction Features (vectors)

Transformation Spectrum

Segmentation Segments

Edge Detection Edges/Lines

Neighborhood/
Preprocessing
Subimage
Low Level

Raw Image Data Pixels

COMP422 IP and Vision: 28

Image Representation

• From now on, without specific notice, images refer to digital

images.
• Images can be represented as an array, or a matrix. Each row
corresponds to a vector.
• A pixel at position (x, y) corresponds to the “brightness” of the
image at the position.
• There are four basic representations for images:
– Binary images
– Gray scale images
– color images
– multispectral
COMP422 IP and Vision: 29

Binary Images

• Binary is the simplest type. It refers to black and white image.

• In other words, there are only two values: 0 for black; 1 for
white, or one bit data, for each pixel in the image.
• These images are often used in the situation that only the gen-
eral shape, outline or position information are needed. For ex-
ample, optical character recognition, robotic gripper to grasp an
object.
• It can be obtained from other formats, particularly from grey
scale format by applying a single threshold.
COMP422 IP and Vision: 30

Sample Images
COMP422 IP and Vision: 31

Grey Scale Images

• Grey scale images only contain brightness information. Com-
pared with binary images, they contain richer information.
• Typically, grey scale images contain 8 bits data. The range of
pixel values is from 0 to 255. In other words, they have 256
grey levels.
• These images can provide some sorts of noise.
COMP422 IP and Vision: 32

Color Images

• Color images have three-band monochrome image data, each of

which corresponds to a different color.
• Typically, the three colors in color images are red, green and
Blue, or RGB.
• Each of the three colors contain 8 bit data. In total, RBG images
have 24 bit pixel data.
• At each position in a RGB image, a pixel corresponds to a color
pixel vector – R, G, B.
COMP422 IP and Vision: 33

Multispectral Images
• Miltispectral images typically contain information outside the
human perceptual range. They contain more (30) bands.
• These images are not directly visible to human, however, they
can be reduced into three bands RGB images, which human
being can see.
• These images might include infrared, ultraviolet data.
• Satellite images, underwater sonar images, radar images, in-
frared images, medical X-ray images are examples.
... ... ... ...

... ... red yellow blue violet

infrared orange green indigo ultravoilet ...

"Colors" of the spectrum

COMP422 IP and Vision: 34

Image File Formats

• Many image formats exists: At least 200 image formats in use

• Most image files contain header information and the raw pixel
data.
• Image header information usually contain some of the follow-
ing information:
– Number of rows (height)
– Number of columns (width)
– Number of bands
– Number of bits per pixel
– file type
• We only review those commonly used
COMP422 IP and Vision: 35

Image File Formats (Continued)

• BMP format (bitmap)

– often have the extensions .bmp, .dib, .vga, .bga, .rle, .rl4,
.rl8, ...
– 1, 4 and 8 bit indexed color, 24 bit RBG color
• PNM format (Portable anymap file format)
– pbm format (Portable bitmap file format, .pbm)
– pgm format (Portable graymap file format, .pgm)
– ppm format (portable pixmap file format, .ppm)
• TIFF format (Tagged Image File Format): 1, 4, 8, 32 bit indexed
color; 24 bit RGB color; 32 bit RGB + alpha. (Often have the
extension .tif)
COMP422 IP and Vision: 36

Image File Formats (Continued)

• GIF (Graphics Interchange Format): 1,8 bits per pixel indexed

color; always compressed using LZW (Lempel-Ziv-Welch,
lossless); have the extension .gif.
• JPEG (Joint Photographic Experts Groups): Compressed; com-
monly used in HTML and World wide web (WWW); have the
extension of .jpg or .jpeg.
• Others
– SGI (Silicon Graphics, Inc.)
– EPS (Encapsulated PostScript)
– VIP (Visualization in Image Processing)
COMP422 IP and Vision: 37

A Typical PGM Format Image

P2
# feep.pgm
24 7
15
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
0 3 3 3 3 0 0 7 7 7 7 0 0 11 11 11 11 0 0 15 15 15 15 0
0 3 0 0 0 0 0 7 0 0 0 0 0 11 0 0 0 0 0 15 0 0 15 0
0 3 3 3 0 0 0 7 7 7 0 0 0 11 11 11 0 0 0 15 15 15 15 0
0 3 0 0 0 0 0 7 0 0 0 0 0 11 0 0 0 0 0 15 0 0 0 0
0 3 0 0 0 0 0 7 7 7 7 0 0 11 11 11 11 0 0 15 0 0 0 0
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
COMP422 IP and Vision: 38

A Typical PGM Format Image (Continued)

• A magic number for identifying the file type. A pgm file’s

magic number is the two characters “P2”.
• A width, formatted as ASCII characters in decimal. (e.g. 24)
• A height, in ASCII decimal (e.g. 7)
• The maximum gray value, again in ASCII decimal (e.g. 15).
• Width×height gray values, each in ASCII decimal, between 0
and the specified maximum value, separated by whitespace. A
value of 0 means black, and the maximum value means white.
• Characters from a ”#” to the next end-of-line are ignored (com-
ments).
COMP422 IP and Vision: 39

P5 Format PGM images

P5 is a variation of P2 format. This format saves PGM images in

a RAWBITS or binary way. Ii is different from P2 format in the
following aspects:

• The magic number is P5 instead of P2.

• The gray values are stored as plain bytes, instead of ASCII dec-
imal.
• In the grays section, a newline character (typically) is set after
the maxval (each row).
• The files are smaller and many times faster to read and write.
• This format can only be used for maxvals less than or equal to
255. If maxval is larger, it will automatically fall back on the
plain format.
COMP422 IP and Vision: 40

Programming with Images

• Loading a image
• Creating an image
• Processing an image
• Outputting/saving an image
• Destroying an image
COMP422 IP and Vision: 41

Programming with Images (Continued)

#include <stdio.h>
#include <stdlib.h>

/* Macros and definitions */

#define TRUE 1
#define FALSE 0
#define OK 1

typedef unsigned char UCHAR;

typedef struct _PGMIMAGE /* Structure for containing a PGM format image */

{
int greylevel;
int xsize;
int ysize;
UCHAR *image;
} PGMIMAGE;
COMP422 IP and Vision: 42

Programming with Images (Continued)

/*
* CreatePGM() -- Creates a PGM image structure in memory
*/

PGMIMAGE *CreatePGM(int xsize, int ysize, int greylevel)

{
PGMIMAGE *pgm;

if ((pgm = (PGMIMAGE *) malloc(sizeof(PGMIMAGE))) == NULL) return NULL;

pgm->greylevel = greylevel;
pgm->xsize = xsize;
pgm->ysize = ysize;

if ((pgm->image = (UCHAR ) malloc(sizeof(UCHAR) xsize * ysize)) == NULL)

{
free(pgm);
return NULL;
}

memset(pgm->image, 0, xsize * ysize);

return pgm;
}
COMP422 IP and Vision: 43

Programming with Images (Continued)

/* LoadPGM() -- Loads a PGM format image file into memory */

PGMIMAGE LoadPGM(char filename)

{
FILE *fp;
PGMIMAGE *pgm;
int i;

if ((fp = fopen(filename, "rb")) == NULL) return NULL;

if ((pgm = (PGMIMAGE *) malloc(sizeof(PGMIMAGE))) == NULL)

{
fclose(fp);
return NULL;
}
fscanf(fp, "P5 %d %d %d\n", &pgm->xsize,
&pgm->ysize, &pgm->greylevel);

if ((pgm->image = (UCHAR ) malloc(sizeof(UCHAR) pgm->xsiz

pgm->ysize)) == NULL)
{
free(pgm);
fclose(fp);
return NULL;
}

for (i = 0; i < pgm->xsize * pgm->ysize; i++)

{
pgm->image[i] = (UCHAR) fgetc(fp);
}
return pgm;
}
COMP422 IP and Vision: 44

Programming with Images (Continued)

/* SavePGM() -- Save a PGM format image to a file */

int SavePGM(char *filename, PGMIMAGE *pgm)
{
FILE *fp;
int i;

if ((fp = fopen(filename, "wb")) == NULL) return (int) NULL;

fprintf(fp, "P5\n %d %d %d\n", pgm->xsize, pgm->ysize, pgm->greylevel);

for (i = 0; i < pgm->xsize * pgm->ysize; i++)

{
fputc(pgm->image[i], fp);
}

fclose(fp);

return OK;
}

/* DestroyPGM() -- Destroys a PGM format image in memory*/

void DestroyPGM(PGMIMAGE *pgm)
{
free(pgm->image);
free(pgm);
}
COMP422 IP and Vision: 45

Programming with Images(Continued)

An example: process.c

• This program shows how to process a PGM (P5) format image.

• It loads an original image (source), draws white lines on diag-
onals (image processing), then save the results to a new image
(target).
COMP422 IP and Vision: 46

Programming with Images (Continued)

#include "pgm_mengjie.h"
typedef UCHAR byte;

int main(int argc, char *argv[])

{
PGMIMAGE *source, *target;
int i, row, col, xsize, ysize, greylevel;
byte **temp; /* int **temp; is also ok. For temporary storage */

if (argc != 3)
{
fprintf(stderr, "\nUsage: %s [source-image] [target-image]\n", argv[0]);
exit(0);
}
if ((source = LoadPGM(argv[1])) == NULL)
{
fprintf(stderr, "Error on reading %s\n", argv[1]);
exit(0);
}

xsize = source->xsize;
ysize = source->ysize;
greylevel = source->greylevel;
COMP422 IP and Vision: 47

Programming with Images(Continued)

/* Create an image for output/save */

if ((target = CreatePGM(xsize, ysize, greylevel)) == NULL)
{
fprintf(stderr, "Not enough memory to create an image!\n");
exit(0);
};

/* allocate memory for the temporary array for processing image(s) */

temp = calloc(xsize, sizeof(byte *));
if (temp == NULL)
{
fprintf(stderr, "Not enough memory to create temp[]\n");
exit(0);
}
for (i = 0; i < ysize; i++)
{
temp[i] = calloc(xsize, sizeof(byte));
if (temp[i] == NULL)
{
fprintf(stderr, "Not enough memory to create temp[][]\n");
exit(0);
}
}
COMP422 IP and Vision: 48

Programming with Images (Continued)

/* Put the source image data to the temporary array */

for (row = 0; row < ysize; row++)
for (col = 0; col < xsize; col++)
temp[row][col] = (byte) source->image[xsize * row + col];

/* Image processing: draw white lines on the diagonals,

add 10 to other pixels */
for (row = 0; row < ysize; row++)
{
for (col = 0; col < xsize; col++)
{
if (col == row || col == xsize - row)
temp[row][col] = greylevel;
else
temp[row][col] += 10;
}
}

/* Assign the processing results to the target image */

for (row = 0; row < ysize; row++)
for (col = 0; col < xsize; col++)
target->image[xsize * row + col] = (byte) temp[row][col];
COMP422 IP and Vision: 49

Programming with Images (Continued)

/* save the target image */

SavePGM(argv[2], target);

fprintf(stderr, "The process is completed... \n");

DestroyPGM(source);
DestroyPGM(target);

return 0;
}
COMP422 IP and Vision: 50

Image Programs/Packages/Tools

• xv
• ImgStar
• pbmplus/netpbm
• jpeg-6b(jpegtran)
• man pnm, pgm, ppm, ...
COMP422 IP and Vision: 51

Image Programs/Packages/Tools (Continued)

For example, % man pnm:
pnm(5) pnm(5)

NAME
pnm - portable anymap file format

DESCRIPTION
The pnm programs operate on portable bitmaps, graymaps,
and pixmaps, produced by the pbm, pgm, and ppm segments.
There is no file format associated with pnm itself.

SEE ALSO
anytopnm(1), rasttopnm(1), tifftopnm(1), xwdtopnm(1), pnm-
tops(1), pnmtorast(1), pnmtotiff(1), pnmtoxwd(1),
pnmarith(1), pnmcat(1), pnmconvol(1), pnmcrop(1), pnm-
cut(1), pnmdepth(1), pnmenlarge(1), pnmfile(1), pnm-
flip(1), pnmgamma(1), pnmindex(1), pnminvert(1), pnmmar-
gin(1), pnmnoraw(1), pnmpaste(1), pnmrotate(1), pnm-
scale(1), pnmshear(1), pnmsmooth(1), pnmtile(1), ppm(5),
pgm(5), pbm(5)
COMP422 IP and Vision: 52

Summary

• Computer imaging = CV + IP
• CV vs IP
• CV main tasks and applications
• IP main topics and applications
• Commonly used image acquisition and display device/systems
• Image digitization
• Image representation/types: binary, gray scale/level, color, mul-
tispectral ...
• Image formats: .pnm, .jpg/jpeg, .gif, .bmp, ...
• PNM: P2/P5—gray level
• Processing implementation: Defining, loading, creating, pro-
cessing, saving, destroying, ...
• Tools: xv, pbmplus/netpbm, ImgStar, jpeg-6b, CVIP, ...
COMP422 IP and Vision: 53

Exercises

• Use cir make, squ make, pgmmake programs to generate some

pictures.
• Write a program that reverses a PGM image.
(Inew (r, c) = 255 - Iold(r, c))
• Write a program that adds two PGM images and produces a
third image as output. (using netpbm/pbmplus or the represen-
tation in .../public/src/mengjie/pgm-rep/ )
• Convert an image from PGM format to GIF format and JPG for-
mat using programs in netpbm/pbmplus package and/or jpeg-6b
package.
COMP422 IP and Vision: 54

Questions for Next Discussion

• What tasks/stages are commonly included/used in image anal-

ysis?
• What is preprocessing for? Give some examples for preprocess-
ing.
• What algebraic operations can be applied to images? Give some
examples.
• Give examples for edge detection and segmentation.
• Image features, feature types, feature extraction
• Object classification methods — Simple DM algorithms

Chapter 1 Introduction To Computer Vision and Image Processing
No ratings yet
Chapter 1 Introduction To Computer Vision and Image Processing
42 pages
Notes
No ratings yet
Notes
34 pages
IP Full
No ratings yet
IP Full
629 pages
DIP
No ratings yet
DIP
33 pages
Computer Vision
No ratings yet
Computer Vision
30 pages
Lect1 PDF
100% (1)
Lect1 PDF
45 pages
Final 7 12 19 Personal-Development-Module-Career-Pathways
100% (3)
Final 7 12 19 Personal-Development-Module-Career-Pathways
28 pages
Lecture 2 Digital Image Processing
No ratings yet
Lecture 2 Digital Image Processing
28 pages
Unit 1
No ratings yet
Unit 1
186 pages
Lec 01
No ratings yet
Lec 01
50 pages
Lecture (1) - Chapter (1) - Introduction
No ratings yet
Lecture (1) - Chapter (1) - Introduction
44 pages
3-2 Fundamentals of Computer Vision
No ratings yet
3-2 Fundamentals of Computer Vision
43 pages
Computer Vision Al 701
No ratings yet
Computer Vision Al 701
38 pages
Unit I 29.1.24
No ratings yet
Unit I 29.1.24
138 pages
Digital Image Processing Seminar
80% (5)
Digital Image Processing Seminar
23 pages
1 - Unit Dip
No ratings yet
1 - Unit Dip
130 pages
CH 1 Introduction
No ratings yet
CH 1 Introduction
57 pages
Computer Vision and Image Computer Vision and Image Processing (CSEL Processing (CSEL - 393) 393) 2
No ratings yet
Computer Vision and Image Computer Vision and Image Processing (CSEL Processing (CSEL - 393) 393) 2
12 pages
Lecture 1
No ratings yet
Lecture 1
48 pages
Lec 01 CompVision N DIP Intro
No ratings yet
Lec 01 CompVision N DIP Intro
91 pages
Chapter 01 Introduction
No ratings yet
Chapter 01 Introduction
33 pages
Chapter 1 - Introduction To CV
No ratings yet
Chapter 1 - Introduction To CV
49 pages
Image Processing
No ratings yet
Image Processing
10 pages
CV Lecture 1
No ratings yet
CV Lecture 1
65 pages
Chapter 11
No ratings yet
Chapter 11
50 pages
Lec 1 Introduction
No ratings yet
Lec 1 Introduction
34 pages
Image Processing Lecture 1
100% (1)
Image Processing Lecture 1
37 pages
Introduction Fundamentals
No ratings yet
Introduction Fundamentals
100 pages
Chapter One: Computer Vision Vs Image Processing
No ratings yet
Chapter One: Computer Vision Vs Image Processing
30 pages
Lec 1
No ratings yet
Lec 1
51 pages
Chapter 1 PDF
No ratings yet
Chapter 1 PDF
17 pages
C All
No ratings yet
C All
109 pages
Lec 1
No ratings yet
Lec 1
32 pages
S1 Art and Design
No ratings yet
S1 Art and Design
24 pages
CVIP Lecture For Stud
No ratings yet
CVIP Lecture For Stud
64 pages
Introduction To CVIP
No ratings yet
Introduction To CVIP
33 pages
CSE367 Lecture 1
No ratings yet
CSE367 Lecture 1
73 pages
Ch01 Introduction To Computer Vision and Image Processing 1
No ratings yet
Ch01 Introduction To Computer Vision and Image Processing 1
29 pages
Chapter 1 (CV & IP)
No ratings yet
Chapter 1 (CV & IP)
41 pages
Chapter1 CV
No ratings yet
Chapter1 CV
29 pages
AD8703 BCV Unit I 2023
No ratings yet
AD8703 BCV Unit I 2023
65 pages
Computer Vision Al 701
No ratings yet
Computer Vision Al 701
50 pages
Digital Image Processing (DIP) : Dr. Muhammad Nawaz Assistant Professor (Multimedia Technologies)
No ratings yet
Digital Image Processing (DIP) : Dr. Muhammad Nawaz Assistant Professor (Multimedia Technologies)
37 pages
Topic - 1 Introduction To Image and Vision
No ratings yet
Topic - 1 Introduction To Image and Vision
118 pages
Image Processing
No ratings yet
Image Processing
18 pages
CH 1
No ratings yet
CH 1
20 pages
Computer Vision
No ratings yet
Computer Vision
35 pages
Emp - Tech - Q2 - M15 - Research Content For Social Advocacy in Developing An ICT Project - FV
No ratings yet
Emp - Tech - Q2 - M15 - Research Content For Social Advocacy in Developing An ICT Project - FV
25 pages
Unit 1
No ratings yet
Unit 1
64 pages
Unit 1 Computer Vision Notes
No ratings yet
Unit 1 Computer Vision Notes
11 pages
Lect 1 Computervision Student PPT 16-9-2017
No ratings yet
Lect 1 Computervision Student PPT 16-9-2017
143 pages
CH 1
No ratings yet
CH 1
18 pages
Chapter One
No ratings yet
Chapter One
47 pages
Computer Vision ch1
No ratings yet
Computer Vision ch1
80 pages
4 Ip
No ratings yet
4 Ip
91 pages
Dip Module 1 Notes
No ratings yet
Dip Module 1 Notes
33 pages
Artificial Intelligence (Computer Vision) : by Dr. Sehat Ullah Department of Computer Science & IT University of Malakand
No ratings yet
Artificial Intelligence (Computer Vision) : by Dr. Sehat Ullah Department of Computer Science & IT University of Malakand
35 pages
Image Processing and Computer Vision: Goals
No ratings yet
Image Processing and Computer Vision: Goals
14 pages
Adobe Photoshop CC Tutorial 1
No ratings yet
Adobe Photoshop CC Tutorial 1
23 pages
Digital Image Processing: Instructor: Namrata Vaswani
No ratings yet
Digital Image Processing: Instructor: Namrata Vaswani
27 pages
Computer Vision CS-6350: Prof. Sukhendu Das Deptt. of Computer Science and Engg., IIT Madras, Chennai - 600036
No ratings yet
Computer Vision CS-6350: Prof. Sukhendu Das Deptt. of Computer Science and Engg., IIT Madras, Chennai - 600036
48 pages
ECE885 Computer Vision: Prof. Bhupinder Verma
No ratings yet
ECE885 Computer Vision: Prof. Bhupinder Verma
59 pages
Modul 2 Proses Bisnis
No ratings yet
Modul 2 Proses Bisnis
41 pages
Ebook Opengl Distilled
100% (3)
Ebook Opengl Distilled
250 pages
Advanced Word Processing Skills
No ratings yet
Advanced Word Processing Skills
50 pages
Introduction To MIL (Part 5) - Media Habits and Performance Task - Project
No ratings yet
Introduction To MIL (Part 5) - Media Habits and Performance Task - Project
23 pages
HP PPT Guidelines
No ratings yet
HP PPT Guidelines
60 pages
Design Exploration Tutorial For HDL Designer Series: Release v2019.4
No ratings yet
Design Exploration Tutorial For HDL Designer Series: Release v2019.4
40 pages
Module 3 Empowerment Technologies
No ratings yet
Module 3 Empowerment Technologies
8 pages
Desk Tidy Exercise: Extrude
No ratings yet
Desk Tidy Exercise: Extrude
9 pages
Melody House Prospectus May 2022 EA Students
No ratings yet
Melody House Prospectus May 2022 EA Students
10 pages
Chapter 3 - Image Enhancement
No ratings yet
Chapter 3 - Image Enhancement
79 pages
2 Diploma 2nd Sem Syl For Printing Photo MMT ID MOPM ARCH
No ratings yet
2 Diploma 2nd Sem Syl For Printing Photo MMT ID MOPM ARCH
63 pages
Advanced Maya Texturing and Lighting 2nd Edition Lee Lanier - Download The Ebook in PDF With All Chapters To Read Anytime
100% (1)
Advanced Maya Texturing and Lighting 2nd Edition Lee Lanier - Download The Ebook in PDF With All Chapters To Read Anytime
61 pages
Zanala Bangladesh
No ratings yet
Zanala Bangladesh
16 pages
Report
No ratings yet
Report
49 pages
Lecture 5 - Plotting Points and Lines
No ratings yet
Lecture 5 - Plotting Points and Lines
26 pages
Introduction To Media and Information Literacy
No ratings yet
Introduction To Media and Information Literacy
7 pages
Photoshop Intro
No ratings yet
Photoshop Intro
37 pages
Graphics Chapter Two
No ratings yet
Graphics Chapter Two
31 pages
Visual/Graphic Aids For The Technical Report: Robertcury
No ratings yet
Visual/Graphic Aids For The Technical Report: Robertcury
5 pages
CgFeb - 2023
No ratings yet
CgFeb - 2023
2 pages
Smartshader PDF
No ratings yet
Smartshader PDF
14 pages
Ch-7 WorkSheet
No ratings yet
Ch-7 WorkSheet
2 pages
Adobe Media Encoder Log-Last
No ratings yet
Adobe Media Encoder Log-Last
8 pages
Css Za Chat
No ratings yet
Css Za Chat
5 pages
Dip Lab 8 Muhammad Ali 18-Se-16: Example 1
No ratings yet
Dip Lab 8 Muhammad Ali 18-Se-16: Example 1
6 pages
Mohd Razswan Bin Rus: Contact Info
No ratings yet
Mohd Razswan Bin Rus: Contact Info
4 pages

Week03 1 S

Uploaded by

Week03 1 S

Uploaded by

VIC TO R IA U NIVER S ITY O F W ELLINGTO N

Te Whare Wananga o te Upoko o te Ika a Maui

COMP 422 — Weeks 2-3

Computer Vision and Image Processing —

• Overview – Computer Imaging

• An implementation of pgm image representation source code:

• Three programs making a PGM image, circles and squares in

The main objectives of this topic – Introduction to computer vision

1. Give an overview of concepts and applications of computer vi-

Overview: Computer Imaging

• Computer imaging is needed: Send/receive complex data; A

Computer Imaging (Continued)

Computer Imaging (Continued)

• Historically, IP grew from electrical engineering as an exten-

Computer Vision (continued)

A major topic in computer vision is Image Analysis, which involves

These topics will be discussed in somewhat detail later.

• Mechanical and manufacturing Engineering

Vision Applications (Continued)

• Law and enforcement and security

• Image processing involve the manipulation of image data for

• Image restoration is the process of taking an image with some

Image Restoration Example

A typical example for image restoration in space exploration to

Image with distortion Restored image

• Image enhancement involves taking an image and improving

Image Enhancement Example

Image with poor contrast Enhanced image

• Image compression involves reducing the typically massive

• Medical community in Diagnostic imaging: Computerized To-

Image Acquisition and Display

Image Acquisition Image Display

• Image acquisition device: Scanners, Cameras, Vedio Players, ...

• Different types: flat-bed or drum

• TV cameras: CCTV, Broadcast

Issues in Image Output

• Resolution: Spatial (say 1280×1024 pixels on a monitor), color

From Analog Video Signal to Digital Image

• Frame Grabber: A special-purpose hardware which accepts a

• Digitization is done by sampling the continuous signal at a fixed

Hierarchical Image Pyramid

OPERATIONS IMAGE REPRESENTATION

Feature Extraction Features (vectors)

Edge Detection Edges/Lines

Raw Image Data Pixels

• From now on, without specific notice, images refer to digital

• Binary is the simplest type. It refers to black and white image.

Grey Scale Images

• Color images have three-band monochrome image data, each of

... ... red yellow blue violet

"Colors" of the spectrum

Image File Formats

• Many image formats exists: At least 200 image formats in use

Image File Formats (Continued)

• BMP format (bitmap)

Image File Formats (Continued)

• GIF (Graphics Interchange Format): 1,8 bits per pixel indexed

A Typical PGM Format Image

A Typical PGM Format Image (Continued)

• A magic number for identifying the file type. A pgm file’s

P5 Format PGM images

P5 is a variation of P2 format. This format saves PGM images in

• The magic number is P5 instead of P2.

Programming with Images

Programming with Images (Continued)

/* Macros and definitions */

typedef unsigned char UCHAR;

typedef struct _PGMIMAGE /* Structure for containing a PGM format image */

Programming with Images (Continued)

PGMIMAGE *CreatePGM(int xsize, int ysize, int greylevel)

if ((pgm = (PGMIMAGE *) malloc(sizeof(PGMIMAGE))) == NULL) return NULL;

if ((pgm->image = (UCHAR *) malloc(sizeof(UCHAR) * xsize * ysize)) == NULL)

memset(pgm->image, 0, xsize * ysize);

Programming with Images (Continued)

/* LoadPGM() -- Loads a PGM format image file into memory */

PGMIMAGE *LoadPGM(char *filename)

if ((pgm->image = (UCHAR ) malloc(sizeof(UCHAR) xsize * ysize)) == NULL)

PGMIMAGE LoadPGM(char filename)

if ((pgm->image = (UCHAR ) malloc(sizeof(UCHAR) pgm->xsiz