Chapter 2
Chapter 2
LOW
Raw data
Low level digital image processing
• Low level computer vision ~ digital image processing
• Image Acquisition
– image captured by a sensor (TV camera) and digitized
• Pre-processing
– suppresses noise (image pre-processing)
– enhances some object features - relevant to understanding the
image
– edge extraction, smoothing, thresholding etc.
• Image segmentation
– separate objects from the image background
– colour segmentation, region growing, edge linking etc
• Object description and classification
– after segmentation
Signals and Functions
• What is an image
• Signal = function (variable with physical meaning)
– one-dimensional (e.g. dependent on time)
– two-dimensional (e.g. images dependent on two co-
ordinates in a plane)
– three-dimensional (e.g. describing an object in space)
– higher-dimensional
• Scalar functions
– sufficient to describe a monochromatic image - intensity
images
• Vector functions
– represent color images - three component colors
Image Functions
• Image - continuous function of a number of variables
• Co-ordinates x, y in a spatial plane
– for image sequences - variable (time) t
• Image function value = brightness at image points
– other physical quantities
– temperature, pressure distribution, distance from the observer
• Image on the human eye retina / TV camera sensor - intrinsically 2D
• 2D image using brightness points = intensity image
• Mapping 3D real world -> 2D image
– 2D intensity image = perspective projection of the 3D scene
– information lost - transformation is not one-to-one
– geometric problem - information recovery
– understanding brightness info
Image definition
Image definition:
A 2D function obtained by sensing a scene
F(x,y), F(x1,x2), F(x)
F - intensity, grey level
x,y - spatial co-ordinates N
B = no. of bits M
f(N-1,M-1)
B L Description
1 2 Binary Image (black and white)
6 54 64 levels, limit of human visual system
8 256 Typical grey level resolution
Image quality
• Quality of digital image proportional to:
– spatial resolution
• proximity of image samples in image plane
– spectral resolution
• bandwidth of light frequencies captured by sensor
– radiometric resolution
• number of distinguishable gray levels
– time resolution
• interval between time samples at which images captured
Digital Image Storage
• Stored in two parts
– header
• width, height … cookie.
– Cookie is an indicator of what type of image file
– data
• uncompressed, compressed, ascii, binary.
• File types
– JPEG, BMP, PPM.
DIP Course
Digital Image Fundamentals and Image Acquisition
(briefly)
Image Enhancement in Spatial Domain
– Pixel operations
– Histogram processing
– Filtering
Image Enhancement in Frequency Domain
– Transformation and reverse transformation
– Frequency domain filters
– Homomorphic filtering
Image Restoration
– Noise reduction techniques
– Geometric transformations
Cont…
Color Image Processing
– Color models
– Pseudocolor image processing
– Color transformations and color segmentation
Wavelets and Multi-Resolution
Processing
– Multi-resolution expansion
– Wavelet transforms, etc.
Image Compression
– Image compression models
– Error free compression
– Lossy compression, etc
DIP Course
Image Segmentation
– Edge, point and boundary detection
– Thresholding
– Region based segmentation, etc
Image Representation
• Image
– Two-dimensional function f(x,y)
– x, y : spatial coordinates
• Value of f : Intensity or gray level
Digital Image
The human eye responds more precisely to brightness information than it does to color, chroma subsampling
– In a 4:4:4 scheme, each 8×8 matrix of RGB pixels converts to three YCrCb 8×8 matrices: one for
luminance (Y) and one for each of the two chrominance bands (Cr and Cb).
– A 4:2:2 scheme also creates one 8×8 luminance matrix but decimates every two horizontal pixels to create
Ratios of 4:2:0 decimate chrominance both horizontally and vertically, resulting in four Y, one Cr, and one Cb
8×8 matrix for every four 8×8 pixel-matrix sources. This conversion creates half the data required in a 4:4:4
chroma ratio.
Chroma Subsampling
HDTV
Computer Video Format
Depends on the i/p and o/p devices (digitizers) for motion video medium.
Digitizers differ in frame resolution, quantization and frame rate
– IRIS video board VINO takes NTSC video signal and after digitization can
achieve frame resolution of 640x480 pixels, 8 bits/pixel and 4 fps.
– SunVideo digitizer captures NTSC video signal in the form of an RGB signal
with frame resolution of 320x240 pixels, 8 bits/pixel and 30 fps.
Computer video controller standards
– The Color Graphics Adapter (CGA): 320 x 240 pixels x 2 bits/pixel = 16,000
bytes (storage capacity per image)
– The Enhanced Graphics Adapter (EGA): 640 x 350 pixels x 4 bits/pixel =
112,000 bytes
– The Video Graphics Array (VGA): 640 x 480 pixels x 8 bits/pixel = 307,200
bytes
– The 8514/A Display Adapter Mode: 1024 x 768 pixels x 8 bits/pixel = 786,432
bytes
– The Extended Graphics Array (XGA): 1024x768 at 256 colors or 640x480 at
65,000 colors
– The Super VGA (SVGS): Upto 1024x768 pixels x 24 bits/pixel = 2,359,296
bytes