Lecture1 Overview
Lecture1 Overview
Processing
Lecture 1: Class Overview
Wade Trappe
Course Basics
Required Textbook:
– Y. Wang, J. Ostermann, Y-Q. Zhang: Digital Video Processing and
Communications, Prentice-Hall, 2001.
Reference Textbook:
– A.Bovik & J.Gibson: Handbook Of Image & Video Processing,
Academic Press, 2000.
The textbook will not be sufficient for this class (especially
early on).
– I will assign paper readings to supplement the text
– More often than not, these papers will come from IEEE or ACM
– Most likely, I will make the papers available on the course website
Other useful resource: MATLAB’s Image Processing Toolbox
– Goto www.mathworks.com
The Dirty Work…
This class is not a “cake-walk” graduate course:
– It should not be assumed that I will give out automatic A’s.
– In fact, just like other “Core” courses, there could be C’s in this
class.
Image and Video Processing requires a lot of work to
understand:
– Programming in MATLAB is a must
– Programming in C/C++ is highly desirable (Matlab is just too slow
for the massive optimizations you will need later)
– Seeing is believing in Image and Video Processing…
Telling me that the MSE/PSNR is good is nice, but I still want to see
the image!
Seeing images and videos play is half the fun!
Introduction to Image and Video
Processing
Thanks to Min Wu (UMD) for providing many of the slides that follow.
Images and Videos are Efficient and Effective
A doctored political ad
http://marsrovers.jpl.nasa.gov/gallery/press/opportunity/20040125a.html
JPL Mars’ Panorama captured by the Opportunity
Visual representations are often the most efficient way to represent information
Images are used in many scenarios:
– Politics, advertisements, scientific research, military, etc…
Video signals can effectively tell a temporally evolving story
– Arise in cinema (motion pictures)
– Arise in surveillance
– Arise in medical applications
Why Do We Process Images?
From Mathworks
Visual Mosaicing
– Stitch photos together without thread or scotch tape
– 1st & 30th Mpeg4.5Mbps frame of original, marked, and their luminance difference
– human visual model for imperceptibility: protect smooth areas and sharp edges
Error Concealment
(a) original lenna image (b) corrupted lenna image (c) concealed lenna
image
intensity
I: [0, a] [0, b] [0, inf ) 100
I(x,y)
50
Color image 0
100
80
– Can be represented by three functions, 50 60
R(x,y) for red, G(x,y) for green, and B(x,y) y 40
x
20
for blue. rows 0 0 columns
Sampling and Quantization
256x256
64x64
16x16
Examples of Quantizaion
8 bits / pixel
4 bits / pixel
2 bits / pixel