Lect1 Introduction
Lect1 Introduction
Image Processing
Slides modified from materials provided by Drs. Yiming Xiao & Tien Bui
COMP 478/6771
• Instructor: Yaser Esmaeili
• Email: [email protected]
• Office hour: Mon. and Fri. : 12:30 - 13:00 ET (via ZOOM, by
an appointment via email)
• Classroom: LS 210 SGW
• Lectures: Tues: 2:45 pm ~ 5:30 pm
• Tutorials/Labs/Marker:
Sepehr Ghamari Tutorial [email protected]
4
• Project: 30% of the final mark.
✓ Goal: deep understanding of the work through a team working project
and easy (transparent) transfer learning to others
✓ Students may complete the project in a team of 3~4 members
✓ Select an image processing title (following the definition mention in the
class, e.g. segmentation, denoising, captioning, face detection re
various objectives etc, from the publications in the next slide)
✓ MATLAB programming delivery (entire/partially)
✓ Final report should be no more than 10 pages long and contain a brief
paper review, details of the implementation, and all relevant codes
✓ In-class presentation
• Mid-term test: one mid-term test worth 25% of the final mark [Week 8].
• Final exam: 3-hr final exam at the end of term, worth 35% of the final mark.
• Note: A) There will be 120 (= 100+20 (bonus)) as the total mark for the
course! B) Paper’s submission with your own novelty/contributions
4 IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) 179 318
h5-index is the h-index for articles published in the last 5 complete years. It is the largest number h such
that h articles published in 2018-2022 have at least h citations each.
h5-median for a publication is the median number of citations for the articles that make up its h5-index.
➢ MATLAB Programming over the alternative data sets / comparing the results
and final report (Week 13) [15%]
9
Imaging
Camera
Obscura
concept
10
https://owlcation.com/humanities/Leonardo-da-Vincis-Camera-Obscura
Imaging History of photography
11
What is a digital image?
• Sensors: CCD (Charge-Coupled Device) & CMOS (Complementary
Metal-Oxide Semiconductor)
✓ Photon -> electron -> voltage
https://www.teledynedalsa.com https://silentpeakphoto.com
12
CCD from a Sony alpha DSLR-
CMOS sensor from Canon 5D Mark II
A300 (10.14 Megapixel)
13
What is a digital image?
• Divided into array of light-sensitive cells called photosites or pixels
• Each pixel has a gray-level or brightness (e.g, 0-255)
• For color pixel: needs three channels or bands (RGB)
• Resolution or size: number of rows and columns in the image
1280 x 720 (720p) for HD video
For brain MRI images: ~256x256x120 (~1x1x1 mm3)
www.javatpoint.com/concept-of-pixel 14
Color channels
What Is Digital Image Processing?
• A sample
digital image.
662*640*256.
x
• f(x,y): A two-dimensional function,
where x and y are spatial coordinates,
and the amplitude of f at any pair of
coordinates (x,y) is called the intensity
or gray level of the image at that point.
• x size: 662, y size: 640, gray levels:
256
• Digital image: x, y, and the amplitude
values of f are all finite, discrete
quantities
• A sample
color
digital
image,
800*600*2
4 bits
• Pixel: The elements of a digital image.
Pixels
The Origins of Digital Image Processing
x 24
Digital Image Representation
25
Digital Image Representation
26
Image Sampling and Quantization
Camera lens
28
Image representation
29
• An MxN digital grayscale image: (for color f(x,y) would be
1 colour component)
(1,1) y (columns)
x (rows) 30
Digital Image Fundamentals
The Grayscale and Its Perception
• We call the intensity L = f(x,y) the “gray level” value
32
Digital Image Fundamentals
Pixel intensity is 2k
Image size is NxN
Storage Needs for Images
34
35
Number of Pixels and Resolution
36
Number of Pixels Resolution
37
Number of Pixels Resolution
38
Summary
• Definition: An image is a 2-dimensional light intensity
function, f(x,y), where x and y are spatial coordinates,
and f at (x,y) is related to the brightness of the image at
that point.
• Definition: A digital image is the representation of a
continuous image f(x,y) by a 2-D array of discrete
samples.
• The amplitude of each sample is quantized to be
represented by a finite number of bits.
• Definition: Each element of the 2-D array of samples is
called a pixel (Picture Element)
39
Revisit: What is Digital Image Processing?
Processing Vision
40
Denoising Landmark labels Face recognition
Image classification
Denoising Segmentation 41
Take a break!
42
Why do we process images?
• Facilitate storage and transmission of pictures
• Compression of digital images
• Transmission of images
• Prepare an image for display
• Resizing images
• High-dynamic range (HDR) display
• Enhance and restore images
• Restoring old movies
• Better medical scans
• Information extraction from images
• Reading zip codes, bar codes
• Robotics: e.g., Self-driving cars 43
General Areas/Techniques of Image
Processing
• Compression
• Recognition/classification
• Denoising
• Enhancement
• Restoration
• Segmentation
• Inpainting
• Super-resolution
• Watermarking
• Others
44
Areas of Image Processing
Compression
- JPEG (1992): Joint Photographic Expert Group: works with color and grayscale
images.
- Compression ratio of lossless methods is not high enough.
- JPEG uses transform coding (DCT = discrete cosine transform)
- JPEG 2000 uses wavelet transforms
45
Image Compression
At 158:1 compression ratio
Courtesy of Minh Do
46
Areas of Image Processing
Recognition and classification
Applications:
medical imaging, pattern recognition, video
processing, document processing,
industrial applications
47
Recognition-Classification
Feature extraction (e.g. content-based image retrieval)
Images Signatures
Feature
extraction
Similarity Measurement
Feature
Query
extraction
N best matched images
48
Areas of Image Processing
Denoising
49
Areas of Image Processing
Denoising
50
Areas of Image Processing
Restoration
support.zoom.us
https://www.sciencetimes.com/articles/39764/20220906/openai-
introduces-outpainting-dall-e-expands-borders-picture.htm
58
What are the applications?
59
Face Detection (identity, sex, age, etc)
towardsdatascience.com/an-intro-to-deep-
learning-for-face-recognition-aa8dfbbc51fb
www.youtube.com/watch?v=Pc2aJxnmzh 60
0&feature=emb_title
Digital Document Processing
User-entered
search keywords:
abonnement
résilier
Scene Text Detection
Digital photography
squeeze
regular HDR
www.graphitech.net
https://becominghuman.ai/computer-vision-applications-in-self-
driving-cars-610561e14118
64
Medical Imaging
1953 1972 1973
1895
X-ray
CT MRI
PET
ultrasound
T1w MRI
T2w
Special image characteristics & features
3D image: voxel instead of pixel
4D image: 3D image in time series
Medical Imaging
Blood vessel enhancement Generate CT from MRI
• Distance Metrics
81