Image Transforms Tutorial
Image Transforms Tutorial
announcements
HW#2 due today HW#3 out Midterm next week, class time+location
Monday March 9th (4:10-6:40, Mudd 1127) Open-book
YES: text book(s), class notes, calculator NO: computer/cellphone/matlab/internet
Grading breakdown
HW-Midterm-Final: 30%-30%-40%
outline
Recap of DFT and DCT Unitary transforms KLT Other unitary transforms Multi-resolution and wavelets Applications
Readings for today and last week: G&W Chap 4, 7, Jain 5.1-5.11
inverse transform
N=32
real(A)
imag(A)
Symmetric 2D separable transforms can be expressed with the notations of its corresponding 1D transform.
Exercise
How do we decompose this picture?
DCT2
1 A2 = 2
1 1 1 1
=-1 =1
1 1 1 1
=0 =1
0 0 0 1
DCT2
? ? ? ?
Orthonormal (Eq 5.5 in Jain) : no two basis represent the same information in the image
Completeness (Eq 5.6 in Jain) : all information in the image are represented in the set of basis functions
minimized when
Unitary Transforms
A linear transform:
The Hermitian of matrix A is: This transform is called unitary when A is a unitary matrix, orthogonal when A is unitary and real.
Exercise
Are these transform matrixes unitary/orthogonal?
2 3 1 2
1 2 1 2 1 2 1 2
cos sin sin cos
2 j
j 2
1 2
1 j j 1
Unitary/orthogonal checklist:
determinant equals 1, |A|=1 unit row/column vector orthogonal row/column vectors, AAH=I
rotation invariance
the angles between vectors are preserved
unitary transform: rotate a vector in Rn, i.e., rotate the basis coordinates
optimal should be defined in a statistical sense so that the transform works well with many images
optimal for each signal is ill-defined
constraints:
matrix identity
assume real input, write AT instead of AH denote the inverse transform matrix as A, AAT=I Rx is symmetric for real input, Hermitian for complex input i.e. RxT=Rx, RxH = Rx Rx nonnegative definite, i.e. has real non-negative eigen values Attributions
Kari Karhunen 1947, Michel Love 1948 a.k.a Hotelling transform (Harold Hotelling, discrete formulation 1933) a.k.a. Principle Component Analysis (PCA, estimate Rx from samples)
note: other matrices (unitary or nonunitary) may also de-correlate the transformed sequence [Jains example 5.5 and 5.7]
The ugly
Data dependent
Need a good estimate of the second-order statistics Increased computation complexity
data: linear transform: fast transform: estimate Rx: compute eig Rx:
Rx and 2 Rx-1 have the same eigen vectors 2 Rx-1 ~ Qc when is close to 1 DCT basis vectors are eigenvectors of a symmetric tri-diagonal matrix Qc
0.0136
0.1055
transform basis
0.1185
1.0000
DFT DCT
KLT
?
?
Implementation
Real-valued Separable Fast to compute w. butterfly-like structure Same implementation for forward and inverse transform
x x x x
Walsh-Hadamard Transform
slant transform
applications of transforms
enhancement (non-universal) compression feature extraction and representation pattern recognition, e.g., eigen faces dimensionality reduction
analyze the principal (dominating) components
Image Compression
Gabor filters
Gaussian windowed Fourier Transform
Make convolution kernels from product of Fourier basis images and Gaussians
Frequency
Filter bank
Input image
outline
Recap of DFT and DCT Unitary transforms KLT Other unitary transforms Multi-resolution and wavelets Applications
sampling (dirac)
FT
STFT
analysis
processing
synthesis
h
x
h
x
1.
3.
Then
The rows of T are the basis functions The rows of T are the reversed versions of the filters
5.
DWT
Iterate only on the lowpass channel
f
wavelet packet
Iterate on both the low pass and (selected) high-pass channels
wavelet packet
First stage: full decomposition
wavelet packet
Second stage: pruning
Cost(parent) < Cost(children)
FT
STFT
t f WT f WP
in a nutshell
if is if is = if is frame if is square and nonsingular, is a basis and is its dual basis unitary, that is, * = I, is an orthonormal basis and rectangular and full rank, is a frame and is its dual =
applications of wavelets
enhancement and denoising compression and MR approximation fingerprint representation with wavelet packets bio-medical image classification subdivision surfaces Geris Game, A Bugs Life, Toy Story 2
NIST 24 fingerprint database 10 people (5 male & 5 female), 2 fingers 20 classes, 100 images/class
Overviews
Books
Wavelets and Subband Coding, M. Vetterli and J. Kovacevic, Prentice Hall, 1995. A Wavelet Tour of Signal Processing, S. Mallat, Academic Press, 1999. Ten Lectures on Wavelets, I. Daubechies, SIAM, 1992. Wavelets and Filter Banks, G. Strang and T. Nguyen, Wells. Cambr. Press, 1996.
summary
unitary transforms
theory revisited the quest for optimal transform example transforms DFT, DCT, KLT, Hadamard, Slant, Haar,
10 yrs
1 yr