Super-Resolution Enhancement of Text Image Sequences.
Super-Resolution Enhancement of Text Image Sequences.
tially invariant, symmetric PSF. Hence the model becomes
W L} , %
for the -th image
image.
Figure 2. Examples of the synthetic projective
images created with Gaussian smoothing
down-sampling ratio (50 50 pixels).
and W
Knowledge of the PSF for any given image sequence is
usually unavailable, so here it is modelled as a Gaussian.
Comparisons with the measured PSF of several CCD based 3.1. Tests on synthetic images
imaging systems show this approximation to be quite rea-
In order to test various estimators under controlled con-
sonable and this is further verified by good super-resolution
ditions we use synthetic images. The ground-truth image
results obtained on real images. A procedure for estimating
(figure 1) is projectively warped (using bicubic interpo-
the PSF is described in [14].
lation), smoothed with an isotropic Gaussian, and down-
3. The ML estimator sampled (figure 2). Various levels of additive Gaussian
T W/X
Assuming the image noise to be Gaussian with mean noise are then applied.
zero, variance , the total probability of the observed im-
The starting point for all minimizations is the average
age
Y/Z given an estimate of the super-resolution image of the registered (warped) input frames. Such an initial esti-
t N T uwv1x T \IVKLv T N\IKLV X (4) poorly conditioned. Figure 3 shows the result of the ML es-
W %
have been simulated, the residual images (simulated mi- in which estimates are obtained from 5 images with addi-
nus observed) are convolved with a back-projection func- tive Gaussian noise grey-levels using two different
tion (BPF) and warped back into the super-resolution frame. BPF. The narrower BPF (left) produces a noisy result. The
The back-projected errors from all the observed images are wider BPF (right) reduces noise but increases smoothing.
averaged and used to directly update the estimate as follows
yT
If a prior probability distribution on the super-resolution
image is available then this information may be used to
¡
where is a constant and
¤¥¦ is the back-projection ker-
“regularize” the estimation. The maximum a posterior
ular estimate .
Figure 4 shows results of this algorithm applied to the same
² N)º X I»¼J¾½¿M
components that tend to dominate the unconstrained ML es- function,
timates. It is similar to a constrained minimization in which
the smallest (and most troublesome) eigenvectors of the lin-
ear system mentioned above are constrained to zero. The }M¹À À{vÁM X IVÂ}ÃVÄ?ÅaÆ»ÇVÅ
This penalty function encourages local smoothness, whilst Irani and Peleg
being more lenient toward step edges, thus encouraging a
piecewise constant solution.
6. The Total Variation estimator
The total variation norm is commonly used as a regular-
(9)
Ì ÈCÉ ¯v 0 ¸ ¸
The gradient of the TV term is
W
rather blotchy at high noise levels. The MAP and TV esti- in figure 7. The graph in figure 8 shows the corresponding
W ÔL}
mators maintain a more piece-wise constant solution. variation of reprojection error as the PSF varies. Note
7. The point-spread function that the minimum lies at the correct value of .
The point-spread function used in the imaging model can This was the value used to make the original simulated im-
W
have a pronounced effect on the super-resolution estimate. ages. Hence, in the case of affine or projective sequences
This is demonstrated in figure 7. The reason for this be- the value of the PSF may be optimized by gradient de-
haviour is easiest to imagine in the case where the input im- scent.
ages are related by only Euclidean transformations (transla- 7.1. A note on PSF implementation
tion/rotation) and the PSF is isotropic. In this case the op- When implementing the image synthesis process the
erations of warping the super-resolution estimate and con- super-resolution estimate must first be geometrically
volving with the PSF commute. This means that there is a warped, then blurred with the PSF and finally down-
family of PSF/super-resolution pairs that can give rise to the sampled. This gives rise to two alternative schemes which
were not made explicit in Irani and Peleg’s original paper,
Õ
same set of observed images. If the PSF is too “low-pass”
then the super-resolution image develops high-frequency, Either perform the warp onto a regular lattice using an
“ringing” artifacts to compensate. Similarly, if the PSF is interpolation operator (e.g. bicubic), followed by con-
too “high-pass” the estimate becomes smoother. So under volution with a discretized form of the PSF and down-
Euclidean transformations the reprojection error is insen- sampling.
sitive to the size of the PSF, only the cost of the prior or
regularizing term varies. This can confound methods which Õ Or warp the super-resolution image as point samples,
attempt to optimize the PSF along with the super-resolution and convolve with a continuous form of the PSF at the
estimate. required sampling positions.
The same effect is observed when dealing with affine or
projective transformations, although in these cases the re- The former is generally much easier to implement. The
projection error is sensitive to PSF variations, as illustrated warping and convolution are easily optimzed for speed with
1.6
1 1 1
0 0 0
0 50 100 0 50 100 0 50 100 0.6
W ¥ª«¦
0.6 0.6 0.6
Figure 8. The variation of reprojection error with
0.4 0.4 0.4 of the Gaussian point-spread function. Each
data point is the end-point of an estimation using
0.2 0.2 0.2 the MAP estimator applied to 10 synthetic, projec-
tively warped images. The minimum corresponds
0 0 0
0 50 100 0 50 100 0 50 100 to the correct PSF.
PSF sigma = 0.75 (correct) PSF sigma = 0.85 PSF sigma = 0.95
Figure 9. Super-resolution at 2 zoom from 20 im-
ages captured using a Pulnix CCD camera. (a) one
gramming, B(45):503–528, 1989.
[12] S. Mann and R. W. Picard. Virtual bellows: Constructing
high quality stills from video. In International Conference
of the original low-res images, (b) the initial esti- on Image Processing, 1994.