Lin - Fast Motion Compensation
Lin - Fast Motion Compensation
Abstract—This paper proposes a fast block-matching algorithm in one complete measurement of are absolute
that uses three fast matching error measures, besides the conven- values (or squarings when and additions.
tional mean-absolute error (MAE) or mean-square error (MSE). The best matched block corresponds to the candidate block
An incoming reference block in the current frame is compared
to candidate blocks within the search window using multiple of its upper left corner located at which has
matching criteria. These three fast matching error measures are the minimum matching error A straightforward
established on the integral projections, having the advantages method of BMA is the full-search BMA (FBMA) which
of being good block features and having simple complexity in requires to compute the ’s for all positions
measuring matching errors. Most of the candidate blocks can of candidate blocks in the search window; that is, the FBMA
be rejected only by calculating one or more of the three fast
matching error measures. The time-consuming computations of needs absolute values (or squarings),
MSE or MAE are performed on only a few candidate blocks that additions, and comparisons for
first pass all three fast matching criteria. Simulation results show each reference block; however, it is an intensive computation
that a reduction of over 86% in computations is achieved after process, limiting its practical applications. Many well-known
integrating the three fast matching criteria into the full-search fast algorithms [3]–[13] have been developed to reduce such
algorithm, while ensuring optimal accuracy.
highly computational complexity of the full-search BMA by
considering only a limited number of the motion vectors in
I. INTRODUCTION the search window at the expense of estimate accuracy. That
is, only suboptimal estimate accuracy is guaranteed by these
M OTION estimation using a block-matching algorithm
(BMA) is widely used in many motion-compensated
video coding systems, such as those recommended by the
algorithms. Concerning the VLSI implementation, most of
these fast algorithms, e.g., the three-step search (TSS) [3],
H.261 and MPEG standards [1], [2], to remove interframe have the drawbacks of irregular data flow and high control
redundancy and thus achieve high data compression. In a overhead, while the full-search BMA has the advantages of
typical BMA, the current frame of a video sequence is divided regular data flow and low control overhead [14], [15].
into nonoverlapping square blocks of pixels, say, of size Recently, a number of algorithms with regard to the pattern-
For each reference block in the current frame, BMA matching problems [16]–[19] make use of integral projec-
searches for the best matched block within a search window tions to simplify the computational complexity of the pattern-
of size in the previous frame, where matching operation. However, all of the previous research
stands for the maximum allowed displacement. Then the work on motion estimation using integral projections has never
relative position between the reference and its best matched provided any optimality-preserving ability like the FBMA.
block is represented as the motion vector of the reference Integral projections are good features describing the block
block. A nonnegative matching error function is mean intensity and the edge location and orientation in a
defined over all the positions to be searched, i.e., block of pixels, and are most likely to be different for different
blocks. In this letter, a fast full-search BMA (FFBMA), which
is also based on the uses of integral projections, is presented
to provide much faster motion estimation than that using
the traditional FBMA, while preserving the optimality of
estimate accuracy. In fact, there still exist similar ideas being
or and (1) realized by other techniques for fast vector quantization (VQ),
such as the partial distortion search (PDS) [20], the triangle-
where is the reference block of its upper left pixel at eliminating rule (TIE) [21], or VQ using mean pyramids of
the coordinate in the current frame, and vectors [22]. These fast VQ algorithms converge to a common
is a candidate block of its upper left pixel at the coordinate goal to reject most entries in the codebook that are not
in the previous frame. The computations incurred best matched to the target block using only the partial and
Paper approved by M. R. Civanlar, the Editor for Image Processing of simple information in the blocks. It is not straightforward or
the IEEE Communications Society. Manuscript received September 8, 1995; even difficult to extend directly these fast algorithms to the
revised May 25, 1996. This work was supported in part by the National
Science Council of Taiwan, R.O.C. under Contract NSC86-2221-E-006-057. motion estimation task. For example, in [22], Lee and Chen
This paper was presented in part at the 13th IEEE International Conference defined a sequence of fast matching criteria, each associated
on Pattern Recognition, Vienna, Austria, August 1996. with a different level of the mean pyramids of blocks, and
The authors are with the Institute of Electrical Engineering, National Cheng
Kung University, Tainan, Taiwan, Republic of China. employed these criteria in the “coarse-to-fine” manner to
Publisher Item Identifier S 0090-6798(97)03723-9. promote speed in searching for the nearest neighbor in a VQ
0090-6778/97$10.00 1997 IEEE
528 IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 45, NO. 5, MAY 1997
(3)
3) massive projection:
(4)
(5)
(6)
IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 45, NO. 5, MAY 1997 529
TABLE I TABLE IV
COMPARISON OF THE COMPUTATIONAL COMPLEXITY FOR COMPARISON OF THE COMPUTATION COMPLEXITY FOR VARIOUS
VARIOUS BLOCK-MATCHING ALGORITHMS WITH THE MSE ALGORITHMS WITH MAE WORKING ON 60 Flower Garden FRAMES
CRITERION ACCORDING TO THE NUMBERS OF THE ARITHMETIC OF 720 2 480 SIZE ACCORDING TO THE NUMBERS OF ARITHMETIC
OPERATIONS REQUIRED FOR EACH 16 2 16 REFERENCE BLOCK OPERATIONS REQUIRED FOR EACH 16 2 16 REFERENCE BLOCK
of sequences like Salesman. As for the sequence of Flower [6] S. Kapagantula and K. R. Rao, “Motion predictive interframe coding,”
Garden, a smaller computation reduction is achieved by the IEEE Trans. Commun., vol. COM-33, pp. 1011–1015, Sept. 1985.
[7] A. Puri, H. M. Hang, and D. L. Schilling, “An efficient block-matching
FFBMA as compared to Fok and Au’s method. algorithm for motion-compensated coding,” in Proc. Int. Conf. Acoust.,
These results verify the efficiency of the proposed FFBMA Speech, Signal Processing, 1987, pp. 25.4.1–25.4.4.
with either MSE and MAE. With the experiments, it is [8] M. Ghanbari, “The cross-search algorithm for motion estimation,” IEEE
Trans. Commun., vol. 38, pp. 950–953, July 1990.
concluded that the FFBMA, which uses the three fast matching [9] C. H. Hsieh, P. C. Lu, J. S. Shyn, and E. H. Lu, “Motion estimation
criteria, can perform much faster than the FBMA and the algorithm using interblock correlation,” Electron. Lett., vol. 26, pp.
PDSBMA at the same estimate accuracy. 276–277, Mar. 1990.
[10] B. Liu and A. Zaccarin, “New fast algorithms for the estimation of block
motion vectors,” IEEE Trans. Circuits Syst. Video Technol., vol. 3, pp.
IV. CONCLUSIONS 148–157, Apr. 1993.
[11] L. W. Lee, J. F. Wang, J. Y. Lee, and J. D. Shie, “Dynamic search-
A new fast full-search block-matching algorithm is pre- window adjustment and interlaced search for block-matching algo-
rithm,” IEEE Trans. Circuits Syst. Video Technol., vol. 3, pp. 85–87,
sented in this paper. It runs much faster than the traditional
Feb. 1993.
full-search BMA, while the optimal accuracy of motion esti- [12] M. J. Chen, L. G. Chen, and T. D. Chiueh, “One-dimensional full search
mation is guaranteed. This improvement of speed is based on motion estimation algorithm for video coding,” IEEE Trans. Circuits
Syst. Video Technol., vol. 4, pp. 504–509, Oct. 1994.
the fact that multiple matching errors which have different lev- [13] F. Dufaux and F. Moscheni, “Motion estimation techniques for digital
els of computation complexity are available on each position to TV: A review and a new contribution,” Proc. IEEE, vol. 83, pp.
be searched. The relationships among the multiple matching 858–876, June 1995.
[14] K. M. Yang, M. T. Sun, and L. Wu, “A family of VLSI designs for the
errors of a candidate position are utilized to construct three motion compensation block-matching algorithm,” IEEE Trans. Circuits
test conditions which can be employed during block matching Syst., vol. 36, pp. 1317–1325, Oct. 1989.
to avoid the time-consuming computations of MSE or MAE [15] H. M. Jong, L. G. Chen, and T. D. Chiueh, “Parallel architectures for 3-
step hierachical search block-matching algorithm,” IEEE Trans. Circuits
measurements. With the experiments, the proposed method Syst. Video Technol.. vol. 4, pp. 407–416, Aug. 1994.
can give a great amount of savings of computations, and thus [16] J. S. Kim and R. H. Park, “A fast feature-based block matching
can be well suited for a wide range of applications, such as algorithm using integral projections,” IEEE J. Select. Areas Commun.,
vol. 10, pp. 968–971, June 1992.
videotelephony, videoconferencing, and HDTV. [17] Y. H. Fok and O. C. Au, “An improved fast feature-based block motion
estimation,” in Proc. IEEE 1994 Int. Conf. Image Processing, 1994, pp.
REFERENCES 741–745.
[18] H. B. Park and C. Wang, “Image compression by vector quantization of
[1] CCITT SG XV, “Recommendation H.261—Video codec for audiovisual projection data,” IEICE Trans. Inform. Syst., vol. E75-D, pp. 148–155,
services at p*64 kbits/s,” Tech. Rep. COM XV-R37-E, Aug. 1990. Jan. 1992.
[2] MPEG, “ISO CD 11172-2: Coding of moving pictures and associated [19] K. H. Jung and C. Wang, “Projective image representation and its
audio for digital storage media at up to about 1.5 M bits/s,” Nov. 1991. application to image compression,” IEICE Trans. Inform. Syst., vol.
[3] T. Koga, K. Iinuma, A. Hirano, Y. Iijima, and T. Ishiguro, “Motion E79-D, pp. 136–142, Feb. 1996.
compensated interframe coding for video conferencing,” in Proc. Nat. [20] C. Bei and R. M. Gray, “An improvement of the minimum distortion
Telecommun. Conf. New Orleans, LA, Nov. 1981, pp. G5.3.1–G5.3.5. encoding algorithm for vector quantization,” IEEE Trans. Commun., vol.
[4] J. R. Jain and A. K. Jain, “Displacement measurement and its application COM-33, pp. 1132–1133, Oct. 1985.
in interframe image coding,” IEEE Trans. Commun., vol. COM-29, pp. [21] S. H. Huang and S. H. Chen, “Fast encoding algorithm for VQ-based
1799–1808, Dec. 1981. image coding,” Electron. Lett., vol. 26, pp. 1618–1619, Sept. 1990.
[5] R. Srinivasan and K. R. Rao, “Predictive coding based on efficient [22] C. H. Lee and L. H. Chen, “A fast search algorithm for vector quantiza-
motion estimation,” IEEE Trans. Commun., vol. COM-33, pp. 888–896, tion using mean pyramids of codewords,” IEEE Trans. Commun., vol.
Aug. 1985. 43, pp. 1697–1702, Feb./Mar./Apr. 1995.