A Real-Time H.264/AVC Encoder & Decoder With Vertical Mode For Intra Frame and Three Step Search Algorithm For P-Frame
A Real-Time H.264/AVC Encoder & Decoder With Vertical Mode For Intra Frame and Three Step Search Algorithm For P-Frame
net/publication/261653610
A Real-Time H.264/AVC Encoder & Decoder with Vertical Mode for Intra Frame
and Three Step Search Algorithm for P-Frame
CITATIONS READS
0 1,263
2 authors:
Some of the authors of this publication are also working on these related projects:
PID Controller Configuration and Tuning Based on Genetic Algorithms View project
All content following this page was uploaded by Mohammed Al-Jammas on 16 April 2014.
ABSTRACT
The video coding standards are being developed to satisfy the requirements of applications for various
purposes, better picture quality, higher coding efficiency, and more error robustness. The new
international video coding standard H.264 /AVC aims at having significant improvements in coding
efficiency, and error robustness in comparison with the previous standards such as MPEG-2, H261,
H263,and H264. Video stream needs to be processed from several steps in order to encode and decode
the video such that it is compressed efficiently with available limited resources of hardware and
software. All advantages and disadvantages of available algorithms should be known to implement a
codec to accomplish final requirement. The purpose of this project is to implement all basic building
blocks of H.264 video encoder and decoder. The significance of the project is the inclusion of all
components required to encode and decode a video in MatLab .
KEYWORDS
H264/AVC , Intra frame (I-frame) , Inter frame (P-frame)
1. INTRODUCTION
A Digital video compression is an important techniques that enables efficient transmission
bandwidth and storage space of multimedia. The H.264/AVC is a standard video coding that
developed to achieve significant improvements, in the compression performance, over the
existing standards. In fact, the high compression performance comes mainly from the
prediction techniques that remove spatial and temporal redundancies. To remove spatial
redundancy, H.264/AVC intra prediction supports many prediction modes to make better
prediction. Inter prediction is enhanced by motion estimation (ME) to remove more temporal
redundancy. However, the H.264/AVC coding performance comes at the price of
computational complexity[1].
H.264/AVC intra encoding achieve higher compression ratio and picture quality compared
with the latest still image coding standard JPEG2000].intra prediction is the first process of
advanced video coding standard. It predicts a macro block by referring to its previous macro
blocks to reduce spatial redundancy. Intra prediction supports nine modes for 4x4 block and
four modes for 16x16 blocks[2].
H.264 is an open, licensed standard that supports the most efficient video compression
techniques available today. Without compromising image quality, an H.264 encoder can
reduce the size of a digital video file by more than 80% compared with the Motion JPEG
format and as much as 50% more than with the MPEG-4 Part 2 standard. This means that
much less network bandwidth and storage space are required for a video file. Or seen another
way, much higher video quality can be achieved for a given bit rate[3].
And also entered as an adjunct of the kind of evolutionary in public services such as video
storage on the Internet and telecommunications companies and surveillance cameras used in
industrial plants , and due to accept this kind of decoder over a wide range of frames during
the second (60/30/25 (fps)) has been expanding in applications control of highways, airports ,
Most of the controversy over the techniques used to process coded information is how fast
and accurate images and video after the code process it possible to re-information fully if it is
reduced to half?. Will be answered by this research, which includes the representation of the
coded H264 and decode of the file with a high level using the MATLAB simulation software
to achieve complete system of coded data and return it in the same efficient used for the
original system.
5. TYPES OF F RAMES
H.264 consists of several different types of frames, such as ( I-P-B), and can be used for
encryption to get the required efficiency below illustrate the theoretical formula for each
quality of frames.
(I-intra frame ) Is an autonomous framework which can encrypt and decrypt
independently without need for another picture as a source of information retrieval,
the first image of the video is for this type of frame, and the (I-frame) is the starting
point for the video display as well as his importance in information retrieval
synchronization if any damage in transport stream bit (bit stream), the flaw in this
window that consumes the largest possible number of bits for encryption because it
takes the window image full but on the other hand, the error rate is low. Encryption
method for this type of window has two properties, depending on the method of
dividing the cluster either ((16x16) or (4x4)) but in General is to convert the frame
version (RGB)format (YCbCr) and separated from the other components of the final
representation and is treated with a single image, so the representation of video
format ((4:2:0) YCbCr) is to reduce the sensitivity of the eye where the eye responds
to brightness by colors so the component (Y) represents the symbol of brightness
luminance while (CbCr) represents the color (chrominance) taken the element Y with
full size while the rest of the elements are reduced by deciding to half the amount of
action in the element size (Y) is (16x16) , the rest of the elements are the size of
(8x8), this means that embedded type of encryption key encryption process.
Encryption process as previously mentioned it is dependent on frame division,
divides the frame into multiple blocks of size (16x16) and has (4) types of encryption
as shown in Figures (eg, Figure 2).
Choose a style for the adoption application and competence required for the
encryption process and the admissibility of the error rate, in most cases the amount of
the error rate of the video is of a higher flexibility compared to the error rate in the
case of a single image.
(P-Inter Frame) Predictive Inter Frame: is derived from the current frame to the video
sequence frame by reducing the time between frames increase unlike previous quality
work only within the space of pixels, the principle of its work essentially compare the
block of the current window with the block of the previous frame and the centre of
block is search for match, this called (matching block), all theories have one and is the
best possible match and this is called motion estimation (ME), after finding the best
match, we put the block of the original block and the remaining known as compensation
(motion compensation), the link between the location of the current block with original
block is the transmission (motion vector (MV)) shown in Figures (eg, Figure 4).
Figure 4. The basic idea to represent the predictive inter frame (P-frame )
B-frames (Bi-predictive inter frame), this type of frame be intermediate between (I,
and B frames) used at high levels for perfect efficiency but complex where the
highest of qualities as follows based on the comparison between more than one
source for block, meaning most forecasts from original source and source is
expected as in Figures (eg, Figure 5).
When retrieving the information in the decryption process is (I-frame), the former to
decrypt followed by (B-P frames) if used, the decryption depend one upon the other in
the information retrieval of the original frame. H264 has several ways in the encryption
that uses encryption (I-frame) or use (I&P ,(I&B&P)) and each method has its qualities,
if you use the first method, the quantity of bits encoded be high compared with other
cases but the error rate low because all the encrypted individually without relying on the
previous window, this method is used in some applications that need high resolution
cameras also in prisons and banks to get clearer picture during the up seizing process as
in Figures (eg, Figure 6) .
But if you use the second method as shown in Figures (eg, Figure 7) they have
characteristics that they reduce the number of bits encoded and the error rate is
acceptable and this method is used in video compression in general and cameras , In the
third grade are more complex than both methods but with a reduced data encrypted and
contains a higher delay method because search matches more than one source.
Figure 7. The first image is a frame ( Frame-I) and encodes a single, the second and third
picture only encrypts mobile part
Motion vector is a simple way to move a lot of information as shown in Figures (eg, Figure
10), but not always give an exact match to give the best quality, taking output subtraction
between the original frame and block the cluster framework forecast output encrypted as
shown in Figures (eg, Figure 11), the encryption process for residual image is similar to the
encryption (I-frame), but the difference is in the process of rounding and through practical
results as in Figures (eg, Figure 12) and found that the compression ratio is 70%of the original
size.
Table 2. Time encode and decode for P-frame
In Figures (eg, Figure 13) shows the encryption and decryption of video sequences
Forman and note that in the case of image encryption and decryption have same
properties.
Figure 13. The Original Foreman Sequence and Return Within Three-Step algorithm
7. CONCLUSIONS
DisplaysH.264 A major step forward in the field of video compression technology. and
provides techniques which enable better compression efficiency, due to more accurate
forecasting capabilities, as well as improving the ability to minimize errors. It provides new
possibilities for creating video encoders that managed to get high quality video and high
frame rates at per second and higher resolution at bitrates (compared to the preceding
criteria), and through the practical results of the MATLAB simulation was found that data
compression during real time up to(70%) of the video size Original part time implementation
261.89 s and time clk (4ns) and highest value can be obtained for a reference to the noise up
to (45 db) also shown in Figures (eg, Figure 14).
H.264 It is expected to replace other compression standards and methods used today, and
form became H.264 more widely available species in network cameras, video encoders and
video management software, designers of systems at the present time, the network video
products support both H.264 and Motion JPEG is perfect for maximum flexibility and
possibilities of integration.
REFERENCE
[1] A. Ben Atitallah, H. Loukil , and N. Masmoudi, FPGA DESIGN FOR H.264/AVC
ENCODER, International Journal of Computer Science, Engineering and Applications
(IJCSEA) Vol.1, No.5, October 2011, pp 119-138.
[2] Manjanaik.N, Dr.Manjunatha.R, Development of Efficient Intra Frame Coding in Advanced
Video Standard Using Horizontal Prediction Mode, International Journal of Emerging
Technology and Advanced Engineering, Volume 3, Issue 2, February 2013, pp 192-196.
[3] H.264 video compression standard, New possibilities within video surveillance, Axis
Communications, White paper, 2008.
[4] Amruta Kiran Kulkarni, Implementation of Fast Inter-Prediction Mode Decision in
H.264/AVC Video Encoder, Master Thesis, May 2012.
Author
Mohammed H. AL-Jammas (Jun’02) born in
1966 in Mosul-Iraq. He awarded BSc in
Electronic and Communication Engineering
from the University of Mosul, Mosul-Iraq in
1988. Next, he awarded the MSc in
Communication from the University of Mosul,
Mosul-Iraq in 1994, and PhD in Computer
Engineering from the University of
Technology, Baghdad-Iraq in 2007. From
2002-2006, Dr. Mohammed worked with the
University of Technology in Baghdad. From
2007, he acts as an Assistance dean of the
College of Electronics Engineering at the
University of Mosul.
Through his academic life he published over 7
papers in field of computer engineering, and
information security.