Hevc 4 Videosense

1
A quick overview of HEVC -current statusProf. Dr. Touradj Ebrahimi December 16th, 2011 VideoSense skype meeting http://www.videosense.eu
Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne
H.264/AVC: Main Functionalities
Coding efficiency (about 50% compared to previous standards)

Enhanced motion compensation, multiple reference frames, hierarchical
B frames, improved de-blocking filter, smaller blocks for transform, enhanced entropy coding
Network friendliness
Enabled through the Network Adaptation Layer (NAL) design
Error resilience
Data partitioning, flexible macroblock ordering (FMO), slices, redundant
slices, resynchronization markers, multiple reference pictures, parameter sets, etc.
Temporal scalability
Enabled by flexible reference picture management via hierarchical
predictions
Basic Coding Architecture

Input Video Signal Coder Control Transform/ Scal./Quant. Scal./Quant. Scaling & Inv. Transform Entropy Coding
Control Data
Split into Macroblocks 16x16 pixels
Quant. Transf. coeffs
Intra-frame Intra-frame Estimation Estimation
Intra Prediction Data
Intra-frame Intra-frame Prediction Motion Compensation Compensation
Deblocking Filter
Motion Data Output Video Signal
Intra/Inter MB select
Motion Estimation Estimation
Similar to Previous Standards
Macroblocks: 1616 luma + 2 88 chroma samples Input: Association of luma and chroma and
conventional sub-sampling of chroma (4:2:0) Block based motion displacement Motion vectors over picture boundaries Variable block-size motion Block transforms Scalar quantization I, P, and B coding types
MBs and sub-MBs for Motion Compensation
Macroblocks
16
16
16
16
1
8
Sub-macroblocks
1
4
Multi-frame Prediction ...

N-4 N-3 N-2 N-2 N-1 N
Transform Coding
The H.264/MPEG-4 AVC standard uses 3 transforms depending on the type of prediction residue to be coded: 1. 44 Hadamard Transform for the luminance DC coefficients in MBs coded with the Intra 1616 mode 2. 22 Hadamard Transform for the chrominance DC coefficients in any MB 3. 44 integer transform based on the DCT for all the other blocks
-1
... 0 2 8 10 1 3 9 11 4 6 12 14 5 7 13 15 18 19 20 21 22 23 AC 24 25 Cb 16 Cr 17 2x2 DC
44 Intra Prediction Directions

Mode 0 - Vertical Mode 1 - Horizontal Mode 3 Diagonal Down/Left
Mode 2 - DC + + + + + + +
Mode 4 Diagonal Down/Right
Mode 5 Vertical-Right
Mode 6 Horizontal-Down
Mode 7 Vertical-Left
Mode 8 Horizontal-Up
H.264/AVC Profiles
H.264/AVC: a Success Story

3GPP (recommended in rel 6) 3GPP2 (optional for streaming service) ARIB (Japan mobile segment broadcast) ATSC (preliminary adoption for robust-mode back-up channel) Blu-ray Disc Association (mandatory for Video BD-ROM players) DLNA (optional in first version) DMB (Korea - mandatory) DVB (specified in TS 102 005 and one of two in TS 101 154) DVD Forum (mandatory for HD DVD players) IETF AVT (RTP payload spec approved as RFC 3984) ISMA (mandatory specified in near-final rel 2.0) SCTE (under consideration) US DoD MISB (US government preferred codec up to 1080p) (and, of course, MPEG and the ITU-T)
10
Scalable Coding: Objectives

Scalability is a functionality regarding the useful decoding of parts of the encoded bitstream, ideally
1.
11
2.
while achieving an RD performance at any supported spatial, temporal, or SNR resolution that is comparable to single-layer coding at that particular resolution, and without significantly increasing the decoding complexity.
Main SVC Requirements

Similar coding efficiency compared to single-layer coding for each
subset of the scalable bit stream. Little increase in decoding complexity compared to single-layer decoding that scales with the decoded spatio-temporal resolution and bit rate. Support of temporal, spatial, and quality scalability. Support of a backward compatible base layer (H.264/AVC in this case). Support of simple bitstream adaptations after encoding.
12
SVC Applications Robust Video Delivery

Adaptive delivery over error-prone networks and to devices with
13
varying capability Combined with unequal error protection Internet/mobile transmission
Scalable Storage
Scalable export of video content Graceful expiration or deletion Surveillance DVRs and Home PVRs
Enhancement Services
Upgrade delivery from 1080i/720p to 1080p DTV broadcasting, optical storage devices
SVC Alternatives Simulcast

Simplest solution Code each layer as an independent stream Incurs increase of rate
14
Stream Switching
Viable for some application scenarios Lacks flexibility within the network Requires more storage/complexity at server
Transcoding
Low cost, designed for specific application needs Already deployed in many application domains
Spatio-Temporal-Quality Cube
Spatial Resolution global bitstream 4CIF
15
CIF Bit Rate (Quality, SNR) low QCIF high 60 30 15 7.5 Temporal Resolution
SVC Coding Architecture

Progressive SNR refinement texture coding texture Hierarchical MCP & Intra prediction motion Base layer coding
2007 !
16
Spatial decimation
Inter-layer prediction: Intra Motion Residual
Progressive SNR refinement texture coding Scalable bit-stream
texture Hierarchical MCP & Intra prediction motion Base layer coding Multiplex
Spatial decimation
Inter-layer prediction: Intra Motion Residual texture motion
Progressive SNR refinement texture coding H. /AVC compatible base layer bit-stream
Hierarchical MCP & Intra prediction
Base layer coding H.264/AVC compatible encoder
Hierarchical Prediction Structures for Temporal Scalability
17
(a) coding with hierarchical B pictures, (b) non-dyadic hierarchical prediction structure, (c) hierarchical prediction structure with a structural encoder/ decoder delay of zero. The numbers below the pictures specify the coding order, and the symbols Tk specify the temporal layers with k representing the corresponding temporal layer identifier.
Trading Enhancement Layer Coding Efficiency and Drift for Packet-based Quality Scalable Coding
18
(a) base layer only control, (b) enhancement layer only control, (c) two-loop control, (d) key picture concept of SVC for hierarchical prediction structures, where key pictures are marked by the hatched boxes.
SVC Scalability Types

Temporal scalability - Can be typically achieved without losses in ratedistortion performance. Spatial scalability - When applying an optimized SVC encoder control, the bit rate increase relative to non-scalable H.264/AVC coding at the same fidelity can be as low as 10% for dyadic spatial scalability. The results become worse as spatial resolution of both layers decreases and results improve as spatial resolution increases. SNR scalability - When applying an optimized encoder control, the bit rate increase relative to non-scalable H.264/AVC coding at the same fidelity can be as low as 10% for all supported rate points when spanning a bit rate range with a factor of 2-3 between the lowest and highest supported rate point.
From IEEE Transactions on Circuits and Systems for Video Technology, September 2007.
19
SVC Performance: Spatial Scalability
20
10~15% gains over simulcasting Performs within 10% of single layer coding
[Segall& Sullivan, T-CSVT, Sept07]
SVC Profiles
21
Multiview Video Coding (MVC)
22
In addition to exploiting the temporal and

spatial redundancy within each view to achieve coding gains, redundancy can also be exploited across the different views. JVT standardization goal is to reach 50% bitrate savings over independent coding of views with same quality by defining another extension to the H.264/AVC standard.
Multiview Video Data
23
Redundancy reduction between camera views

Need to cope with color/illumination mismatch problems Alignment may not always be perfect either
MVC Prediction Structures
24
Many prediction structures possible to exploit inter-camera redundancy: trade-off in memory, delay, computation and coding efficiency.
Time
MPEG-2 Video Multiview profile
(JVT) MVC
View
MVC: Technical Solution Current H.264/AVC multiview extension does not require any
changes to lower-level syntax
Very compatible with single-layer AVC hardware
25
Inter-view prediction
Enabled through flexible design of decoded reference picture
management Allow decoded pictures from other views to be inserted and removed from reference picture buffer
Small changes to high-level syntax

Specify view dependency
Some MVC Performance Results
26
Towards ITU-T Recommendation H.265

H.265 is a long-term video coding standard, launched by ITU-T VCEG. Not yet formalized but VCEG keeps seeking proposals and information regarding the
possibility of a major performance gain to justify the step from H.264 to H.265. Though the necessary scope of H.265 is yet largely to be determined, it is agreed that among the goals will be: High coding efficiency, e.g., two times compared with H.264/AVC Computational efficiency, considering both encoder and decoder Loss/error robustness Network friendliness
27
So far, contributions to VCEG have mainly focused on improving coding

efficiency. To better evaluate these contributions and retain progress, the KTA (Key Technical Area) has been developed as the software platform, using JM11 as the baseline and continuously integrating promising tools.
KTA Inter Prediction Tools
28
Inter prediction
Adaptive interpolation filter (AIF)
2-D non-separable AIF (AD08, AE16) Separable AIF (COM16-C219, AG10) Directional AIF (DAIF) (AG21, AG22, AH17, AH18) Enhanced DAIF (E-DAIF) (AI12,
COM16-C125, COM16-C126) Enhanced AIF (EAIF) (C464, AI38, AJ30) Switch interpolation filters with offsets (SIFO) (C463, AI35, AJ29, COM16-C126) High precision filter (HPF) (AI33) Single-pass encoding (AJ29, AK26)
1/8-pel motion compensated prediction (MCP) (AD09) Extended MCP block size (COM16-C123) Competition-based MV prediction (AC06)
KTA Transform and Quantization Tools
29
Transform and Quantization

Mode-dependent directional transform (MDDT) (AF15,
AG11, AH20, AJ24, AI36) Very large block transform (COM16-C123) Adaptive prediction error coding (APEC) (AB06, AD07, AE15) Adaptive quantization matrix selection (AQMS) (AC07, AD06, AF08, AI19) Rate-distortion optimized quantization (RDO-Q) (AH21)
Other KTA Tools
30
Entropy coding
Parallel CABAC (COM16-C405, AI32)
In-loop filter
Block-based adaptive loop filter (BALF) (AI18, AJ13) Quadtree-based adaptive loop filter (QALF) (COM16-
C181, AK22)
Post filter (AI34, COM16C128) Internal bit depth increasing (IBDI) (AE13, AF07)
JCT-VC roadmap toward HEVC standardization

First meeting: Dresden Germany 15-23 Apr. 2010
31
188 people, 40 input documents
Second meeting: Geneva, Switzerland 21-28 July 2010
Third meeting: Guangzhou, China 7-15 Oct. 2010
Fourth meeting: Daegu, Korea 20-28 Jan. 2011
Fifth meeting: Geneva, Switzerland 16-23 Mar. 2011
Sixth meeting: Turin, Italy 14-22 July 2011
Seventh meeting: Geneva, Switzerland 21-30 Nov. 2011
Eighth meeting: San Jose, United States 1-10 Feb. 2012 [ISO/IEC CD] Ninth meeting: Geneva, Switzerland 30 Apr. 9 May 2012 Tenth meeting: Stockholm, Sweden 11-20 July 2012 [ISO/IEC DIS] Eleventh meeting: Shanghai, China 10-19 Oct. 2012 Twelfth meeting: Geneva, Switzerland 17-23 Jan. 2013 [ISO/IEC FDIS & ITU-T Consent]
Status after HEVC proposals submissions
32
Class A results were not tested subjectively Classes B to E were tested subjectively Proponents performance were anonymized Proposal survey output documents:
Decoder speed JCTVC-A201 Architectural outline JCTVC-A202 Table of design elements JCTVC-A203
HEVC general architecture
33
All 27 proposals basically conceptually similar to AVC

(and prior standards)
Block-based Variable block sizes Block motion compensation Fractional-pel motion vectors Spatial intra prediction Spatial transform of residual difference Integer-based transform designs Arithmetic or VLC-based entropy coding In-loop filtering to form final decoded picture
Lots of variations at the individual tool level
Key milestones
Test Model under Consideration" (TMuC)
July 2010 (Geneva): JCTVC-B204
34
First HEVC Test Model (HM1)

October 2010 (Guangzhou): JCTVC-C402
Second HEVC Test Model (HM2)

January 2011 (Deagu): JCTVC-D502
Third HEVC Test Model (HM3)

March 2011 (Geneva): JCTVC-E602
Forth HEVC Test Model (HM4)

July 2011 (Torino): JCTVC-F802
Fifth HEVC Test Model (HM5)

Expected early 2012
Steady simplification and efficiency improvement through

these iterations
Moores law of compression
35
between different paradigms
36
and even within the same paradigm

MPEG-2 video
37
and the story still goes on
Tools under consideration in HM4
38
High-level Structure:
High-level support for frame rate temporal nesting
and random access Rectangular tile-structured scanning Wavefront-structured processing dependencies for parallelism Slices with spatial granularity possibly smaller than largest coding unit: 16x16 or larger
39
Coding units, Prediction units, and Transform

units:
Coding unit quadtree structure
square coding unit block sizes 2Nx2N, for N=4, 8, 16, 32
Prediction units for coding unit size 2Nx2N:

for Inter, 2Nx2N, 2NxN, Nx2N for N>4, also 2Nx(N/2+3N/2) & (N/2+3N/2)x2N for Intra, only 2Nx2N for N=4, also NxN
Transform unit tree structure within coding unit

maximum of 3 levels
Transform block size of 4x4 to 32x32 samples

For Intra always square for Intra For inter also non-square 4x16, 16x4, 8x32, 32x8
40
Spatial Signal Transformation and PCM

Representation:
DCT-like integer block transform for Intra also a DST-based integer block transform selected based on the intra prediction mode For intra transforms can cross prediction unit
boundaries for
not allowed for Intra
PCM coding with worst-case bit usage limit
41
Intra-picture Prediction:
Angular intra prediction 17 directions for 4x4 3 directions for 64x64 34 directions for others Planar intra prediction Chroma intra prediction separate from or
using luma samples
42
Inter-picture Prediction:
Luma motion compensation interpolation 1/4 sample precision, 8x8 separable with 6 bit tap values Chroma motion compensation interpolation 1/8 sample precision, 4x4 separable with 6 bit tap values Advanced motion vector prediction with
motion vector competition and merging
43
Entropy coding:
In High Efficiency configuration: Context-
adaptive binary arithmetic entropy coding In Low Complexity configuration: Contextadaptive variable-length entropy coding
Picture Storage and Output Precision:

In High Efficiency configuration: 10 bit-per-
sample picture storage and output In Low Complexity configuration: 8 bit-persample storage and output
44
In-Loop Filtering:
Deblocking filter Sample-adaptive offset filter In High Efficiency configuration: Adaptive loop
filter In Low Complexity configuration: No additional filter stage
Largest Coding Units (LCUs)
45
Consists of an NxN block of luma samples together with

two corresponding blocks of chroma samples for a picture Broadly analogous to that of the macroblock in previous standards such as H.264/AVC Maximum allowed size of the luma block in a LCU is 64x64
Slice and tile structures
46
Slice structure
Unit of packetization for transmission purpose Independently decodable No prediction across slice borders Entropy coding restarted between slices
Tile structure
Similar to JPEG and JPEG 2000
Coding Unit (CU) structure
47
Basic unit of region splitting used for inter/intra coding Always square From 8x8 luma samples up to the size of LCU Allows recursive splitting into four equally sized blocks, starting from the LCU Skipped LCU
No refresh
Non-skipped LCU
Intra or inter prediction
Prediction Unit (PU) structure
48
Basic unit for information on prediction

processes Not restricted to square Each CU contains one or more PUs
2Nx2N
Nx2N
2NxN
NxN
Transform Unit (TU) structure
49
Basic unit for transform and quantization

processes Its shape depends on PU partitioning mode 3-level quadtree in High Efficiency configuration
Intra Prediction
50
Unified intra prediction coding tool

provides up to 35 directional prediction depending on the size of PU
33 intra prediction directions

-30 -25 -20 -15 -10 -5 0 5 10 15 20 25 30
51
-30 -25 -20 -15 -10 -5 0 5 10 15 20 25 30
Motion Merge Technique
52
Used to find a (spatial or temporal) neighboring inter

coded PU such that its motion parameters (motion vector, reference picture index, and reference picture list usage flag) can be inferred as the ones for the current PU
AMVP
53
Adaptive Motion Vector Prediction exploits

spatio-temporal correlation of motion vector with neighboring PUs, to explicitly transmit of motion vector
Luma interpolation filters
54
8-tap separable DCT-based filter for 1/4th

luma interpolation
Chroma interpolation filters
55
4-tap separable DCT-based filter for 1/8th

chroma interpolation
Transform
nS = 4 {64, 64, 64, 64} {83, 36,-36,-83} {64,-64,-64, 64} {36,-83, 83,-36} nS = 8 {64, 64, 64, 64, 64, 64, 64, 64} {89, 75, 50, 18,-18,-50,-75,-89} {83, 36,-36,-83,-83,-36, 36, 83} {75,-18,-89,-50, 50, 89, 18,-75} {64,-64,-64, 64, 64,-64,-64, 64} {50,-89, 18, 75,-75,-18, 89,-50} {36,-83, 83,-36,-36, 83,-83, 36} {18,-50, 75,-89, 89,-75, 50,-18} nS = 32 {64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64} {90 90 88 85 82 78 73 67 61 54 46 38 31 22 13 4 -4-13-22-31-38-46-54-61-67-73-78-82-85-88-90-90} {90 87 80 70 57 43 25 9 -9-25-43-57-70-80-87-90-90-87-80-70-57-43-25 -9 9 25 43 57 70 80 87 90} {90 82 67 46 22 -4-31-54-73-85-90-88-78-61-38-13 13 38 61 78 88 90 85 73 54 31 4-22-46-67-82-90} {89 75 50 18-18-50-75-89-89-75-50-18 18 50 75 89 89 75 50 18-18-50-75-89-89-75-50-18 18 50 75 89} {88 67 31-13-54-82-90-78-46 -4 38 73 90 85 61 22-22-61-85-90-73-38 4 46 78 90 82 54 13-31-67-88} {87 57 9-43-80-90-70-25 25 70 90 80 43 -9-57-87-87-57 -9 43 80 90 70 25-25-70-90-80-43 9 57 87} {85 46-13-67-90-73-22 38 82 88 54 -4-61-90-78-31 31 78 90 61 4-54-88-82-38 22 73 90 67 13-46-85} {83 36-36-83-83-36 36 83 83 36-36-83-83-36 36 83 83 36-36-83-83-36 36 83 83 36-36-83-83-36 36 83} {82 22-54-90-61 13 78 85 31-46-90-67 4 73 88 38-38-88-73 -4 67 90 46-31-85-78-13 61 90 54-22-82} {80 9-70-87-25 57 90 43-43-90-57 25 87 70 -9-80-80 -9 70 87 25-57-90-43 43 90 57-25-87-70 9 80} {78 -4-82-73 13 85 67-22-88-61 31 90 54-38-90-46 46 90 38-54-90-31 61 88 22-67-85-13 73 82 4-78} {75-18-89-50 50 89 18-75-75 18 89 50-50-89-18 75 75-18-89-50 50 89 18-75-75 18 89 50-50-89-18 75} {73-31-90-22 78 67-38-90-13 82 61-46-88 -4 85 54-54-85 4 88 46-61-82 13 90 38-67-78 22 90 31-73} {70-43-87 9 90 25-80-57 57 80-25-90 -9 87 43-70-70 43 87 -9-90-25 80 57-57-80 25 90 9-87-43 70} {67-54-78 38 85-22-90 4 90 13-88-31 82 46-73-61 61 73-46-82 31 88-13-90 -4 90 22-85-38 78 54-67} {64-64-64 64 64-64-64 64 64-64-64 64 64-64-64 64 64-64-64 64 64-64-64 64 64-64-64 64 64-64-64 64} {61-73-46 82 31-88-13 90 -4-90 22 85-38-78 54 67-67-54 78 38-85-22 90 4-90 13 88-31-82 46 73-61} {57-80-25 90 -9-87 43 70-70-43 87 9-90 25 80-57-57 80 25-90 9 87-43-70 70 43-87 -9 90-25-80 57} {54-85 -4 88-46-61 82 13-90 38 67-78-22 90-31-73 73 31-90 22 78-67-38 90-13-82 61 46-88 4 85-54} {50-89 18 75-75-18 89-50-50 89-18-75 75 18-89 50 50-89 18 75-75-18 89-50-50 89-18-75 75 18-89 50} {46-90 38 54-90 31 61-88 22 67-85 13 73-82 4 78-78 -4 82-73-13 85-67-22 88-61-31 90-54-38 90-46} {43-90 57 25-87 70 9-80 80 -9-70 87-25-57 90-43-43 90-57-25 87-70 -9 80-80 9 70-87 25 57-90 43} {38-88 73 -4-67 90-46-31 85-78 13 61-90 54 22-82 82-22-54 90-61-13 78-85 31 46-90 67 4-73 88-38} {36-83 83-36-36 83-83 36 36-83 83-36-36 83-83 36 36-83 83-36-36 83-83 36 36-83 83-36-36 83-83 36} {31-78 90-61 4 54-88 82-38-22 73-90 67-13-46 85-85 46 13-67 90-73 22 38-82 88-54 -4 61-90 78-31} {25-70 90-80 43 9-57 87-87 57 -9-43 80-90 70-25-25 70-90 80-43 -9 57-87 87-57 9 43-80 90-70 25} {22-61 85-90 73-38 -4 46-78 90-82 54-13-31 67-88 88-67 31 13-54 82-90 78-46 4 38-73 90-85 61-22} {18-50 75-89 89-75 50-18-18 50-75 89-89 75-50 18 18-50 75-89 89-75 50-18-18 50-75 89-89 75-50 18} {13-38 61-78 88-90 85-73 54-31 4 22-46 67-82 90-90 82-67 46-22 -4 31-54 73-85 90-88 78-61 38-13} { 9-25 43-57 70-80 87-90 90-87 80-70 57-43 25 -9 -9 25-43 57-70 80-87 90-90 87-80 70-57 43-25 9} { 4-13 22-31 38-46 54-61 67-73 78-82 85-88 90-90 90-90 88-85 82-78 73-67 61-54 46-38 31-22 13 -4}
56
nS = 16 {64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64} {90 87 80 70 57 43 25 9 -9-25-43-57-70-80-87-90} {89 75 50 18-18-50-75-89-89-75-50-18 18 50 75 89} {87 57 9-43-80-90-70-25 25 70 90 80 43 -9-57-87} {83 36-36-83-83-36 36 83 83 36-36-83-83-36 36 83} {80 9-70-87-25 57 90 43-43-90-57 25 87 70 -9-80} {75-18-89-50 50 89 18-75-75 18 89 50-50-89-18 75} {70-43-87 9 90 25-80-57 57 80-25-90 -9 87 43-70} {64-64-64 64 64-64-64 64 64-64-64 64 64-64-64 64} {57-80-25 90 -9-87 43 70-70-43 87 9-90 25 80-57} {50-89 18 75-75-18 89-50-50 89-18-75 75 18-89 50} {43-90 57 25-87 70 9-80 80 -9-70 87-25-57 90-43} {36-83 83-36-36 83-83 36 36-83 83-36-36 83-83 36} {25-70 90-80 43 9-57 87-87 57 -9-43 80-90 70-25} {18-50 75-89 89-75 50-18-18 50-75 89-89 75-50 18} { 9-25 43-57 70-80 87-90 90-87 80-70 57-43 25 -9}
Loop filtering
57
Deblocking filter same as H.264/AVC 4x4 block boundaries not deblocked to

reduce the complexity Sample Adaptive Offset (SAO) Adaptive loop filter (ALF)
Temporal prediction structure
58
In core experiments, 3 non-normative

temporal prediction structures are used
Intra-only Low-delay Random-access
Intra-only configuration
59
QPI
QPI
IDR Picture
time
Low-delay configuration
60
QPBL3=QPI+3
QPBL3=QPI+3
QPBL3=QPI+3
QPBL3=QPI+3
1 2 0
3 4
5 6
7 8
GPB(Generalized P and B) Picture
QPBL2=QPI+2 QPI QPBL1=QPI+1
QPBL2=QPI+2 QPBL1=QPI+1
IDR or Intra Picture
time
Random-access configuration
61
5 3
QPBL4=QPI+4
6
Referenced B Picture
7 4
Non-referenced B Picture
2 1
GPB(Generalized P and B) Picture
0
QPBL3=QPI+3 QPBL2=QPI+2
QPI
IDR or Intra Picture
Referenced B Picture
QPBL1=QPI+1
time

Hevc 4 Videosense

Uploaded by

Document Informationclick to expand document information

Copyright:

Available Formats

Hevc 4 Videosense

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Hevc 4 Videosense

Uploaded by

Copyright:

Available Formats

1

H.264/AVC: Main Functionalities

Coding efficiency (about 50% compared to previous standards)

slices, resynchronization markers, multiple reference pictures, parameter sets, etc.

Basic Coding Architecture

Split into Macroblocks 16x16 pixels

Quant. Transf. coeffs

Intra-frame Intra-frame Estimation Estimation

Intra Prediction Data

Intra-frame Intra-frame Prediction Motion Compensation Compensation

Motion Data Output Video Signal

Motion Estimation Estimation

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Similar to Previous Standards

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

MBs and sub-MBs for Motion Compensation

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Multi-frame Prediction ...

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

44 Intra Prediction Directions

Mode 4 Diagonal Down/Right

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

H.264/AVC: a Success Story

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Scalable Coding: Objectives

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Main SVC Requirements

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

SVC Applications Robust Video Delivery

varying capability Combined with unequal error protection Internet/mobile transmission

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

SVC Alternatives Simulcast

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

SVC Coding Architecture

Inter-layer prediction: Intra Motion Residual

Progressive SNR refinement texture coding Scalable bit-stream

Inter-layer prediction: Intra Motion Residual texture motion

Hierarchical MCP & Intra prediction

Base layer coding H.264/AVC compatible encoder

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Hierarchical Prediction Structures for Temporal Scalability

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

SVC Scalability Types

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

SVC Performance: Spatial Scalability

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Multiview Video Coding (MVC)

In addition to exploiting the temporal and

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Multiview Video Data

Redundancy reduction between camera views

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

MVC Prediction Structures

MPEG-2 Video Multiview profile