Hevc 4 Videosense

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 61

1

A quick overview of HEVC -current statusProf. Dr. Touradj Ebrahimi December 16th, 2011 VideoSense skype meeting http://www.videosense.eu
Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

H.264/AVC: Main Functionalities

Coding efficiency (about 50% compared to previous standards)


Enhanced motion compensation, multiple reference frames, hierarchical

B frames, improved de-blocking filter, smaller blocks for transform, enhanced entropy coding

Network friendliness
Enabled through the Network Adaptation Layer (NAL) design

Error resilience
Data partitioning, flexible macroblock ordering (FMO), slices, redundant

slices, resynchronization markers, multiple reference pictures, parameter sets, etc.

Temporal scalability
Enabled by flexible reference picture management via hierarchical

predictions
Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Basic Coding Architecture


Input Video Signal Coder Control Transform/ Scal./Quant. Scal./Quant. Scaling & Inv. Transform Entropy Coding

Control Data

Split into Macroblocks 16x16 pixels

Quant. Transf. coeffs

Intra-frame Intra-frame Estimation Estimation

Intra Prediction Data

Intra-frame Intra-frame Prediction Motion Compensation Compensation

Deblocking Filter

Motion Data Output Video Signal

Intra/Inter MB select

Motion Estimation Estimation

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Similar to Previous Standards

Macroblocks: 1616 luma + 2 88 chroma samples Input: Association of luma and chroma and
conventional sub-sampling of chroma (4:2:0) Block based motion displacement Motion vectors over picture boundaries Variable block-size motion Block transforms Scalar quantization I, P, and B coding types

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

MBs and sub-MBs for Motion Compensation

Macroblocks

16

16

16

16

1
8

Sub-macroblocks

1
4

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Multi-frame Prediction ...


N-4 N-3 N-2 N-2 N-1 N

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Transform Coding
The H.264/MPEG-4 AVC standard uses 3 transforms depending on the type of prediction residue to be coded: 1. 44 Hadamard Transform for the luminance DC coefficients in MBs coded with the Intra 1616 mode 2. 22 Hadamard Transform for the chrominance DC coefficients in any MB 3. 44 integer transform based on the DCT for all the other blocks
-1

... 0 2 8 10 1 3 9 11 4 6 12 14 5 7 13 15 18 19 20 21 22 23 AC 24 25 Cb 16 Cr 17 2x2 DC

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

44 Intra Prediction Directions


Mode 0 - Vertical Mode 1 - Horizontal Mode 3 Diagonal Down/Left

Mode 2 - DC + + + + + + +

Mode 4 Diagonal Down/Right

Mode 5 Vertical-Right

Mode 6 Horizontal-Down

Mode 7 Vertical-Left

Mode 8 Horizontal-Up

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

H.264/AVC Profiles

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

H.264/AVC: a Success Story



3GPP (recommended in rel 6) 3GPP2 (optional for streaming service) ARIB (Japan mobile segment broadcast) ATSC (preliminary adoption for robust-mode back-up channel) Blu-ray Disc Association (mandatory for Video BD-ROM players) DLNA (optional in first version) DMB (Korea - mandatory) DVB (specified in TS 102 005 and one of two in TS 101 154) DVD Forum (mandatory for HD DVD players) IETF AVT (RTP payload spec approved as RFC 3984) ISMA (mandatory specified in near-final rel 2.0) SCTE (under consideration) US DoD MISB (US government preferred codec up to 1080p) (and, of course, MPEG and the ITU-T)

10

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Scalable Coding: Objectives


Scalability is a functionality regarding the useful decoding of parts of the encoded bitstream, ideally
1.

11

2.

while achieving an RD performance at any supported spatial, temporal, or SNR resolution that is comparable to single-layer coding at that particular resolution, and without significantly increasing the decoding complexity.

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Main SVC Requirements


Similar coding efficiency compared to single-layer coding for each
subset of the scalable bit stream. Little increase in decoding complexity compared to single-layer decoding that scales with the decoded spatio-temporal resolution and bit rate. Support of temporal, spatial, and quality scalability. Support of a backward compatible base layer (H.264/AVC in this case). Support of simple bitstream adaptations after encoding.

12

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

SVC Applications Robust Video Delivery


Adaptive delivery over error-prone networks and to devices with

13

varying capability Combined with unequal error protection Internet/mobile transmission

Scalable Storage
Scalable export of video content Graceful expiration or deletion Surveillance DVRs and Home PVRs

Enhancement Services
Upgrade delivery from 1080i/720p to 1080p DTV broadcasting, optical storage devices

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

SVC Alternatives Simulcast


Simplest solution Code each layer as an independent stream Incurs increase of rate

14

Stream Switching
Viable for some application scenarios Lacks flexibility within the network Requires more storage/complexity at server

Transcoding
Low cost, designed for specific application needs Already deployed in many application domains

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Spatio-Temporal-Quality Cube
Spatial Resolution global bitstream 4CIF

15

CIF Bit Rate (Quality, SNR) low QCIF high 60 30 15 7.5 Temporal Resolution

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

SVC Coding Architecture


Progressive SNR refinement texture coding texture Hierarchical MCP & Intra prediction motion Base layer coding

2007 !

16

Spatial decimation

Inter-layer prediction: Intra Motion Residual

Progressive SNR refinement texture coding Scalable bit-stream

texture Hierarchical MCP & Intra prediction motion Base layer coding Multiplex

Spatial decimation

Inter-layer prediction: Intra Motion Residual texture motion

Progressive SNR refinement texture coding H. /AVC compatible base layer bit-stream

Hierarchical MCP & Intra prediction

Base layer coding H.264/AVC compatible encoder

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Hierarchical Prediction Structures for Temporal Scalability

17

(a) coding with hierarchical B pictures, (b) non-dyadic hierarchical prediction structure, (c) hierarchical prediction structure with a structural encoder/ decoder delay of zero. The numbers below the pictures specify the coding order, and the symbols Tk specify the temporal layers with k representing the corresponding temporal layer identifier.

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Trading Enhancement Layer Coding Efficiency and Drift for Packet-based Quality Scalable Coding

18

(a) base layer only control, (b) enhancement layer only control, (c) two-loop control, (d) key picture concept of SVC for hierarchical prediction structures, where key pictures are marked by the hatched boxes.
Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

SVC Scalability Types


Temporal scalability - Can be typically achieved without losses in ratedistortion performance. Spatial scalability - When applying an optimized SVC encoder control, the bit rate increase relative to non-scalable H.264/AVC coding at the same fidelity can be as low as 10% for dyadic spatial scalability. The results become worse as spatial resolution of both layers decreases and results improve as spatial resolution increases. SNR scalability - When applying an optimized encoder control, the bit rate increase relative to non-scalable H.264/AVC coding at the same fidelity can be as low as 10% for all supported rate points when spanning a bit rate range with a factor of 2-3 between the lowest and highest supported rate point.
From IEEE Transactions on Circuits and Systems for Video Technology, September 2007.

19

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

SVC Performance: Spatial Scalability

20

10~15% gains over simulcasting Performs within 10% of single layer coding
[Segall& Sullivan, T-CSVT, Sept07]
Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

SVC Profiles

21

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Multiview Video Coding (MVC)

22

In addition to exploiting the temporal and


spatial redundancy within each view to achieve coding gains, redundancy can also be exploited across the different views. JVT standardization goal is to reach 50% bitrate savings over independent coding of views with same quality by defining another extension to the H.264/AVC standard.

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Multiview Video Data

23

Redundancy reduction between camera views


Need to cope with color/illumination mismatch problems Alignment may not always be perfect either

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

MVC Prediction Structures

24

Many prediction structures possible to exploit inter-camera redundancy: trade-off in memory, delay, computation and coding efficiency.
Time

MPEG-2 Video Multiview profile

(JVT) MVC

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

View

MVC: Technical Solution Current H.264/AVC multiview extension does not require any
changes to lower-level syntax
Very compatible with single-layer AVC hardware

25

Inter-view prediction
Enabled through flexible design of decoded reference picture

management Allow decoded pictures from other views to be inserted and removed from reference picture buffer

Small changes to high-level syntax


Specify view dependency

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Some MVC Performance Results

26

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Towards ITU-T Recommendation H.265


H.265 is a long-term video coding standard, launched by ITU-T VCEG. Not yet formalized but VCEG keeps seeking proposals and information regarding the
possibility of a major performance gain to justify the step from H.264 to H.265. Though the necessary scope of H.265 is yet largely to be determined, it is agreed that among the goals will be: High coding efficiency, e.g., two times compared with H.264/AVC Computational efficiency, considering both encoder and decoder Loss/error robustness Network friendliness

27

So far, contributions to VCEG have mainly focused on improving coding


efficiency. To better evaluate these contributions and retain progress, the KTA (Key Technical Area) has been developed as the software platform, using JM11 as the baseline and continuously integrating promising tools.

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

KTA Inter Prediction Tools

28

Inter prediction
Adaptive interpolation filter (AIF)
2-D non-separable AIF (AD08, AE16) Separable AIF (COM16-C219, AG10) Directional AIF (DAIF) (AG21, AG22, AH17, AH18) Enhanced DAIF (E-DAIF) (AI12,

COM16-C125, COM16-C126) Enhanced AIF (EAIF) (C464, AI38, AJ30) Switch interpolation filters with offsets (SIFO) (C463, AI35, AJ29, COM16-C126) High precision filter (HPF) (AI33) Single-pass encoding (AJ29, AK26)

1/8-pel motion compensated prediction (MCP) (AD09) Extended MCP block size (COM16-C123) Competition-based MV prediction (AC06)

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

KTA Transform and Quantization Tools

29

Transform and Quantization


Mode-dependent directional transform (MDDT) (AF15,

AG11, AH20, AJ24, AI36) Very large block transform (COM16-C123) Adaptive prediction error coding (APEC) (AB06, AD07, AE15) Adaptive quantization matrix selection (AQMS) (AC07, AD06, AF08, AI19) Rate-distortion optimized quantization (RDO-Q) (AH21)

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Other KTA Tools

30

Entropy coding
Parallel CABAC (COM16-C405, AI32)

In-loop filter
Block-based adaptive loop filter (BALF) (AI18, AJ13) Quadtree-based adaptive loop filter (QALF) (COM16-

C181, AK22)

Post filter (AI34, COM16C128) Internal bit depth increasing (IBDI) (AE13, AF07)

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

JCT-VC roadmap toward HEVC standardization



First meeting: Dresden Germany 15-23 Apr. 2010

31

188 people, 40 input documents

Second meeting: Geneva, Switzerland 21-28 July 2010

221 people, 120 input documents

Third meeting: Guangzhou, China 7-15 Oct. 2010

244 people, 300 input documents

Fourth meeting: Daegu, Korea 20-28 Jan. 2011

248 people, 400 input documents

Fifth meeting: Geneva, Switzerland 16-23 Mar. 2011

226 people, 500 input documents

Sixth meeting: Turin, Italy 14-22 July 2011

253 people, 700 input documents

Seventh meeting: Geneva, Switzerland 21-30 Nov. 2011

285 people, 1000 input documents

Eighth meeting: San Jose, United States 1-10 Feb. 2012 [ISO/IEC CD] Ninth meeting: Geneva, Switzerland 30 Apr. 9 May 2012 Tenth meeting: Stockholm, Sweden 11-20 July 2012 [ISO/IEC DIS] Eleventh meeting: Shanghai, China 10-19 Oct. 2012 Twelfth meeting: Geneva, Switzerland 17-23 Jan. 2013 [ISO/IEC FDIS & ITU-T Consent]

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Status after HEVC proposals submissions

32

Class A results were not tested subjectively Classes B to E were tested subjectively Proponents performance were anonymized Proposal survey output documents:

Decoder speed JCTVC-A201 Architectural outline JCTVC-A202 Table of design elements JCTVC-A203

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

HEVC general architecture

33

All 27 proposals basically conceptually similar to AVC


(and prior standards)
Block-based Variable block sizes Block motion compensation Fractional-pel motion vectors Spatial intra prediction Spatial transform of residual difference Integer-based transform designs Arithmetic or VLC-based entropy coding In-loop filtering to form final decoded picture

Lots of variations at the individual tool level

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Key milestones
Test Model under Consideration" (TMuC)
July 2010 (Geneva): JCTVC-B204

34

First HEVC Test Model (HM1)


October 2010 (Guangzhou): JCTVC-C402

Second HEVC Test Model (HM2)


January 2011 (Deagu): JCTVC-D502

Third HEVC Test Model (HM3)


March 2011 (Geneva): JCTVC-E602

Forth HEVC Test Model (HM4)


July 2011 (Torino): JCTVC-F802

Fifth HEVC Test Model (HM5)


Expected early 2012

Steady simplification and efficiency improvement through


these iterations
Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Moores law of compression

35

between different paradigms

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Moores law of compression

36

and even within the same paradigm


MPEG-2 video

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Moores law of compression

37

and the story still goes on

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Tools under consideration in HM4

38

High-level Structure:
High-level support for frame rate temporal nesting

and random access Rectangular tile-structured scanning Wavefront-structured processing dependencies for parallelism Slices with spatial granularity possibly smaller than largest coding unit: 16x16 or larger

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Tools under consideration in HM4

39

Coding units, Prediction units, and Transform


units:
Coding unit quadtree structure
square coding unit block sizes 2Nx2N, for N=4, 8, 16, 32

Prediction units for coding unit size 2Nx2N:


for Inter, 2Nx2N, 2NxN, Nx2N for N>4, also 2Nx(N/2+3N/2) & (N/2+3N/2)x2N for Intra, only 2Nx2N for N=4, also NxN

Transform unit tree structure within coding unit


maximum of 3 levels

Transform block size of 4x4 to 32x32 samples


For Intra always square for Intra For inter also non-square 4x16, 16x4, 8x32, 32x8

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Tools under consideration in HM4

40

Spatial Signal Transformation and PCM


Representation:
DCT-like integer block transform for Intra also a DST-based integer block transform selected based on the intra prediction mode For intra transforms can cross prediction unit

boundaries for
not allowed for Intra

PCM coding with worst-case bit usage limit

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Tools under consideration in HM4

41

Intra-picture Prediction:
Angular intra prediction 17 directions for 4x4 3 directions for 64x64 34 directions for others Planar intra prediction Chroma intra prediction separate from or

using luma samples

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Tools under consideration in HM4

42

Inter-picture Prediction:
Luma motion compensation interpolation 1/4 sample precision, 8x8 separable with 6 bit tap values Chroma motion compensation interpolation 1/8 sample precision, 4x4 separable with 6 bit tap values Advanced motion vector prediction with

motion vector competition and merging

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Tools under consideration in HM4

43

Entropy coding:
In High Efficiency configuration: Context-

adaptive binary arithmetic entropy coding In Low Complexity configuration: Contextadaptive variable-length entropy coding

Picture Storage and Output Precision:


In High Efficiency configuration: 10 bit-per-

sample picture storage and output In Low Complexity configuration: 8 bit-persample storage and output
Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Tools under consideration in HM4

44

In-Loop Filtering:
Deblocking filter Sample-adaptive offset filter In High Efficiency configuration: Adaptive loop

filter In Low Complexity configuration: No additional filter stage

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Largest Coding Units (LCUs)

45

Consists of an NxN block of luma samples together with


two corresponding blocks of chroma samples for a picture Broadly analogous to that of the macroblock in previous standards such as H.264/AVC Maximum allowed size of the luma block in a LCU is 64x64

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Slice and tile structures

46

Slice structure
Unit of packetization for transmission purpose Independently decodable No prediction across slice borders Entropy coding restarted between slices

Tile structure
Similar to JPEG and JPEG 2000

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Coding Unit (CU) structure

47

Basic unit of region splitting used for inter/intra coding Always square From 8x8 luma samples up to the size of LCU Allows recursive splitting into four equally sized blocks, starting from the LCU Skipped LCU
No refresh

Non-skipped LCU
Intra or inter prediction

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Prediction Unit (PU) structure

48

Basic unit for information on prediction


processes Not restricted to square Each CU contains one or more PUs

2Nx2N

Nx2N

2NxN

NxN

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Transform Unit (TU) structure

49

Basic unit for transform and quantization


processes Its shape depends on PU partitioning mode 3-level quadtree in High Efficiency configuration

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Intra Prediction

50

Unified intra prediction coding tool


provides up to 35 directional prediction depending on the size of PU

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

33 intra prediction directions


-30 -25 -20 -15 -10 -5 0 5 10 15 20 25 30

51

-30 -25 -20 -15 -10 -5 0 5 10 15 20 25 30

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Motion Merge Technique

52

Used to find a (spatial or temporal) neighboring inter


coded PU such that its motion parameters (motion vector, reference picture index, and reference picture list usage flag) can be inferred as the ones for the current PU

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

AMVP

53

Adaptive Motion Vector Prediction exploits


spatio-temporal correlation of motion vector with neighboring PUs, to explicitly transmit of motion vector

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Luma interpolation filters

54

8-tap separable DCT-based filter for 1/4th


luma interpolation

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Chroma interpolation filters

55

4-tap separable DCT-based filter for 1/8th


chroma interpolation

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Transform
nS = 4 {64, 64, 64, 64} {83, 36,-36,-83} {64,-64,-64, 64} {36,-83, 83,-36} nS = 8 {64, 64, 64, 64, 64, 64, 64, 64} {89, 75, 50, 18,-18,-50,-75,-89} {83, 36,-36,-83,-83,-36, 36, 83} {75,-18,-89,-50, 50, 89, 18,-75} {64,-64,-64, 64, 64,-64,-64, 64} {50,-89, 18, 75,-75,-18, 89,-50} {36,-83, 83,-36,-36, 83,-83, 36} {18,-50, 75,-89, 89,-75, 50,-18} nS = 32 {64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64} {90 90 88 85 82 78 73 67 61 54 46 38 31 22 13 4 -4-13-22-31-38-46-54-61-67-73-78-82-85-88-90-90} {90 87 80 70 57 43 25 9 -9-25-43-57-70-80-87-90-90-87-80-70-57-43-25 -9 9 25 43 57 70 80 87 90} {90 82 67 46 22 -4-31-54-73-85-90-88-78-61-38-13 13 38 61 78 88 90 85 73 54 31 4-22-46-67-82-90} {89 75 50 18-18-50-75-89-89-75-50-18 18 50 75 89 89 75 50 18-18-50-75-89-89-75-50-18 18 50 75 89} {88 67 31-13-54-82-90-78-46 -4 38 73 90 85 61 22-22-61-85-90-73-38 4 46 78 90 82 54 13-31-67-88} {87 57 9-43-80-90-70-25 25 70 90 80 43 -9-57-87-87-57 -9 43 80 90 70 25-25-70-90-80-43 9 57 87} {85 46-13-67-90-73-22 38 82 88 54 -4-61-90-78-31 31 78 90 61 4-54-88-82-38 22 73 90 67 13-46-85} {83 36-36-83-83-36 36 83 83 36-36-83-83-36 36 83 83 36-36-83-83-36 36 83 83 36-36-83-83-36 36 83} {82 22-54-90-61 13 78 85 31-46-90-67 4 73 88 38-38-88-73 -4 67 90 46-31-85-78-13 61 90 54-22-82} {80 9-70-87-25 57 90 43-43-90-57 25 87 70 -9-80-80 -9 70 87 25-57-90-43 43 90 57-25-87-70 9 80} {78 -4-82-73 13 85 67-22-88-61 31 90 54-38-90-46 46 90 38-54-90-31 61 88 22-67-85-13 73 82 4-78} {75-18-89-50 50 89 18-75-75 18 89 50-50-89-18 75 75-18-89-50 50 89 18-75-75 18 89 50-50-89-18 75} {73-31-90-22 78 67-38-90-13 82 61-46-88 -4 85 54-54-85 4 88 46-61-82 13 90 38-67-78 22 90 31-73} {70-43-87 9 90 25-80-57 57 80-25-90 -9 87 43-70-70 43 87 -9-90-25 80 57-57-80 25 90 9-87-43 70} {67-54-78 38 85-22-90 4 90 13-88-31 82 46-73-61 61 73-46-82 31 88-13-90 -4 90 22-85-38 78 54-67} {64-64-64 64 64-64-64 64 64-64-64 64 64-64-64 64 64-64-64 64 64-64-64 64 64-64-64 64 64-64-64 64} {61-73-46 82 31-88-13 90 -4-90 22 85-38-78 54 67-67-54 78 38-85-22 90 4-90 13 88-31-82 46 73-61} {57-80-25 90 -9-87 43 70-70-43 87 9-90 25 80-57-57 80 25-90 9 87-43-70 70 43-87 -9 90-25-80 57} {54-85 -4 88-46-61 82 13-90 38 67-78-22 90-31-73 73 31-90 22 78-67-38 90-13-82 61 46-88 4 85-54} {50-89 18 75-75-18 89-50-50 89-18-75 75 18-89 50 50-89 18 75-75-18 89-50-50 89-18-75 75 18-89 50} {46-90 38 54-90 31 61-88 22 67-85 13 73-82 4 78-78 -4 82-73-13 85-67-22 88-61-31 90-54-38 90-46} {43-90 57 25-87 70 9-80 80 -9-70 87-25-57 90-43-43 90-57-25 87-70 -9 80-80 9 70-87 25 57-90 43} {38-88 73 -4-67 90-46-31 85-78 13 61-90 54 22-82 82-22-54 90-61-13 78-85 31 46-90 67 4-73 88-38} {36-83 83-36-36 83-83 36 36-83 83-36-36 83-83 36 36-83 83-36-36 83-83 36 36-83 83-36-36 83-83 36} {31-78 90-61 4 54-88 82-38-22 73-90 67-13-46 85-85 46 13-67 90-73 22 38-82 88-54 -4 61-90 78-31} {25-70 90-80 43 9-57 87-87 57 -9-43 80-90 70-25-25 70-90 80-43 -9 57-87 87-57 9 43-80 90-70 25} {22-61 85-90 73-38 -4 46-78 90-82 54-13-31 67-88 88-67 31 13-54 82-90 78-46 4 38-73 90-85 61-22} {18-50 75-89 89-75 50-18-18 50-75 89-89 75-50 18 18-50 75-89 89-75 50-18-18 50-75 89-89 75-50 18} {13-38 61-78 88-90 85-73 54-31 4 22-46 67-82 90-90 82-67 46-22 -4 31-54 73-85 90-88 78-61 38-13} { 9-25 43-57 70-80 87-90 90-87 80-70 57-43 25 -9 -9 25-43 57-70 80-87 90-90 87-80 70-57 43-25 9} { 4-13 22-31 38-46 54-61 67-73 78-82 85-88 90-90 90-90 88-85 82-78 73-67 61-54 46-38 31-22 13 -4}

56

nS = 16 {64 64 64 64 64 64 64 64 64 64 64 64 64 64 64 64} {90 87 80 70 57 43 25 9 -9-25-43-57-70-80-87-90} {89 75 50 18-18-50-75-89-89-75-50-18 18 50 75 89} {87 57 9-43-80-90-70-25 25 70 90 80 43 -9-57-87} {83 36-36-83-83-36 36 83 83 36-36-83-83-36 36 83} {80 9-70-87-25 57 90 43-43-90-57 25 87 70 -9-80} {75-18-89-50 50 89 18-75-75 18 89 50-50-89-18 75} {70-43-87 9 90 25-80-57 57 80-25-90 -9 87 43-70} {64-64-64 64 64-64-64 64 64-64-64 64 64-64-64 64} {57-80-25 90 -9-87 43 70-70-43 87 9-90 25 80-57} {50-89 18 75-75-18 89-50-50 89-18-75 75 18-89 50} {43-90 57 25-87 70 9-80 80 -9-70 87-25-57 90-43} {36-83 83-36-36 83-83 36 36-83 83-36-36 83-83 36} {25-70 90-80 43 9-57 87-87 57 -9-43 80-90 70-25} {18-50 75-89 89-75 50-18-18 50-75 89-89 75-50 18} { 9-25 43-57 70-80 87-90 90-87 80-70 57-43 25 -9}

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Loop filtering

57

Deblocking filter same as H.264/AVC 4x4 block boundaries not deblocked to


reduce the complexity Sample Adaptive Offset (SAO) Adaptive loop filter (ALF)

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Temporal prediction structure

58

In core experiments, 3 non-normative


temporal prediction structures are used
Intra-only Low-delay Random-access

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Intra-only configuration

59

QPI

QPI

IDR Picture

time

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Low-delay configuration

60

QPBL3=QPI+3

QPBL3=QPI+3

QPBL3=QPI+3

QPBL3=QPI+3

1 2 0

3 4

5 6

7 8

GPB(Generalized P and B) Picture

QPBL2=QPI+2 QPI QPBL1=QPI+1

QPBL2=QPI+2 QPBL1=QPI+1

IDR or Intra Picture

time

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

Random-access configuration

61

5 3
QPBL4=QPI+4

6
Referenced B Picture

7 4

Non-referenced B Picture

2 1
GPB(Generalized P and B) Picture

0
QPBL3=QPI+3 QPBL2=QPI+2

QPI

IDR or Intra Picture

Referenced B Picture

QPBL1=QPI+1

time

Multimedia Signal Processing Group Swiss Federal Institute of Technology, Lausanne

You might also like