5 F485 C53 D 01
5 F485 C53 D 01
A/53E
27 December 2005
Amendment No. 1 dated 18 April 2006
Note: An editorial correction to Figure D5.10 was made with publication of Amendment No. 1 to
this document.
2
ATSC A/53E Digital Television Standard 27 December 2005
Table of Contents
1. SCOPE AND DOCUMENTATION STRUCTURE.......................................................................................11
1.1 Documentation Structure 11
2. REFERENCES ...........................................................................................................................................11
2.1 Normative References 11
2.2 Informative References 11
3. DEFINITIONS .............................................................................................................................................11
3.1 Compliance Notation 12
3.2 Treatment of Syntactic Elements 12
3.3 Terms Employed 12
3.4 Symbols, Abbreviations, and Mathematical Operators 17
3.4.1 Arithmetic Operators 17
3.4.2 Logical Operators 18
3.4.3 Relational Operators 18
3.4.4 Bitwise Operators 19
3.4.5 Assignment 19
3.4.6 Mnemonics 19
3.4.7 Constants 19
3.4.8 Method of Describing Bit Stream Syntax 19
3.4.8.1 Definition of bytealigned Function 20
3.4.8.2 Definition of nextbits Function 20
3.4.8.3 Definition of next_start_code Function 20
4. SYSTEM OVERVIEW.................................................................................................................................21
4.1 System Block Diagram 21
1. SCOPE .......................................................................................................................................................25
2. REFERENCES ...........................................................................................................................................25
2.1 Normative References 25
2.2 Informative References 26
3
ATSC A/53E Digital Television Standard 27 December 2005
1. SCOPE .......................................................................................................................................................37
4. SYSTEM OVERVIEW.................................................................................................................................37
5. SPECIFICATION ........................................................................................................................................38
5.1 Constraints With Respect to ATSC Standard A/52B 38
5.2 Sampling Frequency 39
5.3 Bit Rate 39
5.4 Audio Coding Modes 39
5.5 Dialogue Level 39
5.6 Dynamic Range Compression 39
5.7 STD Audio Buffer Size 39
4
ATSC A/53E Digital Television Standard 27 December 2005
1. SCOPE .......................................................................................................................................................44
4. SYSTEM OVERVIEW.................................................................................................................................45
5. SPECIFICATION ........................................................................................................................................46
5.1 MPEG-2 Systems Standard 46
5.1.1 Video T-STD 46
5.1.2 Audio T-STD 46
5.2 Identification of MPEG-2 Private Ranges 46
5.2.1 MPEG-2 Registration Descriptor 46
5.2.1.1 Program Identifier 46
5.2.1.2 Audio Elementary Stream Identifier 47
5.2.1.3 Other Program Element Identifiers 47
5.3 Audio Constraints 47
5.4 Constraints on PSI 47
5.5 PES Constraints 48
5.5.1 Video PES Constraints 49
5.5.2 Audio PES Constraints 49
5.6 Services and Features 49
5.6.1 System Information and Program Guide 49
5.6.1.1 System Information and Program Guide PID 49
5.6.1.2 System Information and Program Guide STD Model 50
5.6.2 Specification of ATSC Private Data 50
5.7 Assignment of Identifiers 50
5.7.1 Audio Stream Type 50
5.7.2 Video Stream Type 50
5.7.3 Descriptors 50
5.7.3.1 AC-3 Audio Descriptor 50
5.7.3.2 Program Smoothing Buffer Descriptor 51
5.7.3.3 ISO-639 Language Descriptor 51
5.7.3.4 ATSC Private Information Descriptor 52
5.7.4 PID Value Assignments 53
5
ATSC A/53E Digital Television Standard 27 December 2005
1. SCOPE .......................................................................................................................................................55
2. REFERENCES ...........................................................................................................................................55
2.1 Normative References 55
2.2 Informative References 55
4. ABBREVIATIONS ......................................................................................................................................56
6
ATSC A/53E Digital Television Standard 27 December 2005
5.5 Synchronization 74
5.5.1 Data Segment Sync 74
5.5.2 Data Field Sync 74
5.5.2.1 Definition of the PN511 Sequence 75
5.5.2.2 Definition of the PN63 Sequence 75
5.5.2.3 VSB Mode Bits 76
5.5.2.4 Reserved 77
5.5.2.5 Precode 77
5.5.2.6 Enhancement Signaling 77
5.6 Enhanced Mode Data Rates 77
5.6.1 Enhanced Stream Rate Limits 78
5.7 Enhanced Mode Map Bits 82
5.7.1 Map to Frame Count to Frame Association 83
5.7.2 Generation of Length of 64 Kerdock Codeword 84
5.7.3 Encoding Procedure 85
5.8 Enhanced Data Packing Structure 85
5.8.1 Placement of Enhanced Mode Packets 86
5.8.2 Packing of Enhanced Mode Data Within Packets 87
5.8.3 E-8VSB Enhancement Signaling 88
5.9 Modulation 88
5.9.1 Bit-to-Symbol Mapping 88
5.9.2 Pilot Addition 88
5.9.3 8 VSB Modulation Method 88
6. TRANSMISSION CHARACTERISTICS FOR HIGH DATA RATE MODE..................................................88
6.1 Overview 88
6.2 Channel Error Protection and Synchronization 90
6.2.1 Data Randomizer 90
6.2.2 Reed-Solomon Encoder 90
6.2.3 Interleaving 90
6.2.4 Data Segment Sync 91
6.2.5 Data Field Sync 91
6.3 Modulation 91
6.3.1 Bit-to-Symbol Mapping 91
6.3.2 Pilot Addition 91
6.3.3 16 VSB Modulation Method 91
7
ATSC A/53E Digital Television Standard 27 December 2005
1. FOREWORD...............................................................................................................................................93
2. HISTORICAL BACKGROUND...................................................................................................................93
2.1 Advisory Committee on Advanced Television Service (ACATS) 94
2.2 Digital HDTV Grand Alliance (Grand Alliance) 94
2.3 Organization for Documenting the Digital Television Standard 95
2.4 Principles for Documenting the Digital Television Standard 95
1. SCOPE .......................................................................................................................................................97
4. SYSTEM OVERVIEW.................................................................................................................................97
5. SPECIFICATION ........................................................................................................................................98
5.1 Constraints With Respect to ATSC Standard A/52B Annex E 98
5.2 Sampling Frequency 99
5.3 Frame Size 99
5.4 Audio Coding Modes 99
5.5 Dialogue Level 99
5.6 Dynamic Range Compression - Artistic 99
5.7 Dynamic Range Compression - Heavy 100
8
ATSC A/53E Digital Television Standard 27 December 2005
9
ATSC A/53E Digital Television Standard 27 December 2005
10
ATSC A/53E Digital Television Standard 27 December 2005
2. REFERENCES
3. DEFINITIONS
With respect to definition of terms, abbreviations, and units, the practice of the Institute of
Electrical and Electronics Engineers (IEEE) as outlined in the Institute’s published standards
shall be used [1]. Where an abbreviation is not covered by IEEE practice, or industry practice
11
ATSC A/53E Digital Television Standard 27 December 2005
differs from IEEE practice, then the abbreviation in question will be described in Section 3.4 of
this document. Many of the definitions included therein are derived from definitions adopted by
MPEG.
12
ATSC A/53E Digital Television Standard 27 December 2005
bit rate – The rate at which the compressed bit stream is delivered from the channel to the input
of a decoder.
block – A block is an 8-by-8 array of pel values or DCT coefficients representing luminance or
chrominance information.
Bps – Bits per second.
byte-aligned – A bit in a coded bit stream is byte-aligned if its position is a multiple of 8-bits
from the first bit in the stream.
channel – A digital medium that stores or transports a digital television stream.
coded representation – A data element as represented in its encoded form.
compression – Reduction in the number of bits used to represent an item of data.
constant bit rate – Operation where the bit rate is constant from start to finish of the
compressed bit stream.
CRC – The cyclic redundancy check to verify the correctness of the data.
data element – An item of data as represented before encoding and after decoding.
DCT – See discrete cosine transform.
decoded stream – The decoded reconstruction of a compressed bit stream.
decoder – An embodiment of a decoding process.
decoding (process) – The process defined in the Digital Television Standard that reads an input
coded bit stream and outputs decoded pictures or audio samples.
decoding time-stamp (DTS) – A field that may be present in a PES packet header that indicates
the time that an access unit is decoded in the system target decoder.
discrete cosine transform – A mathematical transform that can be perfectly undone and which
is useful in image compression.
DTS – See decoding time-stamp.
editing – A process by which one or more compressed bit streams are manipulated to produce a
new compressed bit stream. Conforming edited bit streams are understood to meet the
requirements defined in the Digital Television Standard.
elementary stream (ES) –A generic term for one of the coded video, coded audio, or other
coded bit streams. One elementary stream is carried in a sequence of PES packets with one
and only one stream_id.
elementary stream clock reference (ESCR) – A time stamp in the PES stream from which
decoders of PES streams may derive timing.
encoder – An embodiment of an encoding process.
encoding (process) – A process that reads a stream of input pictures or audio samples and
produces a valid coded bit stream as defined in the Digital Television Standard.
entropy coding – Variable length lossless coding of the digital representation of a signal to
reduce redundancy.
entry point – Refers to a point in a coded bit stream after which a decoder can become properly
initialized and commence syntactically correct decoding. The first transmitted picture after
an entry point is either an I-picture or a P-picture. If the first transmitted picture is not an I-
picture, the decoder may produce one or more pictures during acquisition.
13
ATSC A/53E Digital Television Standard 27 December 2005
14
ATSC A/53E Digital Television Standard 27 December 2005
motion vector – A pair of numbers which represent the vertical and horizontal displacement of a
region of a reference picture for prediction.
MP@HL – Main profile at high level.
MP@ML – Main profile at main level.
MPEG – Refers to standards developed by the ISO/IEC JTC1/SC29 WG11, Moving Picture
Experts Group. MPEG may also refer to the Group.
MPEG-2 – Refers to ISO/IEC standards 13818-1 (systems), 13818-2 (video), 13818-3 (Audio),
13818-4 (Compliance).
pack – A pack consists of a pack header followed by zero or more packets. It is a layer in the
system coding syntax.
packet data – Contiguous bytes of data from an elementary data stream present in the packet.
packet identifier (PID) – A unique integer value used to associate elementary streams of a
program in a single or multi-program transport stream.
packet – A packet consists of a header followed by a number of contiguous bytes from an
elementary data stream. It is a layer in the system coding syntax.
padding – A method to adjust the average length of an audio frame in time to the duration of the
corresponding PCM samples, by continuously adding a slot to the audio frame.
payload – Payload refers to the bytes which follow the header byte in a packet. For example, the
payload of a transport stream packet includes the PES_packet_header and its
PES_packet_data_bytes or pointer_field and PSI sections, or private data. A PES_packet_payload,
however, consists only of PES_packet_data_bytes. The transport stream packet header and
adaptation fields are not payload.
PCR – See program clock reference.
pel – See pixel.
PES packet header – The leading fields in a PES packet up to but not including the
PES_packet_data_byte fields where the stream is not a padding stream. In the case of a padding
stream, the PES packet header is defined as the leading fields in a PES packet up to but not
including the padding_byte fields.
PES packet – The data structure used to carry elementary stream data. It consists of a packet
header followed by PES packet payload.
PES stream – A PES stream consists of PES packets, all of whose payloads consist of data from
a single elementary stream, and all of which have the same stream_id.
PES – An abbreviation for packetized elementary stream.
picture – Source, coded, or reconstructed image data. A source or reconstructed picture consists
of three rectangular matrices representing the luminance and two chrominance signals.
PID – See packet identifier.
pixel – “Picture element” or “pel.” A pixel is a digital sample of the color intensity values of a
picture at a single point.
PMT – Program Map Table. The collection of all the TS_program_map_section()s.
predicted pictures or P-pictures or P-frames – Pictures that are coded with respect to the
nearest previous I or P-picture. This technique is termed forward prediction. P-pictures
15
ATSC A/53E Digital Television Standard 27 December 2005
provide more compression than I-pictures and serve as a reference for future P-pictures or B-
pictures. P-pictures can propagate coding errors when P-pictures (or B-pictures) are
predicted from prior P-pictures where the prediction is flawed.
presentation time-stamp (PTS) – A field that may be present in a PES packet header that
indicates the time that a presentation unit is presented in the system target decoder.
presentation unit (PU) – A decoded audio access unit or a decoded picture.
profile – A defined subset of the syntax specified in the MPEG-2 video coding specification
program clock reference (PCR) – A time stamp in the transport stream from which decoder
timing is derived.
program element – A generic term for one of the elementary streams or other data streams that
may be included in the program.
program specific information (PSI) – PSI consists of normative data which is necessary for the
demultiplexing of transport streams and the successful regeneration of programs.
program – A program is a collection of program elements. Program elements may be
elementary streams. Program elements need not have any defined time base; those that do
have a common time base and are intended for synchronized presentation.
PSI – See program specific information.
PTS – See presentation time-stamp.
PU – See presentation unit.
quantizer – A processing step which intentionally reduces the precision of DCT coefficients.
random access – The process of beginning to read and decode the coded bit stream at an
arbitrary point.
reserved – This term, when used in clauses defining the coded bit stream, indicates that the
value may be used in the future for Digital Television Standard extensions. Unless otherwise
specified within this Standard, all reserved bits shall be set to “1”.
SCR – See system clock reference.
scrambling – The alteration of the characteristics of a video, audio, or coded data stream in
order to prevent unauthorized reception of the information in a clear form. This alteration is a
specified process under the control of a conditional access system.
SDTV – See standard definition television.
slice – A series of consecutive macroblocks.
SMPTE – Society of Motion Picture and Television Engineers.
source stream – A single, non-multiplexed stream of samples before compression coding.
splicing – The concatenation performed on the system level or two different elementary streams.
It is understood that the resulting stream must conform totally to the Digital Television
Standard.
standard definition television (SDTV) – This term is used to signify a digital television system
in which the quality is approximately equivalent to that of NTSC. This equivalent quality
may be achieved from pictures sourced at the 4:2:2 level of ITU-R Recommendation 601 and
subjected to processing as part of the bit rate compression. The results should be such that
when judged across a representative sample of program material, subjective equivalence with
16
ATSC A/53E Digital Television Standard 27 December 2005
NTSC is achieved. Also called standard digital television. See also conventional definition
television and ITU-R Recommendation 1125.
start codes – 32-bit codes embedded in the coded bit stream that are unique. They are used for
several purposes including identifying some of the layers in the coding syntax. Start codes
consist of a 24 bit prefix (0x000001) and an 8 bit stream_id.
STD input buffer – A first-in, first-out buffer at the input of a system target decoder for storage
of compressed data from elementary streams before decoding.
STD – See system target decoder.
still picture – A coded still picture consists of a video sequence containing exactly one coded
picture which is intra-coded. This picture has an associated PTS and the presentation time of
succeeding pictures, if any, is later than that of the still picture by at least two picture
periods.
system clock reference (SCR) – A time stamp in the program stream from which decoder
timing is derived.
system header – The system header is a data structure that carries information summarizing the
system characteristics of the Digital Television Standard multiplexed bit stream.
system target decoder (STD) – A hypothetical reference model of a decoding process used to
describe the semantics of the Digital Television Standard multiplexed bit stream.
time-stamp – A term that indicates the time of a specific action such as the arrival of a byte or
the presentation of a presentation unit.
transport stream packet header – The leading fields in a Transport Stream packet up to and
including the continuity_counter field.
variable bit rate – Operation where the bit rate varies with time during the decoding of a
compressed bit stream.
VBV – See video buffering verifier.
video buffering verifier (VBV) – A hypothetical decoder that is conceptually connected to the
output of an encoder. Its purpose is to provide a constraint on the variability of the data rate
that an encoder can produce.
video sequence – A video sequence is represented by a sequence header, one or more groups of
pictures, and an end_of_sequence code in the data stream.
8 VSB – Vestigial sideband modulation with 8 discrete amplitude levels.
16 VSB – Vestigial sideband modulation with 16 discrete amplitude levels.
17
ATSC A/53E Digital Television Standard 27 December 2005
18
ATSC A/53E Digital Television Standard 27 December 2005
bslbf Bit string, left bit first, where “left” is the order in which bit strings are written in the Standard. Bit
strings are written as a string of 1s and 0s within single quote marks, e.g. ‘1000 0001’. Blanks within
a bit string are for ease of reading and have no significance.
uimsbf Unsigned integer, most significant bit first.
while ( condition ) { If the condition is true, then the group of data elements occurs next in the data stream.
data_element This repeats until the condition is not true.
...
}
do { The data element always occurs at least once. The data element is repeated until the
data_element condition is not true.
...}
while ( condition )
if ( condition) { If the condition is true, then the first group of data elements occurs next in the data
data_element stream.
...
}
19
ATSC A/53E Digital Television Standard 27 December 2005
else { If the condition is not true, then the second group of data elements occurs next in the
data_element data stream.
...
}
for (i = 0;i<n;i++) { The group of data elements occurs n times. Conditional constructs within the group of
data_element data elements may depend on the value of the loop control variable i, which is set to
... zero for the first occurrence, incremented to 1 for the second occurrence, and so forth.
}
As noted, the group of data elements may contain nested conditional constructs. For
compactness, the {} are omitted when only one data element follows.
data_element [ ] data_element [ ] is an array of data. The number of data elements is indicated by the
context.
data_element [n] data_element [n] is the n+1th element of an array of data.
data_element [m] [n] data_element [m] [n] is the m+1,n+1 th element of a two-dimensional array of data.
data_element [l] [m] [n] data_element [l] [m] [n] is the l+1,m+1,n+1 th element of a three-dimensional array of
data.
data_element [m..n] data_element [m..n] is the inclusive range of bits between bit m and bit n in the
data_element.
Decoders must include a means to look for start codes and sync bytes (transport stream) in
order to begin decoding correctly, and to identify errors, erasures or insertions while decoding.
The methods to identify these situations, and the actions to be taken, are not standardized.
3.4.8.1 Definition of bytealigned Function
The function bytealigned( ) returns 1 if the current position is on a byte boundary; that is, the next
bit in the bit stream is the first bit in a byte. Otherwise it returns 0.
3.4.8.2 Definition of nextbits Function
The function nextbits( ) permits comparison of a bit string with the next bits to be decoded in the
bit stream.
3.4.8.3 Definition of next_start_code Function
The next_start_code( ) function removes any zero bit and zero byte stuffing and locates the next
start code.
This function checks whether the current position is byte-aligned. If it is not, 0 stuffing bits
are present. After that any number of 0 bytes may be present before the start-code. Therefore
start-codes are always byte-aligned and may be preceded by any number of 0 stuffing bits.
20
ATSC A/53E Digital Television Standard 27 December 2005
4. SYSTEM OVERVIEW
The Digital Television Standard describes a system designed to transmit high quality video and
audio and ancillary data over a single 6 MHz channel. The system can deliver reliably about 19
Mbps of throughput in a 6 MHz terrestrial broadcasting channel and about 38 Mbps of
throughput in a 6 MHz cable television channel. This means that encoding a video source whose
resolution can be five times that of conventional television (NTSC) resolution requires a bit rate
reduction by a factor of 50 or higher. To achieve this bit rate reduction, the system is designed to
be efficient in utilizing available channel capacity by exploiting complex video and audio
compression technology.
The objective is to maximize the information passed through the data channel by minimizing
the amount of data required to represent the video image sequence and its associated audio. The
objective is to represent the video, audio, and data sources with as few bits as possible while
preserving the level of quality required for the given application.
Although the RF/transmission subsystems described in this Standard are designed
specifically for terrestrial and cable applications, the objective is that the video, audio, and
service multiplex/transport subsystems be useful in other applications.
1
ITU-R Document TG11/3-2, “Outline of Work for Task Group 11/3, Digital Terrestrial
Television Broadcasting,” June 30, 1992.
21
ATSC A/53E Digital Television Standard 27 December 2005
Video
Video Source Coding
and Compression
Channel
Transport
Audio Subsystem Coding
Audio
Service Multiplex
Audio Source Coding
and Compression
Modulation
Ancillary Data
Control Data
Receiver Characteristics
“Source coding and compression” refers to the bit rate reduction methods, also known as data
compression, appropriate for application to the video, audio, and ancillary digital data streams.
The term “ancillary data” includes control data, conditional access control data, and data
associated with the program audio and video services, such as closed captioning. “Ancillary
data” can also refer to independent program services. The purpose of the coder is to minimize
the number of bits needed to represent the audio and video information. The digital television
system employs the MPEG-2 video stream syntax for the coding of video and the Digital Audio
Compression (AC-3) Standard for the coding of audio.
“Service multiplex and transport” refers to the means of dividing the digital data stream into
“packets” of information, the means of uniquely identifying each packet or packet type, and the
appropriate methods of multiplexing video data stream packets, audio data stream packets, and
ancillary data stream packets into a single data stream. In developing the transport mechanism,
interoperability among digital media, such as terrestrial broadcasting, cable distribution, satellite
distribution, recording media, and computer interfaces, was a prime consideration. The digital
television system employs the MPEG-2 transport stream syntax for the packetization and
multiplexing of video, audio, and data signals for digital broadcasting systems (ISO/IEC 13818-
1) [3]. The MPEG-2 transport stream syntax was developed for applications where channel
bandwidth or recording media capacity is limited and the requirement for an efficient transport
mechanism is paramount. It was designed also to facilitate interoperability with the ATM
transport mechanism.
“RF/transmission” refers to channel coding and modulation. The channel coder takes the data
bit stream and adds additional information that can be used by the receiver to reconstruct the
22
ATSC A/53E Digital Television Standard 27 December 2005
data from the received signal which, due to transmission impairments, may not accurately
represent the transmitted signal. The modulation (or physical layer) uses the digital data stream
information to modulate the transmitted signal. The modulation subsystem offers two modes: a
terrestrial broadcast mode (8 VSB), and a high data rate mode (16 VSB).
Figure 5.2 illustrates a high level view of encoding equipment. This view is not intended to
be complete, but is used to illustrate the relationship of various clock frequencies within the
encoder. There are two domains within the encoder where a set of frequencies are related, the
source coding domain and the channel coding domain.
Video In
Video fTP FEC and fsym RF Out
A/D Transport VSB
Encoder Encoder Sync
Modulator
Insertion
Audio In
Audio
A/D Encoder
The source coding domain, represented schematically by the video, audio, and transport
encoders, uses a family of frequencies which are based on a 27 MHz clock (f27MHz). This clock is
used to generate a 42-bit sample of the frequency which is partitioned into two parts defined by
the MPEG-2 specification. These are the 33-bit program_clock_reference_base and the 9-bit
program_clock_reference_extension. The former is equivalent to a sample of a 90 kHz clock which is
locked in frequency to the 27 MHz clock, and is used by the audio and video source encoders
when encoding the presentation time stamp (PTS) and the decode time stamp (DTS). The audio
and video sampling clocks, fa and fv respectively, must be frequency-locked to the 27 MHz clock.
This can be expressed as the requirement that there exist two pairs of integers, (na, ma) and (nv,
mv), such that:
na
fa = × 27 MHz
ma
and
nv
fv = × 27 MHz
mv
23
ATSC A/53E Digital Television Standard 27 December 2005
The channel coding domain is represented by the FEC/Sync Insertion subsystem and the
VSB modulator. The relevant frequencies in this domain are the VSB symbol frequency (fsym)and
the frequency of the transport stream (fTP) which is the frequency of transmission of the encoded
transport stream. These two frequencies must be locked, having the relation:
188 312
ftp = 2 × f
208 313 sym
The signals in the two domains are not required to be frequency-locked to each other, and in
many implementations will operate asynchronously. In such systems, the frequency drift can
necessitate the occasional insertion or deletion of a null packet from within the transport stream,
thereby accommodating the frequency disparity.
The annexes that follow consider the characteristics of the subsystems necessary to
accommodate the services envisioned.
24
ATSC A/53E Digital Television Standard, Annex A 27 December 2005
Annex A:
Video System Characteristics (Normative)
1. SCOPE
This annex describes the characteristics of the video subsystem of the Digital Television
Standard. The input formats and bit stream characteristics are described in separate sections.2
2. REFERENCES
2
Note that there is a coordinated effort underway among ATSC, CEA, and SMPTE to revise and
clarify standards related to delivering closed captions, AFD, and bar data so that each
describes the aspects of the system for which they are primarily responsible without overlap.
This effort is expected to result in revisions of those sections in the ATSC Standards.
25
ATSC A/53E Digital Television Standard, Annex A 27 December 2005
[A9] SMPTE 296M (2001): Standard for Television—1280 x 720 Progressive Image Sample
Structure, Analog and Digital Representation and Analog Interface, Society of Motion
Picture and Television Engineers, White Plains, N.Y.
[A10] ETSI TS 101 154 V1.7.1, Digital Video Broadcasting (DVB): Implementation
Guidelines for the use of MPEG-2 Systems, Video and Audio in Satellite, Cable and
Terrestrial Broadcasting Applications, Annex B, June 2005.
3. COMPLIANCE NOTATION
As used in this document, “shall” denotes a mandatory provision of the standard. “Should”
denotes a provision that is recommended but not mandatory. “May” denotes a feature whose
presence does not preclude compliance, that may or may not be present at the option of the
implementor.
The compression formats may be derived from one or more appropriate video input formats.
It may be anticipated that additional video production standards will be developed in the future
that extend the number of possible input formats.
26
ATSC A/53E Digital Television Standard, Annex A 27 December 2005
Main Profile at High Level.3 Additionally, all bit streams shall meet the constraints and
specifications described in Sections 5.1 and 5.2.
The allowable values for the field bit_rate_value are application-dependent. In the primary
application of terrestrial broadcast, this field shall correspond to a bit rate which is less than or
equal to 19.4 Mbps. In the high data rate mode, the corresponding bit rate is less than or equal to
38.8 Mbps.
5.1.2 Compression Format Constraints
Table A3 lists the allowed compression formats.
3
See ISO/IEC 13818-2 [A3], Section 8 for more information regarding profiles and levels.
27
ATSC A/53E Digital Television Standard, Annex A 27 December 2005
Note: The profile_and_level_indication field shall indicate the lowest profile and level defined in
ISO/IEC 13818-2 [A3], Section 8, that is consistent with the parameters of the video elementary
stream.
5.1.4 Sequence Display Extension Constraints
Table A5 identifies parameters in the sequence display extension part of a bit stream that shall be
constrained by the video subsystem and lists the allowed values for each.
4
Note that 1088 lines are actually coded in order to satisfy the MPEG-2 requirement that the
coded vertical size be a multiple of 16 (progressive scan) or 32 (interlaced scan). The bottom
8 lines are black, per MPEG rules.
28
ATSC A/53E Digital Television Standard, Annex A 27 December 2005
5
Implementers should note that CEA-708-B [A1] describes the semantics for closed captions.
Additional information on implementation of captions may be found in CEA-CEB-10-A
[A15].
29
ATSC A/53E Digital Television Standard, Annex A 27 December 2005
In accordance with the bit stream syntax in Table A6, more than one picture user data
construct may follow any given picture header. However, no more than one picture user data
construct using the same user_data_type_code shall follow any given picture header.
Note that picture user data with a 32-bit field following user_data_start_code having a value
other than ATSC_identifier may be present in an ATSC-compliant video bit stream. As an example,
the afd_identifier (value 0x44544731) is defined for use in ATSC video Elementary Streams (see
Section 5.2.4). Receiving devices are expected to process this field and use it to determine the
syntax and semantics of the user data construct to follow.
6
Shaded cells in this table indicate syntactic and semantic additions to the ISO/IEC 13818-2
[A3] Standard. Note: user_data_type_code values 0x04 and 0x05 are assigned in ANSI/SCTE
21 2001 [A14].
30
ATSC A/53E Digital Television Standard, Annex A 27 December 2005
Receiving devices are expected to silently discard any unrecognized video user data
encountered in the video bit stream. For example, if an unrecognized 32-bit identifier is seen
following the user_data_start_code, or an unrecognized 8-bit user_data_type_code is seen following
the ATSC_identifier, data should be discarded until another start code is seen.
5.2.3 ATSC Picture User Data Semantics
user_data_start_code – This is set to 0x0000 01B2.
ATSC_identifier – This is a 32 bit code that indicates that the video user data conforms to this
specification. The value ATSC_identifier shall be 0x4741 3934.
user_data_type_code – An 8-bit value that identifies the type of ATSC user data to follow. Value
0x03 indicates cc_data(), value 0x06 indicates bar_data(), and other values are either in use in
other standards or are reserved for future use.
cc_data() – A data structure defined in Table A8.
bar_data() – A data structure indicating the sizes of letterbox or pillarbox areas within the coded
video frame .
ATSC_reserved_user_data – Reserved for use by ATSC or used by other standards.
31
ATSC A/53E Digital Television Standard, Annex A 27 December 2005
process_cc_data_flag – This flag is set to indicate whether it is necessary to process the cc_data. If it
is set to ‘1’, the cc_data has to be parsed and its meaning has to be processed. When it is set to
‘0’, the cc_data can be discarded.
additional_data_flag – This flag is set to ‘1’ to indicate the presence of additional user data.
cc_count: This 5-bit integer indicates the number of closed caption constructs following this field.
It can have values 0 through 31. The value of cc_count shall be set according to the frame rate
and coded picture structure (field or frame) such that a fixed bandwidth of 9600 bits per
second is maintained for the closed caption payload data. Sixteen (16) bits of closed caption
payload data are carried in each pair of the fields cc_data_1 and cc_data_2.
cc_valid – This flag is set to ‘1’ to indicate that the two closed caption data bytes that follow are
valid. If set to ‘0’ the two data bytes are invalid, as defined in CEA-708-B [A1].
cc_type – Denotes the type of the two closed caption data bytes that follow, as defined in CEA-
708-B [A1].
cc_data_1 – The first byte of a closed caption data pair as defined in CEA-708-B [A1].
cc_data_2 – The second byte of a closed caption data pair as defined in CEA-708-B [A1].
additional_cc_data – Reserved for future ATSC definition.
7
In other words, the video is letterboxed (bars above and/or below video) or pillarboxed (bars
left and/or right of video).
32
ATSC A/53E Digital Television Standard, Annex A 27 December 2005
top_bar_flag – This flag shall indicate, when set, that the top bar data is present.
33
ATSC A/53E Digital Television Standard, Annex A 27 December 2005
bottom_bar_flag – This flag shall indicate, when set, that the bottom bar data is present.
left_bar_flag – This flag shall indicate, when set, that the left bar data is present.
right_bar_flag – This flag shall indicate, when set, that the right bar data is present.
line_number_end_of_top_bar – A 14-bit unsigned integer value representing the last line of a
horizontal letterbox bar area at the top of the reconstructed frame. Designation of line
numbers shall be as defined in Table A10.
line_number_start_of_bottom_bar – A 14-bit unsigned integer value representing the first line of a
horizontal letterbox bar area at the bottom of the reconstructed frame. Designation of line
numbers shall be as defined in Table A10.
pixel_number_end_of_left_bar – A 14-bit unsigned integer value representing the last horizontal
luminance sample of a vertical pillarbar area at the left side of the reconstructed frame.
Pixels shall be numbered from zero, starting with the leftmost pixel.
pixel_number_start_of_right_bar – A 14-bit unsigned integer value representing the first horizontal
luminance sample of a vertical pillarbar area at the right side of the reconstructed frame.
Pixels shall be numbered from zero, starting with the leftmost pixel.
additional_bar_data – Reserved for future ATSC definition.
34
ATSC A/53E Digital Television Standard, Annex A 27 December 2005
The combination of source aspect ratio and active_format allows the decoder to identify
whether the “area of interest” is the whole of the frame (e.g. source aspect ratio 16:9, active_format
16:9 center), a letterbox within the frame (e.g. source aspect ratio 4:3, active_format 16:9 center),
or a “pillarbox” within the frame (e.g. source aspect ratio 16:9, active_format 4:3 center).
Table A12 defines the coding of the active_format field that shall be used.
35
ATSC A/53E Digital Television Standard, Annex A 27 December 2005
Use of ‘0010’, ‘0011’, or ‘0100’ is not recommended. Illustrations of the various values of
may be found in ETSI TS 101 154 V1.7.1 Table B.3 [A10]. Users are strongly
active_format
encouraged to consult this reference.
5.2.4.3 Recommended Receiver Response to AFD
Receiving device designers are strongly encouraged to study the suite of Consumer Electronics
Association (CEA) bulletins which contain the most recent recommendations regarding the
processing of AFD.
5.2.5 Relationship Between Bar Data and AFD (Informative)
Any combination of Active Format Description and bar data may be present in video user data
(either, neither, or both). Note that AFD data may not always exactly match bar data because
AFD only deals with 4:3, 14:9, and 16:9 aspect ratios while bar data can accurately represent
nearly any aspect ratio.
36
ATSC A/53E Digital Television Standard, Annex B 27 December 2005
Annex B:
Audio System Characteristics (Normative)
1. SCOPE
This annex describes the audio system characteristics and normative specifications of the Digital
Television Standard.
2. NORMATIVE REFERENCES
The following documents contain provisions which in whole or part, through reference in this
text, constitute provisions of this standard. At the time of publication, the editions indicated were
valid. All standards are subject to revision and amendment, and parties to agreement based on
this standard are encouraged to investigate the possibility of applying the most recent editions of
the documents listed below.
[B1] AES3-2003, “AES Recommended practice for digital audio engineering—Serial
transmission format for two-channel linearly represented digital audio data,” (revision of
AES3-1992, including subsequent amendments).
[B2] ANSI S1.4-1983 (R 2001) with Amd.S1.4A-1995, “Specification for Sound Level
Meters.”
[B3] ATSC A/52B: “Digital Audio Compression (AC-3, E-AC-3),” Advanced Television
Systems Committee, Washington, D.C., 14 June 2005.
3. COMPLIANCE NOTATION
As used in this document, “shall” denotes a mandatory provision of the standard. “Should”
denotes a provision that is recommended but not mandatory. “May” denotes a feature whose
presence does not preclude compliance, that may or may not be present at the option of the
implementor.
4. SYSTEM OVERVIEW
As illustrated in Figure B1, the audio subsystem comprises the audio encoding/decoding
function and resides between the audio inputs/outputs and the transport subsystem. The audio
encoder(s) is (are) responsible for generating the audio elementary stream(s) which are encoded
representations of the baseband audio input signals. At the receiver, the audio subsystem is
responsible for decoding the audio elementary stream(s) back into baseband audio.
37
ATSC A/53E Digital Television Standard, Annex B 27 December 2005
Audio
Elementary Transport VSB RF
Stream(s) Packets Transmission
Specified
in this Channel
Annex
Receiver Receiver
Reconstructed Audio
Transport Transmission
Audio Decoder(s)
Subsystem Subsystem
5. SPECIFICATION
This Section forms the normative specification of the audio system. The audio compression
system shall conform with the Digital Audio Compression (AC-3) Standard, subject to the
constraints outlined in this Section.
38
ATSC A/53E Digital Television Standard, Annex B 27 December 2005
39
ATSC A/53E Digital Television Standard, Annex B 27 December 2005
which may (individually) be coded into each elementary stream. Each AC-3 elementary stream
is tagged as to its service type using the bsmod bit field. There are two types of main service and
six types of associated service. Each associated service may be tagged (in the AC-3 audio
descriptor in the transport PSI data) as being associated with one or more main audio services.
Each AC-3 elementary stream may also be tagged with a language code.
Associated services may contain complete program mixes, or may contain only a single
program element. Associated services which are complete mixes may be decoded and used as is.
They are identified by the full_svc bit in the AC-3 descriptor (see A/52B, Annex A [B3]).
Associated services which contain only a single program element are intended to be combined
with the program elements from a main audio service.
This section specifies the meaning and use of each type of service. In general, a complete
audio program (what is presented to the listener over the set of loudspeakers) may consist of a
main audio service, an associated audio service that is a complete mix, or a main audio service
combined with an associated audio service. The capability to simultaneously decode one main
service and one associated service is required in order to form a complete audio program in
certain service combinations described in this section. This capability may not exist in some
receivers.
40
ATSC A/53E Digital Television Standard, Annex B 27 December 2005
41
ATSC A/53E Digital Television Standard, Annex B 27 December 2005
service with the left channel of the D service, and mixing the right channel of the ME service
with the right channel of the D service. The result will be a two channel stereo signal containing
music, effects, and dialogue.
Audio in multiple languages may be provided by supplying multiple D services (each in a
different language) along with a single ME service. This is more efficient than providing
multiple CM services, but, in the case of more than two audio channels in the ME service,
requires that dialogue be restricted to the center channel.
Some receivers may not have the capability to simultaneously decode an ME and a D
service.
42
ATSC A/53E Digital Television Standard, Annex B 27 December 2005
center channel of the main audio service. The dynamic range control signal in the VO service is
intended to be used by the audio decoder to modify the level of the main audio program. Thus
the level of the main audio service may be controlled by the broadcaster, and the broadcaster
may signal the decoder (by altering the dynamic range control words embedded in the VO audio
elementary stream) to reduce the level of the main audio service by up to 24 dB during the
voice-over.
Some receivers may not have the capability to simultaneously decode and reproduce a voice-
over service along with a program audio service.
43
ATSC A/53E Digital Television Standard, Annex C 27 December 2005
Annex C:
Service Multiplex and Transport Subsystem Characteristics
(Normative)
1. SCOPE
This annex constitutes the normative specification for the transport subsystem of the Digital
Television Standard. The syntax and semantics of this specification conform to ISO/IEC 13818-
1 [C3], with additional constraints and conditions specified in this Standard. Within this context,
other ATSC Standards may further constrain and/or supplement the transport subsystem
specification.8
2. NORMATIVE REFERENCES
The following documents contain provisions which in whole or in part, through reference in this
text, constitute provisions of this Standard. At the time of publication, the editions indicated
were valid. For references not including a publication date, the most recent published version
shall apply. All standards are subject to revision and amendment, and parties to agreements
based on this Standard are encouraged to investigate the possibility of applying the most recent
editions of the documents listed below.
3. COMPLIANCE NOTATION
As used in this document, “shall” denotes a mandatory provision of the standard. “Should”
denotes a provision that is recommended but not mandatory. “May” denotes a feature whose
presence does not preclude compliance that may or may not be present at the option of the
implementer. Note: this compliance notation applies only to this annex.
8
Note that there is a coordinated effort underway among ATSC, CEA, and SMPTE to revise and
clarify standards related to delivering closed captions, AFD, and bar data so that each
describes the aspects of the system for which they are primarily responsible without overlap.
This effort is expected to result in revisions of those sections in the ATSC Standards.
44
ATSC A/53E Digital Television Standard, Annex C 27 December 2005
4. SYSTEM OVERVIEW
The transport format and protocol for the Digital Television Standard is a compatible subset of
the MPEG-2 Systems specification defined in ISO/IEC 13818-1 [C3]. It is based on a fixed-
length packet transport stream approach which has been defined and optimized for digital
television delivery applications.
As illustrated in Figure C1, the transport subsystem resides between the application (e.g.,
audio or video) encoding and decoding functions and the transmission subsystem. The encoder’s
transport subsystem is responsible for formatting the coded elementary streams and multiplexing
the different components of the program for transmission. At the receiver, it is responsible for
recovering the elementary streams for the individual application decoders and for the
corresponding error signaling. The transport subsystem also incorporates other higher protocol
layer functionality related to synchronization of the receiver.
Transmitter
(video, audio, data, etc.)
Sources for encoding
packetization and
multiplexing
Application
Transport
Encoders
* Modu-
* lator
*
* Transport
Stream
elementary
streams, private
sections, or PES
Transmission
Clock
Format
Receiver
depacketization and
demultiplexing
Presentation
Application
Transport
Decoders
Demod-
*
* ulator
* Transport
*
elementary Stream with
streams, private error
sections, or PES signaling
with error
signaling
clock
control
Clock
45
ATSC A/53E Digital Television Standard, Annex C 27 December 2005
layer, one or more single program transport streams are combined to form a service multiplex of
programs (also known as a multi-program transport stream in the MPEG-2 Systems standard,
and a Digital Television Standard multiplexed bit stream in this ATSC standard). Program
Specific Information (PSI), carried within TS packets, relates to the identification of programs
and the components of each program.
Not shown explicitly in Figure C1, but essential to the practical implementation of this
Standard, is a control system that manages the transfer and processing of the elementary streams
from the application encoders. The rules followed by this control system are not a part of this
Standard. The output of the control system implementation shall conform to the MPEG-2
Transport Stream coding as specified in ISO/IEC 13818-1 [C3] with the additional constraints
specified in this Standard.
5. SPECIFICATION
This section of the Standard describes the coding constraints that apply to the use of the MPEG-
2 systems specification in the digital television system.
46
ATSC A/53E Digital Television Standard, Annex C 27 December 2005
9
The ISO/IEC-designated registration authority for the format_identifier is SMPTE Registration
Authority, LLC. See (http://www.smpte-ra.org/html).
47
ATSC A/53E Digital Television Standard, Annex C 27 December 2005
48
ATSC A/53E Digital Television Standard, Annex C 27 December 2005
49
ATSC A/53E Digital Television Standard, Annex C 27 December 2005
50
ATSC A/53E Digital Television Standard, Annex C 27 December 2005
2) The 6-bit field for Bit Rate Code shall have a value in the range ‘000000’ through
‘001111’ or ‘100000’ through ‘101111’; i.e. signaling a bit rate less than or equal to 448
kbps.
3) The num_channels field shall have a value in the range 1 to 13.
4) The langcod field is a reserved field. Audio language shall be indicated using an ISO-639
Language Descriptor (see Sec. 5.7.3.3 below).
5) The descriptor shall identify the type of the audio service in the bsmod field, which shall
be the same as the bsmod field in the elementary stream associated with this descriptor.
6) The descriptor may optionally carry a 3-byte language code that is represented per ISO-
639. If this language code is present in the AC-3 Descriptor, it shall match the language
code carried in the ISO-639 Language Descriptor, if present. The descriptor may
terminate at one of two places:
a) if describing a main audio service for which no associated services are available,
immediately following full_svc; otherwise
b) immediately prior to textlen.
Effective 1 March 2008, audio language shall be indicated by including the optional ISO-639
Language bytes within the AC-3_audio_stream_descriptor(), at which point the use of the ISO-639
Language Descriptor to indicate language shall be optional, but recommended to support legacy
devices requiring the ISO-639_language_descriptor().
Informative note: receiving devices are expected to use the bsmod (bit stream
mode) field in the AC-3_audio_stream_descriptor() to determine the type of each audio
stream rather than the audio_type field in the ISO_639_language_descriptor().
51
ATSC A/53E Digital Television Standard, Annex C 27 December 2005
The audio_type field in any ISO_639_language_descriptor() used in this Standard shall be set to
0x00 (meaning “undefined”).
An ISO_639_language_descriptor() may be present in the TS_program_map_section() in other
positions as well, for example to indicate the language or languages of a textual data service
program element.
5.7.3.4 ATSC Private Information Descriptor
The ATSC_private_information_descriptor() provides a method to carry and unambiguously label
private information. More than one ATSC_private_information_descriptor() may appear within a single
descriptor loop. Table C1 defines the bit-stream syntax of the ATSC_private_information_descriptor().
52
ATSC A/53E Digital Television Standard, Annex C 27 December 2005
Elementary Streams for which the transport_scrambling_control field does not exclusively have
the value of ‘00’ for the duration of the program must carry a CA_descriptor in accordance with
Section 2.6.16 of ISO/IEC 13818-1 [C3].
The implementation of a digital television delivery system that employs conditional access
will require the specification of additional data streams and system constraints.
53
ATSC A/53E Digital Television Standard, Annex C 27 December 2005
The symbol rate Sr in Msymbols per second for the transmission subsystem (see Section 5 of
Annex D) is:
⎛ 684 ⎞
Sr =⎜ ⎟×4.5=10.76...Msymbols per second
⎝ 286 ⎠
54
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
Annex D:
RF/Transmission System Characteristics (Normative)
1. SCOPE
This annex describes the characteristics of the RF/Transmission subsystem, which is referred to
as the VSB subsystem, of the Digital Television Standard. The VSB subsystem offers two major
modulation methods: a terrestrial broadcast method (8 VSB), and a high data rate method (16
VSB). These are described in separate sections of this document.
The 8-VSB modulation method is further defined by a set of required elements and various
combinations of optional elements. The set of required elements is called the Main mode. Main
service data is protected by the ‘Main’ Forward Error Correction (FEC) system and is sent using
mandatory training sequences.
The optional enhancements add additional forward error correction coding layers to the data
before sending the data via a constrained version of 8-VSB called Enhanced 8-VSB (E8-VSB).
Various coding rate options are defined, and the payload assignment between the Enhanced 8-
VSB and the Main Mode data is selectable at discretely defined values.
These Modes shall only be used in the defined combinations.
Accordingly, the 8 VSB transmission system offers optional sub-modes of operation that
trade-off data rate for performance. The optional modes facilitate receiver operation in certain
propagation conditions, with the degree of additional enhancements selected by the broadcaster
with an acceptance of a reduction in the payload of the Main Service. The optional modes are
designed to avoid impact on any remaining payload in the Main Service.
2. REFERENCES
3. COMPLIANCE NOTATION
As used in this document, “shall” denotes a mandatory provision of the standard. “Should”
denotes a provision that is recommended but not mandatory. “May” denotes a feature whose
presence does not preclude compliance that may or may not be present at the option of the
implementer.
55
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
4. ABBREVIATIONS
FEC – forward error correction
MPEG – Moving Pictures Experts Group
MUX – multiplexer
PCR – Program Clock Reference
RS – Reed-Solomon (error correction coding)
VSB – vestigial sideband modulation
O p tio n a l
D a ta R eed D a ta P re - RF
Random - S o lo m o n T r e llis SYNC P i lo t VSB
In t e r - e q u a liz e r Up-
iz e r Encoder Encoder MUX In s e r t io n M o d u la t o r
le a v e r F il t e r C o n v e rte r
S eg m en t S yn c
F ie ld S y n c
10
Note that the optional pre-equalizer and RF up-converter blocks are implementation dependent
and not addressed in this Annex.
56
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
Optional
8-VSB
Optional Reed - Data Trellis Pre - RF
Data Solomon Inter- Pilot VSB Up-
coder equalizer
Randomizer Encoder leaver Insertion Modulator Converter
(12-way Filter
interleave) SYNC
MUX
Segment Sync
NOTE
MPEG DATA
Field Sync
SYNC
Field Sync
carried throughout
Map
MPEG E8-VSB
streams Convolutional
coder Data R-S
Main and Enhanced Data RS Data (12-way Byte Bytes
Mux Packet Processor Randomizer Encoder Byte interleave / De - Delete
M/E Interleaver de-interleave) Interleaver
Bytes
Flag
57
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
encoder, and byte interleaver that are identical to the ones used for Main-data-only transmission.
The data shall then be processed by an E8-VSB convolutional trellis coder, which shall have two
modes of operation for the Enhanced and Main data respectively, controlled by the
Main/Enhanced (M/E) flag.
The Enhanced convolutional processing, as specified below, shall result in replacement of
the placeholder bits in the expanded Enhanced stream packets (including the additional RS FEC
data). The symbols at the output of the E8-VSB convolutional coder shall be de-interleaved at
the 12:1 trellis level and then at the byte level. The output of this stage is groups of bytes with
appended erroneous RS coded parity bytes, due to changes in the symbol data produced by the
E8-VSB convolutional trellis coder. The erroneous RS coded parity bytes shall be deleted and
replaced by correct bytes generated by the following RS encoder (top row of Figure D5.2). A
data de-randomizer /randomizer pair may be inserted at this point to reformulate packets that are
transmitted by a SMPTE 310 link.
In case the upper and lower rows of processing in Figure D5.2 are physically separated (as
by a SMPTE 310 link [D3]), means will be required to transmit the map data and the field sync
timing across that link from the multiplexer (lower left of Figure D5.2) to the insertion mux
(upper row of Figure D5.2). One such method is covered in ATSC Standard A/110A [D2]. If the
SMPTE 310 link is used, then the embedded map and synchronization information is decoded
and is sent to the transmitter functional blocks as shown in the upper row of Figure D5.2. Data to
signal the placement of Enhanced data segments within the data field (“map data”) shall be
transmitted during the data field sync segment, as described below (see Section 5.7).
11
Note that E8-VSB data includes additional RS coding and additional trellis coding. The packet
expansion by a ratio of 1:2 or 1:4 for Rate one-half or one-fourth respectively. Due to this
expansion, an MPEG-2 packet that is input to the enhanced stream extends over a non-integral
multiple of normal segments before being dispersed by interleaving. The Data Segment Sync is
MPEG-2 sync byte of the enhanced payload.
58
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
Two-thirds rate trellis coding requires 3/2 x 1656 bits = 2484 bits.
The exact symbol rate is given by Equation 1 below:
(1) Sr (MHz) = 4.5/286 x 684 = 10.76... MHz
The frequency of a Data Segment is given in Equation 2 below:
(2) fseg= Sr / 832 = 12.94... X 103 Data Segments/s.
The Data Frame rate is given by Equation (3) below:
(3) fframe = fseg/626 = 20.66 ... frames/s.
The symbol rate Sr and the transport rate Tr (see Section 7.2 of Annex C) shall be locked to each
other in frequency.
4 828 Symbols
Field Sync #1
S
313 e Data + FEC 24.2
Segments g ms
m
e
n
t
Field Sync #2
S
y
n
c
313 Data + FEC
24.2
Segments ms
Test Segment
1 Segment
= 77.3 us
The 8-level symbols combined with the binary Data Segment Sync and Data Field Sync
signals shall be used to suppressed-carrier modulate a single carrier. Before transmission,
however, most of the lower sideband shall be removed. The resulting spectrum is flat, except for
the band edges where a nominal square root raised cosine response results in 620 kHz transition
regions. The nominal VSB transmission spectrum is shown in Figure D5.4.
At the suppressed-carrier frequency, 310 kHz from the nominal lower band edge, a small
pilot shall be added to the signal.
59
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
1.
.7
P il o t
0
S u p p re s s e d
C a r r ie r
.3 1 5 .3 8 M H z .3 1
6 .0 M H z
60
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
D0 D1 D2 D3 D4 D5 D6 D7
The generator is shifted with the Byte Clock and one 8 bit Byte
of data is extracted per cycle.
61
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
i = 2t- 1
i=0
62
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
1
M(=4Bytes)
2
From 2M To
3
Reed-Solomon Pre-Coder and
Encoder Trellis Encoder
(B-2)M
51
(B-1)M
(B=)52
63
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
Interference Filter
Pre-coder Trellis Encoder 8-Level Symbol Mapper
X2 Y2 Z2 MAP
+
Z2 Z1 Z0 R
000 -7
D 001 -5
010 -3 R
X1 Y1 Z1 011 -1
100 +1
Z0 101 +3
D + D 110 +5
111 +7
(D = 12 Symbols Delay)
Figure D5.8 Main Service trellis encoder, precoder, and symbol mapper.
The output multiplexer shown (as the right circle) in Figure D5.9 shall advance by four
symbols on each segment boundary. However, the state of the trellis encoder shall not be
advanced. The data coming out of the multiplexer shall follow normal ordering from encoder 0
through 11 for the first segment of the frame, but on the second segment the order changes and
symbols are read from encoders 4 through 11, and then 0 through 3. The third segment reads
from encoder 8 through 11 and then 0 through 7. This three-segment pattern shall repeat through
the 312 Data Segments of the frame. Table D5.1 shows the interleaving sequence for the first
three Data Segments of the frame.
64
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
Pre-Coder
& Trellis
Encoder
#0
Pre-Coder
& Trellis
Encoder
#1 Pre-Coded & Trellis Encoded
Data Out
Interleaved Pre-Coder
To Mapper
Data & Trellis
In Encoder
#2
Pre-Coder
& Trellis
Encoder
#9
Pre-Coder
& Trellis
Encoder
#10
Pre-Coder
& Trellis
Encoder
#11
After the Data Segment Sync is inserted, the ordering of the data symbols is such that
symbols from each encoder occur at a spacing of twelve symbols.
A complete conversion of parallel bytes to serial bits needs 828 bytes to produce 6624 bits.
Data symbols are created from 2 bits sent in MSB order, so a complete conversion operation
yields 3312 data symbols, which corresponds to 4 segments of 828 data symbols. 3312 data
symbols divided by 12 trellis encoders gives 276 symbols per trellis encoder. 276 symbols
divided by 4 symbols per byte gives 69 bytes per trellis encoder.
The conversion starts with the first segment of the field and proceeds with groups of 4
segments until the end of the field. 312 segments per field divided by 4 gives 78 conversion
operations per field.
65
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
Allowing for segment sync the input to 4 encoders is skipped but the encoders cycle with no
input. The input is held until the next multiplex cycle and then fed to the correct encoder.
Table D5.2 details the byte to symbol conversion and the associated multiplexing of the
trellis encoders. Segment 0 is the first segment of the field. The pattern repeats every 12
segments; segments 5 through 11 are not shown.
66
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
96 0 24 7,6 ... ... ... ... ... ... ... ... ... ... ... ...
97 1 25 7,6 ... ... ... ... ... ... ... ... ... ... ... ...
... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ...
767 11 191 1,0 ... ... ... ... ... ... ... ... ... ... ... ...
768 0 192 7,6 ... ... ... ... ... ... ... ... ... ... ... ...
769 1 193 7,6 ... ... ... ... ... ... ... ... ... ... ... ...
... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ...
815 11 203 1,0 3 419 7,6 7 623 5,4 11 827 3,2 ... ... ...
816 0 204 7,6 4 408 5,4 8 612 3,2 0 816 1,0 ... ... ...
817 1 205 7,6 5 409 5,4 9 613 3,2 1 817 1,0 ... ... ...
... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ...
827 11 215 7,6 3 419 5,4 7 623 3,2 11 827 1,0 ... ... ...
5.4.2 Main and Enhanced Service Data Error Detection and Correction Facilities
When Enhanced Services are enabled (see Figure D5.2), the processing steps for both Main
Service data and Enhanced Service data are somewhat different than for Main Service data
alone. The requirements for the RF transmission system with Enhanced data capability are
specified herein.
5.4.2.1 Enhanced Data Protection
Enhanced data is protected by a concatenated FEC comprised by an additional FEC (applied to
the Enhanced Service) and by the Main FEC. Enhanced channel error protection uses an
additional interleaver (Enhanced interleaver), an additional Reed Solomon encoding scheme
(Enhanced RS encoding), an additional convolutional interleaver (Enhanced convolutional
interleaver), and an additional 4-state convolutional encoder (Enhanced convolutional encoder).
The Enhanced data and Enhanced RS parity bytes are encapsulated within the transport level
payload portion of an MPEG-2 Packet. The Enhanced 4-state convolutional encoder is
concatenated and synchronized with the Main trellis 4-state encoding to produce an effective 16-
state trellis encoder for the Enhanced data. Enhanced FEC offers two encoding modes defined as
one-half rate and one-quarter rate E8-VSB modes. The following sections establish requirements
for each functional block of the Enhanced FEC shown in Figure D5. 2
The input to the preprocessor shall be 188-byte MPEG-2 packets.
The data intended to be encoded by the Main and Enhanced FEC is split into a maximum of
three parallel streams. Each stream is associated with a Main, an Enhanced one-half rate, or an
Enhanced one-quarter rate FEC mode status.
Appropriate buffering is inserted in both the Main and the Enhanced MPEG-2 streams, and
Enhanced data is sent through the E8-VSB Pre-Processor. Enhanced data shall be Reed Solomon
encoded, and each byte shall be expanded by the ratio of 1:2 (for one-half rate coding) or 1:4 (for
one-quarter rate coding) to form place-holder bits for the 4-state Enhanced convolutional coding.
The expanded bytes shall be formatted in MPEG-2 compliant packets, which shall be
multiplexed with packets from the Main MPEG-2 stream (Main Service).
67
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
Normal 188-Byte
MPEG MUX
Packet Uncoded
Buffer MPEG
Data MPEG
Inserts Packets
188-Byte Enhanced Enhanced
Enhanced (RS Encoded)
Packets into
½ Rate MPEG MPEG Compatible
the MPEG M/E
Packet Packets
Multiplex at Byte
Buffer Uncoded E8-VSB Defined Flag
MPEG Pre-Processor Locations in
Enhanced Data Each VSB
¼ Rate MPEG 4/184
M/E Field
Packet
Buffer Byte
Flag
68
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
the buffer shall contain, consecutively, all of the one-half rate 164-byte packets, and the
second part of the buffer shall contain, consecutively, all of the one-fourth rate 164-byte
packets. The first packet processed by the RS encoder shall be the first one-half rate
packet (if one-half rate packets are present) incoming to the buffer multiplexer.
Each PCR [D1] shall be adjusted to accommodate the actual MPEG-2 Transport Stream
packet [D1] delivery time.
Additionally, the Enhanced multiplexer buffer shall carry an H/Q flag, which shall identify
the nature of each 164-byte packet. This H/Q flag shall have two states: “H” for one-half rate and
“Q” for one-fourth rate.
5.4.2.1.1.3 Enhanced RS
The first function after the segment multiplexer shall be an encoding of the 164-byte packets of
data to be sent via this enhanced means by an RS encoder with parameters t = 10 (184,164). The
Primitive Field Generating Polynomial shall be the same as that specified in Figure D5.6, which
is also used for the Main RS encoder. The RS encoder will pass through the H/Q flag generated
by the previous block and shall extend it to the parity bytes generated by the RS encoder.
5.4.2.1.1.4 Enhanced Data Interleaver
The interleaver is mainly intended to provide additional protection against burst noise and bursty
errors produced by channel degradations. The Enhanced RS encoder shall be followed by an
Enhanced data interleaver that shall perform a convolutional byte interleave with parameters
B=46, M=4, N=184. The Enhanced interleaver shall make multiple complete revolutions per
data field for all Enhanced data code rates. It shall start at the top (zero delay) row in the same
manner as the Main byte interleaver. The interleaver for all mix percentages of Enhanced data
shall introduce a constant delay to an MPEG-2 transport stream. The conceptual structure may
be seen in Figure D5.7. The data interleaver shall carry the H/Q flag generated during the
operation of buffer multiplexing of the 164-byte packets. The H/Q flag shall remain in time
synchronization with each byte it describes.
5.4.2.1.1.5 Enhanced Bytes Expansion and Addition of MPEG-2 Header
Each byte associated with the Enhanced one-half rate FEC mode is expanded into two bytes. The
expansion of the one-half rate byte is as specified below.
69
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
Each byte associated with the Enhanced one-fourth rate FEC mode is expanded into four
bytes. The expansion of the one-fourth rate byte is as specified below.
The bits R7 through R0 of the original one-half rate and one-fourth rate bytes are
respectively mapped into two and four Enhanced bytes. The bits E7 through E0 are bit positions
in the subsequently expanded byte. The “X” in the expanded bytes denotes a placeholder and
may be coded by 0.
For the rate one-half, the expanded byte number “0” shall represent the LSB byte and the
byte number “1” shall represent the MSB byte. For the rate one-fourth, the expanded byte
number “0” shall represent the LSB byte of the 4 bytes word, and the expanded byte number “3”
shall represent the MSB byte. For both the one-half rate and one-fourth rate the original byte
replacement shall start with the MSB byte of respectively the two and four expanded bytes.
The expanded bytes are formatted into 184-byte segments. Each reconstructed 184-byte
segment packet may consist of both one-half rate and one-fourth rate expanded Enhanced bytes.
For the purpose for maintaining strict backward compatibility for existing receivers, 188-
byte packets that contain Enhanced encoded data are composed of a 0x47 Sync byte, followed by
3 bytes as defined by [D1] that contain the null packet designation (PID = 0x1FFF), and the 184-
byte enhanced segment
The encapsulated Enhanced RS encoding (184,164) in the E8-VSB Pre-Processor reduces the
effective payload of these packets by an additional 20 bytes to 164.
After the introduction of the header bytes, the H/Q flag carried through the previous blocks is
dropped. Another flag, called the M/E flag, is used and shall be in the Enhanced state when
Enhanced bytes are clocked out of the E8-VSB Pre-Processor and shall be in the Main state
when the 4-byte MPEG-2 header is clocked out of the E8-VSB Pre-Processor.
70
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
188-byte
MPEG 188-
Packets MPEG byte
Packet MPEG
164-byte Packets
Add
Converter Expand each 4-byte
Enhanced
M byte of the MPEG
Reed Enhanced 184 byte RS
188-byte U Solomon Data Headers
Block of To each
MPEG X (184, 164) Interleaver Robust Data 4/184
MPEG 184 byte
Packets H/Q Encoder H/Q into 2 or 4 RS
Packet Segment Byte bytes
164-byte Block M/E
Flag Flag Byte
Converter
Flag
71
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
2-bit 2-bit
nibbles nibbles
(x1, (x1',
Bytes x2) x2')
From To ATSC
12-way Enhanced Bytes
ATSC 12-way Data Byte
Symbol Symbol
Data Byte Symbol De- De-
Interleaver Processor
Interleaver M/E Interleaver interleaver
M/E
Byte Byte
flag flag
72
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
M/E
E D
M
M U X2'
+ X +
X2
X2 D
2 - bit
nibbles M/E M/E M/E
X1 E E E
M D + M D M X1'
M U M U U
X X M X
X1
73
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
5.5 Synchronization
This section defines synchronization for data segments and fields.
5.5.1 Data Segment Sync
The encoded trellis data shall be passed through a multiplexer that inserts the various
synchronization signals (Data Segment Sync and Data Field Sync).
A two-level (binary) 4-symbol Data Segment Sync shall be inserted into the 8-level digital
data stream at the beginning of each Data Segment. (The MPEG-2 sync byte shall be replaced by
Data Segment Sync.) The Data Segment Sync embedded in random data is illustrated in Figure
D5.14.
A complete segment shall consist of 832 symbols: 4 symbols for Data Segment Sync, and
828 data plus parity symbols. The Data Segment Sync is binary (2-level). The same sync pattern
occurs regularly at 77.3 μs intervals, and is the only signal repeating at this rate. Unlike the data,
the four symbols for Data Segment Sync are not Reed-Solomon or trellis encoded, nor are they
interleaved. The Data Segment Sync pattern shall be a ‘1001’ pattern, as shown in Figure D5.14.
Data Data
Segment Data + FEC Segment
+7
SYNC SYNC
+5
+3
+1
-1
-3
-5
-7
828 Symbols
Levels 4 207 Bytes 4
Before Symbols Symbols
Pilot
Data Segment
Addition 832 Symbols
(Pilot=1.25) 208 Bytes
74
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
Precode *
832 Symbols
+7
+5
+3
+1
PN511 PN63 PN63 PN63 VSB Reserved **
-1 mode
-3
-5
-7 Sync
12
sym
63 63 63 24 bols
Levels 4 511 104
before sym sym sym sym
symbols symbols bols bols bols bols symbols
addition
of pilot
(Pilot=1.25)
* For trellis coded terrestrial 8 VSB, the last 12 symbols of the previous segments are duplicated in the last
12 reserved symbols of the field synch.
** For enhanced data transmission, the last 10 of the reserved symbols before the 12 precode symbols are
defined. The other 82 symbols may be defined for each enhancement, as needed.
‘0000 0001 0111 1111 1100 1010 1010 1110 0110 0110 1000 1000 1001 1110 0001 1101
0111 1101 0011 0101 0011 1011 0011 1010 0100 0101 1000 1111 0010 0001 0100 0111
1100 1111 0101 0001 0100 1100 0011 0001 0000 0100 0011 1111 0000 0101 0100 0000
1100 1111 1110 1110 1010 1001 0110 0110 0011 0111 0111 1011 0100 1010 0100 1110
0111 0001 0111 0100 0011 0100 1111 1011 0001 0101 1011 1100 1101 1010 1110 1101
1001 0110 1101 1100 1001 0010 1110 0011 1001 0111 1010 0011 0101 1000 0100 1101
1111 0001 0010 1011 1100 0110 0101 0000 1000 1100 0001 1110 1111 1101 0110 1010
1100 1001 1001 0001 1101 1100 0010 1101 0000 0110 1100 0000 1001 0000 0001 110’
‘1110 0100 1011 0111 0110 0110 1010 1111 1100 0001 0000 1100 0101 0011 1101 000’
The generators for the PN63 and PN511 sequences are shown in Figure D5.16.
75
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
X or
+
1 0 0 1 1 1 Output
Shift
63 PN Sequence, X6+X+1
Preload ‘100111’
X or X or X or X or X or
+ + + + +
0 1 0 0 0 0 0 0 0 Output
Shift
511 PN Sequence,
X9+X7+X6+X4+X3+X+1
Preload ‘010000000’
P A B C P A B C
where P is the even parity bit, the MSB of the byte, and A, B, C are the actual mode bits.
P A B C
0 0 0 0 Reserved
1 0 0 1 Reserved
1 0 1 0 Reserved
0 0 1 1 Reserved
1 1 0 0 16 VSB
0 1 0 1 8 VSB*
0 1 1 0 Reserved
1 1 1 1 Reserved
0 0 0 0 P A B C P A B C 1 1 1 1
76
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
5.5.2.4 Reserved
In the 8-VSB mode, 92 symbols of the last 104 symbols shall be reserved, and they shall be
followed by the 12 symbols defined below. To maintain a longer period with a flat spectrum, it is
recommended that these 92 symbols be filled with a continuation of the PN63 sequence when
only 8-VSB is present.
When one or more enhanced data transmission methods are used, the previously reserved
symbols (including the 12 precode symbols used for 8-VSB and E8-VSB) shall be numbered
from 1 to 104, in the order transmitted. The 12 precode symbols shall be preceded by 10 symbols
that shall be used to signal the presence of an enhancement or enhancements, as defined below.
The use of some or all of the remaining 82 symbols shall be defined by each enhancement.
5.5.2.5 Precode
In the 8 VSB mode, the last 12 symbols of the segment shall correspond to the last 12 symbols
of the previous segment. All sequences are pre-loaded before the beginning of the Data Field
Sync.
Like the Data Segment Sync, the Data Field Sync is not Reed-Solomon or trellis encoded,
nor is it interleaved.
5.5.2.6 Enhancement Signaling
Symbols 85 through 92 shall be used for indication of specific future enhancements. If no
enhancements are present, they may be set equal to –5 on odd data fields (positive PN63 in the
preceding structure). If one or more enhancements are present, all symbols except those
signaling one or more enhancements shall be set to ‘–5’ (positive PN63). The particular symbol
that shall be used to signal each enhancement by setting its value to ‘+5’ (positive PN63) is
specified hereinafter for each.
Symbols 83 and 84 are reserved for signaling future alternative definitions of symbols
numbered 85 through 92 and shall be set to ‘–5’ (positive PN63) unless such alternative
definitions are signaled as defined hereinafter.
On even data fields (negative PN63), the polarities of symbols 83 through 92 shall be
inverted from those in the odd data field.
77
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
each 9-bit address are found at the intersection of the three-bit column address and the 6-bit row
address.
Table D5.4a, “Rate Segments and Payload vs. Step Number – one-half Rate”, indicates the
number of data segments used by one-half rate data which shall be used for a given step number
(which is referenced from Table D5.3).
Table D5.4b, “Rate Segments and Payload vs. Step Number – one-fourth Rate”, indicates the
number of data segments used by one-fourth rate data which shall be used for a given step
number (which is referenced from Table D5.3).
Tables D5.4a and D5.4b also include informative columns. These columns show the
approximate percent of segments allocated for Enhanced data, the payload of the Main data for
the case when the Enhanced data consists of exclusively one-half rate or one-fourth rate data,
and the payload of the Enhanced one-fourth or one-half rate data (respectively) for a given step
number.
5.6.1 Enhanced Stream Rate Limits
Limits have been defined for the maximum bitrate in the Enhanced (E-VSB) service.
At all times, the quality of video in the main stream shall be equal to or better than the
equivalent program in the Enhanced stream. In the event that the programming is different in the
main and Enhanced streams, a higher resolution format is deemed to be higher quality.
Additionally, during premium programming times, the maximum bitrate in the enhanced (E-
VSB) stream shall be 3 Mbps out of the total 19.4 Mbps channel capacity.
Of the map addresses 0d to 511d (notated as IHGFEDCBA in binary in Table D5.3) the
decimal values of the only map addresses that shall be used when the 3 Mbps limit applies are:
0-70,72-77, 80-84, 88-90, 128-132, 136-139, 144-147, 152-154, 160-162, 168, 169, and 176.
78
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
79
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
101000 0 5 0 13 8 5 8 13 8 21 21 8 25 8 28 12
101001 1 5 1 13 9 5 9 13 9 20 20 9 24 9 31 0
101010 2 5 2 13 10 5 10 13 10 21 21 10 25 10 30 2
101011 3 5 3 13 11 5 11 13 11 20 20 11 24 11 30 5
101100 4 5 4 13 12 5 12 13 12 21 21 12 25 12 17 16
101101 5 5 5 13 13 5 13 13 13 20 20 13 24 13 18 17
101110 6 5 6 13 14 5 14 13 14 21 21 14 25 14 19 16
101111 7 5 7 13 15 5 15 13 15 20 20 15 24 15 20 17
110000 0 6 0 14 8 6 8 14 0 23 23 0 27 0 21 16
110001 1 6 1 14 9 6 9 14 1 22 22 1 26 1 22 17
110010 2 6 2 14 10 6 10 14 2 23 23 2 27 2 23 16
110011 3 6 3 14 11 6 11 14 3 22 22 3 26 3 24 17
110100 4 6 4 14 12 6 12 14 4 23 23 4 27 4 25 16
110101 5 6 5 14 13 6 13 14 5 22 22 5 26 5 26 16
110110 6 6 6 14 14 6 14 14 6 23 23 6 27 6 19 18
110111 7 6 7 14 15 6 15 14 7 22 22 7 26 7 20 19
111000 0 7 0 15 8 7 8 15 8 23 23 8 27 8 21 18
111001 1 7 1 15 9 7 9 15 9 22 22 9 26 9 22 19
111010 2 7 2 15 10 7 10 15 10 23 23 10 27 10 23 18
111011 3 7 3 15 11 7 11 15 11 22 22 11 26 11 24 19
111100 4 7 4 15 12 7 12 15 12 23 23 12 27 12 25 18
111101 5 7 5 15 13 7 13 15 13 22 22 13 26 13 21 20
111110 6 7 6 15 14 7 14 15 14 23 23 14 27 14 22 21
111111 7 7 7 15 15 7 15 15 15 22 22 15 26 15 23 20
80
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
Table D5.4a Segments and Payload vs. Step Number- one-half Rate
Step Segments Percent of Segments Allocated for Main Data Rate (Mbps) ½ rate (Mbps)
# Used Enhanced (informative) (informative) (informative)
0 0 0 19.3927 0.0000
1 2 0.64 19.2683 0.0542
2 4 1.28 19.144 0.1084
3 6 1.92 19.0197 0.1627
4 8 2.56 18.8954 0.2169
5 12 3.85 18.6468 0.3253
6 16 5.13 18.3982 0.4338
7 20 6.41 18.1495 0.5422
8 24 7.69 17.9009 0.6507
9 28 8.97 17.6523 0.7591
10 32 10.26 17.4037 0.8675
11 40 12.82 16.9064 1.0844
12 48 15.38 16.4092 1.3013
13 56 17.95 15.9119 1.5182
14 64 20.51 15.4147 1.7351
15 72 23.08 14.9174 1.9520
16 80 25.64 14.4202 2.1688
17 88 28.21 13.9229 2.3857
18 96 30.77 13.4257 2.6026
19 112 35.9 12.4312 3.0364
20 128 41.03 11.4367 3.4702
21 144 46.15 10.4422 3.9039
22 160 51.28 9.4477 4.3377
23 176 56.41 8.4532 4.7715
24 192 61.54 7.4587 5.2052
25 208 67.95 6.2156 5.6390
26 224 73.08 5.2211 6.0728
27 240 76.92 4.4752 6.5065
28 256 82.05 3.4807 6.9403
29 272 87.18 2.4862 7.3741
30 288 92.31 1.4917 7.8078
31 312 100 0 8.4585
81
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
Table D5.4b Segments and Payload vs. Step Number- one-fourth Rate
Step Segments Percent of Segments Allocated for Main Data Rate (Mbps) ¼ Rate (Mbps)
# Used Enhanced (informative) (informative) (informative)
0 0 0.00 19.3927 0.0000
1 4 1.28 19.1440 0.0542
2 8 2.56 18.8954 0.1084
3 12 3.85 18.6468 0.1627
4 16 5.13 18.3982 0.2169
5 20 6.41 18.1495 0.2711
6 24 7.69 17.9009 0.3253
7 28 8.97 17.6523 0.3795
8 32 10.26 17.4037 0.4338
9 36 11.54 17.1550 0.4880
10 40 12.82 16.9064 0.5422
11 44 14.10 16.6578 0.5964
12 52 16.67 16.1605 0.7049
13 60 19.23 15.6633 0.8133
14 68 21.79 15.1661 0.9218
15 76 24.36 14.6688 1.0302
16 84 26.92 14.1716 1.1386
17 92 29.49 13.6743 1.2471
18 100 32.05 13.1771 1.3555
19 116 37.18 12.1826 1.5724
20 132 42.31 11.1881 1.7893
21 148 47.44 10.1936 2.0062
22 164 52.56 9.1991 2.2231
23 180 57.69 8.2046 2.4400
24 196 62.82 7.2101 2.6568
25 212 67.95 6.2156 2.8737
26 228 73.08 5.2211 3.0906
27 244 78.21 4.2266 3.3075
28 260 83.33 3.2321 3.5244
29 276 88.46 2.2376 3.7413
30 292 93.59 1.2431 3.9581
31 312 100.00 0.0000 4.2292
82
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
• The map data shall be coded with a Kerdock (64, 12) code.
• The polarity of the Kerdock code word shall be inverted on even (negative PN63) data
fields.
• The twelve-bit payload shall contain the current map or the next map.
• In the odd (positive PN63) field, the leftmost 10 bits shall indicate the current map and
the rightmost 2 bits shall be the high order bits of the frame count.
• In the even (negative PN63) field the leftmost 10 bits shall indicate the next map and the
rightmost 2 bits shall be the low order bits of the frame count.
• Within each frame count bit pair, the higher order bit shall be leftmost.
5.7.1 Map to Frame Count to Frame Association
The map shall change at a maximum frequency of once per 16 data frames.
A data frame shall be defined as an odd (positive PN63) data field followed by an even
(negative PN63) data field.
The frame count shall be decremented linearly from 15 to 0 in the frames preceding a map
change.
Current map data shall point one frame ahead, i.e., to the second frame following a particular
transmission of the current map data.
Next map data shall point 16 frames ahead, i.e. to the 17th frame following a particular
transmission of the current map data.
An unchanging map shall be indicated by making the current map and next map equal and
holding the frame count at 15 (‘1111’).
An example of the sequence of map bits during a change is given in the Table D5.5. In this
example, the map is changed from map A, which has persisted for some time, to map B, which
then persists for some time into the future.
83
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
gα g0 g1 … g25 0 0 0 0 0
gα 0 g0 g1 … g25 0 0 0 0
gen_matrix = gα 0 0 g0 g1 … g25 0 0 0
gα 0 0 0 g0 g1 … g25 00
gα 0 0 0 0 g0 g1 … g25 0
gα 0 0 0 0 0 g0 g1 … g25
84
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
25
where g α =− ∑ gi mod 4 = 1.
i=0
85
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
The 10th bit of the 12-bit E-VSB map shall decide which packing structure is to be used in the
VSB frame. When the bit is set to ‘0’, the packing shall refer to and be per packing option 1
defined below. When the bit is set to ‘1’, the packing shall refer to and be per packing option 2
defined below.
5.8.1 Placement of Enhanced Mode Packets
Let 2P be the total number of Enhanced packets transmitted in a VSB frame, where 0<=P<= 156.
Let the index s of the Enhanced packets inserted in the normal VSB frame be an interger
between 0 and 311, Inclusive.
Option 1
The set EMAP of E8-VSB segment positions s shall be:
Where U is the union of sets, and {s | s =…criteria…} is the set of segment positions s that meet
the stated criteria.
Option 2
The index s shall be calculated by the following algorithm written in pseudo-code.
M=round(156/P);
for k=0:2P-1
s=k*M;
if (s>=312)
s=mod(s, 312)+1;
end if
end for
86
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
The function round() means “round up to the next integer value.” The function mod() represents
the operation modulo. For example, in case of P=6, the segment positions are given by s=(0, 26,
52, 78, 104, 130,156 , 182, 208, 234, 260, 286).
5.8.2 Packing of Enhanced Mode Data Within Packets
The Enhanced Reed-Solomon encoding block shall be 184 bytes long, of which 20 bytes are
parity. Refer to Table D5. 2.
For the case of a one-half rate Enhanced code, the Enhanced coder outputs 2 bits for each
input bit, and Enhanced mode data shall be packed as one Enhanced Reed-Solomon block to a
pair of data segments (1 bit per symbol).
For the case of a one-fourth rate Enhanced code, the Enhanced coder outputs 4 bits for each
input bit, and Enhanced mode data shall be packed as one Enhanced Reed-Solomon block for
every 4 data segments (one-half bit per symbol).
The packing of Enhanced mode Reed-Solomon blocks into data segments is shown in Table
D5.6.
Note: All below are shown conceptually pre-interleave; the interleaving process
will disperse the data in the transmitted output).
87
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
5.9 Modulation
R = .1152
1.0
.5
0
d d d = .31 MHz d d
5.38 MHz
6 MHz
Figure D5.17 Nominal VSB system channel response (linear phase raised cosine Nyquist filter).
6.1 Overview
The high data rate mode trades off transmission robustness (28.3 dB signal-to-noise threshold)
for payload data rate (38.57 Mbps). Most parts of the high data rate mode VSB system are
88
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
identical or similar to the terrestrial system. A pilot, Data Segment Sync, and Data Field Sync
are all used to provide enhanced operation. The pilot in the high data rate mode also is 11.3 dB
below the data signal power. The symbol, segment, and field signals and rates are all the same,
allowing either receiver to lock up on the other’s transmitted signal. Also, the data frame
definitions are identical. The primary difference is the number of transmitted levels (8 versus 16)
and the use of trellis coding and NTSC interference rejection filtering in the terrestrial system.
The RF spectrum of the high data rate modem transmitter looks identical to the terrestrial
system, as illustrated in Figure D5.4. Figure D6.1 illustrates a typical data segment, where the
number of data levels is seen to be 16 due to the doubled data rate. Each portion of 828 data
symbols represents 187 data bytes and 20 Reed-Solomon bytes followed by a second group of
187 data bytes and 20 Reed-Solomon bytes (before convolutional interleaving).
Data Data
segment Data + FEC
segment
sync sync
+15
+13
+11
+9
+7
+5
+3
+1
-1
-3
-5
-7
-9
-11
-13
-15
Figure D6.2 shows the block diagram of the transmitter. It is identical to the terrestrial VSB
system except the trellis coding shall be replaced with a mapper that converts data to multi-level
symbols. See Figure D6.3.
89
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
Reed- Data RF
Data Pilot VSB
Randomizer
Solomon Inter- Mapper MUX Insertion Modulator Up-
Encoder leaver Converter
Segment Sync
Field Sync
Xa Xb Xc Xd O
Byte to 1 1 1 1 +15
Symbol 1 1 1 0 +13
Conversion
1 1 0 1 +11
MSB 1 1 0 0 +9
7 Xa1 Xa
6 Xb1 1 0 1 1 +7
1st Nibble Xb 1 0 1 0 +5
5 Xc1
4 Xd1 1 0 0 1 +3
From 1 0 0 0 +1
3 Xa2 Xc To
Byte 0 1 1 1 -1
Interleaver 2 Xb2 MUX
2nd Nibble 0 1 1 0 -3
1 Xc2 Xd
0 1 0 1 -5
0 Xd2
LSB
0 1 0 0 -7
0 0 1 1 -9
0 0 1 0 -11
0 0 0 1 -13
0 0 0 0 -15
90
ATSC A/53E Digital Television Standard, Annex D 27 December 2005
6.3 Modulation
91
ATSC A/53E Digital Television Standard, Annex E 27 December 2005
Annex E:
Receiver Characteristics (Informative)
This annex has been withdrawn. Interested readers are referred to:
• ATSC Recommended Practice A/74, “Receiver Performance Guidelines, 18 June 2004.
• ATSC Technology Group Report T3-600, DTV Signal Reception and Processing
Considerations, 18 September 2003.
92
ATSC A/53E Digital Television Standard, Annex F 27 December 2005
Annex F:
Historical Background (Informative)
1. FOREWORD
This annex contains for historical reasons the background section that was originally published
in A/53 on 16 December 1995.
2. HISTORICAL BACKGROUND
The Advanced Television Systems Committee, chaired by James C. McKinney, was formed by
the member organizations of the Joint Committee on InterSociety Coordination (JCIC)12 for the
purpose of exploring the need for and, where appropriate, to coordinate development of the
documentation of Advanced Television Systems. Documentation is understood to include
voluntary technical standards, recommended practices, and engineering guidelines.
Proposed documentation may be developed by the ATSC, by member organizations of the
JCIC, or by existing standards committees. The ATSC was established recognizing that the
prompt, efficient and effective development of a coordinated set of national standards is essential
to the future development of domestic television services.
On June 5, 1992, ATSC provided information to the Federal Communications Commission
(FCC) outlining proposed industry actions to fully document the advanced television system
standard. The FCC has recognized the importance of prompt disclosure of the system technical
specifications to the mass production of advanced television system professional and consumer
equipment in a timely fashion. The FCC has further noted its appreciation of the diligence with
which the ATSC and the other groups participating in the standardization are pursuing these
matters.13
Supporting this activity, the ATSC Executive Committee requested that the T3/S1 Specialist
Group on Macro Systems Approach meet and suggest which portions of an advanced television
system broadcasting standard might require action by the FCC and which portions should be
voluntary.
Subsequently, T3/S1 held meetings and developed recommendations in two areas:
1) Principles upon which documentation of the advanced television system should be
based
2) A list of characteristics of an advanced television system that should be documented
The list tentatively identified the industry group(s) that would provide the documentation
information and the document where the information would likely appear.
12
The JCIC is presently composed of: the Consumer Electronics Association (CEA), the Institute
of Electrical and Electronics Engineers (IEEE), the National Association of Broadcasters
(NAB), the National Cable Television Association (NCTA), and the Society of Motion
Picture and Television Engineers (SMPTE).
13
FCC 92-438, MM Docket No. 87-268, “Memorandum Opinion and Order/Third Report and
Order/Third Further Notice of Proposed Rule Making,” Adopted: September 17, 1992, pp.
59–60.
93
ATSC A/53E Digital Television Standard, Annex F 27 December 2005
94
ATSC A/53E Digital Television Standard, Annex F 27 December 2005
95
ATSC A/53E Digital Television Standard, Annex F 27 December 2005
96
ATSC A/53E Digital Television Standard, Annex G 27 December 2005
Annex G:
High Efficiency Audio System Characteristics (Normative)
1. SCOPE
This annex describes the robust mode audio system characteristics and normative specifications
of the Digital Television Standard. Audio encoded per this annex may be transmitted over a TS-
E (see Annex C).
2. NORMATIVE REFERENCES
The following documents contain provisions which in whole or part, through reference in this
text, constitute provisions of this standard. At the time of publication, the editions indicated were
valid. All standards are subject to revision and amendment, and parties to agreement based on
this standard are encouraged to investigate the possibility of applying the most recent editions of
the documents listed below.
[G1] ATSC A/52B: “Digital Audio Compression (AC-3, E-AC-3) Standard,” Advanced
Television Systems Committee, Washington, D.C. 14 June 2005.
[G2] AES3-2003, “AES Recommended Practice for digital audio engineering “Serial
transmission format for two-channel linearly represented digital audio data (Revision of
AES3-1992)”
[G3] ANSI S1.4-1983 (R 2001) with Amd.S1.4A-1995, “Specification for Sound Level
Meters.”
3. COMPLIANCE NOTATION
As used in this document, “shall” or “will”, denotes a mandatory provision of the standard.
“Should” denotes a provision that is recommended but not mandatory. “May” denotes a feature
whose presence does not preclude compliance, and that may or may not be present at the option
of the implementer.
4. SYSTEM OVERVIEW
As illustrated in Figure G1, the audio subsystem comprises the audio encoding/decoding
function and resides between the audio inputs/outputs and the transport subsystem. The audio
encoder(s) is (are) responsible for generating the audio elementary stream(s) which are encoded
representations of the baseband audio input signals. At the receiver, the audio subsystem is
responsible for decoding the audio elementary stream(s) back into baseband audio.
97
ATSC A/53E Digital Television Standard, Annex G 27 December 2005
Audio
Elementary Transport VSB RF
Stream(s) Packets Transmission
Specified
in this
Channel
Annex
Receiver Receiver
Reconstructed Audio
Transport Transmission
Audio Decoder(s)
Subsystem Subsystem
5. SPECIFICATION
This Section forms the normative specification for the robust mode audio system that may be
transmitted as part of TS-E (see Annex C). The robust mode audio compression system
conforms to Annex E of the A/52B Digital Audio Compression (AC-3) Standard, subject to the
constraints outlined in this Section.
98
ATSC A/53E Digital Television Standard, Annex G 27 December 2005
99
ATSC A/53E Digital Television Standard, Annex G 27 December 2005
may have dynamic range control words inserted or modified without affecting the encoded
audio. When it is necessary to alter the dynamic range of audio programs that are broadcast, the
dynamic range control word should be used. In order to enable clean switching between main
and fallback audio services (that might have a different number of audio channels), linked audio
services shall have values of dynrng that result in matched audio levels when decoded by
compliant decoders.
100
ATSC A/53E Digital Television Standard, Annex G 27 December 2005
101
Doc. PS-661r4
18 April 2006
102
ATSC A/53E Amendment No. 1 to A/53E 18 April 2006
103
ATSC A/53E Amendment No. 1 to A/53E 18 April 2006
Annex C:
Service Multiplex and Transport Subsystem Characteristics
(Normative)
1. SCOPE
This Annex constitutes the normative specification for the transport subsystem of the Digital
Television Standard. The syntax and semantics of this specification conform to ISO/IEC 13818-
1 [C3], with additional constraints and conditions specified in this Standard. Within this context,
other ATSC Standards may further constrain and/or supplement the transport subsystem
specification.14
2. NORMATIVE REFERENCES
The following documents contain provisions that in whole or in part, through reference in this
text, constitute provisions of this Standard. At the time of publication, the editions indicated
were valid. For references not including a publication date, the most recent published version
shall apply. All standards are subject to revision and amendment, and parties to agreements
based on this Standard are encouraged to investigate the possibility of applying the most recent
editions of the documents listed below.
3. DEFINITION OF TERMS
PAT-E – A table with the same syntax as Program Association Table as defined by [C3]
transmitted using an enhanced VSB mode defined in Annex D.
14
Note that there is a coordinated effort underway among ATSC, CEA, and SMPTE to revise
and clarify standards related to delivering closed captions, AFD, and bar data so that each
describes the aspects of the system for which they are primarily responsible without overlap.
This effort is expected to result in revisions of those sections in the ATSC Standards.
104
ATSC A/53E Amendment No. 1 to A/53E 18 April 2006
PMT-E – A table with the same syntax as Program Map Table as defined by [C3] transmitted
using an enhanced VSB mode defined in Annex D.
PMT-E_PID – A PID that identifies the Transport Stream packets that carry
TS_program_map_section()s in a TS-E.
Program – Program shall mean the collection of all elements within the emission that have the
same program number, independent of the methods used to propagate the program elements.
Linked – Alternative elements of a Program are ‘Linked’ when they have identical values in the
linked_component_tag field of their Enhanced Signaling Descriptors.
TS-M – The portion of TS-R that contains only all Transport Stream packets transmitted by the
main mode (see Annex D) (see Figure C3).
TS-R – The recombined Transport Stream containing all Transport Stream packets delivered by
all transmission modes (main, one-half rate and one-quarter rate) (see Annex D) (see Figure
C3).
TS-E – The portion of TS-R that contains only all Transport Stream packets transmitted by one-
half rate and/or one-quarter rate modes (see Annex D) (see Figure C3).
TS-Ea – The portion of TS-E that contains only all Transport Stream packets transmitted by
one-half rate mode (see Annex D) (see Figure C3).
TS-Eb – The portion of TS-E that contains only all Transport Stream packets transmitted by
one-quarter rate mode (see Annex D) (see Figure C3).
4. COMPLIANCE NOTATION
As used in this document, “shall” denotes a mandatory provision of the standard. “Should”
denotes a provision that is recommended but not mandatory. “May” denotes a feature whose
presence does not preclude compliance that may or may not be present at the option of the
implementer. Note: this compliance notation applies explicitly to this annex.
5. SYSTEM OVERVIEW
The transport format and protocol for the Digital Television Standard is a compatible subset of
the MPEG-2 Systems specification defined in ISO/IEC 13818-1 [C3]. It is based on a fixed-
length packet transport stream approach that has been defined and optimized for digital
television delivery applications.
As illustrated in Figure C1, the transport subsystem resides between the application (e.g.,
audio or video) encoding and decoding functions and the transmission subsystem. The encoder’s
transport subsystem is responsible for formatting the coded elementary streams and multiplexing
the different components of the program for transmission. It also is responsible for delivering
packets intended for transmission using coding methods defined in Annex D. A receiver recovers
the elementary streams for the individual application decoders and for the corresponding error
signaling. The transport subsystem also incorporates other higher protocol layer functionality
related to properly timing the packets to enable receiver synchronization.
105
ATSC A/53E Amendment No. 1 to A/53E 18 April 2006
Transmitter
packetization and
multiplexing
Application
Transport
Encoders
* Modu-
* lator
*
* Transport
Stream
elementary
streams, private
sections, or PES
Transmission
Clock
Format
Receiver
depacketization and
demultiplexing
Presentation
Application
Transport
Decoders
Demod-
*
* ulator
* Transport
*
elementary Stream with
streams, private error
sections, or PES signaling
with error
signaling
clock
control
Clock
106
ATSC A/53E Amendment No. 1 to A/53E 18 April 2006
Receiver is defined such that it performs the functions described in the way that they are
described herein, which may or may not be the way in which actual receivers, built to this
Standard, are designed to operate. Some mandatory transmission requirements are stated herein
in terms of the resulting transmission stream elements that are required to be produced, through
use of the Reference Receiver functional definition.
6. SPECIFICATION
This section of the Standard describes the coding constraints that apply to the use of the MPEG-
2 systems specification [C3] in the digital television system including mandatory main and
optional enhanced services.
15
Section 7.2 also constrains video streams of stream_type 0x02 in TS-M.
107
ATSC A/53E Amendment No. 1 to A/53E 18 April 2006
16
Program is defined in Section 3.
17
This field is defined in A/65C [C2], Section 6.3.1 and Table 6.7.
18
The video in the TS-E may be a different aspect ratio, frame rate, and/or bitrate than that in
TS-M.
108
ATSC A/53E Amendment No. 1 to A/53E 18 April 2006
19
The ISO/IEC-designated registration authority for the format_identifier is SMPTE Registration
Authority, LLC. See (http://www.smpte-ra.org/mpegreg.html).
109
ATSC A/53E Amendment No. 1 to A/53E 18 April 2006
For each Program of service_type 0x03 delivered in TS-M, at least one audio component shall
be a complete main audio service (CM) as defined in Section 6 of Annex B.
For each Program of service_type 0x02 completely delivered in TS-E, if audio is present, at
least one audio component shall be a complete main audio service (CM) as defined in Section 6
of Annex B.
For each Program of service_type 0x03 completely delivered in TS-E, at least one audio
component shall be a complete main audio service (CM) as defined in Section 6 of Annex B.
Program elements within one Program with the same value of linked_component_tag (defined in
Section 6.8.5) shall be of the same audio type (for example CM, VI, or HI) and shall be one of
the services defined in section 6 of Annex B.
110
ATSC A/53E Amendment No. 1 to A/53E 18 April 2006
111
ATSC A/53E Amendment No. 1 to A/53E 18 April 2006
112
ATSC A/53E Amendment No. 1 to A/53E 18 April 2006
audio blocks) from each elementary stream and do the combining (mixing together) on a frame
or (block) basis. In order to have the audio from the two elementary streams reproduced in exact
sample synchronism, it is necessary for the original audio elementary stream encoders to have
encoded the two audio program elements frame synchronously; i.e., if audio program 1 has
sample 0 of frame n at time t0, then audio program 2 should also have frame n beginning with its
sample 0 at the identical time t0. If the encoding is done frame synchronously, then matching
audio frames should have identical values of PTS.
If PES packets from two AC-3 audio services that are to be decoded simultaneously contain
identical values of PTS then the corresponding encoded audio frames contained in the PES
packets should be presented to the audio decoder for simultaneous synchronous decoding. If the
PTS values do not match (indicating that the audio encoding was not frame synchronous) then the
audio frames that are closest in time may be presented to the audio decoder for simultaneous
decoding. In this case, the two services may be reproduced out of sync by as much as one-half of
a frame time (which is often satisfactory; e.g., a voice-over does not require precise timing).
The value of stream_id for AC-3 shall be ‘1011 1101’ (private_stream_1).
6.5.3 Audio PES Constraints for Enhanced AC-3 (stream_type = 0x87)
The value of stream_id for enhanced AC-3 shall be ‘1011 1101’ (private_stream_1).
If an audio service is delivered in TS-E, and is Linked to an audio service delivered in TS-M,
constraints are required on the content of the corresponding audio access units, and on the PTS
values in the corresponding PES packets, of the two Linked audio services in order to enable a
receiver to perform sample time synchronous switching between the two audio services (e.g.,
fallback audio).
The constraints are:
1) That the audio access units of AC-3 in TS-M, and the corresponding audio access
units of enhanced AC-3 in TS-E shall be constructed from sets of input audio samples
that are time synchronous; and
2) That the presentation times corresponding to the corresponding audio access units
shall be identical.
Note that not every audio access unit has an explicit value of PTS in a PES because there is
only one PTS per PES while there can be multiple audio access units per PES. However, there is
a unique presentation time for each audio access unit, and it is the presentation times that must
be identical to enable a receiver to switch between the two audio streams with exact sample
synchronism.
113
ATSC A/53E Amendment No. 1 to A/53E 18 April 2006
114
ATSC A/53E Amendment No. 1 to A/53E 18 April 2006
6.8 Descriptors
Unless explicitly stated to the contrary for a given descriptor, no more than one descriptor with a
given value of descriptor_tag shall appear in any descriptor loop.
6.8.1 AC-3 Audio Descriptor
When an Elementary Stream of stream_type 0x81 (AC-3 audio) or stream_type 0x87 (E-AC-3) is
present in the digital television Transport Stream, an AC-3 Audio Descriptor (AC-
3_audio_stream_descriptor()) shall be included in the descriptor loop immediately following the
ES_info_length field in the TS_program_map_section() describing that Elementary Stream. The syntax
shall be as given in Table A2 of Annex A of ATSC Standard A/52B [C1]. The following
constraints shall apply to the AC-3 Audio Descriptor:
1) The value of the descriptor_tag shall be 0x81.
2) The 6-bit field for Bit Rate Code shall have a value in the range ‘000000’ through
‘001111’ or ‘100000’ through ‘101111’; i.e., signaling a bit rate less than or equal to 448
kbps.
3) The num_channels field shall have a value in the range 1 to 13.
4) The langcod field is a reserved field. Audio language shall be indicated using an ISO-639
Language Descriptor (see Section 6.8.3).
5) The descriptor shall identify the type of the audio service in the bsmod field, which shall
be the same as the bsmod field in the elementary stream associated with this descriptor.
6) The descriptor may optionally carry a 3-byte language code that is represented per ISO-
639. If this language code is present in the AC-3_audio_stream_descriptor(), it shall match the
language code carried in the ISO_639_language_descriptor(), if present.
Effective 1 March 2008, audio language shall be indicated by including the optional ISO-639
Language bytes within the AC-3_audio_stream_descriptor(), at which point the use of the ISO-639
Language Descriptor to indicate language shall be optional, but recommended to support legacy
devices requiring the ISO-639_language_descriptor().
Informative note: Receiving devices are expected to use the bsmod (bit stream
mode) field in the AC-3_audio_stream_descriptor() to determine the type of each AC-3
or E-AC-3 audio stream rather than the audio_type field in the
ISO_639_language_descriptor().
115
ATSC A/53E Amendment No. 1 to A/53E 18 April 2006
The audio_type field in any ISO_639_language_descriptor() used in this Standard shall be set to
0x00 (meaning “undefined”).
An ISO_639_language_descriptor() may be present in the TS_program_map_section() in other
positions as well, for example to indicate the language or languages of a textual data service
program element.
6.8.4 ATSC Private Information Descriptor
The ATSC_private_information_descriptor() provides a method to carry and unambiguously label
private information. More than one ATSC_private_information_descriptor() may appear within a single
descriptor loop. Table C1 defines the bit-stream syntax of the ATSC_private_information_descriptor().
116
ATSC A/53E Amendment No. 1 to A/53E 18 April 2006
format_identifier – The format_identifier is a 32-bit field as defined in ISO/IEC 13818-1 [C3], Section
2.6.9 for the registration_descriptor(). Only format_identifier values registered and recognized by the
SMPTE Registration Authority, LLC shall be used (see http://www.smpte-
ra.org/mpegreg.html)20. Its use in this descriptor shall scope and identify only the private
information contained within this descriptor.
private_data_byte – The syntax and semantics of this field is defined by the assignee of the
format_identifier value.
20
SMPTE Registration Authority, LLC, 595 West Hartsdale Avenue, White Plains, NY 10607
USA
21
See Annex D for definitions.
117
ATSC A/53E Amendment No. 1 to A/53E 18 April 2006
• If the program element has the linkage_preference value of ‘11,’ then the Linked alternative
program element shall have the linkage_preference value of ‘10’ and the two shall have
identical values for program_number and shall have identical values for linked_component_tag.
If the Enhanced Signaling Descriptor is in both descriptor loops, the Enhanced Signaling
Descriptor in the descriptor loop immediately following the ES_info_length field shall take
precedence for the particular program element described therein.
An Enhanced Signaling Descriptor shall be present for each program element transmitted
using one-half rate or one-quarter rate coding in either descriptor loop in the
TS_program_map_section(), or both descriptor loops in the TS_program_map_section(), subject to the
above precedence rule. The bit stream syntax for the Enhanced Signaling Descriptor is shown in
Table C2.
descriptor_tag – This 8-bit unsigned integer shall have the value 0xB2, identifying this descriptor
as an enhanced_signaling_descriptor.
descriptor_length – This 8-bit unsigned integer that shall indicate the length (in bytes)
immediately following this field up to the end of this descriptor.
linkage_preference – This 2-bit bit field indicates whether or not the associated program element is
Linked to another program element in the transmission. If so, it identifies the broadcaster’s
preference for the program element associated with this descriptor. The linkage_preference field
shall be coded as defined in Table C3.
reserved – This 4 bit field shall contain all ‘1’s.
22
This descriptor may be extended in the future to provide additional capability by adding a byte
loop after the last defined byte. Proposals to do so are expected to be considered in the
future.
118
ATSC A/53E Amendment No. 1 to A/53E 18 April 2006
tx_method – This 2-bit field shall identify the VSB transmission method used to transmit the
associated program element.
linked_component_tag – A 4-bit unsigned integer that links the associated program element to an
equivalent, preferred, or less preferred alternative. The Linked program element is the
program element labeled with the same value of linked_component_tag in a
TS_program_map_section labeled with an equivalent value of program_number as the
TS_program_map_section that carries this descriptor. If the program element has the
linkage_preference value equal to ‘00’, then the program element is not Linked and the
linked_component_tag value shall be reserved. See Table C2.
119
ATSC A/53E Amendment No. 1 to A/53E 18 April 2006
Elementary Streams for which the transport_scrambling_control field does not exclusively have
the value of ‘00’ for the duration of the program must carry a CA_descriptor in accordance with
Section 2.6.16 of ISO/IEC 13818-1 [C3].
The implementation of a digital television delivery system that employs conditional access
will require the specification of additional data streams and system constraints.
120
ATSC A/53E Amendment No. 1 to A/53E 18 April 2006
IS-Ea IS-Ea’
Rate ½ Fixed
Rate ½ ½ Input
Delay Buffer
EVSB Exciter
Model
MPEG-2 IS-E IS-Eb (Annex D)
IS To be an IS-Eb’
Data Transport To be at Rate ¼ Fixed
Enhanced Rate ¼ ¼ Input
Sources Stream what Rate? Delay Buffer
Packet? Yes
Multiplexer
The above Head End Reference Model (Figure C2) models the data flow for Transport
Stream packets from the final MPEG-2 multiplexer through logic and buffers and into the three
exciter inputs (Annex D).
The decisions about whether a packet is enhanced or not, and, if enhanced, which of one-
quarter-rate or one-half-rate is to be used (as shown by diamond boxes in Figure C2), are made
121
ATSC A/53E Amendment No. 1 to A/53E 18 April 2006
based on the PID value in each Transport Stream Packet. Special rules for certain streams (like
PSIP-E data) may apply (see A/65C).
The three fixed delay buffers shown in Figure C2 are utilized to effect a delay on each stream
(IS-Ea, IS-Eb, IS-N) to compensate for the fixed variance of delivery time due to the additional
interleaving and error correction applied to streams in IS-E. The magnitude of the delay on each
stream (IS-Ea, IS-Eb, IS-N) that must be applied to the various streams varies by which MAP is
being utilized.
The stream labeled “IS” in Figure C2 may be a MPEG-2 Transport Stream, but need not be.
A Transport Stream has constraints and requirements on it as to timing that need not be met at
IS, but shall be met at TS-R in the Receiver Reference Model. It is, however, at least “similar” to
a MPEG-2 Transport Stream.
Emits 188-byte
packet
immediately when
available
The modeled Reference Receiver (as shown in Figure C3) has two 188-byte buffers that are
filled by rate-one-quarter and rate-one-half data as they are recovered from the rate-one-quarter
and rate-one-half E-VSB segments, respectively. The streams TS-Ea and TS-Eb contain the
series of bytes from the respective TS-E reassembly buffers for each packet after it is assembled.
At any time a buffer has an entire Transport Stream packet available for output (188 bytes), that
buffer is selected by the switches and output immediately. The Reference Receiver does not
reorder packets identified with a particular PID.
Note that due to the transmission system and the instantaneously modeled Reference
Receiver behavior, it is not possible for more than one data source (TS-Ea, TS-Eb, or TS-M) to
be emitting data at the same time. However, as it is possible for no data to be available at a given
122
ATSC A/53E Amendment No. 1 to A/53E 18 April 2006
packet time, the Reference Receiver selects a Null packet for insertion, so that the output is at a
constant packet rate23 to deliver 19.39… Mbps.
The emitted E-VSB stream shall be ordered and coded such that when processed by the
Reference Receiver, the output from the switch (TS-R) shall be an MPEG-2 Transport Stream
compliant with Sections 1 through 7 of this annex. The PCRs shall be adjusted as described in
Section 9.4.
TS-E is the portion of the E-VSB transmission that results in TS-E in the Reference Receiver
of Figure C3.
23
This models a system that produces a constant rate packet stream output, however this may not
be necessary in a real receiver.
123
ATSC A/53E Amendment No. 1 to A/53E 18 April 2006
124
ATSC A/53E Amendment No. 1 to A/53E 18 April 2006
overflowing after insertion of the E-VSB packets. Avoidance of this overflow condition may
require changes to the original allocation of packet rate to TS-M services or eschewing certain
combinations in the multiplex.
-- End of document --
125