Aes 10 2003
Aes 10 2003
Revision of AES10-1991
Published by
Audio Engineering Society, Inc.
Copyright ©2003 by the Audio Engineering Society
Abstract
This standard describes the data organization for a multichannel audio digital interface. It includes a bit-level
description, features in common with the AES3 two-channel format, and the data rates required for its
utilization. The specification provides for the serial digital transmission of 32, 56, or 64 channels of linearly
represented digital audio data at a common sampling frequency within the range 32 kHz to 96 kHz, having a
resolution of up to 24 bits per channel. The format makes possible the transmission and reception of the
complete 28-bit channel word (excluding preamble) as specified in AES3, providing for the validity, user,
channel status, and parity information allowable under that standard. The transmission format is of the
asynchronous simplex type and is specified for a single 75-Ω coaxial cable point-to-point interconnection or
the use of fibre-optic cables.
An AES standard implies a consensus of those directly and materially affected by its scope and provisions and
is intended as a guide to aid the manufacturer, the consumer, and the general public. The existence of an AES
standard does not in any respect preclude anyone, whether or not he or she has approved the document, from
manufacturing, marketing, purchasing, or using products, processes, or procedures not in agreement with the
standard. Prior to approval, all parties were provided opportunities to comment or object to any provision.
Attention is drawn to the possibility that some of the elements of this AES standard or information document
may be the subject of patent rights. AES shall not be held responsible for identifying any or all such patents.
Approval does not assume any liability to any patent owner, nor does it assume any obligation whatever to
parties adopting the standards document. Recipients of this document are invited to submit, with their
comments, notification of any relevant patent rights of which they are aware and to provide supporting
documentation. This document is subject to periodic review and users are cautioned to obtain the latest
printing.
The documentation produced by the group shall enter the public domain as soon as feasibility of the
interface is established.
The interface itself shall be simple to engineer and use.
The cost and simplicity of the interface shall be such that the benefits of its use shall be easily justifiable.
The interface shall not depend on the existence of hardware or software the rights to which are owned by
any one or more members of the group.
The following individuals have contributed to the document: P. Eastty, T. Fujisawa, C. Jenkins, A. Jubb, P.
Lidbetter, R. Salter, D. Ward, and J. Wilkinson.
R. A. Finger, chair
AES Standards Committee Working Group on Digital Input/Output Interfacing
1989 September
At the time of approval of this document for publication, the AES Standards Committee had the following
membership: Y. Abe, J. S. Brawley, R. C. Cabot, P. D’ Antonio, D. Eger (Chair), R. A. Finger, I. Joel,
W. Hogan, T. F. Holman, M. Klasco, D. L. Klepper, B. N. Locanthi, J. P. Nunn, D. Queen (Secretary),
T. Roseberry, W. T. Shelton, W. D. Storm, T. Telesky, H. Tendeloo, and F. E. Toole.
Foreword to second edition
This revision recognizes other uses to which the interface has been put, notably distributed routing and hence
the increase in channels to the maximum of 64 at 48 kHz, the introduction of 96-kHz sampling in digital audio
origination and the use of data transmission in the carrier system.
In this edition, figures 3 through 8 of AES10-1991 have been renumbered to figures 5 through 10. Clauses 2
though 6 of AES10-1991 have been renumbered to clauses 3 through 7, according to IEC guidelines.
This edition has been written by a writing group of SC-02-02. Contributors include R. Caine, C. Travis, R.
Silfvast, and others.
J. Dunn, chair
R. A. Finger, vice-chair
SC-02-02 Working Group on Digital Input/Output Interfacing
2002-09-23
NOTE: In AES standards documents, sentences containing the word “shall” are requirements for
compliance with the document. Sentences containing the verb “should” are strong suggestions
(recommendations). Sentences giving permission use the verb “may”. Sentences expressing a
possibility use the verb “can”.
0 Patents
The Audio Engineering Society draws attention to patents concerning the functionality described in annex B.
Those contemplating making use of this functionality should consider that any such use of sync symbols may
be subject to patents. Applicable patents include GB2276796, US5487067 and JP7015458.
The AES holds no position concerning the evidence, validity and scope of these patent rights.
Attention is drawn to the possibility that some of the elements of this AES standard may be the subject of
patent rights other than those identified herein. AES shall not be held responsible for identifying any or all such
patent rights.
1 Scope
This standard describes the data organization and electrical characteristics for a multichannel audio digital
interface (MADI). It includes a bit-level description, features in common with the two-channel format of the
AES3, AES Recommended Practice for Digital Audio Engineering — Serial Transmission Format for Linearly
Represented Digital Audio Data, and the data rates required for its utilization. The specification provides for
the serial digital transmission over coaxial or fibre-optic lines of 28, 56, or 64 channels of linearly represented
digital data at a common sampling frequency within the range of 32 kHz to 96 kHz having a resolution of up to
24 bits per channel. Only single-point to single-point interconnections from one transmitter to one receiver are
supported.
2 Normative references
AES3-1992 (r1997) AES Recommended Practice for Digital Audio Engineering -- Serial transmission format
for two-channel linearly represented digital audio data.
AES11-1997 AES Recommended practice for digital audio engineering – Synchronization of digital audio
equipment in studio operations.
ISO/IEC 9314-3 (1990-12) Information processing systems - Fibre Distributed Data Interface (FDDI) - Part 3:
Physical Layer Medium Dependent (PMD). Geneva CH: International Electrotechnical Commission.
IEC 60169-8 (1978-01) Radio-frequency connectors. Part 8: R.F. coaxial connectors with inner diameter of
outer conductor 6.5 mm (0.256 in) with bayonet lock - Characteristic impedance 50 ohms (Type BNC). Geneva
CH: International Electrotechnical Commission.
3 Definitions and abbreviations
3.1
audio sample data
audio signal that has been periodically sampled, quantized, and digitally represented in 2's complement form
3.2
channel
set of audio sample data related to one signal accompanied by other data bits transmitted in any one period of
the source sampling frequency
3.3
two-channel format
bit, block, and subframe structure (less preambles) of the AES3 serial transmission format for linearly
represented digital audio data
3.4
frame
sequence of 64 or less (typically 56 or 28) subframes designated using numbers 0 to 63, each carrying audio
sample and related data that are transmitted in one sample period, with the start of a frame beginning with the
first bit of subframe 0
3.5
link
connection between a single serial multichannel digital audio transmitter and a single multichannel digital
audio receiver
3.6
sync symbol
decoder synchronization symbol
3.7
MADI
multichannel audio digital interface
3.8
NRZI
Transmission code as described in ISO 9314. Note: the abbreviation NRZI has also been used for other related
encoding schemes.
Transmitter Receiver
Crystal Crystal
Synchronization Synchronization
regenerator regenerator
fs m
Master
synchronization
NOTE As sample rate changes non-return-to-zero inverted (NRZI) data rate stays constant;
transmitter and receiver are asynchronous. Sampling frequencies (fs) are 32 kHz to 96 kHz.
NOTE 2 The period of each pattern is shown for the 48 kHz sampling frequency. It can be longer for lower
frequencies and can vary with varispeed operation.
MADI subframe 0 1 2 3 4 54 55 0
Audio channel Ch 0 Ch 1 Ch 2 Ch 3 Ch 54 Ch 55 Ch 0
Sample number n n n n n n n+1
AES3 subframe A B A B A B A
20,8 µs
MADI subframe 0 1 2 3 4 52 53 54 55 0
Audio channel Ch 0 Ch 0 Ch 1 Ch 1 Ch 2 Ch 26 Ch 26 Ch 27 Ch 27 Ch 0
Sample number n n+1 n n+1 n n n+1 n n+1 n+2
AES3 subframe A B A B A A B A B A
20,8 µs
Figure 3 — 96 kHz with 28 channels working (legacy pattern, as formed by legacy encoder fed
by 28 single-channel double sampling-frequency mode signals per AES3)
MADI subframe 0 1 2 26 27 0 1 26 27 0
Audio channel Ch 0 Ch 1 Ch 2 Ch 26 Ch 27 Ch 0 Ch 1 Ch 26 Ch 27 Ch 0
Sample number n n n n n n+1 n+1 n+1 n+1 n+2
AES3 subframe A B A A B A B A B A
20,8 µs
Figure 4 — 96 kHz with 28 channels working (96 kHz frame pattern, using 10,4 µs framing, one
channel-zero flag per frame)
MSB
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31
NOTE AES-10id recommends the use of the NotA/B flag in a multichannel function. It is possible
that a receiver can meet this realization, and should be able to default to alternate A/B operation if the
multichannel function is presented.
Each 4-bit word shall be encoded into a 5-bit word using the 4B5B coding scheme shown in table 4.
Each 5-bit encoded word shall be transmitted from the left, as defined in table 5.
This scheme enables a low direct-current (d.c.) bias to be maintained on the link. Although the link signal is
nearly d.c. free, the audio signal may contain d.c. Figure 6 shows the link transmission format for one channel.
Annex A illustrates the encoding process for a single-channel word.
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39
The default 4B5B sync symbol shall be 11000 10001. There are 32 synchronization symbols specified in
FDDI. Other symbols may be used in order to carry, for example, control data not associated with any audio
channel. Annex B outlines this function.
End of frame
Higher sampling frequencies may be accommodated (for example, 192 kHz) by using two or more channels
per audio sample on the link.
NOTE 2 The provision of 56 channels at 32 kHz ± 12,5 % results in a minimum used data rate of
50,176 Mbit/s.
5.2 Link transmission rate
The link transmission rate shall be 125 Mbit/s irrespective of the sampling frequency or number of active
channels. See annex A.
5.3 Data transfer rate
The data transfer rate shall be 100 Mbit/s. The difference between the data transfer rate and the link
transmission data rate is caused by the use of an encoding scheme. See 4.3.1.
6 Synchronization
This clause covers the sample synchronization of transmitters and receivers relative to a master synchronizing
signal. It does not apply in the case of a master-slave connection only.
6.1 Sampling frequency synchronizing signal (synchronization signal)
Each transmitter and receiver shall be provided with an independently distributed master synchronizing signal.
This signal shall be in accordance with AES11. Alternatively, a Society of Motion Picture and Television
A 96-kHz coded link shall be capable of being synchronized by either a 48-kHz or 96-kHz reference signal.
6.2 Sample timing
The link is not intended to carry sample timing information. The exact timing of connected equipment shall be
controlled by the independently distributed master synchronizing signal, not by the MADI.
6.3 Transmitted frame start time
In order to maintain constant latency, the frame start time output from a transmitter shall be within ± 5 % of a
sample period of the reference time defined by the transmitter's externally supplied master synchronizing
signal.
6.4 Received frame start time
A receiver shall be able to correctly interpret a signal of any phase relative to the sample period of the
externally supplied master synchronizing signal. Constant latency shall be maintained with a signal whose
frame start time is within ± 25 % of a sample period of the reference time defined by the receiver's externally
supplied master synchronizing signal.
7 Electrical characteristics
The transmission medium shall be either 75-Ω coaxial cable (see 7.1) or fibre-optic cable (see 7.2). For the
purposes of transmission characterization, the data input to the encoder shall be replaced with a pseudorandom
data generator having a sequence length of at least 216 – 1.
NOTE The random data are applied prior to the 4-bit to 5-bit encoder in order to represent accurately
those signals most likely to appear in normal transmission.
+5V
Transmitter
120R 120R
150n 68R
+ Output 1
10K
1N4148 1N4148
0V
150n 68R
Output 2
100R 100R 10K
1N4148 1N4148
0V
7.1.2 Receiver
7.1.2.1 Eye pattern
The eye pattern represented by the characteristics of figure 9 shows the range of signals at the input terminals
that should be decoded by a conformant receiver.
Vmax Vmin
tmin
tnom
Figure 9 — Eye pattern diagram for maximum and minimum input signals:
tnom = 8 ns; tmin = 6 ns; Vmax = 0,6 V; Vmin = 0,15 V.
7.1.3 Cable
The coaxial cable shall have a 75- Ω ± 2 Ω characteristic impedance.
7.1.4 Connectors
BNC connectors to IEC 60169-8 shall be used throughout.
68R 68R
SP9680
10n (7&8=OL) 1/16V +
1
2 5
Input +
3
75R 4 6
1N4148 1N4148 1/16V
10n
330R 330R 100R 100R
Receiver (s)
1N4148 1N4148
0V
7.1.6 Grounding
The coaxial cable shield shall be grounded at the transmitter. The coaxial cable shall be grounded to the
receiver chassis at radio frequencies above 30 MHz.
For the purpose of minimizing radio frequency emissions it is recommended that the connection be achieved by
direct bonding of the coaxial cable body to the equipment chassis. At the receiver this may be achieved by
capacitive bonding of the coaxial cable connector body to the receiver chassis. A suitable value of capacitor is
1000 pF. The capacitor should be a low-inductance type, having a sufficiently low impedance at all frequencies
from 30 MHz to 500 MHz. The lead bonding lengths shall be kept as small as practical. This method prevents
the possibility of audio frequency ground currents.
Note: Designers should note that specialised techniques, described in appropriate literature, are required in
order that the interface meet international regulations for electromagnetic compatibility (EMC). Bonding the
receiver coaxial outer to the enclosure at DC with a total 360 degree connection is preferred if other
considerations do not preclude it.
7.2 Fibre optic interfacing
7.2.1 Fibre type
A fibre interface should be used as specified according to ISO/IEC 9314-3. It should be a graded-index fibre
with a core diameter of 62,5 nm, nominal cladding diameter of 125 nm and a numerical aperture of 0,275, at a
wavelength of 1300 nm. This specification can provide a range of up to 2 km.
7.2.2 Connectors
The ST1 connector listed on the databases page of www.aes.org/standards/ should be used. It is designed to be
optically and mechanically compatible with the media interface connector (MIC) according to ISO/IEC 9314-3.
Annex A
(Informative)
0 1 2 3
Bit: 0123 4567 8901 2345 6789 0123 4567 8901
Data: 1100 1010 0101 1111 0000 1100 0011 0000
0 1 2 3
Bit: 01234 56789 01234 56789 01234 56789 01234 56789
4B5B code : 11010 10110 01011 11101 11110 11010 10101 11110
Transmission code: 01001 10010 00110 10100 10101 10110 01100 10101
NOTE 1: The tolerance of the link transmission rate of 125 Mbit/s as mentioned in 5.2 should be ±
100 ppm.
B.1 Introduction
This annex describes in outline a method of carrying control data in the MADI transport carrier independent of
any particular audio channel. The transport sync symbol words inserted between audio data words can carry
this control data by virtue of the fact that there are a number of forms of sync symbol, of which the default is
that used by MADI systems complying with AES10. Four-bit nibbles are associated with 16 of the sync symbol
forms, thus allowing data to be inserted in the available space. The default sync symbol described in 4.3.2 is
associated with the binary value 0000.
A legacy MADI stream at full occupancy and highest permitted varispeed rate uses 96,768 Mbit/s, and a 64-
channel 48 kHz stream uses 98,304 Mbit/s. Thus there will always be at least 1 Mbit/s for this data. This may
need to be reduced to ensure that bit-stream synchronization is maintained.
B.2.1 Ordering
Default sync symbol words shall be transmitted at least as often as required, in order to guarantee correct data-
recovery of the whole transport stream. Coded sync symbols shall be inserted as and when required, subject to
the needs of audio data and the provision above.
Informative references
References for the 4B5B scheme are on the databases page of www.aes.org/standards/.