0% found this document useful (0 votes)

312 views8 pages

Convention Paper 5553: Spectral Band Replication, A Novel Approach in Audio Coding

This convention paper presents Spectral Band Replication (SBR), a novel audio coding technology that improves compression efficiency. SBR reconstructs high frequency components on the receiver side, reducing the data needed to encode and transmit them. This allows higher audio quality at lower data rates. The paper provides background on SBR's development and describes how it works together with audio codecs like mp3 and MPEG AAC. SBR analyzes a decoded low frequency signal and generates the high frequencies, matching the masking threshold to improve quality at lower bitrates.

Uploaded by

Juan Andrés Mariscal Ramirez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

312 views8 pages

Convention Paper 5553: Spectral Band Replication, A Novel Approach in Audio Coding

Uploaded by

Juan Andrés Mariscal Ramirez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

_________________________________

Audio Engineering Society

Convention Paper 5553

Presented at the 112th Convention 2002 May 1013 Munich, Germany
This convention paper has been reproduced from the author's advance manuscript, without editing, corrections, or consideration by the Review Board. The AES takes no responsibility for the contents. Additional papers may be obtained by sending request and remittance to Audio Engineering Society, 60 East 42nd Street, New York, New York 10165-2520, USA; also see www.aes.org. All rights reserved. Reproduction of this paper, or any portion thereof, is not permitted without direct permission from the Journal of the Audio Engineering Society.

_________________________________
Spectral Band Replication, a novel approach in audio coding
Martin Dietz , Lars Liljeryd , Kristofer Kjrling and Oliver Kunz
1 2 Coding Technologies, Nuremberg,Germany Coding Technologies, Stockholm, Sweden Correspondence should be addressed to Martin Dietz ([email protected]) 1 2 2 1

ABSTRACT

Spectral Band Replication (SBR) is a novel technology which significantly improves the compression efficiency of perceptual audio codecs. SBR reconstructs the high frequency components of an audio signal on the receiver side. Thus, it takes the burden of encoding and transmitting high frequency components off the encoder, allowing for a much higher audio quality at low datarates. In December 2001, SBR has been chosen as Reference Model for the MPEG standardization process for bandwidth extension. The paper will highlight the underlying technical ideas and the achievable efficiency improvements. A second focus will be a description of current and future applications of SBR.

INTRODUCTION Back in 1997, Coding Technologies started research on a novel approach to improve the efficiency of digital audio coding algorithms. A team of Swedish researchers led by Lars Liljeryd had the idea to recreate (replicate) the high-frequencies of a decoded audio signal in an accurate way, mainly by the re-use of signal information obtained from the decoded base-band signal. Thus a new concept of redundancy coding in the frequency domain was developed. This concept led to what is today known as Spectral Band Replication technology (SBR). SBR is able to significantly improve the compression efficiency of audio codecs over a wide range of bitrates. As SBR is an enhancement technology, it always needs an audio codec to hook upon. Two obvious candidates were identified: mp3

[1,2,6], as the worlds most popular coding format, and MPEG AAC [3,7], so far the most powerful waveform audio coding algorithm. After a year of successful cooperation with the Fraunhofer Institute for Integrated Circuits (FhG-IIS), several mp3 and AAC experts from FhG joined Coding Technologies in autumn 2000 to work on the SBR enhanced audio coding systems. Since then, several applications have adopted SBR. mp3PRO is the result of combining mp3 with SBR in a back- and forwards compatible way. mp3PRO is available since June 2001 and has in the meantime gained significant market acceptance. In January 2001, AAC+SBR has been adopted as audio codec for the Digital Radio Mondiale System. In December 2001, SBR has been successfully submitted to MPEG and became reference model of the MPEG-4 version 3 audio standardization process. The following chapters will describe in more detail the

DIETZ ET AL.

SPECTRAL BAND REPLICATION

background of SBR, the technology behind SBR and applications for this novel technology.

BACKGROUND OF THE SBR DEVELOPMENT The use of means to recreate missing or reduced high-frequencies in audio is nothing new, and has been used more or less frequent over the years in the recording industry. One of the most famous products was the Aphex Aural Exciter invented by Kurt Knoppel in the early 70s. Although Knoppel was not the first (or last) in this field, similar technologies can be found in some audio editing software tools today as well as in some audio processing hardware units. The main aim with such technologies is to add some sparkle, warmth or high-end enrichment of the recorded vocals, speech, instruments, CD-master or broadcast feed, using typically nonlinearities or controlled forms of high-frequency distortion. However, such technologies are performance limited by the way the high-frequencies are recreated. This limits their scope of use to just touching-up existing programme material, and making them unable to replicate major frequency ranges of a high quality programme material. A related technology, Harmonizing, was first successfully pioneered by the Eventide Clockworks Harmonizer in the early 70s, where programme material can be transposed in a musically correct manner. This is quite opposite to the Bode Frequency Shifter developed for electronic music applications by Harald Bode where harmonic relations are non-musically shifted (scrambled). Finally, the Ring-Modulator, one of the oldest weird sound effects units, is sometimes used in electronic music. Spin-offs from Harmonizer-like algorithms are the popular pitch-shift and timestretch algorithms used in many audio workstation sound-editing software tools today. The above-mentioned technologies or methods are sufficient for their intended use, but unsuitable for high quality SBR-like HighFrequency-Reconstruction (HFR) methods as introduced by Coding Technologies for audio coding. Therefore the development of new (patented) and more suitable algorithms was a prerequisite for the successful deployment of SBR. Strict demand was set on basic algorithm performance due to the unusual need to transpose signal components from the available (coded) baseband, by at least a factor three, without creating annoying artefacts. Particular emphasis was put on recreating psychoacoustically accurate representations of the original high-band signal components without violating the well-known consonance-dissonance criteria. However compared to the original signal, the recreated high-band is not necessarily signal-coherent in a technical sense, but coherent in a mere psychoacoustic sense, a most important characteristic.

applied in order to achieve high quality sound across a large dynamic range. This also reveals the basic idea of perceptual audio coding: if it would be possible to shape the quantization noise in the frequency domain, compression of audio signals beyond the CD format must be achievable.

Figure 1: Spectrum and Masking Threshold Research started on how to calculate the masking threshold (psychoacoustic model) and how to process the audio signal in a way that only audible information resides in the signal. The ideal audio codec introduces quantization distortions that are exactly below the masking threshold (figure 2).

Figure 2: Ideal Perceptual Coding This research led to todays waveform based perceptual audio codecs, such as MPEG Layer 2, AC-3, mp3, Atrac, PAC and MPEG AAC. All these codecs are based on the same, well-known principle, as shown in figure 3. The audio signal is transformed into the frequency domain by means of a filterbank or transform on a block-by-block basis. The resulting short-time spectrum is quantized in a way that the masking threshold calculated by the psychoacoustic model is not violated. The quantized spectrum gets coded and packed into a bitstream. The decoder performs the reverse steps.
PCM audio data time/ frequency mapping quantizing psychoand acoustic coding model frame packing bitstream

WHY SBR IMPROVES AUDIO CODING Research on perceptual audio coders started about 20 years ago. Research on the human auditory system revealed that hearing is mainly based on a short-term spectral analysis of the audio signal. The so-called masking effect was observed: the human auditory system is not able to perceive distortions that are masked by a stronger signal in the spectral neighborhood. Thus, when looking at the short-term spectrum, the so-called masking threshold can be calculated. Distortions below this threshold are in the ideal case inaudible. Figure 1 illustrates in a very simplified way how the masking threshold can look like. Figure 1 also reveals that the distortion generated by the PCM format used on CDs is for all frequencies way below this threshold. However, since the PCM signal, representing the time domain, does not allow frequency dependent shaping of the quantization noise, a high SNR needs to be

Psychoacoustic model

Figure 3: Perceptual waveform encoder Although all the codecs mentioned follow the same principle, they still differ significantly in compression efficiency. The performance of such codecs depends mainly on two aspects: how precisely the masking threshold can be approached and how efficiently the quantized spectrum can be coded. AAC, for example, is, according to several tests, twice as efficient as Layer 2. Nevertheless, traditional waveform coding has its limits. Having reached a performance like AAC it gets very hard to further increase

AES 112TH CONVENTION, MUNICH, GERMANY, 2002 MAY 1013

DIETZ ET AL.

SPECTRAL BAND REPLICATION

compression. Figure 4 illustrates what happens if the bitrate is lowered significantly from the codecs reasonable operating point. The quantization noise significantly exceeds the masking threshold, thus generating audible artefacts.

bitstream and results in a correctly shaped high frequency part (figure 7).

Figure 6: High frequency generation based on the waveform coded low frequency part Figure 4: Waveform coding beyond its limits Two main methods have been used so far to overcome this problem in perceptual waveform codecs. The most important one is to limit the audio bandwidth of the signal in or prior to the coding process. As there is no high frequency energy to be coded, more information is available for the remainder of the spectrum, resulting in a clean, but hollow sounding signal (figure 5). The other method, called intensity stereo, can only be used for stereo signals. In intensity stereo, only one channel and a panning information is transmitted instead of a left and a right channel. However, this is only of rather limited use for increasing compression efficiency, as in many cases the stereo image of the audio signal gets destroyed. A third method used is insertion of artificial noise. This method, however, works only with a rather limited class of audio signals.

Figure 7: Spectrum after high frequency adjustment

Figure 5: Limiting the audio bandwidth At this point, the SBR technology comes on the scene. SBR deviates from the waveform coding principle towards a hybrid waveform/parametric method. It is based on the fact that in most cases there are large dependencies between the lower and higher frequency parts of an audio signal. Therefore, the high frequency part of an audio signal can be efficiently reconstructed from the low frequency part. Transmission of the high frequency part is therefore not necessary - only a small amount of SBR control data needs to be carried in the bitstream to guarantee an optimal reconstruction of the high frequencies. Figure 6 illustrates the first step in performing the SBR enhancement. The low frequency part is still coded by an ordinary waveform codec such as mp3 or AAC. The high frequency part, however, is generated by a high quality transposition algorithm. As can be seen in figure 6, the mere generation of high frequency content is not at all sufficient for accurate high frequency reconstruction, since the reconstructed part does not reflect the spectral envelope of the original. Therefore careful adjustment of the spectrum is essential for the performance of the system. The adjustment is controlled by the SBR information carried in the

To summarize, SBR enhanced codecs perform significantly better mainly because a) SBR allows the recreation of the high frequencies using only a very small amount of transmitted side information. The high frequencies, which normally consume a significant amount of bits, do not need to be waveform coded anymore, resulting in a significant coding gain b) the underlying waveform coder can run with a comparatively high SNR, as it is only responsible for the lower frequencies. It can even operate at the optimum sampling rate, which is usually different from the desired output sampling rate. SBR converts the waveform codec sampling rate into the desired output sampling rate. Obviously there are signals where the reconstruction method does not deliver the desired results, e.g. when there is little relationship between the low and high frequency part. Care has been taken to equip SBR with additional tools so that such situations can be handled very well without losing compression efficiency.

HOW SBR WORKS The SBR system is preferably used as a dual-rate system, with the underlying codec operating at half the original sampling-rate, while SBR operates at the original sampling rate. The following description will briefly explain the different parts in the encoder and decoder of the SBR system. The SBR encoder works in parallel with the underlying core codec, albeit at a higher sampling-rate. Although SBR is mainly a post process in the decoder, important parameters are extracted in the encoder in order to ensure the most accurate high frequency reconstruction in the decoder. The encoder process is depicted in figure 8. The encoder estimates the spectral envelope of the SBR range for a time and frequency range/resolution suitable for the current input signal segments characteristics. The spectral envelope

AES 112TH CONVENTION, MUNICH, GERMANY, 2002 MAY 1013

DIETZ ET AL.

SPECTRAL BAND REPLICATION

is estimated by a complex QMF analysis and subsequent energy calculation. The time and frequency resolutions of the spectral envelopes can be chosen with a high level of freedom, in order to ensure the best suited time frequency resolution for the given input segment. The envelope estimation needs to consider that a transient in the original, mainly situated in the high frequency region (for instance a high-hat), will be present to a minor extent in the SBR generated highband prior to envelope adjustment, since the highband in the decoder is based on the low band where the transient is much less pronounced compared to the highband. This aspect imposes different requirements for the time frequency resolution of the spectral envelope data, compared to ordinary spectral envelope estimation as used in other audio coding algorithms.
Control parameter extraction Envelope extraction Bitstream multiplex

Bitstream

Bitstream Demux

Core Codec

Bitstream Decoding

High Frequency Reconstruction

Bitstream

Additional High Frequency Components

Input Core Codec

Envelope Adjustment

Figure 8: Basic block-diagram of the SBR encoder.

Output

Apart from the spectral envelope, several additional parameters are extracted representing spectral characteristics of the input signal for different time and frequency regions. Since the encoder naturally has access to the original signal as well as information on how the SBRunit in the decoder will create the high-band, given the specific set of control parameters, it is possible for the system to handle situations where the lowband constitutes a strong harmonic series and the high band, to be recreated, mainly constitutes random signal components, as well as situations where strong tonal components are present in the original highband without counterparts in the lowband, upon which the highband region is based. Furthermore, the SBR encoder works in close relation to the underlying core codec to assess which frequency range should be covered by SBR at a given time. The SBR data is efficiently coded prior to transmission by exploiting entropy coding as well as channel dependencies of the control data, in the case of stereo signals. The control parameter extraction algorithms need to be carefully tuned to the underlying codec at a given bitrate and a given sampling rate. This is due to the fact that a lower bitrate, usually implies a larger SBR range compared to a high bitrate, and different sampling rates correspond to different time resolutions of the SBR frames. Different core codecs may also display different characteristics during bit constraints, i.e. more or less severe spectral band shutdowns etc. The decoder is constituted of several different parts, as outlined in figure 9. It comprises a bitstream decoding module, a high frequency reconstruction (HFR) module, an additional high frequency components module, and an envelope adjuster module. The system is based around a complex valued QMF filterbank [4]. In the bitstream extraction module the control data is read from the bitstream and decoded. The time frequency grid is obtained for the current frame, prior to reading the envelope data from the bitstream, and decoding of the same. If a stereo signal is processed, suitable stereo decoding is performed according to the control data. Appropriate CRC-checks and error resilience algorithms are also included.

Figure 9: Basic block-diagram of the SBR decoder. The underlying core decoder decodes the audio signal of the current frame (albeit at the lower sampling rate). The resulting frame of audio data is used for high frequency reconstruction by the HFR module. A possible output from the core decoder is displayed in figure 10. Here it is evident that the output is of low-pass character with a bandwidth of approx. 6kHz.

20 ] 0 B d [ l e v e -20 L

-40

-60 0

10 15 Frequency [kHz]

Figure 10: Spectrum of the output signal from the core decoder, for an input signal coded with AAC+SBR at 24kbps. The decoded low-band signal is analyzed using a QMF filterbank. The high frequency reconstruction and envelope adjustment is subsequently performed on the subband samples of the QMF filterbank. The high frequencies are reconstructed from the low-band in a flexible way, based on the given control parameters. Furthermore, the reconstructed highband is adaptively filtered on a subband channel basis according to the control data to ensure the appropriate spectral characteristics of the given time/frequency region. The HFR can be done in several ways dependent on the characteristics of the signal being processed. The output from such a transposition based high frequency reconstruction is depicted in figure 11. Here it is evident that the frequency range of the signal has been extended to approx. 15kHz. There are no algorithmic

AES 112TH CONVENTION, MUNICH, GERMANY, 2002 MAY 1013

DIETZ ET AL.

SPECTRAL BAND REPLICATION

constraints preventing the HFR module to extend the frequency range even higher. However, for this example 15kHz was deemed sufficient.

20 ] B 0 d [ l e v e -20 L

20 ] 0 B d [ l e v e L -20

-40

-60 0

10 15 Frequency [kHz]

-60 0

Figure 13: Spectrum of the original signal.

5 10 15 Frequency [kHz] 20

Figure 11: Spectrum of the output signal from a transposition based SBR module, given the input signal displayed in figure 11 The reconstructed highband is subsequently envelope adjusted according to the time/frequency resolution obtained from the encoder. The output signal is enhanced with additional signal components in order to ensure the correct spectral characteristics of the final output. The low band decoded by the underlying decoder is added to the reconstructed and envelope adjusted highband, giving the output signal. Since the lowband has a sampling rate of half of the highband it is up-sampled during the course of the SBR process. The final output, after envelope adjustment, is depicted in figure 12. It is evident that the SBR module successfully has recreated a high band very similar to that of the original, fig 13.

SBR APPLICATIONS SBR enhanced audio codecs are already in use in several applications, some of which are listed hereafter. mp3PRO In January 2001, Coding Technologies and its licensing partner Thomson Multimedia presented mp3PRO, the combination of mp3 and SBR, for the first time. In June 2001, a free demo software has been made available. Since then several software applications integrating mp3PRO have been released (e.g. from Ahead, Steinberg, Magix, .). First hardware products are expected in Q2/2002. mp3PRO offers significantly improved performance compared to mp3 and outperforms competitive codecs according to independent tests. mp3PRO is back- and forward compatible with mp3. Not only will any mp3PRO decoder decode mp3 content, all mp3 players will be able to decode mp3PRO bitstreams, although without the quality improvement achieved through SBR. Figure 14 shows the integration of mp3 and SBR . Basically, SBR can be seen as pre/postprocessing around the existing mp3 modules. Even the bitstream format remains unchanged, as the SBR data can be carried in the ancillary data field. More detailed information about mp3PRO can be found in [4]. Digital Radio Mondiale (DRM) The DRM consortium has been founded in 1998 with the goal to standardize a new digital system for medium, long- and shortwave transmissions. The system should deliver significantly better audio and reception quality than ordinary analog AM. At the same time the existing AM channel spacing should be used. In addition, reuse of existing transmitter equipment should be possible to guarantee a cost efficient introduction of the new system.

20 ] 0 B d [ l e v e -20 L

-40

-60 0

10 15 Frequency [kHz]

Figure 12: Spectrum of the output signal after envelope adjustment, coded with AAC+SBR at 24kbps.

ancillary data audio input SBR encoder filtered audio data mp3 PRO

ancillary data audio mp3 decoder data SBR decoder audio output

mp3 encoder

bit bit de mux bitstream mux

Figure 14: mp3PRO block diagram

AES 112TH CONVENTION, MUNICH, GERMANY, 2002 MAY 1013

Depending on the wavelength and the region, the channel spacing in the AM bands is either 9 kHz or 10 kHz. The propagation conditions, which depend very much on the wavelength, determine the spectral efficiency achievable. Extensive tests have resulted in an efficiency in the range of 1.5 to 3 bit/Hz. As a result, the bitrate available in one transmission channel is in the range of 1330 kbit/sec. Normal operation will use bitrates between 20 and 25 kbit/sec. Delivering high quality audio at such a low bitrate asks for the most powerful source coding scheme available. Consequently, AAC+SBR, the combination of MPEG-AAC and SBR has been chosen as audio coding method. Tests done within the DRM consortium clearly show the superiority of this coding system compared to previous state of the art. More details about DRM and the listening test can be found in [5,8,9] and at the DRM web site www.drm.org. DRM finalised the specification of the system in January 2001. It is recommended by the ITU [11] and standardised by ETSI [10]. First regular transmissions and receiver products are expected in 2003. MPEG Because of its outstanding compression performance, the combination of AAC and SBR is of high interest not only for broadcasting, but also for audio and audio/video applications like streaming over the internet, streaming and delivery over mobile networks and storage in portable devices. For a lot of these applications, open standards like MPEG play an important role. They usually provide state-of-the-art algorithms and guarantee interoperability and accessibility of the technology. As a consequence, MPEG issued a call for proposals in January 2001, asking for technologies that could further enhance the compression efficiency of the MPEG-4 audio coding algorithms (in particular AAC). SBR was submitted as a proposal; a competitive test was performed by MPEG. AAC+SBR did not only show an excellent performance, but already met the acceptance criteria set up for final acceptance of the technology in the standard. As a result, SBR became Reference Model 0 for version 3 of the MPEG-4 standard. Figure 15 shows the result of the test at 24 kbps mono at the T-Nova test site. The test was done with experienced listeners and the well proven MUSHRA test method. As test items, the usual MPEG test items were used (e.g. harpsichord, glockenspiel, pitchpipe, male german speech, castagnettes, a.s.o). Till today those items are known to be very critical for audio coding. AAC+SBR shows a vast improvement compared to AAC, even when compared to the higher bitrate AAC (30 kbps). Moreover AAC+SBR shows a very good absolute performance at this very low bitrate.
10 0 90 80 70 60 50 40 30 20 10 0 AAC 2 4 AAC 3 0 AAC+ S BR 2 4

Sc02 Sc03 Si01 Si02 Si03 Sm01 Sm02 Sm03

Meistersinger Spot Harpsichord Castagnettes Pitchpipe Bagpipe Glockenspiel Plucked Strings Table 1: The MPEG excerpts

The stereo test results are shown in figure 16 and Figure 18. Again, AAC+SBR shows a significant gain over AAC and a very good absolute grading of more than 85 at a bitrate of 48 kbit/sec. For both tests it should be noted that the comparison was done against MPEG4 AAC. A comparison to the yet much more popular MPEG-2 AAC, which is the AAC version available on the market today, would show an even higher improvement through SBR (which operates based on MPEG-2 AAC). Since AAC was so far deemed to be state-of-the-art audio compression, it can be concluded from the this test that AAC+SBR is the new performance leader in audio coding.
10 0 90 80 70 60 50 40 30 20 10 0 AAC 4 8 AAC 6 0 AAC+ S BR 4 8

Figure 16: MPEG test result at 48 kbps stereo (test site: T-Nova) The MPEG-4 version 3 standard will reach Committee Draft (CD) status in May 2002 and the Final Draft International Standard (FDIS) status in March 2003. The improved audio coding standard is expected to be adopted for several of the above mentioned applications. CONCLUSIONS Spectral Band Replication is a novel technology that combines traditional audio coding with the capabilities of high quality high frequency reconstruction methods. Through the use of SBR in mp3PRO, the compression efficiency of mp3 could be significantly improved while remaining compatible with the widespread mp3 format. mp3PRO will succeed mp3 in all application areas. The combination of AAC and SBR offers the most powerful audio compression available today. As such it is most suited for digital broadcasting and streaming/delivery over networks with limited resources. AAC+SBR is already used in the market place and is the Reference Model in the MPEG-4 standardization. Both mp3PRO and AAC+SBR are available today in soft- and hardware, several applications have already been launched. It is expected that the number of applications using SBR will increase significantly in the near future. REFERENCES [1] ISO/IEC JTC1/SC29/WG11 MPEG, International Standard ISO/IEC IS 11172-3 Information technology Coding of moving pictures and associated audio for digital storage media up to about 1,5 Mbit/s, Part 3: Audio, 1991.

Figure 15: MPEG test result at 24 kbps mono (test site: T-Nova) More detailed results are given in figure 17. Table 1 explains the acronyms for the excerpts. Es01 es02 es03 sc01 Suzanne Vega Male German Speech Female English Speech Haydn Trumpet Concert

DIETZ ET AL.

SPECTRAL BAND REPLICATION

[2]

ISO/IEC JTC1/SC29/WG11 MPEG, International Standard ISO/IEC IS 13818-3 Information technology Generic Coding of Moving Pictures and Associated Audio:, Part 3: Audio, 1994. ISO/IEC JTC1/SC29/WG11 MPEG, International Standard ISO/IEC IS 13818-7 Information technology Generic Coding of Moving Pictures and Associated Audio:, Part 7: Advanced Audio Coding (AAC), 1997. Per Ekstrand, Andreas Ehret, Manfred Lutzky and Thomas Ziegler. Enhancing mp3 with SBR: Features and Capabilities of the new mp3PRO Algorithm. In 112th AES Convention, Munich, May 10-May 13, 2002. Stefan Meltzner, Fredrik Henn and Reinhold Bhm. SBRenhanced audio codecs for digital broadcasting such as Digital Radio Mondial (DRM). In 112th AES Convention, Munich, May 10-May 13, 2002. K. Brandenburg, G. Stoll, Y.F. Dehery, J.D. Johnston, L.v.d. Kerkhof, E.F. Schrder: "The ISO/MPEG-Audio Codec: A Generic Standard for Coding of High Quality Digital Audio", JAES 42, 780-792 Bosi, Brandenburg, Quackenbush, Fielder, Akagiri, Fuchs, Dietz, Herre, Davidson, Oikawa, ISO/IEC MPEG-2 Advanced Audio Coding, 101st AES Convention, Los Angeles, November 1996 DRM Source Coding Group, 2001: Report on Subjective Listening Tests of SBR-LC, an AAC-based Audio Bandwidth Widening Tool Jonathan Stott, 2001. DRM Key Technical Features, Reprint from EBU Technical Review No. 286 (March 2001) ETSI TS 101980 v1.1.1 (2001-09), Digital Radio Mondiale (DRM); System Specification Recommendation ITU-R BS.1348-1 (02/01): Service requirements for digital sound broadcasting at frequencies below 30 MHz

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

AES 112TH CONVENTION, MUNICH, GERMANY, 2002 MAY 1013

DIETZ ET AL.

SPECTRAL BAND REPLICATION

Figure 17: MPEG test results per item, 24 kbps mono (test site: T-Nova)

Figure 18: MPEG test results per item, 48 kbps (test site: T-Nova)

AES 112TH CONVENTION, MUNICH, GERMANY, 2002 MAY 1013

Advanced Audio Coding (Aac)
100% (1)
Advanced Audio Coding (Aac)
33 pages
2022 - (Springer Topics in Signal Processing, 21) Aurelio Uncini - Digital Audio Processing Fundamentals-Springer (2023)
No ratings yet
2022 - (Springer Topics in Signal Processing, 21) Aurelio Uncini - Digital Audio Processing Fundamentals-Springer (2023)
726 pages
Speech Compression Techniques - Formant and CELP Vocoders
No ratings yet
Speech Compression Techniques - Formant and CELP Vocoders
41 pages
Dolby Surround Pro Logic Operation
100% (1)
Dolby Surround Pro Logic Operation
7 pages
Audio Coding: Basics and State of The Art
No ratings yet
Audio Coding: Basics and State of The Art
6 pages
HE-AAC v2 PDF
No ratings yet
HE-AAC v2 PDF
12 pages
Sanyo LCD-32XH7 Noblex 32LC820H Philco PLH3210 JVC LT32R50 Chasis UE6L
100% (4)
Sanyo LCD-32XH7 Noblex 32LC820H Philco PLH3210 JVC LT32R50 Chasis UE6L
50 pages
Audio Coding: Basics and State of The Art
No ratings yet
Audio Coding: Basics and State of The Art
6 pages
Integrating Analytical Elements Through Transpositional Combination in Two ... Crumb
100% (2)
Integrating Analytical Elements Through Transpositional Combination in Two ... Crumb
93 pages
2) .I) Identify The Importance of Resolution, Size and Compression of Images. Ii) Classify File Formats of Various Multimedia Files. I
100% (1)
2) .I) Identify The Importance of Resolution, Size and Compression of Images. Ii) Classify File Formats of Various Multimedia Files. I
3 pages
Regular Pulse Excitation
No ratings yet
Regular Pulse Excitation
10 pages
Blofeld Modifiers
100% (2)
Blofeld Modifiers
34 pages
Color Mixing: By: Beyene Dumecha
No ratings yet
Color Mixing: By: Beyene Dumecha
31 pages
DTMF Detection (10011P0405)
No ratings yet
DTMF Detection (10011P0405)
10 pages
Intermediate Frequency Filter
100% (1)
Intermediate Frequency Filter
9 pages
Compressed Sensing Based Synthetic Transmit Aperture in Ultrasound Imaging
No ratings yet
Compressed Sensing Based Synthetic Transmit Aperture in Ultrasound Imaging
29 pages
Physics of Sound or Basic Theory of Sound: Octave
100% (1)
Physics of Sound or Basic Theory of Sound: Octave
8 pages
Shruti-1 User Manual: Mutable Instruments
No ratings yet
Shruti-1 User Manual: Mutable Instruments
18 pages
Soundprobe2 Youcandoit
No ratings yet
Soundprobe2 Youcandoit
4 pages
Audio Compression Using Wavelet Techniques: Project Report
No ratings yet
Audio Compression Using Wavelet Techniques: Project Report
41 pages
Introduction To Sigma Delta Converters
100% (1)
Introduction To Sigma Delta Converters
72 pages
File Size, File Compression, and File Types of Digital Audio
No ratings yet
File Size, File Compression, and File Types of Digital Audio
38 pages
Small Signal Amplifier - BJT-1-25
No ratings yet
Small Signal Amplifier - BJT-1-25
25 pages
B.E Semester: 6 - IT (GTU) : 2161603 - Data Compression and Data Retrieval
No ratings yet
B.E Semester: 6 - IT (GTU) : 2161603 - Data Compression and Data Retrieval
17 pages
EE6425 Class Project: LPC 10 Speech Analysis and Synthesis Model
No ratings yet
EE6425 Class Project: LPC 10 Speech Analysis and Synthesis Model
23 pages
1.4 Exam Questions
No ratings yet
1.4 Exam Questions
3 pages
MP3 Format: Theory of The Standard
No ratings yet
MP3 Format: Theory of The Standard
15 pages
Vibrato Extraction and Parameterization PDF
No ratings yet
Vibrato Extraction and Parameterization PDF
5 pages
Data Compression
No ratings yet
Data Compression
29 pages
Bandwidth (Signal Processing)
No ratings yet
Bandwidth (Signal Processing)
4 pages
Image Compression: CS474/674 - Prof. Bebis
100% (1)
Image Compression: CS474/674 - Prof. Bebis
110 pages
Digital Audio Primer
No ratings yet
Digital Audio Primer
90 pages
Basics of Digital Communication
100% (1)
Basics of Digital Communication
41 pages
Audio Compression Standards
0% (1)
Audio Compression Standards
3 pages
openSMILE Book Latest PDF
No ratings yet
openSMILE Book Latest PDF
65 pages
TRUMPET 6-MPPA - Dec2016 - Wold20161011
100% (1)
TRUMPET 6-MPPA - Dec2016 - Wold20161011
13 pages
Unit 25 Sound Recording
No ratings yet
Unit 25 Sound Recording
21 pages
Mpeg-2 Basics
100% (1)
Mpeg-2 Basics
17 pages
OM - Chroma
No ratings yet
OM - Chroma
18 pages
Simplified Analogue Realization of The Digital Direct Synthesis (DDS) Technique For Signal Generation
No ratings yet
Simplified Analogue Realization of The Digital Direct Synthesis (DDS) Technique For Signal Generation
5 pages
FraunhoferIIS Application Bulletin AAC Transport Formats
No ratings yet
FraunhoferIIS Application Bulletin AAC Transport Formats
18 pages
Baseband Audio
100% (1)
Baseband Audio
48 pages
Predictive Coding I
No ratings yet
Predictive Coding I
14 pages
Waldorf Microwave Masterclass PDF
No ratings yet
Waldorf Microwave Masterclass PDF
10 pages
Texture and Entropic Processes in Electroacoustic Music
No ratings yet
Texture and Entropic Processes in Electroacoustic Music
14 pages
Making Programming Languages To Dance To: Live Coding With Tidal
100% (1)
Making Programming Languages To Dance To: Live Coding With Tidal
7 pages
Uses of Computer Networks
100% (4)
Uses of Computer Networks
5 pages
SDII Specification
No ratings yet
SDII Specification
8 pages
ARKEL Vision Setup and User Manual V14
100% (1)
ARKEL Vision Setup and User Manual V14
18 pages
A Filter Is A System
No ratings yet
A Filter Is A System
10 pages
Data Compression
No ratings yet
Data Compression
46 pages
Dialnorm - Broadcaster Compliance With The Calm Act
No ratings yet
Dialnorm - Broadcaster Compliance With The Calm Act
3 pages
Alias-Free Digital Synthesis of Classic Analog Waveforms
No ratings yet
Alias-Free Digital Synthesis of Classic Analog Waveforms
12 pages
Digital Signal Synthesis DDS - Tutorial - Rev12!2!99
No ratings yet
Digital Signal Synthesis DDS - Tutorial - Rev12!2!99
122 pages
A Paradigm For Physical Interaction With Sound in 3-D Audio Space
No ratings yet
A Paradigm For Physical Interaction With Sound in 3-D Audio Space
9 pages
Network Topology Note
No ratings yet
Network Topology Note
12 pages
Notch Filter Digital
No ratings yet
Notch Filter Digital
43 pages
Data Communication and Network Chapter 1
No ratings yet
Data Communication and Network Chapter 1
7 pages
Stimming: Production Tips
No ratings yet
Stimming: Production Tips
25 pages
09-Smartax Ma5620 and Ma5626-En
No ratings yet
09-Smartax Ma5620 and Ma5626-En
4 pages
Introduction To Wavelet
No ratings yet
Introduction To Wavelet
44 pages
Making Music With MATLAB
No ratings yet
Making Music With MATLAB
4 pages
VSX-RS320 - DWXCN.book 1 ページ２０１０年４月２日金曜日午後５時４６分
No ratings yet
VSX-RS320 - DWXCN.book 1 ページ２０１０年４月２日金曜日午後５時４６分
44 pages
Itc - Mpeg Case Study
No ratings yet
Itc - Mpeg Case Study
12 pages
Stilson-Smith - Alias-Free Digital Synthesis of Classic Analog Waveforms (BLIT)
No ratings yet
Stilson-Smith - Alias-Free Digital Synthesis of Classic Analog Waveforms (BLIT)
12 pages
Analysis of Energy Efficient Routing Protocols in Wireless Sensor Networks PDF
No ratings yet
Analysis of Energy Efficient Routing Protocols in Wireless Sensor Networks PDF
5 pages
Levels in Digital Audio
No ratings yet
Levels in Digital Audio
8 pages
MT5335PG MT5335PG MT5335PG MT5335PG (K) (K) (K) (K) U U U U
No ratings yet
MT5335PG MT5335PG MT5335PG MT5335PG (K) (K) (K) (K) U U U U
26 pages
HP Pro x2 612 G1 - Data Sheet
No ratings yet
HP Pro x2 612 G1 - Data Sheet
4 pages
Quiz in Data Communications and Networking ECE Board Exam
No ratings yet
Quiz in Data Communications and Networking ECE Board Exam
4 pages
Internet Terminologies
No ratings yet
Internet Terminologies
7 pages
Sony Mini Hi Fi Mhc-gzr77d
No ratings yet
Sony Mini Hi Fi Mhc-gzr77d
151 pages
Sony Sa-Ve505
No ratings yet
Sony Sa-Ve505
28 pages
Free Easy Burner
No ratings yet
Free Easy Burner
65 pages
Data Communication: Data Communications and Networking, 4rd Edition, Behrouz A. Forouzan
No ratings yet
Data Communication: Data Communications and Networking, 4rd Edition, Behrouz A. Forouzan
59 pages
Kanguru Duplicator User Guide
No ratings yet
Kanguru Duplicator User Guide
89 pages
Fixed Asset Register - June 2017 DTT Ndokwa
No ratings yet
Fixed Asset Register - June 2017 DTT Ndokwa
714 pages
HP Compaq 6710b Notebook PC
No ratings yet
HP Compaq 6710b Notebook PC
2 pages
Cable Access Technologies
No ratings yet
Cable Access Technologies
17 pages
Ili9320ds V0.55 PDF
No ratings yet
Ili9320ds V0.55 PDF
109 pages
Wa Ecg Brochure
No ratings yet
Wa Ecg Brochure
4 pages
Dexing
100% (1)
Dexing
7 pages
433 Tranciver
No ratings yet
433 Tranciver
5 pages
DX Diag
No ratings yet
DX Diag
11 pages
LCE-24C100F (K) LCE-24C100F (N) LCE-24C100F (R) LCE-24C100F (S) : Instruction Manual
No ratings yet
LCE-24C100F (K) LCE-24C100F (N) LCE-24C100F (R) LCE-24C100F (S) : Instruction Manual
20 pages
CCNA 2 Chapter 1 v5.0 Exam Answers 2013: Has A Lower Latency Appropriate For High-Performance Computing Applications
No ratings yet
CCNA 2 Chapter 1 v5.0 Exam Answers 2013: Has A Lower Latency Appropriate For High-Performance Computing Applications
7 pages
John Bryan Munoz Laura Yiset Vivas Diego Armando Palacios
No ratings yet
John Bryan Munoz Laura Yiset Vivas Diego Armando Palacios
2 pages
Nexia CS DSP Conference System: Features
No ratings yet
Nexia CS DSP Conference System: Features
2 pages
Error-Correction on Non-Standard Communication Channels
From Everand
Error-Correction on Non-Standard Communication Channels
Edward A. Ratzer
No ratings yet
Noise Reduction: Enhancing Clarity, Advanced Techniques for Noise Reduction in Computer Vision
From Everand
Noise Reduction: Enhancing Clarity, Advanced Techniques for Noise Reduction in Computer Vision
Fouad Sabry
No ratings yet

Convention Paper 5553: Spectral Band Replication, A Novel Approach in Audio Coding

Uploaded by

Convention Paper 5553: Spectral Band Replication, A Novel Approach in Audio Coding

Uploaded by

_________________________________

Audio Engineering Society

Convention Paper 5553

SPECTRAL BAND REPLICATION

AES 112TH CONVENTION, MUNICH, GERMANY, 2002 MAY 1013

SPECTRAL BAND REPLICATION

Figure 7: Spectrum after high frequency adjustment

AES 112TH CONVENTION, MUNICH, GERMANY, 2002 MAY 1013

SPECTRAL BAND REPLICATION

High Frequency Reconstruction

Additional High Frequency Components

Input Core Codec

Figure 8: Basic block-diagram of the SBR encoder.

AES 112TH CONVENTION, MUNICH, GERMANY, 2002 MAY 1013

SPECTRAL BAND REPLICATION

Figure 13: Spectrum of the original signal.

bit bit de mux bitstream mux

Figure 14: mp3PRO block diagram

AES 112TH CONVENTION, MUNICH, GERMANY, 2002 MAY 1013

Sc02 Sc03 Si01 Si02 Si03 Sm01 Sm02 Sm03

SPECTRAL BAND REPLICATION

AES 112TH CONVENTION, MUNICH, GERMANY, 2002 MAY 1013

SPECTRAL BAND REPLICATION

AES 112TH CONVENTION, MUNICH, GERMANY, 2002 MAY 1013

You might also like