David Griesinger
David Griesinger
David Griesinger
Lexicon
[email protected]
www.world.std.com/~griesngr
Major Goals
Reverb
backward
Note many syllables are detected (~30) Notice hardly ANY are detected (~2)
RASTI will give an identical value for both cases!!
We also perceive distance and space
Balloon burst in an
opera house.
Forestage to stalls
row 10.
Note the HUGE burst
of energy about 50ms
after the direct sound.
The 1000Hz 0ctave
band shows the
combined reflections
to be 6dB stronger
than the direct sound.
The result (in this case) is a decrease in intelligibility and an increase in distance
Human Perception – the background sound stream
Mix with
direct
increasing 6dB
Vocals
Downmix - No reverb
on the singers
We can decompose
the waveform into
1/3 octave bands
and look at level
and IACC as a
function of
frequency and
time.
Level ( x = time in ms y=1/3 octave bands 640Hz to 4kHz) IACC
Notice that there is NO information in the IACC below 1000Hz!
Position determination by IACC
We can make a histogram of
the time offset between the
ears during periods of high
IACC.
For the segment of natural
speech in the previous slide, it
is clear that localization is
possible – but somewhat
difficult.
Position determination by IACC 2
Level displayed in 1/3 octave bands (640Hz to 4kHz) IACC in 1/3 octave bands
We can duplicate the sound of the previous example by adding reverberation to dry
speech, and giving it a 5 sample time offset to localize it to the right.
As can be seen in the picture, the direct sound is stronger in the simulation than in the
original, and the IACCs - plotted as 10*log10(1-(1/IACC)) - are stronger.
Position determination by IACC 3
Not surprisingly, due to the higher direct sound level and the artificially
stable source the lateral direction of the synthetic example is extremely clear
and sharply defined.
The physics of two-channel panning
The pressure at each ear is the sum of the direct sound pressure from one speaker
and the diffracted sound pressure from the other.
These two signals interfere with each other, producing a highly frequency
dependent signal.
Consequences of panning physics
• A two channel pan is entirely different from the
localization process in natural hearing.
– Localization always depends on the interaural time delay (ITD)
and the interaural intensity difference (IID).
– In natural hearing the ITD and IID vary due to head shadowing
alone.
• Between 500Hz and 1500Hz the ear relies increasingly on IID rather
than ITD, and the precise phase of the ear signals becomes inaudible.
From: B. C. J. Moore, B. R.
Glasberg and T. Baer, “A model
for the prediction of thresholds,
loudness and partial loudness,” J.
Audio Eng. Soc., vol. 45, pp.
224-240 (1997).
If you record with three widely spaced A listener on the far right will hear the
microphones, an instrument on the left instrument on the left. Now the
will have high amplitude and time orchestra spreads out across the entire
differences in the output signals. loudspeaker basis, even when the
listener is not in the sweet spot.
Training to hear distance
» Play examples
Boston Cantata Singers in Jordan Hall
Major Characteristics
• Chorus is deep in an enclosing stage-house with
significant reverberation.
• Small distances between microphones results in
unwanted leakage.
• Microphones pointed into the stage house increase
the amount of undesirable reverberation.
– Thus the chorus mikes, which must face the chorus, are
supercardiod to minimize reverberation pick-up.
– And the orchestra mikes face the hall, not the stage
house.
• Microphones in front do not pick up enough direct
sound from the chorus to supply the sense of
distance without also getting considerable mud.
Jordan Hall Setup
Solutions
• Add distance to the chorus at the mixing
stage with controlled early reflections
• Minimize stage-house pickup wherever
possible
Audio Demos
• Early reflections
• Late reverberation
Training to hear MUD
• It is relatively easy to train yourself to hear mud, but it is
often very hard to avoid it.
• Mud occurs when the reverberant decay of the recording
venue has too much reflected energy in the 50-150ms
region of the decay curve.
– This is true of nearly all sound stages, small auditoria, and
churches.
• If you are recording in such a space with a relatively large
ensemble, you are in trouble.
Example: John Eargle at Skywalker
ranch
• John Eargle made a wonderful 5.1 channel DVD audio
recording at the Skywalker ranch in Los Angeles.
• Skywalker is a large sound stage with controllable
acoustics. It is not a concert hall.
• As a consequence the reverberation radius is relatively
short. By my estimate (without having seen it) the radius
is about 3.5 meters.
• It is very easy to record mud in such a space.
– Many instruments are beyond the reverb radius.
– Adding more microphones only increases the reverberant pickup.
Example: Revels Chorus in the Sonic Temple
Characteristics
• Main problem here was excessive reverberation level.
– Solution was to add blankets – a LOT of them. 648ft^2
– Here we list the measured reverberation times
– Hz blankets empty
– 8000 0.6 0.9
– 4000 0.8 1.2
– 2000 0.9 1.4
– 1000 0.9 1.4
– 500 1.0 1.3
– 250 0.9 1.3
– 125 1.1 1.4
– 63 1.0 1.5
– Reverb radius before the blankets: ~6 feet (2 meters)
– Reverb radius after the blankets: ~8 feet (2.7 meters)
After the blankets
• Reverberation time drops below 1 second, the
magic number for the early decay in Boston
Symphony Hall
– Recording the band is easy, as we can mike them all
quite closely.
– Recording the chorus is hard, as there are >20 singers,
and we cannot get the microphones close enough to
each.
• Adding more microphones simply results in picking up more
reverberation!
– With the blankets we can record with adequate clarity
using only four supercardioid microphones.
• Once again we augment the early reflections in all outer
channels using the Lexicon.
• Late reverberation is also created using Lexicon late
reverberation.
Reverberation Radius
• The reverberation radius changed from 6’ to 8’
when we added blankets. ~2dB.
– This is not a large enough change to account for the
perceived difference in sound.
• But the change in the total reflected energy in the
time range of 50-150ms (the undesirable time
range) is much larger: 4.5dB.
– This is a highly significant and desirable decrease!
• The decrease in the late reverberation (150ms and
greater) is 6dB.
– But we make this back up with the Lexicon.
Training to hear envelopment
• Here it is essential that you move around the room, and
that you face different directions!
• You must fill the WHOLE room with the sound of the
original recording, and it must work when you face all
directions.
With decorrelated
Mono sounds poor because it We need at least four speakers
reverberation a few spatial
does not reproduce the spatial to reproduce a two
properties come through, but
properties of the original dimensional spatial sensation
only if the listener faces
recording space. that is uniform through the
forward. And the sense of
room.
space is stronger in the front.
Boston Symphony Hall
Boston Symphony Hall
• 2631 seats, 662,000ft^3, 18700m^3, RT 1.9s
– It’s enormous!
– One of the greatest concert halls in the world – maybe the
best.
– Recording here is almost too easy!
– Working here is a rare privilege
• Sufficiently rare I do not do it. (It’s a union shop.)
– The recording in this talk is courtesy of Alan McClellan of
WGBH Boston. (Mixed from 16 tracks by the presenter)
– Reverb Radius is >20’ (>6.6m) even on stage.
– The stage house is enormous. With the orchestra in place,
stage house RT ~1 sec
Boston Symphony Hall, occupied, stage
to front of balcony, 1000Hz
Why is the impulse response
relevant?
• Because the early decay (from the stage) is
short enough to get out of the way before it
muddies he sound.
• And the late decay (from the hall) is long
enough to provide envelopment.
Boston Symphony Orchestra in Symphony Hall
Boston Cantata Singers in Symphony Hall. March 17, 2002
How can we reproduce
envelopment in a small room?
• The reverberant field of a LARGE room can be
reproduced in a SMALL room if:
– We can excite a fluctuating sound VELOCITY across
the listener’s head that mimics the fluctuating velocity
in the original space.
– To do this we MUST have at least two LF drivers on
opposite sides of the listener.
– If the listener is allowed to turn the head, we must have
at least 3 independent drivers, and four is better!
– All the LF drivers must be driven by independent
(uncorrelated) reverberation signals, derived from a
large, non-steady-state room.
Low frequencies are particularly
important!
• In our concert hall and opera work it is frequencies below
300Hz where the major benefit is achieved.
– The result is “inaudible” but highly effective in increasing the
emotional power of the music.
• It is commonly believed that because we “cannot localize”
low frequencies in a playback room we need only one LF
driver
– We can however easily hear the difference on reverberation.
• It is often the case that using a shelf filter on the rear
channels can greatly improve the surround impression.
Shelf filter for rear channels
X-Y plot of the omni front pair in Slavery Documents. Red trace
is Low pass filtered at 200Hz, Yellow trace LP filtered at 100Hz.
Goniometer with AKG pair
X-Y plot of Oriana front pair with The same data, filtered at 100Hz.
a 200Hz LP filter. Red is Note that now the supercardioid
Cardioid, Yellow is Supercardioid is behaving like an omni.
Measure the decorrelation in the playback room
We can make an x-y plot of the left ear and right ear
signals, after boosting the L-R component ~10dB
below 200Hz. These plots cover the frequency
range of 20-100Hz.
Same, with a signal correlation
The change in the ear correlation is quite audible. of 30%
Final Mix
Conclusions
• Recording is a lot of fun!!!
• It is a great pleasure, and is often useful, to
understand some of the science behind the
microphones.
• Although simple techniques using microphone
pairs or arrays can be seductive, a world-class
sound usually requires many microphones, a lot of
work, and artificial acoustic augmentation.
– Time delay panning is undemocratic. Avoid it.
• Make SURE your reverberation is decorrelated,
particularly at low frequencies.
THE END