0% found this document useful (0 votes)

8 views

Evaluating Spatial Sound Systems

This document proposes a framework for quantitatively evaluating any spatial sound reproduction method. The framework involves: 1. Specifying the listening space and speaker placement. 2. Specifying virtual acoustic sources to be created. 3. Computing signals driving each loudspeaker. 4. Comparing the reproduced sound field to the target virtual sources to assess performance. The goal is to make spatial audio system design more deterministic and less trial-and-error by incorporating models of human binaural hearing into analysis tools. Key metrics like perceived source location, extent, and diffusiveness would correspond to what listeners report hearing.

Uploaded by

Mark Bocko

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Evaluating Spatial Sound Systems

Uploaded by

Mark Bocko

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 32

Evaluating Spatial Sound Systems

Mark F. Bocko

Audio & Music Engineering

Audio Engineers love specs …
• Predicting which speakers will sound good …

2
How many speakers are enough?
$
NHK 22.2
$ $
$
$ $
$
$ $$ $ $ $
$ $ $ $ $
$$
$ $ $

$
Framework
Quantitatively evaluate any 1 2

spatial sound reproduction Specify listening space &

speaker placement
Specify virtual acoustic
sources to be created

method in any space … Compute signals driving 3

each loudspeaker
• Incorporate quantitative models of binaural (Your favorite method)

hearing into audio system design tools 4

Compute acoustic field at Compare
• Identify the computable quantities that listener (directional IR) & Assess

correspond to what listeners report they hear 5

(locations, spatial extent of sources, diffusiveness) Compute sound field-
listener interaction
(head model)
• Make the design of systems for creating spatial
audio more deterministic and less trial and error 6 7
Compute percepts Infer virtual acoustic
• Both for free space sound reproduction (binaural fusion model) source properties
• And for headphone based reproduction

4
Outline

• How the ear works – very briefly

• Meddis hair cell model

• Cross-correlation model of directional hearing

• Audio coherence and spatial hearing

• Interaural time and level differences

• Spectral coloring from source elevation

• Correlograms

• Examples
5
Human
Auditory
System

6
7
Reissner Membrane

Scala Vestibuli

Tectorial Membrane
Organ of Corti

Scala Tympani
Basilar Membrane

8
©2013 by American Physiological Society
9
Meddis Hair
Cell Model

~ Firing Probability
Around 3000 inner hair cells
along the length of the basilar
membrane

Neuron firing is
irregular and
clustered near
signal peaks
10
Meddis Hair
Cell Model

~ Firing Probability

Spontaneous
firing rate

11
Binaural Fusion Model
ea r Low Freq
left
m
Fro
High Freq

t
u tpu
O

Site of r
Binaural ht ea
rig
m
Fusion Fro
To right To left
cochlea Represent as a bi-directional delay line
cochlea
12
Binaural fusion mechanism  2 msec windowed cross-correlation
2 msec *

DELAY LINE FROM RIGHT EAR

DELAY LINE FROM LEFT EAR

W(T)

xr(t) t
T

t1 t2 t3
TW
W(T)
xl(t) 𝜏 The lag where the peak in the cross-correlation
T
appears is the Interaural Time Difference
t

t1 - t2 - t3 - • Jeffress, L. A. (1948). A place theory of sound localization. Journal

TW
13
of comparative and physiological psychology, 41(1), 35.
Interaural Time Difference and source direction
(in the horizontal plane)
Perceived ITD (direction to source) is
determined by location of the peak in the
short-time cross-correlation function
Low frequency limit of
Rayleigh diffraction around sphere

ITD
c is the speed of sound

ITD = 0 when = 0
ITD = (3/2)*(d/c) when = 90°
d

Note: Factor of 3/2 is due to diffraction around listeners head

14
Role of coherence in binaural
hearing
3 Sec white noise bursts
S1 S2
• S1 alone
• S2 alone
• S1 + S2 the same
• S1 + S2 different

15
Demonstration of lateralization as a function of noise burst duration
• Play a series of uncorrelated stereo noise bursts of decreasing duration
(2sec 1sec 0.5sec 0.2sec 0.1sec 50msec 20msec 10msec 5msec 2msec 1msec)

Series of uncorrelated
2msec stereo noise bursts

• At about 2 msec and less, each burst is identified with a specific location
• The cross-correlation function always has a peak somewhere! But it is different each time.
• The auditory percept being computed by the brain is updated about every 2 milliseconds
16
Auditory “Sluggishness” “L” click

• How quickly can a listener follow time-

varying binaural cues?
• Evidence for a 200 - 300 msec threshold
• Distribution of 2 msec window ITD’s has a
“memory” of 100 - 300 msec Series of L, C, R located clicks

10 msec 50 msec 100 msec 250 msec 500 msec

Your brain averages over a hundred or more 2 msec windows

and constructs a histogram of interaural time differences.
Histogram of ITD’s
17
Correlograms – Frequency dependent interaural time differences

que ncy
Fre

Frequ
ency
De l
ay
2-D (ITD & frequency) map encodes source location
Brain decodes these maps to source locations ITD
ITD  lateral position of source Stereo speaker pair – center panning
Frequency dependence  source elevation (anechoic conditions)
18
Procedure
• For a given head model …
• Compute the reference correlograms for all possible sound source directions
• Specify the multi-channel reproduction system, the influence of the room, and
the signals driving each speaker (for whatever method you choose)
• Compute the resulting correlogram
• Project the computed correlogram onto the reference set to infer the direction
• One may infer a superposition of source directions
• Specific methods
• Decompose into spherical harmonics (orthogonality helps)
• Error minimization
• Machine learning

19
So how does the method work? … assessing the effect of reverberation

Aula Carolina
(Aachen)

20
Reverberation broadens the source image

Note: Random nature of nerve

impulse stream creates a spread
of image width, even in a non-
Reverberant space
21
Spatial Blur – experimental measurements
The model reproduces the observed angular acuity.

Spread arises from statistics of neuronal pulses.

22
Blauert, J., “Spatial Hearing: The Psychophysics of Human Sound Localization”, MIT Press 1983.
Spatial acuity with one ear!
If you don’t believe the cross-correlation model look at this!

23
Blauert, J., “Spatial Hearing: The Psychophysics of Human Sound Localization”, MIT Press 1983.
Sl Sr Modeling Stereo Reproduction

Frequency dependence of head

diffraction

~ 2
𝑅 𝑅𝐿 ( 𝑡 , 𝜏 )= 𝑅𝑐 (𝑡 ,𝜏 ) + 𝑓 ( 𝜔 ) 𝑅𝑐 ( 𝑡 ,𝜏 )

𝜏 𝑑= left-right ear delay

𝑅 𝑐 ( 𝑡 , 𝜏 ) is the cross-correlation of the Sl and Sr

L R
d
24
Stereo Sweet Spot calculation
• Compute peak of distribution of ITD’s for
a real source at the intended location
• Compute peak of distribution of ITD’s for
the stereo rendered intended source
• Infer the apparent source direction from
peak of ITD distribution
• This example is for coherent sources –
the formalism also can be used with
partially coherent sources, i.e., real
signals in reverberant spaces.

25
Main Points
• Integrated a quantitative neurological model into a spatial audio analysis tool
• Randomness of auditory nerve firing events is important
• Predicts measured angular acuity
• Two time scales are in play
• Short ( ~ 2 msec) window for cross correlation in brainstem
• Longer ( ~ 100 msec) histogram “memory” (higher level processing)
• We can predict what a listener will tell you they hear
• Location and spread of sound source
• There’s a lot left to do …
• Integrate with room modeling software for a complete analysis package
• Create synthesis tools – find the designs and algorithms that best reproduce a desired spatial
sound effect
• Continue to refine auditory models
• Distance cues
26
END

27
Cochlea
28
Cross-correlation (similarity of two signals)
[x1 x2 x3] [x1 x2 x3] [x1 x2 x3] [x1 x2 x3] [x1 x2 x3]
[y1 y2 y3] [y1 y2 y3] [y1 y2 y3] [y1 y2 y3] [y1 y2 y3]
Lag -2 -1 0 1 2

Delay = 0 Delay = 30 samples

Signals are correlated but delayed

Uncorrelated signals

No dominant peak in cross-correlation

Precedence effect
• Law of the first wave-front …
• Direction is inferred from 1st wave-front (up to about 30-40 msec)

• Haas effect – short delays enhance “spaciousness”

0 – 2 msec delay 0 – 40 msec delay 0 – 200 msec delay

(in 20 steps) (in 20 steps) (in 20 steps)

Explained by saturation and recovery time of hair cell response.

31
Directional impulse responses
Directional Impulse Response
Track both the time of
arrival and the
direction of each room
reflection

(Matlab Demo: Imp_Resp_w_Angle_3.m)

Kompendium Matura Dwujezyczna Answer Key
100% (4)
Kompendium Matura Dwujezyczna Answer Key
23 pages
An Introduction To The Psychology of Hearing by Brian Moore 6th Edition PDF
100% (2)
An Introduction To The Psychology of Hearing by Brian Moore 6th Edition PDF
457 pages
Hype Gains
100% (2)
Hype Gains
21 pages
Evaluating Spatial Sound Systems
No ratings yet
Evaluating Spatial Sound Systems
32 pages
Unit 5 - Surround and 3D Sound Systems
No ratings yet
Unit 5 - Surround and 3D Sound Systems
124 pages
Intro To Immersive Audio For VR-2
No ratings yet
Intro To Immersive Audio For VR-2
56 pages
Lecture10 Hearing
No ratings yet
Lecture10 Hearing
43 pages
Interactive Audio: Sound, Waves, The Ear 3D Audio
No ratings yet
Interactive Audio: Sound, Waves, The Ear 3D Audio
102 pages
Spatial Sound - Technologies and Psychoacoustics: This Tutorial
No ratings yet
Spatial Sound - Technologies and Psychoacoustics: This Tutorial
37 pages
Hearing 3
No ratings yet
Hearing 3
28 pages
Sound Localization and The Auditory Scene PDF
No ratings yet
Sound Localization and The Auditory Scene PDF
17 pages
Chapter 7
No ratings yet
Chapter 7
13 pages
Stewart Spatial Auditory 2010
No ratings yet
Stewart Spatial Auditory 2010
186 pages
Localize (Phase Ambiguity)
No ratings yet
Localize (Phase Ambiguity)
54 pages
Sue Harding
No ratings yet
Sue Harding
47 pages
Point-Of-View: Focalised. Focalisation Is The Camera Eye
No ratings yet
Point-Of-View: Focalised. Focalisation Is The Camera Eye
21 pages
A_system_for_spatial_hearing_research
No ratings yet
A_system_for_spatial_hearing_research
8 pages
Biophysics of Auditory System
No ratings yet
Biophysics of Auditory System
5 pages
Perception CH 10 Basic Auditory Functions
No ratings yet
Perception CH 10 Basic Auditory Functions
58 pages
Eaa Ws Meran Hottopica
No ratings yet
Eaa Ws Meran Hottopica
44 pages
Sound Localization and Virtual Auditory Space
No ratings yet
Sound Localization and Virtual Auditory Space
6 pages
ASP Hearing 2007
No ratings yet
ASP Hearing 2007
136 pages
Lecture 11 From ear to thalamus in pdf format 2
No ratings yet
Lecture 11 From ear to thalamus in pdf format 2
21 pages
PPVK Hearing2
No ratings yet
PPVK Hearing2
10 pages
Wolfe6e Lectureppt ch10
No ratings yet
Wolfe6e Lectureppt ch10
58 pages
Sound Localization
No ratings yet
Sound Localization
19 pages
Hearing 2018
No ratings yet
Hearing 2018
47 pages
HRTF Review
No ratings yet
HRTF Review
28 pages
Ch. 9
No ratings yet
Ch. 9
51 pages
Hearing Anatomy and Physiology
No ratings yet
Hearing Anatomy and Physiology
51 pages
Distance Perception in Interactive Virtual Acoustic Environments using First and Higher Order Ambisonic Sound Fields
No ratings yet
Distance Perception in Interactive Virtual Acoustic Environments using First and Higher Order Ambisonic Sound Fields
11 pages
Moving Sound Source Synthesis For Binaural Electroacoustic Music Using Interpolated Head-Related Transfer Functions (HRTFS)
No ratings yet
Moving Sound Source Synthesis For Binaural Electroacoustic Music Using Interpolated Head-Related Transfer Functions (HRTFS)
24 pages
Risoud - Sound Source Localization
No ratings yet
Risoud - Sound Source Localization
6 pages
David Griesinger
No ratings yet
David Griesinger
88 pages
Jasman 000128 003052 - 1
No ratings yet
Jasman 000128 003052 - 1
12 pages
Spatial Hearing
No ratings yet
Spatial Hearing
511 pages
Griesinger laaes2
No ratings yet
Griesinger laaes2
52 pages
Spatial Hearing
No ratings yet
Spatial Hearing
19 pages
HRTF and Panning Evaluations For Binaural Audio Guidance Ferrand
No ratings yet
HRTF and Panning Evaluations For Binaural Audio Guidance Ferrand
7 pages
An Introduction to the Psychology of Hearing 6th Edition Brian Moore - The ebook is ready for download, no waiting required
100% (1)
An Introduction to the Psychology of Hearing 6th Edition Brian Moore - The ebook is ready for download, no waiting required
56 pages
Sound Localization of Humans
No ratings yet
Sound Localization of Humans
13 pages
Introduction to perception lecture 10
No ratings yet
Introduction to perception lecture 10
3 pages
Perception: 1. Ear Physiology 2. Auditory Psychophysics 3. Pitch Perception 4. Music Perception
No ratings yet
Perception: 1. Ear Physiology 2. Auditory Psychophysics 3. Pitch Perception 4. Music Perception
25 pages
Psycho Acoustics
No ratings yet
Psycho Acoustics
64 pages
2001 - Spatial Sound Generation and Perception - Ville Pulkki
No ratings yet
2001 - Spatial Sound Generation and Perception - Ville Pulkki
59 pages
Spatial Sound Generation and Perception by Amplitude Panning Techniques
No ratings yet
Spatial Sound Generation and Perception by Amplitude Panning Techniques
59 pages
Fundamentals of Sound Quality: Dr. Arunkumar M. Sampath
No ratings yet
Fundamentals of Sound Quality: Dr. Arunkumar M. Sampath
32 pages
02 Hearing Continuation)
No ratings yet
02 Hearing Continuation)
24 pages
Chapter 6
No ratings yet
Chapter 6
54 pages
Vestibulocochlear Nerve
No ratings yet
Vestibulocochlear Nerve
12 pages
Reverberation Algorithms
No ratings yet
Reverberation Algorithms
123 pages
HEar User Manual
No ratings yet
HEar User Manual
7 pages
HB DariusSatongar 20161105
No ratings yet
HB DariusSatongar 20161105
350 pages
6055
No ratings yet
6055
61 pages
Human Auditory System: Audio Processing Guide
No ratings yet
Human Auditory System: Audio Processing Guide
11 pages
Intro to Sound Reviewer
No ratings yet
Intro to Sound Reviewer
7 pages
10.1186_2Fs13634-019-0604-x
No ratings yet
10.1186_2Fs13634-019-0604-x
9 pages
Correlation Network Model of Auditory Processing: CNRS - Ircam, 4 Place Igor Stravinsky, 75004, Paris, FRANCE
No ratings yet
Correlation Network Model of Auditory Processing: CNRS - Ircam, 4 Place Igor Stravinsky, 75004, Paris, FRANCE
6 pages
The Physics and Biology of Audition
No ratings yet
The Physics and Biology of Audition
100 pages
Room Acoustics: CMSC 828D / Spring 2006
No ratings yet
Room Acoustics: CMSC 828D / Spring 2006
36 pages
Error-Correction on Non-Standard Communication Channels
From Everand
Error-Correction on Non-Standard Communication Channels
Edward A. Ratzer
No ratings yet
Bat Ultrasonic Hearing
From Everand
Bat Ultrasonic Hearing
Sophie Carter
No ratings yet
Solo Parent
No ratings yet
Solo Parent
1 page
UGRD-MATH6100-2213T Midterm Q1
No ratings yet
UGRD-MATH6100-2213T Midterm Q1
8 pages
Sample Project Manual For Ethernet Communication TUP-I
No ratings yet
Sample Project Manual For Ethernet Communication TUP-I
21 pages
Iosh e P
No ratings yet
Iosh e P
4 pages
Determination of LME Sensitivity of Zinc-Coated Steels Based On The Programmable Deformation Cracking Test
No ratings yet
Determination of LME Sensitivity of Zinc-Coated Steels Based On The Programmable Deformation Cracking Test
14 pages
Skills Gap Analysis Template TalentLMS
No ratings yet
Skills Gap Analysis Template TalentLMS
8 pages
COMM 401 Course Outline - 2021 Summer - AD
No ratings yet
COMM 401 Course Outline - 2021 Summer - AD
8 pages
Week 18 WW2 Compilation
No ratings yet
Week 18 WW2 Compilation
52 pages
FP 20 Sabadi
No ratings yet
FP 20 Sabadi
11 pages
Wave Motion: Challenging MCQ Questions by The Physics Cafe
0% (1)
Wave Motion: Challenging MCQ Questions by The Physics Cafe
12 pages
Principles of Marketing Slides 5
No ratings yet
Principles of Marketing Slides 5
24 pages
CCCCCC C: Exercise 3: Physical Quantities and Their Units & Measuring Tools
No ratings yet
CCCCCC C: Exercise 3: Physical Quantities and Their Units & Measuring Tools
4 pages
b2 Reading Speaking Greenwashing Worksheet
No ratings yet
b2 Reading Speaking Greenwashing Worksheet
10 pages
Solution Sketches
No ratings yet
Solution Sketches
2 pages
Vocabulary Practice 1 For The Following Questions, Choose The Word That Best Fits Each Sentence
No ratings yet
Vocabulary Practice 1 For The Following Questions, Choose The Word That Best Fits Each Sentence
10 pages
MSC Counselling and Psychotherapy
No ratings yet
MSC Counselling and Psychotherapy
71 pages
Bombay Scottish - STD 6 (2021-22) - Algebra 1
No ratings yet
Bombay Scottish - STD 6 (2021-22) - Algebra 1
7 pages
Grade 5 First Diagnostic Test in Science
No ratings yet
Grade 5 First Diagnostic Test in Science
8 pages
Estimation Example
No ratings yet
Estimation Example
2 pages
Design of Seawall
No ratings yet
Design of Seawall
40 pages
Zytex Biotech Introduction
No ratings yet
Zytex Biotech Introduction
2 pages
TFN-Student-Handout-11
No ratings yet
TFN-Student-Handout-11
5 pages
Dental Pins
No ratings yet
Dental Pins
17 pages
Siamak Shahbazi (2016)
No ratings yet
Siamak Shahbazi (2016)
14 pages
Amcas Coursework Video
100% (2)
Amcas Coursework Video
7 pages
Chemistry Picture Vocabulary - Gas Laws
No ratings yet
Chemistry Picture Vocabulary - Gas Laws
23 pages
Lab Rotation 1 and 2 COmpilation
No ratings yet
Lab Rotation 1 and 2 COmpilation
4 pages
Improving Homework Completion Action Research
100% (1)
Improving Homework Completion Action Research
7 pages

Evaluating Spatial Sound Systems

Uploaded by

Evaluating Spatial Sound Systems

Uploaded by

Evaluating Spatial Sound Systems

Audio & Music Engineering

spatial sound reproduction Specify listening space &

method in any space … Compute signals driving 3

hearing into audio system design tools 4

correspond to what listeners report they hear 5

• How the ear works – very briefly

• Cross-correlation model of directional hearing

• Audio coherence and spatial hearing

• Interaural time and level differences

• Spectral coloring from source elevation

DELAY LINE FROM RIGHT EAR

DELAY LINE FROM LEFT EAR

t1 - t2 - t3 - • Jeffress, L. A. (1948). A place theory of sound localization. Journal

Note: Factor of 3/2 is due to diffraction around listeners head

• How quickly can a listener follow time-

10 msec 50 msec 100 msec 250 msec 500 msec

Your brain averages over a hundred or more 2 msec windows

Note: Random nature of nerve

Spread arises from statistics of neuronal pulses.

Frequency dependence of head

𝜏 𝑑= left-right ear delay

𝑅 𝑐 ( 𝑡 , 𝜏 ) is the cross-correlation of the Sl and Sr

Delay = 0 Delay = 30 samples

Signals are correlated but delayed

No dominant peak in cross-correlation

• Haas effect – short delays enhance “spaciousness”

0 – 2 msec delay 0 – 40 msec delay 0 – 200 msec delay

Explained by saturation and recovery time of hair cell response.

(Matlab Demo: Imp_Resp_w_Angle_3.m)

You might also like