Computational Perception. Sound localization 2

Similar documents
Acoustics Research Institute

Hearing and Deafness 2. Ear as a frequency analyzer. Chris Darwin

Computational Perception /785

Binaural Hearing. Reading: Yost Ch. 12

EE1.el3 (EEE1023): Electronics III. Acoustics lecture 20 Sound localisation. Dr Philip Jackson.

AUDL GS08/GAV1 Signals, systems, acoustics and the ear. Loudness & Temporal resolution

Auditory Localization

Study on method of estimating direct arrival using monaural modulation sp. Author(s)Ando, Masaru; Morikawa, Daisuke; Uno

Spatial Audio Reproduction: Towards Individualized Binaural Sound

Envelopment and Small Room Acoustics

Proceedings of Meetings on Acoustics

Introduction. 1.1 Surround sound

Perception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 4: 7 Feb A. Faulkner.

Monaural and binaural processing of fluctuating sounds in the auditory system

Enhancing 3D Audio Using Blind Bandwidth Extension

Perception of pitch. Importance of pitch: 2. mother hemp horse. scold. Definitions. Why is pitch important? AUDL4007: 11 Feb A. Faulkner.

3D Audio Systems through Stereo Loudspeakers

THE MATLAB IMPLEMENTATION OF BINAURAL PROCESSING MODEL SIMULATING LATERAL POSITION OF TONES WITH INTERAURAL TIME DIFFERENCES

Additive Versus Multiplicative Combination of Differences of Interaural Time and Intensity

Perception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 5: 12 Feb A. Faulkner.

Listening with Headphones

Binaural hearing. Prof. Dan Tollin on the Hearing Throne, Oldenburg Hearing Garden

URBANA-CHAMPAIGN. CS 498PS Audio Computing Lab. 3D and Virtual Sound. Paris Smaragdis. paris.cs.illinois.

Sound source localization and its use in multimedia applications

Shift of ITD tuning is observed with different methods of prediction.

A CLOSER LOOK AT THE REPRESENTATION OF INTERAURAL DIFFERENCES IN A BINAURAL MODEL

A102 Signals and Systems for Hearing and Speech: Final exam answers

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

Binaural Mechanisms that Emphasize Consistent Interaural Timing Information over Frequency

You know about adding up waves, e.g. from two loudspeakers. AUDL 4007 Auditory Perception. Week 2½. Mathematical prelude: Adding up levels

A Virtual Audio Environment for Testing Dummy- Head HRTFs modeling Real Life Situations

3D sound image control by individualized parametric head-related transfer functions

Intensity Discrimination and Binaural Interaction

Tone-in-noise detection: Observed discrepancies in spectral integration. Nicolas Le Goff a) Technische Universiteit Eindhoven, P.O.

NEAR-FIELD VIRTUAL AUDIO DISPLAYS

AUDL GS08/GAV1 Auditory Perception. Envelope and temporal fine structure (TFS)

Binaural Hearing- Human Ability of Sound Source Localization

Sound Source Localization using HRTF database

SOUND 1 -- ACOUSTICS 1

TDE-ILD-HRTF-Based 2D Whole-Plane Sound Source Localization Using Only Two Microphones and Source Counting

Chapter 3. Meeting 3, Psychoacoustics, Hearing, and Reflections

AN AUDITORILY MOTIVATED ANALYSIS METHOD FOR ROOM IMPULSE RESPONSES

Speaker placement, externalization, and envelopment in home listening rooms

COMP 546. Lecture 23. Echolocation. Tues. April 10, 2018

Monaural and Binaural Speech Separation

A cat's cocktail party: Psychophysical, neurophysiological, and computational studies of spatial release from masking

Modeling Head-Related Transfer Functions Based on Pinna Anthropometry

Acoustics, signals & systems for audiology. Week 9. Basic Psychoacoustic Phenomena: Temporal resolution

University of Huddersfield Repository

Fundamentals of Environmental Noise Monitoring CENAC

Evaluation of a new stereophonic reproduction method with moving sweet spot using a binaural localization model

Assessing the contribution of binaural cues for apparent source width perception via a functional model

BIOLOGICALLY INSPIRED BINAURAL ANALOGUE SIGNAL PROCESSING

HRIR Customization in the Median Plane via Principal Components Analysis

3D Sound Simulation over Headphones

EXPLORATION OF A BIOLOGICALLY INSPIRED MODEL FOR SOUND SOURCE LOCALIZATION IN 3D SPACE

Acoustics, signals & systems for audiology. Week 4. Signals through Systems

MUS 302 ENGINEERING SECTION

Estimating critical bandwidths of temporal sensitivity to low-frequency amplitude modulation

Temporal resolution AUDL Domain of temporal resolution. Fine structure and envelope. Modulating a sinusoid. Fine structure and envelope

Signals & Systems for Speech & Hearing. Week 6. Practical spectral analysis. Bandpass filters & filterbanks. Try this out on an old friend

A triangulation method for determining the perceptual center of the head for auditory stimuli

CHAPTER 12 SOUND. Sound: Sound is a form of energy which produces a sensation of hearing in our ears.

Directional dependence of loudness and binaural summation Sørensen, Michael Friis; Lydolf, Morten; Frandsen, Peder Christian; Møller, Henrik

Sound Source Localization in Median Plane using Artificial Ear

Final Exam Study Guide: Introduction to Computer Music Course Staff April 24, 2015

Convention Paper 9870 Presented at the 143 rd Convention 2017 October 18 21, New York, NY, USA

BINAURAL RECORDING SYSTEM AND SOUND MAP OF MALAGA

Accurate sound reproduction from two loudspeakers in a living room

Complex Sounds. Reading: Yost Ch. 4

Chapter 11. Audio. Steven M. LaValle. University of Illinois. Available for downloading at

On binaural spatialization and the use of GPGPU for audio processing

Extracting the frequencies of the pinna spectral notches in measured head related impulse responses

Psycho-acoustics (Sound characteristics, Masking, and Loudness)

Added sounds for quiet vehicles

COM325 Computer Speech and Hearing

ORIENTATION IN SIMPLE VIRTUAL AUDITORY SPACE CREATED WITH MEASURED HRTF

2. Bat Detectors 101. Connect mic to laptop. Generic bat recording/analysis system. All in one hand-held unit. Power source (battery/solar)

PERFORMANCE COMPARISON BETWEEN STEREAUSIS AND INCOHERENT WIDEBAND MUSIC FOR LOCALIZATION OF GROUND VEHICLES ABSTRACT

THE DEVELOPMENT OF A DESIGN TOOL FOR 5-SPEAKER SURROUND SOUND DECODERS

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 A MODEL OF THE HEAD-RELATED TRANSFER FUNCTION BASED ON SPECTRAL CUES

A learning, biologically-inspired sound localization model

PAPER Enhanced Vertical Perception through Head-Related Impulse Response Customization Based on Pinna Response Tuning in the Median Plane

Robust Speech Recognition Based on Binaural Auditory Processing

INVESTIGATING BINAURAL LOCALISATION ABILITIES FOR PROPOSING A STANDARDISED TESTING ENVIRONMENT FOR BINAURAL SYSTEMS

A classification-based cocktail-party processor

Narrow- and wideband channels

CRANIAL TRANSITIONS FOR SOPRANO SAXOPHONE AND ELECTRONIC PROCESSING. Jonas Braasch

3D audio overview : from 2.0 to N.M (?)

PERSONALIZED HEAD RELATED TRANSFER FUNCTION MEASUREMENT AND VERIFICATION THROUGH SOUND LOCALIZATION RESOLUTION

Binaural Audio Project

Robust Speech Recognition Based on Binaural Auditory Processing

Audio Engineering Society. Convention Paper. Presented at the 124th Convention 2008 May Amsterdam, The Netherlands

Chapter 12. Preview. Objectives The Production of Sound Waves Frequency of Sound Waves The Doppler Effect. Section 1 Sound Waves

Proceedings of Meetings on Acoustics

On the accuracy reciprocal and direct vibro-acoustic transfer-function measurements on vehicles for lower and medium frequencies

A binaural auditory model and applications to spatial sound evaluation

Speech Compression. Application Scenarios

Echolocation and Echorecognition

Exploiting envelope fluctuations to achieve robust extraction and intelligent integration of binaural cues

Transcription:

Computational Perception 15-485/785 January 22, 2008 Sound localization 2

Last lecture sound propagation: reflection, diffraction, shadowing sound intensity (db) defining computational problems sound lateralization ITD and IIDs duplex theory localization acuity, minimum audible angle estimating ITD, cross correlation 2

Cross correlation of white noise x R (t) 1 Corr(x R (t), x L (t)) 0.5 x L (t) 0 0.5 0 500 1000 1500 2000 μsec 1 1000 500 0 500 1000 μsec 3

Cross correlation of a high frequency tone x R (t) freq=1500 Hz Corr(x R (t), x L (t)) x L (t) freq=1500 Hz 0 500 1000 1500 2000 μsec 1000 500 0 500 1000 μsec This is called phase ambiguity because there are multiple peaks within the natural range of ±690 µsecs. 4

Testing the duplex theory Pure tones are ineffective for lateralization > 1500 Hz. - Does this mean all sounds are? Consider bandpass noise: 3000-3300 Hz - How would you perceive this sound? Sound is correctly localized, but with greater error (60 µsecs vs 10). 5

Cross correlation of a high frequency tone x R (t) freq=3150 Hz Corr(x R (t), x L (t)) x L (t) freq=3150 Hz 0 500 1000 1500 2000 μsec 1000 500 0 500 1000 μsec Why might this sound not be correctly localized? 6

What does the auditory system do? from Yost, 2000 7

Frequency mapping of the basilar membrane from Warren, 1999 How do we lateralize narrowband sounds if the ear decomposes sound in terms of frequency? 8

filtering and frequency space (on board)

Integrating across frequency: psychophysical models Ensembles of coincidence-counting units (Stern and Trahiotis, 1995) How is sound localized when the bandwidth is increased? Note: the sound is still lateralized correctly even though ITD is far outside it s natural range. Narrow band sound lateralized to the right, broadband to left. 10

Things are not as simple as the might seem Delay a 3900 Hz tone modulated at 300 Hz. Can this ITD be detected? Could ITD of low frequencies explain this? No: Beat frequency is 300 Hz spectrum is 3900 and 3900±300 Hz. Time delay of envelope predicts lateralization. from Blauert, 1997 11

Limitations of the Duplex Theory limited to lateralization doesn t do front-back discrimination doesn t explain why are sounds are outside your head 12

Can sound be localized with one ear? total deafness in left ear, normal in right 100 ms white noise pulses. head immobilized Localization ability improves with experience. from Blauert, 1997 13

The Function of the Pinna Older theories: sound gathering (1600s - even today) Darwin (1800s): vestigial form of animal ear, no role in sound localization Lord Rayleigh (1907): distinguish between front and back from Warren, 1999 14

Batteau s theory (1967, 1968) Echos produced by pinnae provide lateralization and elevation cues. used microphones in pinna casts measured delays for azimuths and elevations: - azimuths: 2 to 80 μsec - elevations: 100 to 300 μsec then the key experiment: listening through casts caused externalization also observed that animals have pinnae of similar shapes Freedman and Fisher (1968): Timmear Not necessary to use subject s own pinnae subjects can localize with other pinnae, but with less accuracy Only a single pinna (monaural) is needed for localization 15

Testing Batteau s theory Do we perceive monaural echos? from Warren, 1999 Combining noise with a delay of itself results in spectral filtering from Warren, 1999 16

Model proposed by Blauert to explain the effect of the pinna as a reflector. from Blauert, 1997 17

An improved analysis Shaw and Teranishi (1968): Investigate pinna behavior in frequency domain using external ear model: from Blauert, 1997 18

Acoustic resonance in the outer ear Distribution of sound pressure for several natural resonances: confirmed first two resonances in natural ear others combine into a broad resonance Distribution of sound pressure along model ear canal for 10 khz: resonances are direction dependent. pinna and ear canal form a system of acoustical resonators. 19 from Blauert, 1997

The general case What limitations do the pinnae measurements have? - Do not take into account the effect of the head and body. How to characterize the filtering? - Measure the transfer function: ratio of pressure at sound source to pressure of (ideally) sound reaching eardrum - this is called the head-related transfer function (HRTF) 20

Measuring HRTFs Different types of HRTFs - monaural: pressure at source vs ear drum - binaural: pressure difference for two corresponding points in the ear canal Subject with probe mics Kemar the sound dummy 21 from Blauert, 1997

Measured monaural HRTF from Blauert, 1997 22

Measured binaural HRTF from Blauert, 1997 23

Problems in using HRTFs HRTFs vary across subjects can t easily get an average but can do structural averaging from Blauert, 1997 24

More than just direction: cues for sound distance Frequency independent 1/r pressure attenuation works if you know some properties of sound source HRTF depends on distance freq. dependent attenuation (long distances) head movements (short distances) Curves have 1/r attenuation factored out from Blauert, 1997 25

Next time: the computational problem 26

Misconceptions still persist today... 27