Phase and Feedback in the Nonlinear Brain. Malcolm Slaney (IBM and Stanford) Hiroko Shiraiwa-Terasawa (Stanford) Regaip Sen (Stanford)

Size: px
Start display at page:

Download "Phase and Feedback in the Nonlinear Brain. Malcolm Slaney (IBM and Stanford) Hiroko Shiraiwa-Terasawa (Stanford) Regaip Sen (Stanford)"

Transcription

1 Phase and Feedback in the Nonlinear Brain Malcolm Slaney (IBM and Stanford) Hiroko Shiraiwa-Terasawa (Stanford) Regaip Sen (Stanford) Auditory processing pre-cosyne workshop March 23, 2004

2 Simplistic Models Dehydrated cats and the application of Fourier analysis to hearing problems become more and more a handicap for research in hearing. von Bekesy Overly simplistic models are bad Wrong path For example, critical band filters Pitch and phase

3 Tony s Goals Linearity Taylor says everything is linear To a point Temporal codes I don t know the answer. Attentional effects Big problem.. Not today Phase perception

4 A Critique of Pure Audition Marr Pure vision Churchland A critique of pure vision Slaney A critique of pure audition Where are efferents?

5 Motivation S. Shamma. Speech processing in the auditory system Good Shihab R. Carlyon, S. Shamma. An account of monaural phase perception Bad Shihab

6 Nonlinear Cochlear Mechanics BM Amplitude (db) Constant SPL 33nm 80dB 60dB 40dB 20dB Frequency (khz) Mössbauer Data SPL Q 3 CF 80dB 1 10k 60dB k 40dB k 20dB k dB 40dB 60dB 20 80dB Frequency (khz) From Basilar membrane measurements and the travelling wave, by B.M. Johnstone, R. Patuzzi, and G. K. Yates, Hearing Research, 22 (1986)

7 Cat Vowels (/da/)

8 Goal Test phase perception models with more sophisticated cochlear models Auditory models Gammatone Gammatone with hair cell model Lyon s passive Ear STRF

9 Nonlinearities Capture effect Dominant frequency is represented A few db matter Stochastic resonance Sub-threshold signals entrain firings Temporal coding Non-linear mapping

10 Phase Models Within channel Auditory nerve channel Phase matters Between channels Phase does matter! Single sinusoids 2 octave separation Time Time

11 Phase Perception Model Short-term Within channel Sensitive to small phase change STRF fails Today Long term Speech-rate perception Large phase changes (relative to CF) STRF works great

12 Models Gammatone Critical band filters Psychoacoustic data Linear bandpass filters Meddis Hair Cell Nonlinear reservoir model

13 Models Lyon s Passive Ear Lowpass filters Transmission line Cascade Nonlinear automatic gain control (AGC) Passive

14 Models STRF Spatial temporal response function Based on Shamma s cortical data Fit spectrogram data Two-dimensional sinusoids Image domain

15 Patterson Phase Results Stimulus Constant phase shift per critical band Shifted phase gradually Result Small local changes Large global change Couldn t prove channel effect!

16 Patterson Stimuli Harmonic signal 31 harmonics Phase Alignment Cosine phase Random phase Alternating phase (APH)

17 Our Model Sound A Cochleagram Correlogram Cochleagram Cochleagram Memory? Comparison Comparison Sound B Cochleagram Correlogram Cochleagram Cochleagram Memory? Integrated meansquared error

18 Auditory Dimensions One Dimensional Pressure Time Cochlear Processing Two Dimensional Cochlear Place Time Correlogram Processing Three Dimensional Cochlear Place Autocorrelation Lag Time

19 Correlogram Structure Pitch Period Cochlear Place Autocorrelators Cochlea

20 Duda Tones Simple tone Harmonic complex Tone build up (slow and fast) Tone removal (from top and bottom) Wiggle

21 Cocheagram Correlogram 2 GFB: F0=62.5 LH=3 H= correlogram GFB Results APH Alternating phases are shifted Cochleagram Auditory nerve Error grows Temporal pattern Error falls sum square error sum square error sum square error aph degree 2 x 108 cochleagram MH aph degree 1.5 x 10-3 cochleagram LPE aph degree sum square error sum square error sum square error aph degree 10 x 106 correlogram MH aph degree 5 x 10-3 correlogram LPE aph degree

22 APH Results sum square error Correlogram --- Gammatone FB LH = 3 LH = 7 LH = 15 Test Error wrt phase Lowest harmonic Gammatone Too much variance No one threshold Lyon s Ear sum square error sum square error aph degrees 4 x 108 GFB + Meddis Hair Cell LH = 3 LH = 7 LH = aph degrees Lyon Passive Ear LH = LH = 7 LH = aph degrees

23 10 2 scaled, f = 62.5, 125 reference Threshold Gammatone No one threshold APH threshold 10 1 experiment f = 62.5 simulation f = 62.5 experiment f = 125 simulation f = lowest harmonics Lyon s Passive Ear Consistent results Depends on loudness (scaling) APH threshold reference experiment f = 62.5 simulation f = 62.5 experiment f = 125 simulation f = 125 unscaled, f = 62.5, lowest harmonics

24 Craig and Jeffress Two components Reverse polarity Task Detect difference Identify phase

25 Craig & Jeffress Results 6 x sum square error btw/ original & inversed φ = 0 φ = 45 φ = 90 φ = sum square error Cb level

26 To Do Improve results Tune model bandwidths Better cochlear/hair cell integration Verify two distinct regimes Break in performance Not

27 Conclusions Phase perception model Two different models Short-term within channel Long-term STRF? Better auditory model Less about critical bands

AUDL 4007 Auditory Perception. Week 1. The cochlea & auditory nerve: Obligatory stages of auditory processing

AUDL 4007 Auditory Perception. Week 1. The cochlea & auditory nerve: Obligatory stages of auditory processing AUDL 4007 Auditory Perception Week 1 The cochlea & auditory nerve: Obligatory stages of auditory processing 1 Think of the ear as a collection of systems, transforming sounds to be sent to the brain 25

More information

Hearing and Deafness 2. Ear as a frequency analyzer. Chris Darwin

Hearing and Deafness 2. Ear as a frequency analyzer. Chris Darwin Hearing and Deafness 2. Ear as a analyzer Chris Darwin Frequency: -Hz Sine Wave. Spectrum Amplitude against -..5 Time (s) Waveform Amplitude against time amp Hz Frequency: 5-Hz Sine Wave. Spectrum Amplitude

More information

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 MODELING SPECTRAL AND TEMPORAL MASKING IN THE HUMAN AUDITORY SYSTEM PACS: 43.66.Ba, 43.66.Dc Dau, Torsten; Jepsen, Morten L.; Ewert,

More information

Spectro-Temporal Methods in Primary Auditory Cortex David Klein Didier Depireux Jonathan Simon Shihab Shamma

Spectro-Temporal Methods in Primary Auditory Cortex David Klein Didier Depireux Jonathan Simon Shihab Shamma Spectro-Temporal Methods in Primary Auditory Cortex David Klein Didier Depireux Jonathan Simon Shihab Shamma & Department of Electrical Engineering Supported in part by a MURI grant from the Office of

More information

COM325 Computer Speech and Hearing

COM325 Computer Speech and Hearing COM325 Computer Speech and Hearing Part III : Theories and Models of Pitch Perception Dr. Guy Brown Room 145 Regent Court Department of Computer Science University of Sheffield Email: g.brown@dcs.shef.ac.uk

More information

Acoustics, signals & systems for audiology. Week 4. Signals through Systems

Acoustics, signals & systems for audiology. Week 4. Signals through Systems Acoustics, signals & systems for audiology Week 4 Signals through Systems Crucial ideas Any signal can be constructed as a sum of sine waves In a linear time-invariant (LTI) system, the response to a sinusoid

More information

A102 Signals and Systems for Hearing and Speech: Final exam answers

A102 Signals and Systems for Hearing and Speech: Final exam answers A12 Signals and Systems for Hearing and Speech: Final exam answers 1) Take two sinusoids of 4 khz, both with a phase of. One has a peak level of.8 Pa while the other has a peak level of. Pa. Draw the spectrum

More information

Signals & Systems for Speech & Hearing. Week 6. Practical spectral analysis. Bandpass filters & filterbanks. Try this out on an old friend

Signals & Systems for Speech & Hearing. Week 6. Practical spectral analysis. Bandpass filters & filterbanks. Try this out on an old friend Signals & Systems for Speech & Hearing Week 6 Bandpass filters & filterbanks Practical spectral analysis Most analogue signals of interest are not easily mathematically specified so applying a Fourier

More information

You know about adding up waves, e.g. from two loudspeakers. AUDL 4007 Auditory Perception. Week 2½. Mathematical prelude: Adding up levels

You know about adding up waves, e.g. from two loudspeakers. AUDL 4007 Auditory Perception. Week 2½. Mathematical prelude: Adding up levels AUDL 47 Auditory Perception You know about adding up waves, e.g. from two loudspeakers Week 2½ Mathematical prelude: Adding up levels 2 But how do you get the total rms from the rms values of two signals

More information

Spectral and temporal processing in the human auditory system

Spectral and temporal processing in the human auditory system Spectral and temporal processing in the human auditory system To r s t e n Da u 1, Mo rt e n L. Jepsen 1, a n d St e p h a n D. Ew e r t 2 1Centre for Applied Hearing Research, Ørsted DTU, Technical University

More information

Perception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 4: 7 Feb A. Faulkner.

Perception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 4: 7 Feb A. Faulkner. Perception of pitch BSc Audiology/MSc SHS Psychoacoustics wk 4: 7 Feb 2008. A. Faulkner. See Moore, BCJ Introduction to the Psychology of Hearing, Chapter 5. Or Plack CJ The Sense of Hearing Lawrence Erlbaum,

More information

Imagine the cochlea unrolled

Imagine the cochlea unrolled 2 2 1 1 1 1 1 Cochlea & Auditory Nerve: obligatory stages of auditory processing Think of the auditory periphery as a processor of signals 2 2 1 1 1 1 1 Imagine the cochlea unrolled Basilar membrane motion

More information

Psycho-acoustics (Sound characteristics, Masking, and Loudness)

Psycho-acoustics (Sound characteristics, Masking, and Loudness) Psycho-acoustics (Sound characteristics, Masking, and Loudness) Tai-Shih Chi ( 冀泰石 ) Department of Communication Engineering National Chiao Tung University Mar. 20, 2008 Pure tones Mathematics of the pure

More information

Perception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 5: 12 Feb A. Faulkner.

Perception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 5: 12 Feb A. Faulkner. Perception of pitch BSc Audiology/MSc SHS Psychoacoustics wk 5: 12 Feb 2009. A. Faulkner. See Moore, BCJ Introduction to the Psychology of Hearing, Chapter 5. Or Plack CJ The Sense of Hearing Lawrence

More information

Perception of pitch. Importance of pitch: 2. mother hemp horse. scold. Definitions. Why is pitch important? AUDL4007: 11 Feb A. Faulkner.

Perception of pitch. Importance of pitch: 2. mother hemp horse. scold. Definitions. Why is pitch important? AUDL4007: 11 Feb A. Faulkner. Perception of pitch AUDL4007: 11 Feb 2010. A. Faulkner. See Moore, BCJ Introduction to the Psychology of Hearing, Chapter 5. Or Plack CJ The Sense of Hearing Lawrence Erlbaum, 2005 Chapter 7 1 Definitions

More information

I R UNDERGRADUATE REPORT. Stereausis: A Binaural Processing Model. by Samuel Jiawei Ng Advisor: P.S. Krishnaprasad UG

I R UNDERGRADUATE REPORT. Stereausis: A Binaural Processing Model. by Samuel Jiawei Ng Advisor: P.S. Krishnaprasad UG UNDERGRADUATE REPORT Stereausis: A Binaural Processing Model by Samuel Jiawei Ng Advisor: P.S. Krishnaprasad UG 2001-6 I R INSTITUTE FOR SYSTEMS RESEARCH ISR develops, applies and teaches advanced methodologies

More information

THE MATLAB IMPLEMENTATION OF BINAURAL PROCESSING MODEL SIMULATING LATERAL POSITION OF TONES WITH INTERAURAL TIME DIFFERENCES

THE MATLAB IMPLEMENTATION OF BINAURAL PROCESSING MODEL SIMULATING LATERAL POSITION OF TONES WITH INTERAURAL TIME DIFFERENCES THE MATLAB IMPLEMENTATION OF BINAURAL PROCESSING MODEL SIMULATING LATERAL POSITION OF TONES WITH INTERAURAL TIME DIFFERENCES J. Bouše, V. Vencovský Department of Radioelectronics, Faculty of Electrical

More information

Signals, Sound, and Sensation

Signals, Sound, and Sensation Signals, Sound, and Sensation William M. Hartmann Department of Physics and Astronomy Michigan State University East Lansing, Michigan Л1Р Contents Preface xv Chapter 1: Pure Tones 1 Mathematics of the

More information

Distortion products and the perceived pitch of harmonic complex tones

Distortion products and the perceived pitch of harmonic complex tones Distortion products and the perceived pitch of harmonic complex tones D. Pressnitzer and R.D. Patterson Centre for the Neural Basis of Hearing, Dept. of Physiology, Downing street, Cambridge CB2 3EG, U.K.

More information

Chapter 2 A Silicon Model of Auditory-Nerve Response

Chapter 2 A Silicon Model of Auditory-Nerve Response 5 Chapter 2 A Silicon Model of Auditory-Nerve Response Nonlinear signal processing is an integral part of sensory transduction in the nervous system. Sensory inputs are analog, continuous-time signals

More information

The EarSpring Model for the Loudness Response in Unimpaired Human Hearing

The EarSpring Model for the Loudness Response in Unimpaired Human Hearing The EarSpring Model for the Loudness Response in Unimpaired Human Hearing David McClain, Refined Audiometrics Laboratory, LLC December 2006 Abstract We describe a simple nonlinear differential equation

More information

Neuronal correlates of pitch in the Inferior Colliculus

Neuronal correlates of pitch in the Inferior Colliculus Neuronal correlates of pitch in the Inferior Colliculus Didier A. Depireux David J. Klein Jonathan Z. Simon Shihab A. Shamma Institute for Systems Research University of Maryland College Park, MD 20742-3311

More information

HCS 7367 Speech Perception

HCS 7367 Speech Perception HCS 7367 Speech Perception Dr. Peter Assmann Fall 212 Power spectrum model of masking Assumptions: Only frequencies within the passband of the auditory filter contribute to masking. Detection is based

More information

Auditory Based Feature Vectors for Speech Recognition Systems

Auditory Based Feature Vectors for Speech Recognition Systems Auditory Based Feature Vectors for Speech Recognition Systems Dr. Waleed H. Abdulla Electrical & Computer Engineering Department The University of Auckland, New Zealand [w.abdulla@auckland.ac.nz] 1 Outlines

More information

The psychoacoustics of reverberation

The psychoacoustics of reverberation The psychoacoustics of reverberation Steven van de Par Steven.van.de.Par@uni-oldenburg.de July 19, 2016 Thanks to Julian Grosse and Andreas Häußler 2016 AES International Conference on Sound Field Control

More information

AN AUDITORILY MOTIVATED ANALYSIS METHOD FOR ROOM IMPULSE RESPONSES

AN AUDITORILY MOTIVATED ANALYSIS METHOD FOR ROOM IMPULSE RESPONSES Proceedings of the COST G-6 Conference on Digital Audio Effects (DAFX-), Verona, Italy, December 7-9,2 AN AUDITORILY MOTIVATED ANALYSIS METHOD FOR ROOM IMPULSE RESPONSES Tapio Lokki Telecommunications

More information

An introduction to physics of Sound

An introduction to physics of Sound An introduction to physics of Sound Outlines Acoustics and psycho-acoustics Sound? Wave and waves types Cycle Basic parameters of sound wave period Amplitude Wavelength Frequency Outlines Phase Types of

More information

Pressure vs. decibel modulation in spectrotemporal representations: How nonlinear are auditory cortical stimuli?

Pressure vs. decibel modulation in spectrotemporal representations: How nonlinear are auditory cortical stimuli? Pressure vs. decibel modulation in spectrotemporal representations: How nonlinear are auditory cortical stimuli? 1 2 1 1 David Klein, Didier Depireux, Jonathan Simon, Shihab Shamma 1 Institute for Systems

More information

Results of Egan and Hake using a single sinusoidal masker [reprinted with permission from J. Acoust. Soc. Am. 22, 622 (1950)].

Results of Egan and Hake using a single sinusoidal masker [reprinted with permission from J. Acoust. Soc. Am. 22, 622 (1950)]. XVI. SIGNAL DETECTION BY HUMAN OBSERVERS Prof. J. A. Swets Prof. D. M. Green Linda E. Branneman P. D. Donahue Susan T. Sewall A. MASKING WITH TWO CONTINUOUS TONES One of the earliest studies in the modern

More information

6.551j/HST.714j Acoustics of Speech and Hearing: Exam 2

6.551j/HST.714j Acoustics of Speech and Hearing: Exam 2 Massachusetts Institute of Technology Department of Electrical Engineering and Computer Science, and The Harvard-MIT Division of Health Science and Technology 6.551J/HST.714J: Acoustics of Speech and Hearing

More information

A Pole Zero Filter Cascade Provides Good Fits to Human Masking Data and to Basilar Membrane and Neural Data

A Pole Zero Filter Cascade Provides Good Fits to Human Masking Data and to Basilar Membrane and Neural Data A Pole Zero Filter Cascade Provides Good Fits to Human Masking Data and to Basilar Membrane and Neural Data Richard F. Lyon Google, Inc. Abstract. A cascade of two-pole two-zero filters with level-dependent

More information

Spectro-Temporal Processing of Dynamic Broadband Sounds In Auditory Cortex

Spectro-Temporal Processing of Dynamic Broadband Sounds In Auditory Cortex Spectro-Temporal Processing of Dynamic Broadband Sounds In Auditory Cortex Shihab Shamma Jonathan Simon* Didier Depireux David Klein Institute for Systems Research & Department of Electrical Engineering

More information

Complex Sounds. Reading: Yost Ch. 4

Complex Sounds. Reading: Yost Ch. 4 Complex Sounds Reading: Yost Ch. 4 Natural Sounds Most sounds in our everyday lives are not simple sinusoidal sounds, but are complex sounds, consisting of a sum of many sinusoids. The amplitude and frequency

More information

AUDITORY MODEL INVERSION FOR SOUND SEPARATION. Malcolm Slaney, Daniel Naar, and Richard F. Lyon Apple Computer, Inc.

AUDITORY MODEL INVERSION FOR SOUND SEPARATION. Malcolm Slaney, Daniel Naar, and Richard F. Lyon Apple Computer, Inc. AUDITORY MODEL INVERSION FOR SOUND SEPARATION Malcolm Slaney, Daniel Naar, and Richard F. Lyon Apple Computer, Inc., Cupertino, CA Techniques to recreate sounds from perceptual displays known as cochleagrams

More information

AUDL GS08/GAV1 Signals, systems, acoustics and the ear. Loudness & Temporal resolution

AUDL GS08/GAV1 Signals, systems, acoustics and the ear. Loudness & Temporal resolution AUDL GS08/GAV1 Signals, systems, acoustics and the ear Loudness & Temporal resolution Absolute thresholds & Loudness Name some ways these concepts are crucial to audiologists Sivian & White (1933) JASA

More information

AUDL Final exam page 1/7 Please answer all of the following questions.

AUDL Final exam page 1/7 Please answer all of the following questions. AUDL 11 28 Final exam page 1/7 Please answer all of the following questions. 1) Consider 8 harmonics of a sawtooth wave which has a fundamental period of 1 ms and a fundamental component with a level of

More information

ABSTRACT. Title of Document: SPECTROTEMPORAL MODULATION LISTENERS. Professor, Dr.Shihab Shamma, Department of. Electrical Engineering

ABSTRACT. Title of Document: SPECTROTEMPORAL MODULATION LISTENERS. Professor, Dr.Shihab Shamma, Department of. Electrical Engineering ABSTRACT Title of Document: SPECTROTEMPORAL MODULATION SENSITIVITY IN HEARING-IMPAIRED LISTENERS Golbarg Mehraei, Master of Science, 29 Directed By: Professor, Dr.Shihab Shamma, Department of Electrical

More information

AUDL GS08/GAV1 Auditory Perception. Envelope and temporal fine structure (TFS)

AUDL GS08/GAV1 Auditory Perception. Envelope and temporal fine structure (TFS) AUDL GS08/GAV1 Auditory Perception Envelope and temporal fine structure (TFS) Envelope and TFS arise from a method of decomposing waveforms The classic decomposition of waveforms Spectral analysis... Decomposes

More information

John Lazzaro and Carver Mead Department of Computer Science California Institute of Technology Pasadena, California, 91125

John Lazzaro and Carver Mead Department of Computer Science California Institute of Technology Pasadena, California, 91125 Lazzaro and Mead Circuit Models of Sensory Transduction in the Cochlea CIRCUIT MODELS OF SENSORY TRANSDUCTION IN THE COCHLEA John Lazzaro and Carver Mead Department of Computer Science California Institute

More information

Mel- frequency cepstral coefficients (MFCCs) and gammatone filter banks

Mel- frequency cepstral coefficients (MFCCs) and gammatone filter banks SGN- 14006 Audio and Speech Processing Pasi PerQlä SGN- 14006 2015 Mel- frequency cepstral coefficients (MFCCs) and gammatone filter banks Slides for this lecture are based on those created by Katariina

More information

Structure of Speech. Physical acoustics Time-domain representation Frequency domain representation Sound shaping

Structure of Speech. Physical acoustics Time-domain representation Frequency domain representation Sound shaping Structure of Speech Physical acoustics Time-domain representation Frequency domain representation Sound shaping Speech acoustics Source-Filter Theory Speech Source characteristics Speech Filter characteristics

More information

Applying Models of Auditory Processing to Automatic Speech Recognition: Promise and Progress!

Applying Models of Auditory Processing to Automatic Speech Recognition: Promise and Progress! Applying Models of Auditory Processing to Automatic Speech Recognition: Promise and Progress! Richard Stern (with Chanwoo Kim, Yu-Hsiang Chiu, and others) Department of Electrical and Computer Engineering

More information

Human Auditory Periphery (HAP)

Human Auditory Periphery (HAP) Human Auditory Periphery (HAP) Ray Meddis Department of Human Sciences, University of Essex Colchester, CO4 3SQ, UK. rmeddis@essex.ac.uk A demonstrator for a human auditory modelling approach. 23/11/2003

More information

Ian C. Bruce Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland 21205

Ian C. Bruce Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland 21205 A phenomenological model for the responses of auditory-nerve fibers: I. Nonlinear tuning with compression and suppression Xuedong Zhang Hearing Research Center and Department of Biomedical Engineering,

More information

A Silicon Model of an Auditory Neural Representation of Spectral Shape

A Silicon Model of an Auditory Neural Representation of Spectral Shape A Silicon Model of an Auditory Neural Representation of Spectral Shape John Lazzaro 1 California Institute of Technology Pasadena, California, USA Abstract The paper describes an analog integrated circuit

More information

Auditory Toolbox: A MATLAB Toolbox for Auditory Modeling Work

Auditory Toolbox: A MATLAB Toolbox for Auditory Modeling Work Auditory Toolbox Malcolm Slaney Apple Technical Report #45 Apple Computer, Inc. Advanced Technology Group malcolm@apple.com 2-2 -4-6 1 2 1 3 1 4 Auditory Toolbox 1 2 Auditory Toolbox: A MATLAB Toolbox

More information

Testing of Objective Audio Quality Assessment Models on Archive Recordings Artifacts

Testing of Objective Audio Quality Assessment Models on Archive Recordings Artifacts POSTER 25, PRAGUE MAY 4 Testing of Objective Audio Quality Assessment Models on Archive Recordings Artifacts Bc. Martin Zalabák Department of Radioelectronics, Czech Technical University in Prague, Technická

More information

Using the Gammachirp Filter for Auditory Analysis of Speech

Using the Gammachirp Filter for Auditory Analysis of Speech Using the Gammachirp Filter for Auditory Analysis of Speech 18.327: Wavelets and Filterbanks Alex Park malex@sls.lcs.mit.edu May 14, 2003 Abstract Modern automatic speech recognition (ASR) systems typically

More information

Chapter 12. Preview. Objectives The Production of Sound Waves Frequency of Sound Waves The Doppler Effect. Section 1 Sound Waves

Chapter 12. Preview. Objectives The Production of Sound Waves Frequency of Sound Waves The Doppler Effect. Section 1 Sound Waves Section 1 Sound Waves Preview Objectives The Production of Sound Waves Frequency of Sound Waves The Doppler Effect Section 1 Sound Waves Objectives Explain how sound waves are produced. Relate frequency

More information

Pitch estimation using spiking neurons

Pitch estimation using spiking neurons Pitch estimation using spiking s K. Voutsas J. Adamy Research Assistant Head of Control Theory and Robotics Lab Institute of Automatic Control Control Theory and Robotics Lab Institute of Automatic Control

More information

Multiresolution Spectrotemporal Analysis of Complex Sounds

Multiresolution Spectrotemporal Analysis of Complex Sounds 1 Multiresolution Spectrotemporal Analysis of Complex Sounds Taishih Chi, Powen Ru and Shihab A. Shamma Center for Auditory and Acoustics Research, Institute for Systems Research Electrical and Computer

More information

Chapter 2. Meeting 2, Measures and Visualizations of Sounds and Signals

Chapter 2. Meeting 2, Measures and Visualizations of Sounds and Signals Chapter 2. Meeting 2, Measures and Visualizations of Sounds and Signals 2.1. Announcements Be sure to completely read the syllabus Recording opportunities for small ensembles Due Wednesday, 15 February:

More information

Chapter 3. Meeting 3, Psychoacoustics, Hearing, and Reflections

Chapter 3. Meeting 3, Psychoacoustics, Hearing, and Reflections Chapter 3. Meeting 3, Psychoacoustics, Hearing, and Reflections 3.1. Announcements Need schlep crew for Tuesday (and other days) Due Today, 15 February: Mix Graph 1 Quiz next Tuesday (we meet Tuesday,

More information

Monaural and binaural processing of fluctuating sounds in the auditory system

Monaural and binaural processing of fluctuating sounds in the auditory system Monaural and binaural processing of fluctuating sounds in the auditory system Eric R. Thompson September 23, 2005 MSc Thesis Acoustic Technology Ørsted DTU Technical University of Denmark Supervisor: Torsten

More information

III. Publication III. c 2005 Toni Hirvonen.

III. Publication III. c 2005 Toni Hirvonen. III Publication III Hirvonen, T., Segregation of Two Simultaneously Arriving Narrowband Noise Signals as a Function of Spatial and Frequency Separation, in Proceedings of th International Conference on

More information

What is Sound? Part II

What is Sound? Part II What is Sound? Part II Timbre & Noise 1 Prayouandi (2010) - OneOhtrix Point Never PSYCHOACOUSTICS ACOUSTICS LOUDNESS AMPLITUDE PITCH FREQUENCY QUALITY TIMBRE 2 Timbre / Quality everything that is not frequency

More information

PERFORMANCE COMPARISON BETWEEN STEREAUSIS AND INCOHERENT WIDEBAND MUSIC FOR LOCALIZATION OF GROUND VEHICLES ABSTRACT

PERFORMANCE COMPARISON BETWEEN STEREAUSIS AND INCOHERENT WIDEBAND MUSIC FOR LOCALIZATION OF GROUND VEHICLES ABSTRACT Approved for public release; distribution is unlimited. PERFORMANCE COMPARISON BETWEEN STEREAUSIS AND INCOHERENT WIDEBAND MUSIC FOR LOCALIZATION OF GROUND VEHICLES September 1999 Tien Pham U.S. Army Research

More information

Auditory modelling for speech processing in the perceptual domain

Auditory modelling for speech processing in the perceptual domain ANZIAM J. 45 (E) ppc964 C980, 2004 C964 Auditory modelling for speech processing in the perceptual domain L. Lin E. Ambikairajah W. H. Holmes (Received 8 August 2003; revised 28 January 2004) Abstract

More information

Additive Versus Multiplicative Combination of Differences of Interaural Time and Intensity

Additive Versus Multiplicative Combination of Differences of Interaural Time and Intensity Additive Versus Multiplicative Combination of Differences of Interaural Time and Intensity Samuel H. Tao Submitted to the Department of Electrical and Computer Engineering in Partial Fulfillment of the

More information

Principles of Musical Acoustics

Principles of Musical Acoustics William M. Hartmann Principles of Musical Acoustics ^Spr inger Contents 1 Sound, Music, and Science 1 1.1 The Source 2 1.2 Transmission 3 1.3 Receiver 3 2 Vibrations 1 9 2.1 Mass and Spring 9 2.1.1 Definitions

More information

Sensation and Perception

Sensation and Perception Page 94 Check syllabus! We are starting with Section 6-7 in book. Sensation and Perception Our Link With the World Shorter wavelengths give us blue experience Longer wavelengths give us red experience

More information

IN a natural environment, speech often occurs simultaneously. Monaural Speech Segregation Based on Pitch Tracking and Amplitude Modulation

IN a natural environment, speech often occurs simultaneously. Monaural Speech Segregation Based on Pitch Tracking and Amplitude Modulation IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 15, NO. 5, SEPTEMBER 2004 1135 Monaural Speech Segregation Based on Pitch Tracking and Amplitude Modulation Guoning Hu and DeLiang Wang, Fellow, IEEE Abstract

More information

SOUND QUALITY EVALUATION OF FAN NOISE BASED ON HEARING-RELATED PARAMETERS SUMMARY INTRODUCTION

SOUND QUALITY EVALUATION OF FAN NOISE BASED ON HEARING-RELATED PARAMETERS SUMMARY INTRODUCTION SOUND QUALITY EVALUATION OF FAN NOISE BASED ON HEARING-RELATED PARAMETERS Roland SOTTEK, Klaus GENUIT HEAD acoustics GmbH, Ebertstr. 30a 52134 Herzogenrath, GERMANY SUMMARY Sound quality evaluation of

More information

AUDITORY ILLUSIONS & LAB REPORT FORM

AUDITORY ILLUSIONS & LAB REPORT FORM 01/02 Illusions - 1 AUDITORY ILLUSIONS & LAB REPORT FORM NAME: DATE: PARTNER(S): The objective of this experiment is: To understand concepts such as beats, localization, masking, and musical effects. APPARATUS:

More information

A unitary model of pitch perception Ray Meddis and Lowel O Mard Department of Psychology, Essex University, Colchester CO4 3SQ, United Kingdom

A unitary model of pitch perception Ray Meddis and Lowel O Mard Department of Psychology, Essex University, Colchester CO4 3SQ, United Kingdom A unitary model of pitch perception Ray Meddis and Lowel O Mard Department of Psychology, Essex University, Colchester CO4 3SQ, United Kingdom Received 15 March 1996; revised 22 April 1997; accepted 12

More information

Modeling auditory processing of amplitude modulation I. Detection and masking with narrow-band carriers Dau, T.; Kollmeier, B.; Kohlrausch, A.G.

Modeling auditory processing of amplitude modulation I. Detection and masking with narrow-band carriers Dau, T.; Kollmeier, B.; Kohlrausch, A.G. Modeling auditory processing of amplitude modulation I. Detection and masking with narrow-band carriers Dau, T.; Kollmeier, B.; Kohlrausch, A.G. Published in: Journal of the Acoustical Society of America

More information

Cascades of two-pole two-zero asymmetric resonators are good models of peripheral auditory function

Cascades of two-pole two-zero asymmetric resonators are good models of peripheral auditory function Cascades of two-pole two-zero asymmetric resonators are good models of peripheral auditory function Richard F. Lyon a) Google Inc., 1600 Amphitheatre Parkway, Mountain View, California 94043 (Received

More information

Across frequency processing with time varying spectra

Across frequency processing with time varying spectra Bachelor thesis Across frequency processing with time varying spectra Handed in by Hendrike Heidemann Study course: Engineering Physics First supervisor: Prof. Dr. Jesko Verhey Second supervisor: Prof.

More information

Imperfect pitch: Gabor s uncertainty principle and the pitch of extremely brief sounds

Imperfect pitch: Gabor s uncertainty principle and the pitch of extremely brief sounds Psychon Bull Rev (2016) 23:163 171 DOI 10.3758/s13423-015-0863-y BRIEF REPORT Imperfect pitch: Gabor s uncertainty principle and the pitch of extremely brief sounds I-Hui Hsieh 1 & Kourosh Saberi 2 Published

More information

Tone-in-noise detection: Observed discrepancies in spectral integration. Nicolas Le Goff a) Technische Universiteit Eindhoven, P.O.

Tone-in-noise detection: Observed discrepancies in spectral integration. Nicolas Le Goff a) Technische Universiteit Eindhoven, P.O. Tone-in-noise detection: Observed discrepancies in spectral integration Nicolas Le Goff a) Technische Universiteit Eindhoven, P.O. Box 513, NL-5600 MB Eindhoven, The Netherlands Armin Kohlrausch b) and

More information

inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering August 2000, Nice, FRANCE

inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering August 2000, Nice, FRANCE Copyright SFA - InterNoise 2000 1 inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering 27-30 August 2000, Nice, FRANCE I-INCE Classification: 6.1 AUDIBILITY OF COMPLEX

More information

Machine recognition of speech trained on data from New Jersey Labs

Machine recognition of speech trained on data from New Jersey Labs Machine recognition of speech trained on data from New Jersey Labs Frequency response (peak around 5 Hz) Impulse response (effective length around 200 ms) 41 RASTA filter 10 attenuation [db] 40 1 10 modulation

More information

Physics 101. Lecture 21 Doppler Effect Loudness Human Hearing Interference of Sound Waves Reflection & Refraction of Sound

Physics 101. Lecture 21 Doppler Effect Loudness Human Hearing Interference of Sound Waves Reflection & Refraction of Sound Physics 101 Lecture 21 Doppler Effect Loudness Human Hearing Interference of Sound Waves Reflection & Refraction of Sound Quiz: Monday Oct. 18; Chaps. 16,17,18(as covered in class),19 CR/NC Deadline Oct.

More information

Predicting discrimination of formant frequencies in vowels with a computational model of the auditory midbrain

Predicting discrimination of formant frequencies in vowels with a computational model of the auditory midbrain F 1 Predicting discrimination of formant frequencies in vowels with a computational model of the auditory midbrain Laurel H. Carney and Joyce M. McDonough Abstract Neural information for encoding and processing

More information

Multipitch Analysis and Tracking for Automatic Music Transcription

Multipitch Analysis and Tracking for Automatic Music Transcription University of New Orleans ScholarWorks@UNO University of New Orleans Theses and Dissertations Dissertations and Theses 5-21-2004 Multipitch Analysis and Tracking for Automatic Music Transcription Richard

More information

ANALOG IMPLEMENTATIONS OF AUDITORY MODELS. Richard F. Lyon

ANALOG IMPLEMENTATIONS OF AUDITORY MODELS. Richard F. Lyon ANALOG IMPLEMENTATIONS OF AUDITORY MODELS Richard F. Lyon Apple Computer, Inc. Cupertino, CA 95014 and California Institute of Technology Pasadena, CA 91125 ABSTRACT The challenge of making cost-effective

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 2013 http://acousticalsociety.org/ ICA 2013 Montreal Montreal, Canada 2-7 June 2013 Psychological and Physiological Acoustics Session 1pPPb: Psychoacoustics

More information

Interaction of Object Binding Cues in Binaural Masking Pattern Experiments

Interaction of Object Binding Cues in Binaural Masking Pattern Experiments Interaction of Object Binding Cues in Binaural Masking Pattern Experiments Jesko L.Verhey, Björn Lübken and Steven van de Par Abstract Object binding cues such as binaural and across-frequency modulation

More information

Comparison of Spectral Analysis Methods for Automatic Speech Recognition

Comparison of Spectral Analysis Methods for Automatic Speech Recognition INTERSPEECH 2013 Comparison of Spectral Analysis Methods for Automatic Speech Recognition Venkata Neelima Parinam, Chandra Vootkuri, Stephen A. Zahorian Department of Electrical and Computer Engineering

More information

I. INTRODUCTION J. Acoust. Soc. Am. 110 (3), Pt. 1, Sep /2001/110(3)/1628/13/$ Acoustical Society of America

I. INTRODUCTION J. Acoust. Soc. Am. 110 (3), Pt. 1, Sep /2001/110(3)/1628/13/$ Acoustical Society of America On the upper cutoff frequency of the auditory critical-band envelope detectors in the context of speech perception a) Oded Ghitza Media Signal Processing Research, Agere Systems, Murray Hill, New Jersey

More information

ALTERNATING CURRENT (AC)

ALTERNATING CURRENT (AC) ALL ABOUT NOISE ALTERNATING CURRENT (AC) Any type of electrical transmission where the current repeatedly changes direction, and the voltage varies between maxima and minima. Therefore, any electrical

More information

ROBUST SPEECH RECOGNITION BASED ON HUMAN BINAURAL PERCEPTION

ROBUST SPEECH RECOGNITION BASED ON HUMAN BINAURAL PERCEPTION ROBUST SPEECH RECOGNITION BASED ON HUMAN BINAURAL PERCEPTION Richard M. Stern and Thomas M. Sullivan Department of Electrical and Computer Engineering School of Computer Science Carnegie Mellon University

More information

Lecture 7: Superposition and Fourier Theorem

Lecture 7: Superposition and Fourier Theorem Lecture 7: Superposition and Fourier Theorem Sound is linear. What that means is, if several things are producing sounds at once, then the pressure of the air, due to the several things, will be and the

More information

A CLOSER LOOK AT THE REPRESENTATION OF INTERAURAL DIFFERENCES IN A BINAURAL MODEL

A CLOSER LOOK AT THE REPRESENTATION OF INTERAURAL DIFFERENCES IN A BINAURAL MODEL 9th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, -7 SEPTEMBER 7 A CLOSER LOOK AT THE REPRESENTATION OF INTERAURAL DIFFERENCES IN A BINAURAL MODEL PACS: PACS:. Pn Nicolas Le Goff ; Armin Kohlrausch ; Jeroen

More information

Biomedical Engineering Evoked Responses

Biomedical Engineering Evoked Responses Biomedical Engineering Evoked Responses Dr. rer. nat. Andreas Neubauer andreas.neubauer@medma.uni-heidelberg.de Tel.: 0621 383 5126 Stimulation of biological systems and data acquisition 1. How can biological

More information

Temporal resolution AUDL Domain of temporal resolution. Fine structure and envelope. Modulating a sinusoid. Fine structure and envelope

Temporal resolution AUDL Domain of temporal resolution. Fine structure and envelope. Modulating a sinusoid. Fine structure and envelope Modulating a sinusoid can also work this backwards! Temporal resolution AUDL 4007 carrier (fine structure) x modulator (envelope) = amplitudemodulated wave 1 2 Domain of temporal resolution Fine structure

More information

MUSC 316 Sound & Digital Audio Basics Worksheet

MUSC 316 Sound & Digital Audio Basics Worksheet MUSC 316 Sound & Digital Audio Basics Worksheet updated September 2, 2011 Name: An Aggie does not lie, cheat, or steal, or tolerate those who do. By submitting responses for this test you verify, on your

More information

Estimating critical bandwidths of temporal sensitivity to low-frequency amplitude modulation

Estimating critical bandwidths of temporal sensitivity to low-frequency amplitude modulation Estimating critical bandwidths of temporal sensitivity to low-frequency amplitude modulation Allison I. Shim a) and Bruce G. Berg Department of Cognitive Sciences, University of California, Irvine, Irvine,

More information

FFT 1 /n octave analysis wavelet

FFT 1 /n octave analysis wavelet 06/16 For most acoustic examinations, a simple sound level analysis is insufficient, as not only the overall sound pressure level, but also the frequency-dependent distribution of the level has a significant

More information

A cat's cocktail party: Psychophysical, neurophysiological, and computational studies of spatial release from masking

A cat's cocktail party: Psychophysical, neurophysiological, and computational studies of spatial release from masking A cat's cocktail party: Psychophysical, neurophysiological, and computational studies of spatial release from masking Courtney C. Lane 1, Norbert Kopco 2, Bertrand Delgutte 1, Barbara G. Shinn- Cunningham

More information

Preface A detailed knowledge of the processes involved in hearing is an essential prerequisite for numerous medical and technical applications, such a

Preface A detailed knowledge of the processes involved in hearing is an essential prerequisite for numerous medical and technical applications, such a Modeling auditory processing of amplitude modulation Torsten Dau Preface A detailed knowledge of the processes involved in hearing is an essential prerequisite for numerous medical and technical applications,

More information

MUS 302 ENGINEERING SECTION

MUS 302 ENGINEERING SECTION MUS 302 ENGINEERING SECTION Wiley Ross: Recording Studio Coordinator Email =>ross@email.arizona.edu Twitter=> https://twitter.com/ssor Web page => http://www.arts.arizona.edu/studio Youtube Channel=>http://www.youtube.com/user/wileyross

More information

MOST MODERN automatic speech recognition (ASR)

MOST MODERN automatic speech recognition (ASR) IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 5, NO. 5, SEPTEMBER 1997 451 A Model of Dynamic Auditory Perception and Its Application to Robust Word Recognition Brian Strope and Abeer Alwan, Member,

More information

An unnatural test of a natural model of pitch perception: The tritone paradox and spectral dominance

An unnatural test of a natural model of pitch perception: The tritone paradox and spectral dominance An unnatural test of a natural model of pitch perception: The tritone paradox and spectral dominance Richard PARNCUTT, University of Graz Amos Ping TAN, Universal Music, Singapore Octave-complex tone (OCT)

More information

Monaural and Binaural Speech Separation

Monaural and Binaural Speech Separation Monaural and Binaural Speech Separation DeLiang Wang Perception & Neurodynamics Lab The Ohio State University Outline of presentation Introduction CASA approach to sound separation Ideal binary mask as

More information

c 2014 Brantly A. Sturgeon

c 2014 Brantly A. Sturgeon c 2014 Brantly A. Sturgeon AUDITORY MODEL COMPARISON AND OPTIMIZATION USING DYNAMIC TIME WARPING BY BRANTLY A. STURGEON THESIS Submitted in partial fulfillment of the requirements for the degree of Master

More information

The role of intrinsic masker fluctuations on the spectral spread of masking

The role of intrinsic masker fluctuations on the spectral spread of masking The role of intrinsic masker fluctuations on the spectral spread of masking Steven van de Par Philips Research, Prof. Holstlaan 4, 5656 AA Eindhoven, The Netherlands, Steven.van.de.Par@philips.com, Armin

More information

An auditory model that can account for frequency selectivity and phase effects on masking

An auditory model that can account for frequency selectivity and phase effects on masking Acoust. Sci. & Tech. 2, (24) PAPER An auditory model that can account for frequency selectivity and phase effects on masking Akira Nishimura 1; 1 Department of Media and Cultural Studies, Faculty of Informatics,

More information

Music 171: Amplitude Modulation

Music 171: Amplitude Modulation Music 7: Amplitude Modulation Tamara Smyth, trsmyth@ucsd.edu Department of Music, University of California, San Diego (UCSD) February 7, 9 Adding Sinusoids Recall that adding sinusoids of the same frequency

More information

A binaural auditory model and applications to spatial sound evaluation

A binaural auditory model and applications to spatial sound evaluation A binaural auditory model and applications to spatial sound evaluation Ma r k o Ta k a n e n 1, Ga ë ta n Lo r h o 2, a n d Mat t i Ka r ja l a i n e n 1 1 Helsinki University of Technology, Dept. of Signal

More information