Perceptual Distortion Maps for Room Reverberation

Size: px
Start display at page:

Download "Perceptual Distortion Maps for Room Reverberation"

Transcription

1 Perceptual Distortion Maps for oom everberation Thomas Zarouchas 1 John Mourjopoulos 1 1 Audio and Acoustic Technology Group Wire Communications aboratory Electrical Engineering and Computer Engineering Department University of Patras Greece thozar@wcl.ee.upatras.gr mourjop@wcl.ee.upatras.gr ABSTACT From reverberated audio signals and using as reference the input (anechoic) audio a number of distortion maps are etracted indicating how room reverberation distorts in time-frequency scales perceived features in the received signal. These maps are simplified to describe the monaural time-frequency / level distortions and the distortion of the spatial cues (i.e. inter-channel cues and coherence) which are very important for sound localization in a reverberant environment. Such maps here are studied as functions of room parameters (size acoustics distance etc) as well as due to input signal properties. Overall perceptual distortion ratings are produced and reverberationresilient signal features are etracted. 1. INTODUCTION oom acoustics introduce reverberation to audio signals which is usually formally described by the linear system response functions (e.g. convolutional input/output relationships using appropriate oom Impulse responses). Such approach helps to describe up to a certain degree features of reverberation important from a signal processing perspective [1 2 3]. However the perception of reverberation is a very comple phenomenon resulting from time-frequency delay level directional and signal-dependent cues [ ]. Currently there is a significant gap between the objective and the subjective approach for analyzing such phenomena. This wor etends earlier published results in signal processing-based methodology to deal with room reverberation [8 9] and also recent attempts to introduce perceptually motivated models for similar applications. Specifically in this wor a Computational Auditory Masing Model (CAMM) [1011] complemented by a novel Inter-channel Cue Mapping Module (ICMM) is used for the perceptual description of reverberation distortions in audio signals and the degradations of the stereo image in a typical audio reproduction setup. corresponding auditory model time/frequency regions with significant degradation due to reverberation. Furthermore the output of the inter-channel cue process module indicates the modification of the relevant spatial cues due to reverberation. In both cases the outputs of the CAMM and ICMM are presented in a form of timefrequency (2-D) maps [12]. The paper is organized as follow: In section 2 the analysis scheme for the etraction of the distortion maps of room reverberation is presented. In section 3 the utilization of the Computational Auditory Masing Model to derive the distortion maps due to reverberation is presented. In section 4 the Inter-channel Cue Mapping Model is presented. Simulation results are given in section 5. Finally some conclusions are drawn in section DISTOTION MAPS FO EVEBEATION The proposed structure for the evaluation of the distortion maps of room reverberation is shown in Fig. 1 for stereo reproduction. The concept can be etended to multi-channel audio signal reproduction. The proposed method requires as inputs the anechoic audio signal and the corresponding reverberant signal. According to this approach it is possible to locate from the evaluated internal representations of the

2 EVEBEATION DISTOTION MAPS audio signal generated in any real room as can be measured via an omni directional microphone and was described in detail in [14]. Alternatively a simulated reverberant signal may be obtained via convolution with a measured room impulse response. The CAMM derives the monaural internal representations of the audio signal in a number of frequency bands which are inputted to a Decision-Threshold Device (DTD). The output of the DTD represents time-frequency maps with significant reverberation distortions. The concept of the DTD is based on the Just Noticeable Intensity Difference of the internal signal representations. Figure 1: Analysis scheme for the assessment of the perceptual distortion cues The scheme shown in Fig. 1 employs a monaural Masing Model for estimation of perceived distortion due to reverberation complemented by a Inter-channel Cue Mapping Module (ICMM) for the evaluation of the alterations in the relevant spatial cues. Inputs to the proposed method are the source audio signals and estimates or measurements of the reverberant audio signals generated in any real room. For the analysis in the monaural (CAMM) and spatial cues (ICMM) the signals are transformed into the timefrequency domain. For the purpose of this processing a novel filterban is utilized with near-perfect reconstruction properties enabling fleible signal modification. This filterban provides non-uniform analysis bands with sufficient frequency resolution in order to capture the perceptually relevant cues at low frequencies following closely the critical band / EB scale. The sub-band domain signals s n are obtained as: M1 K m0 s n s n m h( m) cos (1 ) snis the input signal M is the length of a where prototype filter hn and can be considered as a function of the sub-band inde the number of subbands K and a phase parameter φ [15 16]. 3. DISTOTION MAPS BASED ON MONAUA MASKING MODE The Computational Auditory Masing Model used here can successfully emulate many aspects of the monaural signal processing of the auditory system. Input to the auditory model are a single channel of the original audio signal anechoic and the corresponding reverberant The detailed structure for the evaluation of the distortions due to reverberation is shown in Figure 2. Input to the filter ban is the original (source) single channel audio signal n and the corresponding n. The sub- reverberant single channel audio signal band signals see Eq. (1) n and n are then inputted to the CAMM producing the monaural internal representations z n and z n respectively. The Decision-Threshold Device (DTD) accompanied with a set of thresholds T n is utilized to etract the difference n according to n z n z n (2 ) and therefore to derive the parameter D n n T n (3 ) The D n parameter indicates the degree of the perceived distortions due to reverberation above the specified threshold when D n 0 in the timefrequency domain for single channel audio signals and generally 0 D n 1 (4 ) It is clear that the CAMM can easily be etended to evaluate separately the parameters D nfor the 2 channels of a stereo signal. However in such a case any binaural masing mechanisms are not considered. Page 2 of 8

3 EVEBEATION DISTOTION MAPS N p n n m m1 N p n n m m1 2 2 (5 ) and cross-power estimate between the two channels (left-right) is also performed according to N (6 ) m1 p n n m n m Figure 2: Analysis scheme for processing of reference and reverberant audio signals in order to derive the time-frequency map of parameter D n 4. DISTOTION MAPS BASED ON INTE- CHANNE CUE MAPPING Input to the Inter-channel Cue Mapping Module are stereo channels for both the source audio signal and the corresponding reverberant signal. The relevant spatial cues [413] eamined here are the inter-channel level difference (ID) inter-channel time difference (ITD) and inter-channel coherence (IC). These are derived for all signals and channels independently in each frequency band and as function of time (see Figure 3). As it is nown [4 13] Inter Channel evel Difference (ICD [db]) denotes the level/intensity differences between two (left right) channels: ICD n p 10log10 p with a typical level range of n (7 ) ICD (8 ) Inter Channel Time Difference (ICTD [samples]) describes the time difference between two channels and is the time instance at which the maimum value of a short-time estimate of the normalized cross-correlation function has occurred i.e.: n p p n p n (9 ) ICTD has a typical time range (samples) of n ICTD n (10 ) Figure 3: Analysis scheme of the Inter-channel Cue Mapping Module Short-time estimates of the power for each channel (i.e. left right for a typical stereo setup) and for each subband are considered for a window size of N samples according to: Inter Channel Coherence (ICC) defines the coherence between two channels ( and ) and can be epressed as: ICC n ma n (11 ) nn0 considering the maimum value of the instantaneous normalized cross-correlation. ICC has a range of: 0 ICC n 1 (12 ) 0 where 1 indicates that and are perfectly coherent. Page 3 of 8

4 EVEBEATION DISTOTION MAPS Based on the definitions of equations (7-12) the evaluation of the spatial cues for both the source and the reverberant signal (denoted by ~) is performed as: ICD and ICD n ICTD n and n ICTD n 0 ICC 1 and 0 ICC 1 1 n 1 2 n 2 1 n 1 2 n 2 (13 ) Here from these maps which correspond to source and received signals distortion maps will be evaluated (in the time-frequency domain) defined by the differences of these maps. Therefore the differential metric (distortion map) for each cue is introduced according to: ICD ICD t n ICTD ICTDn c n ICC ICCn (14 ) Based on equations ( ) the typical level time and coherence range for each differential metric will be: 1 2 n 1 2 t n n n n n 1 2 c (15 ) Hence the output of the ICMM according to equation (15) indicates the variation of the inter-channel cues in the time-frequency domain for both signals in a form of differential cue mapping. Typical differential maps for a number of test cases are shown in Figures 6 and 7. For c t the differential metrics ( n ) and for the distortion parameter ( D ) the mean values in a frame by frame basis can be also evaluated for each test case. Additional a logarithmic epression of the corresponded mean values (ecept for estimated according to: db 10 n ) can also i 20 log X i i=1...m (16 ) where Xn i is the mean value of the corresponding differential metric or the distortion parameter in frame i for a number of M frames and typical frame length of 1024 samples leading to a simplified interpretation of the overall perceived signal-dependent distortion which can be assigned to each map. 5. TESTS AND ESUTS Preliminary tests were conducted having as reference typical stereo 16 bit resolution signals at f s = 44100Hz. These tests were using input (reference) audio signals of different categories i.e. big band jazz (JAZZ) solo classical piano (PIANO) and male speech (SPEECH) and as second input the corresponding signals recorded under various reverberation conditions in different real enclosures (see Table 1) ranging from a acoustically treated laboratory to a large sports hall. From this set of distortion maps the local variations and the overall distortion metrics for each specific test case are evaluated for typical audio signals and reverberation conditions. oom Dimensions WH(m) T(sec) (freq avg.) Type aboratory Classroom Sports hall Table 1: Properties of rooms used for tests Table 2 indicates the variation of the monaural masing distortion parameter D (db) for different audio signals recorded in three different enclosures with varying acoustical properties. As it is shown room 3 (large sports hall) indicates a higher degree of perceived distortion for all types of signal. Signal oom JAZZ PIANO SPEECH Table 2: Monaural masing distortion parameter D (db) for different real enclosures and different audio signals The variation of the distortion parameter D (db) and the corresponding distortion map based on CAMM for a reverberant audio signal segment recorded in room 3 are shown in Figure 4. As can be observed in Fig. 4(c) the frequency averaged distortion metric Page 4 of 8

5 EVEBEATION DISTOTION MAPS D increases during the reverberant decay of the piano note (shown without reverberation in Fig. 4(a)) indicating the increase of the perceived distortion due to reverberant tail. The corresponding 2-D perceptually motivated map of Fig. 4(d) gives a more detailed illustration of the corresponding time-frequency distortions indicating in red signal regions with higher degree of perceived distortion. The effect of different room acoustics on the variation of distortion parameter D is shown in Figure 5. Note that the dashed line in each case indicates the frequency-averaged mean value of the perceived distortion for the corresponding audio signal segment. It is clear that the mean metric increases with reverberation time and the perceived effects of reverberation are more pronounced for the larger rooms (Fig. 5(a) and 5(b)) than for the acoustically treated room (Fig. 5(c)). Furthermore heavy reverberation seems to lower the frequency and depth of the modulations of the perceived distortion. higher dispersion can be observed for each differential inter-channel metric (Fig. 7). 6. CONCUSIONS The wor illustrates the efficiency of the proposed Computationally Auditory Masing Model and the novel Inter-channel Cue Mapping Module to describe with appropriate 2-D maps the perceived distortion and the general degradation of audio signals due to reverberation. As it was shown signal-dependent perceived distortions can be isolated into specific timefrequency regions. Table 3 shows the variation of the inter-channel differential metrics for the above enclosures using the PIANO as input signal. As it is shown the divergence in the inter-channel coherence differential c and the inter-channel time differential t metrics between rooms 1 and room 3 is close to 3 db. Differential oom Metric Coherence c evel Time t Table 3: Differential metrics for different real enclosures and PIANO as a test signal For the differential inter-channel metrics similar overall trends as with the previously described monaural metric can be observed in Figures 6 and 7 corresponding to the acoustically treated room (Fig. 6) and the large sports hall (Fig. 7). For low reverberation all differential interchannel metrics display low dispersion (distortion maps have large time-frequency regions close to green i.e. 0 db for metric). Furthermore deviations are low around this value. With heavy reverberation conditions Page 5 of 8

6 EVEBEATION DISTOTION MAPS Figure 4: (a) source audio signal segment (PIANO) (b) corresponding reverberant signal recorded in oom 3 (c) distortion parameter (db) (d) Distortion Map D based on CAMM for reverberant signal These distortion maps illustrate different aspects of perceived degradations from monaural masing due to reverberant decay to inter-channel level time and coherence variations in stereo signals. Figure 5: Distortion parameter for PIANO audio segment (a) room 3 (b) room 2 (c) room 1. Dashed line indicates overall mean value This detailed identification of the distortions can allow novel signal-processing methods to evolve so that such distortions can be suppressed without or in conjunction with the more traditional inverse filter based methods [14]. It is also promising that both short-term and long-term trends of the proposed distortion metrics seem to follow the trends in the established physical acoustical parameters of the recorded space. However unlie eisting acoustical measurements the proposed distortion maps are dynamically-varying with the signal evolution and are dependent on the specific audio signal. Hence such maps may help to reconsider the problem of reverberation from a signal-processing perspective that is closer to perception and the specific signal reproduced inside such an enclosure. Page 6 of 8

7 Zarouchas Mourjopoulos EVEBEATION DISTOTION MAPS Figure 6: Differential Cue Mapping (a) inter-channel coherence (b) inter-channel level difference (c) interchannel time difference for room 1 and JAZZ as test signal Future wor will eamine the variation and sensitivity of the differential metrics introduced here with respect to different source receiver positions leading to a hierarchical structure of the relative importance of each differential metric. Figure 7: Differential Cue Mapping (a) inter-channel coherence (b) inter-channel level difference (c) interchannel time difference for room 3 and JAZZ as test signal 7. EFEENCES [1] M.. Schroeder B. F. ogan Colorless Artificial everberation Journal of the Audio Engineering Society Vol. 9 p Page 7 of 8

8 EVEBEATION DISTOTION MAPS [2] S. T. Neely J. B. Allen Invertibility of a oom Impulse esponse Journal of the Acoustical Society of America Vol. 66 pp [3] J. N. Mourjopoulos Digital Equalization of oom Acoustics Journal of the Audio Engineering Society Vol. 42 No 11 pp [4] J. Blauert (1997). Spatial Hearing: The Psychophysics of Human ocalization (evised Edition) The MIT press USA-Cambridge. [5] J. M. Buchholz J. Mourjopoulos J. Blauert oom Masing: Understanding and Modeling the Masing of oom eflections 110 th AES Convention Amsterdam May 2001 preprint (5312). [6]. H. Bolt A. D. MacDonald Theory of Speech Masing by everberation Journal of the Acoustical Society of America Vol. 21(6) pp [7] F. E. Toole oudspeaers and ooms for Sound eproduction A Scientific eview Journal of the Audio Engineering Society Vol. 54 No. 6 June Based on Statistics of Binaural Interaction IEEE Transactions on Audio Speech and anguage Processing Vol. 14 No. 1 January [13] C. Faller J. Merimaa Source ocalization in Comple istening Situations: Selection of Binaural Cues Based on Interaural Coherence Journal of the Acoustical Society of America Vol. 116(5) pp November [14] T. Zarouchas J. Mourjopoulos J. Buchholz P. Hatziantoniou A Perceptual Measure for Assessing and emoving everberation from Audio Signals 120 th AES Convention Paris May 2006 preprint (6702). [15] ISO/IEC Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to about 1.5 Mbit/s Part 3: Audio. [16] J. Breebaart S. van de Par A. Kohlrausch E. Schuijers Parametric Coding of Stereo Audio EUASIP Journal on Applied Signal Processing Vol Issue 9 pages [8] J.. Flanagan. C. ummis Signal Processing to educe Multipath Distortions in Small ooms Journal of the Audio Engineering Society Vol. 47 pp [9] J. B. Allen D. A. Berley J. Blauert Multimicrophone Signal Processing Technique to emove oom everberation from Speech Signals Journal of the Acoustical Society of America Vol. 64(2) pp [10] J. M. Buchholz J. Mourjopoulos A Computational Auditory Masing Model Based on Signal-Dependent Compression. I. Model Description and Performance Analysis Acta Acustica United with Acustica Vol. 90 pp (2004). [11] J. M. Buchholz J. Mourjopoulos A Computational Auditory Masing Model Based on Signal-Dependent Compression. II. Model Simulations and Analytical Approimations Acta Acustica United with Acustica Vol. 90 pp (2004). [12] S. Harding J. Barer G. J. Brown Mas Estimation for Missing Data Speech ecognition Page 8 of 8

Subband Analysis of Time Delay Estimation in STFT Domain

Subband Analysis of Time Delay Estimation in STFT Domain PAGE 211 Subband Analysis of Time Delay Estimation in STFT Domain S. Wang, D. Sen and W. Lu School of Electrical Engineering & Telecommunications University of ew South Wales, Sydney, Australia sh.wang@student.unsw.edu.au,

More information

AN AUDITORILY MOTIVATED ANALYSIS METHOD FOR ROOM IMPULSE RESPONSES

AN AUDITORILY MOTIVATED ANALYSIS METHOD FOR ROOM IMPULSE RESPONSES Proceedings of the COST G-6 Conference on Digital Audio Effects (DAFX-), Verona, Italy, December 7-9,2 AN AUDITORILY MOTIVATED ANALYSIS METHOD FOR ROOM IMPULSE RESPONSES Tapio Lokki Telecommunications

More information

A CLOSER LOOK AT THE REPRESENTATION OF INTERAURAL DIFFERENCES IN A BINAURAL MODEL

A CLOSER LOOK AT THE REPRESENTATION OF INTERAURAL DIFFERENCES IN A BINAURAL MODEL 9th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, -7 SEPTEMBER 7 A CLOSER LOOK AT THE REPRESENTATION OF INTERAURAL DIFFERENCES IN A BINAURAL MODEL PACS: PACS:. Pn Nicolas Le Goff ; Armin Kohlrausch ; Jeroen

More information

Analysis of room transfer function and reverberant signal statistics

Analysis of room transfer function and reverberant signal statistics Analysis of room transfer function and reverberant signal statistics E. Georganti a, J. Mourjopoulos b and F. Jacobsen a a Acoustic Technology Department, Technical University of Denmark, Ørsted Plads,

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 2013 http://acousticalsociety.org/ ICA 2013 Montreal Montreal, Canada 2-7 June 2013 Architectural Acoustics Session 2aAAa: Adapting, Enhancing, and Fictionalizing

More information

III. Publication III. c 2005 Toni Hirvonen.

III. Publication III. c 2005 Toni Hirvonen. III Publication III Hirvonen, T., Segregation of Two Simultaneously Arriving Narrowband Noise Signals as a Function of Spatial and Frequency Separation, in Proceedings of th International Conference on

More information

A spatial squeezing approach to ambisonic audio compression

A spatial squeezing approach to ambisonic audio compression University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2008 A spatial squeezing approach to ambisonic audio compression Bin Cheng

More information

Psychoacoustic Cues in Room Size Perception

Psychoacoustic Cues in Room Size Perception Audio Engineering Society Convention Paper Presented at the 116th Convention 2004 May 8 11 Berlin, Germany 6084 This convention paper has been reproduced from the author s advance manuscript, without editing,

More information

The psychoacoustics of reverberation

The psychoacoustics of reverberation The psychoacoustics of reverberation Steven van de Par Steven.van.de.Par@uni-oldenburg.de July 19, 2016 Thanks to Julian Grosse and Andreas Häußler 2016 AES International Conference on Sound Field Control

More information

Binaural Cue Coding Part I: Psychoacoustic Fundamentals and Design Principles

Binaural Cue Coding Part I: Psychoacoustic Fundamentals and Design Principles IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 11, NO. 6, NOVEMBER 2003 509 Binaural Cue Coding Part I: Psychoacoustic Fundamentals and Design Principles Frank Baumgarte and Christof Faller Abstract

More information

A binaural auditory model and applications to spatial sound evaluation

A binaural auditory model and applications to spatial sound evaluation A binaural auditory model and applications to spatial sound evaluation Ma r k o Ta k a n e n 1, Ga ë ta n Lo r h o 2, a n d Mat t i Ka r ja l a i n e n 1 1 Helsinki University of Technology, Dept. of Signal

More information

Estimation of Reverberation Time from Binaural Signals Without Using Controlled Excitation

Estimation of Reverberation Time from Binaural Signals Without Using Controlled Excitation Estimation of Reverberation Time from Binaural Signals Without Using Controlled Excitation Sampo Vesa Master s Thesis presentation on 22nd of September, 24 21st September 24 HUT / Laboratory of Acoustics

More information

A Virtual Audio Environment for Testing Dummy- Head HRTFs modeling Real Life Situations

A Virtual Audio Environment for Testing Dummy- Head HRTFs modeling Real Life Situations A Virtual Audio Environment for Testing Dummy- Head HRTFs modeling Real Life Situations György Wersényi Széchenyi István University, Hungary. József Répás Széchenyi István University, Hungary. Summary

More information

IMPROVED COCKTAIL-PARTY PROCESSING

IMPROVED COCKTAIL-PARTY PROCESSING IMPROVED COCKTAIL-PARTY PROCESSING Alexis Favrot, Markus Erne Scopein Research Aarau, Switzerland postmaster@scopein.ch Christof Faller Audiovisual Communications Laboratory, LCAV Swiss Institute of Technology

More information

The Human Auditory System

The Human Auditory System medial geniculate nucleus primary auditory cortex inferior colliculus cochlea superior olivary complex The Human Auditory System Prominent Features of Binaural Hearing Localization Formation of positions

More information

From Binaural Technology to Virtual Reality

From Binaural Technology to Virtual Reality From Binaural Technology to Virtual Reality Jens Blauert, D-Bochum Prominent Prominent Features of of Binaural Binaural Hearing Hearing - Localization Formation of positions of the auditory events (azimuth,

More information

Audio Engineering Society. Convention Paper. Presented at the 115th Convention 2003 October New York, New York

Audio Engineering Society. Convention Paper. Presented at the 115th Convention 2003 October New York, New York Audio Engineering Society Convention Paper Presented at the 115th Convention 2003 October 10 13 New York, New York This convention paper has been reproduced from the author's advance manuscript, without

More information

Binaural Hearing. Reading: Yost Ch. 12

Binaural Hearing. Reading: Yost Ch. 12 Binaural Hearing Reading: Yost Ch. 12 Binaural Advantages Sounds in our environment are usually complex, and occur either simultaneously or close together in time. Studies have shown that the ability to

More information

Digital Loudspeaker Arrays driven by 1-bit signals

Digital Loudspeaker Arrays driven by 1-bit signals Digital Loudspeaer Arrays driven by 1-bit signals Nicolas Alexander Tatlas and John Mourjopoulos Audiogroup, Electrical Engineering and Computer Engineering Department, University of Patras, Patras, 265

More information

Effects of Reverberation on Pitch, Onset/Offset, and Binaural Cues

Effects of Reverberation on Pitch, Onset/Offset, and Binaural Cues Effects of Reverberation on Pitch, Onset/Offset, and Binaural Cues DeLiang Wang Perception & Neurodynamics Lab The Ohio State University Outline of presentation Introduction Human performance Reverberation

More information

MPEG-4 Structured Audio Systems

MPEG-4 Structured Audio Systems MPEG-4 Structured Audio Systems Mihir Anandpara The University of Texas at Austin anandpar@ece.utexas.edu 1 Abstract The MPEG-4 standard has been proposed to provide high quality audio and video content

More information

QUALITY ASSESSMENT OF MULTI-CHANNEL AUDIO PROCESSING SCHEMES BASED ON A BINAURAL AUDITORY MODEL

QUALITY ASSESSMENT OF MULTI-CHANNEL AUDIO PROCESSING SCHEMES BASED ON A BINAURAL AUDITORY MODEL 214 IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP) QUALITY ASSESSMENT OF MULTI-CHANNEL AUDIO PROCESSING SCHEMES BASED ON A BINAURAL AUDITORY MODEL Jan-Hendrik Fleßner

More information

FROM BLIND SOURCE SEPARATION TO BLIND SOURCE CANCELLATION IN THE UNDERDETERMINED CASE: A NEW APPROACH BASED ON TIME-FREQUENCY ANALYSIS

FROM BLIND SOURCE SEPARATION TO BLIND SOURCE CANCELLATION IN THE UNDERDETERMINED CASE: A NEW APPROACH BASED ON TIME-FREQUENCY ANALYSIS ' FROM BLIND SOURCE SEPARATION TO BLIND SOURCE CANCELLATION IN THE UNDERDETERMINED CASE: A NEW APPROACH BASED ON TIME-FREQUENCY ANALYSIS Frédéric Abrard and Yannick Deville Laboratoire d Acoustique, de

More information

Distance Estimation and Localization of Sound Sources in Reverberant Conditions using Deep Neural Networks

Distance Estimation and Localization of Sound Sources in Reverberant Conditions using Deep Neural Networks Distance Estimation and Localization of Sound Sources in Reverberant Conditions using Deep Neural Networks Mariam Yiwere 1 and Eun Joo Rhee 2 1 Department of Computer Engineering, Hanbat National University,

More information

Nonuniform multi level crossing for signal reconstruction

Nonuniform multi level crossing for signal reconstruction 6 Nonuniform multi level crossing for signal reconstruction 6.1 Introduction In recent years, there has been considerable interest in level crossing algorithms for sampling continuous time signals. Driven

More information

PRIMARY-AMBIENT SOURCE SEPARATION FOR UPMIXING TO SURROUND SOUND SYSTEMS

PRIMARY-AMBIENT SOURCE SEPARATION FOR UPMIXING TO SURROUND SOUND SYSTEMS PRIMARY-AMBIENT SOURCE SEPARATION FOR UPMIXING TO SURROUND SOUND SYSTEMS Karim M. Ibrahim National University of Singapore karim.ibrahim@comp.nus.edu.sg Mahmoud Allam Nile University mallam@nu.edu.eg ABSTRACT

More information

Listening with Headphones

Listening with Headphones Listening with Headphones Main Types of Errors Front-back reversals Angle error Some Experimental Results Most front-back errors are front-to-back Substantial individual differences Most evident in elevation

More information

THE MATLAB IMPLEMENTATION OF BINAURAL PROCESSING MODEL SIMULATING LATERAL POSITION OF TONES WITH INTERAURAL TIME DIFFERENCES

THE MATLAB IMPLEMENTATION OF BINAURAL PROCESSING MODEL SIMULATING LATERAL POSITION OF TONES WITH INTERAURAL TIME DIFFERENCES THE MATLAB IMPLEMENTATION OF BINAURAL PROCESSING MODEL SIMULATING LATERAL POSITION OF TONES WITH INTERAURAL TIME DIFFERENCES J. Bouše, V. Vencovský Department of Radioelectronics, Faculty of Electrical

More information

This is an electronic reprint of the original article. This reprint may differ from the original in pagination and typographic detail.

This is an electronic reprint of the original article. This reprint may differ from the original in pagination and typographic detail. Powered by TCPDF (www.tcpdf.org) This is an electronic reprint of the original article. This reprint may differ from the original in pagination and typographic detail. Author(s): Title: Mikko-Ville Laitinen,

More information

THE PERCEPTION OF ALL-PASS COMPONENTS IN TRANSFER FUNCTIONS

THE PERCEPTION OF ALL-PASS COMPONENTS IN TRANSFER FUNCTIONS PACS Reference: 43.66.Pn THE PERCEPTION OF ALL-PASS COMPONENTS IN TRANSFER FUNCTIONS Pauli Minnaar; Jan Plogsties; Søren Krarup Olesen; Flemming Christensen; Henrik Møller Department of Acoustics Aalborg

More information

Evaluation of a new stereophonic reproduction method with moving sweet spot using a binaural localization model

Evaluation of a new stereophonic reproduction method with moving sweet spot using a binaural localization model Evaluation of a new stereophonic reproduction method with moving sweet spot using a binaural localization model Sebastian Merchel and Stephan Groth Chair of Communication Acoustics, Dresden University

More information

Spatial Audio Reproduction: Towards Individualized Binaural Sound

Spatial Audio Reproduction: Towards Individualized Binaural Sound Spatial Audio Reproduction: Towards Individualized Binaural Sound WILLIAM G. GARDNER Wave Arts, Inc. Arlington, Massachusetts INTRODUCTION The compact disc (CD) format records audio with 16-bit resolution

More information

A generalized framework for binaural spectral subtraction dereverberation

A generalized framework for binaural spectral subtraction dereverberation A generalized framework for binaural spectral subtraction dereverberation Alexandros Tsilfidis, Eleftheria Georganti, John Mourjopoulos Audio and Acoustic Technology Group, Department of Electrical and

More information

Robotic Spatial Sound Localization and Its 3-D Sound Human Interface

Robotic Spatial Sound Localization and Its 3-D Sound Human Interface Robotic Spatial Sound Localization and Its 3-D Sound Human Interface Jie Huang, Katsunori Kume, Akira Saji, Masahiro Nishihashi, Teppei Watanabe and William L. Martens The University of Aizu Aizu-Wakamatsu,

More information

396 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 19, NO. 2, FEBRUARY 2011

396 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 19, NO. 2, FEBRUARY 2011 396 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 19, NO. 2, FEBRUARY 2011 Obtaining Binaural Room Impulse Responses From B-Format Impulse Responses Using Frequency-Dependent Coherence

More information

Final Exam Study Guide: Introduction to Computer Music Course Staff April 24, 2015

Final Exam Study Guide: Introduction to Computer Music Course Staff April 24, 2015 Final Exam Study Guide: 15-322 Introduction to Computer Music Course Staff April 24, 2015 This document is intended to help you identify and master the main concepts of 15-322, which is also what we intend

More information

Recent Advances in Acoustic Signal Extraction and Dereverberation

Recent Advances in Acoustic Signal Extraction and Dereverberation Recent Advances in Acoustic Signal Extraction and Dereverberation Emanuël Habets Erlangen Colloquium 2016 Scenario Spatial Filtering Estimated Desired Signal Undesired sound components: Sensor noise Competing

More information

Simultaneous Recognition of Speech Commands by a Robot using a Small Microphone Array

Simultaneous Recognition of Speech Commands by a Robot using a Small Microphone Array 2012 2nd International Conference on Computer Design and Engineering (ICCDE 2012) IPCSIT vol. 49 (2012) (2012) IACSIT Press, Singapore DOI: 10.7763/IPCSIT.2012.V49.14 Simultaneous Recognition of Speech

More information

Tone-in-noise detection: Observed discrepancies in spectral integration. Nicolas Le Goff a) Technische Universiteit Eindhoven, P.O.

Tone-in-noise detection: Observed discrepancies in spectral integration. Nicolas Le Goff a) Technische Universiteit Eindhoven, P.O. Tone-in-noise detection: Observed discrepancies in spectral integration Nicolas Le Goff a) Technische Universiteit Eindhoven, P.O. Box 513, NL-5600 MB Eindhoven, The Netherlands Armin Kohlrausch b) and

More information

Wavelet Speech Enhancement based on the Teager Energy Operator

Wavelet Speech Enhancement based on the Teager Energy Operator Wavelet Speech Enhancement based on the Teager Energy Operator Mohammed Bahoura and Jean Rouat ERMETIS, DSA, Université du Québec à Chicoutimi, Chicoutimi, Québec, G7H 2B1, Canada. Abstract We propose

More information

Spatial audio is a field that

Spatial audio is a field that [applications CORNER] Ville Pulkki and Matti Karjalainen Multichannel Audio Rendering Using Amplitude Panning Spatial audio is a field that investigates techniques to reproduce spatial attributes of sound

More information

THE BEATING EQUALIZER AND ITS APPLICATION TO THE SYNTHESIS AND MODIFICATION OF PIANO TONES

THE BEATING EQUALIZER AND ITS APPLICATION TO THE SYNTHESIS AND MODIFICATION OF PIANO TONES J. Rauhala, The beating equalizer and its application to the synthesis and modification of piano tones, in Proceedings of the 1th International Conference on Digital Audio Effects, Bordeaux, France, 27,

More information

Evaluation of Audio Compression Artifacts M. Herrera Martinez

Evaluation of Audio Compression Artifacts M. Herrera Martinez Evaluation of Audio Compression Artifacts M. Herrera Martinez This paper deals with subjective evaluation of audio-coding systems. From this evaluation, it is found that, depending on the type of signal

More information

DISTANCE CODING AND PERFORMANCE OF THE MARK 5 AND ST350 SOUNDFIELD MICROPHONES AND THEIR SUITABILITY FOR AMBISONIC REPRODUCTION

DISTANCE CODING AND PERFORMANCE OF THE MARK 5 AND ST350 SOUNDFIELD MICROPHONES AND THEIR SUITABILITY FOR AMBISONIC REPRODUCTION DISTANCE CODING AND PERFORMANCE OF THE MARK 5 AND ST350 SOUNDFIELD MICROPHONES AND THEIR SUITABILITY FOR AMBISONIC REPRODUCTION T Spenceley B Wiggins University of Derby, Derby, UK University of Derby,

More information

Enhancing 3D Audio Using Blind Bandwidth Extension

Enhancing 3D Audio Using Blind Bandwidth Extension Enhancing 3D Audio Using Blind Bandwidth Extension (PREPRINT) Tim Habigt, Marko Ðurković, Martin Rothbucher, and Klaus Diepold Institute for Data Processing, Technische Universität München, 829 München,

More information

PITCH-TRACKING OF REVERBERANT SOUNDS, APPLICATION TO SPATIAL DESCRIPTION OF SOUND SCENES

PITCH-TRACKING OF REVERBERANT SOUNDS, APPLICATION TO SPATIAL DESCRIPTION OF SOUND SCENES PITCH-TRACKING OF REVERBERANT SOUNDS, APPLICATION TO SPATIAL DESCRIPTION OF SOUND SCENES ALEXIS BASKIND AND ALAIN DE CHEVEIGNÉ IRCAM, 1 place Igor-Stravinsky, 74 Paris, France Alexis.Baskind@ircam.fr Alain.de.Cheveigne@ircam.fr

More information

BEAMFORMING WITHIN THE MODAL SOUND FIELD OF A VEHICLE INTERIOR

BEAMFORMING WITHIN THE MODAL SOUND FIELD OF A VEHICLE INTERIOR BeBeC-2016-S9 BEAMFORMING WITHIN THE MODAL SOUND FIELD OF A VEHICLE INTERIOR Clemens Nau Daimler AG Béla-Barényi-Straße 1, 71063 Sindelfingen, Germany ABSTRACT Physically the conventional beamforming method

More information

IMPLEMENTATION AND APPLICATION OF A BINAURAL HEARING MODEL TO THE OBJECTIVE EVALUATION OF SPATIAL IMPRESSION

IMPLEMENTATION AND APPLICATION OF A BINAURAL HEARING MODEL TO THE OBJECTIVE EVALUATION OF SPATIAL IMPRESSION IMPLEMENTATION AND APPLICATION OF A BINAURAL HEARING MODEL TO THE OBJECTIVE EVALUATION OF SPATIAL IMPRESSION RUSSELL MASON Institute of Sound Recording, University of Surrey, Guildford, UK r.mason@surrey.ac.uk

More information

SPECTRAL COMBINING FOR MICROPHONE DIVERSITY SYSTEMS

SPECTRAL COMBINING FOR MICROPHONE DIVERSITY SYSTEMS 17th European Signal Processing Conference (EUSIPCO 29) Glasgow, Scotland, August 24-28, 29 SPECTRAL COMBINING FOR MICROPHONE DIVERSITY SYSTEMS Jürgen Freudenberger, Sebastian Stenzel, Benjamin Venditti

More information

Predicting localization accuracy for stereophonic downmixes in Wave Field Synthesis

Predicting localization accuracy for stereophonic downmixes in Wave Field Synthesis Predicting localization accuracy for stereophonic downmixes in Wave Field Synthesis Hagen Wierstorf Assessment of IP-based Applications, T-Labs, Technische Universität Berlin, Berlin, Germany. Sascha Spors

More information

Mel Spectrum Analysis of Speech Recognition using Single Microphone

Mel Spectrum Analysis of Speech Recognition using Single Microphone International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree

More information

DECORRELATION TECHNIQUES FOR THE RENDERING OF APPARENT SOUND SOURCE WIDTH IN 3D AUDIO DISPLAYS. Guillaume Potard, Ian Burnett

DECORRELATION TECHNIQUES FOR THE RENDERING OF APPARENT SOUND SOURCE WIDTH IN 3D AUDIO DISPLAYS. Guillaume Potard, Ian Burnett 04 DAFx DECORRELATION TECHNIQUES FOR THE RENDERING OF APPARENT SOUND SOURCE WIDTH IN 3D AUDIO DISPLAYS Guillaume Potard, Ian Burnett School of Electrical, Computer and Telecommunications Engineering University

More information

Robust Speech Recognition Based on Binaural Auditory Processing

Robust Speech Recognition Based on Binaural Auditory Processing INTERSPEECH 2017 August 20 24, 2017, Stockholm, Sweden Robust Speech Recognition Based on Binaural Auditory Processing Anjali Menon 1, Chanwoo Kim 2, Richard M. Stern 1 1 Department of Electrical and Computer

More information

Robust Speech Recognition Based on Binaural Auditory Processing

Robust Speech Recognition Based on Binaural Auditory Processing Robust Speech Recognition Based on Binaural Auditory Processing Anjali Menon 1, Chanwoo Kim 2, Richard M. Stern 1 1 Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh,

More information

From acoustic simulation to virtual auditory displays

From acoustic simulation to virtual auditory displays PROCEEDINGS of the 22 nd International Congress on Acoustics Plenary Lecture: Paper ICA2016-481 From acoustic simulation to virtual auditory displays Michael Vorländer Institute of Technical Acoustics,

More information

Assessing the contribution of binaural cues for apparent source width perception via a functional model

Assessing the contribution of binaural cues for apparent source width perception via a functional model Virtual Acoustics: Paper ICA06-768 Assessing the contribution of binaural cues for apparent source width perception via a functional model Johannes Käsbach (a), Manuel Hahmann (a), Tobias May (a) and Torsten

More information

Audio Engineering Society Convention Paper Presented at the 110th Convention 2001 May Amsterdam, The Netherlands

Audio Engineering Society Convention Paper Presented at the 110th Convention 2001 May Amsterdam, The Netherlands Audio Engineering Society Convention Paper Presented at the th Convention May 5 Amsterdam, The Netherlands This convention paper has been reproduced from the author's advance manuscript, without editing,

More information

University of Huddersfield Repository

University of Huddersfield Repository University of Huddersfield Repository Wankling, Matthew and Fazenda, Bruno The optimization of modal spacing within small rooms Original Citation Wankling, Matthew and Fazenda, Bruno (2008) The optimization

More information

Spatialisation accuracy of a Virtual Performance System

Spatialisation accuracy of a Virtual Performance System Spatialisation accuracy of a Virtual Performance System Iain Laird, Dr Paul Chapman, Digital Design Studio, Glasgow School of Art, Glasgow, UK, I.Laird1@gsa.ac.uk, p.chapman@gsa.ac.uk Dr Damian Murphy

More information

THE problem of acoustic echo cancellation (AEC) was

THE problem of acoustic echo cancellation (AEC) was IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 13, NO. 6, NOVEMBER 2005 1231 Acoustic Echo Cancellation and Doubletalk Detection Using Estimated Loudspeaker Impulse Responses Per Åhgren Abstract

More information

A classification-based cocktail-party processor

A classification-based cocktail-party processor A classification-based cocktail-party processor Nicoleta Roman, DeLiang Wang Department of Computer and Information Science and Center for Cognitive Science The Ohio State University Columbus, OH 43, USA

More information

Study on method of estimating direct arrival using monaural modulation sp. Author(s)Ando, Masaru; Morikawa, Daisuke; Uno

Study on method of estimating direct arrival using monaural modulation sp. Author(s)Ando, Masaru; Morikawa, Daisuke; Uno JAIST Reposi https://dspace.j Title Study on method of estimating direct arrival using monaural modulation sp Author(s)Ando, Masaru; Morikawa, Daisuke; Uno Citation Journal of Signal Processing, 18(4):

More information

LOCAL MULTISCALE FREQUENCY AND BANDWIDTH ESTIMATION. Hans Knutsson Carl-Fredrik Westin Gösta Granlund

LOCAL MULTISCALE FREQUENCY AND BANDWIDTH ESTIMATION. Hans Knutsson Carl-Fredrik Westin Gösta Granlund LOCAL MULTISCALE FREQUENCY AND BANDWIDTH ESTIMATION Hans Knutsson Carl-Fredri Westin Gösta Granlund Department of Electrical Engineering, Computer Vision Laboratory Linöping University, S-58 83 Linöping,

More information

Live multi-track audio recording

Live multi-track audio recording Live multi-track audio recording Joao Luiz Azevedo de Carvalho EE522 Project - Spring 2007 - University of Southern California Abstract In live multi-track audio recording, each microphone perceives sound

More information

Optimal modal spacing and density for critical listening

Optimal modal spacing and density for critical listening Optimal modal spacing and density for critical listening Fazenda, BM and Wankling, M Title Authors Type URL Published Date 2008 Optimal modal spacing and density for critical listening Fazenda, BM and

More information

Room Impulse Response Modeling in the Sub-2kHz Band using 3-D Rectangular Digital Waveguide Mesh

Room Impulse Response Modeling in the Sub-2kHz Band using 3-D Rectangular Digital Waveguide Mesh Room Impulse Response Modeling in the Sub-2kHz Band using 3-D Rectangular Digital Waveguide Mesh Zhixin Chen ILX Lightwave Corporation Bozeman, Montana, USA Abstract Digital waveguide mesh has emerged

More information

Three-dimensional sound field simulation using the immersive auditory display system Sound Cask for stage acoustics

Three-dimensional sound field simulation using the immersive auditory display system Sound Cask for stage acoustics Stage acoustics: Paper ISMRA2016-34 Three-dimensional sound field simulation using the immersive auditory display system Sound Cask for stage acoustics Kanako Ueno (a), Maori Kobayashi (b), Haruhito Aso

More information

Speech Enhancement Based on Audible Noise Suppression

Speech Enhancement Based on Audible Noise Suppression IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 5, NO. 6, NOVEMBER 1997 497 Speech Enhancement Based on Audible Noise Suppression Dionysis E. Tsoukalas, John N. Mourjopoulos, Member, IEEE, and George

More information

ROOM IMPULSE RESPONSE SHORTENING BY CHANNEL SHORTENING CONCEPTS. Markus Kallinger and Alfred Mertins

ROOM IMPULSE RESPONSE SHORTENING BY CHANNEL SHORTENING CONCEPTS. Markus Kallinger and Alfred Mertins ROOM IMPULSE RESPONSE SHORTENING BY CHANNEL SHORTENING CONCEPTS Markus Kallinger and Alfred Mertins University of Oldenburg, Institute of Physics, Signal Processing Group D-26111 Oldenburg, Germany {markus.kallinger,

More information

I D I A P R E S E A R C H R E P O R T. June published in Interspeech 2008

I D I A P R E S E A R C H R E P O R T. June published in Interspeech 2008 R E S E A R C H R E P O R T I D I A P Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain Sriram Ganapathy a b Petr Motlicek a Hynek Hermansky a b Harinath

More information

DIRECTIONAL CODING OF AUDIO USING A CIRCULAR MICROPHONE ARRAY

DIRECTIONAL CODING OF AUDIO USING A CIRCULAR MICROPHONE ARRAY DIRECTIONAL CODING OF AUDIO USING A CIRCULAR MICROPHONE ARRAY Anastasios Alexandridis Anthony Griffin Athanasios Mouchtaris FORTH-ICS, Heraklion, Crete, Greece, GR-70013 University of Crete, Department

More information

Accurate Delay Measurement of Coded Speech Signals with Subsample Resolution

Accurate Delay Measurement of Coded Speech Signals with Subsample Resolution PAGE 433 Accurate Delay Measurement of Coded Speech Signals with Subsample Resolution Wenliang Lu, D. Sen, and Shuai Wang School of Electrical Engineering & Telecommunications University of New South Wales,

More information

Audio Engineering Society Convention Paper 5449

Audio Engineering Society Convention Paper 5449 Audio Engineering Society Convention Paper 5449 Presented at the 111th Convention 21 September 21 24 New York, NY, USA This convention paper has been reproduced from the author s advance manuscript, without

More information

Speaker Isolation in a Cocktail-Party Setting

Speaker Isolation in a Cocktail-Party Setting Speaker Isolation in a Cocktail-Party Setting M.K. Alisdairi Columbia University M.S. Candidate Electrical Engineering Spring Abstract the human auditory system is capable of performing many interesting

More information

Introduction. 1.1 Surround sound

Introduction. 1.1 Surround sound Introduction 1 This chapter introduces the project. First a brief description of surround sound is presented. A problem statement is defined which leads to the goal of the project. Finally the scope of

More information

Convention Paper Presented at the 120th Convention 2006 May Paris, France

Convention Paper Presented at the 120th Convention 2006 May Paris, France Audio Engineering Society Convention Paper Presented at the 12th Convention 26 May 2 23 Paris, France This convention paper has been reproduced from the author s advance manuscript, without editing, corrections,

More information

Auditory Localization

Auditory Localization Auditory Localization CMPT 468: Sound Localization Tamara Smyth, tamaras@cs.sfu.ca School of Computing Science, Simon Fraser University November 15, 2013 Auditory locatlization is the human perception

More information

ROBUST echo cancellation requires a method for adjusting

ROBUST echo cancellation requires a method for adjusting 1030 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 15, NO. 3, MARCH 2007 On Adjusting the Learning Rate in Frequency Domain Echo Cancellation With Double-Talk Jean-Marc Valin, Member,

More information

Novel approaches towards more realistic listening environments for experiments in complex acoustic scenes

Novel approaches towards more realistic listening environments for experiments in complex acoustic scenes Novel approaches towards more realistic listening environments for experiments in complex acoustic scenes Janina Fels, Florian Pausch, Josefa Oberem, Ramona Bomhardt, Jan-Gerrit-Richter Teaching and Research

More information

A BINAURAL HEARING AID SPEECH ENHANCEMENT METHOD MAINTAINING SPATIAL AWARENESS FOR THE USER

A BINAURAL HEARING AID SPEECH ENHANCEMENT METHOD MAINTAINING SPATIAL AWARENESS FOR THE USER A BINAURAL EARING AID SPEEC ENANCEMENT METOD MAINTAINING SPATIAL AWARENESS FOR TE USER Joachim Thiemann, Menno Müller and Steven van de Par Carl-von-Ossietzky University Oldenburg, Cluster of Excellence

More information

Convention Paper Presented at the 128th Convention 2010 May London, UK

Convention Paper Presented at the 128th Convention 2010 May London, UK Audio Engineering Society Convention Paper Presented at the 128th Convention 21 May 22 25 London, UK 879 The papers at this Convention have been selected on the basis of a submitted abstract and extended

More information

Different Approaches of Spectral Subtraction Method for Speech Enhancement

Different Approaches of Spectral Subtraction Method for Speech Enhancement ISSN 2249 5460 Available online at www.internationalejournals.com International ejournals International Journal of Mathematical Sciences, Technology and Humanities 95 (2013 1056 1062 Different Approaches

More information

Modeling Diffraction of an Edge Between Surfaces with Different Materials

Modeling Diffraction of an Edge Between Surfaces with Different Materials Modeling Diffraction of an Edge Between Surfaces with Different Materials Tapio Lokki, Ville Pulkki Helsinki University of Technology Telecommunications Software and Multimedia Laboratory P.O.Box 5400,

More information

Monaural and Binaural Speech Separation

Monaural and Binaural Speech Separation Monaural and Binaural Speech Separation DeLiang Wang Perception & Neurodynamics Lab The Ohio State University Outline of presentation Introduction CASA approach to sound separation Ideal binary mask as

More information

3D sound image control by individualized parametric head-related transfer functions

3D sound image control by individualized parametric head-related transfer functions D sound image control by individualized parametric head-related transfer functions Kazuhiro IIDA 1 and Yohji ISHII 1 Chiba Institute of Technology 2-17-1 Tsudanuma, Narashino, Chiba 275-001 JAPAN ABSTRACT

More information

The relation between perceived apparent source width and interaural cross-correlation in sound reproduction spaces with low reverberation

The relation between perceived apparent source width and interaural cross-correlation in sound reproduction spaces with low reverberation Downloaded from orbit.dtu.dk on: Feb 05, 2018 The relation between perceived apparent source width and interaural cross-correlation in sound reproduction spaces with low reverberation Käsbach, Johannes;

More information

Acoustics Research Institute

Acoustics Research Institute Austrian Academy of Sciences Acoustics Research Institute Spatial SpatialHearing: Hearing: Single SingleSound SoundSource Sourcein infree FreeField Field Piotr PiotrMajdak Majdak&&Bernhard BernhardLaback

More information

Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter

Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter 1 Gupteswar Sahu, 2 D. Arun Kumar, 3 M. Bala Krishna and 4 Jami Venkata Suman Assistant Professor, Department of ECE,

More information

Convention e-brief 310

Convention e-brief 310 Audio Engineering Society Convention e-brief 310 Presented at the 142nd Convention 2017 May 20 23 Berlin, Germany This Engineering Brief was selected on the basis of a submitted synopsis. The author is

More information

AUDITORY ILLUSIONS & LAB REPORT FORM

AUDITORY ILLUSIONS & LAB REPORT FORM 01/02 Illusions - 1 AUDITORY ILLUSIONS & LAB REPORT FORM NAME: DATE: PARTNER(S): The objective of this experiment is: To understand concepts such as beats, localization, masking, and musical effects. APPARATUS:

More information

ROOM AND CONCERT HALL ACOUSTICS MEASUREMENTS USING ARRAYS OF CAMERAS AND MICROPHONES

ROOM AND CONCERT HALL ACOUSTICS MEASUREMENTS USING ARRAYS OF CAMERAS AND MICROPHONES ROOM AND CONCERT HALL ACOUSTICS The perception of sound by human listeners in a listening space, such as a room or a concert hall is a complicated function of the type of source sound (speech, oration,

More information

SOPA version 2. Revised July SOPA project. September 21, Introduction 2. 2 Basic concept 3. 3 Capturing spatial audio 4

SOPA version 2. Revised July SOPA project. September 21, Introduction 2. 2 Basic concept 3. 3 Capturing spatial audio 4 SOPA version 2 Revised July 7 2014 SOPA project September 21, 2014 Contents 1 Introduction 2 2 Basic concept 3 3 Capturing spatial audio 4 4 Sphere around your head 5 5 Reproduction 7 5.1 Binaural reproduction......................

More information

A METHOD FOR BINAURAL SOUND REPRODUCTION WITH WIDER LISTENING AREA USING TWO LOUDSPEAKERS

A METHOD FOR BINAURAL SOUND REPRODUCTION WITH WIDER LISTENING AREA USING TWO LOUDSPEAKERS 23 rd International ongress on Sound & Vibration Athens, Greece 0-4 July 206 ISV23 A METHOD FO BINAUA SOUND EPODUTION WITH WIDE ISTENING AEA USING TWO OUDSPEAKES Keiichiro Someda, Akihiko Enamito, Osamu

More information

Convention Paper Presented at the 126th Convention 2009 May 7 10 Munich, Germany

Convention Paper Presented at the 126th Convention 2009 May 7 10 Munich, Germany Audio Engineering Society Convention Paper Presented at the 16th Convention 9 May 7 Munich, Germany The papers at this Convention have been selected on the basis of a submitted abstract and extended precis

More information

RASTA-PLP SPEECH ANALYSIS. Aruna Bayya. Phil Kohn y TR December 1991

RASTA-PLP SPEECH ANALYSIS. Aruna Bayya. Phil Kohn y TR December 1991 RASTA-PLP SPEECH ANALYSIS Hynek Hermansky Nelson Morgan y Aruna Bayya Phil Kohn y TR-91-069 December 1991 Abstract Most speech parameter estimation techniques are easily inuenced by the frequency response

More information

Combining Subjective and Objective Assessment of Loudspeaker Distortion Marian Liebig Wolfgang Klippel

Combining Subjective and Objective Assessment of Loudspeaker Distortion Marian Liebig Wolfgang Klippel Combining Subjective and Objective Assessment of Loudspeaker Distortion Marian Liebig (m.liebig@klippel.de) Wolfgang Klippel (wklippel@klippel.de) Abstract To reproduce an artist s performance, the loudspeakers

More information

Encoding higher order ambisonics with AAC

Encoding higher order ambisonics with AAC University of Wollongong Research Online Faculty of Engineering - Papers (Archive) Faculty of Engineering and Information Sciences 2008 Encoding higher order ambisonics with AAC Erik Hellerud Norwegian

More information

Multiple Sound Sources Localization Using Energetic Analysis Method

Multiple Sound Sources Localization Using Energetic Analysis Method VOL.3, NO.4, DECEMBER 1 Multiple Sound Sources Localization Using Energetic Analysis Method Hasan Khaddour, Jiří Schimmel Department of Telecommunications FEEC, Brno University of Technology Purkyňova

More information

Multiple sound source localization using gammatone auditory filtering and direct sound componence detection

Multiple sound source localization using gammatone auditory filtering and direct sound componence detection IOP Conference Series: Earth and Environmental Science PAPER OPE ACCESS Multiple sound source localization using gammatone auditory filtering and direct sound componence detection To cite this article:

More information

Perception of pitch. Importance of pitch: 2. mother hemp horse. scold. Definitions. Why is pitch important? AUDL4007: 11 Feb A. Faulkner.

Perception of pitch. Importance of pitch: 2. mother hemp horse. scold. Definitions. Why is pitch important? AUDL4007: 11 Feb A. Faulkner. Perception of pitch AUDL4007: 11 Feb 2010. A. Faulkner. See Moore, BCJ Introduction to the Psychology of Hearing, Chapter 5. Or Plack CJ The Sense of Hearing Lawrence Erlbaum, 2005 Chapter 7 1 Definitions

More information