WAVELET-BASED SPECTRAL SMOOTHING FOR HEAD-RELATED TRANSFER FUNCTION FILTER DESIGN

Size: px
Start display at page:

Download "WAVELET-BASED SPECTRAL SMOOTHING FOR HEAD-RELATED TRANSFER FUNCTION FILTER DESIGN"

Transcription

1 WAVELET-BASE SPECTRAL SMOOTHING FOR HEA-RELATE TRANSFER FUNCTION FILTER ESIGN HUSEYIN HACIHABIBOGLU, BANU GUNEL, AN FIONN MURTAGH Sonic Arts Research Centre (SARC), Queen s University Belfast, Belfast, BT 1NN, U.K. {h.hacihabiboglu, b.gunel, f.murtagh}@qub.ac.uk Three wavelet-based spectral smoothing techniques are presented in this paper as a pre-processing stage for headrelated transfer function (HRTF) filter design. These wavelet-based methods include wavelet denoising, wavelet approximation, and redundant wavelet transform. These methods are used with time-domain parametric filter design methods to reduce the order of the IIR filters which is useful for real-time implementation of immersive audio systems. Results of a subjective listening test are then presented in order to justify the perceptual validity of the investigated smoothing methods. Results show that wavelet based spectral smoothing methods are beneficial in reducing the filter order and increasing the perception of localization without introducing noticeable effect on timbre. INTROUCTION Spectral shaping of sound on its way from a fixed point in free field to an external ear is defined by a headrelated transfer function (HRTF). ata reduction of HRTFs becomes an important issue when dealing with virtual reality applications, spatialization of audio and binaural synthesis in real time. ifferent methods for data complexity reduction including downsampling, rectangular windowing [1], and principal component analysis [2] have been proposed. The data-reduced form of an HRTF is then used for filter design. An essential issue in HRTF filter design for perceptual validity is preservation of the binaural hearing cues. A better approach uses auditory criteria for smoothing [3]. Frequency warping [4] has been used as an auditory weighting method []. A review of auditory weighting is given in Huopaniemi et al. []. The smoothing methods presented in this paper depend on wavelet transforms. In Section 1, a summary of wavelet de-noising, wavelet approximation and redundant wavelet transform is presented. The application of these smoothing methods to individual HRTFs is discussed together with the filter design issues in Section 2. Results of the subjective listening test are presented in Section 3. 1 REVIEW OF WAVELET SMOOTHING The smoothing methods employed in this paper utilize the discrete wavelet transform (WT). A general review of wavelets can be found in Mallat []. WT can be used for smoothing a given signal in a number of different ways. Three possibilities are wavelet denoising, wavelet approximation and redundant wavelet transform. In the scope of this paper, the signals to be smoothed are the magnitude responses of the individual HRTFs. 1.1 Wavelet enoising with SURE Assuming that the magnitude response of an HRTF is a noisy version of a smoother function, we can use wavelet denoising to recover an approximate, smoother function. The wavelet denoising is defined as a three-step process [] [9]. If we assume that the magnitude response of the actual HRTF has additive white Gaussian noise such that h = f t ) + z i where ( (1) i done by following the steps below: i 2 z i = N(, σ ), then the wavelet denoising is 1) WT is performed on the noisy data. 2) A soft-threshold non-linearity is applied to the noisy wavelet coefficients to find the wavelet coefficient estimates. η ( w) = sgn( w)( w t) I( w t) (2) t > The threshold t in this equation is calculated with the SURE (Stein s Unbiased Risk Estimation) algorithm. AES 22 nd International Conference on Virtual, Synthetic and Entertainment Audio 1

2 3) The denoised function estimate (t) is obtained by inverting the WT using the wavelet coefficient estimates found in the previous step. Validity of the assumption of noise characteristics for HRTF data will be discussed in Section Wavelet Approximation Another assumption regarding HRTF data is that small deviations are not critically important and can accordingly be discarded [2]. After the WT is performed on a signal of length 2 N, approximation coefficients at level L will have the length 2 N-L. The signal approximation is low-pass interpolated to get a smooth approximation of the actual HRTF magnitude response. Smoothness of the final representation also depends on the selection of the wavelet type and the depth of the tree. A scaling function with a higher attenuation in the first side-lobe captures better low frequency approximation. f The smoothing and filtering procedures described in the subsequent sections have been applied to the HRTFs for all the azimuth and elevation values for both types of the pinnae. 2.2 Smoothing with Wavelet enoising The wavelet denoising is performed with the SURE algorithm to select the soft-threshold. ifferent wavelet functions show different denoising performances with the same denoising scheme. Therefore, denoising was performed for different wavelets selected from a wellknown group consisting of Haar, discrete Meyer, aubechies 2-9, Coiflet 1- and Symmlet 1-1 wavelets. A 3-level WT was used in all of the cases. The results of the smoothing for magnitude response of the HRTF corresponding to the left ear at zero-azimuth (ϕ= ) and zero-elevation (δ= ) is given below (see Fig. 1). 1.3 Redundant Wavelet Transform If the sub-sampling steps are discarded from the WT structure, a better low-pass approximation can be obtained. In the case of the HRTFs, significant coarse details of the frequency notches and the low frequency parts of the magnitude response are successfully captured. Results of the redundant wavelet transform depend on the selection of the wavelet type and the depth of the wavelet tree as in the case of wavelet approximation. Selection of a scaling function with a high attenuation in the first side-lobe is also important for this type of smoothing. 2 HRTF SMOOTHING USING WAVELETS 2.1 HRTF ata The wavelet-based smoothing methods mentioned in the previous section are applied to the magnitude responses of the non-individualized HRTFs sampled at three different locations. The HRTF sets are selected from the free-field equalized HRTF measurements made on the KEMAR mannequin by Martin and Gardner [1]. The Martin-Gardner HRTF set includes the HRTF measurements for two types of pinnae. A property of these data sets is that they are already presented as 12 sample series, after an acceptable amount of data is discarded. Before the smoothing process, magnitude responses of the individual HRTFs were equalized to discard the effect of the loudspeaker used in the HRTF measurements Figure 1: Original and denoised magnitude responses with db4, sym2, and coif3 from top to bottom, respectively It may be noted that wavelet denoising with SURE does not provide a sufficiently smooth result. Overfitting of the original by the wavelet-denoised data prevents the reduction of the order of filters in the design phase. This result is a consequence of the false assumption that the irregularities in the magnitude responses may be modeled as additive white Gaussian noise. Therefore wavelet denoising with SURE was not included as a smoothing method to be investigated in the listening test. 2.3 Smoothing with Wavelet Approximation The equalized magnitude response can be smoothed by obtaining the low-frequency approximation using the WT. The wavelet approximation of the signal was computed using the Symmlet 1 filter bank. A 3-level WT was performed on the 12-sample magnitude response. The AES 22 nd International Conference on Virtual, Synthetic and Entertainment Audio 2

3 wavelet approximation, which is 4 samples long, was low-pass interpolated to get a smoother version of the signal. An alternative to this approach may be sub-sampling the signal by and low-pass interpolating with samples between each obtained result. However, this alternative results in loss of crucial data and gives minor frequency peaks and notches (see the selected area in Fig. 2) which are not significant in the original data. This decreases the success of smoothing in reducing the filter order. 2. Filter esign The equalized and smoothed magnitude responses were first converted into the time-domain. A minimum-phase version of the signal was then obtained by the minimum-phase reconstruction using real cepstrum [11]. A minimum-phase IIR filter with the same number of poles and zeros was designed for each HRTF with Prony s method. Frequency dependence of the interaural time delay (IT) was assumed to be perceptually irrelevant [2]. Thus the IT was supplied by a constant delay line PERCEPTUAL VALIATION OF WAVELET- BASE SMOOTHING Perceptual validation of our results was carried out using a subjective listening test. We used intraural, earplug style Sony headphones, and carried out the tests in a recording studio in the Music epartment of Queen s University Belfast Figure 2: Original (middle), wavelet approximation smoothing (top), and subsampling/interpolation smoothing (bottom) (ϕ=, δ= ) 2.4 Smoothing with Redundant Transform The redundant wavelet transform described in the previous section was applied to the magnitude responses of the HRTFs. The filter-bank used was the Symmlet 1 filter bank. As the approximation obtained using the redundant wavelet transform consists of 12 samples, low-pass interpolation is no longer necessary. Thus, the results capture the frequency peaks and notches better than wavelet approximation (see Fig. 3) Figure 3: Original (top) and redundant wavelet transform smoothing (bottom) (ϕ=, δ= ) 3.1 Subjects Eleven subjects (3 female, male) aged 19-3 served as voluntary participants. All had normal hearing and no history of hearing problems. Five of the subjects had previous experience of specialized listening. 3.2 Stimulus We used a pink noise sample with a length of one second with ms ramps in the onset and the offset portions. 3.3 Procedure Three HRTF sets representing three spatial positions were used in the listening test. The HRTF pairs we used were selected to represent both types of pinnae for 3 varying positions of azimuth (ϕ) and elevation (δ): ϕ= and δ= for small pinnae. ϕ=3 and δ= for small pinnae. ϕ=24 and δ=-1 for large red pinnae. This selection was rather arbitrary but it was made sure that the positions were far enough from each other to prevent bias in the listening test due to the previous location of the sound. This means that the ripples, peaks and notches of the HRTFs are quite different from each other (see Fig. 4 and Fig. ). The subjects were seated in the acoustically insulated recording studio. Tests for each of the three HRTF sets were carried out separately for each subject. Test signals were presented in pairs. The first signal was always the reference signal obtained by the convolution of the stimulus signal with the original HRTF set. The second signal was one of the following three signals selected randomly: AES 22 nd International Conference on Virtual, Synthetic and Entertainment Audio 3

4 Pink noise filtered with the IIR filter which was directly designed from the HRTF data in the timedomain without any smoothing (). Pink noise filtered with the IIR filter after smoothing with the wavelet approximation (). Pink noise filtered with the IIR filter after smoothing with the redundant wavelet transform () AE A3E A-1E24 Figure 4: Tested HRTFs for the right ear plotted with db difference between them. AE A3E A-1E24 The test program, designed with MATLAB was similar to that used by Huopaniemi et al. [12]. The subjects were verbally instructed on the usage of the test program beforehand. The program selected the sound pairs from the test set randomly. Each sound pair was played with. sec time interval between the first and the second sounds. Subjects evaluated the sound pairs by similarity of timbre and localization on a scale of 1- (very different - no difference) quantized to unit intervals of.1. Subjects were given the opportunity to replay the sound pairs. The test lasted for -3 minutes per subject. The positions at which the subjects actually heard the stimuli were not investigated due to the fact that the HRTFs used in the test were not individualized HRTFs of the subjects. 3.4 Results The results were analyzed using the SPSS software package. The independent variables were method, subject, filter order, and angle. The dependent variables were timbre and localization scores obtained from the subjective listening test. The redundant wavelet transform outperformed the wavelet approximation and the direct design in terms of the localization scores for different filter orders. It may be noted that even for a filter order as low as 1, the localization is satisfactorily high. The wavelet approximation improved the localization compared to the direct design as well (see Fig. ) Figure : Tested HRTFs for the left ear plotted with db difference between them. All of the tested filters were calculated for the orders of,,,, 9, 1, 12, 14, 1, 1,, 24, 2, 32 and 3 for the HRTFs sampled at three locations. Thus, each subject was presented a total of 13 test pairs, which were equalized for the headphone and the ear canal resonance. 9% CI Localisation Order Figure : Filter Order vs. Mean Localization graphs for the three methods (,, and ) The timbre scores follow an increasing trend for increasing filter orders as expected. However, it is not possible to comment on the type of method which is better in preserving the timbre (see Fig. ). AES 22 nd International Conference on Virtual, Synthetic and Entertainment Audio 4

5 . wavelet transform was found to be satisfactory even for very low filter orders (see Fig. 9) % CI Timbre Order Figure : Filter Order vs. Mean Timbre graphs for the three methods (, and ) The redundant wavelet transform was the preferred choice for all of the subjects in terms of the mean localization scores. The wavelet approximation was found to be superior over the direct design. This result is another proof that the wavelet-based smoothing methods provide better localization cues (see Fig. ).. Count Order Figure 9: Filter Order vs. Count for the three methods for (Timbre 3.) and (Localization 4.) The localization accuracy depends on the location of the sound source [13]. The redundant wavelet transform smoothing performed well for all of the investigated HRTFs giving mean localization scores over 4. (see Fig. 1) % CI Localisation Subjects Figure : Subject vs. Mean Localization graphs for the three methods (, and ) The correlation coefficient between the timbre and the localization was calculated as.19 using Spearman s rho test. This low correlation can be due to the fact that the HRTFs include both the timbre and the localization information []. Overall mean values for the timbre score and the localization score were found to be 3.4 and 3.3 respectively. Therefore the cases where the timbre score is greater than 3. and the localization score is greater than 4. can be taken as acceptable. These cases were analyzed with respect to the filter orders for the three smoothing methods. The redundant 9% CI Localisation Angle AE A24E-1 A3E Figure 1: Angle vs. Mean Localization graphs for three methods (, and ) 4 CONCLUSIONS AN FUTURE WORK In this work, we analyzed the effects of three waveletbased methods for spectral smoothing for HRTF filter design. The results show that the wavelet-based smoothing methods can help in reducing the filter order for IIR filters. The wavelet approximation outperforms the direct design in most of the cases. The redundant wavelet transform increases the localization accuracy AES 22 nd International Conference on Virtual, Synthetic and Entertainment Audio

6 without substantially decreasing the timbre quality. Its robustness for different source locations provides evidence for its usefulness for HRTF filter design for immersive audio applications. The opportunities for future work include investigation of the different thresholding strategies to be used with wavelet denoising compatible with the human auditory system. The detail coefficients of the WT will be used to restore the details of the HRTFs within a real-time system utilizing a head tracker. Restoring the details of the HRTFs can be useful in smoothly and gradually increasing the level of timbral detail when the position of the listener s head is stable. ACKNOWLEGEMENTS The authors gratefully acknowledge the assistance of Mr. Chris Corrigan in the organization of the subjective listening test. We would also like to thank the participants of the test. REFERENCES [1]. R. Begault, 3- Sound for Virtual Reality and Multimedia (Academic Press, London, 1994), pp [2] F. L. Wightman and. J. Kistler, "A Model of Head-Related Transfer Function Based on Principal Components Analysis and Minimumphase Reconstruction," J. Acoust. Soc. Am., vol. 91, pp , (1992) []. L. onoho, and I. M. Johnstone, Adapting to Unknown Smoothness via Wavelet Shrinkage, J. Am. Statist. Ass., vol. 9, pp , (199) [9]. L. onoho, "enoising by Soft-thresholding," IEEE Trans. Inf. Theory, vol. 41, pp. 13-2, (199) [1] W. G. Gardner and K.. Martin. "HRTF Measurements of a KEMAR," J. Acoust. Soc. Am., vol. 9, pp , (199) [11] A. V. Oppenheim and R. W. Schafer, iscrete- Time Signal Processing (Prentice-Hall, N.J., 199) pp. 3- [12] J. Huopaniemi, N. Zacharov, and M. Karjalainen. "Objective and Subjective Evaluation of Head- Related Transfer Function Filter esign," J. Audio Eng. Soc., vol. 4, no. 4., pp , (1999) [13] J. Blauert, Spatial Hearing : The Psychophysics of Human Sound Localization (M.I.T. Press, Cambridge, MA, 193), pp [3] J. Huopaniemi and M. Karjalainen, "HRTF Filter esign Based on Auditory Criteria," in Proc. Nordic Acoustical Meeting (NAM'9), (Helsinki, Finland, 199, June 12-14), pp [4] J. Huopaniemi and M. Karjalainen. Comparison of igital Filter esign s for 3- Sound, in Proc. IEEE Nordic Signal Processing Symp. (NORSIG'9), (Espoo, Finland, 199), pp [] J. Smith and J. Abel, Bark and ERB Bilinear Transforms, IEEE Transactions on Speech and Audio Processing, vol., iss., pp. 9, (1999) [] J. Huopaniemi and M. Karjalainen. Review of igital Filter esign and Implementation s for 3- Sound, in Proc. of the 12 nd Convention of the Audio Engineering Society, Preprint 441, (Munich, Germany, 199) [] S. Mallat, A Wavelet Tour of Signal Processing, (Academic Press, London, 199) AES 22 nd International Conference on Virtual, Synthetic and Entertainment Audio

A binaural auditory model and applications to spatial sound evaluation

A binaural auditory model and applications to spatial sound evaluation A binaural auditory model and applications to spatial sound evaluation Ma r k o Ta k a n e n 1, Ga ë ta n Lo r h o 2, a n d Mat t i Ka r ja l a i n e n 1 1 Helsinki University of Technology, Dept. of Signal

More information

Enhancing 3D Audio Using Blind Bandwidth Extension

Enhancing 3D Audio Using Blind Bandwidth Extension Enhancing 3D Audio Using Blind Bandwidth Extension (PREPRINT) Tim Habigt, Marko Ðurković, Martin Rothbucher, and Klaus Diepold Institute for Data Processing, Technische Universität München, 829 München,

More information

Psychoacoustic Cues in Room Size Perception

Psychoacoustic Cues in Room Size Perception Audio Engineering Society Convention Paper Presented at the 116th Convention 2004 May 8 11 Berlin, Germany 6084 This convention paper has been reproduced from the author s advance manuscript, without editing,

More information

WARPED FILTER DESIGN FOR THE BODY MODELING AND SOUND SYNTHESIS OF STRING INSTRUMENTS

WARPED FILTER DESIGN FOR THE BODY MODELING AND SOUND SYNTHESIS OF STRING INSTRUMENTS NORDIC ACOUSTICAL MEETING 12-14 JUNE 1996 HELSINKI WARPED FILTER DESIGN FOR THE BODY MODELING AND SOUND SYNTHESIS OF STRING INSTRUMENTS Helsinki University of Technology Laboratory of Acoustics and Audio

More information

The analysis of multi-channel sound reproduction algorithms using HRTF data

The analysis of multi-channel sound reproduction algorithms using HRTF data The analysis of multichannel sound reproduction algorithms using HRTF data B. Wiggins, I. PatersonStephens, P. Schillebeeckx Processing Applications Research Group University of Derby Derby, United Kingdom

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 2013 http://acousticalsociety.org/ ICA 2013 Montreal Montreal, Canada 2-7 June 2013 Psychological and Physiological Acoustics Session 2aPPa: Binaural Hearing

More information

A triangulation method for determining the perceptual center of the head for auditory stimuli

A triangulation method for determining the perceptual center of the head for auditory stimuli A triangulation method for determining the perceptual center of the head for auditory stimuli PACS REFERENCE: 43.66.Qp Brungart, Douglas 1 ; Neelon, Michael 2 ; Kordik, Alexander 3 ; Simpson, Brian 4 1

More information

III. Publication III. c 2005 Toni Hirvonen.

III. Publication III. c 2005 Toni Hirvonen. III Publication III Hirvonen, T., Segregation of Two Simultaneously Arriving Narrowband Noise Signals as a Function of Spatial and Frequency Separation, in Proceedings of th International Conference on

More information

A Virtual Audio Environment for Testing Dummy- Head HRTFs modeling Real Life Situations

A Virtual Audio Environment for Testing Dummy- Head HRTFs modeling Real Life Situations A Virtual Audio Environment for Testing Dummy- Head HRTFs modeling Real Life Situations György Wersényi Széchenyi István University, Hungary. József Répás Széchenyi István University, Hungary. Summary

More information

ORIENTATION IN SIMPLE VIRTUAL AUDITORY SPACE CREATED WITH MEASURED HRTF

ORIENTATION IN SIMPLE VIRTUAL AUDITORY SPACE CREATED WITH MEASURED HRTF ORIENTATION IN SIMPLE VIRTUAL AUDITORY SPACE CREATED WITH MEASURED HRTF F. Rund, D. Štorek, O. Glaser, M. Barda Faculty of Electrical Engineering Czech Technical University in Prague, Prague, Czech Republic

More information

Direction-Dependent Physical Modeling of Musical Instruments

Direction-Dependent Physical Modeling of Musical Instruments 15th International Congress on Acoustics (ICA 95), Trondheim, Norway, June 26-3, 1995 Title of the paper: Direction-Dependent Physical ing of Musical Instruments Authors: Matti Karjalainen 1,3, Jyri Huopaniemi

More information

Spatial Audio Reproduction: Towards Individualized Binaural Sound

Spatial Audio Reproduction: Towards Individualized Binaural Sound Spatial Audio Reproduction: Towards Individualized Binaural Sound WILLIAM G. GARDNER Wave Arts, Inc. Arlington, Massachusetts INTRODUCTION The compact disc (CD) format records audio with 16-bit resolution

More information

Auditory modelling for speech processing in the perceptual domain

Auditory modelling for speech processing in the perceptual domain ANZIAM J. 45 (E) ppc964 C980, 2004 C964 Auditory modelling for speech processing in the perceptual domain L. Lin E. Ambikairajah W. H. Holmes (Received 8 August 2003; revised 28 January 2004) Abstract

More information

ROOM SHAPE AND SIZE ESTIMATION USING DIRECTIONAL IMPULSE RESPONSE MEASUREMENTS

ROOM SHAPE AND SIZE ESTIMATION USING DIRECTIONAL IMPULSE RESPONSE MEASUREMENTS ROOM SHAPE AND SIZE ESTIMATION USING DIRECTIONAL IMPULSE RESPONSE MEASUREMENTS PACS: 4.55 Br Gunel, Banu Sonic Arts Research Centre (SARC) School of Computer Science Queen s University Belfast Belfast,

More information

Sound source localization and its use in multimedia applications

Sound source localization and its use in multimedia applications Notes for lecture/ Zack Settel, McGill University Sound source localization and its use in multimedia applications Introduction With the arrival of real-time binaural or "3D" digital audio processing,

More information

BINAURAL RECORDING SYSTEM AND SOUND MAP OF MALAGA

BINAURAL RECORDING SYSTEM AND SOUND MAP OF MALAGA EUROPEAN SYMPOSIUM ON UNDERWATER BINAURAL RECORDING SYSTEM AND SOUND MAP OF MALAGA PACS: Rosas Pérez, Carmen; Luna Ramírez, Salvador Universidad de Málaga Campus de Teatinos, 29071 Málaga, España Tel:+34

More information

HIGH QUALITY AUDIO CODING AT LOW BIT RATE USING WAVELET AND WAVELET PACKET TRANSFORM

HIGH QUALITY AUDIO CODING AT LOW BIT RATE USING WAVELET AND WAVELET PACKET TRANSFORM HIGH QUALITY AUDIO CODING AT LOW BIT RATE USING WAVELET AND WAVELET PACKET TRANSFORM DR. D.C. DHUBKARYA AND SONAM DUBEY 2 Email at: sonamdubey2000@gmail.com, Electronic and communication department Bundelkhand

More information

Audio Engineering Society. Convention Paper. Presented at the 131st Convention 2011 October New York, NY, USA

Audio Engineering Society. Convention Paper. Presented at the 131st Convention 2011 October New York, NY, USA Audio Engineering Society Convention Paper Presented at the 131st Convention 2011 October 20 23 New York, NY, USA This Convention paper was selected based on a submitted abstract and 750-word precis that

More information

Convention Paper Presented at the 125th Convention 2008 October 2 5 San Francisco, CA, USA

Convention Paper Presented at the 125th Convention 2008 October 2 5 San Francisco, CA, USA Audio Engineering Society Convention Paper Presented at the 125th Convention 2008 October 2 5 San Francisco, CA, USA The papers at this Convention have been selected on the basis of a submitted abstract

More information

Aalborg Universitet. Audibility of time switching in dynamic binaural synthesis Hoffmann, Pablo Francisco F.; Møller, Henrik

Aalborg Universitet. Audibility of time switching in dynamic binaural synthesis Hoffmann, Pablo Francisco F.; Møller, Henrik Aalborg Universitet Audibility of time switching in dynamic binaural synthesis Hoffmann, Pablo Francisco F.; Møller, Henrik Published in: Journal of the Audio Engineering Society Publication date: 2005

More information

Multichannel level alignment, part I: Signals and methods

Multichannel level alignment, part I: Signals and methods Suokuisma, Zacharov & Bech AES 5th Convention - San Francisco Multichannel level alignment, part I: Signals and methods Pekka Suokuisma Nokia Research Center, Speech and Audio Systems Laboratory, Tampere,

More information

Acoustics Research Institute

Acoustics Research Institute Austrian Academy of Sciences Acoustics Research Institute Spatial SpatialHearing: Hearing: Single SingleSound SoundSource Sourcein infree FreeField Field Piotr PiotrMajdak Majdak&&Bernhard BernhardLaback

More information

Binaural Hearing. Reading: Yost Ch. 12

Binaural Hearing. Reading: Yost Ch. 12 Binaural Hearing Reading: Yost Ch. 12 Binaural Advantages Sounds in our environment are usually complex, and occur either simultaneously or close together in time. Studies have shown that the ability to

More information

HRTF adaptation and pattern learning

HRTF adaptation and pattern learning HRTF adaptation and pattern learning FLORIAN KLEIN * AND STEPHAN WERNER Electronic Media Technology Lab, Institute for Media Technology, Technische Universität Ilmenau, D-98693 Ilmenau, Germany The human

More information

Listening with Headphones

Listening with Headphones Listening with Headphones Main Types of Errors Front-back reversals Angle error Some Experimental Results Most front-back errors are front-to-back Substantial individual differences Most evident in elevation

More information

HRIR Customization in the Median Plane via Principal Components Analysis

HRIR Customization in the Median Plane via Principal Components Analysis 한국소음진동공학회 27 년춘계학술대회논문집 KSNVE7S-6- HRIR Customization in the Median Plane via Principal Components Analysis 주성분분석을이용한 HRIR 맞춤기법 Sungmok Hwang and Youngjin Park* 황성목 박영진 Key Words : Head-Related Transfer

More information

Evaluation of a new stereophonic reproduction method with moving sweet spot using a binaural localization model

Evaluation of a new stereophonic reproduction method with moving sweet spot using a binaural localization model Evaluation of a new stereophonic reproduction method with moving sweet spot using a binaural localization model Sebastian Merchel and Stephan Groth Chair of Communication Acoustics, Dresden University

More information

Enhancement of Speech Signal by Adaptation of Scales and Thresholds of Bionic Wavelet Transform Coefficients

Enhancement of Speech Signal by Adaptation of Scales and Thresholds of Bionic Wavelet Transform Coefficients ISSN (Print) : 232 3765 An ISO 3297: 27 Certified Organization Vol. 3, Special Issue 3, April 214 Paiyanoor-63 14, Tamil Nadu, India Enhancement of Speech Signal by Adaptation of Scales and Thresholds

More information

AN AUDITORILY MOTIVATED ANALYSIS METHOD FOR ROOM IMPULSE RESPONSES

AN AUDITORILY MOTIVATED ANALYSIS METHOD FOR ROOM IMPULSE RESPONSES Proceedings of the COST G-6 Conference on Digital Audio Effects (DAFX-), Verona, Italy, December 7-9,2 AN AUDITORILY MOTIVATED ANALYSIS METHOD FOR ROOM IMPULSE RESPONSES Tapio Lokki Telecommunications

More information

Computational Perception /785

Computational Perception /785 Computational Perception 15-485/785 Assignment 1 Sound Localization due: Thursday, Jan. 31 Introduction This assignment focuses on sound localization. You will develop Matlab programs that synthesize sounds

More information

Convention Paper 9870 Presented at the 143 rd Convention 2017 October 18 21, New York, NY, USA

Convention Paper 9870 Presented at the 143 rd Convention 2017 October 18 21, New York, NY, USA Audio Engineering Society Convention Paper 987 Presented at the 143 rd Convention 217 October 18 21, New York, NY, USA This convention paper was selected based on a submitted abstract and 7-word precis

More information

Ivan Tashev Microsoft Research

Ivan Tashev Microsoft Research Hannes Gamper Microsoft Research David Johnston Microsoft Research Ivan Tashev Microsoft Research Mark R. P. Thomas Dolby Laboratories Jens Ahrens Chalmers University, Sweden Augmented and virtual reality,

More information

The psychoacoustics of reverberation

The psychoacoustics of reverberation The psychoacoustics of reverberation Steven van de Par Steven.van.de.Par@uni-oldenburg.de July 19, 2016 Thanks to Julian Grosse and Andreas Häußler 2016 AES International Conference on Sound Field Control

More information

Nonlinear Filtering in ECG Signal Denoising

Nonlinear Filtering in ECG Signal Denoising Acta Universitatis Sapientiae Electrical and Mechanical Engineering, 2 (2) 36-45 Nonlinear Filtering in ECG Signal Denoising Zoltán GERMÁN-SALLÓ Department of Electrical Engineering, Faculty of Engineering,

More information

Study on method of estimating direct arrival using monaural modulation sp. Author(s)Ando, Masaru; Morikawa, Daisuke; Uno

Study on method of estimating direct arrival using monaural modulation sp. Author(s)Ando, Masaru; Morikawa, Daisuke; Uno JAIST Reposi https://dspace.j Title Study on method of estimating direct arrival using monaural modulation sp Author(s)Ando, Masaru; Morikawa, Daisuke; Uno Citation Journal of Signal Processing, 18(4):

More information

Robotic Spatial Sound Localization and Its 3-D Sound Human Interface

Robotic Spatial Sound Localization and Its 3-D Sound Human Interface Robotic Spatial Sound Localization and Its 3-D Sound Human Interface Jie Huang, Katsunori Kume, Akira Saji, Masahiro Nishihashi, Teppei Watanabe and William L. Martens The University of Aizu Aizu-Wakamatsu,

More information

Virtual Reality Presentation of Loudspeaker Stereo Recordings

Virtual Reality Presentation of Loudspeaker Stereo Recordings Virtual Reality Presentation of Loudspeaker Stereo Recordings by Ben Supper 21 March 2000 ACKNOWLEDGEMENTS Thanks to: Francis Rumsey, for obtaining a head tracker specifically for this Technical Project;

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 213 http://acousticalsociety.org/ IA 213 Montreal Montreal, anada 2-7 June 213 Psychological and Physiological Acoustics Session 3pPP: Multimodal Influences

More information

Tu1.D II Current Approaches to 3-D Sound Reproduction. Elizabeth M. Wenzel

Tu1.D II Current Approaches to 3-D Sound Reproduction. Elizabeth M. Wenzel Current Approaches to 3-D Sound Reproduction Elizabeth M. Wenzel NASA Ames Research Center Moffett Field, CA 94035 Elizabeth.M.Wenzel@nasa.gov Abstract Current approaches to spatial sound synthesis are

More information

University of Huddersfield Repository

University of Huddersfield Repository University of Huddersfield Repository Moore, David J. and Wakefield, Jonathan P. Surround Sound for Large Audiences: What are the Problems? Original Citation Moore, David J. and Wakefield, Jonathan P.

More information

Blind source separation and directional audio synthesis for binaural auralization of multiple sound sources using microphone array recordings

Blind source separation and directional audio synthesis for binaural auralization of multiple sound sources using microphone array recordings Blind source separation and directional audio synthesis for binaural auralization of multiple sound sources using microphone array recordings Banu Gunel, Huseyin Hacihabiboglu and Ahmet Kondoz I-Lab Multimedia

More information

Image De-Noising Using a Fast Non-Local Averaging Algorithm

Image De-Noising Using a Fast Non-Local Averaging Algorithm Image De-Noising Using a Fast Non-Local Averaging Algorithm RADU CIPRIAN BILCU 1, MARKKU VEHVILAINEN 2 1,2 Multimedia Technologies Laboratory, Nokia Research Center Visiokatu 1, FIN-33720, Tampere FINLAND

More information

INVESTIGATING BINAURAL LOCALISATION ABILITIES FOR PROPOSING A STANDARDISED TESTING ENVIRONMENT FOR BINAURAL SYSTEMS

INVESTIGATING BINAURAL LOCALISATION ABILITIES FOR PROPOSING A STANDARDISED TESTING ENVIRONMENT FOR BINAURAL SYSTEMS 20-21 September 2018, BULGARIA 1 Proceedings of the International Conference on Information Technologies (InfoTech-2018) 20-21 September 2018, Bulgaria INVESTIGATING BINAURAL LOCALISATION ABILITIES FOR

More information

From Binaural Technology to Virtual Reality

From Binaural Technology to Virtual Reality From Binaural Technology to Virtual Reality Jens Blauert, D-Bochum Prominent Prominent Features of of Binaural Binaural Hearing Hearing - Localization Formation of positions of the auditory events (azimuth,

More information

Computational Perception. Sound localization 2

Computational Perception. Sound localization 2 Computational Perception 15-485/785 January 22, 2008 Sound localization 2 Last lecture sound propagation: reflection, diffraction, shadowing sound intensity (db) defining computational problems sound lateralization

More information

Subband Analysis of Time Delay Estimation in STFT Domain

Subband Analysis of Time Delay Estimation in STFT Domain PAGE 211 Subband Analysis of Time Delay Estimation in STFT Domain S. Wang, D. Sen and W. Lu School of Electrical Engineering & Telecommunications University of ew South Wales, Sydney, Australia sh.wang@student.unsw.edu.au,

More information

VIRTUAL ACOUSTICS: OPPORTUNITIES AND LIMITS OF SPATIAL SOUND REPRODUCTION

VIRTUAL ACOUSTICS: OPPORTUNITIES AND LIMITS OF SPATIAL SOUND REPRODUCTION ARCHIVES OF ACOUSTICS 33, 4, 413 422 (2008) VIRTUAL ACOUSTICS: OPPORTUNITIES AND LIMITS OF SPATIAL SOUND REPRODUCTION Michael VORLÄNDER RWTH Aachen University Institute of Technical Acoustics 52056 Aachen,

More information

A spatial squeezing approach to ambisonic audio compression

A spatial squeezing approach to ambisonic audio compression University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2008 A spatial squeezing approach to ambisonic audio compression Bin Cheng

More information

SIMULATION OF SMALL HEAD-MOVEMENTS ON A VIRTUAL AUDIO DISPLAY USING HEADPHONE PLAYBACK AND HRTF SYNTHESIS. György Wersényi

SIMULATION OF SMALL HEAD-MOVEMENTS ON A VIRTUAL AUDIO DISPLAY USING HEADPHONE PLAYBACK AND HRTF SYNTHESIS. György Wersényi SIMULATION OF SMALL HEAD-MOVEMENTS ON A VIRTUAL AUDIO DISPLAY USING HEADPHONE PLAYBACK AND HRTF SYNTHESIS György Wersényi Széchenyi István University Department of Telecommunications Egyetem tér 1, H-9024,

More information

PERSONAL 3D AUDIO SYSTEM WITH LOUDSPEAKERS

PERSONAL 3D AUDIO SYSTEM WITH LOUDSPEAKERS PERSONAL 3D AUDIO SYSTEM WITH LOUDSPEAKERS Myung-Suk Song #1, Cha Zhang 2, Dinei Florencio 3, and Hong-Goo Kang #4 # Department of Electrical and Electronic, Yonsei University Microsoft Research 1 earth112@dsp.yonsei.ac.kr,

More information

DECORRELATION TECHNIQUES FOR THE RENDERING OF APPARENT SOUND SOURCE WIDTH IN 3D AUDIO DISPLAYS. Guillaume Potard, Ian Burnett

DECORRELATION TECHNIQUES FOR THE RENDERING OF APPARENT SOUND SOURCE WIDTH IN 3D AUDIO DISPLAYS. Guillaume Potard, Ian Burnett 04 DAFx DECORRELATION TECHNIQUES FOR THE RENDERING OF APPARENT SOUND SOURCE WIDTH IN 3D AUDIO DISPLAYS Guillaume Potard, Ian Burnett School of Electrical, Computer and Telecommunications Engineering University

More information

Transcription of Piano Music

Transcription of Piano Music Transcription of Piano Music Rudolf BRISUDA Slovak University of Technology in Bratislava Faculty of Informatics and Information Technologies Ilkovičova 2, 842 16 Bratislava, Slovakia xbrisuda@is.stuba.sk

More information

WAVELET SIGNAL AND IMAGE DENOISING

WAVELET SIGNAL AND IMAGE DENOISING WAVELET SIGNAL AND IMAGE DENOISING E. Hošťálková, A. Procházka Institute of Chemical Technology Department of Computing and Control Engineering Abstract The paper deals with the use of wavelet transform

More information

Optimizing a High-Order Graphic Equalizer for Audio Processing

Optimizing a High-Order Graphic Equalizer for Audio Processing Powered by TCPDF (www.tcpdf.org) This is an electronic reprint of the original article. This reprint may differ from the original in pagination and typographic detail. Author(s): Rämö, J.; Välimäki, V.

More information

THE PERCEPTION OF ALL-PASS COMPONENTS IN TRANSFER FUNCTIONS

THE PERCEPTION OF ALL-PASS COMPONENTS IN TRANSFER FUNCTIONS PACS Reference: 43.66.Pn THE PERCEPTION OF ALL-PASS COMPONENTS IN TRANSFER FUNCTIONS Pauli Minnaar; Jan Plogsties; Søren Krarup Olesen; Flemming Christensen; Henrik Møller Department of Acoustics Aalborg

More information

NAME STUDENT # ELEC 484 Audio Signal Processing. Midterm Exam July Listening test

NAME STUDENT # ELEC 484 Audio Signal Processing. Midterm Exam July Listening test NAME STUDENT # ELEC 484 Audio Signal Processing Midterm Exam July 2008 CLOSED BOOK EXAM Time 1 hour Listening test Choose one of the digital audio effects for each sound example. Put only ONE mark in each

More information

396 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 19, NO. 2, FEBRUARY 2011

396 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 19, NO. 2, FEBRUARY 2011 396 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 19, NO. 2, FEBRUARY 2011 Obtaining Binaural Room Impulse Responses From B-Format Impulse Responses Using Frequency-Dependent Coherence

More information

ANALYZING NOTCH PATTERNS OF HEAD RELATED TRANSFER FUNCTIONS IN CIPIC AND SYMARE DATABASES. M. Shahnawaz, L. Bianchi, A. Sarti, S.

ANALYZING NOTCH PATTERNS OF HEAD RELATED TRANSFER FUNCTIONS IN CIPIC AND SYMARE DATABASES. M. Shahnawaz, L. Bianchi, A. Sarti, S. ANALYZING NOTCH PATTERNS OF HEAD RELATED TRANSFER FUNCTIONS IN CIPIC AND SYMARE DATABASES M. Shahnawaz, L. Bianchi, A. Sarti, S. Tubaro Dipartimento di Elettronica, Informazione e Bioingegneria, Politecnico

More information

Improving Virtual Sound Source Robustness using Multiresolution Spectral Analysis and Synthesis

Improving Virtual Sound Source Robustness using Multiresolution Spectral Analysis and Synthesis Improving Virtual Sound Source Robustness using Multiresolution Spectral Analysis and Synthesis John Garas and Piet C.W. Sommen Eindhoven University of Technology Ehoog 6.34, P.O.Box 513 5 MB Eindhoven,

More information

Wavelet Speech Enhancement based on the Teager Energy Operator

Wavelet Speech Enhancement based on the Teager Energy Operator Wavelet Speech Enhancement based on the Teager Energy Operator Mohammed Bahoura and Jean Rouat ERMETIS, DSA, Université du Québec à Chicoutimi, Chicoutimi, Québec, G7H 2B1, Canada. Abstract We propose

More information

A Parametric Model for Spectral Sound Synthesis of Musical Sounds

A Parametric Model for Spectral Sound Synthesis of Musical Sounds A Parametric Model for Spectral Sound Synthesis of Musical Sounds Cornelia Kreutzer University of Limerick ECE Department Limerick, Ireland cornelia.kreutzer@ul.ie Jacqueline Walker University of Limerick

More information

Convention Paper Presented at the 139th Convention 2015 October 29 November 1 New York, USA

Convention Paper Presented at the 139th Convention 2015 October 29 November 1 New York, USA Audio Engineering Society Convention Paper Presented at the 139th Convention 2015 October 29 November 1 New York, USA 9447 This Convention paper was selected based on a submitted abstract and 750-word

More information

Final Exam Study Guide: Introduction to Computer Music Course Staff April 24, 2015

Final Exam Study Guide: Introduction to Computer Music Course Staff April 24, 2015 Final Exam Study Guide: 15-322 Introduction to Computer Music Course Staff April 24, 2015 This document is intended to help you identify and master the main concepts of 15-322, which is also what we intend

More information

MAGNITUDE-COMPLEMENTARY FILTERS FOR DYNAMIC EQUALIZATION

MAGNITUDE-COMPLEMENTARY FILTERS FOR DYNAMIC EQUALIZATION Proceedings of the COST G-6 Conference on Digital Audio Effects (DAFX-), Limerick, Ireland, December 6-8, MAGNITUDE-COMPLEMENTARY FILTERS FOR DYNAMIC EQUALIZATION Federico Fontana University of Verona

More information

MPEG-4 Structured Audio Systems

MPEG-4 Structured Audio Systems MPEG-4 Structured Audio Systems Mihir Anandpara The University of Texas at Austin anandpar@ece.utexas.edu 1 Abstract The MPEG-4 standard has been proposed to provide high quality audio and video content

More information

Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter

Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter Ching-Ta Lu, Kun-Fu Tseng 2, Chih-Tsung Chen 2 Department of Information Communication, Asia University, Taichung, Taiwan, ROC

More information

Khlui-Phiang-Aw Sound Synthesis Using A Warped FIR Filter

Khlui-Phiang-Aw Sound Synthesis Using A Warped FIR Filter Khlui-Phiang-Aw Sound Synthesis Using A Warped FIR Filter Korakoch Saengrattanakul Faculty of Engineering, Khon Kaen University Khon Kaen-40002, Thailand. ORCID: 0000-0001-8620-8782 Kittipitch Meesawat*

More information

From acoustic simulation to virtual auditory displays

From acoustic simulation to virtual auditory displays PROCEEDINGS of the 22 nd International Congress on Acoustics Plenary Lecture: Paper ICA2016-481 From acoustic simulation to virtual auditory displays Michael Vorländer Institute of Technical Acoustics,

More information

Sound Source Localization using HRTF database

Sound Source Localization using HRTF database ICCAS June -, KINTEX, Gyeonggi-Do, Korea Sound Source Localization using HRTF database Sungmok Hwang*, Youngjin Park and Younsik Park * Center for Noise and Vibration Control, Dept. of Mech. Eng., KAIST,

More information

Estimation of Reverberation Time from Binaural Signals Without Using Controlled Excitation

Estimation of Reverberation Time from Binaural Signals Without Using Controlled Excitation Estimation of Reverberation Time from Binaural Signals Without Using Controlled Excitation Sampo Vesa Master s Thesis presentation on 22nd of September, 24 21st September 24 HUT / Laboratory of Acoustics

More information

Virtual Acoustic Space as Assistive Technology

Virtual Acoustic Space as Assistive Technology Multimedia Technology Group Virtual Acoustic Space as Assistive Technology Czech Technical University in Prague Faculty of Electrical Engineering Department of Radioelectronics Technická 2 166 27 Prague

More information

Multimedia Signal Processing: Theory and Applications in Speech, Music and Communications

Multimedia Signal Processing: Theory and Applications in Speech, Music and Communications Brochure More information from http://www.researchandmarkets.com/reports/569388/ Multimedia Signal Processing: Theory and Applications in Speech, Music and Communications Description: Multimedia Signal

More information

Externalization in binaural synthesis: effects of recording environment and measurement procedure

Externalization in binaural synthesis: effects of recording environment and measurement procedure Externalization in binaural synthesis: effects of recording environment and measurement procedure F. Völk, F. Heinemann and H. Fastl AG Technische Akustik, MMK, TU München, Arcisstr., 80 München, Germany

More information

NOISE ESTIMATION IN A SINGLE CHANNEL

NOISE ESTIMATION IN A SINGLE CHANNEL SPEECH ENHANCEMENT FOR CROSS-TALK INTERFERENCE by Levent M. Arslan and John H.L. Hansen Robust Speech Processing Laboratory Department of Electrical Engineering Box 99 Duke University Durham, North Carolina

More information

Source Localisation Mapping using Weighted Interaural Cross-Correlation

Source Localisation Mapping using Weighted Interaural Cross-Correlation ISSC 27, Derry, Sept 3-4 Source Localisation Mapping using Weighted Interaural Cross-Correlation Gavin Kearney, Damien Kelly, Enda Bates, Frank Boland and Dermot Furlong. Department of Electronic and Electrical

More information

Nonuniform multi level crossing for signal reconstruction

Nonuniform multi level crossing for signal reconstruction 6 Nonuniform multi level crossing for signal reconstruction 6.1 Introduction In recent years, there has been considerable interest in level crossing algorithms for sampling continuous time signals. Driven

More information

Personalized 3D sound rendering for content creation, delivery, and presentation

Personalized 3D sound rendering for content creation, delivery, and presentation Personalized 3D sound rendering for content creation, delivery, and presentation Federico Avanzini 1, Luca Mion 2, Simone Spagnol 1 1 Dep. of Information Engineering, University of Padova, Italy; 2 TasLab

More information

THE BEATING EQUALIZER AND ITS APPLICATION TO THE SYNTHESIS AND MODIFICATION OF PIANO TONES

THE BEATING EQUALIZER AND ITS APPLICATION TO THE SYNTHESIS AND MODIFICATION OF PIANO TONES J. Rauhala, The beating equalizer and its application to the synthesis and modification of piano tones, in Proceedings of the 1th International Conference on Digital Audio Effects, Bordeaux, France, 27,

More information

HTTP Compression for 1-D signal based on Multiresolution Analysis and Run length Encoding

HTTP Compression for 1-D signal based on Multiresolution Analysis and Run length Encoding 0 International Conference on Information and Electronics Engineering IPCSIT vol.6 (0) (0) IACSIT Press, Singapore HTTP for -D signal based on Multiresolution Analysis and Run length Encoding Raneet Kumar

More information

Feasibility of Vocal Emotion Conversion on Modulation Spectrogram for Simulated Cochlear Implants

Feasibility of Vocal Emotion Conversion on Modulation Spectrogram for Simulated Cochlear Implants Feasibility of Vocal Emotion Conversion on Modulation Spectrogram for Simulated Cochlear Implants Zhi Zhu, Ryota Miyauchi, Yukiko Araki, and Masashi Unoki School of Information Science, Japan Advanced

More information

IMPROVED COCKTAIL-PARTY PROCESSING

IMPROVED COCKTAIL-PARTY PROCESSING IMPROVED COCKTAIL-PARTY PROCESSING Alexis Favrot, Markus Erne Scopein Research Aarau, Switzerland postmaster@scopein.ch Christof Faller Audiovisual Communications Laboratory, LCAV Swiss Institute of Technology

More information

Audio Enhancement Using Remez Exchange Algorithm with DWT

Audio Enhancement Using Remez Exchange Algorithm with DWT Audio Enhancement Using Remez Exchange Algorithm with DWT Abstract: Audio enhancement became important when noise in signals causes loss of actual information. Many filters have been developed and still

More information

Perceptual Speech Enhancement Using Multi_band Spectral Attenuation Filter

Perceptual Speech Enhancement Using Multi_band Spectral Attenuation Filter Perceptual Speech Enhancement Using Multi_band Spectral Attenuation Filter Sana Alaya, Novlène Zoghlami and Zied Lachiri Signal, Image and Information Technology Laboratory National Engineering School

More information

Robust Voice Activity Detection Based on Discrete Wavelet. Transform

Robust Voice Activity Detection Based on Discrete Wavelet. Transform Robust Voice Activity Detection Based on Discrete Wavelet Transform Kun-Ching Wang Department of Information Technology & Communication Shin Chien University kunching@mail.kh.usc.edu.tw Abstract This paper

More information

Frequency Domain Analysis for Noise Suppression Using Spectral Processing Methods for Degraded Speech Signal in Speech Enhancement

Frequency Domain Analysis for Noise Suppression Using Spectral Processing Methods for Degraded Speech Signal in Speech Enhancement Frequency Domain Analysis for Noise Suppression Using Spectral Processing Methods for Degraded Speech Signal in Speech Enhancement 1 Zeeshan Hashmi Khateeb, 2 Gopalaiah 1,2 Department of Instrumentation

More information

Mel Spectrum Analysis of Speech Recognition using Single Microphone

Mel Spectrum Analysis of Speech Recognition using Single Microphone International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree

More information

SGN Audio and Speech Processing

SGN Audio and Speech Processing Introduction 1 Course goals Introduction 2 SGN 14006 Audio and Speech Processing Lectures, Fall 2014 Anssi Klapuri Tampere University of Technology! Learn basics of audio signal processing Basic operations

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 1, 21 http://acousticalsociety.org/ ICA 21 Montreal Montreal, Canada 2 - June 21 Psychological and Physiological Acoustics Session appb: Binaural Hearing (Poster

More information

Adaptive Filters Application of Linear Prediction

Adaptive Filters Application of Linear Prediction Adaptive Filters Application of Linear Prediction Gerhard Schmidt Christian-Albrechts-Universität zu Kiel Faculty of Engineering Electrical Engineering and Information Technology Digital Signal Processing

More information

On distance dependence of pinna spectral patterns in head-related transfer functions

On distance dependence of pinna spectral patterns in head-related transfer functions On distance dependence of pinna spectral patterns in head-related transfer functions Simone Spagnol a) Department of Information Engineering, University of Padova, Padova 35131, Italy spagnols@dei.unipd.it

More information

THE HUMANISATION OF STOCHASTIC PROCESSES FOR THE MODELLING OF F0 DRIFT IN SINGING

THE HUMANISATION OF STOCHASTIC PROCESSES FOR THE MODELLING OF F0 DRIFT IN SINGING THE HUMANISATION OF STOCHASTIC PROCESSES FOR THE MODELLING OF F0 DRIFT IN SINGING Ryan Stables [1], Dr. Jamie Bullock [2], Dr. Cham Athwal [3] [1] Institute of Digital Experience, Birmingham City University,

More information

Auditory Localization

Auditory Localization Auditory Localization CMPT 468: Sound Localization Tamara Smyth, tamaras@cs.sfu.ca School of Computing Science, Simon Fraser University November 15, 2013 Auditory locatlization is the human perception

More information

An Adaptive Algorithm for Speech Source Separation in Overcomplete Cases Using Wavelet Packets

An Adaptive Algorithm for Speech Source Separation in Overcomplete Cases Using Wavelet Packets Proceedings of the th WSEAS International Conference on Signal Processing, Istanbul, Turkey, May 7-9, 6 (pp4-44) An Adaptive Algorithm for Speech Source Separation in Overcomplete Cases Using Wavelet Packets

More information

A classification-based cocktail-party processor

A classification-based cocktail-party processor A classification-based cocktail-party processor Nicoleta Roman, DeLiang Wang Department of Computer and Information Science and Center for Cognitive Science The Ohio State University Columbus, OH 43, USA

More information

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 A MODEL OF THE HEAD-RELATED TRANSFER FUNCTION BASED ON SPECTRAL CUES

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 A MODEL OF THE HEAD-RELATED TRANSFER FUNCTION BASED ON SPECTRAL CUES 19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, -7 SEPTEMBER 007 A MODEL OF THE HEAD-RELATED TRANSFER FUNCTION BASED ON SPECTRAL CUES PACS: 43.66.Qp, 43.66.Pn, 43.66Ba Iida, Kazuhiro 1 ; Itoh, Motokuni

More information

Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter

Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter 1 Gupteswar Sahu, 2 D. Arun Kumar, 3 M. Bala Krishna and 4 Jami Venkata Suman Assistant Professor, Department of ECE,

More information

Tone-in-noise detection: Observed discrepancies in spectral integration. Nicolas Le Goff a) Technische Universiteit Eindhoven, P.O.

Tone-in-noise detection: Observed discrepancies in spectral integration. Nicolas Le Goff a) Technische Universiteit Eindhoven, P.O. Tone-in-noise detection: Observed discrepancies in spectral integration Nicolas Le Goff a) Technische Universiteit Eindhoven, P.O. Box 513, NL-5600 MB Eindhoven, The Netherlands Armin Kohlrausch b) and

More information

Two-channel Separation of Speech Using Direction-of-arrival Estimation And Sinusoids Plus Transients Modeling

Two-channel Separation of Speech Using Direction-of-arrival Estimation And Sinusoids Plus Transients Modeling Two-channel Separation of Speech Using Direction-of-arrival Estimation And Sinusoids Plus Transients Modeling Mikko Parviainen 1 and Tuomas Virtanen 2 Institute of Signal Processing Tampere University

More information

Convention Paper Presented at the 112th Convention 2002 May Munich, Germany

Convention Paper Presented at the 112th Convention 2002 May Munich, Germany Audio Engineering Society Convention Paper Presented at the 112th Convention 2002 May 10 13 Munich, Germany 5627 This convention paper has been reproduced from the author s advance manuscript, without

More information

Perceptual Distortion Maps for Room Reverberation

Perceptual Distortion Maps for Room Reverberation Perceptual Distortion Maps for oom everberation Thomas Zarouchas 1 John Mourjopoulos 1 1 Audio and Acoustic Technology Group Wire Communications aboratory Electrical Engineering and Computer Engineering

More information