WAVELET-BASED SPECTRAL SMOOTHING FOR HEAD-RELATED TRANSFER FUNCTION FILTER DESIGN
|
|
- Nelson Gardner
- 5 years ago
- Views:
Transcription
1 WAVELET-BASE SPECTRAL SMOOTHING FOR HEA-RELATE TRANSFER FUNCTION FILTER ESIGN HUSEYIN HACIHABIBOGLU, BANU GUNEL, AN FIONN MURTAGH Sonic Arts Research Centre (SARC), Queen s University Belfast, Belfast, BT 1NN, U.K. {h.hacihabiboglu, b.gunel, f.murtagh}@qub.ac.uk Three wavelet-based spectral smoothing techniques are presented in this paper as a pre-processing stage for headrelated transfer function (HRTF) filter design. These wavelet-based methods include wavelet denoising, wavelet approximation, and redundant wavelet transform. These methods are used with time-domain parametric filter design methods to reduce the order of the IIR filters which is useful for real-time implementation of immersive audio systems. Results of a subjective listening test are then presented in order to justify the perceptual validity of the investigated smoothing methods. Results show that wavelet based spectral smoothing methods are beneficial in reducing the filter order and increasing the perception of localization without introducing noticeable effect on timbre. INTROUCTION Spectral shaping of sound on its way from a fixed point in free field to an external ear is defined by a headrelated transfer function (HRTF). ata reduction of HRTFs becomes an important issue when dealing with virtual reality applications, spatialization of audio and binaural synthesis in real time. ifferent methods for data complexity reduction including downsampling, rectangular windowing [1], and principal component analysis [2] have been proposed. The data-reduced form of an HRTF is then used for filter design. An essential issue in HRTF filter design for perceptual validity is preservation of the binaural hearing cues. A better approach uses auditory criteria for smoothing [3]. Frequency warping [4] has been used as an auditory weighting method []. A review of auditory weighting is given in Huopaniemi et al. []. The smoothing methods presented in this paper depend on wavelet transforms. In Section 1, a summary of wavelet de-noising, wavelet approximation and redundant wavelet transform is presented. The application of these smoothing methods to individual HRTFs is discussed together with the filter design issues in Section 2. Results of the subjective listening test are presented in Section 3. 1 REVIEW OF WAVELET SMOOTHING The smoothing methods employed in this paper utilize the discrete wavelet transform (WT). A general review of wavelets can be found in Mallat []. WT can be used for smoothing a given signal in a number of different ways. Three possibilities are wavelet denoising, wavelet approximation and redundant wavelet transform. In the scope of this paper, the signals to be smoothed are the magnitude responses of the individual HRTFs. 1.1 Wavelet enoising with SURE Assuming that the magnitude response of an HRTF is a noisy version of a smoother function, we can use wavelet denoising to recover an approximate, smoother function. The wavelet denoising is defined as a three-step process [] [9]. If we assume that the magnitude response of the actual HRTF has additive white Gaussian noise such that h = f t ) + z i where ( (1) i done by following the steps below: i 2 z i = N(, σ ), then the wavelet denoising is 1) WT is performed on the noisy data. 2) A soft-threshold non-linearity is applied to the noisy wavelet coefficients to find the wavelet coefficient estimates. η ( w) = sgn( w)( w t) I( w t) (2) t > The threshold t in this equation is calculated with the SURE (Stein s Unbiased Risk Estimation) algorithm. AES 22 nd International Conference on Virtual, Synthetic and Entertainment Audio 1
2 3) The denoised function estimate (t) is obtained by inverting the WT using the wavelet coefficient estimates found in the previous step. Validity of the assumption of noise characteristics for HRTF data will be discussed in Section Wavelet Approximation Another assumption regarding HRTF data is that small deviations are not critically important and can accordingly be discarded [2]. After the WT is performed on a signal of length 2 N, approximation coefficients at level L will have the length 2 N-L. The signal approximation is low-pass interpolated to get a smooth approximation of the actual HRTF magnitude response. Smoothness of the final representation also depends on the selection of the wavelet type and the depth of the tree. A scaling function with a higher attenuation in the first side-lobe captures better low frequency approximation. f The smoothing and filtering procedures described in the subsequent sections have been applied to the HRTFs for all the azimuth and elevation values for both types of the pinnae. 2.2 Smoothing with Wavelet enoising The wavelet denoising is performed with the SURE algorithm to select the soft-threshold. ifferent wavelet functions show different denoising performances with the same denoising scheme. Therefore, denoising was performed for different wavelets selected from a wellknown group consisting of Haar, discrete Meyer, aubechies 2-9, Coiflet 1- and Symmlet 1-1 wavelets. A 3-level WT was used in all of the cases. The results of the smoothing for magnitude response of the HRTF corresponding to the left ear at zero-azimuth (ϕ= ) and zero-elevation (δ= ) is given below (see Fig. 1). 1.3 Redundant Wavelet Transform If the sub-sampling steps are discarded from the WT structure, a better low-pass approximation can be obtained. In the case of the HRTFs, significant coarse details of the frequency notches and the low frequency parts of the magnitude response are successfully captured. Results of the redundant wavelet transform depend on the selection of the wavelet type and the depth of the wavelet tree as in the case of wavelet approximation. Selection of a scaling function with a high attenuation in the first side-lobe is also important for this type of smoothing. 2 HRTF SMOOTHING USING WAVELETS 2.1 HRTF ata The wavelet-based smoothing methods mentioned in the previous section are applied to the magnitude responses of the non-individualized HRTFs sampled at three different locations. The HRTF sets are selected from the free-field equalized HRTF measurements made on the KEMAR mannequin by Martin and Gardner [1]. The Martin-Gardner HRTF set includes the HRTF measurements for two types of pinnae. A property of these data sets is that they are already presented as 12 sample series, after an acceptable amount of data is discarded. Before the smoothing process, magnitude responses of the individual HRTFs were equalized to discard the effect of the loudspeaker used in the HRTF measurements Figure 1: Original and denoised magnitude responses with db4, sym2, and coif3 from top to bottom, respectively It may be noted that wavelet denoising with SURE does not provide a sufficiently smooth result. Overfitting of the original by the wavelet-denoised data prevents the reduction of the order of filters in the design phase. This result is a consequence of the false assumption that the irregularities in the magnitude responses may be modeled as additive white Gaussian noise. Therefore wavelet denoising with SURE was not included as a smoothing method to be investigated in the listening test. 2.3 Smoothing with Wavelet Approximation The equalized magnitude response can be smoothed by obtaining the low-frequency approximation using the WT. The wavelet approximation of the signal was computed using the Symmlet 1 filter bank. A 3-level WT was performed on the 12-sample magnitude response. The AES 22 nd International Conference on Virtual, Synthetic and Entertainment Audio 2
3 wavelet approximation, which is 4 samples long, was low-pass interpolated to get a smoother version of the signal. An alternative to this approach may be sub-sampling the signal by and low-pass interpolating with samples between each obtained result. However, this alternative results in loss of crucial data and gives minor frequency peaks and notches (see the selected area in Fig. 2) which are not significant in the original data. This decreases the success of smoothing in reducing the filter order. 2. Filter esign The equalized and smoothed magnitude responses were first converted into the time-domain. A minimum-phase version of the signal was then obtained by the minimum-phase reconstruction using real cepstrum [11]. A minimum-phase IIR filter with the same number of poles and zeros was designed for each HRTF with Prony s method. Frequency dependence of the interaural time delay (IT) was assumed to be perceptually irrelevant [2]. Thus the IT was supplied by a constant delay line PERCEPTUAL VALIATION OF WAVELET- BASE SMOOTHING Perceptual validation of our results was carried out using a subjective listening test. We used intraural, earplug style Sony headphones, and carried out the tests in a recording studio in the Music epartment of Queen s University Belfast Figure 2: Original (middle), wavelet approximation smoothing (top), and subsampling/interpolation smoothing (bottom) (ϕ=, δ= ) 2.4 Smoothing with Redundant Transform The redundant wavelet transform described in the previous section was applied to the magnitude responses of the HRTFs. The filter-bank used was the Symmlet 1 filter bank. As the approximation obtained using the redundant wavelet transform consists of 12 samples, low-pass interpolation is no longer necessary. Thus, the results capture the frequency peaks and notches better than wavelet approximation (see Fig. 3) Figure 3: Original (top) and redundant wavelet transform smoothing (bottom) (ϕ=, δ= ) 3.1 Subjects Eleven subjects (3 female, male) aged 19-3 served as voluntary participants. All had normal hearing and no history of hearing problems. Five of the subjects had previous experience of specialized listening. 3.2 Stimulus We used a pink noise sample with a length of one second with ms ramps in the onset and the offset portions. 3.3 Procedure Three HRTF sets representing three spatial positions were used in the listening test. The HRTF pairs we used were selected to represent both types of pinnae for 3 varying positions of azimuth (ϕ) and elevation (δ): ϕ= and δ= for small pinnae. ϕ=3 and δ= for small pinnae. ϕ=24 and δ=-1 for large red pinnae. This selection was rather arbitrary but it was made sure that the positions were far enough from each other to prevent bias in the listening test due to the previous location of the sound. This means that the ripples, peaks and notches of the HRTFs are quite different from each other (see Fig. 4 and Fig. ). The subjects were seated in the acoustically insulated recording studio. Tests for each of the three HRTF sets were carried out separately for each subject. Test signals were presented in pairs. The first signal was always the reference signal obtained by the convolution of the stimulus signal with the original HRTF set. The second signal was one of the following three signals selected randomly: AES 22 nd International Conference on Virtual, Synthetic and Entertainment Audio 3
4 Pink noise filtered with the IIR filter which was directly designed from the HRTF data in the timedomain without any smoothing (). Pink noise filtered with the IIR filter after smoothing with the wavelet approximation (). Pink noise filtered with the IIR filter after smoothing with the redundant wavelet transform () AE A3E A-1E24 Figure 4: Tested HRTFs for the right ear plotted with db difference between them. AE A3E A-1E24 The test program, designed with MATLAB was similar to that used by Huopaniemi et al. [12]. The subjects were verbally instructed on the usage of the test program beforehand. The program selected the sound pairs from the test set randomly. Each sound pair was played with. sec time interval between the first and the second sounds. Subjects evaluated the sound pairs by similarity of timbre and localization on a scale of 1- (very different - no difference) quantized to unit intervals of.1. Subjects were given the opportunity to replay the sound pairs. The test lasted for -3 minutes per subject. The positions at which the subjects actually heard the stimuli were not investigated due to the fact that the HRTFs used in the test were not individualized HRTFs of the subjects. 3.4 Results The results were analyzed using the SPSS software package. The independent variables were method, subject, filter order, and angle. The dependent variables were timbre and localization scores obtained from the subjective listening test. The redundant wavelet transform outperformed the wavelet approximation and the direct design in terms of the localization scores for different filter orders. It may be noted that even for a filter order as low as 1, the localization is satisfactorily high. The wavelet approximation improved the localization compared to the direct design as well (see Fig. ) Figure : Tested HRTFs for the left ear plotted with db difference between them. All of the tested filters were calculated for the orders of,,,, 9, 1, 12, 14, 1, 1,, 24, 2, 32 and 3 for the HRTFs sampled at three locations. Thus, each subject was presented a total of 13 test pairs, which were equalized for the headphone and the ear canal resonance. 9% CI Localisation Order Figure : Filter Order vs. Mean Localization graphs for the three methods (,, and ) The timbre scores follow an increasing trend for increasing filter orders as expected. However, it is not possible to comment on the type of method which is better in preserving the timbre (see Fig. ). AES 22 nd International Conference on Virtual, Synthetic and Entertainment Audio 4
5 . wavelet transform was found to be satisfactory even for very low filter orders (see Fig. 9) % CI Timbre Order Figure : Filter Order vs. Mean Timbre graphs for the three methods (, and ) The redundant wavelet transform was the preferred choice for all of the subjects in terms of the mean localization scores. The wavelet approximation was found to be superior over the direct design. This result is another proof that the wavelet-based smoothing methods provide better localization cues (see Fig. ).. Count Order Figure 9: Filter Order vs. Count for the three methods for (Timbre 3.) and (Localization 4.) The localization accuracy depends on the location of the sound source [13]. The redundant wavelet transform smoothing performed well for all of the investigated HRTFs giving mean localization scores over 4. (see Fig. 1) % CI Localisation Subjects Figure : Subject vs. Mean Localization graphs for the three methods (, and ) The correlation coefficient between the timbre and the localization was calculated as.19 using Spearman s rho test. This low correlation can be due to the fact that the HRTFs include both the timbre and the localization information []. Overall mean values for the timbre score and the localization score were found to be 3.4 and 3.3 respectively. Therefore the cases where the timbre score is greater than 3. and the localization score is greater than 4. can be taken as acceptable. These cases were analyzed with respect to the filter orders for the three smoothing methods. The redundant 9% CI Localisation Angle AE A24E-1 A3E Figure 1: Angle vs. Mean Localization graphs for three methods (, and ) 4 CONCLUSIONS AN FUTURE WORK In this work, we analyzed the effects of three waveletbased methods for spectral smoothing for HRTF filter design. The results show that the wavelet-based smoothing methods can help in reducing the filter order for IIR filters. The wavelet approximation outperforms the direct design in most of the cases. The redundant wavelet transform increases the localization accuracy AES 22 nd International Conference on Virtual, Synthetic and Entertainment Audio
6 without substantially decreasing the timbre quality. Its robustness for different source locations provides evidence for its usefulness for HRTF filter design for immersive audio applications. The opportunities for future work include investigation of the different thresholding strategies to be used with wavelet denoising compatible with the human auditory system. The detail coefficients of the WT will be used to restore the details of the HRTFs within a real-time system utilizing a head tracker. Restoring the details of the HRTFs can be useful in smoothly and gradually increasing the level of timbral detail when the position of the listener s head is stable. ACKNOWLEGEMENTS The authors gratefully acknowledge the assistance of Mr. Chris Corrigan in the organization of the subjective listening test. We would also like to thank the participants of the test. REFERENCES [1]. R. Begault, 3- Sound for Virtual Reality and Multimedia (Academic Press, London, 1994), pp [2] F. L. Wightman and. J. Kistler, "A Model of Head-Related Transfer Function Based on Principal Components Analysis and Minimumphase Reconstruction," J. Acoust. Soc. Am., vol. 91, pp , (1992) []. L. onoho, and I. M. Johnstone, Adapting to Unknown Smoothness via Wavelet Shrinkage, J. Am. Statist. Ass., vol. 9, pp , (199) [9]. L. onoho, "enoising by Soft-thresholding," IEEE Trans. Inf. Theory, vol. 41, pp. 13-2, (199) [1] W. G. Gardner and K.. Martin. "HRTF Measurements of a KEMAR," J. Acoust. Soc. Am., vol. 9, pp , (199) [11] A. V. Oppenheim and R. W. Schafer, iscrete- Time Signal Processing (Prentice-Hall, N.J., 199) pp. 3- [12] J. Huopaniemi, N. Zacharov, and M. Karjalainen. "Objective and Subjective Evaluation of Head- Related Transfer Function Filter esign," J. Audio Eng. Soc., vol. 4, no. 4., pp , (1999) [13] J. Blauert, Spatial Hearing : The Psychophysics of Human Sound Localization (M.I.T. Press, Cambridge, MA, 193), pp [3] J. Huopaniemi and M. Karjalainen, "HRTF Filter esign Based on Auditory Criteria," in Proc. Nordic Acoustical Meeting (NAM'9), (Helsinki, Finland, 199, June 12-14), pp [4] J. Huopaniemi and M. Karjalainen. Comparison of igital Filter esign s for 3- Sound, in Proc. IEEE Nordic Signal Processing Symp. (NORSIG'9), (Espoo, Finland, 199), pp [] J. Smith and J. Abel, Bark and ERB Bilinear Transforms, IEEE Transactions on Speech and Audio Processing, vol., iss., pp. 9, (1999) [] J. Huopaniemi and M. Karjalainen. Review of igital Filter esign and Implementation s for 3- Sound, in Proc. of the 12 nd Convention of the Audio Engineering Society, Preprint 441, (Munich, Germany, 199) [] S. Mallat, A Wavelet Tour of Signal Processing, (Academic Press, London, 199) AES 22 nd International Conference on Virtual, Synthetic and Entertainment Audio
A binaural auditory model and applications to spatial sound evaluation
A binaural auditory model and applications to spatial sound evaluation Ma r k o Ta k a n e n 1, Ga ë ta n Lo r h o 2, a n d Mat t i Ka r ja l a i n e n 1 1 Helsinki University of Technology, Dept. of Signal
More informationEnhancing 3D Audio Using Blind Bandwidth Extension
Enhancing 3D Audio Using Blind Bandwidth Extension (PREPRINT) Tim Habigt, Marko Ðurković, Martin Rothbucher, and Klaus Diepold Institute for Data Processing, Technische Universität München, 829 München,
More informationPsychoacoustic Cues in Room Size Perception
Audio Engineering Society Convention Paper Presented at the 116th Convention 2004 May 8 11 Berlin, Germany 6084 This convention paper has been reproduced from the author s advance manuscript, without editing,
More informationWARPED FILTER DESIGN FOR THE BODY MODELING AND SOUND SYNTHESIS OF STRING INSTRUMENTS
NORDIC ACOUSTICAL MEETING 12-14 JUNE 1996 HELSINKI WARPED FILTER DESIGN FOR THE BODY MODELING AND SOUND SYNTHESIS OF STRING INSTRUMENTS Helsinki University of Technology Laboratory of Acoustics and Audio
More informationThe analysis of multi-channel sound reproduction algorithms using HRTF data
The analysis of multichannel sound reproduction algorithms using HRTF data B. Wiggins, I. PatersonStephens, P. Schillebeeckx Processing Applications Research Group University of Derby Derby, United Kingdom
More informationProceedings of Meetings on Acoustics
Proceedings of Meetings on Acoustics Volume 19, 2013 http://acousticalsociety.org/ ICA 2013 Montreal Montreal, Canada 2-7 June 2013 Psychological and Physiological Acoustics Session 2aPPa: Binaural Hearing
More informationA triangulation method for determining the perceptual center of the head for auditory stimuli
A triangulation method for determining the perceptual center of the head for auditory stimuli PACS REFERENCE: 43.66.Qp Brungart, Douglas 1 ; Neelon, Michael 2 ; Kordik, Alexander 3 ; Simpson, Brian 4 1
More informationIII. Publication III. c 2005 Toni Hirvonen.
III Publication III Hirvonen, T., Segregation of Two Simultaneously Arriving Narrowband Noise Signals as a Function of Spatial and Frequency Separation, in Proceedings of th International Conference on
More informationA Virtual Audio Environment for Testing Dummy- Head HRTFs modeling Real Life Situations
A Virtual Audio Environment for Testing Dummy- Head HRTFs modeling Real Life Situations György Wersényi Széchenyi István University, Hungary. József Répás Széchenyi István University, Hungary. Summary
More informationORIENTATION IN SIMPLE VIRTUAL AUDITORY SPACE CREATED WITH MEASURED HRTF
ORIENTATION IN SIMPLE VIRTUAL AUDITORY SPACE CREATED WITH MEASURED HRTF F. Rund, D. Štorek, O. Glaser, M. Barda Faculty of Electrical Engineering Czech Technical University in Prague, Prague, Czech Republic
More informationDirection-Dependent Physical Modeling of Musical Instruments
15th International Congress on Acoustics (ICA 95), Trondheim, Norway, June 26-3, 1995 Title of the paper: Direction-Dependent Physical ing of Musical Instruments Authors: Matti Karjalainen 1,3, Jyri Huopaniemi
More informationSpatial Audio Reproduction: Towards Individualized Binaural Sound
Spatial Audio Reproduction: Towards Individualized Binaural Sound WILLIAM G. GARDNER Wave Arts, Inc. Arlington, Massachusetts INTRODUCTION The compact disc (CD) format records audio with 16-bit resolution
More informationAuditory modelling for speech processing in the perceptual domain
ANZIAM J. 45 (E) ppc964 C980, 2004 C964 Auditory modelling for speech processing in the perceptual domain L. Lin E. Ambikairajah W. H. Holmes (Received 8 August 2003; revised 28 January 2004) Abstract
More informationROOM SHAPE AND SIZE ESTIMATION USING DIRECTIONAL IMPULSE RESPONSE MEASUREMENTS
ROOM SHAPE AND SIZE ESTIMATION USING DIRECTIONAL IMPULSE RESPONSE MEASUREMENTS PACS: 4.55 Br Gunel, Banu Sonic Arts Research Centre (SARC) School of Computer Science Queen s University Belfast Belfast,
More informationSound source localization and its use in multimedia applications
Notes for lecture/ Zack Settel, McGill University Sound source localization and its use in multimedia applications Introduction With the arrival of real-time binaural or "3D" digital audio processing,
More informationBINAURAL RECORDING SYSTEM AND SOUND MAP OF MALAGA
EUROPEAN SYMPOSIUM ON UNDERWATER BINAURAL RECORDING SYSTEM AND SOUND MAP OF MALAGA PACS: Rosas Pérez, Carmen; Luna Ramírez, Salvador Universidad de Málaga Campus de Teatinos, 29071 Málaga, España Tel:+34
More informationHIGH QUALITY AUDIO CODING AT LOW BIT RATE USING WAVELET AND WAVELET PACKET TRANSFORM
HIGH QUALITY AUDIO CODING AT LOW BIT RATE USING WAVELET AND WAVELET PACKET TRANSFORM DR. D.C. DHUBKARYA AND SONAM DUBEY 2 Email at: sonamdubey2000@gmail.com, Electronic and communication department Bundelkhand
More informationAudio Engineering Society. Convention Paper. Presented at the 131st Convention 2011 October New York, NY, USA
Audio Engineering Society Convention Paper Presented at the 131st Convention 2011 October 20 23 New York, NY, USA This Convention paper was selected based on a submitted abstract and 750-word precis that
More informationConvention Paper Presented at the 125th Convention 2008 October 2 5 San Francisco, CA, USA
Audio Engineering Society Convention Paper Presented at the 125th Convention 2008 October 2 5 San Francisco, CA, USA The papers at this Convention have been selected on the basis of a submitted abstract
More informationAalborg Universitet. Audibility of time switching in dynamic binaural synthesis Hoffmann, Pablo Francisco F.; Møller, Henrik
Aalborg Universitet Audibility of time switching in dynamic binaural synthesis Hoffmann, Pablo Francisco F.; Møller, Henrik Published in: Journal of the Audio Engineering Society Publication date: 2005
More informationMultichannel level alignment, part I: Signals and methods
Suokuisma, Zacharov & Bech AES 5th Convention - San Francisco Multichannel level alignment, part I: Signals and methods Pekka Suokuisma Nokia Research Center, Speech and Audio Systems Laboratory, Tampere,
More informationAcoustics Research Institute
Austrian Academy of Sciences Acoustics Research Institute Spatial SpatialHearing: Hearing: Single SingleSound SoundSource Sourcein infree FreeField Field Piotr PiotrMajdak Majdak&&Bernhard BernhardLaback
More informationBinaural Hearing. Reading: Yost Ch. 12
Binaural Hearing Reading: Yost Ch. 12 Binaural Advantages Sounds in our environment are usually complex, and occur either simultaneously or close together in time. Studies have shown that the ability to
More informationHRTF adaptation and pattern learning
HRTF adaptation and pattern learning FLORIAN KLEIN * AND STEPHAN WERNER Electronic Media Technology Lab, Institute for Media Technology, Technische Universität Ilmenau, D-98693 Ilmenau, Germany The human
More informationListening with Headphones
Listening with Headphones Main Types of Errors Front-back reversals Angle error Some Experimental Results Most front-back errors are front-to-back Substantial individual differences Most evident in elevation
More informationHRIR Customization in the Median Plane via Principal Components Analysis
한국소음진동공학회 27 년춘계학술대회논문집 KSNVE7S-6- HRIR Customization in the Median Plane via Principal Components Analysis 주성분분석을이용한 HRIR 맞춤기법 Sungmok Hwang and Youngjin Park* 황성목 박영진 Key Words : Head-Related Transfer
More informationEvaluation of a new stereophonic reproduction method with moving sweet spot using a binaural localization model
Evaluation of a new stereophonic reproduction method with moving sweet spot using a binaural localization model Sebastian Merchel and Stephan Groth Chair of Communication Acoustics, Dresden University
More informationEnhancement of Speech Signal by Adaptation of Scales and Thresholds of Bionic Wavelet Transform Coefficients
ISSN (Print) : 232 3765 An ISO 3297: 27 Certified Organization Vol. 3, Special Issue 3, April 214 Paiyanoor-63 14, Tamil Nadu, India Enhancement of Speech Signal by Adaptation of Scales and Thresholds
More informationAN AUDITORILY MOTIVATED ANALYSIS METHOD FOR ROOM IMPULSE RESPONSES
Proceedings of the COST G-6 Conference on Digital Audio Effects (DAFX-), Verona, Italy, December 7-9,2 AN AUDITORILY MOTIVATED ANALYSIS METHOD FOR ROOM IMPULSE RESPONSES Tapio Lokki Telecommunications
More informationComputational Perception /785
Computational Perception 15-485/785 Assignment 1 Sound Localization due: Thursday, Jan. 31 Introduction This assignment focuses on sound localization. You will develop Matlab programs that synthesize sounds
More informationConvention Paper 9870 Presented at the 143 rd Convention 2017 October 18 21, New York, NY, USA
Audio Engineering Society Convention Paper 987 Presented at the 143 rd Convention 217 October 18 21, New York, NY, USA This convention paper was selected based on a submitted abstract and 7-word precis
More informationIvan Tashev Microsoft Research
Hannes Gamper Microsoft Research David Johnston Microsoft Research Ivan Tashev Microsoft Research Mark R. P. Thomas Dolby Laboratories Jens Ahrens Chalmers University, Sweden Augmented and virtual reality,
More informationThe psychoacoustics of reverberation
The psychoacoustics of reverberation Steven van de Par Steven.van.de.Par@uni-oldenburg.de July 19, 2016 Thanks to Julian Grosse and Andreas Häußler 2016 AES International Conference on Sound Field Control
More informationNonlinear Filtering in ECG Signal Denoising
Acta Universitatis Sapientiae Electrical and Mechanical Engineering, 2 (2) 36-45 Nonlinear Filtering in ECG Signal Denoising Zoltán GERMÁN-SALLÓ Department of Electrical Engineering, Faculty of Engineering,
More informationStudy on method of estimating direct arrival using monaural modulation sp. Author(s)Ando, Masaru; Morikawa, Daisuke; Uno
JAIST Reposi https://dspace.j Title Study on method of estimating direct arrival using monaural modulation sp Author(s)Ando, Masaru; Morikawa, Daisuke; Uno Citation Journal of Signal Processing, 18(4):
More informationRobotic Spatial Sound Localization and Its 3-D Sound Human Interface
Robotic Spatial Sound Localization and Its 3-D Sound Human Interface Jie Huang, Katsunori Kume, Akira Saji, Masahiro Nishihashi, Teppei Watanabe and William L. Martens The University of Aizu Aizu-Wakamatsu,
More informationVirtual Reality Presentation of Loudspeaker Stereo Recordings
Virtual Reality Presentation of Loudspeaker Stereo Recordings by Ben Supper 21 March 2000 ACKNOWLEDGEMENTS Thanks to: Francis Rumsey, for obtaining a head tracker specifically for this Technical Project;
More informationProceedings of Meetings on Acoustics
Proceedings of Meetings on Acoustics Volume 19, 213 http://acousticalsociety.org/ IA 213 Montreal Montreal, anada 2-7 June 213 Psychological and Physiological Acoustics Session 3pPP: Multimodal Influences
More informationTu1.D II Current Approaches to 3-D Sound Reproduction. Elizabeth M. Wenzel
Current Approaches to 3-D Sound Reproduction Elizabeth M. Wenzel NASA Ames Research Center Moffett Field, CA 94035 Elizabeth.M.Wenzel@nasa.gov Abstract Current approaches to spatial sound synthesis are
More informationUniversity of Huddersfield Repository
University of Huddersfield Repository Moore, David J. and Wakefield, Jonathan P. Surround Sound for Large Audiences: What are the Problems? Original Citation Moore, David J. and Wakefield, Jonathan P.
More informationBlind source separation and directional audio synthesis for binaural auralization of multiple sound sources using microphone array recordings
Blind source separation and directional audio synthesis for binaural auralization of multiple sound sources using microphone array recordings Banu Gunel, Huseyin Hacihabiboglu and Ahmet Kondoz I-Lab Multimedia
More informationImage De-Noising Using a Fast Non-Local Averaging Algorithm
Image De-Noising Using a Fast Non-Local Averaging Algorithm RADU CIPRIAN BILCU 1, MARKKU VEHVILAINEN 2 1,2 Multimedia Technologies Laboratory, Nokia Research Center Visiokatu 1, FIN-33720, Tampere FINLAND
More informationINVESTIGATING BINAURAL LOCALISATION ABILITIES FOR PROPOSING A STANDARDISED TESTING ENVIRONMENT FOR BINAURAL SYSTEMS
20-21 September 2018, BULGARIA 1 Proceedings of the International Conference on Information Technologies (InfoTech-2018) 20-21 September 2018, Bulgaria INVESTIGATING BINAURAL LOCALISATION ABILITIES FOR
More informationFrom Binaural Technology to Virtual Reality
From Binaural Technology to Virtual Reality Jens Blauert, D-Bochum Prominent Prominent Features of of Binaural Binaural Hearing Hearing - Localization Formation of positions of the auditory events (azimuth,
More informationComputational Perception. Sound localization 2
Computational Perception 15-485/785 January 22, 2008 Sound localization 2 Last lecture sound propagation: reflection, diffraction, shadowing sound intensity (db) defining computational problems sound lateralization
More informationSubband Analysis of Time Delay Estimation in STFT Domain
PAGE 211 Subband Analysis of Time Delay Estimation in STFT Domain S. Wang, D. Sen and W. Lu School of Electrical Engineering & Telecommunications University of ew South Wales, Sydney, Australia sh.wang@student.unsw.edu.au,
More informationVIRTUAL ACOUSTICS: OPPORTUNITIES AND LIMITS OF SPATIAL SOUND REPRODUCTION
ARCHIVES OF ACOUSTICS 33, 4, 413 422 (2008) VIRTUAL ACOUSTICS: OPPORTUNITIES AND LIMITS OF SPATIAL SOUND REPRODUCTION Michael VORLÄNDER RWTH Aachen University Institute of Technical Acoustics 52056 Aachen,
More informationA spatial squeezing approach to ambisonic audio compression
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2008 A spatial squeezing approach to ambisonic audio compression Bin Cheng
More informationSIMULATION OF SMALL HEAD-MOVEMENTS ON A VIRTUAL AUDIO DISPLAY USING HEADPHONE PLAYBACK AND HRTF SYNTHESIS. György Wersényi
SIMULATION OF SMALL HEAD-MOVEMENTS ON A VIRTUAL AUDIO DISPLAY USING HEADPHONE PLAYBACK AND HRTF SYNTHESIS György Wersényi Széchenyi István University Department of Telecommunications Egyetem tér 1, H-9024,
More informationPERSONAL 3D AUDIO SYSTEM WITH LOUDSPEAKERS
PERSONAL 3D AUDIO SYSTEM WITH LOUDSPEAKERS Myung-Suk Song #1, Cha Zhang 2, Dinei Florencio 3, and Hong-Goo Kang #4 # Department of Electrical and Electronic, Yonsei University Microsoft Research 1 earth112@dsp.yonsei.ac.kr,
More informationDECORRELATION TECHNIQUES FOR THE RENDERING OF APPARENT SOUND SOURCE WIDTH IN 3D AUDIO DISPLAYS. Guillaume Potard, Ian Burnett
04 DAFx DECORRELATION TECHNIQUES FOR THE RENDERING OF APPARENT SOUND SOURCE WIDTH IN 3D AUDIO DISPLAYS Guillaume Potard, Ian Burnett School of Electrical, Computer and Telecommunications Engineering University
More informationTranscription of Piano Music
Transcription of Piano Music Rudolf BRISUDA Slovak University of Technology in Bratislava Faculty of Informatics and Information Technologies Ilkovičova 2, 842 16 Bratislava, Slovakia xbrisuda@is.stuba.sk
More informationWAVELET SIGNAL AND IMAGE DENOISING
WAVELET SIGNAL AND IMAGE DENOISING E. Hošťálková, A. Procházka Institute of Chemical Technology Department of Computing and Control Engineering Abstract The paper deals with the use of wavelet transform
More informationOptimizing a High-Order Graphic Equalizer for Audio Processing
Powered by TCPDF (www.tcpdf.org) This is an electronic reprint of the original article. This reprint may differ from the original in pagination and typographic detail. Author(s): Rämö, J.; Välimäki, V.
More informationTHE PERCEPTION OF ALL-PASS COMPONENTS IN TRANSFER FUNCTIONS
PACS Reference: 43.66.Pn THE PERCEPTION OF ALL-PASS COMPONENTS IN TRANSFER FUNCTIONS Pauli Minnaar; Jan Plogsties; Søren Krarup Olesen; Flemming Christensen; Henrik Møller Department of Acoustics Aalborg
More informationNAME STUDENT # ELEC 484 Audio Signal Processing. Midterm Exam July Listening test
NAME STUDENT # ELEC 484 Audio Signal Processing Midterm Exam July 2008 CLOSED BOOK EXAM Time 1 hour Listening test Choose one of the digital audio effects for each sound example. Put only ONE mark in each
More information396 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 19, NO. 2, FEBRUARY 2011
396 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 19, NO. 2, FEBRUARY 2011 Obtaining Binaural Room Impulse Responses From B-Format Impulse Responses Using Frequency-Dependent Coherence
More informationANALYZING NOTCH PATTERNS OF HEAD RELATED TRANSFER FUNCTIONS IN CIPIC AND SYMARE DATABASES. M. Shahnawaz, L. Bianchi, A. Sarti, S.
ANALYZING NOTCH PATTERNS OF HEAD RELATED TRANSFER FUNCTIONS IN CIPIC AND SYMARE DATABASES M. Shahnawaz, L. Bianchi, A. Sarti, S. Tubaro Dipartimento di Elettronica, Informazione e Bioingegneria, Politecnico
More informationImproving Virtual Sound Source Robustness using Multiresolution Spectral Analysis and Synthesis
Improving Virtual Sound Source Robustness using Multiresolution Spectral Analysis and Synthesis John Garas and Piet C.W. Sommen Eindhoven University of Technology Ehoog 6.34, P.O.Box 513 5 MB Eindhoven,
More informationWavelet Speech Enhancement based on the Teager Energy Operator
Wavelet Speech Enhancement based on the Teager Energy Operator Mohammed Bahoura and Jean Rouat ERMETIS, DSA, Université du Québec à Chicoutimi, Chicoutimi, Québec, G7H 2B1, Canada. Abstract We propose
More informationA Parametric Model for Spectral Sound Synthesis of Musical Sounds
A Parametric Model for Spectral Sound Synthesis of Musical Sounds Cornelia Kreutzer University of Limerick ECE Department Limerick, Ireland cornelia.kreutzer@ul.ie Jacqueline Walker University of Limerick
More informationConvention Paper Presented at the 139th Convention 2015 October 29 November 1 New York, USA
Audio Engineering Society Convention Paper Presented at the 139th Convention 2015 October 29 November 1 New York, USA 9447 This Convention paper was selected based on a submitted abstract and 750-word
More informationFinal Exam Study Guide: Introduction to Computer Music Course Staff April 24, 2015
Final Exam Study Guide: 15-322 Introduction to Computer Music Course Staff April 24, 2015 This document is intended to help you identify and master the main concepts of 15-322, which is also what we intend
More informationMAGNITUDE-COMPLEMENTARY FILTERS FOR DYNAMIC EQUALIZATION
Proceedings of the COST G-6 Conference on Digital Audio Effects (DAFX-), Limerick, Ireland, December 6-8, MAGNITUDE-COMPLEMENTARY FILTERS FOR DYNAMIC EQUALIZATION Federico Fontana University of Verona
More informationMPEG-4 Structured Audio Systems
MPEG-4 Structured Audio Systems Mihir Anandpara The University of Texas at Austin anandpar@ece.utexas.edu 1 Abstract The MPEG-4 standard has been proposed to provide high quality audio and video content
More informationReduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter
Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter Ching-Ta Lu, Kun-Fu Tseng 2, Chih-Tsung Chen 2 Department of Information Communication, Asia University, Taichung, Taiwan, ROC
More informationKhlui-Phiang-Aw Sound Synthesis Using A Warped FIR Filter
Khlui-Phiang-Aw Sound Synthesis Using A Warped FIR Filter Korakoch Saengrattanakul Faculty of Engineering, Khon Kaen University Khon Kaen-40002, Thailand. ORCID: 0000-0001-8620-8782 Kittipitch Meesawat*
More informationFrom acoustic simulation to virtual auditory displays
PROCEEDINGS of the 22 nd International Congress on Acoustics Plenary Lecture: Paper ICA2016-481 From acoustic simulation to virtual auditory displays Michael Vorländer Institute of Technical Acoustics,
More informationSound Source Localization using HRTF database
ICCAS June -, KINTEX, Gyeonggi-Do, Korea Sound Source Localization using HRTF database Sungmok Hwang*, Youngjin Park and Younsik Park * Center for Noise and Vibration Control, Dept. of Mech. Eng., KAIST,
More informationEstimation of Reverberation Time from Binaural Signals Without Using Controlled Excitation
Estimation of Reverberation Time from Binaural Signals Without Using Controlled Excitation Sampo Vesa Master s Thesis presentation on 22nd of September, 24 21st September 24 HUT / Laboratory of Acoustics
More informationVirtual Acoustic Space as Assistive Technology
Multimedia Technology Group Virtual Acoustic Space as Assistive Technology Czech Technical University in Prague Faculty of Electrical Engineering Department of Radioelectronics Technická 2 166 27 Prague
More informationMultimedia Signal Processing: Theory and Applications in Speech, Music and Communications
Brochure More information from http://www.researchandmarkets.com/reports/569388/ Multimedia Signal Processing: Theory and Applications in Speech, Music and Communications Description: Multimedia Signal
More informationExternalization in binaural synthesis: effects of recording environment and measurement procedure
Externalization in binaural synthesis: effects of recording environment and measurement procedure F. Völk, F. Heinemann and H. Fastl AG Technische Akustik, MMK, TU München, Arcisstr., 80 München, Germany
More informationNOISE ESTIMATION IN A SINGLE CHANNEL
SPEECH ENHANCEMENT FOR CROSS-TALK INTERFERENCE by Levent M. Arslan and John H.L. Hansen Robust Speech Processing Laboratory Department of Electrical Engineering Box 99 Duke University Durham, North Carolina
More informationSource Localisation Mapping using Weighted Interaural Cross-Correlation
ISSC 27, Derry, Sept 3-4 Source Localisation Mapping using Weighted Interaural Cross-Correlation Gavin Kearney, Damien Kelly, Enda Bates, Frank Boland and Dermot Furlong. Department of Electronic and Electrical
More informationNonuniform multi level crossing for signal reconstruction
6 Nonuniform multi level crossing for signal reconstruction 6.1 Introduction In recent years, there has been considerable interest in level crossing algorithms for sampling continuous time signals. Driven
More informationPersonalized 3D sound rendering for content creation, delivery, and presentation
Personalized 3D sound rendering for content creation, delivery, and presentation Federico Avanzini 1, Luca Mion 2, Simone Spagnol 1 1 Dep. of Information Engineering, University of Padova, Italy; 2 TasLab
More informationTHE BEATING EQUALIZER AND ITS APPLICATION TO THE SYNTHESIS AND MODIFICATION OF PIANO TONES
J. Rauhala, The beating equalizer and its application to the synthesis and modification of piano tones, in Proceedings of the 1th International Conference on Digital Audio Effects, Bordeaux, France, 27,
More informationHTTP Compression for 1-D signal based on Multiresolution Analysis and Run length Encoding
0 International Conference on Information and Electronics Engineering IPCSIT vol.6 (0) (0) IACSIT Press, Singapore HTTP for -D signal based on Multiresolution Analysis and Run length Encoding Raneet Kumar
More informationFeasibility of Vocal Emotion Conversion on Modulation Spectrogram for Simulated Cochlear Implants
Feasibility of Vocal Emotion Conversion on Modulation Spectrogram for Simulated Cochlear Implants Zhi Zhu, Ryota Miyauchi, Yukiko Araki, and Masashi Unoki School of Information Science, Japan Advanced
More informationIMPROVED COCKTAIL-PARTY PROCESSING
IMPROVED COCKTAIL-PARTY PROCESSING Alexis Favrot, Markus Erne Scopein Research Aarau, Switzerland postmaster@scopein.ch Christof Faller Audiovisual Communications Laboratory, LCAV Swiss Institute of Technology
More informationAudio Enhancement Using Remez Exchange Algorithm with DWT
Audio Enhancement Using Remez Exchange Algorithm with DWT Abstract: Audio enhancement became important when noise in signals causes loss of actual information. Many filters have been developed and still
More informationPerceptual Speech Enhancement Using Multi_band Spectral Attenuation Filter
Perceptual Speech Enhancement Using Multi_band Spectral Attenuation Filter Sana Alaya, Novlène Zoghlami and Zied Lachiri Signal, Image and Information Technology Laboratory National Engineering School
More informationRobust Voice Activity Detection Based on Discrete Wavelet. Transform
Robust Voice Activity Detection Based on Discrete Wavelet Transform Kun-Ching Wang Department of Information Technology & Communication Shin Chien University kunching@mail.kh.usc.edu.tw Abstract This paper
More informationFrequency Domain Analysis for Noise Suppression Using Spectral Processing Methods for Degraded Speech Signal in Speech Enhancement
Frequency Domain Analysis for Noise Suppression Using Spectral Processing Methods for Degraded Speech Signal in Speech Enhancement 1 Zeeshan Hashmi Khateeb, 2 Gopalaiah 1,2 Department of Instrumentation
More informationMel Spectrum Analysis of Speech Recognition using Single Microphone
International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree
More informationSGN Audio and Speech Processing
Introduction 1 Course goals Introduction 2 SGN 14006 Audio and Speech Processing Lectures, Fall 2014 Anssi Klapuri Tampere University of Technology! Learn basics of audio signal processing Basic operations
More informationProceedings of Meetings on Acoustics
Proceedings of Meetings on Acoustics Volume 1, 21 http://acousticalsociety.org/ ICA 21 Montreal Montreal, Canada 2 - June 21 Psychological and Physiological Acoustics Session appb: Binaural Hearing (Poster
More informationAdaptive Filters Application of Linear Prediction
Adaptive Filters Application of Linear Prediction Gerhard Schmidt Christian-Albrechts-Universität zu Kiel Faculty of Engineering Electrical Engineering and Information Technology Digital Signal Processing
More informationOn distance dependence of pinna spectral patterns in head-related transfer functions
On distance dependence of pinna spectral patterns in head-related transfer functions Simone Spagnol a) Department of Information Engineering, University of Padova, Padova 35131, Italy spagnols@dei.unipd.it
More informationTHE HUMANISATION OF STOCHASTIC PROCESSES FOR THE MODELLING OF F0 DRIFT IN SINGING
THE HUMANISATION OF STOCHASTIC PROCESSES FOR THE MODELLING OF F0 DRIFT IN SINGING Ryan Stables [1], Dr. Jamie Bullock [2], Dr. Cham Athwal [3] [1] Institute of Digital Experience, Birmingham City University,
More informationAuditory Localization
Auditory Localization CMPT 468: Sound Localization Tamara Smyth, tamaras@cs.sfu.ca School of Computing Science, Simon Fraser University November 15, 2013 Auditory locatlization is the human perception
More informationAn Adaptive Algorithm for Speech Source Separation in Overcomplete Cases Using Wavelet Packets
Proceedings of the th WSEAS International Conference on Signal Processing, Istanbul, Turkey, May 7-9, 6 (pp4-44) An Adaptive Algorithm for Speech Source Separation in Overcomplete Cases Using Wavelet Packets
More informationA classification-based cocktail-party processor
A classification-based cocktail-party processor Nicoleta Roman, DeLiang Wang Department of Computer and Information Science and Center for Cognitive Science The Ohio State University Columbus, OH 43, USA
More information19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 A MODEL OF THE HEAD-RELATED TRANSFER FUNCTION BASED ON SPECTRAL CUES
19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, -7 SEPTEMBER 007 A MODEL OF THE HEAD-RELATED TRANSFER FUNCTION BASED ON SPECTRAL CUES PACS: 43.66.Qp, 43.66.Pn, 43.66Ba Iida, Kazuhiro 1 ; Itoh, Motokuni
More informationSpeech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter
Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter 1 Gupteswar Sahu, 2 D. Arun Kumar, 3 M. Bala Krishna and 4 Jami Venkata Suman Assistant Professor, Department of ECE,
More informationTone-in-noise detection: Observed discrepancies in spectral integration. Nicolas Le Goff a) Technische Universiteit Eindhoven, P.O.
Tone-in-noise detection: Observed discrepancies in spectral integration Nicolas Le Goff a) Technische Universiteit Eindhoven, P.O. Box 513, NL-5600 MB Eindhoven, The Netherlands Armin Kohlrausch b) and
More informationTwo-channel Separation of Speech Using Direction-of-arrival Estimation And Sinusoids Plus Transients Modeling
Two-channel Separation of Speech Using Direction-of-arrival Estimation And Sinusoids Plus Transients Modeling Mikko Parviainen 1 and Tuomas Virtanen 2 Institute of Signal Processing Tampere University
More informationConvention Paper Presented at the 112th Convention 2002 May Munich, Germany
Audio Engineering Society Convention Paper Presented at the 112th Convention 2002 May 10 13 Munich, Germany 5627 This convention paper has been reproduced from the author s advance manuscript, without
More informationPerceptual Distortion Maps for Room Reverberation
Perceptual Distortion Maps for oom everberation Thomas Zarouchas 1 John Mourjopoulos 1 1 Audio and Acoustic Technology Group Wire Communications aboratory Electrical Engineering and Computer Engineering
More information