Convention Paper Presented at the 131st Convention 2011 October New York, USA

Size: px
Start display at page:

Download "Convention Paper Presented at the 131st Convention 2011 October New York, USA"

Transcription

1 Audio Engineering Society Convention Paper Presented at the 131st Convention 211 October 2 23 New York, USA This paper was peer-reviewed as a complete manuscript for presentation at this Convention. Additional papers may be obtained by sending request and remittance to Audio Engineering Society, 6 East 42 nd Street, New York, New York , USA; also see All rights reserved. Reproduction of this paper, or any portion thereof, is not permitted without direct permission from the Journal of the Audio Engineering Society. Head orientation tracking using binaural headset microphones Hannes Gamper, Sakari Tervo, and Tapio Lokki 1 1 Aalto University School of Science, Department of Media Technology, P.O. Box 154, FI-76, Finland Correspondence should be addressed to Hannes Gamper (first[dot]last[at]aalto.fi) ABSTRACT A head orientation tracking system using binaural headset microphones is proposed. Unlike previous approaches, the proposed method does not require anchor sources, but relies on speech signals of the wearers of the binaural headsets. From the binaural microphone signals, time of arrival (TOA) and time difference of arrival (TDOA) estimates are obtained. The tracking is performed using a particle filter integrated with a maximum likelihood estimation function. In a case study, the proposed method is used to track the head orientations of three conferees in a meeting scenario. With an accuracy of about 1 degrees, the proposed method is shown to outperform a reference method which achieves an accuracy of about 35 degrees. 1. INTRODUCTION Augmented reality applications aim at embedding virtual content into the perception of the real world. To produce realistic auditory augmentation, knowledge about the head orientation of the user is often necessary. Conventional head tracking systems use e.g. cameras to track visibly distinct markers or inertial sensors to detect head movements. Here we propose a head orientation tracking method for a teleconferencing scenario that employs microphone signals captured at the ears of the conferees. Binaural headsets with integrated microphones are used in audio augmented reality applications to enable the display of virtual audio content, whilst leaving the transducers acoustically transparent [1]. In a teleconference scenario, wearing such headsets allows the simultaneous perception of real and virtual participants. Head orientation tracking with binaural headset microphones has previously been implemented using anchor sources at known positions [2]. In the method presented here, the head orientation tracking is based on the speech signals of the conferees. Thus, anchor sources are not required. The locations of the conferees are assumed to be known. Alternatively, the positions of the conferees can be estimated for example via acoustic source localisation and tracking [3, 4].

2 The proposed tracking method is based on time of arrival (TOA) and time difference of arrival (TDOA) estimates of speech signals. TDOA estimates are obtained from the binaural microphone signals recorded at the ears of the active speaker S and one listener L (cf. Fig. 1). From these TDOAs, the TOAs at the ears of listener L can be estimated by assuming a constant propagation delay from the acoustic centre to the ears of speaker S. Particle filtering is applied to the maximum likelihood estimation function of the TOAs and TDOAs to track the head orientation ϕ H of listener L. As a case study, a conference scenario with three participants is presented. In the experiment, each participant wore a binaural headset with integrated microphones. The head orientations of the conferees were tracked using the speech signals recorded from each conferee. The results are compared to a reference method from the literature. 2. PROPOSED METHOD 2.1. Geometrical quantities and TDOA estimation This work concentrates on tracking the head orientation of the listener. The positions of the speaker S and the listener L are assumed to be known. A schematic view of the problem is shown in Fig. 1. The head orientation tracking is based on time of arrival (TOA) and time difference of arrival (TDOA) estimates. A TDOA is the difference between two TOAs: τ i,j (φ) = t i (φ) t j (φ). (1) The TOA between source position S and the right binaural microphone at listener position L r is expressed with respect to the angle ϕ (cf. Fig. 1) as t r (ϕ) = c 1 D r,m = c 1 L r S = c 1 (D L,M cos(ϕ) D H ) 2 + (D L,M sin(ϕ)) 2, (2) where D H is the head radius. The distance between the listener and the acoustic centre of the speaker (i.e., the mouth of the speaker) is approximated as D L,M L S. The speed of sound is denoted by c and assumed to be constant, c = 345 m/s. y L l L φ H φ φ L,S L r D L,M S r M D r,m S D M-E Fig. 1: Schematic view of the head orientation estimation problem. The head orientation of the listener L is denoted with ϕ H. The mouth M of the speaker S is assumed to be the acoustic centre. D M E denotes the mouth-to-ear distance. The distance of the mouth to L is given by D L,M, and the distance of the mouth to the right ear of L is denoted as D r,m. S l, S r, L l and L r indicate the left and right microphone signals of the speaker and listener. Similarly, the TOA for the left binaural microphone of the listener at location L l is calculated as t l (ϕ) = c 1 D l,m = c 1 L l S = S l c 1 (D L,M cos(ϕ) + D H ) 2 + (D L,M sin(ϕ)) 2. x (3) Eqs. (2) and (3) yield two solutions for the angle ϕ. One solution can be discarded by assuming that the listener is facing the speaker, hence the speaker is located in the frontal hemisphere of the listener. The head orientation of the listener ϕ H can be derived from the angle ϕ and the angle between the listener and speaker position ϕ L,S as ϕ H = ϕ L,S ϕ + π 2. (4) The TDOAs are estimated using the generalised correlation method with phase transform weighting Page 2 of 7

3 (GCC-PHAT) [5] ( R i,j (τ) = F 1 Xi (ω)x j (ω) ) X i (ω)x j (ω), (5) where X i (ω) is the Fourier transform of the microphone signal x i (t), ( ) denotes the complex conjugate, and F 1 ( ) denotes the inverse Fouriertransform. The maximum argument of the GCC- PHAT function is the TDOA estimate ˆτ i,j = arg max (R i,j (τ)). (6) τ The method proposed to derive TOA estimates from TDOA estimates is described in section Head orientation tracking using particle filtering Particle filtering is a technique that can be used to approximate the posterior probability density function underlying the Bayesian filtering problem [6]. Particle filtering is often used in tracking applications [3, 4], since it lowers the computational load in non-linear and non-gaussian filtering substantially. In the method presented here, particle filtering is applied to track the head orientation of the conferees. During initialisation, the particles are uniformly distributed from to 2π, for each conferee. Particle filtering consists of three steps: prediction, update, and resampling. The prediction step requires a model that predicts the movement of the source or the particles. Here, Brownian motion is assumed, i.e., the particle locations are propagated according to a random distribution [4]. To update the particles, a weight w k (n) is calculated for each of the K particles at time step n, using the likelihood model introduced in the next section. The resampling of the particles is done with stratified resampling according to the weights [7]. After the resampling, the weights are given a uniform value, w k (n) = 1/K. The head orientation estimate of each listener is the (weighted) circular mean of the particles of that listener { K } ˆϕ(n) = arg w k (n) exp(iϕ k (n)) (7) k=1 where i = 1. The reader is referred to [3] and [6] for detailed information on particle filtering Likelihood function To determine the weights of each particle, a likelihood estimation function is used. In this paper we consider the maximum likelihood estimation (MLE) function [3] P MLE (ϕ) = M {i,j}=1 p(τ i,j (ϕ) ˆτ i,j ), (8) where M is the number of binaural microphone pairs {i, j}, and ( 1 p(τ i,j (ϕ) ˆτ i,j ) = exp (τ i,j(ϕ) ˆτ i,j ) 2 ) 2πσ 2 2σ 2 is a normal distribution with variance σ = 1 and mean ˆτ i,j. Here, the errors of the TDOA estimates are assumed to be independent and identically distributed. Consider a speaker with binaural microphones S r and S l and a listener with binaural microphones L r and L l (cf. Fig. 1). The TDOA estimates between the binaural microphones are ˆτ Lr,L l, ˆτ Lr,S r, ˆτ Lr,S l, ˆτ Ll,S r, and ˆτ Ll,S l. The TDOA between the binaural microphones of the speaker is assumed to be zero, ˆτ Sr,S l. By assuming a constant TOA from the acoustic centre to the binaural microphones of the speaker t M E (mouth-to-ear TOA), we can extract the TOA at the binaural microphones of the listener from the TDOA estimates: ˆt Lr = ˆτ Lr,S r ˆt Lr = ˆτ Lr,S l ˆt Ll = ˆτ Ll,S r ˆt Ll = ˆτ Ll,S l (9) where t M E is derived from the mouth-to-ear distance D M E, which we estimated as.18 m. The MLE function for the head orientation tracking can now be written as P MLE (ϕ) = p(τ Lr,L l (ϕ) ˆτ Lr,L l ) p(t Lr (ϕ) ˆτ Lr,S r + t M E ) p(t Lr (ϕ) ˆτ Lr,S l + t M E ) p(t Ll (ϕ) ˆτ Ll,S r + t M E ) p(t Ll (ϕ) ˆτ L1,S l + t M E ) Speech and Speaker Activity Detection (1) To improve the reliability of the tracking results, tracking is only performed when speech activity is Page 3 of 7

4 P1 P3 P2 The position and head orientation of each conferee was determined using the ARToolkit [8], which allows tracking predefined markers in a video stream. The markers placed on top of each conferee were recorded using a Canon EOS 7D camera, mounted about 4 m above the scene, and tracked via the AR- Toolkit. Two methods for estimating the head orientation of the conferees were tested and compared against the ground truth data, i.e., the true head orientation of each conferee obtained from the video stream using the ARToolkit [8]: Reference method: the head orientation is determined from the TDOA estimates between the binaural microphones of the listener [2] (see Appendix for details). Fig. 2: The experimental setup of the case study. Three conferees are involved in a discussion. AR markers on the heads of the conferees are tracked using the ARToolkit [8] to obtain the true location and head orientation of each participant. detected in the binaural microphone signals. A simple measure for speech activity is the energy on the speech band (.2 8 khz). Speech activity is detected if the energy at one of the binaural headsets exceeds a certain threshold. The wearer of that headset is determined to be the currently active speaker. This simple activity detection works reasonably well if the conferees do not talk simultaneously. 3. EXPERIMENTAL SETUP In a case study, the head orientations of three conferees were tracked during a conversation in a meeting scenario. The experimental setup is depicted in Fig. 2. Each participant was wearing a binaural headset with integrated microphones of type Philips SHN25. The microphone signals were recorded at a sampling rate of 96 khz. For analysis, the signals were divided into frames of 5 ms with 45 ms overlap using a Hanning window. The recording was made in a multipurpose space with a reverberation time of about.3 s. The signal-to-noise ratio (SNR) was between 5 and 3 db during active speech frames. Proposed method: the head orientation is tracked using the TDOA estimates between the listener and speaker (Eq.(9)), maximum likelihood estimation function (Eq. (1)), and particle filtering. Speech activity and the currently active speaker were determined in each frame from the binaural microphone signals. For each conferee, the head orientation was determined or tracked in frames where speech activity from any of the other conferees was detected. 4. RESULTS AND DISCUSSION Figure 3 illustrates the tracking results for each conferee and the speech activity map. Speech activity was detected in 67% of the frames. Table 1 shows the root mean squared error (RMSE) of the head orientation tracking. It is calculated for each conferee over the frames during which tracking was performed. For all three conferees the proposed method clearly outperforms the reference method. This is partly due to the fact that the reference method estimates the head orientation based only on the TDOA estimate between the binaural microphone signals of the listener, whereas the proposed method uses also the TDOA estimates between the binaural microphones of the speaker and the listener. The fact that the particle filter takes into account the history Page 4 of 7

5 P1 P2 P3 Angle [ ] Angle [ ] Angle [ ] Grou n d tru th R e f e re n c e m e th o d Prop ose d m e th od Speech Activity P1 P2 P Ti me [s] Fig. 3: Head orientation tracking results for each of the three conferees P1, P2 and P3. The bottom graph indicates the frames were speech activity was detected. of each particle, i.e., its iterative, seems to improve the tracking performance. As can be seen in Fig. 3, P2 rotated the head the most during the meeting scenario. The RMSE is largest for P2, since a moving target generally suffers from a larger tracking error than a steady one. The tracking deteriorates in passages with large head movements or low speech activity, for instance around 15 s into the recording for P2. A key factor for the tracking performance is the signal-to-noise ratio (SNR). Fig. 4 shows the RMSE of both methods as a function of the SNR in the frames. The RMSE for each SNR value is obtained by averaging the RMSE of all three conferees over all frames with at least that SNR. As expected, the performance of both methods is better in frames with high SNR. Above 3 db SNR the performance of the reference method approaches the performance of RMSE [ ] Conferee Reference Proposed method P P P Table 1: Root mean squared error (RMSE) of the head orientation tracking for the reference and the proposed method. The results are calculated over the frames where tracking was performed. the proposed method. This implies that with high SNR a single TDOA estimate between the binaural microphones of the listener provides a reliable estimation of the head orientation, whereas the use of additional TDOA estimates in the proposed method Page 5 of 7

6 RMSE [ ] R e f e re n c e m e th o d Prop ose d m e th od the proposed method, a propagation model taking into account the particle velocity could be integrated, such as the Langevin model [9]. This would also enable blind tracking during frames without speech activity. The integration of acoustic source localisation and tracking [3, 4] to the system would allow tracking the positions of the conferees in addition to their head orientation SNR [db] Fig. 4: Root mean squared error (RMSE) versus signal-to-noise ratio (SNR). The RMSE is averaged over three conferees. yields only a minor improvement. In frames with low SNR, however, the proposed method clearly outperforms the reference method. Frames with low SNR provide weak evidence for tracking, hence in those frames the reference method fails, as it estimates the head orientation in each frame separately. The proposed method compensates for weak evidence in frames with low SNR by taking into account the tracking history, thus relying on strong tracking evidence found in frames with high SNR. Furthermore, the use of several TDOA estimates adds to the robustness of the proposed method. 5. CONCLUSION AND FUTURE WORK A method is proposed for tracking the head orientation of conferees in a meeting scenario using binaural headset microphones. In contrast to previous methods [2], the proposed method does not require anchor sources. The method is based on calculating the TOA and TDOA estimates of the speech signals between the binaural microphones of the speaker and the listeners. The tracking is implemented using a particle filter integrated with a maximum likelihood estimation function. As a proof of concept, head orientation tracking was applied to three conferees in a meeting scenario. The proposed method achieved an accuracy of about 1 degrees, while the reference method taken from the literature achieved an accuracy of about 35 degrees. To further improve the robustness and accuracy of 6. ACKNOWLEDGMENTS The research leading to these results has received funding from the Academy of Finland, project nos. [ and 14786], the European Research Council under the European Community s Seventh Framework Programme (FP7/27-213) / ERC grant agreement no. [23636], the Helsinki Graduate School in Computer Science and Engineering (HECSE), and [MIDE program] of Aalto University. 7. REFERENCES [1] A. Härmä, J. Jakka, M. Tikander, M. Karjalainen, T. Lokki, J. Hiipakka, and G. Lorho, Augmented reality audio for mobile and wearable appliances, Journal of the Audio Engineering Society, vol. 52, no. 6, pp , 24. [2] M. Tikander, A. Härmä, and M. Karjalainen, Acoustic positioning and head tracking based on binaural signals, in 116th Audio Engineering Society Convention, Berlin, Germany, 24, pp [3] E. Lehmann, Particle filtering methods for acoustic source localisation and tracking, Ph.D. dissertation, Australian National University, 24. [4] P. Pertilä, T. Korhonen, and A. Visa, Measurement combination for acoustic source localization in a room environment, EURASIP Journal on Audio, Speech, and Music Processing, vol. 28, pp. 1 14, 28. [5] C. Knapp and G. Carter, The generalized correlation method for estimation of time delay, IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 24, no. 4, pp , Page 6 of 7

7 [6] M. Arulampalam, S. Maskell, N. Gordon, and T. Clapp, A tutorial on particle filters for online nonlinear/non-gaussian bayesian tracking, IEEE Transactions on Signal Processing, vol. 5, no. 2, pp , 22. [7] R. Douc and O. Cappé, Comparison of resampling schemes for particle filtering, in 4th International Symposium on Image and Signal Processing and Analysis, Zagreb, Croatia, 25, pp [8] H. Kato and M. Billinghurst, Marker tracking and hmd calibration for a video-based augmented reality conferencing system, in IEEE International Workshop on Augmented Reality, San Francisco, CA, USA, 1999, pp [9] E. Lehmann, A. Johansson, and S. Nordholm, Modeling of motion dynamics and its influence on the performance of a particle filter for acoustic speaker tracking, in IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, USA, 27, pp APPENDIX: REFERENCE METHOD The angle of incidence of a sound wave ϕ can be derived from the TDOA τ l,r measured between the microphones of a binaural headset via a TDOA model [2]: τ l,r = D head (ϕ + sin(ϕ)), (11) where D head corresponds to the head radius. Assuming the positions of both the listener and the sound source are known, the head orientation of the listener ϕ H can be obtained via ϕ H = ϕ L,S ϕ, (12) where ϕ L,S is the angle between the listener and the source position. This approach is proposed in [2] for head orientation tracking, and serves as a reference method in the case study presented in this paper. Page 7 of 7

Spatial analysis of concert hall impulse responses

Spatial analysis of concert hall impulse responses Toronto, Canada International Symposium on Room Acoustics 2013 June 9-11 Spatial analysis of concert hall impulse responses Sakari Tervo (sakari.tervo@aalto.fi) Jukka Pätynen (jukka.patynen@aalto.fi) Tapio

More information

Time-of-arrival estimation for blind beamforming

Time-of-arrival estimation for blind beamforming Time-of-arrival estimation for blind beamforming Pasi Pertilä, pasi.pertila (at) tut.fi www.cs.tut.fi/~pertila/ Aki Tinakari, aki.tinakari (at) tut.fi Tampere University of Technology Tampere, Finland

More information

Acoustic Source Tracking in Reverberant Environment Using Regional Steered Response Power Measurement

Acoustic Source Tracking in Reverberant Environment Using Regional Steered Response Power Measurement Acoustic Source Tracing in Reverberant Environment Using Regional Steered Response Power Measurement Kai Wu and Andy W. H. Khong School of Electrical and Electronic Engineering, Nanyang Technological University,

More information

Recent Advances in Acoustic Signal Extraction and Dereverberation

Recent Advances in Acoustic Signal Extraction and Dereverberation Recent Advances in Acoustic Signal Extraction and Dereverberation Emanuël Habets Erlangen Colloquium 2016 Scenario Spatial Filtering Estimated Desired Signal Undesired sound components: Sensor noise Competing

More information

arxiv: v1 [cs.sd] 4 Dec 2018

arxiv: v1 [cs.sd] 4 Dec 2018 LOCALIZATION AND TRACKING OF AN ACOUSTIC SOURCE USING A DIAGONAL UNLOADING BEAMFORMING AND A KALMAN FILTER Daniele Salvati, Carlo Drioli, Gian Luca Foresti Department of Mathematics, Computer Science and

More information

SPATIALISATION IN AUDIO AUGMENTED REALITY USING FINGER SNAPS

SPATIALISATION IN AUDIO AUGMENTED REALITY USING FINGER SNAPS 1 SPATIALISATION IN AUDIO AUGMENTED REALITY USING FINGER SNAPS H. GAMPER and T. LOKKI Department of Media Technology, Aalto University, P.O.Box 15400, FI-00076 Aalto, FINLAND E-mail: [Hannes.Gamper,ktlokki]@tml.hut.fi

More information

Omnidirectional Sound Source Tracking Based on Sequential Updating Histogram

Omnidirectional Sound Source Tracking Based on Sequential Updating Histogram Proceedings of APSIPA Annual Summit and Conference 5 6-9 December 5 Omnidirectional Sound Source Tracking Based on Sequential Updating Histogram Yusuke SHIIKI and Kenji SUYAMA School of Engineering, Tokyo

More information

Automotive three-microphone voice activity detector and noise-canceller

Automotive three-microphone voice activity detector and noise-canceller Res. Lett. Inf. Math. Sci., 005, Vol. 7, pp 47-55 47 Available online at http://iims.massey.ac.nz/research/letters/ Automotive three-microphone voice activity detector and noise-canceller Z. QI and T.J.MOIR

More information

Time Delay Estimation: Applications and Algorithms

Time Delay Estimation: Applications and Algorithms Time Delay Estimation: Applications and Algorithms Hing Cheung So http://www.ee.cityu.edu.hk/~hcso Department of Electronic Engineering City University of Hong Kong H. C. So Page 1 Outline Introduction

More information

Localization of underwater moving sound source based on time delay estimation using hydrophone array

Localization of underwater moving sound source based on time delay estimation using hydrophone array Journal of Physics: Conference Series PAPER OPEN ACCESS Localization of underwater moving sound source based on time delay estimation using hydrophone array To cite this article: S. A. Rahman et al 2016

More information

Reducing comb filtering on different musical instruments using time delay estimation

Reducing comb filtering on different musical instruments using time delay estimation Reducing comb filtering on different musical instruments using time delay estimation Alice Clifford and Josh Reiss Queen Mary, University of London alice.clifford@eecs.qmul.ac.uk Abstract Comb filtering

More information

Auditory System For a Mobile Robot

Auditory System For a Mobile Robot Auditory System For a Mobile Robot PhD Thesis Jean-Marc Valin Department of Electrical Engineering and Computer Engineering Université de Sherbrooke, Québec, Canada Jean-Marc.Valin@USherbrooke.ca Motivations

More information

Multiple Sound Sources Localization Using Energetic Analysis Method

Multiple Sound Sources Localization Using Energetic Analysis Method VOL.3, NO.4, DECEMBER 1 Multiple Sound Sources Localization Using Energetic Analysis Method Hasan Khaddour, Jiří Schimmel Department of Telecommunications FEEC, Brno University of Technology Purkyňova

More information

Robust Low-Resource Sound Localization in Correlated Noise

Robust Low-Resource Sound Localization in Correlated Noise INTERSPEECH 2014 Robust Low-Resource Sound Localization in Correlated Noise Lorin Netsch, Jacek Stachurski Texas Instruments, Inc. netsch@ti.com, jacek@ti.com Abstract In this paper we address the problem

More information

A Weighted Least Squares Algorithm for Passive Localization in Multipath Scenarios

A Weighted Least Squares Algorithm for Passive Localization in Multipath Scenarios A Weighted Least Squares Algorithm for Passive Localization in Multipath Scenarios Noha El Gemayel, Holger Jäkel, Friedrich K. Jondral Karlsruhe Institute of Technology, Germany, {noha.gemayel,holger.jaekel,friedrich.jondral}@kit.edu

More information

STUDIES OF EPIDAURUS WITH A HYBRID ROOM ACOUSTICS MODELLING METHOD

STUDIES OF EPIDAURUS WITH A HYBRID ROOM ACOUSTICS MODELLING METHOD STUDIES OF EPIDAURUS WITH A HYBRID ROOM ACOUSTICS MODELLING METHOD Tapio Lokki (1), Alex Southern (1), Samuel Siltanen (1), Lauri Savioja (1), 1) Aalto University School of Science, Dept. of Media Technology,

More information

Audio Engineering Society Convention Paper Presented at the 110th Convention 2001 May Amsterdam, The Netherlands

Audio Engineering Society Convention Paper Presented at the 110th Convention 2001 May Amsterdam, The Netherlands Audio Engineering Society Convention Paper Presented at the th Convention May 5 Amsterdam, The Netherlands This convention paper has been reproduced from the author's advance manuscript, without editing,

More information

Nicholas Chong, Shanhung Wong, Sven Nordholm, Iain Murray

Nicholas Chong, Shanhung Wong, Sven Nordholm, Iain Murray MULTIPLE SOUND SOURCE TRACKING AND IDENTIFICATION VIA DEGENERATE UNMIXING ESTIMATION TECHNIQUE AND CARDINALITY BALANCED MULTI-TARGET MULTI-BERNOULLI FILTER (DUET-CBMEMBER) WITH TRACK MANAGEMENT Nicholas

More information

Determining Times of Arrival of Transponder Signals in a Sensor Network using GPS Time Synchronization

Determining Times of Arrival of Transponder Signals in a Sensor Network using GPS Time Synchronization Determining Times of Arrival of Transponder Signals in a Sensor Network using GPS Time Synchronization Christian Steffes, Regina Kaune and Sven Rau Fraunhofer FKIE, Dept. Sensor Data and Information Fusion

More information

AUDIO AUGMENTED REALITY IN TELECOMMUNICATION THROUGH VIRTUAL AUDITORY DISPLAY. Hannes Gamper and Tapio Lokki

AUDIO AUGMENTED REALITY IN TELECOMMUNICATION THROUGH VIRTUAL AUDITORY DISPLAY. Hannes Gamper and Tapio Lokki AUDIO AUGMENTED REALITY IN TELECOMMUNICATION THROUGH VIRTUAL AUDITORY DISPLAY Hannes Gamper and Tapio Lokki Aalto University School of Science and Technology Department of Media Technology P.O.Box 154,

More information

Introduction to Audio Watermarking Schemes

Introduction to Audio Watermarking Schemes Introduction to Audio Watermarking Schemes N. Lazic and P. Aarabi, Communication over an Acoustic Channel Using Data Hiding Techniques, IEEE Transactions on Multimedia, Vol. 8, No. 5, October 2006 Multimedia

More information

Direction-of-Arrival Estimation Using a Microphone Array with the Multichannel Cross-Correlation Method

Direction-of-Arrival Estimation Using a Microphone Array with the Multichannel Cross-Correlation Method Direction-of-Arrival Estimation Using a Microphone Array with the Multichannel Cross-Correlation Method Udo Klein, Member, IEEE, and TrInh Qu6c VO School of Electrical Engineering, International University,

More information

Influence of artificial mouth s directivity in determining Speech Transmission Index

Influence of artificial mouth s directivity in determining Speech Transmission Index Audio Engineering Society Convention Paper Presented at the 119th Convention 2005 October 7 10 New York, New York USA This convention paper has been reproduced from the author's advance manuscript, without

More information

Psychoacoustic Cues in Room Size Perception

Psychoacoustic Cues in Room Size Perception Audio Engineering Society Convention Paper Presented at the 116th Convention 2004 May 8 11 Berlin, Germany 6084 This convention paper has been reproduced from the author s advance manuscript, without editing,

More information

Subband Analysis of Time Delay Estimation in STFT Domain

Subband Analysis of Time Delay Estimation in STFT Domain PAGE 211 Subband Analysis of Time Delay Estimation in STFT Domain S. Wang, D. Sen and W. Lu School of Electrical Engineering & Telecommunications University of ew South Wales, Sydney, Australia sh.wang@student.unsw.edu.au,

More information

Hybrid WiFi/UWB, Cooperative Localization using Particle Filter

Hybrid WiFi/UWB, Cooperative Localization using Particle Filter Hybrid WiFi/UWB, Cooperative Localization using Particle Filter Nader Bargshady, Kaveh Pahlavan Center for Wireless Information Network Studies Worcester Polytechnic Institute Worcester, MA, 69, USA Email:

More information

The Simulated Location Accuracy of Integrated CCGA for TDOA Radio Spectrum Monitoring System in NLOS Environment

The Simulated Location Accuracy of Integrated CCGA for TDOA Radio Spectrum Monitoring System in NLOS Environment The Simulated Location Accuracy of Integrated CCGA for TDOA Radio Spectrum Monitoring System in NLOS Environment ao-tang Chang 1, Hsu-Chih Cheng 2 and Chi-Lin Wu 3 1 Department of Information Technology,

More information

III. Publication III. c 2005 Toni Hirvonen.

III. Publication III. c 2005 Toni Hirvonen. III Publication III Hirvonen, T., Segregation of Two Simultaneously Arriving Narrowband Noise Signals as a Function of Spatial and Frequency Separation, in Proceedings of th International Conference on

More information

Spatialized teleconferencing: recording and 'Squeezed' rendering of multiple distributed sites

Spatialized teleconferencing: recording and 'Squeezed' rendering of multiple distributed sites University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2008 Spatialized teleconferencing: recording and 'Squeezed' rendering

More information

Microphone Array Design and Beamforming

Microphone Array Design and Beamforming Microphone Array Design and Beamforming Heinrich Löllmann Multimedia Communications and Signal Processing heinrich.loellmann@fau.de with contributions from Vladi Tourbabin and Hendrik Barfuss EUSIPCO Tutorial

More information

Response spectrum Time history Power Spectral Density, PSD

Response spectrum Time history Power Spectral Density, PSD A description is given of one way to implement an earthquake test where the test severities are specified by time histories. The test is done by using a biaxial computer aided servohydraulic test rig.

More information

Audio Engineering Society. Convention Paper. Presented at the 115th Convention 2003 October New York, New York

Audio Engineering Society. Convention Paper. Presented at the 115th Convention 2003 October New York, New York Audio Engineering Society Convention Paper Presented at the 115th Convention 2003 October 10 13 New York, New York This convention paper has been reproduced from the author's advance manuscript, without

More information

Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter

Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter Ching-Ta Lu, Kun-Fu Tseng 2, Chih-Tsung Chen 2 Department of Information Communication, Asia University, Taichung, Taiwan, ROC

More information

Airo Interantional Research Journal September, 2013 Volume II, ISSN:

Airo Interantional Research Journal September, 2013 Volume II, ISSN: Airo Interantional Research Journal September, 2013 Volume II, ISSN: 2320-3714 Name of author- Navin Kumar Research scholar Department of Electronics BR Ambedkar Bihar University Muzaffarpur ABSTRACT Direction

More information

Applying the Filtered Back-Projection Method to Extract Signal at Specific Position

Applying the Filtered Back-Projection Method to Extract Signal at Specific Position Applying the Filtered Back-Projection Method to Extract Signal at Specific Position 1 Chia-Ming Chang and Chun-Hao Peng Department of Computer Science and Engineering, Tatung University, Taipei, Taiwan

More information

THE problem of acoustic echo cancellation (AEC) was

THE problem of acoustic echo cancellation (AEC) was IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 13, NO. 6, NOVEMBER 2005 1231 Acoustic Echo Cancellation and Doubletalk Detection Using Estimated Loudspeaker Impulse Responses Per Åhgren Abstract

More information

Nonuniform multi level crossing for signal reconstruction

Nonuniform multi level crossing for signal reconstruction 6 Nonuniform multi level crossing for signal reconstruction 6.1 Introduction In recent years, there has been considerable interest in level crossing algorithms for sampling continuous time signals. Driven

More information

WIND SPEED ESTIMATION AND WIND-INDUCED NOISE REDUCTION USING A 2-CHANNEL SMALL MICROPHONE ARRAY

WIND SPEED ESTIMATION AND WIND-INDUCED NOISE REDUCTION USING A 2-CHANNEL SMALL MICROPHONE ARRAY INTER-NOISE 216 WIND SPEED ESTIMATION AND WIND-INDUCED NOISE REDUCTION USING A 2-CHANNEL SMALL MICROPHONE ARRAY Shumpei SAKAI 1 ; Tetsuro MURAKAMI 2 ; Naoto SAKATA 3 ; Hirohumi NAKAJIMA 4 ; Kazuhiro NAKADAI

More information

LOCALIZATION AND IDENTIFICATION OF PERSONS AND AMBIENT NOISE SOURCES VIA ACOUSTIC SCENE ANALYSIS

LOCALIZATION AND IDENTIFICATION OF PERSONS AND AMBIENT NOISE SOURCES VIA ACOUSTIC SCENE ANALYSIS ICSV14 Cairns Australia 9-12 July, 2007 LOCALIZATION AND IDENTIFICATION OF PERSONS AND AMBIENT NOISE SOURCES VIA ACOUSTIC SCENE ANALYSIS Abstract Alexej Swerdlow, Kristian Kroschel, Timo Machmer, Dirk

More information

Passive Emitter Geolocation using Agent-based Data Fusion of AOA, TDOA and FDOA Measurements

Passive Emitter Geolocation using Agent-based Data Fusion of AOA, TDOA and FDOA Measurements Passive Emitter Geolocation using Agent-based Data Fusion of AOA, TDOA and FDOA Measurements Alex Mikhalev and Richard Ormondroyd Department of Aerospace Power and Sensors Cranfield University The Defence

More information

SOUND SOURCE LOCATION METHOD

SOUND SOURCE LOCATION METHOD SOUND SOURCE LOCATION METHOD Michal Mandlik 1, Vladimír Brázda 2 Summary: This paper deals with received acoustic signals on microphone array. In this paper the localization system based on a speaker speech

More information

Source Localisation Mapping using Weighted Interaural Cross-Correlation

Source Localisation Mapping using Weighted Interaural Cross-Correlation ISSC 27, Derry, Sept 3-4 Source Localisation Mapping using Weighted Interaural Cross-Correlation Gavin Kearney, Damien Kelly, Enda Bates, Frank Boland and Dermot Furlong. Department of Electronic and Electrical

More information

Convention Paper 6274 Presented at the 117th Convention 2004 October San Francisco, CA, USA

Convention Paper 6274 Presented at the 117th Convention 2004 October San Francisco, CA, USA Audio Engineering Society Convention Paper 6274 Presented at the 117th Convention 2004 October 28 31 San Francisco, CA, USA This convention paper has been reproduced from the author's advance manuscript,

More information

A Hybrid TDOA/RSSD Geolocation System using the Unscented Kalman Filter

A Hybrid TDOA/RSSD Geolocation System using the Unscented Kalman Filter A Hybrid TDOA/RSSD Geolocation System using the Unscented Kalman Filter Noha El Gemayel, Holger Jäkel and Friedrich K. Jondral Communications Engineering Lab, Karlsruhe Institute of Technology (KIT, Germany

More information

Speaker Localization in Noisy Environments Using Steered Response Voice Power

Speaker Localization in Noisy Environments Using Steered Response Voice Power 112 IEEE Transactions on Consumer Electronics, Vol. 61, No. 1, February 2015 Speaker Localization in Noisy Environments Using Steered Response Voice Power Hyeontaek Lim, In-Chul Yoo, Youngkyu Cho, and

More information

A five-microphone method to measure the reflection coefficients of headsets

A five-microphone method to measure the reflection coefficients of headsets A five-microphone method to measure the reflection coefficients of headsets Jinlin Liu, Huiqun Deng, Peifeng Ji and Jun Yang Key Laboratory of Noise and Vibration Research Institute of Acoustics, Chinese

More information

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 VIRTUAL AUDIO REPRODUCED IN A HEADREST

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 VIRTUAL AUDIO REPRODUCED IN A HEADREST 19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 VIRTUAL AUDIO REPRODUCED IN A HEADREST PACS: 43.25.Lj M.Jones, S.J.Elliott, T.Takeuchi, J.Beer Institute of Sound and Vibration Research;

More information

Toward an Augmented Reality System for Violin Learning Support

Toward an Augmented Reality System for Violin Learning Support Toward an Augmented Reality System for Violin Learning Support Hiroyuki Shiino, François de Sorbier, and Hideo Saito Graduate School of Science and Technology, Keio University, Yokohama, Japan {shiino,fdesorbi,saito}@hvrl.ics.keio.ac.jp

More information

Convention Paper 7480

Convention Paper 7480 Audio Engineering Society Convention Paper 7480 Presented at the 124th Convention 2008 May 17-20 Amsterdam, The Netherlands The papers at this Convention have been selected on the basis of a submitted

More information

Robust direction of arrival estimation

Robust direction of arrival estimation Tuomo Pirinen e-mail: tuomo.pirinen@tut.fi 26th February 2004 ICSI Speech Group Lunch Talk Outline Motivation, background and applications Basics Robustness Misc. results 2 Motivation Page1 3 Motivation

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 2013 http://acousticalsociety.org/ ICA 2013 Montreal Montreal, Canada 2-7 June 2013 Psychological and Physiological Acoustics Session 2aPPa: Binaural Hearing

More information

REAL TIME INDOOR TRACKING OF TAGGED OBJECTS WITH A NETWORK OF RFID READERS

REAL TIME INDOOR TRACKING OF TAGGED OBJECTS WITH A NETWORK OF RFID READERS th European Signal Processing Conference (EUSIPCO ) Bucharest, Romania, August 7 -, REAL TIME INDOOR TRACKING OF TAGGED OBJECTS WITH A NETWORK OF RFID READERS Li Geng, Mónica F. Bugallo, Akshay Athalye,

More information

IMPROVING AUDIO WATERMARK DETECTION USING NOISE MODELLING AND TURBO CODING

IMPROVING AUDIO WATERMARK DETECTION USING NOISE MODELLING AND TURBO CODING IMPROVING AUDIO WATERMARK DETECTION USING NOISE MODELLING AND TURBO CODING Nedeljko Cvejic, Tapio Seppänen MediaTeam Oulu, Information Processing Laboratory, University of Oulu P.O. Box 4500, 4STOINF,

More information

IMPROVEMENT OF SPEECH SOURCE LOCALIZATION IN NOISY ENVIRONMENT USING OVERCOMPLETE RATIONAL-DILATION WAVELET TRANSFORMS

IMPROVEMENT OF SPEECH SOURCE LOCALIZATION IN NOISY ENVIRONMENT USING OVERCOMPLETE RATIONAL-DILATION WAVELET TRANSFORMS 1 International Conference on Cyberworlds IMPROVEMENT OF SPEECH SOURCE LOCALIZATION IN NOISY ENVIRONMENT USING OVERCOMPLETE RATIONAL-DILATION WAVELET TRANSFORMS Di Liu, Andy W. H. Khong School of Electrical

More information

Joint Position-Pitch Decomposition for Multi-Speaker Tracking

Joint Position-Pitch Decomposition for Multi-Speaker Tracking Joint Position-Pitch Decomposition for Multi-Speaker Tracking SPSC Laboratory, TU Graz 1 Contents: 1. Microphone Arrays SPSC circular array Beamforming 2. Source Localization Direction of Arrival (DoA)

More information

Calibration of Microphone Arrays for Improved Speech Recognition

Calibration of Microphone Arrays for Improved Speech Recognition MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Calibration of Microphone Arrays for Improved Speech Recognition Michael L. Seltzer, Bhiksha Raj TR-2001-43 December 2001 Abstract We present

More information

VOL. 3, NO.11 Nov, 2012 ISSN Journal of Emerging Trends in Computing and Information Sciences CIS Journal. All rights reserved.

VOL. 3, NO.11 Nov, 2012 ISSN Journal of Emerging Trends in Computing and Information Sciences CIS Journal. All rights reserved. Effect of Fading Correlation on the Performance of Spatial Multiplexed MIMO systems with circular antennas M. A. Mangoud Department of Electrical and Electronics Engineering, University of Bahrain P. O.

More information

arxiv: v1 [cs.sd] 17 Dec 2018

arxiv: v1 [cs.sd] 17 Dec 2018 CIRCULAR STATISTICS-BASED LOW COMPLEXITY DOA ESTIMATION FOR HEARING AID APPLICATION L. D. Mosgaard, D. Pelegrin-Garcia, T. B. Elmedyb, M. J. Pihl, P. Mowlaee Widex A/S, Nymøllevej 6, DK-3540 Lynge, Denmark

More information

PERSONAL 3D AUDIO SYSTEM WITH LOUDSPEAKERS

PERSONAL 3D AUDIO SYSTEM WITH LOUDSPEAKERS PERSONAL 3D AUDIO SYSTEM WITH LOUDSPEAKERS Myung-Suk Song #1, Cha Zhang 2, Dinei Florencio 3, and Hong-Goo Kang #4 # Department of Electrical and Electronic, Yonsei University Microsoft Research 1 earth112@dsp.yonsei.ac.kr,

More information

Blind Blur Estimation Using Low Rank Approximation of Cepstrum

Blind Blur Estimation Using Low Rank Approximation of Cepstrum Blind Blur Estimation Using Low Rank Approximation of Cepstrum Adeel A. Bhutta and Hassan Foroosh School of Electrical Engineering and Computer Science, University of Central Florida, 4 Central Florida

More information

Audio Engineering Society Convention Paper 5449

Audio Engineering Society Convention Paper 5449 Audio Engineering Society Convention Paper 5449 Presented at the 111th Convention 21 September 21 24 New York, NY, USA This convention paper has been reproduced from the author s advance manuscript, without

More information

Painting with Music. Weijian Zhou

Painting with Music. Weijian Zhou Painting with Music by Weijian Zhou A thesis submitted in conformity with the requirements for the degree of Master of Applied Science and Engineering Graduate Department of Electrical and Computer Engineering

More information

Localization in Wireless Sensor Networks

Localization in Wireless Sensor Networks Localization in Wireless Sensor Networks Part 2: Localization techniques Department of Informatics University of Oslo Cyber Physical Systems, 11.10.2011 Localization problem in WSN In a localization problem

More information

Perception of room size and the ability of self localization in a virtual environment. Loudspeaker experiment

Perception of room size and the ability of self localization in a virtual environment. Loudspeaker experiment Perception of room size and the ability of self localization in a virtual environment. Loudspeaker experiment Marko Horvat University of Zagreb Faculty of Electrical Engineering and Computing, Zagreb,

More information

Immersive Authoring of Tangible Augmented Reality Applications

Immersive Authoring of Tangible Augmented Reality Applications International Symposium on Mixed and Augmented Reality 2004 Immersive Authoring of Tangible Augmented Reality Applications Gun A. Lee α Gerard J. Kim α Claudia Nelles β Mark Billinghurst β α Virtual Reality

More information

ACOUSTIC feedback problems may occur in audio systems

ACOUSTIC feedback problems may occur in audio systems IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL 20, NO 9, NOVEMBER 2012 2549 Novel Acoustic Feedback Cancellation Approaches in Hearing Aid Applications Using Probe Noise and Probe Noise

More information

Distance Estimation and Localization of Sound Sources in Reverberant Conditions using Deep Neural Networks

Distance Estimation and Localization of Sound Sources in Reverberant Conditions using Deep Neural Networks Distance Estimation and Localization of Sound Sources in Reverberant Conditions using Deep Neural Networks Mariam Yiwere 1 and Eun Joo Rhee 2 1 Department of Computer Engineering, Hanbat National University,

More information

TDE-ILD-HRTF-Based 2D Whole-Plane Sound Source Localization Using Only Two Microphones and Source Counting

TDE-ILD-HRTF-Based 2D Whole-Plane Sound Source Localization Using Only Two Microphones and Source Counting TDE-ILD-HRTF-Based 2D Whole-Plane Sound Source Localization Using Only Two Microphones Source Counting Ali Pourmohammad, Member, IACSIT Seyed Mohammad Ahadi Abstract In outdoor cases, TDOA-based methods

More information

Performance Analysis of a 1-bit Feedback Beamforming Algorithm

Performance Analysis of a 1-bit Feedback Beamforming Algorithm Performance Analysis of a 1-bit Feedback Beamforming Algorithm Sherman Ng Mark Johnson Electrical Engineering and Computer Sciences University of California at Berkeley Technical Report No. UCB/EECS-2009-161

More information

Speech Enhancement Using Microphone Arrays

Speech Enhancement Using Microphone Arrays Friedrich-Alexander-Universität Erlangen-Nürnberg Lab Course Speech Enhancement Using Microphone Arrays International Audio Laboratories Erlangen Prof. Dr. ir. Emanuël A. P. Habets Friedrich-Alexander

More information

EWGAE 2010 Vienna, 8th to 10th September

EWGAE 2010 Vienna, 8th to 10th September EWGAE 2010 Vienna, 8th to 10th September Frequencies and Amplitudes of AE Signals in a Plate as a Function of Source Rise Time M. A. HAMSTAD University of Denver, Department of Mechanical and Materials

More information

Convention Paper Presented at the 126th Convention 2009 May 7 10 Munich, Germany

Convention Paper Presented at the 126th Convention 2009 May 7 10 Munich, Germany Audio Engineering Society Convention Paper Presented at the 16th Convention 9 May 7 Munich, Germany The papers at this Convention have been selected on the basis of a submitted abstract and extended precis

More information

The psychoacoustics of reverberation

The psychoacoustics of reverberation The psychoacoustics of reverberation Steven van de Par Steven.van.de.Par@uni-oldenburg.de July 19, 2016 Thanks to Julian Grosse and Andreas Häußler 2016 AES International Conference on Sound Field Control

More information

Waves Nx VIRTUAL REALITY AUDIO

Waves Nx VIRTUAL REALITY AUDIO Waves Nx VIRTUAL REALITY AUDIO WAVES VIRTUAL REALITY AUDIO THE FUTURE OF AUDIO REPRODUCTION AND CREATION Today s entertainment is on a mission to recreate the real world. Just as VR makes us feel like

More information

Sound Processing Technologies for Realistic Sensations in Teleworking

Sound Processing Technologies for Realistic Sensations in Teleworking Sound Processing Technologies for Realistic Sensations in Teleworking Takashi Yazu Makoto Morito In an office environment we usually acquire a large amount of information without any particular effort

More information

Convention Paper 6230

Convention Paper 6230 Audio Engineering Society Convention Paper 6230 Presented at the 117th Convention 2004 October 28 31 San Francisco, CA, USA This convention paper has been reproduced from the author's advance manuscript,

More information

Mikko Myllymäki and Tuomas Virtanen

Mikko Myllymäki and Tuomas Virtanen NON-STATIONARY NOISE MODEL COMPENSATION IN VOICE ACTIVITY DETECTION Mikko Myllymäki and Tuomas Virtanen Department of Signal Processing, Tampere University of Technology Korkeakoulunkatu 1, 3370, Tampere,

More information

Evaluation of a Multiple versus a Single Reference MIMO ANC Algorithm on Dornier 328 Test Data Set

Evaluation of a Multiple versus a Single Reference MIMO ANC Algorithm on Dornier 328 Test Data Set Evaluation of a Multiple versus a Single Reference MIMO ANC Algorithm on Dornier 328 Test Data Set S. Johansson, S. Nordebo, T. L. Lagö, P. Sjösten, I. Claesson I. U. Borchers, K. Renger University of

More information

ON WAVEFORM SELECTION IN A TIME VARYING SONAR ENVIRONMENT

ON WAVEFORM SELECTION IN A TIME VARYING SONAR ENVIRONMENT ON WAVEFORM SELECTION IN A TIME VARYING SONAR ENVIRONMENT Ashley I. Larsson 1* and Chris Gillard 1 (1) Maritime Operations Division, Defence Science and Technology Organisation, Edinburgh, Australia Abstract

More information

Tracking of UWB Multipath Components Using Probability Hypothesis Density Filters

Tracking of UWB Multipath Components Using Probability Hypothesis Density Filters Tracking of UWB Multipath Components Using Probability Hypothesis Density Filters Markus Froehle, Paul Meissner and Klaus Witrisal Graz University of Technology, Graz, Austria. Email: {froehle, paul.meissner,

More information

Error Analysis of a Low Cost TDoA Sensor Network

Error Analysis of a Low Cost TDoA Sensor Network Error Analysis of a Low Cost TDoA Sensor Network Noha El Gemayel, Holger Jäkel and Friedrich K. Jondral Communications Engineering Lab, Karlsruhe Institute of Technology (KIT), Germany {noha.gemayel, holger.jaekel,

More information

Sponsored by. Nisarg Kothari Carnegie Mellon University April 26, 2011

Sponsored by. Nisarg Kothari Carnegie Mellon University April 26, 2011 Sponsored by Nisarg Kothari Carnegie Mellon University April 26, 2011 Motivation Why indoor localization? Navigating malls, airports, office buildings Museum tours, context aware apps Augmented reality

More information

A MICROPHONE ARRAY INTERFACE FOR REAL-TIME INTERACTIVE MUSIC PERFORMANCE

A MICROPHONE ARRAY INTERFACE FOR REAL-TIME INTERACTIVE MUSIC PERFORMANCE A MICROPHONE ARRA INTERFACE FOR REAL-TIME INTERACTIVE MUSIC PERFORMANCE Daniele Salvati AVIRES lab Dep. of Mathematics and Computer Science, University of Udine, Italy daniele.salvati@uniud.it Sergio Canazza

More information

Convention Paper 7024 Presented at the 122th Convention 2007 May 5 8 Vienna, Austria

Convention Paper 7024 Presented at the 122th Convention 2007 May 5 8 Vienna, Austria Audio Engineering Society Convention Paper 7024 Presented at the 122th Convention 2007 May 5 8 Vienna, Austria This convention paper has been reproduced from the author's advance manuscript, without editing,

More information

ROBUST F0 ESTIMATION IN NOISY SPEECH SIGNALS USING SHIFT AUTOCORRELATION. Frank Kurth, Alessia Cornaggia-Urrigshardt and Sebastian Urrigshardt

ROBUST F0 ESTIMATION IN NOISY SPEECH SIGNALS USING SHIFT AUTOCORRELATION. Frank Kurth, Alessia Cornaggia-Urrigshardt and Sebastian Urrigshardt 2014 IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP) ROBUST F0 ESTIMATION IN NOISY SPEECH SIGNALS USING SHIFT AUTOCORRELATION Frank Kurth, Alessia Cornaggia-Urrigshardt

More information

Interfacing with the Machine

Interfacing with the Machine Interfacing with the Machine Jay Desloge SENS Corporation Sumit Basu Microsoft Research They (We) Are Better Than We Think! Machine source separation, localization, and recognition are not as distant as

More information

Digital Communications over Fading Channel s

Digital Communications over Fading Channel s over Fading Channel s Instructor: Prof. Dr. Noor M Khan Department of Electronic Engineering, Muhammad Ali Jinnah University, Islamabad Campus, Islamabad, PAKISTAN Ph: +9 (51) 111-878787, Ext. 19 (Office),

More information

Recording and analysis of head movements, interaural level and time differences in rooms and real-world listening scenarios

Recording and analysis of head movements, interaural level and time differences in rooms and real-world listening scenarios Toronto, Canada International Symposium on Room Acoustics 2013 June 9-11 ISRA 2013 Recording and analysis of head movements, interaural level and time differences in rooms and real-world listening scenarios

More information

Sound Source Localization using HRTF database

Sound Source Localization using HRTF database ICCAS June -, KINTEX, Gyeonggi-Do, Korea Sound Source Localization using HRTF database Sungmok Hwang*, Youngjin Park and Younsik Park * Center for Noise and Vibration Control, Dept. of Mech. Eng., KAIST,

More information

AN AUDITORILY MOTIVATED ANALYSIS METHOD FOR ROOM IMPULSE RESPONSES

AN AUDITORILY MOTIVATED ANALYSIS METHOD FOR ROOM IMPULSE RESPONSES Proceedings of the COST G-6 Conference on Digital Audio Effects (DAFX-), Verona, Italy, December 7-9,2 AN AUDITORILY MOTIVATED ANALYSIS METHOD FOR ROOM IMPULSE RESPONSES Tapio Lokki Telecommunications

More information

WARPED FILTER DESIGN FOR THE BODY MODELING AND SOUND SYNTHESIS OF STRING INSTRUMENTS

WARPED FILTER DESIGN FOR THE BODY MODELING AND SOUND SYNTHESIS OF STRING INSTRUMENTS NORDIC ACOUSTICAL MEETING 12-14 JUNE 1996 HELSINKI WARPED FILTER DESIGN FOR THE BODY MODELING AND SOUND SYNTHESIS OF STRING INSTRUMENTS Helsinki University of Technology Laboratory of Acoustics and Audio

More information

REAL-TIME SRP-PHAT SOURCE LOCATION IMPLEMENTATIONS ON A LARGE-APERTURE MICROPHONE ARRAY

REAL-TIME SRP-PHAT SOURCE LOCATION IMPLEMENTATIONS ON A LARGE-APERTURE MICROPHONE ARRAY REAL-TIME SRP-PHAT SOURCE LOCATION IMPLEMENTATIONS ON A LARGE-APERTURE MICROPHONE ARRAY by Hoang Tran Huy Do A THESIS SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF MASTER OF SCIENCE

More information

FROM BLIND SOURCE SEPARATION TO BLIND SOURCE CANCELLATION IN THE UNDERDETERMINED CASE: A NEW APPROACH BASED ON TIME-FREQUENCY ANALYSIS

FROM BLIND SOURCE SEPARATION TO BLIND SOURCE CANCELLATION IN THE UNDERDETERMINED CASE: A NEW APPROACH BASED ON TIME-FREQUENCY ANALYSIS ' FROM BLIND SOURCE SEPARATION TO BLIND SOURCE CANCELLATION IN THE UNDERDETERMINED CASE: A NEW APPROACH BASED ON TIME-FREQUENCY ANALYSIS Frédéric Abrard and Yannick Deville Laboratoire d Acoustique, de

More information

Auditory Distance Perception. Yan-Chen Lu & Martin Cooke

Auditory Distance Perception. Yan-Chen Lu & Martin Cooke Auditory Distance Perception Yan-Chen Lu & Martin Cooke Human auditory distance perception Human performance data (21 studies, 84 data sets) can be modelled by a power function r =kr a (Zahorik et al.

More information

Optimized threshold calculation for blanking nonlinearity at OFDM receivers based on impulsive noise estimation

Optimized threshold calculation for blanking nonlinearity at OFDM receivers based on impulsive noise estimation Ali et al. EURASIP Journal on Wireless Communications and Networking (2015) 2015:191 DOI 10.1186/s13638-015-0416-0 RESEARCH Optimized threshold calculation for blanking nonlinearity at OFDM receivers based

More information

Active noise control at a moving virtual microphone using the SOTDF moving virtual sensing method

Active noise control at a moving virtual microphone using the SOTDF moving virtual sensing method Proceedings of ACOUSTICS 29 23 25 November 29, Adelaide, Australia Active noise control at a moving rophone using the SOTDF moving sensing method Danielle J. Moreau, Ben S. Cazzolato and Anthony C. Zander

More information

Robust Speaker Identification for Meetings: UPC CLEAR 07 Meeting Room Evaluation System

Robust Speaker Identification for Meetings: UPC CLEAR 07 Meeting Room Evaluation System Robust Speaker Identification for Meetings: UPC CLEAR 07 Meeting Room Evaluation System Jordi Luque and Javier Hernando Technical University of Catalonia (UPC) Jordi Girona, 1-3 D5, 08034 Barcelona, Spain

More information

Audio Watermark Detection Improvement by Using Noise Modelling

Audio Watermark Detection Improvement by Using Noise Modelling Audio Watermark Detection Improvement by Using Noise Modelling NEDELJKO CVEJIC, TAPIO SEPPÄNEN*, DAVID BULL Dept. of Electrical and Electronic Engineering University of Bristol Merchant Venturers Building,

More information

Optimization of Coded MIMO-Transmission with Antenna Selection

Optimization of Coded MIMO-Transmission with Antenna Selection Optimization of Coded MIMO-Transmission with Antenna Selection Biljana Badic, Paul Fuxjäger, Hans Weinrichter Institute of Communications and Radio Frequency Engineering Vienna University of Technology

More information

Paper VI. Non-synchronous resonators on leaky substrates. J. Meltaus, V. P. Plessky, and S. S. Hong. Copyright 2005 IEEE.

Paper VI. Non-synchronous resonators on leaky substrates. J. Meltaus, V. P. Plessky, and S. S. Hong. Copyright 2005 IEEE. Paper VI Non-synchronous resonators on leaky substrates J. Meltaus, V. P. Plessky, and S. S. Hong Copyright 5 IEEE. Reprinted from J. Meltaus, V. P. Plessky, and S. S. Hong, "Nonsynchronous resonators

More information