Multiple Sound Sources Localization Using Energetic Analysis Method
|
|
- Julius Gilbert
- 5 years ago
- Views:
Transcription
1 VOL.3, NO.4, DECEMBER 1 Multiple Sound Sources Localization Using Energetic Analysis Method Hasan Khaddour, Jiří Schimmel Department of Telecommunications FEEC, Brno University of Technology Purkyňova 11, 1, Brno Czech Republic xkhadd@stud.feec.vutbr.cz, schimmel@feec.vutbr.cz Abstract In this article a method for multiple sound source localization is proposed. This method depends on energetic analysis of B-format signals. The number of sound sources localized by this method can exceed the number of the used microphones. The method was simulated in Matlab and tested in a real environment. Both experimental and simulation results show the efficiency of this method. 1 Introduction Sound source localization methods have been intensively investigated, several methods have been designed for one sound source localization; most of them are based on time delay estimation [1], and on the phase difference []. Some methods are able to localize a number of sound sources that is equal or less than the number of used sensors (microphones), such as MUSIC (Multiple Signal Classification) [3]. MUSIC can estimate the directional of arrivals (DOAs) based on relation between the noise subspace and the signal subspace [3]. Some new methods solved this problem and they are able to localize more sound sources. To achieve that, one method uses binary time frequency masks for blind separation of speech mixtures [4]. This method relies on a property of the Gabor expansions of speech signals, which is called W- disjoint orthogonality [4]. Other method use so called blind separation method (BSS) was presented in [5]. BSS method presents a new approach in order to estimate direction of arrivals (DOAs) depending on applying the Expectation-Maximization algorithm (EM) to a sparseness-based approach [5]. In our paper, a new method is presented for multiple sound source localization using B-format signals, in which the number of sound sources localized by this method can exceed the number of used sensors (in our method B-format signals in horizontal plane only are used). This method depends on energetic analysis of sound signals (B-format signals). This paper is organized as follows; Section presents the B-format signals principle. The energetic analysis method is introduced in Section 3. Section 4 presents the simulation results for this method in Matlab. The experimental results are presented in Section 5, and Section concludes the paper. mation about sound sources [5]. The signals, provide information about the sound source in the horizontal plane, they are recorded using two figure-of-eight microphones facing front-back ( ) and left-right ( )), while provides information about the vertical plane, and it is recorded using figure-of-eight microphone that faces up-down. The signal is recorded using omni-directional microphone, see Figure 1. The encoding equations for B-format signals are [] where represents the azimuth angle of the source, represents the elevation angle of the source, and s represents the sound signal. Left (y(t)) Front (x(t)) Up (z(t)) W(t) (1) B-Format Signals B-format signals are able to represent the sound sources in the three dimensional sphere, they contain four signals,, and, which can carry all of the directional infor- Figure 1: Polar patterns of B-format components. 5
2 time (sec) VOL.3, NO.4, DECEMBER 1 3 Energetic Analysis Method Energetic analysis method is based on the fact, that the sound source direction is the opposite direction of the intensity vector of the sound. This principle has been used in spatial sound reproduction methods such as directional audio coding (DirAC) [7]. However, it is used in this paper for multiple sound source localization using other criteria. The sound energies are distributed in time and frequency, the spectral density distribution for one of the signals is presented in Figure using spectrogram in Matlab where the length of hamming window was chosen to be 14 samples, the overlaps was chosen to be 5 points, the number of sampling points to calculate the discrete Fourier transform was 5 points, and the sampling frequnecy was 441 Hz. Assuming that there are several sound sources, the energy in some timefrequency points are generated from several sound sources simultaneously. Therefore, from one frequency bin it is not possible to determine all sound source positions. the instantaneous intensity vector and it is defined as [] [ ] (3) The instantaneous intensity vector points to the direction of the flow of sound energy, while the direction of arrival is supposed to be opposite to this direction. We can get the azimuth of the sound source as [] { and the elevation as [ [ ] [ ] ] (5) After calculating the angles for each frequency bin in each time frame, a statistical estimation for angle distribution should be done, see Figure (3) B-format signals Dividing signals in time Dividing signals into frequency bands Azimuth and elevation estimation Statistical calculation for angles for each time frame The estimated angles Frequency (Hz) x 1 4 Figure. Spectral density distribution for a sample of speech signal recorded by omni-directional microphone. In this method, the sound signals are divided in time and then in frequency using short time Fourier transforms (STFT), where the window was chosen to be hanning window with length of 51 samples, and the overlapping was chosen to be half of the window size. The input signals for this method are B-format signals, the intensity vector can be obtained using following equations for each time frame [] Re{ }, Re{ }, () Re{ }, where Z is the acoustic impedance of the air, t is time, f is frequency, * denotes complex conjugate,,, and are the Fourier transform for the B-format signals,, and respectively, and is Figure 3: A diagram for Energetic Analysis method. In each time frame, we assume that only one sound source is dominant for each frequency bin. This assumption can be hold since that each sound signal differs from others, and the signals have different intensity in time. In this case, each frequency bin has information about one sound source direction. We consider the direction from where the sound signals come from as the direction that is the most repeated in the frequency bins in each time frame. When we have several sound signals that are emitted simultaneously, the direction for each sound source signal is repeated several times in each time frame for different frequency bins. We can obtain the sound source direction as the angle that maximizes the summation of function on the whole frequency interval for each time frame. In case there is only one sound source the estimated direction could be written
3 frequency bins number of frequency bins time (sec) VOL.3, NO.4, DECEMBER 1 as where is the estimated sound source direction, K is the number of the frequency bins for S is the sound signal, t denotes the time frame index, f is the frequency bin, and is the probability that this signal comes from the direction α. The main difficulties that face this method come generally from background noise, reverberation and microphone noise. However, the sound intensity coming from the sound source is bigger than noise and reverberation intensity. For some time frame, the detection error is bigger when there is no active speaker 4 Simulation Results The method was simulated in Matlab. Figure 4 shows the simulation results for this method. In this scenario we assumed that there are four speakers in the horizontal plane, who speak simultaneously. B-format signals are generated from these sound signals according to (1). With no additional noise, the method was able to estimate the sound source positions perfectly. Peaks in Figure 4 denote the angle from where the sound is coming from. As can be seen from Figure 4, the four sound source positions are estimated correctly. Some frequency bins indicates that the sound signal is coming from other directions. This angle detection error comes from the fact that more than one signal has component in the same frequency bin in the same time frame. 7 5 () by Matlab. The two noise signals were added to each other and asumed to be located in different places arround the microphones. These places were assumed to be equidistantly separated (i.e. 3 degrees from each other). The signal-to-noise ratio (SNR) between the signal and the additional noise signal was about - db, and it is calculated using following equation ( ) ( ) where, and are the average power for and respectively, and is the additional noise signal. The method was also able to detect the sound sources positions for the all speakers, see Figure Frequency (Hz) x 1 4 Figure 5: Spectral density distribution for a fan noise sound signal. As can be seen from simulation results, adding the additive noise decreases the ability of localization the sound sources. The noise signal influences the intelligibility of the sound source signal and it changes the distribution of sound intensity. However, since the sound source intensity is bigger than the noise intensity, the method is still able to localize the sound sources correctly Figure 4: Simulation results with absence of noise Two different noise signals were added to each B-format signal. The first noise signal is a fan noise sound. Spectral energy distribution for this signal is presented in Figure 5 using the same spectrogram parameters as in Figure. The second noise signal is pseudo-random noise with a normal distribution with mean zero and standard deviation one which is generated Figure : Simulation result with the present of pseudorandom noise signal and a fan noise signal. 7
4 number of frequency bins VOL.3, NO.4, DECEMBER 1 5 Experimental results The measurements were carried out in the acoustic laboratory at Department of Telecommunications FEEC, Brno University of Technology where the experiment s conditions were same as in sound control rooms, listening rooms, or in living rooms with high quality listening environment; the laboratory provides semi-diffuse field with reverberation time RT <.3 s in all octave bands. The measurements were carried out only for sound source placed in horizontal plane of the B-format microphone setup. In the first part of the experiment, three people (two men and one woman) were talking simultaneously in forty different positions. The positions were selected arbitrary for each speaker in a circle around the microphones, see Figure 7. One sentence was chosen for all speakers to be said, the speech in each case lasted about seconds. The positions of the speakers for each case were registered and compared with the results of the method. localization less accurate. However, this method was able to estimate the sound source positions correctly in the real environment. Figure : The directional sensitivity of the eight-of- figure microphones [9]. B-format microphones The directions of arrival were estimated for each speakers positions, and the results were compared with the real speakers positions. The results are illustrated using box plots. The boxes have lines at lower quartile, median, and upper quartile values. The whiskers show the extent of the rest of the data. The outliers are presented by red cross outside of the whiskers Figure 7: Speakers positions around the microphones. Two figure-of-eight microphones were used to pick-up the signals,, and one omni-direction microphone was used to pick up the signal. The directional sensitivity of the figure-of-eight microphones is shown in Figure. The results show the ability of this method to estimate the sound source positions correctly. Figure 9 shows one case of the experiment where three speakers were in three different positions around the microphones. As can be seen, the positions of the three speakers are well estimated, where the peaks denote the positions of the speakers. The speaker who stood in position (+1 ) could be considered also to be in position (- 1 ). Since there are multiple sound sources (speakers), the sound intensity differs in time, and for some frequencies the sound energy is coming from more than one speaker. Furthermore, there are a background noise and noise coming from the microphones. These factors together make the sound sources Figure 9: Estimated speakers positions in real environment. As can be seen in Figure 1, the method was able to localize the sound source positions; the median error is between three and four degrees for all positions. The biggest error is about twelve degrees for the third speaker, where the first and the third speakers are two men and the second speaker is a woman.
5 absolute error (degree) absolute error (degree) VOL.3, NO.4, DECEMBER the target s movement. The error could happen when the target moves too fast. Acknowledgment The described research was performed in laboratories supported by the SIX project; the registration number CZ.1.5/.1./3.7, the operational program Research and Development for Innovation. References Figure 1: Absolute angle error for each speaker in case of three speakers. In the second part of our experiment, four people (two men and two women) talked simultaneously. The same sentence as in the first part of the experiment was chosen to be said. The results showed the ability of the method to localize the sound sources as can be seen in Figure 11. It should be noted that the first and second speaker are women. The median error in this case was about 4 degrees first speaker second speaker third speaker first speaker second speaker third speaker fourth speaker Figure 11: Absolute angle error in case of four speakers. [1] Carter, G.C.; "Coherence and time delay estimation," Proceedings of the IEEE, vol.75, no., pp. 3-55, Feb [] Taff, L.G.;, "Target localization from bearings-only observations," Aerospace and Electronic Systems, IEEE Transactions on, vol.33, no.1, pp.-1, Jan [3] Schmidt, R.;, "Multiple emitter location and signal parameter estimation," Antennas and Propagation, IEEE Transactions on, vol.34, no.3, pp. 7-, Mar 19. [4] Yilmaz, O.; Rickard, S.;, "Blind separation of speech mixtures via time-frequency masking," Signal Processing, IEEE Transactions on, vol.5, no.7, pp , July 4. [5] Izumi, Yosuke; Ono, Nobutaka; Sagayama, Shigeki;, "Sparseness-Based CH BSS using the EM Algorithm in Reverberant Environment," Applications of Signal Processing to Audio and Acoustics, 7 IEEE Workshop on, vol., no., pp , 1-4 Oct. 7. [] Benjamin, E.; Heller, A.; Lee,.; "Localization in horizontal-only ambisonic systems, " in Proc. 11st Convention of the Audio Engineering Society, San Francisco,. pp.1. [7] Pulkki, V.; Spatial Sound Reproduction with Directional audio coding J.Audio Eng.Soc.,vol.55,pp.53-51,Jun 7. [] E. Ahonen J.; Pulkki V., Kuech F.; Kallinger M.; Schultz- Amling R.; Directional analysis of sound field with linear microphone array and applications in sound reproduction. In Proc. AES 14th Convention, Amsterdam, The Netherlands, May. [9] AKG- 5 years of innovation [online]. [Citied.11.1]. Accessible from < 35,nodeid,,_language,EN,view,diagram.html>. Conclusion The energetic analysis method is a good method for multiple sound source localization. It achieved good results in both simulated and real environment. The angle detection errors come from the background noise and the reverberation signals. The method is able to localize more sound sources than the number of the used microphones. The method can be used for tracking mobile targets, when the duration of time frame is chosen to be suitable for the speed of 9
Performance Evaluation of Nonlinear Speech Enhancement Based on Virtual Increase of Channels in Reverberant Environments
Performance Evaluation of Nonlinear Speech Enhancement Based on Virtual Increase of Channels in Reverberant Environments Kouei Yamaoka, Shoji Makino, Nobutaka Ono, and Takeshi Yamada University of Tsukuba,
More informationRecent Advances in Acoustic Signal Extraction and Dereverberation
Recent Advances in Acoustic Signal Extraction and Dereverberation Emanuël Habets Erlangen Colloquium 2016 Scenario Spatial Filtering Estimated Desired Signal Undesired sound components: Sensor noise Competing
More informationAutomotive three-microphone voice activity detector and noise-canceller
Res. Lett. Inf. Math. Sci., 005, Vol. 7, pp 47-55 47 Available online at http://iims.massey.ac.nz/research/letters/ Automotive three-microphone voice activity detector and noise-canceller Z. QI and T.J.MOIR
More informationEnhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis
Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis Mohini Avatade & S.L. Sahare Electronics & Telecommunication Department, Cummins
More informationAdaptive Beamforming Applied for Signals Estimated with MUSIC Algorithm
Buletinul Ştiinţific al Universităţii "Politehnica" din Timişoara Seria ELECTRONICĂ şi TELECOMUNICAŢII TRANSACTIONS on ELECTRONICS and COMMUNICATIONS Tom 57(71), Fascicola 2, 2012 Adaptive Beamforming
More informationLocalization of underwater moving sound source based on time delay estimation using hydrophone array
Journal of Physics: Conference Series PAPER OPEN ACCESS Localization of underwater moving sound source based on time delay estimation using hydrophone array To cite this article: S. A. Rahman et al 2016
More informationStudy Of Sound Source Localization Using Music Method In Real Acoustic Environment
International Journal of Electronics Engineering Research. ISSN 975-645 Volume 9, Number 4 (27) pp. 545-556 Research India Publications http://www.ripublication.com Study Of Sound Source Localization Using
More informationThe Role of High Frequencies in Convolutive Blind Source Separation of Speech Signals
The Role of High Frequencies in Convolutive Blind Source Separation of Speech Signals Maria G. Jafari and Mark D. Plumbley Centre for Digital Music, Queen Mary University of London, UK maria.jafari@elec.qmul.ac.uk,
More informationOmnidirectional Sound Source Tracking Based on Sequential Updating Histogram
Proceedings of APSIPA Annual Summit and Conference 5 6-9 December 5 Omnidirectional Sound Source Tracking Based on Sequential Updating Histogram Yusuke SHIIKI and Kenji SUYAMA School of Engineering, Tokyo
More informationarxiv: v1 [cs.sd] 4 Dec 2018
LOCALIZATION AND TRACKING OF AN ACOUSTIC SOURCE USING A DIAGONAL UNLOADING BEAMFORMING AND A KALMAN FILTER Daniele Salvati, Carlo Drioli, Gian Luca Foresti Department of Mathematics, Computer Science and
More informationSound Source Localization using HRTF database
ICCAS June -, KINTEX, Gyeonggi-Do, Korea Sound Source Localization using HRTF database Sungmok Hwang*, Youngjin Park and Younsik Park * Center for Noise and Vibration Control, Dept. of Mech. Eng., KAIST,
More informationIntroduction to Audio Watermarking Schemes
Introduction to Audio Watermarking Schemes N. Lazic and P. Aarabi, Communication over an Acoustic Channel Using Data Hiding Techniques, IEEE Transactions on Multimedia, Vol. 8, No. 5, October 2006 Multimedia
More informationBroadband Microphone Arrays for Speech Acquisition
Broadband Microphone Arrays for Speech Acquisition Darren B. Ward Acoustics and Speech Research Dept. Bell Labs, Lucent Technologies Murray Hill, NJ 07974, USA Robert C. Williamson Dept. of Engineering,
More informationBREAKING DOWN THE COCKTAIL PARTY: CAPTURING AND ISOLATING SOURCES IN A SOUNDSCAPE
BREAKING DOWN THE COCKTAIL PARTY: CAPTURING AND ISOLATING SOURCES IN A SOUNDSCAPE Anastasios Alexandridis, Anthony Griffin, and Athanasios Mouchtaris FORTH-ICS, Heraklion, Crete, Greece, GR-70013 University
More informationinter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering August 2000, Nice, FRANCE
Copyright SFA - InterNoise 2000 1 inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering 27-30 August 2000, Nice, FRANCE I-INCE Classification: 7.2 MICROPHONE ARRAY
More informationMeasuring impulse responses containing complete spatial information ABSTRACT
Measuring impulse responses containing complete spatial information Angelo Farina, Paolo Martignon, Andrea Capra, Simone Fontana University of Parma, Industrial Eng. Dept., via delle Scienze 181/A, 43100
More informationSmart antenna for doa using music and esprit
IOSR Journal of Electronics and Communication Engineering (IOSRJECE) ISSN : 2278-2834 Volume 1, Issue 1 (May-June 2012), PP 12-17 Smart antenna for doa using music and esprit SURAYA MUBEEN 1, DR.A.M.PRASAD
More informationIMPROVED COCKTAIL-PARTY PROCESSING
IMPROVED COCKTAIL-PARTY PROCESSING Alexis Favrot, Markus Erne Scopein Research Aarau, Switzerland postmaster@scopein.ch Christof Faller Audiovisual Communications Laboratory, LCAV Swiss Institute of Technology
More informationA Weighted Least Squares Algorithm for Passive Localization in Multipath Scenarios
A Weighted Least Squares Algorithm for Passive Localization in Multipath Scenarios Noha El Gemayel, Holger Jäkel, Friedrich K. Jondral Karlsruhe Institute of Technology, Germany, {noha.gemayel,holger.jaekel,friedrich.jondral}@kit.edu
More informationTwo-channel Separation of Speech Using Direction-of-arrival Estimation And Sinusoids Plus Transients Modeling
Two-channel Separation of Speech Using Direction-of-arrival Estimation And Sinusoids Plus Transients Modeling Mikko Parviainen 1 and Tuomas Virtanen 2 Institute of Signal Processing Tampere University
More informationECMA-108. Measurement of Highfrequency. emitted by Information Technology and Telecommunications Equipment. 4 th Edition / December 2008
ECMA-108 4 th Edition / December 2008 Measurement of Highfrequency Noise emitted by Information Technology and Telecommunications Equipment COPYRIGHT PROTECTED DOCUMENT Ecma International 2008 Standard
More informationMINUET: MUSICAL INTERFERENCE UNMIXING ESTIMATION TECHNIQUE
MINUET: MUSICAL INTERFERENCE UNMIXING ESTIMATION TECHNIQUE Scott Rickard, Conor Fearon University College Dublin, Dublin, Ireland {scott.rickard,conor.fearon}@ee.ucd.ie Radu Balan, Justinian Rosca Siemens
More informationSUPERVISED SIGNAL PROCESSING FOR SEPARATION AND INDEPENDENT GAIN CONTROL OF DIFFERENT PERCUSSION INSTRUMENTS USING A LIMITED NUMBER OF MICROPHONES
SUPERVISED SIGNAL PROCESSING FOR SEPARATION AND INDEPENDENT GAIN CONTROL OF DIFFERENT PERCUSSION INSTRUMENTS USING A LIMITED NUMBER OF MICROPHONES SF Minhas A Barton P Gaydecki School of Electrical and
More informationAnalysis of Frontal Localization in Double Layered Loudspeaker Array System
Proceedings of 20th International Congress on Acoustics, ICA 2010 23 27 August 2010, Sydney, Australia Analysis of Frontal Localization in Double Layered Loudspeaker Array System Hyunjoo Chung (1), Sang
More informationAn Adaptive Algorithm for Speech Source Separation in Overcomplete Cases Using Wavelet Packets
Proceedings of the th WSEAS International Conference on Signal Processing, Istanbul, Turkey, May 7-9, 6 (pp4-44) An Adaptive Algorithm for Speech Source Separation in Overcomplete Cases Using Wavelet Packets
More informationImproving reverberant speech separation with binaural cues using temporal context and convolutional neural networks
Improving reverberant speech separation with binaural cues using temporal context and convolutional neural networks Alfredo Zermini, Qiuqiang Kong, Yong Xu, Mark D. Plumbley, Wenwu Wang Centre for Vision,
More informationDigital Modulation Recognition Based on Feature, Spectrum and Phase Analysis and its Testing with Disturbed Signals
Digital Modulation Recognition Based on Feature, Spectrum and Phase Analysis and its Testing with Disturbed Signals A. KUBANKOVA AND D. KUBANEK Department of Telecommunications Brno University of Technology
More informationEstimation of Non-stationary Noise Power Spectrum using DWT
Estimation of Non-stationary Noise Power Spectrum using DWT Haripriya.R.P. Department of Electronics & Communication Engineering Mar Baselios College of Engineering & Technology, Kerala, India Lani Rachel
More informationThe psychoacoustics of reverberation
The psychoacoustics of reverberation Steven van de Par Steven.van.de.Par@uni-oldenburg.de July 19, 2016 Thanks to Julian Grosse and Andreas Häußler 2016 AES International Conference on Sound Field Control
More informationNonlinear postprocessing for blind speech separation
Nonlinear postprocessing for blind speech separation Dorothea Kolossa and Reinhold Orglmeister 1 TU Berlin, Berlin, Germany, D.Kolossa@ee.tu-berlin.de, WWW home page: http://ntife.ee.tu-berlin.de/personen/kolossa/home.html
More informationTARGET SPEECH EXTRACTION IN COCKTAIL PARTY BY COMBINING BEAMFORMING AND BLIND SOURCE SEPARATION
TARGET SPEECH EXTRACTION IN COCKTAIL PARTY BY COMBINING BEAMFORMING AND BLIND SOURCE SEPARATION Lin Wang 1,2, Heping Ding 2 and Fuliang Yin 1 1 School of Electronic and Information Engineering, Dalian
More informationDIRECTIONAL CODING OF AUDIO USING A CIRCULAR MICROPHONE ARRAY
DIRECTIONAL CODING OF AUDIO USING A CIRCULAR MICROPHONE ARRAY Anastasios Alexandridis Anthony Griffin Athanasios Mouchtaris FORTH-ICS, Heraklion, Crete, Greece, GR-70013 University of Crete, Department
More informationRobust Low-Resource Sound Localization in Correlated Noise
INTERSPEECH 2014 Robust Low-Resource Sound Localization in Correlated Noise Lorin Netsch, Jacek Stachurski Texas Instruments, Inc. netsch@ti.com, jacek@ti.com Abstract In this paper we address the problem
More informationAiro Interantional Research Journal September, 2013 Volume II, ISSN:
Airo Interantional Research Journal September, 2013 Volume II, ISSN: 2320-3714 Name of author- Navin Kumar Research scholar Department of Electronics BR Ambedkar Bihar University Muzaffarpur ABSTRACT Direction
More informationSpeech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm
International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm A.T. Rajamanickam, N.P.Subiramaniyam, A.Balamurugan*,
More informationEnhancing 3D Audio Using Blind Bandwidth Extension
Enhancing 3D Audio Using Blind Bandwidth Extension (PREPRINT) Tim Habigt, Marko Ðurković, Martin Rothbucher, and Klaus Diepold Institute for Data Processing, Technische Universität München, 829 München,
More informationBlind Pilot Decontamination
Blind Pilot Decontamination Ralf R. Müller Professor for Digital Communications Friedrich-Alexander University Erlangen-Nuremberg Adjunct Professor for Wireless Networks Norwegian University of Science
More informationAmbient Passive Seismic Imaging with Noise Analysis Aleksandar Jeremic, Michael Thornton, Peter Duncan, MicroSeismic Inc.
Aleksandar Jeremic, Michael Thornton, Peter Duncan, MicroSeismic Inc. SUMMARY The ambient passive seismic imaging technique is capable of imaging repetitive passive seismic events. Here we investigate
More informationEE 464 Short-Time Fourier Transform Fall and Spectrogram. Many signals of importance have spectral content that
EE 464 Short-Time Fourier Transform Fall 2018 Read Text, Chapter 4.9. and Spectrogram Many signals of importance have spectral content that changes with time. Let xx(nn), nn = 0, 1,, NN 1 1 be a discrete-time
More informationJoint recognition and direction-of-arrival estimation of simultaneous meetingroom acoustic events
INTERSPEECH 2013 Joint recognition and direction-of-arrival estimation of simultaneous meetingroom acoustic events Rupayan Chakraborty and Climent Nadeu TALP Research Centre, Department of Signal Theory
More informationEffect of the number of loudspeakers on sense of presence in 3D audio system based on multiple vertical panning
Effect of the number of loudspeakers on sense of presence in 3D audio system based on multiple vertical panning Toshiyuki Kimura and Hiroshi Ando Universal Communication Research Institute, National Institute
More informationSpatialisation accuracy of a Virtual Performance System
Spatialisation accuracy of a Virtual Performance System Iain Laird, Dr Paul Chapman, Digital Design Studio, Glasgow School of Art, Glasgow, UK, I.Laird1@gsa.ac.uk, p.chapman@gsa.ac.uk Dr Damian Murphy
More informationKeywords Decomposition; Reconstruction; SNR; Speech signal; Super soft Thresholding.
Volume 5, Issue 2, February 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Speech Enhancement
More informationAn improved direction of arrival (DOA) estimation algorithm and beam formation algorithm for smart antenna system in multipath environment
ISSN:2348-2079 Volume-6 Issue-1 International Journal of Intellectual Advancements and Research in Engineering Computations An improved direction of arrival (DOA) estimation algorithm and beam formation
More informationLecture 7 Frequency Modulation
Lecture 7 Frequency Modulation Fundamentals of Digital Signal Processing Spring, 2012 Wei-Ta Chu 2012/3/15 1 Time-Frequency Spectrum We have seen that a wide range of interesting waveforms can be synthesized
More informationPsychoacoustic Cues in Room Size Perception
Audio Engineering Society Convention Paper Presented at the 116th Convention 2004 May 8 11 Berlin, Germany 6084 This convention paper has been reproduced from the author s advance manuscript, without editing,
More informationSeparation of Multiple Speech Signals by Using Triangular Microphone Array
Separation of Multiple Speech Signals by Using Triangular Microphone Array 15 Separation of Multiple Speech Signals by Using Triangular Microphone Array Nozomu Hamada 1, Non-member ABSTRACT Speech source
More informationDirection-of-Arrival Estimation Using a Microphone Array with the Multichannel Cross-Correlation Method
Direction-of-Arrival Estimation Using a Microphone Array with the Multichannel Cross-Correlation Method Udo Klein, Member, IEEE, and TrInh Qu6c VO School of Electrical Engineering, International University,
More informationAudio Imputation Using the Non-negative Hidden Markov Model
Audio Imputation Using the Non-negative Hidden Markov Model Jinyu Han 1,, Gautham J. Mysore 2, and Bryan Pardo 1 1 EECS Department, Northwestern University 2 Advanced Technology Labs, Adobe Systems Inc.
More informationCORRELATION BASED SNR ESTIMATION IN OFDM SYSTEM
CORRELATION BASED SNR ESTIMATION IN OFDM SYSTEM Suneetha Kokkirigadda 1 & Asst.Prof.K.Vasu Babu 2 1.ECE, Vasireddy Venkatadri Institute of Technology,Namburu,A.P,India 2.ECE, Vasireddy Venkatadri Institute
More informationONLINE REPET-SIM FOR REAL-TIME SPEECH ENHANCEMENT
ONLINE REPET-SIM FOR REAL-TIME SPEECH ENHANCEMENT Zafar Rafii Northwestern University EECS Department Evanston, IL, USA Bryan Pardo Northwestern University EECS Department Evanston, IL, USA ABSTRACT REPET-SIM
More informationBEAMFORMING WITHIN THE MODAL SOUND FIELD OF A VEHICLE INTERIOR
BeBeC-2016-S9 BEAMFORMING WITHIN THE MODAL SOUND FIELD OF A VEHICLE INTERIOR Clemens Nau Daimler AG Béla-Barényi-Straße 1, 71063 Sindelfingen, Germany ABSTRACT Physically the conventional beamforming method
More informationChapter 4 SPEECH ENHANCEMENT
44 Chapter 4 SPEECH ENHANCEMENT 4.1 INTRODUCTION: Enhancement is defined as improvement in the value or Quality of something. Speech enhancement is defined as the improvement in intelligibility and/or
More informationSPECTRAL COMBINING FOR MICROPHONE DIVERSITY SYSTEMS
17th European Signal Processing Conference (EUSIPCO 29) Glasgow, Scotland, August 24-28, 29 SPECTRAL COMBINING FOR MICROPHONE DIVERSITY SYSTEMS Jürgen Freudenberger, Sebastian Stenzel, Benjamin Venditti
More informationROOM AND CONCERT HALL ACOUSTICS MEASUREMENTS USING ARRAYS OF CAMERAS AND MICROPHONES
ROOM AND CONCERT HALL ACOUSTICS The perception of sound by human listeners in a listening space, such as a room or a concert hall is a complicated function of the type of source sound (speech, oration,
More informationFROM BLIND SOURCE SEPARATION TO BLIND SOURCE CANCELLATION IN THE UNDERDETERMINED CASE: A NEW APPROACH BASED ON TIME-FREQUENCY ANALYSIS
' FROM BLIND SOURCE SEPARATION TO BLIND SOURCE CANCELLATION IN THE UNDERDETERMINED CASE: A NEW APPROACH BASED ON TIME-FREQUENCY ANALYSIS Frédéric Abrard and Yannick Deville Laboratoire d Acoustique, de
More informationAntennas and Propagation. Chapter 5c: Array Signal Processing and Parametric Estimation Techniques
Antennas and Propagation : Array Signal Processing and Parametric Estimation Techniques Introduction Time-domain Signal Processing Fourier spectral analysis Identify important frequency-content of signal
More informationECMA-108. Measurement of Highfrequency. emitted by Information Technology and Telecommunications Equipment. 5 th Edition / December 2010
ECMA-108 5 th Edition / December 2010 Measurement of Highfrequency Noise emitted by Information Technology and Telecommunications Equipment Reference number ECMA-123:2009 Ecma International 2009 COPYRIGHT
More informationTime- frequency Masking
Time- Masking EECS 352: Machine Percep=on of Music & Audio Zafar Rafii, Winter 214 1 STFT The Short- Time Fourier Transform (STFT) is a succession of local Fourier Transforms (FT) Time signal Real spectrogram
More informationProceedings of Meetings on Acoustics
Proceedings of Meetings on Acoustics Volume 19, 2013 http://acousticalsociety.org/ ICA 2013 Montreal Montreal, Canada 2-7 June 2013 Architectural Acoustics Session 1pAAa: Advanced Analysis of Room Acoustics:
More informationAnalysis of room transfer function and reverberant signal statistics
Analysis of room transfer function and reverberant signal statistics E. Georganti a, J. Mourjopoulos b and F. Jacobsen a a Acoustic Technology Department, Technical University of Denmark, Ørsted Plads,
More informationProceedings of Meetings on Acoustics
Proceedings of Meetings on Acoustics Volume 19, 2013 http://acousticalsociety.org/ ICA 2013 Montreal Montreal, Canada 2-7 June 2013 Engineering Acoustics Session 2pEAb: Controlling Sound Quality 2pEAb10.
More informationReverberant Sound Localization with a Robot Head Based on Direct-Path Relative Transfer Function
Reverberant Sound Localization with a Robot Head Based on Direct-Path Relative Transfer Function Xiaofei Li, Laurent Girin, Fabien Badeig, Radu Horaud PERCEPTION Team, INRIA Grenoble Rhone-Alpes October
More informationADAPTIVE ANTENNAS. TYPES OF BEAMFORMING
ADAPTIVE ANTENNAS TYPES OF BEAMFORMING 1 1- Outlines This chapter will introduce : Essential terminologies for beamforming; BF Demonstrating the function of the complex weights and how the phase and amplitude
More informationLocal Relative Transfer Function for Sound Source Localization
Local Relative Transfer Function for Sound Source Localization Xiaofei Li 1, Radu Horaud 1, Laurent Girin 1,2, Sharon Gannot 3 1 INRIA Grenoble Rhône-Alpes. {firstname.lastname@inria.fr} 2 GIPSA-Lab &
More informationStudents: Avihay Barazany Royi Levy Supervisor: Kuti Avargel In Association with: Zoran, Haifa
Students: Avihay Barazany Royi Levy Supervisor: Kuti Avargel In Association with: Zoran, Haifa Spring 2008 Introduction Problem Formulation Possible Solutions Proposed Algorithm Experimental Results Conclusions
More informationAdaptive Fingerprint Binarization by Frequency Domain Analysis
Adaptive Fingerprint Binarization by Frequency Domain Analysis Josef Ström Bartůněk, Mikael Nilsson, Jörgen Nordberg, Ingvar Claesson Department of Signal Processing, School of Engineering, Blekinge Institute
More informationSpeech Enhancement Based On Noise Reduction
Speech Enhancement Based On Noise Reduction Kundan Kumar Singh Electrical Engineering Department University Of Rochester ksingh11@z.rochester.edu ABSTRACT This paper addresses the problem of signal distortion
More informationSOPA version 2. Revised July SOPA project. September 21, Introduction 2. 2 Basic concept 3. 3 Capturing spatial audio 4
SOPA version 2 Revised July 7 2014 SOPA project September 21, 2014 Contents 1 Introduction 2 2 Basic concept 3 3 Capturing spatial audio 4 4 Sphere around your head 5 5 Reproduction 7 5.1 Binaural reproduction......................
More informationExperimental evaluation of massive MIMO at 20 GHz band in indoor environment
This article has been accepted and published on J-STAGE in advance of copyediting. Content is final as presented. IEICE Communications Express, Vol., 1 6 Experimental evaluation of massive MIMO at GHz
More informationHigh-speed Noise Cancellation with Microphone Array
Noise Cancellation a Posteriori Probability, Maximum Criteria Independent Component Analysis High-speed Noise Cancellation with Microphone Array We propose the use of a microphone array based on independent
More information396 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 19, NO. 2, FEBRUARY 2011
396 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 19, NO. 2, FEBRUARY 2011 Obtaining Binaural Room Impulse Responses From B-Format Impulse Responses Using Frequency-Dependent Coherence
More informationMikko Myllymäki and Tuomas Virtanen
NON-STATIONARY NOISE MODEL COMPENSATION IN VOICE ACTIVITY DETECTION Mikko Myllymäki and Tuomas Virtanen Department of Signal Processing, Tampere University of Technology Korkeakoulunkatu 1, 3370, Tampere,
More informationDISTANCE CODING AND PERFORMANCE OF THE MARK 5 AND ST350 SOUNDFIELD MICROPHONES AND THEIR SUITABILITY FOR AMBISONIC REPRODUCTION
DISTANCE CODING AND PERFORMANCE OF THE MARK 5 AND ST350 SOUNDFIELD MICROPHONES AND THEIR SUITABILITY FOR AMBISONIC REPRODUCTION T Spenceley B Wiggins University of Derby, Derby, UK University of Derby,
More informationA Comparison of the Convolutive Model and Real Recording for Using in Acoustic Echo Cancellation
A Comparison of the Convolutive Model and Real Recording for Using in Acoustic Echo Cancellation SEPTIMIU MISCHIE Faculty of Electronics and Telecommunications Politehnica University of Timisoara Vasile
More informationTime Delay Estimation: Applications and Algorithms
Time Delay Estimation: Applications and Algorithms Hing Cheung So http://www.ee.cityu.edu.hk/~hcso Department of Electronic Engineering City University of Hong Kong H. C. So Page 1 Outline Introduction
More informationBase-station Antenna Pattern Design for Maximizing Average Channel Capacity in Indoor MIMO System
MIMO Capacity Expansion Antenna Pattern Base-station Antenna Pattern Design for Maximizing Average Channel Capacity in Indoor MIMO System We present an antenna-pattern design method for maximizing average
More informationMultiple sound source localization using gammatone auditory filtering and direct sound componence detection
IOP Conference Series: Earth and Environmental Science PAPER OPE ACCESS Multiple sound source localization using gammatone auditory filtering and direct sound componence detection To cite this article:
More informationInformed Spatial Filtering for Sound Extraction Using Distributed Microphone Arrays
IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 22, NO. 7, JULY 2014 1195 Informed Spatial Filtering for Sound Extraction Using Distributed Microphone Arrays Maja Taseska, Student
More informationSOUND FIELD MEASUREMENTS INSIDE A REVERBERANT ROOM BY MEANS OF A NEW 3D METHOD AND COMPARISON WITH FEM MODEL
SOUND FIELD MEASUREMENTS INSIDE A REVERBERANT ROOM BY MEANS OF A NEW 3D METHOD AND COMPARISON WITH FEM MODEL P. Guidorzi a, F. Pompoli b, P. Bonfiglio b, M. Garai a a Department of Industrial Engineering
More informationThree Element Beam forming Algorithm with Reduced Interference Effect in Signal Direction
Vol. 3, Issue. 5, Sep - Oct. 3 pp-749-753 ISSN: 49-6645 Three Element Beam forming Algorithm with Reduced Interference Effect in Signal Direction V. Manjula, M. Tech, K.Suresh Reddy, M.Tech, (Ph.D) Deparment
More information38123 Povo Trento (Italy), Via Sommarive 14
UNIVERSITY OF TRENTO DIPARTIMENTO DI INGEGNERIA E SCIENZA DELL INFORMAZIONE 38123 Povo Trento (Italy), Via Sommarive 14 http://www.disi.unitn.it AN INVESTIGATION ON UWB-MIMO COMMUNICATION SYSTEMS BASED
More informationDrum Transcription Based on Independent Subspace Analysis
Report for EE 391 Special Studies and Reports for Electrical Engineering Drum Transcription Based on Independent Subspace Analysis Yinyi Guo Center for Computer Research in Music and Acoustics, Stanford,
More informationRadar Signal Classification Based on Cascade of STFT, PCA and Naïve Bayes
216 7th International Conference on Intelligent Systems, Modelling and Simulation Radar Signal Classification Based on Cascade of STFT, PCA and Naïve Bayes Yuanyuan Guo Department of Electronic Engineering
More informationThe Estimation of the Directions of Arrival of the Spread-Spectrum Signals With Three Orthogonal Sensors
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 51, NO. 5, SEPTEMBER 2002 817 The Estimation of the Directions of Arrival of the Spread-Spectrum Signals With Three Orthogonal Sensors Xin Wang and Zong-xin
More information29th TONMEISTERTAGUNG VDT INTERNATIONAL CONVENTION, November 2016
Measurement and Visualization of Room Impulse Responses with Spherical Microphone Arrays (Messung und Visualisierung von Raumimpulsantworten mit kugelförmigen Mikrofonarrays) Michael Kerscher 1, Benjamin
More informationDetection of Multipath Propagation Effects in SAR-Tomography with MIMO Modes
Detection of Multipath Propagation Effects in SAR-Tomography with MIMO Modes Tobias Rommel, German Aerospace Centre (DLR), tobias.rommel@dlr.de, Germany Gerhard Krieger, German Aerospace Centre (DLR),
More informationSound source localization accuracy of ambisonic microphone in anechoic conditions
Sound source localization accuracy of ambisonic microphone in anechoic conditions Pawel MALECKI 1 ; 1 AGH University of Science and Technology in Krakow, Poland ABSTRACT The paper presents results of determination
More informationPERFORMANCE COMPARISON BETWEEN STEREAUSIS AND INCOHERENT WIDEBAND MUSIC FOR LOCALIZATION OF GROUND VEHICLES ABSTRACT
Approved for public release; distribution is unlimited. PERFORMANCE COMPARISON BETWEEN STEREAUSIS AND INCOHERENT WIDEBAND MUSIC FOR LOCALIZATION OF GROUND VEHICLES September 1999 Tien Pham U.S. Army Research
More informationDESIGN AND IMPLEMENTATION OF AN ALGORITHM FOR MODULATION IDENTIFICATION OF ANALOG AND DIGITAL SIGNALS
DESIGN AND IMPLEMENTATION OF AN ALGORITHM FOR MODULATION IDENTIFICATION OF ANALOG AND DIGITAL SIGNALS John Yong Jia Chen (Department of Electrical Engineering, San José State University, San José, California,
More informationBlind Dereverberation of Single-Channel Speech Signals Using an ICA-Based Generative Model
Blind Dereverberation of Single-Channel Speech Signals Using an ICA-Based Generative Model Jong-Hwan Lee 1, Sang-Hoon Oh 2, and Soo-Young Lee 3 1 Brain Science Research Center and Department of Electrial
More informationENHANCED PRECISION IN SOURCE LOCALIZATION BY USING 3D-INTENSITY ARRAY MODULE
BeBeC-2016-D11 ENHANCED PRECISION IN SOURCE LOCALIZATION BY USING 3D-INTENSITY ARRAY MODULE 1 Jung-Han Woo, In-Jee Jung, and Jeong-Guon Ih 1 Center for Noise and Vibration Control (NoViC), Department of
More informationAudio Fingerprinting using Fractional Fourier Transform
Audio Fingerprinting using Fractional Fourier Transform Swati V. Sutar 1, D. G. Bhalke 2 1 (Department of Electronics & Telecommunication, JSPM s RSCOE college of Engineering Pune, India) 2 (Department,
More informationSPEECH ENHANCEMENT USING A ROBUST KALMAN FILTER POST-PROCESSOR IN THE MODULATION DOMAIN. Yu Wang and Mike Brookes
SPEECH ENHANCEMENT USING A ROBUST KALMAN FILTER POST-PROCESSOR IN THE MODULATION DOMAIN Yu Wang and Mike Brookes Department of Electrical and Electronic Engineering, Exhibition Road, Imperial College London,
More informationWhat applications is a cardioid subwoofer configuration appropriate for?
SETTING UP A CARDIOID SUBWOOFER SYSTEM Joan La Roda DAS Audio, Engineering Department. Introduction In general, we say that a speaker, or a group of speakers, radiates with a cardioid pattern when it radiates
More informationSpeech Enhancement Using Microphone Arrays
Friedrich-Alexander-Universität Erlangen-Nürnberg Lab Course Speech Enhancement Using Microphone Arrays International Audio Laboratories Erlangen Prof. Dr. ir. Emanuël A. P. Habets Friedrich-Alexander
More informationCase study for voice amplification in a highly absorptive conference room using negative absorption tuning by the YAMAHA Active Field Control system
Case study for voice amplification in a highly absorptive conference room using negative absorption tuning by the YAMAHA Active Field Control system Takayuki Watanabe Yamaha Commercial Audio Systems, Inc.
More informationProceedings Statistical Evaluation of the Positioning Error in Sequential Localization Techniques for Sensor Networks
Proceedings Statistical Evaluation of the Positioning Error in Sequential Localization Techniques for Sensor Networks Cesar Vargas-Rosales *, Yasuo Maidana, Rafaela Villalpando-Hernandez and Leyre Azpilicueta
More informationMeasuring procedures for the environmental parameters: Acoustic comfort
Measuring procedures for the environmental parameters: Acoustic comfort Abstract Measuring procedures for selected environmental parameters related to acoustic comfort are shown here. All protocols are
More informationDistance Estimation and Localization of Sound Sources in Reverberant Conditions using Deep Neural Networks
Distance Estimation and Localization of Sound Sources in Reverberant Conditions using Deep Neural Networks Mariam Yiwere 1 and Eun Joo Rhee 2 1 Department of Computer Engineering, Hanbat National University,
More information