PAPER Adaptive Microphone Array System with Two-Stage Adaptation Mode Controller

Size: px
Start display at page:

Download "PAPER Adaptive Microphone Array System with Two-Stage Adaptation Mode Controller"

Transcription

1 972 IEICE TRANS. FUNDAMENTALS, VOL.E88 A, NO.4 APRIL 2005 PAPER Adaptive Microphone Array System with Two-Stage Adaptation Mode Controller Yang-Won JUNG a), Student Member, Hong-Goo KANG, Chungyong LEE, Nonmembers, Dae-Hee YOUN, Member, Changkyu CHOI, and Jaywoo KIM, Nonmembers SUMMARY In this paper, an adaptive microphone array system with a two-stage adaptation mode controller (AMC) is proposed for high-quality speech acquisition in real environments. The proposed system includes an adaptive array algorithm, a time-delay estimator and a newly proposed AMC. To ensure proper adaptation of the adaptive array algorithm, the proposed AMC uses not only temporal information, but also spatial information. The proposed AMC is constructed with two processing stages: an initialization stage and a running stage. In the initialization stage, a sound source localization technique is adopted, and a signal correlation characteristic is used in the running stage. For the adaptive array algorithm, a generalized sidelobe canceller with an adaptive blocking matrix is used. The proposed algorithm is implemented as a real-time man-machine interface module of a home-agent robot. Simulation results show 13 db SINR improvement with the speaker sitting 2 m distance from the home-agent robot. The speech recognition rate is also enhanced by 32% when compared to the single channel acquisition system. key words: speech enhancement, microphone array, generalized sidelobe canceller, adaptation mode controller 1. Introduction Recently, much consideration has been paid to microphone array (MA) systems that can offer more comfortable speech interface. MA systems can provide accurate acquisition of a distant speaker s speech even in noisy environments. The current speech recognition systems suffer from distant speakers and strong interference environments, but with MA systems, the application of speech recognition will be greatly expanded [1] [3]. The generalized sidelobe canceller (GSC) is considered the most feasible algorithm for the MA system because of its simplicity and capability of interference reduction [4], [5]. In a real situation, however, reverberation causes an incomplete blocking of the target signal in the noise reference signal path, thus the system performance degrades significantly [4], [6]. To overcome this problem, methods to control the blocking matrix (BM) in an adaptive way were proposed [4], [6]. In these methods, an adaptation mode controller (AMC) that differentiates between the existence of the tar- Manuscript received May 19, Manuscript revised October 27, Final manuscript received December 27, The authors are with MCSP Lab., Dept. of Electrical and Electronic Eng., Yonsei University, Seoul, Korea. The authors are with Human & Computer Interaction Lab., Samsung Advanced Institute of Technology, Kyonggi-Do, Korea. This work was supported by the Research Fund of Samsung Advanced Institute of Technology under Project PR RR. a) ywjung@mcsp.yonsei.ac.kr DOI: /ietfec/e88 a get and the interfering signals is introduced to have proper adaptation in the adaptive part of the GSC. It is well known that the performance of the GSC-based MA system mainly depends on the accuracy of the AMC. When the AMC fails to determine proper adaptation modes, the target signal of the system output can be cancelled out entirely. Despite its importance to the overall performance of the GSC, few researches have concentrated on AMC methods yet. Hoshuyama et al proposed a power ratio method that compares the power of the output of a fixed beamformer (FBF) to that of the output of the BM [7]. Since it utilizes the output of the BM, this method is only applicable when the adaptive blocking matrix (ABM) of the system is initially converged. Moreover, the decision parameters of this method should be controlled by a signal to interference and noise ratio (SINR), which is difficult to set to an appropriate threshold. Much earlier, Greenberg et al proposed an AMC for an adaptive beamformer for hearing aids [8]. In their method, cross correlation between input sensors are of concern, and this method can be applicable only to uncorrelated background noise environments. In this paper, we propose a new AMC that can be used even in the situation when the system has not been initialized. The proposed AMC operates with two-stage modes: an initialization stage and a running stage. In the initialization stage, a sound source localization (SSL) technique is adopted. Using the SSL s estimate of the direction of the incoming signal, an ABM can be trained when the signal comes from the target direction only. The running stage is controlled by the cross correlation coefficient of the FBF output and the GSC output, and it is easy to set an appropriate threshold by its normalization property. The performance of the proposed system is evaluated as a real-time pre-processor of the man-machine-interface for a Home-Agent Robot (HAR) system. Experimental results verify that the proposed AMC method outperforms the power ratio method, and high-quality speech acquisition of distant speakers can be achieved in very low SINR environments. 2. Adaptive Microphone Array Algorithm 2.1 Generalized Sidelobe Canceller The GSC consists of three functional blocks: a FBF, a BM, and a multiple input canceller (MIC). Generally, the FBF Copyright c 2005 The Institute of Electronics, Information and Communication Engineers

2 JUNG et al.: ADAPTIVE MICROPHONE ARRAY SYSTEM WITH TWO-STAGE ADAPTATION MODE CONTROLLER 973 Fig. 2 Example of the power ratio method. ABM MIC Fig. 1 Generalized sidelobe canceller with an ABM. Table 1 Adaptation mode of the GSC. Target only Interference only Target + Interference Adaptation Filtering Filtering + Filtering only only Filtering Adaptation Filtering only + Filtering only is realized by a delay-and-sum beamformer, the BM by a fixed transform such as a delay-and-subtract or a Walsh transform, and the MIC by a traditional multi-channel adaptive noise canceller [9], [10]. In enclosures like room environments, there always exist multiple propagation paths, so called reverberation. Due to reverberation, the conventional BM fails to block the target signal in the noise reference signal path, and then the system performance degrades significantly. Constructing the BM as an adaptive filter is suggested for preventing target signal leakage. The filter coefficients of the ABM and the MIC are updated by an LMS algorithm [4], [10]. Figure 1 shows a block diagram of the GSC with an ABM [11]. 2.2 Adaptation Mode Controller Since the filter characteristics of the ABM and the MIC are totally inverse, adaptation of the ABM and the MIC should be performed alternatively [4], [7]. The filter coefficients of the ABM are updated when only the target signal exists, and those of the MIC are updated when only the interference signals exist. This kind of adaptation mode control is well known as a double-talk problem in the field of adaptive echo cancellation [12]. However, if the adaptation mode is wrongly controlled, i.e. if the filters of the ABM are updated when the interference signal exists, the output of the overall system will not be enhanced at all or will be totally cancelled out [4]. To prevent this, an efficient AMC that correctly detects signal existence should be developed. The adequate adaptation mode is summarized in Table 1. The AMC based on power ratio was proposed by Hoshuyama et al [7]. In their method, the power ratio of the FBF output and the blocked output was used as a criterion to distinguish the target from interferences as follows: P ratio (n) = P y FBF (n) P bi (n) P yfbf (n) = (1 λ)p yfbf (n 1) + λy 2 FBF (n) P bi (n) = (1 λ)p bi (n 1) + λb 2 i (n) where, P yfbf (n) is the estimated power of the FBF output, P bi (n) is the estimated power of the i-th blocked output, and λ indicates a forgetting factor for averaging. Then, the power ratio P ratio (n) is compared with the predetermined threshold T pwr. P ratio (n) tends to be a large value when the target signal exists. This method has several weak points. First, it cannot be used for training an ABM. When an ABM is not trained at all, i.e. in the initial state, the i-th output of the ABM, b i (n) isthesameasthei + 1-th input signal x i+1 (n). As a result, P ratio (n) tends to be close to 1 in any case when a target or an interference signal exists. Therefore, P ratio (n) cannot be a measure of the signals existence. The second problem is that it is hard to set the threshold to be suitable to both high and low SINR environments, especially when the interference signal is non-stationary, such as speech or music. Moreover, leakage from the ABM forces P ratio (n) tobe a small value even in target periods. Note that perfect blocking cannot be achievable in real environments, and there always exists some amount of the target signal leakage in the blocked signal paths. Figure 2 shows an example of P ratio (n) and the target signal when we use the power ratio method. In this case, we pre-trained the ABM because the power ratio method cannot be applied to train the ABM. The difficulty of setting the threshold and the misdetection of the target signal in the target free period is well illustrated in Fig. 2. As shown in the area (b), false alarms occur when the target signal is absent. False alarms cause the adaptive filters of the MIC to converge slower, but their effect is not so critical for performance. More serious problems are caused by the misdetection of the target signal as in the area (a). In this case, since P ratio (n)issmallerthant pwr, the MIC should be trained even though it is in the target signal period, which causes a wrong solution of the MIC, thus the output of the GSC will be seriously distorted, sometimes even cancelled entirely. 3. Proposed System In this paper, we propose a new AMC that can be used even (1)

3 974 IEICE TRANS. FUNDAMENTALS, VOL.E88 A, NO.4 APRIL 2005 in the situation when the system has not been initialized yet. Unlike the previous attempts, the proposed AMC utilizes not only temporal information but also spatial information of given input signals. In general, it is assumed that a target signal is located at spatially separated position with an interference signal, thus the spatial information of the signals will be greatly helpful to decide the proper adaptation modes. The goal of this paper is to design and to implement an AMC that can be applicable in real environments using the both spatial and temporal information. In the GSC, the ABM and the MIC should be trained sequentially. Since the change of the ABM also affects the relationship of the reference signal and the desired signal of the MIC, the filter coefficients of the MIC should be readjusted when the ABM changes. For this reason, our proposed AMC controls the adaptation modes of the ABM and the MIC in a sequential manner. The proposed AMC operates with two-stage modes: an initialization stage and a running stage, and the operation of the proposed AMC in each stage is described as follows. 3.1 Initialization Stage To develop the initialization stage of our AMC, we use two assumptions: 1. The rough region of the target speaker is known in advance: Signals coming from the target region are treated as the target signals, while signals coming from outside of the target region are treated as interfering signals. 2. When signals appear from the target region for the first time, no directional interferences are coming from outside of the target region. When the characteristics of the interference signals differ from those of the target signal, there exist a few methods which can classify each signal. But, when the interference signals have similar statistical characteristics as the target one, (i.e. both target and interference signal are speech) it is extremely hard to detect the desired signal without a prior knowledge, even though blind separation technique could be applied [13]. Using the first assumption, we set the target region to differentiate the target from the other signals. The range of the target region can be controlled arbitrary based on the type of applications used. The second assumption is made for the easier realization of real-time man-machine-interface systems. In other words, a single SSL technique is used for our application. It is well known that the performance of the single SSL methods degrades when there exist multiple sound sources [4]. If multiple SSL techniques are concerned, the second assumption needs not be mandatory. In the initialization stage, the incoming direction of the input signal is estimated by the SSL. Under the first assumption, we can detect the target-only duration when the signals come from the target region, and thus, the ABM can be trained properly. The time difference of arrival information of each microphone input is used for localizing the sound source [4]. With the knowledge of the microphone positions, estimated time differences between each sensor are used to generate hyperbolic curves which are then intersected to arrive at a source location estimate [4]. Due to its computational efficiency and statistically optimum property for noise environments, the generalized cross correlation phase transform (GCC-PHAT) [14] is employed in this work. No signal enhancement can be achieved because the MIC is not yet adapted and remains in an initial state. Therefore, the system output is the same as the FBF output. The initialization stage is performed as follow: Calculate R ij (n), the cross correlation of two filtered versions of x i (n), and x j (n) : R ij (n)= 1 ψ ij (ω)x i (ω)x j 2π (ω)e jωn dω (2) where X i (ω), X j (ω) denote Fourier transform of x i (n) and x j (n), and frequency dependent weighting function ψ ij (ω) is given as [14] 1 ψ ij (ω) =. (3) X i (ω)x j (ω) Find the argument ˆn ij of the maximum R ij (n): ˆn ij = arg max R ij (n). (4) n D Find the angle of ˆn ij If the estimated angle is within the target region, the ABM is adapted and the AMC procedure jumps to the running stage. If the estimated angle is outside of the target region, the overall step is repeated. 3.2 Running Stage After finishing the initial training step of the ABM, the proposed AMC controls the adaptation mode of the MIC. In the running stage, we adopt a double talk detection (DTD) technique. Based on the principle of orthogonality [10], the cross correlation between the FBF output and the system output becomes zero when only interference exists. Conversely, the cross correlation tends to have very large value when the target signal exists. With our previously developed cross correlation based DTD method [15], we can detect the existences of the target and the interference signals efficiently. Moreover, by its normalization property, it is easy to set the threshold [15]. The cross correlation coefficient, ρ FBF GSC, can be estimated recursively as follows: ρ FBF GSC (n) = P FBF GSC (n) PyFBF (n)p y GSC (n) (5) P FBF GSC (n) = λp FBF GSC (n 1) + (1 λ)y FBF (n)y GSC (n)

4 JUNG et al.: ADAPTIVE MICROPHONE ARRAY SYSTEM WITH TWO-STAGE ADAPTATION MODE CONTROLLER 975 Fig. 4 room. (a) Experimental environments. (a) Target HAR. (b) Experimental (b) Fig. 3 Flowchart of the proposed AMC. P ygsc (n) = (1 λ)p ygsc (n 1) + λy 2 GSC (n) where λ indicates a forgetting factor for averaging and P yfbf (n) is defined in (1). Note that the value of ρ FBF GSC is close to 1 when the target signal exists, and ρ FBF GSC becomes zero when interference only exists. In a real situation, ρ FBF GSC tends to be small value due to estimation errors and other real world problems. Using the property of ρ FBF GSC, we can update the MIC when ρ FBF GSC is less than the threshold T corr, i.e. when the interference signal is active. The existence of the interference signal can be detected by the signal energy level. A flowchart of the proposed twostage AMC procedure is summarized in Fig Experiment Results We implemented the PC-based real-time system to evaluate the performance of the proposed system as a pre-processor of the man-machine-interface for a HAR system. The target HAR, developed by Samsung Advance Institute of Technology, is 62 cm tall and the body is near circular with an approximately 24 cm radius [16]. For speech processing, we use 8 omni-directional microphones placed around the robot s body. The distance between adjacent microphones is approximately 9 cm. An automatic speech recognition module is implemented and is combined with the MA system. The recognized words are displayed on LCD display. A living room-like experimental room is constructed to develop and ensure the efficiency of the proposed speech processing system. The experimental room is (6.1 m 4.4 m 2.6 m) and the reflection time is about 300 msec. To mimic a general living room environment, sofa, TV and stereo system are placed in the room. The HAR is located approximately 2 m from the sofa. During Fig. 5 AMC parameters (running stage). experiments, human speakers are located on the sofa and speak 2 5 syllable words. Rock music is played from the loudspeaker as an interference and the SINR is set to about 0 10 db. The HAR and the experimental room are depicted in Fig. 4. We evaluate the performance of our proposed system in terms of the mode detection in the AMC, the amount of speech enhancement, and the recognition score. 4.1 AMC Results In this section, only the second stage of the proposed AMC is evaluated. The performance of the proposed AMC at the first stage entirely depends on the accuracy of the SSL. Since we are mainly concerned with the performance of the AMC, we will not analyze the performance of the SSL module in this paper. Note that, however, the GCC-PHAT method provides sufficient accuracy to perform the first stage of the AMC in our experiment. The performance of the second stage is compared with that of the power ratio method. As mentioned in Sect. 2.2, the power ratio method cannot be used to train the ABM. Therefore, we also apply the same processing step of the first stage to the power ratio method. The AMC parameters of the second stage are given in Fig. 5. As we discussed earlier, the power ratio method fails to detect target existence, and it is difficult to set the decision threshold. On the con-

5 976 IEICE TRANS. FUNDAMENTALS, VOL.E88 A, NO.4 APRIL 2005 (a) (b) Fig. 6 Waveforms and spectrograms of: (a) 1st microphone input and (b) system output. Table 2 SINR and recognition results. Single channel Proposed SINR 5dB 18 db (13 db improved) Recognition score 61% 93% trary, no detection failure occurs and it is easy to determine with the proposed AMC. One can notice that false alarms appear in both the power ratio method and the proposed method. Though the false alarms force the MIC to freeze in the target absent periods and they slow convergence of the MIC, small amounts of false alarms could be tolerable for algorithm stability. However, the misdetection of the target existence affects the MIC deteriorately. In the case of false alarms, the proposed AMC is much more reliable than the power ratio method. 4.2 System Outputs and Recognition Results To verify the performance of the proposed system when it is combined with the HAR, we measure the SINR of the system input and output as well as the speech recognition rate. The time domain waveforms and spectrograms of the 1st microphone input and the system output are depicted in Fig. 6. About 13 db SINR improvement is achieved in this experiment. We do not include the results of the power ratio method because its performance is very poor in these situations. For the recognition system, a command set consisting of 40 words for control of the home appliance is used. Table 2 shows the recognition score and the SINR enhancement of the single channel system and the proposed system. The proposed method shows 32% higher recognition rate than the single channel method. 5. Conclusion In this paper, we proposed a new adaptive MA system with a two-stage AMC and implemented it with a real-time system for the man-machine-interface of the HAR. The proposed AMC utilized both temporal and spatial information of given signals for more proper control of adaptation modes, which will eventually improve the performance of the MA systems where the proper information is available. The proposed AMC outperformed the conventional power ratio method, and the implemented system acquired high quality speech from a distant speaker in noisy environments while it requiring only PC computing power. The recognition score of the implemented system was enhanced more than 30% compared to the single channel system. The implemented MA system could be used by any kind of manmachine-interface system for speech acquisition and recognition in real environments. References [1] L.R. Rabiner and B.H. Juang, Fundamentals of speech recognitions, Prentice Hall, [2] M. Omologo, P. Svaizer, and M. Matassoni, Environmental conditions and acoustic transduction in hands-free speech recognition, Speech Commun., vol.25, pp.75 95, [3] J. Bitzer, K.U. Simmer, and K.D. Kammeyer, Multi-microphone noise reduction techniques for hands-free speech recognition A comparative study, Proc. Workshop on Robust Methods for Speech Recognition in Adverse Conditions, pp , Tampere, Finland, [4] M. Brandstein and D. Ward, Microphone Arrays, Springer, [5] S.Y. Low, N. Grbic, and S. Nordholm, Robust microphone array using subband adaptive beamformer and spectral subtraction, Proc. 8th International Conference on Communication Systems, vol.2, pp , [6] O. Hoshuyama, A. Sugiyama, and A. Hirano, A robust adaptive beamformer for microphone arrays with a blocking matrix using constrained adaptive filters, IEEE Trans. Signal Process., vol.47, no.10, pp , Oct [7] O. Hoshuyama, B. Begasse, A. Sugiyama, and A. Hirano, A realtime robust adaptive microphone array controlled by an SNR estimarte, Proc. IEEE International Conference on Acoustics, Speech, Signal Processing, pp , [8] J.E. Greenberg and P.M. Zurek, Evaluation of an adaptive beamforming method for hearing aids, J. Acoust. Soc. Am., vol.91, no.3, pp , March [9] L.J. Griffiths and C.W. Jim, An alternative approach to linear constrained adaptive beamforming, IEEE Trans. Antennas Propag., vol.ap-30, no.1, pp.27 34, Jan [10] S. Haykin, Adaptive Filter Theory, Prentice Hall, [11] S. Gannot, D. Burshtein, and E. Weinstein, Signal enhancement using beamforming and nonstationarity with applications to speech, IEEE Trans. Signal Process., vol.49, no.8, pp , Aug [12] M.M. Sondhi, An adaptive echo canceler, Bell Syst. Tech. J., vol.46, pp , March [13] A. Hyvarinen and E. Oja, Independent component analysis: Algorithms and applications, Neural Netw., vol.13, no.4-5, pp , [14] C.H. Knapp and G.C. Carter, The generalized correlation method for estimation of time delay, IEEE Trans. Acoust. Speech Signal Process., vol.24, no.4, pp , Aug [15] S.J. Park, C.G. Cho, C. Lee, and D.H. Youn, Integrated echo and noise canceler for hands-free applications, IEEE Trans. Circuits Syst. II, Analog Digit. Signal Process., vol.49, no.3, pp , March [16] Y.W. Jung, J. Lee, D. Kong, J. Kim, and C. Lee, High-quality speech acquisition and recognition system for home-agent robot, Proc. IEEE International Conference on Consumer Electronics, pp , 2003.

6 JUNG et al.: ADAPTIVE MICROPHONE ARRAY SYSTEM WITH TWO-STAGE ADAPTATION MODE CONTROLLER 977 Hong-Goo Kang received the B.S., M.S., and Ph.D. degrees in electronic engineering from Yonsei University, Seoul Korea, in 1989, 1991 and 1995, respectively. He was a Senior Member of Technical Staff of AT&T, Labs- Research, from 1996 to In 2002, he joined the Department of Electrical and Electronic Engineering, Yonsei University, where he is currently an Assistant Professor. His research interests include speech signal processing, array signal processing and communication signal processing. Yang-Won Jung received his B.S. and M.S. degrees in Electronic Engineering from Yonsei University, Seoul, Korea in 1998 and 2000 respectively. He is currently a Ph.D. candidate in Department of Electrical and Electronic Engineering at Yonsei University, Seoul, Korea. His research interests include adaptive signal processing, echo and noise cancellation, 3D audio signal processing, and adaptive microphone array algorithm. Chungyong Lee received the B.S. and M.S. degrees in electronic engineering from Yonsei University, Seoul Korea, in 1987 and 1989, respectively, and the Ph.D. degree in electrical and computer engineering from the Georgia Institute of Technology, Atlanta, GA, in He was a senior engineer of Samsung Electronics Co., Ltd., Kiheung, Korea from 1996 to In 1997, he joined the faculty of the Department of Electrical and Electronic Engineering, Yonsei University, where he is currently an Associate Professor. His research interests include array signal processing and communication signal processing. Changkyu Choi received the B.E., M.E., and Ph.D. degrees in Electrical Engineering from Korea Advanced Institute of Science and Technology in 1991, 1994, and 1999, respectively. He joined Human and Computer Interaction Laboratory at Samsung Advanced Institute of Technology, Kyonggi-Do, Korea, in 1999, where he is currently a senior engineer. His research interests are in the areas of speech enhancement, speech feature extraction, sound source localization, blind source separation, and microphone array signal processing for robotic systems. Jaywoo Kim received the B.S. degree in electronics and computer engineering from Korea University, Seoul, in 1990, and the M.S. and Ph.D. degrees in electrical engineering from the Ohio State University, Columbus, in 1992 and 1995, respectively. From 1996 to 1999, he was a research engineer at Renault-Samsung Motors Technology Center, Kyonggi-Do, Korea. He joined Human and Computer Interaction Laboratory at Samsung Advanced Institute of Technology, Kyungki-Do, Korea, in 1999 where he is currently a senior researcher. His research interests include user interface, robotics, computer vision, speech recognition, and sound source localization/separation. Dae-Hee Youn received his B.S. degree in Electronic Engineering from Yonsei University, Seoul, Korea, in 1977, and the M.S. and Ph.D. degrees in Electrical Engineering from Kansas State University, Manhattan, Kansas, in 1979 and 1982, respectively. From 1982 to 1985, he was an assistant professor at the University of Iowa, Iowa City, Iowa. Since 1985, he has been with the Department of Electrical and Electronic Engineering at Yonsei University, Seoul, Korea, where he is currently a professor. His research interests include adaptive digital filter and its application, speech and audio signal processing, and real-time implementation of DSP algorithms.

Speech Enhancement Using Beamforming Dr. G. Ramesh Babu 1, D. Lavanya 2, B. Yamuna 2, H. Divya 2, B. Shiva Kumar 2, B.

Speech Enhancement Using Beamforming Dr. G. Ramesh Babu 1, D. Lavanya 2, B. Yamuna 2, H. Divya 2, B. Shiva Kumar 2, B. www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 4 Issue 4 April 2015, Page No. 11143-11147 Speech Enhancement Using Beamforming Dr. G. Ramesh Babu 1, D. Lavanya

More information

LETTER Pre-Filtering Algorithm for Dual-Microphone Generalized Sidelobe Canceller Using General Transfer Function

LETTER Pre-Filtering Algorithm for Dual-Microphone Generalized Sidelobe Canceller Using General Transfer Function IEICE TRANS. INF. & SYST., VOL.E97 D, NO.9 SEPTEMBER 2014 2533 LETTER Pre-Filtering Algorithm for Dual-Microphone Generalized Sidelobe Canceller Using General Transfer Function Jinsoo PARK, Wooil KIM,

More information

Michael Brandstein Darren Ward (Eds.) Microphone Arrays. Signal Processing Techniques and Applications. With 149 Figures. Springer

Michael Brandstein Darren Ward (Eds.) Microphone Arrays. Signal Processing Techniques and Applications. With 149 Figures. Springer Michael Brandstein Darren Ward (Eds.) Microphone Arrays Signal Processing Techniques and Applications With 149 Figures Springer Contents Part I. Speech Enhancement 1 Constant Directivity Beamforming Darren

More information

Speaker Localization in Noisy Environments Using Steered Response Voice Power

Speaker Localization in Noisy Environments Using Steered Response Voice Power 112 IEEE Transactions on Consumer Electronics, Vol. 61, No. 1, February 2015 Speaker Localization in Noisy Environments Using Steered Response Voice Power Hyeontaek Lim, In-Chul Yoo, Youngkyu Cho, and

More information

Speech and Audio Processing Recognition and Audio Effects Part 3: Beamforming

Speech and Audio Processing Recognition and Audio Effects Part 3: Beamforming Speech and Audio Processing Recognition and Audio Effects Part 3: Beamforming Gerhard Schmidt Christian-Albrechts-Universität zu Kiel Faculty of Engineering Electrical Engineering and Information Engineering

More information

THE problem of acoustic echo cancellation (AEC) was

THE problem of acoustic echo cancellation (AEC) was IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 13, NO. 6, NOVEMBER 2005 1231 Acoustic Echo Cancellation and Doubletalk Detection Using Estimated Loudspeaker Impulse Responses Per Åhgren Abstract

More information

Robust Low-Resource Sound Localization in Correlated Noise

Robust Low-Resource Sound Localization in Correlated Noise INTERSPEECH 2014 Robust Low-Resource Sound Localization in Correlated Noise Lorin Netsch, Jacek Stachurski Texas Instruments, Inc. netsch@ti.com, jacek@ti.com Abstract In this paper we address the problem

More information

Study Of Sound Source Localization Using Music Method In Real Acoustic Environment

Study Of Sound Source Localization Using Music Method In Real Acoustic Environment International Journal of Electronics Engineering Research. ISSN 975-645 Volume 9, Number 4 (27) pp. 545-556 Research India Publications http://www.ripublication.com Study Of Sound Source Localization Using

More information

Automotive three-microphone voice activity detector and noise-canceller

Automotive three-microphone voice activity detector and noise-canceller Res. Lett. Inf. Math. Sci., 005, Vol. 7, pp 47-55 47 Available online at http://iims.massey.ac.nz/research/letters/ Automotive three-microphone voice activity detector and noise-canceller Z. QI and T.J.MOIR

More information

Sound Source Localization using HRTF database

Sound Source Localization using HRTF database ICCAS June -, KINTEX, Gyeonggi-Do, Korea Sound Source Localization using HRTF database Sungmok Hwang*, Youngjin Park and Younsik Park * Center for Noise and Vibration Control, Dept. of Mech. Eng., KAIST,

More information

ROBUST echo cancellation requires a method for adjusting

ROBUST echo cancellation requires a method for adjusting 1030 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 15, NO. 3, MARCH 2007 On Adjusting the Learning Rate in Frequency Domain Echo Cancellation With Double-Talk Jean-Marc Valin, Member,

More information

AN ADAPTIVE MICROPHONE ARRAY FOR OPTIMUM BEAMFORMING AND NOISE REDUCTION

AN ADAPTIVE MICROPHONE ARRAY FOR OPTIMUM BEAMFORMING AND NOISE REDUCTION 1th European Signal Processing Conference (EUSIPCO ), Florence, Italy, September -,, copyright by EURASIP AN ADAPTIVE MICROPHONE ARRAY FOR OPTIMUM BEAMFORMING AND NOISE REDUCTION Gerhard Doblinger Institute

More information

IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 50, NO. 12, DECEMBER

IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 50, NO. 12, DECEMBER IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 50, NO. 12, DECEMBER 2002 1865 Transactions Letters Fast Initialization of Nyquist Echo Cancelers Using Circular Convolution Technique Minho Cheong, Student Member,

More information

IN REVERBERANT and noisy environments, multi-channel

IN REVERBERANT and noisy environments, multi-channel 684 IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 11, NO. 6, NOVEMBER 2003 Analysis of Two-Channel Generalized Sidelobe Canceller (GSC) With Post-Filtering Israel Cohen, Senior Member, IEEE Abstract

More information

Recent Advances in Acoustic Signal Extraction and Dereverberation

Recent Advances in Acoustic Signal Extraction and Dereverberation Recent Advances in Acoustic Signal Extraction and Dereverberation Emanuël Habets Erlangen Colloquium 2016 Scenario Spatial Filtering Estimated Desired Signal Undesired sound components: Sensor noise Competing

More information

AN ADAPTIVE MICROPHONE ARRAY FOR OPTIMUM BEAMFORMING AND NOISE REDUCTION

AN ADAPTIVE MICROPHONE ARRAY FOR OPTIMUM BEAMFORMING AND NOISE REDUCTION AN ADAPTIVE MICROPHONE ARRAY FOR OPTIMUM BEAMFORMING AND NOISE REDUCTION Gerhard Doblinger Institute of Communications and Radio-Frequency Engineering Vienna University of Technology Gusshausstr. 5/39,

More information

Adaptive beamforming using pipelined transform domain filters

Adaptive beamforming using pipelined transform domain filters Adaptive beamforming using pipelined transform domain filters GEORGE-OTHON GLENTIS Technological Education Institute of Crete, Branch at Chania, Department of Electronics, 3, Romanou Str, Chalepa, 73133

More information

Airo Interantional Research Journal September, 2013 Volume II, ISSN:

Airo Interantional Research Journal September, 2013 Volume II, ISSN: Airo Interantional Research Journal September, 2013 Volume II, ISSN: 2320-3714 Name of author- Navin Kumar Research scholar Department of Electronics BR Ambedkar Bihar University Muzaffarpur ABSTRACT Direction

More information

Emanuël A. P. Habets, Jacob Benesty, and Patrick A. Naylor. Presented by Amir Kiperwas

Emanuël A. P. Habets, Jacob Benesty, and Patrick A. Naylor. Presented by Amir Kiperwas Emanuël A. P. Habets, Jacob Benesty, and Patrick A. Naylor Presented by Amir Kiperwas 1 M-element microphone array One desired source One undesired source Ambient noise field Signals: Broadband Mutually

More information

A COHERENCE-BASED ALGORITHM FOR NOISE REDUCTION IN DUAL-MICROPHONE APPLICATIONS

A COHERENCE-BASED ALGORITHM FOR NOISE REDUCTION IN DUAL-MICROPHONE APPLICATIONS 18th European Signal Processing Conference (EUSIPCO-21) Aalborg, Denmark, August 23-27, 21 A COHERENCE-BASED ALGORITHM FOR NOISE REDUCTION IN DUAL-MICROPHONE APPLICATIONS Nima Yousefian, Kostas Kokkinakis

More information

Design and Implementation on a Sub-band based Acoustic Echo Cancellation Approach

Design and Implementation on a Sub-band based Acoustic Echo Cancellation Approach Vol., No. 6, 0 Design and Implementation on a Sub-band based Acoustic Echo Cancellation Approach Zhixin Chen ILX Lightwave Corporation Bozeman, Montana, USA chen.zhixin.mt@gmail.com Abstract This paper

More information

Blind Dereverberation of Single-Channel Speech Signals Using an ICA-Based Generative Model

Blind Dereverberation of Single-Channel Speech Signals Using an ICA-Based Generative Model Blind Dereverberation of Single-Channel Speech Signals Using an ICA-Based Generative Model Jong-Hwan Lee 1, Sang-Hoon Oh 2, and Soo-Young Lee 3 1 Brain Science Research Center and Department of Electrial

More information

Speech Enhancement Based On Noise Reduction

Speech Enhancement Based On Noise Reduction Speech Enhancement Based On Noise Reduction Kundan Kumar Singh Electrical Engineering Department University Of Rochester ksingh11@z.rochester.edu ABSTRACT This paper addresses the problem of signal distortion

More information

Towards an intelligent binaural spee enhancement system by integrating me signal extraction. Author(s)Chau, Duc Thanh; Li, Junfeng; Akagi,

Towards an intelligent binaural spee enhancement system by integrating me signal extraction. Author(s)Chau, Duc Thanh; Li, Junfeng; Akagi, JAIST Reposi https://dspace.j Title Towards an intelligent binaural spee enhancement system by integrating me signal extraction Author(s)Chau, Duc Thanh; Li, Junfeng; Akagi, Citation 2011 International

More information

Speech Enhancement Using Robust Generalized Sidelobe Canceller with Multi-Channel Post-Filtering in Adverse Environments

Speech Enhancement Using Robust Generalized Sidelobe Canceller with Multi-Channel Post-Filtering in Adverse Environments Chinese Journal of Electronics Vol.21, No.1, Jan. 2012 Speech Enhancement Using Robust Generalized Sidelobe Canceller with Multi-Channel Post-Filtering in Adverse Environments LI Kai, FU Qiang and YAN

More information

MULTICHANNEL systems are often used for

MULTICHANNEL systems are often used for IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 52, NO. 5, MAY 2004 1149 Multichannel Post-Filtering in Nonstationary Noise Environments Israel Cohen, Senior Member, IEEE Abstract In this paper, we present

More information

High-speed Noise Cancellation with Microphone Array

High-speed Noise Cancellation with Microphone Array Noise Cancellation a Posteriori Probability, Maximum Criteria Independent Component Analysis High-speed Noise Cancellation with Microphone Array We propose the use of a microphone array based on independent

More information

Calibration of Microphone Arrays for Improved Speech Recognition

Calibration of Microphone Arrays for Improved Speech Recognition MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Calibration of Microphone Arrays for Improved Speech Recognition Michael L. Seltzer, Bhiksha Raj TR-2001-43 December 2001 Abstract We present

More information

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm A.T. Rajamanickam, N.P.Subiramaniyam, A.Balamurugan*,

More information

WIND SPEED ESTIMATION AND WIND-INDUCED NOISE REDUCTION USING A 2-CHANNEL SMALL MICROPHONE ARRAY

WIND SPEED ESTIMATION AND WIND-INDUCED NOISE REDUCTION USING A 2-CHANNEL SMALL MICROPHONE ARRAY INTER-NOISE 216 WIND SPEED ESTIMATION AND WIND-INDUCED NOISE REDUCTION USING A 2-CHANNEL SMALL MICROPHONE ARRAY Shumpei SAKAI 1 ; Tetsuro MURAKAMI 2 ; Naoto SAKATA 3 ; Hirohumi NAKAJIMA 4 ; Kazuhiro NAKADAI

More information

Joint recognition and direction-of-arrival estimation of simultaneous meetingroom acoustic events

Joint recognition and direction-of-arrival estimation of simultaneous meetingroom acoustic events INTERSPEECH 2013 Joint recognition and direction-of-arrival estimation of simultaneous meetingroom acoustic events Rupayan Chakraborty and Climent Nadeu TALP Research Centre, Department of Signal Theory

More information

Improving Meetings with Microphone Array Algorithms. Ivan Tashev Microsoft Research

Improving Meetings with Microphone Array Algorithms. Ivan Tashev Microsoft Research Improving Meetings with Microphone Array Algorithms Ivan Tashev Microsoft Research Why microphone arrays? They ensure better sound quality: less noises and reverberation Provide speaker position using

More information

Audio Restoration Based on DSP Tools

Audio Restoration Based on DSP Tools Audio Restoration Based on DSP Tools EECS 451 Final Project Report Nan Wu School of Electrical Engineering and Computer Science University of Michigan Ann Arbor, MI, United States wunan@umich.edu Abstract

More information

Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter

Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter Ching-Ta Lu, Kun-Fu Tseng 2, Chih-Tsung Chen 2 Department of Information Communication, Asia University, Taichung, Taiwan, ROC

More information

A MICROPHONE ARRAY INTERFACE FOR REAL-TIME INTERACTIVE MUSIC PERFORMANCE

A MICROPHONE ARRAY INTERFACE FOR REAL-TIME INTERACTIVE MUSIC PERFORMANCE A MICROPHONE ARRA INTERFACE FOR REAL-TIME INTERACTIVE MUSIC PERFORMANCE Daniele Salvati AVIRES lab Dep. of Mathematics and Computer Science, University of Udine, Italy daniele.salvati@uniud.it Sergio Canazza

More information

Single Channel Speaker Segregation using Sinusoidal Residual Modeling

Single Channel Speaker Segregation using Sinusoidal Residual Modeling NCC 2009, January 16-18, IIT Guwahati 294 Single Channel Speaker Segregation using Sinusoidal Residual Modeling Rajesh M Hegde and A. Srinivas Dept. of Electrical Engineering Indian Institute of Technology

More information

DESIGN AND IMPLEMENTATION OF ADAPTIVE ECHO CANCELLER BASED LMS & NLMS ALGORITHM

DESIGN AND IMPLEMENTATION OF ADAPTIVE ECHO CANCELLER BASED LMS & NLMS ALGORITHM DESIGN AND IMPLEMENTATION OF ADAPTIVE ECHO CANCELLER BASED LMS & NLMS ALGORITHM Sandip A. Zade 1, Prof. Sameena Zafar 2 1 Mtech student,department of EC Engg., Patel college of Science and Technology Bhopal(India)

More information

NOISE ESTIMATION IN A SINGLE CHANNEL

NOISE ESTIMATION IN A SINGLE CHANNEL SPEECH ENHANCEMENT FOR CROSS-TALK INTERFERENCE by Levent M. Arslan and John H.L. Hansen Robust Speech Processing Laboratory Department of Electrical Engineering Box 99 Duke University Durham, North Carolina

More information

Broadband Microphone Arrays for Speech Acquisition

Broadband Microphone Arrays for Speech Acquisition Broadband Microphone Arrays for Speech Acquisition Darren B. Ward Acoustics and Speech Research Dept. Bell Labs, Lucent Technologies Murray Hill, NJ 07974, USA Robert C. Williamson Dept. of Engineering,

More information

Omnidirectional Sound Source Tracking Based on Sequential Updating Histogram

Omnidirectional Sound Source Tracking Based on Sequential Updating Histogram Proceedings of APSIPA Annual Summit and Conference 5 6-9 December 5 Omnidirectional Sound Source Tracking Based on Sequential Updating Histogram Yusuke SHIIKI and Kenji SUYAMA School of Engineering, Tokyo

More information

INTERFERENCE REJECTION OF ADAPTIVE ARRAY ANTENNAS BY USING LMS AND SMI ALGORITHMS

INTERFERENCE REJECTION OF ADAPTIVE ARRAY ANTENNAS BY USING LMS AND SMI ALGORITHMS INTERFERENCE REJECTION OF ADAPTIVE ARRAY ANTENNAS BY USING LMS AND SMI ALGORITHMS Kerim Guney Bilal Babayigit Ali Akdagli e-mail: kguney@erciyes.edu.tr e-mail: bilalb@erciyes.edu.tr e-mail: akdagli@erciyes.edu.tr

More information

LOCALIZATION AND IDENTIFICATION OF PERSONS AND AMBIENT NOISE SOURCES VIA ACOUSTIC SCENE ANALYSIS

LOCALIZATION AND IDENTIFICATION OF PERSONS AND AMBIENT NOISE SOURCES VIA ACOUSTIC SCENE ANALYSIS ICSV14 Cairns Australia 9-12 July, 2007 LOCALIZATION AND IDENTIFICATION OF PERSONS AND AMBIENT NOISE SOURCES VIA ACOUSTIC SCENE ANALYSIS Abstract Alexej Swerdlow, Kristian Kroschel, Timo Machmer, Dirk

More information

Analysis of LMS and NLMS Adaptive Beamforming Algorithms

Analysis of LMS and NLMS Adaptive Beamforming Algorithms Analysis of LMS and NLMS Adaptive Beamforming Algorithms PG Student.Minal. A. Nemade Dept. of Electronics Engg. Asst. Professor D. G. Ganage Dept. of E&TC Engg. Professor & Head M. B. Mali Dept. of E&TC

More information

Optimum Rate Allocation for Two-Class Services in CDMA Smart Antenna Systems

Optimum Rate Allocation for Two-Class Services in CDMA Smart Antenna Systems 810 IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 51, NO. 5, MAY 2003 Optimum Rate Allocation for Two-Class Services in CDMA Smart Antenna Systems Il-Min Kim, Member, IEEE, Hyung-Myung Kim, Senior Member,

More information

Auditory System For a Mobile Robot

Auditory System For a Mobile Robot Auditory System For a Mobile Robot PhD Thesis Jean-Marc Valin Department of Electrical Engineering and Computer Engineering Université de Sherbrooke, Québec, Canada Jean-Marc.Valin@USherbrooke.ca Motivations

More information

Comparison of LMS and NLMS algorithm with the using of 4 Linear Microphone Array for Speech Enhancement

Comparison of LMS and NLMS algorithm with the using of 4 Linear Microphone Array for Speech Enhancement Comparison of LMS and NLMS algorithm with the using of 4 Linear Microphone Array for Speech Enhancement Mamun Ahmed, Nasimul Hyder Maruf Bhuyan Abstract In this paper, we have presented the design, implementation

More information

Chapter 4 SPEECH ENHANCEMENT

Chapter 4 SPEECH ENHANCEMENT 44 Chapter 4 SPEECH ENHANCEMENT 4.1 INTRODUCTION: Enhancement is defined as improvement in the value or Quality of something. Speech enhancement is defined as the improvement in intelligibility and/or

More information

Optimal Adaptive Filtering Technique for Tamil Speech Enhancement

Optimal Adaptive Filtering Technique for Tamil Speech Enhancement Optimal Adaptive Filtering Technique for Tamil Speech Enhancement Vimala.C Project Fellow, Department of Computer Science Avinashilingam Institute for Home Science and Higher Education and Women Coimbatore,

More information

Students: Avihay Barazany Royi Levy Supervisor: Kuti Avargel In Association with: Zoran, Haifa

Students: Avihay Barazany Royi Levy Supervisor: Kuti Avargel In Association with: Zoran, Haifa Students: Avihay Barazany Royi Levy Supervisor: Kuti Avargel In Association with: Zoran, Haifa Spring 2008 Introduction Problem Formulation Possible Solutions Proposed Algorithm Experimental Results Conclusions

More information

Localization of underwater moving sound source based on time delay estimation using hydrophone array

Localization of underwater moving sound source based on time delay estimation using hydrophone array Journal of Physics: Conference Series PAPER OPEN ACCESS Localization of underwater moving sound source based on time delay estimation using hydrophone array To cite this article: S. A. Rahman et al 2016

More information

Multiple Sound Sources Localization Using Energetic Analysis Method

Multiple Sound Sources Localization Using Energetic Analysis Method VOL.3, NO.4, DECEMBER 1 Multiple Sound Sources Localization Using Energetic Analysis Method Hasan Khaddour, Jiří Schimmel Department of Telecommunications FEEC, Brno University of Technology Purkyňova

More information

A Robust Adaptive Beamformer with a Blocking Matrix Using Coefficient-Constrained Adaptive Filters

A Robust Adaptive Beamformer with a Blocking Matrix Using Coefficient-Constrained Adaptive Filters 640 IEICE TRANS. FUNDAMENTALS, VOL.E82 A, NO.4 APRIL 1999 PAPER A Robust Adaptive Beamformer with a Blocking Matrix Using Coefficient-Constrained Adaptive Filters Osamu HOSHUYAMA, Akihiko SUGIYAMA, and

More information

PAPER A Novel Adaptive Array Utilizing Frequency Characteristics of Multi-Carrier Signals

PAPER A Novel Adaptive Array Utilizing Frequency Characteristics of Multi-Carrier Signals IEICE TRANS. COMMUN., VOL.E83 B, NO.2 FEBRUARY 2000 371 PAPER A Novel Adaptive Array Utilizing Frequency Characteristics of Multi-Carrier Signals Mitoshi FUJIMOTO, Kunitoshi NISHIKAWA, Tsutayuki SHIBATA,

More information

REAL-TIME BLIND SOURCE SEPARATION FOR MOVING SPEAKERS USING BLOCKWISE ICA AND RESIDUAL CROSSTALK SUBTRACTION

REAL-TIME BLIND SOURCE SEPARATION FOR MOVING SPEAKERS USING BLOCKWISE ICA AND RESIDUAL CROSSTALK SUBTRACTION REAL-TIME BLIND SOURCE SEPARATION FOR MOVING SPEAKERS USING BLOCKWISE ICA AND RESIDUAL CROSSTALK SUBTRACTION Ryo Mukai Hiroshi Sawada Shoko Araki Shoji Makino NTT Communication Science Laboratories, NTT

More information

A BROADBAND BEAMFORMER USING CONTROLLABLE CONSTRAINTS AND MINIMUM VARIANCE

A BROADBAND BEAMFORMER USING CONTROLLABLE CONSTRAINTS AND MINIMUM VARIANCE A BROADBAND BEAMFORMER USING CONTROLLABLE CONSTRAINTS AND MINIMUM VARIANCE Sam Karimian-Azari, Jacob Benesty,, Jesper Rindom Jensen, and Mads Græsbøll Christensen Audio Analysis Lab, AD:MT, Aalborg University,

More information

A Study on the control Method of 3-Dimensional Space Application using KINECT System Jong-wook Kang, Dong-jun Seo, and Dong-seok Jung,

A Study on the control Method of 3-Dimensional Space Application using KINECT System Jong-wook Kang, Dong-jun Seo, and Dong-seok Jung, IJCSNS International Journal of Computer Science and Network Security, VOL.11 No.9, September 2011 55 A Study on the control Method of 3-Dimensional Space Application using KINECT System Jong-wook Kang,

More information

DEEP LEARNING BASED AUTOMATIC VOLUME CONTROL AND LIMITER SYSTEM. Jun Yang (IEEE Senior Member), Philip Hilmes, Brian Adair, David W.

DEEP LEARNING BASED AUTOMATIC VOLUME CONTROL AND LIMITER SYSTEM. Jun Yang (IEEE Senior Member), Philip Hilmes, Brian Adair, David W. DEEP LEARNING BASED AUTOMATIC VOLUME CONTROL AND LIMITER SYSTEM Jun Yang (IEEE Senior Member), Philip Hilmes, Brian Adair, David W. Krueger Amazon Lab126, Sunnyvale, CA 94089, USA Email: {junyang, philmes,

More information

Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis

Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis Mohini Avatade & S.L. Sahare Electronics & Telecommunication Department, Cummins

More information

SPECTRAL COMBINING FOR MICROPHONE DIVERSITY SYSTEMS

SPECTRAL COMBINING FOR MICROPHONE DIVERSITY SYSTEMS 17th European Signal Processing Conference (EUSIPCO 29) Glasgow, Scotland, August 24-28, 29 SPECTRAL COMBINING FOR MICROPHONE DIVERSITY SYSTEMS Jürgen Freudenberger, Sebastian Stenzel, Benjamin Venditti

More information

Study of Different Adaptive Filter Algorithms for Noise Cancellation in Real-Time Environment

Study of Different Adaptive Filter Algorithms for Noise Cancellation in Real-Time Environment Study of Different Adaptive Filter Algorithms for Noise Cancellation in Real-Time Environment G.V.P.Chandra Sekhar Yadav Student, M.Tech, DECS Gudlavalleru Engineering College Gudlavalleru-521356, Krishna

More information

Advanced delay-and-sum beamformer with deep neural network

Advanced delay-and-sum beamformer with deep neural network PROCEEDINGS of the 22 nd International Congress on Acoustics Acoustic Array Systems: Paper ICA2016-686 Advanced delay-and-sum beamformer with deep neural network Mitsunori Mizumachi (a), Maya Origuchi

More information

Direction-of-Arrival Estimation Using a Microphone Array with the Multichannel Cross-Correlation Method

Direction-of-Arrival Estimation Using a Microphone Array with the Multichannel Cross-Correlation Method Direction-of-Arrival Estimation Using a Microphone Array with the Multichannel Cross-Correlation Method Udo Klein, Member, IEEE, and TrInh Qu6c VO School of Electrical Engineering, International University,

More information

Noise Reduction for L-3 Nautronix Receivers

Noise Reduction for L-3 Nautronix Receivers Noise Reduction for L-3 Nautronix Receivers Jessica Manea School of Electrical, Electronic and Computer Engineering, University of Western Australia Roberto Togneri School of Electrical, Electronic and

More information

Speech Synthesis using Mel-Cepstral Coefficient Feature

Speech Synthesis using Mel-Cepstral Coefficient Feature Speech Synthesis using Mel-Cepstral Coefficient Feature By Lu Wang Senior Thesis in Electrical Engineering University of Illinois at Urbana-Champaign Advisor: Professor Mark Hasegawa-Johnson May 2018 Abstract

More information

Microphone Array Feedback Suppression. for Indoor Room Acoustics

Microphone Array Feedback Suppression. for Indoor Room Acoustics Microphone Array Feedback Suppression for Indoor Room Acoustics by Tanmay Prakash Advisor: Dr. Jeffrey Krolik Department of Electrical and Computer Engineering Duke University 1 Abstract The objective

More information

Title. Author(s)Sugiyama, Akihiko; Kato, Masanori; Serizawa, Masahir. Issue Date Doc URL. Type. Note. File Information

Title. Author(s)Sugiyama, Akihiko; Kato, Masanori; Serizawa, Masahir. Issue Date Doc URL. Type. Note. File Information Title A Low-Distortion Noise Canceller with an SNR-Modifie Author(s)Sugiyama, Akihiko; Kato, Masanori; Serizawa, Masahir Proceedings : APSIPA ASC 9 : Asia-Pacific Signal Citationand Conference: -5 Issue

More information

On-Line Dead-Time Compensation Method Based on Time Delay Control

On-Line Dead-Time Compensation Method Based on Time Delay Control IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, VOL. 11, NO. 2, MARCH 2003 279 On-Line Dead-Time Compensation Method Based on Time Delay Control Hyun-Soo Kim, Kyeong-Hwa Kim, and Myung-Joong Youn Abstract

More information

Smart antenna for doa using music and esprit

Smart antenna for doa using music and esprit IOSR Journal of Electronics and Communication Engineering (IOSRJECE) ISSN : 2278-2834 Volume 1, Issue 1 (May-June 2012), PP 12-17 Smart antenna for doa using music and esprit SURAYA MUBEEN 1, DR.A.M.PRASAD

More information

Local Relative Transfer Function for Sound Source Localization

Local Relative Transfer Function for Sound Source Localization Local Relative Transfer Function for Sound Source Localization Xiaofei Li 1, Radu Horaud 1, Laurent Girin 1,2, Sharon Gannot 3 1 INRIA Grenoble Rhône-Alpes. {firstname.lastname@inria.fr} 2 GIPSA-Lab &

More information

Narrow-Band Interference Rejection in DS/CDMA Systems Using Adaptive (QRD-LSL)-Based Nonlinear ACM Interpolators

Narrow-Band Interference Rejection in DS/CDMA Systems Using Adaptive (QRD-LSL)-Based Nonlinear ACM Interpolators 374 IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 52, NO. 2, MARCH 2003 Narrow-Band Interference Rejection in DS/CDMA Systems Using Adaptive (QRD-LSL)-Based Nonlinear ACM Interpolators Jenq-Tay Yuan

More information

Dual Transfer Function GSC and Application to Joint Noise Reduction and Acoustic Echo Cancellation

Dual Transfer Function GSC and Application to Joint Noise Reduction and Acoustic Echo Cancellation Dual Transfer Function GSC and Application to Joint Noise Reduction and Acoustic Echo Cancellation Gal Reuven Under supervision of Sharon Gannot 1 and Israel Cohen 2 1 School of Engineering, Bar-Ilan University,

More information

Applying the Filtered Back-Projection Method to Extract Signal at Specific Position

Applying the Filtered Back-Projection Method to Extract Signal at Specific Position Applying the Filtered Back-Projection Method to Extract Signal at Specific Position 1 Chia-Ming Chang and Chun-Hao Peng Department of Computer Science and Engineering, Tatung University, Taipei, Taiwan

More information

Real time noise-speech discrimination in time domain for speech recognition application

Real time noise-speech discrimination in time domain for speech recognition application University of Malaya From the SelectedWorks of Mokhtar Norrima January 4, 2011 Real time noise-speech discrimination in time domain for speech recognition application Norrima Mokhtar, University of Malaya

More information

x ( Primary Path d( P (z) - e ( y ( Adaptive Filter W (z) y( S (z) Figure 1 Spectrum of motorcycle noise at 40 mph. modeling of the secondary path to

x ( Primary Path d( P (z) - e ( y ( Adaptive Filter W (z) y( S (z) Figure 1 Spectrum of motorcycle noise at 40 mph. modeling of the secondary path to Active Noise Control for Motorcycle Helmets Kishan P. Raghunathan and Sen M. Kuo Department of Electrical Engineering Northern Illinois University DeKalb, IL, USA Woon S. Gan School of Electrical and Electronic

More information

Published in: Proceedings of the 11th International Workshop on Acoustic Echo and Noise Control

Published in: Proceedings of the 11th International Workshop on Acoustic Echo and Noise Control Aalborg Universitet Variable Speech Distortion Weighted Multichannel Wiener Filter based on Soft Output Voice Activity Detection for Noise Reduction in Hearing Aids Ngo, Kim; Spriet, Ann; Moonen, Marc;

More information

VOL. 3, NO.11 Nov, 2012 ISSN Journal of Emerging Trends in Computing and Information Sciences CIS Journal. All rights reserved.

VOL. 3, NO.11 Nov, 2012 ISSN Journal of Emerging Trends in Computing and Information Sciences CIS Journal. All rights reserved. Effect of Fading Correlation on the Performance of Spatial Multiplexed MIMO systems with circular antennas M. A. Mangoud Department of Electrical and Electronics Engineering, University of Bahrain P. O.

More information

RECENTLY, there has been an increasing interest in noisy

RECENTLY, there has been an increasing interest in noisy IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 52, NO. 9, SEPTEMBER 2005 535 Warped Discrete Cosine Transform-Based Noisy Speech Enhancement Joon-Hyuk Chang, Member, IEEE Abstract In

More information

An HARQ scheme with antenna switching for V-BLAST system

An HARQ scheme with antenna switching for V-BLAST system An HARQ scheme with antenna switching for V-BLAST system Bonghoe Kim* and Donghee Shim* *Standardization & System Research Gr., Mobile Communication Technology Research LAB., LG Electronics Inc., 533,

More information

Time Delay Estimation: Applications and Algorithms

Time Delay Estimation: Applications and Algorithms Time Delay Estimation: Applications and Algorithms Hing Cheung So http://www.ee.cityu.edu.hk/~hcso Department of Electronic Engineering City University of Hong Kong H. C. So Page 1 Outline Introduction

More information

Microphone Array Design and Beamforming

Microphone Array Design and Beamforming Microphone Array Design and Beamforming Heinrich Löllmann Multimedia Communications and Signal Processing heinrich.loellmann@fau.de with contributions from Vladi Tourbabin and Hendrik Barfuss EUSIPCO Tutorial

More information

Acoustic Beamforming for Hearing Aids Using Multi Microphone Array by Designing Graphical User Interface

Acoustic Beamforming for Hearing Aids Using Multi Microphone Array by Designing Graphical User Interface MEE-2010-2012 Acoustic Beamforming for Hearing Aids Using Multi Microphone Array by Designing Graphical User Interface Master s Thesis S S V SUMANTH KOTTA BULLI KOTESWARARAO KOMMINENI This thesis is presented

More information

University Ibn Tofail, B.P. 133, Kenitra, Morocco. University Moulay Ismail, B.P Meknes, Morocco

University Ibn Tofail, B.P. 133, Kenitra, Morocco. University Moulay Ismail, B.P Meknes, Morocco Research Journal of Applied Sciences, Engineering and Technology 8(9): 1132-1138, 2014 DOI:10.19026/raset.8.1077 ISSN: 2040-7459; e-issn: 2040-7467 2014 Maxwell Scientific Publication Corp. Submitted:

More information

ONE of the most common and robust beamforming algorithms

ONE of the most common and robust beamforming algorithms TECHNICAL NOTE 1 Beamforming algorithms - beamformers Jørgen Grythe, Norsonic AS, Oslo, Norway Abstract Beamforming is the name given to a wide variety of array processing algorithms that focus or steer

More information

FREQUENCY RESPONSE AND LATENCY OF MEMS MICROPHONES: THEORY AND PRACTICE

FREQUENCY RESPONSE AND LATENCY OF MEMS MICROPHONES: THEORY AND PRACTICE APPLICATION NOTE AN22 FREQUENCY RESPONSE AND LATENCY OF MEMS MICROPHONES: THEORY AND PRACTICE This application note covers engineering details behind the latency of MEMS microphones. Major components of

More information

SOUND SOURCE LOCATION METHOD

SOUND SOURCE LOCATION METHOD SOUND SOURCE LOCATION METHOD Michal Mandlik 1, Vladimír Brázda 2 Summary: This paper deals with received acoustic signals on microphone array. In this paper the localization system based on a speaker speech

More information

546 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 17, NO. 4, MAY /$ IEEE

546 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 17, NO. 4, MAY /$ IEEE 546 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL 17, NO 4, MAY 2009 Relative Transfer Function Identification Using Convolutive Transfer Function Approximation Ronen Talmon, Israel

More information

Frequency Domain Analysis for Noise Suppression Using Spectral Processing Methods for Degraded Speech Signal in Speech Enhancement

Frequency Domain Analysis for Noise Suppression Using Spectral Processing Methods for Degraded Speech Signal in Speech Enhancement Frequency Domain Analysis for Noise Suppression Using Spectral Processing Methods for Degraded Speech Signal in Speech Enhancement 1 Zeeshan Hashmi Khateeb, 2 Gopalaiah 1,2 Department of Instrumentation

More information

Sound Processing Technologies for Realistic Sensations in Teleworking

Sound Processing Technologies for Realistic Sensations in Teleworking Sound Processing Technologies for Realistic Sensations in Teleworking Takashi Yazu Makoto Morito In an office environment we usually acquire a large amount of information without any particular effort

More information

Impulsive Noise Reduction Method Based on Clipping and Adaptive Filters in AWGN Channel

Impulsive Noise Reduction Method Based on Clipping and Adaptive Filters in AWGN Channel Impulsive Noise Reduction Method Based on Clipping and Adaptive Filters in AWGN Channel Sumrin M. Kabir, Alina Mirza, and Shahzad A. Sheikh Abstract Impulsive noise is a man-made non-gaussian noise that

More information

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 21, NO. 5, MAY

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 21, NO. 5, MAY IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 21, NO. 5, MAY 2013 945 A Two-Stage Beamforming Approach for Noise Reduction Dereverberation Emanuël A. P. Habets, Senior Member, IEEE,

More information

IMPROVEMENT OF SPEECH SOURCE LOCALIZATION IN NOISY ENVIRONMENT USING OVERCOMPLETE RATIONAL-DILATION WAVELET TRANSFORMS

IMPROVEMENT OF SPEECH SOURCE LOCALIZATION IN NOISY ENVIRONMENT USING OVERCOMPLETE RATIONAL-DILATION WAVELET TRANSFORMS 1 International Conference on Cyberworlds IMPROVEMENT OF SPEECH SOURCE LOCALIZATION IN NOISY ENVIRONMENT USING OVERCOMPLETE RATIONAL-DILATION WAVELET TRANSFORMS Di Liu, Andy W. H. Khong School of Electrical

More information

Implementation of decentralized active control of power transformer noise

Implementation of decentralized active control of power transformer noise Implementation of decentralized active control of power transformer noise P. Micheau, E. Leboucher, A. Berry G.A.U.S., Université de Sherbrooke, 25 boulevard de l Université,J1K 2R1, Québec, Canada Philippe.micheau@gme.usherb.ca

More information

Using GPS to Synthesize A Large Antenna Aperture When The Elements Are Mobile

Using GPS to Synthesize A Large Antenna Aperture When The Elements Are Mobile Using GPS to Synthesize A Large Antenna Aperture When The Elements Are Mobile Shau-Shiun Jan, Per Enge Department of Aeronautics and Astronautics Stanford University BIOGRAPHY Shau-Shiun Jan is a Ph.D.

More information

3rd International Conference on Machinery, Materials and Information Technology Applications (ICMMITA 2015)

3rd International Conference on Machinery, Materials and Information Technology Applications (ICMMITA 2015) 3rd International Conference on Machinery, Materials and Information echnology Applications (ICMMIA 015) he processing of background noise in secondary path identification of Power transformer ANC system

More information

Different Approaches of Spectral Subtraction Method for Speech Enhancement

Different Approaches of Spectral Subtraction Method for Speech Enhancement ISSN 2249 5460 Available online at www.internationalejournals.com International ejournals International Journal of Mathematical Sciences, Technology and Humanities 95 (2013 1056 1062 Different Approaches

More information

BEAMFORMING WITHIN THE MODAL SOUND FIELD OF A VEHICLE INTERIOR

BEAMFORMING WITHIN THE MODAL SOUND FIELD OF A VEHICLE INTERIOR BeBeC-2016-S9 BEAMFORMING WITHIN THE MODAL SOUND FIELD OF A VEHICLE INTERIOR Clemens Nau Daimler AG Béla-Barényi-Straße 1, 71063 Sindelfingen, Germany ABSTRACT Physically the conventional beamforming method

More information

A JOINT MODULATION IDENTIFICATION AND FREQUENCY OFFSET CORRECTION ALGORITHM FOR QAM SYSTEMS

A JOINT MODULATION IDENTIFICATION AND FREQUENCY OFFSET CORRECTION ALGORITHM FOR QAM SYSTEMS A JOINT MODULATION IDENTIFICATION AND FREQUENCY OFFSET CORRECTION ALGORITHM FOR QAM SYSTEMS Evren Terzi, Hasan B. Celebi, and Huseyin Arslan Department of Electrical Engineering, University of South Florida

More information

/$ IEEE

/$ IEEE IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 17, NO. 6, AUGUST 2009 1071 Multichannel Eigenspace Beamforming in a Reverberant Noisy Environment With Multiple Interfering Speech Signals

More information

SPEECH ENHANCEMENT USING SPARSE CODE SHRINKAGE AND GLOBAL SOFT DECISION. Changkyu Choi, Seungho Choi, and Sang-Ryong Kim

SPEECH ENHANCEMENT USING SPARSE CODE SHRINKAGE AND GLOBAL SOFT DECISION. Changkyu Choi, Seungho Choi, and Sang-Ryong Kim SPEECH ENHANCEMENT USING SPARSE CODE SHRINKAGE AND GLOBAL SOFT DECISION Changkyu Choi, Seungho Choi, and Sang-Ryong Kim Human & Computer Interaction Laboratory Samsung Advanced Institute of Technology

More information

Image De-Noising Using a Fast Non-Local Averaging Algorithm

Image De-Noising Using a Fast Non-Local Averaging Algorithm Image De-Noising Using a Fast Non-Local Averaging Algorithm RADU CIPRIAN BILCU 1, MARKKU VEHVILAINEN 2 1,2 Multimedia Technologies Laboratory, Nokia Research Center Visiokatu 1, FIN-33720, Tampere FINLAND

More information