ARTICLE IN PRESS. Signal Processing
|
|
- Erin Ellis
- 5 years ago
- Views:
Transcription
1 Signal Processing 9 (2) Contents lists available at ScienceDirect Signal Processing journal homepage: Fast communication Double-talk detection based on soft decision for acoustic echo suppression Yun-Sik Park, Joon-Hyuk Chang School of Electronic Engineering, Inha University, Incheon 42-75, Republic of Korea article info Article history: Received 22 May 29 Received in revised form 22 September 29 Accepted 2 November 29 Available online 2 November 29 Keywords: Double-talk detection Speech presence probability Voice activity detection abstract In this paper, we propose a novel double-talk detection (DTD) technique based on a soft decision in the frequency domain. The proposed method provides an efficient procedure to detect the double-talk situation by the use of the global near-end speech presence probability (GNSPP) and voice activity detection (VAD) of the near-end and far-end signal. Specifically, the GNSPP is derived based on a statistical method of speech and is employed to determine the double-talk presence in a given frame. The performance of our approach is evaluated by objective tests under different environments, and it is found that the suggested method yields better results compared with the conventional scheme. & 29 Elsevier B.V. All rights reserved.. Introduction Corresponding author. Tel.: ; fax: address: changjh@inha.ac.kr (J.-H. Chang). In most hands-free mobile communication systems, since the loudspeaker and microphone are acoustically coupled, acoustic echoes occur. In efforts to address this problem, numerous acoustic echo cancellation (AEC) techniques incorporating an adaptive filter such as the least mean square (LMS) and normalized LMS (NLMS) have been reported [ 3]. One of the major problems of AEC techniques, however, is that the performance significantly degrades during the double-talk periods, in which signals from both the near-end and far-end coexist because the double-talk acts as very large interference to the adaptive filter. The problem can be alleviated by freezing the adaptive filter coefficients through the use of a double talk detection (DTD) algorithm [4]. In this regard, many studies have been dedicated to the problem of DTD. In practice, cross-correlation and coherence-based algorithms are relevant, as they present straightforward approaches. Adopting hard decisions [4 6], these schemes classify each frame into one of two (i.e., double-talk or not) cases by comparing decision statistics and given threshold values. However, they are sensitive to optimized parameters and do not always provide reliable performance under various conditions. In this paper, we propose a novel DTD algorithm based on a global soft decision [7], wheretheterm global means that DTD is performed globally in a given frame and soft decision [8,9] denotes that the probability of double-talk is introduced as a decision and is applied to update the adaptive filter in the acoustic echo suppressor (AES) algorithm []. Specifically, the global near-end speech presence probability (GNSPP) based on a statistical model is computed in each frame to apply the proposed DTD algorithm in conjunction with results of voice activity detection (VAD) of the near-end and far-end signal. It is worth noting that our approach provides for the first time an effective framework of DTD based on a soft decision by taking advantage of a statistical model, in contrast with the conventional hard decision-based method. The performance of the proposed algorithm is evaluated by echo return loss enhancement (ERLE) and speech attenuation (SA) tests during double-talk and is demonstrated to be better than that of the conventional method. 2. Global near-end speech presence probability In this section, we consider how to derive the global near-end speech presence probability (GNSPP) in the /$ - see front matter & 29 Elsevier B.V. All rights reserved. doi:.6/j.sigpro.29..3
2 738 Y.-S. Park, J.-H. Chang / Signal Processing 9 (2) frequency domain. To this end, we first assume that two hypotheses, H and H, indicate near-end speech absence and presence as follows: H : near end speech absent : YðiÞ¼DðiÞ H : near end speech present : YðiÞ¼DðiÞþSðiÞ where DðiÞ¼½Dði; Þ; Dði; 2Þ;...; Dði; MÞŠ, SðiÞ¼½Sði; Þ; Sði; 2Þ;...; Sði; MÞŠ and YðiÞ¼½Yði; Þ; Yði; 2Þ;...; Yði; MÞŠ, respectively, represent the Fourier domain spectra of the echo signal, the near-end speech and the microphone input signal with a frame index i. Also, XðiÞ¼½Xði; Þ; Xði; 2Þ;...; Xði; MÞŠ denote the Fourier spectrum of the far-end signal as shown in Fig.. The background noise is not taken into account since we assume that near-end speech absence is not correlated with the background noise. Under the assumption that Dði; kþ and Sði; kþ are characterized by separate zero-mean complex Gaussian distributions, the following is obtained [7]: jyði; kþj2 pðyði; kþjh Þ¼ exp ð2þ pl d ði; kþ l d ði; kþ pðyði; kþjh Þ¼ ðþ pðl s ði; kþþl d ði; kþþ exp jyði; kþj 2 l s ði; kþþl d ði; kþ ð3þ where l s ði; kþ and l d ði; kþ are the variance of the near-end speech and estimated echo, respectively. Accordingly, the GNSPP pðh jyðiþþ is derived from Bayes rule, such that [7] pðh jyðiþþ ¼ pðyðiþjh ÞpðH Þ pðyðiþjh ÞpðH ÞþpðYðiÞjH ÞpðH Þ where pðh Þð¼ pðh ÞÞ represents the a priori probability of near-end speech absence. Since the spectral component in each frequency bin is assumed to be statistically independent, (4) can be rewritten as [7] pðh jyðiþþ pðh Þ Q M k ¼ ¼ pðyði; kþjh Þ pðh Þ Q M k ¼ pðyði; kþjh ÞþpðH Þ Q M k ¼ pðyði; kþjh Þ ¼ q Q M k ¼ L kðyði; kþþ þq Q M k ¼ L kðyði; kþþ ð4þ ð5þ in which q ¼ pðh Þ=pðH Þð¼Þ which is determined by the rough estimate of the ratio of absence time duration and presence time duration for near-end speech and L k ðyði; kþþ is the likelihood ratio computed in the k th frequency bin, as given by [7] L k ðyði; kþþ ¼ pðyði; kþjh Þ pðyði; kþjh Þ ¼ gði; kþxði; kþ exp ð6þ þxði; kþ þxði; kþ where the a posteriori signal-to-echo ratio (SER) gði; kþ and the a priori SER xði; kþ are defined by gði; kþ jyði; kþj2 l d ði; kþ xði; kþ l sði; kþ l d ði; kþ where l d ði; kþ is estimated by ^l d ði; kþ. The power spectrum of the echo signal is obtained in the case of the absence of the near-end speech signal, as given by ^l d ði; kþ¼z ^l D d ði ; kþþð z D Þj ^Y ði; kþj 2 ð9þ in which z D ð¼:93þ is the smoothing parameter. Also, in (8), xði; kþ is estimated with the help of the well-known decision-directed approach with a DD ¼ :6 []. Then, ^xði; kþ¼a DD j^sði ; kþj 2 l d ði ; kþ þð a DDÞPfgði; kþ g ð7þ ð8þ ðþ where Pfzg¼z if zz, and Pfzg¼ otherwise. As specified in (9), the robust estimation of the echo magnitude spectrum ^Y ði; kþ plays an essential role in the performance. In our approach, we follow the parameter estimation procedure proposed in [] as follows: j ^Y ði; kþj ¼ ^Hði; kþjxði; kþj ðþ where ^Hði; kþ is the estimate for the echo path response mimicking the actual echo path. Specifically, ^H opt ði; kþ is obtained based on the magnitude of the least squares estimator as follows []: ^H opt ði; kþ¼ E½X ði; kþyði; kþš E½X ði; kþxði; kþš ð2þ where denotes the complex conjugate and E½Š represents the expected value. Note that there exist DFT x Send path IDFT Microphone G Near-end GNSPP SER Esimation Far-end Loudspeaker VAD Decision DTD Echo Path Response Estimation IDFT DFT Receive path Fig.. Block diagram of the proposed DTD algorithm.
3 Y.-S. Park, J.-H. Chang / Signal Processing 9 (2) some delay between the far-end speech Xði; kþ and the microphone input signal Yði; kþ (due to a digital amplifier, e.g.). In our approach, it is assumed that the echo timedelay is separately estimated and compensated (i.e., no delay) at the near-end. Since the echo path is time varying, the estimated echo path response ^Hði; kþ is obtained using the iterative procedure such that [] Cði; kþ ^Hði; kþ¼ ð3þ Rði; kþ where Cði; kþ¼z C Cði ; kþþð z C ÞjX ði; kþyði; kþj ð4þ Rði; kþ¼z R Rði ; kþþð z R ÞjX ði; kþxði; kþj ð5þ and z C ð¼:998þ and z R ð¼:998þ are smoothing parameters. Note that this update iteration achieves the room change tracking. 3. double-talk detection based AES As noted earlier, the update of the echo path response must be frozen in the case of the double-talk. For this, we propose the DTD technique to incorporate the newly derived GNSPP, pðh jyðiþþ, with the help of the VAD results of the near-end and far-end signal, as shown in Fig.. We inherently consider the near-end speech presence in the case of far-end signal presence, where the GNSPP substantially determine the double-talk situation and is used to update the echo path response based on (2). Note that the VAD has an impact on the near-end speech presence and the far-end speech presence only. Specifically, we derive a novel update routine of the echo path response by utilizing the soft decision as follows: 8 pðh jyðiþþ ^H ði ; kþ >< þð pðh jyðiþþþ ^H ði; kþ¼ ^H opt ði; kþ ð6þ if IðYðiÞÞ ¼ and IðXðiÞÞ ¼ >: ^H opt ði; kþ otherwise where IðÞ denotes an indicator function of the VAD result provided by the IS-27 noise suppression algorithm since it is known that it gives us a robust performance under various noise environments [2]. Furthermore, we modified the VAD algorithm to reduce the false decisions. For example, IðYðiÞÞ ¼ if the near-end signal YðiÞ exists at the i th frame and IðYðiÞÞ ¼ otherwise. Therefore, the update of ^H ði; kþ is finally addressed such that ^H ði; kþ replaces ^H ði ; kþ (i.e., no update) within the double-talk regions on each frequency bin and ^H opt ði; kþ as specified in (2) in the case of single-talk. In particular, in the case of abrupt transient periods between double-talk and single-talk, as shown in Fig. 2, the GNSPP could be a soft value between and. This accounts for why the soft decision scheme is more insensitive to detection error compared to the conventional hard decision methods. Based on this proposed DTD method, we finally apply it to the AES algorithm proposed by Faller et al. [] as follows: ^Sði; kþ¼gði; kþyði; kþ ð7þ Far end Echo Signal Near end Speech Signal Microphone Input Signal p(h Y(i))..5 Far end echo Double Talk Near end Speech Time (sec) Fig. 2. DTD results for the acoustic echo signal under the vehicular noise condition (SNR ¼ 2 db).
4 74 Y.-S. Park, J.-H. Chang / Signal Processing 9 (2) where the Wiener filter gain Gði; kþ is given by [] " Gði; kþ¼ maxðjyði; kþj j ^Y # ði; kþj; Þ jyði; kþj ð8þ Table ERLE during single-talk and SA during double-talk test results obtained from the proposed DTD algorithm based on a soft decision with those yielded by the conventional hard decision method in no changes of the echo path. 4. Experiments and results In order to verify the performance of the proposed DTD algorithm, we conducted objective comparison experiments under various noise conditions. Twenty test phrases, spoken by seven speakers and sampled at 8 khz, were used as the experimental data. For assessing the performance of the proposed method, we artificially created 2 data files, where each file was produced by mixing the far-end signal with the near-end signal. Each frame of the windowed signal was transformed into its corresponding spectrum through a 28-point DFT after zero padding. We then constructed 6 frequency bands through combination of subbands to cover all frequency ranges ( 4 khz) of the narrow band speech signal, which is analogous to that of the IS-27 noise suppression algorithm [2]. The far-end speech signal was passed through a filter simulating the acoustic echo path modeled by a time-invariant FIR filter based on the analysis of room acoustics before being mixed electrically [3,4]. The simulation environment was designed to fit a small office room having a size of 5 4 3m 3. The echo level measured at the input microphone was 3.5 db lower than that of the input near-end speech on average. In order to create noisy conditions, white, babble and vehicular noises from the NOISEX-92 database were added to clean near-end speech signals at signal-to-noise ratios (SNRs) of, 2, and 3 db. For the purpose of an objective comparison, we evaluated the performance of the proposed scheme and that of the conventional DTD algorithm proposed by Park et al. [6], wherein the crosscorrelation coefficients-based double-talk detection method is used. The performances of the approaches were measured in terms of echo return loss enhancement (ERLE) and speech attenuation (SA) during double-talk, which are defined by [4]: E½y 2 ðtþš ERLEðtÞ¼ log ð9þ E½e 2 ðtþš " # SAðtÞ¼ X N E½s 2 ðtþš log N ð2þ E½~s 2 ðtþš where t is a sample index, N is the number of samples during the double-talk periods and ~s 2 ðtþ denotes the nearend speech component in the output signal eðtþ. Given the three types of noise environments, the ERLE and SAs scores were averaged to give final mean score results, as presented in Tables and 2. From Table which indicates the results in the single echo path case, it is evident that in most noisy conditions, the proposed DTD algorithm based on a soft decision yielded a lower SA compared to the hard decision-based conventional technique while maintaining the similar ERLE compared with the conventional method during single-talk. Also, from Table 2 showing the results in the case of echo path Environments ERLE (db) SA (db) SNR (db) changes with different room size [3], we can observe that the SAs (measured during double-talk periods) of the proposed scheme based on a soft decision were better than those of the previous method [6] in all the tested conditions. It is noted that the performance gain of the proposed method becomes smaller as the SNR becomes lower. This is attributed to imperfection of the GNSPP under adverse noise conditions. Summarizing the overall results, the proposed approach is found to be effective in the AES technique. 5. Conclusions White Babble Vehicle Clean speech Table 2 ERLE during single-talk and SA during double-talk test results obtained from the proposed DTD algorithm based on a soft decision with those yielded by the conventional hard decision method in changes of the echo path. Environments ERLE (db) SA (db) SNR (db) White Babble Vehicle Clean speech In this paper, we have proposed a novel DTD algorithm based on a soft decision scheme in the frequency domain. The GNSPP based on a statistical model of the near-end and far-end signal is applied to the DTD algorithm in
5 Y.-S. Park, J.-H. Chang / Signal Processing 9 (2) conjunction with VAD decisions for effective echo suppression. The performance of the proposed algorithm has been found to be superior to that of the conventional technique through objective evaluation tests. Acknowledgements This research was supported by the MKE (The Ministry of Knowledge Economy), Korea, under the ITRC (Information Technology Research Center) support program supervised by the NIPA (National IT Industry Promotion Agency) (NIPA-29-C9-92-) and this work was supported by National Research Foundation of Korea (NRF) Grant funded by the Korean Government (MEST) (NRF ). References [] P.S.R. Diniz, Adaptive Filtering: Algorithm and Practical Implementation, Kluwer, Norwell, MA, 997. [2] C. Avendano, Acoustic echo suppression in the STFT domain, in: Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, October 2. [3] H. Ye, B.X. Wu, A new double-talk detection algorithm based on the orthogonality theorem, IEEE Trans. Commun. 39 (November 99) [4] T. Gänsler, M. Hansson, C.J. Ivarsson, A double-talk detector based on coherence, IEEE Trans. Commun. 44 (November 996) [5] J. Benesty, D.R. Morgan, J.H. Cho, A new class of doubletalk detectors based on cross-correlation, IEEE Trans. Speech Audio Process. 8 () (March 2) [6] S.J. Park, C.G. Cho, C. Lee, D.H. Youn, Integrated echo and noise canceler for hands-free applications, IEEE Trans. Circuits Syst. II 49 (3) (March 22) [7] N.S. Kim, J.-H. Chang, Spectral enhancement based on global soft decision, IEEE Signal Process. Lett. 7 (5) (May 2) 8. [8] Y.-S. Park, J.-H. Chang, A novel approach to a robust a priori SNR estimator in speech enhancement, IEICE Trans. Commun. E9-B (8) (August 27) [9] Y.-S. Park, J.-H. Chang, A probabilistic combination method of minimum statistics and soft decision for robust noise power estimation in speech enhancement, IEEE Signal Process. Lett. 5 (5) (January 28) [] C. Faller, C. Tournery, Robust echo control using a simple echo path model, in: Proceedings of the IEEE International Conference on Acoustics, Speech Signal Processing, vol. 5, 26, pp. V28 V284. [] Y. Ephraim, D. Malah, Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator, IEEE Trans. Acoust. Speech Signal Process. ASSP-32 (6) (December 984) 9 2. [2] TIA/EIA/IS-27, Enhanced variable rate codec, speech service option 3 for wideband spread spectrum digital systems, 996. [3] S. McGovern, A Model for Room Acoustics, 23 [Online]. Available: / [4] S.Y. Lee, N.S. Kim, A statistical model based residual echo suppression, IEEE Signal Process. Lett. 4 () (October 27)
Signal Processing 91 (2011) Contents lists available at ScienceDirect. Signal Processing. journal homepage:
Signal Processing 9 (2) 55 6 Contents lists available at ScienceDirect Signal Processing journal homepage: www.elsevier.com/locate/sigpro Fast communication Minima-controlled speech presence uncertainty
More informationIntegrated acoustic echo and background noise suppression technique based on soft decision
Park and Chang EURASIP Journal on Advances in Signal Processing, : http://asp.eurasipjournals.com/content/// RESEARCH Open Access Integrated acoustic echo and background noise suppression technique based
More informationRECENTLY, there has been an increasing interest in noisy
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 52, NO. 9, SEPTEMBER 2005 535 Warped Discrete Cosine Transform-Based Noisy Speech Enhancement Joon-Hyuk Chang, Member, IEEE Abstract In
More informationMMSE STSA Based Techniques for Single channel Speech Enhancement Application Simit Shah 1, Roma Patel 2
MMSE STSA Based Techniques for Single channel Speech Enhancement Application Simit Shah 1, Roma Patel 2 1 Electronics and Communication Department, Parul institute of engineering and technology, Vadodara,
More informationROBUST echo cancellation requires a method for adjusting
1030 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 15, NO. 3, MARCH 2007 On Adjusting the Learning Rate in Frequency Domain Echo Cancellation With Double-Talk Jean-Marc Valin, Member,
More informationSpeech Enhancement for Nonstationary Noise Environments
Signal & Image Processing : An International Journal (SIPIJ) Vol., No.4, December Speech Enhancement for Nonstationary Noise Environments Sandhya Hawaldar and Manasi Dixit Department of Electronics, KIT
More informationTHE problem of acoustic echo cancellation (AEC) was
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 13, NO. 6, NOVEMBER 2005 1231 Acoustic Echo Cancellation and Doubletalk Detection Using Estimated Loudspeaker Impulse Responses Per Åhgren Abstract
More informationEnhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis
Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis Mohini Avatade & S.L. Sahare Electronics & Telecommunication Department, Cummins
More informationA Computational Efficient Method for Assuring Full Duplex Feeling in Hands-free Communication
A Computational Efficient Method for Assuring Full Duplex Feeling in Hands-free Communication FREDRIC LINDSTRÖM 1, MATTIAS DAHL, INGVAR CLAESSON Department of Signal Processing Blekinge Institute of Technology
More informationStudents: Avihay Barazany Royi Levy Supervisor: Kuti Avargel In Association with: Zoran, Haifa
Students: Avihay Barazany Royi Levy Supervisor: Kuti Avargel In Association with: Zoran, Haifa Spring 2008 Introduction Problem Formulation Possible Solutions Proposed Algorithm Experimental Results Conclusions
More informationSpeech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter
Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter 1 Gupteswar Sahu, 2 D. Arun Kumar, 3 M. Bala Krishna and 4 Jami Venkata Suman Assistant Professor, Department of ECE,
More informationRobust Voice Activity Detection Based on Discrete Wavelet. Transform
Robust Voice Activity Detection Based on Discrete Wavelet Transform Kun-Ching Wang Department of Information Technology & Communication Shin Chien University kunching@mail.kh.usc.edu.tw Abstract This paper
More informationJoint dereverberation and residual echo suppression of speech signals in noisy environments Habets, E.A.P.; Gannot, S.; Cohen, I.; Sommen, P.C.W.
Joint dereverberation and residual echo suppression of speech signals in noisy environments Habets, E.A.P.; Gannot, S.; Cohen, I.; Sommen, P.C.W. Published in: IEEE Transactions on Audio, Speech, and Language
More informationSpeech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm
International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm A.T. Rajamanickam, N.P.Subiramaniyam, A.Balamurugan*,
More informationOn using acoustic environment classification for statistical model-based speech enhancement
Available online at www.sciencedirect.com Speech Communication 54 (22) 477 49 www.elsevier.com/locate/specom On using acoustic environment classification for statistical model-based speech enhancement
More informationAutomotive three-microphone voice activity detector and noise-canceller
Res. Lett. Inf. Math. Sci., 005, Vol. 7, pp 47-55 47 Available online at http://iims.massey.ac.nz/research/letters/ Automotive three-microphone voice activity detector and noise-canceller Z. QI and T.J.MOIR
More informationEffective post-processing for single-channel frequency-domain speech enhancement Weifeng Li a
R E S E A R C H R E P O R T I D I A P Effective post-processing for single-channel frequency-domain speech enhancement Weifeng Li a IDIAP RR 7-7 January 8 submitted for publication a IDIAP Research Institute,
More informationNOISE ESTIMATION IN A SINGLE CHANNEL
SPEECH ENHANCEMENT FOR CROSS-TALK INTERFERENCE by Levent M. Arslan and John H.L. Hansen Robust Speech Processing Laboratory Department of Electrical Engineering Box 99 Duke University Durham, North Carolina
More informationRobust Low-Resource Sound Localization in Correlated Noise
INTERSPEECH 2014 Robust Low-Resource Sound Localization in Correlated Noise Lorin Netsch, Jacek Stachurski Texas Instruments, Inc. netsch@ti.com, jacek@ti.com Abstract In this paper we address the problem
More informationSystematic Integration of Acoustic Echo Canceller and Noise Reduction Modules for Voice Communication Systems
INTERSPEECH 2015 Systematic Integration of Acoustic Echo Canceller and Noise Reduction Modules for Voice Communication Systems Hyeonjoo Kang 1, JeeSo Lee 1, Soonho Bae 2, and Hong-Goo Kang 1 1 Dept. of
More informationDifferent Approaches of Spectral Subtraction Method for Speech Enhancement
ISSN 2249 5460 Available online at www.internationalejournals.com International ejournals International Journal of Mathematical Sciences, Technology and Humanities 95 (2013 1056 1062 Different Approaches
More informationSPEECH ENHANCEMENT USING SPARSE CODE SHRINKAGE AND GLOBAL SOFT DECISION. Changkyu Choi, Seungho Choi, and Sang-Ryong Kim
SPEECH ENHANCEMENT USING SPARSE CODE SHRINKAGE AND GLOBAL SOFT DECISION Changkyu Choi, Seungho Choi, and Sang-Ryong Kim Human & Computer Interaction Laboratory Samsung Advanced Institute of Technology
More informationIEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 50, NO. 12, DECEMBER
IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 50, NO. 12, DECEMBER 2002 1865 Transactions Letters Fast Initialization of Nyquist Echo Cancelers Using Circular Convolution Technique Minho Cheong, Student Member,
More informationDesign and Implementation on a Sub-band based Acoustic Echo Cancellation Approach
Vol., No. 6, 0 Design and Implementation on a Sub-band based Acoustic Echo Cancellation Approach Zhixin Chen ILX Lightwave Corporation Bozeman, Montana, USA chen.zhixin.mt@gmail.com Abstract This paper
More informationNoise Reduction: An Instructional Example
Noise Reduction: An Instructional Example VOCAL Technologies LTD July 1st, 2012 Abstract A discussion on general structure of noise reduction algorithms along with an illustrative example are contained
More informationAcoustic Echo Cancellation: Dual Architecture Implementation
Journal of Computer Science 6 (2): 101-106, 2010 ISSN 1549-3636 2010 Science Publications Acoustic Echo Cancellation: Dual Architecture Implementation 1 B. Stark and 2 B.D. Barkana 1 Department of Computer
More informationReduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter
Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter Ching-Ta Lu, Kun-Fu Tseng 2, Chih-Tsung Chen 2 Department of Information Communication, Asia University, Taichung, Taiwan, ROC
More informationSingle channel noise reduction
Single channel noise reduction Basics and processing used for ETSI STF 94 ETSI Workshop on Speech and Noise in Wideband Communication Claude Marro France Telecom ETSI 007. All rights reserved Outline Scope
More informationMel Spectrum Analysis of Speech Recognition using Single Microphone
International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree
More informationNoise Spectrum Estimation in Adverse Environments: Improved Minima Controlled Recursive Averaging
466 IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 11, NO. 5, SEPTEMBER 2003 Noise Spectrum Estimation in Adverse Environments: Improved Minima Controlled Recursive Averaging Israel Cohen Abstract
More informationMULTILAYER ADAPTATION BASED COMPLEX ECHO CANCELLATION AND VOICE ENHANCEMENT. Jun Yang (Senior Member, IEEE)
MULTILAYER ADAPTATION BASED COMPLEX ECHO CANCELLATION AND VOICE ENHANCEMENT Jun Yang (Senior Member, IEEE) Amazon Lab16, 11 Enterprise Way, Sunnyvale, CA 9489, USA Email: junyang@amazon.com ABSTRACT The
More informationCHAPTER 3 SPEECH ENHANCEMENT ALGORITHMS
46 CHAPTER 3 SPEECH ENHANCEMENT ALGORITHMS 3.1 INTRODUCTION Personal communication of today is impaired by nearly ubiquitous noise. Speech communication becomes difficult under these conditions; speech
More informationANUMBER of estimators of the signal magnitude spectrum
IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 19, NO. 5, JULY 2011 1123 Estimators of the Magnitude-Squared Spectrum and Methods for Incorporating SNR Uncertainty Yang Lu and Philipos
More informationZLS38500 Firmware for Handsfree Car Kits
Firmware for Handsfree Car Kits Features Selectable Acoustic and Line Cancellers (AEC & LEC) Programmable echo tail cancellation length from 8 to 256 ms Reduction - up to 20 db for white noise and up to
More informationFundamental frequency estimation of speech signals using MUSIC algorithm
Acoust. Sci. & Tech. 22, 4 (2) TECHNICAL REPORT Fundamental frequency estimation of speech signals using MUSIC algorithm Takahiro Murakami and Yoshihisa Ishida School of Science and Technology, Meiji University,,
More informationIN RECENT YEARS, there has been a great deal of interest
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL 12, NO 1, JANUARY 2004 9 Signal Modification for Robust Speech Coding Nam Soo Kim, Member, IEEE, and Joon-Hyuk Chang, Member, IEEE Abstract Usually,
More informationUniversity Ibn Tofail, B.P. 133, Kenitra, Morocco. University Moulay Ismail, B.P Meknes, Morocco
Research Journal of Applied Sciences, Engineering and Technology 8(9): 1132-1138, 2014 DOI:10.19026/raset.8.1077 ISSN: 2040-7459; e-issn: 2040-7467 2014 Maxwell Scientific Publication Corp. Submitted:
More informationPhase estimation in speech enhancement unimportant, important, or impossible?
IEEE 7-th Convention of Electrical and Electronics Engineers in Israel Phase estimation in speech enhancement unimportant, important, or impossible? Timo Gerkmann, Martin Krawczyk, and Robert Rehr Speech
More informationInternational Journal of Advanced Research in Computer Science and Software Engineering
Volume 2, Issue 11, November 2012 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Review of
More informationA Comparison of the Convolutive Model and Real Recording for Using in Acoustic Echo Cancellation
A Comparison of the Convolutive Model and Real Recording for Using in Acoustic Echo Cancellation SEPTIMIU MISCHIE Faculty of Electronics and Telecommunications Politehnica University of Timisoara Vasile
More informationCarrier Frequency Offset Estimation Algorithm in the Presence of I/Q Imbalance in OFDM Systems
Carrier Frequency Offset Estimation Algorithm in the Presence of I/Q Imbalance in OFDM Systems K. Jagan Mohan, K. Suresh & J. Durga Rao Dept. of E.C.E, Chaitanya Engineering College, Vishakapatnam, India
More informationEFFECTS OF PHYSICAL CONFIGURATIONS ON ANC HEADPHONE PERFORMANCE
EFFECTS OF PHYSICAL CONFIGURATIONS ON ANC HEADPHONE PERFORMANCE Lifu Wu Nanjing University of Information Science and Technology, School of Electronic & Information Engineering, CICAEET, Nanjing, 210044,
More informationWavelet Speech Enhancement based on the Teager Energy Operator
Wavelet Speech Enhancement based on the Teager Energy Operator Mohammed Bahoura and Jean Rouat ERMETIS, DSA, Université du Québec à Chicoutimi, Chicoutimi, Québec, G7H 2B1, Canada. Abstract We propose
More informationGerhard Schmidt / Tim Haulick Recent Tends for Improving Automotive Speech Enhancement Systems. Geneva, 5-7 March 2008
Gerhard Schmidt / Tim Haulick Recent Tends for Improving Automotive Speech Enhancement Systems Speech Communication Channels in a Vehicle 2 Into the vehicle Within the vehicle Out of the vehicle Speech
More informationAS DIGITAL speech communication devices, such as
IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 20, NO. 4, MAY 2012 1383 Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay Timo Gerkmann, Member, IEEE,
More informationChapter 4 SPEECH ENHANCEMENT
44 Chapter 4 SPEECH ENHANCEMENT 4.1 INTRODUCTION: Enhancement is defined as improvement in the value or Quality of something. Speech enhancement is defined as the improvement in intelligibility and/or
More informationSTATISTICAL METHODS FOR THE ENHANCEMENT OF NOISY SPEECH. Rainer Martin
STATISTICAL METHODS FOR THE ENHANCEMENT OF NOISY SPEECH Rainer Martin Institute of Communication Technology Technical University of Braunschweig, 38106 Braunschweig, Germany Phone: +49 531 391 2485, Fax:
More informationFrequency Domain Analysis for Noise Suppression Using Spectral Processing Methods for Degraded Speech Signal in Speech Enhancement
Frequency Domain Analysis for Noise Suppression Using Spectral Processing Methods for Degraded Speech Signal in Speech Enhancement 1 Zeeshan Hashmi Khateeb, 2 Gopalaiah 1,2 Department of Instrumentation
More informationMULTICHANNEL ACOUSTIC ECHO SUPPRESSION
MULTICHANNEL ACOUSTIC ECHO SUPPRESSION Karim Helwani 1, Herbert Buchner 2, Jacob Benesty 3, and Jingdong Chen 4 1 Quality and Usability Lab, Telekom Innovation Laboratories, 2 Machine Learning Group 1,2
More informationChapter 3. Speech Enhancement and Detection Techniques: Transform Domain
Speech Enhancement and Detection Techniques: Transform Domain 43 This chapter describes techniques for additive noise removal which are transform domain methods and based mostly on short time Fourier transform
More informationSpeech Enhancement using Wiener filtering
Speech Enhancement using Wiener filtering S. Chirtmay and M. Tahernezhadi Department of Electrical Engineering Northern Illinois University DeKalb, IL 60115 ABSTRACT The problem of reducing the disturbing
More informationAudio Restoration Based on DSP Tools
Audio Restoration Based on DSP Tools EECS 451 Final Project Report Nan Wu School of Electrical Engineering and Computer Science University of Michigan Ann Arbor, MI, United States wunan@umich.edu Abstract
More informationRecent Advances in Acoustic Signal Extraction and Dereverberation
Recent Advances in Acoustic Signal Extraction and Dereverberation Emanuël Habets Erlangen Colloquium 2016 Scenario Spatial Filtering Estimated Desired Signal Undesired sound components: Sensor noise Competing
More informationSpeech Enhancement Using a Mixture-Maximum Model
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 10, NO. 6, SEPTEMBER 2002 341 Speech Enhancement Using a Mixture-Maximum Model David Burshtein, Senior Member, IEEE, and Sharon Gannot, Member, IEEE
More informationNoise Tracking Algorithm for Speech Enhancement
Appl. Math. Inf. Sci. 9, No. 2, 691-698 (2015) 691 Applied Mathematics & Information Sciences An International Journal http://dx.doi.org/10.12785/amis/090217 Noise Tracking Algorithm for Speech Enhancement
More informationREAL-TIME BROADBAND NOISE REDUCTION
REAL-TIME BROADBAND NOISE REDUCTION Robert Hoeldrich and Markus Lorber Institute of Electronic Music Graz Jakoministrasse 3-5, A-8010 Graz, Austria email: robert.hoeldrich@mhsg.ac.at Abstract A real-time
More informationSPEECH ENHANCEMENT USING A ROBUST KALMAN FILTER POST-PROCESSOR IN THE MODULATION DOMAIN. Yu Wang and Mike Brookes
SPEECH ENHANCEMENT USING A ROBUST KALMAN FILTER POST-PROCESSOR IN THE MODULATION DOMAIN Yu Wang and Mike Brookes Department of Electrical and Electronic Engineering, Exhibition Road, Imperial College London,
More informationOFDM Transmission Corrupted by Impulsive Noise
OFDM Transmission Corrupted by Impulsive Noise Jiirgen Haring, Han Vinck University of Essen Institute for Experimental Mathematics Ellernstr. 29 45326 Essen, Germany,. e-mail: haering@exp-math.uni-essen.de
More informationIN REVERBERANT and noisy environments, multi-channel
684 IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 11, NO. 6, NOVEMBER 2003 Analysis of Two-Channel Generalized Sidelobe Canceller (GSC) With Post-Filtering Israel Cohen, Senior Member, IEEE Abstract
More informationSpectral estimation using higher-lag autocorrelation coefficients with applications to speech recognition
Spectral estimation using higher-lag autocorrelation coefficients with applications to speech recognition Author Shannon, Ben, Paliwal, Kuldip Published 25 Conference Title The 8th International Symposium
More informationEncoding a Hidden Digital Signature onto an Audio Signal Using Psychoacoustic Masking
The 7th International Conference on Signal Processing Applications & Technology, Boston MA, pp. 476-480, 7-10 October 1996. Encoding a Hidden Digital Signature onto an Audio Signal Using Psychoacoustic
More informationSpeech Signal Enhancement Techniques
Speech Signal Enhancement Techniques Chouki Zegar 1, Abdelhakim Dahimene 2 1,2 Institute of Electrical and Electronic Engineering, University of Boumerdes, Algeria inelectr@yahoo.fr, dahimenehakim@yahoo.fr
More informationAnalysis of the SNR Estimator for Speech Enhancement Using a Cascaded Linear Model
Analysis of the SNR Estimator for Speech Enhancement Using a Cascaded Linear Model Harjeet Kaur Ph.D Research Scholar I.K.Gujral Punjab Technical University Jalandhar, Punjab, India Rajneesh Talwar Principal,Professor
More informationSpeech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction
IOSR Journal of VLSI and Signal Processing (IOSR-JVSP) Volume 7, Issue, Ver. I (Mar. - Apr. 7), PP 4-46 e-issn: 9 4, p-issn No. : 9 497 www.iosrjournals.org Speech Enhancement Using Spectral Flatness Measure
More informationTitle. Author(s)Sugiyama, Akihiko; Kato, Masanori; Serizawa, Masahir. Issue Date Doc URL. Type. Note. File Information
Title A Low-Distortion Noise Canceller with an SNR-Modifie Author(s)Sugiyama, Akihiko; Kato, Masanori; Serizawa, Masahir Proceedings : APSIPA ASC 9 : Asia-Pacific Signal Citationand Conference: -5 Issue
More informationHigh-speed Noise Cancellation with Microphone Array
Noise Cancellation a Posteriori Probability, Maximum Criteria Independent Component Analysis High-speed Noise Cancellation with Microphone Array We propose the use of a microphone array based on independent
More informationScienceDirect. Unsupervised Speech Segregation Using Pitch Information and Time Frequency Masking
Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 46 (2015 ) 122 126 International Conference on Information and Communication Technologies (ICICT 2014) Unsupervised Speech
More informationIMPROVEMENT OF SPEECH SOURCE LOCALIZATION IN NOISY ENVIRONMENT USING OVERCOMPLETE RATIONAL-DILATION WAVELET TRANSFORMS
1 International Conference on Cyberworlds IMPROVEMENT OF SPEECH SOURCE LOCALIZATION IN NOISY ENVIRONMENT USING OVERCOMPLETE RATIONAL-DILATION WAVELET TRANSFORMS Di Liu, Andy W. H. Khong School of Electrical
More informationAccurate Delay Measurement of Coded Speech Signals with Subsample Resolution
PAGE 433 Accurate Delay Measurement of Coded Speech Signals with Subsample Resolution Wenliang Lu, D. Sen, and Shuai Wang School of Electrical Engineering & Telecommunications University of New South Wales,
More informationLETTER Pre-Filtering Algorithm for Dual-Microphone Generalized Sidelobe Canceller Using General Transfer Function
IEICE TRANS. INF. & SYST., VOL.E97 D, NO.9 SEPTEMBER 2014 2533 LETTER Pre-Filtering Algorithm for Dual-Microphone Generalized Sidelobe Canceller Using General Transfer Function Jinsoo PARK, Wooil KIM,
More informationCHAPTER 4 VOICE ACTIVITY DETECTION ALGORITHMS
66 CHAPTER 4 VOICE ACTIVITY DETECTION ALGORITHMS 4.1 INTRODUCTION New frontiers of speech technology are demanding increased levels of performance in many areas. In the advent of Wireless Communications
More informationSpeech Enhancement Based On Noise Reduction
Speech Enhancement Based On Noise Reduction Kundan Kumar Singh Electrical Engineering Department University Of Rochester ksingh11@z.rochester.edu ABSTRACT This paper addresses the problem of signal distortion
More informationCancellation of Unwanted Audio to Support Interactive Computer Music
Jonghyun Lee, Roger B. Dannenberg, and Joohwan Chun. 24. Cancellation of Unwanted Audio to Support Interactive Computer Music. In The ICMC 24 Proceedings. San Francisco: The International Computer Music
More informationHigh dynamic range image compression with improved logarithmic transformation
High dynamic range image compression with improved logarithmic transformation Masahide Sumizawa a) and Xi Zhang b) Graduate School of Informatics and Engineering, The University of Electro- Communications,
More informationSpeech Enhancement: Reduction of Additive Noise in the Digital Processing of Speech
Speech Enhancement: Reduction of Additive Noise in the Digital Processing of Speech Project Proposal Avner Halevy Department of Mathematics University of Maryland, College Park ahalevy at math.umd.edu
More informationBlind Dereverberation of Single-Channel Speech Signals Using an ICA-Based Generative Model
Blind Dereverberation of Single-Channel Speech Signals Using an ICA-Based Generative Model Jong-Hwan Lee 1, Sang-Hoon Oh 2, and Soo-Young Lee 3 1 Brain Science Research Center and Department of Electrial
More informationDeep Learning for Acoustic Echo Cancellation in Noisy and Double-Talk Scenarios
Interspeech 218 2-6 September 218, Hyderabad Deep Learning for Acoustic Echo Cancellation in Noisy and Double-Talk Scenarios Hao Zhang 1, DeLiang Wang 1,2,3 1 Department of Computer Science and Engineering,
More informationResidual noise Control for Coherence Based Dual Microphone Speech Enhancement
008 International Conference on Computer and Electrical Engineering Residual noise Control for Coherence Based Dual Microphone Speech Enhancement Behzad Zamani Mohsen Rahmani Ahmad Akbari Islamic Azad
More informationNoise Estimation based on Standard Deviation and Sigmoid Function Using a Posteriori Signal to Noise Ratio in Nonstationary Noisy Environments
88 International Journal of Control, Automation, and Systems, vol. 6, no. 6, pp. 88-87, December 008 Noise Estimation based on Standard Deviation and Sigmoid Function Using a Posteriori Signal to Noise
More informationMikko Myllymäki and Tuomas Virtanen
NON-STATIONARY NOISE MODEL COMPENSATION IN VOICE ACTIVITY DETECTION Mikko Myllymäki and Tuomas Virtanen Department of Signal Processing, Tampere University of Technology Korkeakoulunkatu 1, 3370, Tampere,
More informationworks must be obtained from the IEE
Title A filtered-x LMS algorithm for sinu Effects of frequency mismatch Author(s) Hinamoto, Y; Sakai, H Citation IEEE SIGNAL PROCESSING LETTERS (200 262 Issue Date 2007-04 URL http://hdl.hle.net/2433/50542
More informationProbability of Error Calculation of OFDM Systems With Frequency Offset
1884 IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 49, NO. 11, NOVEMBER 2001 Probability of Error Calculation of OFDM Systems With Frequency Offset K. Sathananthan and C. Tellambura Abstract Orthogonal frequency-division
More informationAdaptive Noise Reduction Algorithm for Speech Enhancement
Adaptive Noise Reduction Algorithm for Speech Enhancement M. Kalamani, S. Valarmathy, M. Krishnamoorthi Abstract In this paper, Least Mean Square (LMS) adaptive noise reduction algorithm is proposed to
More informationSpeech and Audio Processing Recognition and Audio Effects Part 3: Beamforming
Speech and Audio Processing Recognition and Audio Effects Part 3: Beamforming Gerhard Schmidt Christian-Albrechts-Universität zu Kiel Faculty of Engineering Electrical Engineering and Information Engineering
More informationMULTICHANNEL systems are often used for
IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 52, NO. 5, MAY 2004 1149 Multichannel Post-Filtering in Nonstationary Noise Environments Israel Cohen, Senior Member, IEEE Abstract In this paper, we present
More informationCalibration of Microphone Arrays for Improved Speech Recognition
MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Calibration of Microphone Arrays for Improved Speech Recognition Michael L. Seltzer, Bhiksha Raj TR-2001-43 December 2001 Abstract We present
More informationMedian-Prefiltering-Based Robust Acquisition of Direct-Sequence Spread-Spectrum Signals in Wide-Band Pulse Jamming
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 51, NO. 1, JANUARY 2002 171 Median-Prefiltering-Based Robust Acquisition of Direct-Sequence Spread-Spectrum Signals in Wide-Band Pulse Jamming Yong-Hwan
More informationApplication of Affine Projection Algorithm in Adaptive Noise Cancellation
ISSN: 78-8 Vol. 3 Issue, January - Application of Affine Projection Algorithm in Adaptive Noise Cancellation Rajul Goyal Dr. Girish Parmar Pankaj Shukla EC Deptt.,DTE Jodhpur EC Deptt., RTU Kota EC Deptt.,
More informationRevision 1.1 May Front End DSP Audio Technologies for In-Car Applications ROADMAP 2016
Revision 1.1 May 2016 Front End DSP Audio Technologies for In-Car Applications ROADMAP 2016 PAGE 2 EXISTING PRODUCTS 1. Hands-free communication enhancement: Voice Communication Package (VCP-7) generation
More informationA Robust Acoustic Echo Canceller for Noisy Environment 1
A Robust Acoustic Echo Canceller for Noisy Environment 1 Shenghao Qin, Sha Meng, and Jia Liu Department of Electronic Engineering, Tsinghua University, Beijing 184 {qinsh99, mengs4}@mails.tsinghua.edu.cn,
More informationNonlinear Companding Transform Algorithm for Suppression of PAPR in OFDM Systems
Nonlinear Companding Transform Algorithm for Suppression of PAPR in OFDM Systems P. Guru Vamsikrishna Reddy 1, Dr. C. Subhas 2 1 Student, Department of ECE, Sree Vidyanikethan Engineering College, Andhra
More informationSpeech Enhancement Using Beamforming Dr. G. Ramesh Babu 1, D. Lavanya 2, B. Yamuna 2, H. Divya 2, B. Shiva Kumar 2, B.
www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 4 Issue 4 April 2015, Page No. 11143-11147 Speech Enhancement Using Beamforming Dr. G. Ramesh Babu 1, D. Lavanya
More informationECHO cancellers (ECs) have been used in networks for
4572 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 54, NO. 12, DECEMBER 2006 Echo Cancellation A Likelihood Ratio Test for Double-Talk Versus Channel Change Neil J. Bershad, Fellow, IEEE, and Jean-Yves
More informationDual-Microphone Speech Dereverberation in a Noisy Environment
Dual-Microphone Speech Dereverberation in a Noisy Environment Emanuël A. P. Habets Dept. of Electrical Engineering Technische Universiteit Eindhoven Eindhoven, The Netherlands Email: e.a.p.habets@tue.nl
More informationIEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING 1. Suppressing Acoustic Echo in a Spectral Envelope Space
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING 1 Suppressing Acoustic Echo in a Spectral Envelope Space Christof Faller and Jingdong Chen, Member, IEEE Abstract Full-duplex hands-free telecommunication
More informationPerformance analysis of voice activity detection algorithm for robust speech recognition system under different noisy environment
BABU et al: VOICE ACTIVITY DETECTION ALGORITHM FOR ROBUST SPEECH RECOGNITION SYSTEM Journal of Scientific & Industrial Research Vol. 69, July 2010, pp. 515-522 515 Performance analysis of voice activity
More informationModified Kalman Filter-based Approach in Comparison with Traditional Speech Enhancement Algorithms from Adverse Noisy Environments
Modified Kalman Filter-based Approach in Comparison with Traditional Speech Enhancement Algorithms from Adverse Noisy Environments G. Ramesh Babu 1 Department of E.C.E, Sri Sivani College of Engg., Chilakapalem,
More informationUnraveling Zero Crossing and Full Spectrum What does it all mean?
Unraveling Zero Crossing and Full Spectrum What does it all mean? Ian Agranat Wildlife Acoustics, Inc. 2 nd Symposium on Bat Echolocation Research, Tucson AZ March 29, 2017 Let s start with a sound wave
More informationCodebook-based Bayesian speech enhancement for nonstationary environments Srinivasan, S.; Samuelsson, J.; Kleijn, W.B.
Codebook-based Bayesian speech enhancement for nonstationary environments Srinivasan, S.; Samuelsson, J.; Kleijn, W.B. Published in: IEEE Transactions on Audio, Speech, and Language Processing DOI: 10.1109/TASL.2006.881696
More informationAdaptive Filters Wiener Filter
Adaptive Filters Wiener Filter Gerhard Schmidt Christian-Albrechts-Universität zu Kiel Faculty of Engineering Institute of Electrical and Information Engineering Digital Signal Processing and System Theory
More information