Signal Processing 91 (2011) Contents lists available at ScienceDirect. Signal Processing. journal homepage:
|
|
- Scott Robbins
- 5 years ago
- Views:
Transcription
1 Signal Processing 9 (2) 55 6 Contents lists available at ScienceDirect Signal Processing journal homepage: Fast communication Minima-controlled speech presence uncertainty tracking method for speech enhancement Woojung Lee, Ji-Hyun Song, Joon-Hyuk Chang School of Electronic Engineering, Inha University, Incheon 42-75, Republic of Korea article info Article history: Received 3 February 2 Received in revised form April 2 Accepted 8 June 2 Available online 25 June 2 Keywords: Soft decision Speech absence probability Minima-controlled recursive averaging abstract In speech enhancement, soft decision, in which the speech absence probability (SAP) is introduced to modify the spectral gain or update the noise power, is known to be efficient. In many previous works, a fixed a priori probability of speech absence (q) is assumed in estimating the SAP, which is not realistic since speech is quasi-stationary and may not be present in each frequency bin. To address this problem, Malah et al. devised a novel method to obtain distinct values of q for each frequency bin in many frames by comparing the a posteriori SNR to a threshold value [9]. In this regard, a novel algorithm is achieved by taking an advantage of a minima-controlled recursive averaging (MCRA) technique that allows for the robust tracking of speech absence in time. This leads to the improved tracking performance of speech absence in speech enhancement and better results in the objective and subjective evaluation tests. & 2 Elsevier B.V. All rights reserved.. Introduction In general, listening to speech becomes more difficult as the ambient noise level increases. To avoid this problem, speech enhancement techniques attempt to remove the effect of the additive noise [ 7]. Among them, a conventional strategy of applying soft decision has been considered effective because the probability of speech absence (or speech presence) is incorporated as a key parameter for modifying the spectral gain and updating the noise power [8]. From this viewpoint, in the literature, it can be seen that a fixed probability of q, which is the a priori probability of speech absence, is assumed for all frequency components in the analyzed input frames [8,9]. In[2], q was set to.5 to address the worst-case scenario in which speech and noise are equally likely to occur, while q was set to.2 based on the listening test in []. Several algorithms have been proposed for estimating and updating q [9,]. In Corresponding author. Tel.: ; fax: address: changjh@inha.ac.kr (J.-H. Chang). particular, Malah et al. proposed an algorithm to obtain distinct values of q for each frequency in each frame based on a simple hypothesis test by comparing the a posteriori SNR with a given threshold [9]. However, it can be seen that the a posteriori SNR is sensitive to outliers, especially for time-varying noise. On the other hand, Cohen proposed a novel technique for estimating noise by averaging past spectral power values with a smoothing parameter that is adjusted by the speech presence probability in subbands []. In particular, the presence of speech in subbands is determined by the ratio between the local energy of noisy speech and its minimum within a given time window. Note that Cohen s method is known to be insensitive to the type and intensity of ambient noise. Also, this method is computationally efficient and characterized by the capability to quickly adapt to sudden changes in the noise spectrum. In this paper, we develop a novel method to track the a priori probability of speech absence which is a dominant parameter in computing the speech absence probability from the observation. To do this, we devise a method to track the a priori probability of speech absence by comparing the local energy of the noisy speech and its /$ - see front matter & 2 Elsevier B.V. All rights reserved. doi:.6/j.sigpro.2.6.9
2 56 W. Lee et al. / Signal Processing 9 (2) 55 6 corresponding minimum value in each frequency bin. It is found that it enables a more robust estimate of q, which is analogous to the advantage of Cohen s method []. Based on this, we performed an objective and subjective quality test by incorporating the proposed approach into the speech enhancement, and produced better results. 2. Review of tracking speech presence uncertainty In this section, we first review the notion of the tracking speech uncertainty introduced in [9]. At first, let y(n) denote a noisy speech signal, which is the sum of a clean speech signal, x(n), and an uncorrelated additive noise signal, d(n); y(n)=x(n)+d(n). Applying a short-time Fourier transform (STFT), we then have in the time frequency domain Yðk,lÞ¼Xðk,lÞþDðk,lÞ, ðþ where k is the frequency bin and l is the frame index, respectively. Given two hypotheses, H (k,l) and H (k,l), which indicate speech absence and presence, respectively, it is assumed that H ðk,lþ : Yðk,lÞ¼Dðk,lÞ, H ðk,lþ : Yðk,lÞ¼Xðk,lÞþDðk,lÞ: Like a number of other speech enhancement algorithms [8], we also assume that X(k,l) and D(k,l) are characterized by separate zero-mean complex Gaussian distributions, and the following is obtained: jyðk,lþj2 pðyðk,lþjh Þ¼ exp, pl d ðk,lþ l d ðk,lþ pðyðk,lþjh Þ¼ p½l d ðk,lþþl x ðk,lþš exp jyðk,lþj 2, l d ðk,lþþl x ðk,lþ ð3þ in which l x ðk,lþ and l d ðk,lþ are variances of the clean speech and noise in the kth frequency bin and lth frame index, respectively. Conditioned on the current observation, Y(k,l), the speech absence probability (SAP), pðh jyðk,lþþ, is given by [8] pðh jyðk,lþþ ¼ pðyðk,lþjh ÞpðH Þ pðyðk,lþþ pðyðk,lþjh ÞpðH Þ ¼ pðyðk,lþjh ÞpðH ÞþpðYðk,lÞjH ÞpðH Þ ¼ þqlðyðk,lþþ, ð4þ in which LðYðk,lÞÞ is the likelihood ratio computed in the kth subband and lth frame index as follows: LðYðk,lÞÞ ¼ pðyðk,lþjh Þ pðyðk,lþjh Þ gðk,lþxðk,lþ ¼ exp, ð5þ þxðk,lþ þxðk,lþ where gðk,lþ and xðk,lþ are the a posteriori SNR and the a priori SNR [8], respectively, as follows: gðk,lþ jyðk,lþj2 l d ðk,lþ, ð2þ ð6þ xðk,lþ l xðk,lþ l d ðk,lþ, ð7þ and q (=p(h )/p(h )) is the ratio of the a priori probability for speech presence and speech absence []. Indeed, q is a rough estimate of the ratio of silence time intervals between speech activities and the time duration of speech. This ratio q is assumed to be fixed in many previous works [,5,8]. However, Malah et al. proposed the method to allow different q s in different frequency bins for each frame since this number varies in time due to the non-stationarity of speech. Specifically, in the method of Malah et al., (4) becomes pðh jyðk,lþþ ¼ þqðk,lþlðyðk,lþþ, ð8þ where qðk,lþ¼a q qðk,l Þþð a q ÞIðk,lÞ, ð9þ and a q ðoa q oþ is a smoothing parameter. In particular, I(k,l) is an index function denoting the following hypothesis test by incorporating the a posteriori SNR such that gðk,lþ _ H g TH, ðþ H where g TH is a given threshold (i.e., I(k,l)= if H is accepted, and I(k,l)= if H is accepted). Note that, in the method of Malah et al. [9], the availability of a separate estimate of q in each bin for each frame adaptively controls the update of the noise power in the case of speech presence. 3. Proposed minima-controlled speech presence uncertainty tracking method In the previous section, the estimation of pðh jyðk,lþþ given by (4) is controlled by distinct values of q s obtained by the a posteriori SNR-based hypothesis test, as in the previous approach [9]. However, we note that the a posteriori SNR cannot be relevant due to its high variation over successive short-time frames [2]. For this reason, we consider a monotonic hypothesis test denoting the ratio between the local energy of the noisy speech and its derived minimum, as in the MCRA method proposed by Cohen []. This method is clearly insensitive to the type and strength of noise, which are very desirable characteristics []. To illustrate these characteristics, we first introduce the smoothed local energy of the noisy speech by a first order recursive averaging Sðk,lÞ¼a s Sðk,l Þþð a s ÞS f ðk,lþ, ðþ where S f (k,l) is a local energy of a current frame and a s ðoa s oþ is a smoothing parameter. The minimum of the local energy S min (k,l) is searched for in a samplewise comparison manner such that S min ðk,lþ¼minfs min ðk,l Þ,Sðk,lÞg, S tmp ðk,lþ¼minfs tmp ðk,l Þ,Sðk,lÞg, ð2þ where the minimum value for the current frame is yielded by a comparison of the local energy of the noisy speech
3 W. Lee et al. / Signal Processing 9 (2) and the minimum value of the previous frame. Whenever L frames have been read, i.e., l is divisible by L, the temporary value should be employed and initialized by S min ðk,lþ¼minfs tmp ðk,l Þ,Sðk,lÞg, S tmp ðk,lþ¼sðk,lþ, ð3þ and (2) continues to search for the minimum values. The implementation of the minima tracking is summarized as follows: Initialize variables at the first frame (l=) for all frequency bin S(k,)=S f (k,) S min (k,)=s f (k,) For all time frames l ðl^þ For all frequency bins k compute S min =min {S min (k,l-, S(k,l)} using () and (2). save S tmpðk,lþ¼s min fs tmpðk,l Þ,Sðk,lÞg using (2) When l % L== compute S min ðk,lþ¼minfs tmpðk,lþ,sðk,lþg using (3) update S tmpðk,lþ¼sðk,lþ using (3) Using the obtained S min (k,l), we now consider the S r ðk,lþ9sðk,lþ=s min ðk,lþ which denotes the ratio between the local energy of the noisy speech and its derived minimum []. From this, we can derive the following: S r ðk,lþ _ H d, ð4þ H where d is a simple threshold. As an example, Fig. compares two statistics (a posteriori SNR vs. S r (k,l)) when the speech enhancement algorithm operates on noisy speech corrupted by the car noise. From the figure, it can be seen that the a posteriori SNR tends to fluctuate highly during noise intervals. In contrast, S r (k,l) does not exhibit large variation over successive frames during the noiseonly periods while S r (k,l) adapts the speech energy adequately during the speech. Using the decision rule of (4) in the MCRA scheme, we propose ^q, which has a different value of q as in the conventional tracking speech presence uncertainty scheme, such that ^qðk,lþ is given by ^qðk,lþ¼a p ^qðk,l Þþð a p ÞIðk,lÞ, ð5þ in which a p ðoa p oþ is a smoothing parameter and I(k,l) is an indicator function for the result of the decision rule of (4), i.e., I(k,l)= if S r ðk,lþ4d and I(k,l)= if S r ðk,lþod. Then, (8) implies pðh jyðk,lþþ ¼ þ ^qðk,lþlðyðk,lþþ : ð6þ It is not difficult to see from Fig. 2 that the SAP by the proposed method seems more accurate than the conventional method (a posteriori SNR-based). 4. Experiments and results The proposed minima-controlled speech presence uncertainty tracking method was adopted for softdecision-based speech enhancement, as in [8], and was evaluated with extensive objective and subjective tests. For these tests, phrases, spoken by four male and four Fig.. Comparison of two statistics (k=2, around 3 Hz) under street noise (SNR = 5 db). (a) Clean speech waveform, (b) noisy speech waveform, (c) gðk,lþ (dashed line) vs. S r (k,l) (solid line).
4 58 W. Lee et al. / Signal Processing 9 (2) Speech Presence Probability Fig. 2. Comparison of probability (k=2, around 3 Hz) under car noise (SNR = 5 db). (a) Clean speech waveform, (b) noisy speech waveform, (c) speech presence probability in short-time frames: probability using the a posteriori (dashed line), probability of the proposed algorithm (bold line). female speakers, were employed as the experimental data. Each phrase consists of two different meaningful sentences, and its duration was 8 s. For a real-time processing, the proposed method was conducted for each frame of ms with a sampling frequency of 8 khz. Four types of noise sources, such as white noise, car noise, street noise, and office noise, were digitally added to the clean speech waveform at SNRs of 5,, and 5 db. In all cases, speech enhancement was conducted with the experimentally optimized parameter values: a q ¼ :95, g TH ¼ :8, a p ¼ :2, d ¼ 5. At first, we carried out the perceptual evaluation of speech quality (PESQ) based on the ITU-T P.862 tests [3]. From Table, which shows the results of the PESQ, we can see that the proposed minima-controlled speech presence uncertainty tracking method outperformed three conventional methods proposed by McAulay [], Ephraim [2], Malah [9], and ideal q-based method under the given noise conditions. Specifically, the ideal q-based method has fixed values of q which are determined from the ratio of speech and noise in the each speech segment. Note that the performance gain becomes larger, especially for the non-stationary noise such as car and street noise. We also carried out a set of informal tests under the same noise conditions to evaluate the subjective quality of the proposed method. Subjective opinions were given by a group of 2 listeners; each listener gave a score for each test sentence: 5 (Excellent), 4 (Good), 3 (Fair), 2 (Poor), and (Bad). All listener scores were then averaged to Table PESQ scores of the conventional methods and the proposed method. Noise Method SNR (db) 5 5 White McAulay (q=.5) Ephraim (q=.2) Ideal Malah Proposed Street McAulay (q=.5) Ephraim (q=.2) Ideal Malah Proposed Car McAulay (q=.5) Ephraim (q=.2) Ideal Malah Proposed Office McAulay (q=.5) Ephraim (q=.2) Ideal Malah Proposed yield a mean opinion score (MOS). The MOS test results, with a 95% confidence interval, are summarized in Table 2, in which a higher value indicates preference. It is noted that performance was found to improve for most of the
5 W. Lee et al. / Signal Processing 9 (2) Table 2 MOS of the conventional methods and the proposed method (with 95% confidence interval). Noise Method SNR (db) 5 5 White McAulay Ephraim Ideal Malah Proposed Car McAulay Ephraim Ideal Malah Proposed Street McAulay Ephraim Ideal Malah Proposed Office McAulay Ephraim Ideal Malah Proposed Table 3 CCR test of the conventional method (Malah-based) and the proposed method (with 95% confidence interval). Noise SNR (db) Overall Speech Noise White Car Street Office noises at all SNRs. Indeed, it is observed that the performance differences in the MOS are more significant than the case of the PESQ in many cases. This phenomenon can be attributed to the fact that all parameters have been optimized for subjective quality enhancement. These results confirm that the proposed algorithm is consistently better than the conventional methods. We also conducted additional subjective tests via the ITU-T comparison category rating (CCR) to assess performance difference [4]. Ten listeners with normal hearing (six male and four female) participated in the experiment. The CCR test sheds light on perception quality of the signal of method A (proposed) over method B (Malah). The grades of the seven points scale range are as follows: 3 (much better), 2 (better), (slightly), (about), (slightly worse), 2 (worse), 3 (much worse). The results of CCR test between the proposed method and the conventional method based on Malah [9] are organized in Table 3. From the table, we confirm that the proposed method is found to improve the quality of speech, background noise, and overall speech. Finally, the speech spectrograms obtained with the conventional and proposed approach are presented in Fig. 3. From the figure, we can see that the proposed method effectively suppresses the background noise compared to the conventional method. 5. Conclusions In this paper, we have proposed a novel method to incorporate the minima-controlled technique into the
6 6 W. Lee et al. / Signal Processing 9 (2) time (s) Fig. 3. Speech spectrograms (car noise, SNR = 5 db). (a) Spectrogram of the clean speech (Original), (b) spectrogram of the noisy speech (Noisy Speech), (c) spectrogram of the output signal obtained by Malah [9] (Malah), (d) spectrogram of the output signal obtained by the proposed method (Proposed). tracking speech presence uncertainty for speech enhancement. The ratio between a local energy and its minimum, which is introduced from the MCRA, controls q s for different bins since it provides us with a robust tracking performance of speech presence. Compared to the conventional tracking speech presence uncertainty, the performance of the proposed technique under various noise environments was superior in both subjective and objective tests. Acknowledgements This research was supported by the MKE, Korea, under the ITRC support program supervised by the NIPA (NIPA- 2-C9-2-7). And this work was supported by the IT R&D program of MKE/KEIT. [29-S-36-, Development of New Virtual Machine Specification and Technology]. References [] Y. Ephraim, D. Malah, Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator, IEEE Transactions on Acoustics, Speech and Signal Processing ASSP-32 (6) (984) 9 2. [2] R.J. McAulay, M.L. Malpass, Speech enhancement using a softdecision noise suppression filter, IEEE Transactions on Acoustics, Speech and Signal Processing ASSP-28 (2) (98) [3] J.-H. Chang, Q.-H. Jo, D.K. Kim, N.S. Kim, Global soft decision employing support vector machine for speech enhancement, IEEE Signal Processing Letters 6 () (29) [4] R. Martin, Spectral subtraction based on minimum statistics, in: Proceedings of the EUSIPCO, Edinburgh, UK, September 994, pp [5] I. Cohen, B. Berdugo, Speech enhancement for non-stationary noise environments, Signal Processing 8 () (2) [6] G. Doblinger, Computationally efficient speech enhancement by spectral minima tracking in subbands, in: Proceedings of the Eurospeech, Madrid, Spain, September 995, pp [7] J. Meyer, K.U. Simmer, K.D. Kammeyer, Comparison of one- and two-channel noise-estimation techniques, in: Proceedings of the IWAENC, London, UK, September 997, pp [8] N.S. Kim, J.-H. Chang, Spectral enhancement based on global soft decision, IEEE Signal Processing Letters 7 (5) (2) 8. [9] D. Malah, R. Cox, A.J. Accardi, Tracking speech-presence uncertainty to improve speech enhancement in nonstationary noise environments. in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Pheonix, AZ, March 999, pp [] I. Soon, S. Koh, C. Yeo, Improved noise suppression filter using selfadaptive estimator of probability of speech absence, Signal Processing 75 (2) (999) 5 59.
7 W. Lee et al. / Signal Processing 9 (2) [] I. Cohen, B. Berdugo, Noise estimation by minima controlled recursive averaging for robust speech enhancement, IEEE Signal Processing Letters 9 () (22) 2 5. [2] O. Cappé, Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor, IEEE Transactions on Speech Audio Processing 2 (April) (994) [3] ITU-T P.862, Perceptual evaluation of speech quality (PESQ), an objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs, February 2. [4] ITU-T P.8, Methods for subjective determination of transmission quality, August 996.
ARTICLE IN PRESS. Signal Processing
Signal Processing 9 (2) 737 74 Contents lists available at ScienceDirect Signal Processing journal homepage: www.elsevier.com/locate/sigpro Fast communication Double-talk detection based on soft decision
More informationSpeech Enhancement for Nonstationary Noise Environments
Signal & Image Processing : An International Journal (SIPIJ) Vol., No.4, December Speech Enhancement for Nonstationary Noise Environments Sandhya Hawaldar and Manasi Dixit Department of Electronics, KIT
More informationEnhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis
Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis Mohini Avatade & S.L. Sahare Electronics & Telecommunication Department, Cummins
More informationNoise Estimation based on Standard Deviation and Sigmoid Function Using a Posteriori Signal to Noise Ratio in Nonstationary Noisy Environments
88 International Journal of Control, Automation, and Systems, vol. 6, no. 6, pp. 88-87, December 008 Noise Estimation based on Standard Deviation and Sigmoid Function Using a Posteriori Signal to Noise
More informationEffective post-processing for single-channel frequency-domain speech enhancement Weifeng Li a
R E S E A R C H R E P O R T I D I A P Effective post-processing for single-channel frequency-domain speech enhancement Weifeng Li a IDIAP RR 7-7 January 8 submitted for publication a IDIAP Research Institute,
More informationStudents: Avihay Barazany Royi Levy Supervisor: Kuti Avargel In Association with: Zoran, Haifa
Students: Avihay Barazany Royi Levy Supervisor: Kuti Avargel In Association with: Zoran, Haifa Spring 2008 Introduction Problem Formulation Possible Solutions Proposed Algorithm Experimental Results Conclusions
More informationReduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter
Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter Ching-Ta Lu, Kun-Fu Tseng 2, Chih-Tsung Chen 2 Department of Information Communication, Asia University, Taichung, Taiwan, ROC
More informationRECENTLY, there has been an increasing interest in noisy
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 52, NO. 9, SEPTEMBER 2005 535 Warped Discrete Cosine Transform-Based Noisy Speech Enhancement Joon-Hyuk Chang, Member, IEEE Abstract In
More informationNoise Spectrum Estimation in Adverse Environments: Improved Minima Controlled Recursive Averaging
466 IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 11, NO. 5, SEPTEMBER 2003 Noise Spectrum Estimation in Adverse Environments: Improved Minima Controlled Recursive Averaging Israel Cohen Abstract
More informationSpeech Signal Enhancement Techniques
Speech Signal Enhancement Techniques Chouki Zegar 1, Abdelhakim Dahimene 2 1,2 Institute of Electrical and Electronic Engineering, University of Boumerdes, Algeria inelectr@yahoo.fr, dahimenehakim@yahoo.fr
More informationDifferent Approaches of Spectral Subtraction Method for Speech Enhancement
ISSN 2249 5460 Available online at www.internationalejournals.com International ejournals International Journal of Mathematical Sciences, Technology and Humanities 95 (2013 1056 1062 Different Approaches
More informationMMSE STSA Based Techniques for Single channel Speech Enhancement Application Simit Shah 1, Roma Patel 2
MMSE STSA Based Techniques for Single channel Speech Enhancement Application Simit Shah 1, Roma Patel 2 1 Electronics and Communication Department, Parul institute of engineering and technology, Vadodara,
More informationSpeech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter
Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter 1 Gupteswar Sahu, 2 D. Arun Kumar, 3 M. Bala Krishna and 4 Jami Venkata Suman Assistant Professor, Department of ECE,
More informationOn using acoustic environment classification for statistical model-based speech enhancement
Available online at www.sciencedirect.com Speech Communication 54 (22) 477 49 www.elsevier.com/locate/specom On using acoustic environment classification for statistical model-based speech enhancement
More informationANUMBER of estimators of the signal magnitude spectrum
IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 19, NO. 5, JULY 2011 1123 Estimators of the Magnitude-Squared Spectrum and Methods for Incorporating SNR Uncertainty Yang Lu and Philipos
More informationEstimation of Non-stationary Noise Power Spectrum using DWT
Estimation of Non-stationary Noise Power Spectrum using DWT Haripriya.R.P. Department of Electronics & Communication Engineering Mar Baselios College of Engineering & Technology, Kerala, India Lani Rachel
More informationInternational Journal of Advanced Research in Computer Science and Software Engineering
Volume 2, Issue 11, November 2012 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Review of
More informationScienceDirect. Unsupervised Speech Segregation Using Pitch Information and Time Frequency Masking
Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 46 (2015 ) 122 126 International Conference on Information and Communication Technologies (ICICT 2014) Unsupervised Speech
More informationIntegrated acoustic echo and background noise suppression technique based on soft decision
Park and Chang EURASIP Journal on Advances in Signal Processing, : http://asp.eurasipjournals.com/content/// RESEARCH Open Access Integrated acoustic echo and background noise suppression technique based
More informationSTATISTICAL METHODS FOR THE ENHANCEMENT OF NOISY SPEECH. Rainer Martin
STATISTICAL METHODS FOR THE ENHANCEMENT OF NOISY SPEECH Rainer Martin Institute of Communication Technology Technical University of Braunschweig, 38106 Braunschweig, Germany Phone: +49 531 391 2485, Fax:
More informationIN REVERBERANT and noisy environments, multi-channel
684 IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 11, NO. 6, NOVEMBER 2003 Analysis of Two-Channel Generalized Sidelobe Canceller (GSC) With Post-Filtering Israel Cohen, Senior Member, IEEE Abstract
More informationCHAPTER 3 SPEECH ENHANCEMENT ALGORITHMS
46 CHAPTER 3 SPEECH ENHANCEMENT ALGORITHMS 3.1 INTRODUCTION Personal communication of today is impaired by nearly ubiquitous noise. Speech communication becomes difficult under these conditions; speech
More informationFrequency Domain Analysis for Noise Suppression Using Spectral Processing Methods for Degraded Speech Signal in Speech Enhancement
Frequency Domain Analysis for Noise Suppression Using Spectral Processing Methods for Degraded Speech Signal in Speech Enhancement 1 Zeeshan Hashmi Khateeb, 2 Gopalaiah 1,2 Department of Instrumentation
More informationPhase estimation in speech enhancement unimportant, important, or impossible?
IEEE 7-th Convention of Electrical and Electronics Engineers in Israel Phase estimation in speech enhancement unimportant, important, or impossible? Timo Gerkmann, Martin Krawczyk, and Robert Rehr Speech
More informationMel Spectrum Analysis of Speech Recognition using Single Microphone
International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree
More informationAccurate Delay Measurement of Coded Speech Signals with Subsample Resolution
PAGE 433 Accurate Delay Measurement of Coded Speech Signals with Subsample Resolution Wenliang Lu, D. Sen, and Shuai Wang School of Electrical Engineering & Telecommunications University of New South Wales,
More informationNoise Power Spectral Density Estimation Based on Optimal Smoothing and Minimum Statistics
504 IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 9, NO. 5, JULY 2001 Noise Power Spectral Density Estimation Based on Optimal Smoothing and Minimum Statistics Rainer Martin, Senior Member, IEEE
More informationSingle channel noise reduction
Single channel noise reduction Basics and processing used for ETSI STF 94 ETSI Workshop on Speech and Noise in Wideband Communication Claude Marro France Telecom ETSI 007. All rights reserved Outline Scope
More informationREAL-TIME BROADBAND NOISE REDUCTION
REAL-TIME BROADBAND NOISE REDUCTION Robert Hoeldrich and Markus Lorber Institute of Electronic Music Graz Jakoministrasse 3-5, A-8010 Graz, Austria email: robert.hoeldrich@mhsg.ac.at Abstract A real-time
More informationWavelet Speech Enhancement based on the Teager Energy Operator
Wavelet Speech Enhancement based on the Teager Energy Operator Mohammed Bahoura and Jean Rouat ERMETIS, DSA, Université du Québec à Chicoutimi, Chicoutimi, Québec, G7H 2B1, Canada. Abstract We propose
More informationSPEECH ENHANCEMENT BASED ON A LOG-SPECTRAL AMPLITUDE ESTIMATOR AND A POSTFILTER DERIVED FROM CLEAN SPEECH CODEBOOK
18th European Signal Processing Conference (EUSIPCO-2010) Aalborg, Denmar, August 23-27, 2010 SPEECH ENHANCEMENT BASED ON A LOG-SPECTRAL AMPLITUDE ESTIMATOR AND A POSTFILTER DERIVED FROM CLEAN SPEECH CODEBOOK
More informationFundamental frequency estimation of speech signals using MUSIC algorithm
Acoust. Sci. & Tech. 22, 4 (2) TECHNICAL REPORT Fundamental frequency estimation of speech signals using MUSIC algorithm Takahiro Murakami and Yoshihisa Ishida School of Science and Technology, Meiji University,,
More informationNOISE ESTIMATION IN A SINGLE CHANNEL
SPEECH ENHANCEMENT FOR CROSS-TALK INTERFERENCE by Levent M. Arslan and John H.L. Hansen Robust Speech Processing Laboratory Department of Electrical Engineering Box 99 Duke University Durham, North Carolina
More informationSPEECH ENHANCEMENT USING SPARSE CODE SHRINKAGE AND GLOBAL SOFT DECISION. Changkyu Choi, Seungho Choi, and Sang-Ryong Kim
SPEECH ENHANCEMENT USING SPARSE CODE SHRINKAGE AND GLOBAL SOFT DECISION Changkyu Choi, Seungho Choi, and Sang-Ryong Kim Human & Computer Interaction Laboratory Samsung Advanced Institute of Technology
More informationSpeech Enhancement Based On Noise Reduction
Speech Enhancement Based On Noise Reduction Kundan Kumar Singh Electrical Engineering Department University Of Rochester ksingh11@z.rochester.edu ABSTRACT This paper addresses the problem of signal distortion
More informationMikko Myllymäki and Tuomas Virtanen
NON-STATIONARY NOISE MODEL COMPENSATION IN VOICE ACTIVITY DETECTION Mikko Myllymäki and Tuomas Virtanen Department of Signal Processing, Tampere University of Technology Korkeakoulunkatu 1, 3370, Tampere,
More informationModified Kalman Filter-based Approach in Comparison with Traditional Speech Enhancement Algorithms from Adverse Noisy Environments
Modified Kalman Filter-based Approach in Comparison with Traditional Speech Enhancement Algorithms from Adverse Noisy Environments G. Ramesh Babu 1 Department of E.C.E, Sri Sivani College of Engg., Chilakapalem,
More informationSpeech Enhancement: Reduction of Additive Noise in the Digital Processing of Speech
Speech Enhancement: Reduction of Additive Noise in the Digital Processing of Speech Project Proposal Avner Halevy Department of Mathematics University of Maryland, College Park ahalevy at math.umd.edu
More informationINTERNATIONAL TELECOMMUNICATION UNION
INTERNATIONAL TELECOMMUNICATION UNION ITU-T P.835 TELECOMMUNICATION STANDARDIZATION SECTOR OF ITU (11/2003) SERIES P: TELEPHONE TRANSMISSION QUALITY, TELEPHONE INSTALLATIONS, LOCAL LINE NETWORKS Methods
More informationCodebook-based Bayesian speech enhancement for nonstationary environments Srinivasan, S.; Samuelsson, J.; Kleijn, W.B.
Codebook-based Bayesian speech enhancement for nonstationary environments Srinivasan, S.; Samuelsson, J.; Kleijn, W.B. Published in: IEEE Transactions on Audio, Speech, and Language Processing DOI: 10.1109/TASL.2006.881696
More informationTransient noise reduction in speech signal with a modified long-term predictor
RESEARCH Open Access Transient noise reduction in speech signal a modified long-term predictor Min-Seok Choi * and Hong-Goo Kang Abstract This article proposes an efficient median filter based algorithm
More informationAS DIGITAL speech communication devices, such as
IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 20, NO. 4, MAY 2012 1383 Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay Timo Gerkmann, Member, IEEE,
More informationAudio Restoration Based on DSP Tools
Audio Restoration Based on DSP Tools EECS 451 Final Project Report Nan Wu School of Electrical Engineering and Computer Science University of Michigan Ann Arbor, MI, United States wunan@umich.edu Abstract
More informationNoise Reduction: An Instructional Example
Noise Reduction: An Instructional Example VOCAL Technologies LTD July 1st, 2012 Abstract A discussion on general structure of noise reduction algorithms along with an illustrative example are contained
More informationSpeech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction
IOSR Journal of VLSI and Signal Processing (IOSR-JVSP) Volume 7, Issue, Ver. I (Mar. - Apr. 7), PP 4-46 e-issn: 9 4, p-issn No. : 9 497 www.iosrjournals.org Speech Enhancement Using Spectral Flatness Measure
More informationRobust Voice Activity Detection Based on Discrete Wavelet. Transform
Robust Voice Activity Detection Based on Discrete Wavelet Transform Kun-Ching Wang Department of Information Technology & Communication Shin Chien University kunching@mail.kh.usc.edu.tw Abstract This paper
More informationChapter 3. Speech Enhancement and Detection Techniques: Transform Domain
Speech Enhancement and Detection Techniques: Transform Domain 43 This chapter describes techniques for additive noise removal which are transform domain methods and based mostly on short time Fourier transform
More informationSpeech Enhancement Based on Non-stationary Noise-driven Geometric Spectral Subtraction and Phase Spectrum Compensation
Speech Enhancement Based on Non-stationary Noise-driven Geometric Spectral Subtraction and Phase Spectrum Compensation Md Tauhidul Islam a, Udoy Saha b, K.T. Shahid b, Ahmed Bin Hussain b, Celia Shahnaz
More informationNoise Tracking Algorithm for Speech Enhancement
Appl. Math. Inf. Sci. 9, No. 2, 691-698 (2015) 691 Applied Mathematics & Information Sciences An International Journal http://dx.doi.org/10.12785/amis/090217 Noise Tracking Algorithm for Speech Enhancement
More informationSPEECH ENHANCEMENT USING A ROBUST KALMAN FILTER POST-PROCESSOR IN THE MODULATION DOMAIN. Yu Wang and Mike Brookes
SPEECH ENHANCEMENT USING A ROBUST KALMAN FILTER POST-PROCESSOR IN THE MODULATION DOMAIN Yu Wang and Mike Brookes Department of Electrical and Electronic Engineering, Exhibition Road, Imperial College London,
More informationPerceptual Speech Enhancement Using Multi_band Spectral Attenuation Filter
Perceptual Speech Enhancement Using Multi_band Spectral Attenuation Filter Sana Alaya, Novlène Zoghlami and Zied Lachiri Signal, Image and Information Technology Laboratory National Engineering School
More informationHUMAN speech is frequently encountered in several
1948 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 20, NO. 7, SEPTEMBER 2012 Enhancement of Single-Channel Periodic Signals in the Time-Domain Jesper Rindom Jensen, Student Member,
More informationAdvances in Applied and Pure Mathematics
Enhancement of speech signal based on application of the Maximum a Posterior Estimator of Magnitude-Squared Spectrum in Stationary Bionic Wavelet Domain MOURAD TALBI, ANIS BEN AICHA 1 mouradtalbi196@yahoo.fr,
More informationMODIFIED DCT BASED SPEECH ENHANCEMENT IN VEHICULAR ENVIRONMENTS
MODIFIED DCT BASED SPEECH ENHANCEMENT IN VEHICULAR ENVIRONMENTS 1 S.PRASANNA VENKATESH, 2 NITIN NARAYAN, 3 K.SAILESH BHARATHWAAJ, 4 M.P.ACTLIN JEEVA, 5 P.VIJAYALAKSHMI 1,2,3,4,5 SSN College of Engineering,
More informationPerformance Evaluation of Noise Estimation Techniques for Blind Source Separation in Non Stationary Noise Environment
www.ijcsi.org 242 Performance Evaluation of Noise Estimation Techniques for Blind Source Separation in Non Stationary Noise Environment Ms. Mohini Avatade 1, Prof. Mr. S.L. Sahare 2 1,2 Electronics & Telecommunication
More informationPROSE: Perceptual Risk Optimization for Speech Enhancement
PROSE: Perceptual Ris Optimization for Speech Enhancement Jishnu Sadasivan and Chandra Sehar Seelamantula Department of Electrical Communication Engineering, Department of Electrical Engineering Indian
More informationTRANSIENT NOISE REDUCTION BASED ON SPEECH RECONSTRUCTION
TRANSIENT NOISE REDUCTION BASED ON SPEECH RECONSTRUCTION Jian Li 1,2, Shiwei Wang 1,2, Renhua Peng 1,2, Chengshi Zheng 1,2, Xiaodong Li 1,2 1. Communication Acoustics Laboratory, Institute of Acoustics,
More informationSingle Channel Speaker Segregation using Sinusoidal Residual Modeling
NCC 2009, January 16-18, IIT Guwahati 294 Single Channel Speaker Segregation using Sinusoidal Residual Modeling Rajesh M Hegde and A. Srinivas Dept. of Electrical Engineering Indian Institute of Technology
More informationHigh-speed Noise Cancellation with Microphone Array
Noise Cancellation a Posteriori Probability, Maximum Criteria Independent Component Analysis High-speed Noise Cancellation with Microphone Array We propose the use of a microphone array based on independent
More informationAvailable online at ScienceDirect. Procedia Computer Science 89 (2016 )
Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 89 (2016 ) 666 676 Twelfth International Multi-Conference on Information Processing-2016 (IMCIP-2016) Comparison of Speech
More informationCOM 12 C 288 E October 2011 English only Original: English
Question(s): 9/12 Source: Title: INTERNATIONAL TELECOMMUNICATION UNION TELECOMMUNICATION STANDARDIZATION SECTOR STUDY PERIOD 2009-2012 Audience STUDY GROUP 12 CONTRIBUTION 288 P.ONRA Contribution Additional
More informationMeasuring the complexity of sound
PRAMANA c Indian Academy of Sciences Vol. 77, No. 5 journal of November 2011 physics pp. 811 816 Measuring the complexity of sound NANDINI CHATTERJEE SINGH National Brain Research Centre, NH-8, Nainwal
More informationMULTICHANNEL systems are often used for
IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 52, NO. 5, MAY 2004 1149 Multichannel Post-Filtering in Nonstationary Noise Environments Israel Cohen, Senior Member, IEEE Abstract In this paper, we present
More informationNonuniform multi level crossing for signal reconstruction
6 Nonuniform multi level crossing for signal reconstruction 6.1 Introduction In recent years, there has been considerable interest in level crossing algorithms for sampling continuous time signals. Driven
More informationAutomotive three-microphone voice activity detector and noise-canceller
Res. Lett. Inf. Math. Sci., 005, Vol. 7, pp 47-55 47 Available online at http://iims.massey.ac.nz/research/letters/ Automotive three-microphone voice activity detector and noise-canceller Z. QI and T.J.MOIR
More informationIMPROVED SPEECH QUALITY FOR VMR - WB SPEECH CODING USING EFFICIENT NOISE ESTIMATION ALGORITHM
IMPROVED SPEECH QUALITY FOR VMR - WB SPEECH CODING USING EFFICIENT NOISE ESTIMATION ALGORITHM Mr. M. Mathivanan Associate Professor/ECE Selvam College of Technology Namakkal, Tamilnadu, India Dr. S.Chenthur
More informationQUANTIZATION NOISE ESTIMATION FOR LOG-PCM. Mohamed Konaté and Peter Kabal
QUANTIZATION NOISE ESTIMATION FOR OG-PCM Mohamed Konaté and Peter Kabal McGill University Department of Electrical and Computer Engineering Montreal, Quebec, Canada, H3A 2A7 e-mail: mohamed.konate2@mail.mcgill.ca,
More informationRASTA-PLP SPEECH ANALYSIS. Aruna Bayya. Phil Kohn y TR December 1991
RASTA-PLP SPEECH ANALYSIS Hynek Hermansky Nelson Morgan y Aruna Bayya Phil Kohn y TR-91-069 December 1991 Abstract Most speech parameter estimation techniques are easily inuenced by the frequency response
More informationResearch Article Subband DCT and EMD Based Hybrid Soft Thresholding for Speech Enhancement
Advances in Acoustics and Vibration, Article ID 755, 11 pages http://dx.doi.org/1.1155/1/755 Research Article Subband DCT and EMD Based Hybrid Soft Thresholding for Speech Enhancement Erhan Deger, 1 Md.
More informationEMD BASED FILTERING (EMDF) OF LOW FREQUENCY NOISE FOR SPEECH ENHANCEMENT
T-ASL-03274-2011 1 EMD BASED FILTERING (EMDF) OF LOW FREQUENCY NOISE FOR SPEECH ENHANCEMENT Navin Chatlani and John J. Soraghan Abstract An Empirical Mode Decomposition based filtering (EMDF) approach
More informationAnalysis of the SNR Estimator for Speech Enhancement Using a Cascaded Linear Model
Analysis of the SNR Estimator for Speech Enhancement Using a Cascaded Linear Model Harjeet Kaur Ph.D Research Scholar I.K.Gujral Punjab Technical University Jalandhar, Punjab, India Rajneesh Talwar Principal,Professor
More informationOnline Monaural Speech Enhancement Based on Periodicity Analysis and A Priori SNR Estimation
1 Online Monaural Speech Enhancement Based on Periodicity Analysis and A Priori SNR Estimation Zhangli Chen* and Volker Hohmann Abstract This paper describes an online algorithm for enhancing monaural
More informationModulation Domain Spectral Subtraction for Speech Enhancement
Modulation Domain Spectral Subtraction for Speech Enhancement Author Paliwal, Kuldip, Schwerin, Belinda, Wojcicki, Kamil Published 9 Conference Title Proceedings of Interspeech 9 Copyright Statement 9
More informationIN RECENT YEARS, there has been a great deal of interest
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL 12, NO 1, JANUARY 2004 9 Signal Modification for Robust Speech Coding Nam Soo Kim, Member, IEEE, and Joon-Hyuk Chang, Member, IEEE Abstract Usually,
More informationAvailable online at ScienceDirect. Procedia Computer Science 54 (2015 )
Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 54 (2015 ) 574 584 Eleventh International Multi-Conference on Information Processing-2015 (IMCIP-2015) Speech Enhancement
More informationA GENERALIZED LOG-SPECTRAL AMPLITUDE ESTIMATOR FOR SINGLE-CHANNEL SPEECH ENHANCEMENT. Aleksej Chinaev, Reinhold Haeb-Umbach
A GENERALIZED LOG-SPECTRAL AMPLITUDE ESTIMATOR FOR SINGLE-CHANNEL SPEECH ENHANCEMENT Aleksej Chinaev, Reinhold Haeb-Umbach Department of Communications Engineering, Paderborn University, 98 Paderborn,
More informationSpeech Enhancement Using a Mixture-Maximum Model
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 10, NO. 6, SEPTEMBER 2002 341 Speech Enhancement Using a Mixture-Maximum Model David Burshtein, Senior Member, IEEE, and Sharon Gannot, Member, IEEE
More informationNOISE POWER SPECTRAL DENSITY MATRIX ESTIMATION BASED ON MODIFIED IMCRA. Qipeng Gong, Benoit Champagne and Peter Kabal
NOISE POWER SPECTRAL DENSITY MATRIX ESTIMATION BASED ON MODIFIED IMCRA Qipeng Gong, Benoit Champagne and Peter Kabal Department of Electrical & Computer Engineering, McGill University 3480 University St.,
More informationEvaluation of clipping-noise suppression of stationary-noisy speech based on spectral compensation
Evaluation of clipping-noise suppression of stationary-noisy speech based on spectral compensation Takahiro FUKUMORI ; Makoto HAYAKAWA ; Masato NAKAYAMA 2 ; Takanobu NISHIURA 2 ; Yoichi YAMASHITA 2 Graduate
More informationEnhanced Waveform Interpolative Coding at 4 kbps
Enhanced Waveform Interpolative Coding at 4 kbps Oded Gottesman, and Allen Gersho Signal Compression Lab. University of California, Santa Barbara E-mail: [oded, gersho]@scl.ece.ucsb.edu Signal Compression
More informationReliable A posteriori Signal-to-Noise Ratio features selection
Reliable A eriori Signal-to-Noise Ratio features selection Cyril Plapous, Claude Marro, Pascal Scalart To cite this version: Cyril Plapous, Claude Marro, Pascal Scalart. Reliable A eriori Signal-to-Noise
More informationReal time noise-speech discrimination in time domain for speech recognition application
University of Malaya From the SelectedWorks of Mokhtar Norrima January 4, 2011 Real time noise-speech discrimination in time domain for speech recognition application Norrima Mokhtar, University of Malaya
More informationA SUPERVISED SIGNAL-TO-NOISE RATIO ESTIMATION OF SPEECH SIGNALS. Pavlos Papadopoulos, Andreas Tsiartas, James Gibson, and Shrikanth Narayanan
IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP) A SUPERVISED SIGNAL-TO-NOISE RATIO ESTIMATION OF SPEECH SIGNALS Pavlos Papadopoulos, Andreas Tsiartas, James Gibson, and
More informationJoint dereverberation and residual echo suppression of speech signals in noisy environments Habets, E.A.P.; Gannot, S.; Cohen, I.; Sommen, P.C.W.
Joint dereverberation and residual echo suppression of speech signals in noisy environments Habets, E.A.P.; Gannot, S.; Cohen, I.; Sommen, P.C.W. Published in: IEEE Transactions on Audio, Speech, and Language
More informationROBUST PITCH TRACKING USING LINEAR REGRESSION OF THE PHASE
- @ Ramon E Prieto et al Robust Pitch Tracking ROUST PITCH TRACKIN USIN LINEAR RERESSION OF THE PHASE Ramon E Prieto, Sora Kim 2 Electrical Engineering Department, Stanford University, rprieto@stanfordedu
More informationA HYBRID APPROACH TO COMBINING CONVENTIONAL AND DEEP LEARNING TECHNIQUES FOR SINGLE-CHANNEL SPEECH ENHANCEMENT AND RECOGNITION
A HYBRID APPROACH TO COMBINING CONVENTIONAL AND DEEP LEARNING TECHNIQUES FOR SINGLE-CHANNEL SPEECH ENHANCEMENT AND RECOGNITION Yan-Hui Tu 1, Ivan Tashev 2, Chin-Hui Lee 3, Shuayb Zarar 2 1 University of
More informationDual-Microphone Speech Dereverberation in a Noisy Environment
Dual-Microphone Speech Dereverberation in a Noisy Environment Emanuël A. P. Habets Dept. of Electrical Engineering Technische Universiteit Eindhoven Eindhoven, The Netherlands Email: e.a.p.habets@tue.nl
More informationImpact Noise Suppression Using Spectral Phase Estimation
Proceedings of APSIPA Annual Summit and Conference 2015 16-19 December 2015 Impact oise Suppression Using Spectral Phase Estimation Kohei FUJIKURA, Arata KAWAMURA, and Youji IIGUI Graduate School of Engineering
More informationReal Time Noise Suppression in Social Settings Comprising a Mixture of Non-stationary and Transient Noise
th European Signal Processing Conference (EUSIPCO) Real Noise Suppression in Social Settings Comprising a Mixture of Non-stationary and Transient Noise Pei Chee Yong, Sven Nordholm Department of Electrical
More informationOPTIMAL SPECTRAL SMOOTHING IN SHORT-TIME SPECTRAL ATTENUATION (STSA) ALGORITHMS: RESULTS OF OBJECTIVE MEASURES AND LISTENING TESTS
17th European Signal Processing Conference (EUSIPCO 9) Glasgow, Scotland, August -, 9 OPTIMAL SPECTRAL SMOOTHING IN SHORT-TIME SPECTRAL ATTENUATION (STSA) ALGORITHMS: RESULTS OF OBJECTIVE MEASURES AND
More informationRobust Low-Resource Sound Localization in Correlated Noise
INTERSPEECH 2014 Robust Low-Resource Sound Localization in Correlated Noise Lorin Netsch, Jacek Stachurski Texas Instruments, Inc. netsch@ti.com, jacek@ti.com Abstract In this paper we address the problem
More informationOPTIMUM POST-FILTER ESTIMATION FOR NOISE REDUCTION IN MULTICHANNEL SPEECH PROCESSING
14th European Signal Processing Conference (EUSIPCO 6), Florence, Italy, September 4-8, 6, copyright by EURASIP OPTIMUM POST-FILTER ESTIMATION FOR NOISE REDUCTION IN MULTICHANNEL SPEECH PROCESSING Stamatis
More informationTIME-FREQUENCY CONSTRAINTS FOR PHASE ESTIMATION IN SINGLE-CHANNEL SPEECH ENHANCEMENT. Pejman Mowlaee, Rahim Saeidi
th International Workshop on Acoustic Signal Enhancement (IWAENC) TIME-FREQUENCY CONSTRAINTS FOR PHASE ESTIMATION IN SINGLE-CHANNEL SPEECH ENHANCEMENT Pejman Mowlaee, Rahim Saeidi Signal Processing and
More informationOn a Classification of Voiced/Unvoiced by using SNR for Speech Recognition
International Conference on Advanced Computer Science and Electronics Information (ICACSEI 03) On a Classification of Voiced/Unvoiced by using SNR for Speech Recognition Jongkuk Kim, Hernsoo Hahn Department
More informationIntroduction of Audio and Music
1 Introduction of Audio and Music Wei-Ta Chu 2009/12/3 Outline 2 Introduction of Audio Signals Introduction of Music 3 Introduction of Audio Signals Wei-Ta Chu 2009/12/3 Li and Drew, Fundamentals of Multimedia,
More informationA Correlation-Maximization Denoising Filter Used as An Enhancement Frontend for Noise Robust Bird Call Classification
A Correlation-Maximization Denoising Filter Used as An Enhancement Frontend for Noise Robust Bird Call Classification Wei Chu and Abeer Alwan Speech Processing and Auditory Perception Laboratory Department
More informationTitle. Author(s)Sugiyama, Akihiko; Kato, Masanori; Serizawa, Masahir. Issue Date Doc URL. Type. Note. File Information
Title A Low-Distortion Noise Canceller with an SNR-Modifie Author(s)Sugiyama, Akihiko; Kato, Masanori; Serizawa, Masahir Proceedings : APSIPA ASC 9 : Asia-Pacific Signal Citationand Conference: -5 Issue
More informationUse of linear predictive features and pattern recognition techniques to develop a vector quantization based blind SNR estimation system
Rowan University Rowan Digital Works Theses and Dissertations 12-31-2008 Use of linear predictive features and pattern recognition techniques to develop a vector quantization based blind SNR estimation
More informationA HYBRID APPROACH TO COMBINING CONVENTIONAL AND DEEP LEARNING TECHNIQUES FOR SINGLE-CHANNEL SPEECH ENHANCEMENT AND RECOGNITION
A HYBRID APPROACH TO COMBINING CONVENTIONAL AND DEEP LEARNING TECHNIQUES FOR SINGLE-CHANNEL SPEECH ENHANCEMENT AND RECOGNITION Yan-Hui Tu 1, Ivan Tashev 2, Shuayb Zarar 2, Chin-Hui Lee 3 1 University of
More informationNOISE PSD ESTIMATION BY LOGARITHMIC BASELINE TRACING. Florian Heese and Peter Vary
NOISE PSD ESTIMATION BY LOGARITHMIC BASELINE TRACING Florian Heese and Peter Vary Institute of Communication Systems and Data Processing RWTH Aachen University, Germany {heese,vary}@ind.rwth-aachen.de
More information