CHAPTER 3 SPEECH ENHANCEMENT ALGORITHMS
|
|
- Caroline Joella Hoover
- 5 years ago
- Views:
Transcription
1 46 CHAPTER 3 SPEECH ENHANCEMENT ALGORITHMS 3.1 INTRODUCTION Personal communication of today is impaired by nearly ubiquitous noise. Speech communication becomes difficult under these conditions; speech sounds are masked by the noise and speech features like quality and intelligibility may be degraded. Today, a great deal of our personal communication is performed using some sort of communication equipment, such as cell phones and inter-com devices. With the prosperity of such portable communication devices, speech enhancement has received a lot of attention. Noise corrupted speech does indeed force the user of the communication equipment to strain both hearing and voice. Altogether, acoustic noise directly affects the human communication, and also dramatically decreases the performance of speech coding and speech recognition algorithms. This urges for effective speech enhancement methods (Nils Westerlund et al 2004). Speech enhancement refers to the restoration of clean speech. The main objective of speech enhancement is to improve one or more perceptual aspects of speech, such as human or machine speech recognition or degree of listener fatigue. Here, the additive noise source may be in the form of wideband noise, which includes white or coloured noise, stationary or nonstationary noise, or a periodic signal.
2 47 A microphone is used to receive a desired audio signal (such as human speech) and ambient noise from other sources which often interferes with the desired signal. One conventional approach has been used to solve this problem is by Speech Enhancement, a signal processing technique, which utilizes differences in the statistical characteristics of speech and noise. In most Speech Enhancement Algorithms (SEA), it is assumed that an estimate of the noise spectrum is available. Such an estimate is critical for the performance of speech enhancement algorithms as it is needed, for instance, to evaluate the Wiener filter in the Wiener algorithms (Lim and Oppenheim, 1978) or to estimate the a priori SNR in the MMSE algorithms (Ephraim and Malah 1984) or to estimate the noise covariance matrix in the subspace algorithms (Ephraim and Van Trees 1993). A frequently used digital method for effective noise reduction in speech communication is spectral subtraction. This frequency domain method is based on Fast Fourier Transform and is a non linear, yet straight forward way of reducing unwanted broadband noise acoustically added to the signal. The noise bias is estimated in frequency domain during speech pauses and then subtracted from the noisy speech spectra (Boll 1979). The speech enhancement systems capitalize on the major importance of the Short Time Spectral Amplitude (STSA) of the speech signal in its perception. A system which utilizes a Minimum Mean Square Error (MMSE) STSA estimator was proposed by Yariv et al (1984). STSA based on modeling speech and noise spectral components as statistically independent Gaussian random variables. The noise estimate can have a major impact on the quality of the enhanced signal. If the noise estimate is too low, annoying residual noise will be audible, if it is too high, speech will be distorted resulting possibly in
3 48 intelligibility loss. The simplest approach is to estimate and update the noise spectrum during the silent (e.g., during pauses) segments of the signal using a voice-activity detection (VAD) algorithm (Sohn and Kim 1999). Although such an approach might work satisfactorily in stationary noise (e.g., white noise), it will not work well in more realistic environments (e.g., in a restaurant) where the spectral characteristics of the noise might be changing constantly. Hence there is a need to update the noise spectrum continuously over time and this can be done using noise estimation algorithms. Several noise-estimation algorithms have been proposed for speech enhancement applications (Benny sallberg et al 2005, Malah et al 1999, Martin 2001, Cohen 2002, Cohen 2003, Doblinger 1995, Hirsch and Ehrlicher 1995, Lin et al 2003, Stahl et al 2000, Rangachari et al 2004, Ris and Dupont 2001). Martin (2001) proposed a method for estimating the noise spectrum based on tracking the minimum of the noisy speech over a finite window. As the minimum is typically smaller than the mean, unbiased estimates of noise spectrum were computed by introducing a bias factor based on the statistics of the minimum estimates. The noisy speech feature vectors were modeled using a mixture of Gaussians, and the noise feature vectors were obtained by maximizing a conditional likelihood function based on a recursive EM algorithm. Stochastic approximations were made to sequentially update the noise feature vectors. Some of those noise updates resembled the time-recursive updates of the noise spectrum used in the above noise-estimation algorithms. In fact, some (Garcia et al 2008) proposed the use of optimum smoothing factors for the noise updates similar to (Martin 2001). Improvements to the EM-based methods were reported in (Huijun et al 2009) using sequential Monte Carlo techniques.
4 49 Most of the aforementioned noise estimation algorithms developed for speech enhancement algorithms do not adapt quickly to increasing noise levels. A noise-estimation algorithm proposed by Rangachari et al (2004) updates the noise estimate faster than the above methods and also avoids overestimation of the noise level. The noise estimate was updated in each frame based on Voice Activity Detection. If speech was absent in a specific frame, the noise estimate was updated with a constant smoothing factor. The speech-presence decision made in each speech frame was based on the ratio of noisy speech spectrum to its local minimum. Improving the quality of speech, hidden in noise, is of great importance in speech communication systems. These systems are used in common real acoustic environments. The nonlinear subtraction is used when a frequency dependent Signal to Noise Ratio (SNR) is obtained by Jiri poruba (2002). Westerlund et al (2003) have proposed the Adaptive Gain Equalizer method (AGE), where the input signal is divided into a number of subbands. These are individually weighed in time domain, in accordance to the short time Signal to Noise Ratio (SNR). A fractional bank gammatone filter for speech enhancement based on a short-term temporal Masking threshold to Noise Ratio (MNR) was proposed by Teddy Surya Gunawan et al (2004). Further research is required to fine tune the parameters for different speech and/or noise characteristics. Various speech enhancement algorithms have been proposed to improve the performance of modern communication devices in noisy environments. Yet, it still remains unclear as to which speech enhancement algorithm performs well in real-world listening situations where the background noise level and characteristics are constantly changing. Reliable and fair comparison between algorithms has been elusive for several reasons, including lack of common speech database for evaluation of new algorithms,
5 50 differences in the types of noise used and differences in the testing methodology (Yi Hu et al 2006). In this chapter, a Speech Enhancement Algorithm developed by Westerlund et al (2003) is presented along with Non linear Spectral Subtraction method as discussed in Jiri poruba (2002). 3.2 ADAPTIVE GAIN EQUALIZATION The Adaptive Gain Equalization (AGE) method for speech enhancement, introduced by Westerlund et al (2003) separates itself from the traditional methods of improving the SNR of a signal corrupted by noise, through moving away from noise suppression and focusing primarily on speech boosting. Noise suppression traditionally, like spectral subtraction, looks at subtracting an estimated noise bias from the signal corrupted by noise, whereas speech boosting aims to enhance the speech part of the signal by adding an estimate of the speech itself, thus boosting the speech part of the signal. The difference between noise suppression and speech boosting is presented in Figure 3.1. Figure 3.1 (a) shows a noise estimate being subtracted from a noise corrupted signal, while in Figure 3.1 (b), an estimate of the speech signal is used to boost the speech in the noise corrupted signal (Nils Westerlund et al 2004). (a) Noise Suppression (b) Speech Boosting Figure 3.1 Difference between noise suppression and speech boosting
6 51 An acoustical discrete time speech is signal denoted by s(n) and a discrete time noise signal is denoted by w(n). The noise corrupted speech signal x(n) can then be written as X(n) = s(n) + w(n) (3.1) By filtering the input signal x(n) using a bank of k bandpass filters, h (n), the signal is divided into K subbands, each denoted by x (n) where k is the subband index. This filtering operation can be written in time domain as X (n) = x(n) h (n) (3.2) where * is the convolution operator. The original signal can then described as X(n) = x (n) = s (n) + w (n) (3.3) where s (n) is the speech part subband k and w (n) is the noise part subband k. The output y(n) is defined by y(n) = G (n)x (n) (3.4) where ) is a weighting function that amplifies the band k during speech activity. The ) introduces a gain to each subband, the function will be denoted as gain function that weights the input signal subbands using the ratio between s (n) and w (n), a short term SNR estimate. A block scheme illustration the subband decomposition, weighting and final summation is shown in Figure 3.2. The input signal is divided into a number of frequency subbands, that are individually and adaptively weighted in time domain according to a short term Signal to Noise Ratio (SNR) estimate in each subband at every time instant. When dividing a signal into subbands, a number of different banks could be employed.
7 52 Gain Control ) ) ) Gain Control ) ) Gain Control ) ) ) Figure 3.2 Block diagram of adaptive gain equalization The gain function in each subband is found by using a ratio of a short term exponential magnitude average, A (n), is calculated as A (n) = A (n 1) + x (n) (3.5) where is a small positive constant controlling how sensitive the algorithm should be to rapid changes in input signal amplitude in subband k. Human speech can be considered approximately short time stationary and constant should be chosen with this in mind. A suitable value for can be estimated using the equation = (3.6) where F is the sampling frequency and T is a time constant.
8 53 A The slowing varying noise floor level estimate for each subband k, (n), is calculated according to A (n)= (1 + k)a ( 1) if Ax, k(n) > A ( 1) Ax, k(n) if Ax, k(n) A ( 1) (3.7) where is a small positive constant controlling how fast the noise floor level estimation in subband k will adapt to change in the noise environment. Note that A (n) A (n)and that <<. G (n) according to The variable A (n) and A (n) are used to form the gain function G (n)= ; p 0, A (n) > 0 (3.8) where p decides the gain raise individually applied to each of the subband signals. The resulting speech enhanced output signal y(n) is then calculated as in equation (3.4). The calculation of G (n) involves division, care must be taken to ensure that the quotient does not become excessively large due to a small A (n). In a situation with a fair SNR, G (n) will become considerable if no limit is imposed on this function, resulting in unacceptable high speech amplification. A limiter can be imposed on G (n) as G (n)= G (n) if G (n) L L if G (n) > L (3.9) where L is some positive constant, expressed in decibels.
9 54 The concept of obtaining a speech bias estimate to perform speech boosting may seem like a daunting task. But it does not need to be, the AGE method of speech enhancement relies on a few basic ideas. The first of which is that a speech signal which is corrupted by bandlimited noise can be divided into a number of subbands and each of these subbands can be individually and adaptively boosted according to a SNR estimate in that particular subband. In each subband, a short term average is calculated simultaneously with an estimate of a slowly varying noise floor level. By using the short term average and floor estimate, a gain function is calculated per subband through dividing the short term average by the floor estimate. This gain function is multiplied with the corresponding signal in each subband to form an output per subband. The sum of the outputs from each subband forms the final output signal, which should contain a higher SNR when compared to the original noisy signal. The AGE acts as a speech booster, which is adaptively looking for a subband speech signal to boost. Outlining that speech energy is a highly non-stationary input amplitude excursion, if there is no such excursions no alteration to the subband will be performed, the AGE will remain idle, as a result of the quotient between the short term magnitude average and the noise floor estimate being unity, with them being approximately the same. If speech is present the short term magnitude average will change with the noise floor level remaining approximately unchanged, thus amplifying the signal in the subband at hand due to the quotient becoming larger than unity. During periods of no speech activity, using the AGE provides distortion free background noise during speech activity due to masking effects. This results in increased speech quality with the output signal having a natural sound with minimum distortion and artifacts.
10 55 The AGE algorithm can be implemented either on digital or analogue circuits proving to be versatile and flexible. The speech enhancement is performed continuously in each subband, which means no voice activity detectors are required. The method is Stand-Alone; it works independently of different speech coding schemes or other adaptive algorithms. Using the AGE requires minimum amendment for good performance Drawbacks OF AGE When analysis was done under various noise conditions it was seen that the algorithm has an obvious failing point for a SNR of -5 db, with inadequate levels of noise suppression for SNR less than this point. This is due to the fact that short term average failing to track the speech spectra of a speech signal which is heavily corrupted by noise. 3.3 SPECTRAL SUBTRACTION METHOD The standard spectral subtraction method is described in the following equations. A short-term noise spectral magnitude is subtracted from a degraded speech signal by = (3.10) where = (3.11) ) = (3.12)
11 56 The is a short-term spectral estimation of speech (frame ), a short-term estimation of noise (frame i), a smoothed-out estimate of the corrupted magnitude at time i, a smoothed-out estimate of the noise magnitude at time i, and is a clean speech estimate. following equations The magnitude and can be computed from = + (1 ), (3.13) = + (1 ), (3.14) where the values of the memory factors are found in intervals and NON LINEAR SPECTRAL SUBSTRACTION The nonlinear subtraction (NSS) is used when a frequencydependent Signal to Noise Ratio (SNR) is obtained. An improved noise model in the nonlinear spectral subtraction scheme is determined by (, ) where ) is a frequencydependent overestimation factor that can be estimated during the speech pauses jointly with the noise magnitude. The factor ) is possible to determine from the following equation = ( ). (3.15) Provided the noise magnitude ) and the noise overestimation model ) are known, it is possible for the nonlinear subtraction to be performed through the equation
12 57 ) = ) ), ), ) ). (3.16) A biased estimate of SNR ) at frame is determined by ) = (3.17) is a nonlinear function which determines a subtraction measure on the basis of the Signal to Noise Ratio determination: ) and the following interval ) ), ), ) 3 ) (3.18) It is essentially an arbitrary function, which realizes the following idea: a minimal subtraction is used if the signal-to-noise ratio ) is high and on the contrary, more noise is subtracted when the SNR ) is low (here the subtraction factor is at maximum). For example it is possible to use the following function (, ) = (3.19) where is a weight factor depending on the variation range of SNR. 3.5 PROPOSED SPEECH ENHANCEMENT ALGORITHM AGE when coupled with Non linear Spectral Subtraction (AGE+ NSS) performs better than AGE when SNR drops below -5db. The first step requires the signal to be filtered into number of subbands. In this example, number of subbands is chosen to be eight. The signal which is sampled at 16kHz is filtered into eight subbands. Non linear spectral subtraction is applied to each of the subband. Short term exponential magnitude average and noise floor is taken simultaneously. Using the short term exponential
13 58 magnitude average and noise floor the gain is calculated and it is multiplied with the spectra. The block diagram of Adaptive Gain Equalization with Non linear Spectral Subtraction (AGE+NSS) is shown in Figure 3.3. Figure 3.3 Block diagram of proposed algorithm 3.6 RESULTS AND DISCUSSION Noise sources were taken from AURORA database, includes suburban train noise, babble, car, exhibition hall, restaurant, street, airport and train-station noise. In training phase, uttered words (100 samples each digits 0-9, both male and female) were recorded using 8-bit Pulse Code Modulation (PCM) with a sampling rate of 8 khz and saved as a wave file using sound recorder software. Automatic Speech Recognition systems work reasonably well under clean conditions but become fragile in practical applications involving realworld environments.
14 59 Analysis was carried out at different environmental noises for digits 0-9 at different SNR values. The proposed SEA system works better for different noises at different SNR values. From Tables 3.1 and 3.2 it is observed that proposed algorithm results in better RA compared to existing AGE algorithm for different noise conditions. The proposed algorithm yields an improvement of 20.89% in terms of RA for Exhibition noise compared to other noise sources for 0dB SNR. This is due to the fact the spectral components of Exhibition noise are distributed over the different frequencies and their amplitudes are not up to the level at which they disturb the existing audio frequency range of the isolated words from zero to nine. From Table 3.3 and 3.4 it can be inferred that at 5dB of SNR, an improvement of 13.25% in RA was observed for an ASR in presence of Station noise compared to other noise sources. The proposed ASR with SEA performs better as the Station noise signal is not of sufficient amplitude to disturb the signal strength of the isolated words from zero to nine. At 10dB level the proposed algorithm yields an improvement of 10.03% in RA in presence of Car noise compared to other noise sources as tabulated from Table 3.5 and Table 3.6. The strength of spectral components is at medium level for Car noise compared to other noise sources and it results in better RA. As the noise spectral components are defined clearly at 15dB level, it is easier for the proposed ASR-SEA algorithm to remove the noise components better that in presence the other noise sources at different levels as shown in the Tables 3.7 and 3.8.
15 60
16 61
17 62
18 63
19 64 Table 3.9 shows the performance of ASR, the proposed method performs better with maximum improvement of 20.89% RA for Exhibition noise and with a minimum improvement of 4.58% RA for Station noise. Proposed SEA works better than existing SEA algorithm for different noises at different SNR values. Better recognition occurred for Street noise and least recognition for Airport noise was observed in Figure 3.4. Table 3.9 Overall performance analysis of Proposed SEA algorithm in terms of % improvement in RA Percentage Improvement Better 0dB 5dB 10dB 15dB Exhibition Station Car Airport (20.89%) (13.25 %) (10.03 %) (7.59 %) Least Airport (13.0%) Airport (9.67%) Airport (7.13 %) Station (4.58%) Overall % of RA for proposed and existing Algorithm % of RA Proposed Method Existing Method Noise Sources Figure 3.4 Overall percentage RA of proposed and existing SEA for various noises
20 65 From Table 3.1 through 3.8 it can inferred that best results for improvement in RA was observed for isolated words in presence of Street noise followed by equally good performance in presence of Car and Train noise. The improvement in RA for proposed algorithm in presence of various noise sources at different levels is that, the NSS with adaptive gain equalizer works well for non-stationary signals compared to stationary signals. 3.7 CONCLUSION Several experiments were conducted commonly to evaluate SEA algorithm. Analysis mainly focused on error probabilities. Proposed SEA was evaluated in terms of ability to discriminate speech from non speech at different SNR s values. SEA avoid losing speech periods leading to an extremely conservative behavior in detecting speech pauses. Proposed framework uses a speech processing module including a noise estimation algorithm with HMM based classification and noise language modeling to achieve effective noise knowledge estimation. Analysis was taken at different environmental noises for digits 0-9 at different SNR values. Recognition Accuracy (RA) increases when noise is estimated for each frame than without noise is being estimated. The proposed method combining the Adaptive Gain Equalizer (AGE) and Non linear Spectral Subtraction (NSS) system works better for different noises at different SNR values.
Performance Analysiss of Speech Enhancement Algorithm for Robust Speech Recognition System
Performance Analysiss of Speech Enhancement Algorithm for Robust Speech Recognition System C.GANESH BABU 1, Dr.P..T.VANATHI 2 R.RAMACHANDRAN 3, M.SENTHIL RAJAA 3, R.VENGATESH 3 1 Research Scholar (PSGCT)
More informationDifferent Approaches of Spectral Subtraction Method for Speech Enhancement
ISSN 2249 5460 Available online at www.internationalejournals.com International ejournals International Journal of Mathematical Sciences, Technology and Humanities 95 (2013 1056 1062 Different Approaches
More informationMMSE STSA Based Techniques for Single channel Speech Enhancement Application Simit Shah 1, Roma Patel 2
MMSE STSA Based Techniques for Single channel Speech Enhancement Application Simit Shah 1, Roma Patel 2 1 Electronics and Communication Department, Parul institute of engineering and technology, Vadodara,
More informationSpeech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter
Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter 1 Gupteswar Sahu, 2 D. Arun Kumar, 3 M. Bala Krishna and 4 Jami Venkata Suman Assistant Professor, Department of ECE,
More informationChapter 4 SPEECH ENHANCEMENT
44 Chapter 4 SPEECH ENHANCEMENT 4.1 INTRODUCTION: Enhancement is defined as improvement in the value or Quality of something. Speech enhancement is defined as the improvement in intelligibility and/or
More informationNoise Estimation and Noise Removal Techniques for Speech Recognition in Adverse Environment
Noise Estimation and Noise Removal Techniques for Speech Recognition in Adverse Environment Urmila Shrawankar 1,3 and Vilas Thakare 2 1 IEEE Student Member & Research Scholar, (CSE), SGB Amravati University,
More informationCHAPTER 4 VOICE ACTIVITY DETECTION ALGORITHMS
66 CHAPTER 4 VOICE ACTIVITY DETECTION ALGORITHMS 4.1 INTRODUCTION New frontiers of speech technology are demanding increased levels of performance in many areas. In the advent of Wireless Communications
More informationEnhancement of Speech in Noisy Conditions
Enhancement of Speech in Noisy Conditions Anuprita P Pawar 1, Asst.Prof.Kirtimalini.B.Choudhari 2 PG Student, Dept. of Electronics and Telecommunication, AISSMS C.O.E., Pune University, India 1 Assistant
More informationEnhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis
Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis Mohini Avatade & S.L. Sahare Electronics & Telecommunication Department, Cummins
More informationEffective post-processing for single-channel frequency-domain speech enhancement Weifeng Li a
R E S E A R C H R E P O R T I D I A P Effective post-processing for single-channel frequency-domain speech enhancement Weifeng Li a IDIAP RR 7-7 January 8 submitted for publication a IDIAP Research Institute,
More informationMel Spectrum Analysis of Speech Recognition using Single Microphone
International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree
More informationSpeech Enhancement Based On Noise Reduction
Speech Enhancement Based On Noise Reduction Kundan Kumar Singh Electrical Engineering Department University Of Rochester ksingh11@z.rochester.edu ABSTRACT This paper addresses the problem of signal distortion
More informationRASTA-PLP SPEECH ANALYSIS. Aruna Bayya. Phil Kohn y TR December 1991
RASTA-PLP SPEECH ANALYSIS Hynek Hermansky Nelson Morgan y Aruna Bayya Phil Kohn y TR-91-069 December 1991 Abstract Most speech parameter estimation techniques are easily inuenced by the frequency response
More informationPerformance analysis of voice activity detection algorithm for robust speech recognition system under different noisy environment
BABU et al: VOICE ACTIVITY DETECTION ALGORITHM FOR ROBUST SPEECH RECOGNITION SYSTEM Journal of Scientific & Industrial Research Vol. 69, July 2010, pp. 515-522 515 Performance analysis of voice activity
More informationSpeech Enhancement: Reduction of Additive Noise in the Digital Processing of Speech
Speech Enhancement: Reduction of Additive Noise in the Digital Processing of Speech Project Proposal Avner Halevy Department of Mathematics University of Maryland, College Park ahalevy at math.umd.edu
More informationFrequency Domain Analysis for Noise Suppression Using Spectral Processing Methods for Degraded Speech Signal in Speech Enhancement
Frequency Domain Analysis for Noise Suppression Using Spectral Processing Methods for Degraded Speech Signal in Speech Enhancement 1 Zeeshan Hashmi Khateeb, 2 Gopalaiah 1,2 Department of Instrumentation
More informationReduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter
Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter Ching-Ta Lu, Kun-Fu Tseng 2, Chih-Tsung Chen 2 Department of Information Communication, Asia University, Taichung, Taiwan, ROC
More informationSingle channel noise reduction
Single channel noise reduction Basics and processing used for ETSI STF 94 ETSI Workshop on Speech and Noise in Wideband Communication Claude Marro France Telecom ETSI 007. All rights reserved Outline Scope
More informationAuditory modelling for speech processing in the perceptual domain
ANZIAM J. 45 (E) ppc964 C980, 2004 C964 Auditory modelling for speech processing in the perceptual domain L. Lin E. Ambikairajah W. H. Holmes (Received 8 August 2003; revised 28 January 2004) Abstract
More informationREAL-TIME BROADBAND NOISE REDUCTION
REAL-TIME BROADBAND NOISE REDUCTION Robert Hoeldrich and Markus Lorber Institute of Electronic Music Graz Jakoministrasse 3-5, A-8010 Graz, Austria email: robert.hoeldrich@mhsg.ac.at Abstract A real-time
More informationSpeech Enhancement for Nonstationary Noise Environments
Signal & Image Processing : An International Journal (SIPIJ) Vol., No.4, December Speech Enhancement for Nonstationary Noise Environments Sandhya Hawaldar and Manasi Dixit Department of Electronics, KIT
More informationSpeech Signal Enhancement Techniques
Speech Signal Enhancement Techniques Chouki Zegar 1, Abdelhakim Dahimene 2 1,2 Institute of Electrical and Electronic Engineering, University of Boumerdes, Algeria inelectr@yahoo.fr, dahimenehakim@yahoo.fr
More informationModulator Domain Adaptive Gain Equalizer for Speech Enhancement
Modulator Domain Adaptive Gain Equalizer for Speech Enhancement Ravindra d. Dhage, Prof. Pravinkumar R.Badadapure Abstract M.E Scholar, Professor. This paper presents a speech enhancement method for personal
More informationMikko Myllymäki and Tuomas Virtanen
NON-STATIONARY NOISE MODEL COMPENSATION IN VOICE ACTIVITY DETECTION Mikko Myllymäki and Tuomas Virtanen Department of Signal Processing, Tampere University of Technology Korkeakoulunkatu 1, 3370, Tampere,
More informationModified Kalman Filter-based Approach in Comparison with Traditional Speech Enhancement Algorithms from Adverse Noisy Environments
Modified Kalman Filter-based Approach in Comparison with Traditional Speech Enhancement Algorithms from Adverse Noisy Environments G. Ramesh Babu 1 Department of E.C.E, Sri Sivani College of Engg., Chilakapalem,
More informationAnalysis of the SNR Estimator for Speech Enhancement Using a Cascaded Linear Model
Analysis of the SNR Estimator for Speech Enhancement Using a Cascaded Linear Model Harjeet Kaur Ph.D Research Scholar I.K.Gujral Punjab Technical University Jalandhar, Punjab, India Rajneesh Talwar Principal,Professor
More informationSpeech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction
IOSR Journal of VLSI and Signal Processing (IOSR-JVSP) Volume 7, Issue, Ver. I (Mar. - Apr. 7), PP 4-46 e-issn: 9 4, p-issn No. : 9 497 www.iosrjournals.org Speech Enhancement Using Spectral Flatness Measure
More informationMODIFIED DCT BASED SPEECH ENHANCEMENT IN VEHICULAR ENVIRONMENTS
MODIFIED DCT BASED SPEECH ENHANCEMENT IN VEHICULAR ENVIRONMENTS 1 S.PRASANNA VENKATESH, 2 NITIN NARAYAN, 3 K.SAILESH BHARATHWAAJ, 4 M.P.ACTLIN JEEVA, 5 P.VIJAYALAKSHMI 1,2,3,4,5 SSN College of Engineering,
More informationNOISE ESTIMATION IN A SINGLE CHANNEL
SPEECH ENHANCEMENT FOR CROSS-TALK INTERFERENCE by Levent M. Arslan and John H.L. Hansen Robust Speech Processing Laboratory Department of Electrical Engineering Box 99 Duke University Durham, North Carolina
More informationEnhancement of Speech Signal by Adaptation of Scales and Thresholds of Bionic Wavelet Transform Coefficients
ISSN (Print) : 232 3765 An ISO 3297: 27 Certified Organization Vol. 3, Special Issue 3, April 214 Paiyanoor-63 14, Tamil Nadu, India Enhancement of Speech Signal by Adaptation of Scales and Thresholds
More informationSpeech Enhancement Using a Mixture-Maximum Model
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 10, NO. 6, SEPTEMBER 2002 341 Speech Enhancement Using a Mixture-Maximum Model David Burshtein, Senior Member, IEEE, and Sharon Gannot, Member, IEEE
More informationEXPERIMENTAL INVESTIGATION INTO THE OPTIMAL USE OF DITHER
EXPERIMENTAL INVESTIGATION INTO THE OPTIMAL USE OF DITHER PACS: 43.60.Cg Preben Kvist 1, Karsten Bo Rasmussen 2, Torben Poulsen 1 1 Acoustic Technology, Ørsted DTU, Technical University of Denmark DK-2800
More informationVoice Activity Detection
Voice Activity Detection Speech Processing Tom Bäckström Aalto University October 2015 Introduction Voice activity detection (VAD) (or speech activity detection, or speech detection) refers to a class
More informationNoise Estimation and Noise Removal Techniques for Speech Recognition in Adverse Environment
Noise Estimation and Noise Removal Techniques for Speech Recognition in Adverse Environment Urmila Shrawankar, Vilas Thakare To cite this version: Urmila Shrawankar, Vilas Thakare. Noise Estimation and
More informationSpeech Enhancement Techniques using Wiener Filter and Subspace Filter
IJSTE - International Journal of Science Technology & Engineering Volume 3 Issue 05 November 2016 ISSN (online): 2349-784X Speech Enhancement Techniques using Wiener Filter and Subspace Filter Ankeeta
More informationEffects of Reverberation on Pitch, Onset/Offset, and Binaural Cues
Effects of Reverberation on Pitch, Onset/Offset, and Binaural Cues DeLiang Wang Perception & Neurodynamics Lab The Ohio State University Outline of presentation Introduction Human performance Reverberation
More informationRobust Low-Resource Sound Localization in Correlated Noise
INTERSPEECH 2014 Robust Low-Resource Sound Localization in Correlated Noise Lorin Netsch, Jacek Stachurski Texas Instruments, Inc. netsch@ti.com, jacek@ti.com Abstract In this paper we address the problem
More informationRECENTLY, there has been an increasing interest in noisy
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 52, NO. 9, SEPTEMBER 2005 535 Warped Discrete Cosine Transform-Based Noisy Speech Enhancement Joon-Hyuk Chang, Member, IEEE Abstract In
More informationCodebook-based Bayesian speech enhancement for nonstationary environments Srinivasan, S.; Samuelsson, J.; Kleijn, W.B.
Codebook-based Bayesian speech enhancement for nonstationary environments Srinivasan, S.; Samuelsson, J.; Kleijn, W.B. Published in: IEEE Transactions on Audio, Speech, and Language Processing DOI: 10.1109/TASL.2006.881696
More informationEE482: Digital Signal Processing Applications
Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 12 Speech Signal Processing 14/03/25 http://www.ee.unlv.edu/~b1morris/ee482/
More informationSpeech Enhancement using Wiener filtering
Speech Enhancement using Wiener filtering S. Chirtmay and M. Tahernezhadi Department of Electrical Engineering Northern Illinois University DeKalb, IL 60115 ABSTRACT The problem of reducing the disturbing
More informationPerceptual Speech Enhancement Using Multi_band Spectral Attenuation Filter
Perceptual Speech Enhancement Using Multi_band Spectral Attenuation Filter Sana Alaya, Novlène Zoghlami and Zied Lachiri Signal, Image and Information Technology Laboratory National Engineering School
More informationAvailable online at ScienceDirect. Procedia Computer Science 89 (2016 )
Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 89 (2016 ) 666 676 Twelfth International Multi-Conference on Information Processing-2016 (IMCIP-2016) Comparison of Speech
More information24 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 17, NO. 1, JANUARY /$ IEEE
24 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 17, NO. 1, JANUARY 2009 Speech Enhancement, Gain, and Noise Spectrum Adaptation Using Approximate Bayesian Estimation Jiucang Hao, Hagai
More informationModulation Domain Spectral Subtraction for Speech Enhancement
Modulation Domain Spectral Subtraction for Speech Enhancement Author Paliwal, Kuldip, Schwerin, Belinda, Wojcicki, Kamil Published 9 Conference Title Proceedings of Interspeech 9 Copyright Statement 9
More informationVQ Source Models: Perceptual & Phase Issues
VQ Source Models: Perceptual & Phase Issues Dan Ellis & Ron Weiss Laboratory for Recognition and Organization of Speech and Audio Dept. Electrical Eng., Columbia Univ., NY USA {dpwe,ronw}@ee.columbia.edu
More informationspeech signal S(n). This involves a transformation of S(n) into another signal or a set of signals
16 3. SPEECH ANALYSIS 3.1 INTRODUCTION TO SPEECH ANALYSIS Many speech processing [22] applications exploits speech production and perception to accomplish speech analysis. By speech analysis we extract
More informationSpeech Compression for Better Audibility Using Wavelet Transformation with Adaptive Kalman Filtering
Speech Compression for Better Audibility Using Wavelet Transformation with Adaptive Kalman Filtering P. Sunitha 1, Satya Prasad Chitneedi 2 1 Assoc. Professor, Department of ECE, Pragathi Engineering College,
More informationNoise Estimation based on Standard Deviation and Sigmoid Function Using a Posteriori Signal to Noise Ratio in Nonstationary Noisy Environments
88 International Journal of Control, Automation, and Systems, vol. 6, no. 6, pp. 88-87, December 008 Noise Estimation based on Standard Deviation and Sigmoid Function Using a Posteriori Signal to Noise
More informationOnline Version Only. Book made by this file is ILLEGAL. 2. Mathematical Description
Vol.9, No.9, (216), pp.317-324 http://dx.doi.org/1.14257/ijsip.216.9.9.29 Speech Enhancement Using Iterative Kalman Filter with Time and Frequency Mask in Different Noisy Environment G. Manmadha Rao 1
More informationHUMAN speech is frequently encountered in several
1948 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 20, NO. 7, SEPTEMBER 2012 Enhancement of Single-Channel Periodic Signals in the Time-Domain Jesper Rindom Jensen, Student Member,
More informationSPEECH ENHANCEMENT WITH SIGNAL SUBSPACE FILTER BASED ON PERCEPTUAL POST FILTERING
SPEECH ENHANCEMENT WITH SIGNAL SUBSPACE FILTER BASED ON PERCEPTUAL POST FILTERING K.Ramalakshmi Assistant Professor, Dept of CSE Sri Ramakrishna Institute of Technology, Coimbatore R.N.Devendra Kumar Assistant
More informationGUI Based Performance Analysis of Speech Enhancement Techniques
International Journal of Scientific and Research Publications, Volume 3, Issue 9, September 2013 1 GUI Based Performance Analysis of Speech Enhancement Techniques Shishir Banchhor*, Jimish Dodia**, Darshana
More informationAvailable online at ScienceDirect. Procedia Computer Science 54 (2015 )
Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 54 (2015 ) 574 584 Eleventh International Multi-Conference on Information Processing-2015 (IMCIP-2015) Speech Enhancement
More informationIsolated Word Recognition Based on Combination of Multiple Noise-Robust Techniques
Isolated Word Recognition Based on Combination of Multiple Noise-Robust Techniques 81 Isolated Word Recognition Based on Combination of Multiple Noise-Robust Techniques Noboru Hayasaka 1, Non-member ABSTRACT
More informationDifferent Approaches of Spectral Subtraction method for Enhancing the Speech Signal in Noisy Environments
International Journal of Scientific & Engineering Research, Volume 2, Issue 5, May-2011 1 Different Approaches of Spectral Subtraction method for Enhancing the Speech Signal in Noisy Environments Anuradha
More informationStudents: Avihay Barazany Royi Levy Supervisor: Kuti Avargel In Association with: Zoran, Haifa
Students: Avihay Barazany Royi Levy Supervisor: Kuti Avargel In Association with: Zoran, Haifa Spring 2008 Introduction Problem Formulation Possible Solutions Proposed Algorithm Experimental Results Conclusions
More informationModulation Domain Improved Adaptive Gain Equalizer for Single Channel Speech Enhancement
Master Thesis Electrical Engineering Modulation Domain Improved Adaptive Gain Equalizer for Single Channel Speech Enhancement ADITHYA VALLI NETTEM SHAKIRA SHAHEEN This thesis is presented as part of Degree
More informationSpeech Enhancement in Noisy Environment using Kalman Filter
Speech Enhancement in Noisy Environment using Kalman Filter Erukonda Sravya 1, Rakesh Ranjan 2, Nitish J. Wadne 3 1, 2 Assistant professor, Dept. of ECE, CMR Engineering College, Hyderabad (India) 3 PG
More informationComparative Performance Analysis of Speech Enhancement Methods
International Journal of Innovative Research in Electronics and Communications (IJIREC) Volume 3, Issue 2, 2016, PP 15-23 ISSN 2349-4042 (Print) & ISSN 2349-4050 (Online) www.arcjournals.org Comparative
More informationDetection, Interpolation and Cancellation Algorithms for GSM burst Removal for Forensic Audio
>Bitzer and Rademacher (Paper Nr. 21)< 1 Detection, Interpolation and Cancellation Algorithms for GSM burst Removal for Forensic Audio Joerg Bitzer and Jan Rademacher Abstract One increasing problem for
More informationAdaptive Noise Reduction Algorithm for Speech Enhancement
Adaptive Noise Reduction Algorithm for Speech Enhancement M. Kalamani, S. Valarmathy, M. Krishnamoorthi Abstract In this paper, Least Mean Square (LMS) adaptive noise reduction algorithm is proposed to
More informationDigital Signal Processing of Speech for the Hearing Impaired
Digital Signal Processing of Speech for the Hearing Impaired N. Magotra, F. Livingston, S. Savadatti, S. Kamath Texas Instruments Incorporated 12203 Southwest Freeway Stafford TX 77477 Abstract This paper
More informationAdaptive Noise Reduction of Speech. Signals. Wenqing Jiang and Henrique Malvar. July Technical Report MSR-TR Microsoft Research
Adaptive Noise Reduction of Speech Signals Wenqing Jiang and Henrique Malvar July 2000 Technical Report MSR-TR-2000-86 Microsoft Research Microsoft Corporation One Microsoft Way Redmond, WA 98052 http://www.research.microsoft.com
More informationReliable A posteriori Signal-to-Noise Ratio features selection
Reliable A eriori Signal-to-Noise Ratio features selection Cyril Plapous, Claude Marro, Pascal Scalart To cite this version: Cyril Plapous, Claude Marro, Pascal Scalart. Reliable A eriori Signal-to-Noise
More informationtwo computers. 2- Providing a channel between them for transmitting and receiving the signals through it.
1. Introduction: Communication is the process of transmitting the messages that carrying information, where the two computers can be communicated with each other if the two conditions are available: 1-
More informationPERFORMANCE ANALYSIS OF SPEECH SIGNAL ENHANCEMENT TECHNIQUES FOR NOISY TAMIL SPEECH RECOGNITION
Journal of Engineering Science and Technology Vol. 12, No. 4 (2017) 972-986 School of Engineering, Taylor s University PERFORMANCE ANALYSIS OF SPEECH SIGNAL ENHANCEMENT TECHNIQUES FOR NOISY TAMIL SPEECH
More informationSignal Processing 91 (2011) Contents lists available at ScienceDirect. Signal Processing. journal homepage:
Signal Processing 9 (2) 55 6 Contents lists available at ScienceDirect Signal Processing journal homepage: www.elsevier.com/locate/sigpro Fast communication Minima-controlled speech presence uncertainty
More informationOptimal Adaptive Filtering Technique for Tamil Speech Enhancement
Optimal Adaptive Filtering Technique for Tamil Speech Enhancement Vimala.C Project Fellow, Department of Computer Science Avinashilingam Institute for Home Science and Higher Education and Women Coimbatore,
More informationRecent Advances in Acoustic Signal Extraction and Dereverberation
Recent Advances in Acoustic Signal Extraction and Dereverberation Emanuël Habets Erlangen Colloquium 2016 Scenario Spatial Filtering Estimated Desired Signal Undesired sound components: Sensor noise Competing
More informationAudio Imputation Using the Non-negative Hidden Markov Model
Audio Imputation Using the Non-negative Hidden Markov Model Jinyu Han 1,, Gautham J. Mysore 2, and Bryan Pardo 1 1 EECS Department, Northwestern University 2 Advanced Technology Labs, Adobe Systems Inc.
More informationEnhancement of Speech Communication Technology Performance Using Adaptive-Control Factor Based Spectral Subtraction Method
Enhancement of Speech Communication Technology Performance Using Adaptive-Control Factor Based Spectral Subtraction Method Paper Isiaka A. Alimi a,b and Michael O. Kolawole a a Electrical and Electronics
More informationWavelet Speech Enhancement based on the Teager Energy Operator
Wavelet Speech Enhancement based on the Teager Energy Operator Mohammed Bahoura and Jean Rouat ERMETIS, DSA, Université du Québec à Chicoutimi, Chicoutimi, Québec, G7H 2B1, Canada. Abstract We propose
More informationAudio Restoration Based on DSP Tools
Audio Restoration Based on DSP Tools EECS 451 Final Project Report Nan Wu School of Electrical Engineering and Computer Science University of Michigan Ann Arbor, MI, United States wunan@umich.edu Abstract
More informationChapter 3. Speech Enhancement and Detection Techniques: Transform Domain
Speech Enhancement and Detection Techniques: Transform Domain 43 This chapter describes techniques for additive noise removal which are transform domain methods and based mostly on short time Fourier transform
More informationESE531 Spring University of Pennsylvania Department of Electrical and System Engineering Digital Signal Processing
University of Pennsylvania Department of Electrical and System Engineering Digital Signal Processing ESE531, Spring 2017 Final Project: Audio Equalization Wednesday, Apr. 5 Due: Tuesday, April 25th, 11:59pm
More informationPhase estimation in speech enhancement unimportant, important, or impossible?
IEEE 7-th Convention of Electrical and Electronics Engineers in Israel Phase estimation in speech enhancement unimportant, important, or impossible? Timo Gerkmann, Martin Krawczyk, and Robert Rehr Speech
More informationNoise Power Spectral Density Estimation Based on Optimal Smoothing and Minimum Statistics
504 IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 9, NO. 5, JULY 2001 Noise Power Spectral Density Estimation Based on Optimal Smoothing and Minimum Statistics Rainer Martin, Senior Member, IEEE
More informationAS DIGITAL speech communication devices, such as
IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 20, NO. 4, MAY 2012 1383 Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay Timo Gerkmann, Member, IEEE,
More informationSingle Channel Speaker Segregation using Sinusoidal Residual Modeling
NCC 2009, January 16-18, IIT Guwahati 294 Single Channel Speaker Segregation using Sinusoidal Residual Modeling Rajesh M Hegde and A. Srinivas Dept. of Electrical Engineering Indian Institute of Technology
More informationSingle Channel Speech Enhancement in Severe Noise Conditions
Single Channel Speech Enhancement in Severe Noise Conditions This thesis is presented for the degree of Doctor of Philosophy In the School of Electrical, Electronic and Computer Engineering The University
More informationKeywords Decomposition; Reconstruction; SNR; Speech signal; Super soft Thresholding.
Volume 5, Issue 2, February 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Speech Enhancement
More informationNoise Spectrum Estimation in Adverse Environments: Improved Minima Controlled Recursive Averaging
466 IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 11, NO. 5, SEPTEMBER 2003 Noise Spectrum Estimation in Adverse Environments: Improved Minima Controlled Recursive Averaging Israel Cohen Abstract
More informationRobust Voice Activity Detection Based on Discrete Wavelet. Transform
Robust Voice Activity Detection Based on Discrete Wavelet Transform Kun-Ching Wang Department of Information Technology & Communication Shin Chien University kunching@mail.kh.usc.edu.tw Abstract This paper
More informationA Survey and Evaluation of Voice Activity Detection Algorithms
A Survey and Evaluation of Voice Activity Detection Algorithms Seshashyama Sameeraj Meduri (ssme09@student.bth.se, 861003-7577) Rufus Ananth (anru09@student.bth.se, 861129-5018) Examiner: Dr. Sven Johansson
More informationHigh-speed Noise Cancellation with Microphone Array
Noise Cancellation a Posteriori Probability, Maximum Criteria Independent Component Analysis High-speed Noise Cancellation with Microphone Array We propose the use of a microphone array based on independent
More informationSingle-channel speech enhancement using spectral subtraction in the short-time modulation domain
Single-channel speech enhancement using spectral subtraction in the short-time modulation domain Kuldip Paliwal, Kamil Wójcicki and Belinda Schwerin Signal Processing Laboratory, Griffith School of Engineering,
More informationDenoising Of Speech Signal By Classification Into Voiced, Unvoiced And Silence Region
IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) e-issn: 2278-2834,p- ISSN: 2278-8735.Volume 11, Issue 1, Ver. III (Jan. - Feb.216), PP 26-35 www.iosrjournals.org Denoising Of Speech
More informationEstimation of Non-stationary Noise Power Spectrum using DWT
Estimation of Non-stationary Noise Power Spectrum using DWT Haripriya.R.P. Department of Electronics & Communication Engineering Mar Baselios College of Engineering & Technology, Kerala, India Lani Rachel
More informationAdvanced Signal Processing and Digital Noise Reduction
Advanced Signal Processing and Digital Noise Reduction Advanced Signal Processing and Digital Noise Reduction Saeed V. Vaseghi Queen's University of Belfast UK ~ W I lilteubner L E Y A Partnership between
More informationLecture 4 Biosignal Processing. Digital Signal Processing and Analysis in Biomedical Systems
Lecture 4 Biosignal Processing Digital Signal Processing and Analysis in Biomedical Systems Contents - Preprocessing as first step of signal analysis - Biosignal acquisition - ADC - Filtration (linear,
More informationAcoustic Beamforming for Hearing Aids Using Multi Microphone Array by Designing Graphical User Interface
MEE-2010-2012 Acoustic Beamforming for Hearing Aids Using Multi Microphone Array by Designing Graphical User Interface Master s Thesis S S V SUMANTH KOTTA BULLI KOTESWARARAO KOMMINENI This thesis is presented
More informationAnalysis Modification synthesis based Optimized Modulation Spectral Subtraction for speech enhancement
Analysis Modification synthesis based Optimized Modulation Spectral Subtraction for speech enhancement Pavan D. Paikrao *, Sanjay L. Nalbalwar, Abstract Traditional analysis modification synthesis (AMS
More informationSpeech Enhancement Based on Audible Noise Suppression
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 5, NO. 6, NOVEMBER 1997 497 Speech Enhancement Based on Audible Noise Suppression Dionysis E. Tsoukalas, John N. Mourjopoulos, Member, IEEE, and George
More informationSpeech Volume Monitor for Hearing Impaired
Speech Volume Monitor for Hearing Impaired R.DEEPA (Mphil Research scholar) PSGR Krishnnaml college for women. GRG School of Applied Technology Coimbatore,India Abstract Hearing impaired can be classified
More informationTesting of Objective Audio Quality Assessment Models on Archive Recordings Artifacts
POSTER 25, PRAGUE MAY 4 Testing of Objective Audio Quality Assessment Models on Archive Recordings Artifacts Bc. Martin Zalabák Department of Radioelectronics, Czech Technical University in Prague, Technická
More informationSPEECH ENHANCEMENT BASED ON A LOG-SPECTRAL AMPLITUDE ESTIMATOR AND A POSTFILTER DERIVED FROM CLEAN SPEECH CODEBOOK
18th European Signal Processing Conference (EUSIPCO-2010) Aalborg, Denmar, August 23-27, 2010 SPEECH ENHANCEMENT BASED ON A LOG-SPECTRAL AMPLITUDE ESTIMATOR AND A POSTFILTER DERIVED FROM CLEAN SPEECH CODEBOOK
More informationSpeech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm
International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm A.T. Rajamanickam, N.P.Subiramaniyam, A.Balamurugan*,
More informationAutomotive three-microphone voice activity detector and noise-canceller
Res. Lett. Inf. Math. Sci., 005, Vol. 7, pp 47-55 47 Available online at http://iims.massey.ac.nz/research/letters/ Automotive three-microphone voice activity detector and noise-canceller Z. QI and T.J.MOIR
More informationIntegrated Speech Enhancement Technique for Hands-Free Mobile Phones
Master Thesis Electrical Engineering August 2012 Integrated Speech Enhancement Technique for Hands-Free Mobile Phones ANEESH KALUVA School of Engineering Department of Electrical Engineering Blekinge Institute
More information