Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter

Similar documents
Chapter 4 SPEECH ENHANCEMENT

Different Approaches of Spectral Subtraction Method for Speech Enhancement

Effective post-processing for single-channel frequency-domain speech enhancement Weifeng Li a

Frequency Domain Analysis for Noise Suppression Using Spectral Processing Methods for Degraded Speech Signal in Speech Enhancement

Comparative Performance Analysis of Speech Enhancement Methods

Enhancement of Speech in Noisy Conditions

Speech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction

Speech Enhancement: Reduction of Additive Noise in the Digital Processing of Speech

Enhancement of Speech Signal by Adaptation of Scales and Thresholds of Bionic Wavelet Transform Coefficients

MMSE STSA Based Techniques for Single channel Speech Enhancement Application Simit Shah 1, Roma Patel 2

REAL-TIME BROADBAND NOISE REDUCTION

Speech Enhancement in Noisy Environment using Kalman Filter

CHAPTER 3 SPEECH ENHANCEMENT ALGORITHMS

Audio Restoration Based on DSP Tools

Speech Signal Enhancement Techniques

Online Version Only. Book made by this file is ILLEGAL. 2. Mathematical Description

Robust Low-Resource Sound Localization in Correlated Noise

Isolated Word Recognition Based on Combination of Multiple Noise-Robust Techniques

Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis

Speech Synthesis using Mel-Cepstral Coefficient Feature

RECENTLY, there has been an increasing interest in noisy

Speech Enhancement Based On Noise Reduction

A Correlation-Maximization Denoising Filter Used as An Enhancement Frontend for Noise Robust Bird Call Classification

Mel Spectrum Analysis of Speech Recognition using Single Microphone

ScienceDirect. Unsupervised Speech Segregation Using Pitch Information and Time Frequency Masking

MODIFIED DCT BASED SPEECH ENHANCEMENT IN VEHICULAR ENVIRONMENTS

Performance Analysiss of Speech Enhancement Algorithm for Robust Speech Recognition System

Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter

Speech Enhancement Using a Mixture-Maximum Model

AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND TRANSFER FUNCTIONS

Multimedia Signal Processing: Theory and Applications in Speech, Music and Communications

Shweta Kumari, 2 Priyanka Jaiswal, 3 Dr. Manish Jain 1,2

Available online at ScienceDirect. Procedia Computer Science 89 (2016 )

Modulator Domain Adaptive Gain Equalizer for Speech Enhancement

SPEECH ENHANCEMENT USING A ROBUST KALMAN FILTER POST-PROCESSOR IN THE MODULATION DOMAIN. Yu Wang and Mike Brookes

Analysis of the SNR Estimator for Speech Enhancement Using a Cascaded Linear Model

Single Channel Speaker Segregation using Sinusoidal Residual Modeling

24 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 17, NO. 1, JANUARY /$ IEEE

Perceptual Speech Enhancement Using Multi_band Spectral Attenuation Filter

NOISE ESTIMATION IN A SINGLE CHANNEL

Speech Enhancement using Wiener filtering

Advances in Applied and Pure Mathematics

Calibration of Microphone Arrays for Improved Speech Recognition

EE482: Digital Signal Processing Applications

Speech Enhancement for Nonstationary Noise Environments

speech signal S(n). This involves a transformation of S(n) into another signal or a set of signals

SPECTRAL COMBINING FOR MICROPHONE DIVERSITY SYSTEMS

Adaptive Speech Enhancement Using Partial Differential Equations and Back Propagation Neural Networks

A Spectral Conversion Approach to Single- Channel Speech Enhancement

Adaptive Noise Reduction Algorithm for Speech Enhancement

SPEECH communication under noisy conditions is difficult

Modified Kalman Filter-based Approach in Comparison with Traditional Speech Enhancement Algorithms from Adverse Noisy Environments

High-speed Noise Cancellation with Microphone Array

Recent Advances in Acoustic Signal Extraction and Dereverberation

Evaluation of clipping-noise suppression of stationary-noisy speech based on spectral compensation

Performance study of Text-independent Speaker identification system using MFCC & IMFCC for Telephone and Microphone Speeches

Single channel noise reduction

Available online at ScienceDirect. Procedia Computer Science 54 (2015 )

RASTA-PLP SPEECH ANALYSIS. Aruna Bayya. Phil Kohn y TR December 1991

Keywords Decomposition; Reconstruction; SNR; Speech signal; Super soft Thresholding.

Signal Processing for Speech Applications - Part 2-1. Signal Processing For Speech Applications - Part 2

Dominant Voiced Speech Segregation Using Onset Offset Detection and IBM Based Segmentation

Wavelet Speech Enhancement based on the Teager Energy Operator

Mikko Myllymäki and Tuomas Virtanen

ROBUST PITCH TRACKING USING LINEAR REGRESSION OF THE PHASE

REAL TIME DIGITAL SIGNAL PROCESSING

Modulation Domain Spectral Subtraction for Speech Enhancement

CHAPTER 4 VOICE ACTIVITY DETECTION ALGORITHMS

Phase estimation in speech enhancement unimportant, important, or impossible?

SPEECH SIGNAL ENHANCEMENT USING FIREFLY OPTIMIZATION ALGORITHM

Khlui-Phiang-Aw Sound Synthesis Using A Warped FIR Filter

Speech Enhancement Using Beamforming Dr. G. Ramesh Babu 1, D. Lavanya 2, B. Yamuna 2, H. Divya 2, B. Shiva Kumar 2, B.

Implementation of SYMLET Wavelets to Removal of Gaussian Additive Noise from Speech Signal

Adaptive Noise Reduction of Speech. Signals. Wenqing Jiang and Henrique Malvar. July Technical Report MSR-TR Microsoft Research

Lecture 4 Biosignal Processing. Digital Signal Processing and Analysis in Biomedical Systems

Application of Affine Projection Algorithm in Adaptive Noise Cancellation

Chapter 3. Speech Enhancement and Detection Techniques: Transform Domain

SPEECH enhancement has many applications in voice

SPEECH ENHANCEMENT WITH SIGNAL SUBSPACE FILTER BASED ON PERCEPTUAL POST FILTERING

Noise Estimation based on Standard Deviation and Sigmoid Function Using a Posteriori Signal to Noise Ratio in Nonstationary Noisy Environments

Classification of ships using autocorrelation technique for feature extraction of the underwater acoustic noise

Enhancement of Speech Communication Technology Performance Using Adaptive-Control Factor Based Spectral Subtraction Method

Optimal Adaptive Filtering Technique for Tamil Speech Enhancement

Estimation of Non-stationary Noise Power Spectrum using DWT

Overview of Code Excited Linear Predictive Coder

HUMAN speech is frequently encountered in several

GUI Based Performance Analysis of Speech Enhancement Techniques

Robust Speech Feature Extraction using RSF/DRA and Burst Noise Skipping

Speech Enhancement Based on Non-stationary Noise-driven Geometric Spectral Subtraction and Phase Spectrum Compensation

CG401 Advanced Signal Processing. Dr Stuart Lawson Room A330 Tel: January 2003

AN AUTOREGRESSIVE BASED LFM REVERBERATION SUPPRESSION FOR RADAR AND SONAR APPLICATIONS

Auditory modelling for speech processing in the perceptual domain

Performance analysis of voice activity detection algorithm for robust speech recognition system under different noisy environment

Codebook-based Bayesian speech enhancement for nonstationary environments Srinivasan, S.; Samuelsson, J.; Kleijn, W.B.

Audio Imputation Using the Non-negative Hidden Markov Model

Performance Analysis of MFCC and LPCC Techniques in Automatic Speech Recognition

STFT Phase Reconstruction in Voiced Speech for an Improved Single-Channel Speech Enhancement

Speech Enhancement Based on Audible Noise Suppression

Adaptive Noise Canceling for Speech Signals

Bandwidth Extension for Speech Enhancement

Transcription:

Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter 1 Gupteswar Sahu, 2 D. Arun Kumar, 3 M. Bala Krishna and 4 Jami Venkata Suman Assistant Professor, Department of ECE, GMR Institute of Technology, Rajam, India. Abstract The process of suppressing the back ground noise in speech signals can be improved by subtraction of an estimate of the average noise spectrum from the noisy signal spectrum. The main objective of this paper is to implement and evaluate speech enhancement techniques based on spectral subtraction methods in presence of noise. The two techniques discussed in this paper are spectral subtraction filter and wiener filter. The simulation results reveal the superiority of the proposed methods for different back ground noises. Keywords- Spectral Subtraction, Additive white Gaussian Noise, Weiner Filter, Power spectrum. I. INTRODUCTION The main objective of the speech enhancement is to minimize the effects of noise on speech by improving the perceptual quality of noisy speech. In a real world environment most of the times the speech signal is accompanied by back ground noise. The presence of the additive back ground noise like car noise, babble noise, train noise affects the quality of the speech signal. A number of speech enhancement methods are proposed to enhance the quality of the degrade speech. A majority of the speech enhancement techniques can be grouped into temporal processing and spectral processing methods. In temporal processing techniques, the processing is done in the time domain, and in spectral processing techniques, the enhancement achieved by processing the degraded speech signal in frequency domain. The proper selection of the enhancement method depends on the type of degradation. Generally, the speech enhancement problem consists of a family of problems characterized by the type of noise source, the way noise is interfered with the clean signal, the number of microphone outputs and number of voice channels available for enhancement [1]. Time domain techniques are developed based on Infinite Impulse Response (IIR) filters, Finite Impulse Response (FIR) filters, Linear Predictive Coefficients (LPC), Hidden Markov Models (HMM) and Kalman filters. In transformation domain technique, transformation is performed over degraded speech before filtering. Followed by an inverse transformation to reinstate the original speech. The main advantage of performing noise reduction lies in the relative simplicity of distinguishing and removing noise from the speech signal [3]. Most of the research work carried out on speech enhancement uses the additive noise model to model background noise. In the time domain one of the simplest methods used to reduce noise is the comb filters, which exploits the periodicity of the voiced signal. The basic approach of this method is to use a FIR filter whose coefficients are separated by the pitch period. Linear predictive coding (LPC) is a widely used technique, which is a iterative scheme to estimate the LP parameters of the speech signal. This is based on autoregressive model of the speech signal. The LPC proven to be an effective and computationally efficient choice for speech enhancement. The kalman filtering is one of the important time domain method used for speech enhancement. The main objective for using kalman filters comes from the following advantages. (a). It can adapt to non stationary signals (e.g. Speech signal); (b). It operates on finite data set; (c) it makes use of speech and noise production model. Hidden Markov Model is one of the dominant techniques used for acoustic models. A general HMM model consists of several interconnected states, which are designed to capture time varying signal characteristics and considered to be the generalization of Gaussian Mixture Model (GMM) The model jumps form one state to another according to the signal and the state transition relationship between the states. The HMM based enhancement techniques yields reasonably well enhanced speech. Spectral subtraction can be categorized as non parametric speech enhancement method. This is used for enhancing speech degraded by additive stationary background noise. Weiner filtering is one of the alternative methods to spectral subtraction to enhance the degraded speech signal. In this paper we focused on spectral subtraction noise removal approach in speech processing along with Weiner filtering approach.. The spectral subtraction is a popular method that is used to enhance the speech in presence of back ground noise. Its main advantage Available Online@ 179

comes from its minimal complexity and low computational load. In this method the speech spectrum enhanced by subtracting an average noise spectrum from the noisy speech spectrum. Here it is assumed that, noise is uncorrelated and additive to the noise. II. Spectral Subtraction Method It is based on the basic principal of restoring the signal by subtracting an estimate of the noise spectrum from the noisy speech spectrum. The noise spectrum can be estimated from the pauses and quite periods in the speech signal. When there is no speech being said and only noise is present. Basic assumption is noise is additive. Spectral subtraction of the signal takes place in the frequency domain. The data from the signal are segmented and windowed by using Hamming window followed by FFT to transform the signal from time domain to frequency domain. Let p(k) and n(k) be represented by a windowed speech signal and noise signal respectively. The sum of the two is then denoted by x(k), x(k) = p(k) + n(k). (1) Where x(k), p(k) and n(k) are the noisy signal, the original signal and the noise respectively. By applying Fourier transform on both sides of equation (1). X(e jw )=P( e jw ) + N (e jw ) (2) X w (w) 2 = P w (w) 2 + N w (w) 2 + P w (w).n w (w) * + P w (w) *.N w (w) (5) If we assume the noise component n(k) is un- correlated to p(n). Then the terms P w (w).n w (w) * and P w (w) *.N w (w) are reduced to zero. From the above based assumptions the estimation of clean speech can be estimated as follows P w (w) 2 = X w (w) 2 - N w (w) 2 (6) A more general form can be derived by generalizing the exponent from 2 to b. P w (w) b = X w (w) b - α N w (w) b (7) Where b represents the power exponent, for magnitude spectral subtraction, the exponent b=1, and for power spectral subtraction, b=2. The parameter α controls the amount of noise subtracted from the noisy signal. For full noise subtraction α=1, and for over subtraction α>1. III. Algorithm The overview of the algorithm presented in Fig. 1. The input signals are digitized with a sampling rate of 8 KHz. The analysis window is a 128 point Hamming window and the overlap between two successive windows is set to be 50%. In order to avoid wrap around errors each frame is zero padded to 256 point. The frequency transformation of the signal is done by using FFT. Where X(e jw ), P(e jw ) and N(e jw ) are the fourier transforms of the noisy signal, the original signal and the additive noise. Further the incoming signal is x(k) is divided into segments of length N. and each segment is windowed by using a hamming window and then transformation is performed via FFT. The window signal is given as follows x w (m) = w(m) x(m) = w (m) [p(m) + n(m)] =p w (m) + n w (m) (3) The frequency domain representation for the windowed operation can be represented as X w (f)=p w (f) + N w (f) (4) In the frequency domain, with their respective Fourier Transforms, The power spectrum of the noisy signal is given by Fig. 1: Block Diagram of Spectral Subtraction Method Available Online@ 180

IV. Spectral Subtraction Via Weiner Filtration Weiner filtering was first proposed by Norbert Weiner in 1949. This type of filter generally is used to estimate or to predict the signal in presence of noise. For the implementation of the Weiner filter requires the power spectra of the signal and the noise process. The frequency domain representation of the noisy signal, the original signal and the additive white noise is represented by equation (5). The formulation for the Weiner filter is as follows. W ( f ) P XX Pxx ( f ) ( f ) P yy ( f ) (8) Where P xx (f) and Pyy(f) represents the estimated power spectra of the noise free signal and the background noise. Which are assumed to be uncorrelated and stationary. After calculating the transfer function W(f). Dividing equation (8) by P yy (f). SNR( f ) ( f ) SNR( f ) 1 W (9) (car noise, train noise and babble noise) with different signal to noise ratio (10, 5, 0dB). The amount of noise reduction is generally measured with the SNR improvement. Fig. 3. Clean Speech signal (blue), Clean speech signal added with 5 db AWGN (red), and filtered signal (green). Fig. 4. Speech signal with car noise of 0 db (red) and filtered signal (Green) Fig. 2 shows the block diagram of Weiner filter V. Results and Discussions For the evaluation of performance of spectral subtraction and wiener filter, a time spectrum of several utterances in presence of noise is analyzed and to show the improvement of the noisy speech signal, to conduct experiments we have used speech signals constituted by sentences pronounced in English language by male and female speakers. The speech signals are sampled with 8kHz and are corrupted by three types of additive noise Available Online@ 181

Fig. 5. Speech signal with train noise of 0 db (red) and Fig. 8. Speech signal with car noise of 5 db (red) and Fig. 6. Speech signals with babble noise of 0 db (red) and Fig. 9. Speech signal with car noise of 5 db (red) and Figure [3] to [6] shows the results of spectral subtraction working at SNR of 0 db for noisy speech signal, car noise, train noise and babble noise. From the figures it can be seen that the filter does indeed remove the noise. It shows the magnitude of the speech against time. Figure [7] to [9] shows the results from the wiener filter implementation. From these figures it is clear that the noisy signal has been filtered and up to certain extent noise has been removed. Fig. 7. Speech signal with car noise of 5 db (red) and VI. TESTING The testing method involves the listening of the filtered speech with listeners and examining the results of the filtered signal. The testing results appear to be relatively close in quality of filtered speech for both the methods. Available Online@ 182

CONCLUSION In this paper, the speech enhancement method by using spectral subtraction and wiener filter had been implemented and analyzed. The methods improve the speech quality by increasing the signal to noise ratio. This method provides a definite improvement compared to other traditional speech enhancement methods. The results from both simulation and evaluation suggest that, this method achieves better reduction of the noise for different noisy signals such as car noise, train noise and babble noise. REFERENCES 1. S. Boll, Suppression of acoustic noise in speech using spectral subtraction, IEEE Transactions on Acoustics, Speech, Signal Processing vol.27, pp. 113-120, Apr. 1979. 2.Y. Ephraim and D. Malah, Speech enhancement using a minimum mean-square error short-term spectral amplitude estimator, IEEE Transactions on Acoustics, Speech, Signal Processing vol.assp-32, No.6, pp. 1109-1121, Dec.1984. 3. Y. M. Cheng and D. O'Shaughnessy, Speech enhancement based conceptually on auditory evidence," IEEE Trans. Signal Processing, vol. 39, Sept. 1991. 4. Y. Ephraim and H. L. Van Trees, A signal subspace approach for speech enhancement," IEEE Trans. Speech and Audio Processing, vol. 3, July 1995. 5. Scalart, P., and Filho, J. Speech enhancement based on a priori signal to noise estimation, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 629 632. 1996. 6. Md. Kamrul Hasan, Sayeef Salahuddin, and M. Rezwan Khan, A Modified A Priori SNR for Speech Enhancement Using Spectral Subtraction Rules, IEEE Signal Processing Letters, vol. 11, no. 4, April 2004. 7. B. L. Sim, Y. C. Tong, J. S. Chang, and C. T. Tan, A parametric formulation of the generalized spectral subtraction method, IEEE Trans. Speech Audio Processing, vol. 6, pp. 328 337, July Available Online@ 183