Speech Enhancement in a Noisy Environment Using Sub-Band Processing

Similar documents
Ultra Low-Power Noise Reduction Strategies Using a Configurable Weighted Overlap-Add Coprocessor

Speech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction

Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter

MMSE STSA Based Techniques for Single channel Speech Enhancement Application Simit Shah 1, Roma Patel 2

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm

Mel Spectrum Analysis of Speech Recognition using Single Microphone

EE482: Digital Signal Processing Applications

Different Approaches of Spectral Subtraction Method for Speech Enhancement

Online Version Only. Book made by this file is ILLEGAL. 2. Mathematical Description

Audio Restoration Based on DSP Tools

M.Tech Student, Asst Professor Department Of Eelectronics and Communications, SRKR Engineering College, Andhra Pradesh, India

Study of Different Adaptive Filter Algorithms for Noise Cancellation in Real-Time Environment

Speech Enhancement using Wiener filtering

Removal of High Density Salt and Pepper Noise through Modified Decision based Un Symmetric Trimmed Median Filter

Design and Implementation on a Sub-band based Acoustic Echo Cancellation Approach

Frequency Domain Analysis for Noise Suppression Using Spectral Processing Methods for Degraded Speech Signal in Speech Enhancement

International Journal of Modern Trends in Engineering and Research e-issn No.: , Date: 2-4 July, 2015

FPGA implementation of DWT for Audio Watermarking Application

Enhancement of Speech Signal by Adaptation of Scales and Thresholds of Bionic Wavelet Transform Coefficients

MODIFIED DCT BASED SPEECH ENHANCEMENT IN VEHICULAR ENVIRONMENTS

Robust Voice Activity Detection Based on Discrete Wavelet. Transform

Comparative Performance Analysis of Speech Enhancement Methods

Performance Analysis of gradient decent adaptive filters for noise cancellation in Signal Processing

Single channel noise reduction

CHAPTER 3 SPEECH ENHANCEMENT ALGORITHMS

Speech Enhancement: Reduction of Additive Noise in the Digital Processing of Speech

C. Efficient Removal Of Impulse Noise In [7], a method used to remove the impulse noise (ERIN) is based on simple fuzzy impulse detection technique.

Hardware/Software Co-Simulation of BPSK Modulator and Demodulator using Xilinx System Generator

Introduction of Audio and Music

ECORFAN Journal-Mexico COMPUTING August 2013 Vol.4 No Study of linear behavior algorithms with fundamental tone detection

Frequency Domain Implementation of Advanced Speech Enhancement System on TMS320C6713DSK

Enhancement of Speech in Noisy Conditions

Experiment 6: Multirate Signal Processing

Denoising Of Speech Signal By Classification Into Voiced, Unvoiced And Silence Region

Efficient Sub-band Coder Implementation for Portable Low-power Applications

Overview of Code Excited Linear Predictive Coder

IMPROVED SPEECH QUALITY FOR VMR - WB SPEECH CODING USING EFFICIENT NOISE ESTIMATION ALGORITHM

Discrete Fourier Transform (DFT)

Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis

Design and Testing of DWT based Image Fusion System using MATLAB Simulink

Auditory modelling for speech processing in the perceptual domain

CHAPTER 4 VOICE ACTIVITY DETECTION ALGORITHMS

AhoTransf: A tool for Multiband Excitation based speech analysis and modification

Study the Behavioral Change in Adaptive Beamforming of Smart Antenna Array Using LMS and RLS Algorithms

JDT LOW POWER FIR FILTER ARCHITECTURE USING ACCUMULATOR BASED RADIX-2 MULTIPLIER

Interband Alias-Free Subband Adaptive Filtering with Critical Sampling

AN EFFICIENT ALGORITHM FOR THE REMOVAL OF IMPULSE NOISE IN IMAGES USING BLACKFIN PROCESSOR

Carrier Frequency Offset Estimation in WCDMA Systems Using a Modified FFT-Based Algorithm

THE USE OF THE ADAPTIVE NOISE CANCELLATION FOR VOICE COMMUNICATION WITH THE CONTROL SYSTEM

Optimized BPSK and QAM Techniques for OFDM Systems

Noise Simulation and Reduction in AM-SSB Radio Systems Thiago D. Olson, Zi Ling, and Mohammad Naquiddin A. Razak

Optimal Adaptive Filtering Technique for Tamil Speech Enhancement

Linguistic Phonetics. Spectral Analysis

Integrated Speech Enhancement Technique for Hands-Free Mobile Phones

Reduction in sidelobe and SNR improves by using Digital Pulse Compression Technique

Ensemble Empirical Mode Decomposition: An adaptive method for noise reduction

Hardware/Software Co-Simulation of BPSK Modulator Using Xilinx System Generator

Comparative Study on DWT-OFDM and FFT- OFDM Simulation Using Matlab Simulink

Chapter 3. Speech Enhancement and Detection Techniques: Transform Domain

(i) Understanding the basic concepts of signal modeling, correlation, maximum likelihood estimation, least squares and iterative numerical methods

REAL-TIME BROADBAND NOISE REDUCTION

Innovative Communications Experiments Using an Integrated Design Laboratory

Image Denoising Using Statistical and Non Statistical Method

Performance Analysis of MFCC and LPCC Techniques in Automatic Speech Recognition

Chapter 7. Frequency-Domain Representations 语音信号的频域表征

ECE 556 BASICS OF DIGITAL SPEECH PROCESSING. Assıst.Prof.Dr. Selma ÖZAYDIN Spring Term-2017 Lecture 2

Available online at ScienceDirect. Anugerah Firdauzi*, Kiki Wirianto, Muhammad Arijal, Trio Adiono

speech signal S(n). This involves a transformation of S(n) into another signal or a set of signals

Enhancement of Speech Communication Technology Performance Using Adaptive-Control Factor Based Spectral Subtraction Method

Speech Enhancement Based On Noise Reduction

Thumbnail Images Using Resampling Method

Analysis of Wavelet Denoising with Different Types of Noises

Multi Modulus Blind Equalizations for Quadrature Amplitude Modulation

Wavelet Speech Enhancement based on the Teager Energy Operator

Comparison of LMS and NLMS algorithm with the using of 4 Linear Microphone Array for Speech Enhancement

RECENTLY, there has been an increasing interest in noisy

Analysis of LMS Algorithm in Wavelet Domain

Interpolation of CFA Color Images with Hybrid Image Denoising

Online Monaural Speech Enhancement Based on Periodicity Analysis and A Priori SNR Estimation

Low Power R4SDC Pipelined FFT Processor Architecture

Image Denoising Using Complex Framelets

Vector Arithmetic Logic Unit Amit Kumar Dutta JIS College of Engineering, Kalyani, WB, India

A New High Speed Low Power Performance of 8- Bit Parallel Multiplier-Accumulator Using Modified Radix-2 Booth Encoded Algorithm

Performance Analysiss of Speech Enhancement Algorithm for Robust Speech Recognition System

Chapter 4 SPEECH ENHANCEMENT

International Journal of Advanced Research in Computer Science and Software Engineering

Computer Science and Engineering

Reading: Johnson Ch , Ch.5.5 (today); Liljencrants & Lindblom; Stevens (Tues) reminder: no class on Thursday.

ScienceDirect. Unsupervised Speech Segregation Using Pitch Information and Time Frequency Masking

Modulation Domain Spectral Subtraction for Speech Enhancement

Communications Theory and Engineering

TRANSFORMS / WAVELETS

COMPARISON OF CHANNEL ESTIMATION AND EQUALIZATION TECHNIQUES FOR OFDM SYSTEMS

B.Tech III Year II Semester (R13) Regular & Supplementary Examinations May/June 2017 DIGITAL SIGNAL PROCESSING (Common to ECE and EIE)

FPGA IMPLEMENTATION OF RSEPD TECHNIQUE BASED IMPULSE NOISE REMOVAL

e-issn: p-issn: X Page 145

Performance Evaluation of Nonlinear Equalizer based on Multilayer Perceptron for OFDM Power- Line Communication

An Efficient Design of Parallel Pipelined FFT Architecture

Analysis of Interference & BER with Simulation Concept for MC-CDMA

Speech Enhancement in Noisy Environment using Kalman Filter

Transcription:

IOSR Journal of VLSI and Signal Processing (IOSR-JVSP) ISSN: 239-42, ISBN No. : 239-497 Volume, Issue 2 (Nov. - Dec. 22), PP 47-52 Speech Enhancement in a Noisy Environment Using Sub-Band Processing K. Sravanthi, M. S. Anuradha 2 (M.Tech 2/2, ECE dept, Andhra University, INDIA) 2 (Sr. Asst. professor, ECE dept, Andhra University, INDIA) Abstract: One of the methods for recovering a signal corrupted by additive noise using Weiner filtering approach. This paper mainly deals with speech enhancement/recovering using Weiner filtering technique. It uses the Weighted Overlap Add filter bank implemented using digital signal processing techniques. The spectral enhancement is achieved by Voice Activity Detector in conjunction with Weiner filtering, and the results were discussed for different noise floor levels for a stationary recorded input signal. Keywords: speech enhancement, WOLA filter bank, VAD, weiner filtering. I. Introduction Speech enhancement is most important in speech signal processing. Several techniques have been proposed for this purpose. Weiner filtering is one of the speech processing methods which involve linear estimation of a desired signal sequence from previous sequence [5]. The main approach is to minimize the mean square error signal that is defined as the difference between desired response and the actual filter output [2]. WOLA (Weighted Overlap and Add) filter bank approach is utilized to obtain the objective which is shown in the below fig. It decomposes the original noisy speech signal into different frequency sub-bands [3]. Filter bank processing can be considered as a divide-and-conquer approach within signal processing, since larger problems are sub-divided into many smaller problems. As the original signal is decomposed into sub bands it achieves faster convergence rate and low mean square error. In speech processing systems usually the voice activity detector (VAD) is used, which identifies the time intervals with speech signal and those with only background noise. The outcome from the VAD allows discarding the silent frames from further processing, as they carry no speech information, therefore improving the performance of the rest of the system [4]. FIG: simplified block diagram for speech enhancement. II. Theoretical Aspect The WOLA (Weighted Overlap and Add) filter bank is an efficient method used to implement a uniformly distributed multi-channel filter bank. It decomposes the input signal into a series of separate frequency bands by this it minimizes the overlap between adjacent bands. This results in a pure tone input to the analysis filter bank. For WOLA filter bank with K/2 frequency bands and band outputs are decimated by K/OS (OS=2 was used above). []. The following three stages are represented in the following fig: 2 There are three main stages:. analysis filter bank 2. Weiner filtering 3. synthesis filter bank 47 Page

Speech Enhancement In A Noisy Environment Using Sub-Band Processing ANALYSIS FILTER BANK The operations are: Read D input block samples Shift the block samples into the input FIFO buffer u[n] Applying window and time-fold to K samples Applying circular shift by K/2 samples in order to produce a zero-phase signal for the FFT. Take K-point FFT (even or odd) Applying channel gains to (complex) frequency data III. Weiner Filtering The Wiener filtering is optimal in terms of the mean square error. It minimizes the overall mean square error in the process. They operate on the past and present of the input signal. A Wiener filter used is a linear filter.the linear filter assumes that the inputs are stationary. The goal of the Wiener filter is to filter out noise that has corrupted a signal. The received sub band signals comprise a mixture of clean speech plus degrading noise, i.e., xk[n] = sk[n] + vk[n], () Where sk[n] denotes clean sub band speech and vk[n] denote sub band noise. Here, the filter bank analysis stage produces a set of sub-band signals s k [n] from a time signal s[n]. The subband signals are subject to sub-band processors, denoted as gk, to compute sub-band output signals y k [n] = gk {x k [n]}. The sub-band processing implemented in this paper is Adaptive Wiener Filtering. The Wiener filter provides an optimal estimate of a signal embedded in additive noise by minimizing the Mean Squares Error (MSE), Where E { } is the expectation operator. The Wiener filtering subtracts the noise signal spectrum estimate from the input signal spectrum to acquire an approximate of the original speech signal spectrum. The speech signal which is approximated using wiener filtering is obtained by multiplying the input signal with the obtained gain in wiener filtering as shown below, Y=G*X (3) IV. Synthesis Filter Bank The Inverse WOLA (IWOLA) performs the opposite function to the WOLA filter bank. It takes the K channels, and re-synthesizes these to form a single time-domain sequence. The operations are Take inverse FFT(IFFT) of (complex) input (even or odd) Applying circular shift by K/2 samples, to counter-act the circular shift used in the analysis stage Applying synthesis window Accumulate into output FIFO, shift out L samples Where: D is the input block size L is the input window size K is the FFT size DF is the decimation factor OS is the over sampling factor. 48 Page

Speech Enhancement In A Noisy Environment Using Sub-Band Processing Fig: 2 block representation of WOLA filter bank III. Matlab Implementation The input signal which is used for the speech processing in this work is recorded through a mobile phone in the.amr format and this is converted into.wav file. The duration of the signal considered is seconds with 8 khz sampling frequency with a message containing the information Hello.. Hello. The VAD data is calculated with this recorded signal.vad= indicates when there is a speech signal and VAD decision = indicates where there is a noise is represented in fig.the SNR threshold is chosen to be for VAD decision. After Weiner filtering the noisy speech signal and the output signal are presented in fig4. Where: the input block size D =6 the input window size L=28 the FFT size K =32 the decimation factor DF = the over sampling factor OS =2.5 VAD decision (=speech, =noise).8.6.4.2.5 -.2 -.4 -.6 -.8 -.5 2 3 4 5 6 7 8 9 Fig: 3 input noisy speech signal and the VAD data. - 2 3 4 5 6 7 8 9 Fig: 4 speech signal after Weiner filtering without noise. 49 Page

Speech Enhancement In A Noisy Environment Using Sub-Band Processing The additional noise is added to the original recorded input signal with different noise floors and the results are presented in the following figures. Case-:.5 original speech signal noisy speech signal.8.6.4.2.5 -.2 -.4 -.6 -.5 -.8-2 3 4 5 6 7 8-2 3 4 5 6 7 8 x 4 x 4 Fig 5: original speech signal fig:6 noisy speech signal.5 VAD decision (=speech, =noise).8.6.4.2.5 -.2 -.4 -.6 -.8 -.5 2 3 4 5 6 7 8 9 Fig: 7 input noisy speech signal and the VAD data - 2 3 4 5 6 7 8 9 fig: 8 speech signal after Weiner filtering without noise Case 2:.5 original speech signal noisy speech signal.8.6.4.5.2 -.2 -.5 -.4 -.6 - -.8-2 3 4 5 6 7 8 x 4 -.5 2 3 4 5 6 7 8 x 4 Fig 9: original speech signal fig: noisy speech signal 5 Page

Speech Enhancement In A Noisy Environment Using Sub-Band Processing.5 VAD decision (=speech, =noise).8.6.4.2.5 -.2 -.4 -.6 -.8 -.5 2 3 4 5 6 7 8 9 Fig: input noisy speech signal and the VAD data - 2 3 4 5 6 7 8 9 fig: 2 speech signal after Weiner filtering without noise Case 3:.5 original speech signal noisy speech signal.8.6.4.5.2 -.2 -.5 -.4 -.6 - -.8-2 3 4 5 6 7 8 -.5 2 3 4 5 6 7 8 x 4 x 4 Fig: 3original speech signal fig: 4 noisy speech signal.5 VAD decision (=speech, =noise).8.6.5.4.2 -.5 -.2 -.4 - -.6 -.8 -.5 2 3 4 5 6 7 8 9 Fig: 5 input noisy speech signal and the VAD data - 2 3 4 5 6 7 8 9 fig: 6 speech signal after Weiner filtering without noise 5 Page

Speech Enhancement In A Noisy Environment Using Sub-Band Processing IV. Conclusion This paper discusses the Noise suppression in speech signal using WOLA filter bank to divide the signal into sub-bands. WOLA filter bank is implemented using Hamming windowing technique and the subband processing improves better efficiency under different noise conditions. Noise suppression is achieved through Voice Activity Detector and Weiner filter. This paper is limited to the stationary signals.for real time applications adaptive Weiner filtering approach may be applied and the efficiency can be improved with increased sampling rates and also by selecting different windowing techniques for sub-band processing. Reference [] A flexible filter bank structure for extensive signal manipulations in digital hearing aids by Robert Brennan Todd Schneider dspfactory, Waterloo Ontario, Canada, N2J 4V [2] Technion { Israel Institute of Technology, Department of Electrical Engineering Estimation and Identi cation in Dynamical Systems (48825) Lecture Notes, Fall 29, Prof. N. Shimkin [3] Wideband High-Resolution Filterbank Firmware: Acceptance Testing and Excision Filtering K. L. Harman and C. Potter [4] Modified LTSE-VAD algorithm for applications requiring reduced silence frame misclassification by Iker Luengo, Eva Navas, Igor Odriozola, Ibon Saratxaga, Inmaculada Hernaez, I naki Sainz, Daniel Erro [5] Speech Signal Processing School of Electronic Information, Wuhan University. BOOK [6] Adaptive Filtering Primer with MATLAB - Poularikas and Ramadan. 52 Page