Dual-Microphone Speech Dereverberation using a Reference Signal Habets, E.A.P.; Gannot, S.

Similar documents
Dual-Microphone Speech Dereverberation in a Noisy Environment

Joint dereverberation and residual echo suppression of speech signals in noisy environments Habets, E.A.P.; Gannot, S.; Cohen, I.; Sommen, P.C.W.

Recent Advances in Acoustic Signal Extraction and Dereverberation

Emanuël A. P. Habets, Jacob Benesty, and Patrick A. Naylor. Presented by Amir Kiperwas

Speech Enhancement Using Beamforming Dr. G. Ramesh Babu 1, D. Lavanya 2, B. Yamuna 2, H. Divya 2, B. Shiva Kumar 2, B.

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm

546 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 17, NO. 4, MAY /$ IEEE

Published in: Proceedings of the 11th International Workshop on Acoustic Echo and Noise Control

Dual Transfer Function GSC and Application to Joint Noise Reduction and Acoustic Echo Cancellation

A COHERENCE-BASED ALGORITHM FOR NOISE REDUCTION IN DUAL-MICROPHONE APPLICATIONS

Clustered Multi-channel Dereverberation for Ad-hoc Microphone Arrays

MULTICHANNEL AUDIO DATABASE IN VARIOUS ACOUSTIC ENVIRONMENTS

Non resonant slots for wide band 1D scanning arrays

LETTER Pre-Filtering Algorithm for Dual-Microphone Generalized Sidelobe Canceller Using General Transfer Function

Calibration of current-steering D/A Converters

MMSE STSA Based Techniques for Single channel Speech Enhancement Application Simit Shah 1, Roma Patel 2

Single-channel late reverberation power spectral density estimation using denoising autoencoders

NOISE ESTIMATION IN A SINGLE CHANNEL

Towards an intelligent binaural spee enhancement system by integrating me signal extraction. Author(s)Chau, Duc Thanh; Li, Junfeng; Akagi,

Leaky-wave slot array antenna fed by a dual reflector system Ettorre, M.; Neto, A.; Gerini, G.; Maci, S.

Directional Sensing for Online PD Monitoring of MV Cables Wagenaars, P.; van der Wielen, P.C.J.M.; Wouters, P.A.A.F.; Steennis, E.F.

Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis

Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter

IN REVERBERANT and noisy environments, multi-channel

arxiv: v1 [cs.sd] 4 Dec 2018

/$ IEEE

Different Approaches of Spectral Subtraction Method for Speech Enhancement

DISTANT or hands-free audio acquisition is required in

SINGLE CHANNEL REVERBERATION SUPPRESSION BASED ON SPARSE LINEAR PREDICTION

A BROADBAND BEAMFORMER USING CONTROLLABLE CONSTRAINTS AND MINIMUM VARIANCE

Performance Evaluation of Nonlinear Speech Enhancement Based on Virtual Increase of Channels in Reverberant Environments

Direction-of-Arrival Estimation Using a Microphone Array with the Multichannel Cross-Correlation Method

SPECTRAL COMBINING FOR MICROPHONE DIVERSITY SYSTEMS

A generalized framework for binaural spectral subtraction dereverberation

DBR based passively mode-locked 1.5m semiconductor laser with 9 nm tuning range Moskalenko, V.; Williams, K.A.; Bente, E.A.J.M.

Microphone Array Feedback Suppression. for Indoor Room Acoustics

Characteristic mode based pattern reconfigurable antenna for mobile handset

The Role of High Frequencies in Convolutive Blind Source Separation of Speech Signals

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 21, NO. 5, MAY

Speech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction

REVERB Workshop 2014 A COMPUTATIONALLY RESTRAINED AND SINGLE-CHANNEL BLIND DEREVERBERATION METHOD UTILIZING ITERATIVE SPECTRAL MODIFICATIONS Kazunobu

Chapter 4 SPEECH ENHANCEMENT

Visualization of Compact Microphone Array Room Impulse Responses

Multiple Sound Sources Localization Using Energetic Analysis Method

Automotive three-microphone voice activity detector and noise-canceller

Single-Microphone Speech Dereverberation based on Multiple-Step Linear Predictive Inverse Filtering and Spectral Subtraction

Single channel noise reduction

Speech Enhancement Using Microphone Arrays

Voltage dip detection with half cycle window RMS values and aggregation of short events Qin, Y.; Ye, G.; Cuk, V.; Cobben, J.F.G.

Speech and Audio Processing Recognition and Audio Effects Part 3: Beamforming

Mel Spectrum Analysis of Speech Recognition using Single Microphone

Spectral estimation using higher-lag autocorrelation coefficients with applications to speech recognition

A novel output transformer based highly linear RF-DAC architecture Bechthum, E.; Radulov, G.I.; Briaire, J.; Geelen, G.; van Roermund, A.H.M.

GROUP SPARSITY FOR MIMO SPEECH DEREVERBERATION. and the Cluster of Excellence Hearing4All, Oldenburg, Germany.

SUBJECTIVE SPEECH QUALITY AND SPEECH INTELLIGIBILITY EVALUATION OF SINGLE-CHANNEL DEREVERBERATION ALGORITHMS

Planar circularly symmetric EBG's to improve the isolation of array elements Llombart, N.; Neto, A.; Gerini, G.; de Maagt, P.J.I.

Speech Enhancement Based On Noise Reduction

Two octaves bandwidth passive balun for the eleven feed for reflector antennas Zamanifekri, A.; Yang, J.

Impact of the size of the hearing aid on the mobile phone near fields Bonev, Ivan Bonev; Franek, Ondrej; Pedersen, Gert F.

A BINAURAL HEARING AID SPEECH ENHANCEMENT METHOD MAINTAINING SPATIAL AWARENESS FOR THE USER

On Single-Channel Speech Enhancement and On Non-Linear Modulation-Domain Kalman Filtering

Speech Signal Enhancement Techniques

Microphone Array Power Ratio for Speech Quality Assessment in Noisy Reverberant Environments 1

Speech Quality Assessment for Listening-Room Compensation

On-chip antenna integration for single-chip millimeterwave FMCW radars Adela, B.B.; Pual, P.T.M; Smolders, A.B.

Design and Implementation on a Sub-band based Acoustic Echo Cancellation Approach

Effective post-processing for single-channel frequency-domain speech enhancement Weifeng Li a

Residual noise Control for Coherence Based Dual Microphone Speech Enhancement

260 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 18, NO. 2, FEBRUARY /$ IEEE

REAL-TIME BROADBAND NOISE REDUCTION

Real-time Adaptive Concepts in Acoustics

Study Of Sound Source Localization Using Music Method In Real Acoustic Environment

MULTICHANNEL systems are often used for

Joint recognition and direction-of-arrival estimation of simultaneous meetingroom acoustic events

Enhancement of Speech in Noisy Conditions

ROBUST echo cancellation requires a method for adjusting

BEAMFORMING WITHIN THE MODAL SOUND FIELD OF A VEHICLE INTERIOR

Spectral Methods for Single and Multi Channel Speech Enhancement in Multi Source Environment

IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 50, NO. 12, DECEMBER

Microphone Array Design and Beamforming

Students: Avihay Barazany Royi Levy Supervisor: Kuti Avargel In Association with: Zoran, Haifa

Low frequency sound reproduction in irregular rooms using CABS (Control Acoustic Bass System) Celestinos, Adrian; Nielsen, Sofus Birkedal

Enhancement of Speech Communication Technology Performance Using Adaptive-Control Factor Based Spectral Subtraction Method

Acoustic Echo Cancellation: Dual Architecture Implementation

Speech Enhancement Using Robust Generalized Sidelobe Canceller with Multi-Channel Post-Filtering in Adverse Environments

Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter

Improving Meetings with Microphone Array Algorithms. Ivan Tashev Microsoft Research

Speech Enhancement for Nonstationary Noise Environments

A Computational Efficient Method for Assuring Full Duplex Feeling in Hands-free Communication

A Comparison of the Convolutive Model and Real Recording for Using in Acoustic Echo Cancellation

Reducing comb filtering on different musical instruments using time delay estimation

Boldt, Jesper Bünsow; Kjems, Ulrik; Pedersen, Michael Syskind; Lunner, Thomas; Wang, DeLiang

ESTIMATION OF TIME-VARYING ROOM IMPULSE RESPONSES OF MULTIPLE SOUND SOURCES FROM OBSERVED MIXTURE AND ISOLATED SOURCE SIGNALS

Psychoacoustic Cues in Room Size Perception

Frequency Domain Implementation of Advanced Speech Enhancement System on TMS320C6713DSK

Nonlinear postprocessing for blind speech separation

RIR Estimation for Synthetic Data Acquisition

Published in: Proceedings of the 20th Annual Symposium of the IEEE Photonics Benelux Chapter, November 2015, Brussels, Belgium

EE482: Digital Signal Processing Applications

NOISE REDUCTION IN DUAL-MICROPHONE MOBILE PHONES USING A BANK OF PRE-MEASURED TARGET-CANCELLATION FILTERS. P.O.Box 18, Prague 8, Czech Republic

Transcription:

DualMicrophone Speech Dereverberation using a Reference Signal Habets, E.A.P.; Gannot, S. Published in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 7) 15 April 7, Honolulu, Hawaii, ISA DOI: 1.119/ICASSP.7.36716 Published: 1/1/7 Document Version Publisher s PDF, also known as Version of Record (includes final page, issue and volume numbers) Please check the document version of this publication: A submitted manuscript is the author's version of the article upon submission and before peerreview. There can be important differences between the submitted version and the official published version of record. People interested in the research are advised to contact the author for the final version of the publication, or visit the DOI to the publisher's website. The final author version and the galley proof are versions of the publication after peer review. The final published version features the final layout of the paper including the volume, issue and page numbers. Link to publication Citation for published version (APA): Habets, E. A. P., & Gannot, S. (7). DualMicrophone Speech Dereverberation using a Reference Signal. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 7) 15 April 7, Honolulu, Hawaii, ISA (pp. 9194). DOI: 1.119/ICASSP.7.36716 General rights Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. Users may download and print one copy of any publication from the public portal for the purpose of private study or research. You may not further distribute the material or use it for any profitmaking activity or commercial gain You may freely distribute the URL identifying the publication in the public portal? Take down policy If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim. Download date: 16. Nov. 18

DUALMICROPHONE SPEECH DEREVERBERATION USING A REFERENCE SIGNAL EAR Habets, Student member, IEEE S. Gannot, Senior Member, IEEE Department of Electrical Engineering Technische Universiteit Eindhoven Eindhoven, The Netherlands Email: e. habets@ieee.org School of Engineering BarIlan University RamatGan, Israel Email: gannot@eng.biu.ac.il ABSTRACT Speech signals recorded with a distant microphone usually contain reverberation, which degrades the fidelity and intelligibility of speech, and the recognition performance of automatic speech recognition systems. In this paper we propose a speech dereverberation system which uses two microphones. A Generalized Sidelobe Canceller (GSC) type of structure is used to enhance the desired speech signal. The GSC structure is used to create two signals. The first signal is the output of a standard delay and sum beamformer, and the second signal is a reference signal which is constructed such that the direct speech signal is blocked. We propose to utilize the reverberation which is present in the reference signal to enhance the output of the delay and sum beamformer. The power envelope of the reference signal and the power envelope of the output of the delay and sum beamformer are used to estimate the residual reverberation in the output of the delay and sum beamformer. The output of the delay and sum beamformer is then enhanced using a spectral enhancement technique. The proposed method only requires an estimate of the direction of arrival of the desired speech source. Experiments using simulated room impulse responses are presented and show significant reverberation reduction while keeping the speech distortion low. Index Terms Speech dereverberation, speech enhancement. 1. INTRODUCTION Acoustic signals radiated within a room are linearly distorted by reflections from walls and other objects. These distortions degrade the fidelity and intelligibility of speech, and the recognition performance of automatic speech recognition systems. Early reflections mainly contribute to coloration, or spectral distortion, while late reflections, or late reverberation, contribute noiselike perceptions or tails to speech signals [1]. Spectral coloration and late reverberation cause users of hearing aids to complain of being unable to distinguish one voice from another in a crowded room. One of the reasons why reverberation degrades speech intelligibility is the effect of overlapmasking, in which segments of an acoustic signal are affected by reverberation components of previous segments. In this paper we have investigated the application of signal processing techniques to improve the quality of speech recorded in an acoustic environment. Dereverberation algorithms can be divided into two classes. The classification depends on whether the Room Impulse Responses (RIRs) need to be known or estimated beforehand. Until now blind Thanks to the Dutch Technology Foundation STW (project EEL 491), applied science division of NWO, for funding. 1444781/7/$. 7 IEEE estimation of the RIRs, in a practical scenario, remains an unsolved but challenging problem []. Even if the RIRs could be estimated, the inversion and tracking would be very difficult. While these techniques try to recover the anechoic speech signal we like to suppress the tail of the RIR by means of spectral enhancement. In the last decade many speech enhancement solutions have been proposed which do not require an estimate ofthe RIR. For example algorithms based on processing of the linear prediction (LP) residual signal [3, 4]. Other algorithms are based on spectral enhancement techniques and utilize a statistical reverberation model [5, 6, 7]. The later algorithms do not require detailed knowledge on the RIR structure, but require some a priori information about room characteristics, for example the reverberation time. In this paper we propose a dualmicrophone speech dereverberation system. A Generalized Sidelobe Canceller (GSC) [8] type of structure is used to enhance the desired speech signal. The GSC structure is used to create two signals. The first signal is the output of a standard delay and sum beamformer, and the second signal is a reference signal which is constructed such that the direct speech signal is blocked. We propose a novel method which utilizes the reverberation present in the reference signal to enhance the output of the delay and sum beamformer. The power envelope of the reference signal and the power envelope of the output of the delay and sum beamformer are used to estimate the residual reverberation in the output of the delay and sum beamformer. The signal is then enhanced using a spectral enhancement technique. An advantage of the proposed method is that it requires a minimum amount of a priori knowledge, since we only require an estimate of the Direction of Arrival (DOA) of the desired speech source. The outline of this paper is as follows, in Section the problem is described. In Section 3 we describe the proposed dereverberation algorithm. Evaluation using simulated RIRs are presented in Section 4. Discussion and conclusions can be found in Section 5.. PROBLEM STATEMENT The mth microphone signal (m C {1, }) is denoted by zm(n) and results from the convolution of the anechoic speech signal s(n) and the RIR between the source and the corresponding microphone. The RIR between the source and the mth microphone, at time n, is modelled as a finite impulse response of length L, and is denoted by am(n) [am,o(n),..., am,l 1(n)]T. The RIR is divided into two parts such that am,j (n) ad j (n) + a (n) (1) where j is the coefficient index, ad (n) consists of the direct path, and a' (n) consists of all echoes. In the sequel we assume that the IV 91 ICASSP 7 Authorized licensed use limited to: Eindhoven University of Technology. Downloaded on March 9,1 at 3:4: EDT from IEEE Xplore. Restrictions apply.

Zi (k l) Q (k,l1) 3. PROPOSED METHOD Post Filter D (k,l1) (k, 1) ~~~~~~~~~~~G In this section we show how the residual reverberant energy can be estimated using the reference signal. Additionally, we design a post filter which uses this estimate to enhance the speech signal. Ar(k, I) U(k,)REE1) Z(k, 1) 3.1. Reverberant Energy Estimator Fig. 1. Dual Microphone Speech Dereverberation System (REE: Reverberant Energy Estimator). microphone array is steered towards the desired source using an estimate of the DOA of the direct signal, i.e., the direct speech signals in zi (n) and Z (n) are timealigned. The mth microphone signal is given by zm (n) am (n)) s(n) + (a' (n)) s(n) ' () First the power envelopes of the output of the delay and sum beamformer Q(k, 1) and the reference signal U(k, 1) are recursively estimated, using and Aq(k, 1) 3Aq(k, 1 1) + (1 ) Q(k, 1), (7) A,(k, 1) (8) 3Azu(k, 1 1) + (1 3) U(k, 1), respectively, where 3 ( < 3 < 1) is the forgetting factor. Let us assume that the estimated residual reverberant energy in frequency bin k at frame 1 can be estimated using W (k) A,,, (k, 1 I d,m(n) rm (n) where s(n) [s(n),... s(n L + 1)]T, dm(n) is the desired (direct) speech component, and rm (n) denotes the reverberant component which contains all reflections. Using the ShortTime Fourier Transform (STFT), we have in the timefrequency domain (3) Zm(k, I) Dm(k, 1) + Rm(k, 1) Vm E {1, }, where k represents the frequency bin index, and 1 the frame index. Figure 1 shows the proposed dualmicrophone speech dereverberation system. The timefrequency signal Q(k, 1) is the output of a delay and sum beamformer (in this case with zero delay), i.e., D(k, 1) + Rq(k, 1), where D(k, 1) denotes the direct speech, and Rq(k, 1) (RI (k, 1)+ R (k, 1)) denotes the residual reverberation ofthe speech in Q(k, 1). The reference signal U(k, 1) is constructed using the difference between the two microphone signals, i.e., U (k, 1) 1 (Zi (k, 1) Z (k, 1)) (4) In case there are no steering errors the direct signal is perfectly blocked, i.e., DI (k, 1) D(k, 1), such that U (k, 1) (RI (k, 1) R (k, 1)).(5) We can now see that U(k, 1) contains the (spatially filtered) reverberation. Note that the exact relation between U (k, 1) and Rq (k, 1) is very complex due to the spatial filtering, e.g., for low frequencies the delay and sum beamformer is omnidirectional, while the null beamformer, which is used to create the reference signal, will not only suppress the direct signal but also some reflections. However, using the statistical reverberation model used in [7] it can be shown that for frequencies above the Schroeder frequency S{ U(k, 1) 1 } S{ Rq(k, 1) }, where S{ } denotes the mathematical expectation. The spectral speech component D(k, 1) is obtained by applying a frame and frequency dependent spectral gain function G(k, 1) (see Section 3) to the spectral component Q(k, 1), i.e., D(k, l) (6) G(k, l) Q(k, 1). The dereverberated speech signal d(n) can be obtained using the A) I (9) where W(k) is a frequency dependent constant. The parameter A can be used to control the end point ofthe uncompensated part ofthe residual reverberation, e.g., by increasing A one can reduce only late reflections while leaving the early reflections intact. The end point is measured with respect to the arrival time of the direct speech signal. Note that A is a positive integer value. The time related to A is given by 'F, where fs denotes the sampling frequency and F denotes the frame rate in samples of the STFT. The frame rate F depends on the window length and the overlap of the STFT. We now define an error signal Ae(k, 1) as, Q(k, 1) 1 (Z1(k, 1) + Z (k, 1)) A q (k, 1) k (k, 1). (1) An adaptive algorithm is used to minimize the following quadratic cost function J such that W(k, 1 + 1) (Ae (k, l)), (1 1) W(k, 1) VJw, (1) where,u denotes the stepsize parameter, and VJw denotes the gradient with respect to W(k, 1), which is given by VJw Ae(k, l)a,u(k, Note that and 1. A). (13) Ae(k, 1) and A,(k, 1) are real and positive values for all k 3.. Post Filter Many spectral enhancement techniques are described in the literature. Spectral subtraction methods are the most widely used due to the simplicity of implementation and the low computational load, which makes them the primary choice for realtime applications. A common feature of this technique is that the interference reduction process can be related to the estimation of a shorttime spectral attenuation factor [9]. Since the spectral components are assumed to be statistically independent, this factor is adjusted individually as a function ofthe relative local aposteriori Signal to Interference Ratio (SIR) on each frequency. The a posteriori SIR is defined as inverse STFT and the weighted overlapadd method. ( k, l1) <A Q(k, 1) (14) IV 9 Authorized licensed use limited to: Eindhoven University of Technology. Downloaded on March 9,1 at 3:4: EDT from IEEE Xplore. Restrictions apply.

Using informal listening tests we concluded that magnitude subtraction gives very good performance. The gain function related to the magnitude subtraction is given by [9] (15) s where Gmin is a lowerbound constraint for the spectral gain function which allows us to control the maximum amount of reverberation that is reduced. In the following experiments Gmin was set to.1, which corresponds to maximum attenuation of db. Irl G(k, 1) max { 1 'y(k igmin 1 N > W: ( C:u g. o 4. EVALUATION Proposed A 4( In this section we present evaluation results that were obtained using synthetically reverberated signals. One speech fragment which consists of a female voice of seconds and a male voice of seconds, sampled at 8 khz, was used in all experiments. The synthetic RIRs were generated using the image method [1], and the reflection coefficients were set such that the reverberation time, denoted by T6 was equal to approximately, 4 and 6 ms. Experiments were conducted using different distances between the source and the center of the array, denoted by d, ranging from 1 to m. The distance between the two microphones was set to 1 cm. The analysis window of the STFT was a 56 point Hamming window, and the overlap between two successive frames is set to 75%. Each frame is zero padded with 56 points to avoid wrap around errors. The forgetting factor 3 in (7) and (8) was set to.9, and the stepsize /, in (1) was set to.. We used the segmental Signal to Interference Ratio (), Bark Spectral Distortion (), and a recently proposed evaluation measure developed by Wen and Naylor called the Reverberation Decay Tail () [11] to evaluate the proposed algorithm. The jointly characterizes the relative energy in the tail of the RIR and the rate of decay. In [1] the measure was tested using three dereverberation methods, the results were compared to the subjective amount of reverberation indicated by 6 normal hearing subjects. The results showed a strong correlation between the values and the amount of reverberation perceived by the subjects. Note that higher values correspond to a higher amount of relative energy in the tail and/or a slower decay rate. The (properly delayed) anechoic speech signal was used as a reference signal for these speech quality measures. As a reference dereverberation method we show the quality measures calculated from the output of the delay and sum beamformer (DSB). In Table 1 the results are shown for d 1 m and d, and A 1. The quality measures are calculated using 4 seconds of speech data after the filter coefficients have converged. We can see that the is increased in almost all scenarios. The measure indicates that the average Bark spectral distance is slightly increased. The values are very consistent and indicate a clear improvement in all cases. In Figure the spectrogram of the anechoic signal, the microphone signal zi (n) and the output of the proposed algorithm for A and A 16 are depicted (d m and T6 4 ms). Note that the effect of overlapmasking is reduced and that the first reflections can be preserved by increasing A. In Figure 3 the microphone signal zi (n) and the output of the proposed algorithm, using d m, T6 4 ms and A 8, are depicted. In both figures is can be seen that the smearing caused by reverberation, is reduced. Proposed A 16 4 3 3.5 1 5 Fig.. Spectrograms of the anechoic, reverberant and proposed signalsusing A Oand A 16(d mandt6o 4 ms). Anechoic.5 1. 5TT 1.5 Reverberant, 5 $ 5 1 _.5 Proposed A, 5.5 Fig. 3. Anechoic, reverberant and proposed (A T6 4 ms) signals. 8, d m, and In case the DOA estimation is not perfect the direct speech signal will leak into the reference signal. To study the effects of steering errors due to errors in the DOA estimate we introduced a steering error of 5 degrees. The spectrogram of the processed signals, with and without steering error, are depicted in Figure 4. We can see that the proposed algorithm is still able to suppress a significant amount of reverberation. However, it can also be seen that some additional distortion was introduced by the proposed dereverberation algorithm. 'The results are available for listening on the following web page: http://www.sps. ele. tue.nlmembersle. a.p.habetslicasspo7. IV 93 Authorized licensed use limited to: Eindhoven University of Technology. Downloaded on March 9,1 at 3:4: EDT from IEEE Xplore. Restrictions apply.

Table 1. Experimental results in terms of segmental Signal to Interference Ratio (), Bark Spectral Distortion () and Reverberation Decay Tail () for A. d Method Unprocessed 1 m DSB _ Proposed Unprocessed m DSB Proposed T6 8.4 db 9.3 db 6.83 db 3.5 db 4.41 db 4.5 db ms.5 db.4 db.6 db.15 db.1 db.18 db T6 53.13 db 4.37 db 3 [.3 db 89 4.17 db 74 3.35 db 66.1 db 4 ms.13 db.1 db.13 db.31 db.3 db.34 db T6 5 4.31 db 175 3.95 db 16 j.6 db 454 8.15 db 337 7.43 db.83 db 6 ms. db.17 db.18 db.41 db.33 db.45 db 568 463 18 939 766 96 7. REFERENCES [1] J. B. Allen, D. A. Berkley, and J. Blauert, "Multimicrophone signalprocessing technique to remove room reverberation from speech signals," Journal of the Acoustical Society ofamerica, vol. 6, no. 4, pp. 91915, 1977. [] Y. Huang, J. Benesty, and J. Chen, "Identification of acoustic MIMO systems: Challenges and opportunities," Signal Processing, vol. 6, no. 86, pp. 178195, 6. [3] B. Yegnanarayana and P. S. Murthy, "Enhancement of reverberant speech using LP residual signal," IEEE Transactions on Speech andaudio Processing, vol. 8, no. 3, pp. 6781,. [4] N. D. Gaubitch, P. A. Naylor, and D. Ward, "On the use of linear prediction for dereverberation of speech," in Proc. of the International Workshop on Acoutsic Echo and Noise Control (IWAENC'3), Kyoto, Japan, 3, pp. 991. [5] K. Lebart and J.M. Boucher, "A new method based on spectral subtraction for speech dereverberation," Acta Acoustica, vol. 87, pp. 359366, 1. [6] E.A.P. Habets, "MultiChannel Speech Dereverberation based on a Statistical Model of Late Reverberation," in Proc. of the 3th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 5), Philadelphia, USA, March 5, pp. 173176. [7] E.A.P. Habets, S. Gannot, and I. Cohen, "DualMicrophone Speech Dereverberation in a Noisy Environment," in Proc. of the IEEE International Symposium on Signal Processing and Information Technology (ISSPIT 6), Vancouver, Canada, August 6, pp. 651655. [8] L.J. Griffiths and C.W. Jim, "An Alternate Approach to Linearly Constrained Adaptive Beamforming," IEEE Transaction on Antennas and Propagation, vol. 1, no. 3, pp. 734, 198. [9] S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Processing, vol. 7, no., pp. 111, April 1979. [1] J. B. Allen and D. A. Berkley, "Image method for efficiently simulating small room acoustics," Journal of the Acoustical Society ofamerica, vol. 65, no. 4, pp. 94395, 1979. [11] J.Y.C. Wen and P. Naylor, "An evaluation measure for reverberant speech using tail decay modelling," in Proc. of the European Signal Processing Conference (EUSIPCO 6), Florence, Italy, 6, pp. 14. [1] J.Y.C. Wen, N.D. Gaubitch, E.A.P. Habets, T. Myatt, and P.A. Naylor, "Evaluation of Speech Dereverberation Algorithms using the MARDY Database," in Proc. of the 1th International Workshop ofacoutsic Echo and Noise Control (IWAENC 6), Paris, France, September 6, pp. 14. Proposed 4 'N 3 G 4;) " 1 3 4 Prono.sedi with 9 stecriniq err( 4 N' 3 lon u Fig. 4. Spectrograms of the processed signal with and without steering error of 5 degrees (A, d m, and T6o 4 ms). a 5. DISCUSSION AND CONCLUSIONS In this paper we proposed a dualmicrophone speech dereverberation algorithm. A GSC type of structure was used to enhance the desired speech signal. We proposed to use a reference signal to enhance the output of the delay and sum beamfomer. The advantage of the proposed solution is that we only require an estimate of the DOA. Although no additional interferences have been taken into account, i.e., coherent or noncoherent noise sources, we would like to point out that the power envelope of the reverberant component could also be estimated in a noisy environment (see for example [7]). Experimental results have shown that the proposed solution can be used to reduce the reverberation while keeping speech distortion low. Future research will focus on the extension to multimicrophones, which allows better estimation ofthe residual reverberant energy, and to more realistic situations where additional interferences are present. 6. ACKNOWLEDGEMENT The authors express there thanks to Jimi Wen from the Imperial College London, United Kingdom, for making the code for the measure available. IV 94 Authorized licensed use limited to: Eindhoven University of Technology. Downloaded on March 9,1 at 3:4: EDT from IEEE Xplore. Restrictions apply.