REAL-TIME BLIND SOURCE SEPARATION FOR MOVING SPEAKERS USING BLOCKWISE ICA AND RESIDUAL CROSSTALK SUBTRACTION

Similar documents
SEPARATION AND DEREVERBERATION PERFORMANCE OF FREQUENCY DOMAIN BLIND SOURCE SEPARATION. Ryo Mukai Shoko Araki Shoji Makino

BLIND SOURCE separation (BSS) [1] is a technique for

The Role of High Frequencies in Convolutive Blind Source Separation of Speech Signals

High-speed Noise Cancellation with Microphone Array

Frequency-Domain Blind Source Separation of Many Speech Signals Using Near-Field and Far-Field Models

Performance Evaluation of Nonlinear Speech Enhancement Based on Virtual Increase of Channels in Reverberant Environments

BLIND SOURCE SEPARATION BASED ON ACOUSTIC PRESSURE DISTRIBUTION AND NORMALIZED RELATIVE PHASE USING DODECAHEDRAL MICROPHONE ARRAY

Blind Dereverberation of Single-Channel Speech Signals Using an ICA-Based Generative Model

Grouping Separated Frequency Components by Estimating Propagation Model Parameters in Frequency-Domain Blind Source Separation

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 17, NO. 4, MAY /$ IEEE

Source Separation and Echo Cancellation Using Independent Component Analysis and DWT

WIND SPEED ESTIMATION AND WIND-INDUCED NOISE REDUCTION USING A 2-CHANNEL SMALL MICROPHONE ARRAY

A Novel Hybrid Approach to the Permutation Problem of Frequency Domain Blind Source Separation

+ C(0)21 C(1)21 Z -1. S1(t) + - C21. E1(t) C(D)21 C(D)12 C12 C(1)12. E2(t) S2(t) (a) Original H-J Network C(0)12. (b) Extended H-J Network

516 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING

TARGET SPEECH EXTRACTION IN COCKTAIL PARTY BY COMBINING BEAMFORMING AND BLIND SOURCE SEPARATION

BLIND SOURCE SEPARATION FOR CONVOLUTIVE MIXTURES USING SPATIALLY RESAMPLED OBSERVATIONS

MULTIMODAL BLIND SOURCE SEPARATION WITH A CIRCULAR MICROPHONE ARRAY AND ROBUST BEAMFORMING

Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis

SUPERVISED SIGNAL PROCESSING FOR SEPARATION AND INDEPENDENT GAIN CONTROL OF DIFFERENT PERCUSSION INSTRUMENTS USING A LIMITED NUMBER OF MICROPHONES

Electronic Research Archive of Blekinge Institute of Technology

Separation of Noise and Signals by Independent Component Analysis

Nonlinear postprocessing for blind speech separation

Dual Transfer Function GSC and Application to Joint Noise Reduction and Acoustic Echo Cancellation

WHITENING PROCESSING FOR BLIND SEPARATION OF SPEECH SIGNALS

RASTA-PLP SPEECH ANALYSIS. Aruna Bayya. Phil Kohn y TR December 1991

A Comparison of the Convolutive Model and Real Recording for Using in Acoustic Echo Cancellation

MINUET: MUSICAL INTERFERENCE UNMIXING ESTIMATION TECHNIQUE

Automotive three-microphone voice activity detector and noise-canceller

Speech enhancement with ad-hoc microphone array using single source activity

Recent Advances in Acoustic Signal Extraction and Dereverberation

Separation of Multiple Speech Signals by Using Triangular Microphone Array

Improving reverberant speech separation with binaural cues using temporal context and convolutional neural networks

Multiple Sound Sources Localization Using Energetic Analysis Method

Underdetermined Convolutive Blind Source Separation via Frequency Bin-wise Clustering and Permutation Alignment

Audiovisual speech source separation: a regularization method based on visual voice activity detection

ICA for Musical Signal Separation

Improving Meetings with Microphone Array Algorithms. Ivan Tashev Microsoft Research

Speech and Audio Processing Recognition and Audio Effects Part 3: Beamforming

Discrete Fourier Transform (DFT)

Applying the Filtered Back-Projection Method to Extract Signal at Specific Position

University Ibn Tofail, B.P. 133, Kenitra, Morocco. University Moulay Ismail, B.P Meknes, Morocco

Title. Author(s)Sugiyama, Akihiko; Kato, Masanori; Serizawa, Masahir. Issue Date Doc URL. Type. Note. File Information

Real-time Adaptive Concepts in Acoustics

The basic problem is simply described. Assume d s statistically independent sources s(t) =[s1(t) ::: s ds (t)] T. These sources are convolved and mixe

Multichannel Acoustic Signal Processing for Human/Machine Interfaces -

Effective post-processing for single-channel frequency-domain speech enhancement Weifeng Li a

Airo Interantional Research Journal September, 2013 Volume II, ISSN:

Deblending random seismic sources via independent component analysis

Study Of Sound Source Localization Using Music Method In Real Acoustic Environment

Convention Paper Presented at the 138th Convention 2015 May 7 10 Warsaw, Poland

On the Subcarrier Averaged Channel Estimation for Polarization Mode Dispersion CO-OFDM Systems

THE problem of acoustic echo cancellation (AEC) was

REAL TIME DIGITAL SIGNAL PROCESSING

Introduction to Blind Signal Processing: Problems and Applications

A Three-Microphone Adaptive Noise Canceller for Minimizing Reverberation and Signal Distortion

Optimal Adaptive Filtering Technique for Tamil Speech Enhancement

Blind Separation of Radio Signals Fading Channels

Eigenvalue equalization applied to the active minimization of engine noise in a mock cabin

Adaptive Array Beamforming using LMS Algorithm

Microphone Array Feedback Suppression. for Indoor Room Acoustics

A HYPOTHESIS TESTING APPROACH FOR REAL-TIME MULTICHANNEL SPEECH SEPARATION USING TIME-FREQUENCY MASKS. Ryan M. Corey and Andrew C.

NOISE ESTIMATION IN A SINGLE CHANNEL

DURING the past several years, independent component

Single Channel Speaker Segregation using Sinusoidal Residual Modeling

Microphone Array Design and Beamforming

BLIND SEPARATION OF LINEAR CONVOLUTIVE MIXTURES USING ORTHOGONAL FILTER BANKS. Milutin Stanacevic, Marc Cohen and Gert Cauwenberghs

Correlated postfiltering and mutual information in pseudoanechoic model based blind source separation

Speech Enhancement Based On Noise Reduction

Enhancement of Speech in Noisy Conditions

Adaptive Filters Application of Linear Prediction

Evaluation of a Multiple versus a Single Reference MIMO ANC Algorithm on Dornier 328 Test Data Set

source signals seconds separateded signals seconds

SPEECH ENHANCEMENT USING A ROBUST KALMAN FILTER POST-PROCESSOR IN THE MODULATION DOMAIN. Yu Wang and Mike Brookes

A wireless MIMO CPM system with blind signal separation for incoherent demodulation

Calibration of Microphone Arrays for Improved Speech Recognition

Permutation Correction in the Frequency Domain in Blind Separation of Speech Mixtures

Mikko Myllymäki and Tuomas Virtanen

Detection, Interpolation and Cancellation Algorithms for GSM burst Removal for Forensic Audio

AMAIN cause of speech degradation in practically all listening

Adaptive Beamforming Approach with Robust Interference Suppression

Adaptive beamforming using pipelined transform domain filters

An Adaptive Algorithm for Speech Source Separation in Overcomplete Cases Using Wavelet Packets

ESE531 Spring University of Pennsylvania Department of Electrical and System Engineering Digital Signal Processing

FROM BLIND SOURCE SEPARATION TO BLIND SOURCE CANCELLATION IN THE UNDERDETERMINED CASE: A NEW APPROACH BASED ON TIME-FREQUENCY ANALYSIS

Speech Enhancement Using Microphone Arrays

Analysis of LMS and NLMS Adaptive Beamforming Algorithms

Computer exercise 3: Normalized Least Mean Square

Case study for voice amplification in a highly absorptive conference room using negative absorption tuning by the YAMAHA Active Field Control system

DERIVATION OF TRAPS IN AUDITORY DOMAIN

Comprehensive Performance Analysis of Non Blind LMS Beamforming Algorithm using a Prefilter

Proceedings of Meetings on Acoustics

SPEECH ENHANCEMENT USING ADAPTIVE FILTERS AND INDEPENDENT COMPONENT ANALYSIS APPROACH

Towards an intelligent binaural spee enhancement system by integrating me signal extraction. Author(s)Chau, Duc Thanh; Li, Junfeng; Akagi,

Robust Low-Resource Sound Localization in Correlated Noise

ESTIMATION OF TIME-VARYING ROOM IMPULSE RESPONSES OF MULTIPLE SOUND SOURCES FROM OBSERVED MIXTURE AND ISOLATED SOURCE SIGNALS

AUTOMATIC EQUALIZATION FOR IN-CAR COMMUNICATION SYSTEMS

Joint recognition and direction-of-arrival estimation of simultaneous meetingroom acoustic events

Omnidirectional Sound Source Tracking Based on Sequential Updating Histogram

Suggested Solutions to Examination SSY130 Applied Signal Processing

Transcription:

REAL-TIME BLIND SOURCE SEPARATION FOR MOVING SPEAKERS USING BLOCKWISE ICA AND RESIDUAL CROSSTALK SUBTRACTION Ryo Mukai Hiroshi Sawada Shoko Araki Shoji Makino NTT Communication Science Laboratories, NTT Corporation 2 4 Hikaridai, Seika-cho, Soraku-gun, Kyoto 69 237, Japan fryo,sawada,shoko,makig@cslab.kecl.ntt.co.jp ABSTRACT This paper describes a real-time blind source separation (BSS) method for moving speech signals in a room. Our method employs frequency domain independent component analysis (ICA) using a blockwise batch algorithm in the first stage, and the separated signals are refined by postprocessing using crosstalk component estimation and nonstationary spectral subtraction in the second stage. The blockwise batch algorithm achieves better performance than an online algorithm when sources are fixed, and the postprocessing compensates for performance degradation caused by source movement. Experimental results using speech signals recorded in a real room show that the proposed method realizes robust real-time separation for moving sources. Our method is implemented on a standard PC and works in realtime.. INTRODUCTION Blind source separation (BSS) is a technique for estimating original source signals using only observed mixtures. The BSS of audio signals has a wide range of applications including noise robust speech recognition, hands-free telecommunication systems and high-quality hearing aids. In most realistic applications, the source signal location may change, and the mixing system is time-varying. Although a large number of studies have been undertaken on BSS based on ICA [, 2, 3], only few studies have been made on BSS for moving source signals [4,, 6, 7]. Indeed an online algorithm can track a time-varying system, however, in general, its performance is worse than a batch algorithm when the system becomes stationary. Although we are dealing with moving sources, we do not want to degrade the performance for fixed sources. In this paper, we propose a robust real-time BSS method that employs frequency domain ICA using a blockwise batch algorithm in the first stage, and the postprocessing of crosstalk component estimation and non-stationary spectral subtraction in the second stage. When we adopt a blockwise frequency domain ICA, we need to solve a permutation problem for every block, and this is a time consuming process especially when the block length is short. We use an algorithm based on analytical calculation of null directions to solve the permutation problem quickly. Another problem inherent to batch algorithms is an input-output delay. To reduce the delay, we use a technique for computing output signal without waiting for the calculation of the separating system to be completed. These techniques are useful for realizing low-delay real-time BSS. The blockwise batch algorithm achieves better separation performance than an online algorithm for fixed source signals, but the performance declines for moving sources. As we pointed out in [8], the solution of ICA works like an adaptive beamformer, which forms a spatial null towards a jammer signal. This characteristic means that BSS using ICA is fragile as regards a moving jammer signal but robust with respect to a moving target signal. Utilizing this nature, we can estimate residual crosstalk components even when a jammer signal moves. To compensate for the degradation when a jammer signal moves, we employ postprocessing in the second stage. Experimental results using speech signals recorded in a room show the effectiveness of the method in realizing robust real-time separation. 2. ICA BASED BSS OF CONVOLUTIVE MIXTURES In this section, we briefly review the BSS algorithm that uses frequency domain ICA and formulate a blockwise batch algorithm including an online algorithm as a special case. We also describe a fast algorithm for solving permutation problems, which is necessary for real-time processing. 2.. Frequency domain ICA When the source signals are s i (t)(i =;:::;N), the signals observed by microphone j are x j (t)(j =; :::; M ), and the separated signals are y k (t)(k = ; :::; N), the BSS model

can be described by the following equations: x j (t) = y k (t) = NX i= MX j= (hji Λ s i )(t) () (wkj Λ x j )(t) (2) (a) x(t) STDFT X (!; n) y(t) B m B m+ B m+2 ICA STIDFT W m block size Y m = W mxm T b STDFT frame length where hji is the impulse response from source i to microphone j, wkj is the coefficient when we assume that a separating system is used as an FIR filter, and Λ denotes the convolution operator. A convolutive mixture in the time domain corresponds to instantaneous mixtures in the frequency domain. Therefore, we can apply an ordinary ICA algorithm in the frequency domain to solve a BSS problem in a reverberant environment. Using a short-time discrete Fourier transform (STDFT) for (), the model is approximated as: X(!; n) =H(!)S(!; n); (3) where,! is the angular frequency, and n represents the frame index. The separating process can be formulated in each frequency bin as: Y (!; n) =W (!)X(!; n); (4) where S(!; n) = [S (!; n); :::; S N (!; n)] T is the source signal in frequency bin!, X(!; n) = [X (!; n);:::; X M (!; n)] T denotes the observed signals, Y (!; n) = [Y (!; n);:::;y N (!; n)] T is the estimated source signal, and W (!) represents the separating matrix. W (!) is determined so that Y i (!; n) and Y j (!; n) become mutually independent. To calculate the separating matrix W, we use an optimization algorithm based on the minimization of the mutual information of Y. The optimal W is obtained by the following iterative equation using the natural gradient approach [9] : W (i+) = W (i) + μ[i hφ(y )Y H i]w (i) ; () where i is an index for the iteration, I is an identity matrix, μ is a step size parameter, h i denotes the averaging operator, and Φ( ) is a nonlinear function. Because the signals have complex values in the frequency domain, we use a polar coordinate based nonlinear function, which is effective for fast convergence especially when the number of input data samples is small []: Φ(Y ) = tanh(g abs(y ))e arg(y ) ; (6) where g is a gain parameter that controls the nonlinearity. (b) x(t) X (!; n) STDFT y(t) input-output delay B m 2 B m B m W m 2 ICA STIDFT input-output delay Y m = W m 2Xm Fig.. Input-output delay of (a) BSS using ordinary blockwise batch algorithm, and (b) BSS without waiting for calculation of W m. 2.2. Blockwise batch algorithm In order to track the time-varying mixing system, we update the separating matrix for each time block B m = ft : (m )T b» t<mt b g, where T b is the block size, and m represents the block index (m ). Koutras et al. have proposed a similar method in the time domain []. When T b equals the STDFT frame length, this procedure can be considered an online algorithm in the frequency domain. We use the separating matrix of the previous block as the initial iteration value for a new block, i.e., W () m+ (!) = W (NI m ) (!), where N I is the number of iterations for (). We use a set of two null beamformers as the initial matrix W () (!) for the first block. The batch algorithm has an inherent delay, because the calculation of W needs to wait for the arrival of a data block. Moreover, the calculation itself also takes time (Fig. (a)). However, when the calculation is completed within T b and we use W m 2 for separation of the signals in B m, we can avoid the delay for waiting and calculation (Fig. (b)). This technique can reduce the input-output delay and is suitable for low-delay real-time applications. It seems that this method fails when a source signal moves, but it is actually robust for the moving target signal, which is shown in Sec. 4.3.

Source signals S S 2 H + Mixing system Observed signals X W + + + X 2 Separating system Separated signals Y Y 2 Separated signals obtained by ICA Y (!; n) Y (s) (!; n) + Y (c) (!; n) Y 2 (!; n) a(!; n) Spectral subtractor ^Y (c) (!; n) Refined output ^Y (s) (!; n) Fig. 2. Model of BSS system (N = M = 2). 2.3. Scaling and permutation Once we have completed the ICA for all frequencies, we need to solve the permutation and scaling problems. Since we are handling signals with complex values, the scaling factors are also complex values. Thus the scaling can be divided into phase scaling and amplitude scaling. We use a directivity pattern based method to solve the permutation and phase scaling problems. When we consider a separating system as a microphone array, we can write directivity patterns for every frequency bin. The permutation problem is solved so that the null directions are aligned. We can estimate the directions of the source signals from the aligned directivity patterns, and the phase scaling problem is solved so that the phase response of the estimated source direction becomes zero. In the following sections, we consider a two-input, twooutput convolutive BSS problem, i.e., N = M = 2 (Fig. 2). When M = 2 and the distance between the microphones is sufficiently small to avoid spatial aliasing, the null directions i (!) can be calculated analytically as: wi (!) c i (!) = arcsin arg ; (7) w i2 (!) d! where [w i (!)w i2 (!)] is an i-th row vector of W (!), d is the distance between microphones and c is the velocity of sound []. This method does not require the directivity pattern to be scanned, thus we can solve the permutation problem quickly. The amplitude scaling problem is solved by using a slightly modified version of the method described in [2]. We calculate the inverse of the separating matrices W (!), and decide the scaling factors so that the norms of each column of W (!) become uniform. 3. POSTPROCESSING FOR REFINING SEPARATED SIGNALS In this section, we briefly summarize the procedure for estimating and subtracting the residual crosstalk component. The algorithm is described in detail in [3]. Figure 3 shows a block diagram of the algorithm. update a only when jy (!; n)j < jy 2 (!; n)j Fig. 3. Postprocessing for removing crosstalk component Y (c) from Y. We consider that S is a target signal and S 2 is a jammer signal. The separated signal Y consists of a straight component Y (s) derived from S and a crosstalk component Y (c) derived from S 2.IfY (c) is removed from Y, the separation performance improves. We introduce FIR filters a(!; n) = [a (!; n); :::; a L (!; n)] in each frequency bin, where L is the length of the filter. We assume that the power of Y (c) (!; n) can be approximated as the output of the filter whose input is Y 2 (!; n). This is formulated as follows: X L (c) j ^Y (!; n)j 2 ß a k (!; n)jy 2 (!; n k)j 2 (8) k= The filters are updated by the following selectively normalized LMS algorithm. a(!; n +)= (9) 8 >< e(!; n)u(!; n) ffi + jju(!; n)jj2 (if jy >: (w; n)j < jy 2 (w; n)j) (otherwise) where u(!; n) = [jy 2 (!; n)j 2 ; jy 2 (!; n )j 2 ; :::; jy 2 (!; n L +)j 2 ] T is an input vector and e(!; n) = jy (!; n)j 2 a T (!; n)u(!; n) is an estimation error. Here, is a step size parameter and ffi is a positive constant to avoid numerical instability when u is very small. We estimate the power of the residual crosstalk component using (8) and (9), and finally, we obtain an estimation (s) of the straight component as ^Y by the following spectral subtraction procedure: ^Y (s) (!; n) = () 8 >< >: (jy (!; n)j 2 j^y (c) (!; n)j 2 ) =2 Y (!; n) jy (!; n)j (if jy (!; n)j 2 > j ^Y (c) (!; n)j 2 ) (otherwise)

4.4 m B Loudspeakers (height:.3 m) Target A signal Microphones 3. m (height:.39 m) -7-4 4cm 3. m 7 C Jammer D signal 2.2 m.7 m Room height: 2. m Fig. 4. Layout of room used in experiments. Table. Experimental conditions Common Sampling rate =8kHz Window = hanning Reverberation time T R=3 ms ICA part Frame length T ICA = 24 point (28 ms) Frame shift = 26 point (32ms) g =. Number of iterations N I = Block size T b =s Post Frame length T SS = 24 point (28 ms) processing Frame shift = 64 point (8 ms) part =:; ffi =: 4. EXPERIMENTS 4.. Experimental conditions To examine the effectiveness of the proposed method, we carried out experiments using speech signals recorded in a room. The reverberation time of the room was 3 ms. We used two omni-directional microphones with an interelement spacing of 4 cm. The layout of the room is shown in Fig 4. The target source signal was first located at A, and then moved to B at a speed of 3 deg/s. The jammer signal was located at C and moved to D at a speed of 4 deg/s. The step size parameter μ in () affects the separation performance of BSS when the block size changes. We chose μ to optimize the performance for each block size. Other conditions are summarized in Table. We assumed the straight component y (s) as a signal, and the difference between the output signal and the straight component as interference. We defined the output signalto-interference ratio (SIR O ) in the time domain as follows: SIR O log Pt jy(s) (t)j2 Pt jy (t) y (s) (t)j2 (db): () Similarly, the input SIR (SIR I ) is defined as, P P 2 t i= SIR I log j(h i Λ s )(t)j P P 2 2 (db): (2) i= j(h i2 Λ s 2 )(t)j2 t We use SIR = SIR O SIR I as a performance measure. This measurement is consistent with the performance evaluation of BSS in which the crosstalk component is assumed as interference. We measured SIRs with 3 combinations of source signals using three male and three female speakers, and averaged them. 4.2. Performance for fixed sources Although we are dealing with moving sources, we do not want the performance for fixed sources to deteriorate. First, 7 6 4 3 2 Online.28.2. 2 Block size Blockwise batch Fig.. Average and standard deviation of SIR for fixed sources T b (s) we measured the BSS performance using ICA without postprocessing. Figure shows the average and standard deviation of SIR for fixed sources (the target is at A and the jammer at C in Fig. 4). This indicates that the blockwise batch algorithm outperforms the online algorithm (in which μ is tuned to optimize the performance), when we use the update equation (). In addition, the deviation of the batch algorithm is smaller than that of the online algorithm. This is why we adopt the blockwise batch algorithm in the first stage. We used T b =. sec. in the following experiments. 4.3. Moving target and moving jammer Before considering the result obtained with the postprocessing method, we investigate the BSS performance for moving sources using the blockwise batch algorithm. Figure 6 shows the SIR for a moving target (solid line) and for a moving jammer (dotted line). We can see that the SIR is not degraded even when the target moves. By contrast, jammer movement causes a decline in the SIR. This can be explained by the directivity pattern of the separating system obtained by ICA. The solution of fre-

2 Moving target 2 Blockwise batch with postprocessing (proposed method) Moving jammer Move 2 4 6 8 2 4 6 8 2 Time (s) Fig. 6. SIR of blockwise batch algorithm without postprocessing. Target and jammer signals moved at sec. Blockwise batch Move without postprocessing 2 4 6 8 2 4 6 8 2 Time (s) Fig. 8. Effect of postprocessing. Jammer signal moved from C to D at sec. Gain (db) - - - -2-2 -3-8 -6-4 -2 Direction (deg) 2 4 6 8 2 23 3 4 Frequency (Hz) Fig. 7. Directivity pattern of separating system obtained by frequency domain ICA quency domain BSS works in the same way as an adaptive beamformer, which forms a spatial null towards a jammer signal (Fig. 7). Because of this characteristic, BSS using ICA is robust as regards a moving target signal but fragile with respect to a moving jammer signal. 4.4. Performance of blockwise batch algorithm with postprocessing The most important factor when estimating the crosstalk component Y (c) using (8) and (9) is Y 2, and Y 2 is estimated robustly even when S 2 moves, because S 2 is a target signal for Y 2. Therefore, postprocessing works robustly even when the jammer signal S 2 moves. Figure 8 shows the SIR of blockwise batch algorithm with postprocessing when the jammer signal moves (solid line). We can see that the SIR is improved by the postprocessing, and the drop of the SIR when the jammer moves is reduced. This result shows that our postprocessing method can compensate the fragility of the blockwise batch algorithm when a jammer signal moves. Although crosstalk components still remaining in the postprocessed output signal sometimes make a musical noise, the power is much 2 Online with postprocessing Online without postprocessing Move 2 4 6 8 2 4 6 8 2 Time (s) Fig. 9. Performance of online algorithm with and without postprocessing. Jammer signal moved from C to D at sec. smaller than ordinary spectral subtraction. 4.. Performance of online algorithm Figure 9 shows the SIR of online algorithm with and without postprocessing. The online algorithm is more stable than blockwise algorithm, however the performance is worse when the sources are stationary, as we described in Sec. 4.2. The postprocessing is also effective for this case, thus we may choose the algorithm in the first stage according to requirements of the application.. CONCLUSION We proposed a robust real-time BSS method for moving source signals. The combination of the blockwise batch and the postprocessing realizes a robust low-delay real-time BSS. We can solve a permutation problem quickly by using analytical calculation of null directions, and this technique is useful for solving convolutive BSS problems in

realtime. Postprocessing using crosstalk component estimation and non-stationary spectral subtraction improves the separation performance and reduces the performance deterioration when a jammer signal moves. Experimental results using speech signals recorded in a room showed the effectiveness of the proposed method. ACKNOWLEDGEMENT We thank Dr. Shigeru Katagiri for his continuous encouragement. 6. REFERENCES [] A. J. Bell and T. J. Sejnowski, An informationmaximization approach to blind separation and blind deconvolution, Neural Computation, vol. 7, no. 6, pp. 29 9, 99. [] H. Sawada, R. Mukai, S. Araki, and S. Makino, Polar coordinate based nonlinear function for frequencydomain blind source separation, in Proc. of ICASSP 2, 22, pp. 4. [] H. Sawada, R. Mukai, S. Araki, and S. Makino, A robust approach to the permutation problem of frequency-domain blind source separation, in Proc. of ICASSP 3, 23, submitted. [2] F. Asano and S. Ikeda, Evaluation and real-time implementation of blind source separation system using time-delayed decorrelation, in Proc. of Intl. Workshop on Independent Component Analysis and Blind Signal Separation (ICA ), 2, pp. 4 4. [3] R. Mukai, S. Araki, H. Sawada, and S. Makino, Removal of residual crosstalk components in blind source separation using LMS filters, in Proc. of NNSP 2, 22, pp. 43 444. [2] S. Haykin, Ed., Unsupervised Adaptive Filtering, John Wiley & Sons, 2. [3] A. Hyvärinen, J. Karhunen, and E. Oja, Independent Component Analysis, John Wiley & Sons, 2. [4] J. Anemüller and T. Gramss, On-line blind separation of moving sound sources, in Proc. of Intl. Conf. on Independent Component Analysis and Blind Source Separation (ICA 99), 999, pp. 33 334. [] A. Koutras, E. Dermatas, and G. Kokkinakis, Blind speech separation of moving speakers in real reverberant environment, in Proc. of ICASSP, 2, pp. 33 36. [6] I. Kopriva, Z. Devcic, and H. Szu, An adaptive shorttime frequency domain algorithm for blind separation of non-stationary convolved mixtures, in Proc. of IJCNN, 2, pp. 424 429. [7] K. E. Hild II, D. Erdogmus, and J. C. Principe, Blind source separation of time-varying, instantaneous mixtures using an on-line algorithm, in Proc. of ICASSP 2, 22, pp. 993 996. [8] S. Araki, S. Makino, R. Mukai, and H. Saruwatari, Equivalence between frequency domain blind source separation and frequency domain adaptive null beamformers, in Proc. of Eurospeech, 2, pp. 29 298. [9] S. Amari, A. Cichocki, and H. H. Yang, A new learning algorithm for blind signal separation, in Advances in Neural Information Processing Systems 8, pp. 77 763. The MIT Press, 996.