Networks for the Separation of Sources that are Superimposed and Delayed

Similar documents
A nalog Circuits for Constrained Optimization

DURING the past several years, independent component

MINUET: MUSICAL INTERFERENCE UNMIXING ESTIMATION TECHNIQUE

Title. Author(s)Sugiyama, Akihiko; Kato, Masanori; Serizawa, Masahir. Issue Date Doc URL. Type. Note. File Information

+ C(0)21 C(1)21 Z -1. S1(t) + - C21. E1(t) C(D)21 C(D)12 C12 C(1)12. E2(t) S2(t) (a) Original H-J Network C(0)12. (b) Extended H-J Network

FPGA Implementation Of LMS Algorithm For Audio Applications

Impulse-Noise Cancelation using the Common Mode Signal

FROM BLIND SOURCE SEPARATION TO BLIND SOURCE CANCELLATION IN THE UNDERDETERMINED CASE: A NEW APPROACH BASED ON TIME-FREQUENCY ANALYSIS

Design and Evaluation of Modified Adaptive Block Normalized Algorithm for Acoustic Echo Cancellation in Hands-Free Communications

Speech Enhancement Based On Noise Reduction

Design and Implementation on a Sub-band based Acoustic Echo Cancellation Approach

BLIND SEPARATION OF LINEAR CONVOLUTIVE MIXTURES USING ORTHOGONAL FILTER BANKS. Milutin Stanacevic, Marc Cohen and Gert Cauwenberghs

works must be obtained from the IEE

Students: Avihay Barazany Royi Levy Supervisor: Kuti Avargel In Association with: Zoran, Haifa

Figure 1: A typical Multiuser Detection

int.,.noil. 1989December

(i) Understanding the basic concepts of signal modeling, correlation, maximum likelihood estimation, least squares and iterative numerical methods

Hardware Implementation of a PCA Learning Network by an Asynchronous PDM Digital Circuit

DESIGN AND IMPLEMENTATION OF ADAPTIVE ECHO CANCELLER BASED LMS & NLMS ALGORITHM

Acoustic Echo Cancellation using LMS Algorithm

Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis

Comparative Study of Different Algorithms for the Design of Adaptive Filter for Noise Cancellation

Chapter 4 SPEECH ENHANCEMENT

Adaptive Noise Cancellation with Fast Tunable RBF Network

AN INSIGHT INTO ADAPTIVE NOISE CANCELLATION AND COMPARISON OF ALGORITHMS

Neural Blind Separation for Electromagnetic Source Localization and Assessment

SUPERVISED SIGNAL PROCESSING FOR SEPARATION AND INDEPENDENT GAIN CONTROL OF DIFFERENT PERCUSSION INSTRUMENTS USING A LIMITED NUMBER OF MICROPHONES

EE301 Electronics I , Fall

Hardware Implementation of Adaptive Algorithms for Noise Cancellation

Nonlinear postprocessing for blind speech separation

Optimal Adaptive Filtering Technique for Tamil Speech Enhancement

An Adaptive Adjacent Channel Interference Cancellation Technique

Speech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction

A FEEDFORWARD ACTIVE NOISE CONTROL SYSTEM FOR DUCTS USING A PASSIVE SILENCER TO REDUCE ACOUSTIC FEEDBACK

Multirate Algorithm for Acoustic Echo Cancellation

A Novel Adaptive Algorithm for

Nonuniform multi level crossing for signal reconstruction

Keywords: Adaptive filtering, LMS algorithm, Noise cancellation, VHDL Design, Signal to noise ratio (SNR), Convergence Speed.

NEURO-ACTIVE NOISE CONTROL USING A DECOUPLED LINEAIUNONLINEAR SYSTEM APPROACH

ICA for Musical Signal Separation

Blind Dereverberation of Single-Channel Speech Signals Using an ICA-Based Generative Model

Real- Time Computer Vision and Robotics Using Analog VLSI Circuits

Convolutional Coding Using Booth Algorithm For Application in Wireless Communication

x ( Primary Path d( P (z) - e ( y ( Adaptive Filter W (z) y( S (z) Figure 1 Spectrum of motorcycle noise at 40 mph. modeling of the secondary path to

PVT Insensitive Reference Current Generation

Adaptive Antennas in Wireless Communication Networks

Effect of the Audio Amplifier s Distortion on Feedforward Active Noise Control

Multiple Antenna Techniques

THE ABILITY to selectively enhance audio signals of

Evaluation of a Multiple versus a Single Reference MIMO ANC Algorithm on Dornier 328 Test Data Set

Keywords : Simultaneous perturbation, Neural networks, Neuro-controller, Real-time, Flexible arm. w u. (a)learning by the back-propagation.

High-speed Noise Cancellation with Microphone Array

A Comparison of the Convolutive Model and Real Recording for Using in Acoustic Echo Cancellation

Separation of Noise and Signals by Independent Component Analysis

A Simple Design and Implementation of Reconfigurable Neural Networks

An Effective Implementation of Noise Cancellation for Audio Enhancement using Adaptive Filtering Algorithm

A COMPACT, AGILE, LOW-PHASE-NOISE FREQUENCY SOURCE WITH AM, FM AND PULSE MODULATION CAPABILITIES

1. Motivation. 2. Periodic non-gaussian noise

Single Channel Speaker Segregation using Sinusoidal Residual Modeling

Analysis of LMS and NLMS Adaptive Beamforming Algorithms

Multiple Sound Sources Localization Using Energetic Analysis Method

John Lazzaro and John Wawrzynek Computer Science Division UC Berkeley Berkeley, CA, 94720

BECAUSE OF their low cost and high reliability, many

Performance Comparison of ZF, LMS and RLS Algorithms for Linear Adaptive Equalizer

Lab/Project Error Control Coding using LDPC Codes and HARQ

Feature Extraction of Acoustic Emission Signals from Low Carbon Steel. Pitting Based on Independent Component Analysis and Wavelet Transforming

PERFORMANCE ANALYSIS OF DIFFERENT M-ARY MODULATION TECHNIQUES IN FADING CHANNELS USING DIFFERENT DIVERSITY

Audio Restoration Based on DSP Tools

SGN Advanced Signal Processing

Analysis on Extraction of Modulated Signal Using Adaptive Filtering Algorithms against Ambient Noises in Underwater Communication

Auditory modelling for speech processing in the perceptual domain

A New Method For Active Noise Control Systems With Online Acoustic Feedback Path Modeling

Chapter 1 INTRODUCTION TO SOURCE CODING AND CHANNEL CODING. Whether a source is analog or digital, a digital communication

VLSI Circuit Design for Noise Cancellation in Ear Headphones

Adding Fractions with Different Denominators. Subtracting Fractions with Different Denominators

Adaptive Antennas. Randy L. Haupt

MthSc 103 Test #1 Spring 2011 Version A JIT , 1.8, , , , 8.1, 11.1 ANSWER KEY AND CUID: GRADING GUIDELINES

Initialisation improvement in engineering feedforward ANN models.

Adaptive Rate Transmission for Spectrum Sharing System with Quantized Channel State Information

VLSI Implementation of Separating Fetal ECG Using Adaptive Line Enhancer

A Bottom-Up Approach to on-chip Signal Integrity

Performance Evaluation of Adaptive Filters for Noise Cancellation

IN WIRELESS and wireline digital communications systems,

Analysis of LMS Algorithm in Wavelet Domain

4.5 Fractional Delay Operations with Allpass Filters

λ iso d 4 π watt (1) + L db (2)

Architecture design for Adaptive Noise Cancellation

A Comparative Study of Π and Split R-Π Model for the CMOS Driver Receiver Pair for Low Energy On-Chip Interconnects

Implementation of decentralized active control of power transformer noise

Performance Evaluation of different α value for OFDM System

Reducing comb filtering on different musical instruments using time delay estimation

The Role of High Frequencies in Convolutive Blind Source Separation of Speech Signals

Design Of A Comparator For Pipelined A/D Converter

Appendix. RF Transient Simulator. Page 1

Noise Reduction using Adaptive Filter Design with Power Optimization for DSP Applications

Acoustic Emission Source Location Based on Signal Features. Blahacek, M., Chlada, M. and Prevorovsky, Z.

3rd International Conference on Machinery, Materials and Information Technology Applications (ICMMITA 2015)

Revision of Channel Coding

MATLAB SIMULATOR FOR ADAPTIVE FILTERS

REAL-TIME BLIND SOURCE SEPARATION FOR MOVING SPEAKERS USING BLOCKWISE ICA AND RESIDUAL CROSSTALK SUBTRACTION

Transcription:

Networks for the Separation of Sources that are Superimposed and Delayed John C. Platt Federico Faggin Synaptics, Inc. 2860 Zanker Road, Suite 206 San Jose, CA 95134 ABSTRACT We have created new networks to unmix signals which have been mixed either with time delays or via filtering. We first show that a subset of the Herault-Jutten learning rules fulfills a principle of minimum output power. We then apply this principle to extensions of the Herault-Jutten network which have delays in the feedback path. Our networks perform well on real speech and music signals that have been mixed using time delays or filtering. 1 INTRODUCTION Recently, there has been much interest in neural architectures to solve the "blind separation of signals" problem (Herault & Jutten, 1986) (Vittoz & Arreguit, 1989). The separation is called "blind," because nothing is assumed known about the frequency or phase of the signals. A concrete example of blind separation of sources is when the pure signals are sounds generated in a room and the mixed signals are the output of some microphones. The mixture process would model the delay of the sound to each microphone, and the mixing of the sounds at each microphone. The inputs to the neural network would be the microphone outputs, and the neural network would try to produce the pure signals. The mixing process can take on different mathematical forms in different situations. To express these forms, we denote the pure signal i as Pi, the mixed signal i as Ii (which is the ith input to the network), and the output signal i as Oi. The simplest form to unmix is linear superposition: 730 lj(t) = Pi(t) + L Mjj(t)Pj(t). j# (1)

Networks for the Separation of Sources that are Superimposed and Delayed 731 A more realistic, but more difficult form to unmix is superposition with single delays: l i(f) = Pi(t) + L Mij(t)Pj(t - Djj(t)). j i-i (2) Finally, a rather general mixing process would be superposition with causal filtering: li(t) = Pi(t) + L L M ijk(t)pj (t - 15k). ji-i k (3) Blind separation is interesting for many different reasons. The network must adapt on-line and without a supervisor, which is a challenging type of learning. One could imagine using a blind separation network to clean up an input to a speech understanding system. (Juttell & Herault, 1991) uses a blind separation network to deskew images. Finally, researchers have implemented blind separation networks using analog VLSI to yield systems which are capable of performing the separation of sources in real time (Vittoz & Arreguit, 1990) (Cohen, et. al., 1992). 1.1 Previous Work Interest in adaptive systems which perform noise cancellation dates back to the 1960s and 1970s (Widrow, et. al., 1975). The first neural network to un mix on-line a linear superposition of sources was (Herault & Jutten, 1986). Further work on off-line blind separation was performed by (Cardoso, 1989). Recently, a network to unmix filtered signals was proposed in (Jutten, et. al., 1991), independently of this paper. 2 PRINCIPLE OF MINIMUM OUTPUT POWER In this section, we apply the mathematics of noise-cancelling networks (Widrow, et. al., 1975) to the network in (Herault & Jutten, 1986) in order to generalize to new networks that can handle delays in the mixing process. 2.1 Noise-cancellation Networks A noise-cancellation network tries to purify a signal which is corrupted by filtered noise (Widrow, et. al., 1975). The network has access to the isolated noise signal. The interference equation is 1(t) = P(t) + L MjN(t - 8j ). j (4) The adaptive filter inverts the interference equation, to yield an output: O(t) = 1(t) - L Cj N(t - 8j ). j (5) The adaptation of a noise-cancellation network relies on an elegant notion: if a signal is impure, it will have a higher power than a pure signal, because the noise power adds to the signal power. The true pure signal has the lowest power. This minimum output power principle is used to determine adaptation laws for noisecancellation networks. Specifically, at any time t, Cj is adjusted by taking a step that minimizes 0(t)2

732 Platt and Faggin Figure 1: The network described in (Herault & Jutten, 1986). The dashed arrows represent adaptation. 2.2 The Herault-Jutten Network The Herault-Jutten network (see Figure 1) uses a purely additive model of interference. The interference is modeled by Ii = Pi + LMijPj. j,#-i (6) Notice the Herault-Jutten network solves a more general problem than previous noise-cancellation networks: the Herault-Jutten network has no access to any pure signal. In (Herault & Jutten, 1986), the authors also propose inverting the interference model: (7) OJ = Ii - L: GijOj. j,#-i The Herault-Jutten network can be understood intuitively by assuming that the network has already adapted so that the outputs are the pure signals (OJ = P j ). Each connection Gij subtracts just the right amount of the pure signal P j from the input Ii to yield the pure signal Pi. So, the Herault-J utten network will produce pure signals if the Gij = M ij. In (Herault & Jutten, 1986), the authors propose a very general adaptation rule for the Gij: (8) for some non-linear functions f and g. (Sorouchyari, 1991) proves that the network converges for f(x) = x 3. In this paper, we propose that the same elegant minimization principle that governs the noise-cancellation networks can be used to justify a subset of Herault-Jutten

Networks for the Separation of Sources that are Superimposed and Delayed 733 learning algorithms. Let g(x) = x and f(x) be a derivative of some convex function h(x), with a minimum at x = O. In this case, each output of the Hcrault-Jutten network independently minimizes a function h(x). A Herault-Jutten network can be made by setting h(x) = x 2. Unfortunately, this network will not converge, because the update rules for two connections G ij and Gji are identical: (9) Under this condition, the two parameters Gij and Gji will track one another and not converge to the correct answer. Therefore, a non-linear adaptation rule is needed to break the symmetry between the outputs. The next two sections of the paper describe how the minimum output power principle can be applied to generalizations of the Herault-J utten architecture. 3 NETWORK FOR UNMIXING DELAYED SIGNALS Figure 2: Our network for unmixing signals mixed with single delays. The adjustable delay in the feedback path avoids the degeneracy in the learning rule. The dashed arrows represent adaptation: the source of the arrow is the source of the error used by gradient descent. Our new network is an extension of the Herault-Jutten network (see Figure 2). We assume that the interference is delayed by a certain amount: Ii(t) = Pi(t) + L: Mij Pj (t - Djj (t»). (10) i:j:.j Compare this to equation (6): our network can handle delayed interference, while the Herault-Jutten network cannot. We introduce an adjustable delay in the feedback path in order to cancel the delay of the interference: Oi(t) = I(t) - L: GijOj(t - djj(t)). i:j:.j (11)

734 Platt and Faggin We apply the minimum output power principle to adapt the mixing coefficients Gij and the delays dij : ~Gij(t) = aoi(t)oj(t - dij(t)), do ~dij(t) = -f3gij (t)oj(t) d/ (t - djj(t)). (12) By introducing a delay in the feedback, we prevent degeneracy in the learning rule, hence we can use a quadratic power to adjust the coefficients.... 0 1-0... <1)0 ~. 0 0 0.. -0... ~S ~o > <"<:t <1)b....... I tv) Ob 65... \0 b... 0 1 2 3 4 5 6 7 Time (sec) Figure 3: The results of the network applied to a speech/music superposition. These curves are short-time averages of the power of signals. The upper curve shows the power of the pure speech signal. The lower curve shows the power of the difference between the speech output of the network, and the pure speech signal. The gap between the curves is the amount that the network attenuates the interference between the music and speech: the adaptation of the network tries to drive the lower curve to zero. As you can see, the network quickly isolates the pure speech signal. For a test of our network, we took two signals, one speech and one music, and mixed them together via software to form two new signals: the first being speech plus a delayed, attenuated music; the second being music plus delayed, attenuated speech. Figure 3 shows the results of our network applied to these two signals: the interference was attenuated by approximately 22 db. One output of the network sounds like speech, with superimposed music which quickly fades away. The other output of the network sounds like music, with a superimposed speech signal which quickly fades away. Our network can also be extended to more than two sources, like the Herault-Jutten network. If the network tries to separate S sources, it requires S non-identical

Networks for the Separation of Sources that are Superimposed and Delayed 735 inputs. Each output connects to one input, and a delayed version of each of the other outputs, for a total of 28(S - 1) adaptive coefficients. 4 NETWORK FOR UNMIXING FILTERED SIGNALS Figure 4: A network to unrnix signals that have been mixed via filtering. The filters in the feedback path are adjusted to independently minimize the power h( Oi) of each output. For the mixing process that involves filtering, Ii(t) = Pi(t) + L L MijkPj(t - bk), j-:ti k (13) we put filters in the feedback path of each output: Oi(t) = li(t) - L L CjkOj(t - 15k), j -:ti k (14) (Jutten, et. al., 1991) also independently developed this architecture. We can use the principle of minimum output power to develop a learning rule for this architecture: for some convex function h. (Jutten, et. al., 1991) suggests using an adaptation rule that is equivalent to choosing h(x) = X4. Interestingly, neither the choice of h( x) = x 2 nor h( x) = X4 converges to the correct solution. For both h(x) = x 2 and h(x) = x4, if the coefficients start at the correct solution, they stay there. However, if the coefficients start at zero, they converge to a solution that is only roughly correct (see Figure 5). These experiments show (15)

736 Platt and Faggin... = Absolute Value o =SquSIe o = Fourth Power,...; 90~~1---+2--~3---4+-~5---+6--~7---+8--~9 coefficient number Figure 5: The coefficients for one filter in the feedback path of the network. The weights were initialized t.o zero. Two different speech/music mixtures were applied to the network. The solid line indicates the correct solution for the coefficients. When minimizing either h(x) = x2 or h(x) = x\ the network converges to an incorrect solution. Minimizing h(x) = Ixl seems to work well. that the learning algorithm has multiple stable states. Experimentally, the spurious stable states seem to perform roughly as well as the true answer. To account for these multiple stable states, we came up with a conjecture: that the different minimizations performed by each output fought against one another and created the multiple stable states. Optimization theory suggests using an exact penalty method to avoid fighting between multiple terms in a single optimization criteria (Gill, 1981). The exact penalty method minimizes a function h(x) that has a non-zero derivative for x close to O. We tried a simple exact penalty method of h(x) = Ix\' and it empirically converged to the correct solution (see Figure 5). The adaptation rule is then In this case, the non-linearity of the adaptation rule seems to be important for the network to converge to the true answer. For a speech/music mixture, we achieved a signal-to-noise ratio of 20 db using the update rule (16). 5 FUTURE WORK The networks described in the last two sections were found to converge empirically. In the future, proving conditions for convergence would be useful. There are some known pathological cases which cause these networks not to converge. For example, using white noise as the pure signals for the network in section 3 causes it to fail, because there is no sensible way for the network to change the delays. (16)

Networks for the Separation of Sources that are Superimposed and Delayed 737 More exploration of the choice of optimization function needs to be performed in the future. The work in section 4 is just a first step which illustrates the possible usefulness of the absolute value function. Another avenue of future work is to try to express the blind separation problem as a global optimization problem, perhaps by trying to minimize the mutual information between the outputs. (Feinstein, Becker, personal communication) 6 CONCLUSIONS We have found that the minimum output power principle can generate a subset of the Herault-Jutten network learning rules. We use this principle to adapt extensions of the Herault-Jutten network, which have delays in the feedback path. These new networks unmix signals which have been mixed with single delays or via filtering. Acknowledgements We would like to thank Kannan Parthasarathy for his assistance in some of the experiments. We would also like to thank David Feinstein, Sue Becker, and David Mackay for useful discussions. References Cardoso, J. F., (1989) "Blind Identification of Independent Components," Proceedings of the Workshop 011 Higher-Order Spectral Analysis, Vail, Colorado, pp. 157-160, (1989). Cohen, M. H., Pouliquen, P.O., Andreou, A. G., (1992) "Analog VLSI Implementation of an Auto-Adaptive Network for Real-Time Separation of Independent Signals," Advances in Neural Information Processing Systems 4, Morgan-Kaufmann, San Mateo, CA. Gill, P. E., Murray, W., Wright, M. H., (1981) Practical Optimization, Academic Press, London. Herault, J., J utten, C., (1986) "Space or Time Adaptive Signal Processing by Neural Network Models," Neural Networks for Computing, AlP Conference Proceedings 151, pp. 207-211, Snowbird, Utah. Jutten, C., Thi, L. N., Dijkstra, E., Vittoz, E., Caelen, J., (1991) "Blind Separation of Sources: an Algorithm for Separation of Convolutive Mixtures," Proc. Inti. Workshop on High Order Statistics, Chamrousse France, July 1991. Jutten, C., Herault, J., (1991) "Blind Separation of Sources, part I: An Adaptive Algorithm Based on Neuromimetic Architecture," Signal Processing, vol. 24, pp. 1-10. Sorouchyari, E., (1991) "Blind Separation of Sources, Part III: Stability analysis," Signal Processing, vol. 24, pp. 21-29. Vittoz, E. A., Arreguit, X., (1989) "CMOS Integration of Herault-Jutten Cells for Separation of Sources," Proc. Workshop on Analog VLSI and Neural Systems, Portland, Oregon, May 1989. Widrow, B., Glover, J., McCool, J., Kaunitz, J., Williams, C., Hearn, R., Zeidler, J., Dong, E., Goodlin, R., (1975) "Adaptive Noise Cancelling: Principles and Applications," Proc. IEEE, vol. 63, no. 12, pp. 1692-1716.

PART XI IMPLEMENTATION