A Computational Efficient Method for Assuring Full Duplex Feeling in Hands-free Communication

Similar documents
A Computational Efficient Method for Assuring Full Duplex Feeling in Hands- Free Communication

THE problem of acoustic echo cancellation (AEC) was

FOURIER analysis is a well-known method for nonparametric

ROBUST echo cancellation requires a method for adjusting

Digital Signal Processing Methods and Algorithms for Audio Conferencing Systems

Design and Implementation on a Sub-band based Acoustic Echo Cancellation Approach

A FEEDFORWARD ACTIVE NOISE CONTROL SYSTEM FOR DUCTS USING A PASSIVE SILENCER TO REDUCE ACOUSTIC FEEDBACK


Low-Complexity Algorithms. Audio Conferencing Systems. Christian Schüldt. Blekinge Institute of Technology Doctoral Dissertation Series No.

Gerhard Schmidt / Tim Haulick Recent Tends for Improving Automotive Speech Enhancement Systems. Geneva, 5-7 March 2008

Encoding a Hidden Digital Signature onto an Audio Signal Using Psychoacoustic Masking

An Efficient and Flexible Structure for Decimation and Sample Rate Adaptation in Software Radio Receivers

GSM Interference Cancellation For Forensic Audio

A hybrid acoustic echo canceller and suppressor

IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 50, NO. 12, DECEMBER

Evaluation of a Multiple versus a Single Reference MIMO ANC Algorithm on Dornier 328 Test Data Set

EE 6422 Adaptive Signal Processing

ADAPTIVE NOISE CANCELLING IN HEADSETS

A Low-Power Broad-Bandwidth Noise Cancellation VLSI Circuit Design for In-Ear Headphones

University Ibn Tofail, B.P. 133, Kenitra, Morocco. University Moulay Ismail, B.P Meknes, Morocco

EXPERIMENTAL INVESTIGATIONS OF DIFFERENT MICROPHONE INSTALLATIONS FOR ACTIVE NOISE CONTROL IN DUCTS

A SYSTEM IMPLEMENTATION OF AN ACTIVE NOISE CONTROL SYSTEM COMBINED WITH PASSIVE SILENCERS FOR IMPROVED NOISE REDUCTION IN DUCTS SUMMARY INTRODUCTION

ARTICLE IN PRESS. Signal Processing

SPEECH communication among passengers in large motor

ZLS38500 Firmware for Handsfree Car Kits

Digital Signal Processing of Speech for the Hearing Impaired

Speech and Audio Processing Recognition and Audio Effects Part 3: Beamforming

DESIGN AND IMPLEMENTATION OF ADAPTIVE ECHO CANCELLER BASED LMS & NLMS ALGORITHM

Implementation of Optimized Proportionate Adaptive Algorithm for Acoustic Echo Cancellation in Speech Signals

Title. Author(s)Sugiyama, Akihiko; Kato, Masanori; Serizawa, Masahir. Issue Date Doc URL. Type. Note. File Information

Implementation of decentralized active control of power transformer noise

Mel Spectrum Analysis of Speech Recognition using Single Microphone

COMPARATIVE STUDY OF VARIOUS FIXED AND VARIABLE ADAPTIVE FILTERS IN WIRELESS COMMUNICATION FOR ECHO CANCELLATION USING SIMULINK MODEL

Automotive three-microphone voice activity detector and noise-canceller

Isolated Word Recognition Based on Combination of Multiple Noise-Robust Techniques

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm

AUTOMATIC EQUALIZATION FOR IN-CAR COMMUNICATION SYSTEMS

Optimal Adaptive Filtering Technique for Tamil Speech Enhancement

CHAPTER 2 FIR ARCHITECTURE FOR THE FILTER BANK OF SPEECH PROCESSOR

Acoustic echo cancellers for mobile devices

Architecture design for Adaptive Noise Cancellation

Non-coherent pulse compression - concept and waveforms Nadav Levanon and Uri Peer Tel Aviv University

APPLICATIONS OF ACOUSTIC ECHO CONTROL AN OVERVIEW

Systematic Integration of Acoustic Echo Canceller and Noise Reduction Modules for Voice Communication Systems

(51) Int Cl.: H04M 9/08 ( ) (56) References cited:

Design and Implementation of Adaptive Echo Canceller Based LMS & NLMS Algorithm

Acoustic Echo Cancellation: Dual Architecture Implementation

Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter

int.,.noil. 1989December

On The Achievable Amplification of the Low Order NLMS Based Adaptive Feedback Canceller for Public Address System

Broadband Microphone Arrays for Speech Acquisition

A Comparison of the Convolutive Model and Real Recording for Using in Acoustic Echo Cancellation

Keywords: Adaptive filtering, LMS algorithm, Noise cancellation, VHDL Design, Signal to noise ratio (SNR), Convergence Speed.

Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter

Design and Performance Analysis of a Reconfigurable Fir Filter

SPECTRAL COMBINING FOR MICROPHONE DIVERSITY SYSTEMS

An FPGA Based Architecture for Moving Target Indication (MTI) Processing Using IIR Filters

The Steering for Distance Perception with Reflective Audio Spot

Faculty of science, Ibn Tofail Kenitra University, Morocco Faculty of Science, Moulay Ismail University, Meknès, Morocco

Performance Enhancement of Adaptive Acoustic Echo Canceller Using a New Time Varying Step Size LMS Algorithm (NVSSLMS)

Available online at ScienceDirect. Anugerah Firdauzi*, Kiki Wirianto, Muhammad Arijal, Trio Adiono

SUPERVISED SIGNAL PROCESSING FOR SEPARATION AND INDEPENDENT GAIN CONTROL OF DIFFERENT PERCUSSION INSTRUMENTS USING A LIMITED NUMBER OF MICROPHONES

Adaptive Filters Wiener Filter

Digitally controlled Active Noise Reduction with integrated Speech Communication

Adaptive Filters Application of Linear Prediction

Transmit Power Allocation for BER Performance Improvement in Multicarrier Systems

Different Approaches of Spectral Subtraction Method for Speech Enhancement

Performance Analysis of Acoustic Echo Cancellation Techniques

ACTIVE NOISE CONTROL ON HIGH FREQUENCY NARROW BAND DENTAL DRILL NOISE: PRELIMINARY RESULTS

Audio Quality Terminology

REAL-TIME BROADBAND NOISE REDUCTION

works must be obtained from the IEE

Improving room acoustics at low frequencies with multiple loudspeakers and time based room correction

MINUET: MUSICAL INTERFERENCE UNMIXING ESTIMATION TECHNIQUE

Practical Limitations of Wideband Terminals

Wireless Sensing for Active Noise Control

16QAM Symbol Timing Recovery in the Upstream Transmission of DOCSIS Standard

Speech Enhancement using Wiener filtering

Low Power Approach for Fir Filter Using Modified Booth Multiprecision Multiplier

Continuously Variable Bandwidth Sharp FIR Filters with Low Complexity

WARPED FILTER DESIGN FOR THE BODY MODELING AND SOUND SYNTHESIS OF STRING INSTRUMENTS

Acoustic Echo Cancellation using LMS Algorithm

A Novel Hybrid Technique for Acoustic Echo Cancellation and Noise reduction Using LMS Filter and ANFIS Based Nonlinear Filter

Audio Engineering Society. Convention Paper. Presented at the 115th Convention 2003 October New York, New York

Simple Impulse Noise Cancellation Based on Fuzzy Logic

XAP GWARE 119 M A T R I X. Acoustic Echo Canceller

Pattern Recognition. Part 6: Bandwidth Extension. Gerhard Schmidt

Acoustic echo cancellers for mobile devices

Performance Analysiss of Speech Enhancement Algorithm for Robust Speech Recognition System

Auditory modelling for speech processing in the perceptual domain

Technical features For internal use only / For internal use only Copy / right Copy Sieme A All rights re 06. All rights re se v r ed.

AIC3254 Acoustic Echo Cancellation (AEC)

MULTICHANNEL ACOUSTIC ECHO SUPPRESSION

EFFECTS OF PHYSICAL CONFIGURATIONS ON ANC HEADPHONE PERFORMANCE

Noise Reduction using Adaptive Filter Design with Power Optimization for DSP Applications

II Year (04 Semester) EE6403 Discrete Time Systems and Signal Processing

Perceptual Speech Enhancement Using Multi_band Spectral Attenuation Filter

Acoustic Echo Reduction Using Adaptive Filter: A Literature Review

HOW TO CHOOSE AN ACOUSTIC ECHO CANCELLER

Dual Transfer Function GSC and Application to Joint Noise Reduction and Acoustic Echo Cancellation

Transcription:

A Computational Efficient Method for Assuring Full Duplex Feeling in Hands-free Communication FREDRIC LINDSTRÖM 1, MATTIAS DAHL, INGVAR CLAESSON Department of Signal Processing Blekinge Institute of Technology S-372 25 Ronneby Sweden http://www.bth.se/its Abstract: - This report proposes a new method for obtaining satisfying "full-duplex feeling" in hands-free communication units at low computational cost. The proposed method uses a combination of an acoustic echo cancellation unit and an adaptive gain unit. The core of the method is to divide the speech signal into two frequency bands and to process these in different manners. Acoustic echoes in the low frequency band of the signal are cancelled by means of an acoustic echo cancellation unit, while acoustic echoes in the high frequency part are suppressed by an adaptive gain unit. In order to evaluate the use of the proposed method to extend the bandwidth of an existing hands-free phone a real-time implementation of the proposed method is compared with a real-time implementation of a conventional hands-free phone. The evaluation indicates that the extended method can be used to increase the quality of a hands-free phone, with only a small increase in computational demand. Key-Words: - Hands-free, Full-Duplex, Half-Duplex, Sub-band, Acoustic Echo Cancellation, AEC, AEC-AG 1 Introduction Hands-free phones are preferred in many situations, e.g. in the car, at the office, at home, or in the conference room. A hands-free phone call takes part between the participants located in the same room/car as the hands-free phone, the near-end talkers, and participants at a remote location, the far-end talkers. Due to the strong acoustic coupling between the loudspeaker and the microphone in the hands-free phone acoustic echoes might arise. If no countermeasure is taken to suppress the acoustic echo, far-end talkers are presumed to hear an echo of their own voice, which is in general considered to be very annoying. One type of solutions to the acoustic echo problem are those based on the system identification scheme, [1]. A common solution, conforming to the system identification scheme, is often denoted an Acoustic Echo Canceler (AEC), [2]. Other acoustic echo cancellers using adaptive methods, such as microphone array based systems, [3], can also be categorized as acoustic echo cancellers. In the sequel an AEC will denote a solution conforming to a system identification scheme as defined in [1]. Many variations of the AEC are based on different algorithms for the adaptation of the system. Algorithms such as the Normalized Least Mean Squares(NLMS), the Recursive Least Squares(RLS), or the Affined Projection Algorithm(APA), [4]. The performance of an AEC is linked to the estimation of certain parameters, such as speech activity, acoustic coupling etc, [5]. The calculation of certain speech activity parameters is crucial for most AEC systems, in particular double-talk detection. Several double-talk detectors have been proposed, e.g. the classical Giegel detector, [6], detectors based on cross-correlation, [7], [8], cepstral distance, [5], or power measurements, [5]. Solutions to the acoustic echo problem can be categorized into Half-DupleX (HDX) and Full- DupleX (FDX) solutions. An objective definition of these two concept is to define an HDX solution as a system only allowing sound to stream in one direction at a time and an FDX solution as a system allowing talk to stream undamped in both directions simultaneously. Many real-time implementations claiming FDX does not fulfill this requirement, i.e. that the signal should stream undamped in both directions all the time. Further many real-time solutions characterized as HDX allows simultaneously streaming in both directions of more or less damped signals. A psychoacoustic approach to the HDX and FDX notation problem is to characterize implementations with "FDX-feeling" as implementations for which it is possible for the farend and near-end talkers to speak simultaneously and to interrupt each other without any perceptible 1 Also with Konftel Technology AB http://www.konfteltech.com

degradation of the perceived overall quality of the call. Implementations with "HDX-feeling" will thus be those that do not have "FDX-feeling". The concept of "FDX-feeling" and "HDX-feeling" is truly subjective. Still it is an attractive concept, since it provides a notation that can be used to separate handsfree systems with satisfying quality from those characterized by switching effects and interrupted communication. In this report we propose a method for implementing a hands-free phone, providing fullduplex feeling with a low computational demand, the low computational demand makes the method suitable for bandwidth extension of already existing systems. 2 The Environment The AEC environment can be described using the following notation, the far-end signal, x(t), the nearend signal, y(t), the acoustic echo, e(t), and near-end generated sound, v(t), see Fig.1. Near-end sound consists of the sum of the near-end talk, s(t), and nearend noise, n(t), e.g. fan noise, after they have been filtered through the room and the microphone. The near-end talk s(t) consists of a direct path part and a part that reaches the microphone via a multitude of reflections. This reflected part is referred to as the near-end room reverbation signal. Near-end room reverbation can also degenerate the perceived speech quality, however it is the remedy of the acoustic echo that is treated in this report. The acoustic echo e(t) is the result of the transformation of the far-end signal x(t) as it passes through the loudspeaker, the room and the microphone. The near-end signal is thus a combination of the acoustic echo and the near-end generated sound, i.e. y(t)=e(t)+v(t). The combined influence from the loudspeaker, the room, and the microphone on a signal that passes through all of these is denoted the Loudspeaker Enclosure Microphone system (LEM-system). The combined operation of a filtering with the LEM-system and the adding of the near-end generated sound is denoted as the LEMN-system, see Fig.1. The LEMN-system can thus be seen as a transform of the far-end signal x(t) into the acoustic echo e(t) and the adding of the nearend generated sound v(t), in order to produce the nearend signal y(t). Four different situations of speech activity apply to a hands-free telephony. These are idle, i.e. nobody is talking, near-end single talk, i.e. only the near-end talkers are talking, far-end single talk, i.e. only the far-end talkers are talking, and finally double talk, i.e. both sides are talking simulataneously. Fig.1 Signals and systems applying to the AEC, LEM-system (dashed), LEMN-system (dotted). Fig.2 The AEC solution 3 The AEC An AEC based on a system identification scheme is depicted in Fig.2. It consists of an Adaptive Processing Unit (APU), and a summation unit, Σ. The signals present in an AEC are: the far-end signal, x(.), the near-end signal, y(.), an estimated near-end signal, ŷ (.), i.e. the output from the APU, and finally an error signal, y e (.), with y e (.)=y(.)- ŷ (.). The purpose of such an AEC is to adapt the transfer characteristics of the APU to be similar to the LEM. Optimum is said to be achieved when the acoustic echo part of y e (.) is minimized in some given measure. Hence the signal y e (.) is often used as feedback in the adaptation algorithm of the APU. The NLMS algorithm is an often used algorithm for the implementation of the APU. The popularity of the NLMS algorithm is due to the low complexity and the high robustness to power variations in finite arithmetic precision, [4], [5].

Fig.3 The AEC-AG solution 4 The Proposed Method The basic idea of the method proposed in this report is to split the speech signals, (i.e. the far-end and the near-end signals), in two frequency bands, one that contains the lower frequencies and one that contains the higher frequencies, which are processed in different ways. The low frequency part is processed with a conventional full duplex AEC. Acoustic echoes in the low frequency band will therefore be cancelled and communication will not be interrupted in either direction. The high frequency part will be passed with a level dependent damping, i.e. high frequency echoes are suppressed with a damping factor. This implies that in a situation of acoustic echo the high frequencies of the near-end signal will be damped. The processing of the high frequency part can be done in a low computation consumption manner, since there is no need for full-band adaptive filtering. The method is depicted in Fig.3 where the far-end signal, x(.), is split up into a low frequency part, x L (.), and a high frequency part, x H (.) and likewise the near-end signal, y(.), is split up into y L (.) and y H (.). The low frequencies signals x L (.) and y L (.) are processed according to the NLMS AEC algorithm, other algorithms than the NLMS are of course possible for the implementation of the echo cancelling. The operation performed on the high frequency signals x H (.) and y H (.) will be an adaptive attenuation of y H (.) by an attenuation factor, D, resulting in a possibly damped signal, y Hdamp (.). The adaptation of D is processed by a Control Unit (CU), which sets the value of D dependent on the value of some given measure of the x H (.) signal, e.g. an estimation of the mean of x H (.) or x H (.) 2. The proposed method can thus be seen as a combination of the AEC method and an adaptive gain (AG) method, thus it is denoted the AEC-AG method. The only difference between the two models from a frequency aspect is the damping of the high frequency part of the near-end signal in situations of double talk. An AEC method implemented with the NLMS algorithm operating on a communication bandwidth of B Hz, will require a sampling frequency f s, with f s > 2B. With an echo cancelling duration of T seconds, the NLMS solution will require an adaptive FIR filter of length N T =Tf s. For every sample the echo canceler will require N T Digital Signal Processor (DSP) instructions 2 for the filtering with the adaptive filter and 2N T DSP instructions for the update of the coefficients of the adaptive filter. This gives that the total number of DSP instructions for the AEC method, I AEC, is I AEC = (Tf s )f s + (2Tf s )f s = 3Tf s 2 (1) Consider an AEC-AG based solution operating on the same communication bandwidth as the AEC solution, i.e. B Hz. Now we assume that for the AEC-AG solution we have a lower AEC band of size B/2 and an upper AG band of size B/2. This solution also uses the NLMS algorithm for echo cancellation in the lower AEC band. So the required sampling frequency for the lower band f' s can be set to f' s = f s /2. This implies that the echo cancelling processing in the AEC-AG will require I AEC-AG instructions with I AEC-AG = 3Tf' s 2 = 3Tf s 2 / 4 (2) There is approximately a factor of four less instructions needed in the proposed implementation. Real implementations of an AEC-AG solution would require an extra filtering to obtain the frequency split, this processing is however negligible in comparison with that of the acoustic echo cancellation. 5 Bandwidth Extension One of the most interesting applications of the AEC- AG method is to use it to extend the bandwidth of an existing AEC based hands-free phone. In order to explore the performance of such an extension two real-time implementations were tested. One, called solution S AEC, is an implementation of an AEC NLMS based hands-free phone with a communication 2 A DSP instruction is an instruction performed on a digital signal processor, which has the capability of multiplication with cumulative addition in one clock cycle, e.g. the AD-21xx, [9].

bandwidth of [300Hz, 3200Hz]. The other, solution S AEC-AG, is an AEC-AG NLMS based implementation with a communication bandwidth of [300Hz, 7000Hz], where the bandwidth of the AEC part is [300Hz, 3200Hz]. These limits for the communication bandwidths were chosen in accordance with the standards for PTSN and ISDN respectively, e.g. [10], [11]. The computational demand of solution S AEC and S AEC-AG are thus approximately equal. Note that in section 4 two systems of the same bandwidth were compared. Solution S AEC and S AEC-AG have different bandwidths, but approximately the same computational demand. Both solutions was implemented on an AD-2186 fix-point processor using an AD-73322 codec for D/A A/D conversions. Solution S AEC was implemented according to the scheme depicted in Fig.2, with a length of the adaptive FIR filter, of 1600 taps. The sample frequency was 8000Hz, thus corresponding to an echo cancelling length of 200ms. Solution S AEC-AG was implemented according to the scheme depicted in Fig.3 and has the exact same implementation for its low frequency band [300Hz, 3200Hz] as solution S AEC Thus the low frequency band of the S AEC-AG was working with a 8000Hz sample frequency while the upper band used a 16000Hz sample frequency. The high frequency near-end signal of solution S AEC-AG was damped according to the scheme described in section 4. In this particular implementation the attenuation D was chosen as D(k)=u(k), where k is the sample index and u(k) is an averaging function given by u(k)=(1-α 1 )u(k-1)+α 1 u(k) if u(k) > u(k-1) (3) u(k)=(1-α 2 )u(k-1)+α 2 u(k) if u(k) < u(k-1) (4) where α 1 > α 2, and u(k) is a function of x H (k), given by u(k)=lookup( x H (k) ). The function LOOKUP(.) is as depicted in Fig.4. The use different constants in the exponential averaging in equations (3) and (4) implies that the value of D(k) will increase rapidly in response to a sudden increase of the average of x H (k), while a sudden decrease will lead to a slowly decreasing value of D(k). This is done in order to make the value of D(k) correspond to the nature of an acoustic echo in a room, i.e. the acoustic echo is almost instant and has a duration. The function LOOKUP(.) is implemented as a look-up-table function, thereof its quantified appearance, see Fig.4. Note that even though u(k) consist of quantified values, D(k) is a more smooth function as a result of the averaging performed in equations(3) and (4). Fig.4 The function LOOKUP(.) 6 Setup, Evaluation, and Result The setup was done in a small office with an reverbation time of approximately 400ms in measures of RT60 3. Three loudspeakers were used to produce the near-end output signal of the hands-free phone, see Fig.5. An omnidirectional microphone was used to obtain the near-end input signal of the hands-free phone. The far-end input signal was obtained by an hand-held microphone situated in a neighboring room thus obtaining total acoustic isolation between the farend and near-end side. Two loudspeakers were placed in each room and connected to a two-channel DAT player, DAT1, see Fig.5. Channel 1 of DAT1 was connected to the loudspeaker in the far-end room, and channel 2 to the loudspeaker in the near-end room. Another two-channel DAT recorder, DAT2, was connected to the hands-free phone line-out and to the wire feeding the three hands-free phone loudspeakers. Channel 1 of DAT2 was connected to the speaker wire and channel 2 was connected to the line out. The phone was set to solution S AEC mode, and then a tape was played on DAT1 and at the same time DAT2 was set to record. The tape contained a session corresponding to the states: idle, near-end single talk, far-end single talk, and double talk, se Fig.6. The procedure was repeated with the phone set in solution S AEC-AG mode. Subjective real-time evaluation of the both methods was also performed. The subjective evaluation was performed in the following manner. One person placed her/himself at the near-end side, i.e. in the office, and another person placed her/himself at the far-end side, i.e. in the neighboring room. These person performed a natural conversation, containing sessions of double talk. Throughout the test repeated switches between solution S AEC and solution 3 RT60 defines the measure of reverbation time to be the time it takes for the sound level in a room to decrease with 60dB after an impulse.

S AEC-AG mode were performed. Fig.5 The setup Fig.7 and Fig.8 shows spectograms for the recorded phone loudspeaker and phone line out signals of the AEC and the AEC-AG solutions, respectively. In an optimal situation the recorded phone loudspeaker signal and phone line out signal should be identical to the far-end speech signal and near-end speech signal, respectively, i.e. the spectograms in Fig.7 and Fig.8 should be as similar as possible to that of Fig.6. A comparison of the spectral content of near-end and far-end signals of solution S AEC and solution S AEC- AG, in relation to the optimal solution, given in Fig.6, show that the AEC-AG solution gives more natural sounding signals than the AEC solution in that it also contains high frequency components. By observing the idle part of the spectograms in Fig.7 and Fig.8 we get a picture of the background noise present in the two solutions. For the idle part in Fig.7 and Fig.8 the channel 1 signal, i.e. the phone loudspeaker signal, contains noise in the region of 0 to 1000Hz, this noise originates from different noise sources in the far-end room, e.g. fan noise. The noise characteristic for the channel 2 signals, i.e. the phone microphone signal, is more white and spread over the entire spectra. This noise originates from internal electrical noise in the phone that is transferred by the phone loudspeaker and then received by the phone microphone. For the AEC-AG solution there is a difference in magnitude between the noise in the high and low frequencies. This is due to a noise reduction feature that has been implemented in combination with the echo cancellation performed on the low frequencies. So, the difference in background noise level between the high and low frequencies is not a result of the specific AEC-AG solution, but a result of that noise reduction is only implemented for the lower frequency band. In the far-end talk part of the spectograms we can get an inkling of that there is a small residual echo in both the AEC and the AEC-AG solutions, but these echoes have such low energy that they are not disturbing. In the channel 2 signal of the AEC-AG solution the function of the attenuation, D, described in section 5 can be observed, specially this can be seen from the characteristics of the background noise. When a speech signal occurs in channel 1 the background noise in the high frequency part of channel 2 is damped away, see Fig.8, e.g. compare the high frequency background noise in the channel 2 idle and far-end talk sections. During situations of double talk the AEC-AG solution will approaches that of the AEC solution for the near-end signal, i.e. during double talk the high frequencies of the near-end signal will be damped. This can be seen by comparing the near-end and double talk parts for channel 2 of the AEC-AG solution spectogram. The performed subjective evaluation clearly showed the advantages of the AEC-AG method. The 7kHz sound was perceived as of higher quality and the damping of the high frequency band was hardly noticeable. Note however that these tests were performed under rather modest near-end noise conditions and a full evaluation should also account for the performance of the systems in environments with high background noise. 7 Conclusion In this report a solution to the acoustic echo problem in hands-free communication was presented. With this method it is possible to construct hands-free phones with full-duplex feeling in a computational saving manner. Two real-time implementations of a handsfree phone were done with approximately the same demand of computational power, one based on a conventional AEC method and the other on the proposed method. These real-time implementations were done in order to evaluate the use of the proposed method to extend the bandwidth of an existing AEC conference phone. Frequency analysis of measured test signals as well as subjective evaluations showed that the new method can be used for bandwidth extension of an existing AEC phone at hardly no extra computational cost.

components. Fig.6 Spectogram for the input channel. Channel 1 is the far-end speech signal and channel 2 is the near-end speech signal. Fig.7 Spectogram for the recorded signals when the AEC solution is used. Channel 1 is the phone loudspeaker signal and channel 2 is the phone line-out signal. References: [1] B. Widrow, S. D. Stearns, Adaptive signal processing, Prentice-Hall, 1985 [2] M. M. Sondhi, "An adaptive echo canceler", Bell Syst. Tech. J., vol. 46, 1967, pp. 497-510. [3] M. Dahl, I. Claesson "Acoustic noise and echo canceling with microphone array", IEEE Trans. on Vehicular Technology, vol.48 no. 5, 1999, pp.1576-1581. [4] S. Haykin, Adaptive Filter Theory, 4ed., Prentice- Hall, 2002 [5] A. Mader, H. Puder, G. U. Schmidt, "Step-size control for acoustic cancellation filters an overview", Signal Processing, vol.80 2000, pp. 1697-1719. [6] D.L. Duttweiler, "A twelve-channel digital echo canceler", IEEE Trans. Commun., vol COM-26, 1978, pp. 647-653. [7] H. Ye, B. X. Wu, "A new double talk detection based on the orthognality theorem", IEEE Trans. Commun., vol. 39, 1991 pp. 1542-1545. [8] T. Gansler, M Hansson, C.-J. Ivarsson, G. Salomonsson, "A double-talk detector based on coherence", IEEE Trans. Commun., vol. 44, 1996, pp. 1421-1427. [9] ADSP-2100 Family User's Manual, 3ed., Analog Devices, 1995 [10] TBR21, European Telecommunications Standards Institute, 1998 [11] G.722, International Telecommunication Union, 1999 Fig.8 Spectogram for the recorded signals when the AEC-AG solution is used. Channel 1 is the phone loudspeaker signal and channel 2 is the phone line-out signal. By comparing the spectograms of Fig.7 and Fig.8 with the optimal spectogram given in Fig.6, it is clear that the AEC-AG solution gives a more natural sounding signal in that it also contains high frequency