Gerhard Schmidt / Tim Haulick Recent Tends for Improving Automotive Speech Enhancement Systems. Geneva, 5-7 March 2008

Similar documents
Speech and Audio Processing Recognition and Audio Effects Part 3: Beamforming

Pattern Recognition. Part 6: Bandwidth Extension. Gerhard Schmidt

Revision 1.1 May Front End DSP Audio Technologies for In-Car Applications ROADMAP 2016

Test Report. 4 th ITU Test Event on Compatibility of Mobile Phones and Vehicle Hands-free Terminals th September 2017

Adaptive Filters Wiener Filter

Adaptive Filters Application of Linear Prediction

Pattern Recognition Part 2: Noise Suppression

COM 12 C 288 E October 2011 English only Original: English

Speech Technologies in Cars and the Role of ITU-T

Sound Reinforcement Package SRP

PA System in a Box. Edwin Africano, Nathan Gutierrez, Tuan Phan

AUTOMATIC EQUALIZATION FOR IN-CAR COMMUNICATION SYSTEMS

A Computational Efficient Method for Assuring Full Duplex Feeling in Hands-free Communication

Key Issues and Their Implications for Automotive Industry. HEAD acoustics GmbH

ZLS38500 Firmware for Handsfree Car Kits

Practical Limitations of Wideband Terminals

Speech quality for mobile phones: What is achievable with today s technology?

Speech communication in cars goes wideband the new ITU-T T Focus Group CarCom

Signal Processing for In-Car Communication Systems

Dual Transfer Function GSC and Application to Joint Noise Reduction and Acoustic Echo Cancellation

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm

The new ITU-T Work on Speech communication requirements for emergency calls originating from vehicles

Conversational Speech Quality - The Dominating Parameters in VoIP Systems

EXPERIMENTS ON PERFORMANCES OF ACTIVE-PASSIVE HYBRID MUFFLERS

Case study for voice amplification in a highly absorptive conference room using negative absorption tuning by the YAMAHA Active Field Control system

Noise considerations for RTPGE objectives. Gavin Parnaby IEEE RTPGE Study Group Geneva September 2012

APPLICATIONS OF ACOUSTIC ECHO CONTROL AN OVERVIEW

Automotive three-microphone voice activity detector and noise-canceller

SPEECH communication among passengers in large motor

Audio Quality Terminology

Factors impacting the speech quality in VoIP scenarios and how to assess them

EE482: Digital Signal Processing Applications

Bandwidth Extension for Speech Enhancement

NOISE SHAPING IN AN ITU-T G.711-INTEROPERABLE EMBEDDED CODEC

AIC3254 Acoustic Echo Cancellation (AEC)

[Q] DEFINE AUDIO AMPLIFIER. STATE ITS TYPE. DRAW ITS FREQUENCY RESPONSE CURVE.


XAP GWARE 119 M A T R I X. Acoustic Echo Canceller

AN547 - Why you need high performance, ultra-high SNR MEMS microphones

Emanuël A. P. Habets, Jacob Benesty, and Patrick A. Naylor. Presented by Amir Kiperwas

Artificial Bandwidth Extension Using Deep Neural Networks for Spectral Envelope Estimation

QAM-Based Transceiver Solutions for Full-Duplex Gigabit Ethernet Over 4 Pairs of UTP-5 Cable. Motivation for Using QAM

PB 700 PB 1000 PB 1100 PB 1500 PB 2600 PB 1200 PB 1700 PB 2200 PB 2700 USER'S MANUAL.

BEAMFORMING WITHIN THE MODAL SOUND FIELD OF A VEHICLE INTERIOR

Recent Advances in Acoustic Signal Extraction and Dereverberation

Design and Implementation on a Sub-band based Acoustic Echo Cancellation Approach

Single channel noise reduction

Digitally controlled Active Noise Reduction with integrated Speech Communication

SigmaDSP processors for audio signal processing

Chapter 2: Digitization of Sound

LEON-G100 / LEON -G200

Acoustic echo cancellers for mobile devices

SpeechLine. microphones. Microphone solutions for corporate and commercial applications. Application guide

Bandwidth Efficient Mixed Pseudo Analogue-Digital Speech Transmission

Speech Enhancement Based On Noise Reduction

Copyright S. K. Mitra

Interfacing to the SoundStation VTX 1000 TM with Vortex Devices

Speech Quality Assessment for Wideband Communication Scenarios

MMSE STSA Based Techniques for Single channel Speech Enhancement Application Simit Shah 1, Roma Patel 2

Influence of artificial mouth s directivity in determining Speech Transmission Index

Analog Circuits and Systems

Towards an intelligent binaural spee enhancement system by integrating me signal extraction. Author(s)Chau, Duc Thanh; Li, Junfeng; Akagi,

ON THE PERFORMANCE OF WTIMIT FOR WIDE BAND TELEPHONY

Synchronous Overlap and Add of Spectra for Enhancement of Excitation in Artificial Bandwidth Extension of Speech

A102 Signals and Systems for Hearing and Speech: Final exam answers

Technical Report Speech and multimedia Transmission Quality (STQ); Speech samples and their usage for QoS testing

Equalizers. Contents: IIR or FIR for audio filtering? Shelving equalizers Peak equalizers

Perceptual wideband speech and audio quality measurement. Dr Antony Rix Psytechnics Limited

T-DSL 128/768 kbits Flatrate. Simple Soundcard- and PTT-Interface DG2IAQ-L Mhz FM. DG2IAQ ISDN 64 kbits Internet by Call. 10/100 MBit Switch

ADJACENT BAND COMPATIBILITY BETWEEN TETRA TAPS MOBILE SERVICES AT 870 MHz

Fractional Octave Analysis and Acoustic Applications

inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering August 2000, Nice, FRANCE

Production Noise Immunity

Acoustic Echo and Noise Control Where did we come from and where are we going?

Digital Signal Processing of Speech for the Hearing Impaired

Speech Enhancement using Wiener filtering

techniques are means of reducing the bandwidth needed to represent the human voice. In mobile

3GPP TS V ( )

Sound Design and Technology. ROP Stagehand Technician

SPECTRAL COMBINING FOR MICROPHONE DIVERSITY SYSTEMS

APPLICATION NOTES CONNECTING A COLLABORATE ROOM TO A CONVERGE PRO. Purpose: Connecting the COLLABORATE Room to the CONVERGE Pro:

3GPP TS V ( )

MAXXSPEECH PERFORMANCE ENHANCEMENT FOR AUTOMATIC SPEECH RECOGNITION

United States Patent 5,159,703 Lowery October 27, Abstract

Silent subliminal presentation system

NOISE ESTIMATION IN A SINGLE CHANNEL

Title. Author(s)Sugiyama, Akihiko; Kato, Masanori; Serizawa, Masahir. Issue Date Doc URL. Type. Note. File Information

Acoustic effects of platform screen doors in underground stations

and RTL-SDR Wireless Systems

THE problem of acoustic echo cancellation (AEC) was

CS 3570 Chapter 5. Digital Audio Processing

The Emergence, Introduction and Challenges of Wideband Choice Codecs in the VoIP Market

Feedback Active Noise Control in a Crew Rest Compartment Mock-Up

Detection, Interpolation and Cancellation Algorithms for GSM burst Removal for Forensic Audio

Speech Intelligibility Enhancement using Microphone Array via Intra-Vehicular Beamforming

The Association of Loudspeaker Manufacturers & Acoustics International presents

Telecom. Sound Scenarios. Devices. Speech Quality Communication Quality Analysis. Speech Intelligibility. Accessories Analysis Methods.

ARTICLE IN PRESS. Signal Processing

Stefan Launer, Lyon, January 2011 Phonak AG, Stäfa, CH

A Low-Power Broad-Bandwidth Noise Cancellation VLSI Circuit Design for In-Ear Headphones

Transcription:

Gerhard Schmidt / Tim Haulick Recent Tends for Improving Automotive Speech Enhancement Systems

Speech Communication Channels in a Vehicle 2 Into the vehicle Within the vehicle Out of the vehicle

Speech Enhancement for the Communication Channels 3 Receive side processing: gain control Phone Speech dialog system In-car communication: feedback suppression, automatic gain adjustment Send side processing: beamforming, echo and noise suppression

Boundary Conditions for Automotive Speech Enhancement 4 Often the car is owned by the driver or the passenger: The communication channels within the vehicle and into the vehicle should be in focus for improving the perceived system quality. Often a car is used only by a few (3 5) people: Speaker dependent speech processing might be a good choice. Often the speech signals of passengers who are using the car can be recorded in periods with high SNR: Speaker dependant speech models might be extracted in periods of high SNR and used in periods of low SNR.

Extended Speech Enhancement 5 Receive side processing: gain control, bandwidth extension, adaptive equalization Phone Speech dialog system Speaker (in-)dependent speech knowledge Speaker independent speech knowledge Send side processing: beamforming, echo and noise suppression, speech reconstruction

Speech Enhancement in the Receiving Path (1) 6 Speaker independent speech knowledge Phone (downlink) Receive side processing To loudspeaker Control for receive side processing Analysis filter bank Noise PSD Echo PSD Echo cancellation Phone (uplink) Synthesis filter bank Residual echo and noise suppression Analysis filter bank Microphone

Speech Enhancement in the Receiving Path (2) 7 Packet loss correction Noise suppression Bandwidth extension Adaptive equalization Adaptive limiter Phone (downlink) To loudspeaker Control for receive side processing Speaker independent speech knowledge Noise PSD Echo PSD

Bandwidth Extension Basic Principle 8 Input signal Low-frequency extension Lowpass filter Output signal Speaker independent speech knowledge High-frequency extension Highpass filter

Bandwidth Extension Example 9 Narrowband connection (current standard): Bandwidth-extension for narrowband speech signals (bandwidth 3.4 3.8 khz) extension of low frequency components and extension of high frequency components up to 5.5 or 8 khz. Narrowband input Narrowband output Wideband input Wideband connection: Bandwidth-extension for wideband speech signals (bandwidth 7 khz, e.g. AMR wideband codec G.722.2) extension of high frequency components up to 11kHz. Wideband input

Adaptive Equalization Basic Priciple 10 Gain adjustment: Gain part Shape part The echo and the background noise power are analyzed and a gain correction is computed in order to achieve a predefined SNR in the passenger compartment. Input signal Echo PSD Noise PSD Low-order FIR filter Power normalization Computation of the gain and the shape Output signal Shape adjustment: In addition to the short-term powers also the short-term spectra of the noise and the echo are analyzed and a correction filter is designed in order to boost frequencies with low SNR while slightly attenuating those with good SNR. The design process is computed 10 to 20 time per seconds. An improvement of the speech intelligibility can be achieved while maintaining the loudness of the output signal.

Adaptive Equalization Example 11 Intelligibility improvement Measurement: Signal was recorded in an accelerating vehicle, while entering a motorway. Intelligibility improvement

Speech Enhancement in the Sending Path (1) 12 Motivation: At medium and high speed the SNR often drops at low frequencies below the 0 db threshold. Thus, standard noise suppression schemes perform only an attenuation at these frequencies. For further improvement of the speech quality a reconstruction approach is an alternative. However, speech reconstruction starts now where conventional noise reduction fails

Speech Enhancement in the Sending Path (2) 13 Receive side processing Phone (downlink) To loudspeaker Speaker (in-) dependent speech knowledge Analysis filter bank Echo cancellation Speech reconstruction Phone (uplink) Microphone Synthesis filter bank Mixer Residual echo and noise suppression Analysis filter bank

Speech Reconstruction Audio Examples 14 Microphone signal Conventional noise suppression Mixed suppression and reconstruction

Final Remark 15 Additional Information: Harman/Becker Automotive Systems Acoustic Signal Processing Gerhard Schmidt geschmidt@harmanbecker.com Söflinger Str. 100 89077 Ulm, Germany Tim Haulick thaulick@harmanbecker.com