Microphone Array Design and Beamforming

Similar documents
Microphone Array Signal Processing for Robot Audition

Speech and Audio Processing Recognition and Audio Effects Part 3: Beamforming

arxiv: v1 [cs.sd] 4 Dec 2018

Michael Brandstein Darren Ward (Eds.) Microphone Arrays. Signal Processing Techniques and Applications. With 149 Figures. Springer

Emanuël A. P. Habets, Jacob Benesty, and Patrick A. Naylor. Presented by Amir Kiperwas

Airo Interantional Research Journal September, 2013 Volume II, ISSN:

BFGUI: AN INTERACTIVE TOOL FOR THE SYNTHESIS AND ANALYSIS OF MICROPHONE ARRAY BEAMFORMERS. M. R. P. Thomas, H. Gamper, I. J.

RIR Estimation for Synthetic Data Acquisition

MULTIMODAL BLIND SOURCE SEPARATION WITH A CIRCULAR MICROPHONE ARRAY AND ROBUST BEAMFORMING

Recent Advances in Acoustic Signal Extraction and Dereverberation

ROBUST SUPERDIRECTIVE BEAMFORMER WITH OPTIMAL REGULARIZATION

The LOCATA Challenge Data Corpus for Acoustic Source Localization and Tracking

Sound Source Localization using HRTF database

Towards an intelligent binaural spee enhancement system by integrating me signal extraction. Author(s)Chau, Duc Thanh; Li, Junfeng; Akagi,

Real-time Adaptive Concepts in Acoustics

Single Channel Speaker Segregation using Sinusoidal Residual Modeling

BEAMFORMING WITHIN THE MODAL SOUND FIELD OF A VEHICLE INTERIOR

Introduction to distributed speech enhancement algorithms for ad hoc microphone arrays and wireless acoustic sensor networks

The Role of High Frequencies in Convolutive Blind Source Separation of Speech Signals

A BROADBAND BEAMFORMER USING CONTROLLABLE CONSTRAINTS AND MINIMUM VARIANCE

Speech Enhancement Using Beamforming Dr. G. Ramesh Babu 1, D. Lavanya 2, B. Yamuna 2, H. Divya 2, B. Shiva Kumar 2, B.

Study Of Sound Source Localization Using Music Method In Real Acoustic Environment

The psychoacoustics of reverberation

Auditory System For a Mobile Robot

ONE of the most common and robust beamforming algorithms

Advanced delay-and-sum beamformer with deep neural network

LOCAL RELATIVE TRANSFER FUNCTION FOR SOUND SOURCE LOCALIZATION

Sound Parameter Estimation in a Security System

Automatic Text-Independent. Speaker. Recognition Approaches Using Binaural Inputs

Simultaneous Recognition of Speech Commands by a Robot using a Small Microphone Array

A COHERENCE-BASED ALGORITHM FOR NOISE REDUCTION IN DUAL-MICROPHONE APPLICATIONS

Joint recognition and direction-of-arrival estimation of simultaneous meetingroom acoustic events

Audio Engineering Society. Convention Paper. Presented at the 131st Convention 2011 October New York, NY, USA

1856 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 18, NO. 7, SEPTEMBER /$ IEEE

SYNTHESIS OF DEVICE-INDEPENDENT NOISE CORPORA FOR SPEECH QUALITY ASSESSMENT. Hannes Gamper, Lyle Corbin, David Johnston, Ivan J.

Broadband Microphone Arrays for Speech Acquisition

Mel Spectrum Analysis of Speech Recognition using Single Microphone

Advances in Direction-of-Arrival Estimation

PERFORMANCE COMPARISON BETWEEN STEREAUSIS AND INCOHERENT WIDEBAND MUSIC FOR LOCALIZATION OF GROUND VEHICLES ABSTRACT

Performance Evaluation of Nonlinear Speech Enhancement Based on Virtual Increase of Channels in Reverberant Environments

Blind Blur Estimation Using Low Rank Approximation of Cepstrum

2112 J. Acoust. Soc. Am. 117 (4), Pt. 1, April /2005/117(4)/2112/10/$ Acoustical Society of America

ROOM AND CONCERT HALL ACOUSTICS MEASUREMENTS USING ARRAYS OF CAMERAS AND MICROPHONES

OPTIMUM POST-FILTER ESTIMATION FOR NOISE REDUCTION IN MULTICHANNEL SPEECH PROCESSING

Design of Robust Differential Microphone Arrays

Audio Imputation Using the Non-negative Hidden Markov Model

MEASUREMENT-BASED MODAL BEAMFORMING USING PLANAR CIRCULAR MICROPHONE ARRAYS

TARGET SPEECH EXTRACTION IN COCKTAIL PARTY BY COMBINING BEAMFORMING AND BLIND SOURCE SEPARATION

Omnidirectional Sound Source Tracking Based on Sequential Updating Histogram

Using sound levels for location tracking

Blind source separation and directional audio synthesis for binaural auralization of multiple sound sources using microphone array recordings

Calibration of Microphone Arrays for Improved Speech Recognition

Reverberant Sound Localization with a Robot Head Based on Direct-Path Relative Transfer Function

/$ IEEE

Direction of Arrival Algorithms for Mobile User Detection

Recurrent Timing Neural Networks for Joint F0-Localisation Estimation

Different Approaches of Spectral Subtraction Method for Speech Enhancement

From Monaural to Binaural Speaker Recognition for Humanoid Robots

Direction-of-Arrival Estimation Using a Microphone Array with the Multichannel Cross-Correlation Method

Sound Source Localization in Median Plane using Artificial Ear

Smart antenna for doa using music and esprit

Effects of Reverberation on Pitch, Onset/Offset, and Binaural Cues

Robust Low-Resource Sound Localization in Correlated Noise

Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter

Robust Near-Field Adaptive Beamforming with Distance Discrimination

MIMO CHANNEL OPTIMIZATION IN INDOOR LINE-OF-SIGHT (LOS) ENVIRONMENT

COMPARISON OF MICROPHONE ARRAY GEOMETRIES FOR MULTI-POINT SOUND FIELD REPRODUCTION

Research Article DOA Estimation with Local-Peak-Weighted CSP

Improving reverberant speech separation with binaural cues using temporal context and convolutional neural networks

A classification-based cocktail-party processor

Introduction to Audio Watermarking Schemes

FP6 IST

Robotic Spatial Sound Localization and Its 3-D Sound Human Interface

AN ADAPTIVE MICROPHONE ARRAY FOR OPTIMUM BEAMFORMING AND NOISE REDUCTION

Uplink and Downlink Beamforming for Fading Channels. Mats Bengtsson and Björn Ottersten

Book Chapters. Refereed Journal Publications J11

Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis

SPEECH ENHANCEMENT WITH SIGNAL SUBSPACE FILTER BASED ON PERCEPTUAL POST FILTERING

Improving Meetings with Microphone Array Algorithms. Ivan Tashev Microsoft Research

Wave Field Analysis Using Virtual Circular Microphone Arrays

Selected Research Signal & Information Processing Group

AN ADAPTIVE MICROPHONE ARRAY FOR OPTIMUM BEAMFORMING AND NOISE REDUCTION

A Frequency-Invariant Fixed Beamformer for Speech Enhancement

Proceedings of Meetings on Acoustics

SPHERICAL MICROPHONE ARRAY BASED IMMERSIVE AUDIO SCENE RENDERING. Adam M. O Donovan, Dmitry N. Zotkin, Ramani Duraiswami

Subspace Noise Estimation and Gamma Distribution Based Microphone Array Post-filter Design

A BINAURAL HEARING AID SPEECH ENHANCEMENT METHOD MAINTAINING SPATIAL AWARENESS FOR THE USER

Applying the Filtered Back-Projection Method to Extract Signal at Specific Position

Ivan Tashev Microsoft Research

Single channel noise reduction

Lecture 14: Source Separation

Spectral Methods for Single and Multi Channel Speech Enhancement in Multi Source Environment

Binaural Speaker Recognition for Humanoid Robots

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 21, NO. 5, MAY

3D sound in the telepresence project BEAMING Olesen, Søren Krarup; Markovic, Milos; Madsen, Esben; Hoffmann, Pablo Francisco F.; Hammershøi, Dorte

An analysis of blind signal separation for real time application

Enhancement of Speech Signal by Adaptation of Scales and Thresholds of Bionic Wavelet Transform Coefficients

Distance Estimation and Localization of Sound Sources in Reverberant Conditions using Deep Neural Networks

HRIR Customization in the Median Plane via Principal Components Analysis

Informed Spatial Filtering for Sound Extraction Using Distributed Microphone Arrays

Transcription:

Microphone Array Design and Beamforming Heinrich Löllmann Multimedia Communications and Signal Processing heinrich.loellmann@fau.de with contributions from Vladi Tourbabin and Hendrik Barfuss EUSIPCO Tutorial on Embodied Audition for Robots August 31, 2015

Introduction Overview Microphone Array Design measure for array performance array design for Spherical Harmonics (SHs) new robot head design Beamforming robust least-squares beamformer design HRTF-based design for robots evaluation for Automatic Speech Recognition (ASR) Conclusions 2

Introduction Beamforming is used by robots to enhance the performance of the automatic speech recognition (ASR) Numerous publications about beamformers design and related approaches, e.g., [Brandstein & Ward, 2001, Van Trees 2002, Benesty et al, 2008] fixed beamformer designs adaptive beamforming blind source separation (BSS)... Which issues are special for a beamformer design in the context of robot audition? 3

Design Aspects for Robot Audition Construction of the robot (head) How many microphones are needed/feasible? What are the optimal microphone positions? Beamforming with head microphones influence of the head (no free-field) presence of ego-noise ( tutorial talk on signal enhancement) influence of robot movements, especially head rotations Localization and tracking of the desired speaker tutorial talks by C. Evers and R. Horaud 4

Optimal Microphone Positions Many robot systems use 2 microphones to mimic the human auditory system [Argentieri et al., 2013] Can we do better than with only two ears? How can we determine the optimal microphone placement? Measure for array performance in dependence of sensor positions presented in [Tourbabin & Rafaely, 2014] 5

Generalized HRTF Matrix Model for generalized head-related transfer function (HRTF) complex amplitude of far-field source complex pressure amplitudes transfer function between source j and sensor l 6

Generalized HRTF Matrix Representation for D sources, M sensors, K frequency points Compact matrix formulation 7

Generalized HRTF Matrix Singular Value Decomposition (SVD) of matrix with generalized head related transfer functions (GHRTFs) Observation information to construct from mainly contained by the most dominant eigenvectors high effective rank of GHRTF matrix indicates a good sensor placing 8

Optimal Sensor Placement The effective rank of the GHRTF matrix is given by Optimal microphone positions obtained by Solution can be found by genetic algorithm optimization 9

Optimal Sensor Placement Simulation example effective rank of GHRTF matrix for single positions on head surface Relation between effective rank and beamformer robustness as well as DOA estimation accuracy derived in [Tourbabin & Rafaely, 2014] 10

Array Design for Spherical Harmonics (SH) A nearly ball-shaped robot head with many microphones motivates array designs in the Spherical Harmonic (SH) domain An approach related to the previous method developed for this case optimal position founds by minimizing the aliasing level for different positions outline of this concept provided by the appendix, but a detailed treatment of SH exceeds this tutorial 11

New Head Array Design for Nao Robot Possible regions for microphone placement (green) determined by mechanical constraints of the manufacturer Positions considered for the optimization 327 positions on a simulated head 12

New Head Array Design for Nao Robot Simulation results (BG University) Layout for new Nao head (Aldebaran) slight deviations from optimal position due to mechanical constraints 13

New Head Array Design for Nao Robot First prototype head 14

Beamformer Design for Robot Audition Review: Filter-and-Sum Beamformer Beampattern 15

Robust Beamformer Design Robust Least-Squares Frequency-Invariant (RLSFI) Design [Mabande et al, 2009] 16

Design Example RLSFI beamformer design for different WNG thresholds beampattern for free-field response trade-off between spatial selectivity and robustness (WNG) 17

Design Example RLSFI beamformer design for beampattern for HRTF-based response reduced spatial selectivity and distortions in look direction HRTFs should be considered in the design! 18

HRTF-Based Beamformer Design HRTF-based RLSFI Design [Barfuss et al, 2015] 19

Design Example HRTF-based RLSFI design for different WNG thresholds HRTF-based design free-field design distortionless response in look direction similar spatial selectivity as for free-field design 20

Experimental Evaluation Setup ASR Pocket Sphinx trained on clean speech of GRID corpus test corpus contained 200 utterances signal quality evaluated by frequency weighted segmental SNR (fwsegsnr) [Hu & Loizou, 2008] interference at 45 o and desired speaker between 0 o and 180 o room impulse responses measured for a T60 of 190ms 21

Results scenario with speaker at 30 o and 60 o most challenging better performance for HRTF-based design in almost all cases Experimental Evaluation 22

Conclusions Beamformer design for robot audition requires tailored algorithms and designs Optimal microphone positions can be determined by the effective rank of GHRTF matrix can be found for a SH design by minimizing the aliasing level Beamformer design based on common free-field assumption leads to inferior results knowledge about the HRTF of the robot should be incorporated in the design 23

Appendix Array Design for Spherical Harmonics (SHs) How to find the best microphone positions for SHs Concept [Tourbabin et al., 2015] 24

Appendix Array Design for Spherical Harmonics Example for matrix aliasing level for sensor positions given by ratio of highest off-line element and corresponding diagonal element Optimal microphone positions obtained by 25

Appendix White Noise Gain for HRTF-based Design Maximal possible WNG f or free-field (delay-and-sum beamformer) lower maximal WNG for HRTF-based design 26

Appendix References [Argentieri et al., 2013] S. Argentieri, A. Portello, M. Bernard, P. Danés, and B. Gas, Binaural Systems in Robotics, in The Technology of Binaural Listening, J. Blauert, Ed., Modern Acoustics and Signal Processing, pp.225 253, Springer [Brandstein & Ward, 2001]: M. Brandstein and D. Ward (Eds.): Microphone Arrays, Springer, 2001 [Van Trees 2002]: H. L. Van Tress: Optimum Array Processing (Detection, Estimation and Modulation Theory, Part IV), Wiley Intersience [Benesty et al., 2008] J. Benesty, J. Chen, and Y. Huang: Microphone Array Signal Processing, Springer [Tourbabin & Rafaely, 2014] V. Tourbabin and B. Rafaely: Theoretical Framework for the Optimization of Microphone Array Configuration for Humanoid Robot Audition, IEEE Trans. on Audio, Speech, and Language Processing, vol. 22, no.12 [Mabande et al., 2009] E. Mabande, A. Schad, and W. Kellermann: Design of Robust Superdirective Beamformers as a Convex Optimization Problem, IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), Taipei, Taiwan [Barfuss et al., 2015] H. Barfuss, C. Huemmer, G. Lamani, A. Schwarz, and W. Kellermann: HRTF- Based Robust Least-Squares Frequency-Invariant Beamforming, Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, NY, USA [Hu & Loizou, 2008] Y. Hu and P.C. Loizou: Evaluation of Objective Quality Measures for Speech Enhancement, IEEE Trans. on Audio, Speech, and Language Processing, vol.16, no.1, pp.229-238 27

Acknowledgment The research leading to these results has received funding from the European Union's Seventh Framework Programme (FP7/2007-2013) under grant agreement no. 609465. 28