Time-of-arrival estimation for blind beamforming

Similar documents
Improving Meetings with Microphone Array Algorithms. Ivan Tashev Microsoft Research

Joint Position-Pitch Decomposition for Multi-Speaker Tracking

Lab S-3: Beamforming with Phasors. N r k. is the time shift applied to r k

Robust direction of arrival estimation

Self-localization of Dynamic User-Worn Microphones From Observed Speech

WPI Precision Personnel Locator: Inverse Synthetic Array Reconciliation Tomography Performance. Co-authors: M. Lowe, D. Cyganski, R. J.

Speech Enhancement Using Microphone Arrays

Convention Paper Presented at the 131st Convention 2011 October New York, USA

EE1.el3 (EEE1023): Electronics III. Acoustics lecture 20 Sound localisation. Dr Philip Jackson.

Approaches for Angle of Arrival Estimation. Wenguang Mao

Localization in Wireless Sensor Networks

arxiv: v1 [cs.sd] 4 Dec 2018

Live multi-track audio recording

Passive Temporal Offset Estimation of Multichannel Recordings of an Ad-Hoc Microphone Array

Microphone Array Feedback Suppression. for Indoor Room Acoustics

ONE of the most common and robust beamforming algorithms

Self Localization of acoustic sensors and actuators on Distributed platforms. Abstract. 1. Introduction and Motivation

Acoustic Beamforming for Speaker Diarization of Meetings

Recent Advances in Acoustic Signal Extraction and Dereverberation

Speech enhancement with ad-hoc microphone array using single source activity

Study Of Sound Source Localization Using Music Method In Real Acoustic Environment

Direction-of-Arrival Estimation Using a Microphone Array with the Multichannel Cross-Correlation Method

Antennas and Propagation. Chapter 5c: Array Signal Processing and Parametric Estimation Techniques

Microphone Array project in MSR: approach and results

Joint recognition and direction-of-arrival estimation of simultaneous meetingroom acoustic events

Speech Enhancement Using Beamforming Dr. G. Ramesh Babu 1, D. Lavanya 2, B. Yamuna 2, H. Divya 2, B. Shiva Kumar 2, B.

Calibration of Microphone Arrays for Improved Speech Recognition

Robust Speaker Identification for Meetings: UPC CLEAR 07 Meeting Room Evaluation System

SPEAKER CHANGE DETECTION AND SPEAKER DIARIZATION USING SPATIAL INFORMATION.

Adaptive Systems Homework Assignment 3

Ad hoc and Sensor Networks Chapter 9: Localization & positioning

Speech and Audio Processing Recognition and Audio Effects Part 3: Beamforming

Robust Low-Resource Sound Localization in Correlated Noise

EXPERIMENTAL EVALUATION OF MODIFIED PHASE TRANSFORM FOR SOUND SOURCE DETECTION

Adaptive Beamforming Applied for Signals Estimated with MUSIC Algorithm

Microphone Array Power Ratio for Speech Quality Assessment in Noisy Reverberant Environments 1

Mikko Myllymäki and Tuomas Virtanen

arxiv: v1 [cs.sd] 7 Oct 2016

ADAPTIVE ANTENNAS. TYPES OF BEAMFORMING

ESTIMATION OF TIME-VARYING ROOM IMPULSE RESPONSES OF MULTIPLE SOUND SOURCES FROM OBSERVED MIXTURE AND ISOLATED SOURCE SIGNALS

N. Garcia, A.M. Haimovich, J.A. Dabin and M. Coulon

Room Impulse Response Measurement and Analysis. Music 318, Winter 2010, Impulse Response Measurement

Time Delay Estimation: Applications and Algorithms

Ultrasound Beamforming and Image Formation. Jeremy J. Dahl

MIMO II: Physical Channel Modeling, Spatial Multiplexing. COS 463: Wireless Networks Lecture 17 Kyle Jamieson

Learning Human Context through Unobtrusive Methods

WIND SPEED ESTIMATION AND WIND-INDUCED NOISE REDUCTION USING A 2-CHANNEL SMALL MICROPHONE ARRAY

Nicholas Chong, Shanhung Wong, Sven Nordholm, Iain Murray

Indoor Positioning by the Fusion of Wireless Metrics and Sensors

AKI TINAKARI PHYSICAL SIZE OF MICROPHONE ARRAYS IN AD-HOC BEAMFORMING. Master s thesis

POSSIBLY the most noticeable difference when performing

The Role of High Frequencies in Convolutive Blind Source Separation of Speech Signals

METIS Second Training & Seminar. Smart antenna: Source localization and beamforming

ROBUST SUPERDIRECTIVE BEAMFORMER WITH OPTIMAL REGULARIZATION

Airo Interantional Research Journal September, 2013 Volume II, ISSN:

Performance Study of A Non-Blind Algorithm for Smart Antenna System

Spatial analysis of concert hall impulse responses

Accurate Three-Step Algorithm for Joint Source Position and Propagation Speed Estimation

Autonomous Underwater Vehicle Navigation.

Training neural network acoustic models on (multichannel) waveforms

MARQUETTE UNIVERSITY

Advanced Techniques for Mobile Robotics Location-Based Activity Recognition

Channel Modeling ETIN10. Wireless Positioning

Experimental Characterization of a Large Aperture Array Localization Technique using an SDR Testbench

Clustered Multi-channel Dereverberation for Ad-hoc Microphone Arrays

Excelsior Audio Design & Services, llc

LOCALIZATION WITH GPS UNAVAILABLE

Understanding Advanced Bluetooth Angle Estimation Techniques for Real-Time Locationing

TARGET SPEECH EXTRACTION IN COCKTAIL PARTY BY COMBINING BEAMFORMING AND BLIND SOURCE SEPARATION

Chapter 2 Channel Equalization

Ultrasound Bioinstrumentation. Topic 2 (lecture 3) Beamforming

Eigenvalues and Eigenvectors in Array Antennas. Optimization of Array Antennas for High Performance. Self-introduction

Robust Speaker Recognition using Microphone Arrays

Passive Emitter Geolocation using Agent-based Data Fusion of AOA, TDOA and FDOA Measurements

Research Article Localization of Directional Sound Sources Supported by A Priori Information of the Acoustic Environment

TIME delay of arrival (TDOA) estimation is an essential

High-speed Noise Cancellation with Microphone Array

Asymptotically Optimal Detection/ Localization of LPI Signals of Emitters using Distributed Sensors

Collection of re-transmitted data and impulse responses and remote ASR and speaker verification. Igor Szoke, Lada Mosner (et al.

BLIND SOURCE SEPARATION FOR CONVOLUTIVE MIXTURES USING SPATIALLY RESAMPLED OBSERVATIONS

A Hybrid TDOA/RSSD Geolocation System using the Unscented Kalman Filter

Ocean Ambient Noise Studies for Shallow and Deep Water Environments

ABSTRACT POSITION CALIBRATION OF ACOUSTIC SENSORS AND ACTUATORS ON DISTRIBUTED GENERAL PURPOSE COMPUTING PLATFORMS

REAL-TIME SRP-PHAT SOURCE LOCATION IMPLEMENTATIONS ON A LARGE-APERTURE MICROPHONE ARRAY

Characterization of Multi-carrier Locator Performance. Daniel E. Breen, Jr.

Performance of a Precision Indoor Positioning System Using a Multi-Carrier Approach

ECE 476/ECE 501C/CS Wireless Communication Systems Winter Lecture 6: Fading

Emanuël A. P. Habets, Jacob Benesty, and Patrick A. Naylor. Presented by Amir Kiperwas

Sensor Data Fusion Using a Probability Density Grid

ECE 476/ECE 501C/CS Wireless Communication Systems Winter Lecture 6: Fading

ADAPTIVE ANTENNAS. NARROW BAND AND WIDE BAND BEAMFORMING

Real-time Adaptive Concepts in Acoustics

Blind Dereverberation of Single-Channel Speech Signals Using an ICA-Based Generative Model

EVALUATION OF BINARY PHASE CODED PULSE COMPRESSION SCHEMES USING AND TIME-SERIES WEATHER RADAR SIMULATOR

Online Simultaneous Localization and Mapping of Multiple Sound Sources and Asynchronous Microphone Arrays

GEOMETRY CALIBRATION OF DISTRIBUTED MICROPHONE ARRAYS EXPLOITING AUDIO-VISUAL CORRESPONDENCES. Axel Plinge and Gernot A. Fink

Automotive three-microphone voice activity detector and noise-canceller

Michael E. Lockwood, Satish Mohan, Douglas L. Jones. Quang Su, Ronald N. Miles

Time-of-flight PET with SiPM sensors on monolithic scintillation crystals Vinke, Ruud

Robust Speaker Segmentation for Meetings: The ICSI-SRI Spring 2005 Diarization System

Transcription:

Time-of-arrival estimation for blind beamforming Pasi Pertilä, pasi.pertila (at) tut.fi www.cs.tut.fi/~pertila/ Aki Tinakari, aki.tinakari (at) tut.fi Tampere University of Technology Tampere, Finland

Presentation outline 1) Traditional beamforming / beam steering 2) Ad-hoc microphone arrays 3) Three ad-hoc array beam steering methods Time-of-Arrival (TOA) based solutions 4) Simulation of TOA accuracy 5) Measurements with an array of smartphones Accuracy of TOA estimation Obtained beamforming quality 2

Traditional Beamforming Linear combination of microphone signals X i (ω), where i =1,,M Requirements for steering the beam: 1) Array shape is known (mic. position matrix M) 2) Sensors are synchronous (time offset is zero/known) 3) Direction/position to steer the array is known or can be scanned e.g. based on energy. Simple Delay-and-Sum Beamformer (DSB) M 1 Y (ω) = exp(iωτ ) X i(ω) i=0 i time-shifting 3

Signal observation (near field) Sound Time-of-Flight (TOF) is τ i = m i s c 1 Align signals by advancing x i (t) by τ i x 0 (t) = s(t τ 0 ) x 1 (t) = s(t τ 1 ) s(t) x M 1 (t) = s(t τ M 1 ) 4

Ad-Hoc microphone array Independent devices equipped with a microphone Traditional beamforming requirements unfulfilled 1. Array geometry is unknown (M is unknown) 2. Devices aren t synchronized (unknown time offsets Δ i ) 3. The space cannot be easily panned to find source direction θ to steer the beam into 5

Time of Arrival (TOA) Signal time-of-arrival (TOA) for and ad-hoc array τ i = c 1 s m i + Δ i propagation delay time offset Time-difference-of-Arrival (TDOA) for mics i, j τ i, j = τ i τ j TDOAs τ i,j can be measured using e.g. correlation Previously considered as source spatial information A. Brutti and F. Nesta, Tracking of multidimensional TDOA for multiple sources with distributed microphone pairs, Computer Speech & Language, vol. 27, TDOA and TOA vectors are written as P=M(M-1)/2 6

Time of Arrival (TOA) By defining an observation matrix E.g. for three microphones H = The linear model between TOA and TDOA is TOA proposed as source spatial representation 7

Time of Arrival (TOA) 1st Baseline method (TDOA subset): 1. Select a reference microphone (e.g. 1 st mic) 2. Use relative delays τ i,j between the reference (i =1) and rest (j =2,,M) as TOA - Does not utilize TDOA information between all sensors 8

Time of Arrival (TOA) 2nd Moore-Penrose inverse solution for TOA H 0 is H without the first column to account for one missing degree of freedom, i.e. the TOA is relative to 1 st sensor (which is set to zero). + Utilizes TDOA information between all sensors 9

Time of Arrival (TOA) 3rd Kalman filtering based TOA estimation (state eq.) (measurement eq.) x consists of TOA and TOA velocity, A is transition matrix, q, r are noise! x = # " Predict p(x t y t-1 ) and update p(x t y t ) steps. Outlier rejection based on projected measurement likelihood + Utilizes TDOA information between all pairs + Can track speaker during noise contaminated segments. 10 τ τ $ & %

TOA Estimation simulation 3 microphones 48kHz Source rotates around the array Gaussian noise added to TDOA observations τ ij, σ = 20 Gaussian noise in offset values Δ i, σ 2 =10 11

Simulation TOA accuracy Baseline (subset Moore-Penrose Kalman of TDOAs) Inverse filter TOA RMS error (samples@48k, 100 trials) Baseline Moore-Penrose Kalman filter 8.7 16.2 19.9 12

Measurements 10 smartphones were used to capture audio 9 and 12 second sentences were used Speaker walked around the array Reverberation time T60 ~ 370 ms Room size: 5.1m 6.6m TDOAs were manually annotated to obtain ground truth TOA. Reference signal was captured with headworn microphones. 13

Performance of TOA estimators in measurements RMS Error (samples @ 48kHz) 500 450 400 350 300 250 200 150 100 50 0 437 461 Baseline Rec 1 Rec 2 223 232 110 47 Moore-Penrose Kalman filter 14

Obtained beamforming quality We used estimated TOAs to steer DSB Output y(t) quality was evaluated with BSSmetric Signal-to-Artifacts-Ratio or SAR *) SAR= 20log 10 s target e artifacts ( ) y(t) = s target (t)+ e artifacts (t) Scored in segments due to speaker movement (gain variation) Only active segments considered (with VAD) Modified metric: Segmental Signal-to-Artifacts Ratio Arithmetic mean (SSARA) *) http://bass-db.gforge.inria.fr/bss eval/ 15

Objective speech quality SSARA (db) 8 7 6 5 4 3 2 1 0 Rec #1 Rec #2 Best Mic. TDOA Moore-Pensore inverse Kalman filter Ground Truth TOA 16

Conclusions Proposed TOA as the spatial source information of an ad-hoc microphone array Previous research only considered TDOA Dimension of TOA is M-1, for TDOA M(M-1)/2 Three TOA estimation solutions considered TDOA subset (baseline), pseudo-inverse, and Kalman filtering à most accurate TOA allows beam-steering towards source w/o mic. positions / synchronization: blindly Kalman filter based TOA provided best objective signal quality for beamforming 17