ROOM AND CONCERT HALL ACOUSTICS MEASUREMENTS USING ARRAYS OF CAMERAS AND MICROPHONES

Similar documents
Proceedings of Meetings on Acoustics

Ivan Tashev Microsoft Research

29th TONMEISTERTAGUNG VDT INTERNATIONAL CONVENTION, November 2016

SPHERICAL MICROPHONE ARRAY BASED IMMERSIVE AUDIO SCENE RENDERING. Adam M. O Donovan, Dmitry N. Zotkin, Ramani Duraiswami

From Binaural Technology to Virtual Reality

Abstract. 1. Introduction and Motivation. 3. Methods. 2. Related Work Omni Directional Stereo Imaging

Sound Source Localization using HRTF database

Validation of lateral fraction results in room acoustic measurements

Recent Advances in Acoustic Signal Extraction and Dereverberation

SOPA version 2. Revised July SOPA project. September 21, Introduction 2. 2 Basic concept 3. 3 Capturing spatial audio 4

A Toolkit for Customizing the ambix Ambisonics-to- Binaural Renderer

Ambisonics Directional Room Impulse Response as a New SOFA Convention

INVESTIGATING BINAURAL LOCALISATION ABILITIES FOR PROPOSING A STANDARDISED TESTING ENVIRONMENT FOR BINAURAL SYSTEMS

HRIR Customization in the Median Plane via Principal Components Analysis

Binaural Hearing. Reading: Yost Ch. 12

The psychoacoustics of reverberation

Sound source localization and its use in multimedia applications

AN AUDITORILY MOTIVATED ANALYSIS METHOD FOR ROOM IMPULSE RESPONSES

Auditory System For a Mobile Robot

6-channel recording/reproduction system for 3-dimensional auralization of sound fields

Introduction. 1.1 Surround sound

Acoustics Research Institute

Measuring impulse responses containing complete spatial information ABSTRACT

On distance dependence of pinna spectral patterns in head-related transfer functions

RIR Estimation for Synthetic Data Acquisition

Visualization of Compact Microphone Array Room Impulse Responses

Robotic Spatial Sound Localization and Its 3-D Sound Human Interface

Room impulse response measurement with a spherical microphone array, application to room and building acoustics

Listening with Headphones

Advanced techniques for the determination of sound spatialization in Italian Opera Theatres

Multiple Sound Sources Localization Using Energetic Analysis Method

Psychoacoustic Cues in Room Size Perception

Soundfield Navigation using an Array of Higher-Order Ambisonics Microphones

Study Of Sound Source Localization Using Music Method In Real Acoustic Environment

Auditory Localization

Spatialization and Timbre for Effective Auditory Graphing

SGN Audio and Speech Processing

Accurate sound reproduction from two loudspeakers in a living room

Spatialisation accuracy of a Virtual Performance System

Audio Engineering Society Convention Paper 5449

Waves Nx VIRTUAL REALITY AUDIO

From acoustic simulation to virtual auditory displays

Microphone Array Design and Beamforming

SIA Software Company, Inc.

REAL TIME WALKTHROUGH AURALIZATION - THE FIRST YEAR

SGN Audio and Speech Processing

Spatial Audio & The Vestibular System!

Speech Intelligibility Enhancement using Microphone Array via Intra-Vehicular Beamforming

Institute for Media Technology Electronic Media Technology (ELMT)

Proceedings of Meetings on Acoustics

University of Huddersfield Repository

396 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 19, NO. 2, FEBRUARY 2011

Audio Engineering Society. Convention Paper. Presented at the 131st Convention 2011 October New York, NY, USA

Audio Engineering Society. Convention Paper. Presented at the 129th Convention 2010 November 4 7 San Francisco, CA, USA. Why Ambisonics Does Work

IMPLEMENTATION AND APPLICATION OF A BINAURAL HEARING MODEL TO THE OBJECTIVE EVALUATION OF SPATIAL IMPRESSION

Improving room acoustics at low frequencies with multiple loudspeakers and time based room correction

HRTF adaptation and pattern learning

CAPTURING ANCIENT THEATERS SOUND SIGNATURE USING BEAMFORMING

EFFECT OF STIMULUS SPEED ERROR ON MEASURED ROOM ACOUSTIC PARAMETERS

Binaural Sound Source Localization Based on Steered Beamformer with Spherical Scatterer

Blind source separation and directional audio synthesis for binaural auralization of multiple sound sources using microphone array recordings

Monaural and Binaural Speech Separation

A Virtual Audio Environment for Testing Dummy- Head HRTFs modeling Real Life Situations

TDE-ILD-HRTF-Based 2D Whole-Plane Sound Source Localization Using Only Two Microphones and Source Counting

MPEG-4 Structured Audio Systems

Lecture 14: Source Separation

Enhancing 3D Audio Using Blind Bandwidth Extension

Direction-Dependent Physical Modeling of Musical Instruments

APPLICATION OF THE HEAD RELATED TRANSFER FUNCTIONS IN ROOM ACOUSTICS DESIGN USING BEAMFORMING

Proceedings of Meetings on Acoustics

Binaural room impulse response database acquired from a variable acoustics classroom

HRTF measurement on KEMAR manikin

Automatic Text-Independent. Speaker. Recognition Approaches Using Binaural Inputs

Introduction of Audio and Music

MANY emerging applications require the ability to render

SYNTHESIS OF DEVICE-INDEPENDENT NOISE CORPORA FOR SPEECH QUALITY ASSESSMENT. Hannes Gamper, Lyle Corbin, David Johnston, Ivan J.

Audio Engineering Society. Convention Paper. Presented at the 124th Convention 2008 May Amsterdam, The Netherlands

Preeti Rao 2 nd CompMusicWorkshop, Istanbul 2012

SOUND FIELD MEASUREMENTS INSIDE A REVERBERANT ROOM BY MEANS OF A NEW 3D METHOD AND COMPARISON WITH FEM MODEL

Quality Measure of Multicamera Image for Geometric Distortion

BEAMFORMING WITHIN THE MODAL SOUND FIELD OF A VEHICLE INTERIOR

Reducing comb filtering on different musical instruments using time delay estimation

AVAL: Audio-Visual Active Locator ECE-492/3 Senior Design Project Spring 2014

The Human Auditory System

Convention e-brief 310

Automatic Evaluation of Hindustani Learner s SARGAM Practice

ROOM ACOUSTICS MEASUREMENT AND ASSESSMENT METHODS. 1. Signal to Noise OBJECTIVE METHODS ASSESSMENT OF ROOM ACOUSTICS CAN BE DIVIDED INTO:

Sound rendering in Interactive Multimodal Systems. Federico Avanzini

New acoustical techniques for measuring spatial properties in concert halls

DECORRELATION TECHNIQUES FOR THE RENDERING OF APPARENT SOUND SOURCE WIDTH IN 3D AUDIO DISPLAYS. Guillaume Potard, Ian Burnett

VIRTUAL ACOUSTICS: OPPORTUNITIES AND LIMITS OF SPATIAL SOUND REPRODUCTION

Measuring procedures for the environmental parameters: Acoustic comfort

PITCH-TRACKING OF REVERBERANT SOUNDS, APPLICATION TO SPATIAL DESCRIPTION OF SOUND SCENES

Circumaural transducer arrays for binaural synthesis

Spatial Audio Reproduction: Towards Individualized Binaural Sound

ANALYZING NOTCH PATTERNS OF HEAD RELATED TRANSFER FUNCTIONS IN CIPIC AND SYMARE DATABASES. M. Shahnawaz, L. Bianchi, A. Sarti, S.

Binaural auralization based on spherical-harmonics beamforming

PRODUCT DATA. DIRAC Room Acoustics Software Type Photo courtesy of Muziekcentrum Frits Philips, Eindhoven, The Netherlands

Advanced Functions of Java-DSP for use in Electrical and Computer Engineering Senior Level Courses

Proceedings of Meetings on Acoustics

Transcription:

ROOM AND CONCERT HALL ACOUSTICS The perception of sound by human listeners in a listening space, such as a room or a concert hall is a complicated function of the type of source sound (speech, oration, concert, music and type), of the intended outcome (comprehension, comfort, enjoyment, localization), of the room (geometry, reflecting surfaces/absorbers, size) and the listener (HRTF, hearing acuity). Quite obviously, reducing this tremendous variability to a single standard is an extremely hard task. The ISO 3382 standard [1] specifies several performance space acoustic parameters (Part 1) and room acoustics parameters (Part 2) for this. These measurements include varying quantities (decay times, clarity, etc.) extracted in varying frequency bands, for both single microphone and binaural measurements. The binaural measurements attempt to account for the effects of the human head, torso and pinnae on the measurements, but do not account for inter-personal variations. To go beyond this standard, one needs to understand the impulse response for every source-receiver position combination of interest. In the case of binaural listening, this needs to be done in a way that particular individual head-related transfer functions can be employed. MEASUREMENTS USING ARRAYS OF CAMERAS AND MICROPHONES To obtain a finer understanding of the listening characteristics, individual impulse responses must be measured for each direction and their structure understood from the geometry and materials used to construct the space. In particular for the early stage, the interaction of the source sound with each scattering surface, in each frequency band can be easily characterized using the VisiSonics AudioVisual Panoramic Camera. This device provides several microphones (64 or 128), and several high definition cameras (5 or 15), which provide an ability to record sound, and to co-register the sound. The device allows 24 bit recordings, at 3 user-selectable gain levels, and 48 khz sampling. All the cameras and microphones are mounted on a head-sized sphere (0.2 m diameter). The use of an advanced digital microphone architecture (Zotkin et al 2013) allows the audio measurements from all cameras and microphones be sent over a combination of IEEE1394 (video) and USB 2 (audio), or over USB 3 (audio and video). A NVIDIA GPU card equipped laptop allows both real time analysis of a scene, and of sophisticated on-scene computations. A panoramic video image is computed from the multiple video cameras by using a GPU based stitching algorithm. An overlaid audio image is computed by employing several thousand plane-wave or spherical beamformers on a grid of directions on the sphere (O Donovan et al, 2007). The magnitude of these beamformers in particular frequency bands are mapped to false color values and overlaid via alpha-blending as a transparent texture over the panoramic video. The raw audio and video data are streamed to disk, and can be used for more careful late analysis. Real-time analysis possible via the video stream includes identification of prominent reflections, and the geometric sources of their origin. A

sample screenshot with a speaking user in a room is shown below. SPATIAL, TEMPORAL AND SPECTRAL ANALYSES Various off-line analyses can be performed via the tool. These include the computation of directional impulse responses, the analysis of the reverberation in temporal windows, the analysis in spectral windows, the online computation of spectrograms with various weightings and frequency scales, and others. Some preliminary analysis results can be seen in following video posted online. The interface for performing spectro-temporal spatial analysis of the acoustic scene is shown above. Window 1 shows the time domain plot of the signal recorded. A sliding red box allows a particular time interval of interest to be selected.

Window 2 shows a spectrogram of the selected time region. The spectrogram is a useful tool for finding structure in the recorded audio. It can be used to select a frequency and time sub region to generate an acoustic image. In the following images, screenshots are shown from a recording made in a reverberant chamber. The image below shows the first frame of the acoustic image. It was generated by computing the acoustic images for all frequency bins within the user selected frequency range of a 256 sample frame of data. For each successive image the sample frame is advanced by 10 samples and the corresponding image is generated. This sequence can then be played to explore the dynamics of the instantaneous spatial distribution of the incident sound field. Notice the thin red line in the spectrogram. This line indicates the extent of the data used to generate the image in the current frame. The image above shows when the impulse onset has occurred; the spatial distribution of the acoustic field is very localized to the driver creating the signal, and a bulls-eye pattern is seen. Because the analysis frame time can be selected accurately, we can advance the frames, until the first reflection is distinctly seen. In this case the image above shows that it is on the leftmost wall of the reverberant chamber. The corresponding bulls-eye pattern of the reflection is seen clearly in the image below, as is the temporal pattern of the

corresponding reflection. In the image above, the frame has been advanced to show the second distinct reflection occurring on the center wall. Advancing 100 ms (the image above) shows just how diffuse the field has already gotten in this reverberant chamber. CONCLUSIONS AND ONGOING WORK The images above just indicate a small selection of analyses possible. Users of the device have used it to quickly identify problematic structures in rooms. It is also possible to compute integrated quantities over time and frequency. Further, using known sources we can compute impulse responses. In a version of the tool under development, we can compute impulse responses corresponding to any direction.

REFERENCES ISO-3382, Acoustics - Measurement of room acoustic parameters -- Part 1: Performance spaces (2009) ISO-3382, Acoustics - Measurement of room acoustic parameters -- Part 2: Reverberation time in ordinary rooms (2008) A. O Donovan, R. Duraiswami, Jan Neumann (2007). Microphone Arrays as Generalized Cameras for Integrated Audio Visual Processing. IEEE CVPR. A. O'Donovan, D. N. Zotkin and R. Duraiswami (2008). A spherical microphone array based system for immersive audio scene rendering. Proc. 14th International Conference on Auditory Displays. Paris, France. A. O'Donovan, R. Duraiswami and D.N. Zotkin. (2010). Automatic matched filter recovery via the audio camera. Proc. IEEE ICASSP, (pp. 2826-2829). Dallas, TX. A. O Donovan, R. Duraiswami and D.N. Zotkin. (2008). Imaging Concert Hall Acoustics using Audio and Visual Cameras. IEEE ICASSP, (pp. 5284-5287). Las Vegas, NV. A. O'Donovan, R. Duraiswami and Nail A. Gumerov. (2007). Real Time Capture of Audio Images and Their Use with Video. IEEE WASPAA, (pp. 1-8).