A3D Contiguous time-frequency energized sound-field: reflection-free listening space supports integration in audiology

Similar documents
Perception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 4: 7 Feb A. Faulkner.

Perception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 5: 12 Feb A. Faulkner.

Sonnet. we think differently!

O P S I. ( Optimised Phantom Source Imaging of the high frequency content of virtual sources in Wave Field Synthesis )

III. Publication III. c 2005 Toni Hirvonen.

Envelopment and Small Room Acoustics

Perception of pitch. Importance of pitch: 2. mother hemp horse. scold. Definitions. Why is pitch important? AUDL4007: 11 Feb A. Faulkner.

Pre- and Post Ringing Of Impulse Response

Surround: The Current Technological Situation. David Griesinger Lexicon 3 Oak Park Bedford, MA

Acoustics II: Kurt Heutschi recording technique. stereo recording. microphone positioning. surround sound recordings.

Force versus Frequency Figure 1.

What applications is a cardioid subwoofer configuration appropriate for?

ONLINE TUTORIALS. Log on using your username & password. (same as your ) Choose a category from menu. (ie: audio)

FFT 1 /n octave analysis wavelet

From time to time it is useful even for an expert to give a thought to the basics of sound reproduction. For instance, what the stereo is all about?

Binaural Hearing. Reading: Yost Ch. 12

Accurate sound reproduction from two loudspeakers in a living room

Added sounds for quiet vehicles

AN AUDITORILY MOTIVATED ANALYSIS METHOD FOR ROOM IMPULSE RESPONSES

INVESTIGATING BINAURAL LOCALISATION ABILITIES FOR PROPOSING A STANDARDISED TESTING ENVIRONMENT FOR BINAURAL SYSTEMS

Auditory Localization

Finding the Prototype for Stereo Loudspeakers

Sound recording & playback

Practical Applications of the Wavelet Analysis

Combining Subjective and Objective Assessment of Loudspeaker Distortion Marian Liebig Wolfgang Klippel

Processor Setting Fundamentals -or- What Is the Crossover Point?

SOUND 1 -- ACOUSTICS 1

Perception of low frequencies in small rooms

Improving room acoustics at low frequencies with multiple loudspeakers and time based room correction

K L A N G W E R K ACTIVE TECHNOLOGY. Active versus Passive Technology. CPR (Compensated Phase Response)-System AOI (Adapted Output Impedance)-System

Psychoacoustic Cues in Room Size Perception

The analysis of multi-channel sound reproduction algorithms using HRTF data

The Naim Balanced Mode Radiator The Naim Ovator Bass Driver

ALTERNATING CURRENT (AC)

Earl R. Geddes, Ph.D. Audio Intelligence

AUDL GS08/GAV1 Signals, systems, acoustics and the ear. Loudness & Temporal resolution

Measuring procedures for the environmental parameters: Acoustic comfort

EBU UER. european broadcasting union. Listening conditions for the assessment of sound programme material. Supplement 1.

Quadra 10 Available in Black and White

Computational Perception. Sound localization 2

CONTENTS. Preface...vii. Acknowledgments...ix. Chapter 1: Behavior of Sound...1. Chapter 2: The Ear and Hearing...11

Reflection and absorption of sound (Item No.: P )

Reducing comb filtering on different musical instruments using time delay estimation

EECS 452, W.03 DSP Project Proposals: HW#5 James Glettler

The psychoacoustics of reverberation

2. The use of beam steering speakers in a Public Address system

MEASURING DIRECTIVITIES OF NATURAL SOUND SOURCES WITH A SPHERICAL MICROPHONE ARRAY

Detection, Interpolation and Cancellation Algorithms for GSM burst Removal for Forensic Audio

29th TONMEISTERTAGUNG VDT INTERNATIONAL CONVENTION, November 2016

Technique for the Derivation of Wide Band Room Impulse Response

Introduction Unique Electronics

University of Huddersfield Repository

Learning Objectives:

ROOM SHAPE AND SIZE ESTIMATION USING DIRECTIONAL IMPULSE RESPONSE MEASUREMENTS

Introduction to Audio Watermarking Schemes

Room Acoustics. March 27th 2015

SIA Software Company, Inc.

Introduction. 1.1 Surround sound

A triangulation method for determining the perceptual center of the head for auditory stimuli

COM325 Computer Speech and Hearing

Review Title. Secondary Title

Audio Quality Terminology

A White Paper on Danley Sound Labs Tapped Horn and Synergy Horn Technologies

Digital Signal Processing Audio Measurements Custom Designed Tools. Loudness measurement in sone (DIN ISO 532B)

AMERICAN UNIVERSITY EAST CAMPUS DEVELOPMENT WASHINGTON, D.C. Environmental Noise Study. Project Number

Excelsior Audio Design & Services, llc

SMALL WONDER. THE OMNISAT MICRO. OMNIPOLAR SOUND. COMPACT DESIGN. Z E S I A L T U A C

LOW FREQUENCY SOUND IN ROOMS

Perception of temporal response and resolution in the time domain

New transducer technology A.R.T. = Accelerated Ribbon Technology - evolution of the air motion transformer principle

Audio Engineering Society. Convention Paper. Presented at the 115th Convention 2003 October New York, New York

Technical Guide. Installed Sound. Recommended Equalization Procedures. TA-6 Version 1.1 April, 2002

3D Distortion Measurement (DIS)

Lift-over crossings as a solution to tram-generated ground-borne vibration and re-radiated noise

datasheet TQ-445 QLIGHT SERIES ENGINEERING INFORMATION FEATURES APPLICATIONS Three-way, bi-amp design Exceptional audio clarity 60 x 40 dispersion

Chapter 12. Preview. Objectives The Production of Sound Waves Frequency of Sound Waves The Doppler Effect. Section 1 Sound Waves

6-channel recording/reproduction system for 3-dimensional auralization of sound fields

Since the advent of the sine wave oscillator

DESIGN OF ROOMS FOR MULTICHANNEL AUDIO MONITORING

INFLUENCE OF FREQUENCY DISTRIBUTION ON INTENSITY FLUCTUATIONS OF NOISE

An introduction to physics of Sound

Audio System Evaluation with Music Signals

Physics 101. Lecture 21 Doppler Effect Loudness Human Hearing Interference of Sound Waves Reflection & Refraction of Sound

Multichannel Audio In Cars (Tim Nind)

Photo Story The Time-Behaviour of Equalisers in dependancy of the Quality factor

Copyright 2009 Pearson Education, Inc.

Quadra 15 Available in Black and White

Week 1. Signals & Systems for Speech & Hearing. Sound is a SIGNAL 3. You may find this course demanding! How to get through it:

Monitor Setup Guide The right monitors. The correct setup. Proper sound.

Small Room and Loudspeaker Interaction

Technology Super Live Audio Technology (SLA)

DISTANCE CODING AND PERFORMANCE OF THE MARK 5 AND ST350 SOUNDFIELD MICROPHONES AND THEIR SUITABILITY FOR AMBISONIC REPRODUCTION

Loudspeaker Data Reliable, Comprehensive, Interpretable

Week I AUDL Signals & Systems for Speech & Hearing. Sound is a SIGNAL. You may find this course demanding! How to get through it: What is sound?

TEAK Sound and Music

SOUND QUALITY EVALUATION OF FAN NOISE BASED ON HEARING-RELATED PARAMETERS SUMMARY INTRODUCTION

EQ s & Frequency Processing

Sound Source Localization using HRTF database

9 A small tutorial. 9.1 Loudspeaker boxes

Multichannel Audio Technologies. More on Surround Sound Microphone Techniques:

Transcription:

A3D Contiguous time-frequency energized sound-field: reflection-free listening space supports integration in audiology Joe Hayes Chief Technology Officer Acoustic3D Holdings Ltd joe.hayes@acoustic3d.com Jon South Acoustician Acoustic3D Holdings Ltd jon.south@acoustic3d.com Malcolm Duffield CEO Vastigo Limited malcolm.duffield@vastigo.com Professor Dr Phil Graham Queensland University of Technology p.graham@qut.edu.au Abstract In this paper we present a model of treated sound (sound played back through loudspeakers) that diverges from conventional psychoacoustic understandings. We show that if sound is diffused into a listening space, using a mathematically pure diffusion, it will effectively prevent the creation of specular reflections that might carry false but meaningful (i.e. audible and functional) acoustic cues or what is commonly called room noise. Reducing the processing demand on the listeners audiology enables the listener to better identify and locate sources of sound in space. Because the acoustic energy presented to the ear has been simultaneously Acoustic Wavelet-encoded whilst being so diffused, it forms a contiguous phase-sound field that we argue better suits the integration period of human audiology, thereby enhancing sonic perception of recorded material. The nature of Acoustic Reflections In the following section, we analyse measurements obtained from both conventional and A3D loudspeakers. The speakers were energised with a unit impulse source (UIS) signal, and microphone measurements made of the resulting direct sound and early reflections from nearby surfaces. For the scope of this paper, these measurements will be referred to as Speaker Impulse Reflection Responses (SIRR). The reference listening room was a medium sized L shaped long room chosen to represent a typical home listening environment. The speakers were located adjacent to a wall near the apex of the L. A sample of 2 meters length* was captured by an instrument microphone (Audio Technica 4031) at tweeter height on both systems, at identical distance (of 1 meter), on-axis. *In an effort to relate measurements to spatial environments, time based measurements are converted to the corresponding distance sound travels at 343m/s. e.g. 5.8ms = 2m. Contrast Figures 1 through to 6, over the next two pages, compare and contrast a conventional loudspeaker (Rogers BBC Monitor) and an A3D loudspeaker (NewAudio AS8) in terms of their performance in this typical listening space. Figure 1 shows the SIRR for the conventional Rogers loudspeaker. The UIS response is seen localised at 0cm, with three clearly visible specular reflections (at approximately 80, 105 and 155 cm after the click). Whilst these would not necessarily be heard as echoes, our audiology would still have to process (and elect to ignore or incorporate) such sounds as cues in order to establish our spatial perception. Figure 1 - Rogers BBC Monitor loudspeaker SIRR as measured in the reference room

Figure 2 depicts a Wavelet Energy Transform (WET) of the above SIRR. Mathematically, we are performing a Continuous Wavelet Transform (CWT). It allows us to look at the wavelet decomposition of the signal, and the energy content in the wavelet domain. From it you will see the energy reflected back into the listening space as three stalactite-like structures, each having an apex at approximately 80, 105 and 155 cm respectively after the click. The click itself also has a stalactitelike structure that informs us of an approximately 10cm mis-alignment in the excitation force between the low frequencies (up to approximately 3,200Hz) and the high frequencies (above 3,200Hz). This is due to the offset between the woofer and the tweeter on a flat front panel in a classic, passive-crossover, design such as that of the Rogers. Figure 2 - Rogers BBC Monitor loudspeaker Wavelet Energy Transform (WET) measured in the reference room Figure 3 being a view of the WET slice through 11,142Hz, clearly indicates the reflected energy at approximately 80, 105 and 155 cm after the click. The 80cm reflection contains the most energy whilst the 105 and 155cm reflections contain about equal energy. The distinct reflections in the time domain are typical of the coloration added by a listening room. Figure 3 - Rogers BBC Monitor loudspeaker WET at scale 7 (11,142 Hz) as measured in the reference room Acoustic3D Holdings Ltd Commercial in confidence Page 2 of 6

Figures 4 through to 6 however show the same SIRR measurement for the A3D speaker, with its Diffusion at Source design. Early specular reflections, as seen with the Rogers, might be expected at 80cm, 105cm and 160cm after the incident arrival (0cm), but are almost completely absent. Thus the listener s audiology doesn t have to process them as cues (false or otherwise). This, and listeners stated opinions of A3D speakers as clearer, would appear to support both the first and second parts of our diffusion hypothesis. Figure 4 - A3D speaker SIRR measured in the same room Figure 5 depicts a WET that has a consistently darker, less-correlated, area after the original click representing the diffuse residual energy. This diffuse energy enables to room to remain live sounding. Figure 5 - A3D speaker IRR measured in same room Figure 6: The WET profile centred on 11,142Hz, shows the absence of reflected energy at approximately 80cm, 105cm, and 155cm. Thus the hypothesised psychoacoustic contamination effect of the listening room appears to have been successfully removed. There is a much smaller energy presence at approximately 30cm that likely correlates to a small amount of internal reflection coming back through the cone of the source itself, given the internal width of the Satellite used in the A3D system is of the order of 297mm. Acoustic3D Holdings Ltd Commercial in confidence Page 3 of 6

Figure 6 - A3D speaker IRR measured in same room In the above Figure 6 there are no obvious audible room reflections in the SIRR. The WET (Figure 5) also shows how diffuse (i.e. non-correlated) the residual energy is coming from the listening room (evidenced by the WET s much darker bars). A significant difference between the Rogers and A3D speakers is the form that the radiated click energy takes going in to the listening space. The Rogers speakers, and other conventional speakers, rely on the uncontrolled electro-mechanical response of its components in the presentation of sound to the acoustic environment. It is proposed that A3D speakers present a controlled, wavelet encoded response to the acoustic environment. As wavelets offer superior timefrequency localisation, they produce a complete and contiguous phase-spatial image that conventional speakers cannot. Morlet wavelets are particularly well suited for this application: as Gaussians they have a simple analytic form and they work well reproducing complex energy paths in the reproduced sound field. These complex energy paths in the sound field, along with the antispecular-reflection properties, contribute significantly to the perception of a real construction of an acoustic sound field when the audio is reproduced on A3D equipped speakers. Integration in human perception "A 1,000-Hz tone sounds like 1,000 Hz in a 1-second tone burst, but an extremely short burst sounds like a click. The duration of such a burst also influences the perceived loudness. Short bursts do not sound as loud as longer ones... A pulse 3 ms long must have a level about 15dB higher to sound as loud as a 0.5 second (500 millisecond) pulse. Tones and random noise follow roughly the same relationship in loudness vs. pulse length. The 100-ms region is significant... Only when the tones or noise bursts are shorter than this amount must the sound-pressure level be increased to produce loudness equal to that of long pulses or steady tones or noise. This 100 ms appears to be the integrating time, or the time constant, of the human ear. 1 In simple terms the A3D wavelet encoded sound appears to give acoustic energy the time to make its presence felt. We might deduce from Everest s observations - and those of others - that our audiology, possibly the Basilar membrane itself, needs some time to respond to the presence of patterns in sound. It would also appear likely that our audiological processes derive the zero time (or centre time) of such wave packets so as to be able to derive spatial information from such cues. 1 The Master Handbook of Acoustics - F. Alton Everest Acoustic3D Holdings Ltd Commercial in confidence Page 4 of 6

The above being the case would appear to further support listeners anecdotal evidence that A3D speakers provide greater spatial perception (through clearer and contiguous consistent cues?). But ultimately we don't listen to clicks, we listen to sound and music. Impulse responses are important though, because they allow us to characterise Linear Time Invariant Systems, like loudspeakers and acoustic environments (generally). This is clearly an area that would benefit from further research - but it is also clear that this kind of research is the domain of neuroscience NOT loudspeaker design. A3D Alignment Figure 7-11,142Hz Wavelet Energy Transform Plot Figure 7 above shows the detailed view of the 11,142Hz WET for the A3D Satellite. This response is typical of a zero-phase wavelet, also known as a wave packet. We can see that the regression of the Morlet Wavelet is a very good fit. This energy coherency is what provides the contiguous energy patterns within the reverberant sound field created by A3D. As the cues are coherent, both in time and in their position in space, human audiology can now integrate a longer passage of cues thus better matching our priori maps of what real sound is like. With the extension of the perception integration of a singular event, we suspect the listener will increase the attribution of reality to that sound. The effect of listening room reflections, we hypothesise, is to interrupt the contiguous listening window (100ms - Everest) effectively cutting short the ability of the human ear to completely integrate any passage of an event train of cues. Consequently the perception of loudness is reduced and the reality in a spatial context is compromised. Conclusion A3D is a technology platform for loudspeaker designers to re-engineer the way a loudspeaker energizes an acoustic space. The authors hypothesize that by using zero-phase wavelets two major transformations occur; A - The contamination effect created by the listening space reflections is significantly reduced, giving the listener the ability to decipher longer integrals of contiguous as recorded temporal cues. Acoustic3D Holdings Ltd Commercial in confidence Page 5 of 6

B - Because wavelets are localized in both time and frequency, they produce a contiguous spatial image over time and space, whereas conventional speakers do not. Applications are likely two fold: To provide a clearer listening experience for consumer audio devices placed in a listening space. Devices such as TVs, In-Car Audio, HiFi, Home theatre systems, and public address systems, can benefit from improved clarity. Application to more personal consumer electronics (smartphones, tablets, etc.) would produce a contiguous energy sound field that takes on the qualities of the audio signal, even when the listening space contribution may be insignificant. This is, of course, also applicable to the first group of products. It is further likely to have applications in VR and AR given their likely need for accurate spatial cues. Footnotes: A3D is a range of patented technologies created and owned by Acoustic3D Holdings Ltd. Academic enquiries should be directed to Joe Hayes. Vastigo Ltd is the Licensor of A3D technologies. Commercialisation enquiries should be directed to Malcolm Duffield. A3D equipped newaudio Emergence AS8 loudspeakers can be found online at http://www.newaudio.com.au Acoustic3D Holdings Ltd Commercial in confidence Page 6 of 6