Proceedings of Meetings on Acoustics

Similar documents
Geometric quantities for polar curves

CHAPTER 3 AMPLIFIER DESIGN TECHNIQUES

Multi-beam antennas in a broadband wireless access system

CHAPTER 2 LITERATURE STUDY

10.4 AREAS AND LENGTHS IN POLAR COORDINATES

SOLVING TRIANGLES USING THE SINE AND COSINE RULES

Mixed CMOS PTL Adders

& Y Connected resistors, Light emitting diode.

To provide data transmission in indoor

INSTITUTE OF AERONAUTICAL ENGINEERING (Autonomous) Dundigal, Hyderabad

METHOD OF LOCATION USING SIGNALS OF UNKNOWN ORIGIN. Inventor: Brian L. Baskin

Kirchhoff s Rules. Kirchhoff s Laws. Kirchhoff s Rules. Kirchhoff s Laws. Practice. Understanding SPH4UW. Kirchhoff s Voltage Rule (KVR):

(1) Non-linear system

Performance Comparison between Network Coding in Space and Routing in Space

Triangles and parallelograms of equal area in an ellipse

AN IMPROVED METHOD FOR RADIO FREQUENCY DIRECTION FINDING USING WIRELESS SENSOR NETWORKS

Polar Coordinates. July 30, 2014

INTRODUCTION TO TRIGONOMETRY AND ITS APPLICATIONS

(1) Primary Trigonometric Ratios (SOH CAH TOA): Given a right triangle OPQ with acute angle, we have the following trig ratios: ADJ

Two-layer slotted-waveguide antenna array with broad reflection/gain bandwidth at millimetre-wave frequencies

Magnetic monopole field exposed by electrons

University of Dayton Research Institute Dayton, Ohio, Materials Laboratory Wright Patterson AFB, Ohio,

Redundancy Data Elimination Scheme Based on Stitching Technique in Image Senor Networks

Solutions to exercise 1 in ETS052 Computer Communication

Translate and Classify Conic Sections

Exercise 1-1. The Sine Wave EXERCISE OBJECTIVE DISCUSSION OUTLINE. Relationship between a rotating phasor and a sine wave DISCUSSION

Localization of Latent Image in Heterophase AgBr(I) Tabular Microcrystals

Example. Check that the Jacobian of the transformation to spherical coordinates is

Available online at ScienceDirect. Procedia Engineering 89 (2014 )

Color gamut reduction techniques for printing with custom inks

Design and implementation of a high-speed bit-serial SFQ adder based on the binary decision diagram

LATEST CALIBRATION OF GLONASS P-CODE TIME RECEIVERS

On the Description of Communications Between Software Components with UML

DESIGN OF CONTINUOUS LAG COMPENSATORS

All-optical busbar differential protection scheme for electric power systems

A New Algorithm to Compute Alternate Paths in Reliable OSPF (ROSPF)

Algebra Practice. Dr. Barbara Sandall, Ed.D., and Travis Olson, M.S.

EE Controls Lab #2: Implementing State-Transition Logic on a PLC

9.4. ; 65. A family of curves has polar equations. ; 66. The astronomer Giovanni Cassini ( ) studied the family of curves with polar equations

Interference Cancellation Method without Feedback Amount for Three Users Interference Channel

S1 Only VEOG HEOG. S2 Only. S1 and S2. Computer. Subject. Computer

Comparison of soundscape on the ground floor of tube-houses in Hanoi and open urban space in Bordeaux

A Development of Earthing-Resistance-Estimation Instrument

Figure 2.14: Illustration of spatial frequency in image data. a) original image, f(x,y), b) plot of f(x) for the transect across image at the arrow.

Dataflow Language Model. DataFlow Models. Applications of Dataflow. Dataflow Languages. Kahn process networks. A Kahn Process (1)

University of Bristol - Explore Bristol Research. Peer reviewed version. Link to published version (if available): /iet-com.2017.

Sequential Logic (2) Synchronous vs Asynchronous Sequential Circuit. Clock Signal. Synchronous Sequential Circuits. FSM Overview 9/10/12

Performance Monitoring Fundamentals: Demystifying Performance Assessment Techniques

TIME: 1 hour 30 minutes

Student Book SERIES. Fractions. Name

Study Guide # Vectors in R 2 and R 3. (a) v = a, b, c = a i + b j + c k; vector addition and subtraction geometrically using parallelograms

Design of UHF Fractal Antenna for Localized Near-Field RFID Application

Implementation of Different Architectures of Forward 4x4 Integer DCT For H.264/AVC Encoder

Algorithms for Memory Hierarchies Lecture 14

MAXIMUM FLOWS IN FUZZY NETWORKS WITH FUNNEL-SHAPED NODES

REVIEW QUESTIONS. Figure For Review Question Figure For Review Question Figure For Review Question 10.2.

Fitting & User Instructions

Homework #1 due Monday at 6pm. White drop box in Student Lounge on the second floor of Cory. Tuesday labs cancelled next week

ISSCC 2006 / SESSION 21 / ADVANCED CLOCKING, LOGIC AND SIGNALING TECHNIQUES / 21.5

ABB STOTZ-KONTAKT. ABB i-bus EIB Current Module SM/S Intelligent Installation Systems. User Manual SM/S In = 16 A AC Un = 230 V AC

Spiral Tilings with C-curves

Network Theorems. Objectives 9.1 INTRODUCTION 9.2 SUPERPOSITION THEOREM

A VIRTUAL INFRASTRUCTURE FOR MITIGATING TYPICAL CHALLENGES IN SENSOR NETWORKS

A New Stochastic Inner Product Core Design for Digital FIR Filters

Experiment 3: The research of Thevenin theorem

Section 16.3 Double Integrals over General Regions

Analysis of Coding-aware MAC Protocols based on Reverse Direction Protocol for IEEE based Wireless Networks using Network Coding*

Postprint. This is the accepted version of a paper presented at IEEE PES General Meeting.

Power System Stability Enhancement By UPFC Based Power Oscillation Damping Controller

A Comparative Analysis of Algorithms for Determining the Peak Position of a Stripe to Sub-pixel Accuracy

Th ELI1 09 Broadband Processing of West of Shetland Data

Math Circles Finite Automata Question Sheet 3 (Solutions)

Three-Phase NPC Inverter Using Three-Phase Coupled Inductor

Travel Prediction-based Data Forwarding for Sparse Vehicular Networks. Technical Report

AN ELECTRON SWITCH. by C. DORSMAN and S. L. de BRUIN.

Soft-decision Viterbi Decoding with Diversity Combining. T.Sakai, K.Kobayashi, S.Kubota, M.Morikura, S.Kato

Understanding Basic Analog Ideal Op Amps

High Speed On-Chip Interconnects: Trade offs in Passive Termination

Software for the automatic scaling of critical frequency f 0 F2 and MUF(3000)F2 from ionograms applied at the Ionospheric Observatory of Gibilmanna

Effective Extraction and Filtering of Frequency Components in Physiological Signals Using Sum-of-Sinusoids Modelling

This is a repository copy of Four-port diplexer for high Tx/Rx isolation for integrated transceivers.

Subword Permutation Instructions for Two-Dimensional Multimedia Processing in MicroSIMD Architectures

Experiment 3: Non-Ideal Operational Amplifiers

Vector Calculus. 1 Line Integrals

This is a repository copy of Effect of power state on absorption cross section of personal computer components.

University of North Carolina-Charlotte Department of Electrical and Computer Engineering ECGR 4143/5195 Electrical Machinery Fall 2009

Open Access A Novel Parallel Current-sharing Control Method of Switch Power Supply

First Round Solutions Grades 4, 5, and 6

mac profile Configuration Guide Adobe Photoshop CS/CC Sawgrass Virtuoso SG400/SG800 Macintosh v

Alternating-Current Circuits

D]TC - S octa Asmria ooi. <~ p-ee 199b3- %he srorisr7cx L~)~,71'% a I PHOTOGRAPH THIS SHEET. li LEVEL INVENTORY DOCUMENT IDENTIFICATION

DYE SOLUBILITY IN SUPERCRITICAL CARBON DIOXIDE FLUID

Improving Iris Identification using User Quality and Cohort Information

CS 135: Computer Architecture I. Boolean Algebra. Basic Logic Gates

Simulation of Transformer Based Z-Source Inverter to Obtain High Voltage Boost Ability

Synchronous Machine Parameter Measurement

Characterization of 3x3 and 4x4 multimode interference couplers in InP generic photonic integration technology

CSI-SF: Estimating Wireless Channel State Using CSI Sampling & Fusion

Experiment 8 Series DC Motor (II)

ABSTRACT. We further show that using pixel variance for flat field correction leads to errors in cameras with good factory calibration.

Transcription:

Proceedings of Meetings on Acoustics Volume 19, 2013 http://cousticlsociety.org/ ICA 2013 Montrel Montrel, Cnd 2-7 June 2013 Signl Processing in Acoustics Session 4SP: Sensor Arry Bemforming nd Its Applictions 4SP2. Sptil sound pick-up with low numer of microphones Julin D. Plcino* nd Rozenn Nicol *Corresponding uthor's ddress: SVQ//TPS, Ornge Ls, 2 Av Pierre Mrzin, Lnnion, 22307, Britny, Frnce, julin.plcino@ornge.com For severl decdes sptil udio hs een only used y movies, music composers nd reserchers in lortories. Becuse of their complexity, people hve lwys een wy from 3D udio techniques. Dedicted devices such s microphone nd loudspeker rrys re expensive nd cnnot e used without some expertise of udio cpturing nd reproduction. Nowdys the min rrier preventing consumer solution from cpturing sptil udio is the ig numer of trnsducers needed to get n ccurte 3D sound imge. In order to rek down this rrier we propose new 3D udio recording set-up which is composed of three-microphone rry le to get the full 3D udio informtion. A 2D version, consisting of two-microphone rry, is lso ville. The sound locliztion is sed on the trnsducer directivities nd dditionl informtion to solve the ngulr miguity. This pper will descrie firstly the microphone set-up nd its ssocited lgorithm. Secondly the performnces of sound locliztion will e ssessed. Pulished y the Acousticl Society of Americ through the Americn Institute of Physics 2013 Acousticl Society of Americ [DOI: 10.1121/1.4800844] Received 22 Jn 2013; pulished 2 Jun 2013 Proceedings of Meetings on Acoustics, Vol. 19, 055078 (2013) Pge 1

INTRODUCTION For severl decdes sptil udio hs een only used y movies, music composers nd reserchers in lortories [1]. Becuse of their complexity, people hve lwys een wy from 3D udio techniques. Dedicted devices such s microphone nd loudspeker rrys re expensive nd cnnot e used without some expertise of udio cpturing nd reproduction. Nowdys the min rrier preventing consumer solution from cpturing sptil udio is the ig numer of trnsducers needed to get n ccurte 3D sound imge [2]. In order to rek down this rrier we propose new 3D udio recording set-up which is composed of three-microphone rry le to get the full 3D udio informtion. A 2D version, consisting of two-microphone rry, is lso ville. The sound locliztion is sed on the trnsducer directivities nd dditionl informtion to solve the ngulr miguity. This pper will descrie firstly the microphone set-up nd its ssocited lgorithm. Secondly the performnces of sound locliztion will e ssessed. SOURCE LOCALIZATION USING MICROPHONE DIRECTIVITY PATTERN Microphone Arry Lyout The microphone rry is composed of 3 crdioid microphones (see Figure 1): the first one pointing to the x xis (right), the second one to the opposite direction (left) nd the third one to the z xis (top). FIGURE 1 Lyout of the microphone device. ) 3D rry. ) 2D rry. Sptil Informtion Achieved from Microphone Directivity The method opertes in the time/frequency domin. It is ssumed tht only one sound source is present t ech moment for single frequency in. Equtions will e presented for fixed frequency. Before pplying FFT, time smples re weighted y soft edge window to void oscilltions in frequency domin. In terms of signl processing, the choice of the window type nd its length is importnt. The slope nd frequency ounding ffect results of ner frequencies. To void instility of the source locliztion, results cn e smoothed frequency nd time wise. The directivity of the n th crdioid microphone is represented y (1) where (2) The vector defines the source direction nd the vector refers to the pointing direction of n th microphone. In this cse, the pointing direction cn e expressed in the Crtesin sis for ech microphone y,,, (3) Source loction direction cn e expressed in the Sphericl or Crtesin coordinte sis, respectively or, y Proceedings of Meetings on Acoustics, Vol. 19, 055078 (2013) Pge 2

(4) where the Sphericl coordintes re defined y rdius, zimuth ngle θ, nd elevtion ngle φ. The directivity functions of the three microphones re [3]:, (5), c Since the direction is unchnged for ny vlue of, rdius is fixed to in following expressions. The sound source produces signl t the sis origin. Assuming tht ech microphone is locted t this point, their output signls re: (6) The signls llow to getting three dt: 1) The monophonic signl of the sound source, y using reltions (5). (5). nd (6) (7) 2) The elevtion ngle of the sound source, y using equtions (5).c nd (6) (8) 3) The zimuth ngle of the sound source, y using reltions (5). (5). (9) Alterntely it is possile to use other directivity pttern microphones to reconstruct crdioid directivity virtully (see Section: Synthesizing Virtul Crdioid Microphones ). The sme method cn e used on 2D version using only the two microphones corresponding to the horizontl plne. In this cse n ritrry elevtion must e fixed. This introduces n error incresing with the ngulr mismtch from the chosen elevtion nd the rel position. In this first step, source loction is clculted using exclusively the microphone directivity pttern. However the zimuth is estimted with sign miguity (front-rer) due to the cosine of Eqution (9). This miguity cn e solved y moving microphones perpendiculrly to their pointing direction, which will e illustrted in the next section. Front Rer Amiguity Resolution Using Time Dely If we consider now tht the n th microphone is locted t the position defined y the vector: (10) its output signl,, ecomes: (11) where is the dely induced y the distnce etween the n th microphone nd the sis origin given y (12) From eqution (4) the dely ecomes. (13) In frequency domin ecomes (where is the ngulr frequency with the time frequency) y pplying Fourier Trnsform, (14) with. (15) Proceedings of Meetings on Acoustics, Vol. 19, 055078 (2013) Pge 3

Eqution (14) ecomes. (16) Consequently (17) nd. (18) The time dely etween the two microphones is expressed y:. (19) Since this result is used to solve the miguity in reltion (9) only its sign is needed. It is inserted in this ltter s: (20) SOUND LOCALIZATION USING COINCIDENT BI-DIRECTIONAL MICROPHONES FIGURE 2 Lyout of the coincident microphone device. ) 3D rry. ) 2D rry. Now it will e shown how to use the equtions (7), (8) nd (9) with virtul crdioid microphones synthesized from idirectionl microphones. The method is descried here for n rry composed of two idirectionl microphones nd crdioid one. Microphone Lyout The rry is composed of two idirectionl microphones pointing x nd y xis over the horizontl plne nd third crdioid microphone pointing to the Z xis (see FIGURE 2). It should e noticed tht lterntely soundfield microphone [2] could e used since the X nd Y components of B-formt re equivlent to the idirectionl signl descried ove. Synthesizing Virtul Crdioid Microphones The signls delivered y the three microphones re [3]: c The virtul signls re otined using the expressions: (21) (22) Proceedings of Meetings on Acoustics, Vol. 19, 055078 (2013) Pge 4

where the pressure signl is estimted y: Alterntely, if Soundfield B-formt signl is used (24) nd (25) where the signl Z(t) refers to the Z component of the B-formt. Sound Locliztion Using Acoustic Intensity The signls of the virtul crdioid microphones (eq.(22)) llow to estimting the elevtion nd zimuth ngle of the sound source using reltions (8) nd (9) ut with front-rer miguity which cn not e solved y introducing time dely since coincident microphone rry is used. Insted sptil informtion will e otined from coustic intensity. Acoustic intensity vector is linked to the coustic pressure nd coustic velocity y the reltion [5]: (26) where is the complex conjugted of the coustic pressure nd, et re the,, components of the coustic velocity [5]. In the cse of progressive plne wve the coustic pressure is expressed y : (27) where is the wve vector. Euler s eqution gives the coustic velocity s function the coustic pressure: (28) where is medium density nd the speed of sound. Therefore coustic intensity components re (29) where represents, or. Thus it is oserved tht the coustic intensity vector hs the sme direction s, nd cn therefore e used to estimte the direction of the sound source. Bidirectionl coincident rry deliver pressure grdient informtion which leds directly to the coustic velocity. For instnce, the horizontl plne projection of velocity is given y: (30) From Eqution (26), the coustic intensity components re: (31) Elevtion nd zimuth ngle re then otined from Eqution (29) [6]: (32) (23) Proceedings of Meetings on Acoustics, Vol. 19, 055078 (2013) Pge 5

Solving Amiguity Locliztion for Coincident Arrys FIGURE 3 Angle estimtion miguity: Crdioid directivity method (continuous lue line; see Eqution (9)), Acoustic intensity method (dotted green line; see Eqution (32)), Theoreticl position (red dshed line) TABLE 1 Amiguity resolution y cross-checking the ngulr estimtion from the directivity (Eqution (9)) nd intensity (Eqution (32)) methods rel estimted opertion to solve miguity Directivity Intensity Directivity Intensity Locliztion sed on the crdioid directivity llows to otining the zimuth ngle with front rer miguity due to the cosine reltion involved in its estimtion (eq.(9)). This estimtion cn e solved for non coincident rrys y inserting dely. For coincident rrys it is possile to use coustic intensity to estimte zimuth ngle, ut this time with left - right miguity due to the inverse of the tngent in the reltion (32).. As shown y FIGURE 3 nd TABLE 1, the front - rer miguity is complementry to the left - right miguity. Four cses re pointed out, corresponding to the four comintions of the two miguous estimtions. The rel position cn then e found using conditionl reserch. In theory once the miguity is solved, oth methods (i.e Eqution (9) or Eqution (32)) give the sme result. However they my e slightly different in prctice. Depending on the sound scene, the sound stimulus or the noise level, one method cn chieve etter performnces. LOCALIZATION ASSESSMENT A computer progrm stimultes the signls which would hve een recorded y the two microphone rry setups previously descried. Vrious stimuli were used (music, rndom noise, noise nd nd hrmonic tone). Evlution Criteri The zimuth nd elevtion error nd re clculted here s the ngulr distnce etween the rel loction nd the estimted one. The totl error is the ngulr distnce etween the rel nd the estimted loction on the sphere (see eq(33)). The ngulr distnce is clculted using the sclr product of the rel nd estimted position. Proceedings of Meetings on Acoustics, Vol. 19, 055078 (2013) Pge 6

where (33), (34), When or re clculted, nd, or nd component re fixed to 0 respectively. The error is expressed in degrees where 0 sttes the est estimtion nd 180 the worst one. In ddition new criterion is proposed, computed s the error level otined y t lest 75% of 1/3 octve spectrum. It will e referred to s. Results c d FIGURE 4 Source locliztion of rndom noise moving round from ottom to the top of 3 crdioid microphones rry. Horizontl plne microphones re seprted y 2cm. ) Source loction t 1036 Hz, zimuth (lue) nd elevtion (green). ) criterion, zimuth (lue), elevtion (green), totl (red) c) Elevtion source loction error. d) Azimuth locliztion error. FIGURE 4 depicts the results otined when loclizing rndom noise moving round nd from ottom to the top. Locliztion ccurcy is ffected y the microphone spcing ecuse the reconstruction of the omnidirectionl pressure is ltered. As shown in FIGURE 4c, high frequencies re more ffected when the wvelength is closer to the microphone distnce. For high elevtions vritions of the crdioid pttern ( ) re slow which results in poor ngulr discrimintion. As zimuth locliztion uses elevtion (cf. eq. (20)), zimuth estimtion is consequently degrded, s shown y FIGURE 4d. A B c FIGURE 5 Source locliztion of rndom noise in zimuth (lue) nd elevtion (green) picked up y 2 idirectionl + 1crdioid microphone rry t 1036 Hz found with ). directivity only, ) intensity only, c) directivity nd intensity When coincident rrys re used, estimtion of the pressure signl is more ccurte (see eq.(23)) nd results re not ffected with elevtion over ll frequencies (see FIGURE 5). However, in prctice coincident microphone rrys re impossile to uild. As consequence some rtifcts will lwys occur in high frequencies. As it hs een specified efore, it is ssumed tht only one source is present t ech moment nd t ech frequency. In prctice, coustic field is complex nd more thn one source is present ffecting locliztion ccurcy (see FIGURE 6 nd FIGURE 7). Low energy signls close to high energy signls re then loclized to the direction of the higher one. When the trget sound source level is higher thn 12 db to disturing noise, locliztion error is Proceedings of Meetings on Acoustics, Vol. 19, 055078 (2013) Pge 7

A B c FIGURE 6 Source locliztion evlution of rndom noise: Elevtion error (1 st row), Azimuth error (2 nd row) nd criterion (3 rd row) picked up with 3 crdioid microphone rry in presence of disturing second noise source t =0 nd =0 nd level difference of ). 0 db, ) -12 db, c) -20 db wek: Some front-rer confusions re introduced elow 500 Hz since phse informtion of the trget source is ltered y disturing second source. Oviously this is only oserved in the cse of the crdioid rry. The influence of disturing source ecomes insignificnt for level differences higher thn 20 db. Contrsting with FIGURE 6, it is oserved for the idirectionl rry tht error increses unexpectedly for low elevtions (see FIGURE 7-3c). Indeed the trget source level is disdvntged y the rry directivity which is def t those directions. This phenomenon cn e turned into dvntge if the disturing noise is plced t the def re of the rry. On the contrry for crdioid rry the trget source is homogenously picked-up t ll directions. As suggested y FIGURE 6, zimuth error hs not the sme impct in terms of totl ngulr distnces in function of the elevtion ngle. FIGURE 7 clerly shows tht zimuth error hs less impct when the source is ner to the poles. CONCLUSION AND FUTURE WORKS In order to provide 3D udio tools for consumer device, we propose recording solution using smll numer of microphones. The omnidirectionl pressure signl is recomposed nd compred with the directionl microphone output. Sound source loction is estimted in zimuth nd elevtion using microphone directivity. However the locliztion is front-rer miguous. For non-coincident rrys, this miguity is solved y time difference etween microphones, wheres for coincident rrys, the coustic intensity vector is used. In the cse of non-coincident rrys, the omnidirectionl pressure component is not properly reconstructed t wvelengths closer to microphone spcing, which lters the locliztion. Proceedings of Meetings on Acoustics, Vol. 19, 055078 (2013) Pge 8

c FIGURE 7 Source locliztion evlution of rndom noise : Elevtion error (1 st row), zimuth error (2 nd row) nd criterion (3 rd row) picked up with 2 idirectionl + 1 crdioid microphone rry in presence of disturing second noise source t =0 nd =0 nd level difference of ). 0 db, ) -12 db, c) -20 db Sound scene nlysis presented in this pper cn e used s the first step of oject sed sptil udio representtion. Using the informtion of source position, it is possile to render the sound scene over ny type of sptil udio system such s stereo, 5.1, 7.1, 22.2, Higher Order Amisonics [7] or Wve Field Synthesis [8]. REFERENCES [1]. J. Sunier, The story of stereo: 1881-. Gernsck Lirry, 1960. [2]. R. Nicol, «Représenttion et perception des espces uditifs virtuels», HDR, Université du Mine, Le Mns, Frnce, 2010. [3]. J. Jouhneu, Notions élémentires d coustique: Électrocoustique. Tec & Doc Lvoisier, 1999. [4]. Michel A. Gerzon et Peter G. Crven, «Coincident microphone simultion covering three dimensionl spce nd yelding vrious directionl outputs», U.S. Ptent 4,042,77916-oût-1977. [5]. M. Bruneu, Mnuel d coustique fondmentle. Hermès, 1998. [6]. V. Pulkki, «Directionl udio coding in sptil sound reproduction nd stereo upmixing», in Proc. of the AES 28th Int. Conf, Pite, Sweden, 2006. [7]. J. Dniel, «Evolving views on HOA: From technologicl to prgmtic concerns», Amisonics Symposium 2009, June 25-27, Grz, 2009. [8]. A.J. Berkhout, D. de Vries & P. Vogel, «Acoustic Control y Wve Field Synthesis», J. Acoust. Soc. Am., 1993, 93, pp. 2764-2778. Proceedings of Meetings on Acoustics, Vol. 19, 055078 (2013) Pge 9