OPTIMIZATION TECHNIQUES FOR PARAMETRIC MODELING OF ACOUSTIC SYSTEMS AND MATERIALS

Similar documents
WARPED FILTER DESIGN FOR THE BODY MODELING AND SOUND SYNTHESIS OF STRING INSTRUMENTS

Frequency-Zooming ARMA Modeling of Resonant and Reverberant Systems *

AN AUDITORILY MOTIVATED ANALYSIS METHOD FOR ROOM IMPULSE RESPONSES

Direction-Dependent Physical Modeling of Musical Instruments

Modeling Diffraction of an Edge Between Surfaces with Different Materials

THE BEATING EQUALIZER AND ITS APPLICATION TO THE SYNTHESIS AND MODIFICATION OF PIANO TONES

FOURIER analysis is a well-known method for nonparametric

MAGNITUDE-COMPLEMENTARY FILTERS FOR DYNAMIC EQUALIZATION

Digital Filtering: Realization

Karjalainen, Matti; Paatero, Tuomas Equalization of Loudspeaker and Room Responses Using Kautz Filters: Direct Least Squares Design

ROOM SHAPE AND SIZE ESTIMATION USING DIRECTIONAL IMPULSE RESPONSE MEASUREMENTS

Modal Equalization of Loudspeaker Room Responses at Low Frequencies *

Psychoacoustic Cues in Room Size Perception

EE 422G - Signals and Systems Laboratory

SIA Software Company, Inc.

Audio Engineering Society Convention Paper 5449

EE 470 Signals and Systems

A SIMPLE APPROACH TO DESIGN LINEAR PHASE IIR FILTERS

Frequency-Zooming ARMA Modeling for Analysis of Noisy String Instrument Tones

Improving room acoustics at low frequencies with multiple loudspeakers and time based room correction

Low frequency sound reproduction in irregular rooms using CABS (Control Acoustic Bass System) Celestinos, Adrian; Nielsen, Sofus Birkedal

Equalizers. Contents: IIR or FIR for audio filtering? Shelving equalizers Peak equalizers

Performance Analysis of MFCC and LPCC Techniques in Automatic Speech Recognition

Convention Paper Presented at the 120th Convention 2006 May Paris, France

CG401 Advanced Signal Processing. Dr Stuart Lawson Room A330 Tel: January 2003

Digital Signal Processing

Level I Signal Modeling and Adaptive Spectral Analysis

Non-linear guitar body models

Validation of lateral fraction results in room acoustic measurements

MPEG-4 Structured Audio Systems

APPLICATION NOTE MAKING GOOD MEASUREMENTS LEARNING TO RECOGNIZE AND AVOID DISTORTION SOUNDSCAPES. by Langston Holland -

Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter

Adaptive Filters Application of Linear Prediction

Measuring procedures for the environmental parameters: Acoustic comfort

MUMT618 - Final Report Litterature Review on Guitar Body Modeling Techniques

Resonator Factoring. Julius Smith and Nelson Lee

Discrete-Time Signal Processing (DTSP) v14

Audio Engineering Society Convention Paper Presented at the 110th Convention 2001 May Amsterdam, The Netherlands

STUDIES OF EPIDAURUS WITH A HYBRID ROOM ACOUSTICS MODELLING METHOD

Audio Engineering Society. Convention Paper. Presented at the 115th Convention 2003 October New York, New York

THE PERCEPTION OF ALL-PASS COMPONENTS IN TRANSFER FUNCTIONS

Pre- and Post Ringing Of Impulse Response

FIR Filter For Audio Practitioners

INHARMONIC DISPERSION TUNABLE COMB FILTER DESIGN USING MODIFIED IIR BAND PASS TRANSFER FUNCTION

Class Overview. tracking mixing mastering encoding. Figure 1: Audio Production Process

Blind Dereverberation of Single-Channel Speech Signals Using an ICA-Based Generative Model

Aalborg Universitet. Published in: Acustica United with Acta Acustica. Publication date: Document Version Early version, also known as pre-print

Narrow-Band Interference Rejection in DS/CDMA Systems Using Adaptive (QRD-LSL)-Based Nonlinear ACM Interpolators

Analysis of room transfer function and reverberant signal statistics

Room Impulse Response Modeling in the Sub-2kHz Band using 3-D Rectangular Digital Waveguide Mesh

Room Response Equalization A Review. and Sascha Spors 3 ID

Tonehole Radiation Directivity: A Comparison Of Theory To Measurements

ROOM IMPULSE RESPONSE SHORTENING BY CHANNEL SHORTENING CONCEPTS. Markus Kallinger and Alfred Mertins

EE 351M Digital Signal Processing

Proceedings of Meetings on Acoustics

Optimizing a High-Order Graphic Equalizer for Audio Processing

OF HIGH QUALITY AUDIO SIGNALS

The University of Texas at Austin Dept. of Electrical and Computer Engineering Midterm #1

SGN Audio and Speech Processing

I D I A P R E S E A R C H R E P O R T. June published in Interspeech 2008

Technique for the Derivation of Wide Band Room Impulse Response

SGN Audio and Speech Processing

Estimation of Reverberation Time from Binaural Signals Without Using Controlled Excitation

Measuring impulse responses containing complete spatial information ABSTRACT

Practical Applications of the Wavelet Analysis

Design of IIR Digital Filters with Flat Passband and Equiripple Stopband Responses

Active Field Control (AFC) Reverberation Enhancement System Using Acoustical Feedback Control

NH 67, Karur Trichy Highways, Puliyur C.F, Karur District DEPARTMENT OF INFORMATION TECHNOLOGY DIGITAL SIGNAL PROCESSING UNIT 3

Khlui-Phiang-Aw Sound Synthesis Using A Warped FIR Filter

Speech Synthesis using Mel-Cepstral Coefficient Feature

4.5 Fractional Delay Operations with Allpass Filters

ESE531 Spring University of Pennsylvania Department of Electrical and System Engineering Digital Signal Processing

6-channel recording/reproduction system for 3-dimensional auralization of sound fields

ECE438 - Laboratory 7a: Digital Filter Design (Week 1) By Prof. Charles Bouman and Prof. Mireille Boutin Fall 2015

Design of 2-Dimensional Recursive Filters by using Neural Networks

Three-dimensional sound field simulation using the immersive auditory display system Sound Cask for stage acoustics

Scattering Parameters for the Keefe Clarinet Tonehole Model

Evaluation of a Multiple versus a Single Reference MIMO ANC Algorithm on Dornier 328 Test Data Set

FIR/Convolution. Visulalizing the convolution sum. Convolution

Reverberation time and structure loss factor

Implementation of decentralized active control of power transformer noise

Disturbance Rejection Using Self-Tuning ARMARKOV Adaptive Control with Simultaneous Identification

ECE Digital Signal Processing

(i) Understanding the basic concepts of signal modeling, correlation, maximum likelihood estimation, least squares and iterative numerical methods

REAL-TIME BROADBAND NOISE REDUCTION

EFFECTS OF PHYSICAL CONFIGURATIONS ON ANC HEADPHONE PERFORMANCE

A Parametric Model for Spectral Sound Synthesis of Musical Sounds

Subband coring for image noise reduction. Edward H. Adelson Internal Report, RCA David Sarnoff Research Center, Nov

Department of Electronic Engineering NED University of Engineering & Technology. LABORATORY WORKBOOK For the Course SIGNALS & SYSTEMS (TC-202)

Corona noise on the 400 kv overhead power line - measurements and computer modeling

University Ibn Tofail, B.P. 133, Kenitra, Morocco. University Moulay Ismail, B.P Meknes, Morocco

AUTOMATIC EQUALIZATION FOR IN-CAR COMMUNICATION SYSTEMS

ELEC-C5230 Digitaalisen signaalinkäsittelyn perusteet

Copyright S. K. Mitra

29th TONMEISTERTAGUNG VDT INTERNATIONAL CONVENTION, November 2016

Analytical and Experimental Approach to Acoustic Package Design

ECMA TR/105. A Shaped Noise File Representative of Speech. 1 st Edition / December Reference number ECMA TR/12:2009

Exponential Time Decay Constants of Marimba Bars

ODEON APPLICATION NOTE ISO Open plan offices Part 2 Measurements

EE 6422 Adaptive Signal Processing

Transcription:

OPTIMIZATION TECHNIQUES FOR PARAMETRIC MODELING OF ACOUSTIC SYSTEMS AND MATERIALS PACS: 43.55.Ka Matti Karjalainen, Tuomas Paatero, and Miikka Tikander Helsinki University of Technology Laboratory of Acoustics and Audio Signal Processing P.O.Box 3, FIN-215, HUT Espoo Finland Tel: +358 9 451 249 Fax: +358 9 46 224 E-mail: matti.karjalainen@hut.fi ABSTRACT Linear and time-invariant signal and system models are useful in compact characterization of acoustic transfer functions. In addition to compact representations of responses, such models are efficient in simulating acoustic systems for sound synthesis, artificial reverberation, etc. In this paper we propose parametric modeling techniques for room impulse responses (RIRs), insitu acoustic material measurements, and musical instrument modeling, based on ARMA models including Kautz filter models which require nonlinear optimization of the parameters. Low order models are applied to surface impedance modeling, and high-order models are used for complex responses such as RIRs and musical instruments. INTRODUCTION Physical modeling of acoustic systems is in practice based on numerical simulation, for example by solving partial difference equations. Often the problem is to model the response from a point to another so that the spatial distribution of waves is not of primary interest. This leads to using transfer functions and signal modeling. The signal processing approach has the advantage of being computationally highly efficient, which is important especially when real-time simulation is needed. This is the case for example in model-based sound synthesis of musical instruments, audio effects such as artificial reverberators, auralization of room or concert hall acoustics in acoustics design software, or equalization of loudspeaker-room responses in audio reproduction. In all these cases the system to be modeled can be considered being linear and time-invariant (LTI), and in practice also stable and causal. Further cases where the signal processing methodology is highly useful can be found in acoustic and audio measurements. For LTI acoustic systems the impulse response or corresponding frequency response can be measured and the task is often to find a compact parametric model to capture the essential features of the target response. Digital filters, in the form of finite impulse response (FIR) and infinite impulse response (IIR) filters, are powerful means of modeling target responses, both from analysis and synthesis points of view. In analysis applications we may obtain essential information about the measured process, such as (eigen)mode parameters of a room or a musical instrument body. In synthesis applications a compact parametric model in the form of a digital filter makes it possible to simulate a given system in real time. Although LTI models are linear in input-output relationships, the solving for optimal model parameters is not necessarily so. There are a number of parameter estimation techniques for LTI systems [1,2]. MA (moving average) modeling leads to an FIR type filter model with a k

coefficients being zero in its z-transform, in Eq.(1) below. While MA models are easily estimated, for reverberant and resonating systems they are not compact, computationally efficient, or analytically interesting. AR (autoregressive) models, with only b being non-zero in the numerator of z-transform in Eq.(1), are able to describe recursive feedback in a system, and are also relatively straightforward due to linearity of normal equations used to estimate filter parameters for example in the linear predictive (LP) autocorrelation method [3]. AR models are however, due to their minimum-phase property, limited in modeling capabilities, e.g., in modeling responses exhibiting inherent latencies in their frequency components. ARMA (autoregressive moving average) modeling is the most general form of LTI modeling with both a k and b k non-zero coefficients in Eq.(1), but there is no way to solve the optimal model parameters in a closed form, and thus iterative techniques of nonlinear optimization are needed, which brings potential problems in convergence, e.g., trapping to a local minumum not close to the global optimum. H ( z) N k = + b z N k b + b1 z +... + bn z k = = P P 1+ + + k a1z... apz 1 a k = k z 1 Nonlinear optimization of a k and b k in a general case is realized in iterative techniques such as Prony s method [4] or Steiglitz-McBride method [1] (both are available in the Matlab Signal Processing Toolbox). In a more general case of (low-order) nonlinear model fitting, iterative methods (such as Matlab function curvefit ) are available also for constraints in model parameters, as is show in the surface impedance modeling case below. In this paper we will explore cases where nonlinear optimization techniques are applied to acoustical modeling problems. We will explore parametric modeling for room impulse responses (RIRs), in-situ acoustic material measurements, and musical instrument modeling, based on pole-zero models, particularly orthonormal Kautz filters. Low-order models are applied to surface impedance case and high-order models are used for complex responses, such as RIRs and musical instruments. (1) MODELING OF ROOM IMPULSE RESPONSES Room impulse responses can be measured easily with modern computerized means, e.g., by using maximum length sequences (MLS). In principle about 1.5 x T 6, where T 6 is reverberation time, should be measured to obtain a 9 db dynamic range, but in practice the achieved dynamic range in often limited to 4-6 db. Thus a small room response may contain 1-5 samples and a concert hall may contain even more than 1 samples for a sample rate of 48 Hz. The goal of AR or ARMA modeling of such a response is either to obtain acoustically interesting parametric information or a compact filter model for a practical application. It turns out that it is not possible to estimate a single ARMA model for the full audio range. This is due to problems in convergence, numerical precision, and related unstability of iterative estimation procedures. Even filter orders below 1 (P and/or N in Eq.(1)) may turn unstable and above 3 it is seldom possible to get useful results at all. Here we first introduce ARMA modeling based on Kautz filters instead of more traditional pole-zero modeling. Case studies of room responses start with an example of limiting the modeling to low frequencies only. Then we discuss frequency zooming techniques, finally carrying out a demanding concert hall case using frequency-zooming Kautz modeling. Kautz Functions and Models Kautz filters [5,6] are a special class of fixed-pole IIR filters organized structurally to produce orthonormal tap-output impulse responses. The lowest order such filters, corresponding to any given set of desired stable poles, constitute an efficient tapped transversal structure, depicted in Fig.1. A particular Kautz filter is defined by a set of poles z i in the unit circle and with a corresponding set of somehow assigned tap-output weights w i, Eq.(2). Because of the transversal appearance and tap-output orthonormality, Kautz filters can be seen as a generalizations of the FIR filter. For more details on Kautz filter formulations and aspects of audio applications, see [7].

Figure 1. The Kautz filter. For z i = in Eq.(2) it degenerates to an FIR filter, for z i = a, -1<a<1, it is a Laguerre filter where the tap filters can be replaced by a common pre-filter. N * i 1 zizi z z H ( z) = wi i= 1 1 ziz j= 1 1 z jz * j Here we address the approximation of a given target response h(n) (or H(z)) using Kautz filters. The filter parametrization, i.e., choosing of the filter weights w i, is done in the least-square sense. For solving optimal values for Kautz poles z i, we have adopted a method from [8], called here the BU-method [7]. In the following examples this estimation method is used due to its favorable properties originating from the orthonormality of Eq.(2), although other ARMA techniques can be used also. Modeling of Low-Frequency Response of a Room First we apply Kautz models through the BU-method in (low-pass filtered and down-sampled) frequency range of -2 Hz to an impulse response, measured in a room of size 5.5 x 6.5 x 2.7 m 3. Note the relatively long modal decay time (about 1.3 s) at low frequencies. Figure 2 depicts the modeling accuracy both by magnitude response and by temporal envelope. A Kautz order 8 already yields a good fit and with orders >1 the result is perfect for practical purposes. (2) Magnitude / db Level / db 3 4 5 6 7 8 2 4 6 8 1 12 14 16 18 2 22 Frequency / Hz 2 4 6 8.2.4.6.8 1 Time / s Figure 2. Low-frequency Kautz modeling (-2 Hz) for a room, (a) magnitude responses of 8th order model (top) and target (bottom), with lines indicating Kautz pole positions, and (b) the decay envelope (top) compared to the target response (bottom). (a) (b) Frequency-Zooming ARMA Modeling Particularly for large rooms, such as concert halls, the required model order to capture the whole time-frequency range of diffuse modal pattern is not within the capability of direct ARMA modeling. There is, however, a technique to partition a measured response in the frequency domain and do the modeling in subbands as discussed in [9]. We call this frequency-zooming

ARMA (FZ-ARMA) modeling. In FZ-ARMA we modulate a subband down in frequency, apply low-pass filtering and decimation, and finally do ARMA modeling in this decimated sample rate domain. This can be done band-by-band, covering the whole audio range. The obtained model can be used as a highly efficient multirate decimated filterfank, or alternatively a composite polezero filter for the original sample rate is constructed from the subband filters. The zooming factor Kzoom for decimation can be selected so that the filter order within each subband remains manageable. As a challenging case we have applied FZ-ARMA to a 7-seat concert hall, the room response measured by an omnidirectional sound source (1-1 Hz) at the stage and a microphone in the 7 th row on the main floor. The audio range was partitioned in frequency bands of 25 Hz, and Kautz models of order around 5-3 (experimentally hand-tuned for each band) were used. The modeling accuracy was tested visually by different time-frequency plots and by informal listening to the original and the modeled response. The required filter order was highest (about 3) around 5-1 Hz, while above 5-1 khz much lower orders are enough. Note that from an auditory point of view even lower filter orders should yield perfect late reverberation [1]. Figure 3 shows the waterfall plots of time-frequency behavior of the original and the frequency-zooming Kautz modeled impulse response. (b) -1 1-2 -3 2 3-4 1 1 frequency/hz 1.8.6 time/s.4.2 4 1 1 frequency/hz 1.8.6 time/s.4.2 Figure 3: 1/3-octave waterfall time-frequency plots of original (left) and frequency-zooming Kautz modeled concert hall response. MODELING OF MUSICAL INSTRUMENTS As a case of musical instrument sound, the analysis and modeling of bell sounds is explored. A characteristic feature of bell sounds is that they are composed of an inharmonic set of partials [11], such as the one described by magnitude spectrum in Fig.4(b). Each partial is a decaying sinusoid that, in a closer inspection, Fig.4(c), turns out to be a pair or a group of modes very closely located in frequency. This leads to perceptually noticeable beating. In this case the modal group consists primarily of two modes with a frequency difference of about 2.5 Hz. FZ-ARMA is an excellent method for analyzing the modal groups of bell sounds. Figure 4 shows the envelope match obtained with three different FZ-ARMA orders for the 131 Hz modal group. The zooming factor Kzoom is 2 in each case and Steiglitz-McBride iteration was used instead of Kautz modeling. In view (e) the orders are N = and P = 4 cf. Eq.(1). Two pole pairs should in principle be sufficient for a double mode, but this all-pole (AR) case with N = does not allow proper phase matching, and thus the overall match remains poor. For ARMA orders N = 4 and P = 4 in (f) the relatively high number of zeros allows for excellent match with just two pole pairs. The same can be achieved with orders N = 2 to P = 6, i.e., by adding an extra pole pair and keeping the number of zeros minimal. For all resonances up to 1 khz for this bell sound, filter orders of N = 2 4 and P = 4 6 are suffcient for good modal decay matching so that a parallel filter, composed of modal group filters with a total order of about N = 4 and P = 5, can implement effcient and high-quality synthesis for the bell sound at a sampling rate of 225 Hz. Other musical instruments (guitar, piano) have been analyzed and modeled using a similar approach, see [9].

1 (a) -1.5 1 1.5 2 time/s 2.5 1 (b) 8 6 4 1 2 3 4 5 6 7 8 9 freq/hz 1 (c) 8 6 2 (d) -2.5 1 1.5 2 time/s 2.5 6 (e) 5 4 3.5 1 1.5 2 time/s 2.5 6 (f) 5 4 4 127 128 129 13 131 132 133 134 135 freq/hz 3.5 1 1.5 2 time/s 2.5 Figure 4. Analysis and modeling of a small bell sound: (a) recorded time-domain signal, (b) magnitude spectrum, (c) magnitude spectrum in the modal region around 131 Hz, (d) decay envelope of the 131 Hz modal group, (e) FZ-ARMA(,4), and (f) FZ-ARMA(4,4). MODELING OF ACOUSTIC SURFACE IMPEDANCE As the last example, a modeling problem of different scope is discussed. Acoustic surface impedance can be measured by different techniques. Particularly in in-situ measurements the accuracy tends to remain poor at low frequencies and when absorption is low. By low-order model fitting the data can be smoothed or the material can be characterized with few parameters. Figure 5 (left) shows an in-situ measurement setup [11] to obtain reflection impulse response with a microphone close to a material surface, and by time windowing of the response. The same absorption material is also measured in impedance tube and reverberation room. Fig. 5 (right) shows a case by absorption coefficient data, along with in-situ measurement using freefield and hard surface calibration references. It can be found that there is relatively much variance in data, and in-situ results look unreliable at low frequencies (negative absorption!). Figure 5: Left: in-situ system setup with (a) direct sound, (b) reflection (c) parasitic reflections to be removed. Right: data from different absorption coefficient measurements, including in-situ results with free-field and hard-wall references, for mineral wool (2 mm thick, 4 kg/m 3 ). The same material data was modeled by fitting a first-order pole-zero filter H r (z) to the measured reflection magnitude response, which is constrained to value 1. (no loss) at zero frequency: Nonlinear optimization of parameters a and b was computed by Matlab function polyfit to yield a least squares match. Figure 6 (left) shows the corresponding absortion coefficient, compared to raw data from free-field calibrated in-situ measurement, and a curve by model filter fitting that is obviously more useful at low frequencies that the raw data. For simple homogeneous absorbents this seems to work well, although there is no clear physical basis for such modeling.

Figure 6 (right) depicts results of more physical model fitting in a case where there is an air gap between 2 mm of the same absorbent as above and a hard wall. The flow resistance vs. acoustic impedance models by Delany-Bazley [12] and Mechel [13] were least-squares fitted to measured impedance tube data. As can be seen, generally the models are in fairly good agreement with measurements except at high frequencies. Figure 6: Left: low-pass reflection filter model fitted to in-situ measurement data (same as in Fig. 5) plotted by absorption coefficient as a function of frequency. Right: Absorption by the Delany- Bazley [12] and Mechel [13]) impedance models based on flow resistance parameter, compared to impedance tube measurement data (same mineral wool as in Fig. 5 with 2 mm air gap). SUMMARY Nonlinear optimization techniques have been applied in this study to parametric modeling of various acoustic systems. This is primarily based on finding discrete-time pole-zero transfer function models using ARMA methods, except in the last case, acoustic impedance modeling based on flow resistance parameter. Frequency-zooming ARMA modeling is shown to be a powerful technique in dealing with highly complex room responses and fairly complex musical instrument sounds. The estimated models can be applied also in real-time simulation and synthesis purposes, for example in artificial reverberation or spatialization/auralization systems for virtual acoustic reality. ACKNOWLEDGMENTS Tuomas Paatero s work has been part of the Academy of Finland project Technology for Audio and Speech Processing. Miikka Tikander s work has been part of the Tekes project TAKU. REFERENCES [1] M.H. Hayes, Statistical Digital Signal Processing and Modeling. Prentice-Hall, 1996. [2] S.M. Kay, Fundamentals of Statistical Signal Processing: Vol. I: Estimation Theory. Prentice-Hall, 1993. [3] J.D. Markel and J.H.G. Gray, Linear Prediction of Speech Signals. Springer-Verlag, Berlin, 1976. [4] T.W. Parks and C.S. Burrus, Digital Filter Design. New York: John Wiley & Sons, 1987. [5] W.H. Kautz, Transient Synthesis in the Time Domain, IRE Trans. Circuit Theory, vol. CT1, pp. 29-39. 1954. [6] P.W. Broome, Discrete Orthonormal Sequences, J. Assoc. Comp. Mach., vol. 12, (2), pp. 151-168, 1965. [7] T. Paatero and M. Karjalainen, Kautz Filters and Generalized Frequency Resolution -- Theory and Audio Applications, Preprint 5378 AES 11th Convention, Amsterdam, 21. [8] H. Brandenstein and R. Unbehauen, Least-Squares Approximation of FIR by IIR Digital Filters, IEEE Trans. Sig. Proc., vol. 46, (1), pp. 21-3, 1998. [9] Karjalainen et al., AR/ARMA Analysis and Modeling of modes in resonant and reverberant systems, Preprint in AES 11th Convention, Munich, 22. [1] Karjalainen M. and Järveläinen H., More About this Reverberation Science: Perceptually Good Late Reverberation, Preprint 5415 AES 11th Convention, New York, 21. [11] Karjalainen, M., and Tikander, M., ''Reducing Artefacts of In-Situ Surface Impedance Measurements,'' in Proceedings of the 17th Int. Congr. on Acoust. (ICA), vol. 2, p. 393, Rome, Italy, September 2-7, 21. [12] Delaney, M.E. and Bazley, E.N., Acoustical properties of fibrous absorbent materials, Appl Acoust. 3(197), pp. 15-116 [13] Mechel, F.P and Ver, I., Sound absorbing materials and sound absorbers, in the book: Beranek L, Ver I (ed.), Noise and Vibration Control Engineering, John Wiley & Sons, New York, 1992.