Multizone Wideband Reproduction of Speech Soundfields

Size: px
Start display at page:

Download "Multizone Wideband Reproduction of Speech Soundfields"

Transcription

1 Multizone Wideband Reproduction of Speech Soundfields Associate Professor Christian Ritz School of Electrical, Computer and Telecommunications Engineering, University of Wollongong

2 Overview Brief introduction to the University of Wollongong, School of Electrical, Computer and Telecommunications Engineering (SECTE) Overview of spatial audio the University of Wollongong Mathematical description of soundfields Brief fundamentals Reproducing soundfields using loudspeakers Single zone approaches Multizone soundfield reproduction using loudspeakers Orthogonal basis expansion approach Conclusions and open research challenges

3 Where is Wollongong? About 1 hour south of Sydney on the East coast of NSW

4 The city of Wollongong 203,500 Population in Wollongong 292,500 Population of Illawarra area 22 C Average daily temperature (71.6 F) 27 C Average summer temperature (80.6 F) 17 Patrolled beaches View from the escarpment (small mountains)

5 University of Wollongong 2011: Celebrated their 60 th Anniversary Total number of students: 31,464 (as of 2014) 12,811 international students in Australia and abroad Rankings: Top 2% of research universities in the world (QS and Times Higher Education World University Rankings 2013/14) Recently improved ranking to 31 st in the world s top 100 younger universities in the 2015 Times Higher Education (THE) 100 Under 50 Also ranked by TE to be in the top 250 institutions in the world in the subject fields of Electrical and Electronic Engineering and Computer Science and Information Systems UOW Library UOW Innovation Campus

6 School of Electrical, Computer and Telecommunications Engineering (SECTE): Students and Staff Around 25 Academic Staff and many research students Degrees offered: Undergraduate : Bachelor of Engineering majoring in Electrical Engineering, Computer Engineering or Telecommunications Engineering Postgraduate: Masters by Coursework, Masters by Research and PhD ~300 Bachelor Students, ~200 Graduate students (including ~50 PhD students) ~50% of students are international

7 School of Electrical Computer and Telecommunications Engineering: Research Major research groups and labs ADVANCED MANUFACTURING TECHNOLOGIES (AMT) Australian Power Quality & Reliability Centre Centre for Intelligent Mechatronics Research (CIMR) INFORMATION AND COMMUNICATIONS TECHNOLOGY RESEARCH (ICTR) Emerging Networks & Applications (ENA) Optoelectronic Signal Processing Research Lab (OSPR) Visual & Audio Signal Processing (VASP) SUSTAINABLE BUILDINGS RESEARCH CENTRE

8 SPATIAL AUDIO THE UNIVERSITY OF WOLLONGONG Brief overview

9 Research led by A/Prof Ritz Current Flagship Projects: Microphone arrays for sound processing Spatial Audio Signal processing, compression, enhancement and reproduction Acoustic design of microphone arrays, loudspeaker arrays and customised musical instruments Quality of Experience (QoE) for multimedia Team: Research students (mostly PhD) selected projects: Ad-Hoc Microphone Arrays for Speech and audio signal enhancement and localisation Spatial Audio Coding and Enhancement Multizone audio reproduction and control Single and Multichannel speech signal enhancement based on dictionary learning and sparse coding Quality of Experience for Image Matching Applications Expertise: Digital Signal Processing for speech, audio, multimedia applications Sound source localisation using microphones 3D audio recording, analysis, synthesis and coding Human perception of multimedia and social media Multimedia annotation and semantics International standards for multimedia communication and processing Collaboration Highlights: RMIT University (spatial audio, semantics of multimedia) Smart Services CRC (New Media Services) collaborating with Fairfax Media to deliver AirLink University of Klagenfurt, Austria Peking University (Shenzhen Graduate School), China (Microphone arrays) Beijing University of Technology (speech coding and enhancement, spatial audio) Plus numerous other collaborators in the Faculty and University

10 Research Facilities UOW Anechoic Chamber Configurable Hemispheric Environment for Spatialised Sound (CHESS): 16 loudspeaker hemisphere for 3D sound reproduction Unique collaboration with staff from the Creative Arts discipline Audio hardware equipment and software Microphone arrays, loudspeaker arrays, amplifiers, pre-amps, custom hardware and software Technical workshop resources UOW Anechoic Chamber: CHESS Plus access to University research infrastructure and expertise including High Performance Computing, electronics and mechanical workshops, 3D printing services etc.

11 Spatial (3D) Audio Communication Multiple people, multiple sites communicating through a network Systems key stages: Microphone Array Recording Sound scene analysis and enhancement Compression Spatial Audio Rendering Spatial Audio Communication System

12 Microphone Array Recording A microphone array is required for Source location (Direction of Arrival (DOA)) estimation and separation into individual speech objects Multichannel speech enhancement in noisy environments We have researched miniaturised coincident microphone arrays Coincident: Microphones located very close together- increased performance and much smaller than alternative technologies Acoustic Vector Sensor (AVS): Uses specialist gradient microphones to record pressure gradient in x, y and z directions Miniature B-Format microphones: can also provide x, y and z directions but using standard pressure microphone 1 cm Example AVS designed by our lab 3D printed B- Format microphone

13 Enhancement based on Multichannel Linear Prediction (MC-LP) Apply MC-LP to the AVS channels o(n) x(n) y(n) z(n) MC-LP LP coefficients Residual signal Key idea: The speech signal can then be enhanced by further processing of both the spectrum represented by the LP coefficients and the residual signal Enhanced speech is then produced using the enhanced residual and LP coefficients Shujau, M., Ritz, C., Burnett, I., Speech Dereverberation Based On Linear Prediction: An Acoustic Vector Sensor Approach, Proc. ICASSP'2013, pp. 1-5, Vancouver, Canada, May

14 Multichannel 3D audio coding Spatially Squeezed Surround Audio Coding (S 3 AC) Converts 5 channel surround to 2 channel stereo Stereo signal can be compressed with existing technology (e.g. MP3) 5 channel recovered from analysis of the decoded stereo signal S 3 AC 3D: Generalisation to more than 5 channels E.g. 16 channel 3D audio coded at 128 kbps results in equivalent subjective quality to separate coding of each channel via AAC Bin Cheng; Ritz, C.; Burnett, I.; Xiguang Zheng, "A General Compression Approach to Multi-Channel Three-Dimensional Audio," Audio, Speech, and Language Processing, IEEE Transactions on, vol.21, no.8, pp.1676,1688, Aug. 2013

15 Compression of Multiple Soundfield Zones Example application: Spatial Audio teleconferencing Analysis of speakers using microphone array analysis Extract individual talkers as separate spatial audio objects Joint compression of spatial audio objects Spatial audio playback at each site, combining all streams from each other site Multiple, distributed sites One or more talkers at each site Xiguang Zheng; Ritz, C.; Jiangtao Xi, "Encoding Navigable Speech Sources: A Psychoacoustic-Based Analysis-by-Synthesis Approach," Audio, Speech, and Language Processing, IEEE Transactions on, vol.21, no.1, pp.29,38, Jan

16 MATHEMATICAL DESCRIPTION OF SOUNDFIELDS Brief fundamentals

17 What is a sound field? Sound that has both amplitude and direction E.g. 2D spatial audio, 3D spatial audio, binaural audio In real environments, includes reverberation due to echo off walls, ceiling and floor Described mathematically as the solution of the acoustic wave equation: 2 p z, t 1 c 2 ρ 2 p(z,t) t 2 = 0 p z, t is the pressure at a point in space z = x, y, z and at time t 2 is the laplacian operator i.e. 2 = 2 p x p y p z 2 c is the speed of sound 340 m/s

18 Planewave solution Consider the 2D case and define x = (x, y) as a position in space (in cartesian coordinates), which is equivalent to r, θ in polar coordinates i.e. (x, y)= rcosθ, rsinθ A simple solution to the wave equation for 2D soundfields is a function of planewaves A sound wave of constant frequency travelling in a specific direction θ Source:

19 Cylindrical Harmonic Expansion Description of the soundfield as weighted sum of cylindrical harmonic functions + S x, k = α m (k)j m (kr)e jmθ where J m is the m th order Bessel function of the first kind, j = 1, and α m (k) are known as the Fourier-Bessel coefficients Can be solved to find α m (k) but not ideal approach zeros of J m (kr) cause problems

20 Green s Functions Green s functions describe the acoustic transfer function between two points in space For an impulse source arriving from location z s, s z s, k, the 3D Green s function in the free field (anechoic) is: 1 p z z s, k = e jk(z zs), where k = ω 4π z z s c wavenumber and j = 1 For 2D: p x x s, k = j 4 H 0 1 k x s x, is the where H 1 0 is the zeroth order Hankel function of the first kind (a function of Bessel functions - see mathematics literature for definition) Frequency domain soundfield at listening point, S x, k, is multiplication of source with the Green s function (or convolution in the time domain): S x, k = s x s, k j 4 H 0 1 k x s x Source, s x s, k Listening point, S x, k

21 Green s functions and Superposition Using Green s functions, we can derive the soundfield at a point x due to an impulse as the sum of the contribution of all sources arriving from all locations i.e., in 2D for l = 1 to L sources: p x, k = L j 4 H 0 1 k x l x l=1 Hence, total soundfield at x is : Source 2, s x 2, k Source L, s x L, k Source 1, s x 1, k Listening point, S x, k L S x, k = s x l, k l=1 j 4 H 0 1 k x l x

22 Broadband Soundfields Previous slide was for single frequency Reproducing broadband (i.e. multiple frequency) soundfields (e.g. speech) requires inverse Fourier transform of S x, k or alternatively a Fourier series representation Total (time-domain) broadband soundfield s b x, t assuming each source contains frequencies k =1 to K: K s b x, t = S x, k k=1

23 REPRODUCING SOUNDFIELDS USING LOUDSPEAKERS Single zone approaches

24 Reproducing sound fields using Aim: Find loudspeaker signals, D x l, ω, such that a soundfield within a restricted region (zone) is accurately reproduced i.e. we want P(x, ω) to match a desired (virtual) soundfield produced by source S x, ω Loudspeakers Loudspeakers D(x l, ω) Virtual Sound source, S x, ω x l D P(x, ω) x Surface surrounding volume D D

25 Existing Loudspeaker Approaches Simpler (panning-based) loudspeaker approaches: Stereo (2 channels) only (limited) 2D sound fields reproduced Surround sound (5.1 channels) 2D sound fields (but with limited accuracy in reproducing source direction) Ambisonics (basic approach) Vector Base Amplitude Panning (VBAP) More sophisticated (sound field synthesis) approaches: Higher Order Ambisonics Wave Field Synthesis (WFS) Least Squares

26 Panning-based approaches Sound source direction perceived based on level differences between loudspeaker signals Channel pairs: stereo, 5.1 surround and VBAP Channel triplets: 3D VBAP All channels: Ambisonics Relatively simpler signal processing compared to sound field synthesis approaches Good results for practical numbers of loudspeakers Ambisonics reproduction using UOW 16 channel hemisphere

27 Sound Field Synthesis (SFS) Consider an enclosed volume D of surface D Synthesis soundfield using multiple loudspeakers (as secondary sources) located at positions x l, l = 1 to L SFS synthesis equation: Loudspeakers D P(x, ω) x S x, ω = D x l, ω G x x l, ω da(x l ) D(x l, ω) x l G(x x l, ω) D D x l, ω are the loudspeaker signal weights G x x l, ω is the Green s function between x and x l da(x l ): surface area of the enclosure Virtual Sound source, S x, ω Surface surrounding volume D D

28 SFS Solutions Higher Order Ambisonics (HOA) Loudspeaker signals derived by decomposing the sound field into an orthogonal set of basis functions Often relies on planewave representations of sound fields Wave Field Synthesis (WFS) Loudspeaker signals derived by directly solving the SFS synthesis integral on the previous slide Comparing HOA and WFS for the same number of loudspeakers WFS produces accurate first wavefront across a larger area but with strong artefacts outside this area HOA produces accurate sound field but within a smaller area HOA and WFS generally require analytic solutions to the integral expressions Assume continuous functions Alternative Least Squares solutions: Minimise the least squared error between desired and achievable pressure values for a given set up (i.e. number of loudspeakers) and spatial sampling resolution (i.e. number of chosen pressure samples) within a chosen reproduction region

29 MULTIZONE SOUNDFIELD REPRODUCTION USING LOUDSPEAKERS Orthogonal basis expansion approach

30 Room Key Question Active zone 1: Listening to speech or music Active zone 2: Listening to speech or music Quiet zone: No external sound How can we create multiple independent listening zones within a room? Using a single set of loudspeakers

31 Multizone Soundfields Move from one zone to multiple zones Example: Region D contains three sub regions Bright zone, D b quiet zone D q unattended zone is all other space in the region D Direction of the desired planewave in D b is θ and is reproduced by loudspeakers positioned at x l (or equivalently φ l ) with first loudspeaker at φ Each zone has radius r within a total region D of size R surrounded by a loudspeaker array or radius R l

32 Existing solutions 2D multizone reproduction first introduced in 2008 by Polletti [5] and based on least squares approach to SFS A multizone approach using cylindrical harmonic expansion proposed by Wu and Abhayapala in 2011 [6] Previous approaches attempt to completely suppress any interzone interference Can result in impractically large loudspeaker signal amplitudes More recent work by Jin and Kleijn [7] uses an orthogonal basis expansion approach with weightings to control reproduction of each zone according to their importance Leads to more practical numbers of loudspeakers Conceptually similar approach also proposed by Chen, Abhayapala, and Zhang [8] Limited work on practical solutions for multiple frequency soundfields Multizone narrowband and wideband speech soundfields presented in 2013 by Radmanesh and Burnett [9]

33 Orthogonal Basis Expansion Similar to a planewave decomposition, a soundfield can be described more generally as a weighted sum of orthogonal basis functions: S x, k = C n G n x, k n Where G n x, k are the basis functions and C n are the coefficients, which can be derived using a inner product as: C n = D S x, k G n x, k d x

34 Weighted Orthogonal Basis Expansion Add a weighting term to the inner product i.e.: C n = w(x)s x, k G n x, k dx D, n = 1 to N Weights can be chosen in various ways e.g. assume constant weight values within each zonei.e. w(x b )=w b, w(x q )=w q, w(x u )=w u for the bright, quiet and unattended zones, respectively In our work, we explore alternative weighting schemes and efficient implementations Basic idea: Derive the coefficients for chosen set of basis functions that minimises reconstruction error between modelled and desired soundfield

35 Deriving the basis functions We begin with a set of P planewaves We factorise these into an orthogonal set of N basis functions or wavefields Via QR factorisation of a set of P planewaves, F p (x, k), arriving from angles 0 2π The resulting soundfield can then be described as: S x, k = P p (k, w)f p (x, k) p Where coefficients P p (k, w) are related to the desired C n values via the QR decomposition (see [7] for more details)

36 Deriving loudspeaker signal weights Recall from before, free field soundfield at a point x produced by multiple sources is sum of signals arriving from each loudspeaker For loudspeaker signal weights at position l derived using above approach with weighting w: L S x, k = d l k, w j l=1 H k x l x Loudspeaker signal weights derived based on the orthogonal decomposition previously desribed Basic idea: Find loudspeaker signal weights, d l k, w, to produce the desired soundfield described by the orthogonal set of basis wavefields

37 Deriving loudspeaker signal weights Loudspeaker signal weights derived as d l k, w below H m 1 is the m th order Hankel function of the first kind M = kr is the truncation length [10] (based on spatial frequency and size of the region) φ pw = p 1 φ, where φ = 2π P are angles of arrival of the planewaves φ l are the angular position of the L loudspeakers separated by an angular spacing of φ s M d l k, w = 2(jπH m 1 (kr l )) 1 m= M p P p (k, w)j m e jmφ pw e jmφ l φ s

38 Reproducing broadband speech soundfields d l k, w are the weights that must be applied to a desired source signal e.g. a speech signal to reproduce a desired soundfield E.g. assume we wish to synthesise a speech soundfield at an arbitrary location, x, in the reproduction area, D, using a set of L loudspeakers The value of the loudspeaker signals, Y l x, k, required to reproduce this soundfield at x is: Y l x, k = d l k, w Y k, where d l k, w is the free field loudspeaker signal weights as given on the earlier slide Y(k) is the complex Fourier coefficient of a desired time domain virtual source, y(n), e.g. a desired speech signal We can also derive the estimated soundfield reproduced by the array of loudspeakers such that we can minimise the mean squared error between the desired and actual soundfield And understand the impact of the weights used for the basis functions

39 Reproduction of multiple frequencies (i.e. broadband speech soundfields) The pressure generated by the loudspeakers at any point in the reproduced soundfield is given by, K k p x, w = S x, k, w where there are K different sinusoidal components. Multiple nested summations are required to derive this value: Here, a summation occurs for every sample in the reproduction region, D, for P planewaves, for N basis wavefields, for 2M+1 modes, for L loudspeakers and for K sinusoidal components This is computationally demanding E.g. a three second audio file sampled at 16 khz requires approximately independent reproductions due to the QR decomposition Hence, more efficient implementations are required

40 Examples Soundfields best visualised using animations of single frequencies Example desired soundfields generated based on weighted orthogonal basis expansion Orthogonal Basis Expansion with one frequency (2 khz) Independent Multizone Soundfields with Orthogonal Basis Expansion (2 khz and 1.25 khz) Example soundfields reproduced using loudspeakers Loudspeaker Reproduced Orthogonal Basis Expansion Loudspeaker Reproduced Occlusion Problem Example showing affect of varying the weightings used for the basis functions Loudspeaker Reproduced Multizone Soundfield with Varied Weighting

41 Conclusions and open research challenges Multizone soundfield reproduction is theoretical possible and simulations demonstrate it s practical feasibility Current approaches are computationally demanding e.g. a 3 s audio file sampled at 16 khz requires approximately independent derivations due to the QR decomposition and nested summations to derive multiple frequency components We are investigating codebook-based approaches to reduce complexity Techniques require many loudspeakers governed by desired bandwidth of the sound sources as well as size of the reproduction area We are working on alternative weighting approaches to reduce loudspeaker counts whilst maintaining perceptual quality of the reproduced soundfield Some initial results to be published at ChinaSIP 2015

42 References 1. Ahrens, J., Rabenstein, R., Spors, S., Sound Field Synthesis for Audio Presentation, Acoustics Today, pp , Vol. 10, Iss. 2, Spring Spors, S.; Wierstorf, H.; Raake, A.; Melchior, F.; Frank, M.; Zotter, F., "Spatial Sound With Loudspeakers and Its Perception: A Review of the Current State," Proceedings of the IEEE, vol.101, no.9, pp.1920,1938, Sept R. Rabenstein and S. Spors, Sound Field Reproduction, in Springer Handbook of Speech Processing, P. J. B. Dr, P. M. M. Sondhi, and P. Y. (Arden) H. Dr, Eds. Springer Berlin Heidelberg, 2008, pp M. Kolundzija, C. Faller, and M. Vetterli, Reproducing Sound Fields Using MIMO Acoustic Channel Inversion, JAES, vol. 59, no. 10, pp , Nov M. Poletti, An Investigation of 2-D Multizone Surround Sound Systems, presented at the Audio Engineering Society Convention 125, Y. J. Wu and T. D. Abhayapala, Spatial multizone soundfield reproduction: Theory and design, Audio, Speech, and Language Processing, IEEE Transactions on, vol. 19, no. 6, pp , W. Jin, W. B. Kleijn, and D. Virette, Multizone soundfield reproduction using orthogonal basis expansion, presented at the Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, 2013, pp H. Chen, T. D. Abhayapala, and W. Zhang, Enhanced sound field reproduction within prioritized control region, in INTER-NOISE and NOISE-CON Congress and Conference Proceedings, 2014, vol. 249, pp N. Radmanesh and I. S. Burnett, Generation of isolated wideband sound fields using a combined twostage lasso-ls algorithm, Audio, Speech, and Language Processing, IEEE Transactions on, vol. 21, no. 2, pp , Y. J. Wu and T. D. Abhayapala, Theory and design of soundfield reproduction using continuous loudspeaker concept, Audio, Speech, and Language Processing, IEEE Transactions on, vol. 17, no. 1, pp , 2009.

43

A spatial squeezing approach to ambisonic audio compression

A spatial squeezing approach to ambisonic audio compression University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2008 A spatial squeezing approach to ambisonic audio compression Bin Cheng

More information

Wave Field Analysis Using Virtual Circular Microphone Arrays

Wave Field Analysis Using Virtual Circular Microphone Arrays **i Achim Kuntz таг] Ш 5 Wave Field Analysis Using Virtual Circular Microphone Arrays га [W] та Contents Abstract Zusammenfassung v vii 1 Introduction l 2 Multidimensional Signals and Wave Fields 9 2.1

More information

Soundfield Navigation using an Array of Higher-Order Ambisonics Microphones

Soundfield Navigation using an Array of Higher-Order Ambisonics Microphones Soundfield Navigation using an Array of Higher-Order Ambisonics Microphones AES International Conference on Audio for Virtual and Augmented Reality September 30th, 2016 Joseph G. Tylka (presenter) Edgar

More information

Measuring impulse responses containing complete spatial information ABSTRACT

Measuring impulse responses containing complete spatial information ABSTRACT Measuring impulse responses containing complete spatial information Angelo Farina, Paolo Martignon, Andrea Capra, Simone Fontana University of Parma, Industrial Eng. Dept., via delle Scienze 181/A, 43100

More information

Broadband Microphone Arrays for Speech Acquisition

Broadband Microphone Arrays for Speech Acquisition Broadband Microphone Arrays for Speech Acquisition Darren B. Ward Acoustics and Speech Research Dept. Bell Labs, Lucent Technologies Murray Hill, NJ 07974, USA Robert C. Williamson Dept. of Engineering,

More information

Encoding higher order ambisonics with AAC

Encoding higher order ambisonics with AAC University of Wollongong Research Online Faculty of Engineering - Papers (Archive) Faculty of Engineering and Information Sciences 2008 Encoding higher order ambisonics with AAC Erik Hellerud Norwegian

More information

Spatialized teleconferencing: recording and 'Squeezed' rendering of multiple distributed sites

Spatialized teleconferencing: recording and 'Squeezed' rendering of multiple distributed sites University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2008 Spatialized teleconferencing: recording and 'Squeezed' rendering

More information

Subband Analysis of Time Delay Estimation in STFT Domain

Subband Analysis of Time Delay Estimation in STFT Domain PAGE 211 Subband Analysis of Time Delay Estimation in STFT Domain S. Wang, D. Sen and W. Lu School of Electrical Engineering & Telecommunications University of ew South Wales, Sydney, Australia sh.wang@student.unsw.edu.au,

More information

The Role of High Frequencies in Convolutive Blind Source Separation of Speech Signals

The Role of High Frequencies in Convolutive Blind Source Separation of Speech Signals The Role of High Frequencies in Convolutive Blind Source Separation of Speech Signals Maria G. Jafari and Mark D. Plumbley Centre for Digital Music, Queen Mary University of London, UK maria.jafari@elec.qmul.ac.uk,

More information

SPATIAL SOUND REPRODUCTION WITH WAVE FIELD SYNTHESIS

SPATIAL SOUND REPRODUCTION WITH WAVE FIELD SYNTHESIS AES Italian Section Annual Meeting Como, November 3-5, 2005 ANNUAL MEETING 2005 Paper: 05005 Como, 3-5 November Politecnico di MILANO SPATIAL SOUND REPRODUCTION WITH WAVE FIELD SYNTHESIS RUDOLF RABENSTEIN,

More information

UNIVERSITÉ DE SHERBROOKE

UNIVERSITÉ DE SHERBROOKE Wave Field Synthesis, Adaptive Wave Field Synthesis and Ambisonics using decentralized transformed control: potential applications to sound field reproduction and active noise control P.-A. Gauthier, A.

More information

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm A.T. Rajamanickam, N.P.Subiramaniyam, A.Balamurugan*,

More information

Predicting localization accuracy for stereophonic downmixes in Wave Field Synthesis

Predicting localization accuracy for stereophonic downmixes in Wave Field Synthesis Predicting localization accuracy for stereophonic downmixes in Wave Field Synthesis Hagen Wierstorf Assessment of IP-based Applications, T-Labs, Technische Universität Berlin, Berlin, Germany. Sascha Spors

More information

DISTANCE CODING AND PERFORMANCE OF THE MARK 5 AND ST350 SOUNDFIELD MICROPHONES AND THEIR SUITABILITY FOR AMBISONIC REPRODUCTION

DISTANCE CODING AND PERFORMANCE OF THE MARK 5 AND ST350 SOUNDFIELD MICROPHONES AND THEIR SUITABILITY FOR AMBISONIC REPRODUCTION DISTANCE CODING AND PERFORMANCE OF THE MARK 5 AND ST350 SOUNDFIELD MICROPHONES AND THEIR SUITABILITY FOR AMBISONIC REPRODUCTION T Spenceley B Wiggins University of Derby, Derby, UK University of Derby,

More information

Outline. Context. Aim of our projects. Framework

Outline. Context. Aim of our projects. Framework Cédric André, Marc Evrard, Jean-Jacques Embrechts, Jacques Verly Laboratory for Signal and Image Exploitation (INTELSIG), Department of Electrical Engineering and Computer Science, University of Liège,

More information

Audio Compression using the MLT and SPIHT

Audio Compression using the MLT and SPIHT Audio Compression using the MLT and SPIHT Mohammed Raad, Alfred Mertins and Ian Burnett School of Electrical, Computer and Telecommunications Engineering University Of Wollongong Northfields Ave Wollongong

More information

COMPARISON OF MICROPHONE ARRAY GEOMETRIES FOR MULTI-POINT SOUND FIELD REPRODUCTION

COMPARISON OF MICROPHONE ARRAY GEOMETRIES FOR MULTI-POINT SOUND FIELD REPRODUCTION COMPARISON OF MICROPHONE ARRAY GEOMETRIES FOR MULTI-POINT SOUND FIELD REPRODUCTION Philip Coleman, Miguel Blanco Galindo, Philip J. B. Jackson Centre for Vision, Speech and Signal Processing, University

More information

Auditory modelling for speech processing in the perceptual domain

Auditory modelling for speech processing in the perceptual domain ANZIAM J. 45 (E) ppc964 C980, 2004 C964 Auditory modelling for speech processing in the perceptual domain L. Lin E. Ambikairajah W. H. Holmes (Received 8 August 2003; revised 28 January 2004) Abstract

More information

Improving speech privacy in personal sound zones

Improving speech privacy in personal sound zones University of Wollongong Research Online Faculty of Engineering and Information Sciences - Papers: Part A Faculty of Engineering and Information Sciences 26 Improving speech privacy in personal sound zones

More information

A Directional Loudspeaker Array for Surround Sound in Reverberant Rooms

A Directional Loudspeaker Array for Surround Sound in Reverberant Rooms Proceedings of 2th International Congress on Acoustics, ICA 21 23 27 August 21, Sydney, Australia A Directional Loudspeaker Array for Surround Sound in Reverberant Rooms T. Betlehem (1), C. Anderson (2)

More information

Recent Advances in Acoustic Signal Extraction and Dereverberation

Recent Advances in Acoustic Signal Extraction and Dereverberation Recent Advances in Acoustic Signal Extraction and Dereverberation Emanuël Habets Erlangen Colloquium 2016 Scenario Spatial Filtering Estimated Desired Signal Undesired sound components: Sensor noise Competing

More information

SOPA version 2. Revised July SOPA project. September 21, Introduction 2. 2 Basic concept 3. 3 Capturing spatial audio 4

SOPA version 2. Revised July SOPA project. September 21, Introduction 2. 2 Basic concept 3. 3 Capturing spatial audio 4 SOPA version 2 Revised July 7 2014 SOPA project September 21, 2014 Contents 1 Introduction 2 2 Basic concept 3 3 Capturing spatial audio 4 4 Sphere around your head 5 5 Reproduction 7 5.1 Binaural reproduction......................

More information

Speech and Audio Processing Recognition and Audio Effects Part 3: Beamforming

Speech and Audio Processing Recognition and Audio Effects Part 3: Beamforming Speech and Audio Processing Recognition and Audio Effects Part 3: Beamforming Gerhard Schmidt Christian-Albrechts-Universität zu Kiel Faculty of Engineering Electrical Engineering and Information Engineering

More information

Mel Spectrum Analysis of Speech Recognition using Single Microphone

Mel Spectrum Analysis of Speech Recognition using Single Microphone International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree

More information

ROOM IMPULSE RESPONSES AS TEMPORAL AND SPATIAL FILTERS ABSTRACT INTRODUCTION

ROOM IMPULSE RESPONSES AS TEMPORAL AND SPATIAL FILTERS ABSTRACT INTRODUCTION ROOM IMPULSE RESPONSES AS TEMPORAL AND SPATIAL FILTERS Angelo Farina University of Parma Industrial Engineering Dept., Parco Area delle Scienze 181/A, 43100 Parma, ITALY E-mail: farina@unipr.it ABSTRACT

More information

SPHERICAL MICROPHONE ARRAY BASED IMMERSIVE AUDIO SCENE RENDERING. Adam M. O Donovan, Dmitry N. Zotkin, Ramani Duraiswami

SPHERICAL MICROPHONE ARRAY BASED IMMERSIVE AUDIO SCENE RENDERING. Adam M. O Donovan, Dmitry N. Zotkin, Ramani Duraiswami SPHERICAL MICROPHONE ARRAY BASED IMMERSIVE AUDIO SCENE RENDERING Adam M. O Donovan, Dmitry N. Zotkin, Ramani Duraiswami Perceptual Interfaces and Reality Laboratory, Computer Science & UMIACS, University

More information

DECORRELATION TECHNIQUES FOR THE RENDERING OF APPARENT SOUND SOURCE WIDTH IN 3D AUDIO DISPLAYS. Guillaume Potard, Ian Burnett

DECORRELATION TECHNIQUES FOR THE RENDERING OF APPARENT SOUND SOURCE WIDTH IN 3D AUDIO DISPLAYS. Guillaume Potard, Ian Burnett 04 DAFx DECORRELATION TECHNIQUES FOR THE RENDERING OF APPARENT SOUND SOURCE WIDTH IN 3D AUDIO DISPLAYS Guillaume Potard, Ian Burnett School of Electrical, Computer and Telecommunications Engineering University

More information

COLOURATION IN 2.5D LOCAL WAVE FIELD SYNTHESIS USING SPATIAL BANDWIDTH-LIMITATION

COLOURATION IN 2.5D LOCAL WAVE FIELD SYNTHESIS USING SPATIAL BANDWIDTH-LIMITATION 27 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics October 5-8, 27, New Paltz, NY COLOURATION IN 2.5D LOCAL WAVE FIELD SYNTHESIS USING SPATIAL BANDWIDTH-LIMITATION Fiete Winter,

More information

Virtual Sound Source Positioning and Mixing in 5.1 Implementation on the Real-Time System Genesis

Virtual Sound Source Positioning and Mixing in 5.1 Implementation on the Real-Time System Genesis Virtual Sound Source Positioning and Mixing in 5 Implementation on the Real-Time System Genesis Jean-Marie Pernaux () Patrick Boussard () Jean-Marc Jot (3) () and () Steria/Digilog SA, Aix-en-Provence

More information

SYNTHESIS OF DEVICE-INDEPENDENT NOISE CORPORA FOR SPEECH QUALITY ASSESSMENT. Hannes Gamper, Lyle Corbin, David Johnston, Ivan J.

SYNTHESIS OF DEVICE-INDEPENDENT NOISE CORPORA FOR SPEECH QUALITY ASSESSMENT. Hannes Gamper, Lyle Corbin, David Johnston, Ivan J. SYNTHESIS OF DEVICE-INDEPENDENT NOISE CORPORA FOR SPEECH QUALITY ASSESSMENT Hannes Gamper, Lyle Corbin, David Johnston, Ivan J. Tashev Microsoft Corporation, One Microsoft Way, Redmond, WA 98, USA ABSTRACT

More information

The psychoacoustics of reverberation

The psychoacoustics of reverberation The psychoacoustics of reverberation Steven van de Par Steven.van.de.Par@uni-oldenburg.de July 19, 2016 Thanks to Julian Grosse and Andreas Häußler 2016 AES International Conference on Sound Field Control

More information

Reducing comb filtering on different musical instruments using time delay estimation

Reducing comb filtering on different musical instruments using time delay estimation Reducing comb filtering on different musical instruments using time delay estimation Alice Clifford and Josh Reiss Queen Mary, University of London alice.clifford@eecs.qmul.ac.uk Abstract Comb filtering

More information

EFFECTS OF PHYSICAL CONFIGURATIONS ON ANC HEADPHONE PERFORMANCE

EFFECTS OF PHYSICAL CONFIGURATIONS ON ANC HEADPHONE PERFORMANCE EFFECTS OF PHYSICAL CONFIGURATIONS ON ANC HEADPHONE PERFORMANCE Lifu Wu Nanjing University of Information Science and Technology, School of Electronic & Information Engineering, CICAEET, Nanjing, 210044,

More information

Holographic Measurement of the Acoustical 3D Output by Near Field Scanning by Dave Logan, Wolfgang Klippel, Christian Bellmann, Daniel Knobloch

Holographic Measurement of the Acoustical 3D Output by Near Field Scanning by Dave Logan, Wolfgang Klippel, Christian Bellmann, Daniel Knobloch Holographic Measurement of the Acoustical 3D Output by Near Field Scanning 2015 by Dave Logan, Wolfgang Klippel, Christian Bellmann, Daniel Knobloch LOGAN,NEAR FIELD SCANNING, 1 Introductions LOGAN,NEAR

More information

BREAKING DOWN THE COCKTAIL PARTY: CAPTURING AND ISOLATING SOURCES IN A SOUNDSCAPE

BREAKING DOWN THE COCKTAIL PARTY: CAPTURING AND ISOLATING SOURCES IN A SOUNDSCAPE BREAKING DOWN THE COCKTAIL PARTY: CAPTURING AND ISOLATING SOURCES IN A SOUNDSCAPE Anastasios Alexandridis, Anthony Griffin, and Athanasios Mouchtaris FORTH-ICS, Heraklion, Crete, Greece, GR-70013 University

More information

Book Chapters. Refereed Journal Publications J11

Book Chapters. Refereed Journal Publications J11 Book Chapters B2 B1 A. Mouchtaris and P. Tsakalides, Low Bitrate Coding of Spot Audio Signals for Interactive and Immersive Audio Applications, in New Directions in Intelligent Interactive Multimedia,

More information

DIRECTIONAL CODING OF AUDIO USING A CIRCULAR MICROPHONE ARRAY

DIRECTIONAL CODING OF AUDIO USING A CIRCULAR MICROPHONE ARRAY DIRECTIONAL CODING OF AUDIO USING A CIRCULAR MICROPHONE ARRAY Anastasios Alexandridis Anthony Griffin Athanasios Mouchtaris FORTH-ICS, Heraklion, Crete, Greece, GR-70013 University of Crete, Department

More information

arxiv: v1 [cs.sd] 4 Dec 2018

arxiv: v1 [cs.sd] 4 Dec 2018 LOCALIZATION AND TRACKING OF AN ACOUSTIC SOURCE USING A DIAGONAL UNLOADING BEAMFORMING AND A KALMAN FILTER Daniele Salvati, Carlo Drioli, Gian Luca Foresti Department of Mathematics, Computer Science and

More information

In air acoustic vector sensors for capturing and processing of speech signals

In air acoustic vector sensors for capturing and processing of speech signals University of Wollongong Research Online University of Wollongong Thesis Collection University of Wollongong Thesis Collections 2011 In air acoustic vector sensors for capturing and processing of speech

More information

Analysis of Frontal Localization in Double Layered Loudspeaker Array System

Analysis of Frontal Localization in Double Layered Loudspeaker Array System Proceedings of 20th International Congress on Acoustics, ICA 2010 23 27 August 2010, Sydney, Australia Analysis of Frontal Localization in Double Layered Loudspeaker Array System Hyunjoo Chung (1), Sang

More information

SUPERVISED SIGNAL PROCESSING FOR SEPARATION AND INDEPENDENT GAIN CONTROL OF DIFFERENT PERCUSSION INSTRUMENTS USING A LIMITED NUMBER OF MICROPHONES

SUPERVISED SIGNAL PROCESSING FOR SEPARATION AND INDEPENDENT GAIN CONTROL OF DIFFERENT PERCUSSION INSTRUMENTS USING A LIMITED NUMBER OF MICROPHONES SUPERVISED SIGNAL PROCESSING FOR SEPARATION AND INDEPENDENT GAIN CONTROL OF DIFFERENT PERCUSSION INSTRUMENTS USING A LIMITED NUMBER OF MICROPHONES SF Minhas A Barton P Gaydecki School of Electrical and

More information

Performance Evaluation of Nonlinear Speech Enhancement Based on Virtual Increase of Channels in Reverberant Environments

Performance Evaluation of Nonlinear Speech Enhancement Based on Virtual Increase of Channels in Reverberant Environments Performance Evaluation of Nonlinear Speech Enhancement Based on Virtual Increase of Channels in Reverberant Environments Kouei Yamaoka, Shoji Makino, Nobutaka Ono, and Takeshi Yamada University of Tsukuba,

More information

Localization of 3D Ambisonic Recordings and Ambisonic Virtual Sources

Localization of 3D Ambisonic Recordings and Ambisonic Virtual Sources Localization of 3D Ambisonic Recordings and Ambisonic Virtual Sources Sebastian Braun and Matthias Frank Universität für Musik und darstellende Kunst Graz, Austria Institut für Elektronische Musik und

More information

Airo Interantional Research Journal September, 2013 Volume II, ISSN:

Airo Interantional Research Journal September, 2013 Volume II, ISSN: Airo Interantional Research Journal September, 2013 Volume II, ISSN: 2320-3714 Name of author- Navin Kumar Research scholar Department of Electronics BR Ambedkar Bihar University Muzaffarpur ABSTRACT Direction

More information

Blind Dereverberation of Single-Channel Speech Signals Using an ICA-Based Generative Model

Blind Dereverberation of Single-Channel Speech Signals Using an ICA-Based Generative Model Blind Dereverberation of Single-Channel Speech Signals Using an ICA-Based Generative Model Jong-Hwan Lee 1, Sang-Hoon Oh 2, and Soo-Young Lee 3 1 Brain Science Research Center and Department of Electrial

More information

Study Of Sound Source Localization Using Music Method In Real Acoustic Environment

Study Of Sound Source Localization Using Music Method In Real Acoustic Environment International Journal of Electronics Engineering Research. ISSN 975-645 Volume 9, Number 4 (27) pp. 545-556 Research India Publications http://www.ripublication.com Study Of Sound Source Localization Using

More information

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 MEASURING SPATIAL IMPULSE RESPONSES IN CONCERT HALLS AND OPERA HOUSES EMPLOYING A SPHERICAL MICROPHONE ARRAY PACS: 43.55.Cs Angelo,

More information

MPEG-4 Structured Audio Systems

MPEG-4 Structured Audio Systems MPEG-4 Structured Audio Systems Mihir Anandpara The University of Texas at Austin anandpar@ece.utexas.edu 1 Abstract The MPEG-4 standard has been proposed to provide high quality audio and video content

More information

The analysis of multi-channel sound reproduction algorithms using HRTF data

The analysis of multi-channel sound reproduction algorithms using HRTF data The analysis of multichannel sound reproduction algorithms using HRTF data B. Wiggins, I. PatersonStephens, P. Schillebeeckx Processing Applications Research Group University of Derby Derby, United Kingdom

More information

Spatial Audio & The Vestibular System!

Spatial Audio & The Vestibular System! ! Spatial Audio & The Vestibular System! Gordon Wetzstein! Stanford University! EE 267 Virtual Reality! Lecture 13! stanford.edu/class/ee267/!! Updates! lab this Friday will be released as a video! TAs

More information

A Study on Complexity Reduction of Binaural. Decoding in Multi-channel Audio Coding for. Realistic Audio Service

A Study on Complexity Reduction of Binaural. Decoding in Multi-channel Audio Coding for. Realistic Audio Service Contemporary Engineering Sciences, Vol. 9, 2016, no. 1, 11-19 IKARI Ltd, www.m-hiari.com http://dx.doi.org/10.12988/ces.2016.512315 A Study on Complexity Reduction of Binaural Decoding in Multi-channel

More information

Applying the Filtered Back-Projection Method to Extract Signal at Specific Position

Applying the Filtered Back-Projection Method to Extract Signal at Specific Position Applying the Filtered Back-Projection Method to Extract Signal at Specific Position 1 Chia-Ming Chang and Chun-Hao Peng Department of Computer Science and Engineering, Tatung University, Taipei, Taiwan

More information

Blind source separation and directional audio synthesis for binaural auralization of multiple sound sources using microphone array recordings

Blind source separation and directional audio synthesis for binaural auralization of multiple sound sources using microphone array recordings Blind source separation and directional audio synthesis for binaural auralization of multiple sound sources using microphone array recordings Banu Gunel, Huseyin Hacihabiboglu and Ahmet Kondoz I-Lab Multimedia

More information

Multiple Sound Sources Localization Using Energetic Analysis Method

Multiple Sound Sources Localization Using Energetic Analysis Method VOL.3, NO.4, DECEMBER 1 Multiple Sound Sources Localization Using Energetic Analysis Method Hasan Khaddour, Jiří Schimmel Department of Telecommunications FEEC, Brno University of Technology Purkyňova

More information

Emanuël A. P. Habets, Jacob Benesty, and Patrick A. Naylor. Presented by Amir Kiperwas

Emanuël A. P. Habets, Jacob Benesty, and Patrick A. Naylor. Presented by Amir Kiperwas Emanuël A. P. Habets, Jacob Benesty, and Patrick A. Naylor Presented by Amir Kiperwas 1 M-element microphone array One desired source One undesired source Ambient noise field Signals: Broadband Mutually

More information

University of Huddersfield Repository

University of Huddersfield Repository University of Huddersfield Repository Moore, David J. and Wakefield, Jonathan P. Surround Sound for Large Audiences: What are the Problems? Original Citation Moore, David J. and Wakefield, Jonathan P.

More information

Clustered Multi-channel Dereverberation for Ad-hoc Microphone Arrays

Clustered Multi-channel Dereverberation for Ad-hoc Microphone Arrays Clustered Multi-channel Dereverberation for Ad-hoc Microphone Arrays Shahab Pasha and Christian Ritz School of Electrical, Computer and Telecommunications Engineering, University of Wollongong, Wollongong,

More information

Overview of Code Excited Linear Predictive Coder

Overview of Code Excited Linear Predictive Coder Overview of Code Excited Linear Predictive Coder Minal Mulye 1, Sonal Jagtap 2 1 PG Student, 2 Assistant Professor, Department of E&TC, Smt. Kashibai Navale College of Engg, Pune, India Abstract Advances

More information

Real-time Adaptive Concepts in Acoustics

Real-time Adaptive Concepts in Acoustics Real-time Adaptive Concepts in Acoustics Real-time Adaptive Concepts in Acoustics Blind Signal Separation and Multichannel Echo Cancellation by Daniel W.E. Schobben, Ph. D. Philips Research Laboratories

More information

Digital Signal Processing. VO Embedded Systems Engineering Armin Wasicek WS 2009/10

Digital Signal Processing. VO Embedded Systems Engineering Armin Wasicek WS 2009/10 Digital Signal Processing VO Embedded Systems Engineering Armin Wasicek WS 2009/10 Overview Signals and Systems Processing of Signals Display of Signals Digital Signal Processors Common Signal Processing

More information

New acoustical techniques for measuring spatial properties in concert halls

New acoustical techniques for measuring spatial properties in concert halls New acoustical techniques for measuring spatial properties in concert halls LAMBERTO TRONCHIN and VALERIO TARABUSI DIENCA CIARM, University of Bologna, Italy http://www.ciarm.ing.unibo.it Abstract: - The

More information

University of Huddersfield Repository

University of Huddersfield Repository University of Huddersfield Repository Lee, Hyunkook Capturing and Rendering 360º VR Audio Using Cardioid Microphones Original Citation Lee, Hyunkook (2016) Capturing and Rendering 360º VR Audio Using Cardioid

More information

Audio Engineering Society. Convention Paper. Presented at the 129th Convention 2010 November 4 7 San Francisco, CA, USA. Why Ambisonics Does Work

Audio Engineering Society. Convention Paper. Presented at the 129th Convention 2010 November 4 7 San Francisco, CA, USA. Why Ambisonics Does Work Audio Engineering Society Convention Paper Presented at the 129th Convention 2010 November 4 7 San Francisco, CA, USA The papers at this Convention have been selected on the basis of a submitted abstract

More information

VLSI Circuit Design for Noise Cancellation in Ear Headphones

VLSI Circuit Design for Noise Cancellation in Ear Headphones VLSI Circuit Design for Noise Cancellation in Ear Headphones Jegadeesh.M 1, Karthi.R 2, Karthik.S 3, Mohan.N 4, R.Poovendran 5 UG Scholar, Department of ECE, Adhiyamaan College of Engineering, Hosur, Tamilnadu,

More information

RIR Estimation for Synthetic Data Acquisition

RIR Estimation for Synthetic Data Acquisition RIR Estimation for Synthetic Data Acquisition Kevin Venalainen, Philippe Moquin, Dinei Florencio Microsoft ABSTRACT - Automatic Speech Recognition (ASR) works best when the speech signal best matches the

More information

Spatialisation accuracy of a Virtual Performance System

Spatialisation accuracy of a Virtual Performance System Spatialisation accuracy of a Virtual Performance System Iain Laird, Dr Paul Chapman, Digital Design Studio, Glasgow School of Art, Glasgow, UK, I.Laird1@gsa.ac.uk, p.chapman@gsa.ac.uk Dr Damian Murphy

More information

I-Hao Hsiao, Chun-Tang Chao*, and Chi-Jo Wang (2016). A HHT-Based Music Synthesizer. Intelligent Technologies and Engineering Systems, Lecture Notes

I-Hao Hsiao, Chun-Tang Chao*, and Chi-Jo Wang (2016). A HHT-Based Music Synthesizer. Intelligent Technologies and Engineering Systems, Lecture Notes I-Hao Hsiao, Chun-Tang Chao*, and Chi-Jo Wang (2016). A HHT-Based Music Synthesizer. Intelligent Technologies and Engineering Systems, Lecture Notes in Electrical Engineering (LNEE), Vol.345, pp.523-528.

More information

Different Approaches of Spectral Subtraction Method for Speech Enhancement

Different Approaches of Spectral Subtraction Method for Speech Enhancement ISSN 2249 5460 Available online at www.internationalejournals.com International ejournals International Journal of Mathematical Sciences, Technology and Humanities 95 (2013 1056 1062 Different Approaches

More information

Sound source localization and its use in multimedia applications

Sound source localization and its use in multimedia applications Notes for lecture/ Zack Settel, McGill University Sound source localization and its use in multimedia applications Introduction With the arrival of real-time binaural or "3D" digital audio processing,

More information

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 VIRTUAL AUDIO REPRODUCED IN A HEADREST

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 VIRTUAL AUDIO REPRODUCED IN A HEADREST 19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 VIRTUAL AUDIO REPRODUCED IN A HEADREST PACS: 43.25.Lj M.Jones, S.J.Elliott, T.Takeuchi, J.Beer Institute of Sound and Vibration Research;

More information

Multi-channel Active Control of Axial Cooling Fan Noise

Multi-channel Active Control of Axial Cooling Fan Noise The 2002 International Congress and Exposition on Noise Control Engineering Dearborn, MI, USA. August 19-21, 2002 Multi-channel Active Control of Axial Cooling Fan Noise Kent L. Gee and Scott D. Sommerfeldt

More information

Convention Paper Presented at the 126th Convention 2009 May 7 10 Munich, Germany

Convention Paper Presented at the 126th Convention 2009 May 7 10 Munich, Germany Audio Engineering Society Convention Paper Presented at the th Convention 9 May 7 Munich, Germany The papers at this Convention have been selected on the basis of a submitted abstract and extended precis

More information

Two-channel Separation of Speech Using Direction-of-arrival Estimation And Sinusoids Plus Transients Modeling

Two-channel Separation of Speech Using Direction-of-arrival Estimation And Sinusoids Plus Transients Modeling Two-channel Separation of Speech Using Direction-of-arrival Estimation And Sinusoids Plus Transients Modeling Mikko Parviainen 1 and Tuomas Virtanen 2 Institute of Signal Processing Tampere University

More information

Localization of underwater moving sound source based on time delay estimation using hydrophone array

Localization of underwater moving sound source based on time delay estimation using hydrophone array Journal of Physics: Conference Series PAPER OPEN ACCESS Localization of underwater moving sound source based on time delay estimation using hydrophone array To cite this article: S. A. Rahman et al 2016

More information

Transcoding of Narrowband to Wideband Speech

Transcoding of Narrowband to Wideband Speech University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2005 Transcoding of Narrowband to Wideband Speech Christian H. Ritz University

More information

Room Impulse Response Modeling in the Sub-2kHz Band using 3-D Rectangular Digital Waveguide Mesh

Room Impulse Response Modeling in the Sub-2kHz Band using 3-D Rectangular Digital Waveguide Mesh Room Impulse Response Modeling in the Sub-2kHz Band using 3-D Rectangular Digital Waveguide Mesh Zhixin Chen ILX Lightwave Corporation Bozeman, Montana, USA Abstract Digital waveguide mesh has emerged

More information

VIRTUAL ACOUSTICS: OPPORTUNITIES AND LIMITS OF SPATIAL SOUND REPRODUCTION

VIRTUAL ACOUSTICS: OPPORTUNITIES AND LIMITS OF SPATIAL SOUND REPRODUCTION ARCHIVES OF ACOUSTICS 33, 4, 413 422 (2008) VIRTUAL ACOUSTICS: OPPORTUNITIES AND LIMITS OF SPATIAL SOUND REPRODUCTION Michael VORLÄNDER RWTH Aachen University Institute of Technical Acoustics 52056 Aachen,

More information

Sound pressure level calculation methodology investigation of corona noise in AC substations

Sound pressure level calculation methodology investigation of corona noise in AC substations International Conference on Advanced Electronic Science and Technology (AEST 06) Sound pressure level calculation methodology investigation of corona noise in AC substations,a Xiaowen Wu, Nianguang Zhou,

More information

Sound source localization accuracy of ambisonic microphone in anechoic conditions

Sound source localization accuracy of ambisonic microphone in anechoic conditions Sound source localization accuracy of ambisonic microphone in anechoic conditions Pawel MALECKI 1 ; 1 AGH University of Science and Technology in Krakow, Poland ABSTRACT The paper presents results of determination

More information

capsule quality matter? A comparison study between spherical microphone arrays using different

capsule quality matter? A comparison study between spherical microphone arrays using different Does capsule quality matter? A comparison study between spherical microphone arrays using different types of omnidirectional capsules Simeon Delikaris-Manias, Vincent Koehl, Mathieu Paquier, Rozenn Nicol,

More information

Synthesis Techniques. Juan P Bello

Synthesis Techniques. Juan P Bello Synthesis Techniques Juan P Bello Synthesis It implies the artificial construction of a complex body by combining its elements. Complex body: acoustic signal (sound) Elements: parameters and/or basic signals

More information

A Parametric Model for Spectral Sound Synthesis of Musical Sounds

A Parametric Model for Spectral Sound Synthesis of Musical Sounds A Parametric Model for Spectral Sound Synthesis of Musical Sounds Cornelia Kreutzer University of Limerick ECE Department Limerick, Ireland cornelia.kreutzer@ul.ie Jacqueline Walker University of Limerick

More information

I D I A P R E S E A R C H R E P O R T. June published in Interspeech 2008

I D I A P R E S E A R C H R E P O R T. June published in Interspeech 2008 R E S E A R C H R E P O R T I D I A P Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain Sriram Ganapathy a b Petr Motlicek a Hynek Hermansky a b Harinath

More information

Adaptive Filters Application of Linear Prediction

Adaptive Filters Application of Linear Prediction Adaptive Filters Application of Linear Prediction Gerhard Schmidt Christian-Albrechts-Universität zu Kiel Faculty of Engineering Electrical Engineering and Information Technology Digital Signal Processing

More information

Spanning the 4 kbps divide using pulse modeled residual

Spanning the 4 kbps divide using pulse modeled residual University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2002 Spanning the 4 kbps divide using pulse modeled residual J Lukasiak

More information

Improved signal analysis and time-synchronous reconstruction in waveform interpolation coding

Improved signal analysis and time-synchronous reconstruction in waveform interpolation coding University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2000 Improved signal analysis and time-synchronous reconstruction in waveform

More information

Implementation and Comparative analysis of Orthogonal Frequency Division Multiplexing (OFDM) Signaling Rashmi Choudhary

Implementation and Comparative analysis of Orthogonal Frequency Division Multiplexing (OFDM) Signaling Rashmi Choudhary Implementation and Comparative analysis of Orthogonal Frequency Division Multiplexing (OFDM) Signaling Rashmi Choudhary M.Tech Scholar, ECE Department,SKIT, Jaipur, Abstract Orthogonal Frequency Division

More information

Robotic Spatial Sound Localization and Its 3-D Sound Human Interface

Robotic Spatial Sound Localization and Its 3-D Sound Human Interface Robotic Spatial Sound Localization and Its 3-D Sound Human Interface Jie Huang, Katsunori Kume, Akira Saji, Masahiro Nishihashi, Teppei Watanabe and William L. Martens The University of Aizu Aizu-Wakamatsu,

More information

Electric Audio Unit Un

Electric Audio Unit Un Electric Audio Unit Un VIRTUALMONIUM The world s first acousmonium emulated in in higher-order ambisonics Natasha Barrett 2017 User Manual The Virtualmonium User manual Natasha Barrett 2017 Electric Audio

More information

Audio Engineering Society Convention Paper Presented at the 110th Convention 2001 May Amsterdam, The Netherlands

Audio Engineering Society Convention Paper Presented at the 110th Convention 2001 May Amsterdam, The Netherlands Audio Engineering Society Convention Paper Presented at the th Convention May 5 Amsterdam, The Netherlands This convention paper has been reproduced from the author's advance manuscript, without editing,

More information

Convention Paper Presented at the 129th Convention 2010 November 4 7 San Francisco, CA

Convention Paper Presented at the 129th Convention 2010 November 4 7 San Francisco, CA Audio Engineering Society Convention Paper Presented at the 129th Convention 21 November 4 7 San Francisco, CA The papers at this Convention have been selected on the basis of a submitted abstract and

More information

Spatial Audio with the SoundScape Renderer

Spatial Audio with the SoundScape Renderer Spatial Audio with the SoundScape Renderer Matthias Geier, Sascha Spors Institut für Nachrichtentechnik, Universität Rostock {Matthias.Geier,Sascha.Spors}@uni-rostock.de Abstract The SoundScape Renderer

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 2013 http://acousticalsociety.org/ ICA 2013 Montreal Montreal, Canada 2-7 June 2013 Architectural Acoustics Session 1pAAa: Advanced Analysis of Room Acoustics:

More information

SPARSE CHANNEL ESTIMATION BY PILOT ALLOCATION IN MIMO-OFDM SYSTEMS

SPARSE CHANNEL ESTIMATION BY PILOT ALLOCATION IN MIMO-OFDM SYSTEMS SPARSE CHANNEL ESTIMATION BY PILOT ALLOCATION IN MIMO-OFDM SYSTEMS Puneetha R 1, Dr.S.Akhila 2 1 M. Tech in Digital Communication B M S College Of Engineering Karnataka, India 2 Professor Department of

More information

GETTING MIXED UP WITH WFS, VBAP, HOA, TRM FROM ACRONYMIC CACOPHONY TO A GENERALIZED RENDERING TOOLBOX

GETTING MIXED UP WITH WFS, VBAP, HOA, TRM FROM ACRONYMIC CACOPHONY TO A GENERALIZED RENDERING TOOLBOX GETTING MIXED UP WITH WF, VBAP, HOA, TM FOM ACONYMIC CACOPHONY TO A GENEALIZED ENDEING TOOLBOX Alois ontacchi and obert Höldrich Institute of Electronic Music and Acoustics, University of Music and dramatic

More information

Holographic Measurement of the 3D Sound Field using Near-Field Scanning by Dave Logan, Wolfgang Klippel, Christian Bellmann, Daniel Knobloch

Holographic Measurement of the 3D Sound Field using Near-Field Scanning by Dave Logan, Wolfgang Klippel, Christian Bellmann, Daniel Knobloch Holographic Measurement of the 3D Sound Field using Near-Field Scanning 2015 by Dave Logan, Wolfgang Klippel, Christian Bellmann, Daniel Knobloch KLIPPEL, WARKWYN: Near field scanning, 1 AGENDA 1. Pros

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 213 http://acousticalsociety.org/ ICA 213 Montreal Montreal, Canada 2-7 June 213 Signal Processing in Acoustics Session 2pSP: Acoustic Signal Processing

More information

Multimedia Signal Processing: Theory and Applications in Speech, Music and Communications

Multimedia Signal Processing: Theory and Applications in Speech, Music and Communications Brochure More information from http://www.researchandmarkets.com/reports/569388/ Multimedia Signal Processing: Theory and Applications in Speech, Music and Communications Description: Multimedia Signal

More information

Open Access Research of Dielectric Loss Measurement with Sparse Representation

Open Access Research of Dielectric Loss Measurement with Sparse Representation Send Orders for Reprints to reprints@benthamscience.ae 698 The Open Automation and Control Systems Journal, 2, 7, 698-73 Open Access Research of Dielectric Loss Measurement with Sparse Representation Zheng

More information

LOCAL MULTISCALE FREQUENCY AND BANDWIDTH ESTIMATION. Hans Knutsson Carl-Fredrik Westin Gösta Granlund

LOCAL MULTISCALE FREQUENCY AND BANDWIDTH ESTIMATION. Hans Knutsson Carl-Fredrik Westin Gösta Granlund LOCAL MULTISCALE FREQUENCY AND BANDWIDTH ESTIMATION Hans Knutsson Carl-Fredri Westin Gösta Granlund Department of Electrical Engineering, Computer Vision Laboratory Linöping University, S-58 83 Linöping,

More information