The Why and How of With-Height Surround Sound

Similar documents
Sound source localization and its use in multimedia applications

Surround: The Current Technological Situation. David Griesinger Lexicon 3 Oak Park Bedford, MA

Virtual Sound Source Positioning and Mixing in 5.1 Implementation on the Real-Time System Genesis

Auditory Localization

Multichannel Audio Technologies. More on Surround Sound Microphone Techniques:

The Spatial Soundscape. James L. Barbour Swinburne University of Technology, Melbourne, Australia

University of Huddersfield Repository

Introduction. 1.1 Surround sound

Multi-Loudspeaker Reproduction: Surround Sound

A spatial squeezing approach to ambisonic audio compression

URBANA-CHAMPAIGN. CS 498PS Audio Computing Lab. 3D and Virtual Sound. Paris Smaragdis. paris.cs.illinois.

The analysis of multi-channel sound reproduction algorithms using HRTF data

Spatial audio is a field that

Immersive Audio Technology Available to Planetariums. Part I A paper pp presented at: II International Festival of Planetariums

Convention Paper Presented at the 128th Convention 2010 May London, UK

Psychoacoustics of 3D Sound Recording: Research and Practice

Evaluation of a new stereophonic reproduction method with moving sweet spot using a binaural localization model

SOUND 1 -- ACOUSTICS 1

Perceptual effects of visual images on out-of-head localization of sounds produced by binaural recording and reproduction.

University of Huddersfield Repository

Binaural Hearing. Reading: Yost Ch. 12

Audio Engineering Society. Convention Paper. Presented at the 129th Convention 2010 November 4 7 San Francisco, CA, USA. Why Ambisonics Does Work

Multichannel Audio In Cars (Tim Nind)

Capturing 360 Audio Using an Equal Segment Microphone Array (ESMA)

THE TEMPORAL and spectral structure of a sound signal

A Comparative Study of the Performance of Spatialization Techniques for a Distributed Audience in a Concert Hall Environment

EBU UER. european broadcasting union. Listening conditions for the assessment of sound programme material. Supplement 1.

Perceived cathedral ceiling height in a multichannel virtual acoustic rendering for Gregorian Chant

Acoustics II: Kurt Heutschi recording technique. stereo recording. microphone positioning. surround sound recordings.

Chapter 6: Room Acoustics and 3D Sound Processing

Sound localization with multi-loudspeakers by usage of a coincident microphone array

RECOMMENDATION ITU-R BR.1384 *, ** Parameters for international exchange of multi-channel sound recordings ***

SOPA version 2. Revised July SOPA project. September 21, Introduction 2. 2 Basic concept 3. 3 Capturing spatial audio 4

Convention Paper 7480

Accurate sound reproduction from two loudspeakers in a living room

Multichannel Audio Technologies: Lecture 3.A. Mixing in 5.1 Surround Sound. Setup

3D audio overview : from 2.0 to N.M (?)

Envelopment and Small Room Acoustics

Perception of pitch. Importance of pitch: 2. mother hemp horse. scold. Definitions. Why is pitch important? AUDL4007: 11 Feb A. Faulkner.

EE1.el3 (EEE1023): Electronics III. Acoustics lecture 20 Sound localisation. Dr Philip Jackson.

WHITE PAPER AUROMAX. Next generation Immersive Sound system. Rev.: 2.0, Date: 2015 Nov AURO TECHNOLOGIES NV BARCO NV.

Speech Compression. Application Scenarios

B360 Ambisonics Encoder. User Guide

Sonnet. we think differently!

THE DEVELOPMENT OF A DESIGN TOOL FOR 5-SPEAKER SURROUND SOUND DECODERS

Ambisonic Auralizer Tools VST User Guide

Perception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 4: 7 Feb A. Faulkner.

MONOPHONIC SOURCE LOCALIZATION FOR A DISTRIBUTED AUDIENCE IN A SMALL CONCERT HALL

Analysis of Frontal Localization in Double Layered Loudspeaker Array System

Predicting localization accuracy for stereophonic downmixes in Wave Field Synthesis

Investigation on the Quality of 3D Sound Reproduction

The psychoacoustics of reverberation

New acoustical techniques for measuring spatial properties in concert halls

NEXT-GENERATION AUDIO NEW OPPORTUNITIES FOR TERRESTRIAL UHD BROADCASTING. Fraunhofer IIS

Principles of Musical Acoustics

Perception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 5: 12 Feb A. Faulkner.

Reproduction of Surround Sound in Headphones

DISTANCE CODING AND PERFORMANCE OF THE MARK 5 AND ST350 SOUNDFIELD MICROPHONES AND THEIR SUITABILITY FOR AMBISONIC REPRODUCTION

Wave field synthesis: The future of spatial audio

MUS 302 ENGINEERING SECTION

Convention Paper 7057

Localization of 3D Ambisonic Recordings and Ambisonic Virtual Sources

HRTF adaptation and pattern learning

BASEBAND SIGNAL PROCESSING FM BROADCAST SIGNAL ECE 3101

MULTICHANNEL CONTROL OF SPATIAL EXTENT THROUGH SINUSOIDAL PARTIAL MODULATION (SPM)

Synthesised Surround Sound Department of Electronics and Computer Science University of Southampton, Southampton, SO17 2GQ

Ambisonics plug-in suite for production and performance usage

Presented at the 102nd Convention 1997 March Munich,Germany

DECORRELATION TECHNIQUES FOR THE RENDERING OF APPARENT SOUND SOURCE WIDTH IN 3D AUDIO DISPLAYS. Guillaume Potard, Ian Burnett

What is Sound? Part II

CS101 Lecture 18: Audio Encoding. What You ll Learn Today

Convention Paper Presented at the 137th Convention 2014 October 9 12 Los Angeles, USA

Binaural hearing. Prof. Dan Tollin on the Hearing Throne, Oldenburg Hearing Garden

Experimental Tetrahedral Recording: Part Two

Browser Application for Virtual Audio Walkthrough

Measuring impulse responses containing complete spatial information ABSTRACT

Proceedings of Meetings on Acoustics

Perception of room size and the ability of self localization in a virtual environment. Loudspeaker experiment

The future of illustrated sound in programme making

Convention Paper Presented at the 144 th Convention 2018 May 23 26, Milan, Italy

Novel approaches towards more realistic listening environments for experiments in complex acoustic scenes

Psychoacoustic Cues in Room Size Perception

Master MVA Analyse des signaux Audiofréquences Audio Signal Analysis, Indexing and Transformation

DESIGN OF ROOMS FOR MULTICHANNEL AUDIO MONITORING

IMPLEMENTATION AND APPLICATION OF A BINAURAL HEARING MODEL TO THE OBJECTIVE EVALUATION OF SPATIAL IMPRESSION

Development and application of a stereophonic multichannel recording technique for 3D Audio and VR

c 2014 Michael Friedman

Parameters for international exchange of multi-channel sound recordings with or without accompanying picture

The vertical precedence effect: Utilizing delay panning for height channel mixing in 3D audio

Perceptual Band Allocation (PBA) for the Rendering of Vertical Image Spread with a Vertical 2D Loudspeaker Array

Sound is the human ear s perceived effect of pressure changes in the ambient air. Sound can be modeled as a function of time.

Outline. Context. Aim of our projects. Framework

Monitor Setup Guide The right monitors. The correct setup. Proper sound.

Spatial Audio Reproduction: Towards Individualized Binaural Sound

Computational Perception. Sound localization 2

VAMBU SOUND: A MIXED TECHNIQUE 4-D REPRODUCTION SYSTEM WITH A HEIGHTENED FRONTAL LOCALISATION AREA

Improving 5.1 and Stereophonic Mastering/Monitoring by Using Ambiophonic Techniques

The Projection of Sound in Three-Dimensional Space

Click to edit Master title style

Audio Engineering Society. Convention Paper. Presented at the 124th Convention 2008 May Amsterdam, The Netherlands

Transcription:

The Why and How of With-Height Surround Sound Jörn Nettingsmeier <nettings@stackingdwarves.net> freelance audio engineer Essen, Germany 1

Your next 45 minutes on the graveyard shift this lovely Saturday morning: A bit of history How do we perceive elevated sound? Why include height at all? How do different methods (re-)produce height? A closer look at multichannel stereo techniques VBAP Ambisonics 2

A bit of History German Pavillon World Expo 1970: 50 speakers in a sphere, acoustically transparent grid floor. (both images Stockhausen Foundation for Music) 3

A bit of History Mostly discrete routing, a bit of amplitude panning. Lots of fun with acoustics. 4

A bit of History François Bayle's Acousmonium (Radio France): 80 different speakers spread around, for live diffusion of tape music (usually stereophonic). 5

A bit of History François Bayle's Acousmonium (Radio France): 80 different speakers spread around, for live diffusion of tape music (usually stereophonic). The music is being diffused in real-time, usually by the composer, sitting at a mixing desk designed for the purpose. 6

A bit of History François Bayle's Acousmonium (Radio France): 80 different speakers spread around, for live diffusion of tape music (usually stereophonic). The music is being diffused in real-time, usually by the composer, sitting at a mixing desk designed for the purpose. Speakers are chosen for their different characteristics. Localisation is part of the interpretation, but not independent of speaker positions. 7

A bit of History François Bayle's Acousmonium (Radio France): 80 different speakers spread around, for live diffusion of tape music (usually stereophonic). The music is being diffused in real-time, usually by the composer, sitting at a mixing desk designed for the purpose. Speakers are chosen for their different characteristics. Localisation is part of the interpretation, but not independent of speaker positions. A modern-day successor is the BEAST (Birmingham ElectroAcoustic Sound Theatre). 8

A bit of History Systems like these are not aiming at a systematic, portable approach to with-height surround. 9

A bit of History Systems like these are not aiming at a systematic, portable approach to with-height surround. They are part of the artwork, and of the creative process. 10

A bit of History Systems like these are not aiming at a systematic, portable approach to with-height surround. They are part of the artwork, and of the creative process. Their strengths can be exploited in depth. 11

A bit of History Systems like these are not aiming at a systematic, portable approach to with-height surround. They are part of the artwork, and of the creative process. Their strengths can be exploited in depth. Their deficiencies mark important artistic constraints, which are either fought against, or put to use. 12

A bit of History Systems like these are not aiming at a systematic, portable approach to with-height surround. They are part of the artwork, and of the creative process. Their strengths can be exploited in depth. Their deficiencies mark important artistic constraints, which are either fought against, or put to use. In any case, they are integral parts of the artwork, too. 13

A bit of History That is of course a brilliant excuse :-D 14

A bit of History That is of course a brilliant excuse :-D Let's look instead at systems that aim for widespread deployment in a wider potential market 15

A bit of History That is of course a brilliant excuse :-D Let's look instead at systems that aim for widespread deployment in a wider potential market aim to reproduce content by third parties 16

A bit of History That is of course a brilliant excuse :-D Let's look instead at systems that aim for widespread deployment in a wider potential market aim to reproduce content by third parties define clearly how the system should be implemented 17

A bit of History Michael Gerzon 1973, periphonic (i.e. withheight) surround sound using 4 channels: B-format. loudspeaker layout agnostic scalable In 1992, Gerzon proposed this as a candidate format for HDTV. Alas,... 18

A bit of History Tomlinson Holman, 1999: eight speakers on the horizontal plane (with heavy frontal bias), two subs left and right, and two elevated frontal speakers: 10.2 speaker feed mixing ( Twice as good as 5.1 ) 19

A bit of History Werner Dabringhaus, 1999: front left/right, rear left/right, elevated front left/right: 2+2+2 stereo-pairwise mixing using traditional miking techniques Designed to work on DVD-Audio, with the 5 plus 1 channels available. Some tricks to ensure a meaningful (although compromised) image when played back over an ITU 5.1 rig. 20

A bit of History Wilfried van Baelen (Galaxy Studios), 2005: an ITU 5.1 system with elevated speakers above L, R, Ls and Rs: Auro-3D same basic idea, yet more channels The proposal includes some neat encoding tricks to funnel 10 (or more) signals into 5.1 carriers, or into the 8 PCM streams of a Blu-ray disc. 21

A bit of History Kimio Hamasaki et. al, 2005 (NHK): ten horizontal channels, eight elevated channels, one voice of God, three front low channels, two subs: 22.2 Designed as a complement to the proposed Ultra-HDTV standard for total immersion. Again, more channels... 22

And back in the present... It seems there are many variations on the theme. Now let's all go pick an arbitrary pair {N.M} and stick our names on it. 23

My own humble claim to fame is: 24

My own humble claim to fame is: 25

My own humble claim to fame is: 44.4 26

My own humble claim to fame is: 44.4 Eat my dust, Kimio :-D 27

That is of course just a joke. The system was used for IOSONO playback, and higher-order Ambisonics. 28

Learning from History Except for Ambisonics, all proposals share the same paradigms/problems more and more channels without real up- and downwards compatibility frontal bias speaker-feed mixing underspecified signal relationships (correlation etc.) 29

Perception 30

How do we perceive direction? Left/right (horizontal) cues are interaural time difference ITD (at LF) no head shading (perfect diffraction) unambiguous phase (wavelength > 2x ear dist.) 31

How do we perceive direction? Left/right (horizontal) cues are interaural time difference ITD (at LF) no head shading (perfect diffraction) unambiguous phase (wavelength > 2x ear dist.) interaural level difference ILD (at HF) head shading ambiguous phase! 32

How do we perceive height? How about a source that moves up on the median plane (i.e. right in front of us)? constant ITD, no cue constant ILD, no cue -> All we have is a slight change of tone colour, due to ear flaps (pinnae) and head/torso effects. 33

How do we perceive height? If it's just tone colour, how do we perceive height when we don't know the uncoloured sound? 34

How do we perceive height? If it's just tone colour, how do we perceive height when we don't know the uncoloured sound? Short answer: we don't. 35

How do we perceive height? If it's just tone colour, how do we perceive height when we don't know the uncoloured sound? Short answer: we don't. Long answer: we do not. 36

How do we perceive height? If it's just tone colour, how do we perceive height when we don't know the uncoloured sound? Short answer: we don't. Long answer: we do not. But some narrowband signals suggest height regardless of the actual source elevation (Blauert, 1983). 37

How do we perceive height? Humans don't perceive height very well. Signal semantics dominate: Airplane? must be up. Birds, likewise. Footsteps? flowing water? down. And if you see a source, that's where you hear it, usually (multi-modal perception). 38

How do we perceive height? But: We can move our head to direct the more acute horizontal localisation mechanisms at any source. We can explore a sound field at leisure. 39

Then why bother? 40

Why include height? A few common claims, supported by personal experience (not research): 41

Why include height? A few common claims, supported by personal experience (not research): improved immersion/envelopment 42

Why include height? A few common claims, supported by personal experience (not research): improved immersion/envelopment increased robustness against listening room problems 43

Why include height? A few common claims, supported by personal experience (not research): improved immersion/envelopment increased robustness against listening room problems enlarged usable listening area 44

Why include height? A few common claims, supported by personal experience (not research): improved immersion/envelopment increased robustness against listening room problems enlarged usable listening area more natural timbre 45

Why include height? A few common claims, supported by personal experience (not research): improved immersion/envelopment increased robustness against listening room problems enlarged usable listening area more natural timbre height localisation 46

Why include height? A few common claims, supported by personal experience (not research): improved immersion/envelopment increased robustness against listening room problems enlarged usable listening area more natural timbre height localisation Nobody cares! 47

Uses for height localisation better audibility of complex structures due to vertical separation: e.g. organ music more precise reproduction of room acoustics: characteristic ceiling reflections use of location as a precisely audible musical parameter, like pitch and duration discrete sources at height: elevated choirs or solo instruments, opera scenes 48

Height reproduction in Stereo Stereo := using stereophonic techniques level differences in speaker pairs (=artificial ILD) time differences in speaker pairs (=artificial ITD) But: not used on the median plane. Tone colour for any given height is not the sum of upper speaker tone colour plus lower speaker tone colour weighted by relative amplitude. 49

Height reproduction in Stereo Hence: ILD/ITD not much use for height, steep localisation curve. Bottomline: it's either on the bottom speaker, or on the upper speaker. No stable auditory events in between (however, suggesting quick vertical movement is possible). 50

Height reproduction in Stereo Artificially delivered ITD/ILD fall apart when the listener's head is rotated away from the frontal upright orientation. Don't move! 51

Height reproduction in Ambisonics Ambi attempts to get the soundfield correct, to some degree. In a correct soundfield, you can move any way you like and collect useful cues. Once your brain has locked onto a cue, localisation remains stable even if you move. 52

Bottom line: Only Higher-order Ambisonics and VBAP can create meaningful and stable auditory events at continuously variable elevation. 53

Is with-height surround really worth the trouble? 54

Depends. 55

Thanks for your attention. I'm looking forward to your remarks and questions. 56