Effect of the number of loudspeakers on sense of presence in 3D audio system based on multiple vertical panning

Similar documents
Proceedings of Meetings on Acoustics

inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering August 2000, Nice, FRANCE

Analysis of Frontal Localization in Double Layered Loudspeaker Array System

XLA 3200 Line Array Loudspeakers

Multiple Sound Sources Localization Using Energetic Analysis Method

Multichannel Audio Technologies. More on Surround Sound Microphone Techniques:

A triangulation method for determining the perceptual center of the head for auditory stimuli

Perceptual effects of visual images on out-of-head localization of sounds produced by binaural recording and reproduction.

A Road Traffic Noise Evaluation System Considering A Stereoscopic Sound Field UsingVirtual Reality Technology

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 A MODEL OF THE HEAD-RELATED TRANSFER FUNCTION BASED ON SPECTRAL CUES

Psychoacoustic Cues in Room Size Perception

The Steering for Distance Perception with Reflective Audio Spot

DESIGN OF VOICE ALARM SYSTEMS FOR TRAFFIC TUNNELS: OPTIMISATION OF SPEECH INTELLIGIBILITY

INVESTIGATING BINAURAL LOCALISATION ABILITIES FOR PROPOSING A STANDARDISED TESTING ENVIRONMENT FOR BINAURAL SYSTEMS

SOUND 1 -- ACOUSTICS 1

Multichannel level alignment, part III: The effects of loudspeaker directivity and reproduction bandwidth

THE TEMPORAL and spectral structure of a sound signal

Paper Body Vibration Effects on Perceived Reality with Multi-modal Contents

Proceedings of Meetings on Acoustics

EBU UER. european broadcasting union. Listening conditions for the assessment of sound programme material. Supplement 1.

Surround: The Current Technological Situation. David Griesinger Lexicon 3 Oak Park Bedford, MA

Capturing 360 Audio Using an Equal Segment Microphone Array (ESMA)

LINE ARRAY Q&A ABOUT LINE ARRAYS. Question: Why Line Arrays?

RECOMMENDATION ITU-R BS User requirements for audio coding systems for digital broadcasting

Auditory Localization

Evaluation of clipping-noise suppression of stationary-noisy speech based on spectral compensation

Design of a Line Array Point Source Loudspeaker System

Convention Paper Presented at the 128th Convention 2010 May London, UK

HRTF adaptation and pattern learning

Combining Subjective and Objective Assessment of Loudspeaker Distortion Marian Liebig Wolfgang Klippel

WITHIN GENERATOR APPLICATIONS

Discriminating direction of motion trajectories from angular speed and background information

EFFECT OF STIMULUS SPEED ERROR ON MEASURED ROOM ACOUSTIC PARAMETERS

Sound source localization and its use in multimedia applications

O P S I. ( Optimised Phantom Source Imaging of the high frequency content of virtual sources in Wave Field Synthesis )

DESIGN AND APPLICATION OF DDS-CONTROLLED, CARDIOID LOUDSPEAKER ARRAYS

Spatial audio is a field that

RECOMMENDATION ITU-R BR.1384 *, ** Parameters for international exchange of multi-channel sound recordings ***

Measuring impulse responses containing complete spatial information ABSTRACT

University of Huddersfield Repository

Measuring procedures for the environmental parameters: Acoustic comfort

A Virtual Audio Environment for Testing Dummy- Head HRTFs modeling Real Life Situations

6-channel recording/reproduction system for 3-dimensional auralization of sound fields

Series P Supplement 16 (11/88)

Proceedings of Meetings on Acoustics

Pre- and Post Ringing Of Impulse Response

Spatial Audio Reproduction: Towards Individualized Binaural Sound

REDUCING THE NEGATIVE EFFECTS OF EAR-CANAL OCCLUSION. Samuel S. Job

MULTICHANNEL REPRODUCTION OF LOW FREQUENCIES. Toni Hirvonen, Miikka Tikander, and Ville Pulkki

Vertical Localization Performance in a Practical 3-D WFS Formulation

Proceedings of Meetings on Acoustics

Rapid Formation of Robust Auditory Memories: Insights from Noise

APPLICATIONS OF A DIGITAL AUDIO-SIGNAL PROCESSOR IN T.V. SETS

Principles of Musical Acoustics

An evaluation on comfortable sound design of unpleasant sounds based on chord-forming with bandlimited sound

Tone-in-noise detection: Observed discrepancies in spectral integration. Nicolas Le Goff a) Technische Universiteit Eindhoven, P.O.

Development of multichannel single-unit microphone using shotgun microphone array

The effect of 3D audio and other audio techniques on virtual reality experience

Binaural auralization based on spherical-harmonics beamforming

Evaluation of a new stereophonic reproduction method with moving sweet spot using a binaural localization model

SPATIAL SOUND REPRODUCTION WITH WAVE FIELD SYNTHESIS

How To... Commission an Installed Sound Environment

LBC 3201/00 Line Array Indoor Loudspeaker

INTERNATIONAL TELECOMMUNICATION UNION

Simultaneous Recognition of Speech Commands by a Robot using a Small Microphone Array

Introduction to Audio Watermarking Schemes

Upper hemisphere sound localization using head-related transfer functions in the median plane and interaural differences

SA1521 SPEAKER SA1521 SA1521. Features. Applications Live Applications Playback Applications Portable PA Systems Corporate Events

A spatial squeezing approach to ambisonic audio compression

What you Need: Exel Acoustic Set with XL2 Analyzer M4260 Measurement Microphone Minirator MR-PRO

Installation Guide & User Manual Sound Plus Infrared System, Model WIR 950

SOUND FIELD MEASUREMENTS INSIDE A REVERBERANT ROOM BY MEANS OF A NEW 3D METHOD AND COMPARISON WITH FEM MODEL

DETERMINATION OF EQUAL-LOUDNESS RELATIONS AT HIGH FREQUENCIES

Airborne Sound Insulation

Sound localization with multi-loudspeakers by usage of a coincident microphone array

Virtual Sound Source Positioning and Mixing in 5.1 Implementation on the Real-Time System Genesis

UBL S119 LOUDSPEAKER SYSTEM

What applications is a cardioid subwoofer configuration appropriate for?

A Tactile Display using Ultrasound Linear Phased Array

Binaural Hearing. Reading: Yost Ch. 12

Source Localisation Mapping using Weighted Interaural Cross-Correlation

Spatial Judgments from Different Vantage Points: A Different Perspective

CONTENTS. Preface...vii. Acknowledgments...ix. Chapter 1: Behavior of Sound...1. Chapter 2: The Ear and Hearing...11

Analysis of retinal images for retinal projection type super multiview 3D head-mounted display

Assessing the contribution of binaural cues for apparent source width perception via a functional model

Proceedings of Meetings on Acoustics

The Role of High Frequencies in Convolutive Blind Source Separation of Speech Signals

VLS 15. Technical Data Sheet. Product description

PERCEIVED ROOM SIZE AND SOURCE DISTANCE IN FIVE SIMULATED CONCERT AUDITORIA

Feasibility of Vocal Emotion Conversion on Modulation Spectrogram for Simulated Cochlear Implants

Low frequency sound reproduction in irregular rooms using CABS (Control Acoustic Bass System) Celestinos, Adrian; Nielsen, Sofus Birkedal

The Shape-Weight Illusion

Integral three-dimensional display with high image quality using multiple flat-panel displays

Optimization of loudspeaker and microphone configurations for sound reproduction system based on boundary surface control principle

GROUPING BASED ON PHENOMENAL PROXIMITY

Monitor Setup Guide The right monitors. The correct setup. Proper sound.

RoomMatch RM9060 TECHNICAL DATA SHEET. array module loudspeaker. Key Features. Product Overview. Technical Specifications

How Accurate is Your Directivity Data?

Proceedings of Meetings on Acoustics

ZX1i-90. Electro-Voice ZX1i-90. Integrated QuickSAM Heavy-Duty Strong-Arm Mounting Bracket Included

Transcription:

Effect of the number of loudspeakers on sense of presence in 3D audio system based on multiple vertical panning Toshiyuki Kimura and Hiroshi Ando Universal Communication Research Institute, National Institute of Information and Communications Technology, Kyoto, Japan. Summary NICT has developed a multiview 3D video display system with a 2-inch screen. In this system, several viewers can simultaneously view natural 3D objects from their respective viewing positions without special glasses. We have proposed a 3D audio system using multiple vertical panning, which is compatible with a large-screen multiview 3D video display system, that has following four features: ) Multiple viewers can simultaneously feel the sound images at the position of the 3D objects depicted by the 3D video display system regardless of the viewing position; 2) Viewers do not need to wear hearing devices; 3) There are no sound devices between the projector array and the viewing position; 4) No recording microphones need to be placed between the projector array and the viewing position. However, because the loudspeakers are densely placed at the horizontal direction of the screen in this proposed system, their number must be reduced for practical use. In this paper, we performed an audio-visual psychological experiment to develop a practical system. 3D videos were presented to viewers wearing glasses and six sound stimuli were randomly presented by 4, 6,, 2, 22, and 42 loudspeakers. Nine viewers compared the sound stimuli according to two evaluation criteria: the position and the movement of sound images. Our results show that viewers failed to discriminate the difference of the location and the movement of sound images with ten or more loudspeakers. We conclude that a practical system can be constructed with our proposed system with ten loudspeakers. PACS no. 43.6.Dh, 43.66.Qp. Introduction Ultra-realistic communication techniques have been investigated at NICT []. Their applications will enable more realistic forms of communication (e.g., 3D television and 3D teleconferencing) than those currently offered by conventional video and audio techniques (4K television and 5.-channel audio). At NICT, a glasses-free 3D video technique using a projector array has been proposed, and a multiview 3D video display system with a 2-inch screen has been developed [2]. Its basic configuration is shown in Figure. Parallax videos are projected to a Fresnel lens by projector units, which are components of the projector array. These parallax videos are only projected in the horizontal direction because of the diffusion characteristics of a diffuser screen placed in front of the Fresnel lens (a small diffusion angle in the horizontal direction and a wide diffusion angle in the (c) European Acoustics Association vertical direction). Since this system allows viewers to view parallax videos from a horizontal position, several viewers can simultaneously view natural 3D objects based on each particular viewing position without special glasses. However, the developed system only presents visual sensations. To achieve realistic auditory sensations, a 3D audio system must be developed that corresponds to a large-screen multiview 3D video display system. We have proposed a novel 3D audio system using the multiple vertical panning (MVP) method to develop a 3D audio system that matches our developed 3D video display system. The results of our audiovisual psychological experiment indicated that our proposed system was effective with such conventional audio systems as stereophonic audio [4]. However, because the loudspeakers are densely placed horizontally to its screen, their number is enormous. Thus, we must reduce them before our proposed system can be put into practical use. In this paper, we evaluated the effect of the number of loudspeakers on the sense of presence by performing an audio-visual psychologi-

Projector unit Plane for image display Reconstructed 3-D image Projector array Diffuser screen Observer Fresnel lens In the proposed system, viewers do not need to wear hearing devices. Because the loudspeakers are placed at the upper and lower sides of the screen, there are no sound devices between the projector array and the viewing position. Since this system only has to directly record the speech of the participants in teleconferences and does not restrict the position of the recording microphones, it does not need to place recording microphones between the projector array and the viewing position. 3. Audio-visual experiment Figure. Basic configuration of a large-screen multiview 3D video display system [3]. Sound Image Sound Image Sound Image 2 Figure 2. Basic configuration of multiple vertical panning (MVP) method [4]. cal experiment and developed a practical audio-visual system using our proposed system. 2. Diagram of proposed system [4] The basic configuration of our proposed system is shown in Figure 2. First, as shown on its left-hand side, two loudspeakers are placed at the upper and lower sides of the 3D object s position depicted in the screen by the developed 3D video display system. If a sound is played from two loudspeakers using the panning between them (vertical panning), viewers are expected to perceive a sound image between the two loudspeakers. If their sound pressure level difference is properly adjusted, because the sound-playing devices are only two loudspeakers placed at the upper and lower sides of the screen (vertically panned loudspeakers), we expect that multiple viewers can perceive a sound image at the position of the 3D object, regardless of the viewing position. Second, as shown on the right-hand side of Figure 2, the sound image positions are also expanded in the left-right direction by placing multiple vertically panned loudspeakers at the upper and lower sides of the screen. As a result, multiple viewers can simultaneously feel the sound images at the position of the 3D objects depicted by the 3D video display system, regardless of the viewing position. 3.. Environment and conditions Our experiment was performed in a conference room where a 2-inch rear-projection visual screen was set up. Two projectors for the 2D video of the left and right eyes were placed behind the screen. Because polarization plates are placed in front of the projectors, viewers can see 3D video by wearing polarization glasses. The room s reverberation time was 42 ms, and the background noise level had an A-weighted level of 38 db. We placed 42 loudspeakers in the room (Figure 3). They were placed in the forward position.275 m from the screen because they could not be placed over and under the screen, which was attached to the wall. The loudspeakers were made by mounting a loudspeaker unit (Fostex: FE3En) on a loudspeaker enclosure (width: cm, depth: 25 cm, height: cm). Considering the proper viewing distance in the developed large-screen multiview 3D video display system (5.5 m), three viewing positions (forward, central, and backward) were set at 3.5, 5.5, and 7.5 meters from the screen. The viewing width of the developed system is across, centered around the front viewing position of the screen when the viewing distance is 5.5 m. Thus, an additional viewing position (lateral) was set at a lateral position to the left of the central position. The height of all the viewing positions was set to.5 m at the ear position of the viewers. The sound pressure level was set to an A-weighted level of approximately 7 db in the central viewing position. The 3D video used in this experiment is shown in Figure 4. In it, the UFO (inside the yellow oval in Figure 4) that plays a sound is moving about the screen every five seconds. When it touches the stars and balls (inside the red circles in Figure 4), the sound of the stars and balls is played at their positions. The proper viewing distance and the parallax of the 3D video are 5.5 m and.625 m, respectively. Because the 3D viewing videos change based on the viewing positions in the developed 3D video display system, we also changed the presented 3D videos in this experiment based on the viewing positions. The sound conditions are shown in Figure 5. The gray loudspeakers denote the loudspeaker from which

Plane View 2.6 m (a) 4 Loudspeakers (b) 6 Loudspeakers.72 5.5 m 7. m 4.428 m 4.5 m Forward Central Backward 6.25 m (c) Loudspeakers (d) 2 Loudspeakers.275 m Lateral 4 m (e) 22 Loudspeakers (f) 42 Loudspeakers Cross-sectional View.275 m.39 m 2.6 m.29 m 2.49 m 5.5 m 7. m 4.4 m m.6 m Front View.39 m.5 m m 4.5 m.29 m Figure 5. Sound conditions used in audio-visual experiment. 4.4 m 2.49 m 4 m 4.428 m.5 m.6 m.72 m Figure 3. of viewers, screen, and loudspeaker array in audio-visual experiment. height of the sound images is the same as that of the ear position of the viewers if P V is -.3455. First, based on the horizontal position of the presented 3D object, P H, two loudspeakers placed at the upper and lower sides of the screen are selected: ( ) P H PH + 2.2 = d round 2.2, () d Figure 4. 3D video used in audio-visual experiment. a sound is not replayed in each condition. The sounds played at 3D object position (P H, P V ) at time T = m were synthesized by the following procedure. Note that (=3 fps) and m(=,...) denote the frame rate and the frame index of the video signals. P H (= 2.24 2.24) and P V (=.2455.2455) denote the horizontal and vertical positions of the presented 3D object. If P H is, the horizontal position corresponds to the screen s horizontal central position. The where P H (= 2.2,..., 2.2) denotes the horizontal position of the two selected loudspeakers. d denotes their right-and-left intervals. In this experiment, the d values are 4.4, 2.2,.,.88,.44 and.2 in an order corresponding to the sound conditions shown in Figure 5. Second, the sound calculated from the sound source signal, s(n), is replayed from two selected loudspeakers: x U (n) = a U w(n)s(n), (2) x D (n) = a D w(n)s(n), (3) (n = F s (m ),..., F s m + LF s ) where F s (=48 khz) and n(=,...) denote the sampling frequency and the sample time of the sound signals and x U (n) and x D (n) denote the sound signals replayed from the two loudspeakers of the upper and lower sides. w(n) denotes the window function of the

sound signals defined as follows: LF s {n Fs (m )} (n = Fs (m ),..., Fs (m ) + LF s ) w(n) = (n = Fs (m ) + LF s,..., Fs m), LF s (n Fs m) + (n = Fs m,..., Fs m + LF s ) (4) where L(= ms) denotes the crossfade time of the window function. a U and a D (the gain coefficients in each sound signal) are calculated from the level difference, A [db], as follows: a U = a D = A 2 A, (5) +. (6) A + In this experiment, level difference A was based on a previous study [4] as follows: A = αp V +.437. (7).65 The vertical interval of the loudspeakers is 2.7 m in this experiment, but it was 2.5 m in the previous study [4]. Thus, α(= 2.7 2.5 ) was set to compensate the differences of the vertical intervals of the loudspeakers. 3.2. Design and procedure Nine subjects (ages: 29 39, five males and four females) with normal audibility participated as viewers in this experiment. Scheffé s paired comparison [5] was applied as an evaluation method. This experiment s flowchart is shown in Figure 6. First, we set two evaluation criteria: the degree of the coincidence of the sound location and the sound movement. The sound location s degree of coincidence denotes whether viewers feel that the sound of the stars and balls (Figure 4) is always played at the position of the videos. The degree of the coincidence of the sound movement denotes whether viewers feel that the UFO s sound (Figure 4) is always moving in concert with the video. We divided our experiment into eight sessions for evaluation criteria and viewing positions and randomized their presented orders for all viewers. Six practice trials and thirty main trials were performed in each session. The six practice trials were permutations of the three sound conditions shown in Figure 5 ((a), (b), and (f)). The permutations of the six sound conditions shown in Figure 5 resulted in thirty main trials. The presentation orders of the trials were randomized for each viewer. The viewers graded the degree of the coincidence of stimulus B in reference to stimulus A using the 7- step scale shown in Table I. The viewers were allowed to freely move their heads and upper bodies while listening to the sounds. Test Evaluation Evaluation 2 Order...Randomized (Sound Location or Movement) Evaluation Session Session 2 Session 3 Session 4 Order...Randomized (4 Viewing s) Session Practice (6 trials) Trial Sign (. s) Break (.9 s) Main (3 trials) Stimulus ABreak Stimulus B (5 s) (2 s) (5 s) Figure 6. Flowchart of audio-visual experiment. Table I. Scale of Scheffé s paired comparison. Grade Judgment 3 Very good 2 Fairly good Little good The same - Little bad -2 Fairly bad -3 Very bad 3.3. Results and discussion Answer (4 s) An analysis of variance (ANOVA) of this experiment s result was performed based on Scheffé s paired comparison of eight sessions: evaluation criterion (2) viewing position (4). We found a significant main effect of the sound conditions at a.% level except for one session (sound location, backward viewing position). Thus, since there are significant differences among the number of loudspeakers, we evaluated their effect based on the average grades calculated in each session. In each evaluation criterion and viewing position, the average grades of all the sound conditions are shown in Figures 7-8. The error bars denote 95% confidence intervals based on a yardstick. In the sound condition (a), the average grades are significantly lower than other sound conditions except the lateral viewing position, since the position of the sound images is biased to the right-and-left sides of the screen and viewers can clearly perceive the position difference between the 3D object and the sound image. Viewers correctly discriminated the differences of the sense of presence. We evaluated the effect of the number of loudspeakers on the sense of presence on the basis of a sound condition in which the average grade is highest in all the sound conditions (basic sound condition). With four or six loudspeakers, sometimes the average grades were significantly lower (5% level) than the basic sound condition. On the other hand, we found no

- - - - Central 4 6 2 22 42 Backward 4 6 2 22 42 - - - - Forward 4 6 2 22 42 Lateral 4 6 2 22 42 Figure 7. Results of audio-visual experiment: sound location. the sense of presence. We found that viewers could not discriminate the differences of the sense of presence with ten or more loudspeakers. To construct a practical recording method for our proposed system, future work must perform an additional audio-visual psychological experiment and evaluate the effect of the number of discretized vertical positions of sound sources on the sense of presence. Acknowledgement The authors thank Dr. S. Iwasawa for constructing the environment of our audio-visual experiment and Dr. M. Makino for depicting the 3D videos in it. The audio-visual experiment in this paper was performed with the approval of the ethics committee of the National Institute of Information and Communications Technology (NICT), Japan. References - - - - Central 4 6 2 22 42 Backward 4 6 2 22 42 - - - - Forward 4 6 2 22 42 Lateral 4 6 2 22 42 [] K. Enami : Research on Ultra-realistic Communications. ECTI Transactions on Electrical Engineering, Electronics, and Communications 6 (28) 22-25. [2] S. Iwasawa and M. Kawakita: Quantifying Capabilities of the Prototype 2-inch Automultiscopic Display. Proc. International Conference on 3D Systems and Applications (22) 5-9. [3] M. Kawakita, S. Iwasawa, G. Sabri and N. Inoue: Development of Glasses-free 3D Video System. Proc. International Universal Communication Symposium, (2) 323-327. [4] T. Kimura and H.Ando: 3D Audio System Using Multiple Vertical Panning for Large-screen Multiview 3D Video Display. ITE Transactions on Media Technology and Applications 2 (24) 33-45. [5] H. Scheffé: An analysis of variance for paired comparisons. Journal of the American Statistical Association 47 (952) 38-4. Figure 8. Results of audio-visual experiment: sound movement. significant differences among the basic sound conditions in all the sessions with, 2, 22, or 42 loudspeakers. When the number of loudspeakers equals or exceeds ten, viewers cannot discriminate the differences of the location and the movement of sound images even with more loudspeakers. Thus, it is considered that the number of loudspeakers can be reduced to ten when an audio-visual system is based on our proposed system. 4. Conclusion In this paper, to reduce the required number of loudspeakers in our previously proposed system, we performed an audio-visual psychological experiment and evaluated the effect of the number of loudspeakers on