Proceedings of Meetings on Acoustics

Similar documents
DIVERSE RESONANCE TUNING STRATEGIES FOR WOMEN SINGERS

SPEECH AND SPECTRAL ANALYSIS

Converting Speaking Voice into Singing Voice

Musical Acoustics, C. Bertulani. Musical Acoustics. Lecture 14 Timbre / Tone quality II

Barbershop Tuning By Ted Chamberlain for HCNW

INTRODUCTION TO ACOUSTIC PHONETICS 2 Hilary Term, week 6 22 February 2006

COMP 546, Winter 2017 lecture 20 - sound 2

Lab 10 The Harmonic Series, Scales, Tuning, and Cents

An Experimentally Measured Source Filter Model: Glottal Flow, Vocal Tract Gain and Output Sound from a Physical Model

Lecture 5: Pitch and Chord (1) Chord Recognition. Li Su

CS 188: Artificial Intelligence Spring Speech in an Hour

Speech Synthesis using Mel-Cepstral Coefficient Feature

Linear Frequency Modulation (FM) Chirp Signal. Chirp Signal cont. CMPT 468: Lecture 7 Frequency Modulation (FM) Synthesis

Project 0: Part 2 A second hands-on lab on Speech Processing Frequency-domain processing

Experimental evaluation of inverse filtering using physical systems with known glottal flow and tract characteristics

Digital Signal Processing

Speech Processing. Undergraduate course code: LASC10061 Postgraduate course code: LASC11065

ACOUSTICS. Sounds are vibrations in the air, extremely small and fast fluctuations of airpressure.

Source-Filter Theory 1

THE HUMANISATION OF STOCHASTIC PROCESSES FOR THE MODELLING OF F0 DRIFT IN SINGING

INDIANA UNIVERSITY, DEPT. OF PHYSICS P105, Basic Physics of Sound, Spring 2010

Subtractive Synthesis & Formant Synthesis

Reading: Johnson Ch , Ch.5.5 (today); Liljencrants & Lindblom; Stevens (Tues) reminder: no class on Thursday.

Structure of Speech. Physical acoustics Time-domain representation Frequency domain representation Sound shaping

MUSC 316 Sound & Digital Audio Basics Worksheet

Experienced saxophonists learn to tune their vocal tracts

WaveSurfer. Basic acoustics part 2 Spectrograms, resonance, vowels. Spectrogram. See Rogers chapter 7 8

Teaching the descriptive physics of string instruments at the undergraduate level

Quantification of glottal and voiced speech harmonicsto-noise ratios using cepstral-based estimation

Musical Acoustics, C. Bertulani. Musical Acoustics. Lecture 13 Timbre / Tone quality I

Acoustics and Fourier Transform Physics Advanced Physics Lab - Summer 2018 Don Heiman, Northeastern University, 1/12/2018

Speech Synthesis; Pitch Detection and Vocoders

Lab 8. ANALYSIS OF COMPLEX SOUNDS AND SPEECH ANALYSIS Amplitude, loudness, and decibels

IMPROVING QUALITY OF SPEECH SYNTHESIS IN INDIAN LANGUAGES. P. K. Lehana and P. C. Pandey

Foundations of Language Science and Technology. Acoustic Phonetics 1: Resonances and formants

Proceedings of Meetings on Acoustics

Acoustic Phonetics. Chapter 8

Acoustic Phonetics. How speech sounds are physically represented. Chapters 12 and 13

RAM Analytical Skills Introductory Theory Primer Part 1: Intervals Part 2: Scales and Keys Part 3: Forming Chords Within Keys Part 4: Voice-leading

Vocal effort modification for singing synthesis

Singing Expression Transfer from One Voice to Another for a Given Song

Music 171: Amplitude Modulation

VIBRATO DETECTING ALGORITHM IN REAL TIME. Minhao Zhang, Xinzhao Liu. University of Rochester Department of Electrical and Computer Engineering

Hi-Fi voice: observations on the distribution of energy in the singing voice spectrum above 5 khz

Quarterly Progress and Status Report. Synthesis of selected VCV-syllables in singing

2. When is an overtone harmonic? a. never c. when it is an integer multiple of the fundamental frequency b. always d.

Recording and post-processing speech signals from magnetic resonance imaging experiments

CMPT 468: Frequency Modulation (FM) Synthesis

Quarterly Progress and Status Report. Acoustic properties of the Rothenberg mask

A-110 VCO. 1. Introduction. doepfer System A VCO A-110. Module A-110 (VCO) is a voltage-controlled oscillator.

the 99th Convention 1995 October 6-9 NewYork

AUDITORY ILLUSIONS & LAB REPORT FORM

SPEECH TO SINGING SYNTHESIS SYSTEM. Mingqing Yun, Yoon mo Yang, Yufei Zhang. Department of Electrical and Computer Engineering University of Rochester

Linguistic Phonetics. Spectral Analysis

Source-filter Analysis of Consonants: Nasals and Laterals

Block diagram of proposed general approach to automatic reduction of speech wave to lowinformation-rate signals.

TE 302 DISCRETE SIGNALS AND SYSTEMS. Chapter 1: INTRODUCTION

Preeti Rao 2 nd CompMusicWorkshop, Istanbul 2012

What is Sound? Part II

From Ladefoged EAP, p. 11

Complex Sounds. Reading: Yost Ch. 4

Spectral analysis of different harmonies Implemented by Equal temperament, Just, and Overtone ratio based tuning system

CHORD RECOGNITION USING INSTRUMENT VOICING CONSTRAINTS

Optimization of an Acoustic Waveguide for Professional Audio Applications

Music Theory I (MUT 1111) Prof. Nancy Rogers

Linguistics 401 LECTURE #2. BASIC ACOUSTIC CONCEPTS (A review)

Perception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 5: 12 Feb A. Faulkner.

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics

A system for automatic detection and correction of detuned singing

The source-filter model of speech production"

Determination of instants of significant excitation in speech using Hilbert envelope and group delay function

Dept. of Computer Science, University of Copenhagen Universitetsparken 1, DK-2100 Copenhagen Ø, Denmark

Proceedings of Meetings on Acoustics

Sound Synthesis Methods

Musical Acoustics Lecture 17 Interval, Scales, Tuning and Temperament - II

A Parametric Model for Spectral Sound Synthesis of Musical Sounds

Verse (Bars 5 20) The Contour of the Acoustic Guitar Riff

NCCF ACF. cepstrum coef. error signal > samples

Plaits. Macro-oscillator

Lecture 7: Superposition and Fourier Theorem

Aspiration Noise during Phonation: Synthesis, Analysis, and Pitch-Scale Modification. Daryush Mehta

Music and Engineering: Just and Equal Temperament

Dynamic Tuning: Implementation and Control

Simple Plucked and Blown Free Reeds from Southeast Asia

ZynAddSubFX an open-source software synthesizer

Glottal source model selection for stationary singing-voice by low-band envelope matching

Learning to Unlearn and Relearn Speech Signal Processing using Neural Networks: current and future perspectives

A() I I X=t,~ X=XI, X=O

Physics 101. Lecture 21 Doppler Effect Loudness Human Hearing Interference of Sound Waves Reflection & Refraction of Sound

Signal Processing for Speech Applications - Part 2-1. Signal Processing For Speech Applications - Part 2

Recap the waveform. Complex waves (dạnh sóng phức tạp) and spectra. Recap the waveform

L19: Prosodic modification of speech

SOPA version 2. Revised July SOPA project. September 21, Introduction 2. 2 Basic concept 3. 3 Capturing spatial audio 4

Synthesis Algorithms and Validation

Chaos tool implementation for non-singer and singer voice comparison (preliminary study)

A Look at Un-Electronic Musical Instruments

Lab week 4: Harmonic Synthesis

Transforming High-Effort Voices Into Breathy Voices Using Adaptive Pre-Emphasis Linear Prediction

ANALYSIS AND EVALUATION OF IRREGULARITY IN PITCH VIBRATO FOR STRING-INSTRUMENT TONES

Transcription:

Proceedings of Meetings on Acoustics Volume, http://acousticalsociety.org/ ICA Montreal Montreal, Canada - June Musical Acoustics Session amu: Aeroacoustics of Wind Instruments and Human Voice II amu. The spectral and acoustical impact of vowel changing in choral tuning Bertrand Delvaux* and David Howard *Corresponding author's address: University of York, NAMUR,, NAMUR, Belgium, bertrand.delvaux@gmail.com Mixed soft/solid models of the Vocal Tract were moulded with a D rapid prototyping technique based on MRI data obtained from two male singers during the phonation of five English vowels as in hard, stern, neap, port and food. The replicas are used to assess the interaction of several vocal tracts in different settings : twice the same singer or two different singers, singing on the same vowel or on different vowels, on a consonant or a dissonant interval. The spectral output is analysed and the acoustical output is submitted to a listening test to evaluate the spectral and acoustical grounds for an interval/chord to perceptually sound "in tune". Published by the Acoustical Society of America through the American Institute of Physics Acoustical Society of America [DOI:./.] Received Jan ; published Jun Proceedings of Meetings on Acoustics, Vol., () Page

INTRODUCTION There have been many studies relating to the singing voice over the last few decades ([], [], [], []) but little research relates to choir singing, particularly ensemble intonation. Ternström and Sundberg [] have studied the intonation of choir, Sacerdote [] have measured frequency fluctuations and vibrato in choirs and Howard [] has investigated pitch drift in a cappella (unaccompanied) quartet singing. In ensemble singing, we believe that the perceived intonation depends on both the vowels sung and the singers combination. We assess the perceived intonation between two experienced male singers (S and S, bass and tenor respectively) using moulded replicas of the Vocal Tract (VT). These are driven by an input sound signal representing the glottal flow. We analyse the sound output through spectrograms and a listening test. MATERIALS AND METHODS Moulding the Vocal Tracts The VT replicas were moulded based on volumetric MRI data collected while two profesionnal male singers were singing English vowels in a supine position [], by a D fast gradient echo sequence. The relaxation time was. ms and the excitation time was. ms. Acquisition is isotropic mm in a x matrix. Output is then interpolated to x using % slice overlap giving an effective anisotropic output of.x.xmm. A stack of images is produced in the midsagittal plane in approximately s. The MRI data were then segmented with the open source code ITK-Snap, to rebuild a D Vocal Tract, whose.stl file was then printed in the Electronics Department of the University of York, on Objet (a D printer made by Objet) using the material called "Vero White Plus FullCure " and a support material called "FullCure support". The VTs are composed of an insert to plug onto a. horn driver and lips of actual shape (see Fig. ). The thickness of the shell of the VT is mm. FIGURE : The VT replicas, with the insert and the actual shaped lips Pure Data Pure Data (Pd) is a graphical programming environment for audio, video, and graphical processing. It is used to generate the glottal waveform injected in the VTs, here a sawtooth signal of the form Proceedings of Meetings on Acoustics, Vol., () Page

sin(πkf t) x(t) = k k= () with f the fundamental frequency, and t the time. This infinite series is truncated and normalized. The code in pd allows to fix f, the vibrato rate and the vibrato depth. We fixed the value f = Hz (C) and chose a vibrato rate and depth of. Hz and +/- % of a semitone respectively, which are common values accepted in the literature. The scale was tuned according to "just intonation" [] (see Tab. ). TABLE : Scale used in the experiment, with the frequency (in Hz) and ratio to the root note (C = Hz) C D D E E F G G A A B B C....... Acoustic measurements The experiment took place in a -sided acoustic anechoic chamber. Both VTs were located cm appart, and a measuring mic MBC was placed cm far from each of them, occupying a vertex of a cm edge equilateral triangle. The recorded output was written on a USB type device with a khz sampling rate on a bits WAV file. The VTs were given an input signal (as described before) at the glottis by a.uk driver unit. RESULTS To assess the perceived intonation between two VTs, we chose four harmonic intervals. An interval between two tones is harmonic when the ratio between them can be expressed by two small integers m : n (m, n=integers where m > n). For this study, we chose:. a minor third, m rd (:). a major third, M rd (:). a perfect fifth, th (:). a minor seven, m th (:). We evaluated the spectral factors of each of these intervals for singer with two different vowels and vowel with two different singers. This gives sets in total:. S (bass) with i-a. S (tenor) with i-a. i with S-S. a with S-S Each set was given the intervals, with settings for each: S - S or S - S. This gave pairs of sound. For each pair, expert listeners were asked "Which sound do you prefer?". They were given the choice among the first setting, the second setting and no preference. Proceedings of Meetings on Acoustics, Vol., () Page

On Fig., we can see the evaluation given by the expert listeners, for the same singer (i.e., S or S) singing the intervals with a or i as a root (in blue and in red respectively). S exhibits a preference for i as the root note for the th and a as the root note for the m th (as illustrated on Fig. a, where coming the right part ( i as the root) is denser in harmonics). S shows a clear preference for i as the root (as can be seen on Fig. b). On Fig., we can see that for i, S was generally preferred as a root note whereas for a, S was preferred as the root. The former case is illustrated on Fig. a, where the spectrogram gets denser around. -. khz when it goes from S ro S as the root. The latter case is illustrated on the Fig. b, with denser harmonics on the part with S as a root. The listening test confirmed what could be observed on the spectrograms. The results can be summarized as follows: For S, i was preferred as a root For i, S was generally preferred as a root For a, S was preferred as the root. The m th is the interval which listeners rated the most unanimously DISCUSSION From the summary of results above, we could observe that S- i and S- a are strong combinations, reinforcing the partials of the other singer/vowel when they are taken as a root note. In other words, our bass singer exhibits a denser spectrum when singing the vowel a whereas our tenor singer exhibits a denser spectrum when singing the vowel i. The vowel i was found to generally be better as the root of a harmonic dyad. This is a preliminary study. Therefore, further refinements and a more extensive study including more vowels/singers are needed to find patterns in the spectral role of vowels and singers for ensemble intonation. REFERENCES [] J. Sundberg, The Science of the Singing Voice (Northern Illinois University Press) (). [] J. Sundberg, Articulatory interpretation of the singing formant, J. Acoust. Soc. Am., (). [] J. Sundberg, The acoustics of the singing voice, Sci. Am. (March ). [] J. Sundberg, Effects of the vibrato and the singing formant on pitch, Musicolog. Slovaca, (). [] S. Ternström and J. Sundberg, Intonation precision of choir singers, J. Acoust. Soc. Am., (July ). [] G. Sacerdote, Researches on the singing voice, J. Acoust. Soc. Am., (). [] D. Howard, Intonation drift in a capella soprano, alto, tenor, bass quartet singing with key modulation, J. Voice, (May ). [] M. Speed, Voice synthesis using the three-dimensional digital waveguide mesh, Ph.D. thesis, University of York (). [] J. Mason, Comparison of solo and ensemble performances with refererences to pythagorean, just, and equi-tempered intonations, J. Res. Mus. Edu., (Spring ). Proceedings of Meetings on Acoustics, Vol., () Page

S i-a i -a S i-a i -a m rd M rd th m th m rd M rd th m th FIGURE : S and S with a (in blue) or i (in red) being the root note of the interval. In yellow, no preference......... (A) S, m th. Root note from /a/ to /i/ (B) S, th. Root note from /a/ to /i/ FIGURE : Spectrograms Proceedings of Meetings on Acoustics, Vol., () Page

i S -S S - S a S -S S - S m rd M rd th m th m rd M rd th m th FIGURE : i and a with S (in blue) or S (in red) being the root note of the interval. In yellow, no preference......... (A) i, m th. Root note from S to S (B) a, th. Root note from S to S FIGURE : Spectrograms Proceedings of Meetings on Acoustics, Vol., () Page

(A) S, root note from /a/ to /i/ (B) S, root note from /a/ to /i/ FIGURE : Ascending scales (A) i, root note from S to S (B) a, root note from S to S FIGURE : Ascending scales Proceedings of Meetings on Acoustics, Vol., () Page