Quarterly Progress and Status Report. Acoustic properties of the Rothenberg mask
|
|
- Colin Alexander
- 5 years ago
- Views:
Transcription
1 Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Acoustic properties of the Rothenberg mask Hertegård, S. and Gauffin, J. journal: STL-QPSR volume: 33 number: 2-3 year: 1992 pages:
2
3 STL-QPSR 2-3 / 1992 ACOUSTIC PROPERTIES OF THE ROTHENBERG MASK Stellan Hertegdrd* G. Jan Gauffin Abstract The flow response and possible distortion from the Rothenberg mask system on radiated speech were studied by means of sweep-tone measurements. The flow frequency response was flat within 3 db up to 1.6 khz. This frequency range for the speech signal radiated through the mask normally includes the lowest two formants for open vowels. and is probably sufjicient to describe most aspects of the glottalflow waveform for untrained normal and pathological voices. A pronounced zero between 1.8 and 2 khz was found. This restricts the use of the mask systems tested here for air flow measurements at higher frequencies. It was shown that this zero could be moved up in frequency, increasing the useful frequency response to nearly 3 khz. We suggest that the zero is caused by acoustic shunting of the nasal part of the mask. A modified mask design by placing wire screen also in its nasal part might substantially improve the frequency response. INTRODUCTION Mean airflow at the lips is often measured in order to assess vocal function. If the subglottal pressure during phonation also is measured, the glottal resistance during phonation can be calculated as the quotient between pressure and flow (Isshiki, 1964; Schutte, 1980). This gives an estimate of the mean "closing" ability of the glottis. Pneumotachograph masks are frequently used for measuring mean airflow during phonation of a vowel. The disadvantage of calculating glottal resistance from mean airflow is that it is impossible to study each individual glottal cycle and measure the absolute airflow during the closed phase. Fig. 1 illustrates that a certain mean airflow can be produced by means of a high modulated transglottal airflow and zero airflow during the closed phase, indicating complete glottal closure. The same mean airflow could also result from a smaller modulated airflow and a constant airflow (waveform offset) even during the presumed closed phase, indicating insufficient glottal closure**. These two alternatives would not result in the same voice quality, but could not be separated from measurements of the mean airflow only. The circumferentially vented wire screen mask designed by Rothenberg (Fig. 2) has some special properties that differ from many other pneumotachograph masks that are used for the measurement of mean airflow at the lips (Rothenberg, 1973, 1977). In the Rothenberg mask a number of holes have been drilled around the circumference close to the mouth. These are covered with a fine meshed single or double layered steel wire screen that produce an acoustic resistance. A wide frequency differential pressure transducer is used as a microphone. It has one input on the inside of the wire screen and another input on the outside, which provides a measure of the pressure drop across the screen, thus also an estimation of flow. The purpose of the design is to reduce the effect of the mask on the acoustic resonances of the vo- *~e~t. of Logopedics and Phoniatrics, Huddinge University Hospital, Karolinska Institute, Stockholm. ** A small amount of waveform offset during the closed phase could be caused by vertical movements of the vocal folds as previously discussed in Hertegbd, Gauffin, & Karlsson, 1992.
4 cal tract and also to increase the frequency response of the mask system. If the airflow at the lips is properly inverse filtered, each glottal pulse during phonation can be studied in more detail and the absolute transglottal airflow during the open and closed phases can be measured after calibration. The response time and acoustic properties of the mask system have been studied by Rothenberg (1973; 1977). Ttansglottal airflow closed phase Time Fig. I. Mean airflow and transglottal airflow for two types of phonations: A with a high modulated flow amplitude and zero airflow during the closed phase, B with a small amount of modulated airflozu and a constant waveform offset. hfe~allic wire screen cove;ing the holes F compressible Results from studies of the upper frequency limit of the flow response of the mask system and assessment of the possible distortion of the radiated speech sound from the mask were presented in a previous paper (Badin, Hertegsrd, & Karlsson, 1990). In the present study complementary experiments are described and their implications for clinical use of the Rothenberg mask system are discussed. Fig. 2. Schematic drawing of the Rothenberg mask. METHODS The flow response of the mask system was tested by sweep-tone analysis. Fig. 3 shows the experimental set-up. The mask was connected via a ceramic adapter provided by Glottal Enterprise, Syracuse, USA and placed on a wooden plate with a central 5 cm hole'on top of a small loudspeaker driven by a Hewlett Packard Dynamic Signal Analyser, type 3562A. The Dynamic Signal Analyser produced a constant amplitude frequency sweep. The loudspeaker and adapter simulated a short vocal tract with its first formant at approximately 3 khz. The output of the mask
5 STL-QPSR 2-3/ 1992 electronics was recorded by the Dynamic Signal Analyser. The microphone in the mask system tested was a differential pressure transducer (type MTW). The mask electronics in the experiment was type MSIF-2, which has a built in 3 khz low-pass filter. The masks tested were a larger double layer mask (type MA-2N) with a resistance of approximately 0.5 cm H20/l/s, a smaller double layer mask (type MA-2s)) and a single layer wire screen (type MA-IN) mask with a resistance of about 0.25 cm H20/l/s. All these items were manufactured by Glottal Enterprises, Syracuse, USA, the maker of the Rothenberg mask systems. Rothenberg mask on ceramic adaptor 7 mask preamp1 ifier 3- wooden plate - - Sweep inpu ' Dyn alyze Loudspeaker Fig. 3. The experimental set-up for the sweq-tone measurements. The frequency sweep provided by the loudspeaker was first tested with a free field sweep. This was measured with a Briiel & Kjaer (BK) 1/2" microphone, with a flat frequency response high above 5 khz, and the Dynamic Signal Analyser. Fig. 4 shows that the sweep provided by the loudspeaker was linear within 3 db from 200 Hz to approximately 3 khz. Thus, it can be concluded that the loudspeaker provided a sinusoidal volume flow with an amplitude that decreased with a constant slope of 6 db/octave above 200 Hz to close to 3 khz. RESULTS Sweep-tone analysis was carried out for each of the three masks as described above. In Fig. 5, the result of the sweep for the larger double layer mask, type MA- 2N (with the MTW-transducer and the MSIF-2 preamplifier) is shown. A -6 db/octave line was drawn in the figure. The response of this mask system was linear within 3 db to 1.6 khz. At around 1.9 khz there was a pronounced dip of about 15 db from the -6 db line. Slightly below 3 khz there was a peak. The level fell for frequencies above 3 khz, probably due to the low-pass filter of the mask electronics. There was also a small dip around 400 to 500 Hz. Vibrations could be felt in the measuring equipment during the sweep around these frequencies, indicating that this dip was caused by a mechanical resonance in the measuring equipment itself. This small dip was also observed during the sweep tone measurements for the other masks tested. In Fig. 6 are shown the results of the sweep tone measurements for all three masks described above. All three mask systems had a linear frequency response within 3 db up to khz. They all had a zero around 2 khz and a peak near 3 khz.
6 STL-QPSR 2-3/1992 Fig. 4. A free field sweep registered with a BriieleKjaer 1/2" microphone held over the loudspeaker with no mask present. The ripple in the curve is due to noise in the experimental room I I I I I l l 1 I I I I ' 200 Log Hz 2 5k Fig. 5. The frequency response from a sweq with the equipment as described in Fig 3. A larger double layer mask (type MA-2N) was used. A -6 db /octave line was superimposed for comparison.
7
8 STL-QPSR 2-3/ 1992 Fig. 7 shows a sweep with the MTW transducer without a mask in free field approximately 2 cm above the ceramic adapter and the loudspeaker and another sweep with the transducer held 2 cm above the larger double layer MA-2N mask (which was held firmly to the ceramic adapter). The sound level was generally lower with the mask present. The free field recording without the mask did not show any dip near 2 khz, whereas a dip was present near 2 khz with the mask. There was a peak near 3 khz in both recordings, probably caused by the first "formant" in the acoustic system, as mentioned above. In Fig. 8 similar sweeps were made with a 1/2" BK microphone in free field above the loudspeaker and adapter without the mask and with the mask held firmly to the adapter. A dip was present near 2 khz with, but not without, the mask I I I I I I I I J 200 Log Hz 2 5k Fig. 8. The frequency response from a free field sweep registered with a 112 " BriielaKjaer microphone held above the MA-2N mask (solid line) and without the mask (dashed line). Further analysis of the zero near 2 khz In order to investigate if the zero near 2 khz was caused by a mechanical resonance in the mask or microphone, we excited the mask (type MA-2N) and microphone with a mechanical pulse and measured the mechanical vibrations at different points. The pulse was produced by a small hammer fitted with an accelerator (PCB Impulse force hammer type 086M37) connected to the HP Dynamic Analyzer. This revealed a mechanical resonance in the connector wire hold of the microphone at approximately 1.9 khz. This resonance was effectively dampened by a piece of plasticine. By repeating the sweep-tone measurements we could conclude that this mechanical resonance only had a marginal effect on the zero. Since there was no mechanical resonance causing the dip, we suspected a cross resonance or a shunt in the mask itself. By filling out the nasal part of the mask with
9 STL-QPSR 2-3/ 1992 plasticine we could move the zero at 2 khz upwards in frequency to over 3 khz (Fig. 9) Fig. 9. The frequency response from sweep tone measurements using the equipment as described in Fig. 3, including the MA-2N mask (solid line) and with plasticine dampening in the nasal part of the mask (dotted line). DISCUSSION The acoustic properties of the mask system have been previously studied by Rothenberg (1973; 1977). He reported the mask to be linear for a static airflow from zero up to well above 1 l/s, which is satisfactory for most clinical conditions. He also found that for open vowels, such as /a/ and /a/, the lowest two formants (which are most important for the shape of the waveform during inverse filtering) were lowered by Hz due to the increase in effective vocal tract length from the mask. The transmission characteristics of the mask were found to be linear within 6 db for speech range frequencies, except for a pronounced dip at khz. This dip probably corresponds to the dip found around 1.7 khz in our previous study (Badin, et al., 1990) and between 1.8 and 2 khz in the present report. Badin, et al. (1990) studied the effect of the mask on radiated speech sound pressure by means of LTAS analysis of speech samples with and without the mask. This analysis showed a reduced sound pressure level around 2 khz in recordings with the mask than without. This indicates that the zero near 2 khz exists both in the speech measurements and in the sweep-tone measurements with the mask. All free field measurements without the mask failed to show any zero. This indicates that the zero was caused by a cross resonance or a shunt in the mask itself. In our previous study it was shown that the frequency of the zero varied somewhat for different dampening foam settings in the mask microphone and the mask (Badin, et
10 STL-QPSR 2-3/ 1992 al., 1990), but the dip could not be eliminated. Pressing the mask with varying force to the adapter or face did not significantly affect the dip. As described in this paper, a marginal dampening of the zero resulted from dampening a small cavity in the transparent plastic microphone adapter. From the present experiments it is also apparent that the measurement equipment itself was not responsible for the dip. We have also made some tests of a prototype to a new pressure transducer provided by Glottal Enterprises (type PTW). Those sweep-tone measurements failed to give a better frequency response and the zero mentioned above was also present with that transducer. We conclude that the zero near 2 khz, which limits the response of the mask system tested here, is due to acoustic shunting in the mask. By filling out the nasal part of the mask the dip was moved up in frequency, resulting in a substantially improved frequency response. Implications for clinical use of the mask system tested In our previous study (Badin, et al., 1990) we concluded that the frequency response of the mask system tested was essentially flat up to 1 khz. However, after additional testing of different masks it seems that the response of the mask systems tested here was flat within 3 db to 1.6 khz. This includes harmonics for a male speaker with a normal fundamental frequency around 120 Hz. For females it would include 7 harmonics in the normal speaking range. The frequency range also includes the first and second formants for open vowels, such as /a/, used in speech samples by most researchers who published results from studies with the Rothenberg mask (Gauffin & Sundberg, 1989; HertegArd & Gauffin, 1991; HertegArd, Gauffin, & Karlsson, 1992; Holmberg, Hillman, & Perkell, 1988; Karlsson, 1992; Lofquist, 1992; Rothenberg, 1973; 1977). This frequency range is probably sufficient to describe the most important aspects of the glottal waveform. Normally closed vowels (such as /i/) and nasalized vowels are avoided in speech samples because of difficulties in performing a proper inverse filtering, regardless of whether a mask recording or an ordinary pressure microphone recording is used. The level of the higher harmonics are usually lower for pathological voices which often have insufficient vocal fold closure. The transglottal waveform does not often seem to be affected by higher harmonics in these cases. On the other hand, for trained voices, such as for a singer with a prominent so-called The shunting ca\ity singer's formant, harmonics near 3 khz may 4 influence the shape of the glottal velocity waveform. In these cases results from studies using the mask systems tested here must be evaluated with caution if details of the waveform are described. Glottal waveform parameters used in voice synthesis, such as in the so-called Liljencrants-Fant (LF) glottal model, are dependent on a correct frequency response for higher frequencies (Fant, Liljencrants, & Lin, 1985). Fig. 10. The Rothenberg mask. The arrow is pointing to the shunting cavity.
11 STL-QPSR 2-3/ If data are collected for such a model, an ordinary pressure microphone recording or special mask systems (Rothenberg, 1987) are probably preferable. The improved frequency response with the nasal part of the mask filled out by plasticine indicate that some modifications in mask design might substantially improve the response. The placement of holes with wire screen also in the nasal part of the mask might have the same effect (Fig. 10). CONCLUSION The present experiments indicate that the tested mask systems seem to be linear within 3 db from zero airflow to around 1.6 khz. For a male voice with a fundamental frequency of 120 Hz during phonation this includes harmonics and for an open vowel like /a/ this also includes both the first and seconds formants, which are the most important for the waveform. The same will be true for female voices with fundamental frequencies around 200 Hz. For pathological voices the level of the higher harmonics are often lowered due to insufficient vocal fold closure. This means that the frequency response of the mask seems sufficient for measurements on most patients with voice problems. However, if details of the waveform are studied from mask recordings made on subjects with trained voices and with more prominent higher harmonics, the results must be evaluated with caution. A zero in the frequency response near 2 khz seems to be caused mainly by the shunting of the nasal part of the mask. This dip could be moved up in frequency by filling out the nasal part of the mask, resulting in an increase in the response to around 3 khz. A modification in the mask design with wire screen placement in the nasal part might have a similar effect. ACKNOWLEDGEMENT We would like to express our thanks to Erik Jansson for assistance and valuable advice during the experiments. REFERENCES Badin, P., Hertegard, S., & Karlsson, I. (1990): "Notes on the Rothenberg mask," STL-QPSR No. 1, pp 1-7. Fant, G., Liljencrants, J., & Lin, Q. (1985): "A four parameter model of glottal flow," STL- QPSR NO. 4, pp Gauffin, J. & Sundberg, J. (1989): "Spectral correlates of glottal voice source waveform characteristics," J.Speech & Hear.Res. 32, pp Hertegard, S. & Gauffin, J. (1991): "Insufficient vocal fold closure as studied by inverse filtering," pp in (J. Gauffin & B. Hammarberg, eds.), Vocal Fold Physiology: Acoustic, Perceptual and Physiolo~ical Aspects of Voice Mechanism, Singular Publ. Group, Inc. San Diego, CA. Hertegard, S., Gauffin, J., & Karlsson, I. (1992): "Physiological correlates of the inverse filtered flow waveform," J. Voice 6:3, pp Holmberg, E., Hillman, R., & Perkell, J. (1988): "Glottal airflow and transglottal airpressure measurements for male and female speakers in soft, normal and loud voice," J.Acoust.Soc.Am. 84, pp Isshiki, N. (1964): "Regulatory mechanism of voice intensity variation," J.Speech & Hear.Res. 7, pp
12 STL-QPSR 2-3/ 1992 Karlsson, I. (1992): Analysis and Synthesis of Diferent Voices with Emphasis on Female Speech, Diss., Dept. of Speech Communication and Music Acoustics, KTH, Stockholm. Lofquist, A. (1991): "Inverse filtering as a tool in voice research and therapy," Scand. 1. Logopedics and Phoniatrics 16, pp Rothenberg, M. (1973): "A new inverse-filtering for deriving the glottal airflow waveform during voicing, " ].Acoust.Soc.Am. 53, pp Rothenberg, M. (1977): "Measurement of airflow in speech," ].Speech G. Hear.Res. 20, pp Rothenberg, M. (1987): "Cosi fan tutte and what it means or Nonlinear source-tract acoustic interaction in the soprano voice and some implications for the definition of vocal efficiency," pp in (T.B. Baer, C. Sasaki, & K.S. Harris, eds.), Laryngeal Function in Phonation and Respiration (Proc. Vocal Fold Physiology Conf. 1985), Singular Publ. Group, Inc., San Diego, CA. Schutte, H. (1980): "The efficiency of voice production," Groningen (issued by the author).
Quarterly Progress and Status Report. Notes on the Rothenberg mask
Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Notes on the Rothenberg mask Badin, P. and Hertegård, S. and Karlsson, I. journal: STL-QPSR volume: 31 number: 1 year: 1990 pages:
More informationQuarterly Progress and Status Report. Vocal fold vibration and voice source aperiodicity in phonatorily distorted singing
Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Vocal fold vibration and voice source aperiodicity in phonatorily distorted singing Zangger Borch, D. and Sundberg, J. and Lindestad,
More informationSignificance of analysis window size in maximum flow declination rate (MFDR)
Significance of analysis window size in maximum flow declination rate (MFDR) Linda M. Carroll, PhD Department of Otolaryngology, Mount Sinai School of Medicine Goal: 1. To determine whether a significant
More informationINTRODUCTION TO ACOUSTIC PHONETICS 2 Hilary Term, week 6 22 February 2006
1. Resonators and Filters INTRODUCTION TO ACOUSTIC PHONETICS 2 Hilary Term, week 6 22 February 2006 Different vibrating objects are tuned to specific frequencies; these frequencies at which a particular
More informationVocal fold vibration and voice source aperiodicity in dist tones: a study of a timbral ornament in rock singing
æoriginal ARTICLE æ Vocal fold vibration and voice source aperiodicity in dist tones: a study of a timbral ornament in rock singing D. Zangger Borch 1, J. Sundberg 2, P.-Å. Lindestad 3 and M. Thalén 1
More informationQuarterly Progress and Status Report. A note on the vocal tract wall impedance
Dept. for Speech, Music and Hearing Quarterly Progress and Status Report A note on the vocal tract wall impedance Fant, G. and Nord, L. and Branderud, P. journal: STL-QPSR volume: 17 number: 4 year: 1976
More informationCHAPTER 3. ACOUSTIC MEASURES OF GLOTTAL CHARACTERISTICS 39 and from periodic glottal sources (Shadle, 1985; Stevens, 1993). The ratio of the amplitude of the harmonics at 3 khz to the noise amplitude in
More informationParameterization of the glottal source with the phase plane plot
INTERSPEECH 2014 Parameterization of the glottal source with the phase plane plot Manu Airaksinen, Paavo Alku Department of Signal Processing and Acoustics, Aalto University, Finland manu.airaksinen@aalto.fi,
More informationSPEECH AND SPECTRAL ANALYSIS
SPEECH AND SPECTRAL ANALYSIS 1 Sound waves: production in general: acoustic interference vibration (carried by some propagation medium) variations in air pressure speech: actions of the articulatory organs
More informationAspiration Noise during Phonation: Synthesis, Analysis, and Pitch-Scale Modification. Daryush Mehta
Aspiration Noise during Phonation: Synthesis, Analysis, and Pitch-Scale Modification Daryush Mehta SHBT 03 Research Advisor: Thomas F. Quatieri Speech and Hearing Biosciences and Technology 1 Summary Studied
More information2007 Elsevier Science. Reprinted with permission from Elsevier.
Lehto L, Airas M, Björkner E, Sundberg J, Alku P, Comparison of two inverse filtering methods in parameterization of the glottal closing phase characteristics in different phonation types, Journal of Voice,
More informationDIVERSE RESONANCE TUNING STRATEGIES FOR WOMEN SINGERS
DIVERSE RESONANCE TUNING STRATEGIES FOR WOMEN SINGERS John Smith Joe Wolfe Nathalie Henrich Maëva Garnier Physics, University of New South Wales, Sydney j.wolfe@unsw.edu.au Physics, University of New South
More informationExperimental evaluation of inverse filtering using physical systems with known glottal flow and tract characteristics
Experimental evaluation of inverse filtering using physical systems with known glottal flow and tract characteristics Derek Tze Wei Chu and Kaiwen Li School of Physics, University of New South Wales, Sydney,
More informationLinguistic Phonetics. Spectral Analysis
24.963 Linguistic Phonetics Spectral Analysis 4 4 Frequency (Hz) 1 Reading for next week: Liljencrants & Lindblom 1972. Assignment: Lip-rounding assignment, due 1/15. 2 Spectral analysis techniques There
More informationQuantification of glottal and voiced speech harmonicsto-noise ratios using cepstral-based estimation
Quantification of glottal and voiced speech harmonicsto-noise ratios using cepstral-based estimation Peter J. Murphy and Olatunji O. Akande, Department of Electronic and Computer Engineering University
More informationQuarterly Progress and Status Report. Synthesis of selected VCV-syllables in singing
Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Synthesis of selected VCV-syllables in singing Zera, J. and Gauffin, J. and Sundberg, J. journal: STL-QPSR volume: 25 number: 2-3
More informationQuarterly Progress and Status Report. Speech waveform perturbation analysis
Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Speech waveform perturbation analysis Askenfelt, A. and Hammarberg, B. journal: STL-QPSR volume: 21 number: 4 year: 1980 pages:
More informationThe source-filter model of speech production"
24.915/24.963! Linguistic Phonetics! The source-filter model of speech production" Glottal airflow Output from lips 400 200 0.1 0.2 0.3 Time (in secs) 30 20 10 0 0 1000 2000 3000 Frequency (Hz) Source
More informationAirflow visualization in a model of human glottis near the self-oscillating vocal folds model
Applied and Computational Mechanics 5 (2011) 21 28 Airflow visualization in a model of human glottis near the self-oscillating vocal folds model J. Horáček a,, V. Uruba a,v.radolf a, J. Veselý a,v.bula
More informationQuarterly Progress and Status Report. On the body resonance C3 and its relation to top and back plate stiffness
Dept. for Speech, Music and Hearing Quarterly Progress and Status Report On the body resonance C3 and its relation to top and back plate stiffness Jansson, E. V. and Niewczyk, B. K. and Frydén, L. journal:
More informationA() I I X=t,~ X=XI, X=O
6 541J Handout T l - Pert r tt Ofl 11 (fo 2/19/4 A() al -FA ' AF2 \ / +\ X=t,~ X=X, X=O, AF3 n +\ A V V V x=-l x=o Figure 3.19 Curves showing the relative magnitude and direction of the shift AFn in formant
More informationASPIRATION NOISE DURING PHONATION: SYNTHESIS, ANALYSIS, AND PITCH-SCALE MODIFICATION DARYUSH MEHTA
ASPIRATION NOISE DURING PHONATION: SYNTHESIS, ANALYSIS, AND PITCH-SCALE MODIFICATION by DARYUSH MEHTA B.S., Electrical Engineering (23) University of Florida SUBMITTED TO THE DEPARTMENT OF ELECTRICAL ENGINEERING
More informationMask-Based Nasometry A New Method for the Measurement of Nasalance
Publications of Dr. Martin Rothenberg: Mask-Based Nasometry A New Method for the Measurement of Nasalance ABSTRACT The term nasalance has been proposed by Fletcher and his associates (Fletcher and Frost,
More informationAalto Aparat A Freely Available Tool for Glottal Inverse Filtering and Voice Source Parameterization
[LOGO] Aalto Aparat A Freely Available Tool for Glottal Inverse Filtering and Voice Source Parameterization Paavo Alku, Hilla Pohjalainen, Manu Airaksinen Aalto University, Department of Signal Processing
More informationAn Experimentally Measured Source Filter Model: Glottal Flow, Vocal Tract Gain and Output Sound from a Physical Model
Acoust Aust (2016) 44:187 191 DOI 10.1007/s40857-016-0046-7 TUTORIAL PAPER An Experimentally Measured Source Filter Model: Glottal Flow, Vocal Tract Gain and Output Sound from a Physical Model Joe Wolfe
More informationQuarterly Progress and Status Report. Computing formant frequencies for VT configurations with abruptly changing area functions
Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Computing formant frequencies for VT configurations with abruptly changing area functions Sundberg, J. and Lindblom, B. journal:
More informationSubtractive Synthesis & Formant Synthesis
Subtractive Synthesis & Formant Synthesis Prof Eduardo R Miranda Varèse-Gastprofessor eduardo.miranda@btinternet.com Electronic Music Studio TU Berlin Institute of Communications Research http://www.kgw.tu-berlin.de/
More informationLab 8. ANALYSIS OF COMPLEX SOUNDS AND SPEECH ANALYSIS Amplitude, loudness, and decibels
Lab 8. ANALYSIS OF COMPLEX SOUNDS AND SPEECH ANALYSIS Amplitude, loudness, and decibels A complex sound with particular frequency can be analyzed and quantified by its Fourier spectrum: the relative amplitudes
More informationOn the function of the violin - vibration excitation and sound radiation.
TMH-QPSR 4/1996 On the function of the violin - vibration excitation and sound radiation. Erik V Jansson Abstract The bow-string interaction results in slip-stick motions of the bowed string. The slip
More informationOn the glottal flow derivative waveform and its properties
COMPUTER SCIENCE DEPARTMENT UNIVERSITY OF CRETE On the glottal flow derivative waveform and its properties A time/frequency study George P. Kafentzis Bachelor s Dissertation 29/2/2008 Supervisor: Yannis
More informationCOMP 546, Winter 2017 lecture 20 - sound 2
Today we will examine two types of sounds that are of great interest: music and speech. We will see how a frequency domain analysis is fundamental to both. Musical sounds Let s begin by briefly considering
More informationSubglottal coupling and its influence on vowel formants
Subglottal coupling and its influence on vowel formants Xuemin Chi a and Morgan Sonderegger b Speech Communication Group, RLE, MIT, Cambridge, Massachusetts 02139 Received 25 September 2006; revised 14
More informationQuarterly Progress and Status Report. Electroglottograph and contact microphone for measuring vocal pitch
Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Electroglottograph and contact microphone for measuring vocal pitch Askenfelt, A. and Gauffin, J. and Kitzing, P. and Sundberg,
More informationResonance and resonators
Resonance and resonators Dr. Christian DiCanio cdicanio@buffalo.edu University at Buffalo 10/13/15 DiCanio (UB) Resonance 10/13/15 1 / 27 Harmonics Harmonics and Resonance An example... Suppose you are
More informationX. SPEECH ANALYSIS. Prof. M. Halle G. W. Hughes H. J. Jacobsen A. I. Engel F. Poza A. VOWEL IDENTIFIER
X. SPEECH ANALYSIS Prof. M. Halle G. W. Hughes H. J. Jacobsen A. I. Engel F. Poza A. VOWEL IDENTIFIER Most vowel identifiers constructed in the past were designed on the principle of "pattern matching";
More informationAnalysis and Synthesis of Pathological Voice Quality
Second Edition Revised November, 2016 33 Analysis and Synthesis of Pathological Voice Quality by Jody Kreiman Bruce R. Gerratt Norma Antoñanzas-Barroso Bureau of Glottal Affairs Department of Head/Neck
More informationWaveSurfer. Basic acoustics part 2 Spectrograms, resonance, vowels. Spectrogram. See Rogers chapter 7 8
WaveSurfer. Basic acoustics part 2 Spectrograms, resonance, vowels See Rogers chapter 7 8 Allows us to see Waveform Spectrogram (color or gray) Spectral section short-time spectrum = spectrum of a brief
More informationThe Correlogram: a visual display of periodicity
The Correlogram: a visual display of periodicity Svante Granqvist* and Britta Hammarberg** * Dept of Speech, Music and Hearing, KTH, Stockholm; Electronic mail: svante.granqvist@speech.kth.se ** Dept of
More informationA perceptually and physiologically motivated voice source model
INTERSPEECH 23 A perceptually and physiologically motivated voice source model Gang Chen, Marc Garellek 2,3, Jody Kreiman 3, Bruce R. Gerratt 3, Abeer Alwan Department of Electrical Engineering, University
More informationA Review of Glottal Waveform Analysis
A Review of Glottal Waveform Analysis Jacqueline Walker and Peter Murphy Department of Electronic and Computer Engineering, University of Limerick, Limerick, Ireland jacqueline.walker@ul.ie,peter.murphy@ul.ie
More informationDigitally controlled Active Noise Reduction with integrated Speech Communication
Digitally controlled Active Noise Reduction with integrated Speech Communication Herman J.M. Steeneken and Jan Verhave TNO Human Factors, Soesterberg, The Netherlands herman@steeneken.com ABSTRACT Active
More informationAutomatic estimation of the lip radiation effect in glottal inverse filtering
INTERSPEECH 24 Automatic estimation of the lip radiation effect in glottal inverse filtering Manu Airaksinen, Tom Bäckström 2, Paavo Alku Department of Signal Processing and Acoustics, Aalto University,
More informationCOMPARING ACOUSTIC GLOTTAL FEATURE EXTRACTION METHODS WITH SIMULTANEOUSLY RECORDED HIGH- SPEED VIDEO FEATURES FOR CLINICALLY OBTAINED DATA
University of Kentucky UKnowledge Theses and Dissertations--Electrical and Computer Engineering Electrical and Computer Engineering 2012 COMPARING ACOUSTIC GLOTTAL FEATURE EXTRACTION METHODS WITH SIMULTANEOUSLY
More informationBlock diagram of proposed general approach to automatic reduction of speech wave to lowinformation-rate signals.
XIV. SPEECH COMMUNICATION Prof. M. Halle G. W. Hughes J. M. Heinz Prof. K. N. Stevens Jane B. Arnold C. I. Malme Dr. T. T. Sandel P. T. Brady F. Poza C. G. Bell O. Fujimura G. Rosen A. AUTOMATIC RESOLUTION
More informationStructure of Speech. Physical acoustics Time-domain representation Frequency domain representation Sound shaping
Structure of Speech Physical acoustics Time-domain representation Frequency domain representation Sound shaping Speech acoustics Source-Filter Theory Speech Source characteristics Speech Filter characteristics
More information3D Intermodulation Distortion Measurement AN 8
3D Intermodulation Distortion Measurement AN 8 Application Note to the R&D SYSTEM The modulation of a high frequency tone f (voice tone and a low frequency tone f (bass tone is measured by using the 3D
More informationThe effect of whisper and creak vocal mechanisms on vocal tract resonances
The effect of whisper and creak vocal mechanisms on vocal tract resonances Yoni Swerdlin, John Smith, a and Joe Wolfe School of Physics, University of New South Wales, Sydney, New South Wales 5, Australia
More informationQuarterly Progress and Status Report. Observations on the transient components of the piano tone
Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Observations on the transient components of the piano tone Askenfelt, A. journal: STL-QPSR volume: 34 number: 4 year: 1993 pages:
More informationSource-filter Analysis of Consonants: Nasals and Laterals
L105/205 Phonetics Scarborough Handout 11 Nov. 3, 2005 reading: Johnson Ch. 9 (today); Pickett Ch. 5 (Tues.) Source-filter Analysis of Consonants: Nasals and Laterals 1. Both nasals and laterals have voicing
More informationAcoustic Phonetics. How speech sounds are physically represented. Chapters 12 and 13
Acoustic Phonetics How speech sounds are physically represented Chapters 12 and 13 1 Sound Energy Travels through a medium to reach the ear Compression waves 2 Information from Phonetics for Dummies. William
More informationFoundations of Language Science and Technology. Acoustic Phonetics 1: Resonances and formants
Foundations of Language Science and Technology Acoustic Phonetics 1: Resonances and formants Jan 19, 2015 Bernd Möbius FR 4.7, Phonetics Saarland University Speech waveforms and spectrograms A f t Formants
More informationGlottal source model selection for stationary singing-voice by low-band envelope matching
Glottal source model selection for stationary singing-voice by low-band envelope matching Fernando Villavicencio Yamaha Corporation, Corporate Research & Development Center, 3 Matsunokijima, Iwata, Shizuoka,
More informationMeasurement of Weighted Harmonic Distortion HI-2
Measurement of Weighted Harmonic Distortion HI-2 Application Note for the R&D and QC SYSTEM (Document Revision 1.2) AN 7 DESCRIPTION The weighted harmonic distortion HI-2 can be measured by using the DIS-Pro
More informationReading: Johnson Ch , Ch.5.5 (today); Liljencrants & Lindblom; Stevens (Tues) reminder: no class on Thursday.
L105/205 Phonetics Scarborough Handout 7 10/18/05 Reading: Johnson Ch.2.3.3-2.3.6, Ch.5.5 (today); Liljencrants & Lindblom; Stevens (Tues) reminder: no class on Thursday Spectral Analysis 1. There are
More informationQuarterly Progress and Status Report. On certain irregularities of voiced-speech waveforms
Dept. for Speech, Music and Hearing Quarterly Progress and Status Report On certain irregularities of voiced-speech waveforms Dolansky, L. and Tjernlund, P. journal: STL-QPSR volume: 8 number: 2-3 year:
More informationSteady state phonation is never perfectly steady. Phonation is characterized
Perception of Vocal Tremor Jody Kreiman Brian Gabelman Bruce R. Gerratt The David Geffen School of Medicine at UCLA Los Angeles, CA Vocal tremors characterize many pathological voices, but acoustic-perceptual
More informationPDF hosted at the Radboud Repository of the Radboud University Nijmegen
PDF hosted at the Radboud Repository of the Radboud University Nijmegen The following full text is a publisher's version. For additional information about this publication click this link. http://hdl.handle.net/2066/76252
More informationRD75, RD50, RD40, RD28.1 Planar magnetic transducers with true line source characteristics
RD75, RD50, RD40, RD28.1 Planar magnetic transducers true line source characteristics The RD line of planar-magnetic ribbon drivers represents the ultimate thin film diaphragm technology. The RD drivers
More informationSource-filter analysis of fricatives
24.915/24.963 Linguistic Phonetics Source-filter analysis of fricatives Figure removed due to copyright restrictions. Readings: Johnson chapter 5 (speech perception) 24.963: Fujimura et al (1978) Noise
More informationThe purpose of this study was to establish the relation
JSLHR Article Relation of Structural and Vibratory Kinematics of the Vocal Folds to Two Acoustic Measures of Breathy Voice Based on Computational Modeling Robin A. Samlan a and Brad H. Story a Purpose:
More informationAN ANALYSIS OF ITERATIVE ALGORITHM FOR ESTIMATION OF HARMONICS-TO-NOISE RATIO IN SPEECH
AN ANALYSIS OF ITERATIVE ALGORITHM FOR ESTIMATION OF HARMONICS-TO-NOISE RATIO IN SPEECH A. Stráník, R. Čmejla Department of Circuit Theory, Faculty of Electrical Engineering, CTU in Prague Abstract Acoustic
More informationPerception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 4: 7 Feb A. Faulkner.
Perception of pitch BSc Audiology/MSc SHS Psychoacoustics wk 4: 7 Feb 2008. A. Faulkner. See Moore, BCJ Introduction to the Psychology of Hearing, Chapter 5. Or Plack CJ The Sense of Hearing Lawrence Erlbaum,
More informationRespiration, Phonation, and Resonation: How dependent are they on each other? (Kay-Pentax Lecture in Upper Airway Science) Ingo R.
Respiration, Phonation, and Resonation: How dependent are they on each other? (Kay-Pentax Lecture in Upper Airway Science) Ingo R. Titze Director, National Center for Voice and Speech, University of Utah
More informationPerceptual evaluation of voice source models a)
Perceptual evaluation of voice source models a) Jody Kreiman, 1,b) Marc Garellek, 2 Gang Chen, 3,c) Abeer Alwan, 3 and Bruce R. Gerratt 1 1 Department of Head and Neck Surgery, University of California
More informationQuarterly Progress and Status Report. Frequency domain interpretation and derivation of glottal flow parameters
Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Frequency domain interpretation and derivation of glottal flow parameters Fant, G. and Lin, Q. journal: STL-QPSR volume: 29 number:
More informationProceedings of Meetings on Acoustics
Proceedings of Meetings on Acoustics Volume, http://acousticalsociety.org/ ICA Montreal Montreal, Canada - June Musical Acoustics Session amu: Aeroacoustics of Wind Instruments and Human Voice II amu.
More informationEE482: Digital Signal Processing Applications
Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 12 Speech Signal Processing 14/03/25 http://www.ee.unlv.edu/~b1morris/ee482/
More informationSpeech Synthesis; Pitch Detection and Vocoders
Speech Synthesis; Pitch Detection and Vocoders Tai-Shih Chi ( 冀泰石 ) Department of Communication Engineering National Chiao Tung University May. 29, 2008 Speech Synthesis Basic components of the text-to-speech
More informationinter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering August 2000, Nice, FRANCE
Copyright SFA - InterNoise 2000 1 inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering 27-30 August 2000, Nice, FRANCE I-INCE Classification: 6.1 AUDIBILITY OF COMPLEX
More informationSpeech Perception Speech Analysis Project. Record 3 tokens of each of the 15 vowels of American English in bvd or hvd context.
Speech Perception Map your vowel space. Record tokens of the 15 vowels of English. Using LPC and measurements on the waveform and spectrum, determine F0, F1, F2, F3, and F4 at 3 points in each token plus
More informationQuarterly Progress and Status Report. Formant amplitude measurements
Dept. for Speech, Music and Hearing Quarterly rogress and Status Report Formant amplitude measurements Fant, G. and Mártony, J. journal: STL-QSR volume: 4 number: 1 year: 1963 pages: 001-005 http://www.speech.kth.se/qpsr
More informationTechnique for the Derivation of Wide Band Room Impulse Response
Technique for the Derivation of Wide Band Room Impulse Response PACS Reference: 43.55 Behler, Gottfried K.; Müller, Swen Institute on Technical Acoustics, RWTH, Technical University of Aachen Templergraben
More informationAn introduction to physics of Sound
An introduction to physics of Sound Outlines Acoustics and psycho-acoustics Sound? Wave and waves types Cycle Basic parameters of sound wave period Amplitude Wavelength Frequency Outlines Phase Types of
More informationVocal effort modification for singing synthesis
INTERSPEECH 2016 September 8 12, 2016, San Francisco, USA Vocal effort modification for singing synthesis Olivier Perrotin, Christophe d Alessandro LIMSI, CNRS, Université Paris-Saclay, France olivier.perrotin@limsi.fr
More informationMeasurement of Equivalent Input Distortion. Wolfgang Klippel. Klippel GmbH,Dresden, 01277, Germany, Fellow
Wolfgang Klippel Klippel GmbH,Dresden, 01277, Germany, Fellow ABSTRACT A new technique for measuring nonlinear distortion in transducers is presented which considers a priori information from transducer
More informationPerception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 5: 12 Feb A. Faulkner.
Perception of pitch BSc Audiology/MSc SHS Psychoacoustics wk 5: 12 Feb 2009. A. Faulkner. See Moore, BCJ Introduction to the Psychology of Hearing, Chapter 5. Or Plack CJ The Sense of Hearing Lawrence
More informationSpeech Processing. Undergraduate course code: LASC10061 Postgraduate course code: LASC11065
Speech Processing Undergraduate course code: LASC10061 Postgraduate course code: LASC11065 All course materials and handouts are the same for both versions. Differences: credits (20 for UG, 10 for PG);
More informationAdvanced Methods for Glottal Wave Extraction
Advanced Methods for Glottal Wave Extraction Jacqueline Walker and Peter Murphy Department of Electronic and Computer Engineering, University of Limerick, Limerick, Ireland, jacqueline.walker@ul.ie, peter.murphy@ul.ie
More informationHCS 7367 Speech Perception
HCS 7367 Speech Perception Dr. Peter Assmann Fall 212 Power spectrum model of masking Assumptions: Only frequencies within the passband of the auditory filter contribute to masking. Detection is based
More informationDynamic Generation of DC Displacement AN 13
Dynamic Generation of DC Displacement AN 13 Application Note to the R&D SYSTEM Nonlinearities inherent in the transducer produce a DC component in the voice coil displacement by rectifying the AC signal.
More informationSource-Filter Theory 1
Source-Filter Theory 1 Vocal tract as sound production device Sound production by the vocal tract can be understood by analogy to a wind or brass instrument. sound generation sound shaping (or filtering)
More informationHi-Fi voice: observations on the distribution of energy in the singing voice spectrum above 5 khz
Hi-Fi voice: observations on the distribution of energy in the singing voice spectrum above 5 khz S. O Ternström Kungliga Tekniska Högskolan, Dept. of Speech, Music & Hearing, Lindstedtsvägen 24, SE-100
More informationPerception of pitch. Importance of pitch: 2. mother hemp horse. scold. Definitions. Why is pitch important? AUDL4007: 11 Feb A. Faulkner.
Perception of pitch AUDL4007: 11 Feb 2010. A. Faulkner. See Moore, BCJ Introduction to the Psychology of Hearing, Chapter 5. Or Plack CJ The Sense of Hearing Lawrence Erlbaum, 2005 Chapter 7 1 Definitions
More informationTransforming High-Effort Voices Into Breathy Voices Using Adaptive Pre-Emphasis Linear Prediction
Transforming High-Effort Voices Into Breathy Voices Using Adaptive Pre-Emphasis Linear Prediction by Karl Ingram Nordstrom B.Eng., University of Victoria, 1995 M.A.Sc., University of Victoria, 2000 A Dissertation
More informationEE 225D LECTURE ON SPEECH SYNTHESIS. University of California Berkeley
University of California Berkeley College of Engineering Department of Electrical Engineering and Computer Sciences Professors : N.Morgan / B.Gold EE225D Speech Synthesis Spring,1999 Lecture 23 N.MORGAN
More informationLinguistics 401 LECTURE #2. BASIC ACOUSTIC CONCEPTS (A review)
Linguistics 401 LECTURE #2 BASIC ACOUSTIC CONCEPTS (A review) Unit of wave: CYCLE one complete wave (=one complete crest and trough) The number of cycles per second: FREQUENCY cycles per second (cps) =
More informationSpeech Signal Analysis
Speech Signal Analysis Hiroshi Shimodaira and Steve Renals Automatic Speech Recognition ASR Lectures 2&3 14,18 January 216 ASR Lectures 2&3 Speech Signal Analysis 1 Overview Speech Signal Analysis for
More informationVOICE QUALITY SYNTHESIS WITH THE BANDWIDTH ENHANCED SINUSOIDAL MODEL
VOICE QUALITY SYNTHESIS WITH THE BANDWIDTH ENHANCED SINUSOIDAL MODEL Narsimh Kamath Vishweshwara Rao Preeti Rao NIT Karnataka EE Dept, IIT-Bombay EE Dept, IIT-Bombay narsimh@gmail.com vishu@ee.iitb.ac.in
More informationCauses for Amplitude Compression AN 12
Causes for Amplitude AN 2 Application Note to the R&D SYSTEM Both thermal and nonlinear effects limit the amplitude of the fundamental component in the state variables and in the sound pressure output.
More informationA Multichannel Electroglottograph
Publications of Dr. Martin Rothenberg: A Multichannel Electroglottograph Published in the Journal of Voice, Vol. 6., No. 1, pp. 36-43, 1992 Raven Press, Ltd., New York Summary: It is shown that a practical
More informationQuarterly Progress and Status Report. A look at violin bows
Dept. for Speech, Music and Hearing Quarterly Progress and Status Report A look at violin bows Askenfelt, A. journal: STL-QPSR volume: 34 number: 2-3 year: 1993 pages: 041-048 http://www.speech.kth.se/qpsr
More informationLoudspeaker Distortion Measurement and Perception Part 2: Irregular distortion caused by defects
Loudspeaker Distortion Measurement and Perception Part 2: Irregular distortion caused by defects Wolfgang Klippel, Klippel GmbH, wklippel@klippel.de Robert Werner, Klippel GmbH, r.werner@klippel.de ABSTRACT
More informationAn Implementation of the Klatt Speech Synthesiser*
REVISTA DO DETUA, VOL. 2, Nº 1, SETEMBRO 1997 1 An Implementation of the Klatt Speech Synthesiser* Luis Miguel Teixeira de Jesus, Francisco Vaz, José Carlos Principe Resumo - Neste trabalho descreve-se
More informationMeasurement of weighted harmonic distortion HI-2
Measurement of weighted harmonic distortion HI-2 Software of the KLIPPEL R&D and QC SYSTEM ( Document Revision 1.0) AN 7 DESCRIPTION The weighted harmonic distortion HI-2 is measured by using the DIS-Pro
More informationAP Homework (Q2) Does the sound intensity level obey the inverse-square law? Why?
AP Homework 11.1 Loudness & Intensity (Q1) Which has a more direct influence on the loudness of a sound wave: the displacement amplitude or the pressure amplitude? Explain your reasoning. (Q2) Does the
More informationSynthesis Algorithms and Validation
Chapter 5 Synthesis Algorithms and Validation An essential step in the study of pathological voices is re-synthesis; clear and immediate evidence of the success and accuracy of modeling efforts is provided
More informationPerceived Pitch of Synthesized Voice with Alternate Cycles
Journal of Voice Vol. 16, No. 4, pp. 443 459 2002 The Voice Foundation Perceived Pitch of Synthesized Voice with Alternate Cycles Xuejing Sun and Yi Xu Department of Communication Sciences and Disorders,
More informationReverberation time and structure loss factor
Reverberation time and structure loss factor CHRISTER HEED SD2165 Stockholm October 2008 Marcus Wallenberg Laboratoriet för Ljud- och Vibrationsforskning Reverberation time and structure loss factor Christer
More informationINTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET)
INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET) Proceedings of the 2 nd International Conference on Current Trends in Engineering and Management ICCTEM -214 ISSN
More informationA R T A - A P P L I C A T I O N N O T E
Introduction A R T A - A P P L I C A T I O N N O T E The AES-Recommendation 2-1984 (r2003) [01] defines the estimation of linear displacement of a loudspeaker as follows: Voice-coil peak displacement at
More information