Quarterly Progress and Status Report. Acoustic properties of the Rothenberg mask

Size: px
Start display at page:

Download "Quarterly Progress and Status Report. Acoustic properties of the Rothenberg mask"

Transcription

1 Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Acoustic properties of the Rothenberg mask Hertegård, S. and Gauffin, J. journal: STL-QPSR volume: 33 number: 2-3 year: 1992 pages:

2

3 STL-QPSR 2-3 / 1992 ACOUSTIC PROPERTIES OF THE ROTHENBERG MASK Stellan Hertegdrd* G. Jan Gauffin Abstract The flow response and possible distortion from the Rothenberg mask system on radiated speech were studied by means of sweep-tone measurements. The flow frequency response was flat within 3 db up to 1.6 khz. This frequency range for the speech signal radiated through the mask normally includes the lowest two formants for open vowels. and is probably sufjicient to describe most aspects of the glottalflow waveform for untrained normal and pathological voices. A pronounced zero between 1.8 and 2 khz was found. This restricts the use of the mask systems tested here for air flow measurements at higher frequencies. It was shown that this zero could be moved up in frequency, increasing the useful frequency response to nearly 3 khz. We suggest that the zero is caused by acoustic shunting of the nasal part of the mask. A modified mask design by placing wire screen also in its nasal part might substantially improve the frequency response. INTRODUCTION Mean airflow at the lips is often measured in order to assess vocal function. If the subglottal pressure during phonation also is measured, the glottal resistance during phonation can be calculated as the quotient between pressure and flow (Isshiki, 1964; Schutte, 1980). This gives an estimate of the mean "closing" ability of the glottis. Pneumotachograph masks are frequently used for measuring mean airflow during phonation of a vowel. The disadvantage of calculating glottal resistance from mean airflow is that it is impossible to study each individual glottal cycle and measure the absolute airflow during the closed phase. Fig. 1 illustrates that a certain mean airflow can be produced by means of a high modulated transglottal airflow and zero airflow during the closed phase, indicating complete glottal closure. The same mean airflow could also result from a smaller modulated airflow and a constant airflow (waveform offset) even during the presumed closed phase, indicating insufficient glottal closure**. These two alternatives would not result in the same voice quality, but could not be separated from measurements of the mean airflow only. The circumferentially vented wire screen mask designed by Rothenberg (Fig. 2) has some special properties that differ from many other pneumotachograph masks that are used for the measurement of mean airflow at the lips (Rothenberg, 1973, 1977). In the Rothenberg mask a number of holes have been drilled around the circumference close to the mouth. These are covered with a fine meshed single or double layered steel wire screen that produce an acoustic resistance. A wide frequency differential pressure transducer is used as a microphone. It has one input on the inside of the wire screen and another input on the outside, which provides a measure of the pressure drop across the screen, thus also an estimation of flow. The purpose of the design is to reduce the effect of the mask on the acoustic resonances of the vo- *~e~t. of Logopedics and Phoniatrics, Huddinge University Hospital, Karolinska Institute, Stockholm. ** A small amount of waveform offset during the closed phase could be caused by vertical movements of the vocal folds as previously discussed in Hertegbd, Gauffin, & Karlsson, 1992.

4 cal tract and also to increase the frequency response of the mask system. If the airflow at the lips is properly inverse filtered, each glottal pulse during phonation can be studied in more detail and the absolute transglottal airflow during the open and closed phases can be measured after calibration. The response time and acoustic properties of the mask system have been studied by Rothenberg (1973; 1977). Ttansglottal airflow closed phase Time Fig. I. Mean airflow and transglottal airflow for two types of phonations: A with a high modulated flow amplitude and zero airflow during the closed phase, B with a small amount of modulated airflozu and a constant waveform offset. hfe~allic wire screen cove;ing the holes F compressible Results from studies of the upper frequency limit of the flow response of the mask system and assessment of the possible distortion of the radiated speech sound from the mask were presented in a previous paper (Badin, Hertegsrd, & Karlsson, 1990). In the present study complementary experiments are described and their implications for clinical use of the Rothenberg mask system are discussed. Fig. 2. Schematic drawing of the Rothenberg mask. METHODS The flow response of the mask system was tested by sweep-tone analysis. Fig. 3 shows the experimental set-up. The mask was connected via a ceramic adapter provided by Glottal Enterprise, Syracuse, USA and placed on a wooden plate with a central 5 cm hole'on top of a small loudspeaker driven by a Hewlett Packard Dynamic Signal Analyser, type 3562A. The Dynamic Signal Analyser produced a constant amplitude frequency sweep. The loudspeaker and adapter simulated a short vocal tract with its first formant at approximately 3 khz. The output of the mask

5 STL-QPSR 2-3/ 1992 electronics was recorded by the Dynamic Signal Analyser. The microphone in the mask system tested was a differential pressure transducer (type MTW). The mask electronics in the experiment was type MSIF-2, which has a built in 3 khz low-pass filter. The masks tested were a larger double layer mask (type MA-2N) with a resistance of approximately 0.5 cm H20/l/s, a smaller double layer mask (type MA-2s)) and a single layer wire screen (type MA-IN) mask with a resistance of about 0.25 cm H20/l/s. All these items were manufactured by Glottal Enterprises, Syracuse, USA, the maker of the Rothenberg mask systems. Rothenberg mask on ceramic adaptor 7 mask preamp1 ifier 3- wooden plate - - Sweep inpu ' Dyn alyze Loudspeaker Fig. 3. The experimental set-up for the sweq-tone measurements. The frequency sweep provided by the loudspeaker was first tested with a free field sweep. This was measured with a Briiel & Kjaer (BK) 1/2" microphone, with a flat frequency response high above 5 khz, and the Dynamic Signal Analyser. Fig. 4 shows that the sweep provided by the loudspeaker was linear within 3 db from 200 Hz to approximately 3 khz. Thus, it can be concluded that the loudspeaker provided a sinusoidal volume flow with an amplitude that decreased with a constant slope of 6 db/octave above 200 Hz to close to 3 khz. RESULTS Sweep-tone analysis was carried out for each of the three masks as described above. In Fig. 5, the result of the sweep for the larger double layer mask, type MA- 2N (with the MTW-transducer and the MSIF-2 preamplifier) is shown. A -6 db/octave line was drawn in the figure. The response of this mask system was linear within 3 db to 1.6 khz. At around 1.9 khz there was a pronounced dip of about 15 db from the -6 db line. Slightly below 3 khz there was a peak. The level fell for frequencies above 3 khz, probably due to the low-pass filter of the mask electronics. There was also a small dip around 400 to 500 Hz. Vibrations could be felt in the measuring equipment during the sweep around these frequencies, indicating that this dip was caused by a mechanical resonance in the measuring equipment itself. This small dip was also observed during the sweep tone measurements for the other masks tested. In Fig. 6 are shown the results of the sweep tone measurements for all three masks described above. All three mask systems had a linear frequency response within 3 db up to khz. They all had a zero around 2 khz and a peak near 3 khz.

6 STL-QPSR 2-3/1992 Fig. 4. A free field sweep registered with a BriieleKjaer 1/2" microphone held over the loudspeaker with no mask present. The ripple in the curve is due to noise in the experimental room I I I I I l l 1 I I I I ' 200 Log Hz 2 5k Fig. 5. The frequency response from a sweq with the equipment as described in Fig 3. A larger double layer mask (type MA-2N) was used. A -6 db /octave line was superimposed for comparison.

7

8 STL-QPSR 2-3/ 1992 Fig. 7 shows a sweep with the MTW transducer without a mask in free field approximately 2 cm above the ceramic adapter and the loudspeaker and another sweep with the transducer held 2 cm above the larger double layer MA-2N mask (which was held firmly to the ceramic adapter). The sound level was generally lower with the mask present. The free field recording without the mask did not show any dip near 2 khz, whereas a dip was present near 2 khz with the mask. There was a peak near 3 khz in both recordings, probably caused by the first "formant" in the acoustic system, as mentioned above. In Fig. 8 similar sweeps were made with a 1/2" BK microphone in free field above the loudspeaker and adapter without the mask and with the mask held firmly to the adapter. A dip was present near 2 khz with, but not without, the mask I I I I I I I I J 200 Log Hz 2 5k Fig. 8. The frequency response from a free field sweep registered with a 112 " BriielaKjaer microphone held above the MA-2N mask (solid line) and without the mask (dashed line). Further analysis of the zero near 2 khz In order to investigate if the zero near 2 khz was caused by a mechanical resonance in the mask or microphone, we excited the mask (type MA-2N) and microphone with a mechanical pulse and measured the mechanical vibrations at different points. The pulse was produced by a small hammer fitted with an accelerator (PCB Impulse force hammer type 086M37) connected to the HP Dynamic Analyzer. This revealed a mechanical resonance in the connector wire hold of the microphone at approximately 1.9 khz. This resonance was effectively dampened by a piece of plasticine. By repeating the sweep-tone measurements we could conclude that this mechanical resonance only had a marginal effect on the zero. Since there was no mechanical resonance causing the dip, we suspected a cross resonance or a shunt in the mask itself. By filling out the nasal part of the mask with

9 STL-QPSR 2-3/ 1992 plasticine we could move the zero at 2 khz upwards in frequency to over 3 khz (Fig. 9) Fig. 9. The frequency response from sweep tone measurements using the equipment as described in Fig. 3, including the MA-2N mask (solid line) and with plasticine dampening in the nasal part of the mask (dotted line). DISCUSSION The acoustic properties of the mask system have been previously studied by Rothenberg (1973; 1977). He reported the mask to be linear for a static airflow from zero up to well above 1 l/s, which is satisfactory for most clinical conditions. He also found that for open vowels, such as /a/ and /a/, the lowest two formants (which are most important for the shape of the waveform during inverse filtering) were lowered by Hz due to the increase in effective vocal tract length from the mask. The transmission characteristics of the mask were found to be linear within 6 db for speech range frequencies, except for a pronounced dip at khz. This dip probably corresponds to the dip found around 1.7 khz in our previous study (Badin, et al., 1990) and between 1.8 and 2 khz in the present report. Badin, et al. (1990) studied the effect of the mask on radiated speech sound pressure by means of LTAS analysis of speech samples with and without the mask. This analysis showed a reduced sound pressure level around 2 khz in recordings with the mask than without. This indicates that the zero near 2 khz exists both in the speech measurements and in the sweep-tone measurements with the mask. All free field measurements without the mask failed to show any zero. This indicates that the zero was caused by a cross resonance or a shunt in the mask itself. In our previous study it was shown that the frequency of the zero varied somewhat for different dampening foam settings in the mask microphone and the mask (Badin, et

10 STL-QPSR 2-3/ 1992 al., 1990), but the dip could not be eliminated. Pressing the mask with varying force to the adapter or face did not significantly affect the dip. As described in this paper, a marginal dampening of the zero resulted from dampening a small cavity in the transparent plastic microphone adapter. From the present experiments it is also apparent that the measurement equipment itself was not responsible for the dip. We have also made some tests of a prototype to a new pressure transducer provided by Glottal Enterprises (type PTW). Those sweep-tone measurements failed to give a better frequency response and the zero mentioned above was also present with that transducer. We conclude that the zero near 2 khz, which limits the response of the mask system tested here, is due to acoustic shunting in the mask. By filling out the nasal part of the mask the dip was moved up in frequency, resulting in a substantially improved frequency response. Implications for clinical use of the mask system tested In our previous study (Badin, et al., 1990) we concluded that the frequency response of the mask system tested was essentially flat up to 1 khz. However, after additional testing of different masks it seems that the response of the mask systems tested here was flat within 3 db to 1.6 khz. This includes harmonics for a male speaker with a normal fundamental frequency around 120 Hz. For females it would include 7 harmonics in the normal speaking range. The frequency range also includes the first and second formants for open vowels, such as /a/, used in speech samples by most researchers who published results from studies with the Rothenberg mask (Gauffin & Sundberg, 1989; HertegArd & Gauffin, 1991; HertegArd, Gauffin, & Karlsson, 1992; Holmberg, Hillman, & Perkell, 1988; Karlsson, 1992; Lofquist, 1992; Rothenberg, 1973; 1977). This frequency range is probably sufficient to describe the most important aspects of the glottal waveform. Normally closed vowels (such as /i/) and nasalized vowels are avoided in speech samples because of difficulties in performing a proper inverse filtering, regardless of whether a mask recording or an ordinary pressure microphone recording is used. The level of the higher harmonics are usually lower for pathological voices which often have insufficient vocal fold closure. The transglottal waveform does not often seem to be affected by higher harmonics in these cases. On the other hand, for trained voices, such as for a singer with a prominent so-called The shunting ca\ity singer's formant, harmonics near 3 khz may 4 influence the shape of the glottal velocity waveform. In these cases results from studies using the mask systems tested here must be evaluated with caution if details of the waveform are described. Glottal waveform parameters used in voice synthesis, such as in the so-called Liljencrants-Fant (LF) glottal model, are dependent on a correct frequency response for higher frequencies (Fant, Liljencrants, & Lin, 1985). Fig. 10. The Rothenberg mask. The arrow is pointing to the shunting cavity.

11 STL-QPSR 2-3/ If data are collected for such a model, an ordinary pressure microphone recording or special mask systems (Rothenberg, 1987) are probably preferable. The improved frequency response with the nasal part of the mask filled out by plasticine indicate that some modifications in mask design might substantially improve the response. The placement of holes with wire screen also in the nasal part of the mask might have the same effect (Fig. 10). CONCLUSION The present experiments indicate that the tested mask systems seem to be linear within 3 db from zero airflow to around 1.6 khz. For a male voice with a fundamental frequency of 120 Hz during phonation this includes harmonics and for an open vowel like /a/ this also includes both the first and seconds formants, which are the most important for the waveform. The same will be true for female voices with fundamental frequencies around 200 Hz. For pathological voices the level of the higher harmonics are often lowered due to insufficient vocal fold closure. This means that the frequency response of the mask seems sufficient for measurements on most patients with voice problems. However, if details of the waveform are studied from mask recordings made on subjects with trained voices and with more prominent higher harmonics, the results must be evaluated with caution. A zero in the frequency response near 2 khz seems to be caused mainly by the shunting of the nasal part of the mask. This dip could be moved up in frequency by filling out the nasal part of the mask, resulting in an increase in the response to around 3 khz. A modification in the mask design with wire screen placement in the nasal part might have a similar effect. ACKNOWLEDGEMENT We would like to express our thanks to Erik Jansson for assistance and valuable advice during the experiments. REFERENCES Badin, P., Hertegard, S., & Karlsson, I. (1990): "Notes on the Rothenberg mask," STL-QPSR No. 1, pp 1-7. Fant, G., Liljencrants, J., & Lin, Q. (1985): "A four parameter model of glottal flow," STL- QPSR NO. 4, pp Gauffin, J. & Sundberg, J. (1989): "Spectral correlates of glottal voice source waveform characteristics," J.Speech & Hear.Res. 32, pp Hertegard, S. & Gauffin, J. (1991): "Insufficient vocal fold closure as studied by inverse filtering," pp in (J. Gauffin & B. Hammarberg, eds.), Vocal Fold Physiology: Acoustic, Perceptual and Physiolo~ical Aspects of Voice Mechanism, Singular Publ. Group, Inc. San Diego, CA. Hertegard, S., Gauffin, J., & Karlsson, I. (1992): "Physiological correlates of the inverse filtered flow waveform," J. Voice 6:3, pp Holmberg, E., Hillman, R., & Perkell, J. (1988): "Glottal airflow and transglottal airpressure measurements for male and female speakers in soft, normal and loud voice," J.Acoust.Soc.Am. 84, pp Isshiki, N. (1964): "Regulatory mechanism of voice intensity variation," J.Speech & Hear.Res. 7, pp

12 STL-QPSR 2-3/ 1992 Karlsson, I. (1992): Analysis and Synthesis of Diferent Voices with Emphasis on Female Speech, Diss., Dept. of Speech Communication and Music Acoustics, KTH, Stockholm. Lofquist, A. (1991): "Inverse filtering as a tool in voice research and therapy," Scand. 1. Logopedics and Phoniatrics 16, pp Rothenberg, M. (1973): "A new inverse-filtering for deriving the glottal airflow waveform during voicing, " ].Acoust.Soc.Am. 53, pp Rothenberg, M. (1977): "Measurement of airflow in speech," ].Speech G. Hear.Res. 20, pp Rothenberg, M. (1987): "Cosi fan tutte and what it means or Nonlinear source-tract acoustic interaction in the soprano voice and some implications for the definition of vocal efficiency," pp in (T.B. Baer, C. Sasaki, & K.S. Harris, eds.), Laryngeal Function in Phonation and Respiration (Proc. Vocal Fold Physiology Conf. 1985), Singular Publ. Group, Inc., San Diego, CA. Schutte, H. (1980): "The efficiency of voice production," Groningen (issued by the author).

Quarterly Progress and Status Report. Notes on the Rothenberg mask

Quarterly Progress and Status Report. Notes on the Rothenberg mask Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Notes on the Rothenberg mask Badin, P. and Hertegård, S. and Karlsson, I. journal: STL-QPSR volume: 31 number: 1 year: 1990 pages:

More information

Quarterly Progress and Status Report. Vocal fold vibration and voice source aperiodicity in phonatorily distorted singing

Quarterly Progress and Status Report. Vocal fold vibration and voice source aperiodicity in phonatorily distorted singing Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Vocal fold vibration and voice source aperiodicity in phonatorily distorted singing Zangger Borch, D. and Sundberg, J. and Lindestad,

More information

Significance of analysis window size in maximum flow declination rate (MFDR)

Significance of analysis window size in maximum flow declination rate (MFDR) Significance of analysis window size in maximum flow declination rate (MFDR) Linda M. Carroll, PhD Department of Otolaryngology, Mount Sinai School of Medicine Goal: 1. To determine whether a significant

More information

INTRODUCTION TO ACOUSTIC PHONETICS 2 Hilary Term, week 6 22 February 2006

INTRODUCTION TO ACOUSTIC PHONETICS 2 Hilary Term, week 6 22 February 2006 1. Resonators and Filters INTRODUCTION TO ACOUSTIC PHONETICS 2 Hilary Term, week 6 22 February 2006 Different vibrating objects are tuned to specific frequencies; these frequencies at which a particular

More information

Vocal fold vibration and voice source aperiodicity in dist tones: a study of a timbral ornament in rock singing

Vocal fold vibration and voice source aperiodicity in dist tones: a study of a timbral ornament in rock singing æoriginal ARTICLE æ Vocal fold vibration and voice source aperiodicity in dist tones: a study of a timbral ornament in rock singing D. Zangger Borch 1, J. Sundberg 2, P.-Å. Lindestad 3 and M. Thalén 1

More information

Quarterly Progress and Status Report. A note on the vocal tract wall impedance

Quarterly Progress and Status Report. A note on the vocal tract wall impedance Dept. for Speech, Music and Hearing Quarterly Progress and Status Report A note on the vocal tract wall impedance Fant, G. and Nord, L. and Branderud, P. journal: STL-QPSR volume: 17 number: 4 year: 1976

More information

CHAPTER 3. ACOUSTIC MEASURES OF GLOTTAL CHARACTERISTICS 39 and from periodic glottal sources (Shadle, 1985; Stevens, 1993). The ratio of the amplitude of the harmonics at 3 khz to the noise amplitude in

More information

Parameterization of the glottal source with the phase plane plot

Parameterization of the glottal source with the phase plane plot INTERSPEECH 2014 Parameterization of the glottal source with the phase plane plot Manu Airaksinen, Paavo Alku Department of Signal Processing and Acoustics, Aalto University, Finland manu.airaksinen@aalto.fi,

More information

SPEECH AND SPECTRAL ANALYSIS

SPEECH AND SPECTRAL ANALYSIS SPEECH AND SPECTRAL ANALYSIS 1 Sound waves: production in general: acoustic interference vibration (carried by some propagation medium) variations in air pressure speech: actions of the articulatory organs

More information

Aspiration Noise during Phonation: Synthesis, Analysis, and Pitch-Scale Modification. Daryush Mehta

Aspiration Noise during Phonation: Synthesis, Analysis, and Pitch-Scale Modification. Daryush Mehta Aspiration Noise during Phonation: Synthesis, Analysis, and Pitch-Scale Modification Daryush Mehta SHBT 03 Research Advisor: Thomas F. Quatieri Speech and Hearing Biosciences and Technology 1 Summary Studied

More information

2007 Elsevier Science. Reprinted with permission from Elsevier.

2007 Elsevier Science. Reprinted with permission from Elsevier. Lehto L, Airas M, Björkner E, Sundberg J, Alku P, Comparison of two inverse filtering methods in parameterization of the glottal closing phase characteristics in different phonation types, Journal of Voice,

More information

DIVERSE RESONANCE TUNING STRATEGIES FOR WOMEN SINGERS

DIVERSE RESONANCE TUNING STRATEGIES FOR WOMEN SINGERS DIVERSE RESONANCE TUNING STRATEGIES FOR WOMEN SINGERS John Smith Joe Wolfe Nathalie Henrich Maëva Garnier Physics, University of New South Wales, Sydney j.wolfe@unsw.edu.au Physics, University of New South

More information

Experimental evaluation of inverse filtering using physical systems with known glottal flow and tract characteristics

Experimental evaluation of inverse filtering using physical systems with known glottal flow and tract characteristics Experimental evaluation of inverse filtering using physical systems with known glottal flow and tract characteristics Derek Tze Wei Chu and Kaiwen Li School of Physics, University of New South Wales, Sydney,

More information

Linguistic Phonetics. Spectral Analysis

Linguistic Phonetics. Spectral Analysis 24.963 Linguistic Phonetics Spectral Analysis 4 4 Frequency (Hz) 1 Reading for next week: Liljencrants & Lindblom 1972. Assignment: Lip-rounding assignment, due 1/15. 2 Spectral analysis techniques There

More information

Quantification of glottal and voiced speech harmonicsto-noise ratios using cepstral-based estimation

Quantification of glottal and voiced speech harmonicsto-noise ratios using cepstral-based estimation Quantification of glottal and voiced speech harmonicsto-noise ratios using cepstral-based estimation Peter J. Murphy and Olatunji O. Akande, Department of Electronic and Computer Engineering University

More information

Quarterly Progress and Status Report. Synthesis of selected VCV-syllables in singing

Quarterly Progress and Status Report. Synthesis of selected VCV-syllables in singing Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Synthesis of selected VCV-syllables in singing Zera, J. and Gauffin, J. and Sundberg, J. journal: STL-QPSR volume: 25 number: 2-3

More information

Quarterly Progress and Status Report. Speech waveform perturbation analysis

Quarterly Progress and Status Report. Speech waveform perturbation analysis Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Speech waveform perturbation analysis Askenfelt, A. and Hammarberg, B. journal: STL-QPSR volume: 21 number: 4 year: 1980 pages:

More information

The source-filter model of speech production"

The source-filter model of speech production 24.915/24.963! Linguistic Phonetics! The source-filter model of speech production" Glottal airflow Output from lips 400 200 0.1 0.2 0.3 Time (in secs) 30 20 10 0 0 1000 2000 3000 Frequency (Hz) Source

More information

Airflow visualization in a model of human glottis near the self-oscillating vocal folds model

Airflow visualization in a model of human glottis near the self-oscillating vocal folds model Applied and Computational Mechanics 5 (2011) 21 28 Airflow visualization in a model of human glottis near the self-oscillating vocal folds model J. Horáček a,, V. Uruba a,v.radolf a, J. Veselý a,v.bula

More information

Quarterly Progress and Status Report. On the body resonance C3 and its relation to top and back plate stiffness

Quarterly Progress and Status Report. On the body resonance C3 and its relation to top and back plate stiffness Dept. for Speech, Music and Hearing Quarterly Progress and Status Report On the body resonance C3 and its relation to top and back plate stiffness Jansson, E. V. and Niewczyk, B. K. and Frydén, L. journal:

More information

A() I I X=t,~ X=XI, X=O

A() I I X=t,~ X=XI, X=O 6 541J Handout T l - Pert r tt Ofl 11 (fo 2/19/4 A() al -FA ' AF2 \ / +\ X=t,~ X=X, X=O, AF3 n +\ A V V V x=-l x=o Figure 3.19 Curves showing the relative magnitude and direction of the shift AFn in formant

More information

ASPIRATION NOISE DURING PHONATION: SYNTHESIS, ANALYSIS, AND PITCH-SCALE MODIFICATION DARYUSH MEHTA

ASPIRATION NOISE DURING PHONATION: SYNTHESIS, ANALYSIS, AND PITCH-SCALE MODIFICATION DARYUSH MEHTA ASPIRATION NOISE DURING PHONATION: SYNTHESIS, ANALYSIS, AND PITCH-SCALE MODIFICATION by DARYUSH MEHTA B.S., Electrical Engineering (23) University of Florida SUBMITTED TO THE DEPARTMENT OF ELECTRICAL ENGINEERING

More information

Mask-Based Nasometry A New Method for the Measurement of Nasalance

Mask-Based Nasometry A New Method for the Measurement of Nasalance Publications of Dr. Martin Rothenberg: Mask-Based Nasometry A New Method for the Measurement of Nasalance ABSTRACT The term nasalance has been proposed by Fletcher and his associates (Fletcher and Frost,

More information

Aalto Aparat A Freely Available Tool for Glottal Inverse Filtering and Voice Source Parameterization

Aalto Aparat A Freely Available Tool for Glottal Inverse Filtering and Voice Source Parameterization [LOGO] Aalto Aparat A Freely Available Tool for Glottal Inverse Filtering and Voice Source Parameterization Paavo Alku, Hilla Pohjalainen, Manu Airaksinen Aalto University, Department of Signal Processing

More information

An Experimentally Measured Source Filter Model: Glottal Flow, Vocal Tract Gain and Output Sound from a Physical Model

An Experimentally Measured Source Filter Model: Glottal Flow, Vocal Tract Gain and Output Sound from a Physical Model Acoust Aust (2016) 44:187 191 DOI 10.1007/s40857-016-0046-7 TUTORIAL PAPER An Experimentally Measured Source Filter Model: Glottal Flow, Vocal Tract Gain and Output Sound from a Physical Model Joe Wolfe

More information

Quarterly Progress and Status Report. Computing formant frequencies for VT configurations with abruptly changing area functions

Quarterly Progress and Status Report. Computing formant frequencies for VT configurations with abruptly changing area functions Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Computing formant frequencies for VT configurations with abruptly changing area functions Sundberg, J. and Lindblom, B. journal:

More information

Subtractive Synthesis & Formant Synthesis

Subtractive Synthesis & Formant Synthesis Subtractive Synthesis & Formant Synthesis Prof Eduardo R Miranda Varèse-Gastprofessor eduardo.miranda@btinternet.com Electronic Music Studio TU Berlin Institute of Communications Research http://www.kgw.tu-berlin.de/

More information

Lab 8. ANALYSIS OF COMPLEX SOUNDS AND SPEECH ANALYSIS Amplitude, loudness, and decibels

Lab 8. ANALYSIS OF COMPLEX SOUNDS AND SPEECH ANALYSIS Amplitude, loudness, and decibels Lab 8. ANALYSIS OF COMPLEX SOUNDS AND SPEECH ANALYSIS Amplitude, loudness, and decibels A complex sound with particular frequency can be analyzed and quantified by its Fourier spectrum: the relative amplitudes

More information

On the function of the violin - vibration excitation and sound radiation.

On the function of the violin - vibration excitation and sound radiation. TMH-QPSR 4/1996 On the function of the violin - vibration excitation and sound radiation. Erik V Jansson Abstract The bow-string interaction results in slip-stick motions of the bowed string. The slip

More information

On the glottal flow derivative waveform and its properties

On the glottal flow derivative waveform and its properties COMPUTER SCIENCE DEPARTMENT UNIVERSITY OF CRETE On the glottal flow derivative waveform and its properties A time/frequency study George P. Kafentzis Bachelor s Dissertation 29/2/2008 Supervisor: Yannis

More information

COMP 546, Winter 2017 lecture 20 - sound 2

COMP 546, Winter 2017 lecture 20 - sound 2 Today we will examine two types of sounds that are of great interest: music and speech. We will see how a frequency domain analysis is fundamental to both. Musical sounds Let s begin by briefly considering

More information

Subglottal coupling and its influence on vowel formants

Subglottal coupling and its influence on vowel formants Subglottal coupling and its influence on vowel formants Xuemin Chi a and Morgan Sonderegger b Speech Communication Group, RLE, MIT, Cambridge, Massachusetts 02139 Received 25 September 2006; revised 14

More information

Quarterly Progress and Status Report. Electroglottograph and contact microphone for measuring vocal pitch

Quarterly Progress and Status Report. Electroglottograph and contact microphone for measuring vocal pitch Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Electroglottograph and contact microphone for measuring vocal pitch Askenfelt, A. and Gauffin, J. and Kitzing, P. and Sundberg,

More information

Resonance and resonators

Resonance and resonators Resonance and resonators Dr. Christian DiCanio cdicanio@buffalo.edu University at Buffalo 10/13/15 DiCanio (UB) Resonance 10/13/15 1 / 27 Harmonics Harmonics and Resonance An example... Suppose you are

More information

X. SPEECH ANALYSIS. Prof. M. Halle G. W. Hughes H. J. Jacobsen A. I. Engel F. Poza A. VOWEL IDENTIFIER

X. SPEECH ANALYSIS. Prof. M. Halle G. W. Hughes H. J. Jacobsen A. I. Engel F. Poza A. VOWEL IDENTIFIER X. SPEECH ANALYSIS Prof. M. Halle G. W. Hughes H. J. Jacobsen A. I. Engel F. Poza A. VOWEL IDENTIFIER Most vowel identifiers constructed in the past were designed on the principle of "pattern matching";

More information

Analysis and Synthesis of Pathological Voice Quality

Analysis and Synthesis of Pathological Voice Quality Second Edition Revised November, 2016 33 Analysis and Synthesis of Pathological Voice Quality by Jody Kreiman Bruce R. Gerratt Norma Antoñanzas-Barroso Bureau of Glottal Affairs Department of Head/Neck

More information

WaveSurfer. Basic acoustics part 2 Spectrograms, resonance, vowels. Spectrogram. See Rogers chapter 7 8

WaveSurfer. Basic acoustics part 2 Spectrograms, resonance, vowels. Spectrogram. See Rogers chapter 7 8 WaveSurfer. Basic acoustics part 2 Spectrograms, resonance, vowels See Rogers chapter 7 8 Allows us to see Waveform Spectrogram (color or gray) Spectral section short-time spectrum = spectrum of a brief

More information

The Correlogram: a visual display of periodicity

The Correlogram: a visual display of periodicity The Correlogram: a visual display of periodicity Svante Granqvist* and Britta Hammarberg** * Dept of Speech, Music and Hearing, KTH, Stockholm; Electronic mail: svante.granqvist@speech.kth.se ** Dept of

More information

A perceptually and physiologically motivated voice source model

A perceptually and physiologically motivated voice source model INTERSPEECH 23 A perceptually and physiologically motivated voice source model Gang Chen, Marc Garellek 2,3, Jody Kreiman 3, Bruce R. Gerratt 3, Abeer Alwan Department of Electrical Engineering, University

More information

A Review of Glottal Waveform Analysis

A Review of Glottal Waveform Analysis A Review of Glottal Waveform Analysis Jacqueline Walker and Peter Murphy Department of Electronic and Computer Engineering, University of Limerick, Limerick, Ireland jacqueline.walker@ul.ie,peter.murphy@ul.ie

More information

Digitally controlled Active Noise Reduction with integrated Speech Communication

Digitally controlled Active Noise Reduction with integrated Speech Communication Digitally controlled Active Noise Reduction with integrated Speech Communication Herman J.M. Steeneken and Jan Verhave TNO Human Factors, Soesterberg, The Netherlands herman@steeneken.com ABSTRACT Active

More information

Automatic estimation of the lip radiation effect in glottal inverse filtering

Automatic estimation of the lip radiation effect in glottal inverse filtering INTERSPEECH 24 Automatic estimation of the lip radiation effect in glottal inverse filtering Manu Airaksinen, Tom Bäckström 2, Paavo Alku Department of Signal Processing and Acoustics, Aalto University,

More information

COMPARING ACOUSTIC GLOTTAL FEATURE EXTRACTION METHODS WITH SIMULTANEOUSLY RECORDED HIGH- SPEED VIDEO FEATURES FOR CLINICALLY OBTAINED DATA

COMPARING ACOUSTIC GLOTTAL FEATURE EXTRACTION METHODS WITH SIMULTANEOUSLY RECORDED HIGH- SPEED VIDEO FEATURES FOR CLINICALLY OBTAINED DATA University of Kentucky UKnowledge Theses and Dissertations--Electrical and Computer Engineering Electrical and Computer Engineering 2012 COMPARING ACOUSTIC GLOTTAL FEATURE EXTRACTION METHODS WITH SIMULTANEOUSLY

More information

Block diagram of proposed general approach to automatic reduction of speech wave to lowinformation-rate signals.

Block diagram of proposed general approach to automatic reduction of speech wave to lowinformation-rate signals. XIV. SPEECH COMMUNICATION Prof. M. Halle G. W. Hughes J. M. Heinz Prof. K. N. Stevens Jane B. Arnold C. I. Malme Dr. T. T. Sandel P. T. Brady F. Poza C. G. Bell O. Fujimura G. Rosen A. AUTOMATIC RESOLUTION

More information

Structure of Speech. Physical acoustics Time-domain representation Frequency domain representation Sound shaping

Structure of Speech. Physical acoustics Time-domain representation Frequency domain representation Sound shaping Structure of Speech Physical acoustics Time-domain representation Frequency domain representation Sound shaping Speech acoustics Source-Filter Theory Speech Source characteristics Speech Filter characteristics

More information

3D Intermodulation Distortion Measurement AN 8

3D Intermodulation Distortion Measurement AN 8 3D Intermodulation Distortion Measurement AN 8 Application Note to the R&D SYSTEM The modulation of a high frequency tone f (voice tone and a low frequency tone f (bass tone is measured by using the 3D

More information

The effect of whisper and creak vocal mechanisms on vocal tract resonances

The effect of whisper and creak vocal mechanisms on vocal tract resonances The effect of whisper and creak vocal mechanisms on vocal tract resonances Yoni Swerdlin, John Smith, a and Joe Wolfe School of Physics, University of New South Wales, Sydney, New South Wales 5, Australia

More information

Quarterly Progress and Status Report. Observations on the transient components of the piano tone

Quarterly Progress and Status Report. Observations on the transient components of the piano tone Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Observations on the transient components of the piano tone Askenfelt, A. journal: STL-QPSR volume: 34 number: 4 year: 1993 pages:

More information

Source-filter Analysis of Consonants: Nasals and Laterals

Source-filter Analysis of Consonants: Nasals and Laterals L105/205 Phonetics Scarborough Handout 11 Nov. 3, 2005 reading: Johnson Ch. 9 (today); Pickett Ch. 5 (Tues.) Source-filter Analysis of Consonants: Nasals and Laterals 1. Both nasals and laterals have voicing

More information

Acoustic Phonetics. How speech sounds are physically represented. Chapters 12 and 13

Acoustic Phonetics. How speech sounds are physically represented. Chapters 12 and 13 Acoustic Phonetics How speech sounds are physically represented Chapters 12 and 13 1 Sound Energy Travels through a medium to reach the ear Compression waves 2 Information from Phonetics for Dummies. William

More information

Foundations of Language Science and Technology. Acoustic Phonetics 1: Resonances and formants

Foundations of Language Science and Technology. Acoustic Phonetics 1: Resonances and formants Foundations of Language Science and Technology Acoustic Phonetics 1: Resonances and formants Jan 19, 2015 Bernd Möbius FR 4.7, Phonetics Saarland University Speech waveforms and spectrograms A f t Formants

More information

Glottal source model selection for stationary singing-voice by low-band envelope matching

Glottal source model selection for stationary singing-voice by low-band envelope matching Glottal source model selection for stationary singing-voice by low-band envelope matching Fernando Villavicencio Yamaha Corporation, Corporate Research & Development Center, 3 Matsunokijima, Iwata, Shizuoka,

More information

Measurement of Weighted Harmonic Distortion HI-2

Measurement of Weighted Harmonic Distortion HI-2 Measurement of Weighted Harmonic Distortion HI-2 Application Note for the R&D and QC SYSTEM (Document Revision 1.2) AN 7 DESCRIPTION The weighted harmonic distortion HI-2 can be measured by using the DIS-Pro

More information

Reading: Johnson Ch , Ch.5.5 (today); Liljencrants & Lindblom; Stevens (Tues) reminder: no class on Thursday.

Reading: Johnson Ch , Ch.5.5 (today); Liljencrants & Lindblom; Stevens (Tues) reminder: no class on Thursday. L105/205 Phonetics Scarborough Handout 7 10/18/05 Reading: Johnson Ch.2.3.3-2.3.6, Ch.5.5 (today); Liljencrants & Lindblom; Stevens (Tues) reminder: no class on Thursday Spectral Analysis 1. There are

More information

Quarterly Progress and Status Report. On certain irregularities of voiced-speech waveforms

Quarterly Progress and Status Report. On certain irregularities of voiced-speech waveforms Dept. for Speech, Music and Hearing Quarterly Progress and Status Report On certain irregularities of voiced-speech waveforms Dolansky, L. and Tjernlund, P. journal: STL-QPSR volume: 8 number: 2-3 year:

More information

Steady state phonation is never perfectly steady. Phonation is characterized

Steady state phonation is never perfectly steady. Phonation is characterized Perception of Vocal Tremor Jody Kreiman Brian Gabelman Bruce R. Gerratt The David Geffen School of Medicine at UCLA Los Angeles, CA Vocal tremors characterize many pathological voices, but acoustic-perceptual

More information

PDF hosted at the Radboud Repository of the Radboud University Nijmegen

PDF hosted at the Radboud Repository of the Radboud University Nijmegen PDF hosted at the Radboud Repository of the Radboud University Nijmegen The following full text is a publisher's version. For additional information about this publication click this link. http://hdl.handle.net/2066/76252

More information

RD75, RD50, RD40, RD28.1 Planar magnetic transducers with true line source characteristics

RD75, RD50, RD40, RD28.1 Planar magnetic transducers with true line source characteristics RD75, RD50, RD40, RD28.1 Planar magnetic transducers true line source characteristics The RD line of planar-magnetic ribbon drivers represents the ultimate thin film diaphragm technology. The RD drivers

More information

Source-filter analysis of fricatives

Source-filter analysis of fricatives 24.915/24.963 Linguistic Phonetics Source-filter analysis of fricatives Figure removed due to copyright restrictions. Readings: Johnson chapter 5 (speech perception) 24.963: Fujimura et al (1978) Noise

More information

The purpose of this study was to establish the relation

The purpose of this study was to establish the relation JSLHR Article Relation of Structural and Vibratory Kinematics of the Vocal Folds to Two Acoustic Measures of Breathy Voice Based on Computational Modeling Robin A. Samlan a and Brad H. Story a Purpose:

More information

AN ANALYSIS OF ITERATIVE ALGORITHM FOR ESTIMATION OF HARMONICS-TO-NOISE RATIO IN SPEECH

AN ANALYSIS OF ITERATIVE ALGORITHM FOR ESTIMATION OF HARMONICS-TO-NOISE RATIO IN SPEECH AN ANALYSIS OF ITERATIVE ALGORITHM FOR ESTIMATION OF HARMONICS-TO-NOISE RATIO IN SPEECH A. Stráník, R. Čmejla Department of Circuit Theory, Faculty of Electrical Engineering, CTU in Prague Abstract Acoustic

More information

Perception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 4: 7 Feb A. Faulkner.

Perception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 4: 7 Feb A. Faulkner. Perception of pitch BSc Audiology/MSc SHS Psychoacoustics wk 4: 7 Feb 2008. A. Faulkner. See Moore, BCJ Introduction to the Psychology of Hearing, Chapter 5. Or Plack CJ The Sense of Hearing Lawrence Erlbaum,

More information

Respiration, Phonation, and Resonation: How dependent are they on each other? (Kay-Pentax Lecture in Upper Airway Science) Ingo R.

Respiration, Phonation, and Resonation: How dependent are they on each other? (Kay-Pentax Lecture in Upper Airway Science) Ingo R. Respiration, Phonation, and Resonation: How dependent are they on each other? (Kay-Pentax Lecture in Upper Airway Science) Ingo R. Titze Director, National Center for Voice and Speech, University of Utah

More information

Perceptual evaluation of voice source models a)

Perceptual evaluation of voice source models a) Perceptual evaluation of voice source models a) Jody Kreiman, 1,b) Marc Garellek, 2 Gang Chen, 3,c) Abeer Alwan, 3 and Bruce R. Gerratt 1 1 Department of Head and Neck Surgery, University of California

More information

Quarterly Progress and Status Report. Frequency domain interpretation and derivation of glottal flow parameters

Quarterly Progress and Status Report. Frequency domain interpretation and derivation of glottal flow parameters Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Frequency domain interpretation and derivation of glottal flow parameters Fant, G. and Lin, Q. journal: STL-QPSR volume: 29 number:

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume, http://acousticalsociety.org/ ICA Montreal Montreal, Canada - June Musical Acoustics Session amu: Aeroacoustics of Wind Instruments and Human Voice II amu.

More information

EE482: Digital Signal Processing Applications

EE482: Digital Signal Processing Applications Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 12 Speech Signal Processing 14/03/25 http://www.ee.unlv.edu/~b1morris/ee482/

More information

Speech Synthesis; Pitch Detection and Vocoders

Speech Synthesis; Pitch Detection and Vocoders Speech Synthesis; Pitch Detection and Vocoders Tai-Shih Chi ( 冀泰石 ) Department of Communication Engineering National Chiao Tung University May. 29, 2008 Speech Synthesis Basic components of the text-to-speech

More information

inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering August 2000, Nice, FRANCE

inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering August 2000, Nice, FRANCE Copyright SFA - InterNoise 2000 1 inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering 27-30 August 2000, Nice, FRANCE I-INCE Classification: 6.1 AUDIBILITY OF COMPLEX

More information

Speech Perception Speech Analysis Project. Record 3 tokens of each of the 15 vowels of American English in bvd or hvd context.

Speech Perception Speech Analysis Project. Record 3 tokens of each of the 15 vowels of American English in bvd or hvd context. Speech Perception Map your vowel space. Record tokens of the 15 vowels of English. Using LPC and measurements on the waveform and spectrum, determine F0, F1, F2, F3, and F4 at 3 points in each token plus

More information

Quarterly Progress and Status Report. Formant amplitude measurements

Quarterly Progress and Status Report. Formant amplitude measurements Dept. for Speech, Music and Hearing Quarterly rogress and Status Report Formant amplitude measurements Fant, G. and Mártony, J. journal: STL-QSR volume: 4 number: 1 year: 1963 pages: 001-005 http://www.speech.kth.se/qpsr

More information

Technique for the Derivation of Wide Band Room Impulse Response

Technique for the Derivation of Wide Band Room Impulse Response Technique for the Derivation of Wide Band Room Impulse Response PACS Reference: 43.55 Behler, Gottfried K.; Müller, Swen Institute on Technical Acoustics, RWTH, Technical University of Aachen Templergraben

More information

An introduction to physics of Sound

An introduction to physics of Sound An introduction to physics of Sound Outlines Acoustics and psycho-acoustics Sound? Wave and waves types Cycle Basic parameters of sound wave period Amplitude Wavelength Frequency Outlines Phase Types of

More information

Vocal effort modification for singing synthesis

Vocal effort modification for singing synthesis INTERSPEECH 2016 September 8 12, 2016, San Francisco, USA Vocal effort modification for singing synthesis Olivier Perrotin, Christophe d Alessandro LIMSI, CNRS, Université Paris-Saclay, France olivier.perrotin@limsi.fr

More information

Measurement of Equivalent Input Distortion. Wolfgang Klippel. Klippel GmbH,Dresden, 01277, Germany, Fellow

Measurement of Equivalent Input Distortion. Wolfgang Klippel. Klippel GmbH,Dresden, 01277, Germany, Fellow Wolfgang Klippel Klippel GmbH,Dresden, 01277, Germany, Fellow ABSTRACT A new technique for measuring nonlinear distortion in transducers is presented which considers a priori information from transducer

More information

Perception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 5: 12 Feb A. Faulkner.

Perception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 5: 12 Feb A. Faulkner. Perception of pitch BSc Audiology/MSc SHS Psychoacoustics wk 5: 12 Feb 2009. A. Faulkner. See Moore, BCJ Introduction to the Psychology of Hearing, Chapter 5. Or Plack CJ The Sense of Hearing Lawrence

More information

Speech Processing. Undergraduate course code: LASC10061 Postgraduate course code: LASC11065

Speech Processing. Undergraduate course code: LASC10061 Postgraduate course code: LASC11065 Speech Processing Undergraduate course code: LASC10061 Postgraduate course code: LASC11065 All course materials and handouts are the same for both versions. Differences: credits (20 for UG, 10 for PG);

More information

Advanced Methods for Glottal Wave Extraction

Advanced Methods for Glottal Wave Extraction Advanced Methods for Glottal Wave Extraction Jacqueline Walker and Peter Murphy Department of Electronic and Computer Engineering, University of Limerick, Limerick, Ireland, jacqueline.walker@ul.ie, peter.murphy@ul.ie

More information

HCS 7367 Speech Perception

HCS 7367 Speech Perception HCS 7367 Speech Perception Dr. Peter Assmann Fall 212 Power spectrum model of masking Assumptions: Only frequencies within the passband of the auditory filter contribute to masking. Detection is based

More information

Dynamic Generation of DC Displacement AN 13

Dynamic Generation of DC Displacement AN 13 Dynamic Generation of DC Displacement AN 13 Application Note to the R&D SYSTEM Nonlinearities inherent in the transducer produce a DC component in the voice coil displacement by rectifying the AC signal.

More information

Source-Filter Theory 1

Source-Filter Theory 1 Source-Filter Theory 1 Vocal tract as sound production device Sound production by the vocal tract can be understood by analogy to a wind or brass instrument. sound generation sound shaping (or filtering)

More information

Hi-Fi voice: observations on the distribution of energy in the singing voice spectrum above 5 khz

Hi-Fi voice: observations on the distribution of energy in the singing voice spectrum above 5 khz Hi-Fi voice: observations on the distribution of energy in the singing voice spectrum above 5 khz S. O Ternström Kungliga Tekniska Högskolan, Dept. of Speech, Music & Hearing, Lindstedtsvägen 24, SE-100

More information

Perception of pitch. Importance of pitch: 2. mother hemp horse. scold. Definitions. Why is pitch important? AUDL4007: 11 Feb A. Faulkner.

Perception of pitch. Importance of pitch: 2. mother hemp horse. scold. Definitions. Why is pitch important? AUDL4007: 11 Feb A. Faulkner. Perception of pitch AUDL4007: 11 Feb 2010. A. Faulkner. See Moore, BCJ Introduction to the Psychology of Hearing, Chapter 5. Or Plack CJ The Sense of Hearing Lawrence Erlbaum, 2005 Chapter 7 1 Definitions

More information

Transforming High-Effort Voices Into Breathy Voices Using Adaptive Pre-Emphasis Linear Prediction

Transforming High-Effort Voices Into Breathy Voices Using Adaptive Pre-Emphasis Linear Prediction Transforming High-Effort Voices Into Breathy Voices Using Adaptive Pre-Emphasis Linear Prediction by Karl Ingram Nordstrom B.Eng., University of Victoria, 1995 M.A.Sc., University of Victoria, 2000 A Dissertation

More information

EE 225D LECTURE ON SPEECH SYNTHESIS. University of California Berkeley

EE 225D LECTURE ON SPEECH SYNTHESIS. University of California Berkeley University of California Berkeley College of Engineering Department of Electrical Engineering and Computer Sciences Professors : N.Morgan / B.Gold EE225D Speech Synthesis Spring,1999 Lecture 23 N.MORGAN

More information

Linguistics 401 LECTURE #2. BASIC ACOUSTIC CONCEPTS (A review)

Linguistics 401 LECTURE #2. BASIC ACOUSTIC CONCEPTS (A review) Linguistics 401 LECTURE #2 BASIC ACOUSTIC CONCEPTS (A review) Unit of wave: CYCLE one complete wave (=one complete crest and trough) The number of cycles per second: FREQUENCY cycles per second (cps) =

More information

Speech Signal Analysis

Speech Signal Analysis Speech Signal Analysis Hiroshi Shimodaira and Steve Renals Automatic Speech Recognition ASR Lectures 2&3 14,18 January 216 ASR Lectures 2&3 Speech Signal Analysis 1 Overview Speech Signal Analysis for

More information

VOICE QUALITY SYNTHESIS WITH THE BANDWIDTH ENHANCED SINUSOIDAL MODEL

VOICE QUALITY SYNTHESIS WITH THE BANDWIDTH ENHANCED SINUSOIDAL MODEL VOICE QUALITY SYNTHESIS WITH THE BANDWIDTH ENHANCED SINUSOIDAL MODEL Narsimh Kamath Vishweshwara Rao Preeti Rao NIT Karnataka EE Dept, IIT-Bombay EE Dept, IIT-Bombay narsimh@gmail.com vishu@ee.iitb.ac.in

More information

Causes for Amplitude Compression AN 12

Causes for Amplitude Compression AN 12 Causes for Amplitude AN 2 Application Note to the R&D SYSTEM Both thermal and nonlinear effects limit the amplitude of the fundamental component in the state variables and in the sound pressure output.

More information

A Multichannel Electroglottograph

A Multichannel Electroglottograph Publications of Dr. Martin Rothenberg: A Multichannel Electroglottograph Published in the Journal of Voice, Vol. 6., No. 1, pp. 36-43, 1992 Raven Press, Ltd., New York Summary: It is shown that a practical

More information

Quarterly Progress and Status Report. A look at violin bows

Quarterly Progress and Status Report. A look at violin bows Dept. for Speech, Music and Hearing Quarterly Progress and Status Report A look at violin bows Askenfelt, A. journal: STL-QPSR volume: 34 number: 2-3 year: 1993 pages: 041-048 http://www.speech.kth.se/qpsr

More information

Loudspeaker Distortion Measurement and Perception Part 2: Irregular distortion caused by defects

Loudspeaker Distortion Measurement and Perception Part 2: Irregular distortion caused by defects Loudspeaker Distortion Measurement and Perception Part 2: Irregular distortion caused by defects Wolfgang Klippel, Klippel GmbH, wklippel@klippel.de Robert Werner, Klippel GmbH, r.werner@klippel.de ABSTRACT

More information

An Implementation of the Klatt Speech Synthesiser*

An Implementation of the Klatt Speech Synthesiser* REVISTA DO DETUA, VOL. 2, Nº 1, SETEMBRO 1997 1 An Implementation of the Klatt Speech Synthesiser* Luis Miguel Teixeira de Jesus, Francisco Vaz, José Carlos Principe Resumo - Neste trabalho descreve-se

More information

Measurement of weighted harmonic distortion HI-2

Measurement of weighted harmonic distortion HI-2 Measurement of weighted harmonic distortion HI-2 Software of the KLIPPEL R&D and QC SYSTEM ( Document Revision 1.0) AN 7 DESCRIPTION The weighted harmonic distortion HI-2 is measured by using the DIS-Pro

More information

AP Homework (Q2) Does the sound intensity level obey the inverse-square law? Why?

AP Homework (Q2) Does the sound intensity level obey the inverse-square law? Why? AP Homework 11.1 Loudness & Intensity (Q1) Which has a more direct influence on the loudness of a sound wave: the displacement amplitude or the pressure amplitude? Explain your reasoning. (Q2) Does the

More information

Synthesis Algorithms and Validation

Synthesis Algorithms and Validation Chapter 5 Synthesis Algorithms and Validation An essential step in the study of pathological voices is re-synthesis; clear and immediate evidence of the success and accuracy of modeling efforts is provided

More information

Perceived Pitch of Synthesized Voice with Alternate Cycles

Perceived Pitch of Synthesized Voice with Alternate Cycles Journal of Voice Vol. 16, No. 4, pp. 443 459 2002 The Voice Foundation Perceived Pitch of Synthesized Voice with Alternate Cycles Xuejing Sun and Yi Xu Department of Communication Sciences and Disorders,

More information

Reverberation time and structure loss factor

Reverberation time and structure loss factor Reverberation time and structure loss factor CHRISTER HEED SD2165 Stockholm October 2008 Marcus Wallenberg Laboratoriet för Ljud- och Vibrationsforskning Reverberation time and structure loss factor Christer

More information

INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET)

INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET) INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET) Proceedings of the 2 nd International Conference on Current Trends in Engineering and Management ICCTEM -214 ISSN

More information

A R T A - A P P L I C A T I O N N O T E

A R T A - A P P L I C A T I O N N O T E Introduction A R T A - A P P L I C A T I O N N O T E The AES-Recommendation 2-1984 (r2003) [01] defines the estimation of linear displacement of a loudspeaker as follows: Voice-coil peak displacement at

More information