Acoustic Tremor Measurement: Comparing Two Systems

Similar documents
INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET)

Synthesis Algorithms and Validation

Quantification of glottal and voiced speech harmonicsto-noise ratios using cepstral-based estimation

Linguistic Phonetics. Spectral Analysis

IMPROVING QUALITY OF SPEECH SYNTHESIS IN INDIAN LANGUAGES. P. K. Lehana and P. C. Pandey

L19: Prosodic modification of speech

ScienceDirect. Accuracy of Jitter and Shimmer Measurements

Determination of instants of significant excitation in speech using Hilbert envelope and group delay function

Project 0: Part 2 A second hands-on lab on Speech Processing Frequency-domain processing

Envelope Modulation Spectrum (EMS)

Reading: Johnson Ch , Ch.5.5 (today); Liljencrants & Lindblom; Stevens (Tues) reminder: no class on Thursday.

SPEECH TO SINGING SYNTHESIS SYSTEM. Mingqing Yun, Yoon mo Yang, Yufei Zhang. Department of Electrical and Computer Engineering University of Rochester

NOTES FOR THE SYLLABLE-SIGNAL SYNTHESIS METHOD: TIPW

CORRELATIONS BETWEEN SPEAKER'S BODY SIZE AND ACOUSTIC PARAMETERS OF VOICE 1, 2

Pitch Period of Speech Signals Preface, Determination and Transformation

Hungarian Speech Synthesis Using a Phase Exact HNM Approach

Subtractive Synthesis & Formant Synthesis

Non-stationary Analysis/Synthesis using Spectrum Peak Shape Distortion, Phase and Reassignment

ENEE408G Multimedia Signal Processing

EE 225D LECTURE ON SPEECH SYNTHESIS. University of California Berkeley

Advances in Speech Signal Processing for Voice Quality Assessment

Communications Theory and Engineering

Single Channel Speaker Segregation using Sinusoidal Residual Modeling

Epoch Extraction From Emotional Speech

Converting Speaking Voice into Singing Voice

Digital Speech Processing and Coding

THE HUMANISATION OF STOCHASTIC PROCESSES FOR THE MODELLING OF F0 DRIFT IN SINGING

Speech Synthesis; Pitch Detection and Vocoders

Speech Perception Speech Analysis Project. Record 3 tokens of each of the 15 vowels of American English in bvd or hvd context.

Enhanced Waveform Interpolative Coding at 4 kbps

EE482: Digital Signal Processing Applications

NOTICE WARNING CONCERNING COPYRIGHT RESTRICTIONS: The copyright law of the United States (title 17, U.S. Code) governs the making of photocopies or

FREQUENCY-DOMAIN TECHNIQUES FOR HIGH-QUALITY VOICE MODIFICATION. Jean Laroche

Block diagram of proposed general approach to automatic reduction of speech wave to lowinformation-rate signals.

The Effects of Noise on Acoustic Parameters

Acoustic Phonetics. Chapter 8

Applying the Harmonic Plus Noise Model in Concatenative Speech Synthesis

SINOLA: A New Analysis/Synthesis Method using Spectrum Peak Shape Distortion, Phase and Reassigned Spectrum

Mel Spectrum Analysis of Speech Recognition using Single Microphone

Page 0 of 23. MELP Vocoder

speech signal S(n). This involves a transformation of S(n) into another signal or a set of signals

A Novel Adaptive Algorithm for

Aalto Aparat A Freely Available Tool for Glottal Inverse Filtering and Voice Source Parameterization

Experimental evaluation of inverse filtering using physical systems with known glottal flow and tract characteristics

Speech/Non-speech detection Rule-based method using log energy and zero crossing rate

Complex Sounds. Reading: Yost Ch. 4

Measuring and generating signals with ADC's and DAC's

Aspiration Noise during Phonation: Synthesis, Analysis, and Pitch-Scale Modification. Daryush Mehta

University of Washington Department of Electrical Engineering Computer Speech Processing EE516 Winter 2005

INTRODUCTION TO ACOUSTIC PHONETICS 2 Hilary Term, week 6 22 February 2006

Vocoder (LPC) Analysis by Variation of Input Parameters and Signals

Introducing COVAREP: A collaborative voice analysis repository for speech technologies

WaveSurfer. Basic acoustics part 2 Spectrograms, resonance, vowels. Spectrogram. See Rogers chapter 7 8

EC 6501 DIGITAL COMMUNICATION UNIT - II PART A

HST.582J / 6.555J / J Biomedical Signal and Image Processing Spring 2007

Pitch Detection Algorithms

ROBUST PITCH TRACKING USING LINEAR REGRESSION OF THE PHASE

4-4 Graphing Sine and Cosine Functions

CMPT 468: Frequency Modulation (FM) Synthesis

Linear Frequency Modulation (FM) Chirp Signal. Chirp Signal cont. CMPT 468: Lecture 7 Frequency Modulation (FM) Synthesis

SPEECH ANALYSIS* Prof. M. Halle G. W. Hughes A. R. Adolph

Speech Synthesis using Mel-Cepstral Coefficient Feature

Correspondence. Cepstrum-Based Pitch Detection Using a New Statistical V/UV Classification Algorithm

Lab 8. ANALYSIS OF COMPLEX SOUNDS AND SPEECH ANALYSIS Amplitude, loudness, and decibels

SPEECH AND SPECTRAL ANALYSIS

Signal Characterization in terms of Sinusoidal and Non-Sinusoidal Components

A New Iterative Algorithm for ARMA Modelling of Vowels and glottal Flow Estimation based on Blind System Identification

Sound Synthesis Methods

Acoustic Studies of Tremor in Pathological Voices

(i) Understanding the basic concepts of signal modeling, correlation, maximum likelihood estimation, least squares and iterative numerical methods

COMPRESSIVE SAMPLING OF SPEECH SIGNALS. Mona Hussein Ramadan. BS, Sebha University, Submitted to the Graduate Faculty of

SOUND SOURCE RECOGNITION AND MODELING

Audio Engineering Society Convention Paper Presented at the 110th Convention 2001 May Amsterdam, The Netherlands

International Journal of Modern Trends in Engineering and Research e-issn No.: , Date: 2-4 July, 2015

Quarterly Progress and Status Report. Formant amplitude measurements

APPLICATIONS OF DSP OBJECTIVES

Voice Excited Lpc for Speech Compression by V/Uv Classification

Steady state phonation is never perfectly steady. Phonation is characterized

Overview of Signal Processing

Columbia University. Principles of Communication Systems ELEN E3701. Spring Semester May Final Examination

ROBUST F0 ESTIMATION IN NOISY SPEECH SIGNALS USING SHIFT AUTOCORRELATION. Frank Kurth, Alessia Cornaggia-Urrigshardt and Sebastian Urrigshardt

High-Pitch Formant Estimation by Exploiting Temporal Change of Pitch

Lab 9 Fourier Synthesis and Analysis

Voice Conversion of Non-aligned Data using Unit Selection

Relative phase information for detecting human speech and spoofed speech

Adaptive Filters Linear Prediction

Glottal source model selection for stationary singing-voice by low-band envelope matching

Epoch Extraction From Speech Signals K. Sri Rama Murty and B. Yegnanarayana, Senior Member, IEEE

The Partly Preserved Natural Phases in the Concatenative Speech Synthesis Based on the Harmonic/Noise Approach

Overview of Digital Signal Processing

Structure of Speech. Physical acoustics Time-domain representation Frequency domain representation Sound shaping

ASPIRATION NOISE DURING PHONATION: SYNTHESIS, ANALYSIS, AND PITCH-SCALE MODIFICATION DARYUSH MEHTA

Overview of Code Excited Linear Predictive Coder

Synthesis Techniques. Juan P Bello

Large-scale cortical correlation structure of spontaneous oscillatory activity

SPEECH ANALYSIS-SYNTHESIS FOR SPEAKER CHARACTERISTIC MODIFICATION

ME scope Application Note 01 The FFT, Leakage, and Windowing

Introduction to cochlear implants Philipos C. Loizou Figure Captions

TE 302 DISCRETE SIGNALS AND SYSTEMS. Chapter 1: INTRODUCTION

Acoustic signal typing for evaluation of voice quality in tracheoesophageal speech van As, C.J.; van Beinum, F.J.; Pols, L.C.W.; Hilgers, F.J.M.

Transcription:

Acoustic Tremor Measurement: Comparing Two Systems Markus Brückl Elvira Ibragimova Silke Bögelein Institute for Language and Communication Technische Universität Berlin 10 th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications

Outline 1. Introduction 2. Methods a) Acoustic synthesis of the test sounds b) The tremor measurement systems c) Statistical methods 3. Results 4. Discussion 5. Conclusion Page 2

Introduction tremor as a symptom The ascertainment of tremor (severity) bears a high potential to serve for early diagnosis of several, mostly neuro-degenerative diseases like Parkinson s, Alzheimer s, multiple sclerosis. Tremor often is defined as involuntary cyclic movement (deviation) of the limbs, but Page 3

Introduction vocal tremor if it is caused by deficits of the central nervous system, it is most likely that speech production is affected too, since the production of speech involves the coordinated processing of about 1,400 motor commands per second. The more than 80 muscles of the vocal apparatus may all show tremor and thus vocal tremor may have many sources. Page 4

Introduction vocal tremor But once the acoustic output is investigated, all of these organic modulation sources combine to only two types of tremor: subsonic and quasi-cyclic modulations of the frequency and of the amplitude. Page 5

Introduction aim of the study In spite of the potential of (auditive or) acoustic vocal tremor assessment, its reliability and therewith its validity still provide great room for improvement. Hence, the aim of this study is to compare two acoustic tremor measurement systems according to their criterion validity, that is here defined as goodness in measuring synthetically generated and thus known tremor. Page 6

Acoustic synthesis of the test sounds a completely synthetic sustained vowel with known tremor properties is created by formant synthesis the glottal source signal is modelled with 3s duration 200Hz mean fundamental frequency according to [1] and then filtered by a time-invariant female -/a/-shaped filter function this /a/-sound serves as the carrier for the frequency and amplitude modulations [1] Rosenberg, A. E., Effect of glottal pulse shape on the quality of natural vowels, Journal of the Acoustical Society of America, 49, 583 590, 1971. Page 7

Acoustic synthesis the modulation carrier Page 8

Acoustic synthesis of the test sounds modulations are done by re-synthesis according to the overlap-and-add method [2] both modulation types are modelled with a sinusoidal shape that is varied in frequency and amplitude, resulting in 4 synthesis arguments: frequency tremor frequency (FTrF[Hz]) amplitude tremor frequency (ATrF[Hz]) (relative) frequency tremor intensity (FTrI[%]) (relative) amplitude tremor intensity (ATrI[%]) each argument is varied in 4 equally spaced steps across each range of naturally occurring values both a frequency and an intensity decline are synthesized 4 6 = 4,096 test sounds [2] Moulines, E., Charpentier, F., Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones, Speech Communication, 9, 453 467, 1990. Page 9

Acoustic synthesis frequency modulation F 0 M t = F 0,s + FTrI തF 0 sin FTrF 2π t decf t) Page 10

Acoustic synthesis amplitude modulation AM t = A s + ATrI ҧ A sin ATrF 2π t deca t) Page 11

Acoustic synthesis both modulations Page 12

Measurement systems MDVP s measures MDVP [3] extracts 4 parameters of vocal tremor: 2 measures of frequency tremor frequency of the strongest low-frequency modulation of the fundamental frequency (Fftr [Hz]) mean magnitude of the strongest low-frequency modulation of the fundamental frequency (FTRI [%]) 2 measures of amplitude tremor frequency of the strongest low-frequency modulation of the amplitude (Fatr [Hz]) mean magnitude of the strongest low-frequency modulation of the amplitude (ATRI [%]) [3] Kay Elemetrics Corp. / PENTAX Medical, Multi-Dimensional Voice Program (MDVP), Model 5105 (Version 2.6.2) [Computer program], 1993/2003. Page 13

Measurement systems TREMOR.PRAAT TREMOR.PRAAT 3.01 extracts 14 parameters of vocal tremor 4 out of these 14 meet the above definitions 2 measures of frequency tremor frequency tremor frequency (FTrF) frequency tremor intensity index (FTrI) 2 measures of amplitude tremor amplitude tremor frequency (ATrF) amplitude tremor intensity index (ATrI) TREMOR.PRAAT is open-source software and implemented as a Praat [4] script [4] P. Boersma, D. Weenink, Praat: doing phonetics by computer (Version 6.0.29) [Computer program], Uni-versity of Amsterdam Page 14

Acoustic measurement of vocal tremor with tremor.praat tremor.praat s algorithm is based on autocorrelation of the F 0 contour and the amplitude contour and corrected for the declination that is naturally found in both contours it is implemented in the script language of the speech-processing program PRAAT tremor.praat (version 3.01) can be downloaded from http://brykl.de/tremor3.01.zip Page 15

Methods TREMOR.PRAAT S tremor measures Page 16

Methods extracting the tremor frequencies autocorrelate the (windowed) signal to estimate the F 0 contour use PRAAT s To Amplitude function to extract amplitudes per period resample these time/duration-varying amplitudes at a constant time rate to derive an amplitude contour remove linear declinations of both contours by subtracting the linear regression estimates autocorrelate the contours FTrF is the frequency of the strongest low-frequency modulation of F 0 ATrF is the frequency of the strongest low-frequency modulation of the amplitude (A). [where strength is determined by the contours autocorrelation coefficients] Page 17

Methods determining the tremor intensity indices normalize/relativize the (de-declined) contours by rel. F 0 t = F 0 t തF 0 rel. A(t) = A t Aҧ തF 0 Aҧ the time marks of the contours extrema are found with PRAAT's built-in function To PointProcess (peaks), once the tremor frequencies are known intensity indices are then determined by F, A TrI = σ i=1 m max i m + σ n j=1 min j n 2 Page 18

Comparison statistic regressions determination coefficients 8 simple linear regressions are computed in order to assess the dependence of the 8 measured values on the 4 synthesized values their determination coefficients (R²) denote the proportion of variance in the measured values that can be explained by the set values variance they may serve as coefficients of validity of the 2 measurement instruments 99.99% confidence intervals (CIs) around these coefficients are calculated in order to indicate if the populations of corresponding coefficients differ from another Page 19

Results MDVP fails to extract amplitude tremor measures in 513 cases and frequency tremor measures in 256 cases. TREMOR.PRAAT achieves to extract all measures from all sounds, and TREMOR.PRAAT s measurement errors are highly significantly smaller, i.e. its measures are highly significantly more valid than those of the MDVP Page 20

Results R²s and their CIs Page 21

Results scatterplots Page 22

Discussion errors of TREMOR.PRAAT the tremor intensity measures (FTrI and ATrI) exhibit greater underestimations at greater synthetically set values if ATrF gets extracted deficiently, then exactly one or two octaves too low avoid by raising the tremor octave cost since both error types are due to the (mandatory) quantization of the tremor contours all errors in TREMOR.PRAAT s measurements may be reduced by shortening the analysis time step Page 23

Discussion errors of the MDVP errors in the MDVP s extractions seem to be far less systematic sources must remain unrevealed, since the MDVP s algorithm is proprietary and thus unknown Page 24

Conclusion TREMOR.PRAAT is still under development, but it is far more valid in measuring vocal tremor than the standard program MDVP use TREMOR.PRAAT for acoustic tremor measurement re-measure formerly gained results based on the MDVP Page 25

Questions? send an email to: markus.brueckl@tu-berlin.de download tremor.praat: http://brykl.de/tremor3.01.zip Page 26