WaveSurfer. Basic acoustics part 2 Spectrograms, resonance, vowels. Spectrogram. See Rogers chapter 7 8

Similar documents
INTRODUCTION TO ACOUSTIC PHONETICS 2 Hilary Term, week 6 22 February 2006

Acoustic Phonetics. How speech sounds are physically represented. Chapters 12 and 13

Digital Signal Processing

Review: Frequency Response Graph. Introduction to Speech and Science. Review: Vowels. Response Graph. Review: Acoustic tube models

Complex Sounds. Reading: Yost Ch. 4

SPEECH AND SPECTRAL ANALYSIS

Linguistic Phonetics. Spectral Analysis

Source-Filter Theory 1

Project 0: Part 2 A second hands-on lab on Speech Processing Frequency-domain processing

Reading: Johnson Ch , Ch.5.5 (today); Liljencrants & Lindblom; Stevens (Tues) reminder: no class on Thursday.

COMP 546, Winter 2017 lecture 20 - sound 2

Subtractive Synthesis & Formant Synthesis

Speech Processing. Undergraduate course code: LASC10061 Postgraduate course code: LASC11065

Speech Perception Speech Analysis Project. Record 3 tokens of each of the 15 vowels of American English in bvd or hvd context.

Source-filter Analysis of Consonants: Nasals and Laterals

CS 188: Artificial Intelligence Spring Speech in an Hour

A Look at Un-Electronic Musical Instruments

The source-filter model of speech production"

Foundations of Language Science and Technology. Acoustic Phonetics 1: Resonances and formants

speech signal S(n). This involves a transformation of S(n) into another signal or a set of signals

E : Lecture 8 Source-Filter Processing. E : Lecture 8 Source-Filter Processing / 21

Digitized signals. Notes on the perils of low sample resolution and inappropriate sampling rates.

International Journal of Modern Trends in Engineering and Research e-issn No.: , Date: 2-4 July, 2015

Announcements. Today. Speech and Language. State Path Trellis. HMMs: MLE Queries. Introduction to Artificial Intelligence. V22.

Structure of Speech. Physical acoustics Time-domain representation Frequency domain representation Sound shaping

Linguistics 401 LECTURE #2. BASIC ACOUSTIC CONCEPTS (A review)

L19: Prosodic modification of speech

Acoustic Phonetics. Chapter 8

Pitch Period of Speech Signals Preface, Determination and Transformation

Chapter 3. Description of the Cascade/Parallel Formant Synthesizer. 3.1 Overview

Signal Processing for Speech Applications - Part 2-1. Signal Processing For Speech Applications - Part 2

8A. ANALYSIS OF COMPLEX SOUNDS. Amplitude, loudness, and decibels

Quantification of glottal and voiced speech harmonicsto-noise ratios using cepstral-based estimation

Lab 9 Fourier Synthesis and Analysis

INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET)

EE 225D LECTURE ON SPEECH SYNTHESIS. University of California Berkeley

Source-filter analysis of fricatives

On the glottal flow derivative waveform and its properties

Lab 8. ANALYSIS OF COMPLEX SOUNDS AND SPEECH ANALYSIS Amplitude, loudness, and decibels

Resonance and resonators

Experimental evaluation of inverse filtering using physical systems with known glottal flow and tract characteristics

Speech Signal Analysis

About waves. Sounds of English. Different types of waves. Ever done the wave?? Why do we care? Tuning forks and pendulums

DIVERSE RESONANCE TUNING STRATEGIES FOR WOMEN SINGERS

Airflow visualization in a model of human glottis near the self-oscillating vocal folds model

Acoustics, signals & systems for audiology. Week 4. Signals through Systems

Sound, acoustics Slides based on: Rossing, The science of sound, 1990.

Musical Acoustics, C. Bertulani. Musical Acoustics. Lecture 14 Timbre / Tone quality II

Homework 4. Installing Praat Download Praat from Paul Boersma's website at Follow the instructions there.

ASPIRATION NOISE DURING PHONATION: SYNTHESIS, ANALYSIS, AND PITCH-SCALE MODIFICATION DARYUSH MEHTA

Signals & Systems for Speech & Hearing. Week 6. Practical spectral analysis. Bandpass filters & filterbanks. Try this out on an old friend

Block diagram of proposed general approach to automatic reduction of speech wave to lowinformation-rate signals.

Statistical NLP Spring Unsupervised Tagging?

INDIANA UNIVERSITY, DEPT. OF PHYSICS P105, Basic Physics of Sound, Spring 2010


Quarterly Progress and Status Report. A note on the vocal tract wall impedance

Converting Speaking Voice into Singing Voice

Nature of Noise source. soundsc (noise, 10000);

SPEECH ANALYSIS* Prof. M. Halle G. W. Hughes A. R. Adolph

Introducing COVAREP: A collaborative voice analysis repository for speech technologies

Respiration, Phonation, and Resonation: How dependent are they on each other? (Kay-Pentax Lecture in Upper Airway Science) Ingo R.

3A: PROPERTIES OF WAVES

The quality of the transmission signal The characteristics of the transmission medium. Some type of transmission medium is required for transmission:

Lab S-8: Spectrograms: Harmonic Lines & Chirp Aliasing

the 99th Convention 1995 October 6-9 NewYork

Recap the waveform. Complex waves (dạnh sóng phức tạp) and spectra. Recap the waveform

Synthesis Algorithms and Validation

Determination of instants of significant excitation in speech using Hilbert envelope and group delay function

EC 6501 DIGITAL COMMUNICATION UNIT - II PART A

Digital Speech Processing and Coding

Communications Theory and Engineering

IMPROVING QUALITY OF SPEECH SYNTHESIS IN INDIAN LANGUAGES. P. K. Lehana and P. C. Pandey

EE482: Digital Signal Processing Applications

Epoch Extraction From Speech Signals K. Sri Rama Murty and B. Yegnanarayana, Senior Member, IEEE

Linear Frequency Modulation (FM) Chirp Signal. Chirp Signal cont. CMPT 468: Lecture 7 Frequency Modulation (FM) Synthesis

Acoustics and Fourier Transform Physics Advanced Physics Lab - Summer 2018 Don Heiman, Northeastern University, 1/12/2018

Glottal source model selection for stationary singing-voice by low-band envelope matching

CMPT 468: Frequency Modulation (FM) Synthesis

Signal Analysis. Young Won Lim 2/10/18

MAKE SOMETHING THAT TALKS?

Project Report Liquid Robotics, Inc. Integration and Use of a High-frequency Acoustic Recording Package (HARP) on a Wave Glider

Quarterly Progress and Status Report. Acoustic properties of the Rothenberg mask

X. SPEECH ANALYSIS. Prof. M. Halle G. W. Hughes H. J. Jacobsen A. I. Engel F. Poza A. VOWEL IDENTIFIER

PART II Practical problems in the spectral analysis of speech signals

Pattern Recognition. Part 6: Bandwidth Extension. Gerhard Schmidt

Pitch Bending PITCH BENDING AND ANOMALOUS BEHAVIOR IN A FREE REED COUPLED TO A PIPE RESONATOR

3.2 Measuring Frequency Response Of Low-Pass Filter :

Resonant Self-Destruction

An Introduction to Spectrum Analyzer. An Introduction to Spectrum Analyzer

An introduction to physics of Sound

Speech Synthesis; Pitch Detection and Vocoders

Chapter 7. Frequency-Domain Representations 语音信号的频域表征

Advanced Audiovisual Processing Expected Background

Location of sound source and transfer functions

Epoch Extraction From Emotional Speech

INTRODUCTION TO COMPUTER MUSIC. Roger B. Dannenberg Professor of Computer Science, Art, and Music. Copyright by Roger B.

Magne Skålevik Brekke & Strand, Oslo, Norway. Small room acoustics THE HARD CASE

An Experimentally Measured Source Filter Model: Glottal Flow, Vocal Tract Gain and Output Sound from a Physical Model

Bioacoustics Lab- Spring 2011 BRING LAPTOP & HEADPHONES

Musical Acoustics, C. Bertulani. Musical Acoustics. Lecture 13 Timbre / Tone quality I

Transcription:

WaveSurfer. Basic acoustics part 2 Spectrograms, resonance, vowels See Rogers chapter 7 8 Allows us to see Waveform Spectrogram (color or gray) Spectral section short-time spectrum = spectrum of a brief stretch of speech Demonstration spectrograms of whistle of speech Spectrogram Narrow band spectrogram [aaa] pitch change Spectrogram Represents spectrum varying over time X-axis (horiz.) time (like waveform) Y-axis Frequency (like spectrum) Third dimension: pseudo-color or gray-scale representing amplitude

Harmonics [aaa] pitch change Measuring F0 from narrow band spectrogram Measure F0 from k-th harmonic Hk = x Hz then F0= x/k Hz 10th harmonic is convenient Expanding frequency scale makes this easier Harmonics-- Narrow stripes running left-right Narrow band spectrogram: changing pitch of [AAA] Harmonics [aaa] pitch change: Freq. Expanded Spectrogram of [AAA] on varying pitches Narrow band spectrogram Looks at fairly long stretch of time 40 ms or so sees several glottal pulses at once Each glottal pulse about 10 ms long or less so several are blurred Varying harmonic structure clear Spectral sections at different times 930 Hz 1410 Hz 10-th Harmonic highlighted: F0 about 93 and 141 Hz at arrows

F4 F3 F2 Wide band spectrogram [AAA] pitch change Wide band spectrogram: changing pitch of [AAA] Spectrogram of [AAA] on varying pitches Wideband spectrogram Looks at fairly long short of time 2 to 3 ms only sees less than one full glottal period Each glottal pulse about 10 ms long Varying harmonic structure no longer clear Dark bars show approximate location formant peaks Formants don t change much with pitch changes They change lots with VOWEL changes F1 F4 F3 F2 Wide band spectrogram [AAiiAA] No pitch change F3 F2 Wideband band spectrogram: [AAiiAA] Spectrogram of [AAiiAA] on SAME pitch Wideband spectrogram Looks at fairly long short of time 2 to 3 ms only sees less than one full glottal period Each glottal pulse about 10 ms long Harmonic structure no longer clear Dark bars show approximate location formant peaks Formants change lots with VOWEL changes F1 F1

Wide and narrowband spectrograms Narrowband spectrogram makes harmonic structure clear Associated with glottal source Wideband spectrogram makes formant structure clearer Dark formant bands that change with vowel, not with pitch) Formants associated filter properties of vocal tract above the larynx Principle of source + filter : Glottal source 10th harmonic at 1000 Hz f0 = 100 Hz Instant of glottal closure Period = 10 ms Source + Filter = Vowel Principle of source + filter : Vowel Source + Filter theory of speech Consider vowel like sounds first Source = voicing in glottis Filter = tube-resonator system of SLVT SLVT = supra-laryngeal vocal tract Harmonics(peaks)

Compare last two slides Source + Filter = Vowel [i] Waveform and spectrum of glottal source are relatively simple compared to vowel SLVT filter imparts extra structure on vowel waveform Oscillation between glottal pulses Enhances (boosts) certain frequency regions F1 F2 F3 Resonance (formant) peaks Source + Filter = Vowel [Q] Artificial glottal source Transformer robot voice Replace glottal source with a simple buzz Use my SLVT as the filter F1 F2 F3 Resonance (formant) peaks

Spectrum of the Robot source Robot vowels stage 1 Robot source: Lots of harmonics across the frequencies Ideally each harmonic would be near same amplitude Note we see little pointed pickets in spectral section Not narrow lines Real time-limited spectra look like this As we increase time for a steady signal we get more line-like harmonic peaks WaveSurfer analysis slapped tubes of different lengths (ThreeTappedTubes) WaveSurfer rapidly tapped mid-size tube (TappedTubeEmpty.wav) WaveSurfer tapped tube with partial block (TappedTubeBlock.wav) What about filter? We ve seen the robot source that can be filtered by real vocal tract Can we make a robot filter Yes: Plastic tubes Slap them with palm of hand and get an impulse response of filter Robot vowels stage 2 WaveSurfer analysis slapped tubes of different lengths (ThreeTappedTubes) WaveSurfer rapidly tapped mid-size tube (TappedTubeEmpty.wav) WaveSurfer tapped tube with partial block (TappedTubeBlock.wav)

Robot vowels stage 3 Add Robot source to tube Move robot tongue to change shape WaveSurfer: robot /aaaiiiaaa/ Waveform Waveforms: Time x amplitude Good for measuring durations of some events (especially when displayed with spectrogram). Period of a repetitive waveform (e.g. glottal pulse duration of voiced speech) VOT Waveform Review: Displays Time x amplitude Spectrum or spectral section Frequency by amplitude (db) Spectrogram Time by frequency by amplitude (horiz.) (vert.) (color or darkness) Spectral section Spectral section (spectrum) Frequency by amplitude in a brief interval of time (a section of a longer signal) Narrow band spectra look at moderately long chunks of speech (30-40 ms) Show harmonics for voiced speech Broad band spectra look at shorter chunks of speech (less than glottal period) can show formant structure

Narrow band spectrogram Narrow band spectrogram is a way to display many narrow-band spectral sections at once At each point in time, look at moderately long chunks of speech (30-40 ms) centered on that time point ( windowed sections) Represent amplitude at each frequency for that center time by darkness or color coding Shows harmonics as horizontal bands that bend as fundamental frequency changes Formant patterns visible only indirectly by which harmonics are strong Measuring F0 from wide band spectrogram Find duration of one period Distance between vertical striations (stripes) Proceed as with waveform Ballpark method for average F0: Count number of striations in 100 ms and multiply by 10 Measuring F0 from waveforms Find duration of one period Convert period duration to frequency 1 period in.005 seconds (= 5 ms) that means = X periods in 1 second? Answer 1/.005 = 200 Hz Alternate method: count several periods (k) x periods in x sec means frequency of k/x Hz That is k/x periods occur in one second Measuring F0 from narrow band spectrogram or spectral section Count up to the 10th harmonic Measure its frequency against the frequency scale Divide by 10 Can be very accurate Can use harmonic number k (instead of 10) if that s easier to find Then divide by k

Measuring Formants Use wide band spectrogram Try to identify wide bars that move a bit up and down Measure the center frequency of darkest or redest part. Note: I will provide formant tracks from WaveSurfer which will put thin lines through the formants