Acoustic Phonetics. How speech sounds are physically represented. Chapters 12 and 13

Similar documents
SPEECH AND SPECTRAL ANALYSIS

Source-filter Analysis of Consonants: Nasals and Laterals

INTRODUCTION TO ACOUSTIC PHONETICS 2 Hilary Term, week 6 22 February 2006

WaveSurfer. Basic acoustics part 2 Spectrograms, resonance, vowels. Spectrogram. See Rogers chapter 7 8

CS 188: Artificial Intelligence Spring Speech in an Hour

COMP 546, Winter 2017 lecture 20 - sound 2

Speech Perception Speech Analysis Project. Record 3 tokens of each of the 15 vowels of American English in bvd or hvd context.

Resonance and resonators

Foundations of Language Science and Technology. Acoustic Phonetics 1: Resonances and formants

Reading: Johnson Ch , Ch.5.5 (today); Liljencrants & Lindblom; Stevens (Tues) reminder: no class on Thursday.

Lab 8. ANALYSIS OF COMPLEX SOUNDS AND SPEECH ANALYSIS Amplitude, loudness, and decibels

Source-filter analysis of fricatives

About waves. Sounds of English. Different types of waves. Ever done the wave?? Why do we care? Tuning forks and pendulums

Acoustic Phonetics. Chapter 8

Speech Processing. Undergraduate course code: LASC10061 Postgraduate course code: LASC11065

Linguistics 401 LECTURE #2. BASIC ACOUSTIC CONCEPTS (A review)

Structure of Speech. Physical acoustics Time-domain representation Frequency domain representation Sound shaping

Linguistic Phonetics. Spectral Analysis

The source-filter model of speech production"

Statistical NLP Spring Unsupervised Tagging?

Source-Filter Theory 1

Review: Frequency Response Graph. Introduction to Speech and Science. Review: Vowels. Response Graph. Review: Acoustic tube models

An introduction to physics of Sound

Chapter 2. Meeting 2, Measures and Visualizations of Sounds and Signals

Musical Acoustics, C. Bertulani. Musical Acoustics. Lecture 14 Timbre / Tone quality II

EE482: Digital Signal Processing Applications

Subtractive Synthesis & Formant Synthesis

Complex Sounds. Reading: Yost Ch. 4

Linguistic Phonetics. The acoustics of vowels

Psychology of Language

Speech Synthesis; Pitch Detection and Vocoders

Digital Signal Processing

Sound. Production of Sound

speech signal S(n). This involves a transformation of S(n) into another signal or a set of signals

From Ladefoged EAP, p. 11

International Journal of Modern Trends in Engineering and Research e-issn No.: , Date: 2-4 July, 2015

DIVERSE RESONANCE TUNING STRATEGIES FOR WOMEN SINGERS

Speech Signal Analysis

Lecture Presentation Chapter 16 Superposition and Standing Waves

Physics I Notes: Chapter 13 Sound

AP Homework (Q2) Does the sound intensity level obey the inverse-square law? Why?

Acoustics and Fourier Transform Physics Advanced Physics Lab - Summer 2018 Don Heiman, Northeastern University, 1/12/2018

Sound, acoustics Slides based on: Rossing, The science of sound, 1990.

EE 225D LECTURE ON SPEECH SYNTHESIS. University of California Berkeley

Signals, systems, acoustics and the ear. Week 3. Frequency characterisations of systems & signals

Definition of Sound. Sound. Vibration. Period - Frequency. Waveform. Parameters. SPA Lundeen

Acoustics, signals & systems for audiology. Week 3. Frequency characterisations of systems & signals

Chapter 3 The Physics of Sound

Principles of Musical Acoustics

Digitized signals. Notes on the perils of low sample resolution and inappropriate sampling rates.

Music. Sound Part II

Signals & Systems for Speech & Hearing. Week 6. Practical spectral analysis. Bandpass filters & filterbanks. Try this out on an old friend

SPEECH ANALYSIS* Prof. M. Halle G. W. Hughes A. R. Adolph

Acoustics, signals & systems for audiology. Week 4. Signals through Systems

Chapter 12. Preview. Objectives The Production of Sound Waves Frequency of Sound Waves The Doppler Effect. Section 1 Sound Waves

Recap the waveform. Complex waves (dạnh sóng phức tạp) and spectra. Recap the waveform

SGN Audio and Speech Processing

Warm-Up. Think of three examples of waves. What do waves have in common? What, if anything, do waves carry from one place to another?

6.551j/HST.714j Acoustics of Speech and Hearing: Exam 2

IMPROVING QUALITY OF SPEECH SYNTHESIS IN INDIAN LANGUAGES. P. K. Lehana and P. C. Pandey

Chapter 15 Supplement HPS. Harmonic Motion

A mechanical wave is a disturbance which propagates through a medium with little or no net displacement of the particles of the medium.

Converting Speaking Voice into Singing Voice

Math and Music: Understanding Pitch

No Brain Too Small PHYSICS

Project 0: Part 2 A second hands-on lab on Speech Processing Frequency-domain processing

Physics 1240: Sound and Music Scott Parker 1/31/06. Today: Sound sources, resonance, nature of sound waves (begin wave motion)

Signal Processing for Speech Applications - Part 2-1. Signal Processing For Speech Applications - Part 2

Standing Waves, Natural Frequency, & Resonance. Physics 5 th /6 th 6wks

Chapter 18. Superposition and Standing Waves

Speech Recognition. Mitch Marcus CIS 421/521 Artificial Intelligence

Copyright 2009 Pearson Education, Inc.

Block diagram of proposed general approach to automatic reduction of speech wave to lowinformation-rate signals.

Mask-Based Nasometry A New Method for the Measurement of Nasalance

Preview. Sound Section 1. Section 1 Sound Waves. Section 2 Sound Intensity and Resonance. Section 3 Harmonics

EC 6501 DIGITAL COMMUNICATION UNIT - II PART A

Fundamentals of Music Technology

HCS 7367 Speech Perception

Speech Synthesis using Mel-Cepstral Coefficient Feature

Sound is the human ear s perceived effect of pressure changes in the ambient air. Sound can be modeled as a function of time.

Perception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 4: 7 Feb A. Faulkner.

Resonant Self-Destruction

Music: Sound that follows a regular pattern; a mixture of frequencies which have a clear mathematical relationship between them.

JOURNAL OF OBJECT TECHNOLOGY

Between physics and perception signal models for high level audio processing. Axel Röbel. Analysis / synthesis team, IRCAM. DAFx 2010 iem Graz

Quarterly Progress and Status Report. A note on the vocal tract wall impedance

L19: Prosodic modification of speech

Section 1 Sound Waves. Chapter 12. Sound Waves. Copyright by Holt, Rinehart and Winston. All rights reserved.

Lecture 7: Superposition and Fourier Theorem

MAKE SOMETHING THAT TALKS?

Aspiration Noise during Phonation: Synthesis, Analysis, and Pitch-Scale Modification. Daryush Mehta

Dept. of Computer Science, University of Copenhagen Universitetsparken 1, DK-2100 Copenhagen Ø, Denmark

Respiration, Phonation, and Resonation: How dependent are they on each other? (Kay-Pentax Lecture in Upper Airway Science) Ingo R.

Complete the sound and music introductory lesson and the Musical Instruments Part I lesson. Gather supplies (see materials list).

HST.582J / 6.555J / J Biomedical Signal and Image Processing Spring 2007

Physics 101. Lecture 21 Doppler Effect Loudness Human Hearing Interference of Sound Waves Reflection & Refraction of Sound

PHYSICS 102N Spring Week 6 Oscillations, Waves, Sound and Music

Perception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 5: 12 Feb A. Faulkner.

What is Sound? Part II

Epoch Extraction From Speech Signals K. Sri Rama Murty and B. Yegnanarayana, Senior Member, IEEE

Transcription:

Acoustic Phonetics How speech sounds are physically represented Chapters 12 and 13 1

Sound Energy Travels through a medium to reach the ear Compression waves 2 Information from Phonetics for Dummies. William F. Katz. Making Waves: An Overview of Sound. 2013.

Periodic waves Simple (sine; sinusoid) Complex (actually a composite of many overlapping simple waves) 3

Sinusoid waves Simple periodic motion from perfectly oscillating bodies Found in in nature (e.g., swinging pendulum, sidewinder snake trail, airflow when you whistle) Sinusoids sound cold (e.g. flute) 4

Let s crank one out! Pg. 175 5

Frequency - Tones 6

Simple waves - key properties Frequency = cycles per sec (cps) = Hz Amplitude measured in decibels (db), 1/10 of a Bell (Note: db is on a log scale, increases by powers of 10) 7

Phase A measure of the position along the sinusoidal vibration These two waveforms are slightly out of phase (approx. 90 0 difference) Used in sound localization 8

Damping Loss of vibration due to friction 9

Quickie Quiz! Q: What is the frequency of this wave? HINT: It repeats twice in 10 msec 10

Answer: 200 Hz! (2 cycles in.01 sec = 200 cps) 11

Physical vs. perceptual PHYSICAL Fundamental frequency (F 0 ) PERCEPTUAL Pitch Amplitude/ Intensity Loudness Duration Length 12

Image from Fetal Hydrocephalus. The Amazing Owen. Great News from the Audiologist. March 23, 2009. Accessed June 13, 2016. http://fetalhydrocephalus.com/hydro/siblog/default.aspx?id=35&t=great-news-from-the-audiologist 13

Complex periodic waves Results from imperfectly oscillating bodies Demonstrate simple harmonic motion Examples - a vibrating string, the vocal folds 14

Frequency Tones/ Adding 15

Another example.. "http://www.askamathematician.com/wpcontent/uploads/2012/09/indykkatabipricehassanieh.jpg"> 16

Waveforms - Male Vowels 17

Waveforms - Female Vowels 18

Complex periodic waves cont d Consists of a fundamental (F 0 ) and harmonics Harmonics ( overtones ) consist of energy at integer multiples of the fundamental (x2, x3, x4 etc ) 19

Harmonic series Imagine you pluck a guitar string and could look at it with a really precise strobe light Here is what its vibration will look like 20

From complex wave to its components and the frequency spectrum Also known as a line spectrum Here, complex wave at the bottom..is broken into its component sin waves shown at the top (complex wave) 21

Fourier analysis 1768-1830 Complex wave component sinusoids Sound Light 22

Review of source characteristics Simple waves are a good way to learn about basic properties of frequency, amplitude, and phase. Examples include whistling; not really found much in speech Complex waves are found in nature for oscillating bodies that show simple harmonic motion (e.g., the vocal folds) 23 Information from Phonetics for Dummies. William F. Katz. Making Waves: An Overview of Sound. 2013.

Now let s look at the filter In speech, the filter is the supralaryngeal vocal tract (SLVT) The shape of the oral/pharyngeal cavity determines vowel quality SLVT shape is chiefly determined by tongue movement, but lips, velum and (indirectly) jaw also play a role 24

Resonance Reinforcement or shaping of frequencies as a function of the boundary conditions through which sound is passed FUN: Try producing a vowel with a paper towel roll placed over your mouth! The extra tube changes the resonance properties 25

Resonance / Formants The SLVT can be modeled as a kind of bottle with different shapes as sound passes through this chamber it achieves different sound qualities The resonant peaks of speech that relate to vowel quality are called formants. Thus, R1 = F1 ( first formant). R2 = F2, etc. F1 and F2 are critical determinants of vowel quality 26

Input SLVT final output 27

Vocal tract shape formant frequencies 28

Resonance FOUR basic rules F1 rule inversely related to jaw height. As the jaw goes down, F1 goes up, etc. F2 rule directly related to tongue fronting. As the tongue moves forward, F2 increases. F3 rule F3 drops with r-coloring Lip rounding rule All formants are lowered by liprounding (because lip protrusion lengthens the vocal tract tube ) 29 Information from Phonetics for Dummies. William F. Katz. Making Waves: An Overview of Sound. 2013.

Examples of resonance for /i/, /ɑ/, /u/ /i/ is made with the tongue high (thus, low F1) and fronted (high F2) /ɑ/ is made with the tongue low (high F1) and back (low F2) /i/ /ɑ/ /u/ 30

American English Vowels (Assmann & Katz, 2000) 31 Tables from Phonetics for Dummies. William F. Katz. Making Waves: An Overview of Sound. 2013.

F2 x F1 plot American English Vowels Peterson & Barney, 1952 32 Figure from Phonetics for Dummies. William F. Katz. Making Waves: An Overview of Sound. 2013.

Chap 13 Reading a sound spectrogram 33

The sound spectrograph Invented in the 1940s First called visible speech Originally thought to produce a speech fingerprint (?) We now know speech perception is far more complicated and ambiguous.. 34

Basics of spectrogram operation Original systems used bandpass filters Accumulated energy was represented by a dark image burned onto specially-treated paper Nowadays, performed digitally using variety of algorithms (e.g., DFT, LPC) 35

Relating line spectrum to spectrogram F3 F1 1 36

Sample of word spectrogram Pg. 192 Figure from Phonetics for Dummies. William F. Katz. Reading a Sound Spectrogram. 2013. 37

Vowel basics Here is /i ɑ i ɑ / produced with level pitch Wideband spectrogram (left); narrow band (right) Spectrogram from Ladefoged and Johnson, A course in phonetics 38

Let s find some vowels! 39 Figure from Phonetics for Dummies. William F. Katz. Reading a Sound Spectrogram. 2013.

Here they are: 40 Figure from Phonetics for Dummies. William F. Katz. Reading a Sound Spectrogram. 2013.

Consonants formant transitions An example of an F1 transition for the syllable /da/ 41 Figure from Phonetics for Dummies. William F. Katz. Reading a Sound Spectrogram. 2013.

American English vowels in /b_d/ context TOP ROW (front vowels): bead bid bade bed bad BOTTOM ROW (back vowels) bod bawd bode buhd booed 42 Spectrograms from Ladefoged and Johnson, A course in phonetics

Stops/ formant transitions Spectrograms of bab dad and gag Labials - point down, alveolars point to ~1700-1800 Hz, velars pinch F2 and F3 together Note: bottom-most fuzzy is the voice bar! Spectrogram from Ladefoged and Johnson, A course in phonetics 43

Voicing (voice of WK) 44

Fricatives Top row: /f/, theta, s, esh, Bottom row: /v/, ethe, z, long z Distribution of the spectral noise is the key here! 45 Spectrogram from Ladefoged and Johnson, A course in phonetics

The fricative /h/ Commonly excites all the formant cavities May look slightly different in varying vowel contexts 46 Spectrogram from Ladefoged and Johnson, A course in phonetics

Nasal stops Spectrograms of dinner dimmer dinger Marked by zeroes or formant regions with little energy Can also result in broadening of formant bandwidths (fuzzying the edges) Spectrogram from Ladefoged and Johnson, A course in phonetics 47

Approximants /ɹ/ - very low third formant, just above F2 /l/ - formants in the neighborhood of 250, 1200, and 2400 Hz; less apparent in final position. Higher formants considerable reduced in intensity Spectrogram from Ladefoged and Johnson, A course in phonetics 48

Common allophonic variations a toe a doe otto For full stops, there is about 100 ms of silence For tap, about 10-30 ms Spectrogram from Ladefoged and Johnson, A course in phonetics 49

Pseudo-colored example Here is an American English /æ/ (male) Analyzed in Wavesurfer Hot areas (in green/yellow/red) have more energy 50

Some tough cases. ALS (notice loss of formant frequency quality) Healthy 51

Women and children (High F 0 can cause problems estimating formants) 52