SPEECH AND SPECTRAL ANALYSIS
|
|
- Chad Walters
- 5 years ago
- Views:
Transcription
1 SPEECH AND SPECTRAL ANALYSIS 1
2 Sound waves: production in general: acoustic interference vibration (carried by some propagation medium) variations in air pressure speech: actions of the articulatory organs -> vibrations propagation medium -> airstream Representation of fluctuations in air pressure caused by a vibrating tuning fork (from P. Ladefoged, Elements of acoustic phonetics). 2
3 Sound waves: perception A schematic diagramm of the mechanism of the ear (from P. Ladefoged, Elements of acoustic phonetics). 3
4 Distinctive features of sound waves Frequency measured in cycles per second (Hz): A sound wave whose frequency is 100 Hz has 100 cycles in a second. cycle: the distance between two peaks (C) or rests (B) in the movement of the wave (i.e. it describes how close together the two points are) period: Period is the time required to complete one cycle of vibration, e.g. if 20 cycles are completed in 1 second, the period is 1/20th of a second (s), or 0.05 s. Amplitude the maximum distance between the peak (C) and the trough (A) peak-to-peak a. Fundamental frequency (of a voiced speech sound): 1/fundamental period (i.e. the time required to complete one cycle of the pattern as a whole) the frequency of vocal folds vibration depending on the size of the vocal apparatus human voice produces sounds within the ranges: male, female, children A wave of a 20 Hz frequency from Davenport & Hannahs, Introducing phonetics and phonology). 4
5 Simple and complex waves Two simple waves (pure tones, harmonics) of frequency 100 and 500 cps. The complex wave resulting from superposition of two simple waves of 100 and 500 cps (from P. Ladefoged, Elements of acoustic phonetics). 5
6 Distinctive features of sounds (1) Two sounds of the same duration (lenght) can differ with respect to: Pitch: subjective impression of the height of the sound related to fundamental frequency of the vibration which is an acoustic (objective) measure indicating the height of the sound two sounds of a different f. frequency (f 0 ) can be perceived as having the same pitch Loudness related to the amplitude of the sound: the higher the amplitudę, the louder the sound is perceived affected by the efficiency and distance of the propagating medium: the larger the distance, the less audible the sound becomes some materials, e.g. wood, are more efficient in carrying sounds than air 6
7 Distinctive features of sounds (2) quality (or colouring) results from differences in the shape of the propagation medium (hence differences in the perception of the same phoneme produced by different speakers, as well as differences in the vowel quality resulting from different shape of the vocal tract) and the material enclosing that medium (in case of musical instruments e.g. flute made of metal vs. wooden violin). Depending on the features (shape, size and material) of the propagation medium some harmonics of the sound will be emphasized and others will be damped. 7
8 Source-filter theory (1) speech production: a two stage process 1) the generation of a sound source 2) shaping/filtering of the sound source by the resonant properties of the vocal tract the input (source of sound): glottis or the supralaryngeal vocal tract the output: the lips or the nose (or both) The vocal tract filters the sound source. The vocal tract s acoustic response depends on its length & shape. 8
9 Source-filter theory (2) the effect of the vocal tract shape on the characteristics of the output sound: it determies whether there is a supralaryngeal sound source it determies the resonance frequencies (formant frequencies) of the vocal tract Examples of different types of source and vocal tract shape. 9
10 Source-filter theory (3) A resonator acts as a filter on the original source of sound: it rearranges the input energy so that frequencies that are at or near the resonance frequencies are amplified, at the expense of those frequencies that are not near the resonance frequencies (they become reduced). We can calculate the resonances given the length of the vocal tract (assume 17.5 cm for now) and the speed of sound (assume cm/s): F1 = c/4l, where: c = the speed of sound and L = the length of the tube For example, for a 17.5 cm tube, F1 = c/4l = 35000/70 = 500 Hz. 10
11 Periodic and aperiodic waves complex waves can be: periodic: regularly repeating pattern each complete cycle, or period, is like the last one aperiodic: irregular no regularly repeating pattern, thus no clear cycles, or periods the type of the complex waveform is determined by the sound source (excitation source): periodic: when the vocal folds vibrate regularly aperiodic: every other sound source, laryngeal and supralaryngeal 11
12 Periodic sound source in speech 1. Regular vibration of the vocal folds produces many different frequencies in a single glottal cycle, which results in a complex periodic waveform -> a periodic (= regularly repeating) sound source. 2. All periodic speech sounds are phonated, i.e. phonetically voiced. The source of periodic sound is always in the larynx at the glottis. 3. The period is the duration of one cycle of the pattern of a periodic wave (one glottal cycle). 4. The fundamental frequency (f0) is the reciprocal of the period: 1/period. 5. The percept of pitch is closely related to f0. A higher pitch has a higher f0, and hence faster glottal pulses. (Periodic sounds have pitch; aperiodic sounds do not.) 12
13 Aperiodic sound sources in speech 1. Aperiodic sound source results in turbulence noise or implosion noise (random noise = many frequencies, but forming irregular patterns). The vocal folds do not vibrate: such sounds are phonetically voiceless. 2. The aperiodic source may be laryngeal (located at the glottis) or supralaryngeal (located higher in the vocal tract): when the glottis is narrowed enough to produce aperiodic noise (but too wide to let the vocal folds vibrate), the result is whisper, [h] (= a voiceless vowel) or breathy voice for other aperiodic speech sounds, the source of sound is at a constriction in the oral cavity that is narrow enough to cause air to rush through it. These supralaryngeal constrictions result in voiceless stops, fricatives and affricates, e.g. [f s t ʧ]. 13
14 Mixed voiced and aperiodic sound source Periodic and aperiodic sources can be generated simultaneously to produce mixed voiced and aperiodic speech typical of sounds such as voiced fricatives. 14
15 Acoustic representations of sounds: spectrogram, waveform, spectrum (1) waveform variations in the air pressure associated with speech sounds changes in amplitude through time pulses corresponding to the vibrations of the vocal folds Waveform of a Polish utterance: Ostatnie przygody Korowiowa i Behemota (male speaker). 15
16 Acoustic representations of sounds (2): waveforms What kind of information can we derive from a waveform? amplitude, F0, the manner of articulation (to some extent): vowels, approximants and nasals pulses (voicing), high amplitude and energy (vowels, approximants and in the end nasals) voiced obstruents (plosives, fricatives and affricates) pulses and low energy and amplitude (fricative segments, plosives) voiceless obstruents empty spaces in case of stops, aperiodic variation in the amplitude in case of fricatives and fricative component of an affricate 16
17 Acoustic representations of sounds (3): spectrograms spectrogram variation in the frequency domain over the time vertical lines -> pulsations of the vocal folds frequency domain: certain frequencies are emphasized (dark marks) -> formants The frequency of the formant depends on the size and shape of the vocal tract, so in a spectrographic analysis it provides information on the place and manner of articulation. Spectrogram of a Polish utterance: Ostatnie przygody Korowiowa i Behemota (male speaker). 17
18 Acoustic representations of sounds (4): spectrograms In the analysis of speech the first four formants are taken into account and they are marked as F1, F2, F3 and F4 (from the lowest to the highest on the frequency scale). F1 and F2 are the most important indicators of vowel quality, whereas the higher formants reflect speaker s characteristics (voice quality). In the flow of articulation changes in formant frequencies which occur when the setting of the vocal tract is changed from one sound to another are called transitions. Spectrograms: optimal for analysis of duration, F0 and phonetic features (e.g. aspiration), and identification of different speech sounds (-> formant frequencies, transitions and vocal folds pulsations) 18
19 Acoustic representations of sounds (5): spectra spectrum (pl. spectra) is static: it shows the amplitude of each frequency present in the sound, usually during a single short section of the signall e.g. 25 or 50 ms you can obtain a spectrogram by arranging together a series of spectra types of spectral analysis: Fourier analysis (fft [fast Fourier transform] or dft [discrete Fourier transform]) Linear Predictive Coding (lpc) harmonics each component frequency in a periodic wave: H1, H2 (=2 x H1), H3 (=3 x H1), etc. the frequency of the lowest harmonic (the first harmonic) is equivalent to the fundamental frequency of the voice-> f0 = H1 harmonics formants Dft (jagged line) and lpc (smooth line) spectra of [uː] in It s too much. 19
INTRODUCTION TO ACOUSTIC PHONETICS 2 Hilary Term, week 6 22 February 2006
1. Resonators and Filters INTRODUCTION TO ACOUSTIC PHONETICS 2 Hilary Term, week 6 22 February 2006 Different vibrating objects are tuned to specific frequencies; these frequencies at which a particular
More informationThe source-filter model of speech production"
24.915/24.963! Linguistic Phonetics! The source-filter model of speech production" Glottal airflow Output from lips 400 200 0.1 0.2 0.3 Time (in secs) 30 20 10 0 0 1000 2000 3000 Frequency (Hz) Source
More informationSource-filter Analysis of Consonants: Nasals and Laterals
L105/205 Phonetics Scarborough Handout 11 Nov. 3, 2005 reading: Johnson Ch. 9 (today); Pickett Ch. 5 (Tues.) Source-filter Analysis of Consonants: Nasals and Laterals 1. Both nasals and laterals have voicing
More informationFrom Ladefoged EAP, p. 11
The smooth and regular curve that results from sounding a tuning fork (or from the motion of a pendulum) is a simple sine wave, or a waveform of a single constant frequency and amplitude. From Ladefoged
More informationReview: Frequency Response Graph. Introduction to Speech and Science. Review: Vowels. Response Graph. Review: Acoustic tube models
eview: requency esponse Graph Introduction to Speech and Science Lecture 5 ricatives and Spectrograms requency Domain Description Input Signal System Output Signal Output = Input esponse? eview: requency
More informationAcoustic Phonetics. How speech sounds are physically represented. Chapters 12 and 13
Acoustic Phonetics How speech sounds are physically represented Chapters 12 and 13 1 Sound Energy Travels through a medium to reach the ear Compression waves 2 Information from Phonetics for Dummies. William
More informationSpeech Processing. Undergraduate course code: LASC10061 Postgraduate course code: LASC11065
Speech Processing Undergraduate course code: LASC10061 Postgraduate course code: LASC11065 All course materials and handouts are the same for both versions. Differences: credits (20 for UG, 10 for PG);
More informationSource-filter analysis of fricatives
24.915/24.963 Linguistic Phonetics Source-filter analysis of fricatives Figure removed due to copyright restrictions. Readings: Johnson chapter 5 (speech perception) 24.963: Fujimura et al (1978) Noise
More informationAcoustic Phonetics. Chapter 8
Acoustic Phonetics Chapter 8 1 1. Sound waves Vocal folds/cords: Frequency: 300 Hz 0 0 0.01 0.02 0.03 2 1.1 Sound waves: The parts of waves We will be considering the parts of a wave with the wave represented
More informationCOMP 546, Winter 2017 lecture 20 - sound 2
Today we will examine two types of sounds that are of great interest: music and speech. We will see how a frequency domain analysis is fundamental to both. Musical sounds Let s begin by briefly considering
More informationLab 8. ANALYSIS OF COMPLEX SOUNDS AND SPEECH ANALYSIS Amplitude, loudness, and decibels
Lab 8. ANALYSIS OF COMPLEX SOUNDS AND SPEECH ANALYSIS Amplitude, loudness, and decibels A complex sound with particular frequency can be analyzed and quantified by its Fourier spectrum: the relative amplitudes
More informationStructure of Speech. Physical acoustics Time-domain representation Frequency domain representation Sound shaping
Structure of Speech Physical acoustics Time-domain representation Frequency domain representation Sound shaping Speech acoustics Source-Filter Theory Speech Source characteristics Speech Filter characteristics
More informationResonance and resonators
Resonance and resonators Dr. Christian DiCanio cdicanio@buffalo.edu University at Buffalo 10/13/15 DiCanio (UB) Resonance 10/13/15 1 / 27 Harmonics Harmonics and Resonance An example... Suppose you are
More informationCS 188: Artificial Intelligence Spring Speech in an Hour
CS 188: Artificial Intelligence Spring 2006 Lecture 19: Speech Recognition 3/23/2006 Dan Klein UC Berkeley Many slides from Dan Jurafsky Speech in an Hour Speech input is an acoustic wave form s p ee ch
More informationAspiration Noise during Phonation: Synthesis, Analysis, and Pitch-Scale Modification. Daryush Mehta
Aspiration Noise during Phonation: Synthesis, Analysis, and Pitch-Scale Modification Daryush Mehta SHBT 03 Research Advisor: Thomas F. Quatieri Speech and Hearing Biosciences and Technology 1 Summary Studied
More informationLinguistic Phonetics. Spectral Analysis
24.963 Linguistic Phonetics Spectral Analysis 4 4 Frequency (Hz) 1 Reading for next week: Liljencrants & Lindblom 1972. Assignment: Lip-rounding assignment, due 1/15. 2 Spectral analysis techniques There
More informationspeech signal S(n). This involves a transformation of S(n) into another signal or a set of signals
16 3. SPEECH ANALYSIS 3.1 INTRODUCTION TO SPEECH ANALYSIS Many speech processing [22] applications exploits speech production and perception to accomplish speech analysis. By speech analysis we extract
More informationAbout waves. Sounds of English. Different types of waves. Ever done the wave?? Why do we care? Tuning forks and pendulums
bout waves Sounds of English Topic 7 The acoustics of speech: Sound Waves Lots of examples in the world around us! an take all sorts of different forms Definition: disturbance that travels through a medium
More informationAn introduction to physics of Sound
An introduction to physics of Sound Outlines Acoustics and psycho-acoustics Sound? Wave and waves types Cycle Basic parameters of sound wave period Amplitude Wavelength Frequency Outlines Phase Types of
More informationLinguistics 401 LECTURE #2. BASIC ACOUSTIC CONCEPTS (A review)
Linguistics 401 LECTURE #2 BASIC ACOUSTIC CONCEPTS (A review) Unit of wave: CYCLE one complete wave (=one complete crest and trough) The number of cycles per second: FREQUENCY cycles per second (cps) =
More informationCommunications Theory and Engineering
Communications Theory and Engineering Master's Degree in Electronic Engineering Sapienza University of Rome A.A. 2018-2019 Speech and telephone speech Based on a voice production model Parametric representation
More informationWaveSurfer. Basic acoustics part 2 Spectrograms, resonance, vowels. Spectrogram. See Rogers chapter 7 8
WaveSurfer. Basic acoustics part 2 Spectrograms, resonance, vowels See Rogers chapter 7 8 Allows us to see Waveform Spectrogram (color or gray) Spectral section short-time spectrum = spectrum of a brief
More informationDigital Signal Processing
COMP ENG 4TL4: Digital Signal Processing Notes for Lecture #27 Tuesday, November 11, 23 6. SPECTRAL ANALYSIS AND ESTIMATION 6.1 Introduction to Spectral Analysis and Estimation The discrete-time Fourier
More informationSubtractive Synthesis & Formant Synthesis
Subtractive Synthesis & Formant Synthesis Prof Eduardo R Miranda Varèse-Gastprofessor eduardo.miranda@btinternet.com Electronic Music Studio TU Berlin Institute of Communications Research http://www.kgw.tu-berlin.de/
More informationReading: Johnson Ch , Ch.5.5 (today); Liljencrants & Lindblom; Stevens (Tues) reminder: no class on Thursday.
L105/205 Phonetics Scarborough Handout 7 10/18/05 Reading: Johnson Ch.2.3.3-2.3.6, Ch.5.5 (today); Liljencrants & Lindblom; Stevens (Tues) reminder: no class on Thursday Spectral Analysis 1. There are
More informationEE 225D LECTURE ON SPEECH SYNTHESIS. University of California Berkeley
University of California Berkeley College of Engineering Department of Electrical Engineering and Computer Sciences Professors : N.Morgan / B.Gold EE225D Speech Synthesis Spring,1999 Lecture 23 N.MORGAN
More informationConverting Speaking Voice into Singing Voice
Converting Speaking Voice into Singing Voice 1 st place of the Synthesis of Singing Challenge 2007: Vocal Conversion from Speaking to Singing Voice using STRAIGHT by Takeshi Saitou et al. 1 STRAIGHT Speech
More informationSource-Filter Theory 1
Source-Filter Theory 1 Vocal tract as sound production device Sound production by the vocal tract can be understood by analogy to a wind or brass instrument. sound generation sound shaping (or filtering)
More informationSpeech Synthesis; Pitch Detection and Vocoders
Speech Synthesis; Pitch Detection and Vocoders Tai-Shih Chi ( 冀泰石 ) Department of Communication Engineering National Chiao Tung University May. 29, 2008 Speech Synthesis Basic components of the text-to-speech
More informationFoundations of Language Science and Technology. Acoustic Phonetics 1: Resonances and formants
Foundations of Language Science and Technology Acoustic Phonetics 1: Resonances and formants Jan 19, 2015 Bernd Möbius FR 4.7, Phonetics Saarland University Speech waveforms and spectrograms A f t Formants
More informationInternational Journal of Modern Trends in Engineering and Research e-issn No.: , Date: 2-4 July, 2015
International Journal of Modern Trends in Engineering and Research www.ijmter.com e-issn No.:2349-9745, Date: 2-4 July, 2015 Analysis of Speech Signal Using Graphic User Interface Solly Joy 1, Savitha
More informationComplex Sounds. Reading: Yost Ch. 4
Complex Sounds Reading: Yost Ch. 4 Natural Sounds Most sounds in our everyday lives are not simple sinusoidal sounds, but are complex sounds, consisting of a sum of many sinusoids. The amplitude and frequency
More informationASPIRATION NOISE DURING PHONATION: SYNTHESIS, ANALYSIS, AND PITCH-SCALE MODIFICATION DARYUSH MEHTA
ASPIRATION NOISE DURING PHONATION: SYNTHESIS, ANALYSIS, AND PITCH-SCALE MODIFICATION by DARYUSH MEHTA B.S., Electrical Engineering (23) University of Florida SUBMITTED TO THE DEPARTMENT OF ELECTRICAL ENGINEERING
More informationPsychology of Language
PSYCH 150 / LIN 155 UCI COGNITIVE SCIENCES syn lab Psychology of Language Prof. Jon Sprouse 01.10.13: The Mental Representation of Speech Sounds 1 A logical organization For clarity s sake, we ll organize
More informationStatistical NLP Spring Unsupervised Tagging?
Statistical NLP Spring 2008 Lecture 9: Speech Signal Dan Klein UC Berkeley Unsupervised Tagging? AKA part-of-speech induction Task: Raw sentences in Tagged sentences out Obvious thing to do: Start with
More informationPrinciples of Musical Acoustics
William M. Hartmann Principles of Musical Acoustics ^Spr inger Contents 1 Sound, Music, and Science 1 1.1 The Source 2 1.2 Transmission 3 1.3 Receiver 3 2 Vibrations 1 9 2.1 Mass and Spring 9 2.1.1 Definitions
More informationChapter 3. Description of the Cascade/Parallel Formant Synthesizer. 3.1 Overview
Chapter 3 Description of the Cascade/Parallel Formant Synthesizer The Klattalk system uses the KLSYN88 cascade-~arallel formant synthesizer that was first described in Klatt and Klatt (1990). This speech
More informationProject 0: Part 2 A second hands-on lab on Speech Processing Frequency-domain processing
Project : Part 2 A second hands-on lab on Speech Processing Frequency-domain processing February 24, 217 During this lab, you will have a first contact on frequency domain analysis of speech signals. You
More informationEE482: Digital Signal Processing Applications
Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 12 Speech Signal Processing 14/03/25 http://www.ee.unlv.edu/~b1morris/ee482/
More informationSpeech Perception Speech Analysis Project. Record 3 tokens of each of the 15 vowels of American English in bvd or hvd context.
Speech Perception Map your vowel space. Record tokens of the 15 vowels of English. Using LPC and measurements on the waveform and spectrum, determine F0, F1, F2, F3, and F4 at 3 points in each token plus
More informationDigitized signals. Notes on the perils of low sample resolution and inappropriate sampling rates.
Digitized signals Notes on the perils of low sample resolution and inappropriate sampling rates. 1 Analog to Digital Conversion Sampling an analog waveform Sample = measurement of waveform amplitude at
More informationEpoch Extraction From Speech Signals K. Sri Rama Murty and B. Yegnanarayana, Senior Member, IEEE
1602 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 16, NO. 8, NOVEMBER 2008 Epoch Extraction From Speech Signals K. Sri Rama Murty and B. Yegnanarayana, Senior Member, IEEE Abstract
More informationRecap the waveform. Complex waves (dạnh sóng phức tạp) and spectra. Recap the waveform
Recap the waveform Complex waves (dạnh sóng phức tạp) and spectra Cơ sở âm vị học và ngữ âm học Lecture 11 The waveform (dạnh sóng âm) is a representation of the amplitude (biên độ) of air pressure perturbations
More informationINTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET)
INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET) Proceedings of the 2 nd International Conference on Current Trends in Engineering and Management ICCTEM -214 ISSN
More informationCHAPTER 3. ACOUSTIC MEASURES OF GLOTTAL CHARACTERISTICS 39 and from periodic glottal sources (Shadle, 1985; Stevens, 1993). The ratio of the amplitude of the harmonics at 3 khz to the noise amplitude in
More informationCopyright 2009 Pearson Education, Inc.
Chapter 16 Sound 16-1 Characteristics of Sound Sound can travel through h any kind of matter, but not through a vacuum. The speed of sound is different in different materials; in general, it is slowest
More informationQuantification of glottal and voiced speech harmonicsto-noise ratios using cepstral-based estimation
Quantification of glottal and voiced speech harmonicsto-noise ratios using cepstral-based estimation Peter J. Murphy and Olatunji O. Akande, Department of Electronic and Computer Engineering University
More informationINDIANA UNIVERSITY, DEPT. OF PHYSICS P105, Basic Physics of Sound, Spring 2010
Name: ID#: INDIANA UNIVERSITY, DEPT. OF PHYSICS P105, Basic Physics of Sound, Spring 2010 Midterm Exam #2 Thursday, 25 March 2010, 7:30 9:30 p.m. Closed book. You are allowed a calculator. There is a Formula
More informationDefinition of Sound. Sound. Vibration. Period - Frequency. Waveform. Parameters. SPA Lundeen
Definition of Sound Sound Psychologist's = that which is heard Physicist's = a propagated disturbance in the density of an elastic medium Vibrator serves as the sound source Medium = air 2 Vibration Periodic
More informationRespiration, Phonation, and Resonation: How dependent are they on each other? (Kay-Pentax Lecture in Upper Airway Science) Ingo R.
Respiration, Phonation, and Resonation: How dependent are they on each other? (Kay-Pentax Lecture in Upper Airway Science) Ingo R. Titze Director, National Center for Voice and Speech, University of Utah
More informationPreview. Sound Section 1. Section 1 Sound Waves. Section 2 Sound Intensity and Resonance. Section 3 Harmonics
Sound Section 1 Preview Section 1 Sound Waves Section 2 Sound Intensity and Resonance Section 3 Harmonics Sound Section 1 TEKS The student is expected to: 7A examine and describe oscillatory motion and
More informationChapter 3 The Physics of Sound
Chapter 3 The Physics of Sound Sound lies at the very center of speech communication. A sound wave is both the end product of the speech production mechanism and the primary source of raw material from
More informationDetermination of instants of significant excitation in speech using Hilbert envelope and group delay function
Determination of instants of significant excitation in speech using Hilbert envelope and group delay function by K. Sreenivasa Rao, S. R. M. Prasanna, B.Yegnanarayana in IEEE Signal Processing Letters,
More informationLinguistic Phonetics. The acoustics of vowels
24.963 Linguistic Phonetics The acoustics of vowels No class on Tuesday 0/3 (Tuesday is a Monday) Readings: Johnson chapter 6 (for this week) Liljencrants & Lindblom (972) (for next week) Assignment: Modeling
More informationBlock diagram of proposed general approach to automatic reduction of speech wave to lowinformation-rate signals.
XIV. SPEECH COMMUNICATION Prof. M. Halle G. W. Hughes J. M. Heinz Prof. K. N. Stevens Jane B. Arnold C. I. Malme Dr. T. T. Sandel P. T. Brady F. Poza C. G. Bell O. Fujimura G. Rosen A. AUTOMATIC RESOLUTION
More informationMusical Acoustics, C. Bertulani. Musical Acoustics. Lecture 13 Timbre / Tone quality I
1 Musical Acoustics Lecture 13 Timbre / Tone quality I Waves: review 2 distance x (m) At a given time t: y = A sin(2πx/λ) A -A time t (s) At a given position x: y = A sin(2πt/t) Perfect Tuning Fork: Pure
More information8.3 Basic Parameters for Audio
8.3 Basic Parameters for Audio Analysis Physical audio signal: simple one-dimensional amplitude = loudness frequency = pitch Psycho-acoustic features: complex A real-life tone arises from a complex superposition
More informationSignal Processing for Speech Applications - Part 2-1. Signal Processing For Speech Applications - Part 2
Signal Processing for Speech Applications - Part 2-1 Signal Processing For Speech Applications - Part 2 May 14, 2013 Signal Processing for Speech Applications - Part 2-2 References Huang et al., Chapter
More informationAn Implementation of the Klatt Speech Synthesiser*
REVISTA DO DETUA, VOL. 2, Nº 1, SETEMBRO 1997 1 An Implementation of the Klatt Speech Synthesiser* Luis Miguel Teixeira de Jesus, Francisco Vaz, José Carlos Principe Resumo - Neste trabalho descreve-se
More informationAn Experimentally Measured Source Filter Model: Glottal Flow, Vocal Tract Gain and Output Sound from a Physical Model
Acoust Aust (2016) 44:187 191 DOI 10.1007/s40857-016-0046-7 TUTORIAL PAPER An Experimentally Measured Source Filter Model: Glottal Flow, Vocal Tract Gain and Output Sound from a Physical Model Joe Wolfe
More informationQuarterly Progress and Status Report. A note on the vocal tract wall impedance
Dept. for Speech, Music and Hearing Quarterly Progress and Status Report A note on the vocal tract wall impedance Fant, G. and Nord, L. and Branderud, P. journal: STL-QPSR volume: 17 number: 4 year: 1976
More informationExperimental evaluation of inverse filtering using physical systems with known glottal flow and tract characteristics
Experimental evaluation of inverse filtering using physical systems with known glottal flow and tract characteristics Derek Tze Wei Chu and Kaiwen Li School of Physics, University of New South Wales, Sydney,
More informationA Physiologically Produced Impulsive UWB signal: Speech
A Physiologically Produced Impulsive UWB signal: Speech Maria-Gabriella Di Benedetto University of Rome La Sapienza Faculty of Engineering Rome, Italy gaby@acts.ing.uniroma1.it http://acts.ing.uniroma1.it
More informationGLOTTAL EXCITATION EXTRACTION OF VOICED SPEECH - JOINTLY PARAMETRIC AND NONPARAMETRIC APPROACHES
Clemson University TigerPrints All Dissertations Dissertations 5-2012 GLOTTAL EXCITATION EXTRACTION OF VOICED SPEECH - JOINTLY PARAMETRIC AND NONPARAMETRIC APPROACHES Yiqiao Chen Clemson University, rls_lms@yahoo.com
More informationChapter 2. Meeting 2, Measures and Visualizations of Sounds and Signals
Chapter 2. Meeting 2, Measures and Visualizations of Sounds and Signals 2.1. Announcements Be sure to completely read the syllabus Recording opportunities for small ensembles Due Wednesday, 15 February:
More informationLocation of sound source and transfer functions
Location of sound source and transfer functions Sounds produced with source at the larynx either voiced or voiceless (aspiration) sound is filtered by entire vocal tract Transfer function is well modeled
More informationIMPROVING QUALITY OF SPEECH SYNTHESIS IN INDIAN LANGUAGES. P. K. Lehana and P. C. Pandey
Workshop on Spoken Language Processing - 2003, TIFR, Mumbai, India, January 9-11, 2003 149 IMPROVING QUALITY OF SPEECH SYNTHESIS IN INDIAN LANGUAGES P. K. Lehana and P. C. Pandey Department of Electrical
More informationPerception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 4: 7 Feb A. Faulkner.
Perception of pitch BSc Audiology/MSc SHS Psychoacoustics wk 4: 7 Feb 2008. A. Faulkner. See Moore, BCJ Introduction to the Psychology of Hearing, Chapter 5. Or Plack CJ The Sense of Hearing Lawrence Erlbaum,
More informationDIVERSE RESONANCE TUNING STRATEGIES FOR WOMEN SINGERS
DIVERSE RESONANCE TUNING STRATEGIES FOR WOMEN SINGERS John Smith Joe Wolfe Nathalie Henrich Maëva Garnier Physics, University of New South Wales, Sydney j.wolfe@unsw.edu.au Physics, University of New South
More informationL19: Prosodic modification of speech
L19: Prosodic modification of speech Time-domain pitch synchronous overlap add (TD-PSOLA) Linear-prediction PSOLA Frequency-domain PSOLA Sinusoidal models Harmonic + noise models STRAIGHT This lecture
More informationDigital Speech Processing and Coding
ENEE408G Spring 2006 Lecture-2 Digital Speech Processing and Coding Spring 06 Instructor: Shihab Shamma Electrical & Computer Engineering University of Maryland, College Park http://www.ece.umd.edu/class/enee408g/
More informationALTERNATING CURRENT (AC)
ALL ABOUT NOISE ALTERNATING CURRENT (AC) Any type of electrical transmission where the current repeatedly changes direction, and the voltage varies between maxima and minima. Therefore, any electrical
More informationUSING A WHITE NOISE SOURCE TO CHARACTERIZE A GLOTTAL SOURCE WAVEFORM FOR IMPLEMENTATION IN A SPEECH SYNTHESIS SYSTEM
USING A WHITE NOISE SOURCE TO CHARACTERIZE A GLOTTAL SOURCE WAVEFORM FOR IMPLEMENTATION IN A SPEECH SYNTHESIS SYSTEM by Brandon R. Graham A report submitted in partial fulfillment of the requirements for
More informationSpeech Signal Analysis
Speech Signal Analysis Hiroshi Shimodaira and Steve Renals Automatic Speech Recognition ASR Lectures 2&3 14,18 January 216 ASR Lectures 2&3 Speech Signal Analysis 1 Overview Speech Signal Analysis for
More informationAcoustics and Fourier Transform Physics Advanced Physics Lab - Summer 2018 Don Heiman, Northeastern University, 1/12/2018
1 Acoustics and Fourier Transform Physics 3600 - Advanced Physics Lab - Summer 2018 Don Heiman, Northeastern University, 1/12/2018 I. INTRODUCTION Time is fundamental in our everyday life in the 4-dimensional
More informationSpeech Synthesis using Mel-Cepstral Coefficient Feature
Speech Synthesis using Mel-Cepstral Coefficient Feature By Lu Wang Senior Thesis in Electrical Engineering University of Illinois at Urbana-Champaign Advisor: Professor Mark Hasegawa-Johnson May 2018 Abstract
More informationJOURNAL OF OBJECT TECHNOLOGY
JOURNAL OF OBJECT TECHNOLOGY Online at http://www.jot.fm. Published by ETH Zurich, Chair of Software Engineering JOT, 2009 Vol. 9, No. 1, January-February 2010 The Discrete Fourier Transform, Part 5: Spectrogram
More informationQuarterly Progress and Status Report. Acoustic properties of the Rothenberg mask
Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Acoustic properties of the Rothenberg mask Hertegård, S. and Gauffin, J. journal: STL-QPSR volume: 33 number: 2-3 year: 1992 pages:
More informationPerception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 5: 12 Feb A. Faulkner.
Perception of pitch BSc Audiology/MSc SHS Psychoacoustics wk 5: 12 Feb 2009. A. Faulkner. See Moore, BCJ Introduction to the Psychology of Hearing, Chapter 5. Or Plack CJ The Sense of Hearing Lawrence
More informationSOUND SOURCE RECOGNITION AND MODELING
SOUND SOURCE RECOGNITION AND MODELING CASA seminar, summer 2000 Antti Eronen antti.eronen@tut.fi Contents: Basics of human sound source recognition Timbre Voice recognition Recognition of environmental
More informationBasic Characteristics of Speech Signal Analysis
www.ijird.com March, 2016 Vol 5 Issue 4 ISSN 2278 0211 (Online) Basic Characteristics of Speech Signal Analysis S. Poornima Assistant Professor, VlbJanakiammal College of Arts and Science, Coimbatore,
More informationA() I I X=t,~ X=XI, X=O
6 541J Handout T l - Pert r tt Ofl 11 (fo 2/19/4 A() al -FA ' AF2 \ / +\ X=t,~ X=X, X=O, AF3 n +\ A V V V x=-l x=o Figure 3.19 Curves showing the relative magnitude and direction of the shift AFn in formant
More informationA mechanical wave is a disturbance which propagates through a medium with little or no net displacement of the particles of the medium.
Waves and Sound Mechanical Wave A mechanical wave is a disturbance which propagates through a medium with little or no net displacement of the particles of the medium. Water Waves Wave Pulse People Wave
More informationAn Investigation of Response Bias in Tone Glide Direction Identification. A Senior Honors Thesis
An Investigation of Response Bias in Tone Glide Direction Identification A Senior Honors Thesis Presented in Partial Fulfillment of the Requirements for graduation with distinction in Speech and Hearing
More informationAnnouncements. Today. Speech and Language. State Path Trellis. HMMs: MLE Queries. Introduction to Artificial Intelligence. V22.
Introduction to Artificial Intelligence Announcements V22.0472-001 Fall 2009 Lecture 19: Speech Recognition & Viterbi Decoding Rob Fergus Dept of Computer Science, Courant Institute, NYU Slides from John
More informationBetween physics and perception signal models for high level audio processing. Axel Röbel. Analysis / synthesis team, IRCAM. DAFx 2010 iem Graz
Between physics and perception signal models for high level audio processing Axel Röbel Analysis / synthesis team, IRCAM DAFx 2010 iem Graz Overview Introduction High level control of signal transformation
More informationChapter 1: Introduction to audio signal processing
Chapter 1: Introduction to audio signal processing KH WONG, Rm 907, SHB, CSE Dept. CUHK, Email: khwong@cse.cuhk.edu.hk http://www.cse.cuhk.edu.hk/~khwong/cmsc5707 Audio signal proce ssing Ch1, v.3c 1 Reference
More informationTransforming High-Effort Voices Into Breathy Voices Using Adaptive Pre-Emphasis Linear Prediction
Transforming High-Effort Voices Into Breathy Voices Using Adaptive Pre-Emphasis Linear Prediction by Karl Ingram Nordstrom B.Eng., University of Victoria, 1995 M.A.Sc., University of Victoria, 2000 A Dissertation
More informationInterference & Superposition. Creating Complex Wave Forms
Interference & Superposition Creating Complex Wave Forms Waves & Interference I. Definitions and Types II. Parameters and Equations III. Sound IV. Graphs of Waves V. Interference - superposition - standing
More informationEE 225D LECTURE ON SYNTHETIC AUDIO. University of California Berkeley
University of California Berkeley College of Engineering Department of Electrical Engineering and Computer Sciences Professors : N.Morgan / B.Gold EE225D Synthetic Audio Spring,1999 Lecture 2 N.MORGAN
More informationLinear Predictive Coding *
OpenStax-CNX module: m45345 1 Linear Predictive Coding * Kiefer Forseth This work is produced by OpenStax-CNX and licensed under the Creative Commons Attribution License 3.0 1 LPC Implementation Linear
More informationEC 6501 DIGITAL COMMUNICATION UNIT - II PART A
EC 6501 DIGITAL COMMUNICATION 1.What is the need of prediction filtering? UNIT - II PART A [N/D-16] Prediction filtering is used mostly in audio signal processing and speech processing for representing
More informationDigital Signal Representation of Speech Signal
Digital Signal Representation of Speech Signal Mrs. Smita Chopde 1, Mrs. Pushpa U S 2 1,2. EXTC Department, Mumbai University Abstract Delta modulation is a waveform coding techniques which the data rate
More informationX. SPEECH ANALYSIS. Prof. M. Halle G. W. Hughes H. J. Jacobsen A. I. Engel F. Poza A. VOWEL IDENTIFIER
X. SPEECH ANALYSIS Prof. M. Halle G. W. Hughes H. J. Jacobsen A. I. Engel F. Poza A. VOWEL IDENTIFIER Most vowel identifiers constructed in the past were designed on the principle of "pattern matching";
More informationUnit 6: Waves and Sound
Unit 6: Waves and Sound Brent Royuk Phys-109 Concordia University Waves What is a wave? Examples Water, sound, slinky, ER Transverse vs. Longitudinal 2 Wave Properties The magic of waves. Great distances
More informationChapter 12. Preview. Objectives The Production of Sound Waves Frequency of Sound Waves The Doppler Effect. Section 1 Sound Waves
Section 1 Sound Waves Preview Objectives The Production of Sound Waves Frequency of Sound Waves The Doppler Effect Section 1 Sound Waves Objectives Explain how sound waves are produced. Relate frequency
More informationLecture Presentation Chapter 16 Superposition and Standing Waves
Lecture Presentation Chapter 16 Superposition and Standing Waves Suggested Videos for Chapter 16 Prelecture Videos Constructive and Destructive Interference Standing Waves Physics of Your Vocal System
More informationProceedings of Meetings on Acoustics
Proceedings of Meetings on Acoustics Volume, http://acousticalsociety.org/ ICA Montreal Montreal, Canada - June Musical Acoustics Session amu: Aeroacoustics of Wind Instruments and Human Voice II amu.
More informationSound Interference and Resonance: Standing Waves in Air Columns
Sound Interference and Resonance: Standing Waves in Air Columns Bởi: OpenStaxCollege Some types of headphones use the phenomena of constructive and destructive interference to cancel out outside noises.
More informationOn the glottal flow derivative waveform and its properties
COMPUTER SCIENCE DEPARTMENT UNIVERSITY OF CRETE On the glottal flow derivative waveform and its properties A time/frequency study George P. Kafentzis Bachelor s Dissertation 29/2/2008 Supervisor: Yannis
More information