Lecture 6: Nonspeech and Music
|
|
- Polly Allison
- 6 years ago
- Views:
Transcription
1 EE E682: Speech & Audio Processing & Recognition Lecture 6: Nonspeech and Music 1 Music & nonspeech Dan Ellis <dpwe@ee.columbia.edu> Michael Mandel <mim@ee.columbia.edu> 2 Environmental Sounds Columbia University Dept. of Electrical Engineering dpwe/e682 3 Music Synthesis Techniques 4 Sinewave Synthesis February 26, 29 E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, 29 1 / 3
2 Outline 1 Music & nonspeech 2 Environmental Sounds 3 Music Synthesis Techniques 4 Sinewave Synthesis E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, 29 2 / 3
3 Music & nonspeech What is nonspeech? according to research effort: a little music in the world: most everything high speech music Information content low wind & water natural animal sounds contact/ collision Origin machines & engines man-made attributes? E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, 29 3 / 3
4 Sound attributes Attributes suggest model parameters What do we notice about general sound? psychophysics: pitch, loudness, timbre bright/dull; sharp/soft; grating/soothing sound is not abstract : tendency is to describe by source-events Ecological perspective what matters about sound is what happened our percepts express this more-or-less directly E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, 29 4 / 3
5 Motivations for modeling Describe/classify cast sound into model because want to use the resulting parameters Store/transmit model implicitly exploits limited structure of signal Resynthesize/modify model separates out interesting parameters Sound Model parameter space E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, 29 5 / 3
6 Analysis and synthesis Analysis is the converse of synthesis: Model / representation Synthesis Analysis Sound Can exist apart: analysis for classification synthesis of artificial sounds Often used together: encoding/decoding of compressed formats resynthesis based on analyses analysis-by-synthesis E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, 29 6 / 3
7 Outline 1 Music & nonspeech 2 Environmental Sounds 3 Music Synthesis Techniques 4 Sinewave Synthesis E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, 29 7 / 3
8 Environmental Sounds Where sound comes from: mechanical interactions contact / collisions rubbing / scraping ringing / vibrating Interest in environmental sounds carry information about events around us.. including indirect hints need to create them in virtual environments.. including soundtracks Approaches to synthesis recording / sampling synthesis algorithms E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, 29 8 / 3
9 Collision sounds Factors influencing: colliding bodies: size, material, damping local properties at contact point (hardness) energy of collision Source-filter model source = excitation of collision event (energy, local properties at contact) filter = resonance and radiation of energy (body properties) Variety of strike/scraping sounds resonant freqs size/shape damping material HF content in excitation/strike mallet, force (from Gaver, 1993) E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, 29 9 / 3
10 Sound textures What do we hear in: a city street a symphony orchestra How do we distinguish: waterfall rainfall applause static Applause4 Rain1 5 5 freq / Hz freq / Hz time / s Levels of ecological description E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, 29 1 / 3 time / s
11 Sound texture modeling (Athineos) Model broad spectral structure with LPC could ust resynthesize with noise Model fine temporal structure in residual with linear prediction in time domain y[n] Sound TD-LP y[n] = Σia i y[n-i] e[n] Whitened residual Per-frame spectral parameters DCT E[k] Residual spectrum FD-LP E[k] = Σib i E[k-i] Per-frame temporal envelope parameters precise dual of LPC in frequency poles model temporal events Temporal envelopes (4 poles, 256ms) amplitude time / sec Allows modification / synthesis? E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, / 3
12 Outline 1 Music & nonspeech 2 Environmental Sounds 3 Music Synthesis Techniques 4 Sinewave Synthesis E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, / 3
13 Music synthesis techniques S A T B What is music? M E S S I A H 44. chorus could be anything flexible synthesis HALLELUJAH! needed! Key elements of conventional Allegro music # Soprano instruments & # c. J Œ. J note-events (time, pitch, accent level) # melody, harmony, Alto rhythm & # c Hal - - lu - ah!. Œ. patterns of repetition & variation # Tenor Synthesis framework: V # c Hal. le - lu - ah! J J J Œ - instruments: common? # framework for many notes Bass # c Hal - le - lu - ah!. score: sequence of (time, pitch, level) J J note J Œ. events 7 # & # J # & # # V # J J Œ le - lu - ah, Œ le - lu - ah, Œ le - lu - ah,? # # J Œ. Hal - le - lu - ah! J Hal - le - lu - ah,. Hal - le - lu - ah, J Œ Œ. J J J Œ Hal - le - lu - ah,. J J J Œ. R R Hal - le - lu - ah! Hal - le - r r Hal. le - lu - ah! Hal - le J J J R R - - Hal - le - lu - ah! Hal - le - J J J R R Hal - le - lu - ah! Hal - le - J J R R Hal - le - lu - ah, Hal - le -. r r Hal - le - lu - ah, Hal - le -. J J J R R Hal - le - lu - ah, Hal - le -. J J J R R J J R R J J lu - ah! J r lu - ah! Hal - le - lu - ah r J Hal - le - lu - ah J J R R J J lu - ah! Hal - le - lu - ah J J R R J J lu - ah! Hal - le - lu - ah J J R R J lu - ah, lu - ah, J Hal - le - lu - ah, r r Hal - le - lu - ah, J J R R J lu - ah, HA G. F. J Hal - le - lu - ah, J J R R J J E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, / 3
14 The nature of musical instrument notes Characterized by instrument (register), note, loudness/emphasis, articulation... Frequency Piano Violin Time Time Frequency Clarinet Trumpet Time Distinguish how? Time E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, / 3
15 Development of music synthesis Goals of music synthesis: generate realistic / pleasant new notes control / explore timbre (quality) Earliest computer systems in 196s (voice synthesis, algorithmic) Pure synthesis approaches: 197s: Analog synths 198s: FM (Stanford/Yamaha) 199s: Physical modeling, hybrids Analysis-synthesis methods: sampling / wavetables sinusoid modeling harmonics + noise (+ transients) others? E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, / 3
16 Analog synthesis The minimum to make an interesting sound Trigger Pitch + Oscillator Vibrato t + Cutoff freq Filter f Envelope + t Gain Sound Elements: harmonics-rich oscillators time-varying filters time-varying envelope modulation: low frequency + envelope-based Result: time-varying spectrum, independent pitch E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, / 3
17 FM synthesis Fast frequency modulation sidebands: cos(ω c t + β sin(ω m t)) = n= J n(β) cos((ω c + nω m )t) a harmonic series if ωc = rω m J n (β) is a Bessel function: 1 J J 1 J2 J3 J4.5 J n (β) for β < n modulation index β Complex harmonic spectra by varying β 4 3 freq / Hz time / s ωc = 2 Hz, ω m = 2 Hz what use? E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, / 3
18 Sampling synthesis Resynthesis from real notes vary pitch, duration, level Pitch: stretch (resample) waveform Hz 894 Hz time / s Duration: loop a sustain section time / s time / s time / s Level: cross-fade different examples.2 Soft.1 mix.2 Loud veloc time / s time / s need to line up source samples good & bad? E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, / 3
19 Outline 1 Music & nonspeech 2 Environmental Sounds 3 Music Synthesis Techniques 4 Sinewave Synthesis E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, / 3
20 Sinewave synthesis If patterns of harmonics are what matter, why not generate them all explicitly: s[n] = k A k[n] cos(k ω [n] n) particularly powerful model for pitched signals Analysis (as with speech): find peaks in STFT S[ω, n] & track or track fundamental ω (harmonics / autocorrelation) & sample STFT at k ω set of A k [n] to duplicate tone: freq / Hz mag time / s freq / Hz time / s Synthesis via bank of oscillators E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, 29 2 / 3
21 Steps to sinewave modeling - 1 The underlying STFT: X [k, n ] = N 1 n= ( ) 2πkn x[n + n ] w[n] exp N what value for N (FFT length & window size)? what value for H (hop size: n = r H, r =, 1, 2... )? STFT window length determines freq. resolution: X w (e ω ) = X (e ω ) W (e ω ) Choose N long enough to resolve harmonics 2-3x longest (lowest) fundamental period e.g. 3-6 ms = khz choose H N/2 N too long lost time resolution limits sinusoid amplitude rate of change E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, / 3
22 Steps to sinewave modeling - 2 Choose candidate sinusoids at each time by picking peaks in each STFT frame: freq / Hz level / db Quadratic fit for peak: level / db time / s freq / Hz y ab 2 /4 b/2 y = ax(x-b) phase / rad freq / Hz x linear interpolation of unwrapped phase freq / Hz E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, / 3
23 Steps to sinewave modeling - 3 Which peaks to pick? Want true sinusoids, not noise fluctuations prominence threshold above smoothed spectrum level / db freq / Hz Sinusoids exhibit stability... of amplitude in time of phase derivative in time compare with adacent time frames to test? E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, / 3
24 Steps to sinewave modeling - 4 Grow tracks by appending newly-found peaks to existing tracks: freq existing tracks birth death new peaks time ambiguous assignments possible Unclaimed new peak birth of new track backtrack to find earliest trace? No continuation peak for existing track death of track or: reduce peak threshold for hysteresis E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, / 3
25 Resynthesis of sinewave models After analysis, each track defines contours in frequency, amplitude f k [n], A k [n] (+ phase?) use to drive a sinewave oscillators & sum up freq / Hz level A k [n] n f k [n] A k [n] cos(2πf k [n] t) time / s Regularize to exactly harmonic f k [n] = k f [n] freq / Hz freq / Hz time / s time / s time / s E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, / 3
26 Modification in sinewave resynthesis Change duration by warping timebase may want to keep onset unwarped 5 freq / Hz time / s Change pitch by scaling frequencies either stretching or resampling envelope level / db freq / Hz level / db freq / Hz Change timbre by interpolating parameters E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, / 3
27 Sinusoids + residual Only prominent peaks became tracks remainder of spectral energy was noisy? model residual energy with noise How to obtain non-harmonic spectrum? zero-out spectrum near extracted peaks? or: resynthesize (exactly) & subtract waveforms e s [n] = s[n] k A k [n] cos(2πn f k [n]).. must preserve phase! mag / db 2-2 sinusoids original -4-6 residual LPC freq / Hz Can model residual signal with LPC flexible representation of noisy residual E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, / 3
28 Sinusoids + noise + transients Sound represented as sinusoids and noise: s[n] = k A k [n] cos(2πn f k [n]) + h n [n] b[n] Sinusoids Parameters are A k [n], f k [n], h n [n] freq / Hz time / s Separate out abrupt transients in residual? e s [n] = k t k[n] + h n [n] b [n] more specific more flexible Residual e s [n] E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, / 3
29 Summary Nonspeech audio i.e. sound in general characteristics: ecological Music synthesis control of pitch, duration, loudness, articulation evolution of techniques sinusoids + noise + transients Music analysis... and beyond? E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, / 3
30 References W.W. Gaver. Synthesizing auditory icons. In Proc. Conference on Human factors in computing systems INTERCHI-93, pages Addison-Wesley, M. Athineos and D. P. W. Ellis. Autoregressive modeling of temporal envelopes. IEEE Tr. Signal Proc., 15(11): , 27. URL X. Serra and J. Smith III. Spectral Modeling Synthesis: A Sound Analysis/Synthesis System Based on a Deterministic Plus Stochastic Decomposition. Computer Music Journal, 14(4):12 24, 199. T. S. Verma and T. H. Y. Meng. An analysis/synthesis tool for transient signals that allows aflexible sines+ transients+ noise model for audio. In Proc. ICASSP, pages VI , Seattle, E682 (Ellis & Mandel) L6: Nonspeech and Music February 26, 29 3 / 3
Lecture 6: Nonspeech and Music. Music & nonspeech
EE E682: Speech & Audio Processing & Recognition Lecture 6: Nonspeech and Music 2 3 4 5 Music and nonspeech Environmental sounds Music synthesis techniques Sinewave synthesis Music analysis Dan Ellis
More informationLecture 6: Nonspeech and Music
EE E682: Speech & Audio Processing & Recognition Lecture 6: Nonspeech and Music 1 2 3 4 5 Music and nonspeech Environmental sounds Music synthesis techniques Sinewave synthesis Music analysis Dan Ellis
More informationLecture 5: Sinusoidal Modeling
ELEN E4896 MUSIC SIGNAL PROCESSING Lecture 5: Sinusoidal Modeling 1. Sinusoidal Modeling 2. Sinusoidal Analysis 3. Sinusoidal Synthesis & Modification 4. Noise Residual Dan Ellis Dept. Electrical Engineering,
More informationSound Synthesis Methods
Sound Synthesis Methods Matti Vihola, mvihola@cs.tut.fi 23rd August 2001 1 Objectives The objective of sound synthesis is to create sounds that are Musically interesting Preferably realistic (sounds like
More informationSynthesis Techniques. Juan P Bello
Synthesis Techniques Juan P Bello Synthesis It implies the artificial construction of a complex body by combining its elements. Complex body: acoustic signal (sound) Elements: parameters and/or basic signals
More informationA Parametric Model for Spectral Sound Synthesis of Musical Sounds
A Parametric Model for Spectral Sound Synthesis of Musical Sounds Cornelia Kreutzer University of Limerick ECE Department Limerick, Ireland cornelia.kreutzer@ul.ie Jacqueline Walker University of Limerick
More informationPreeti Rao 2 nd CompMusicWorkshop, Istanbul 2012
Preeti Rao 2 nd CompMusicWorkshop, Istanbul 2012 o Music signal characteristics o Perceptual attributes and acoustic properties o Signal representations for pitch detection o STFT o Sinusoidal model o
More informationE : Lecture 8 Source-Filter Processing. E : Lecture 8 Source-Filter Processing / 21
E85.267: Lecture 8 Source-Filter Processing E85.267: Lecture 8 Source-Filter Processing 21-4-1 1 / 21 Source-filter analysis/synthesis n f Spectral envelope Spectral envelope Analysis Source signal n 1
More informationAdvanced audio analysis. Martin Gasser
Advanced audio analysis Martin Gasser Motivation Which methods are common in MIR research? How can we parameterize audio signals? Interesting dimensions of audio: Spectral/ time/melody structure, high
More informationTIME DOMAIN ATTACK AND RELEASE MODELING Applied to Spectral Domain Sound Synthesis
TIME DOMAIN ATTACK AND RELEASE MODELING Applied to Spectral Domain Sound Synthesis Cornelia Kreutzer, Jacqueline Walker Department of Electronic and Computer Engineering, University of Limerick, Limerick,
More informationEE482: Digital Signal Processing Applications
Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 12 Speech Signal Processing 14/03/25 http://www.ee.unlv.edu/~b1morris/ee482/
More informationspeech signal S(n). This involves a transformation of S(n) into another signal or a set of signals
16 3. SPEECH ANALYSIS 3.1 INTRODUCTION TO SPEECH ANALYSIS Many speech processing [22] applications exploits speech production and perception to accomplish speech analysis. By speech analysis we extract
More informationSOUND SOURCE RECOGNITION AND MODELING
SOUND SOURCE RECOGNITION AND MODELING CASA seminar, summer 2000 Antti Eronen antti.eronen@tut.fi Contents: Basics of human sound source recognition Timbre Voice recognition Recognition of environmental
More informationMUS421/EE367B Applications Lecture 9C: Time Scale Modification (TSM) and Frequency Scaling/Shifting
MUS421/EE367B Applications Lecture 9C: Time Scale Modification (TSM) and Frequency Scaling/Shifting Julius O. Smith III (jos@ccrma.stanford.edu) Center for Computer Research in Music and Acoustics (CCRMA)
More informationPerception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 4: 7 Feb A. Faulkner.
Perception of pitch BSc Audiology/MSc SHS Psychoacoustics wk 4: 7 Feb 2008. A. Faulkner. See Moore, BCJ Introduction to the Psychology of Hearing, Chapter 5. Or Plack CJ The Sense of Hearing Lawrence Erlbaum,
More informationFormant Synthesis of Haegeum: A Sound Analysis/Synthesis System using Cpestral Envelope
Formant Synthesis of Haegeum: A Sound Analysis/Synthesis System using Cpestral Envelope Myeongsu Kang School of Computer Engineering and Information Technology Ulsan, South Korea ilmareboy@ulsan.ac.kr
More informationPerception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 5: 12 Feb A. Faulkner.
Perception of pitch BSc Audiology/MSc SHS Psychoacoustics wk 5: 12 Feb 2009. A. Faulkner. See Moore, BCJ Introduction to the Psychology of Hearing, Chapter 5. Or Plack CJ The Sense of Hearing Lawrence
More informationLecture 5: Speech modeling. The speech signal
EE E68: Speech & Audio Processing & Recognition Lecture 5: Speech modeling 1 3 4 5 Modeling speech signals Spectral and cepstral models Linear Predictive models (LPC) Other signal models Speech synthesis
More informationL19: Prosodic modification of speech
L19: Prosodic modification of speech Time-domain pitch synchronous overlap add (TD-PSOLA) Linear-prediction PSOLA Frequency-domain PSOLA Sinusoidal models Harmonic + noise models STRAIGHT This lecture
More informationSpeech Synthesis using Mel-Cepstral Coefficient Feature
Speech Synthesis using Mel-Cepstral Coefficient Feature By Lu Wang Senior Thesis in Electrical Engineering University of Illinois at Urbana-Champaign Advisor: Professor Mark Hasegawa-Johnson May 2018 Abstract
More informationLecture 6: Speech modeling and synthesis
EE E682: Speech & Audio Processing & Recognition Lecture 6: Speech modeling and synthesis 1 2 3 4 5 Modeling speech signals Spectral and cepstral models Linear Predictive models (LPC) Other signal models
More informationLecture 9: Time & Pitch Scaling
ELEN E4896 MUSIC SIGNAL PROCESSING Lecture 9: Time & Pitch Scaling 1. Time Scale Modification (TSM) 2. Time-Domain Approaches 3. The Phase Vocoder 4. Sinusoidal Approach Dan Ellis Dept. Electrical Engineering,
More informationSINUSOIDAL MODELING. EE6641 Analysis and Synthesis of Audio Signals. Yi-Wen Liu Nov 3, 2015
1 SINUSOIDAL MODELING EE6641 Analysis and Synthesis of Audio Signals Yi-Wen Liu Nov 3, 2015 2 Last time: Spectral Estimation Resolution Scenario: multiple peaks in the spectrum Choice of window type and
More informationPerception of pitch. Importance of pitch: 2. mother hemp horse. scold. Definitions. Why is pitch important? AUDL4007: 11 Feb A. Faulkner.
Perception of pitch AUDL4007: 11 Feb 2010. A. Faulkner. See Moore, BCJ Introduction to the Psychology of Hearing, Chapter 5. Or Plack CJ The Sense of Hearing Lawrence Erlbaum, 2005 Chapter 7 1 Definitions
More informationAudio Engineering Society Convention Paper Presented at the 110th Convention 2001 May Amsterdam, The Netherlands
Audio Engineering Society Convention Paper Presented at the th Convention May 5 Amsterdam, The Netherlands This convention paper has been reproduced from the author's advance manuscript, without editing,
More informationVIBRATO DETECTING ALGORITHM IN REAL TIME. Minhao Zhang, Xinzhao Liu. University of Rochester Department of Electrical and Computer Engineering
VIBRATO DETECTING ALGORITHM IN REAL TIME Minhao Zhang, Xinzhao Liu University of Rochester Department of Electrical and Computer Engineering ABSTRACT Vibrato is a fundamental expressive attribute in music,
More informationNon-stationary Analysis/Synthesis using Spectrum Peak Shape Distortion, Phase and Reassignment
Non-stationary Analysis/Synthesis using Spectrum Peak Shape Distortion, Phase Reassignment Geoffroy Peeters, Xavier Rodet Ircam - Centre Georges-Pompidou, Analysis/Synthesis Team, 1, pl. Igor Stravinsky,
More informationLinear Frequency Modulation (FM) Chirp Signal. Chirp Signal cont. CMPT 468: Lecture 7 Frequency Modulation (FM) Synthesis
Linear Frequency Modulation (FM) CMPT 468: Lecture 7 Frequency Modulation (FM) Synthesis Tamara Smyth, tamaras@cs.sfu.ca School of Computing Science, Simon Fraser University January 26, 29 Till now we
More informationADDITIVE SYNTHESIS BASED ON THE CONTINUOUS WAVELET TRANSFORM: A SINUSOIDAL PLUS TRANSIENT MODEL
ADDITIVE SYNTHESIS BASED ON THE CONTINUOUS WAVELET TRANSFORM: A SINUSOIDAL PLUS TRANSIENT MODEL José R. Beltrán and Fernando Beltrán Department of Electronic Engineering and Communications University of
More informationProfessorial Inaugural Lecture, 26 April 2001 DIGITAL SYNTHESIS OF MUSICAL SOUNDS. B.T.G. Tan Department of Physics National University of Singapore
Professorial Inaugural Lecture, 26 April 2001 DIGITAL SYNTHESIS OF MUSICAL SOUNDS B.T.G. Tan Department of Physics National University of Singapore MUSICAL SYNTHESIS The many music synthesizers and keyboards
More informationSINOLA: A New Analysis/Synthesis Method using Spectrum Peak Shape Distortion, Phase and Reassigned Spectrum
SINOLA: A New Analysis/Synthesis Method using Spectrum Peak Shape Distortion, Phase Reassigned Spectrum Geoffroy Peeters, Xavier Rodet Ircam - Centre Georges-Pompidou Analysis/Synthesis Team, 1, pl. Igor
More informationSingle Channel Speaker Segregation using Sinusoidal Residual Modeling
NCC 2009, January 16-18, IIT Guwahati 294 Single Channel Speaker Segregation using Sinusoidal Residual Modeling Rajesh M Hegde and A. Srinivas Dept. of Electrical Engineering Indian Institute of Technology
More informationWARPED FILTER DESIGN FOR THE BODY MODELING AND SOUND SYNTHESIS OF STRING INSTRUMENTS
NORDIC ACOUSTICAL MEETING 12-14 JUNE 1996 HELSINKI WARPED FILTER DESIGN FOR THE BODY MODELING AND SOUND SYNTHESIS OF STRING INSTRUMENTS Helsinki University of Technology Laboratory of Acoustics and Audio
More informationINTRODUCTION TO COMPUTER MUSIC. Roger B. Dannenberg Professor of Computer Science, Art, and Music. Copyright by Roger B.
INTRODUCTION TO COMPUTER MUSIC FM SYNTHESIS A classic synthesis algorithm Roger B. Dannenberg Professor of Computer Science, Art, and Music ICM Week 4 Copyright 2002-2013 by Roger B. Dannenberg 1 Frequency
More informationI-Hao Hsiao, Chun-Tang Chao*, and Chi-Jo Wang (2016). A HHT-Based Music Synthesizer. Intelligent Technologies and Engineering Systems, Lecture Notes
I-Hao Hsiao, Chun-Tang Chao*, and Chi-Jo Wang (2016). A HHT-Based Music Synthesizer. Intelligent Technologies and Engineering Systems, Lecture Notes in Electrical Engineering (LNEE), Vol.345, pp.523-528.
More informationBetween physics and perception signal models for high level audio processing. Axel Röbel. Analysis / synthesis team, IRCAM. DAFx 2010 iem Graz
Between physics and perception signal models for high level audio processing Axel Röbel Analysis / synthesis team, IRCAM DAFx 2010 iem Graz Overview Introduction High level control of signal transformation
More informationCS 591 S1 Midterm Exam
Name: CS 591 S1 Midterm Exam Spring 2017 You must complete 3 of problems 1 4, and then problem 5 is mandatory. Each problem is worth 25 points. Please leave blank, or draw an X through, or write Do Not
More informationALTERNATING CURRENT (AC)
ALL ABOUT NOISE ALTERNATING CURRENT (AC) Any type of electrical transmission where the current repeatedly changes direction, and the voltage varies between maxima and minima. Therefore, any electrical
More informationINFLUENCE OF FREQUENCY DISTRIBUTION ON INTENSITY FLUCTUATIONS OF NOISE
INFLUENCE OF FREQUENCY DISTRIBUTION ON INTENSITY FLUCTUATIONS OF NOISE Pierre HANNA SCRIME - LaBRI Université de Bordeaux 1 F-33405 Talence Cedex, France hanna@labriu-bordeauxfr Myriam DESAINTE-CATHERINE
More informationCMPT 468: Frequency Modulation (FM) Synthesis
CMPT 468: Frequency Modulation (FM) Synthesis Tamara Smyth, tamaras@cs.sfu.ca School of Computing Science, Simon Fraser University October 6, 23 Linear Frequency Modulation (FM) Till now we ve seen signals
More informationConverting Speaking Voice into Singing Voice
Converting Speaking Voice into Singing Voice 1 st place of the Synthesis of Singing Challenge 2007: Vocal Conversion from Speaking to Singing Voice using STRAIGHT by Takeshi Saitou et al. 1 STRAIGHT Speech
More informationFalcon Singles - Oud for Falcon
Falcon Singles - Oud for Falcon 2016 Simon Stockhausen Installation As there is no default location for 3rd party sound libraries for Falcon, you can just install the folder Oud which you extracted from
More informationI D I A P R E S E A R C H R E P O R T. June published in Interspeech 2008
R E S E A R C H R E P O R T I D I A P Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain Sriram Ganapathy a b Petr Motlicek a Hynek Hermansky a b Harinath
More informationLinguistic Phonetics. Spectral Analysis
24.963 Linguistic Phonetics Spectral Analysis 4 4 Frequency (Hz) 1 Reading for next week: Liljencrants & Lindblom 1972. Assignment: Lip-rounding assignment, due 1/15. 2 Spectral analysis techniques There
More informationComplex Sounds. Reading: Yost Ch. 4
Complex Sounds Reading: Yost Ch. 4 Natural Sounds Most sounds in our everyday lives are not simple sinusoidal sounds, but are complex sounds, consisting of a sum of many sinusoids. The amplitude and frequency
More informationFFT analysis in practice
FFT analysis in practice Perception & Multimedia Computing Lecture 13 Rebecca Fiebrink Lecturer, Department of Computing Goldsmiths, University of London 1 Last Week Review of complex numbers: rectangular
More informationB.Tech III Year II Semester (R13) Regular & Supplementary Examinations May/June 2017 DIGITAL SIGNAL PROCESSING (Common to ECE and EIE)
Code: 13A04602 R13 B.Tech III Year II Semester (R13) Regular & Supplementary Examinations May/June 2017 (Common to ECE and EIE) PART A (Compulsory Question) 1 Answer the following: (10 X 02 = 20 Marks)
More informationSPEECH TO SINGING SYNTHESIS SYSTEM. Mingqing Yun, Yoon mo Yang, Yufei Zhang. Department of Electrical and Computer Engineering University of Rochester
SPEECH TO SINGING SYNTHESIS SYSTEM Mingqing Yun, Yoon mo Yang, Yufei Zhang Department of Electrical and Computer Engineering University of Rochester ABSTRACT This paper describes a speech-to-singing synthesis
More informationMel Spectrum Analysis of Speech Recognition using Single Microphone
International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree
More informationUniversity of Washington Department of Electrical Engineering Computer Speech Processing EE516 Winter 2005
University of Washington Department of Electrical Engineering Computer Speech Processing EE516 Winter 2005 Lecture 5 Slides Jan 26 th, 2005 Outline of Today s Lecture Announcements Filter-bank analysis
More informationPhysics 101. Lecture 21 Doppler Effect Loudness Human Hearing Interference of Sound Waves Reflection & Refraction of Sound
Physics 101 Lecture 21 Doppler Effect Loudness Human Hearing Interference of Sound Waves Reflection & Refraction of Sound Quiz: Monday Oct. 18; Chaps. 16,17,18(as covered in class),19 CR/NC Deadline Oct.
More informationOverview of Code Excited Linear Predictive Coder
Overview of Code Excited Linear Predictive Coder Minal Mulye 1, Sonal Jagtap 2 1 PG Student, 2 Assistant Professor, Department of E&TC, Smt. Kashibai Navale College of Engg, Pune, India Abstract Advances
More informationSignal Analysis. Peak Detection. Envelope Follower (Amplitude detection) Music 270a: Signal Analysis
Signal Analysis Music 27a: Signal Analysis Tamara Smyth, trsmyth@ucsd.edu Department of Music, University of California, San Diego (UCSD November 23, 215 Some tools we may want to use to automate analysis
More informationDigitalising sound. Sound Design for Moving Images. Overview of the audio digital recording and playback chain
Digitalising sound Overview of the audio digital recording and playback chain IAT-380 Sound Design 2 Sound Design for Moving Images Sound design for moving images can be divided into three domains: Speech:
More informationVOICE QUALITY SYNTHESIS WITH THE BANDWIDTH ENHANCED SINUSOIDAL MODEL
VOICE QUALITY SYNTHESIS WITH THE BANDWIDTH ENHANCED SINUSOIDAL MODEL Narsimh Kamath Vishweshwara Rao Preeti Rao NIT Karnataka EE Dept, IIT-Bombay EE Dept, IIT-Bombay narsimh@gmail.com vishu@ee.iitb.ac.in
More informationLecture 5: Speech modeling
CSC 836: Speech & Audio Understanding Lecture 5: Speech modeling Dan Ellis CUNY Graduate Center, Computer Science Program http://mr-pc.org/t/csc836 With much content from Dan Ellis
More informationMusical Acoustics, C. Bertulani. Musical Acoustics. Lecture 13 Timbre / Tone quality I
1 Musical Acoustics Lecture 13 Timbre / Tone quality I Waves: review 2 distance x (m) At a given time t: y = A sin(2πx/λ) A -A time t (s) At a given position x: y = A sin(2πt/t) Perfect Tuning Fork: Pure
More informationDigital Speech Processing and Coding
ENEE408G Spring 2006 Lecture-2 Digital Speech Processing and Coding Spring 06 Instructor: Shihab Shamma Electrical & Computer Engineering University of Maryland, College Park http://www.ece.umd.edu/class/enee408g/
More informationSound Modeling from the Analysis of Real Sounds
Sound Modeling from the Analysis of Real Sounds S lvi Ystad Philippe Guillemain Richard Kronland-Martinet CNRS, Laboratoire de Mécanique et d'acoustique 31, Chemin Joseph Aiguier, 13402 Marseille cedex
More informationEnhanced Waveform Interpolative Coding at 4 kbps
Enhanced Waveform Interpolative Coding at 4 kbps Oded Gottesman, and Allen Gersho Signal Compression Lab. University of California, Santa Barbara E-mail: [oded, gersho]@scl.ece.ucsb.edu Signal Compression
More informationComputer Audio. An Overview. (Material freely adapted from sources far too numerous to mention )
Computer Audio An Overview (Material freely adapted from sources far too numerous to mention ) Computer Audio An interdisciplinary field including Music Computer Science Electrical Engineering (signal
More informationFrom Ladefoged EAP, p. 11
The smooth and regular curve that results from sounding a tuning fork (or from the motion of a pendulum) is a simple sine wave, or a waveform of a single constant frequency and amplitude. From Ladefoged
More informationFinal Exam Study Guide: Introduction to Computer Music Course Staff April 24, 2015
Final Exam Study Guide: 15-322 Introduction to Computer Music Course Staff April 24, 2015 This document is intended to help you identify and master the main concepts of 15-322, which is also what we intend
More informationStructure of Speech. Physical acoustics Time-domain representation Frequency domain representation Sound shaping
Structure of Speech Physical acoustics Time-domain representation Frequency domain representation Sound shaping Speech acoustics Source-Filter Theory Speech Source characteristics Speech Filter characteristics
More informationMusic 270a: Modulation
Music 7a: Modulation Tamara Smyth, trsmyth@ucsd.edu Department of Music, University of California, San Diego (UCSD) October 3, 7 Spectrum When sinusoids of different frequencies are added together, the
More informationSpectrum. Additive Synthesis. Additive Synthesis Caveat. Music 270a: Modulation
Spectrum Music 7a: Modulation Tamara Smyth, trsmyth@ucsd.edu Department of Music, University of California, San Diego (UCSD) October 3, 7 When sinusoids of different frequencies are added together, the
More informationInternational Journal of Modern Trends in Engineering and Research e-issn No.: , Date: 2-4 July, 2015
International Journal of Modern Trends in Engineering and Research www.ijmter.com e-issn No.:2349-9745, Date: 2-4 July, 2015 Analysis of Speech Signal Using Graphic User Interface Solly Joy 1, Savitha
More informationSince the advent of the sine wave oscillator
Advanced Distortion Analysis Methods Discover modern test equipment that has the memory and post-processing capability to analyze complex signals and ascertain real-world performance. By Dan Foley European
More informationSound waves. septembre 2014 Audio signals and systems 1
Sound waves Sound is created by elastic vibrations or oscillations of particles in a particular medium. The vibrations are transmitted from particles to (neighbouring) particles: sound wave. Sound waves
More informationTempo and Beat Tracking
Lecture Music Processing Tempo and Beat Tracking Meinard Müller International Audio Laboratories Erlangen meinard.mueller@audiolabs-erlangen.de Book: Fundamentals of Music Processing Meinard Müller Fundamentals
More informationSGN Audio and Speech Processing
Introduction 1 Course goals Introduction 2 SGN 14006 Audio and Speech Processing Lectures, Fall 2014 Anssi Klapuri Tampere University of Technology! Learn basics of audio signal processing Basic operations
More informationMusic 171: Amplitude Modulation
Music 7: Amplitude Modulation Tamara Smyth, trsmyth@ucsd.edu Department of Music, University of California, San Diego (UCSD) February 7, 9 Adding Sinusoids Recall that adding sinusoids of the same frequency
More informationMusical Acoustics, C. Bertulani. Musical Acoustics. Lecture 14 Timbre / Tone quality II
1 Musical Acoustics Lecture 14 Timbre / Tone quality II Odd vs Even Harmonics and Symmetry Sines are Anti-symmetric about mid-point If you mirror around the middle you get the same shape but upside down
More informationAdaptive Line Enhancer (ALE)
Adaptive Line Enhancer (ALE) This demonstration illustrates the application of adaptive filters to signal separation using a structure called an adaptive line enhancer (ALE). In adaptive line enhancement,
More informationTimbral Distortion in Inverse FFT Synthesis
Timbral Distortion in Inverse FFT Synthesis Mark Zadel Introduction Inverse FFT synthesis (FFT ) is a computationally efficient technique for performing additive synthesis []. Instead of summing partials
More informationCommunications Theory and Engineering
Communications Theory and Engineering Master's Degree in Electronic Engineering Sapienza University of Rome A.A. 2018-2019 Speech and telephone speech Based on a voice production model Parametric representation
More informationUniversity of Pennsylvania Department of Electrical and Systems Engineering Digital Audio Basics
University of Pennsylvania Department of Electrical and Systems Engineering Digital Audio Basics ESE250 Spring 2013 Lab 4: Time and Frequency Representation Friday, February 1, 2013 For Lab Session: Thursday,
More informationDeveloping a Versatile Audio Synthesizer TJHSST Senior Research Project Computer Systems Lab
Developing a Versatile Audio Synthesizer TJHSST Senior Research Project Computer Systems Lab 2009-2010 Victor Shepardson June 7, 2010 Abstract A software audio synthesizer is being implemented in C++,
More informationDrum Transcription Based on Independent Subspace Analysis
Report for EE 391 Special Studies and Reports for Electrical Engineering Drum Transcription Based on Independent Subspace Analysis Yinyi Guo Center for Computer Research in Music and Acoustics, Stanford,
More informationAutomatic Evaluation of Hindustani Learner s SARGAM Practice
Automatic Evaluation of Hindustani Learner s SARGAM Practice Gurunath Reddy M and K. Sreenivasa Rao Indian Institute of Technology, Kharagpur, India {mgurunathreddy, ksrao}@sit.iitkgp.ernet.in Abstract
More informationAdvanced Functions of Java-DSP for use in Electrical and Computer Engineering Senior Level Courses
Advanced Functions of Java-DSP for use in Electrical and Computer Engineering Senior Level Courses Andreas Spanias Robert Santucci Tushar Gupta Mohit Shah Karthikeyan Ramamurthy Topics This presentation
More informationINHARMONIC DISPERSION TUNABLE COMB FILTER DESIGN USING MODIFIED IIR BAND PASS TRANSFER FUNCTION
INHARMONIC DISPERSION TUNABLE COMB FILTER DESIGN USING MODIFIED IIR BAND PASS TRANSFER FUNCTION Varsha Shah Asst. Prof., Dept. of Electronics Rizvi College of Engineering, Mumbai, INDIA Varsha_shah_1@rediffmail.com
More informationLaboratory Assignment 4. Fourier Sound Synthesis
Laboratory Assignment 4 Fourier Sound Synthesis PURPOSE This lab investigates how to use a computer to evaluate the Fourier series for periodic signals and to synthesize audio signals from Fourier series
More informationTempo and Beat Tracking
Lecture Music Processing Tempo and Beat Tracking Meinard Müller International Audio Laboratories Erlangen meinard.mueller@audiolabs-erlangen.de Introduction Basic beat tracking task: Given an audio recording
More informationMusic 171: Sinusoids. Tamara Smyth, Department of Music, University of California, San Diego (UCSD) January 10, 2019
Music 7: Sinusoids Tamara Smyth, trsmyth@ucsd.edu Department of Music, University of California, San Diego (UCSD) January 0, 209 What is Sound? The word sound is used to describe both:. an auditory sensation
More informationLecture 6. Rhythm Analysis. (some slides are adapted from Zafar Rafii and some figures are from Meinard Mueller)
Lecture 6 Rhythm Analysis (some slides are adapted from Zafar Rafii and some figures are from Meinard Mueller) Definitions for Rhythm Analysis Rhythm: movement marked by the regulated succession of strong
More informationSpectral analysis based synthesis and transformation of digital sound: the ATSH program
Spectral analysis based synthesis and transformation of digital sound: the ATSH program Oscar Pablo Di Liscia 1, Juan Pampin 2 1 Carrera de Composición con Medios Electroacústicos, Universidad Nacional
More informationSpeech Signal Analysis
Speech Signal Analysis Hiroshi Shimodaira and Steve Renals Automatic Speech Recognition ASR Lectures 2&3 14,18 January 216 ASR Lectures 2&3 Speech Signal Analysis 1 Overview Speech Signal Analysis for
More informationHungarian Speech Synthesis Using a Phase Exact HNM Approach
Hungarian Speech Synthesis Using a Phase Exact HNM Approach Kornél Kovács 1, András Kocsor 2, and László Tóth 3 Research Group on Artificial Intelligence of the Hungarian Academy of Sciences and University
More informationOutline. Communications Engineering 1
Outline Introduction Signal, random variable, random process and spectra Analog modulation Analog to digital conversion Digital transmission through baseband channels Signal space representation Optimal
More informationSTO Limited Warranty Installation Overview
v2.5 2 STO Limited Warranty ----------------------------------------------------3 Installation --------------------------------------------------4 Overview --------------------------------------------------------5
More informationApplications of Music Processing
Lecture Music Processing Applications of Music Processing Christian Dittmar International Audio Laboratories Erlangen christian.dittmar@audiolabs-erlangen.de Singing Voice Detection Important pre-requisite
More informationSound Recognition. ~ CSE 352 Team 3 ~ Jason Park Evan Glover. Kevin Lui Aman Rawat. Prof. Anita Wasilewska
Sound Recognition ~ CSE 352 Team 3 ~ Jason Park Evan Glover Kevin Lui Aman Rawat Prof. Anita Wasilewska What is Sound? Sound is a vibration that propagates as a typically audible mechanical wave of pressure
More informationAUDL GS08/GAV1 Auditory Perception. Envelope and temporal fine structure (TFS)
AUDL GS08/GAV1 Auditory Perception Envelope and temporal fine structure (TFS) Envelope and TFS arise from a method of decomposing waveforms The classic decomposition of waveforms Spectral analysis... Decomposes
More informationProject 0: Part 2 A second hands-on lab on Speech Processing Frequency-domain processing
Project : Part 2 A second hands-on lab on Speech Processing Frequency-domain processing February 24, 217 During this lab, you will have a first contact on frequency domain analysis of speech signals. You
More informationCombining granular synthesis with frequency modulation.
Combining granular synthesis with frequey modulation. Kim ERVIK Department of music University of Sciee and Technology Norway kimer@stud.ntnu.no Øyvind BRANDSEGG Department of music University of Sciee
More informationSHOCK RESPONSE SPECTRUM SYNTHESIS VIA DAMPED SINUSOIDS Revision B
SHOCK RESPONSE SPECTRUM SYNTHESIS VIA DAMPED SINUSOIDS Revision B By Tom Irvine Email: tomirvine@aol.com April 5, 2012 Introduction Mechanical shock can cause electronic components to fail. Crystal oscillators
More informationFundamentals of Digital Audio *
Digital Media The material in this handout is excerpted from Digital Media Curriculum Primer a work written by Dr. Yue-Ling Wong (ylwong@wfu.edu), Department of Computer Science and Department of Art,
More informationANALYSIS AND EVALUATION OF IRREGULARITY IN PITCH VIBRATO FOR STRING-INSTRUMENT TONES
Abstract ANALYSIS AND EVALUATION OF IRREGULARITY IN PITCH VIBRATO FOR STRING-INSTRUMENT TONES William L. Martens Faculty of Architecture, Design and Planning University of Sydney, Sydney NSW 2006, Australia
More informationMusic Signal Processing
Tutorial Music Signal Processing Meinard Müller Saarland University and MPI Informatik meinard@mpi-inf.mpg.de Anssi Klapuri Queen Mary University of London anssi.klapuri@elec.qmul.ac.uk Overview Part I:
More information