Lecture 5: Sinusoidal Modeling

Similar documents
Lecture 9: Time & Pitch Scaling

Lecture 6: Nonspeech and Music

Lecture 6: Nonspeech and Music

Lecture 6: Nonspeech and Music. Music & nonspeech

MUS421/EE367B Applications Lecture 9C: Time Scale Modification (TSM) and Frequency Scaling/Shifting

Sound Synthesis Methods

Single Channel Speaker Segregation using Sinusoidal Residual Modeling

FFT analysis in practice

speech signal S(n). This involves a transformation of S(n) into another signal or a set of signals

E : Lecture 8 Source-Filter Processing. E : Lecture 8 Source-Filter Processing / 21

A Parametric Model for Spectral Sound Synthesis of Musical Sounds

Advanced audio analysis. Martin Gasser

Lecture 6: Speech modeling and synthesis

Lecture 5: Speech modeling. The speech signal

Synthesis Techniques. Juan P Bello

Lecture Schedule: Week Date Lecture Title

Audio Engineering Society Convention Paper Presented at the 110th Convention 2001 May Amsterdam, The Netherlands

Non-stationary Analysis/Synthesis using Spectrum Peak Shape Distortion, Phase and Reassignment

Reading: Johnson Ch , Ch.5.5 (today); Liljencrants & Lindblom; Stevens (Tues) reminder: no class on Thursday.

ADDITIVE SYNTHESIS BASED ON THE CONTINUOUS WAVELET TRANSFORM: A SINUSOIDAL PLUS TRANSIENT MODEL

Final Exam Practice Questions for Music 421, with Solutions

Formant Synthesis of Haegeum: A Sound Analysis/Synthesis System using Cpestral Envelope

L19: Prosodic modification of speech

Converting Speaking Voice into Singing Voice

8.3 Basic Parameters for Audio

Signal Analysis. Peak Detection. Envelope Follower (Amplitude detection) Music 270a: Signal Analysis

Wavelets and wavelet convolution and brain music. Dr. Frederike Petzschner Translational Neuromodeling Unit

Lecture 14: Source Separation

SINUSOIDAL MODELING. EE6641 Analysis and Synthesis of Audio Signals. Yi-Wen Liu Nov 3, 2015

EE482: Digital Signal Processing Applications

Linguistic Phonetics. Spectral Analysis

SINOLA: A New Analysis/Synthesis Method using Spectrum Peak Shape Distortion, Phase and Reassigned Spectrum

Musical Acoustics, C. Bertulani. Musical Acoustics. Lecture 13 Timbre / Tone quality I

The Discrete Fourier Transform. Claudia Feregrino-Uribe, Alicia Morales-Reyes Original material: Dr. René Cumplido

Frequency Domain Representation of Signals

Speech Signal Analysis

Identification of Nonstationary Audio Signals Using the FFT, with Application to Analysis-based Synthesis of Sound

Timbral Distortion in Inverse FFT Synthesis

TIME DOMAIN ATTACK AND RELEASE MODELING Applied to Spectral Domain Sound Synthesis

VQ Source Models: Perceptual & Phase Issues

SPEECH TO SINGING SYNTHESIS SYSTEM. Mingqing Yun, Yoon mo Yang, Yufei Zhang. Department of Electrical and Computer Engineering University of Rochester

Applications of Music Processing

Enhanced Waveform Interpolative Coding at 4 kbps

Between physics and perception signal models for high level audio processing. Axel Röbel. Analysis / synthesis team, IRCAM. DAFx 2010 iem Graz

Lecture 7 Frequency Modulation

Topic. Spectrogram Chromagram Cesptrogram. Bryan Pardo, 2008, Northwestern University EECS 352: Machine Perception of Music and Audio

Chapter 18. Superposition and Standing Waves

Signal segmentation and waveform characterization. Biosignal processing, S Autumn 2012

ALTERNATING CURRENT (AC)

Project 0: Part 2 A second hands-on lab on Speech Processing Frequency-domain processing

Laboratory Assignment 4. Fourier Sound Synthesis

TRANSFORMS / WAVELETS

FIR/Convolution. Visulalizing the convolution sum. Convolution

CS 591 S1 Midterm Exam

URBANA-CHAMPAIGN. CS 498PS Audio Computing Lab. Audio DSP basics. Paris Smaragdis. paris.cs.illinois.

In Phase. Out of Phase

8A. ANALYSIS OF COMPLEX SOUNDS. Amplitude, loudness, and decibels

Phase Vocoder Report for Audio Signal Processing. Gerald Leung V

Since the advent of the sine wave oscillator

Preeti Rao 2 nd CompMusicWorkshop, Istanbul 2012

Modulation Spectral Filtering: A New Tool for Acoustic Signal Analysis

Lab 8. ANALYSIS OF COMPLEX SOUNDS AND SPEECH ANALYSIS Amplitude, loudness, and decibels

Drum Transcription Based on Independent Subspace Analysis

Frequency Division Multiplexing Spring 2011 Lecture #14. Sinusoids and LTI Systems. Periodic Sequences. x[n] = x[n + N]

Signal processing preliminaries

METHODS FOR SEPARATION OF AMPLITUDE AND FREQUENCY MODULATION IN FOURIER TRANSFORMED SIGNALS

Topic 2. Signal Processing Review. (Some slides are adapted from Bryan Pardo s course slides on Machine Perception of Music)

PARSHL: An Analysis/Synthesis Program for Non-Harmonic Sounds Based on a Sinusoidal Representation

SAMPLING THEORY. Representing continuous signals with discrete numbers

From Fourier Series to Analysis of Non-stationary Signals - VII

Signal Characterization in terms of Sinusoidal and Non-Sinusoidal Components

COMBINING ADVANCED SINUSOIDAL AND WAVEFORM MATCHING MODELS FOR PARAMETRIC AUDIO/SPEECH CODING

Singing Voice Detection. Applications of Music Processing. Singing Voice Detection. Singing Voice Detection. Singing Voice Detection

DSP First. Laboratory Exercise #11. Extracting Frequencies of Musical Tones

Physics 115 Lecture 13. Fourier Analysis February 22, 2018

THE CITADEL THE MILITARY COLLEGE OF SOUTH CAROLINA. Department of Electrical and Computer Engineering. ELEC 423 Digital Signal Processing

Audio processing methods on marine mammal vocalizations

POLYPHONIC PITCH DETECTION BY MATCHING SPECTRAL AND AUTOCORRELATION PEAKS. Sebastian Kraft, Udo Zölzer

Empirical Mode Decomposition: Theory & Applications

Final Exam Study Guide: Introduction to Computer Music Course Staff April 24, 2015

EC 6501 DIGITAL COMMUNICATION UNIT - II PART A

SGN Audio and Speech Processing

Digital Signal Processing

Signal Processing for Speech Applications - Part 2-1. Signal Processing For Speech Applications - Part 2

Interpolation Error in Waveform Table Lookup

CLASSIFICATION OF MULTIPLE SIGNALS USING 2D MATCHING OF MAGNITUDE-FREQUENCY DENSITY FEATURES

Signal Processing First Lab 20: Extracting Frequencies of Musical Tones

Sampling and Reconstruction of Analog Signals

Digital Image Processing

CLASSIFICATION OF MULTIPLE SIGNALS USING 2D MATCHING OF MAGNITUDE-FREQUENCY DENSITY FEATURES

Digital Speech Processing and Coding

Waves ADD: Constructive Interference. Waves SUBTRACT: Destructive Interference. In Phase. Out of Phase

Application of The Wavelet Transform In The Processing of Musical Signals

CHAPTER. delta-sigma modulators 1.0

ece 429/529 digital signal processing robin n. strickland ece dept, university of arizona ECE 429/529 RNS

Audio Signal Compression using DCT and LPC Techniques

Lecture 5: Speech modeling

ECE 484 Digital Image Processing Lec 09 - Image Resampling

Signal Analysis. Young Won Lim 2/9/18

Modulator Domain Adaptive Gain Equalizer for Speech Enhancement

Transcription:

ELEN E4896 MUSIC SIGNAL PROCESSING Lecture 5: Sinusoidal Modeling 1. Sinusoidal Modeling 2. Sinusoidal Analysis 3. Sinusoidal Synthesis & Modification 4. Noise Residual Dan Ellis Dept. Electrical Engineering, Columbia University dpwe@ee.columbia.edu http://www.ee.columbia.edu/~dpwe/e4896/ E4896 Music Signal Processing (Dan Ellis) 213-2-18-1 /16

1. Sinusoidal Modeling Periodic sounds ridges in spectrogram each ridge is a sinusoidal harmonic.. with smoothly-varying parameters Violin.arco.ff.A4.. an efficient & flexible description? E4896 Music Signal Processing (Dan Ellis) 213-2-18-2 /16

Sinusoid Modeling Analogous to Fourier series model harmonics explicitly? e.g. x[n] =... for pitched signal with fundamental k[n] =k [n] n Additional constraints harmonicity smoothness of k a k [n] a k [n]cos( k [n]) [n] Arbitrarily accurate given enough sinusoids E4896 Music Signal Processing (Dan Ellis) 213-2-18 - /16

Examples Using Michael Klingbeil s SPEAR http://www.klingbeil.com/spear/ E4896 Music Signal Processing (Dan Ellis) 213-2-18-4 /16

Envelope Limitations Extracted envelope reflects analysis window.4 2.3.2.1 Frequency 15 1 5 1 2 3 4 5 6 7.4 2.5 1 1.5 2 Time.3.2.1 Frequency 15 1 5 1 2 3 4 5 6 7.5 1 1.5 2 Time Sharp window violates assumptions E4896 Music Signal Processing (Dan Ellis) 213-2-18-5 /16

2. Sinusoidal Analysis Sinusoids = peaks in spectrogram slices = DFT frames X[k, m] = N 1 n= DFT length N window determines frequency resolution: long enough to see harmonics x[n + ml] w[n]e j 2 e.g. 2-3x longest pitch cycle typically 5-1 ms a k [n] but: too long blurs amplitude envelope Hop advance L choose N/2 or N/4.. denser for simpler interpolation along time kn N X(e j ) W (e j ) E4896 Music Signal Processing (Dan Ellis) 213-2-18-6 /16

Sinusoidal Peak Picking Local maxima in DFT frames freq / Hz level / db 8 6 4 2.2.4.6.8.1.12.14.16.18 2-2 -4 Quadratic fit for sub-bin resolution level / db 2 1-1 -2 y ab 2 /4 b/2 y = ax(x-b) phase / rad 4 6 8 freq / Hz x 4 6 8 freq / Hz E4896 Music Signal Processing (Dan Ellis) 213-2-18-7 /16-5 -1 time / s -6 1 2 3 4 5 6 7 freq / Hz

Peak Selection Don t want every peak just true sinusoids threshold? level / db 2-2 -4-6 1 2 3 4 5 6 7 freq / Hz local shape - fits ( ) W (e j ) Look for stability of frequency & amplitude in successive time frames phase derivative in time/freq E4896 Music Signal Processing (Dan Ellis) 213-2-18-8 /16

Track Formation Connect peaks in adjacent frames to form sinusoids can be ambiguous if large frequency changes freq birth existing tracks death new peaks time Unclaimed peak create new track No continuation of track termination hysteresis E4896 Music Signal Processing (Dan Ellis) 213-2-18-9 /16

Pitch Tracking Extracted sinusoids could be anywhere but often expect them to be in harmonic series freq / Hz 6 4 2 freq / Hz 7 65 6.5.1.15.2 time / s 55.5.1.15.2 Find pitch by searching for common factor can then regularize pitch k[n] =k [n] time / s E4896 Music Signal Processing (Dan Ellis) 213-2-18-1/16

3. Sinusoidal Synthesis Each sinusoid track drives an oscillator {a k [n], k[n]} 3 3 level 2 1 7 a k [n] a k [n] cos( k [n] t) 2 1-1 freq / Hz 6 5 k [n].5.1.15.2 n time / s -2-3.5.1.15.2 time / s can interpolate amplitude, frequency samples Faster method synthesizes DFT frames then overlap-add trickier to achieve frequency modulation E4896 Music Signal Processing (Dan Ellis) 213-2-18-11/16

Sinusoidal Modification Sinusoidal description very easy to modify e.g. changing time base of sample points 5 freq / Hz 4 3 2 1 Frequency stretch preserve formant envelope? level / db.5.1.15.2.25.3.35.4.45.5 4 3 2 1 1 2 3 4 freq / Hz 1 2 3 4 E4896 Music Signal Processing (Dan Ellis) 213-2-18-12/16 level / db 4 3 2 1 time / s freq / Hz

4. Noise Residual Some energy is not well fit with sinusoids e.g. noisy energy Can just keep it as residual or model it some other way Leads to sinusoidal + noise model x[n] = a k [n]cos( k [n]n) + e[n] mag / db 2-2 k sinusoids original -4-6 -8 LPC 1 2 3 4 5 6 7 freq / Hz residual E4896 Music Signal Processing (Dan Ellis) 213-2-18-13/16

Sinusoids + Noise Decomposition Removing sines reveals noise & transients Guitar - original 4 3 Frequency 2 1 4.2.4.6.8 1 1.2 1.4 1.6 1.8 2 Time Guitar - sinusoid reconstruction 3 Frequency 2 1 4.2.4.6.8 1 1.2 1.4 1.6 1.8 2 Time Guitar - residual (original - sines) 3 Frequency 2 1.2.4.6.8 1 1.2 1.4 1.6 1.8 2 Time Different representation approaches... E4896 Music Signal Processing (Dan Ellis) 213-2-18-14/16

5. Limitations The spectrogram (mag STFT) is not linear superpositions suffer from phase effects freq / Hz 14 13 abs(stft(s1)) + abs(stft(s2)) 14 13 abs(stft(s1+s2)) 25 2 12 11 12 11 15 1 1 1 9 9 5 8.5 1 8.5 1 time / sec Separating sources is generally hard... parameters tracking E4896 Music Signal Processing (Dan Ellis) 213-2-18-15/16

Summary Spectrogram shows sinusoid harmonics in many sounds Peak picking in spectrogram can effectively extract them Sinusoidal domain extremely flexible for modification Noise residual can add even more realism E4896 Music Signal Processing (Dan Ellis) 213-2-18-16/16