SINUSOIDAL MODELING. EE6641 Analysis and Synthesis of Audio Signals. Yi-Wen Liu Nov 3, 2015

Similar documents
A Parametric Model for Spectral Sound Synthesis of Musical Sounds

REAL-TIME BROADBAND NOISE REDUCTION

Audio Engineering Society Convention Paper Presented at the 110th Convention 2001 May Amsterdam, The Netherlands

Chapter 5 Window Functions. periodic with a period of N (number of samples). This is observed in table (3.1).

Perception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 5: 12 Feb A. Faulkner.

Perception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 4: 7 Feb A. Faulkner.

Acoustics, signals & systems for audiology. Week 4. Signals through Systems

Lecture 5: Sinusoidal Modeling

Identification of Nonstationary Audio Signals Using the FFT, with Application to Analysis-based Synthesis of Sound

DISCRETE FOURIER TRANSFORM AND FILTER DESIGN

Final Exam Practice Questions for Music 421, with Solutions

Signals & Systems for Speech & Hearing. Week 6. Practical spectral analysis. Bandpass filters & filterbanks. Try this out on an old friend

Discrete Fourier Transform (DFT)

Design of FIR Filter for Efficient Utilization of Speech Signal Akanksha. Raj 1 Arshiyanaz. Khateeb 2 Fakrunnisa.Balaganur 3

Advanced audio analysis. Martin Gasser

Different Approaches of Spectral Subtraction Method for Speech Enhancement

SINOLA: A New Analysis/Synthesis Method using Spectrum Peak Shape Distortion, Phase and Reassigned Spectrum

Perception of pitch. Importance of pitch: 2. mother hemp horse. scold. Definitions. Why is pitch important? AUDL4007: 11 Feb A. Faulkner.

Complex Sounds. Reading: Yost Ch. 4

Communications Theory and Engineering

Timbral Distortion in Inverse FFT Synthesis

URBANA-CHAMPAIGN. CS 498PS Audio Computing Lab. Audio DSP basics. Paris Smaragdis. paris.cs.illinois.

PART II Practical problems in the spectral analysis of speech signals

Preeti Rao 2 nd CompMusicWorkshop, Istanbul 2012

HIGH ACCURACY FRAME-BY-FRAME NON-STATIONARY SINUSOIDAL MODELLING

Auditory modelling for speech processing in the perceptual domain

COMBINING ADVANCED SINUSOIDAL AND WAVEFORM MATCHING MODELS FOR PARAMETRIC AUDIO/SPEECH CODING

Multimedia Signal Processing: Theory and Applications in Speech, Music and Communications

Final Exam Study Guide: Introduction to Computer Music Course Staff April 24, 2015

Topic. Spectrogram Chromagram Cesptrogram. Bryan Pardo, 2008, Northwestern University EECS 352: Machine Perception of Music and Audio

Topic 6. The Digital Fourier Transform. (Based, in part, on The Scientist and Engineer's Guide to Digital Signal Processing by Steven Smith)

You know about adding up waves, e.g. from two loudspeakers. AUDL 4007 Auditory Perception. Week 2½. Mathematical prelude: Adding up levels

EE 422G - Signals and Systems Laboratory

I D I A P R E S E A R C H R E P O R T. June published in Interspeech 2008

CMPT 368: Lecture 4 Amplitude Modulation (AM) Synthesis

Lab week 4: Harmonic Synthesis

Lecture 6: Nonspeech and Music

Sinusoidal Modeling. summer 2006 lecture on analysis, modeling and transformation of audio signals

Fourier Methods of Spectral Estimation

Enhancement of Speech Signal by Adaptation of Scales and Thresholds of Bionic Wavelet Transform Coefficients

3.2 Measuring Frequency Response Of Low-Pass Filter :

Single Channel Speaker Segregation using Sinusoidal Residual Modeling

Signals and Systems Using MATLAB

Signal Characterization in terms of Sinusoidal and Non-Sinusoidal Components

Spectral analysis based synthesis and transformation of digital sound: the ATSH program

ALTERNATING CURRENT (AC)

Department of Electronic Engineering NED University of Engineering & Technology. LABORATORY WORKBOOK For the Course SIGNALS & SYSTEMS (TC-202)

AUDL GS08/GAV1 Auditory Perception. Envelope and temporal fine structure (TFS)

Non-stationary Analysis/Synthesis using Spectrum Peak Shape Distortion, Phase and Reassignment

Monophony/Polyphony Classification System using Fourier of Fourier Transform

Introduction to Wavelets. For sensor data processing

Frequency slope estimation and its application for non-stationary sinusoidal parameter estimation

A NEW APPROACH TO TRANSIENT PROCESSING IN THE PHASE VOCODER. Axel Röbel. IRCAM, Analysis-Synthesis Team, France

Topic 2. Signal Processing Review. (Some slides are adapted from Bryan Pardo s course slides on Machine Perception of Music)

Effect of filter spacing and correct tonotopic representation on melody recognition: Implications for cochlear implants

Lab 4 Fourier Series and the Gibbs Phenomenon

Sound Synthesis Methods

WARPED FILTER DESIGN FOR THE BODY MODELING AND SOUND SYNTHESIS OF STRING INSTRUMENTS

MUS421/EE367B Applications Lecture 9C: Time Scale Modification (TSM) and Frequency Scaling/Shifting

Definition of Sound. Sound. Vibration. Period - Frequency. Waveform. Parameters. SPA Lundeen

The psychoacoustics of reverberation

Evaluation of Audio Compression Artifacts M. Herrera Martinez

SAMPLING THEORY. Representing continuous signals with discrete numbers

Qäf) Newnes f-s^j^s. Digital Signal Processing. A Practical Guide for Engineers and Scientists. by Steven W. Smith

Audible Aliasing Distortion in Digital Audio Synthesis

Biomedical Signals. Signals and Images in Medicine Dr Nabeel Anwar

The Fundamentals of FFT-Based Signal Analysis and Measurement Michael Cerna and Audrey F. Harvey

G(f ) = g(t) dt. e i2πft. = cos(2πf t) + i sin(2πf t)

speech signal S(n). This involves a transformation of S(n) into another signal or a set of signals

Students: Avihay Barazany Royi Levy Supervisor: Kuti Avargel In Association with: Zoran, Haifa

VIBRATO DETECTING ALGORITHM IN REAL TIME. Minhao Zhang, Xinzhao Liu. University of Rochester Department of Electrical and Computer Engineering

Music 171: Amplitude Modulation

Subtractive Synthesis without Filters

Laboratory Assignment 4. Fourier Sound Synthesis

Windows Connections. Preliminaries

Enhanced Waveform Interpolative Coding at 4 kbps

Sound waves. septembre 2014 Audio signals and systems 1

Signal processing preliminaries

TIME DOMAIN ATTACK AND RELEASE MODELING Applied to Spectral Domain Sound Synthesis

VOICE QUALITY SYNTHESIS WITH THE BANDWIDTH ENHANCED SINUSOIDAL MODEL

FFT 1 /n octave analysis wavelet

Adaptive Filters Application of Linear Prediction

AM/FM Rate Estimation and Bias Correction for Time-Varying Sinusoidal Modeling

Signals, Sound, and Sensation

Interpolation Error in Waveform Table Lookup

ON THE VALIDITY OF THE NOISE MODEL OF QUANTIZATION FOR THE FREQUENCY-DOMAIN AMPLITUDE ESTIMATION OF LOW-LEVEL SINE WAVES

Mel Spectrum Analysis of Speech Recognition using Single Microphone

Performance Analysis of Windowing Techniques in Automatic Speech Signal Segmentation

Resonator Factoring. Julius Smith and Nelson Lee

PART I: The questions in Part I refer to the aliasing portion of the procedure as outlined in the lab manual.

6 Sampling. Sampling. The principles of sampling, especially the benefits of coherent sampling

Laboratory Experiment #1 Introduction to Spectral Analysis

Measurement of RMS values of non-coherently sampled signals. Martin Novotny 1, Milos Sedlacek 2

Accurate Delay Measurement of Coded Speech Signals with Subsample Resolution

Signal segmentation and waveform characterization. Biosignal processing, S Autumn 2012

Signal Processing Toolbox

Carrier Frequency Offset Estimation in WCDMA Systems Using a Modified FFT-Based Algorithm

An introduction to physics of Sound

TRANSFORMS / WAVELETS

Musical Acoustics, C. Bertulani. Musical Acoustics. Lecture 14 Timbre / Tone quality II

Transcription:

1 SINUSOIDAL MODELING EE6641 Analysis and Synthesis of Audio Signals Yi-Wen Liu Nov 3, 2015

2 Last time: Spectral Estimation Resolution Scenario: multiple peaks in the spectrum Choice of window type and window length Rule of thumb: Dw = O(1/Dt) Accuracy Scenario: signal in noise Quadratic interpolation of FFT log-magnitude Fisher Information and Cramer-Rao lower bounds Coherent frequency estimation: Dw = O(Dt -3/2 )

3 Today s agenda Sinusoidal modeling Analysis Formula for QI-FFT Choice of window Peak tracking Synthesis Applications Spectral decomposition (S + N) vs. (S + N + T) Spectral subtraction

4 QI-FFT: a fast frequency estimator Windowing: time-domain multiplication with a shaping function Zero-padding: append zeros in time interpolate in frequency FFT: apply fast Fourier transform Peak detection: find every peak Quadratic interpolation: fit a parabola in log-magnitude The location of the parabola is the frequency estimate. db Reference: M. Abe and J. Smith, Design criteria for simple sinusoidal parameter estimation based on quadratic interpolation of FFT magnitude peaks, (AES 2004) Hz

5 A formula for QI-FFT db -1 0 1 k

6 Example: finding peaks in a spectrum Peaks can move Peaks can grow or attenuate Need to ignore side lobes But how? 100 db SPL 80 60 40 Signal magnitude spectrum Valid peaks Masking curve 20 0 0 2000 4000 6000 8000 10000 12000 Hz

Blackman window has > 50 db side-lobe suppression All side lobes beneath the masking threshold. 7

8 Example: Hann-windowed transform of trumpet sounds signal spectrum Masked region Hz

9 Psychoacoustic masking Masking refers to the fact that softer sounds cannot be heard due to the presence of stronger sounds. Forward masking Simultaneous Backward masking We will cover psychoacoustics later in this semester.

Sinusoidal Modeling: from discrete peaks to trajectories 10

11 Trajectory formation Basic peak tracking involves: Finding shortest link Resolving splits Advanced technique includes Forbidding large jumps Transient detection Highest peak chooses first

12 Frequency-trajectories of a speech signal, arrows indicate transient regions

13 Part II: Synthesis Sinusoidal modeling Analysis Synthesis Linear-interpolation Phase continuation A window-based synthesis method Spectral decomposition (S + N) vs. (S + N + T) Spectral subtraction Noise modeling (Nov. 16, 23)

14 Sinusoidal synthesis (ii): Sinusoidal synthesis (ii): the linear interpolation method the linear interpolation method..an amplitude envelope..an amplitude envelope..a frequency envelope..a frequency envelope Between time mh and (m+1)h, do this: A n = βa + (1 β)a f n = βf + 1 β f β = A m m A[n] A[n] A A m+1 m+1 Remark: Remark: amplitude amplitude can can be log be log or or linear linear scale. scale. 14

The linear-interpolation method (cont d) 15

From one trajectory to a signal: a window-based method Synthesis:..an amplitude envelope..a frequency envelope Continuous phase updates: time A m ht 16

17 summation over all trajectories time A m ht

18 Applications of sinusoidal modeling Noise removal/ speech enhancement Potentials in hearing aids/ cochlear implants Musical effects: Pitch shifting/ time warping Parametric audio coding: MPEG-4 structured audio B. Vercoe et al. (1998). Structured audio: Creation, transmission, and rendering of parametric sound representations, Proc. IEEE, Vol. 86, No. 5, 922-40.

19 Part III: Spectral decomposition Sinusoidal modeling Analysis Synthesis Linear-interpolation Phase continuation A window-based synthesis method Signal decomposition Spectral subtraction (S + N) vs. (S + N + T) Noise modeling (Possible final project idea)

Example: a watermarking system based on signal decomposition (Liu & Smith, 2007) 20

21 Sines + Noise + Transient (S+N+T) decomposition Sinsusoids

22 Spectral subtraction For each peak in the spectrum, do 1. fit the mainlobe of Blackman window 2. subtract the mainlobe 3. Inverse FFT to synthesis the residual ( noise )

(db) 100 80 60 40 20 Sines + noise decomposition by spectral subtraction Signal Signal 0 2k 4k 6k 8k 10k (Hz) 0 2k 4k 6k 8k 10k (Hz) Residual Residual (a) Trumpet (b) Dance music Parametric noise modeling? 23

24 Transient detection Sinsusoid s

25 Transient detection: there s no gold standard The following is just a heuristic approach Energy comparison Sine-to-residual ratio (SRR)

Entering and exiting the transients Once a transient is detected, sinusoidal model is halted for a period of time. 26

27 Possible project ideas Trajectory formation Causal implementation for real-time applications Applications of spectral subtraction Noise-removal / Speech enhancement Audio recognition Sound source segregation Audio fingerprinting/ watermarking And so on

28