Lecture 6. Rhythm Analysis. (some slides are adapted from Zafar Rafii and some figures are from Meinard Mueller)

Similar documents
Rhythm Analysis in Music

Rhythm Analysis in Music

Rhythm Analysis in Music

Tempo and Beat Tracking

Tempo and Beat Tracking

BEAT DETECTION BY DYNAMIC PROGRAMMING. Racquel Ivy Awuor

Music Signal Processing

COMPUTATIONAL RHYTHM AND BEAT ANALYSIS Nicholas Berkner. University of Rochester

Rhythmic Similarity -- a quick paper review. Presented by: Shi Yong March 15, 2007 Music Technology, McGill University

Signal segmentation and waveform characterization. Biosignal processing, S Autumn 2012

Drum Transcription Based on Independent Subspace Analysis

Onset Detection Revisited

REpeating Pattern Extraction Technique (REPET)

Energy-Weighted Multi-Band Novelty Functions for Onset Detection in Piano Music

Onset detection and Attack Phase Descriptors. IMV Signal Processing Meetup, 16 March 2017

Preeti Rao 2 nd CompMusicWorkshop, Istanbul 2012

A SEGMENTATION-BASED TEMPO INDUCTION METHOD

Real-time beat estimation using feature extraction

Research on Extracting BPM Feature Values in Music Beat Tracking Algorithm

SGN Audio and Speech Processing

Survey Paper on Music Beat Tracking

Transcription of Piano Music

SGN Audio and Speech Processing

MUS421/EE367B Applications Lecture 9C: Time Scale Modification (TSM) and Frequency Scaling/Shifting

Assessment Schedule 2014 Music: Demonstrate knowledge of conventions used in music scores (91094)

(i) Understanding the basic concepts of signal modeling, correlation, maximum likelihood estimation, least squares and iterative numerical methods

Applications of Music Processing

Biomedical Signals. Signals and Images in Medicine Dr Nabeel Anwar

POLYPHONIC PITCH DETECTION BY MATCHING SPECTRAL AND AUTOCORRELATION PEAKS. Sebastian Kraft, Udo Zölzer

Sound Synthesis Methods

Lecture 3: Audio Applications

Singing Voice Detection. Applications of Music Processing. Singing Voice Detection. Singing Voice Detection. Singing Voice Detection

DSP First. Laboratory Exercise #11. Extracting Frequencies of Musical Tones

SPEECH TO SINGING SYNTHESIS SYSTEM. Mingqing Yun, Yoon mo Yang, Yufei Zhang. Department of Electrical and Computer Engineering University of Rochester

SUB-BAND INDEPENDENT SUBSPACE ANALYSIS FOR DRUM TRANSCRIPTION. Derry FitzGerald, Eugene Coyle

Signal Processing First Lab 20: Extracting Frequencies of Musical Tones

Automatic Processing of Dance Dance Revolution

Advanced audio analysis. Martin Gasser

Automatic Transcription of Monophonic Audio to MIDI

Speech/Music Change Point Detection using Sonogram and AANN

Project 0: Part 2 A second hands-on lab on Speech Processing Frequency-domain processing


Using Audio Onset Detection Algorithms

Accurate Tempo Estimation based on Recurrent Neural Networks and Resonating Comb Filters

Limitations of Sum-of-Sinusoid Signals

Harmonic-Percussive Source Separation of Polyphonic Music by Suppressing Impulsive Noise Events

EE 464 Short-Time Fourier Transform Fall and Spectrogram. Many signals of importance have spectral content that

EVALUATING THE ONLINE CAPABILITIES OF ONSET DETECTION METHODS

Lecture 5: Sinusoidal Modeling

MUSICAL GENRE CLASSIFICATION OF AUDIO DATA USING SOURCE SEPARATION TECHNIQUES. P.S. Lampropoulou, A.S. Lampropoulos and G.A.

Spectral estimation using higher-lag autocorrelation coefficients with applications to speech recognition

DISCRIMINATION OF SITAR AND TABLA STROKES IN INSTRUMENTAL CONCERTS USING SPECTRAL FEATURES

AMUSIC signal can be considered as a succession of musical

A Parametric Model for Spectral Sound Synthesis of Musical Sounds

Converting Speaking Voice into Singing Voice

University of Southern Queensland Faculty of Health, Engineering & Sciences. Investigation of Digital Audio Manipulation Methods

Harmonic Percussive Source Separation

Lecture 5: Pitch and Chord (1) Chord Recognition. Li Su

Pitch Estimation of Singing Voice From Monaural Popular Music Recordings

Musical tempo estimation using noise subspace projections

Time- frequency Masking

MUSIC is to a great extent an event-based phenomenon for

Deep learning architectures for music audio classification: a personal (re)view

Query by Singing and Humming

Comparison of Multirate two-channel Quadrature Mirror Filter Bank with FIR Filters Based Multiband Dynamic Range Control for audio

AUTOMATED MUSIC TRACK GENERATION

ENHANCED BEAT TRACKING WITH CONTEXT-AWARE NEURAL NETWORKS

Guitar Music Transcription from Silent Video. Temporal Segmentation - Implementation Details

CHORD DETECTION USING CHROMAGRAM OPTIMIZED BY EXTRACTING ADDITIONAL FEATURES

Audio Similarity. Mark Zadel MUMT 611 March 8, Audio Similarity p.1/23

OBTAIN: Real-Time Beat Tracking in Audio Signals

CMPT 468: Frequency Modulation (FM) Synthesis

LAB 2 Machine Perception of Music Computer Science 395, Winter Quarter 2005

EE482: Digital Signal Processing Applications

Lecture 6: Nonspeech and Music. Music & nonspeech

Lecture 6: Nonspeech and Music

Audio Engineering Society Convention Paper Presented at the 110th Convention 2001 May Amsterdam, The Netherlands

Drum Leveler. User Manual. Drum Leveler v Sound Radix Ltd. All Rights Reserved

Advanced Music Content Analysis

UNIT-4 POWER QUALITY MONITORING

Chapter 6: Power Amplifiers

International Journal of Modern Trends in Engineering and Research e-issn No.: , Date: 2-4 July, 2015

ONLINE REPET-SIM FOR REAL-TIME SPEECH ENHANCEMENT

Since the advent of the sine wave oscillator

MULTI-FEATURE MODELING OF PULSE CLARITY: DESIGN, VALIDATION AND OPTIMIZATION

Reading: Johnson Ch , Ch.5.5 (today); Liljencrants & Lindblom; Stevens (Tues) reminder: no class on Thursday.

Automatic Evaluation of Hindustani Learner s SARGAM Practice

A Two-step Technique for MRI Audio Enhancement Using Dictionary Learning and Wavelet Packet Analysis

ROBUST F0 ESTIMATION IN NOISY SPEECH SIGNALS USING SHIFT AUTOCORRELATION. Frank Kurth, Alessia Cornaggia-Urrigshardt and Sebastian Urrigshardt

A MULTI-MODEL APPROACH TO BEAT TRACKING CONSIDERING HETEROGENEOUS MUSIC STYLES

TWO-DIMENSIONAL FOURIER PROCESSING OF RASTERISED AUDIO

Singing Expression Transfer from One Voice to Another for a Given Song

Digital Speech Processing and Coding

Supplementary Materials for

Real-time Drums Transcription with Characteristic Bandpass Filtering

ELT Receiver Architectures and Signal Processing Fall Mandatory homework exercises

A Correlation-Maximization Denoising Filter Used as An Enhancement Frontend for Noise Robust Bird Call Classification

FFT analysis in practice

PROBLEM SET 6. Note: This version is preliminary in that it does not yet have instructions for uploading the MATLAB problems.

E : Lecture 8 Source-Filter Processing. E : Lecture 8 Source-Filter Processing / 21

Transcription:

Lecture 6 Rhythm Analysis (some slides are adapted from Zafar Rafii and some figures are from Meinard Mueller)

Definitions for Rhythm Analysis Rhythm: movement marked by the regulated succession of strong and weak elements, or of opposite or different conditions. Beat: basic unit of time in music ---- Oxford English Dictionary Tempo: speed or pace of a given piece, typically measured in beats per minute (BPM) ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 2

More Definitions Onset: single instant marking the beginning of transient Onsets often occur on beats. Attack: sharp increase of energy Transient: a short duration with high amplitude within which signal evolves quickly Waveform of one piano note ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 3

More Definitions Measure (or bar): segment of time defined by a given number of beats A 4-beat measure drum pattern. [http://en.wikipedia.org/wiki/metre_(music)] ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 4

More Definitions Meter: organization of music into regularly recurring measures of stressed and unstressed beats Hypermeter: 4-beat measure and 4-measure hypermeasure. Hyperbeats in red. [http://en.wikipedia.org/wiki/metre_(music)] ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 5

Rhythm Analysis Tasks Onset Detection Beat Tracking Tempo Estimation Higher-level Structure Analysis ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 6

Intellectual merit Why is it important? Important component of music understanding Music cognition research Broad applications Identify/classify/retrieve by rhythmic similarity Music segmentation/summarization Audio/video synchronization Source separation ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 7

Signal processing: define a detection function Energy-based Spectral-based Phase-based Machine Learning: learn patterns from labeled data Probabilistic models Neural networks Onset Detection ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 8

Energy-based Onset Detection Waveform Signal Envelope (energy) Envelope Derivative (half-wave rectified) Thresholding Onsets ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 9

Energy-based Onset Detection Pros and Cons Simple Works well for percussive sounds Soft onsets by string/wind instruments are hard to detect Tremolo/vibrato can cause false detections How to improve Use logarithmic-energy to replace linear energy Perform analysis in different frequency bands, then summarize ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 0

Spectral-based Onset Detection STFT to get magnitude spectrogram χ (optional) compression Spectral flux: Take derivative w.r.t. time (half-wave rectified) ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208

Spectral-based Onset Detection Pros and Cons More complex than energy-based Can weigh different frequencies differently Works better for soft onsets (e.g., legato notes) and polyphonic music Still doesn t work very well for vibrato ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 2

Tempo Estimation Tempo = beats / minutes Beat tracking is sufficient but not necessary condition for tempo estimation How to estimate tempo without tracking beats? Idea: look at the regularity of onsets Assumptions Onsets mostly occur on beats Tempo is constant within a period of time ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 3

Tempo Estimation Onset strength curve Onsets Take the onset strength curve and analyze its periodicity Autocorrelation STFT Tempogram ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 4

Beat Tracking Identify the beat times, i.e., the times to which we tap our feet Detected onsets provide useful but noisy information, since not all onsets are on beats. Estimated tempo tells us the space between two beats, but not the exact locations (i.e., phase). How to identify beats? To simply the problem, we assume Onsets, especially strong ones, are mostly on beats. Tempo is constant. ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 5

A 2-step approach Beat Tracking Step : Tempo estimation Step 2: Identify beats from onsets using the tempo Create an impulse train (i.e., comb ) with the tempo Cross-correlate the comb with the onset strength curve. The lag that gives us the highest cross-correlation value tells us the beat phase. ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 6

Beat Tracking A 2-step approach, illustration Onset strength curve Combs with the same tempo but different phases Problem: too rigid about beat spacing ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 7

Beat Tracking by Dynamic Programming Beat tracking: finding a sequence of beat locations such that they Score function ) are well aligned with strong onsets 2) mostly regularly spaced [Ellis, 2007] Rough estimate of beat spacing Beat sequence Onset strength Regularity penalty function Find B = (b, b 2,, b L ) that maximizes S(B) ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 8

Beat Tracking by Dynamic Programming Suppose beat locations are precise to audio frames, and suppose there are N frames, then how many possible sequences? 2 N (although many are bad ones!) Can t enumerate all! Key idea: reuse calculations by recursion! ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 9

Beat Tracking by Dynamic Programming Consider a beat sequence B n = b, b 2,, b L where b L = n. Let D(n) be the maximal score over all such sequences ending at n. Then if L > if L = recursion ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 20

Beat Tracking by Dynamic Programming Considering the two cases, we have We can calculate D(n) from D =. Record the preceding beat Best score Trace back from to get the best sequence ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 2

Rhythmic Structure Tatom Tactus Measure 0-0.5.5 2 2.5 time (s) Beginning of Another one bites the dust by Queen. One approach: detect onsets; analyze tempo and beats at different levels. Another approach: analyze repetition of spectral content Beat spectrum ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 22

Definition Beat Spectrum Using the autocorrelation function, we can derive the beat spectrum [Foote et al., 200] Beginning of Another one bites the dust by Queen. 0-2 4 6 8 0 2 4 6 8 time (s) 0.5 0 0 2 4 6 8 0 2 4 6 lag (s) Beat Spectrum. ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 23

Use Beat Spectrum The beat spectrum reveals the hierarchically periodically repeating structure of the audio Periodicity at the measure level Sub-periodicity at the kick level 0 - Beginning of Another one bites the dust by Queen. 2 4 6 8 0 2 4 6 8 time (s) Sub-periodicity at the beat level 0.5 0 0 2 4 6 8 0 2 4 6 lag (s) Beat Spectrum. ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 24

Calculation 0 - Beat Spectrum Compute the power spectrogram from the audio using the STFT (square of magnitude spectrogram) Audio 2 4 6 8 0 2 4 6 8 time (s) frequency (khz) 2.5 0.5 x 0 4 Power spectrogram 2 4 6 8 0 2 4 6 8 time (s) ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 25

Calculation Beat Spectrum Compute the autocorrelation of the rows of the spectrogram x 0 4 Power spectrogram x 0 4 Autocorrelation plots 2 2 frequency (khz).5 0.5 frequency (khz).5 0.5 2 2 4 6 8 0 2 4 6 8 time (s) Spectrogram at 0 khz 0 2 4 6 8 0 2 4 6 lag (s) Autocorrelation at 0 khz 0.5 0 2 4 6 8 0 2 4 6 8 time (s) 0 0 2 4 6 8 0 2 4 6 lag (s) ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 26

Calculation Beat Spectrum Compute the mean of the autocorrelations (of the rows) x 0 4 Power spectrogram x 0 4 Autocorrelation plots 2 2 frequency (khz).5 0.5 frequency (khz).5 0.5 2 4 6 8 0 2 4 6 8 time (s) 0 2 4 6 8 0 2 4 6 lag (s) Beat spectrum 0.5 0 0 2 4 6 8 0 2 4 6 lag (s) ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 27

Notes Beat Spectrum The first highest peak in the beat spectrum does not always correspond to the repeating period! The beat spectrum does not indicate where the beats are or when a measure starts! This is how you find the period 0.5 0 0 2 4 6 8 0 2 4 6 lag (s) This is not Beat Spectrum. the period ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 28

Resources Some interesting links Dannenberg s articles on beat tracking: http://www.cs.cmu.edu/~rbd/bib-beattrack.html Goto s work on beat tracking: http://staff.aist.go.jp/m.goto/proj/bts.html Ellis Matlab codes for tempo estimation and beat tracking: http://labrosa.ee.columbia.edu/projects/beattrack/ MIREX s annual evaluation campaign for Music Information Retrieval (MIR) algorithms, including tasks such as onset detection, tempo extraction, and beat tracking: http://www.musicir.org/mirex/wiki/mirex_home ECE 272/472 (AME 272, TEE 272) Audio Signal Processing, Zhiyao Duan, 208 29