Drum Transcription Based on Independent Subspace Analysis
|
|
- Edward Thomas
- 5 years ago
- Views:
Transcription
1 Report for EE 391 Special Studies and Reports for Electrical Engineering Drum Transcription Based on Independent Subspace Analysis Yinyi Guo Center for Computer Research in Music and Acoustics, Stanford, CA ABSTRACT In automatic music transcription, metadata extraction from recorded audio data or speaker separation in video conferencing, it is a significant prerequisite task to analyze and separate the audio signal into their original source components. In this report, I study and analyze a set of methods of the extraction of percussive instruments metadata from polyphonic music. It mainly focuses on the stage of audio source separation, which consists of methods of Principal Components Analysis, Independent Components Analysis, and Non-negative Matrix Factorization. With this spectrogram decomposition method, different samples of music have been analyzed. The results show very encouraging when considering the extraction of non-pitch information rather than perfect note-to-note transcription. 1. MOTIVATION Rhythm is an essential concept for musical structure, and drum scores are prerequisite for further high level description of any rhythmical content, in that percussive instruments contribute to the rhythmical impression. Drum transcription is described by means of symbolic metadata including onsets timing and the types of drum. This information of rhythmical patterns enables further categorization of musical content such as genre classification and music moods analysis. Also, the measurement of less subjective music elements like tempo and musical meter significantly benefits from the availability of a drum score as well. In addition, the techniques of drum replacement in audio and automatic generation of drum score from recorded music have become increasingly popular in current musical entertainment industries like video games and iphone applications. Thus, automated transcription of the drum score is able to contribute to today s music retrieval algorithms immensely, and stimulates the development of varieties of applications in audio industry as well.
2 2. SYSTEM OVERVIEW PCM Audio Signals Time Frequency Transformation Peaks Picking & Onsets Detection Principal Components Analysis (PCA) Non-negative Independent Components Analysis (ICA) Non-negative Matrix Factorization (NMF) Features Extraction Sources Classification & Onsets Acceptance Training Sources Spectral Profiles Symbolic Data transcription & Midi Synthesis Drum Scores Figure 1 Drum Transcription System Overview An overview of the drum transcription system is presented in figure 1. The digital audio signals used for further signal processing chain are mono files with 16bit per sample at a sampling frequency of 44.1 khz. A spectral representation of the pre-processed time signal is computed using a Short Time Fourier Transformation (STFT). After the process of differentiation and half-wave rectification of the magnitude spectrogram, a non-negative difference-spectrogram is computed for further processing. Then, the detection of multiple local maxima associated with transient onset events in the musical signal is conducted in a basic peak picking method. The main concept of the further process is the storage of a short excerpt of the difference-spectrogram at the time of the onset t. From these onset frames the significant spectral profiles will be gathered in the next stages of PCA for dimensionality reduction and ICA for audio source separation. The
3 subsequent sections will give a more in depth account of the source separation stage endorsed the whole signal processing chain. 3. AUDIO SOURCE SEPARATION 3.1 Principal Components Analysis (PCA) Principal Components Analysis is a way of identifying patterns in data, and expressing the data in such a way as to highlight their similarities and differences. Since patterns can be difficult to find in high dimensional data, such graphical, audio or video representation, PCA is a powerful method for analyzing data. The other advantage is PCA can reduce the dimensionality of the original data without losing much information. From the preceding steps the information about the time of occurrence t and the spectral composition of the onsets t is deduced. In order to find only a limited number of significant pattern subspaces of the deduced high dimensional onsets spectra, Principal Component Analysis (PCA) is applied to reduce the dimensionality of the percussive sources. By using PCA, the whole set of collected spectra can be broken down to a relatively small number of de-correlated principal components, thus resulting in a good representation of the original spectra with small reconstruction error. First, we calculate the covariance matrix of the original spectra vectors subtracted by their means. Next we compute the covariance matrix s eigenvectors which is orthogonal to each others, and their identical eigenvalues. From the set of eigenvectors the ones related to the d largest eigenvalues are chosen to provide the coefficients for a linear combination of the original spectra vectors according to equation X X t T, where T describes a transformation matrix for the dimensionality reduction, since it turns out that the eigenvectors with the highest eigenvalues is the principle components of the data set. The components in X are decorrelated to each others and are also variance normalized. It can be subsequentially put into the ICA stage in the next section. 3.2 Non-negative Independent Components Analysis (ICA) Non-negative Independent Components Analysis is one of the approaches of Independent Component Analysis to separate a set of linear mix signals into their original sources. A requirement for optimum performance of the algorithm is the statistical independency of the sources, which can be satisfied by the process of PCA. Non-Negative ICA uses the very intuitive concept of optimizing a cost function describing the non-negativity of the components. This cost function is related to the reconstruction error introduced by axis pair rotations of two or more variables in the positive quadrant of the joint probability density function [2]. There are two assumptions for this ICA algorithm: the original source signals are both positive and to some extent linearly independent. The first constraint is always fulfilled because the vectors processed ICA are from the amplitude-spectrogram X, which has been differentiated
4 half-wave rectified in early stage, so it does not contain any negative values. For the second constraint, the spectra collected at onset times can be regarded as the superposition of a small set of original source spectra representing the involved percussive instruments. It may safely be assumed that there are some characteristic properties inherent to spectral profiles of drum sounds [4][1] that allow us to separate the whitened components into their potential percussive sources F according to equation F A X, where A denotes the d x d un-mixing matrix iterately estimated by the ICA optimization process[2], which actually separate the individual components X. The sources vectors F are named spectral profiles [4]. The original spectral profiles of involved percussive instruments for training reference are shown in figure 2, and the spectral profiles for one particular input sample are shown in figure 3. figure 2 spectral profiles reference figure 3 spectral profiles of input sample 3.3. Extraction of Amplitude Bases After computing a certain number of spectral profiles, they can be used to extract the spectrograms amplitude basis from here forward referred to as amplitude envelopes according to equation E F X, there is no further ICA computation applying on the extraction of the amplitude envelopes. But actually the extracted amplitude envelopes E, do offer decent detection functions with peaks and plateaus for the following detection and classification stage.
5 Figure 4 Extracted Amplitude Envelopes 3.4 Non-negative Matrix Factorization (NMF) Non-negative Matrix Factorization (NMF) [3] is another approach of Independent Component Analysis successfully used in several unsupervised learning tasks and also in the analysis of music signals. It is an identical method for computing the sources spectral profiles and amplitude envelopes. In the case of music signals, NMF has been used to separate the input signal into a sum of sources, each of which has a fixed spectrum and a time-varying gain. This model suits quite well for representing drum signals. The signal model for spectrum X t ( f ) in frame t can be N t ( n, t n 1 written as a weighted sum of source spectra S n ( f ), X f ) a S ( f ), where the ( f ) actually the spectral profiles of the involved drum sounds deducted from the ICA computation, n S n is and a n, t are identical to the extracted amplitude envelopes discussed in last section, as percussive events detection functions with peaks and plateaus. a, In NMF, both the spectra S n ( f ) and gains n t are restricted to be non-negative. In the case of audio source separation, this means the spectrograms are purely additive. It has turned out that the non-negativity constraint alone is sufficient for separating sources [5]. The method of NMF is similar to the computation of ICA, minimizing a cost function between the observed spectrum and the model to converge. The divergence is minimized by an iterative algorithm, which uses multiplicative updates [3]. The main difference between ICA and NMF is that in NMF both spectral profiles and gain envelops are obtained by iterative estimation algorithm. 3.5 Components Classification and Onsets Acceptance The assignment of spectral profiles to the pre-trained profiles of drum instruments is provided by a simple k-nearest neighbor classifier with spectral profiles of single percussive instruments as training database. The distance function is calculated from the correlation coefficient between reference profile and incoming profile. Drum-like onsets events are detected in the amplitude envelopes by using traditional peak picking approach. The value of the amplitude envelope s magnitude is assigned to every onset candidate at its position. If this value exceeds a predetermined adaptive threshold then the onset is accepted. The threshold varies over time according to the amount of energy in a relatively larger range surrounding the onsets. 4. RESULTS To quantify the abilities of the algorithm, the ground truth drum scores of 20 excerpts were extracted from identical midi files as a reference. Each excerpt consists of 40 seconds duration at 44.1 khz sampling rate and 16 bits quantization resolution. Different musical genres are contained among these examples featuring rock, pop, latin, and rap. They were chosen because of their distinct musical characteristics, and the intention to confront the system with a significant variety
6 of possible percussive instruments and sounds. In this research, we only consider a limited number of percussive instruments, including bass drum, snare drum, hi-hat, cymbal and tom. The results shown in Table 1 is based on ICA algorithm (NMF is under implementation), and they are evaluated by typical statistical method of recall rate, precision rate and F-score. Class Precision Rate Recall Rate F-Score Bass Drum 90% 93% 91% Snare Drum 82% 88% 85% Hi-Hat 74% 85% 79% Cymbal 68% 73% 70% Tom 73% 84% 78% Table 1 Drum Transcription Results From the results above, we found that the recall rate is basically better than the precision rate, which means the detection system is over sensitive so that adds more non-drum events. In addition, the results of high frequency instruments like hi-hat and cymbals are less accurate in contrast to lower sounding drums. Because the presence of very prominent and dynamical harmonic sustained instruments like expressive singing voice or guitar solos tends to affect the purity of the separated sources, so that increase the number of found onsets. 5. CONCLUSIONS This report presented a source separation method of independent subspace analysis for automatic detection and classification of percussive instruments in recorded audio signals. The results are promising when considering the extraction of non-pitch information rather than perfect note-to-note transcription. Further improvements will be made with regards to the stage of classification and the onset acceptance by seeking more adaptive methods. In addition, training data of the spectral profiles have to be improved by using larger and more standard datasets. 6. REFERENCES [1] C. Uhle, C. Dittmar and T. Sporer, Extraction of Drum Tracks from polyphonic Music using Independent Subspace Analysis, in Proc. of the Fourth International Symposium on Independent Component Analysis, Nara, Japan, 2003 [2] M. Plumbley, Algorithms for Non-Negative Independent Component Analysis, in IEEE Transactions on Neural Networks, 14 (3), pp , May 2003 [3] D. D. Lee and H. S. Seung. Algorithms for non-negative matrix factorization. In T. Leen, T. Dietterich, and V. Tresp, editors, Advances in Neural Information Processing Systems, volume 13, pages MIT Press, [4] C. Dittmar, C. Uhle, Further Steps towards Drum Transcription of Polyphonic Music, Proc. Of the AES 116th Convention, Berlin, 2004 [5] J. Paulus and T. Virtanen, Drum Transcription with non-negative spectrogram factorisation, submitted in EUSIPCO 2005, Antalya, Turkey, Sept
SUB-BAND INDEPENDENT SUBSPACE ANALYSIS FOR DRUM TRANSCRIPTION. Derry FitzGerald, Eugene Coyle
SUB-BAND INDEPENDEN SUBSPACE ANALYSIS FOR DRUM RANSCRIPION Derry FitzGerald, Eugene Coyle D.I.., Rathmines Rd, Dublin, Ireland derryfitzgerald@dit.ie eugene.coyle@dit.ie Bob Lawlor Department of Electronic
More informationRhythmic Similarity -- a quick paper review. Presented by: Shi Yong March 15, 2007 Music Technology, McGill University
Rhythmic Similarity -- a quick paper review Presented by: Shi Yong March 15, 2007 Music Technology, McGill University Contents Introduction Three examples J. Foote 2001, 2002 J. Paulus 2002 S. Dixon 2004
More informationEnhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis
Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis Mohini Avatade & S.L. Sahare Electronics & Telecommunication Department, Cummins
More informationMUSICAL GENRE CLASSIFICATION OF AUDIO DATA USING SOURCE SEPARATION TECHNIQUES. P.S. Lampropoulou, A.S. Lampropoulos and G.A.
MUSICAL GENRE CLASSIFICATION OF AUDIO DATA USING SOURCE SEPARATION TECHNIQUES P.S. Lampropoulou, A.S. Lampropoulos and G.A. Tsihrintzis Department of Informatics, University of Piraeus 80 Karaoli & Dimitriou
More informationA Novel Approach to Separation of Musical Signal Sources by NMF
ICSP2014 Proceedings A Novel Approach to Separation of Musical Signal Sources by NMF Sakurako Yazawa Graduate School of Systems and Information Engineering, University of Tsukuba, Japan Masatoshi Hamanaka
More informationApplications of Music Processing
Lecture Music Processing Applications of Music Processing Christian Dittmar International Audio Laboratories Erlangen christian.dittmar@audiolabs-erlangen.de Singing Voice Detection Important pre-requisite
More informationCOMPUTATIONAL RHYTHM AND BEAT ANALYSIS Nicholas Berkner. University of Rochester
COMPUTATIONAL RHYTHM AND BEAT ANALYSIS Nicholas Berkner University of Rochester ABSTRACT One of the most important applications in the field of music information processing is beat finding. Humans have
More informationAUTOMATED MUSIC TRACK GENERATION
AUTOMATED MUSIC TRACK GENERATION LOUIS EUGENE Stanford University leugene@stanford.edu GUILLAUME ROSTAING Stanford University rostaing@stanford.edu Abstract: This paper aims at presenting our method to
More informationAutomatic Transcription of Monophonic Audio to MIDI
Automatic Transcription of Monophonic Audio to MIDI Jiří Vass 1 and Hadas Ofir 2 1 Czech Technical University in Prague, Faculty of Electrical Engineering Department of Measurement vassj@fel.cvut.cz 2
More informationSinging Voice Detection. Applications of Music Processing. Singing Voice Detection. Singing Voice Detection. Singing Voice Detection
Detection Lecture usic Processing Applications of usic Processing Christian Dittmar International Audio Laboratories Erlangen christian.dittmar@audiolabs-erlangen.de Important pre-requisite for: usic segmentation
More informationBEAT DETECTION BY DYNAMIC PROGRAMMING. Racquel Ivy Awuor
BEAT DETECTION BY DYNAMIC PROGRAMMING Racquel Ivy Awuor University of Rochester Department of Electrical and Computer Engineering Rochester, NY 14627 rawuor@ur.rochester.edu ABSTRACT A beat is a salient
More informationLecture 6. Rhythm Analysis. (some slides are adapted from Zafar Rafii and some figures are from Meinard Mueller)
Lecture 6 Rhythm Analysis (some slides are adapted from Zafar Rafii and some figures are from Meinard Mueller) Definitions for Rhythm Analysis Rhythm: movement marked by the regulated succession of strong
More informationA Parametric Model for Spectral Sound Synthesis of Musical Sounds
A Parametric Model for Spectral Sound Synthesis of Musical Sounds Cornelia Kreutzer University of Limerick ECE Department Limerick, Ireland cornelia.kreutzer@ul.ie Jacqueline Walker University of Limerick
More informationRhythm Analysis in Music
Rhythm Analysis in Music EECS 352: Machine Perception of Music & Audio Zafar Rafii, Winter 24 Some Definitions Rhythm movement marked by the regulated succession of strong and weak elements, or of opposite
More informationSUPERVISED SIGNAL PROCESSING FOR SEPARATION AND INDEPENDENT GAIN CONTROL OF DIFFERENT PERCUSSION INSTRUMENTS USING A LIMITED NUMBER OF MICROPHONES
SUPERVISED SIGNAL PROCESSING FOR SEPARATION AND INDEPENDENT GAIN CONTROL OF DIFFERENT PERCUSSION INSTRUMENTS USING A LIMITED NUMBER OF MICROPHONES SF Minhas A Barton P Gaydecki School of Electrical and
More informationAudio Engineering Society Convention Paper Presented at the 110th Convention 2001 May Amsterdam, The Netherlands
Audio Engineering Society Convention Paper Presented at the th Convention May 5 Amsterdam, The Netherlands This convention paper has been reproduced from the author's advance manuscript, without editing,
More informationSurvey Paper on Music Beat Tracking
Survey Paper on Music Beat Tracking Vedshree Panchwadkar, Shravani Pande, Prof.Mr.Makarand Velankar Cummins College of Engg, Pune, India vedshreepd@gmail.com, shravni.pande@gmail.com, makarand_v@rediffmail.com
More informationREpeating Pattern Extraction Technique (REPET)
REpeating Pattern Extraction Technique (REPET) EECS 32: Machine Perception of Music & Audio Zafar RAFII, Spring 22 Repetition Repetition is a fundamental element in generating and perceiving structure
More informationA SEGMENTATION-BASED TEMPO INDUCTION METHOD
A SEGMENTATION-BASED TEMPO INDUCTION METHOD Maxime Le Coz, Helene Lachambre, Lionel Koenig and Regine Andre-Obrecht IRIT, Universite Paul Sabatier, 118 Route de Narbonne, F-31062 TOULOUSE CEDEX 9 {lecoz,lachambre,koenig,obrecht}@irit.fr
More informationAutoScore: The Automated Music Transcriber Project Proposal , Spring 2011 Group 1
AutoScore: The Automated Music Transcriber Project Proposal 18-551, Spring 2011 Group 1 Suyog Sonwalkar, Itthi Chatnuntawech ssonwalk@andrew.cmu.edu, ichatnun@andrew.cmu.edu May 1, 2011 Abstract This project
More informationPreeti Rao 2 nd CompMusicWorkshop, Istanbul 2012
Preeti Rao 2 nd CompMusicWorkshop, Istanbul 2012 o Music signal characteristics o Perceptual attributes and acoustic properties o Signal representations for pitch detection o STFT o Sinusoidal model o
More informationOnset Detection Revisited
simon.dixon@ofai.at Austrian Research Institute for Artificial Intelligence Vienna, Austria 9th International Conference on Digital Audio Effects Outline Background and Motivation 1 Background and Motivation
More informationTranscription of Piano Music
Transcription of Piano Music Rudolf BRISUDA Slovak University of Technology in Bratislava Faculty of Informatics and Information Technologies Ilkovičova 2, 842 16 Bratislava, Slovakia xbrisuda@is.stuba.sk
More informationRhythm Analysis in Music
Rhythm Analysis in Music EECS 352: Machine Perception of Music & Audio Zafar RAFII, Spring 22 Some Definitions Rhythm movement marked by the regulated succession of strong and weak elements, or of opposite
More informationCombining Pitch-Based Inference and Non-Negative Spectrogram Factorization in Separating Vocals from Polyphonic Music
Combining Pitch-Based Inference and Non-Negative Spectrogram Factorization in Separating Vocals from Polyphonic Music Tuomas Virtanen, Annamaria Mesaros, Matti Ryynänen Department of Signal Processing,
More informationCHORD DETECTION USING CHROMAGRAM OPTIMIZED BY EXTRACTING ADDITIONAL FEATURES
CHORD DETECTION USING CHROMAGRAM OPTIMIZED BY EXTRACTING ADDITIONAL FEATURES Jean-Baptiste Rolland Steinberg Media Technologies GmbH jb.rolland@steinberg.de ABSTRACT This paper presents some concepts regarding
More informationMultiresolution Analysis of Connectivity
Multiresolution Analysis of Connectivity Atul Sajjanhar 1, Guojun Lu 2, Dengsheng Zhang 2, Tian Qi 3 1 School of Information Technology Deakin University 221 Burwood Highway Burwood, VIC 3125 Australia
More informationADAPTIVE NOISE LEVEL ESTIMATION
Proc. of the 9 th Int. Conference on Digital Audio Effects (DAFx-6), Montreal, Canada, September 18-2, 26 ADAPTIVE NOISE LEVEL ESTIMATION Chunghsin Yeh Analysis/Synthesis team IRCAM/CNRS-STMS, Paris, France
More informationSound Recognition. ~ CSE 352 Team 3 ~ Jason Park Evan Glover. Kevin Lui Aman Rawat. Prof. Anita Wasilewska
Sound Recognition ~ CSE 352 Team 3 ~ Jason Park Evan Glover Kevin Lui Aman Rawat Prof. Anita Wasilewska What is Sound? Sound is a vibration that propagates as a typically audible mechanical wave of pressure
More informationSpeech/Music Change Point Detection using Sonogram and AANN
International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 6, Number 1 (2016), pp. 45-49 International Research Publications House http://www. irphouse.com Speech/Music Change
More informationApplication of Hilbert-Huang Transform in the Field of Power Quality Events Analysis Manish Kumar Saini 1 and Komal Dhamija 2 1,2
Application of Hilbert-Huang Transform in the Field of Power Quality Events Analysis Manish Kumar Saini 1 and Komal Dhamija 2 1,2 Department of Electrical Engineering, Deenbandhu Chhotu Ram University
More informationSpeech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter
Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter 1 Gupteswar Sahu, 2 D. Arun Kumar, 3 M. Bala Krishna and 4 Jami Venkata Suman Assistant Professor, Department of ECE,
More informationAn Optimization of Audio Classification and Segmentation using GASOM Algorithm
An Optimization of Audio Classification and Segmentation using GASOM Algorithm Dabbabi Karim, Cherif Adnen Research Unity of Processing and Analysis of Electrical and Energetic Systems Faculty of Sciences
More informationAudio Imputation Using the Non-negative Hidden Markov Model
Audio Imputation Using the Non-negative Hidden Markov Model Jinyu Han 1,, Gautham J. Mysore 2, and Bryan Pardo 1 1 EECS Department, Northwestern University 2 Advanced Technology Labs, Adobe Systems Inc.
More informationChapter 4 SPEECH ENHANCEMENT
44 Chapter 4 SPEECH ENHANCEMENT 4.1 INTRODUCTION: Enhancement is defined as improvement in the value or Quality of something. Speech enhancement is defined as the improvement in intelligibility and/or
More informationSpeech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm
International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm A.T. Rajamanickam, N.P.Subiramaniyam, A.Balamurugan*,
More information(i) Understanding the basic concepts of signal modeling, correlation, maximum likelihood estimation, least squares and iterative numerical methods
Tools and Applications Chapter Intended Learning Outcomes: (i) Understanding the basic concepts of signal modeling, correlation, maximum likelihood estimation, least squares and iterative numerical methods
More informationSpectro-Temporal Methods in Primary Auditory Cortex David Klein Didier Depireux Jonathan Simon Shihab Shamma
Spectro-Temporal Methods in Primary Auditory Cortex David Klein Didier Depireux Jonathan Simon Shihab Shamma & Department of Electrical Engineering Supported in part by a MURI grant from the Office of
More informationSingle-channel Mixture Decomposition using Bayesian Harmonic Models
Single-channel Mixture Decomposition using Bayesian Harmonic Models Emmanuel Vincent and Mark D. Plumbley Electronic Engineering Department, Queen Mary, University of London Mile End Road, London E1 4NS,
More informationMel Spectrum Analysis of Speech Recognition using Single Microphone
International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree
More informationMichael Clausen Frank Kurth University of Bonn. Proceedings of the Second International Conference on WEB Delivering of Music 2002 IEEE
Michael Clausen Frank Kurth University of Bonn Proceedings of the Second International Conference on WEB Delivering of Music 2002 IEEE 1 Andreas Ribbrock Frank Kurth University of Bonn 2 Introduction Data
More informationAdvanced audio analysis. Martin Gasser
Advanced audio analysis Martin Gasser Motivation Which methods are common in MIR research? How can we parameterize audio signals? Interesting dimensions of audio: Spectral/ time/melody structure, high
More informationAdaptive noise level estimation
Adaptive noise level estimation Chunghsin Yeh, Axel Roebel To cite this version: Chunghsin Yeh, Axel Roebel. Adaptive noise level estimation. Workshop on Computer Music and Audio Technology (WOCMAT 6),
More informationUnited Codec. 1. Motivation/Background. 2. Overview. Mofei Zhu, Hugo Guo, Deepak Music 422 Winter 09 Stanford University.
United Codec Mofei Zhu, Hugo Guo, Deepak Music 422 Winter 09 Stanford University March 13, 2009 1. Motivation/Background The goal of this project is to build a perceptual audio coder for reducing the data
More informationLecture 14: Source Separation
ELEN E896 MUSIC SIGNAL PROCESSING Lecture 1: Source Separation 1. Sources, Mixtures, & Perception. Spatial Filtering 3. Time-Frequency Masking. Model-Based Separation Dan Ellis Dept. Electrical Engineering,
More informationAutomobile Independent Fault Detection based on Acoustic Emission Using FFT
SINCE2011 Singapore International NDT Conference & Exhibition, 3-4 November 2011 Automobile Independent Fault Detection based on Acoustic Emission Using FFT Hamid GHADERI 1, Peyman KABIRI 2 1 Intelligent
More informationPRIMARY-AMBIENT SOURCE SEPARATION FOR UPMIXING TO SURROUND SOUND SYSTEMS
PRIMARY-AMBIENT SOURCE SEPARATION FOR UPMIXING TO SURROUND SOUND SYSTEMS Karim M. Ibrahim National University of Singapore karim.ibrahim@comp.nus.edu.sg Mahmoud Allam Nile University mallam@nu.edu.eg ABSTRACT
More informationVoice Activity Detection
Voice Activity Detection Speech Processing Tom Bäckström Aalto University October 2015 Introduction Voice activity detection (VAD) (or speech activity detection, or speech detection) refers to a class
More informationSONG RETRIEVAL SYSTEM USING HIDDEN MARKOV MODELS
SONG RETRIEVAL SYSTEM USING HIDDEN MARKOV MODELS AKSHAY CHANDRASHEKARAN ANOOP RAMAKRISHNA akshayc@cmu.edu anoopr@andrew.cmu.edu ABHISHEK JAIN GE YANG ajain2@andrew.cmu.edu younger@cmu.edu NIDHI KOHLI R
More informationReal-time beat estimation using feature extraction
Real-time beat estimation using feature extraction Kristoffer Jensen and Tue Haste Andersen Department of Computer Science, University of Copenhagen Universitetsparken 1 DK-2100 Copenhagen, Denmark, {krist,haste}@diku.dk,
More informationALTERNATING CURRENT (AC)
ALL ABOUT NOISE ALTERNATING CURRENT (AC) Any type of electrical transmission where the current repeatedly changes direction, and the voltage varies between maxima and minima. Therefore, any electrical
More informationModern spectral analysis of non-stationary signals in power electronics
Modern spectral analysis of non-stationary signaln power electronics Zbigniew Leonowicz Wroclaw University of Technology I-7, pl. Grunwaldzki 3 5-37 Wroclaw, Poland ++48-7-36 leonowic@ipee.pwr.wroc.pl
More informationRadar Signal Classification Based on Cascade of STFT, PCA and Naïve Bayes
216 7th International Conference on Intelligent Systems, Modelling and Simulation Radar Signal Classification Based on Cascade of STFT, PCA and Naïve Bayes Yuanyuan Guo Department of Electronic Engineering
More informationSGN Audio and Speech Processing
Introduction 1 Course goals Introduction 2 SGN 14006 Audio and Speech Processing Lectures, Fall 2014 Anssi Klapuri Tampere University of Technology! Learn basics of audio signal processing Basic operations
More informationPOLYPHONIC PITCH DETECTION BY MATCHING SPECTRAL AND AUTOCORRELATION PEAKS. Sebastian Kraft, Udo Zölzer
POLYPHONIC PITCH DETECTION BY MATCHING SPECTRAL AND AUTOCORRELATION PEAKS Sebastian Kraft, Udo Zölzer Department of Signal Processing and Communications Helmut-Schmidt-University, Hamburg, Germany sebastian.kraft@hsu-hh.de
More informationReduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter
Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter Ching-Ta Lu, Kun-Fu Tseng 2, Chih-Tsung Chen 2 Department of Information Communication, Asia University, Taichung, Taiwan, ROC
More informationSIGNAL PROCESSING OF POWER QUALITY DISTURBANCES
SIGNAL PROCESSING OF POWER QUALITY DISTURBANCES MATH H. J. BOLLEN IRENE YU-HUA GU IEEE PRESS SERIES I 0N POWER ENGINEERING IEEE PRESS SERIES ON POWER ENGINEERING MOHAMED E. EL-HAWARY, SERIES EDITOR IEEE
More informationME 535 Course Project: Analysis of Audio Using PCA
ME 535 Course Project: Analysis of Audio Using PCA SAMEER MESHRAM (M.S.), 1725972 VARUN RUPCHANDANI (M.S.), 1728011 VIDUR VIJ (M.S.), 1727990 June 4, 2018 EXECUTIVE SUMMARY: In this project we perform
More informationMULTIPLE F0 ESTIMATION IN THE TRANSFORM DOMAIN
10th International Society for Music Information Retrieval Conference (ISMIR 2009 MULTIPLE F0 ESTIMATION IN THE TRANSFORM DOMAIN Christopher A. Santoro +* Corey I. Cheng *# + LSB Audio Tampa, FL 33610
More informationRemoval of ocular artifacts from EEG signals using adaptive threshold PCA and Wavelet transforms
Available online at www.interscience.in Removal of ocular artifacts from s using adaptive threshold PCA and Wavelet transforms P. Ashok Babu 1, K.V.S.V.R.Prasad 2 1 Narsimha Reddy Engineering College,
More informationSGN Audio and Speech Processing
SGN 14006 Audio and Speech Processing Introduction 1 Course goals Introduction 2! Learn basics of audio signal processing Basic operations and their underlying ideas and principles Give basic skills although
More informationVIBRATO DETECTING ALGORITHM IN REAL TIME. Minhao Zhang, Xinzhao Liu. University of Rochester Department of Electrical and Computer Engineering
VIBRATO DETECTING ALGORITHM IN REAL TIME Minhao Zhang, Xinzhao Liu University of Rochester Department of Electrical and Computer Engineering ABSTRACT Vibrato is a fundamental expressive attribute in music,
More informationTIME DOMAIN ATTACK AND RELEASE MODELING Applied to Spectral Domain Sound Synthesis
TIME DOMAIN ATTACK AND RELEASE MODELING Applied to Spectral Domain Sound Synthesis Cornelia Kreutzer, Jacqueline Walker Department of Electronic and Computer Engineering, University of Limerick, Limerick,
More informationMonophony/Polyphony Classification System using Fourier of Fourier Transform
International Journal of Electronics Engineering, 2 (2), 2010, pp. 299 303 Monophony/Polyphony Classification System using Fourier of Fourier Transform Kalyani Akant 1, Rajesh Pande 2, and S.S. Limaye
More informationMUSIC is to a great extent an event-based phenomenon for
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING 1 A Tutorial on Onset Detection in Music Signals Juan Pablo Bello, Laurent Daudet, Samer Abdallah, Chris Duxbury, Mike Davies, and Mark B. Sandler, Senior
More informationStudy Of Sound Source Localization Using Music Method In Real Acoustic Environment
International Journal of Electronics Engineering Research. ISSN 975-645 Volume 9, Number 4 (27) pp. 545-556 Research India Publications http://www.ripublication.com Study Of Sound Source Localization Using
More informationSound Synthesis Methods
Sound Synthesis Methods Matti Vihola, mvihola@cs.tut.fi 23rd August 2001 1 Objectives The objective of sound synthesis is to create sounds that are Musically interesting Preferably realistic (sounds like
More informationHigh-speed Noise Cancellation with Microphone Array
Noise Cancellation a Posteriori Probability, Maximum Criteria Independent Component Analysis High-speed Noise Cancellation with Microphone Array We propose the use of a microphone array based on independent
More informationLecture 5: Sinusoidal Modeling
ELEN E4896 MUSIC SIGNAL PROCESSING Lecture 5: Sinusoidal Modeling 1. Sinusoidal Modeling 2. Sinusoidal Analysis 3. Sinusoidal Synthesis & Modification 4. Noise Residual Dan Ellis Dept. Electrical Engineering,
More informationICA for Musical Signal Separation
ICA for Musical Signal Separation Alex Favaro Aaron Lewis Garrett Schlesinger 1 Introduction When recording large musical groups it is often desirable to record the entire group at once with separate microphones
More informationENF ANALYSIS ON RECAPTURED AUDIO RECORDINGS
ENF ANALYSIS ON RECAPTURED AUDIO RECORDINGS Hui Su, Ravi Garg, Adi Hajj-Ahmad, and Min Wu {hsu, ravig, adiha, minwu}@umd.edu University of Maryland, College Park ABSTRACT Electric Network (ENF) based forensic
More information8.3 Basic Parameters for Audio
8.3 Basic Parameters for Audio Analysis Physical audio signal: simple one-dimensional amplitude = loudness frequency = pitch Psycho-acoustic features: complex A real-life tone arises from a complex superposition
More informationEXPLORING PHASE INFORMATION IN SOUND SOURCE SEPARATION APPLICATIONS
EXPLORING PHASE INFORMATION IN SOUND SOURCE SEPARATION APPLICATIONS Estefanía Cano, Gerald Schuller and Christian Dittmar Fraunhofer Institute for Digital Media Technology Ilmenau, Germany {cano,shl,dmr}@idmt.fraunhofer.de
More informationRoberto Togneri (Signal Processing and Recognition Lab)
Signal Processing and Machine Learning for Power Quality Disturbance Detection and Classification Roberto Togneri (Signal Processing and Recognition Lab) Power Quality (PQ) disturbances are broadly classified
More informationSpectral estimation using higher-lag autocorrelation coefficients with applications to speech recognition
Spectral estimation using higher-lag autocorrelation coefficients with applications to speech recognition Author Shannon, Ben, Paliwal, Kuldip Published 25 Conference Title The 8th International Symposium
More informationRhythm Analysis in Music
Rhythm Analysis in Music EECS 352: Machine Percep;on of Music & Audio Zafar Rafii, Winter 24 Some Defini;ons Rhythm movement marked by the regulated succession of strong and weak elements, or of opposite
More informationReal-time Drums Transcription with Characteristic Bandpass Filtering
Real-time Drums Transcription with Characteristic Bandpass Filtering Maximos A. Kaliakatsos Papakostas Computational Intelligence Laboratoty (CILab), Department of Mathematics, University of Patras, GR
More informationEE 464 Short-Time Fourier Transform Fall and Spectrogram. Many signals of importance have spectral content that
EE 464 Short-Time Fourier Transform Fall 2018 Read Text, Chapter 4.9. and Spectrogram Many signals of importance have spectral content that changes with time. Let xx(nn), nn = 0, 1,, NN 1 1 be a discrete-time
More informationMusical tempo estimation using noise subspace projections
Musical tempo estimation using noise subspace projections Miguel Alonso Arevalo, Roland Badeau, Bertrand David, Gaël Richard To cite this version: Miguel Alonso Arevalo, Roland Badeau, Bertrand David,
More informationDISCRIMINATION OF SITAR AND TABLA STROKES IN INSTRUMENTAL CONCERTS USING SPECTRAL FEATURES
DISCRIMINATION OF SITAR AND TABLA STROKES IN INSTRUMENTAL CONCERTS USING SPECTRAL FEATURES Abstract Dhanvini Gudi, Vinutha T.P. and Preeti Rao Department of Electrical Engineering Indian Institute of Technology
More informationON MEASURING SYNCOPATION TO DRIVE AN INTERACTIVE MUSIC SYSTEM
ON MEASURING SYNCOPATION TO DRIVE AN INTERACTIVE MUSIC SYSTEM George Sioros André Holzapfel Carlos Guedes Music Technology Group, Universitat Pompeu Fabra hannover@csd.uoc.gr Faculdade de Engenharia da
More informationGSM Interference Cancellation For Forensic Audio
Application Report BACK April 2001 GSM Interference Cancellation For Forensic Audio Philip Harrison and Dr Boaz Rafaely (supervisor) Institute of Sound and Vibration Research (ISVR) University of Southampton,
More informationSpeech Synthesis using Mel-Cepstral Coefficient Feature
Speech Synthesis using Mel-Cepstral Coefficient Feature By Lu Wang Senior Thesis in Electrical Engineering University of Illinois at Urbana-Champaign Advisor: Professor Mark Hasegawa-Johnson May 2018 Abstract
More informationMUS421/EE367B Applications Lecture 9C: Time Scale Modification (TSM) and Frequency Scaling/Shifting
MUS421/EE367B Applications Lecture 9C: Time Scale Modification (TSM) and Frequency Scaling/Shifting Julius O. Smith III (jos@ccrma.stanford.edu) Center for Computer Research in Music and Acoustics (CCRMA)
More informationRemote Sensing. The following figure is grey scale display of SPOT Panchromatic without stretching.
Remote Sensing Objectives This unit will briefly explain display of remote sensing image, geometric correction, spatial enhancement, spectral enhancement and classification of remote sensing image. At
More informationDesign and Implementation on a Sub-band based Acoustic Echo Cancellation Approach
Vol., No. 6, 0 Design and Implementation on a Sub-band based Acoustic Echo Cancellation Approach Zhixin Chen ILX Lightwave Corporation Bozeman, Montana, USA chen.zhixin.mt@gmail.com Abstract This paper
More informationHungarian Speech Synthesis Using a Phase Exact HNM Approach
Hungarian Speech Synthesis Using a Phase Exact HNM Approach Kornél Kovács 1, András Kocsor 2, and László Tóth 3 Research Group on Artificial Intelligence of the Hungarian Academy of Sciences and University
More informationIntroduction to Wavelet Transform. Chapter 7 Instructor: Hossein Pourghassem
Introduction to Wavelet Transform Chapter 7 Instructor: Hossein Pourghassem Introduction Most of the signals in practice, are TIME-DOMAIN signals in their raw format. It means that measured signal is a
More informationMusic Signal Processing
Tutorial Music Signal Processing Meinard Müller Saarland University and MPI Informatik meinard@mpi-inf.mpg.de Anssi Klapuri Queen Mary University of London anssi.klapuri@elec.qmul.ac.uk Overview Part I:
More informationEnhancing 3D Audio Using Blind Bandwidth Extension
Enhancing 3D Audio Using Blind Bandwidth Extension (PREPRINT) Tim Habigt, Marko Ðurković, Martin Rothbucher, and Klaus Diepold Institute for Data Processing, Technische Universität München, 829 München,
More informationANALYSIS OF ACOUSTIC FEATURES FOR AUTOMATED MULTI-TRACK MIXING
th International Society for Music Information Retrieval Conference (ISMIR ) ANALYSIS OF ACOUSTIC FEATURES FOR AUTOMATED MULTI-TRACK MIXING Jeffrey Scott, Youngmoo E. Kim Music and Entertainment Technology
More informationPerformance Analysis of MFCC and LPCC Techniques in Automatic Speech Recognition
www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume - 3 Issue - 8 August, 2014 Page No. 7727-7732 Performance Analysis of MFCC and LPCC Techniques in Automatic
More informationAutomatic Evaluation of Hindustani Learner s SARGAM Practice
Automatic Evaluation of Hindustani Learner s SARGAM Practice Gurunath Reddy M and K. Sreenivasa Rao Indian Institute of Technology, Kharagpur, India {mgurunathreddy, ksrao}@sit.iitkgp.ernet.in Abstract
More informationAuditory modelling for speech processing in the perceptual domain
ANZIAM J. 45 (E) ppc964 C980, 2004 C964 Auditory modelling for speech processing in the perceptual domain L. Lin E. Ambikairajah W. H. Holmes (Received 8 August 2003; revised 28 January 2004) Abstract
More informationHIGH QUALITY AUDIO CODING AT LOW BIT RATE USING WAVELET AND WAVELET PACKET TRANSFORM
HIGH QUALITY AUDIO CODING AT LOW BIT RATE USING WAVELET AND WAVELET PACKET TRANSFORM DR. D.C. DHUBKARYA AND SONAM DUBEY 2 Email at: sonamdubey2000@gmail.com, Electronic and communication department Bundelkhand
More informationHyperspectral image processing and analysis
Hyperspectral image processing and analysis Lecture 12 www.utsa.edu/lrsg/teaching/ees5083/l12-hyper.ppt Multi- vs. Hyper- Hyper-: Narrow bands ( 20 nm in resolution or FWHM) and continuous measurements.
More informationLab 8. ANALYSIS OF COMPLEX SOUNDS AND SPEECH ANALYSIS Amplitude, loudness, and decibels
Lab 8. ANALYSIS OF COMPLEX SOUNDS AND SPEECH ANALYSIS Amplitude, loudness, and decibels A complex sound with particular frequency can be analyzed and quantified by its Fourier spectrum: the relative amplitudes
More informationAdaptive Sampling and Processing of Ultrasound Images
Adaptive Sampling and Processing of Ultrasound Images Paul Rodriguez V. and Marios S. Pattichis image and video Processing and Communication Laboratory (ivpcl) Department of Electrical and Computer Engineering,
More informationSpeaker and Noise Independent Voice Activity Detection
Speaker and Noise Independent Voice Activity Detection François G. Germain, Dennis L. Sun,2, Gautham J. Mysore 3 Center for Computer Research in Music and Acoustics, Stanford University, CA 9435 2 Department
More informationExperimental Study on Super-resolution Techniques for High-speed UWB Radar Imaging of Human Bodies
PIERS ONLINE, VOL. 5, NO. 6, 29 596 Experimental Study on Super-resolution Techniques for High-speed UWB Radar Imaging of Human Bodies T. Sakamoto, H. Taki, and T. Sato Graduate School of Informatics,
More information