I-Hao Hsiao, Chun-Tang Chao*, and Chi-Jo Wang (2016). A HHT-Based Music Synthesizer. Intelligent Technologies and Engineering Systems, Lecture Notes

Similar documents
KONKANI SPEECH RECOGNITION USING HILBERT-HUANG TRANSFORM

Empirical Mode Decomposition (EMD) of Turner Valley Airborne Gravity Data in the Foothills of Alberta, Canada

Empirical Mode Decomposition: Theory & Applications

Ensemble Empirical Mode Decomposition: An adaptive method for noise reduction

Hilbert-Huang Transform, its features and application to the audio signal Ing.Michal Verner

Baseline wander Removal in ECG using an efficient method of EMD in combination with wavelet

Atmospheric Signal Processing. using Wavelets and HHT

Empirical Mode Decomposition (EMD) of Turner Valley Airborne Gravity Data in the Foothills of Alberta, Canada*

Application of Hilbert-Huang Transform in the Field of Power Quality Events Analysis Manish Kumar Saini 1 and Komal Dhamija 2 1,2

Assessment of Power Quality Events by Empirical Mode Decomposition based Neural Network

ANALYSIS OF POWER SYSTEM LOW FREQUENCY OSCILLATION WITH EMPIRICAL MODE DECOMPOSITION

Sound Synthesis Methods

The Application of the Hilbert-Huang Transform in Through-wall Life Detection with UWB Impulse Radar

Seismic application of quality factor estimation using the peak frequency method and sparse time-frequency transforms

INDUCTION MOTOR MULTI-FAULT ANALYSIS BASED ON INTRINSIC MODE FUNCTIONS IN HILBERT-HUANG TRANSFORM

ON THE RELATIONSHIP BETWEEN INSTANTANEOUS FREQUENCY AND PITCH IN. 1 Introduction. Zied Mnasri 1, Hamid Amiri 1

Single Channel Speaker Segregation using Sinusoidal Residual Modeling

Study of Phase Relationships in ECoG Signals Using Hilbert-Huang Transforms

Feature Extraction of ECG Signal Using HHT Algorithm

AdaBoost based EMD as a De-Noising Technique in Time Delay Estimation Application

A Parametric Model for Spectral Sound Synthesis of Musical Sounds

The Improved Algorithm of the EMD Decomposition Based on Cubic Spline Interpolation

NOVEL APPROACH FOR FINDING PITCH MARKERS IN SPEECH SIGNAL USING ENSEMBLE EMPIRICAL MODE DECOMPOSITION

Noise Reduction in Cochlear Implant using Empirical Mode Decomposition

Method for Mode Mixing Separation in Empirical Mode Decomposition

Hilbert-Huang Transform and Its Applications in Engineering and Biomedical Signal Analysis

The characteristic identification of disc brake squeal based on ensemble empirical mode decomposition

Mel Spectrum Analysis of Speech Recognition using Single Microphone

THE CITADEL THE MILITARY COLLEGE OF SOUTH CAROLINA. Department of Electrical and Computer Engineering. ELEC 423 Digital Signal Processing

Development of a New Signal Processing Diagnostic Tool for Vibration Signals Acquired in Transient Conditions

THE BEATING EQUALIZER AND ITS APPLICATION TO THE SYNTHESIS AND MODIFICATION OF PIANO TONES

Linear Frequency Modulation (FM) Chirp Signal. Chirp Signal cont. CMPT 468: Lecture 7 Frequency Modulation (FM) Synthesis

WARPED FILTER DESIGN FOR THE BODY MODELING AND SOUND SYNTHESIS OF STRING INSTRUMENTS

Impact of Time Varying Angular Frequency on the Separation of Instantaneous Power Components in Stand-alone Power Systems

Rail Structure Analysis by Empirical Mode Decomposition and Hilbert Huang Transform

EE216B: VLSI Signal Processing. Wavelets. Prof. Dejan Marković Shortcomings of the Fourier Transform (FT)

Study on the Application of HHT in Bridge Health Monitoring

AM-FM demodulation using zero crossings and local peaks

Investigation on Fault Detection for Split Torque Gearbox Using Acoustic Emission and Vibration Signals

Diagnosis of root cause for oscillations in closed-loop chemical process systems

HIGH ACCURACY FRAME-BY-FRAME NON-STATIONARY SINUSOIDAL MODELLING

HILBERT SPECTRAL ANALYSIS OF VOWELS USING INTRINSIC MODE FUNCTIONS. Phillip L. De Leon

IDENTIFICATION OF NONLINEAR SITE RESPONSE FROM TIME VARIATIONS OF THE PREDOMINANT FREQUENCY

Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter

8.3 Basic Parameters for Audio

ADDITIVE SYNTHESIS BASED ON THE CONTINUOUS WAVELET TRANSFORM: A SINUSOIDAL PLUS TRANSIENT MODEL

SUMMARY THEORY. VMD vs. EMD

INHARMONIC DISPERSION TUNABLE COMB FILTER DESIGN USING MODIFIED IIR BAND PASS TRANSFER FUNCTION

Music 270a: Modulation

Direction-Dependent Physical Modeling of Musical Instruments

Study on Multi-tone Signals for Design and Testing of Linear Circuits and Systems

Broken-Rotor-Bar Diagnosis for Induction Motors

Laboratory Assignment 4. Fourier Sound Synthesis

Auditory modelling for speech processing in the perceptual domain

Sound Modeling from the Analysis of Real Sounds

Tribology in Industry. Bearing Health Monitoring

Speech Synthesis using Mel-Cepstral Coefficient Feature

IMPROVED CODING OF TONAL COMPONENTS IN MPEG-4 AAC WITH SBR

MULTI-FAULT ANALYSIS IN INDUCTION MOTORS USING MULTI-SENSOR FEATURES

Spectrum. Additive Synthesis. Additive Synthesis Caveat. Music 270a: Modulation

I D I A P R E S E A R C H R E P O R T. June published in Interspeech 2008

CMPT 468: Frequency Modulation (FM) Synthesis

Random and coherent noise attenuation by empirical mode decomposition Maïza Bekara, PGS, and Mirko van der Baan, University of Leeds

A Novel Approach to Improve the Smoothening the Wind Profiler Doppler Spectra Using Empirical Mode Decomposition with Moving Average Method

Noise-robust compressed sensing method for superresolution

ICA & Wavelet as a Method for Speech Signal Denoising

NOISE CORRUPTION OF EMPIRICAL MODE DECOMPOSITION AND ITS EFFECT ON INSTANTANEOUS FREQUENCY

Pattern Recognition Part 2: Noise Suppression

DIAGNOSIS OF ROLLING ELEMENT BEARING FAULT IN BEARING-GEARBOX UNION SYSTEM USING WAVELET PACKET CORRELATION ANALYSIS

SINOLA: A New Analysis/Synthesis Method using Spectrum Peak Shape Distortion, Phase and Reassigned Spectrum

Audio Engineering Society Convention Paper Presented at the 110th Convention 2001 May Amsterdam, The Netherlands

Audio Fingerprinting using Fractional Fourier Transform

Fundamental frequency estimation of speech signals using MUSIC algorithm

VIBRATO DETECTING ALGORITHM IN REAL TIME. Minhao Zhang, Xinzhao Liu. University of Rochester Department of Electrical and Computer Engineering

AN AUTOREGRESSIVE BASED LFM REVERBERATION SUPPRESSION FOR RADAR AND SONAR APPLICATIONS

Professorial Inaugural Lecture, 26 April 2001 DIGITAL SYNTHESIS OF MUSICAL SOUNDS. B.T.G. Tan Department of Physics National University of Singapore

The Application of Energy Operator Demodulation Approach Based on EMD in Mechanical System Identification

Mode shape reconstruction of an impulse excited structure using continuous scanning laser Doppler vibrometer and empirical mode decomposition

Signal segmentation and waveform characterization. Biosignal processing, S Autumn 2012

Detecting position dependent tremor with the Empirical mode decomposition

An Adaptive Non-Linearity Detection Algorithm for Process Control Loops

Rolling Bearing Diagnosis Based on LMD and Neural Network

ASSESSMENT OF POWER QUALITY EVENTS BY HILBERT TRANSFORM BASED NEURAL NETWORK. Shyama Sundar Padhi

Vibration-based Fault Detection of Wind Turbine Gearbox using Empirical Mode Decomposition Method

Location of Remote Harmonics in a Power System Using SVD *

A new instantaneous frequency extraction method for nonstationary response signals in civil engineering structures

Guan, L, Gu, F, Shao, Y, Fazenda, BM and Ball, A

Monophony/Polyphony Classification System using Fourier of Fourier Transform

Speech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction

By Shilpa R & Dr. P S Puttaswamy Vidya Vardhaka College of Engineering, India

Formant Synthesis of Haegeum: A Sound Analysis/Synthesis System using Cpestral Envelope

Separation of sinusoidal and chirp components using Compressive sensing approach

Lecture 2: Acoustics

Non-stationary Analysis/Synthesis using Spectrum Peak Shape Distortion, Phase and Reassignment

Cancellation of Unwanted Audio to Support Interactive Computer Music

Gearbox fault detection using a new denoising method based on ensemble empirical mode decomposition and FFT

Original Research Articles

Classification of Voltage Sag Using Multi-resolution Analysis and Support Vector Machine

TIME DOMAIN ATTACK AND RELEASE MODELING Applied to Spectral Domain Sound Synthesis

A Novel Method of Bolt Detection Based on Variational Modal Decomposition 1

Transcription:

I-Hao Hsiao, Chun-Tang Chao*, and Chi-Jo Wang (2016). A HHT-Based Music Synthesizer. Intelligent Technologies and Engineering Systems, Lecture Notes in Electrical Engineering (LNEE), Vol.345, pp.523-528. (EI Journal, ISBN: 978-3-319-17313-9).

Chapter 67 A HHT-Based Music Synthesizer I-Hao Hsiao, Chun-Tang Chao, and Chi-Jo Wang Abstract Synthesizing musical sound plays an important role in modern music composition. Composers nowadays can easily take advantage of powerful and userfriendly personal computers to produce the desired musical sound with a good music synthesis method. In this chapter, the Hilbert-Huang Transform (HHT) timefrequency analysis method is employed, in an attempt to implement a new efficient music synthesizer. By applying the HHT technique, the original varying-pitch music signals can be decomposed into several intrinsic mode functions (IMF) based on the empirical mode decomposition (EMD). The instantaneous amplitude and frequency of IMFs can be further obtained by Hilbert transform. By extracting the main spectrum coefficients of the instantaneous amplitude and frequency of the IMFs, the original musical signal can be reconstructed with little error. Experimental results indicate the feasibility of the proposed method. Keywords Music synthesis Hilbert-Huang transform (HHT) Empirical mode decomposition (EMD) Intrinsic mode function (IMF) 67.1 Introduction For regular music synthesis methods, the two most popular methods may be the wavetable music synthesis [1] and FM synthesis [2]. A good music synthesis allows music creators to synthesize the sound signal accurately and quickly. However, the two methods have been unable to provide satisfactory quality for high performance applications. In recent years, the trend for a musical-tone generator has been based on physical modeling of sound production mechanisms [3]. The digital waveguide filter [4, 5] can be applied to simulate a wide class of musical instruments. Figure 67.1 shows the nonlinear predictive model of an instrument. The excitation unit (Exciter) is the nonlinear part, responsible for generating an oscillatory signal source. And the I.-H. Hsiao C.-T. Chao (*) C.-J. Wang Department of Electrical Engineering, Southern Taiwan University of Science and Technology, Tainan, Taiwan e-mail: tang@mail.stust.edu.tw Springer International Publishing Switzerland 2016 J. Juang (ed.), Proceedings of the 3rd International Conference on Intelligent Technologies and Engineering Systems (ICITES2014), Lecture Notes in Electrical Engineering 345, DOI 10.1007/978-3-319-17314-6_67 523

524 I.-H. Hsiao et al. Fig. 67.1 The nonlinear predictive model of an instrument RBF Network Delay Line H AP u[n] H LP Exciter Nonlinear Predictor Resonator Synthetic output wavetable y 0 y 1 y 2 y 3 y N z 1 z 1 z 1 w 0,1 w 0,2 w 0,3 w0,n Fig. 67.2 The nonlinear predictive model of an instrument resonance unit (Resonator) belongs to the linear filter part, responsible for modulating out the sound signal. Figure 67.2 shows a simple model-based structure implemented by IIR (infinite impulse response) synthesis, consisting of a prediction filter and a delay line to synthesize tones produced by instruments [6]. The design of the coefficients for the IIR synthesizer is accomplished by using a neural network (NN)-based training algorithm. A recurrent NN (RNN) is applied for the prediction filter design. However, such kinds of design approaches can be time-consuming during the training process. Huang et al. [7] in 1998 developed a new method called Hilbert-Huang Transform (HHT) for analyzing nonlinear and nonstationary data. The HHT should be more powerful and suitable in timbre analysis when compared with traditional Short-Time Fourier Transform (STFT). Through the understanding of the HHT method, this chapter proposes a more efficient HHT-Based Music Synthesizer. 67.2 The HHT and EMD The HHT was pioneered by Huang et al., for adaptively representing nonstationary signals as sums of zero-mean amplitude modulation frequency modulation components. The Fourier Transform views the signal as a combination of many

67 A HHT-Based Music Synthesizer 525 fixed-frequency and fixed-amplitude sinusoids. The HHT regards the signal as a combination of many intrinsic mode functions (IMF), which have time-varying frequency (instantaneous frequency) and time-varying amplitude (instantaneous amplitude) [8]. Thus, the HHT provides a more powerful analysis and synthesis tool for the pitch and timbre of a music sound. In this section, the HHT and EMD are briefly introduced. There are two steps in the HHT: (1) For a given signal x(t), extract the IMFs by means of empirical mode decomposition (EMD); and (2) apply the Hilbert Transform on each IMF to get the corresponding instantaneous frequency and amplitude. Step 1 is iteratively finished until the residue becomes a monotonic function or a function with only one cycle from which no more IMFs can be extracted. Equation (67.1) shows the decomposition of the x(t) into N-empirical modes, where c j (t) is the jth IMF and r N (t) is the final residue. xt ðþ¼ XN c j ðþþr t N ðþ t j¼1 ð67:1þ In Step 2, the Hilbert Transform is utilized to obtain an analytic complex representation z(t) for each IMF c(t), as shown in (67.2), where d(t) is the Hilbert Transform of c(t). The instantaneous amplitude and instantaneous phase are denoted as a(t) and θ(t), respectively. zt ðþ¼ct ðþþidðþ¼at t ðþe iθðþ t Then the original signal x(t) can be represented as 8 >< xt ðþ¼re >: XN j¼1 ð 9 a j ðþe t i ω j ðþdτ τ >= >; ð67:2þ ð67:3þ ðþ¼ dθðþ t where ω t dt is the instantaneous frequency and Re denotes real part. Equation (67.3) showsthedifferencebetweenthehhtandthediscretefourier Transform. In the HHT, each component is considered as time-varying amplitude andtime-varyingfrequencysinusoid.forbrevity, the instantaneous frequency and the instantaneous amplitude will be referred to as IF and IA in the following text. For each IMF, its corresponding IF and IA can be calculated. On the contrary, the IMF can be reconstructed by its corresponding IF and IA. In this chapter, the FFT will be applied to the IF and IA, instead of directly applied to the IMF. Using this approach, less FFT coefficients are needed and will yield better synthesis performance.

526 I.-H. Hsiao et al. 67.3 Simulation Results The simulation was implemented in the MATLAB environment. Different sound signals, including piano, trumpet, violin, and bird chirps, are provided. Figure 67.3 shows the EMD analysis of trumpet music (pitch A4 or A440), including the original signal, IMF1 IMF8, and the final residue. For each IMF, the corresponding IF and IA can be obtained. Figure 67.4 shows the IA analysis for each IMF. In the proposed method, only the first four IMFs (IMF1 IMF4) are considered and the latter IMFs (IMF5 IMF11) are omitted. The FFT is applied to IF and IA, and 128 main coefficients for IF and IA, respectively, are selected. Thus for the first four IMFs, all the 1,024 FFT coefficients are stored. The proposed synthesis method is compared with the original sound with direct 1,024-point FFT analysis, to demonstrate its efficiency and feasibility. Table 67.1 shows the synthesis error comparison under the same coefficients number 1,024 for different instruments, where the error is measured by the Euclidean distance defined in (67.4). dx; ð yþ ¼ sffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi X n i¼1 ðx i y i Þ 2 ð67:4þ res. imf11imf10 imf9 imf8 imf7 imf6 imf5 imf4 imf3 imf2 imf1signal Fig. 67.3 EMD analysis of trumpet music (A4)

l 67 A HHT-Based Music Synthesizer 527 imf1 IA imf2 IA 5 1.5 4 3 1 2 imf3 IA 1.2 1 0.8 0.6 0.4 0.4 0.3 0.2 0.1 imf5 IA 0.5 imf4 IA 0.4 0.3 0.2 0.1 imf6 IA 0.2 0.15 0.1 0.05 imf7 IA 0.05 0.04 0.03 0.02 0.01 imf8 IA 0.15 0.1 0.05 Fig. 67.4 Instantaneous amplitude (IA) analysis for each IMF

528 I.-H. Hsiao et al. Table 67.1 Synthesis error comparison Method Sound The proposed method Direct FFT Piano 0.3407 0.8267 Trumpet 0.5202 0.8229 Violin 0.2152 0.8219 Bird chirps 0.0596 0.2952 67.4 Conclusion This chapter presents a music synthesizer based on the HHT. For some advanced model-based approaches, the procedure may be tedious and time-consuming for parameter learning. Since most practical music sounds are not stationary, especially in the beginning of the timbre, the conventional Fourier Transform cannot be expected to realistically synthesize the music sounds. The HHT is an advanced signal-processing technique for analyzing nonlinear and nonstationary time series data. The signal is first segregated into narrow band components, the IMFs, by performing EMD. The Hilbert transform is then applied on each mode to obtain the respective instantaneous frequency and the amplitude. By extracting the main FFT coefficients of the instantaneous frequency and the amplitude for each IMF, the original signal can be restored in a good performance. Simulation results show the feasibility of the proposed synthesis method. Further improvement should be developed for practical applications. References 1. Robert, B.J.: Wavetable synthesis 101, a fundamental perspective. In: Proceedings 101st Convention of the Audio Engineering Society, Los Angeles (1996) 2. John, M.: Chowning: the synthesis of complex audio spectra by means of frequency modulation. Comput. Music J. 1(2), 46 54 (1977) 3. Drioli, C., Rocchesso, D.: A generalized musical-tone generator with application to sound compression and synthesis. In: Proceeding of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Munich, vol. 1, pp. 431 434 (1997) 4. Smith, J.O.: Physical modeling using digital waveguides. Comput. Music J. 16(4), 74 87 (1992) 5. Smith, J.O.: Efficient synthesis of stringed musical instruments. In: Proceedings of the 1993 International Computer Music Conference, pp. 64 71, Computer Music Association, Tokyo (1993) 6. Su, A.W.Y., Liang, S.F.: A new automatic IIR analysis/synthesis technique for plucked-string instruments. IEEE Trans. Speech Audio Process. 9(7), 747 754 (2001) 7. Huang, N.E., Shen, Z., Long, S.R., Wu, M.C., Shih, H.H., Zheng, Q., Yen, N.C., Tung, C.C., Liu, H.H.: The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proc. R. Soc. Lond. Ser. 454(1971), 903 995 (1998) 8. Gloersen, P., Huang, N.E.: Comparison of interannual intrinsic modes in hemispheric sea ice covers and other geophysical parameters. IEEE Trans. Geosci. Remote Sens. 41(5), 1062 1074 (2003)