Determination of Pitch Range Based on Onset and Offset Analysis in Modulation Frequency Domain

Size: px
Start display at page:

Download "Determination of Pitch Range Based on Onset and Offset Analysis in Modulation Frequency Domain"

Transcription

1 Determination o Pitch Range Based on Onset and Oset Analysis in Modulation Frequency Domain A. Mahmoodzadeh Speech Proc. Research Lab ECE Dept. Yazd University Yazd, Iran H. R. Abutalebi Speech Proc. Research Lab ECE Dept. Yazd University Yazd, Iran & Idiap Research Institute 1 Martigny, Switzerland H. Soltanian-Zadeh Control and Intelligent Processing Center o Excellence, University o Tehran Tehran, Iran & Image Analysis Lab. Henry Ford Health System Detroit, USA H. Sheikhzadeh EE Dept. Amirkabir University o Technology Tehran, Iran Abstract Auditory scene in a natural environment contains multiple sources. Auditory scene analysis (ASA) is the process in which the auditory system segregates a scene into streams corresponding to dierent sources. The determination o range o pitch requency is necessary or segmentation. We propose a system to determine the range o pitch requency by analyzing onsets and osets in modulation requency domain. In the proposed system, irst the modulation spectrum o speech is calculated and then, in each subband onsets and osets will be detected. Thereater, the segments are generated by matching corresponding onset and oset ront. Finally, by choosing the desired segments, the rage o pitch requency is determined. Systematic evaluation shows that the range o pitch requency is estimated with good accuracy. Keywords- pitch requency; onset/oset algorithm; modulation requency domain I. INTRODUCTION The pitch is deined as the undamental requency o quasiperiodic or voiced sounds [1]. In the speech signals, the pitch is produced by vibrations o the vocal cords. Pitch determination is a undamental problem that attracts much attention in speech analysis. A robust pitch detection algorithm (PDA) is needed or many applications including computational auditory scene analysis (CASA), prosody analysis, speech enhancement/separation [2], speech recognition, and speaker identiication [3]. Various methods have been proposed or the determination o the pitch requency. These methods are generally classiied into three categories: time-domain, requency-domain, and time-requency domain algorithms. Time-domain PDAs directly examine the temporal structure o a signal waveorm and estimates the period o the quasi-periodic signal. They use either the autocorrelation unction [4],[5], other physical [6][7] and geometric [8] criteria, least-square itting [9], pattern recognition [1], and neural networks [11]. Frequency domain PDAs utilize the harmonic structure in the short-term spectrum or distinguish the undamental requency. These methods include: the harmonic product spectrum; Cepstral analysis, and maximum likelihood. Time-requency domain algorithms perorm time-domain analysis on band-iltered signals obtained via a multichannel ront-end. In these methods, the estimation o pitch requency is done by raming the speech signal and estimation o pitch requency in each rame. Then by orming the pitch contour, the range o pitch requency is determined. In each rame, i the pitch requency changes, the pitch requency will not be correctly estimated. Moreover, in these methods we ace pitch halving and pitch doubling problems. Also, in a pitch curve, using interpolation or unvoiced regions, some values are applied. In addition, the existence o intererence between noise and speech signal deteriorates the perormance o such techniques. In recent years, multipitch methods are presented that have some complexities and problems. In this paper, we propose a system to determine the range o pitch requency by analyzing onsets and osets in modulation requency domain. The proposed method determines the range o pitch requency o voiced speech without any windowing. At irst, modulation spectrum o speech is calculated using the modulation transorm. Then, using the onset and oset algorithm, the onset and oset ronts are detected. Thereater, the segments are generated by matching the corresponding onset and oset ronts. Finally, by choosing the desired segments, the range o pitch requency is determined. By extending this system, we can determine the range o the multi-pitch requency o the two speakers. This, in turn, can be used or single channel speech separation. The undamental requency o speech varies rom 4 Hz or low-pitched male voices to 35 Hz or children or high-pitched emale voices. The pitch requency o everyone is not constant during the time; however it is bonded in a range. When the range o pitch requency is known, it may help in single channel speech separation. This paper is organized as ollows. In Section II and III, we propose a working deinition or modulation requency 1 H. R. Abutalebi has been on sabbatical at Idiap Research Institute during Fall 21-Summer 211.

2 analysis and onset and oset algorithm. In Section IV, we irst give a brie description o our system and then present the details o each stage. The results o the system on the determination o range o pitch requency are reported in Section V. The paper concludes with a discussion in Section VI. II. MODULATION FREQUENCY ANALYSIS The general modulation requency analysis ramework consists o a ilterbank (possibly decimated), ollowed by subband envelope detection and requency analysis o the subband envelopes [12]. In its most straightorward orm, the ilterbank is implemented using the Short-Time Fourier Transorm (STFT), envelope detection is deined as the magnitude or magnitude squared o the subband, and subband envelope requency analysis is perormed with the Fourier transorm. For a discrete signal x ( n ), the STFT can be expressed as kn X k ( m) = h( mm n) x ( n) W K n = (1) or k =,..., K 1 and the envelope detection and modulation requency analysis as im X l ( k, i ) = g ( ll m) X k ( m) W I. (2) m = or i =,..., I 1 j( 2 π / K ) where W K e g m are the acoustic and modulation requency analysis windows, respectively. Throughout the paper, we use the shorthand notations and =. h( n ) and ( ) { ( )} l (, ) T x n = X k i (3) 1 { (, ) } ( ) T X k i = x n (4) l to reer to the modulation requency analysis and synthesis, respectively. The magnitude o the sub-band envelope spectra Xl ( k, i ) is typically displayed in a modulation spectrogram representation. The vertical axis o this representation is regular K, and its horizontal axis is modulation acoustic requency ( ) requency ( i ), (iii). Gray- or color-scale intensity in the joint acoustic/modulation plane represents modulation spectral energy. The modulation analysis ramework is illustrated in Figure 1, and an example o a modulation spectrogram is shown in Figure 2. III. ONSET AND OFFSET Onsets and osets correspond to sudden amplitude increases and decreases [13]. A standard way to identiy such intensity changes is to take the irst-order derivative o the intensity with respect to modulation requency and then ind the peaks and valleys o the derivative. Because o intrinsic intensity luctuations, many peaks and valleys o the derivative do not correspond to actual onsets and osets. To reduce such luctuations, we smooth the intensity over modulation requency, as is commonly done in edge detection or image analysis. Smoothing can be perormed through either a diusion process or low-pass iltering. Onsets correspond to the peaks o the derivative above a certain threshold, and osets are the valleys below a certain threshold. The purpose o thresholdding is to remove peaks and valleys corresponding to insigniicant intensity luctuations. The above procedure is similar to the standard Canny edge detector in image processing [14]. An example o the above procedure is shown in Figure 3. Acoustic requency (Hz) Modulation requency (Hz) Figure 2. Modulation spectrogram o the male speaker. Figure 1. Modulation analysis ramework and the modulation spectrogram [12].

3 Figure 3. The upper panel shows the response intensity, and the lower panel shows the results o onset and oset detection using low-pass ilter. IV. SYSTEM DESCRIPTION This research estimates the range o the pitch requency. The proposed system estimates this range via an analysis o signal in modulation requency domain using onset and oset detection algorithm or one speaker. Although the proposed system is capable o determining the range o pitch requency o one speaker, by its expansion we can present a new system or determination o multipitch requency o speakers in one channel. The place o pitch energy in modulation requency spectrogram is an important eature or determination o the range o the pitch requency o speech. Figure 4 shows a block diagram o our proposed system. In the irst stage, the modulation spectrum o the speech signal is calculated. Then segments are generated in the modulation domain using the onset and oset algorithm and inally in the decision-making stage, the range o pitch requency is determined. A detailed description o the stages is as ollows: A. Cochlear iltering and modulation transorm At irst the spectrum o speech signal is calculated using cochlear iltering and STFT transorm. Then, using the modulation transorm, the modulation spectrum o speech signal is calculated. The vertical axis o modulation spectrum is acoustic requency, and its horizontal axis is modulation requency. Colour intensity in the joint acoustic/modulation plane represents modulation spectral energy. B. Smoothing Smoothing corresponds to low-pass iltering. Our system smoothes the intensity over modulation requency with a lowpass ilter. Let v ( c,,,) denote the initial intensity at modulation requency in ilter channel c. We have (,,, ) (,,,) ( ) v c s = v c h s (5) where h( s ) is a low-pass ilter (in the modulation requency domain with passband [, s ] in Hertz). Here, denotes convolution. The parameter ( s ) indicates the degree o smoothing. The smaller ( s ) is, the smoother v ( c s ) is.,,, C. Maximum detection: By detecting the onsets and osets and orming the onset and oset ront, the modulation spectrum o speech signal is segmented. A ew o these segments consist o inormation about the range o pitch requency. The speaker s pitch ranges have to be [6,35] Hz (or men, women and children). Thereore, in every subband (or each acoustic requency), the maximums in [6,35] Hz and above a certain threshold are ounded. Onset/oset detection and matching: onset and oset candidates are detected by marking peaks and valleys o the modulation requency derivative o the smoothed intensity d d v c s v c h s d d (,,, ) = (,,,) ( ) In each sub-band we select the onsets and osets that occurred around any speciied maxima. Ater determining onset and oset o each subband, onset and oset ronts should be ound and inally the bounds o segments will be identiied. (6) Range o pitch requency Decision Cochlear iltering Modulation Transorm Smoothing Maximum Detection Figure 4. The block diagram o the proposed system. Onset/oset Detection and matching

4 D. Decision-making: Obviously, in the requency domain o speech signal, there are some peaks in the pitch requency and its harmonic. Based on this, in the decision stage, we select the segments whose ranges o modulation requency are the harmonics o each other. In accord to the onset and oset ronts o the desired segment, and by calculating the mean o onset and oset o these ronts, we can ind the beginning and ending o the range o pitch requency. V. RESULTS We now present experimental results demonstrating the robustness and the accuracy o our method compared to the least square harmonic model [15] in dierent SNR s. In [15] an algorithm or harmonic decomposition o time-variant signals is derived rom a least squares harmonic (LSH) technique. The estimates o harmonic amplitudes and phases are ormulated as the solution o a set o linear equations that minimize the mean square error. The experimental results in [15] demonstrate the robustness and the accuracy o LMS method relative to standard algorithms such as RAPT [16] and the maximum a posterior estimator (MAP) o [17]. The signal requency is modeled by a linear or quadratic polynomial and obtained via a local search over polynomial coeicients. An initial estimate o signal requency is necessary to reduce computation time. To evaluate the accuracy o the proposed algorithm in pitch range estimation, we choose signals rom a corpus o 1 speech signal (male and emale speeches), that commonly used or CASA research [18] and TIMIT database. At irst we selected the clean speech rom the TIMIT database, x ( n ) (speaker: one two three our ive ). The speech was sampled at 16 khz. The algorithm parameters were set to M = 16, K = 512, L = 38, I = 512, and h( n ) and g ( m ) were a 48-point and 78-point Hanning window. The additive noise is a white noise with zero mean and unity variance. Figures 2 and 5(a) show the signal and modulation spectrum o the initial.8s o the speech ( one ). The applied ilter or smoothing o each subband is FIR low-pass ilter. By selecting a proper threshold, only the maxima above the certain value in each subband are chosen. In this way, we can avoid the production o undesired segments. The obtained pitch contours o the speech using the LSH model is shown in Figure 5(b) or dierent SNR s. The exact value and the obtained range o pitch requency using the proposed system and LSH model or cleaned speech is reported in Table 1. The perormance o the proposed system and the LSH model in terms o the range o the pitch requency or dierent SNR s is reported in Table 2. Table 3 shows the obtained mean error percentage o pitch range estimation or 1 speech signals ater using the proposed system and LSH model or dierent SNR s. According to Figure 5(b), we can observe that the LSH method can not accurately estimate the pitch requency in the transition region between voiced and unvoiced. According to these results and by comparing with the exact range o pitch requency, one may deduce that or low SNR s, the LSH model aces the error and its accuracy reduces, while using the proposed system, in low SNR s the range o pitch requency is estimated accurately. VI. CONCLUSION AND DISCUSSION We demonstrated that modulation requency localization o pitch energy is an important eature or determination o range o pitch requency o speech in modulation spectrogram that can be exploited or single channel speaker separation. We presented a new approach or determination o range o pitch requency based on modulation requency analysis and onset/oset algorithm. The proposed method is purposely simple and accurate. Also, the results show that the accuracy o the proposed algorithm is acceptable in noisy conditions and or dierent SNR s. By expansion o proposed system, we can present a new system or determination o multipitch requency in modulation requency domain with using pitch energy. TABLE I. THE EXACT AND THE OBTAINED RANGE OF PITCH FREQUENCY USING THE PROPOSED SYSTEM AND LSH MODEL FOR CLEANED SPEECH TABLE II. Amplitude Frequency(Hz) Clean speech True value Proposed system LSH model [91,125] [92,126] [93,126] 1.5 OBTAINED RANGE OF PITCH FREQUENCY USING PROPOSED SYSTEM AND LSH MODEL FOR DIFFERENT SNR S SNR Proposed system LSH model [85,117] [8,135] 5 [92,123] [79,126] 1 [92,124] [93,126] 15 [92,125] [93,127] Time(s) (b) 15 SNR=1dB SNR=15dB 1 (a) SNR=5dB SNR=dB Time(s) Figure 5. (a) Speaker s signal ( one ). (b) Pitch contours o speech signal obtained with using least square harmonic model or dierent SNR s.

5 TABLE III. THE OBTAINED MEAN ERROR PERCENTAGE OF PITCH RANGE ESTIMATION FOR 1 SPEECH SIGNAL SNR Proposed system LSH model REFERENCES [1] P. Vary and R. Martin,Digital Speech Transmission Enhancement, Coding And Error Concealment, Wiley, 26. [2] C. Breithaupt, T. Gerkmann, and R.Martin, A novel a priori SNR estimation approach based on selective cepstro-temporal smoothing, Proceedings o ICASSP, pp , April 28. [3] C. MadsGraesboll and A. Jakobsson, Multi-Pitch Estimation-Synthesis Lectures on Speech and Audio Processing, M & C Publishers, 29. [4] Lawrence R. Rabiner, On the use o autocorrelation analysis or pitch detection, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 25, no. 1, pp , February [5] A. de Cheveign e and H. Kawahara, YIN, a undamental requency estimator or speech and music, J. Acoust. Soc. Am., vol. 111, no. 4, pp , Apr. 22. [6] J. C. Brown and M. S. Puckette, A high resolution undamental requency determination based on phase changes o the urier transorm, Journal o the Acoustical Society o America, vol. 94, no. 2, pp , [7] John E. Lane, Pitch detection using a tunable IIR ilter, Computer Music Journal, vol.14, no.3, pp.46 59, Fall 199. [8] D. Cooperand Kia C. Ng, A monophonic pitch-tracking algorithm based on waveorm periodicity determinations using landmark points, Computer Music Journal, vol. 2, no. 3, pp. 7 78, Fall [9] A. Choi, Real-time undamental requency estimation by least-square itting, IEEE Transactions on Speech and Audio Processing, vol. 5, no. 2, pp , March [1] J. C. Brown, Musical undamental requency tracking using a pattern recognition method, Journal o the Acoustical Society o America, vol. 92, no.3, pp , September [11] H. Sano and B. Keith Jenkins, A neural network model or pitch perception, Computer Music Journal, vol. 13, no. 3, pp , Fall [12] S. Schimmel, L. Atlas, and K. Nie, Feasibility o signal channel speaker separation based on modulation requency analysis, Proceedings o ICASSP, pp , 27. [13] G. Hu, and D.L. Wang, Auditory segmentation based on onset and oset analysis, IEEE Trans. Audio, Speech, Lang. Process. Vol. 15, no. 2, pp , February 27. [14] J. Canny, A computational approach to edge detection, IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 8, pp , [15] Q. Li, L. Atlas, Time-variant least-squares harmonic modelling, Proceedings o IEEE International Conerence on Acoustics, Speech, and Signal Processing, Vol. 2, Hong-Kong, April 23. [16] D. Talkin, A robust algorithm or pitch tracking (RAPT), in Speech Coding and Synthesis, W. B. Klein and K. K. Paliwal, Eds.NewYork, NY: Elsevier, pp , [17] J. Tabrikian, S. Dubnov, and Y. Dickalov, Maximum a posterior probability pitch tracking in noisy environments using harmonic model, IEEE Trans. Speech Audio Process., vol. 12, pp , 24. [18] G. J. Brown, and M. P. Cooke, Computational auditory scene analysis, Comput. Speech Lang., vol. 8, pp , 1994.

Single channel speech separation in modulation frequency domain based on a novel pitch range estimation method

Single channel speech separation in modulation frequency domain based on a novel pitch range estimation method RESEARCH Open Access Single channel speech separation in modulation requency domain based on a novel pitch range estimation method Azar Mahmoodzadeh 1, Hamid Reza Abutalebi 1*, Hamid Soltanian-Zadeh 2,3

More information

Dominant Voiced Speech Segregation Using Onset Offset Detection and IBM Based Segmentation

Dominant Voiced Speech Segregation Using Onset Offset Detection and IBM Based Segmentation Dominant Voiced Speech Segregation Using Onset Offset Detection and IBM Based Segmentation Shibani.H 1, Lekshmi M S 2 M. Tech Student, Ilahia college of Engineering and Technology, Muvattupuzha, Kerala,

More information

Monaural and Binaural Speech Separation

Monaural and Binaural Speech Separation Monaural and Binaural Speech Separation DeLiang Wang Perception & Neurodynamics Lab The Ohio State University Outline of presentation Introduction CASA approach to sound separation Ideal binary mask as

More information

ScienceDirect. Unsupervised Speech Segregation Using Pitch Information and Time Frequency Masking

ScienceDirect. Unsupervised Speech Segregation Using Pitch Information and Time Frequency Masking Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 46 (2015 ) 122 126 International Conference on Information and Communication Technologies (ICICT 2014) Unsupervised Speech

More information

TIME-FREQUENCY ANALYSIS OF NON-STATIONARY THREE PHASE SIGNALS. Z. Leonowicz T. Lobos

TIME-FREQUENCY ANALYSIS OF NON-STATIONARY THREE PHASE SIGNALS. Z. Leonowicz T. Lobos Copyright IFAC 15th Triennial World Congress, Barcelona, Spain TIME-FREQUENCY ANALYSIS OF NON-STATIONARY THREE PHASE SIGNALS Z. Leonowicz T. Lobos Wroclaw University o Technology Pl. Grunwaldzki 13, 537

More information

Speech Synthesis using Mel-Cepstral Coefficient Feature

Speech Synthesis using Mel-Cepstral Coefficient Feature Speech Synthesis using Mel-Cepstral Coefficient Feature By Lu Wang Senior Thesis in Electrical Engineering University of Illinois at Urbana-Champaign Advisor: Professor Mark Hasegawa-Johnson May 2018 Abstract

More information

Monophony/Polyphony Classification System using Fourier of Fourier Transform

Monophony/Polyphony Classification System using Fourier of Fourier Transform International Journal of Electronics Engineering, 2 (2), 2010, pp. 299 303 Monophony/Polyphony Classification System using Fourier of Fourier Transform Kalyani Akant 1, Rajesh Pande 2, and S.S. Limaye

More information

Music Technology Group, Universitat Pompeu Fabra, Barcelona, Spain {jordi.bonada,

Music Technology Group, Universitat Pompeu Fabra, Barcelona, Spain   {jordi.bonada, GENERATION OF GROWL-TYPE VOICE QUALITIES BY SPECTRAL MORPHING Jordi Bonada Merlijn Blaauw Music Technology Group, Universitat Pompeu Fabra, Barcelona, Spain Email: {jordi.bonada, merlijn.blaauw}@up.edu

More information

Fatigue Life Assessment Using Signal Processing Techniques

Fatigue Life Assessment Using Signal Processing Techniques Fatigue Lie Assessment Using Signal Processing Techniques S. ABDULLAH 1, M. Z. NUAWI, C. K. E. NIZWAN, A. ZAHARIM, Z. M. NOPIAH Engineering Faculty, Universiti Kebangsaan Malaysia 43600 UKM Bangi, Selangor,

More information

Preeti Rao 2 nd CompMusicWorkshop, Istanbul 2012

Preeti Rao 2 nd CompMusicWorkshop, Istanbul 2012 Preeti Rao 2 nd CompMusicWorkshop, Istanbul 2012 o Music signal characteristics o Perceptual attributes and acoustic properties o Signal representations for pitch detection o STFT o Sinusoidal model o

More information

Cyclostationarity-Based Spectrum Sensing for Wideband Cognitive Radio

Cyclostationarity-Based Spectrum Sensing for Wideband Cognitive Radio 9 International Conerence on Communications and Mobile Computing Cyclostationarity-Based Spectrum Sensing or Wideband Cognitive Radio Qi Yuan, Peng Tao, Wang Wenbo, Qian Rongrong Wireless Signal Processing

More information

Single Channel Speaker Segregation using Sinusoidal Residual Modeling

Single Channel Speaker Segregation using Sinusoidal Residual Modeling NCC 2009, January 16-18, IIT Guwahati 294 Single Channel Speaker Segregation using Sinusoidal Residual Modeling Rajesh M Hegde and A. Srinivas Dept. of Electrical Engineering Indian Institute of Technology

More information

SPEECH PARAMETERIZATION FOR AUTOMATIC SPEECH RECOGNITION IN NOISY CONDITIONS

SPEECH PARAMETERIZATION FOR AUTOMATIC SPEECH RECOGNITION IN NOISY CONDITIONS SPEECH PARAMETERIZATION FOR AUTOMATIC SPEECH RECOGNITION IN NOISY CONDITIONS Bojana Gajić Department o Telecommunications, Norwegian University o Science and Technology 7491 Trondheim, Norway gajic@tele.ntnu.no

More information

SPEECH ENHANCEMENT USING A ROBUST KALMAN FILTER POST-PROCESSOR IN THE MODULATION DOMAIN. Yu Wang and Mike Brookes

SPEECH ENHANCEMENT USING A ROBUST KALMAN FILTER POST-PROCESSOR IN THE MODULATION DOMAIN. Yu Wang and Mike Brookes SPEECH ENHANCEMENT USING A ROBUST KALMAN FILTER POST-PROCESSOR IN THE MODULATION DOMAIN Yu Wang and Mike Brookes Department of Electrical and Electronic Engineering, Exhibition Road, Imperial College London,

More information

Detection and direction-finding of spread spectrum signals using correlation and narrowband interference rejection

Detection and direction-finding of spread spectrum signals using correlation and narrowband interference rejection Detection and direction-inding o spread spectrum signals using correlation and narrowband intererence rejection Ulrika Ahnström,2,JohanFalk,3, Peter Händel,3, Maria Wikström Department o Electronic Warare

More information

Mel Spectrum Analysis of Speech Recognition using Single Microphone

Mel Spectrum Analysis of Speech Recognition using Single Microphone International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree

More information

NCCF ACF. cepstrum coef. error signal > samples

NCCF ACF. cepstrum coef. error signal > samples ESTIMATION OF FUNDAMENTAL FREQUENCY IN SPEECH Petr Motl»cek 1 Abstract This paper presents an application of one method for improving fundamental frequency detection from the speech. The method is based

More information

Learning to Unlearn and Relearn Speech Signal Processing using Neural Networks: current and future perspectives

Learning to Unlearn and Relearn Speech Signal Processing using Neural Networks: current and future perspectives Learning to Unlearn and Relearn Speech Signal Processing using Neural Networks: current and future perspectives Mathew Magimai Doss Collaborators: Vinayak Abrol, Selen Hande Kabil, Hannah Muckenhirn, Dimitri

More information

Noise Removal from ECG Signal and Performance Analysis Using Different Filter

Noise Removal from ECG Signal and Performance Analysis Using Different Filter International Journal o Innovative Research in Electronics and Communication (IJIREC) Volume. 1, Issue 2, May 214, PP.32-39 ISSN 2349-442 (Print) & ISSN 2349-45 (Online) www.arcjournal.org Noise Removal

More information

ELEC3106 Electronics. Lecture notes: non-linearity and noise. Objective. Non-linearity. Non-linearity measures

ELEC3106 Electronics. Lecture notes: non-linearity and noise. Objective. Non-linearity. Non-linearity measures ELEC316 Electronics Lecture notes: non-linearity and noise Objective The objective o these brie notes is to supplement the textbooks used in the course on the topic o non-linearity and electrical noise.

More information

Sinusoidal signal. Arbitrary signal. Periodic rectangular pulse. Sampling function. Sampled sinusoidal signal. Sampled arbitrary signal

Sinusoidal signal. Arbitrary signal. Periodic rectangular pulse. Sampling function. Sampled sinusoidal signal. Sampled arbitrary signal Techniques o Physics Worksheet 4 Digital Signal Processing 1 Introduction to Digital Signal Processing The ield o digital signal processing (DSP) is concerned with the processing o signals that have been

More information

International Journal of Modern Trends in Engineering and Research e-issn No.: , Date: 2-4 July, 2015

International Journal of Modern Trends in Engineering and Research   e-issn No.: , Date: 2-4 July, 2015 International Journal of Modern Trends in Engineering and Research www.ijmter.com e-issn No.:2349-9745, Date: 2-4 July, 2015 Analysis of Speech Signal Using Graphic User Interface Solly Joy 1, Savitha

More information

A new zoom algorithm and its use in frequency estimation

A new zoom algorithm and its use in frequency estimation Waves Wavelets Fractals Adv. Anal. 5; :7 Research Article Open Access Manuel D. Ortigueira, António S. Serralheiro, and J. A. Tenreiro Machado A new zoom algorithm and its use in requency estimation DOI.55/wwaa-5-

More information

Spectral estimation using higher-lag autocorrelation coefficients with applications to speech recognition

Spectral estimation using higher-lag autocorrelation coefficients with applications to speech recognition Spectral estimation using higher-lag autocorrelation coefficients with applications to speech recognition Author Shannon, Ben, Paliwal, Kuldip Published 25 Conference Title The 8th International Symposium

More information

ROBUST F0 ESTIMATION IN NOISY SPEECH SIGNALS USING SHIFT AUTOCORRELATION. Frank Kurth, Alessia Cornaggia-Urrigshardt and Sebastian Urrigshardt

ROBUST F0 ESTIMATION IN NOISY SPEECH SIGNALS USING SHIFT AUTOCORRELATION. Frank Kurth, Alessia Cornaggia-Urrigshardt and Sebastian Urrigshardt 2014 IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP) ROBUST F0 ESTIMATION IN NOISY SPEECH SIGNALS USING SHIFT AUTOCORRELATION Frank Kurth, Alessia Cornaggia-Urrigshardt

More information

Introduction of Audio and Music

Introduction of Audio and Music 1 Introduction of Audio and Music Wei-Ta Chu 2009/12/3 Outline 2 Introduction of Audio Signals Introduction of Music 3 Introduction of Audio Signals Wei-Ta Chu 2009/12/3 Li and Drew, Fundamentals of Multimedia,

More information

An Efficient Extraction of Vocal Portion from Music Accompaniment Using Trend Estimation

An Efficient Extraction of Vocal Portion from Music Accompaniment Using Trend Estimation An Efficient Extraction of Vocal Portion from Music Accompaniment Using Trend Estimation Aisvarya V 1, Suganthy M 2 PG Student [Comm. Systems], Dept. of ECE, Sree Sastha Institute of Engg. & Tech., Chennai,

More information

ECEN 5014, Spring 2013 Special Topics: Active Microwave Circuits and MMICs Zoya Popovic, University of Colorado, Boulder

ECEN 5014, Spring 2013 Special Topics: Active Microwave Circuits and MMICs Zoya Popovic, University of Colorado, Boulder ECEN 5014, Spring 2013 Special Topics: Active Microwave Circuits and MMICs Zoya Popovic, University o Colorado, Boulder LECTURE 13 PHASE NOISE L13.1. INTRODUCTION The requency stability o an oscillator

More information

ECE 5655/4655 Laboratory Problems

ECE 5655/4655 Laboratory Problems Assignment #4 ECE 5655/4655 Laboratory Problems Make Note o the Following: Due Monday April 15, 2019 I possible write your lab report in Jupyter notebook I you choose to use the spectrum/network analyzer

More information

COMPENSATION OF CURRENT TRANSFORMERS BY MEANS OF FIELD PROGRAMMABLE GATE ARRAY

COMPENSATION OF CURRENT TRANSFORMERS BY MEANS OF FIELD PROGRAMMABLE GATE ARRAY METROLOGY AD MEASUREMET SYSTEMS Index 330930, ISS 0860-89 www.metrology.pg.gda.pl COMPESATIO OF CURRET TRASFORMERS BY MEAS OF FIELD PROGRAMMABLE GATE ARRAY Daniele Gallo, Carmine Landi, Mario Luiso Seconda

More information

MFCC-based perceptual hashing for compressed domain of speech content identification

MFCC-based perceptual hashing for compressed domain of speech content identification Available online www.jocpr.com Journal o Chemical and Pharmaceutical Research, 014, 6(7):379-386 Research Article ISSN : 0975-7384 CODEN(USA) : JCPRC5 MFCC-based perceptual hashing or compressed domain

More information

Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter

Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter Ching-Ta Lu, Kun-Fu Tseng 2, Chih-Tsung Chen 2 Department of Information Communication, Asia University, Taichung, Taiwan, ROC

More information

Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter

Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter 1 Gupteswar Sahu, 2 D. Arun Kumar, 3 M. Bala Krishna and 4 Jami Venkata Suman Assistant Professor, Department of ECE,

More information

Further developments on gear transmission monitoring

Further developments on gear transmission monitoring Further developments on gear transmission monitoring Niola V., Quaremba G., Avagliano V. Department o Mechanical Engineering or Energetics University o Naples Federico II Via Claudio 21, 80125, Napoli,

More information

Wavelet Speech Enhancement based on the Teager Energy Operator

Wavelet Speech Enhancement based on the Teager Energy Operator Wavelet Speech Enhancement based on the Teager Energy Operator Mohammed Bahoura and Jean Rouat ERMETIS, DSA, Université du Québec à Chicoutimi, Chicoutimi, Québec, G7H 2B1, Canada. Abstract We propose

More information

Fundamental frequency estimation of speech signals using MUSIC algorithm

Fundamental frequency estimation of speech signals using MUSIC algorithm Acoust. Sci. & Tech. 22, 4 (2) TECHNICAL REPORT Fundamental frequency estimation of speech signals using MUSIC algorithm Takahiro Murakami and Yoshihisa Ishida School of Science and Technology, Meiji University,,

More information

Optimizing Reception Performance of new UWB Pulse shape over Multipath Channel using MMSE Adaptive Algorithm

Optimizing Reception Performance of new UWB Pulse shape over Multipath Channel using MMSE Adaptive Algorithm IOSR Journal o Engineering (IOSRJEN) ISSN (e): 2250-3021, ISSN (p): 2278-8719 Vol. 05, Issue 01 (January. 2015), V1 PP 44-57 www.iosrjen.org Optimizing Reception Perormance o new UWB Pulse shape over Multipath

More information

A MATLAB Model of Hybrid Active Filter Based on SVPWM Technique

A MATLAB Model of Hybrid Active Filter Based on SVPWM Technique International Journal o Electrical Engineering. ISSN 0974-2158 olume 5, Number 5 (2012), pp. 557-569 International Research Publication House http://www.irphouse.com A MATLAB Model o Hybrid Active Filter

More information

CO-CHANNEL SPEECH DETECTION APPROACHES USING CYCLOSTATIONARITY OR WAVELET TRANSFORM

CO-CHANNEL SPEECH DETECTION APPROACHES USING CYCLOSTATIONARITY OR WAVELET TRANSFORM CO-CHANNEL SPEECH DETECTION APPROACHES USING CYCLOSTATIONARITY OR WAVELET TRANSFORM Arvind Raman Kizhanatham, Nishant Chandra, Robert E. Yantorno Temple University/ECE Dept. 2 th & Norris Streets, Philadelphia,

More information

Different Approaches of Spectral Subtraction Method for Speech Enhancement

Different Approaches of Spectral Subtraction Method for Speech Enhancement ISSN 2249 5460 Available online at www.internationalejournals.com International ejournals International Journal of Mathematical Sciences, Technology and Humanities 95 (2013 1056 1062 Different Approaches

More information

Comparison of Spectral Analysis Methods for Automatic Speech Recognition

Comparison of Spectral Analysis Methods for Automatic Speech Recognition INTERSPEECH 2013 Comparison of Spectral Analysis Methods for Automatic Speech Recognition Venkata Neelima Parinam, Chandra Vootkuri, Stephen A. Zahorian Department of Electrical and Computer Engineering

More information

A Wavelet Approach to Wideband Spectrum Sensing for Cognitive Radios

A Wavelet Approach to Wideband Spectrum Sensing for Cognitive Radios A Wavelet Approach to Wideband Spectrum Sensing or Cognitive Radios Zhi Tian Department o Electrical & Computer Engineering Michigan Technological University Houghton, MI 4993 USA ztian@mtu.edu Georgios

More information

Software Defined Radio Forum Contribution

Software Defined Radio Forum Contribution Committee: Technical Sotware Deined Radio Forum Contribution Title: VITA-49 Drat Speciication Appendices Source Lee Pucker SDR Forum 604-828-9846 Lee.Pucker@sdrorum.org Date: 7 March 2007 Distribution:

More information

SPEECH ENHANCEMENT BASED ON ITERATIVE WIENER FILTER USING COMPLEX SPEECH ANALYSIS

SPEECH ENHANCEMENT BASED ON ITERATIVE WIENER FILTER USING COMPLEX SPEECH ANALYSIS SPEECH ENHANCEMENT BASED ON TERATVE WENER FLTER USNG COMPLEX SPEECH ANALYSS Keiichi Funaki Computing & Networking Center, Univ. o the Ryukyus Senbaru, Nishihara, Okinawa, 93-3, Japan phone: +(8)98-895-8946,

More information

A spectralõtemporal method for robust fundamental frequency tracking

A spectralõtemporal method for robust fundamental frequency tracking A spectralõtemporal method for robust fundamental frequency tracking Stephen A. Zahorian a and Hongbing Hu Department of Electrical and Computer Engineering, State University of New York at Binghamton,

More information

ON THE RELATIONSHIP BETWEEN INSTANTANEOUS FREQUENCY AND PITCH IN. 1 Introduction. Zied Mnasri 1, Hamid Amiri 1

ON THE RELATIONSHIP BETWEEN INSTANTANEOUS FREQUENCY AND PITCH IN. 1 Introduction. Zied Mnasri 1, Hamid Amiri 1 ON THE RELATIONSHIP BETWEEN INSTANTANEOUS FREQUENCY AND PITCH IN SPEECH SIGNALS Zied Mnasri 1, Hamid Amiri 1 1 Electrical engineering dept, National School of Engineering in Tunis, University Tunis El

More information

EEE 311: Digital Signal Processing I

EEE 311: Digital Signal Processing I EEE 311: Digital Signal Processing I Course Teacher: Dr Newaz Md Syur Rahim Associated Proessor, Dept o EEE, BUET, Dhaka 1000 Syllabus: As mentioned in your course calendar Reerence Books: 1 Digital Signal

More information

RASTA-PLP SPEECH ANALYSIS. Aruna Bayya. Phil Kohn y TR December 1991

RASTA-PLP SPEECH ANALYSIS. Aruna Bayya. Phil Kohn y TR December 1991 RASTA-PLP SPEECH ANALYSIS Hynek Hermansky Nelson Morgan y Aruna Bayya Phil Kohn y TR-91-069 December 1991 Abstract Most speech parameter estimation techniques are easily inuenced by the frequency response

More information

YOUR WAVELET BASED PITCH DETECTION AND VOICED/UNVOICED DECISION

YOUR WAVELET BASED PITCH DETECTION AND VOICED/UNVOICED DECISION American Journal of Engineering and Technology Research Vol. 3, No., 03 YOUR WAVELET BASED PITCH DETECTION AND VOICED/UNVOICED DECISION Yinan Kong Department of Electronic Engineering, Macquarie University

More information

Introduction to OFDM. Characteristics of OFDM (Orthogonal Frequency Division Multiplexing)

Introduction to OFDM. Characteristics of OFDM (Orthogonal Frequency Division Multiplexing) Introduction to OFDM Characteristics o OFDM (Orthogonal Frequency Division Multiplexing Parallel data transmission with very long symbol duration - Robust under multi-path channels Transormation o a requency-selective

More information

Epoch Extraction From Emotional Speech

Epoch Extraction From Emotional Speech Epoch Extraction From al Speech D Govind and S R M Prasanna Department of Electronics and Electrical Engineering Indian Institute of Technology Guwahati Email:{dgovind,prasanna}@iitg.ernet.in Abstract

More information

AN EFFICIENT SET OF FEATURES FOR PULSE REPETITION INTERVAL MODULATION RECOGNITION

AN EFFICIENT SET OF FEATURES FOR PULSE REPETITION INTERVAL MODULATION RECOGNITION AN EFFICIENT SET OF FEATURES FOR PULSE REPETITION INTERVAL MODULATION RECOGNITION J-P. Kauppi, K.S. Martikainen Patria Aviation Oy, Naulakatu 3, 33100 Tampere, Finland, ax +358204692696 jukka-pekka.kauppi@patria.i,

More information

Voiced/nonvoiced detection based on robustness of voiced epochs

Voiced/nonvoiced detection based on robustness of voiced epochs Voiced/nonvoiced detection based on robustness of voiced epochs by N. Dhananjaya, B.Yegnanarayana in IEEE Signal Processing Letters, 17, 3 : 273-276 Report No: IIIT/TR/2010/50 Centre for Language Technologies

More information

A Method for Voiced/Unvoiced Classification of Noisy Speech by Analyzing Time-Domain Features of Spectrogram Image

A Method for Voiced/Unvoiced Classification of Noisy Speech by Analyzing Time-Domain Features of Spectrogram Image Science Journal of Circuits, Systems and Signal Processing 2017; 6(2): 11-17 http://www.sciencepublishinggroup.com/j/cssp doi: 10.11648/j.cssp.20170602.12 ISSN: 2326-9065 (Print); ISSN: 2326-9073 (Online)

More information

The Research of Electric Energy Measurement Algorithm Based on S-Transform

The Research of Electric Energy Measurement Algorithm Based on S-Transform International Conerence on Energy, Power and Electrical Engineering (EPEE 16 The Research o Electric Energy Measurement Algorithm Based on S-Transorm Xiyang Ou1,*, Bei He, Xiang Du1, Jin Zhang1, Ling Feng1,

More information

Project 0: Part 2 A second hands-on lab on Speech Processing Frequency-domain processing

Project 0: Part 2 A second hands-on lab on Speech Processing Frequency-domain processing Project : Part 2 A second hands-on lab on Speech Processing Frequency-domain processing February 24, 217 During this lab, you will have a first contact on frequency domain analysis of speech signals. You

More information

Performance Analysis of MFCC and LPCC Techniques in Automatic Speech Recognition

Performance Analysis of MFCC and LPCC Techniques in Automatic Speech Recognition www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume - 3 Issue - 8 August, 2014 Page No. 7727-7732 Performance Analysis of MFCC and LPCC Techniques in Automatic

More information

Experiment 7: Frequency Modulation and Phase Locked Loops Fall 2009

Experiment 7: Frequency Modulation and Phase Locked Loops Fall 2009 Experiment 7: Frequency Modulation and Phase Locked Loops Fall 2009 Frequency Modulation Normally, we consider a voltage wave orm with a ixed requency o the orm v(t) = V sin(ω c t + θ), (1) where ω c is

More information

A Comparative Study of Formant Frequencies Estimation Techniques

A Comparative Study of Formant Frequencies Estimation Techniques A Comparative Study of Formant Frequencies Estimation Techniques DORRA GARGOURI, Med ALI KAMMOUN and AHMED BEN HAMIDA Unité de traitement de l information et électronique médicale, ENIS University of Sfax

More information

Speech Enhancement using Wiener filtering

Speech Enhancement using Wiener filtering Speech Enhancement using Wiener filtering S. Chirtmay and M. Tahernezhadi Department of Electrical Engineering Northern Illinois University DeKalb, IL 60115 ABSTRACT The problem of reducing the disturbing

More information

Change Point Determination in Audio Data Using Auditory Features

Change Point Determination in Audio Data Using Auditory Features INTL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 0, VOL., NO., PP. 8 90 Manuscript received April, 0; revised June, 0. DOI: /eletel-0-00 Change Point Determination in Audio Data Using Auditory Features

More information

Frequency Domain Analysis for Noise Suppression Using Spectral Processing Methods for Degraded Speech Signal in Speech Enhancement

Frequency Domain Analysis for Noise Suppression Using Spectral Processing Methods for Degraded Speech Signal in Speech Enhancement Frequency Domain Analysis for Noise Suppression Using Spectral Processing Methods for Degraded Speech Signal in Speech Enhancement 1 Zeeshan Hashmi Khateeb, 2 Gopalaiah 1,2 Department of Instrumentation

More information

SPEECH TO SINGING SYNTHESIS SYSTEM. Mingqing Yun, Yoon mo Yang, Yufei Zhang. Department of Electrical and Computer Engineering University of Rochester

SPEECH TO SINGING SYNTHESIS SYSTEM. Mingqing Yun, Yoon mo Yang, Yufei Zhang. Department of Electrical and Computer Engineering University of Rochester SPEECH TO SINGING SYNTHESIS SYSTEM Mingqing Yun, Yoon mo Yang, Yufei Zhang Department of Electrical and Computer Engineering University of Rochester ABSTRACT This paper describes a speech-to-singing synthesis

More information

Sub-band Envelope Approach to Obtain Instants of Significant Excitation in Speech

Sub-band Envelope Approach to Obtain Instants of Significant Excitation in Speech Sub-band Envelope Approach to Obtain Instants of Significant Excitation in Speech Vikram Ramesh Lakkavalli, K V Vijay Girish, A G Ramakrishnan Medical Intelligence and Language Engineering (MILE) Laboratory

More information

Simulation of Radio Frequency Integrated Circuits

Simulation of Radio Frequency Integrated Circuits Simulation o Radio Frequency Integrated Circuits Based on: Computer-Aided Circuit Analysis Tools or RFIC Simulation: Algorithms, Features, and Limitations, IEEE Trans. CAS-II, April 2000. Outline Introduction

More information

Drum Transcription Based on Independent Subspace Analysis

Drum Transcription Based on Independent Subspace Analysis Report for EE 391 Special Studies and Reports for Electrical Engineering Drum Transcription Based on Independent Subspace Analysis Yinyi Guo Center for Computer Research in Music and Acoustics, Stanford,

More information

With the proposed technique, those two problems will be overcome. reduction is to eliminate the specific harmonics, which are the lowest orders.

With the proposed technique, those two problems will be overcome. reduction is to eliminate the specific harmonics, which are the lowest orders. CHAPTER 3 OPTIMIZED HARMONIC TEPPED-WAVEFORM TECHNIQUE (OHW The obective o the proposed optimized harmonic stepped-waveorm technique is to reduce, as much as possible, the harmonic distortion in the load

More information

Effects of Reverberation on Pitch, Onset/Offset, and Binaural Cues

Effects of Reverberation on Pitch, Onset/Offset, and Binaural Cues Effects of Reverberation on Pitch, Onset/Offset, and Binaural Cues DeLiang Wang Perception & Neurodynamics Lab The Ohio State University Outline of presentation Introduction Human performance Reverberation

More information

speech signal S(n). This involves a transformation of S(n) into another signal or a set of signals

speech signal S(n). This involves a transformation of S(n) into another signal or a set of signals 16 3. SPEECH ANALYSIS 3.1 INTRODUCTION TO SPEECH ANALYSIS Many speech processing [22] applications exploits speech production and perception to accomplish speech analysis. By speech analysis we extract

More information

Modulator Domain Adaptive Gain Equalizer for Speech Enhancement

Modulator Domain Adaptive Gain Equalizer for Speech Enhancement Modulator Domain Adaptive Gain Equalizer for Speech Enhancement Ravindra d. Dhage, Prof. Pravinkumar R.Badadapure Abstract M.E Scholar, Professor. This paper presents a speech enhancement method for personal

More information

Implementation of an Intelligent Target Classifier with Bicoherence Feature Set

Implementation of an Intelligent Target Classifier with Bicoherence Feature Set ISSN: 39-8753 International Journal o Innovative Research in Science, (An ISO 397: 007 Certiied Organization Vol. 3, Issue, November 04 Implementation o an Intelligent Target Classiier with Bicoherence

More information

SEG/San Antonio 2007 Annual Meeting. Summary. Morlet wavelet transform

SEG/San Antonio 2007 Annual Meeting. Summary. Morlet wavelet transform Xiaogui Miao*, CGGVeritas, Calgary, Canada, Xiao-gui_miao@cggveritas.com Dragana Todorovic-Marinic and Tyler Klatt, Encana, Calgary Canada Summary Most geologic changes have a seismic response but sometimes

More information

Non-stationary Analysis/Synthesis using Spectrum Peak Shape Distortion, Phase and Reassignment

Non-stationary Analysis/Synthesis using Spectrum Peak Shape Distortion, Phase and Reassignment Non-stationary Analysis/Synthesis using Spectrum Peak Shape Distortion, Phase Reassignment Geoffroy Peeters, Xavier Rodet Ircam - Centre Georges-Pompidou, Analysis/Synthesis Team, 1, pl. Igor Stravinsky,

More information

Synchronous Overlap and Add of Spectra for Enhancement of Excitation in Artificial Bandwidth Extension of Speech

Synchronous Overlap and Add of Spectra for Enhancement of Excitation in Artificial Bandwidth Extension of Speech INTERSPEECH 5 Synchronous Overlap and Add of Spectra for Enhancement of Excitation in Artificial Bandwidth Extension of Speech M. A. Tuğtekin Turan and Engin Erzin Multimedia, Vision and Graphics Laboratory,

More information

Speech/Music Discrimination via Energy Density Analysis

Speech/Music Discrimination via Energy Density Analysis Speech/Music Discrimination via Energy Density Analysis Stanis law Kacprzak and Mariusz Zió lko Department of Electronics, AGH University of Science and Technology al. Mickiewicza 30, Kraków, Poland {skacprza,

More information

Voice Excited Lpc for Speech Compression by V/Uv Classification

Voice Excited Lpc for Speech Compression by V/Uv Classification IOSR Journal of VLSI and Signal Processing (IOSR-JVSP) Volume 6, Issue 3, Ver. II (May. -Jun. 2016), PP 65-69 e-issn: 2319 4200, p-issn No. : 2319 4197 www.iosrjournals.org Voice Excited Lpc for Speech

More information

CONVOLUTIONAL NEURAL NETWORK FOR ROBUST PITCH DETERMINATION. Hong Su, Hui Zhang, Xueliang Zhang, Guanglai Gao

CONVOLUTIONAL NEURAL NETWORK FOR ROBUST PITCH DETERMINATION. Hong Su, Hui Zhang, Xueliang Zhang, Guanglai Gao CONVOLUTIONAL NEURAL NETWORK FOR ROBUST PITCH DETERMINATION Hong Su, Hui Zhang, Xueliang Zhang, Guanglai Gao Department of Computer Science, Inner Mongolia University, Hohhot, China, 0002 suhong90 imu@qq.com,

More information

Traditional Analog Modulation Techniques

Traditional Analog Modulation Techniques Chapter 5 Traditional Analog Modulation Techniques Mikael Olosson 2002 2007 Modulation techniques are mainly used to transmit inormation in a given requency band. The reason or that may be that the channel

More information

Speech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction

Speech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction IOSR Journal of VLSI and Signal Processing (IOSR-JVSP) Volume 7, Issue, Ver. I (Mar. - Apr. 7), PP 4-46 e-issn: 9 4, p-issn No. : 9 497 www.iosrjournals.org Speech Enhancement Using Spectral Flatness Measure

More information

A Two-step Technique for MRI Audio Enhancement Using Dictionary Learning and Wavelet Packet Analysis

A Two-step Technique for MRI Audio Enhancement Using Dictionary Learning and Wavelet Packet Analysis A Two-step Technique for MRI Audio Enhancement Using Dictionary Learning and Wavelet Packet Analysis Colin Vaz, Vikram Ramanarayanan, and Shrikanth Narayanan USC SAIL Lab INTERSPEECH Articulatory Data

More information

Dimension Reduction of the Modulation Spectrogram for Speaker Verification

Dimension Reduction of the Modulation Spectrogram for Speaker Verification Dimension Reduction of the Modulation Spectrogram for Speaker Verification Tomi Kinnunen Speech and Image Processing Unit Department of Computer Science University of Joensuu, Finland Kong Aik Lee and

More information

Adaptive Filters Application of Linear Prediction

Adaptive Filters Application of Linear Prediction Adaptive Filters Application of Linear Prediction Gerhard Schmidt Christian-Albrechts-Universität zu Kiel Faculty of Engineering Electrical Engineering and Information Technology Digital Signal Processing

More information

Phase estimation in speech enhancement unimportant, important, or impossible?

Phase estimation in speech enhancement unimportant, important, or impossible? IEEE 7-th Convention of Electrical and Electronics Engineers in Israel Phase estimation in speech enhancement unimportant, important, or impossible? Timo Gerkmann, Martin Krawczyk, and Robert Rehr Speech

More information

Transcription of Piano Music

Transcription of Piano Music Transcription of Piano Music Rudolf BRISUDA Slovak University of Technology in Bratislava Faculty of Informatics and Information Technologies Ilkovičova 2, 842 16 Bratislava, Slovakia xbrisuda@is.stuba.sk

More information

MMSE STSA Based Techniques for Single channel Speech Enhancement Application Simit Shah 1, Roma Patel 2

MMSE STSA Based Techniques for Single channel Speech Enhancement Application Simit Shah 1, Roma Patel 2 MMSE STSA Based Techniques for Single channel Speech Enhancement Application Simit Shah 1, Roma Patel 2 1 Electronics and Communication Department, Parul institute of engineering and technology, Vadodara,

More information

Solid State Relays & Its

Solid State Relays & Its Solid State Relays & Its Applications Presented By Dr. Mostaa Abdel-Geliel Course Objectives Know new techniques in relay industries. Understand the types o static relays and its components. Understand

More information

Speech Recognition using FIR Wiener Filter

Speech Recognition using FIR Wiener Filter Speech Recognition using FIR Wiener Filter Deepak 1, Vikas Mittal 2 1 Department of Electronics & Communication Engineering, Maharishi Markandeshwar University, Mullana (Ambala), INDIA 2 Department of

More information

DERIVATION OF TRAPS IN AUDITORY DOMAIN

DERIVATION OF TRAPS IN AUDITORY DOMAIN DERIVATION OF TRAPS IN AUDITORY DOMAIN Petr Motlíček, Doctoral Degree Programme (4) Dept. of Computer Graphics and Multimedia, FIT, BUT E-mail: motlicek@fit.vutbr.cz Supervised by: Dr. Jan Černocký, Prof.

More information

Hungarian Speech Synthesis Using a Phase Exact HNM Approach

Hungarian Speech Synthesis Using a Phase Exact HNM Approach Hungarian Speech Synthesis Using a Phase Exact HNM Approach Kornél Kovács 1, András Kocsor 2, and László Tóth 3 Research Group on Artificial Intelligence of the Hungarian Academy of Sciences and University

More information

A Multipitch Tracking Algorithm for Noisy Speech

A Multipitch Tracking Algorithm for Noisy Speech IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 11, NO. 3, MAY 2003 229 A Multipitch Tracking Algorithm for Noisy Speech Mingyang Wu, Student Member, IEEE, DeLiang Wang, Senior Member, IEEE, and

More information

PLL AND NUMBER OF SAMPLE SYNCHRONISATION TECHNIQUES FOR ELECTRICAL POWER QUALITY MEASURMENTS

PLL AND NUMBER OF SAMPLE SYNCHRONISATION TECHNIQUES FOR ELECTRICAL POWER QUALITY MEASURMENTS XX IMEKO World Congress Metrology or Green Growth September 9 14, 2012, Busan, Republic o Korea PLL AND NUMBER OF SAMPLE SYNCHRONISATION TECHNIQUES FOR ELECTRICAL POWER QUALITY MEASURMENTS Richárd Bátori

More information

Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis

Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis Mohini Avatade & S.L. Sahare Electronics & Telecommunication Department, Cummins

More information

DARK CURRENT ELIMINATION IN CHARGED COUPLE DEVICES

DARK CURRENT ELIMINATION IN CHARGED COUPLE DEVICES DARK CURRENT ELIMINATION IN CHARGED COUPLE DEVICES L. Kňazovická, J. Švihlík Department o Computing and Control Engineering, ICT Prague Abstract Charged Couple Devices can be ound all around us. They are

More information

Fundamentals of Spectrum Analysis. Christoph Rauscher

Fundamentals of Spectrum Analysis. Christoph Rauscher Fundamentals o Spectrum nalysis Christoph Rauscher Christoph Rauscher Volker Janssen, Roland Minihold Fundamentals o Spectrum nalysis Rohde & Schwarz GmbH & Co. KG, 21 Mühldorstrasse 15 81671 München Germany

More information

Effective post-processing for single-channel frequency-domain speech enhancement Weifeng Li a

Effective post-processing for single-channel frequency-domain speech enhancement Weifeng Li a R E S E A R C H R E P O R T I D I A P Effective post-processing for single-channel frequency-domain speech enhancement Weifeng Li a IDIAP RR 7-7 January 8 submitted for publication a IDIAP Research Institute,

More information

IMPROVING QUALITY OF SPEECH SYNTHESIS IN INDIAN LANGUAGES. P. K. Lehana and P. C. Pandey

IMPROVING QUALITY OF SPEECH SYNTHESIS IN INDIAN LANGUAGES. P. K. Lehana and P. C. Pandey Workshop on Spoken Language Processing - 2003, TIFR, Mumbai, India, January 9-11, 2003 149 IMPROVING QUALITY OF SPEECH SYNTHESIS IN INDIAN LANGUAGES P. K. Lehana and P. C. Pandey Department of Electrical

More information

Quantification of glottal and voiced speech harmonicsto-noise ratios using cepstral-based estimation

Quantification of glottal and voiced speech harmonicsto-noise ratios using cepstral-based estimation Quantification of glottal and voiced speech harmonicsto-noise ratios using cepstral-based estimation Peter J. Murphy and Olatunji O. Akande, Department of Electronic and Computer Engineering University

More information

Robust Voice Activity Detection Based on Discrete Wavelet. Transform

Robust Voice Activity Detection Based on Discrete Wavelet. Transform Robust Voice Activity Detection Based on Discrete Wavelet Transform Kun-Ching Wang Department of Information Technology & Communication Shin Chien University kunching@mail.kh.usc.edu.tw Abstract This paper

More information

Nonlinear FM Waveform Design to Reduction of sidelobe level in Autocorrelation Function

Nonlinear FM Waveform Design to Reduction of sidelobe level in Autocorrelation Function 017 5 th Iranian Conerence on Electrical (ICEE) Nonlinear FM Waveorm Design to Reduction o sidelobe level in Autocorrelation Function Roohollah Ghavamirad Department o Electrical K. N. Toosi University

More information

Project 2. Project 2: audio equalizer. Fig. 1: Kinter MA-170 stereo amplifier with bass and treble controls.

Project 2. Project 2: audio equalizer. Fig. 1: Kinter MA-170 stereo amplifier with bass and treble controls. Introduction Project 2 Project 2: audio equalizer This project aims to motivate our study o ilters by considering the design and implementation o an audio equalizer. An equalizer (EQ) modiies the requency

More information