TRANSIENT NOISE REDUCTION BASED ON SPEECH RECONSTRUCTION

Size: px
Start display at page:

Download "TRANSIENT NOISE REDUCTION BASED ON SPEECH RECONSTRUCTION"

Transcription

1 TRANSIENT NOISE REDUCTION BASED ON SPEECH RECONSTRUCTION Jian Li 1,2, Shiwei Wang 1,2, Renhua Peng 1,2, Chengshi Zheng 1,2, Xiaodong Li 1,2 1. Communication Acoustics Laboratory, Institute of Acoustics, Chinese Academy of Sciences, Beijing, China 119, 2. Acoustics and Information Technology Laboratory, Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai, 2121, This paper proposes a novel transient noise reduction (TNR) algorithm based on speech reconstruction. The proposed algorithm has two stages. First, the transient noise is detected by using linear prediction residual, which will be referred as Linear Prediction Residual (LPR)-based method. Second, we replace the frames that contain transient noise with the reconstructed speech by using packet loss concealment techniques, which can reduce speech distortion and suppress the transient noise in a robust way. Compared with traditional TNR algorithms, the proposed algorithm is computationally efficient. Moreover, the proposed algorithm can completely eliminate transient noise especially when the voiced speech and the transient noise exist simultaneously. Experimental results show that the proposed algorithm using speech reconstruction techniques can reduce the transient noise effectively, up to 3dB, without introducing audible speech distortion. 1. Introduction Transient noise, which is a type of non-stationary signal with short duration less than 5ms, often appears as an interference in speech communication systems, such as mobile phones, hearing aids and teleconference devices [1]. Since transient noise may seriously degrade the speech quality in practice, it is necessary to suppress it in an efficient way. In recent years, transient noise reduction (TNR) has become an attractive research topic and the researchers have already made some efforts to suppress this transient noise. In [1] and [2], Talmon and Cohen proposed an algorithm that can efficiently suppress transient noise with diffusion maps. However, this algorithm is computationally complex and non-causal. In [3]-[5], transient noise is suppressed in the time domain, wavelet domain or frequency domain, respectively. These algorithms can suppress transient noise with low delay while they may cause serious speech distortion when speech is erroneously detected as transient noise. Moreover, experimental studies show that the existing algorithms cannot completely eliminate transient noise in the sense of hearing. In this paper, we propose a novel TNR algorithm by using speech reconstruction. The proposed method is composed of two steps. In the first step, the Linear Prediction Residual (LPR)-based method is proposed to detect the transient noise as far as possible. In the second step, the transient ICSV21, Beijing, China, July 13-17, 214 1

2 21st International Congress on Sound and Vibration (ICSV21), Beijing, China, July 214 noise corrupted frames are removed and packet loss concealment techniques are used to reconstruct speech for continuity. The remainder of this paper is organized as follows. In Section II, we formulate the problem. The LPR-based method is presented in Section III. Then, a packet loss concealment technique is proposed to reconstruct speech in Section IV. Section V gives some experimental results to show the validity of the proposed algorithm. Some conclusions are presented in Section VI. 2. Problem formulation Let s(n) denote a clean speech signal and let d st (n) and d tr (n) be the additive stationary and transient noise signals, respectively. The signal received by a microphone is composed of these three components, written as: x (n) = s (n) + d st (n) + d tr (n) (1) Since the additive stationary noise can be removed by traditional single-channel speech enhancement algorithms [6], [7], we ignore the impact of d st (n) in this paper. Therefore, (1) can be rewritten as: x (n) = s (n) + d tr (n) (2) The microphone signal x(n) can be divided into short-time frames and the transient noise detection problem in the lth frame can be regarded as a binary hypothesis test, given by: { H1 (l) : x l (n) = s (Ml + n) + d tr (Ml + n) or x l (n) = d tr (Ml + n) (3) H (l) : x l (n) = s (Ml + n) where M is the frame shift length and n =,1,...N-1, with N the frame length. In this paper, we choose M=256 and N=512 with the sampling frequency of 16 khz. To eliminate the transients completely, it is better to detect the transient noise as far as possible even when both the speech and the transient noise are presented in the l frame, since we can solve this problem by applying speech reconstruction techniques. 3. Transient noise detection with LPR-based method In the following three parts, we introduce the LPR-based method. In the first part, we analyze the properties of the LPR for different types of signals. In the second part, the specific processes are given to distinguish the transient noise from the speech. In the final part, some experimental results are presented to show the validity of the this method. 3.1 The properties of the LPR In this paper, we assume that the energy of the transient noise mostly concentrates on a small range over the time scale and the temporal energy of the transient noise is significantly larger compared with the speech components. According to the traditional methods, the spectral coherence can be applied to distinguish the unvoiced speech from the transient noise[8]. Meanwhile, the harmonic property of the voiced speech is useful to differentiate the voiced speech and the transient noise [5]. However, when the voiced speech and the transient noise exist at the same time, the voiced speech has a large influence on characteristics of the transient noise and makes it difficult to detect the transient noise. To solve this problem, we propose the LPR-based method. For enhancing the characteristic difference between the transient noise and the speech, we whiten the noisy signal x(n) in each frame. Let x l (n) be the LPR in the lth frame, which can be written as: P x l (n) = x l (n) a l px l (n p) (4) ICSV21, Beijing, China, July 13-17, p=1

3 21st International Congress on Sound and Vibration (ICSV21), Beijing, China, July 214 where {a l p} P p=1 are the AR coefficients in the lth frame. In practice, we can apply the common Levinson-Durbin algorithm to estimate the AR coefficients. Different types of signals are shown in Figure. 1 within a short time frame, where (a) is the voiced speech, (b) is the transient noise, (c) is the voiced speech corrupted by transients and (d) is the unvoiced speech. Each type of signal is whitened using linear prediction and the results are shown in Figure. 2 respectively. It is observed that the LPR of the voiced speech is reduced to an impulse train, where the impulses show up periodically as shown in Figure. 2(a). Whereas, Figure.2(b) indicates that the transient noise concentrates its energy on a small window of time before and after linear prediction due to the fact that the transient noise has a short duration and a flat spectrum. Comparing Figure. 1(c) and Figure. 2(c), it can be seen that the transient noise becomes more obvious after linear prediction since the voiced speech is suppressed by the linear prediction while the whitened transient component retains most of its energy. The energy of the unvoiced speech is approximately uniformly distributed over the time which can be seen in Figure. 1(d) and Figure. 2(d) (a) (b) (c) (d) Samples (a) (b) (c) (d) Samples Figure 1: Waveforms of the original signals. Figure 2: Waveforms of the whitened signals. 3.2 A signal centroid-based method to detect transient noise Based on the different distributions between the transient noise and the other signals in the residual domain, we propose a signal centroid-based method to detect transient noise. The centroid of the LPR in the lth frame can be written as: C(l) = N 1 n= N 1 n x l (n) / n= x l (n) (5) Centered on the centroid C(l), the minimum time length which contains E% total energy is given by: C(l)+v x l (n) n=c(l) v B(l) = min E% (6) v N 1 x l (n) n= where E is recommended from and E = 9 is chosen in this paper. Our studies indicate that the B(l) is small under H 1 (l), which is based on the fact that the energy of the transient noise concentrates around a small range. Aiming at improving the detection probability, we introduce a weighted window function w(n) and (5) can be rewritten as: ICSV21, Beijing, China, July 13-17, 214 3

4 21st International Congress on Sound and Vibration (ICSV21), Beijing, China, July 214 C(l) = N 1 n= N 1 n w(n) x l (n) / n= w(n) x l (n) (7) where w(n) is a hanning window in practice. Our studies indicate that an appropriate choice of w(n) and overlap length will make the energy concentrated, which helps to detect the transient noise. However, we find that the speech phoneme onsets, which are characterized by sudden bursts, also concentrate their energy on a small range. To solve this problem, we propose to add some stationary noise into the original signal, which can mask the speech phoneme onset component but not mask the voiced speech and the transient noise. Notice should be given that even if the speech is erroneously detected as the transient noise, we can use the packet loss concealment technology to reconstruct the speech, which will be introduced in the following section. Through the above protective measures, the detection criterion can be given by: { B(l) Cth, accept H (l) (8) B(l) < C th, accept H 1 (l) where C th is the threshold and it is relevant to the frame length and the type of transient noise. A large amount of experiments show that C th = 15 is a good choice when the frame length is LPR-based method simulation In this part, we show the validity of the LPR-based method. The speech signal corrupted by transient noise is used for simulation and the results are shown in Figure. 3, where the dashed line represents the threshold C th. The results indicate that the LPR-based method can detect the transient noise effectively even when the speech and the transients exist simultaneously. 2 B(l) Time[Sec] Time[Sec] Figure 3: Simulation for the transient noise detection. 4. TNR based on speech reconstruction Traditional TNR algorithms cannot eliminate transient noise completely in practice. Unfortunately, human s auditory system is sensitive to the residual transient noise. Vaseghi and Rayner proposed a method for removing impulsive noise and reconstructing speech with interpolation algorithm [3]. In this paper, we replace the frames that contain transient noise with the reconstructed speech by using packet loss concealment techniques. Since the duration of transient noise is usually less than 5ms, once the frame is detected to contain transient noise, this frame and its two successive frames should be discarded to ensure that the transient noise can be totally eliminated. Various packet loss concealment techniques can be used to generate approximations of the discarded frames such as Waveform substitution algorithm and Waveform Similarity Overlap-add (WSOLA) algorithm [9], [1]. In this paper, we apply two-side pitch waveform replication (PWR) technique [11] to recontract the speech of the discarded frames. ICSV21, Beijing, China, July 13-17, 214 4

5 21st International Congress on Sound and Vibration (ICSV21), Beijing, China, July Pitch detection The pitch period of each frame can be estimated by computing the normalized autocorrelation of the signal and searching for the index that maximizes the normalized autocorrelation [12], i.e., L 1 x(n)x(n + τ) n= C nac (τ) =, τ = τ min...τ max (9) L 1 x(n) 2 L 1 x(n + τ) 2 n= n= { τ, τ L = min < τ N 2 N N τ, < τ τ (1) 2 max where L is the correlation size and τ min and τ max are the minimum and maximum values of pitch periods, respectively. The estimated pitch period is then used to reconstruct the speech. The more accurate method of pitch estimation is beyond the scope of this paper. 4.2 Speech reconstruction Based on whether the forward frame or the backward frame is voiced, we consider four different conditions[11]: both are voiced (BV), only the previous frame is voiced (PV), only the next frame is voiced (NV) and both are unvoiced (BU). The reconstruction methods of the 4 conditions are given in detail Both voiced condition For the BV condition, an algorithm based on phase synchronization and pitch adjustment is used to reconstruct the discarded frames[13]. We assume that pitch period of the forward frame is P f and the pitch period of the backward frame is P b. In the forward frame, we choose P f samples nearest to the discarded frames to be previous pitch waveform, referred as PPW. In the backward frame, we choose P b samples nearest to the discarded frames to be next pitch waveform, referred as NPW. Assuming that there are r samples to be reconstructed, and the number of reconstructed pitch waveform(referred as RPW) is N p, given by: N p = round( round(r/p f) + round(r/p b ) ) (11) 2 In general, P f is not equal to P b so the length of each RPW is different. For instance, if P f < P b, the length of the ith RPW i is given by: P i = P f + round( P b P f N p + 1 i), i = 1, 2...N p (12) If r P 1 +P 2...+P Np, P i should be slightly modified to satisfy the criteria r = P 1 +P 2...+P Np. To get the ith RPW i with length of P i, we apply interpolation method to modify PPW into modified- PPW with P i samples, referred as PPW i m. Likewise, the same method can be used to modify NPW into modified-npw with P i samples, referred as NPW i m. The ith RPW i can be written as: RPW i (k) = w i f(k) PPW i m(k) + w i b(k) NPW i m(k), k = 1, 2...P i (13) wf(k) i = r g, w i r b(k) = g r, g = P 1 + P 2...P i 1 + k (14) where wf(k), i wb(k) i are the gain patterns used for adjusting the contributing ratio of forward and backward component. We combine all the RPWs so as to get the reconstructed speech. ICSV21, Beijing, China, July 13-17, 214 5

6 21st International Congress on Sound and Vibration (ICSV21), Beijing, China, July Other conditions For the PV, NV and BU conditions, a simple recovery approach [11] is used to reconstruct the discarded frames. As to the PV condition, the last pitch segment of the forward frame is repeated to fulfill the region of the discarded frames and the gain patterns are used to adjust the amplitude. A similar method can be used to process the NV condition. In case of the BU condition, the rear half of the forward frame and the first half of the backward frame are respectively extended throughout the discarded frames with an amplitude adjustment process. 4.3 Simulation In this part, different types of speech signals are used for simulation. The waveforms of the o- riginal signals are shown in Figure. 4(a)-(c) and the waveforms of the reconstructed signals are shown in Figure. 4(d)-(f). Notice should be given that in Figure. 4(d)-(f) the dashed lines represent the forward and backward frames while the solid lines represent the reconstructed frames. The results show that the two-side PWR algorithm can reconstruct speech effectively without significant distortion (a) (d) (b) (e) (c) Samples (f) Samples Figure 4: Waveforms of the original and the reconstructed signals. 5. Experiments In this section, some experimental results are given to show the validity of the proposed algorithm. In the first part, we use the speech corrupted by the mouse clicking for simulation and illustrate the validity of the proposed algorithm. In the second part, two objective measures are applied to compare the proposed algorithm with the traditional ENV-TNR algorithm in [14]. 5.1 Validity of the proposed algorithm In this part, the transient noise corrupted speech signal sampled at 16 khz is used to show the validity of the proposed algorithm and the results are shown in Figure. 5. Our experiments show that the proposed algorithm can detect the transient noise accurately and suppress the transient noise effectively without introducing audible speech distortion. 5.2 Quantitative Results In this part, the quantitative results of the perceptual evaluation of speech quality (PESQ) and the amount of noise reduction are given to show the validity of the proposed algorithm. Both the keyboard typing noise and the mouse clicking noise are used to compare the proposed algorithm with ICSV21, Beijing, China, July 13-17, 214 6

7 21st International Congress on Sound and Vibration (ICSV21), Beijing, China, July (a) Fre[Hz] (d) (b) Fre[Hz] (e) (c) Time[Sec] Fre[Hz] (f) Time[Sec] Figure 5: Waveforms of (a) Clean speech; (b) Noisy speech; (c) Enhanced speech and speech spectrograms of (d) Clean speech; (e) Noisy speech; (f) Enhanced speech. the traditional ENV-TNR algorithm. The comparison results are presented in Table 1. This table clearly demonstrates that the proposed algorithm could reduce the transient noise and improve the PESQ simultaneously. This is based on the fact that the proposed algorithm can eliminate the transient noise completely and reconstruct the speech effectively without significant speech distortion. Table 1: Comparison results of the amount of noise reduction and the PESQ. Noise Type Noise Reduction [db] PESQ ENV-TNR Proposed Nosiy ENV-TNR Proposed Keyboard Typing Mouse Clicking Conclusion This paper proposes a new LPR-based transient noise detection method and a new transient noise reduction algorithm based on speech reconstruction. Compared with the traditional TNR algorithms, the proposed algorithm can completely eliminate transient noise without introducing audible speech distortion, even when the voiced speech and the transient noise exist simultaneously. Experimental results verify the validity of the proposed algorithm in reducing transient noise and improving the speech quality. Future work should concentrate on improving the transient noise detection method and reconstructing the speech more accurately to further avoid audible speech distortion. Acknowledgement This work was supported by NSFC (National Science Fund of China) under Grant No and No This work was also supported in part by the tri-networks integration under No. KGZD-EW-13-5(3). REFERENCES 1 Talmon, R., Cohen, I. and Gannot. S. Single-Channel Transient Interference Suppression With Diffusion Maps, IEEE Transactions on Audio, Speech and Language Processing, 21(1), , January, (213). ICSV21, Beijing, China, July 13-17, 214 7

8 21st International Congress on Sound and Vibration (ICSV21), Beijing, China, July Talmon, R., Cohen, I. and Gannot. S. Transient Noise Reduction Using Nonlocal Diffusion Filters, IEEE Transactions on Audio, Speech and Language Processing, 19(6), , August, (211). 3 Vaseghi, S. V. and Rayner, P. J. W. Detection and Suppression of Impulsive Noise in Speech Communication Systems, IEE Proceedings I (Communications, Speech and Vision), 137(1), 38 46, February, (199). 4 Nongpiur, R. C. Impulse Noise Removal in Speech Using Wavelets, Proceedings of the 28 IEEE International Conference on Acoustics, Speech, and Signal Processing, Las Vagas, USA, , April, (28). 5 Zheng, C. S., Chen, X. L., Wang, S. W., Peng, R. H. and Li, X. D. Delayless Method to Suppress Transient Noise Using Speech Properties and Spectral Coherence, Proceedings of the 125 th Audio Engineering Society Convention, New York, USA, 17 2 October, (213). 6 Hu, X. H., Wang, S. W., Zheng, C. S. and Li, X. D. A Cepstrum-based Preprocessing and Postprocessing For Speech Enhancement in Adverse Environments, Applied Acoustics, 74(12), , December, (213). 7 Wang, J., Liu, H., Zheng, C. S. and Li, X. D. Spectral Subtraction based on Two-Stage Sspectral Estimation Modified Cepstrum Thresholding, Applied Acoustics, 74(3), , March, (213). 8 Zheng, C. S., Yang, H. F. and Li, X. D. On Generalized Auto-Spectral Coherence Function and Its Applications to Signal Detection, IEEE Signal Processing Letters, 21(5), , May, (214). 9 Goodman, D. J., Lockhart, G. B., Wasem, O. J. and Wong, W. C. Waveform Substitution Techniques for Recovering Missing Speech Segments in Packet Voice Communications, IEEE Transactions on Acoustics, Speech and Signal Processing, ASSP(34), , December, (1986). 1 Verhelst, W. and Roelands, M. An Overlap-Add Technique Based on Waveform Similarity (W- SOLA) for High Quality Time-Scale Modification of Speech, IEEE International Conference on Acoustics, Speech, and Signal Processing, Minneapolis, MN, USA, 2, , April, (1993). 11 Liao, W. T., Chen, J. C. and Chen, M. S. Adaptive Recovery Techniques for Real-Time Audio Streams, Proceedings of the 2th Annual Joint Conference of the IEEE Computer and Communications Socitiies, Anchorage, AK, 2, , April, (21). 12 Medan, Y. Yair, E. and Chazan, D. Super Resolution Pitch Signal Determination of Speech Signal, IEEE Transactions on Signal Processing, 39(1), 4 48, January, (1991). 13 Li, Z. B., Zhao, S. H., Wang, J. and Kuang, J. M. A Side Information Based Packet Loss Recovery Algorithm in VoIP, Congress on Image and Signal Processing, 28, Sanya, China, 5, , May, (28). 14 Manohar, K. and Rao, P. Speech Enhancement in Nonstationary Noise Environments Using Noise Properties, Speech Communication, 48(1), 96 19, January, (26). ICSV21, Beijing, China, July 13-17, 214 8

Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter

Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter Ching-Ta Lu, Kun-Fu Tseng 2, Chih-Tsung Chen 2 Department of Information Communication, Asia University, Taichung, Taiwan, ROC

More information

MODIFIED DCT BASED SPEECH ENHANCEMENT IN VEHICULAR ENVIRONMENTS

MODIFIED DCT BASED SPEECH ENHANCEMENT IN VEHICULAR ENVIRONMENTS MODIFIED DCT BASED SPEECH ENHANCEMENT IN VEHICULAR ENVIRONMENTS 1 S.PRASANNA VENKATESH, 2 NITIN NARAYAN, 3 K.SAILESH BHARATHWAAJ, 4 M.P.ACTLIN JEEVA, 5 P.VIJAYALAKSHMI 1,2,3,4,5 SSN College of Engineering,

More information

Transient noise reduction in speech signal with a modified long-term predictor

Transient noise reduction in speech signal with a modified long-term predictor RESEARCH Open Access Transient noise reduction in speech signal a modified long-term predictor Min-Seok Choi * and Hong-Goo Kang Abstract This article proposes an efficient median filter based algorithm

More information

Auditory modelling for speech processing in the perceptual domain

Auditory modelling for speech processing in the perceptual domain ANZIAM J. 45 (E) ppc964 C980, 2004 C964 Auditory modelling for speech processing in the perceptual domain L. Lin E. Ambikairajah W. H. Holmes (Received 8 August 2003; revised 28 January 2004) Abstract

More information

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm A.T. Rajamanickam, N.P.Subiramaniyam, A.Balamurugan*,

More information

Students: Avihay Barazany Royi Levy Supervisor: Kuti Avargel In Association with: Zoran, Haifa

Students: Avihay Barazany Royi Levy Supervisor: Kuti Avargel In Association with: Zoran, Haifa Students: Avihay Barazany Royi Levy Supervisor: Kuti Avargel In Association with: Zoran, Haifa Spring 2008 Introduction Problem Formulation Possible Solutions Proposed Algorithm Experimental Results Conclusions

More information

speech signal S(n). This involves a transformation of S(n) into another signal or a set of signals

speech signal S(n). This involves a transformation of S(n) into another signal or a set of signals 16 3. SPEECH ANALYSIS 3.1 INTRODUCTION TO SPEECH ANALYSIS Many speech processing [22] applications exploits speech production and perception to accomplish speech analysis. By speech analysis we extract

More information

Monaural and Binaural Speech Separation

Monaural and Binaural Speech Separation Monaural and Binaural Speech Separation DeLiang Wang Perception & Neurodynamics Lab The Ohio State University Outline of presentation Introduction CASA approach to sound separation Ideal binary mask as

More information

Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter

Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter 1 Gupteswar Sahu, 2 D. Arun Kumar, 3 M. Bala Krishna and 4 Jami Venkata Suman Assistant Professor, Department of ECE,

More information

EE482: Digital Signal Processing Applications

EE482: Digital Signal Processing Applications Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 12 Speech Signal Processing 14/03/25 http://www.ee.unlv.edu/~b1morris/ee482/

More information

Chapter 4 SPEECH ENHANCEMENT

Chapter 4 SPEECH ENHANCEMENT 44 Chapter 4 SPEECH ENHANCEMENT 4.1 INTRODUCTION: Enhancement is defined as improvement in the value or Quality of something. Speech enhancement is defined as the improvement in intelligibility and/or

More information

Single Channel Speaker Segregation using Sinusoidal Residual Modeling

Single Channel Speaker Segregation using Sinusoidal Residual Modeling NCC 2009, January 16-18, IIT Guwahati 294 Single Channel Speaker Segregation using Sinusoidal Residual Modeling Rajesh M Hegde and A. Srinivas Dept. of Electrical Engineering Indian Institute of Technology

More information

Detection, Interpolation and Cancellation Algorithms for GSM burst Removal for Forensic Audio

Detection, Interpolation and Cancellation Algorithms for GSM burst Removal for Forensic Audio >Bitzer and Rademacher (Paper Nr. 21)< 1 Detection, Interpolation and Cancellation Algorithms for GSM burst Removal for Forensic Audio Joerg Bitzer and Jan Rademacher Abstract One increasing problem for

More information

Audio Watermarking Based on Multiple Echoes Hiding for FM Radio

Audio Watermarking Based on Multiple Echoes Hiding for FM Radio INTERSPEECH 2014 Audio Watermarking Based on Multiple Echoes Hiding for FM Radio Xuejun Zhang, Xiang Xie Beijing Institute of Technology Zhangxuejun0910@163.com,xiexiang@bit.edu.cn Abstract An audio watermarking

More information

Pushpraj Tanwar Research Scholar in ECE Dept. Maulana Azad National Institute of Technology Bhopal, India

Pushpraj Tanwar Research Scholar in ECE Dept. Maulana Azad National Institute of Technology Bhopal, India International Journal of Computer Applications (975 8887) Volume 125 No.5, September 215 Unwanted Transients Reduction in Voice Signal by Applying a Predictor and Spectral Subtraction Process Pushpraj

More information

ROBUST PITCH TRACKING USING LINEAR REGRESSION OF THE PHASE

ROBUST PITCH TRACKING USING LINEAR REGRESSION OF THE PHASE - @ Ramon E Prieto et al Robust Pitch Tracking ROUST PITCH TRACKIN USIN LINEAR RERESSION OF THE PHASE Ramon E Prieto, Sora Kim 2 Electrical Engineering Department, Stanford University, rprieto@stanfordedu

More information

SPEECH TO SINGING SYNTHESIS SYSTEM. Mingqing Yun, Yoon mo Yang, Yufei Zhang. Department of Electrical and Computer Engineering University of Rochester

SPEECH TO SINGING SYNTHESIS SYSTEM. Mingqing Yun, Yoon mo Yang, Yufei Zhang. Department of Electrical and Computer Engineering University of Rochester SPEECH TO SINGING SYNTHESIS SYSTEM Mingqing Yun, Yoon mo Yang, Yufei Zhang Department of Electrical and Computer Engineering University of Rochester ABSTRACT This paper describes a speech-to-singing synthesis

More information

Pitch Period of Speech Signals Preface, Determination and Transformation

Pitch Period of Speech Signals Preface, Determination and Transformation Pitch Period of Speech Signals Preface, Determination and Transformation Mohammad Hossein Saeidinezhad 1, Bahareh Karamsichani 2, Ehsan Movahedi 3 1 Islamic Azad university, Najafabad Branch, Saidinezhad@yahoo.com

More information

NOISE ESTIMATION IN A SINGLE CHANNEL

NOISE ESTIMATION IN A SINGLE CHANNEL SPEECH ENHANCEMENT FOR CROSS-TALK INTERFERENCE by Levent M. Arslan and John H.L. Hansen Robust Speech Processing Laboratory Department of Electrical Engineering Box 99 Duke University Durham, North Carolina

More information

Effects of Reverberation on Pitch, Onset/Offset, and Binaural Cues

Effects of Reverberation on Pitch, Onset/Offset, and Binaural Cues Effects of Reverberation on Pitch, Onset/Offset, and Binaural Cues DeLiang Wang Perception & Neurodynamics Lab The Ohio State University Outline of presentation Introduction Human performance Reverberation

More information

Audio Restoration Based on DSP Tools

Audio Restoration Based on DSP Tools Audio Restoration Based on DSP Tools EECS 451 Final Project Report Nan Wu School of Electrical Engineering and Computer Science University of Michigan Ann Arbor, MI, United States wunan@umich.edu Abstract

More information

Bilateral Waveform Similarity Overlap Add Approach based on Time Scale Modification Principle for Packet Loss Concealment of Speech Signals

Bilateral Waveform Similarity Overlap Add Approach based on Time Scale Modification Principle for Packet Loss Concealment of Speech Signals Bilateral Waveform Similarity Overlap Add Approach based on Time Scale Modification Principle for Pacet Loss Concealment of Speech Signals Miss. Rohini D. Patil Research Student, Department of Electronics,

More information

REAL-TIME BROADBAND NOISE REDUCTION

REAL-TIME BROADBAND NOISE REDUCTION REAL-TIME BROADBAND NOISE REDUCTION Robert Hoeldrich and Markus Lorber Institute of Electronic Music Graz Jakoministrasse 3-5, A-8010 Graz, Austria email: robert.hoeldrich@mhsg.ac.at Abstract A real-time

More information

Overview of Code Excited Linear Predictive Coder

Overview of Code Excited Linear Predictive Coder Overview of Code Excited Linear Predictive Coder Minal Mulye 1, Sonal Jagtap 2 1 PG Student, 2 Assistant Professor, Department of E&TC, Smt. Kashibai Navale College of Engg, Pune, India Abstract Advances

More information

Different Approaches of Spectral Subtraction Method for Speech Enhancement

Different Approaches of Spectral Subtraction Method for Speech Enhancement ISSN 2249 5460 Available online at www.internationalejournals.com International ejournals International Journal of Mathematical Sciences, Technology and Humanities 95 (2013 1056 1062 Different Approaches

More information

Speech Synthesis using Mel-Cepstral Coefficient Feature

Speech Synthesis using Mel-Cepstral Coefficient Feature Speech Synthesis using Mel-Cepstral Coefficient Feature By Lu Wang Senior Thesis in Electrical Engineering University of Illinois at Urbana-Champaign Advisor: Professor Mark Hasegawa-Johnson May 2018 Abstract

More information

ScienceDirect. Unsupervised Speech Segregation Using Pitch Information and Time Frequency Masking

ScienceDirect. Unsupervised Speech Segregation Using Pitch Information and Time Frequency Masking Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 46 (2015 ) 122 126 International Conference on Information and Communication Technologies (ICICT 2014) Unsupervised Speech

More information

Speech Enhancement: Reduction of Additive Noise in the Digital Processing of Speech

Speech Enhancement: Reduction of Additive Noise in the Digital Processing of Speech Speech Enhancement: Reduction of Additive Noise in the Digital Processing of Speech Project Proposal Avner Halevy Department of Mathematics University of Maryland, College Park ahalevy at math.umd.edu

More information

Wavelet Speech Enhancement based on the Teager Energy Operator

Wavelet Speech Enhancement based on the Teager Energy Operator Wavelet Speech Enhancement based on the Teager Energy Operator Mohammed Bahoura and Jean Rouat ERMETIS, DSA, Université du Québec à Chicoutimi, Chicoutimi, Québec, G7H 2B1, Canada. Abstract We propose

More information

ON THE RELATIONSHIP BETWEEN INSTANTANEOUS FREQUENCY AND PITCH IN. 1 Introduction. Zied Mnasri 1, Hamid Amiri 1

ON THE RELATIONSHIP BETWEEN INSTANTANEOUS FREQUENCY AND PITCH IN. 1 Introduction. Zied Mnasri 1, Hamid Amiri 1 ON THE RELATIONSHIP BETWEEN INSTANTANEOUS FREQUENCY AND PITCH IN SPEECH SIGNALS Zied Mnasri 1, Hamid Amiri 1 1 Electrical engineering dept, National School of Engineering in Tunis, University Tunis El

More information

Open Access Research of Dielectric Loss Measurement with Sparse Representation

Open Access Research of Dielectric Loss Measurement with Sparse Representation Send Orders for Reprints to reprints@benthamscience.ae 698 The Open Automation and Control Systems Journal, 2, 7, 698-73 Open Access Research of Dielectric Loss Measurement with Sparse Representation Zheng

More information

Speech Coding using Linear Prediction

Speech Coding using Linear Prediction Speech Coding using Linear Prediction Jesper Kjær Nielsen Aalborg University and Bang & Olufsen jkn@es.aau.dk September 10, 2015 1 Background Speech is generated when air is pushed from the lungs through

More information

Modulator Domain Adaptive Gain Equalizer for Speech Enhancement

Modulator Domain Adaptive Gain Equalizer for Speech Enhancement Modulator Domain Adaptive Gain Equalizer for Speech Enhancement Ravindra d. Dhage, Prof. Pravinkumar R.Badadapure Abstract M.E Scholar, Professor. This paper presents a speech enhancement method for personal

More information

Speech Enhancement using Wiener filtering

Speech Enhancement using Wiener filtering Speech Enhancement using Wiener filtering S. Chirtmay and M. Tahernezhadi Department of Electrical Engineering Northern Illinois University DeKalb, IL 60115 ABSTRACT The problem of reducing the disturbing

More information

Effective post-processing for single-channel frequency-domain speech enhancement Weifeng Li a

Effective post-processing for single-channel frequency-domain speech enhancement Weifeng Li a R E S E A R C H R E P O R T I D I A P Effective post-processing for single-channel frequency-domain speech enhancement Weifeng Li a IDIAP RR 7-7 January 8 submitted for publication a IDIAP Research Institute,

More information

Introduction of Audio and Music

Introduction of Audio and Music 1 Introduction of Audio and Music Wei-Ta Chu 2009/12/3 Outline 2 Introduction of Audio Signals Introduction of Music 3 Introduction of Audio Signals Wei-Ta Chu 2009/12/3 Li and Drew, Fundamentals of Multimedia,

More information

Online Version Only. Book made by this file is ILLEGAL. 2. Mathematical Description

Online Version Only. Book made by this file is ILLEGAL. 2. Mathematical Description Vol.9, No.9, (216), pp.317-324 http://dx.doi.org/1.14257/ijsip.216.9.9.29 Speech Enhancement Using Iterative Kalman Filter with Time and Frequency Mask in Different Noisy Environment G. Manmadha Rao 1

More information

An Audio Fingerprint Algorithm Based on Statistical Characteristics of db4 Wavelet

An Audio Fingerprint Algorithm Based on Statistical Characteristics of db4 Wavelet Journal of Information & Computational Science 8: 14 (2011) 3027 3034 Available at http://www.joics.com An Audio Fingerprint Algorithm Based on Statistical Characteristics of db4 Wavelet Jianguo JIANG

More information

Real time noise-speech discrimination in time domain for speech recognition application

Real time noise-speech discrimination in time domain for speech recognition application University of Malaya From the SelectedWorks of Mokhtar Norrima January 4, 2011 Real time noise-speech discrimination in time domain for speech recognition application Norrima Mokhtar, University of Malaya

More information

ACOUSTIC feedback problems may occur in audio systems

ACOUSTIC feedback problems may occur in audio systems IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL 20, NO 9, NOVEMBER 2012 2549 Novel Acoustic Feedback Cancellation Approaches in Hearing Aid Applications Using Probe Noise and Probe Noise

More information

A METHOD OF SPEECH PERIODICITY ENHANCEMENT BASED ON TRANSFORM-DOMAIN SIGNAL DECOMPOSITION

A METHOD OF SPEECH PERIODICITY ENHANCEMENT BASED ON TRANSFORM-DOMAIN SIGNAL DECOMPOSITION 8th European Signal Processing Conference (EUSIPCO-2) Aalborg, Denmark, August 23-27, 2 A METHOD OF SPEECH PERIODICITY ENHANCEMENT BASED ON TRANSFORM-DOMAIN SIGNAL DECOMPOSITION Feng Huang, Tan Lee and

More information

Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis

Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis Mohini Avatade & S.L. Sahare Electronics & Telecommunication Department, Cummins

More information

Signal Processing 91 (2011) Contents lists available at ScienceDirect. Signal Processing. journal homepage:

Signal Processing 91 (2011) Contents lists available at ScienceDirect. Signal Processing. journal homepage: Signal Processing 9 (2) 55 6 Contents lists available at ScienceDirect Signal Processing journal homepage: www.elsevier.com/locate/sigpro Fast communication Minima-controlled speech presence uncertainty

More information

EFFECTS OF PHYSICAL CONFIGURATIONS ON ANC HEADPHONE PERFORMANCE

EFFECTS OF PHYSICAL CONFIGURATIONS ON ANC HEADPHONE PERFORMANCE EFFECTS OF PHYSICAL CONFIGURATIONS ON ANC HEADPHONE PERFORMANCE Lifu Wu Nanjing University of Information Science and Technology, School of Electronic & Information Engineering, CICAEET, Nanjing, 210044,

More information

Enhanced Waveform Interpolative Coding at 4 kbps

Enhanced Waveform Interpolative Coding at 4 kbps Enhanced Waveform Interpolative Coding at 4 kbps Oded Gottesman, and Allen Gersho Signal Compression Lab. University of California, Santa Barbara E-mail: [oded, gersho]@scl.ece.ucsb.edu Signal Compression

More information

HIGH QUALITY AUDIO CODING AT LOW BIT RATE USING WAVELET AND WAVELET PACKET TRANSFORM

HIGH QUALITY AUDIO CODING AT LOW BIT RATE USING WAVELET AND WAVELET PACKET TRANSFORM HIGH QUALITY AUDIO CODING AT LOW BIT RATE USING WAVELET AND WAVELET PACKET TRANSFORM DR. D.C. DHUBKARYA AND SONAM DUBEY 2 Email at: sonamdubey2000@gmail.com, Electronic and communication department Bundelkhand

More information

International Journal of Modern Trends in Engineering and Research e-issn No.: , Date: 2-4 July, 2015

International Journal of Modern Trends in Engineering and Research   e-issn No.: , Date: 2-4 July, 2015 International Journal of Modern Trends in Engineering and Research www.ijmter.com e-issn No.:2349-9745, Date: 2-4 July, 2015 Analysis of Speech Signal Using Graphic User Interface Solly Joy 1, Savitha

More information

Available online at ScienceDirect. Procedia Computer Science 89 (2016 )

Available online at   ScienceDirect. Procedia Computer Science 89 (2016 ) Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 89 (2016 ) 666 676 Twelfth International Multi-Conference on Information Processing-2016 (IMCIP-2016) Comparison of Speech

More information

Mel Spectrum Analysis of Speech Recognition using Single Microphone

Mel Spectrum Analysis of Speech Recognition using Single Microphone International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree

More information

Advanced Signal Processing and Digital Noise Reduction

Advanced Signal Processing and Digital Noise Reduction Advanced Signal Processing and Digital Noise Reduction Advanced Signal Processing and Digital Noise Reduction Saeed V. Vaseghi Queen's University of Belfast UK ~ W I lilteubner L E Y A Partnership between

More information

Audio Imputation Using the Non-negative Hidden Markov Model

Audio Imputation Using the Non-negative Hidden Markov Model Audio Imputation Using the Non-negative Hidden Markov Model Jinyu Han 1,, Gautham J. Mysore 2, and Bryan Pardo 1 1 EECS Department, Northwestern University 2 Advanced Technology Labs, Adobe Systems Inc.

More information

Modified Kalman Filter-based Approach in Comparison with Traditional Speech Enhancement Algorithms from Adverse Noisy Environments

Modified Kalman Filter-based Approach in Comparison with Traditional Speech Enhancement Algorithms from Adverse Noisy Environments Modified Kalman Filter-based Approach in Comparison with Traditional Speech Enhancement Algorithms from Adverse Noisy Environments G. Ramesh Babu 1 Department of E.C.E, Sri Sivani College of Engg., Chilakapalem,

More information

Keywords Decomposition; Reconstruction; SNR; Speech signal; Super soft Thresholding.

Keywords Decomposition; Reconstruction; SNR; Speech signal; Super soft Thresholding. Volume 5, Issue 2, February 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Speech Enhancement

More information

Sound pressure level calculation methodology investigation of corona noise in AC substations

Sound pressure level calculation methodology investigation of corona noise in AC substations International Conference on Advanced Electronic Science and Technology (AEST 06) Sound pressure level calculation methodology investigation of corona noise in AC substations,a Xiaowen Wu, Nianguang Zhou,

More information

ACOUSTIC DATA TRANSMISSION IN AIR USING TRANSDUCER ARRAY

ACOUSTIC DATA TRANSMISSION IN AIR USING TRANSDUCER ARRAY ACOUSTIC DATA TRANSMISSION IN AIR USING TRANSDUCER ARRAY Ziying Yu, Zheng Kuang, Ming Wu and Jun Yang State Key Laboratory of Acoustics and Key Laboratory of Noise and Vibration Research, Institute of

More information

Open Access Sparse Representation Based Dielectric Loss Angle Measurement

Open Access Sparse Representation Based Dielectric Loss Angle Measurement 566 The Open Electrical & Electronic Engineering Journal, 25, 9, 566-57 Send Orders for Reprints to reprints@benthamscience.ae Open Access Sparse Representation Based Dielectric Loss Angle Measurement

More information

Implementation of SYMLET Wavelets to Removal of Gaussian Additive Noise from Speech Signal

Implementation of SYMLET Wavelets to Removal of Gaussian Additive Noise from Speech Signal Implementation of SYMLET Wavelets to Removal of Gaussian Additive Noise from Speech Signal Abstract: MAHESH S. CHAVAN, * NIKOS MASTORAKIS, MANJUSHA N. CHAVAN, *** M.S. GAIKWAD Department of Electronics

More information

Speech Enhancement for Nonstationary Noise Environments

Speech Enhancement for Nonstationary Noise Environments Signal & Image Processing : An International Journal (SIPIJ) Vol., No.4, December Speech Enhancement for Nonstationary Noise Environments Sandhya Hawaldar and Manasi Dixit Department of Electronics, KIT

More information

EC 6501 DIGITAL COMMUNICATION UNIT - II PART A

EC 6501 DIGITAL COMMUNICATION UNIT - II PART A EC 6501 DIGITAL COMMUNICATION 1.What is the need of prediction filtering? UNIT - II PART A [N/D-16] Prediction filtering is used mostly in audio signal processing and speech processing for representing

More information

Signal Processing for Speech Applications - Part 2-1. Signal Processing For Speech Applications - Part 2

Signal Processing for Speech Applications - Part 2-1. Signal Processing For Speech Applications - Part 2 Signal Processing for Speech Applications - Part 2-1 Signal Processing For Speech Applications - Part 2 May 14, 2013 Signal Processing for Speech Applications - Part 2-2 References Huang et al., Chapter

More information

Synchronous Overlap and Add of Spectra for Enhancement of Excitation in Artificial Bandwidth Extension of Speech

Synchronous Overlap and Add of Spectra for Enhancement of Excitation in Artificial Bandwidth Extension of Speech INTERSPEECH 5 Synchronous Overlap and Add of Spectra for Enhancement of Excitation in Artificial Bandwidth Extension of Speech M. A. Tuğtekin Turan and Engin Erzin Multimedia, Vision and Graphics Laboratory,

More information

Adaptive Speech Enhancement Using Partial Differential Equations and Back Propagation Neural Networks

Adaptive Speech Enhancement Using Partial Differential Equations and Back Propagation Neural Networks Australian Journal of Basic and Applied Sciences, 4(7): 2093-2098, 2010 ISSN 1991-8178 Adaptive Speech Enhancement Using Partial Differential Equations and Back Propagation Neural Networks 1 Mojtaba Bandarabadi,

More information

Journal of American Science 2015;11(7)

Journal of American Science 2015;11(7) Design of Efficient Noise Reduction Scheme for Secure Speech Masked by Signals Hikmat N. Abdullah 1, Saad S. Hreshee 2, Ameer K. Jawad 3 1. College of Information Engineering, AL-Nahrain University, Baghdad-Iraq

More information

Audio Engineering Society Convention Paper Presented at the 110th Convention 2001 May Amsterdam, The Netherlands

Audio Engineering Society Convention Paper Presented at the 110th Convention 2001 May Amsterdam, The Netherlands Audio Engineering Society Convention Paper Presented at the th Convention May 5 Amsterdam, The Netherlands This convention paper has been reproduced from the author's advance manuscript, without editing,

More information

L19: Prosodic modification of speech

L19: Prosodic modification of speech L19: Prosodic modification of speech Time-domain pitch synchronous overlap add (TD-PSOLA) Linear-prediction PSOLA Frequency-domain PSOLA Sinusoidal models Harmonic + noise models STRAIGHT This lecture

More information

Analysis of the SNR Estimator for Speech Enhancement Using a Cascaded Linear Model

Analysis of the SNR Estimator for Speech Enhancement Using a Cascaded Linear Model Analysis of the SNR Estimator for Speech Enhancement Using a Cascaded Linear Model Harjeet Kaur Ph.D Research Scholar I.K.Gujral Punjab Technical University Jalandhar, Punjab, India Rajneesh Talwar Principal,Professor

More information

Time-Frequency Enhancement Technique for Bevel Gear Fault Diagnosis

Time-Frequency Enhancement Technique for Bevel Gear Fault Diagnosis Time-Frequency Enhancement Technique for Bevel Gear Fault Diagnosis Dennis Hartono 1, Dunant Halim 1, Achmad Widodo 2 and Gethin Wyn Roberts 3 1 Department of Mechanical, Materials and Manufacturing Engineering,

More information

HUMAN speech is frequently encountered in several

HUMAN speech is frequently encountered in several 1948 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 20, NO. 7, SEPTEMBER 2012 Enhancement of Single-Channel Periodic Signals in the Time-Domain Jesper Rindom Jensen, Student Member,

More information

IN a natural environment, speech often occurs simultaneously. Monaural Speech Segregation Based on Pitch Tracking and Amplitude Modulation

IN a natural environment, speech often occurs simultaneously. Monaural Speech Segregation Based on Pitch Tracking and Amplitude Modulation IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 15, NO. 5, SEPTEMBER 2004 1135 Monaural Speech Segregation Based on Pitch Tracking and Amplitude Modulation Guoning Hu and DeLiang Wang, Fellow, IEEE Abstract

More information

Acoustic Echo Cancellation using LMS Algorithm

Acoustic Echo Cancellation using LMS Algorithm Acoustic Echo Cancellation using LMS Algorithm Nitika Gulbadhar M.Tech Student, Deptt. of Electronics Technology, GNDU, Amritsar Shalini Bahel Professor, Deptt. of Electronics Technology,GNDU,Amritsar

More information

Robust Voice Activity Detection Based on Discrete Wavelet. Transform

Robust Voice Activity Detection Based on Discrete Wavelet. Transform Robust Voice Activity Detection Based on Discrete Wavelet Transform Kun-Ching Wang Department of Information Technology & Communication Shin Chien University kunching@mail.kh.usc.edu.tw Abstract This paper

More information

A NEW FEATURE VECTOR FOR HMM-BASED PACKET LOSS CONCEALMENT

A NEW FEATURE VECTOR FOR HMM-BASED PACKET LOSS CONCEALMENT A NEW FEATURE VECTOR FOR HMM-BASED PACKET LOSS CONCEALMENT L. Koenig (,2,3), R. André-Obrecht (), C. Mailhes (2) and S. Fabre (3) () University of Toulouse, IRIT/UPS, 8 Route de Narbonne, F-362 TOULOUSE

More information

Preeti Rao 2 nd CompMusicWorkshop, Istanbul 2012

Preeti Rao 2 nd CompMusicWorkshop, Istanbul 2012 Preeti Rao 2 nd CompMusicWorkshop, Istanbul 2012 o Music signal characteristics o Perceptual attributes and acoustic properties o Signal representations for pitch detection o STFT o Sinusoidal model o

More information

SPEECH ENHANCEMENT WITH SIGNAL SUBSPACE FILTER BASED ON PERCEPTUAL POST FILTERING

SPEECH ENHANCEMENT WITH SIGNAL SUBSPACE FILTER BASED ON PERCEPTUAL POST FILTERING SPEECH ENHANCEMENT WITH SIGNAL SUBSPACE FILTER BASED ON PERCEPTUAL POST FILTERING K.Ramalakshmi Assistant Professor, Dept of CSE Sri Ramakrishna Institute of Technology, Coimbatore R.N.Devendra Kumar Assistant

More information

Design and Implementation on a Sub-band based Acoustic Echo Cancellation Approach

Design and Implementation on a Sub-band based Acoustic Echo Cancellation Approach Vol., No. 6, 0 Design and Implementation on a Sub-band based Acoustic Echo Cancellation Approach Zhixin Chen ILX Lightwave Corporation Bozeman, Montana, USA chen.zhixin.mt@gmail.com Abstract This paper

More information

Joint dereverberation and residual echo suppression of speech signals in noisy environments Habets, E.A.P.; Gannot, S.; Cohen, I.; Sommen, P.C.W.

Joint dereverberation and residual echo suppression of speech signals in noisy environments Habets, E.A.P.; Gannot, S.; Cohen, I.; Sommen, P.C.W. Joint dereverberation and residual echo suppression of speech signals in noisy environments Habets, E.A.P.; Gannot, S.; Cohen, I.; Sommen, P.C.W. Published in: IEEE Transactions on Audio, Speech, and Language

More information

Research Article Subband DCT and EMD Based Hybrid Soft Thresholding for Speech Enhancement

Research Article Subband DCT and EMD Based Hybrid Soft Thresholding for Speech Enhancement Advances in Acoustics and Vibration, Article ID 755, 11 pages http://dx.doi.org/1.1155/1/755 Research Article Subband DCT and EMD Based Hybrid Soft Thresholding for Speech Enhancement Erhan Deger, 1 Md.

More information

Real Time Noise Suppression in Social Settings Comprising a Mixture of Non-stationary and Transient Noise

Real Time Noise Suppression in Social Settings Comprising a Mixture of Non-stationary and Transient Noise th European Signal Processing Conference (EUSIPCO) Real Noise Suppression in Social Settings Comprising a Mixture of Non-stationary and Transient Noise Pei Chee Yong, Sven Nordholm Department of Electrical

More information

IMPROVING QUALITY OF SPEECH SYNTHESIS IN INDIAN LANGUAGES. P. K. Lehana and P. C. Pandey

IMPROVING QUALITY OF SPEECH SYNTHESIS IN INDIAN LANGUAGES. P. K. Lehana and P. C. Pandey Workshop on Spoken Language Processing - 2003, TIFR, Mumbai, India, January 9-11, 2003 149 IMPROVING QUALITY OF SPEECH SYNTHESIS IN INDIAN LANGUAGES P. K. Lehana and P. C. Pandey Department of Electrical

More information

Speech Enhancement Based On Noise Reduction

Speech Enhancement Based On Noise Reduction Speech Enhancement Based On Noise Reduction Kundan Kumar Singh Electrical Engineering Department University Of Rochester ksingh11@z.rochester.edu ABSTRACT This paper addresses the problem of signal distortion

More information

Simulation of Conjugate Structure Algebraic Code Excited Linear Prediction Speech Coder

Simulation of Conjugate Structure Algebraic Code Excited Linear Prediction Speech Coder COMPUSOFT, An international journal of advanced computer technology, 3 (3), March-204 (Volume-III, Issue-III) ISSN:2320-0790 Simulation of Conjugate Structure Algebraic Code Excited Linear Prediction Speech

More information

Vibration Signal Pre-processing For Spall Size Estimation in Rolling Element Bearings Using Autoregressive Inverse Filtration

Vibration Signal Pre-processing For Spall Size Estimation in Rolling Element Bearings Using Autoregressive Inverse Filtration Vibration Signal Pre-processing For Spall Size Estimation in Rolling Element Bearings Using Autoregressive Inverse Filtration Nader Sawalhi 1, Wenyi Wang 2, Andrew Becker 2 1 Prince Mahammad Bin Fahd University,

More information

Available online at ScienceDirect. Procedia Computer Science 54 (2015 )

Available online at   ScienceDirect. Procedia Computer Science 54 (2015 ) Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 54 (2015 ) 574 584 Eleventh International Multi-Conference on Information Processing-2015 (IMCIP-2015) Speech Enhancement

More information

Enhancement of Speech Signal by Adaptation of Scales and Thresholds of Bionic Wavelet Transform Coefficients

Enhancement of Speech Signal by Adaptation of Scales and Thresholds of Bionic Wavelet Transform Coefficients ISSN (Print) : 232 3765 An ISO 3297: 27 Certified Organization Vol. 3, Special Issue 3, April 214 Paiyanoor-63 14, Tamil Nadu, India Enhancement of Speech Signal by Adaptation of Scales and Thresholds

More information

Mikko Myllymäki and Tuomas Virtanen

Mikko Myllymäki and Tuomas Virtanen NON-STATIONARY NOISE MODEL COMPENSATION IN VOICE ACTIVITY DETECTION Mikko Myllymäki and Tuomas Virtanen Department of Signal Processing, Tampere University of Technology Korkeakoulunkatu 1, 3370, Tampere,

More information

Speech Signal Enhancement Techniques

Speech Signal Enhancement Techniques Speech Signal Enhancement Techniques Chouki Zegar 1, Abdelhakim Dahimene 2 1,2 Institute of Electrical and Electronic Engineering, University of Boumerdes, Algeria inelectr@yahoo.fr, dahimenehakim@yahoo.fr

More information

Online Monaural Speech Enhancement Based on Periodicity Analysis and A Priori SNR Estimation

Online Monaural Speech Enhancement Based on Periodicity Analysis and A Priori SNR Estimation 1 Online Monaural Speech Enhancement Based on Periodicity Analysis and A Priori SNR Estimation Zhangli Chen* and Volker Hohmann Abstract This paper describes an online algorithm for enhancing monaural

More information

Fundamental Frequency Detection

Fundamental Frequency Detection Fundamental Frequency Detection Jan Černocký, Valentina Hubeika {cernocky ihubeika}@fit.vutbr.cz DCGM FIT BUT Brno Fundamental Frequency Detection Jan Černocký, Valentina Hubeika, DCGM FIT BUT Brno 1/37

More information

Sound Synthesis Methods

Sound Synthesis Methods Sound Synthesis Methods Matti Vihola, mvihola@cs.tut.fi 23rd August 2001 1 Objectives The objective of sound synthesis is to create sounds that are Musically interesting Preferably realistic (sounds like

More information

EC209 - Improving Signal-To-Noise Ratio (SNR) for Optimizing Repeatable Auditory Brainstem Responses

EC209 - Improving Signal-To-Noise Ratio (SNR) for Optimizing Repeatable Auditory Brainstem Responses EC209 - Improving Signal-To-Noise Ratio (SNR) for Optimizing Repeatable Auditory Brainstem Responses Aaron Steinman, Ph.D. Director of Research, Vivosonic Inc. aaron.steinman@vivosonic.com 1 Outline Why

More information

Guan, L, Gu, F, Shao, Y, Fazenda, BM and Ball, A

Guan, L, Gu, F, Shao, Y, Fazenda, BM and Ball, A Gearbox fault diagnosis under different operating conditions based on time synchronous average and ensemble empirical mode decomposition Guan, L, Gu, F, Shao, Y, Fazenda, BM and Ball, A Title Authors Type

More information

Speech Compression Using Voice Excited Linear Predictive Coding

Speech Compression Using Voice Excited Linear Predictive Coding Speech Compression Using Voice Excited Linear Predictive Coding Ms.Tosha Sen, Ms.Kruti Jay Pancholi PG Student, Asst. Professor, L J I E T, Ahmedabad Abstract : The aim of the thesis is design good quality

More information

Speech Coding Technique And Analysis Of Speech Codec Using CS-ACELP

Speech Coding Technique And Analysis Of Speech Codec Using CS-ACELP Speech Coding Technique And Analysis Of Speech Codec Using CS-ACELP Monika S.Yadav Vidarbha Institute of Technology Rashtrasant Tukdoji Maharaj Nagpur University, Nagpur, India monika.yadav@rediffmail.com

More information

Encoding a Hidden Digital Signature onto an Audio Signal Using Psychoacoustic Masking

Encoding a Hidden Digital Signature onto an Audio Signal Using Psychoacoustic Masking The 7th International Conference on Signal Processing Applications & Technology, Boston MA, pp. 476-480, 7-10 October 1996. Encoding a Hidden Digital Signature onto an Audio Signal Using Psychoacoustic

More information

Testing of Objective Audio Quality Assessment Models on Archive Recordings Artifacts

Testing of Objective Audio Quality Assessment Models on Archive Recordings Artifacts POSTER 25, PRAGUE MAY 4 Testing of Objective Audio Quality Assessment Models on Archive Recordings Artifacts Bc. Martin Zalabák Department of Radioelectronics, Czech Technical University in Prague, Technická

More information

Quantification of glottal and voiced speech harmonicsto-noise ratios using cepstral-based estimation

Quantification of glottal and voiced speech harmonicsto-noise ratios using cepstral-based estimation Quantification of glottal and voiced speech harmonicsto-noise ratios using cepstral-based estimation Peter J. Murphy and Olatunji O. Akande, Department of Electronic and Computer Engineering University

More information

Multimedia Signal Processing: Theory and Applications in Speech, Music and Communications

Multimedia Signal Processing: Theory and Applications in Speech, Music and Communications Brochure More information from http://www.researchandmarkets.com/reports/569388/ Multimedia Signal Processing: Theory and Applications in Speech, Music and Communications Description: Multimedia Signal

More information

Signal segmentation and waveform characterization. Biosignal processing, S Autumn 2012

Signal segmentation and waveform characterization. Biosignal processing, S Autumn 2012 Signal segmentation and waveform characterization Biosignal processing, 5173S Autumn 01 Short-time analysis of signals Signal statistics may vary in time: nonstationary how to compute signal characterizations?

More information

A LPC-PEV Based VAD for Word Boundary Detection

A LPC-PEV Based VAD for Word Boundary Detection 14 A LPC-PEV Based VAD for Word Boundary Detection Syed Abbas Ali (A), NajmiGhaniHaider (B) and Mahmood Khan Pathan (C) (A) Faculty of Computer &Information Systems Engineering, N.E.D University of Engg.

More information

ORTHOGONAL frequency division multiplexing (OFDM)

ORTHOGONAL frequency division multiplexing (OFDM) 144 IEEE TRANSACTIONS ON BROADCASTING, VOL. 51, NO. 1, MARCH 2005 Performance Analysis for OFDM-CDMA With Joint Frequency-Time Spreading Kan Zheng, Student Member, IEEE, Guoyan Zeng, and Wenbo Wang, Member,

More information