Speech Enhancement using Wiener filtering
|
|
- Nathan Horton
- 5 years ago
- Views:
Transcription
1 Speech Enhancement using Wiener filtering S. Chirtmay and M. Tahernezhadi Department of Electrical Engineering Northern Illinois University DeKalb, IL ABSTRACT The problem of reducing the disturbing effects of additive white noise on a speech signal is considered when a noisereference is not available. Wiener filtering with all-pole modeling built built upon line spectral pair (LSP) frequencies is considered. The filter parameters have been optimized to achieve the highest reduction of noise. The noise is filtered using an iterative LSP-based estimations of LPC parameters. The speech model filter uses an accurate updated estimate of the current noise power spectral density with the aid of a voice activity decoder. I. INTRODUCTION The problem examined here is the enhancement of speech disturbed by additive noise. The basic assumption is that the enhancement system does not have access to any other signal except the corrupted speech itself. The is, no noisereference signal is available, which could allow one to employ classical adaptive noise canceling [l]. The objective of obtaining higher quality and/or intelligibility of the noisy speech may have a fundamental impact on applications like speech compression, speech recognition, and speaker verification, by improving the performance of the relevant digital voice processor. The technique considered in this paper is based on the all-pole model of the vocal tract and uses the estimated coefficients to process the noisy speech with a wiener filter. This is a new and improved iterative speech enhancement technique based on spectral constrains. The iterative technique, originally formulated by Lim and Oppenheim [2], attempts to solve for the maximum likelihood estimate of a speech waveform in additive white noise using Linear predictive coding (LPC). Thus the LPC parameters are estimated using the output of the wiener filter. The LPC model of this estimation is in the form of complex numbers in z - plane and the complex numbers cannot be associated for interframe smoothing; so the LPC poles are represented as Line Spectral Pair (LSP). Inter-frame spectral constraints are applied to LSP parameters across time on a fixed- frame basis. These constraints are applied to ensure that vocal tract characteristics do not vary wildly from frame to frame when speech is present. This method allows constraints to be efficiently applied to speech model pole movements across time so that formants lay along smooth tracks. An N- th order LPC model pole positions are equivalently represented by a set of N/2 LSP position roots and N/2 difference roots. The position root (P) and the difference root (Q) represent a lossless models of the vocal tracts with the glottis closed and open, respectively. They lie on the unit circle in the complex z-plane. The lightly formant locations in the signal s LPC model spectrum are highly correlated with the LSP position roots and the bandwidths of the LPC spectrums at these formants are highly correlated with the LSP difference roots. For a stable LPC model, there is a root at z=- 1 and at z= 1 resepcive!y for Pand Q. P and Q roots alternate around the unit circle. For each iteration to be filtered the LPC pole of the speech estimate is smoothed out around that particular pole in different frames using the LSP equivalent roots. A lower bound on minimum distance of a difference root to adjacent position root is applied to restrain the sharpness of any LPC model s formant to be speech like. Here, we have considered one future frame and one past frame for smoothing in a particular iteration. Then the smoothed LSP roots are again converted to the smoothed LPC parameters. The smoothed LPC model power spectrum and the current noise power estimates are used to get the next iteration of the Wiener filter. The output from the previous Weiner filter iteration is used along with the original input data to get less muffled sounding speech estimate, with a tradeoff of slightly increased residual noise in the output. When we input the noisy signal initially to the Wiener filter input we also input the Fast Fourier transform(fft) of the signal to the Voice Activity Detector (VAD). II. ALGORITHM We know that over a given frame of speech, say coefficients. The method by Lim and Oppenheim is based on maximum a posteriori (MAP) estimation of the LP coefficients, gain, and noise-free speech. The method is an iterative one in which the LP parameters and speech frame are repeatedly reestimated. It is assumed that all unknown 365
2 parameters are random with a priori Guasian pdf's. The resulting MAP estimator, which maximizes the conditional pdf of parameters given the observations, corresponds to solution of a set of nonlinear equations for the additive white Guassian noise (AWGN) case. In the noise case, the estimator requires a k, g k, and S I be chosen to maximize the pdf p (a k,, g k, s I y). Essentially, we wish to perform joint MAP estimation of the LP speech modeling parameters and noise-free speech by maximizing the joint probability density which is p(a k,s k/y,g k,s 1 ), where the terms g k and s I are assumed to be known(or estimated). Lim and Oppenheim consider a sub optimum solution using sequential MAP estimation of S, followed by MAP estimation of a k, g k, given s k. The sequential estimation procedure is linear at each iteration and continues until some criterion is satisfied. With further simplifying assumptions, it can be shown that MAP estimation of s k is equivalent to non causal Wiener filtering of the noisy speech y. Lim and Oppenheim showed that this technique, under certain conditions, increases the joint likelihood of a k and Sk with each iteration. It can also be shown to be the optimal solution in the MSE sense for a input noisy signal is digitized at an rate of 8kHz, and the time series are processed in frames. The number of samples considered per frame is 256 (32 msec) for determining the speech signal. The noise spectral density, or noise variance for the white Guassian case. musl be estimated during non speech activity. Step 1. Estimated a k from Sk USing either: a.) First M values as the initial condition vector, or b.) Always assume a zero initial condition S k = O. Step 2. Estimate S k (N) given the present estimate a l (N). a.) Estimated a,, estimate the speech spectrum : resulting equation for estimating the noise-free speech is simply the optimum Wiener filter Where the extra index k included to indicate the k th parameters holds, this is the optimum processor in a MSE sense. If the Guassian assumption does not hold, this filter is the best linear processor for obtaining the next speech length of signal frame and k is the iteration number. With this relation, sequential MAP estimation- of the LP parameters and the speech frame generally follows these steps: The first step is perfomed via LP parameter and the second step through adaptive Wiener filtering. The final implementation of the algorithm is presented below. The 366
3
4 LSPS only periods. When speech is present, the noise is attenuated by the filter, leaving oniy speech. The energy of the inverse filter filtered signal is compared to a threshold which is updated only during noise-only periods. This threshold rides above the energy of the noise signal after it has been filtered. If the energy is greater than the threshold, then speech is detected. There are variables which need to be updated only when noise is present, but it is obviously dangerous to use the output of the VAD to decide when to update them, because this output is itself the function of these variables. for this we use a secondary VAD. The secondary VAD detects only noise periods but does not endpoint the speech. The secondary VAD makes it decision based on the fact that if the frames have a similar spectral shape for a long period of time then it is either speech or noise. Therefore, if the distortion between frames is below a fixed threshold for a sufficiently long period of time, it is assumed that noise has been detected, unless a steady pitch component has been detected, in which case the input was probably a vowel sound. Another criteria we have used for this VAD is that if the above VAD detects noise frame in between 6 speech frames or before 3 speech frames or after 3 speech frame then that frame is considered to be unvoiced and is finally considered as speech frame. This is in order to fail-safe whenever it is not possible distinguish between unvoiced and speech. III. SIMULATION RESULTS Voice Activity Detector The most critical component of the system is the VAD. A VAD operating in a mobile environment must be able to detect speech in the presence of a range of very diverse types of acoustic background noise. The biggest difficulty in detection of speech is in the presence of very low signal to noise ratio. Thus, a VAD based on the spectral characteristics of the input signal is used in this paper. The block diagram of this VAD is shown in Fig. 1. It incorporates an inverse filter, the coefficients of which are derived during noise The purpose of this computer simulation is to test the performance of the above technique. The first step is to test the performance of VAD (voice activity detector). We need to use VAD to distinguish if current frame is pure noise or noisy speech. For the case SN R is too low, it is really hard to detect the noise frame and unvoiced frame. We use the sentence Don t ask me to carry an oily rage like that" with noise to test the performances of the simulation programs. And we also test the simulation results vs. different SNR. For larger SNR,the processed speech signal gets better quality. Figure 3. shows the output results with input noisy speech for SNR of 10 db. We have performed speech enhancement for SNR of about 5 db, but we can see that as the SNR becomes considerably low is it impossible to detect between unvoiced speech and the noise. Hence this algorithm does not work for very low SNR. Also SNR of 5 db or lower means very low SNR for unvoiced portion of speech. Thus this aspect also needs to be considered for future study. In the table below we have shown input SNR and output SNR for voiced, unvoiced, and noise frame. Thus, we can see that enhancement is about 7-8 db
5 Fig. 2- Table for speech enhancement Fig.3 - Original speech and the enhanced speech REFERENCES 1) John H. L. and Mark A. Clements, Constrained Iterative Speech Enhancement with Application to Automatic Speech Recognition. IEEE, ) Jae S. Lim and Alan, V. Oppenheim, Enhancement and Bandwidth Compression of Noisy Speech. Invited paper, IEEE, ) John H. L. Hanson and Levent M. Arslan, Robust Feature-Estimation and Objective Quality Assessment for Noisy Speech Recognition Using the Credit Card Copus, IEEE Trans, Speech and Audio Processing, Vol-3. No. 3, May ) WYNN,Woodson, Transmitted Noise Reduction in communications systems, Patent Cooperation Treaty
NOISE ESTIMATION IN A SINGLE CHANNEL
SPEECH ENHANCEMENT FOR CROSS-TALK INTERFERENCE by Levent M. Arslan and John H.L. Hansen Robust Speech Processing Laboratory Department of Electrical Engineering Box 99 Duke University Durham, North Carolina
More informationSpeech Synthesis using Mel-Cepstral Coefficient Feature
Speech Synthesis using Mel-Cepstral Coefficient Feature By Lu Wang Senior Thesis in Electrical Engineering University of Illinois at Urbana-Champaign Advisor: Professor Mark Hasegawa-Johnson May 2018 Abstract
More informationEE482: Digital Signal Processing Applications
Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 12 Speech Signal Processing 14/03/25 http://www.ee.unlv.edu/~b1morris/ee482/
More informationDifferent Approaches of Spectral Subtraction Method for Speech Enhancement
ISSN 2249 5460 Available online at www.internationalejournals.com International ejournals International Journal of Mathematical Sciences, Technology and Humanities 95 (2013 1056 1062 Different Approaches
More informationSpeech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm
International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm A.T. Rajamanickam, N.P.Subiramaniyam, A.Balamurugan*,
More informationChapter IV THEORY OF CELP CODING
Chapter IV THEORY OF CELP CODING CHAPTER IV THEORY OF CELP CODING 4.1 Introduction Wavefonn coders fail to produce high quality speech at bit rate lower than 16 kbps. Source coders, such as LPC vocoders,
More informationDigital Speech Processing and Coding
ENEE408G Spring 2006 Lecture-2 Digital Speech Processing and Coding Spring 06 Instructor: Shihab Shamma Electrical & Computer Engineering University of Maryland, College Park http://www.ece.umd.edu/class/enee408g/
More informationMel Spectrum Analysis of Speech Recognition using Single Microphone
International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree
More informationChapter 4 SPEECH ENHANCEMENT
44 Chapter 4 SPEECH ENHANCEMENT 4.1 INTRODUCTION: Enhancement is defined as improvement in the value or Quality of something. Speech enhancement is defined as the improvement in intelligibility and/or
More informationspeech signal S(n). This involves a transformation of S(n) into another signal or a set of signals
16 3. SPEECH ANALYSIS 3.1 INTRODUCTION TO SPEECH ANALYSIS Many speech processing [22] applications exploits speech production and perception to accomplish speech analysis. By speech analysis we extract
More informationSpeech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter
Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter 1 Gupteswar Sahu, 2 D. Arun Kumar, 3 M. Bala Krishna and 4 Jami Venkata Suman Assistant Professor, Department of ECE,
More information(i) Understanding the basic concepts of signal modeling, correlation, maximum likelihood estimation, least squares and iterative numerical methods
Tools and Applications Chapter Intended Learning Outcomes: (i) Understanding the basic concepts of signal modeling, correlation, maximum likelihood estimation, least squares and iterative numerical methods
More informationEC 6501 DIGITAL COMMUNICATION UNIT - II PART A
EC 6501 DIGITAL COMMUNICATION 1.What is the need of prediction filtering? UNIT - II PART A [N/D-16] Prediction filtering is used mostly in audio signal processing and speech processing for representing
More informationOverview of Code Excited Linear Predictive Coder
Overview of Code Excited Linear Predictive Coder Minal Mulye 1, Sonal Jagtap 2 1 PG Student, 2 Assistant Professor, Department of E&TC, Smt. Kashibai Navale College of Engg, Pune, India Abstract Advances
More informationAudio Signal Compression using DCT and LPC Techniques
Audio Signal Compression using DCT and LPC Techniques P. Sandhya Rani#1, D.Nanaji#2, V.Ramesh#3,K.V.S. Kiran#4 #Student, Department of ECE, Lendi Institute Of Engineering And Technology, Vizianagaram,
More informationOn a Classification of Voiced/Unvoiced by using SNR for Speech Recognition
International Conference on Advanced Computer Science and Electronics Information (ICACSEI 03) On a Classification of Voiced/Unvoiced by using SNR for Speech Recognition Jongkuk Kim, Hernsoo Hahn Department
More informationSpeech Synthesis; Pitch Detection and Vocoders
Speech Synthesis; Pitch Detection and Vocoders Tai-Shih Chi ( 冀泰石 ) Department of Communication Engineering National Chiao Tung University May. 29, 2008 Speech Synthesis Basic components of the text-to-speech
More informationEnhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis
Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis Mohini Avatade & S.L. Sahare Electronics & Telecommunication Department, Cummins
More informationReduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter
Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter Ching-Ta Lu, Kun-Fu Tseng 2, Chih-Tsung Chen 2 Department of Information Communication, Asia University, Taichung, Taiwan, ROC
More informationSpeech Enhancement in Noisy Environment using Kalman Filter
Speech Enhancement in Noisy Environment using Kalman Filter Erukonda Sravya 1, Rakesh Ranjan 2, Nitish J. Wadne 3 1, 2 Assistant professor, Dept. of ECE, CMR Engineering College, Hyderabad (India) 3 PG
More informationOFDM Transmission Corrupted by Impulsive Noise
OFDM Transmission Corrupted by Impulsive Noise Jiirgen Haring, Han Vinck University of Essen Institute for Experimental Mathematics Ellernstr. 29 45326 Essen, Germany,. e-mail: haering@exp-math.uni-essen.de
More informationMMSE STSA Based Techniques for Single channel Speech Enhancement Application Simit Shah 1, Roma Patel 2
MMSE STSA Based Techniques for Single channel Speech Enhancement Application Simit Shah 1, Roma Patel 2 1 Electronics and Communication Department, Parul institute of engineering and technology, Vadodara,
More informationVocoder (LPC) Analysis by Variation of Input Parameters and Signals
ISCA Journal of Engineering Sciences ISCA J. Engineering Sci. Vocoder (LPC) Analysis by Variation of Input Parameters and Signals Abstract Gupta Rajani, Mehta Alok K. and Tiwari Vebhav Truba College of
More informationSPEECH ENHANCEMENT USING A ROBUST KALMAN FILTER POST-PROCESSOR IN THE MODULATION DOMAIN. Yu Wang and Mike Brookes
SPEECH ENHANCEMENT USING A ROBUST KALMAN FILTER POST-PROCESSOR IN THE MODULATION DOMAIN Yu Wang and Mike Brookes Department of Electrical and Electronic Engineering, Exhibition Road, Imperial College London,
More informationSpeech Compression Using Voice Excited Linear Predictive Coding
Speech Compression Using Voice Excited Linear Predictive Coding Ms.Tosha Sen, Ms.Kruti Jay Pancholi PG Student, Asst. Professor, L J I E T, Ahmedabad Abstract : The aim of the thesis is design good quality
More informationChapter 9. Digital Communication Through Band-Limited Channels. Muris Sarajlic
Chapter 9 Digital Communication Through Band-Limited Channels Muris Sarajlic Band limited channels (9.1) Analysis in previous chapters considered the channel bandwidth to be unbounded All physical channels
More informationAuditory modelling for speech processing in the perceptual domain
ANZIAM J. 45 (E) ppc964 C980, 2004 C964 Auditory modelling for speech processing in the perceptual domain L. Lin E. Ambikairajah W. H. Holmes (Received 8 August 2003; revised 28 January 2004) Abstract
More informationNOTICE WARNING CONCERNING COPYRIGHT RESTRICTIONS: The copyright law of the United States (title 17, U.S. Code) governs the making of photocopies or
NOTICE WARNING CONCERNING COPYRIGHT RESTRICTIONS: The copyright law of the United States (title 17, U.S. Code) governs the making of photocopies or other reproductions of copyrighted material. Any copying
More informationAPPLICATIONS OF DSP OBJECTIVES
APPLICATIONS OF DSP OBJECTIVES This lecture will discuss the following: Introduce analog and digital waveform coding Introduce Pulse Coded Modulation Consider speech-coding principles Introduce the channel
More informationLinguistic Phonetics. Spectral Analysis
24.963 Linguistic Phonetics Spectral Analysis 4 4 Frequency (Hz) 1 Reading for next week: Liljencrants & Lindblom 1972. Assignment: Lip-rounding assignment, due 1/15. 2 Spectral analysis techniques There
More informationCOMPARISON OF CHANNEL ESTIMATION AND EQUALIZATION TECHNIQUES FOR OFDM SYSTEMS
COMPARISON OF CHANNEL ESTIMATION AND EQUALIZATION TECHNIQUES FOR OFDM SYSTEMS Sanjana T and Suma M N Department of Electronics and communication, BMS College of Engineering, Bangalore, India ABSTRACT In
More informationAdaptive Filters Application of Linear Prediction
Adaptive Filters Application of Linear Prediction Gerhard Schmidt Christian-Albrechts-Universität zu Kiel Faculty of Engineering Electrical Engineering and Information Technology Digital Signal Processing
More informationTHERE are numerous areas where it is necessary to enhance
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 6, NO. 6, NOVEMBER 1998 573 IV. CONCLUSION In this work, it is shown that the actual energy of analysis frames should be taken into account for interpolation.
More informationCan binary masks improve intelligibility?
Can binary masks improve intelligibility? Mike Brookes (Imperial College London) & Mark Huckvale (University College London) Apparently so... 2 How does it work? 3 Time-frequency grid of local SNR + +
More informationPage 0 of 23. MELP Vocoder
Page 0 of 23 MELP Vocoder Outline Introduction MELP Vocoder Features Algorithm Description Parameters & Comparison Page 1 of 23 Introduction Traditional pitched-excited LPC vocoders use either a periodic
More informationSpeech Enhancement Based On Noise Reduction
Speech Enhancement Based On Noise Reduction Kundan Kumar Singh Electrical Engineering Department University Of Rochester ksingh11@z.rochester.edu ABSTRACT This paper addresses the problem of signal distortion
More informationMODIFIED DCT BASED SPEECH ENHANCEMENT IN VEHICULAR ENVIRONMENTS
MODIFIED DCT BASED SPEECH ENHANCEMENT IN VEHICULAR ENVIRONMENTS 1 S.PRASANNA VENKATESH, 2 NITIN NARAYAN, 3 K.SAILESH BHARATHWAAJ, 4 M.P.ACTLIN JEEVA, 5 P.VIJAYALAKSHMI 1,2,3,4,5 SSN College of Engineering,
More informationSingle channel noise reduction
Single channel noise reduction Basics and processing used for ETSI STF 94 ETSI Workshop on Speech and Noise in Wideband Communication Claude Marro France Telecom ETSI 007. All rights reserved Outline Scope
More informationAdaptive Filters Linear Prediction
Adaptive Filters Gerhard Schmidt Christian-Albrechts-Universität zu Kiel Faculty of Engineering Institute of Electrical and Information Engineering Digital Signal Processing and System Theory Slide 1 Contents
More informationSpeech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction
IOSR Journal of VLSI and Signal Processing (IOSR-JVSP) Volume 7, Issue, Ver. I (Mar. - Apr. 7), PP 4-46 e-issn: 9 4, p-issn No. : 9 497 www.iosrjournals.org Speech Enhancement Using Spectral Flatness Measure
More informationPitch Period of Speech Signals Preface, Determination and Transformation
Pitch Period of Speech Signals Preface, Determination and Transformation Mohammad Hossein Saeidinezhad 1, Bahareh Karamsichani 2, Ehsan Movahedi 3 1 Islamic Azad university, Najafabad Branch, Saidinezhad@yahoo.com
More informationPerceptual Speech Enhancement Using Multi_band Spectral Attenuation Filter
Perceptual Speech Enhancement Using Multi_band Spectral Attenuation Filter Sana Alaya, Novlène Zoghlami and Zied Lachiri Signal, Image and Information Technology Laboratory National Engineering School
More informationUniversity of Washington Department of Electrical Engineering Computer Speech Processing EE516 Winter 2005
University of Washington Department of Electrical Engineering Computer Speech Processing EE516 Winter 2005 Lecture 5 Slides Jan 26 th, 2005 Outline of Today s Lecture Announcements Filter-bank analysis
More informationUNEQUAL POWER ALLOCATION FOR JPEG TRANSMISSION OVER MIMO SYSTEMS. Muhammad F. Sabir, Robert W. Heath Jr. and Alan C. Bovik
UNEQUAL POWER ALLOCATION FOR JPEG TRANSMISSION OVER MIMO SYSTEMS Muhammad F. Sabir, Robert W. Heath Jr. and Alan C. Bovik Department of Electrical and Computer Engineering, The University of Texas at Austin,
More informationSpeech Coding using Linear Prediction
Speech Coding using Linear Prediction Jesper Kjær Nielsen Aalborg University and Bang & Olufsen jkn@es.aau.dk September 10, 2015 1 Background Speech is generated when air is pushed from the lungs through
More informationEE 225D LECTURE ON MEDIUM AND HIGH RATE CODING. University of California Berkeley
University of California Berkeley College of Engineering Department of Electrical Engineering and Computer Sciences Professors : N.Morgan / B.Gold EE225D Spring,1999 Medium & High Rate Coding Lecture 26
More informationSignal Processing for Speech Applications - Part 2-1. Signal Processing For Speech Applications - Part 2
Signal Processing for Speech Applications - Part 2-1 Signal Processing For Speech Applications - Part 2 May 14, 2013 Signal Processing for Speech Applications - Part 2-2 References Huang et al., Chapter
More informationEmanuël A. P. Habets, Jacob Benesty, and Patrick A. Naylor. Presented by Amir Kiperwas
Emanuël A. P. Habets, Jacob Benesty, and Patrick A. Naylor Presented by Amir Kiperwas 1 M-element microphone array One desired source One undesired source Ambient noise field Signals: Broadband Mutually
More informationMultimedia Signal Processing: Theory and Applications in Speech, Music and Communications
Brochure More information from http://www.researchandmarkets.com/reports/569388/ Multimedia Signal Processing: Theory and Applications in Speech, Music and Communications Description: Multimedia Signal
More informationSpeech Enhancement Techniques using Wiener Filter and Subspace Filter
IJSTE - International Journal of Science Technology & Engineering Volume 3 Issue 05 November 2016 ISSN (online): 2349-784X Speech Enhancement Techniques using Wiener Filter and Subspace Filter Ankeeta
More informationRobust Voice Activity Detection Based on Discrete Wavelet. Transform
Robust Voice Activity Detection Based on Discrete Wavelet Transform Kun-Ching Wang Department of Information Technology & Communication Shin Chien University kunching@mail.kh.usc.edu.tw Abstract This paper
More informationSpeech synthesizer. W. Tidelund S. Andersson R. Andersson. March 11, 2015
Speech synthesizer W. Tidelund S. Andersson R. Andersson March 11, 2015 1 1 Introduction A real time speech synthesizer is created by modifying a recorded signal on a DSP by using a prediction filter.
More informationJoint Transmitter-Receiver Adaptive Forward-Link DS-CDMA System
# - Joint Transmitter-Receiver Adaptive orward-link D-CDMA ystem Li Gao and Tan. Wong Department of Electrical & Computer Engineering University of lorida Gainesville lorida 3-3 Abstract A joint transmitter-receiver
More informationThe fundamentals of detection theory
Advanced Signal Processing: The fundamentals of detection theory Side 1 of 18 Index of contents: Advanced Signal Processing: The fundamentals of detection theory... 3 1 Problem Statements... 3 2 Detection
More informationHigh-speed Noise Cancellation with Microphone Array
Noise Cancellation a Posteriori Probability, Maximum Criteria Independent Component Analysis High-speed Noise Cancellation with Microphone Array We propose the use of a microphone array based on independent
More informationCommunications Theory and Engineering
Communications Theory and Engineering Master's Degree in Electronic Engineering Sapienza University of Rome A.A. 2018-2019 Speech and telephone speech Based on a voice production model Parametric representation
More informationMultirate Algorithm for Acoustic Echo Cancellation
Technology Volume 1, Issue 2, October-December, 2013, pp. 112-116, IASTER 2013 www.iaster.com, Online: 2347-6109, Print: 2348-0017 Multirate Algorithm for Acoustic Echo Cancellation 1 Ch. Babjiprasad,
More informationAudio Restoration Based on DSP Tools
Audio Restoration Based on DSP Tools EECS 451 Final Project Report Nan Wu School of Electrical Engineering and Computer Science University of Michigan Ann Arbor, MI, United States wunan@umich.edu Abstract
More informationREAL-TIME BROADBAND NOISE REDUCTION
REAL-TIME BROADBAND NOISE REDUCTION Robert Hoeldrich and Markus Lorber Institute of Electronic Music Graz Jakoministrasse 3-5, A-8010 Graz, Austria email: robert.hoeldrich@mhsg.ac.at Abstract A real-time
More informationReading: Johnson Ch , Ch.5.5 (today); Liljencrants & Lindblom; Stevens (Tues) reminder: no class on Thursday.
L105/205 Phonetics Scarborough Handout 7 10/18/05 Reading: Johnson Ch.2.3.3-2.3.6, Ch.5.5 (today); Liljencrants & Lindblom; Stevens (Tues) reminder: no class on Thursday Spectral Analysis 1. There are
More informationDigitally controlled Active Noise Reduction with integrated Speech Communication
Digitally controlled Active Noise Reduction with integrated Speech Communication Herman J.M. Steeneken and Jan Verhave TNO Human Factors, Soesterberg, The Netherlands herman@steeneken.com ABSTRACT Active
More informationIntroduction to cochlear implants Philipos C. Loizou Figure Captions
http://www.utdallas.edu/~loizou/cimplants/tutorial/ Introduction to cochlear implants Philipos C. Loizou Figure Captions Figure 1. The top panel shows the time waveform of a 30-msec segment of the vowel
More informationBlind Dereverberation of Single-Channel Speech Signals Using an ICA-Based Generative Model
Blind Dereverberation of Single-Channel Speech Signals Using an ICA-Based Generative Model Jong-Hwan Lee 1, Sang-Hoon Oh 2, and Soo-Young Lee 3 1 Brain Science Research Center and Department of Electrial
More informationApplication of Affine Projection Algorithm in Adaptive Noise Cancellation
ISSN: 78-8 Vol. 3 Issue, January - Application of Affine Projection Algorithm in Adaptive Noise Cancellation Rajul Goyal Dr. Girish Parmar Pankaj Shukla EC Deptt.,DTE Jodhpur EC Deptt., RTU Kota EC Deptt.,
More informationA Spectral Conversion Approach to Single- Channel Speech Enhancement
University of Pennsylvania ScholarlyCommons Departmental Papers (ESE) Department of Electrical & Systems Engineering May 2007 A Spectral Conversion Approach to Single- Channel Speech Enhancement Athanasios
More informationCHAPTER 3 SPEECH ENHANCEMENT ALGORITHMS
46 CHAPTER 3 SPEECH ENHANCEMENT ALGORITHMS 3.1 INTRODUCTION Personal communication of today is impaired by nearly ubiquitous noise. Speech communication becomes difficult under these conditions; speech
More informationSpeech Enhancement Using a Mixture-Maximum Model
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 10, NO. 6, SEPTEMBER 2002 341 Speech Enhancement Using a Mixture-Maximum Model David Burshtein, Senior Member, IEEE, and Sharon Gannot, Member, IEEE
More informationNCCF ACF. cepstrum coef. error signal > samples
ESTIMATION OF FUNDAMENTAL FREQUENCY IN SPEECH Petr Motl»cek 1 Abstract This paper presents an application of one method for improving fundamental frequency detection from the speech. The method is based
More informationWavelet Speech Enhancement based on the Teager Energy Operator
Wavelet Speech Enhancement based on the Teager Energy Operator Mohammed Bahoura and Jean Rouat ERMETIS, DSA, Université du Québec à Chicoutimi, Chicoutimi, Québec, G7H 2B1, Canada. Abstract We propose
More informationWideband Speech Coding & Its Application
Wideband Speech Coding & Its Application Apeksha B. landge. M.E. [student] Aditya Engineering College Beed Prof. Amir Lodhi. Guide & HOD, Aditya Engineering College Beed ABSTRACT: Increasing the bandwidth
More informationFrequency Domain Analysis for Noise Suppression Using Spectral Processing Methods for Degraded Speech Signal in Speech Enhancement
Frequency Domain Analysis for Noise Suppression Using Spectral Processing Methods for Degraded Speech Signal in Speech Enhancement 1 Zeeshan Hashmi Khateeb, 2 Gopalaiah 1,2 Department of Instrumentation
More informationRECENTLY, there has been an increasing interest in noisy
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 52, NO. 9, SEPTEMBER 2005 535 Warped Discrete Cosine Transform-Based Noisy Speech Enhancement Joon-Hyuk Chang, Member, IEEE Abstract In
More informationDetermination of instants of significant excitation in speech using Hilbert envelope and group delay function
Determination of instants of significant excitation in speech using Hilbert envelope and group delay function by K. Sreenivasa Rao, S. R. M. Prasanna, B.Yegnanarayana in IEEE Signal Processing Letters,
More informationX. SPEECH ANALYSIS. Prof. M. Halle G. W. Hughes H. J. Jacobsen A. I. Engel F. Poza A. VOWEL IDENTIFIER
X. SPEECH ANALYSIS Prof. M. Halle G. W. Hughes H. J. Jacobsen A. I. Engel F. Poza A. VOWEL IDENTIFIER Most vowel identifiers constructed in the past were designed on the principle of "pattern matching";
More informationVoice Activity Detection for Speech Enhancement Applications
Voice Activity Detection for Speech Enhancement Applications E. Verteletskaya, K. Sakhnov Abstract This paper describes a study of noise-robust voice activity detection (VAD) utilizing the periodicity
More informationEvaluation of clipping-noise suppression of stationary-noisy speech based on spectral compensation
Evaluation of clipping-noise suppression of stationary-noisy speech based on spectral compensation Takahiro FUKUMORI ; Makoto HAYAKAWA ; Masato NAKAYAMA 2 ; Takanobu NISHIURA 2 ; Yoichi YAMASHITA 2 Graduate
More informationCepstrum alanysis of speech signals
Cepstrum alanysis of speech signals ELEC-E5520 Speech and language processing methods Spring 2016 Mikko Kurimo 1 /48 Contents Literature and other material Idea and history of cepstrum Cepstrum and LP
More informationAdaptive Filters Wiener Filter
Adaptive Filters Wiener Filter Gerhard Schmidt Christian-Albrechts-Universität zu Kiel Faculty of Engineering Institute of Electrical and Information Engineering Digital Signal Processing and System Theory
More informationAcoustic Echo Cancellation using LMS Algorithm
Acoustic Echo Cancellation using LMS Algorithm Nitika Gulbadhar M.Tech Student, Deptt. of Electronics Technology, GNDU, Amritsar Shalini Bahel Professor, Deptt. of Electronics Technology,GNDU,Amritsar
More informationNoise Plus Interference Power Estimation in Adaptive OFDM Systems
Noise Plus Interference Power Estimation in Adaptive OFDM Systems Tevfik Yücek and Hüseyin Arslan Department of Electrical Engineering, University of South Florida 4202 E. Fowler Avenue, ENB-118, Tampa,
More informationCOMPRESSIVE SAMPLING OF SPEECH SIGNALS. Mona Hussein Ramadan. BS, Sebha University, Submitted to the Graduate Faculty of
COMPRESSIVE SAMPLING OF SPEECH SIGNALS by Mona Hussein Ramadan BS, Sebha University, 25 Submitted to the Graduate Faculty of Swanson School of Engineering in partial fulfillment of the requirements for
More informationPerformance analysis of voice activity detection algorithm for robust speech recognition system under different noisy environment
BABU et al: VOICE ACTIVITY DETECTION ALGORITHM FOR ROBUST SPEECH RECOGNITION SYSTEM Journal of Scientific & Industrial Research Vol. 69, July 2010, pp. 515-522 515 Performance analysis of voice activity
More informationHIGH ORDER MODULATION SHAPED TO WORK WITH RADIO IMPERFECTIONS
HIGH ORDER MODULATION SHAPED TO WORK WITH RADIO IMPERFECTIONS Karl Martin Gjertsen 1 Nera Networks AS, P.O. Box 79 N-52 Bergen, Norway ABSTRACT A novel layout of constellations has been conceived, promising
More informationRobust Linear Prediction Analysis for Low Bit-Rate Speech Coding
Robust Linear Prediction Analysis for Low Bit-Rate Speech Coding Nanda Prasetiyo Koestoer B. Eng (Hon) (1998) School of Microelectronic Engineering Faculty of Engineering and Information Technology Griffith
More informationVQ Source Models: Perceptual & Phase Issues
VQ Source Models: Perceptual & Phase Issues Dan Ellis & Ron Weiss Laboratory for Recognition and Organization of Speech and Audio Dept. Electrical Eng., Columbia Univ., NY USA {dpwe,ronw}@ee.columbia.edu
More informationA Sphere Decoding Algorithm for MIMO
A Sphere Decoding Algorithm for MIMO Jay D Thakar Electronics and Communication Dr. S & S.S Gandhy Government Engg College Surat, INDIA ---------------------------------------------------------------------***-------------------------------------------------------------------
More informationVocal Command Recognition Using Parallel Processing of Multiple Confidence-Weighted Algorithms in an FPGA
Vocal Command Recognition Using Parallel Processing of Multiple Confidence-Weighted Algorithms in an FPGA ECE-492/3 Senior Design Project Spring 2015 Electrical and Computer Engineering Department Volgenau
More informationSynthesis Algorithms and Validation
Chapter 5 Synthesis Algorithms and Validation An essential step in the study of pathological voices is re-synthesis; clear and immediate evidence of the success and accuracy of modeling efforts is provided
More informationDigital Signal Processing of Speech for the Hearing Impaired
Digital Signal Processing of Speech for the Hearing Impaired N. Magotra, F. Livingston, S. Savadatti, S. Kamath Texas Instruments Incorporated 12203 Southwest Freeway Stafford TX 77477 Abstract This paper
More informationAnalysis of LMS and NLMS Adaptive Beamforming Algorithms
Analysis of LMS and NLMS Adaptive Beamforming Algorithms PG Student.Minal. A. Nemade Dept. of Electronics Engg. Asst. Professor D. G. Ganage Dept. of E&TC Engg. Professor & Head M. B. Mali Dept. of E&TC
More informationComparison of CELP speech coder with a wavelet method
University of Kentucky UKnowledge University of Kentucky Master's Theses Graduate School 2006 Comparison of CELP speech coder with a wavelet method Sriram Nagaswamy University of Kentucky, sriramn@gmail.com
More informationEffects of Reverberation on Pitch, Onset/Offset, and Binaural Cues
Effects of Reverberation on Pitch, Onset/Offset, and Binaural Cues DeLiang Wang Perception & Neurodynamics Lab The Ohio State University Outline of presentation Introduction Human performance Reverberation
More informationComputational Complexity of Multiuser. Receivers in DS-CDMA Systems. Syed Rizvi. Department of Electrical & Computer Engineering
Computational Complexity of Multiuser Receivers in DS-CDMA Systems Digital Signal Processing (DSP)-I Fall 2004 By Syed Rizvi Department of Electrical & Computer Engineering Old Dominion University Outline
More informationCHAPTER 4 VOICE ACTIVITY DETECTION ALGORITHMS
66 CHAPTER 4 VOICE ACTIVITY DETECTION ALGORITHMS 4.1 INTRODUCTION New frontiers of speech technology are demanding increased levels of performance in many areas. In the advent of Wireless Communications
More informationA LPC-PEV Based VAD for Word Boundary Detection
14 A LPC-PEV Based VAD for Word Boundary Detection Syed Abbas Ali (A), NajmiGhaniHaider (B) and Mahmood Khan Pathan (C) (A) Faculty of Computer &Information Systems Engineering, N.E.D University of Engg.
More informationEE 6422 Adaptive Signal Processing
EE 6422 Adaptive Signal Processing NANYANG TECHNOLOGICAL UNIVERSITY SINGAPORE School of Electrical & Electronic Engineering JANUARY 2009 Dr Saman S. Abeysekera School of Electrical Engineering Room: S1-B1c-87
More informationMonophony/Polyphony Classification System using Fourier of Fourier Transform
International Journal of Electronics Engineering, 2 (2), 2010, pp. 299 303 Monophony/Polyphony Classification System using Fourier of Fourier Transform Kalyani Akant 1, Rajesh Pande 2, and S.S. Limaye
More informationStudy of Turbo Coded OFDM over Fading Channel
International Journal of Engineering Research and Development e-issn: 2278-067X, p-issn: 2278-800X, www.ijerd.com Volume 3, Issue 2 (August 2012), PP. 54-58 Study of Turbo Coded OFDM over Fading Channel
More informationA Survey and Evaluation of Voice Activity Detection Algorithms
A Survey and Evaluation of Voice Activity Detection Algorithms Seshashyama Sameeraj Meduri (ssme09@student.bth.se, 861003-7577) Rufus Ananth (anru09@student.bth.se, 861129-5018) Examiner: Dr. Sven Johansson
More informationImproved Detection by Peak Shape Recognition Using Artificial Neural Networks
Improved Detection by Peak Shape Recognition Using Artificial Neural Networks Stefan Wunsch, Johannes Fink, Friedrich K. Jondral Communications Engineering Lab, Karlsruhe Institute of Technology Stefan.Wunsch@student.kit.edu,
More information