Multichannel Wiener Filtering for Speech Enhancement in Modulation Domain
|
|
- Audrey Mitchell
- 5 years ago
- Views:
Transcription
1 Multichannel Wiener Filtering for Speech Enhancement in Modulation Domain Muhammad Awais This thesis is presented as part of Degree of Master of Sciences in Electrical ering with emphasis on Signal Processing Blekinge Institute of Technology School of ering Department of Electrical ering Blekinge Institute of Technology, Sweden Supervisor: Dr. Benny Sällberg Examiner: Dr. Benny Sällberg
2 Contact Information: Author: Muhammad Awais Supervisor and Examiner: Dr.Benny Sällberg Signal Processing Group (SPG) School of ering, BTH Blekinge Institute of Technology, Sweden
3 Abstract Normally speech signals are contaminated with noise and interference that reduces the intelligibility of speech during communication. In order to make speech signals effective and useful, they need to be enhanced from the noisy speech signal. In speech processing field many speech enhancement techniques are developed and are providing very good results. Multichannel microphone array is also one of the techniques used for speech enhancement, that provides better results than the single channel speech enhancement. Moreover, Wiener filtering is the most commonly used technique for multichannel microphone array for speech enhancement. The main focus of this thesis is to implement multichannel microphone array using Wiener filtering in the modulation domain system and also in the time domain system to enhance the speech. Both are implemented successfully and validated effectively by considering different parameters, i.e., Signal to Noise Ratio (SNR), Signal to Noise Ratio Improvement (SNRI) and Mean Opinion Score (MOS). Both the systems are tested with one female and one male speech distorted with different types of noise at -5dB,dB,5dB,dB,5dB and db SNR. The values are calculated for the different number of subbands, as the main focus is on modulation domain system. It provided efficient results in terms of SNRI and MOS score. While listening to modulation system, results are clearer than time domain. MOS has been shown between 3.9 and. for both systems.
4 iv
5 Acknowledgements I would like to thanks almighty God for blessing me with health, peace of mind and patience to carry out this thesis work. He gave me this ability and opportunity to understand, learn and gain the in-depth knowledge at the Blekinge Institute of Technology (BTH). I sincerely appreciate the way my supervisor Dr. Benny Sällberg helped me during whole thesis work. He guided me throughout my work in a very nice way. His deep knowledge in this field allowed me to learn and to carry out thesis work smoothly. His knowledge and guidance is of great value to me. I would not have been able to do this work without the support and encouragement from my parents and family.they helped me throughout my educational carrier and motivated me. They supported me both financially and morally. I would like to dedicate my till date education carrier to them. I would also like to pay my regards to my friends and faculty at BTH. I was able to learn a great knowledge and wisdom in a pleasant learning environment. Muhammad Awais, Sweden
6
7 Contents Abstract Acknowledgements iii v Introduction. Introduction Contents Modulation Domain 3. Introduction Non-Coherent Envelope Detection Hilbert Envelope Detection Magnitude Detection Limitations of Non-coherent Envelope Detection Motivation for Complex Valued Modulator Coherent Envelope Detection Smoothed Hilbert Carrier Estimator Instantaneous Frequency Carrier Estimator Frequency Reassignment Carrier Estimator Spectral Center of Gravity MultiChannel Speech Enhancement 3 3. Single Channel Speech Enhancement Microphone Array Speech Enhancement Multichannel Wiener Filtering in Time Domain Multichannel Wiener Filtering in Modulation Domain... 6 Evaluation 9. Consideration Results Results without Modulation Domain Results with Modulation Domain Spectrogram Analysis for Modulation Domain vii
8 viii 5 Conclusion 7 Bibliography 9
9 Chapter Introduction. Introduction Speech signals are low frequency modulating signals which modulate high frequency carriers. From several studies it is observed that the modulating signal of speech is very important for speech reception. Speech signal can be represented by x(t) = m(t)c(t) (.) where, m(t) is the modulator signal and c(t) is the carrier signal. New applications are emerging for speech acquisitions with the development in speech processing technologies and the vogue of telecommunications. The lust for improving interactivity between individuals, is motivated while providing flexibility, quality and ease of use. Increasingly between personal computers and communication devices, telephone and other interactive devices are powered by voice. To improve speech intelligibility in noise, speech processing techniques are proven very effective. Normally receiver is at remote distance from the speech transmitter, that causes the speech to be computed with environment noise, interference and reverberation from walls or ceilings []. Hence, speech enhancement techniques should provide speech dereverberation and efficient noise reduction. In speech processing field there are many techniques proposed for handling these issues. Echo cancelation is used widely in the last decades. In reverberant environment speech enhancement is used. Background noise reduction methods using one microphone [ ]and using multiple microphones are addressed [5 9]. Multiple microphone also called microphone array, which is derived from classical signal processing. For instance blind source separation leads to speech separation algorithms. Spectral subtraction, signal subspace, adaptive gain equalizer etc., are the other techniques. The spatial correlation of multiple received signals has enabled the combined temporal and spatial filtering algorithms known as beamforming techniques [5]. Multichannel array using Wiener filtering for speech
10 Chapter. Introduction enhancement has already been implemented in the time frequency domain. In this thesis, I have implemented the same but also implemented in the modulation system domain and compared the results. The modulation system is to use Wiener filter on modulators of every subband. The modulation domain is mainly research on modulator signal which is discussed in detail in chapter. Multichannel Microphone Array Wiener Filtering for Speech Enhancement is implemented in time frequency domain as well as in modulation domain discussed in detail in chapter 3.. Contents This thesis consists of five chapters.chapter describes about modulation domain. Chapter 3 is about single and multichannel algorithms, mainly multichannel Wiener filtering. Chapter evaluates multichannel techniques using Wiener filtering in both domains. Finally in chapter 5 thesis is concluded.
11 Chapter Modulation Domain. Introduction Normally natural signals are represented as low frequency modulators which modulate high frequency carriers. Generally signal can be represented by s(t) = m(t)c(t) (.) where, m(t) and c(t) are modulator and carrier respectively. From several studies it is observed that the modulator of speech signal is very important for speech reception. For intelligible speech it is necessary to preserve modulator of speech. The modulation domain system basically splits the signal into its modulator and carrier and then modulator is analyzed. A method commonly used to obtain the modulating signal of broadband signals is to divide the signal into narrowband frequency subbands, using filterbanks and then decomposes each subband into carrier and a modulating signal. To decompose each subband into modulator and carrier, coherent and non-coherent envelope detection methods are used. In this thesis coherent envelope detection estimating carrier using the Center of Gravity (COG) [] method is used to decompose signal into modulator and carrier. The generalized model of modulation domain system is presented in figure.. In the modulation domain system filterbank is the heart of the whole domain. Broadband signal, say x(t) passes through filterbank which is set of LTI bandpass filters h k (t), x k (t) = x(t) h k (t) for k =,..., K. The resulting subbands from filterbank decomposes to modulator and carrier by two types of envelope detection, coherent and non-coherent. Then achieved modulator signal from each subband is filtered by LTI filter g k (t), m k (t) = m k (t) g(t) (.) After this, the modulators recombined with the original subband carriers is x k (t) = m k (t)c k (t) (.3) 3
12 Chapter. Modulation Domain Figure.: Generalized Model of Modulation Domain. Then modulation filtered broadband signal is obtained or reconstructed referred, as filterbank reconstruction by filterbank summation method, x (t) = K x k (t) (.) k= In the modulation domain system envelope detection method is very critical part. Magnitude or magnitude like methods are used in non-coherent envelope detection where carrier estimation methods are used in coherent envelope detection.. Non-Coherent Envelope Detection The two methods used in non-coherent envelope detection are based on Hilbert envelope (for real-valued subband) and the magnitude operator (for complexvalued subband). Each type is discussed below.. Hilbert Envelope Detection As from generalized framework of modulation system, based on a filterbank with the set of real-valued LTI bandpass filters h k (t). For subband x k (t), Hilbert envelope detector is magnitude of the analytic subband x + k (t), i.e., m k (t) = D H {x k (t)} (.5) = x + k (t) (.6) = x k (t) + jh{x k (t)} (.7)
13 Chapter. Modulation Domain 5 To obtain carrier from subband using Hilbert transform, it is defined by, c k (t) = D c H{x k (t)} (.8) = cos{arg[x + k (t)]} (.9) = cos{arg[x k (t) + jh{x k (t)}]} (.) where, D H and D c H are the envelope and carrier detection operators respectively. H is the Hilbert transform symbol. arg gives the complex valued number argument []... Magnitude Detection The magnitude detector D for an analytic subband signal x + k (t) is defined as, m k (t) = D x + k (t) (.) = x + k (t) (.) whereas, the carrier detector D c is defined as, c k (t) = D c x + k (t) (.3) = e jarg[x+ k (t)] (.)..3 Limitations of Non-coherent Envelope Detection There are some limitations in non-coherent envelope detection which are stated below []; the magnitude and phase spectrums of the subbands exceed the bandwidth of the subband signal. it assumes a conjugate symmetric spectrum of the modulator which is unrealistic for natural signals. modulator domain is not closed under convolution..3 Motivation for Complex Valued Modulator How can we overcome the limitations of non-coherent envelope detector? For this we have to revisit the basics, as or we can rewrite this in polar form, x(t) = m(t)c(t) (.5) a x (t)e jφx(t) = [a m (t)e jφm(t) ][a c (t)e jφc(t) ] (.6)
14 6 Chapter. Modulation Domain this allows us to decompose multiplicative amplitude and an additive phase, a x (t) = a m (t)a c (t) (.7) φ x (t) = φ m (t) + φ c (t) (.8) Main objective of this is to detect a m (t) and φ m (t) given a x (t) and φ x (t), which is very difficult to decompose in amplitude and phase without any additional restrictions. In non-coherent detection technique, this ambiguity is resolved by assuming that φ m (t) = and a c (t) = for all t, such that a m (t) = a x (t) and φ c (t) = φ x (t). But these assumptions causes problems as discussed above. We can improve the behavior of envelope detector if we set that φ m (t) for all t, which says that modulator is complex valued. By this assumption limitations in non-coherent technique can be solved. From original signal, carrier phase φ c (t) can be estimated by low pass filtering an estimate of the signal instantaneous frequency []. α c (t) = IF {φ x (t)} h lp (t). (.9) which can be defined as the derivative of the signal phase, IF {φ x (t)} = dφ x(t). (.) dt by integrating the instantaneous frequency signal α c (t) over time, φ c (t) = t α c (τ)dτ. (.) carrier signal is given by c(t) = e jφc(t). (.) so modulator signal m(t) using coherent demodulation, m(t) = x(t) c(t) = x(t) e jφc(t) = x(t)e jφc(t) (.3) Hence, this type of detectors and carrier estimators are called as coherent envelope detectors and coherent carrier estimators.. Coherent Envelope Detection.. Smoothed Hilbert Carrier Estimator An analytic subband can be presented in polar form as, x + k (t) = α k(t)e jφ k(t). The phase signal of subband φ k (t) can be written as [], φ k (t) = φ k,m t + φ k, + θ k (t) (.)
15 Chapter. Modulation Domain 7 appropriate values of φ k,m and φ k, should be selected so that φ k (t) has zero mean. Where, φ k,m is the average frequency of the subband x k (t). Smoothed carrier is as follows, φ k (t) = Ω k,m t + Ω k, + [θ k (t) h lp (t)] (.5) from the smoothed subband phase signal φ k (t), carrier can be estimated as [], and the modulator as, c k = D c shx k (t) (.6) = e jφ k(t) (.7) m k = D sh x k (t) (.8) = x + k (t)e j φ k (t) (.9).. Instantaneous Frequency Carrier Estimator The Instantaneous Frequency (IF) [3] is the modification of the differential FM detector []. Signal x(t), decomposes its subband into real and imaginary parts. The estimator derives an unnormalized IF estimate, Z k (m) = Z i,k (m) + jz q,k (m), (.3) A phase-only IF estimate α k (m) is obtained, ( ) Zk (m) α k (m) = Z k (m) α k (m ) Z k (m) > ɛ Z k (m) ɛ where, Z k (m) is unnormalized IF estimate for very small ɛ. This α k (m) smoothed IF is obtained from condition W k ( ) =, W k () = α k () or general recursive equation W k (m) = W k (m )α k (m). (.3) From estimate of phase W k (m) of subband X k (m), carrier estimated will be C k (m) = D c IF X k (m) (.3) = W k (m) (.33) So modulator can be found by the demodulation of the subband signal M k (m) = D IF X k (m) (.3) = X k (m)c k(m). (.35)
16 8 Chapter. Modulation Domain..3 Frequency Reassignment Carrier Estimator The IF estimator depends on the finite central phase difference as, α k (m) = expj [φ k(m + ) φ k (m )] (.36) IF estimator is linear approximation to the true subband instantaneous frequency. The accuracy of this approximation decreases with the increase in decimation factor. This can be avoided by using the frequency reassignment operator from time frequency reassignment [5]. This method overcomes the problems associated with IF method. This method operated on continues time moving window transform, can be expressed as X(τ, Ω) = x(t + τ)w(t)e jωt dt (.37) where, w(t) is analysis window. Signal x(t) can be reconstructed from x(t) = X(τ, Ω) = A(τ, Ω)e jψ(τ,ω) (.38) A(τ, Ω)h(τ t)e j[ψ(τ,ω) Ωτ+Ωt] dωdτ (.39) for maximum contribution of the reconstruction integral Ω should satisfy the condition, [ψ(τ, Ω) Ωτ + Ωt] = (.) τ or equivalently, ψ(τ, Ω) ˆΩ(τ, Ω) = (.) τ the reassignment frequency can be computed by, ˆΩ(τ, Ω) = Ω + Im X Dh(τ, Ω)X (τ, Ω) X(τ, Ω) (.) where, X(τ, Ω) is the short time Fourier transform of the signal x(t) by analysis window h(t) and X Dh (τ, Ω) by h D (t), which is time derivative as h D (t) = dh(t). dt So coherent carrier and modulator using frequency reassignment will be and C Ω (τ) = D c F RX Ω (τ) (.3) = exp[j τ ˆΩ(ζ, Ω)dζ] (.) M Ω (τ) = D F R X Ω (τ) (.5) = X Ω (τ)c Ω(τ) (.6)
17 Chapter. Modulation Domain 9.. Spectral Center of Gravity The COG is the time varying IF, which means that the IF at time t as the COG of a windowed segment centered at t []. Time varying IF can be calculated as, w k (t) = wx k(w, t)dw X k(w, t)dw (.7) where X k (w, t) = f(t τ)x(t)e jwt dt (.8) Hence, we can see that time varying IF is basically short time Fourier transform. We can calculate the phase of the carrier by φ k (t) = t w k (i) (.9) i= The carrier c k will be c k (t) = e jφ k(t) After estimating carrier it is easy to compute complex modulator m k (t) by, (.5) m k (t) = x k (t)c k(t) (.5) The filterbank is the main part of modulation domain. It is used in this thesis for different number of subbands. For instance, figure. shows the frequency and impulse response of the filterbank used. It has 3 subbands and down sampling factor of 6. To observe one of the subband spectrogram, female speech is used which is shown in figure.3. Spectrogram of the subband of speech signal is shown in figure.. This implementation is done through [6] and also used in [7].
18 Chapter. Modulation Domain db magnitude Subband filter frequency responses Filterbank impulse response (analysis + synthesis) db magnitude Frequency (Hz) Figure.: Overall Response of Filter Bank.. Female speech (fs=6khz) x Figure.3: Female Speech Signal.
19 Chapter. Modulation Domain Spectral COG Modulation Spectrum 3 5 Carrier number Modulation frequency (Hz) Figure.: Modulation Spectrogram of Female Speech.
20
21 Chapter 3 MultiChannel Speech Enhancement Researchers proposed many algorithms for speech enhancements. They can be divided into two parts: Single channel speech enhancement and multichannel (or microphone array) speech enhancement. 3. Single Channel Speech Enhancement This technique of speech enhancement is a method which performs processing on data available in a single channel provided by one microphone, i.e., M= x k [n] = T k τ= s k [n τ] + v k [n] (3.) Mostly speech enhancement algorithms are single channel processing. In these algorithms only one microphone is used so as easily embedded in devices such as telephones, mobile phones, computers, etc. So comparatively these algorithms have less computational complexity. Algorithms proposed for single channel speech enhancement are [8] : Short time spectrum based algorithm (spectral subtraction, improved spectral subtraction, Wiener filtering, etc.). Statistical model based algorithms (maximum likelihood estimator, the minimum mean square error estimator, posteriori estimators). Hearing model based algorithm. Speech generation model based algorithm. Subspace algorithm. 3
22 Chapter 3. MultiChannel Speech Enhancement Wavelet algorithm. Single channel speech separation algorithms. 3. Microphone Array Speech Enhancement Microphone array techniques are favorable than single channel when it turns to the very low SNR. Methods used in multi microphone are spatiotemporal filtering or beamforming. The need for movable microphone or telephone is eliminated by microphone beamforming. Frequency domain representation of input- output relation for beamforming FIR filters w m [k] [n] is [9], y [k] [n] = M m= L k l= w [k] m [l] x [k] m [n l] (3.) where, x [k] m [n l] is the received microphone signal for a set of I spatial sources with an additive noise in frequency domain. w m [k] [n] is the frequency representation of w m [t]. Frequency domain beamforming representation is shown in figure below: Filter coefficients w k can be designed in many different ways for different algorithms. Algorithms used for microphone array are: Delay and Sum (DAS). Linear Constraint Minimum Variance (LCMV). Adaptive Noise Canceling (ANC). Post filtering. Generalized Sidelobe Canceling (GSC). Blind Source Separation (BSS). Subband processing. In microphone array noise is suppressed and speech is enhanced through beamforming. The simplest algorithm for beamforming is Delay and Sum (DAS) beamforming. But it has very low efficiency, that under the ideal conditions to get db enhancement, must use at least array of microphones. The Linear Constrained Minimum Variance (LCMV) algorithm, uses present signal and also the delayed samples for constructing the beamforming, which may give better enhanced speech than DAS algorithm. The Adaptive Noise Canceling (ANC) is used to cancel highly correlated noise. It is used for many kind of noises also with less complexity. but the problem with
23 Chapter 3. MultiChannel Speech Enhancement 5 this algorithm is if the speech signal leaked into its referential channel, the speech will also cancel and quality may degrade. Post filtering algorithm uses Wiener filter to suppress noise for the enhancement of speech by the DAS algorithm. Furthermore Wiener filter coefficients processed through multichannel noisy speech signal. The Generalized Sidelobe Canceling (GSC) is a form of LCMV algorithm. It is very important algorithm for speech enhancement using microphone array. Fixed beamformer used to suppress uncorrelated noise, its adaptive noise canceler with blocking matrix may cancel the correlated noise. So GSC may suppress both correlated and uncorrelated noises. This makes it more practical. The drawback with GSC is the blocking matrix, that can t completely block the speech signal which make partial cancelation of the speech in the enhanced speech. Improve GSC algorithm have been introduced to overcome this drawback. The Blind Sources Separation (BSS) works in the criteria that speech and noise are independent. It is very difficult to find separation matrix when elements are time variable in the mixed matrix in BSS. In Subband processing algorithms noisy speech signals are divided into group of subbands. So every subband falls in narrow frequency band. Hence, cancelation of noise is performed with shorter length of the filter. It reduces the complexity of algorithm and speech is enhanced. 3.. Multichannel Wiener Filtering in Time Domain Wiener filtering is most widely used technique for the enhancement of speech. In multichannel, as shown in figure 3. [9], broadband noisy speech signal is divided into multiple subbands through filterbank called analysis. Each subband then filtered from Wiener filter for enhancement. All the outputs from Wiener filter recombined through inverse filterbank are called synthesis. In order to formulate Wiener filter, it is required to have a desired signal d [k] [n]. We need to have autocorrelation matrix and cross correlation vector, which can be calculated as, R [k] x = E{x [k] [n]x [k]h [n]} (3.3) r [k] dx = E{d[k] [n]x [k] [n]} (3.) Practically, these are calculated according to the array model and sources. Wiener Hopf can be calculated as [], R x [k] w k = r [k] dx (3.5) w [k] = R x [k] r [k] dx (3.6) Error signal can be calculated from the difference between desired signal and the beamforming output signal as, e[n] = d [k] [n] w [k]h x [k] [n] (3.7)
24 6 Chapter 3. MultiChannel Speech Enhancement Figure 3.: Generalized Multichannel. 3.. Multichannel Wiener Filtering in Modulation Domain In this thesis main focus is on implementing Wiener filter in modulation domain for speech enhancement. Same methodology is implemented in modulation domain with a small difference. As discussed in previous section broadband signal is divided by a filter bank of K bandpass filters into K subbands in analysis part. x k (n) = h k x(n) (3.8) where, x(n) is the broadband noisy speech signal that is divided into K sub-bands x k (n) and h k is the impulse response of sub-band filter of filter bank. This can be shown in time domain as x(n) = K x k (n) = k= K d k (n) + v k (n) (3.9) k= where, d k (n) and v k (n) are the desired speech and noise signal of the every k th sub-band. Now from every subband COG is used to estimate the carrier and the modulator signal. Only modulators are considered here for reducing noise by Wiener filtering. So Wiener filtering is applied to every modulator to enhance
25 Chapter 3. MultiChannel Speech Enhancement 7 the speech and then multiplied with the same carrier estimated as, m k (n) = w k x k (n) (3.) Y k (n) = m k (n) c k (n) (3.) where, m k (n) is the modulator signal resulted from the k th sub-band Wiener filter w k, which is then multiplied with the carrier c k (n) to get Y k (n) for reconstruction. Improved speech signal output y(n) is then obtained from the inverse filter-bank called as synthesis part. The modulation system for multi-channel looks like figure 3., Figure 3.: Generalized Multichannel in Modulation Domain. Hence, it can be observed from the figure 3. and figure 3., that the only difference between two domain is to split every subband into modulator and carrier which is not the case in simple existing multichannel technique.
26
27 Chapter Evaluation. Consideration In this thesis, the model implemented for modulation domain is shown in figure.. It is considered to have microphone array of three microphones placed in a room. The room dimension and parameters are taken by using image method []. I, here used vertical array of three microphones placed with a small difference to overcome aliasing. In figure. d(t) is the clean speech signal and v(t) is the noise signal. d,d,d3 are the speeches on microphones and v,v,v3 are the noise on three microphones respectively. x (t) is the sum of speech and noise scaled with the required Signal to Noise Ratio (SNR) using d (t)+ SNR v (t) for microphone and so on. M d,c d,m x,c x,m v,c v are the modulator and carrier signals of d (t),x (t) and v (t) respectively for microphone and so on. Wiener filter is applied to the modulators and then after remodulation output signals y d,y x, y v generated through inverse filterbank. Same scenario is used for implementing multichannel Wiener filtering without demodulation filterbank. This technique is already implemented previously by many researchers. It is implemented in this thesis for the comparison of performance of method in both domains. The speech signal used in this thesis is one female and one male speech at sampling frequency of 6KHz. Both speeches are tested using three kind of noises, and scaled to have -5dB,dB,5dB,dB,5dB and db SNR. The results are taken by varying the number of subbands. Number of subbands used are,,8,6 and 3. The performance is measured through the Signal to Noise Ratio Improvement(SNRI), Perceptual Evaluation of Speech Quality(PESQ) and spectrograms are shown to evaluate the results. 9
28 Chapter. Evaluation. Results Figure.: Modulation Domain Model... Results without Modulation Domain It is noticed that SNRI value is around 7dB for all the cases used for both the female and male speeches without modulation domain. It is giving clear sound while listening but with a very small background noise. MOS value calculated through PESQ for this is estimated to. for all cases. This MOS value is considered as good for speech. Results obtained in thesis by considering same scenario for both the domains. In section. obtained plots of SNRI and MOS for both the female and male speech are shown. For illustration results achieved only for SNR -5dB,dB and 5dB. Figure. to figure. shows SNRI plots for female speech while figure.5 to figure.7 are the SNRI plots for male speech. As discussed MOS value is calculated also. MOS value plots for female speech are shown from figure.8 to figure. and MOS value results are presented in figure. to figure.3.
29 Chapter. Evaluation 8 Signal to Noise Ratio Improvement 6 SNR Improvement [db] Figure.: Signal to Noise Ratio Improvement Using Female Speech Signal with Noise at -5dB SNR. 8 Signal to Noise Ratio Improvement 6 SNR Improvement [db] Figure.3: Signal to Noise Ratio Improvement Using Female Speech Signal with Noise at db SNR.
30 Chapter. Evaluation 8 Signal to Noise Ratio Improvement 6 SNR Improvement [db] Figure.: Signal to Noise Ratio Improvement Using Female Speech Signal with Noise at 5dB SNR. 8 Signal to Noise Ratio Improvement 6 SNR Improvement [db] Figure.5: Signal to Noise Ratio Improvement Using male Speech Signal with Noise at -5dB SNR.
31 Chapter. Evaluation 3 8 Signal to Noise Ratio Improvement 6 SNR Improvement [db] Figure.6: Signal to Noise Ratio Improvement Using male Speech Signal with Noise at db SNR. 8 Signal to Noise Ratio Improvement 6 SNR Improvement [db] Figure.7: Signal to Noise Ratio Improvement Using male Speech Signal with Noise at 5dB SNR.
32 Chapter. Evaluation.5 MOS Score through PESQ 3.5 MOS Score Figure.8: MOS Score by PESQ of female Speech for -5dB SNR..5 MOS Score through PESQ 3.5 MOS Score Figure.9: MOS Score by PESQ of female Speech for db SNR.
33 Chapter. Evaluation 5.5 MOS Score through PESQ 3.5 MOS Score Figure.: MOS Score by PESQ of female Speech for 5dB SNR..5 MOS Score through PESQ 3.5 MOS Score Figure.: MOS Score by PESQ of male Speech for -5dB SNR.
34 6 Chapter. Evaluation.5 MOS Score through PESQ 3.5 MOS Score Figure.: MOS Score by PESQ of male Speech for db SNR..5 MOS Score through PESQ 3.5 MOS Score Figure.3: MOS Score by PESQ of male Speech for 5dB SNR.
35 Chapter. Evaluation 7.. Results with Modulation Domain The main emphasis of this thesis is on implementation of multichannel Wiener filtering in modulation domain. For modulation domain, if we consider SNRI first it is observed that the SNRI is almost the same while using different number of subbands. SNRI for male speech is around 5-8dB for almost all engine, factory and noises used, is shown in fig. to fig.5. It is calculated for -5dB,dB, 5dB, db, 5dB and db SNR. It is very small variation in values between different noises. SNRI for female speech is shown in figure. to figure.9 for SNR -5dB, db, db, 5dB and db respectively. It can be seen form figures that SNRI varies for female speech is 6-9dB for all engine, factory and noises. SNRI for both male and female speeches are shown in figures for different number of subbands. PESQ is taken by comparison of d(t) and y s (t) and gives the results how can be the enhanced speech is graded. The Mean Opinion Score (MOS) as calculated by PESQ is around to. for most of tests for both speeches with different number of subbands. This score of MOS is considered to be a very good score for speech signals. For consideration only, MOS value plots are only shown at -5dB, db, 5dB, db and 5dB SNR for both male and female speeches. It is shown from figure.6 to figure.35 that MOS score varies for different number of subbands. Signal to Noise Ratio Improvement 8 6 SNR Improvement [db] Figure.: Signal to Noise Ratio Improvement Using Female Speech Signal with Noise at -5dB SNR.
36 8 Chapter. Evaluation Signal to Noise Ratio Improvement 8 6 SNR Improvement [db] Figure.5: Signal to Noise Ratio Improvement Using Female Speech Signal with Noise at db SNR. Signal to Noise Ratio Improvement 8 6 SNR Improvement [db] Figure.6: Signal to Noise Ratio Improvement Using Female Speech Signal with Noise at 5dB SNR.
37 Chapter. Evaluation 9 Signal to Noise Ratio Improvement 8 6 SNR Improvement [db] Figure.7: Signal to Noise Ratio Improvement Using Female Speech Signal with Noise at db SNR. Signal to Noise Ratio Improvement 8 6 SNR Improvement [db] Figure.8: Signal to Noise Ratio Improvement Using Female Speech Signal with Noise at 5dB SNR.
38 3 Chapter. Evaluation Signal to Noise Ratio Improvement 8 6 SNR Improvement [db] Figure.9: Signal to Noise Ratio Improvement Using Female Speech Signal with Noise at db SNR. Signal to Noise Ratio Improvement 8 6 SNR Improvement [db] Figure.: Signal to Noise Ratio Improvement Using Male Speech Signal with Noise at -5dB SNR.
39 Chapter. Evaluation 3 Signal to Noise Ratio Improvement 8 6 SNR Improvement [db] Figure.: Signal to Noise Ratio Improvement Using Male Speech Signal with Noise at db SNR. Signal to Noise Ratio Improvement 8 6 SNR Improvement [db] Figure.: Signal to Noise Ratio Improvement Using Male Speech Signal with Noise at 5dB SNR.
40 3 Chapter. Evaluation Signal to Noise Ratio Improvement 8 6 SNR Improvement [db] Figure.3: Signal to Noise Ratio Improvement Using Male Speech Signal with Noise at db SNR. Signal to Noise Ratio Improvement 8 6 SNR Improvement [db] Figure.: Signal to Noise Ratio Improvement Using Male Speech Signal with Noise at 5dB SNR.
41 Chapter. Evaluation 33 Signal to Noise Ratio Improvement 8 6 SNR Improvement [db] Figure.5: Signal to Noise Ratio Improvement Using Male Speech Signal with Noise at db SNR..5 MOS Score through PESQ MOS Score Figure.6: MOS Score by PESQ of Female Speech for -5dB SNR.
42 3 Chapter. Evaluation.5 MOS Score through PESQ 3.5 MOS Score Figure.7: MOS Score by PESQ of Female Speech for db SNR..5 MOS Score through PESQ 3.5 MOS Score Figure.8: MOS Score by PESQ of Female Speech for 5dB SNR.
43 Chapter. Evaluation 35.5 MOS Score through PESQ 3.5 MOS Score Figure.9: MOS Score by PESQ of Female Speech for db SNR..5 MOS Score through PESQ 3.5 MOS Score Figure.3: MOS Score by PESQ of Female Speech for 5dB SNR.
44 36 Chapter. Evaluation.5 MOS Score through PESQ 3.5 MOS Score Figure.3: MOS Score by PESQ of Male Speech for -5dB SNR..5 MOS Score through PESQ 3.5 MOS Score Figure.3: MOS Score by PESQ of Male Speech for db SNR.
45 Chapter. Evaluation 37.5 MOS Score through PESQ 3.5 MOS Score Figure.33: MOS Score by PESQ of Male Speech for 5dB SNR..5 MOS Score through PESQ 3.5 MOS Score Figure.3: MOS Score by PESQ of Male Speech for db SNR.
46 38 Chapter. Evaluation.5 MOS Score through PESQ 3.5 MOS Score Figure.35: MOS Score by PESQ of Male Speech for 5dB SNR.
47 Chapter. Evaluation 39.3 Spectrogram Analysis for Modulation Domain In this section spectrum analysis is represented. Effect of speech enhancement is easy to observe in spectrograms which are presented in this section. The spectrogram of clean female speech signal is represented in figure.36. The spectrogram analysis is shown only by taking 3 number of subband with 5dB and -5dB SNR. Figure.37 and figure.38 shows the enhanced female speech for -5dB and 5dB SNR. It can be observed from enhanced spectrograms that noise is removed from the speech while formants remained. Though noise spread around speech but after enhancement it can be listened clearly with very small lost in energy of speech. From figure.39 to figure. represents the same female speech for -5dB and 5dB SNR but mixed with the factory noise. It can be seen from the enhanced speech figures that formants of the speech are very clear than that for noise. Figure. to figure. shows the female speech mixed with engine noise for -5dB and 5dB SNR. Enhanced speech spectrogram shows clear formants for -5dB SNR and 5dB SNR. It offers very good improvement in speech for engine noise. Also same noises behavior is tested for male speech. From figure.3 to figure. represents enhanced male speech for noise. It shows little amount of information is lost same as for female speech. For factory noise mixture it gives better result than noise. It can be seen from figure.5 to figure.6 that formants in enhanced speech spectrograms are very much clear for factory noise. The male speech test for engine noise is shown from figure.7 to figure.8. From enhanced speech spectrograms one can find formants clear and with no loss of speech energy. Hence, all the tested spectrograms suggests that multichannel Wiener filtering is providing effective results in modulation domain also. It has valuable implementation in this domain.
48 Chapter. Evaluation Orginal Speech Signal Frequency Time x Figure.36: Clean Female Speech Enhanced Speech Through Modulation Domain Frequency Time x Figure.37: Processed Female Speech for -5dB SNR with Noise.
49 Chapter. Evaluation Enhanced Speech Through Modulation Domain Frequency Time x Figure.38: Processed Female Speech for 5dB SNR with Noise. Enhanced Speech Through Modulation Domain Frequency Time x Figure.39: Processed Female Speech for -5dB SNR with Noise.
50 Chapter. Evaluation Enhanced Speech Through Modulation Domain Frequency Time Figure.: Processed Female Speech for 5dB SNR with Noise. Enhanced Speech Through Modulation Domain Frequency Time x Figure.: Processed Female Speech for -5dB SNR with Noise.
51 Chapter. Evaluation 3 Enhanced Speech Through Modulation Domain Frequency Time x Figure.: Processed Female Speech for 5dB SNR with Noise.
52 Chapter. Evaluation Enhanced Speech Through Modulation Domain Frequency Time Figure.3: Processed Male Speech for -5dB SNR with Noise. Enhanced Speech Through Modulation Domain Frequency Time Figure.: Processed Male Speech for 5dB SNR with Noise.
53 Chapter. Evaluation 5 Enhanced Speech Through Modulation Domain Frequency Time Figure.5: Processed Male Speech for -5dB SNR with Noise. Enhanced Speech Through Modulation Domain Frequency Time Figure.6: Processed Male Speech for 5dB SNR with Noise.
54 6 Chapter. Evaluation Enhanced Speech Through Modulation Domain Frequency Time Figure.7: Processed Male Speech for -5dB SNR with Noise. Enhanced Speech Through Modulation Domain Frequency Time Figure.8: Processed Male Speech for 5dB SNR with Noise.
55 Chapter 5 Conclusion Multichannel microphone using Wiener filtering is implemented both in the time and modulation domain for the purpose of enhancement in speech. Multichannel already has advantages over single channel speech enhancement algorithms. In this thesis it is compared with the modulation domain. Complete analysis is given in the thesis about implementation in each domain. Three different noises used for the testing of this method. After observing the results it is concluded that multichannel implemented in modulation domain gives better result while listening enhanced speech. While from MOS and SNRI values, it gives us the observation that results are almost same for both the domains. So this method is successfully implemented and validated to achieve the results. Plots and spectrograms shown in thesis gives the better view of results. The problem observed in the results is that a very little background noise is present which requires to be improved. Though this noise is not effecting much the intelligibility of speech but can be improved. This can lead to future work of this thesis. It can also be compared with other enhancement schemes implemented in modulation domain for future work. 7
56
57 Bibliography [] J. R. Deller, J. G. Proakis, and J. H. L. Dudgeon, Discrete-Time Processing of Speech Signals. Macmillan, 993. [] J. Yang, Frequency domain noise suppression approaches in mobile telephone systems, IEEE International Conference on Acoustics, Speech and Signal Processing, vol.. [3] P. Vary, Noise suppression by spectral magnitude estimation mechanism and theoretical limits, Elsevier Signal Processing, vol. 8. [] S. Boll, Suppression of acoustical noise in speech using spectral subtraction, IEEE Transactions on Accoustics, Speech and Signal Processing, vol. 7. [5] D. Johnsonn and D. Dugeon, Array Signal Processing - Concepts and Techniques. Pentice Hall, 993. [6] Y. Kaneda and J. Ohga, Adaptive microphone array system for noise reduction, IEEE Transaction Conference on Acoustics, Speech and Signal Processing, vol. 3, no. 6, pp. 39, 986. [7] Y. Grenier and M.Xu, An adaptive array for speech input in cars, Proceedings of International Symposium of Automative Technology and Automation. [8] N. Grbic, Speech signal extraction- a multichannel approach. [9] N. Grbic, M. Dahl, and I. Claesson, Neural network based adaptive microphone array system for speech enhancement, IEEE World Congress on Computational Intelligence, no. ISBN X, May 998. [] P. Clark and L. E. Atlas, Time-frequency coherent modulation filtering of nonstationary signals, IEEE Transaction on Signal Processing, vol. 57, no., Nov. 9. [] S. M. Schimmel, Theory of modulation frequency analysis and modulation filtering, with applications to hearing devices, Ph.D. dissertation, 7. 9
58 5 Bibliography [] Q. Li and L. Atlas, Coherent modulation filtering for speech, IEEE,Acoustics, Speech and Signal Processing, ICASSP, pp. 8 8, 8. [3] L. Atlas and C. Janssen, Coherent modulation spectral filtering for single channel music source seperation, IEEE-International Conference ICASSP. [] J. Glas, A differential rm detector for low-if radios, IEEE-Vechicular Technology Conference VTC. [5] S. Schimmel, K. Fitz, and L. Atlas, Frequency reassignment for coherent modulation filtering, IEEE,Acoustics, Speech and Signal Processing, ICASSP, vol. 5, pp. V V, 6. [6] L. Atlas, P. Clark, and S. Schimmel, Modulation toolbox for matlab, [7] R. Ishaq, Adaptive gain equalizer and modulation frequency domain for noise reduction, Master s thesis,. [8] Z. Yermeche, Subband beamforming for speech enhancement in hands-free communication, Ph.D. dissertation,. [9] B. Sallberg, Applied methods for blind speech enhancement, Ph.D. dissertation, Sweden, 8. [] S. Haykin, Adaptive Filter Theory. John Wiley and Sons,. [] J. B. and D. A. Berkley, Image method for efficiently simulating small-room acoustics, Acoustics Research Department, Bell Laboratories Murray Hill, New Jersey 797, vol. 65, April 979.
Modulator Domain Adaptive Gain Equalizer for Speech Enhancement
Modulator Domain Adaptive Gain Equalizer for Speech Enhancement Ravindra d. Dhage, Prof. Pravinkumar R.Badadapure Abstract M.E Scholar, Professor. This paper presents a speech enhancement method for personal
More informationModulation Domain Improved Adaptive Gain Equalizer for Single Channel Speech Enhancement
Master Thesis Electrical Engineering Modulation Domain Improved Adaptive Gain Equalizer for Single Channel Speech Enhancement ADITHYA VALLI NETTEM SHAKIRA SHAHEEN This thesis is presented as part of Degree
More informationChapter 4 SPEECH ENHANCEMENT
44 Chapter 4 SPEECH ENHANCEMENT 4.1 INTRODUCTION: Enhancement is defined as improvement in the value or Quality of something. Speech enhancement is defined as the improvement in intelligibility and/or
More informationAcoustic Beamforming for Hearing Aids Using Multi Microphone Array by Designing Graphical User Interface
MEE-2010-2012 Acoustic Beamforming for Hearing Aids Using Multi Microphone Array by Designing Graphical User Interface Master s Thesis S S V SUMANTH KOTTA BULLI KOTESWARARAO KOMMINENI This thesis is presented
More informationComparison of LMS and NLMS algorithm with the using of 4 Linear Microphone Array for Speech Enhancement
Comparison of LMS and NLMS algorithm with the using of 4 Linear Microphone Array for Speech Enhancement Mamun Ahmed, Nasimul Hyder Maruf Bhuyan Abstract In this paper, we have presented the design, implementation
More informationDifferent Approaches of Spectral Subtraction Method for Speech Enhancement
ISSN 2249 5460 Available online at www.internationalejournals.com International ejournals International Journal of Mathematical Sciences, Technology and Humanities 95 (2013 1056 1062 Different Approaches
More informationRecent Advances in Acoustic Signal Extraction and Dereverberation
Recent Advances in Acoustic Signal Extraction and Dereverberation Emanuël Habets Erlangen Colloquium 2016 Scenario Spatial Filtering Estimated Desired Signal Undesired sound components: Sensor noise Competing
More informationIntegrated Speech Enhancement Technique for Hands-Free Mobile Phones
Master Thesis Electrical Engineering August 2012 Integrated Speech Enhancement Technique for Hands-Free Mobile Phones ANEESH KALUVA School of Engineering Department of Electrical Engineering Blekinge Institute
More informationThe Role of High Frequencies in Convolutive Blind Source Separation of Speech Signals
The Role of High Frequencies in Convolutive Blind Source Separation of Speech Signals Maria G. Jafari and Mark D. Plumbley Centre for Digital Music, Queen Mary University of London, UK maria.jafari@elec.qmul.ac.uk,
More informationHUMAN speech is frequently encountered in several
1948 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 20, NO. 7, SEPTEMBER 2012 Enhancement of Single-Channel Periodic Signals in the Time-Domain Jesper Rindom Jensen, Student Member,
More informationSpeech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter
Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter 1 Gupteswar Sahu, 2 D. Arun Kumar, 3 M. Bala Krishna and 4 Jami Venkata Suman Assistant Professor, Department of ECE,
More informationHigh-speed Noise Cancellation with Microphone Array
Noise Cancellation a Posteriori Probability, Maximum Criteria Independent Component Analysis High-speed Noise Cancellation with Microphone Array We propose the use of a microphone array based on independent
More informationMMSE STSA Based Techniques for Single channel Speech Enhancement Application Simit Shah 1, Roma Patel 2
MMSE STSA Based Techniques for Single channel Speech Enhancement Application Simit Shah 1, Roma Patel 2 1 Electronics and Communication Department, Parul institute of engineering and technology, Vadodara,
More informationEnhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis
Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis Mohini Avatade & S.L. Sahare Electronics & Telecommunication Department, Cummins
More informationDesign and Implementation on a Sub-band based Acoustic Echo Cancellation Approach
Vol., No. 6, 0 Design and Implementation on a Sub-band based Acoustic Echo Cancellation Approach Zhixin Chen ILX Lightwave Corporation Bozeman, Montana, USA chen.zhixin.mt@gmail.com Abstract This paper
More informationROBUST echo cancellation requires a method for adjusting
1030 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 15, NO. 3, MARCH 2007 On Adjusting the Learning Rate in Frequency Domain Echo Cancellation With Double-Talk Jean-Marc Valin, Member,
More informationREAL-TIME BROADBAND NOISE REDUCTION
REAL-TIME BROADBAND NOISE REDUCTION Robert Hoeldrich and Markus Lorber Institute of Electronic Music Graz Jakoministrasse 3-5, A-8010 Graz, Austria email: robert.hoeldrich@mhsg.ac.at Abstract A real-time
More informationNOISE ESTIMATION IN A SINGLE CHANNEL
SPEECH ENHANCEMENT FOR CROSS-TALK INTERFERENCE by Levent M. Arslan and John H.L. Hansen Robust Speech Processing Laboratory Department of Electrical Engineering Box 99 Duke University Durham, North Carolina
More informationSpeech Enhancement: Reduction of Additive Noise in the Digital Processing of Speech
Speech Enhancement: Reduction of Additive Noise in the Digital Processing of Speech Project Proposal Avner Halevy Department of Mathematics University of Maryland, College Park ahalevy at math.umd.edu
More informationBiomedical Signals. Signals and Images in Medicine Dr Nabeel Anwar
Biomedical Signals Signals and Images in Medicine Dr Nabeel Anwar Noise Removal: Time Domain Techniques 1. Synchronized Averaging (covered in lecture 1) 2. Moving Average Filters (today s topic) 3. Derivative
More informationProceedings of the 5th WSEAS Int. Conf. on SIGNAL, SPEECH and IMAGE PROCESSING, Corfu, Greece, August 17-19, 2005 (pp17-21)
Ambiguity Function Computation Using Over-Sampled DFT Filter Banks ENNETH P. BENTZ The Aerospace Corporation 5049 Conference Center Dr. Chantilly, VA, USA 90245-469 Abstract: - This paper will demonstrate
More informationChapter 2: Signal Representation
Chapter 2: Signal Representation Aveek Dutta Assistant Professor Department of Electrical and Computer Engineering University at Albany Spring 2018 Images and equations adopted from: Digital Communications
More informationSpeech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction
IOSR Journal of VLSI and Signal Processing (IOSR-JVSP) Volume 7, Issue, Ver. I (Mar. - Apr. 7), PP 4-46 e-issn: 9 4, p-issn No. : 9 497 www.iosrjournals.org Speech Enhancement Using Spectral Flatness Measure
More informationMODIFIED DCT BASED SPEECH ENHANCEMENT IN VEHICULAR ENVIRONMENTS
MODIFIED DCT BASED SPEECH ENHANCEMENT IN VEHICULAR ENVIRONMENTS 1 S.PRASANNA VENKATESH, 2 NITIN NARAYAN, 3 K.SAILESH BHARATHWAAJ, 4 M.P.ACTLIN JEEVA, 5 P.VIJAYALAKSHMI 1,2,3,4,5 SSN College of Engineering,
More informationSpeech Enhancement Based On Noise Reduction
Speech Enhancement Based On Noise Reduction Kundan Kumar Singh Electrical Engineering Department University Of Rochester ksingh11@z.rochester.edu ABSTRACT This paper addresses the problem of signal distortion
More informationSpeech Enhancement Using Beamforming Dr. G. Ramesh Babu 1, D. Lavanya 2, B. Yamuna 2, H. Divya 2, B. Shiva Kumar 2, B.
www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 4 Issue 4 April 2015, Page No. 11143-11147 Speech Enhancement Using Beamforming Dr. G. Ramesh Babu 1, D. Lavanya
More informationarxiv: v1 [cs.it] 9 Mar 2016
A Novel Design of Linear Phase Non-uniform Digital Filter Banks arxiv:163.78v1 [cs.it] 9 Mar 16 Sakthivel V, Elizabeth Elias Department of Electronics and Communication Engineering, National Institute
More informationBroadband Microphone Arrays for Speech Acquisition
Broadband Microphone Arrays for Speech Acquisition Darren B. Ward Acoustics and Speech Research Dept. Bell Labs, Lucent Technologies Murray Hill, NJ 07974, USA Robert C. Williamson Dept. of Engineering,
More informationModulation Spectral Filtering: A New Tool for Acoustic Signal Analysis
Modulation Spectral Filtering: A New Tool for Acoustic Signal Analysis Prof. Les Atlas Department of Electrical Engineering University of Washington Special thans to, Qin Li, Jon Cutter, and Steve Schimmel,
More informationMINUET: MUSICAL INTERFERENCE UNMIXING ESTIMATION TECHNIQUE
MINUET: MUSICAL INTERFERENCE UNMIXING ESTIMATION TECHNIQUE Scott Rickard, Conor Fearon University College Dublin, Dublin, Ireland {scott.rickard,conor.fearon}@ee.ucd.ie Radu Balan, Justinian Rosca Siemens
More informationDual Transfer Function GSC and Application to Joint Noise Reduction and Acoustic Echo Cancellation
Dual Transfer Function GSC and Application to Joint Noise Reduction and Acoustic Echo Cancellation Gal Reuven Under supervision of Sharon Gannot 1 and Israel Cohen 2 1 School of Engineering, Bar-Ilan University,
More informationMichael Brandstein Darren Ward (Eds.) Microphone Arrays. Signal Processing Techniques and Applications. With 149 Figures. Springer
Michael Brandstein Darren Ward (Eds.) Microphone Arrays Signal Processing Techniques and Applications With 149 Figures Springer Contents Part I. Speech Enhancement 1 Constant Directivity Beamforming Darren
More informationScienceDirect. Unsupervised Speech Segregation Using Pitch Information and Time Frequency Masking
Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 46 (2015 ) 122 126 International Conference on Information and Communication Technologies (ICICT 2014) Unsupervised Speech
More informationSPECTRAL COMBINING FOR MICROPHONE DIVERSITY SYSTEMS
17th European Signal Processing Conference (EUSIPCO 29) Glasgow, Scotland, August 24-28, 29 SPECTRAL COMBINING FOR MICROPHONE DIVERSITY SYSTEMS Jürgen Freudenberger, Sebastian Stenzel, Benjamin Venditti
More informationA Three-Microphone Adaptive Noise Canceller for Minimizing Reverberation and Signal Distortion
American Journal of Applied Sciences 5 (4): 30-37, 008 ISSN 1546-939 008 Science Publications A Three-Microphone Adaptive Noise Canceller for Minimizing Reverberation and Signal Distortion Zayed M. Ramadan
More informationAudio Restoration Based on DSP Tools
Audio Restoration Based on DSP Tools EECS 451 Final Project Report Nan Wu School of Electrical Engineering and Computer Science University of Michigan Ann Arbor, MI, United States wunan@umich.edu Abstract
More informationESE531 Spring University of Pennsylvania Department of Electrical and System Engineering Digital Signal Processing
University of Pennsylvania Department of Electrical and System Engineering Digital Signal Processing ESE531, Spring 2017 Final Project: Audio Equalization Wednesday, Apr. 5 Due: Tuesday, April 25th, 11:59pm
More informationStudy Of Sound Source Localization Using Music Method In Real Acoustic Environment
International Journal of Electronics Engineering Research. ISSN 975-645 Volume 9, Number 4 (27) pp. 545-556 Research India Publications http://www.ripublication.com Study Of Sound Source Localization Using
More informationSpeech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm
International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm A.T. Rajamanickam, N.P.Subiramaniyam, A.Balamurugan*,
More informationEmanuël A. P. Habets, Jacob Benesty, and Patrick A. Naylor. Presented by Amir Kiperwas
Emanuël A. P. Habets, Jacob Benesty, and Patrick A. Naylor Presented by Amir Kiperwas 1 M-element microphone array One desired source One undesired source Ambient noise field Signals: Broadband Mutually
More informationLOCAL MULTISCALE FREQUENCY AND BANDWIDTH ESTIMATION. Hans Knutsson Carl-Fredrik Westin Gösta Granlund
LOCAL MULTISCALE FREQUENCY AND BANDWIDTH ESTIMATION Hans Knutsson Carl-Fredri Westin Gösta Granlund Department of Electrical Engineering, Computer Vision Laboratory Linöping University, S-58 83 Linöping,
More informationspeech signal S(n). This involves a transformation of S(n) into another signal or a set of signals
16 3. SPEECH ANALYSIS 3.1 INTRODUCTION TO SPEECH ANALYSIS Many speech processing [22] applications exploits speech production and perception to accomplish speech analysis. By speech analysis we extract
More informationRobust Voice Activity Detection Based on Discrete Wavelet. Transform
Robust Voice Activity Detection Based on Discrete Wavelet Transform Kun-Ching Wang Department of Information Technology & Communication Shin Chien University kunching@mail.kh.usc.edu.tw Abstract This paper
More informationBlind Dereverberation of Single-Channel Speech Signals Using an ICA-Based Generative Model
Blind Dereverberation of Single-Channel Speech Signals Using an ICA-Based Generative Model Jong-Hwan Lee 1, Sang-Hoon Oh 2, and Soo-Young Lee 3 1 Brain Science Research Center and Department of Electrial
More informationDigital Signal Processing
Digital Signal Processing Fourth Edition John G. Proakis Department of Electrical and Computer Engineering Northeastern University Boston, Massachusetts Dimitris G. Manolakis MIT Lincoln Laboratory Lexington,
More informationLocalization of underwater moving sound source based on time delay estimation using hydrophone array
Journal of Physics: Conference Series PAPER OPEN ACCESS Localization of underwater moving sound source based on time delay estimation using hydrophone array To cite this article: S. A. Rahman et al 2016
More informationCHAPTER 3 SPEECH ENHANCEMENT ALGORITHMS
46 CHAPTER 3 SPEECH ENHANCEMENT ALGORITHMS 3.1 INTRODUCTION Personal communication of today is impaired by nearly ubiquitous noise. Speech communication becomes difficult under these conditions; speech
More informationSpeech and Audio Processing Recognition and Audio Effects Part 3: Beamforming
Speech and Audio Processing Recognition and Audio Effects Part 3: Beamforming Gerhard Schmidt Christian-Albrechts-Universität zu Kiel Faculty of Engineering Electrical Engineering and Information Engineering
More informationAutomotive three-microphone voice activity detector and noise-canceller
Res. Lett. Inf. Math. Sci., 005, Vol. 7, pp 47-55 47 Available online at http://iims.massey.ac.nz/research/letters/ Automotive three-microphone voice activity detector and noise-canceller Z. QI and T.J.MOIR
More informationMel Spectrum Analysis of Speech Recognition using Single Microphone
International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree
More informationMultiple Sound Sources Localization Using Energetic Analysis Method
VOL.3, NO.4, DECEMBER 1 Multiple Sound Sources Localization Using Energetic Analysis Method Hasan Khaddour, Jiří Schimmel Department of Telecommunications FEEC, Brno University of Technology Purkyňova
More informationRobust Low-Resource Sound Localization in Correlated Noise
INTERSPEECH 2014 Robust Low-Resource Sound Localization in Correlated Noise Lorin Netsch, Jacek Stachurski Texas Instruments, Inc. netsch@ti.com, jacek@ti.com Abstract In this paper we address the problem
More informationNoise Reduction for L-3 Nautronix Receivers
Noise Reduction for L-3 Nautronix Receivers Jessica Manea School of Electrical, Electronic and Computer Engineering, University of Western Australia Roberto Togneri School of Electrical, Electronic and
More informationDSP First. Laboratory Exercise #7. Everyday Sinusoidal Signals
DSP First Laboratory Exercise #7 Everyday Sinusoidal Signals This lab introduces two practical applications where sinusoidal signals are used to transmit information: a touch-tone dialer and amplitude
More informationFrequency Domain Analysis for Noise Suppression Using Spectral Processing Methods for Degraded Speech Signal in Speech Enhancement
Frequency Domain Analysis for Noise Suppression Using Spectral Processing Methods for Degraded Speech Signal in Speech Enhancement 1 Zeeshan Hashmi Khateeb, 2 Gopalaiah 1,2 Department of Instrumentation
More informationEE482: Digital Signal Processing Applications
Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 12 Speech Signal Processing 14/03/25 http://www.ee.unlv.edu/~b1morris/ee482/
More informationPerceptual Speech Enhancement Using Multi_band Spectral Attenuation Filter
Perceptual Speech Enhancement Using Multi_band Spectral Attenuation Filter Sana Alaya, Novlène Zoghlami and Zied Lachiri Signal, Image and Information Technology Laboratory National Engineering School
More informationCalibration of Microphone Arrays for Improved Speech Recognition
MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Calibration of Microphone Arrays for Improved Speech Recognition Michael L. Seltzer, Bhiksha Raj TR-2001-43 December 2001 Abstract We present
More informationOutline. Communications Engineering 1
Outline Introduction Signal, random variable, random process and spectra Analog modulation Analog to digital conversion Digital transmission through baseband channels Signal space representation Optimal
More informationA Computational Efficient Method for Assuring Full Duplex Feeling in Hands-free Communication
A Computational Efficient Method for Assuring Full Duplex Feeling in Hands-free Communication FREDRIC LINDSTRÖM 1, MATTIAS DAHL, INGVAR CLAESSON Department of Signal Processing Blekinge Institute of Technology
More informationAuditory modelling for speech processing in the perceptual domain
ANZIAM J. 45 (E) ppc964 C980, 2004 C964 Auditory modelling for speech processing in the perceptual domain L. Lin E. Ambikairajah W. H. Holmes (Received 8 August 2003; revised 28 January 2004) Abstract
More informationSpeech, music, images, and video are examples of analog signals. Each of these signals is characterized by its bandwidth, dynamic range, and the
Speech, music, images, and video are examples of analog signals. Each of these signals is characterized by its bandwidth, dynamic range, and the nature of the signal. For instance, in the case of audio
More informationA Review on Beamforming Techniques in Wireless Communication
A Review on Beamforming Techniques in Wireless Communication Hemant Kumar Vijayvergia 1, Garima Saini 2 1Assistant Professor, ECE, Govt. Mahila Engineering College Ajmer, Rajasthan, India 2Assistant Professor,
More informationSingle Channel Speaker Segregation using Sinusoidal Residual Modeling
NCC 2009, January 16-18, IIT Guwahati 294 Single Channel Speaker Segregation using Sinusoidal Residual Modeling Rajesh M Hegde and A. Srinivas Dept. of Electrical Engineering Indian Institute of Technology
More informationDEMODULATION divides a signal into its modulator
IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 18, NO. 8, NOVEMBER 2010 2051 Solving Demodulation as an Optimization Problem Gregory Sell and Malcolm Slaney, Fellow, IEEE Abstract We
More informationWAVELET OFDM WAVELET OFDM
EE678 WAVELETS APPLICATION ASSIGNMENT WAVELET OFDM GROUP MEMBERS RISHABH KASLIWAL rishkas@ee.iitb.ac.in 02D07001 NACHIKET KALE nachiket@ee.iitb.ac.in 02D07002 PIYUSH NAHAR nahar@ee.iitb.ac.in 02D07007
More informationELT Receiver Architectures and Signal Processing Fall Mandatory homework exercises
ELT-44006 Receiver Architectures and Signal Processing Fall 2014 1 Mandatory homework exercises - Individual solutions to be returned to Markku Renfors by email or in paper format. - Solutions are expected
More informationReduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter
Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter Ching-Ta Lu, Kun-Fu Tseng 2, Chih-Tsung Chen 2 Department of Information Communication, Asia University, Taichung, Taiwan, ROC
More informationRECENTLY, there has been an increasing interest in noisy
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 52, NO. 9, SEPTEMBER 2005 535 Warped Discrete Cosine Transform-Based Noisy Speech Enhancement Joon-Hyuk Chang, Member, IEEE Abstract In
More informationReal-time Adaptive Concepts in Acoustics
Real-time Adaptive Concepts in Acoustics Real-time Adaptive Concepts in Acoustics Blind Signal Separation and Multichannel Echo Cancellation by Daniel W.E. Schobben, Ph. D. Philips Research Laboratories
More informationAdaptive Noise Reduction Algorithm for Speech Enhancement
Adaptive Noise Reduction Algorithm for Speech Enhancement M. Kalamani, S. Valarmathy, M. Krishnamoorthi Abstract In this paper, Least Mean Square (LMS) adaptive noise reduction algorithm is proposed to
More informationEE3723 : Digital Communications
EE3723 : Digital Communications Week 8-9: Bandpass Modulation MPSK MASK, OOK MFSK 04-May-15 Muhammad Ali Jinnah University, Islamabad - Digital Communications - EE3723 1 In-phase and Quadrature (I&Q) Representation
More informationDEMODULATION divides a signal into its modulator
Solving Demodulation as an Optimization Problem Gregory Sell and Malcolm Slaney, Fellow, IEEE Abstract We introduce two new methods for the demodulation of acoustic signals by posing the problem in a convex
More informationANALOGUE TRANSMISSION OVER FADING CHANNELS
J.P. Linnartz EECS 290i handouts Spring 1993 ANALOGUE TRANSMISSION OVER FADING CHANNELS Amplitude modulation Various methods exist to transmit a baseband message m(t) using an RF carrier signal c(t) =
More informationEnhancement of Speech Signal by Adaptation of Scales and Thresholds of Bionic Wavelet Transform Coefficients
ISSN (Print) : 232 3765 An ISO 3297: 27 Certified Organization Vol. 3, Special Issue 3, April 214 Paiyanoor-63 14, Tamil Nadu, India Enhancement of Speech Signal by Adaptation of Scales and Thresholds
More informationADAPTIVE ANTENNAS. TYPES OF BEAMFORMING
ADAPTIVE ANTENNAS TYPES OF BEAMFORMING 1 1- Outlines This chapter will introduce : Essential terminologies for beamforming; BF Demonstrating the function of the complex weights and how the phase and amplitude
More informationSPEECH ENHANCEMENT USING A ROBUST KALMAN FILTER POST-PROCESSOR IN THE MODULATION DOMAIN. Yu Wang and Mike Brookes
SPEECH ENHANCEMENT USING A ROBUST KALMAN FILTER POST-PROCESSOR IN THE MODULATION DOMAIN Yu Wang and Mike Brookes Department of Electrical and Electronic Engineering, Exhibition Road, Imperial College London,
More informationTheory of Telecommunications Networks
Theory of Telecommunications Networks Anton Čižmár Ján Papaj Department of electronics and multimedia telecommunications CONTENTS Preface... 5 1 Introduction... 6 1.1 Mathematical models for communication
More informationReducing comb filtering on different musical instruments using time delay estimation
Reducing comb filtering on different musical instruments using time delay estimation Alice Clifford and Josh Reiss Queen Mary, University of London alice.clifford@eecs.qmul.ac.uk Abstract Comb filtering
More informationECE5713 : Advanced Digital Communications
ECE5713 : Advanced Digital Communications Bandpass Modulation MPSK MASK, OOK MFSK 04-May-15 Advanced Digital Communications, Spring-2015, Week-8 1 In-phase and Quadrature (I&Q) Representation Any bandpass
More informationDownloaded from 1
VII SEMESTER FINAL EXAMINATION-2004 Attempt ALL questions. Q. [1] How does Digital communication System differ from Analog systems? Draw functional block diagram of DCS and explain the significance of
More informationProblems from the 3 rd edition
(2.1-1) Find the energies of the signals: a) sin t, 0 t π b) sin t, 0 t π c) 2 sin t, 0 t π d) sin (t-2π), 2π t 4π Problems from the 3 rd edition Comment on the effect on energy of sign change, time shifting
More informationCG401 Advanced Signal Processing. Dr Stuart Lawson Room A330 Tel: January 2003
CG40 Advanced Dr Stuart Lawson Room A330 Tel: 23780 e-mail: ssl@eng.warwick.ac.uk 03 January 2003 Lecture : Overview INTRODUCTION What is a signal? An information-bearing quantity. Examples of -D and 2-D
More informationComparison of Multirate two-channel Quadrature Mirror Filter Bank with FIR Filters Based Multiband Dynamic Range Control for audio
IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) e-issn: 2278-2834,p- ISSN: 2278-8735.Volume 9, Issue 3, Ver. IV (May - Jun. 2014), PP 19-24 Comparison of Multirate two-channel Quadrature
More informationEC 6501 DIGITAL COMMUNICATION UNIT - II PART A
EC 6501 DIGITAL COMMUNICATION 1.What is the need of prediction filtering? UNIT - II PART A [N/D-16] Prediction filtering is used mostly in audio signal processing and speech processing for representing
More informationCepstrum alanysis of speech signals
Cepstrum alanysis of speech signals ELEC-E5520 Speech and language processing methods Spring 2016 Mikko Kurimo 1 /48 Contents Literature and other material Idea and history of cepstrum Cepstrum and LP
More informationUniversity of Washington Department of Electrical Engineering Computer Speech Processing EE516 Winter 2005
University of Washington Department of Electrical Engineering Computer Speech Processing EE516 Winter 2005 Lecture 5 Slides Jan 26 th, 2005 Outline of Today s Lecture Announcements Filter-bank analysis
More informationUniversity Ibn Tofail, B.P. 133, Kenitra, Morocco. University Moulay Ismail, B.P Meknes, Morocco
Research Journal of Applied Sciences, Engineering and Technology 8(9): 1132-1138, 2014 DOI:10.19026/raset.8.1077 ISSN: 2040-7459; e-issn: 2040-7467 2014 Maxwell Scientific Publication Corp. Submitted:
More informationCommunication Channels
Communication Channels wires (PCB trace or conductor on IC) optical fiber (attenuation 4dB/km) broadcast TV (50 kw transmit) voice telephone line (under -9 dbm or 110 µw) walkie-talkie: 500 mw, 467 MHz
More informationSAMPLING THEORY. Representing continuous signals with discrete numbers
SAMPLING THEORY Representing continuous signals with discrete numbers Roger B. Dannenberg Professor of Computer Science, Art, and Music Carnegie Mellon University ICM Week 3 Copyright 2002-2013 by Roger
More informationEffects of Fading Channels on OFDM
IOSR Journal of Engineering (IOSRJEN) e-issn: 2250-3021, p-issn: 2278-8719, Volume 2, Issue 9 (September 2012), PP 116-121 Effects of Fading Channels on OFDM Ahmed Alshammari, Saleh Albdran, and Dr. Mohammad
More informationSpeech Enhancement for Nonstationary Noise Environments
Signal & Image Processing : An International Journal (SIPIJ) Vol., No.4, December Speech Enhancement for Nonstationary Noise Environments Sandhya Hawaldar and Manasi Dixit Department of Electronics, KIT
More informationECE438 - Laboratory 7a: Digital Filter Design (Week 1) By Prof. Charles Bouman and Prof. Mireille Boutin Fall 2015
Purdue University: ECE438 - Digital Signal Processing with Applications 1 ECE438 - Laboratory 7a: Digital Filter Design (Week 1) By Prof. Charles Bouman and Prof. Mireille Boutin Fall 2015 1 Introduction
More informationTopic 2. Signal Processing Review. (Some slides are adapted from Bryan Pardo s course slides on Machine Perception of Music)
Topic 2 Signal Processing Review (Some slides are adapted from Bryan Pardo s course slides on Machine Perception of Music) Recording Sound Mechanical Vibration Pressure Waves Motion->Voltage Transducer
More informationAdaptive STFT-like Time-Frequency analysis from arbitrary distributed signal samples
Adaptive STFT-like Time-Frequency analysis from arbitrary distributed signal samples Modris Greitāns Institute of Electronics and Computer Science, University of Latvia, Latvia E-mail: modris greitans@edi.lv
More informationOnline Monaural Speech Enhancement Based on Periodicity Analysis and A Priori SNR Estimation
1 Online Monaural Speech Enhancement Based on Periodicity Analysis and A Priori SNR Estimation Zhangli Chen* and Volker Hohmann Abstract This paper describes an online algorithm for enhancing monaural
More informationADSP ADSP ADSP ADSP. Advanced Digital Signal Processing (18-792) Spring Fall Semester, Department of Electrical and Computer Engineering
ADSP ADSP ADSP ADSP Advanced Digital Signal Processing (18-792) Spring Fall Semester, 201 2012 Department of Electrical and Computer Engineering PROBLEM SET 5 Issued: 9/27/18 Due: 10/3/18 Reminder: Quiz
More information(i) Understanding the basic concepts of signal modeling, correlation, maximum likelihood estimation, least squares and iterative numerical methods
Tools and Applications Chapter Intended Learning Outcomes: (i) Understanding the basic concepts of signal modeling, correlation, maximum likelihood estimation, least squares and iterative numerical methods
More informationFaculty of science, Ibn Tofail Kenitra University, Morocco Faculty of Science, Moulay Ismail University, Meknès, Morocco
Design and Simulation of an Adaptive Acoustic Echo Cancellation (AEC) for Hands-ree Communications using a Low Computational Cost Algorithm Based Circular Convolution in requency Domain 1 *Azeddine Wahbi
More informationDESIGN OF LATTICE FORM OPTICAL DELAY LINE STRUCTURE FOR MICROWAVE BAND PASS FILTER APPLICATIONS
Progress In Electromagnetics Research C, Vol. 32, 197 206, 2012 DESIGN OF LATTICE FORM OPTICAL DELAY LINE STRUCTURE FOR MICROWAVE BAND PASS FILTER APPLICATIONS P. Praash and M. Ganesh Madhan * Department
More information