Can binary masks improve intelligibility?
|
|
- Bertina Stokes
- 5 years ago
- Views:
Transcription
1 Can binary masks improve intelligibility? Mike Brookes (Imperial College London) & Mark Huckvale (University College London) Apparently so... 2
2 How does it work? 3 Time-frequency grid of local SNR e s = speech energy, e n = noise energy, w() = frequency weighting - F() is some monotonic function - index is increased if attenuation applied in each cell where e n > e s - i.e. where local SNR < db 4 2
3 Use of classifier to estimate binary mask 5 Replication Similarities IEEE sentences as training testing materials Single male talker Babble and speech-shaped -5dB SNR Signals at 2, samples/sec Acoustic features based on modulation spectrum - code provided by Kim Feature vector incorporates time & frequency deltas SNR thresholds for constructing target mask on training data GMM classifier design, using full covariance Four GMMs to classify feature vectors based on division of training vectors into groups based on SNR. Differences We used a different, British English, talker We used babble from NOISEX ROM Thanks to: Toby Davies 6 3
4 Classifier performance -5dB SNR) SNR > Cells % Speech-shaped noise Hits False- Alarms Hits Babble noise False- Alarms Kim et al Ours 7 Classifier performance (@ -5dB SNR) SNR > Cells % Speech-shaped noise Hits False- Alarms Hits Babble noise False- Alarms Kim et al Ours
5 performance -5dB SNR) Words % Speech-shaped noise Babble noise No proc. Proc. Ideal No proc. Proc. Ideal Kim et al Ours 9 performance (@ -5dB SNR) Words % Speech-shaped noise Babble noise No proc. Proc. Ideal No proc. Proc. Ideal Kim et al Ours
6 Binary Mask Enhancement LTASS -5dB Recognised Mask Ideal Mask Binary Mask Enhancement Babble -5dB Recognised Mask Ideal Mask 2 6
7 What is going on? There are a number of arbitrary parameter settings in Kim et al (29) Sampling rate, window size, number of channels Down-sampling of modulation spectrum SNR thresholds for binary mask choice These may have become optimised for particular data set they used Overall performance may be very sensitive to small changes in system design We need to investigate and understand details of algorithm... over to Mike 3 What is the perfect binary mask? Original idea [Wang25]: Select Time-Frequency (TF) cells with S ( t, f ) N( t, f ) > where S and N are speech and noise power spectral densities in db and L is a threshold ( Local Criterion ) L Motivation: Masking Exclude TF cells with poor SNR since they give little information and may mask adjacent frequency bands However If we plot intelligibility versus L for different SNR levels the results do not match this theory 4 7
8 of Binary Masked Speech L= > db SNR L= > 2dB SNR SNR= 6 db: 9 db < L <-5 db Two independent sources of information [Kjems et al 2]:. Noisy speech signal SNR > & (L SNR) < 2. Noise-vocoded signal 3 db < (L SNR) < db The benefit of binary masking comes entirely from component 2 [Kjems et al, EUSIPCO-2] 5 Noise-Vocoded component Define Relative Criterion : R = L SNR = L ( S ( f ) N ( f )) Mask becomes: S( t, f ) N( t, f ) > R + S ( f ) N ( f ) Eliminate noise dependency by taking N ( t, f ) = N ( f ) S ( t, f ) S ( f ) > R Target Binary Mask Clean Speech TF analysis Active Level LTASS db db + R Threshold LTASS Noise TF analysis Mask TF synth 6 8
9 Unimodal Psychometric Function Modelling Product of two logistic curves Fixed guess/lapse rates 4 free parameters Modify to remove interaction between low and high slopes No change if low and high slopes are equal Negligible change if slopes are widely separated Estimation is easier and more stable Use 5% as a single figure of merit Not always ideal UTBM on LTASS noise, fft=4 ms, ov=4 < 4. > UTBM on LTASS noise, fft=2 ms < 8. > Psychometric Function Evaluation Digit triples: male+female Forced choice experiment Bayesian estimation of pdf of 4-D parameter vector Update pdf after each trial Select next R to give greatest expected entropy reduction Very quick convergence (e.g. 6 trials) After trial : UTBM on LTASS noise, fft=4 ms, ov= UTBM on LTASS noise, fft=4 ms, ov=4 Normalized semi-width (db) Ln up slope (ln prob/db) < 4. > Peak position (db) Ln down slope (ln prob/db)
10 Effect of FFT length TF analysis/synthesis Hamming window of length T Freq resolution ~.8/T Modulation bandwidth ~.9/T T=4ms, R can vary by 4 T=6ms performance worse: too much smoothing in modulation T=2ms performance worse: cannot resolve T=ms performance still OK File: psy23655.txt UTBM on LTASS noise, fft= ms, ov=4 < 32.4 > File: psy23545.txt UTBM on LTASS noise, fft=4 ms, ov=4 < 4. > File: psy23858.txt UTBM on LTASS noise, fft=2 ms, ov=4 < 2.2 > -2. File: psy2379.txt UTBM on LTASS noise, fft=6 ms, ov=4 < 35.7 > T=2 ms f res =9 Hz f mod <45 Hz T= ms f res =8 Hz f mod <9 Hz T=4 ms f res =45 Hz f mod <22.5 Hz T=6 ms f res = Hz f mod <5.6 Hz 9 Non-uniform frequency resolution FFT length kept at 5 ms f res =36 Hz, f mod <8 Hz Change mask resolution Estimate mask in erb domain,.5, and 2 erb resolution Observations Even at a resolution of.5 erb, the intelligibility is noticeably worse [surprising] Substantial degradation at erb resolution.5.5 File: psy22578.txt UTBM on LTASS noise, fft=5 ms df=.5 erb < 33.7 > File: psy22572.txt UTBM on LTASS noise, fft=5 ms df=. erb < 2. > File: psy txt UTBM on LTASS noise, fft=5 ms df= erb < 49.5 > File: psy22573.txt UTBM on LTASS noise, fft=5 ms df=2. erb < 9.6 > T=5 ms f res = erb T=5 ms f res =.5 erb T=5 ms f res =. erb T=5 ms f res =2. erb
11 Modulation Domain Model determined by accuracy of modulation domain spectrum [Taal et al, ICASSP 2] Encompasses both regions of the graph within one concept Measure by correlation coefficient between clean and masked speech in 4ms window for each frequency bin Maximize by comparing with low pass filtered version of spectrogram: Clean Speech TF analysis LP filter db db + R Threshold 2 Time Correlation based mask LP filter operates on power spectrum in time domain Hamming window impulse resp LP cutoff =.9/T LP Correlation coeff between clean and masked max when R= Observations Poor intelligibility compared to previous for short T LP Very noisy: mask tries to match noise when no speech energy Use noise floor threshold File: psy25834.txt Band-pass TBM on LTASS noise, fft=4*5 ms, lp=6 ms x Hz File: psy2584.txt Band-pass TBM on LTASS noise, fft=4*5 ms, lp=4 ms x Hz < 37. > < 33.7 > File: psy25845.txt Band-pass TBM on LTASS noise, fft=4*5 ms, lp= ms x Hz < 2. > T LP =6 ms F mod > Hz T LP =4 ms F mod >2.3 Hz T LP = ms F mod >9 Hz 22
12 Time-Freq Correlation based Mask Seems reasonable to try matching modulation in both time and frequency Apply LP filter in both directions Fix T LP =8 ms giving mod domain HP at. Hz Vary filter width in frequency direction Observations Makes rather little difference F LP =2Hz gives some benefit File: psy25859.txt Band-pass TBM on LTASS noise, fft=4*5 ms, lp=8 ms x Hz File: psy2595.txt Band-pass TBM on LTASS noise, fft=4*5 ms, lp=8 ms x 3 Hz File: psy259.txt Band-pass TBM on LTASS noise, fft=4*5 ms, lp=8 ms x 6 Hz < 47.9 > < 45.2 > < 44.4 > File: psy2596.txt Band-pass TBM on LTASS noise, fft=4*5 ms, lp=8 ms x 2 Hz < 5 > T LP =8 ms F mod >. Hz F LP = Hz T LP =8 ms F mod >. Hz F LP =3 Hz T LP =8 ms F mod >. Hz F LP =6 Hz T LP =8 ms F mod >. Hz F LP =2 Hz 23 Summary benefits arise from the noise vocoded component of the masked speech Rapid estimation of unimodal psychometric functions is possible of noise vocoded speech Relative criterion can vary by ~4 db without loss of intelligibility FFT length can vary between 2 and 6 ms without loss of int Uniform frequency resolution is better than non-uniform (erb) Maximizing correlation in modulation domain is equivalent to HP filtering the spectrogram (when R=) Nice idea but little benefit Seems logical to extend it to freq axis but gives small improvement 24 2
13 Can Binary Masks Improve? Replication of Kim et al (29) show mask enhancement not straightforward to achieve Binary mask has two effects Preserve speech information in noisy signal when SNR good enough Encode speech information in vocoded noise when SNR poor Former just like any enhancement algorithm Latter relies on pattern recognition system Which may perform badly at low SNR just when it would be most useful 25 3
Speech Enhancement using Wiener filtering
Speech Enhancement using Wiener filtering S. Chirtmay and M. Tahernezhadi Department of Electrical Engineering Northern Illinois University DeKalb, IL 60115 ABSTRACT The problem of reducing the disturbing
More informationEnhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis
Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis Mohini Avatade & S.L. Sahare Electronics & Telecommunication Department, Cummins
More informationSOBM - A BINARY MASK FOR NOISY SPEECH THAT OPTIMISES AN OBJECTIVE INTELLIGIBILITY METRIC
SOBM - A BINARY MASK FOR NOISY SPEECH THAT OPTIMISES AN OBJECTIVE INTELLIGIBILITY METRIC Leo Lightburn and Mike Brookes Dept. of Electrical and Electronic Engineering, Imperial College London, UK ABSTRACT
More informationSpeech Synthesis using Mel-Cepstral Coefficient Feature
Speech Synthesis using Mel-Cepstral Coefficient Feature By Lu Wang Senior Thesis in Electrical Engineering University of Illinois at Urbana-Champaign Advisor: Professor Mark Hasegawa-Johnson May 2018 Abstract
More informationBandwidth Extension for Speech Enhancement
Bandwidth Extension for Speech Enhancement F. Mustiere, M. Bouchard, M. Bolic University of Ottawa Tuesday, May 4 th 2010 CCECE 2010: Signal and Multimedia Processing 1 2 3 4 Current Topic 1 2 3 4 Context
More informationCHAPTER 6 SIGNAL PROCESSING TECHNIQUES TO IMPROVE PRECISION OF SPECTRAL FIT ALGORITHM
CHAPTER 6 SIGNAL PROCESSING TECHNIQUES TO IMPROVE PRECISION OF SPECTRAL FIT ALGORITHM After developing the Spectral Fit algorithm, many different signal processing techniques were investigated with the
More informationNOISE ESTIMATION IN A SINGLE CHANNEL
SPEECH ENHANCEMENT FOR CROSS-TALK INTERFERENCE by Levent M. Arslan and John H.L. Hansen Robust Speech Processing Laboratory Department of Electrical Engineering Box 99 Duke University Durham, North Carolina
More informationReduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter
Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter Ching-Ta Lu, Kun-Fu Tseng 2, Chih-Tsung Chen 2 Department of Information Communication, Asia University, Taichung, Taiwan, ROC
More informationSpeech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm
International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm A.T. Rajamanickam, N.P.Subiramaniyam, A.Balamurugan*,
More informationAuditory Based Feature Vectors for Speech Recognition Systems
Auditory Based Feature Vectors for Speech Recognition Systems Dr. Waleed H. Abdulla Electrical & Computer Engineering Department The University of Auckland, New Zealand [w.abdulla@auckland.ac.nz] 1 Outlines
More informationDifferent Approaches of Spectral Subtraction Method for Speech Enhancement
ISSN 2249 5460 Available online at www.internationalejournals.com International ejournals International Journal of Mathematical Sciences, Technology and Humanities 95 (2013 1056 1062 Different Approaches
More informationLecture 9: Time & Pitch Scaling
ELEN E4896 MUSIC SIGNAL PROCESSING Lecture 9: Time & Pitch Scaling 1. Time Scale Modification (TSM) 2. Time-Domain Approaches 3. The Phase Vocoder 4. Sinusoidal Approach Dan Ellis Dept. Electrical Engineering,
More informationThe role of temporal resolution in modulation-based speech segregation
Downloaded from orbit.dtu.dk on: Dec 15, 217 The role of temporal resolution in modulation-based speech segregation May, Tobias; Bentsen, Thomas; Dau, Torsten Published in: Proceedings of Interspeech 215
More informationNonuniform multi level crossing for signal reconstruction
6 Nonuniform multi level crossing for signal reconstruction 6.1 Introduction In recent years, there has been considerable interest in level crossing algorithms for sampling continuous time signals. Driven
More informationSPEECH ENHANCEMENT USING A ROBUST KALMAN FILTER POST-PROCESSOR IN THE MODULATION DOMAIN. Yu Wang and Mike Brookes
SPEECH ENHANCEMENT USING A ROBUST KALMAN FILTER POST-PROCESSOR IN THE MODULATION DOMAIN Yu Wang and Mike Brookes Department of Electrical and Electronic Engineering, Exhibition Road, Imperial College London,
More informationREAL-TIME BROADBAND NOISE REDUCTION
REAL-TIME BROADBAND NOISE REDUCTION Robert Hoeldrich and Markus Lorber Institute of Electronic Music Graz Jakoministrasse 3-5, A-8010 Graz, Austria email: robert.hoeldrich@mhsg.ac.at Abstract A real-time
More informationDimension Reduction of the Modulation Spectrogram for Speaker Verification
Dimension Reduction of the Modulation Spectrogram for Speaker Verification Tomi Kinnunen Speech and Image Processing Unit Department of Computer Science University of Joensuu, Finland Kong Aik Lee and
More informationSpectrum Sensing Using Bayesian Method for Maximum Spectrum Utilization in Cognitive Radio
5 Spectrum Sensing Using Bayesian Method for Maximum Spectrum Utilization in Cognitive Radio Anurama Karumanchi, Mohan Kumar Badampudi 2 Research Scholar, 2 Assoc. Professor, Dept. of ECE, Malla Reddy
More information(i) Understanding the basic concepts of signal modeling, correlation, maximum likelihood estimation, least squares and iterative numerical methods
Tools and Applications Chapter Intended Learning Outcomes: (i) Understanding the basic concepts of signal modeling, correlation, maximum likelihood estimation, least squares and iterative numerical methods
More informationThe function is composed of a small number of subfunctions detailed below:
Maximum Chirplet Transform Code These notes complement the Maximum Chirplet Transform Matlab code written by Fabien Millioz and Mike Davies, last updated 2016. This is a software implementation of the
More informationMel Spectrum Analysis of Speech Recognition using Single Microphone
International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree
More informationPerceptual Speech Enhancement Using Multi_band Spectral Attenuation Filter
Perceptual Speech Enhancement Using Multi_band Spectral Attenuation Filter Sana Alaya, Novlène Zoghlami and Zied Lachiri Signal, Image and Information Technology Laboratory National Engineering School
More informationHCS 7367 Speech Perception
HCS 7367 Speech Perception Dr. Peter Assmann Fall 212 Power spectrum model of masking Assumptions: Only frequencies within the passband of the auditory filter contribute to masking. Detection is based
More informationVocoder (LPC) Analysis by Variation of Input Parameters and Signals
ISCA Journal of Engineering Sciences ISCA J. Engineering Sci. Vocoder (LPC) Analysis by Variation of Input Parameters and Signals Abstract Gupta Rajani, Mehta Alok K. and Tiwari Vebhav Truba College of
More informationAudio Restoration Based on DSP Tools
Audio Restoration Based on DSP Tools EECS 451 Final Project Report Nan Wu School of Electrical Engineering and Computer Science University of Michigan Ann Arbor, MI, United States wunan@umich.edu Abstract
More informationSignal Processing Toolbox
Signal Processing Toolbox Perform signal processing, analysis, and algorithm development Signal Processing Toolbox provides industry-standard algorithms for analog and digital signal processing (DSP).
More informationVQ Source Models: Perceptual & Phase Issues
VQ Source Models: Perceptual & Phase Issues Dan Ellis & Ron Weiss Laboratory for Recognition and Organization of Speech and Audio Dept. Electrical Eng., Columbia Univ., NY USA {dpwe,ronw}@ee.columbia.edu
More informationINSTANTANEOUS FREQUENCY ESTIMATION FOR A SINUSOIDAL SIGNAL COMBINING DESA-2 AND NOTCH FILTER. Yosuke SUGIURA, Keisuke USUKURA, Naoyuki AIKAWA
INSTANTANEOUS FREQUENCY ESTIMATION FOR A SINUSOIDAL SIGNAL COMBINING AND NOTCH FILTER Yosuke SUGIURA, Keisuke USUKURA, Naoyuki AIKAWA Tokyo University of Science Faculty of Science and Technology ABSTRACT
More informationINTRODUCTION TO ACOUSTIC PHONETICS 2 Hilary Term, week 6 22 February 2006
1. Resonators and Filters INTRODUCTION TO ACOUSTIC PHONETICS 2 Hilary Term, week 6 22 February 2006 Different vibrating objects are tuned to specific frequencies; these frequencies at which a particular
More informationEE482: Digital Signal Processing Applications
Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 12 Speech Signal Processing 14/03/25 http://www.ee.unlv.edu/~b1morris/ee482/
More informationDigital Filtering: Realization
Digital Filtering: Realization Digital Filtering: Matlab Implementation: 3-tap (2 nd order) IIR filter 1 Transfer Function Differential Equation: z- Transform: Transfer Function: 2 Example: Transfer Function
More informationRobust Voice Activity Detection Based on Discrete Wavelet. Transform
Robust Voice Activity Detection Based on Discrete Wavelet Transform Kun-Ching Wang Department of Information Technology & Communication Shin Chien University kunching@mail.kh.usc.edu.tw Abstract This paper
More informationSpeech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter
Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter 1 Gupteswar Sahu, 2 D. Arun Kumar, 3 M. Bala Krishna and 4 Jami Venkata Suman Assistant Professor, Department of ECE,
More informationSingle Channel Speaker Segregation using Sinusoidal Residual Modeling
NCC 2009, January 16-18, IIT Guwahati 294 Single Channel Speaker Segregation using Sinusoidal Residual Modeling Rajesh M Hegde and A. Srinivas Dept. of Electrical Engineering Indian Institute of Technology
More informationRECENTLY, there has been an increasing interest in noisy
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 52, NO. 9, SEPTEMBER 2005 535 Warped Discrete Cosine Transform-Based Noisy Speech Enhancement Joon-Hyuk Chang, Member, IEEE Abstract In
More informationStudents: Avihay Barazany Royi Levy Supervisor: Kuti Avargel In Association with: Zoran, Haifa
Students: Avihay Barazany Royi Levy Supervisor: Kuti Avargel In Association with: Zoran, Haifa Spring 2008 Introduction Problem Formulation Possible Solutions Proposed Algorithm Experimental Results Conclusions
More informationSpectral estimation using higher-lag autocorrelation coefficients with applications to speech recognition
Spectral estimation using higher-lag autocorrelation coefficients with applications to speech recognition Author Shannon, Ben, Paliwal, Kuldip Published 25 Conference Title The 8th International Symposium
More informationSpeech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction
IOSR Journal of VLSI and Signal Processing (IOSR-JVSP) Volume 7, Issue, Ver. I (Mar. - Apr. 7), PP 4-46 e-issn: 9 4, p-issn No. : 9 497 www.iosrjournals.org Speech Enhancement Using Spectral Flatness Measure
More informationAuditory modelling for speech processing in the perceptual domain
ANZIAM J. 45 (E) ppc964 C980, 2004 C964 Auditory modelling for speech processing in the perceptual domain L. Lin E. Ambikairajah W. H. Holmes (Received 8 August 2003; revised 28 January 2004) Abstract
More informationUnderstanding Probability of Intercept for Intermittent Signals
2013 Understanding Probability of Intercept for Intermittent Signals Richard Overdorf & Rob Bordow Agilent Technologies Agenda Use Cases and Signals Time domain vs. Frequency Domain Probability of Intercept
More informationA Novel Technique or Blind Bandwidth Estimation of the Radio Communication Signal
International Journal of ISSN 0974-2107 Systems and Technologies IJST Vol.3, No.1, pp 11-16 KLEF 2010 A Novel Technique or Blind Bandwidth Estimation of the Radio Communication Signal Gaurav Lohiya 1,
More informationReading: Johnson Ch , Ch.5.5 (today); Liljencrants & Lindblom; Stevens (Tues) reminder: no class on Thursday.
L105/205 Phonetics Scarborough Handout 7 10/18/05 Reading: Johnson Ch.2.3.3-2.3.6, Ch.5.5 (today); Liljencrants & Lindblom; Stevens (Tues) reminder: no class on Thursday Spectral Analysis 1. There are
More informationCS 188: Artificial Intelligence Spring Speech in an Hour
CS 188: Artificial Intelligence Spring 2006 Lecture 19: Speech Recognition 3/23/2006 Dan Klein UC Berkeley Many slides from Dan Jurafsky Speech in an Hour Speech input is an acoustic wave form s p ee ch
More informationBlind Dereverberation of Single-Channel Speech Signals Using an ICA-Based Generative Model
Blind Dereverberation of Single-Channel Speech Signals Using an ICA-Based Generative Model Jong-Hwan Lee 1, Sang-Hoon Oh 2, and Soo-Young Lee 3 1 Brain Science Research Center and Department of Electrial
More informationDimension Reduction of the Modulation Spectrogram for Speaker Verification
Dimension Reduction of the Modulation Spectrogram for Speaker Verification Tomi Kinnunen Speech and Image Processing Unit Department of Computer Science University of Joensuu, Finland tkinnu@cs.joensuu.fi
More informationAn Adaptive Adjacent Channel Interference Cancellation Technique
SJSU ScholarWorks Faculty Publications Electrical Engineering 2009 An Adaptive Adjacent Channel Interference Cancellation Technique Robert H. Morelos-Zaragoza, robert.morelos-zaragoza@sjsu.edu Shobha Kuruba
More informationCOM 12 C 288 E October 2011 English only Original: English
Question(s): 9/12 Source: Title: INTERNATIONAL TELECOMMUNICATION UNION TELECOMMUNICATION STANDARDIZATION SECTOR STUDY PERIOD 2009-2012 Audience STUDY GROUP 12 CONTRIBUTION 288 P.ONRA Contribution Additional
More informationOFDM Transmission Corrupted by Impulsive Noise
OFDM Transmission Corrupted by Impulsive Noise Jiirgen Haring, Han Vinck University of Essen Institute for Experimental Mathematics Ellernstr. 29 45326 Essen, Germany,. e-mail: haering@exp-math.uni-essen.de
More informationLinguistic Phonetics. Spectral Analysis
24.963 Linguistic Phonetics Spectral Analysis 4 4 Frequency (Hz) 1 Reading for next week: Liljencrants & Lindblom 1972. Assignment: Lip-rounding assignment, due 1/15. 2 Spectral analysis techniques There
More informationSpeech Enhancement for Nonstationary Noise Environments
Signal & Image Processing : An International Journal (SIPIJ) Vol., No.4, December Speech Enhancement for Nonstationary Noise Environments Sandhya Hawaldar and Manasi Dixit Department of Electronics, KIT
More informationDigital Signal Processing of Speech for the Hearing Impaired
Digital Signal Processing of Speech for the Hearing Impaired N. Magotra, F. Livingston, S. Savadatti, S. Kamath Texas Instruments Incorporated 12203 Southwest Freeway Stafford TX 77477 Abstract This paper
More informationQuestion 1 Draw a block diagram to illustrate how the data was acquired. Be sure to include important parameter values
Data acquisition Question 1 Draw a block diagram to illustrate how the data was acquired. Be sure to include important parameter values The block diagram illustrating how the signal was acquired is shown
More informationMikko Myllymäki and Tuomas Virtanen
NON-STATIONARY NOISE MODEL COMPENSATION IN VOICE ACTIVITY DETECTION Mikko Myllymäki and Tuomas Virtanen Department of Signal Processing, Tampere University of Technology Korkeakoulunkatu 1, 3370, Tampere,
More informationEffective post-processing for single-channel frequency-domain speech enhancement Weifeng Li a
R E S E A R C H R E P O R T I D I A P Effective post-processing for single-channel frequency-domain speech enhancement Weifeng Li a IDIAP RR 7-7 January 8 submitted for publication a IDIAP Research Institute,
More informationSPEECH ENHANCEMENT USING SPARSE CODE SHRINKAGE AND GLOBAL SOFT DECISION. Changkyu Choi, Seungho Choi, and Sang-Ryong Kim
SPEECH ENHANCEMENT USING SPARSE CODE SHRINKAGE AND GLOBAL SOFT DECISION Changkyu Choi, Seungho Choi, and Sang-Ryong Kim Human & Computer Interaction Laboratory Samsung Advanced Institute of Technology
More informationSpur Detection, Analysis and Removal Stable32 W.J. Riley Hamilton Technical Services
Introduction Spur Detection, Analysis and Removal Stable32 W.J. Riley Hamilton Technical Services Stable32 Version 1.54 and higher has the capability to detect, analyze and remove discrete spectral components
More informationTunable Multi Notch Digital Filters A MATLAB demonstration using real data
Tunable Multi Notch Digital Filters A MATLAB demonstration using real data Jon Bell CSIRO ATNF 27 Sep 2 1 Introduction Many people are investigating a wide range of interference suppression techniques.
More informationLong Range Acoustic Classification
Approved for public release; distribution is unlimited. Long Range Acoustic Classification Authors: Ned B. Thammakhoune, Stephen W. Lang Sanders a Lockheed Martin Company P. O. Box 868 Nashua, New Hampshire
More informationAdvanced Cell Averaging Constant False Alarm Rate Method in Homogeneous and Multiple Target Environment
Advanced Cell Averaging Constant False Alarm Rate Method in Homogeneous and Multiple Target Environment Mrs. Charishma 1, Shrivathsa V. S 2 1Assistant Professor, Dept. of Electronics and Communication
More informationKeywords Decomposition; Reconstruction; SNR; Speech signal; Super soft Thresholding.
Volume 5, Issue 2, February 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Speech Enhancement
More informationEnhancement of Speech Signal by Adaptation of Scales and Thresholds of Bionic Wavelet Transform Coefficients
ISSN (Print) : 232 3765 An ISO 3297: 27 Certified Organization Vol. 3, Special Issue 3, April 214 Paiyanoor-63 14, Tamil Nadu, India Enhancement of Speech Signal by Adaptation of Scales and Thresholds
More informationImproved Detection by Peak Shape Recognition Using Artificial Neural Networks
Improved Detection by Peak Shape Recognition Using Artificial Neural Networks Stefan Wunsch, Johannes Fink, Friedrich K. Jondral Communications Engineering Lab, Karlsruhe Institute of Technology Stefan.Wunsch@student.kit.edu,
More informationA Two-step Technique for MRI Audio Enhancement Using Dictionary Learning and Wavelet Packet Analysis
A Two-step Technique for MRI Audio Enhancement Using Dictionary Learning and Wavelet Packet Analysis Colin Vaz, Vikram Ramanarayanan, and Shrikanth Narayanan USC SAIL Lab INTERSPEECH Articulatory Data
More informationEnhancement of Speech in Noisy Conditions
Enhancement of Speech in Noisy Conditions Anuprita P Pawar 1, Asst.Prof.Kirtimalini.B.Choudhari 2 PG Student, Dept. of Electronics and Telecommunication, AISSMS C.O.E., Pune University, India 1 Assistant
More informationCepstrum alanysis of speech signals
Cepstrum alanysis of speech signals ELEC-E5520 Speech and language processing methods Spring 2016 Mikko Kurimo 1 /48 Contents Literature and other material Idea and history of cepstrum Cepstrum and LP
More informationMultiple Sound Sources Localization Using Energetic Analysis Method
VOL.3, NO.4, DECEMBER 1 Multiple Sound Sources Localization Using Energetic Analysis Method Hasan Khaddour, Jiří Schimmel Department of Telecommunications FEEC, Brno University of Technology Purkyňova
More informationTelecommunication Electronics
Politecnico di Torino ICT School Telecommunication Electronics C5 - Special A/D converters» Logarithmic conversion» Approximation, A and µ laws» Differential converters» Oversampling, noise shaping Logarithmic
More informationNon-coherent pulse compression - concept and waveforms Nadav Levanon and Uri Peer Tel Aviv University
Non-coherent pulse compression - concept and waveforms Nadav Levanon and Uri Peer Tel Aviv University nadav@eng.tau.ac.il Abstract - Non-coherent pulse compression (NCPC) was suggested recently []. It
More informationOperational Amplifiers
Operational Amplifiers Continuing the discussion of Op Amps, the next step is filters. There are many different types of filters, including low pass, high pass and band pass. We will discuss each of the
More informationTarget Echo Information Extraction
Lecture 13 Target Echo Information Extraction 1 The relationships developed earlier between SNR, P d and P fa apply to a single pulse only. As a search radar scans past a target, it will remain in the
More informationRF Characterization Report
SMA-J-P-H-ST-MT1 Mated with: RF316-01SP1-01BJ1-0305 Description: 50-Ω SMA Board Mount Jack, Mixed Technology Samtec, Inc. 2005 All Rights Reserved Table of Contents Introduction...1 Product Description...1
More informationLearning New Articulator Trajectories for a Speech Production Model using Artificial Neural Networks
Learning New Articulator Trajectories for a Speech Production Model using Artificial Neural Networks C. S. Blackburn and S. J. Young Cambridge University Engineering Department (CUED), England email: csb@eng.cam.ac.uk
More informationChannel selection in the modulation domain for improved speech intelligibility in noise
Channel selection in the modulation domain for improved speech intelligibility in noise Kamil K. Wójcicki and Philipos C. Loizou a) Department of Electrical Engineering, University of Texas at Dallas,
More information6.555 Lab1: The Electrocardiogram
6.555 Lab1: The Electrocardiogram Tony Hyun Kim Spring 11 1 Data acquisition Question 1: Draw a block diagram to illustrate how the data was acquired. The EKG signal discussed in this report was recorded
More informationENF PHASE DISCONTINUITY DETECTION BASED ON MULTI-HARMONICS ANALYSIS
U.P.B. Sci. Bull., Series C, Vol. 77, Iss. 4, 2015 ISSN 2286-3540 ENF PHASE DISCONTINUITY DETECTION BASED ON MULTI-HARMONICS ANALYSIS Valentin A. NIŢĂ 1, Amelia CIOBANU 2, Robert Al. DOBRE 3, Cristian
More informationAdaptive Speech Enhancement Using Partial Differential Equations and Back Propagation Neural Networks
Australian Journal of Basic and Applied Sciences, 4(7): 2093-2098, 2010 ISSN 1991-8178 Adaptive Speech Enhancement Using Partial Differential Equations and Back Propagation Neural Networks 1 Mojtaba Bandarabadi,
More informationVoice Activity Detection for Speech Enhancement Applications
Voice Activity Detection for Speech Enhancement Applications E. Verteletskaya, K. Sakhnov Abstract This paper describes a study of noise-robust voice activity detection (VAD) utilizing the periodicity
More informationAnalog and Telecommunication Electronics
Politecnico di Torino - ICT School Analog and Telecommunication Electronics D5 - Special A/D converters» Differential converters» Oversampling, noise shaping» Logarithmic conversion» Approximation, A and
More informationEE390 Final Exam Fall Term 2002 Friday, December 13, 2002
Name Page 1 of 11 EE390 Final Exam Fall Term 2002 Friday, December 13, 2002 Notes 1. This is a 2 hour exam, starting at 9:00 am and ending at 11:00 am. The exam is worth a total of 50 marks, broken down
More informationAutomatic Evaluation of Hindustani Learner s SARGAM Practice
Automatic Evaluation of Hindustani Learner s SARGAM Practice Gurunath Reddy M and K. Sreenivasa Rao Indian Institute of Technology, Kharagpur, India {mgurunathreddy, ksrao}@sit.iitkgp.ernet.in Abstract
More informationAUDL GS08/GAV1 Auditory Perception. Envelope and temporal fine structure (TFS)
AUDL GS08/GAV1 Auditory Perception Envelope and temporal fine structure (TFS) Envelope and TFS arise from a method of decomposing waveforms The classic decomposition of waveforms Spectral analysis... Decomposes
More informationWARPED FILTER DESIGN FOR THE BODY MODELING AND SOUND SYNTHESIS OF STRING INSTRUMENTS
NORDIC ACOUSTICAL MEETING 12-14 JUNE 1996 HELSINKI WARPED FILTER DESIGN FOR THE BODY MODELING AND SOUND SYNTHESIS OF STRING INSTRUMENTS Helsinki University of Technology Laboratory of Acoustics and Audio
More informationFactors Governing the Intelligibility of Speech Sounds
HSR Journal Club JASA, vol(19) No(1), Jan 1947 Factors Governing the Intelligibility of Speech Sounds N. R. French and J. C. Steinberg 1. Introduction Goal: Determine a quantitative relationship between
More informationC/N Ratio at Low Carrier Frequencies in SFQ
Application Note C/N Ratio at Low Carrier Frequencies in SFQ Products: TV Test Transmitter SFQ 7BM09_0E C/N ratio at low carrier frequencies in SFQ Contents 1 Preliminaries... 3 2 Description of Ranges...
More informationA New General Purpose, PC based, Sound Recognition System
A New General Purpose, PC based, Sound Recognition System Neil J Boucher (1), Michihiro Jinnai (2), Ian Gynther (3) (1) Principal Engineer, Compustar, Brisbane, Australia (2) Takamatsu National College
More informationAdvanced bridge instrument for the measurement of the phase noise and of the short-term frequency stability of ultra-stable quartz resonators
Advanced bridge instrument for the measurement of the phase noise and of the short-term frequency stability of ultra-stable quartz resonators F. Sthal, X. Vacheret, S. Galliou P. Salzenstein, E. Rubiola
More informationElectrical & Computer Engineering Technology
Electrical & Computer Engineering Technology EET 419C Digital Signal Processing Laboratory Experiments by Masood Ejaz Experiment # 1 Quantization of Analog Signals and Calculation of Quantized noise Objective:
More informationDepartment of Electronics and Communication Engineering 1
UNIT I SAMPLING AND QUANTIZATION Pulse Modulation 1. Explain in detail the generation of PWM and PPM signals (16) (M/J 2011) 2. Explain in detail the concept of PWM and PAM (16) (N/D 2012) 3. What is the
More informationSignal Processing for Digitizers
Signal Processing for Digitizers Modular digitizers allow accurate, high resolution data acquisition that can be quickly transferred to a host computer. Signal processing functions, applied in the digitizer
More informationComparison of Spectral Analysis Methods for Automatic Speech Recognition
INTERSPEECH 2013 Comparison of Spectral Analysis Methods for Automatic Speech Recognition Venkata Neelima Parinam, Chandra Vootkuri, Stephen A. Zahorian Department of Electrical and Computer Engineering
More informationChapter IV THEORY OF CELP CODING
Chapter IV THEORY OF CELP CODING CHAPTER IV THEORY OF CELP CODING 4.1 Introduction Wavefonn coders fail to produce high quality speech at bit rate lower than 16 kbps. Source coders, such as LPC vocoders,
More informationSpeech Enhancement Using Beamforming Dr. G. Ramesh Babu 1, D. Lavanya 2, B. Yamuna 2, H. Divya 2, B. Shiva Kumar 2, B.
www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 4 Issue 4 April 2015, Page No. 11143-11147 Speech Enhancement Using Beamforming Dr. G. Ramesh Babu 1, D. Lavanya
More informationAn Adaptive Kernel-Growing Median Filter for High Noise Images. Jacob Laurel. Birmingham, AL, USA. Birmingham, AL, USA
An Adaptive Kernel-Growing Median Filter for High Noise Images Jacob Laurel Department of Electrical and Computer Engineering, University of Alabama at Birmingham, Birmingham, AL, USA Electrical and Computer
More informationROBUST PITCH TRACKING USING LINEAR REGRESSION OF THE PHASE
- @ Ramon E Prieto et al Robust Pitch Tracking ROUST PITCH TRACKIN USIN LINEAR RERESSION OF THE PHASE Ramon E Prieto, Sora Kim 2 Electrical Engineering Department, Stanford University, rprieto@stanfordedu
More informationHigh-speed Noise Cancellation with Microphone Array
Noise Cancellation a Posteriori Probability, Maximum Criteria Independent Component Analysis High-speed Noise Cancellation with Microphone Array We propose the use of a microphone array based on independent
More informationFrequency Domain Representation of Signals
Frequency Domain Representation of Signals The Discrete Fourier Transform (DFT) of a sampled time domain waveform x n x 0, x 1,..., x 1 is a set of Fourier Coefficients whose samples are 1 n0 X k X0, X
More informationPitch Period of Speech Signals Preface, Determination and Transformation
Pitch Period of Speech Signals Preface, Determination and Transformation Mohammad Hossein Saeidinezhad 1, Bahareh Karamsichani 2, Ehsan Movahedi 3 1 Islamic Azad university, Najafabad Branch, Saidinezhad@yahoo.com
More informationUtilization of Multipaths for Spread-Spectrum Code Acquisition in Frequency-Selective Rayleigh Fading Channels
734 IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 49, NO. 4, APRIL 2001 Utilization of Multipaths for Spread-Spectrum Code Acquisition in Frequency-Selective Rayleigh Fading Channels Oh-Soon Shin, Student
More informationTarget detection in side-scan sonar images: expert fusion reduces false alarms
Target detection in side-scan sonar images: expert fusion reduces false alarms Nicola Neretti, Nathan Intrator and Quyen Huynh Abstract We integrate several key components of a pattern recognition system
More informationNon-intrusive intelligibility prediction for Mandarin speech in noise. Creative Commons: Attribution 3.0 Hong Kong License
Title Non-intrusive intelligibility prediction for Mandarin speech in noise Author(s) Chen, F; Guan, T Citation The 213 IEEE Region 1 Conference (TENCON 213), Xi'an, China, 22-25 October 213. In Conference
More information