AN INTEGRATED APPROACH FOR NOISE REDUCTION AND DYNAMIC RANGE COMPRESSION IN HEARING AIDS

Size: px
Start display at page:

Download "AN INTEGRATED APPROACH FOR NOISE REDUCTION AND DYNAMIC RANGE COMPRESSION IN HEARING AIDS"

Transcription

1 AN INTEGRATED APPROACH FOR NOISE REDUCTION AND DYNAMIC RANGE COMPRESSION IN HEARING AIDS KimNgo 1,SimonDoclo 1,2,AnnSpriet 1,3,MarcMoonen 1,JanWouters 3 andsørenholdtjensen 4 1 KatholiekeUniversiteitLeuven Department of Electrical Engineering, ESAT-SCD Kasteelpark Arenberg, B-31 Leuven, Belgium kim.ngo@esat.kuleuven.be 3 KatholiekeUniversiteitLeuven Department of Neurosciences, ExpORL, O.& N2 Herestraat 49/721, 3 Leuven, Belgium Jan.Wouters@med.kuleuven.be 2 NXPSemiconductors Corporate I&T-Research, Sound and Systems Group Interleuvenlaan, 31 Leuven, Belgium simon.doclo@nxp.com 4 AalborgUniversity Department of Electronic Systems, MISP Niels Jernes Vej A6-3, 92 Aalborg, Denmark shj@es.aau.dk ABSTRACT Hearing aids typically use a serial concatenation of Noise Reduction(NR) and Dynamic Range Compression(). However, the in such a concatenation negatively affects the performance of thenrstage:theresidualnoiseafternrisamplifiedbythe, resulting in a signal-to-noise-ratio(snr) degradation. In this paper, we present an integrated solution for NR and. The solution isbasedonanestimateoftheamountofspeechandnoiseineach timesegment.incasethespeechisdominant,thenrislessactive anditisdesirabletohaveasmuchaspossible,whereasina noisedominantsegmentthenrismoreactiveandtheideaisnotto compromise this operation by applying. Experimental results confirmed that a serial concatenation of NR and degrades the SNR improvement, and that the proposed solution offers a better SNR improvement compared to a serial concatenation. 1. INTRODUCTION Reduced audibility and reduced dynamic range between threshold anddiscomfortlevelaresomeoftheproblemsthatpeoplewitha sensorineural hearing loss are dealing with. Furthermore, background noise(multiple speakers, traffic etc.) is a great problem and is especially damaging to speech intelligibility. It is known that hearing impaired people need a higher signal-to-noise-ratio(snr) to communicate effectively[1]. Therefore, Noise Reduction(NR) and Dynamic Range Compression() are basic components in hearing aids nowadays[2], but generally these components are developed and evaluated independently of each other. Although sophisticatedalgorithmsfornrandexistthereisstillanopen questionastohowthesealgorithmshouldbecombinedintoanintegrated approach, which has not received a lot of attention so far. The interesting issue now is to analyse undesired effects when these algorithms operate together in an integrated scheme. The integration of hearing aid algorithms is a challenging task since each algorithm can counteract and limit the functionality of other algorithms. When NR and are serially concatenated, undesired interaction effects typically occur, since each algorithm serves different purposes. For instance, can counteract NR by amplifying the residual noise after NR, which consequently degrades the SNRanddefeatsthepurposeofusingNR.AnintegrationofsinglechannelNRandwasproposedin[3]whereaminimummean square error and a maximum a posteriori optimal estimator are proposed that incorporate in the derivation of the NR algorithm. ThisresearchworkwascarriedoutattheESATlaboratoryofthe Katholieke Universiteit Leuven, in the frame of the Marie-Curie Fellowship EST-SIGNAL program( under contract No. MEST-CT , and the Concerted Research Action GOA- AMBioRICS. Ann Spriet is a postdoctoral researcher funded by F.W.O.- Vlaanderen. The scientific responsibility is assumed by its authors. Another issue is the evaluation of such integrated schemes where the lack of an overall evaluation criterion indeed makes the integration more difficult. In the evaluation the crucial question willbeastowhicheffectsaremostdamagingtospeechintelligibility, e.g. the amount of background noise or the audibility. In this work, objective quality measures are used to evaluate the integrated scheme, such as SNR and signal distortion measures. Subjective evaluation using hearing aid users is not included in this work. InthispaperwewillfocusonintegratingNRandandinvestigate if any undesired effect occurs when combining NR and. The integrated scheme applies to both single-channel and multi-channel NR. The paper is organised as follows. In Section 2 the standard scheme is introduced. Section 3 discusses the integration of NR and. In Section 4 experimental results are presented. The work is summarized in Section DYNAMIC RANGE COMPRESSION In this Section, we briefly introduce the basic concept behind. Theroleofistomapthewidedynamicrangeofaspeech signal into the reduced dynamic range of a hearing impaired listener. Thebasicconceptofistoautomaticallyadjustthegainbased ontheintensityleveloftheinputsignal. Segmentswithahigh intensity level are attenuated while segments with a low intensity are amplified. This makes weak sounds audible while loud sounds are not becoming uncomfortably loud. is typically defined by the following parameters: Compression threshold(ct). Compression ratio(cr). Attack and release time. HearingaidgainG db. CTisdefinedindBandisthepointwherebecomesactive. BelowCTthegainislinearandabovetheCT,isactivei.e.the gain is reduced. CR determines the degree of compression. A CR of2(i.e.2:1)meansthatforevery2dbincreaseintheinputsignal, theoutputsignalincreasesby1db.theattackandreleasetimeis defined in milliseconds and specifies how fast the gain is changed according to changes in the input signal. The attack time is defined asthetimeittakesforthecompressortoreacttoanincreaseininput signal level. The release time is the time taken for the compressor toreacttoadecreaseininputlevel. ThehearingaidgainG db is the maximum amount of amplification in db which is specified by thelinearparti.e.belowctofthecurve.acurvewith CR=2,CT=3dBandG db =dbisshowninfigure1. WedefinePin,dB andp out,db astheinputandoutputpowerin db of the, respectively, as in,db =log ( P in (ω,k) 2 ) (1)

2 NR 7 6 (a) Existing method for combining NR and (serial concatenation) CR NR CT Figure1:curve(CRdefineshowtheslopeischangedandCT isthepointatwhichtheslopechanges). and out,db =log ( P out (ω,k) 2 ) (2) attimeinstantkandfrequency ω=2πf.thecurveisdefined basedonalinearcurveandacompressioncurveineq.3andeq. 4, respectively: P linear,db = in,db +G db (3) P compression,db = CT + 1 CR (P in,db CT)+G db (4) wherepin,db istheinputpowerindb.thecurveisgivenby Eq.5 out,db = { Plinear,dB ifpin,db <CT P compression,db ifpin,db CT (5) ThegainindBiscalculatedastheoutputlevelminustheinput level, i.e. G,dB = out,db P in,db (6) Theattackandreleasetimearethenappliedtothegain G,dB usingafirst-orderrecursiveaveragingfilter,beforethe gainisappliedtotheinputpin,db. Thepowerestimationintheschemeusedinthispaper isbasedonindividualfftbins. Ifitisdesiredtohavethe working on a specific number of frequency bands e.g. critical bands, thiscanbeachievedbycombiningthefftbins(e.g. byusing individual FFT bins at low frequencies and combining FFT bins at higher frequencies) as is typically done in hearing aid applications [4]. 3. INTEGRATION OF NOISE REDUCTION AND DYNAMIC RANGE COMPRESSION ThegoalofaNRschemeistoimprovespeechintelligibilitybyreducing the effects of any noise source(e.g. multiple speakers, traffic etc). The NR can be a single-channel or a multi-channel algorithm. Theontheotherhandamplifiessignalsbasedontheintensity level and makes no distinction between speech or noise. This means that noise already attenuated by the NR algorithm can be amplified bythe.thisisanundesiredeffectleadingtoadegradation ofthesnr,sincetheresidualnoiseisamplifiedandthespeechis attenuated. This is one of the crucial problems at hand when cascadingnrand.anexistingmethodforcascadingnrand is a simple serial concatenation, depicted in figure 2(a). Dual (b) Novel method for combining NR and (Dual-) Figure2:AnexistingandanovelapproachforcombiningNRand 3.1 Integration Concept Weintroduceadual-concepttointegrateNRand.The goal is to control the without counteracting the NR performance. The integrated approach is depicted in figure 2(b). The basicideabehindthis,istoidentifytheamountofspeechandnoise presentinasignalsegment.theamountofspeechandnoiseisestimatedbasedontheinputandtheoutputofthenr(seealsosection 3.3).Theappliedgaininthedual-dependsontheamountof speech and noise. This distinction between speech and noise makes itpossibletoapplytomakethespeechaudiblewithoutamplifyingtheresidualnoise.notethatastandardschemeisbased only on the input intensity level and does not make any distinction between speech and noise. Thebasicconceptistoapplyadifferenttothespeechand the noise segments. We therefore introduce two curves which aredefinedsimilarlyasineq.3-5. P s,db -speech(speechdominantcase). P n,db -noise(noisedominantcase). Thesuperscriptssandnareusedtorefertospeechandnoise,respectively. In the case where speech is dominant we apply the speechbasedonp s,db. Ifnoiseisdominantitisundesirabletoamplifythenoiseandthereforealowergainisapplied i.e.weapplythenoisebasedonp n,db.inthecasewhere speechandnoisearepresentatthesametimewedefineaweighted sumofthetwocurvesgivenineq.7. P dual,db = (1 ) P n,db + Ps,dB (7) Here isanestimateoftheprobabilitythatspeechornoiseis present. If =1thereisnodual-andtheisbasedon P,dB s. As <1dual-isactivewithatradeoffbetween P,dB s andpn,db.thedual-gainis, G dual,db =P dual,db in,db () whichisappliedtotheoutputofthenralgorithm. 3.2 Speech and Noise curves The basic extension from a standard scheme to the dual- isthatinthedual-weusetwocurves. Thereasonthat weneedtwocurveisthatwewishtotradeoffbetweenspeech andnoisesegments.p s,db andpn,db arebasciallydefinedby chosingdifferentvaluesforthect,crandg db definedineq.3 andeq.4.thedual-curvesareshowninfigure3.thetradeoff parameter betweenthetwocurvesaredefinedineq.7. Therationalebehindthenoisecurveisthatithasalower gaincomparedtothespeechcurve,asweindeedwishtoapply

3 1 6 4 s n (a) Dual-1 s n (b) Dual-2 Figure 3: Two different approaches for dual-. alowergaintothenoisesegments.asmentionedwewanttoapply without compromising the NR but setting the noise curvetoolowwemightcompromisetheoperationofthe.the goalofdual-isthustofindapropertradeoffbetweennrand. For dual-, we introduce two different approaches which differinthewayp,db n isdefined. Inthefirstapproachdual- 1,thenoisecurveisdefinedbyalinearcurvewitha lineargaing n db ofdb.thisisdepictedinfigure3(a).thedashed linerepresentsthenoisecurvep,db n andthesolidlinerepresentsthespeechcurvep,db s.asmentioned,thetradeoff betweenthesetwocurvesisdefinedwiththeparameter. With dual-1theimpactof isreducedwhenthecrisincreased, sincebeyondtheintersectionbetweenp,db s andpn,db the dual- concept is not active. Dual-1 can have advantages whenthenoiseisdominantorhasalowintensitylevel,whichis wheredual-1hasthelargesttradeoff,orifalowcrisdesired. Thesecondapproachisdual-2.HereP,dB n hasthesame CRandCTasP,dB s butisshiftedtowardslowergainsi.e.gs db > G n db,seefigure3(b).indual-2therangefor iskeptconstant whenthecrisincreased.ifthegaing n db issetclosertogs db the integration is approaching a serial concatenation of NR and. 3.3 Speech and Noise Detection Theparameter thatisusedtotradeoffbetweenspeechandnoise isbasedonthepowerratiobetweentheoutputandtheinput ofthenralgorithm,definedineq.9, α NR (ω,k) = Ps out,nr (ω,k)+pn out,nr (ω,k) Pin,NR s (ω,k)+pn in,nr (ω,k) (9) 1 P n,db P dual,db P s,db α min α max 1 α NR Figure 4: Tradeoff parameters for dual- where Pout,NR s (ω,k) and Pn out,nr (ω,k) are the speech and the noisecomponentsattheoutputofthenrandpin,nr s (ω,k)and Pin,NR n (ω,k)arethespeechandthenoisecomponentsattheinput ofthenr. α NR isusedtodeterminewhetherspeechornoiseisdominant. The NR algorithm preserves speech while attenuating the noise. Hence,ifspeechisdominant α NR willapproachone.ifweassume that the noise is dominant the following statements can be written, Pout,NR s (ω,k) Ps in,nr (ω,k) Pn in,nr (ω,k),pn out,nr (ω,k) () so that, α NR (ω,k) Pn out,nr (ω,k) P n in,nr (ω,k) (11) which is considerably smaller than one. This means that we can evaluate the NR performance by observing the noise power before andafternr.ifthereisnoiseatlowerinputsnrthenralgorithm ismoreactivewhichmeansthat α NR issmall. Inthiscase should not counteract the NR and amplify the residual noise. On theotherhand,ifthereismorespeechpresentathighersnrthe NRislessactiveand α NR willbeclosertoone. Inthiscase,itis desirabletoapply.basically,wewanttotradeoffbetweennr andbasedonthespeechandthenoisecontributiondefinedby α NR. Thenextstepinthedual-approachistomap α NR into based on a threshold function. The threshold function can also be considered as a soft Voice Activity Detector which is illustrated in figure 4. Thethresholdfunctioniscontrolledbyparameters α min and α max,andisgivenineq. = =1 if α NR α max = if α NR α min = α NR α min α max α min otherwise () If α NR islargerthan α max accordingtop s,db isapplied, andifitisbelow α min theisbasedonp n,db. Inbetween thereisatradeoffaccordingtothepowerratiobetweentheoutputandinputofthenr.when α min and α max arebothsettozero the integration corresponds to a serial concatenation, where NR is performed before. Tosummarize,wefirstestimate α NR whichreflectstheamount ofspeechandnoiseintheinput. α NR isthenusedtoestimate the tradeoff parameter. Finally, a dual- gain is computed basedonaspeechandanoisecurve.theobjectivehereisthat the speech is amplified/compressed while the noise is attenuated or at least amplified less than the speech. 4. EXPERIMENTAL RESULTS In this Section, experimental results for the integrated approach for NR and are presented.

4 4.1 Set-up and performance measures The multi-microphone NR scheme used in this paper is the wellknown Generalized Sidelobe Canceler(GSC)[5] consisting of a fixed spatial pre-processor and a multichannel adaptive noise canceler(anc).thenralgorithmimplementedhereisbasedona Frequency Domain Adaptive Filter(FDAF)[6] using a Weighted Overlap-Add(WOLA) analysis/synthesis structure. We have performed simulations with a 2-microphone behind-the-ear hearing aid.thespeechandthenoisesourcesarelocatedat and1,respectively. The speech signals consist of sentences from the HINTdatabase[7]. The noise signal consist of a multi-talker babble from Auditec[],atdBinputSNR.Theinputlevelwassetto65dB SPLatthehearingaidmicrophones. TheFFTframesizewasset to256(i.e.,16ms),ata16khzsamplingrate,with7.5%overlap between sucessive frames. Each frame is weighted with a Hanning window. To assess the NR performance the intelligibility-weighted signal-to-noise ratio(snr)[9] is used which is defined as SNR intellig = I i (SNR i,out SNR i,in ) (13) i where I i is the band importance function defined in [] and SNR i,out andsnr i,in representstheoutputsnrandtheinputsnr (in db) of the ith band, respectively. For measuring the signal distortion a frequency-weighted log-spectral signal distortion(sd) is used defined as SD = 1 K ( f u Pout,k s K w ERB (f) log (f) ) 2 k=1 f l Pin,k s (f) df () wherekisthenumberofframes,pout,k s (f)istheoutputpowerspectrumofthekthframe,pin,k s (f)istheinputpowerspectrumofthekth frameandfisthefrequencyindex.thesdmeasureiscalculated withafrequency-weightingfactorw ERB (f)givingequalweightfor each auditory critical band, as defined by the equivalent rectangular bandwidth(erb) of the auditory filter[11]. The SD is measured between the compressed version of the clean speech reference signal(after the fixed spatial pre-processor) andthespeechcomponentoftheoutputsignalofthetotalscheme containing NR and. The clean speech reference signal is compressedwithouttheuseofdual-,wherethegainiscomputedbasedonthecleanspeechitself. Itisassumedthatnoise dominantframeswillhavealargereffectonthesdmeasureinthe dual- scheme. SimulationsareperformedfordifferentCRand α max forboth dual-1 and dual-2 and compared to a serial concatenation ofnrand.thefirstsimulationistoverifythatahighercr deteriorates the SNR for a serial concatenation compared to dual-.inthesecondsimulation,thecrisfixedandtheeffectof α max ondual-1comparedtodual-2isevaluated.experiments with dual-1 and dual-2 are basically to show that the performance of the integrated scheme is affected by the way the curves are defined. Forallsimulationstheattackandreleasetimearefixedto at=5msandrt=7ms.thehearingaidgaing s db issetto3dband thect s =3dB. 4.2 EffectofCRonSNRimprovementandSD In the first simulation the following setttings are used, SerialconcatenationofNRand(α min =, α max =) Dual-1(α min =.,α max =.7,G n db =db) Dual-2(α min =., α max =.7,G n db =db) CRisvariedfromonetofive.Figure5(a)showstheSNRimprovementforthecasewheretheCRisincreased. ThesolidlinerepresentstheSNRimprovementwhentheNRandareserially SNR (db) 15 5 input SNR db Serial concatenation Dual 1 Dual Compression Ratio (CR) (a) SNR improvement for dual- compared to a serial concatenation SD (db) 6 Serial concatenation Dual 1 Dual Compression Ratio (CR) (b) SD for dual- compared to a serial concatenation Figure 5: Results for dual- compared to a serial concatenation asafunctionofcr. concatenated, where a CR=1 corresponds to the SNR improvement forthenrwithout.thedashedlineandthe( )markerline represents the SNR improvement for dual-1 and dual-2, respectively. Notice that with dual- there is an SNR improvementsinceisappliedtothespeechsegments,buttheresidual noiseisnotamplified.atlowcrthedual-1hasabettersnr improvement but at higher CR dual-2 is better. This happens fortworeasons:firsttheimpactof isreducedindual-1when CRisincreased.Secondly,G n db fordual-1isdbandfordual- 2G n dbissettodb,whichmeansthat fordual-1has agreaterimpactatlowcr.settingg n db todbindual-2will resultinabettersnrimprovementforallvaluesofcr,butthis comesatthecostofless.fordual-2thesnrimprovementcanbecontrolledbychangingg n db,ifthisvalueiscloserto G s db theintegrationisapproachingaserialconcatenation. Figure5(b)showstheSDforthethreecasesandheretheSDis lowestwhennodual-isapplied.fordual-1thesdisinitially higher which corresponds to the better SNR improvement, and thenthesdisdecreasingascrisincreased,whichcorrespondsto smaller SNR improvement. For dual-2 the SD is higher when CR is increased which is also the case for the serial concatenation. 4.3 Effectof α max onsnrimprovementandsd In the second simulation the following settings are used, Dual-1(CR=1.5,G n db =db) Dual-1(CR=3,G n db =db) Dual-2(CR=1.5,G n db =db) Dual-2(CR=3,G n db =db)

5 SNR (db) SD (db) 16 6 input SNR db Dual 1 CR=1.5 Dual 1 CR=3 Dual 2 CR=1.5 Dual 2 CR= α max (a) SNR improvement for dual- for CR=1.5 and Dual 1 CR=1.5 Dual 1 CR=3 Dual 2 CR=1.5 Dual 2 CR= α max (b)sdfordual-forcr=1.5and3 Figure 6: Results for dual-1 and Dual-2 as a function of α max. α min =.and α max isvariedfrom α min toone.figure6(a)shows thesnrimprovementforthecasewhere α max isincreased. The solid line represents the dual-1 for CR=1.5 and this approach outperforms dual-2 for CR=1.5 represented by( ) marker line. When CR=3 the dual-2 represented by the( ) marker line outperforms dual-1 represented by the dashed line, and it is clear thatforcr=3dual-1cannotimprovethesnrmuchwhichis againduetothefactthattheimpactof isreducedwithhighercr. Figure6(b)showstheSDwherethedashedand( )markerline show almost similar distortion, but here it is worth noting that dual- 2 still has a significant SNR improvement compared to dual- 1withCR=3.ForthecasewithCR=1.5thedual-1shows asignificantsnrimprovement,butthiscomesatthecostofhigher SD(shownwiththesolidlinewhichshouldbecomparedtothe( ) marker line). Overall, the SNR improvement comes at the cost of greater distortion. The SD basically represents how far away the dual- is fromtheoriginalcurvep s,db,whichmeansthatthesdis higherwhentheimpactof islargerresultinginmoreactivedual-.inotherwords,thereisatradeoffbetweensnrimprovement andhowclosethedual-istotheoriginalcurve. The dual- plays an important role when the noise is dominant i.e. theisapproachingp n,db.thisisthecasewherethesdis highest, which means that especially the noise dominant segments contribute to the SD. We therefore assume that the speech dominant part is less distorted. measureoftheamountofspeechandnoiseintheinput.this measure,definedby α NR,isestimatedbasedonthepowerratioof theoutputandtheinputofthenr. α NR isusedtotradeoffbetween the amount of that is applied without counteracting the NR performance. Weintroducetradeoffparameters α min and α max to control the integration. When these parameters are set to zero, the integration corresponds to a serial concatenation of NR and. Two dual- approaches have been proposed, dual-1 has a better performance at lower CR whereas dual-2 shows more flexibilityandworksoverawiderangeofcrsettings. WehaveshownthatincreasingtheCRleadstoareducedSNR improvement, when NR and are serially concatenated. Dual- resultedinanimprovementinsnrcomparedtoaserialconcatenationwhencrisincreased.withthecrfixedwehaveshown thatbyincreasing α max itispossibletoimprovethesnrasaresult of the dual- becoming more active. REFERENCES [1] H. Dillon, Hearing Aids. Boomerang Press, Thieme, 1. [2] V. Hamacher, J. Chalupper, J. Eggers, E. Fischer, U. Kornagel, H. Puder, and U.Rass, Signal processing in high-end hearing aids: State of the art, challenges, and future trends, EURASIP Journal on Applied Signal Processing, vol. 1, pp , 5. [3] D.Mauler,A.M.Nagathil,andR.Martin, Onoptimalestimation of compressed speech for hearing aids, Proc. Interspeech, pp , August 27-31, 7. [4] J. M. Kates, Principles of digital dynamic-range compression, Trends In Amplification, vol. 9, no. 2, pp , 5. [5] L. Griffiths and C. Jim, An alternative approach to linearly constrained adaptive beamforming, IEEE Transactions on Antennas and Propagation, vol. 3, no. 1, pp , Jan 192. [6] J. Shynk, Frequency-domain and multirate adaptive filtering, Signal Processing Magazine, IEEE, vol. 9, no. 1, pp. 37, Jan [7] M.Nilsson,S.D.Soli,andA.Sullivan, Developmentofthe Hearing in Noise Test for the measurement of speech receptionthresholdsinquietandinnoise, jasa,vol.95,no.2,pp. 5 99, Feb [] Auditec, Auditory Tests(Revised), Compact Disc, Auditec, St. Louis, St. Louis, [9] J. E. Greenberg, P. M. Peterson, and P. M. Zurek, Intelligibility-weighted measures of speech-to-interference ratio and speech system performance, J. Acoustic. Soc. Am., vol.94,no.5,pp.39 3,Nov [] Acoustical Society of America, ANSI S American National Standard Methods for calculation of the speech intelligibility index, June [11] B. Moore, An Introduction to the Psychology of Hearing, 5th ed. Academic Press, CONCLUSIONS In this paper, we have presented a novel approach for integrating NRandbasedonadual-concept.Thedual-usesa

Published in: Proceedings of the 11th International Workshop on Acoustic Echo and Noise Control

Published in: Proceedings of the 11th International Workshop on Acoustic Echo and Noise Control Aalborg Universitet Variable Speech Distortion Weighted Multichannel Wiener Filter based on Soft Output Voice Activity Detection for Noise Reduction in Hearing Aids Ngo, Kim; Spriet, Ann; Moonen, Marc;

More information

Adaptive Feedback Cancellation in Hearing Aids using a Sinusoidal near-end Signal Model 1

Adaptive Feedback Cancellation in Hearing Aids using a Sinusoidal near-end Signal Model 1 Katholieke Universiteit Leuven Departement Elektrotechniek ESAT-SISTA/TR 09-185 Adaptive Feedback Cancellation in Hearing Aids using a Sinusoidal near-end Signal Model 1 Kim Ngo 2, Toon van Waterschoot

More information

A COHERENCE-BASED ALGORITHM FOR NOISE REDUCTION IN DUAL-MICROPHONE APPLICATIONS

A COHERENCE-BASED ALGORITHM FOR NOISE REDUCTION IN DUAL-MICROPHONE APPLICATIONS 18th European Signal Processing Conference (EUSIPCO-21) Aalborg, Denmark, August 23-27, 21 A COHERENCE-BASED ALGORITHM FOR NOISE REDUCTION IN DUAL-MICROPHONE APPLICATIONS Nima Yousefian, Kostas Kokkinakis

More information

Adaptive beamforming using pipelined transform domain filters

Adaptive beamforming using pipelined transform domain filters Adaptive beamforming using pipelined transform domain filters GEORGE-OTHON GLENTIS Technological Education Institute of Crete, Branch at Chania, Department of Electronics, 3, Romanou Str, Chalepa, 73133

More information

REAL-TIME BROADBAND NOISE REDUCTION

REAL-TIME BROADBAND NOISE REDUCTION REAL-TIME BROADBAND NOISE REDUCTION Robert Hoeldrich and Markus Lorber Institute of Electronic Music Graz Jakoministrasse 3-5, A-8010 Graz, Austria email: robert.hoeldrich@mhsg.ac.at Abstract A real-time

More information

Reducing comb filtering on different musical instruments using time delay estimation

Reducing comb filtering on different musical instruments using time delay estimation Reducing comb filtering on different musical instruments using time delay estimation Alice Clifford and Josh Reiss Queen Mary, University of London alice.clifford@eecs.qmul.ac.uk Abstract Comb filtering

More information

Automotive three-microphone voice activity detector and noise-canceller

Automotive three-microphone voice activity detector and noise-canceller Res. Lett. Inf. Math. Sci., 005, Vol. 7, pp 47-55 47 Available online at http://iims.massey.ac.nz/research/letters/ Automotive three-microphone voice activity detector and noise-canceller Z. QI and T.J.MOIR

More information

A Digital Signal Processor for Musicians and Audiophiles Published on Monday, 09 February :54

A Digital Signal Processor for Musicians and Audiophiles Published on Monday, 09 February :54 A Digital Signal Processor for Musicians and Audiophiles Published on Monday, 09 February 2009 09:54 The main focus of hearing aid research and development has been on the use of hearing aids to improve

More information

MMSE STSA Based Techniques for Single channel Speech Enhancement Application Simit Shah 1, Roma Patel 2

MMSE STSA Based Techniques for Single channel Speech Enhancement Application Simit Shah 1, Roma Patel 2 MMSE STSA Based Techniques for Single channel Speech Enhancement Application Simit Shah 1, Roma Patel 2 1 Electronics and Communication Department, Parul institute of engineering and technology, Vadodara,

More information

A BINAURAL HEARING AID SPEECH ENHANCEMENT METHOD MAINTAINING SPATIAL AWARENESS FOR THE USER

A BINAURAL HEARING AID SPEECH ENHANCEMENT METHOD MAINTAINING SPATIAL AWARENESS FOR THE USER A BINAURAL EARING AID SPEEC ENANCEMENT METOD MAINTAINING SPATIAL AWARENESS FOR TE USER Joachim Thiemann, Menno Müller and Steven van de Par Carl-von-Ossietzky University Oldenburg, Cluster of Excellence

More information

Can binary masks improve intelligibility?

Can binary masks improve intelligibility? Can binary masks improve intelligibility? Mike Brookes (Imperial College London) & Mark Huckvale (University College London) Apparently so... 2 How does it work? 3 Time-frequency grid of local SNR + +

More information

ENERGY-VS-PERFORMANCE TRADE-OFFS IN SPEECH ENHANCEMENT IN WIRELESS ACOUSTIC SENSOR NETWORKS

ENERGY-VS-PERFORMANCE TRADE-OFFS IN SPEECH ENHANCEMENT IN WIRELESS ACOUSTIC SENSOR NETWORKS ENERGY-VS-PERFORMANCE TRADE-OFFS IN SPEECH ENHANCEMENT IN WIRELESS ACOUSTIC SENSOR NETWORKS Fernando de la Hucha Arce 1, Fernando Rosas, Marc Moonen 1, Marian Verhelst, Alexander Bertrand 1 KU Leuven,

More information

Speech Enhancement Using Beamforming Dr. G. Ramesh Babu 1, D. Lavanya 2, B. Yamuna 2, H. Divya 2, B. Shiva Kumar 2, B.

Speech Enhancement Using Beamforming Dr. G. Ramesh Babu 1, D. Lavanya 2, B. Yamuna 2, H. Divya 2, B. Shiva Kumar 2, B. www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 4 Issue 4 April 2015, Page No. 11143-11147 Speech Enhancement Using Beamforming Dr. G. Ramesh Babu 1, D. Lavanya

More information

Microphone Array Feedback Suppression. for Indoor Room Acoustics

Microphone Array Feedback Suppression. for Indoor Room Acoustics Microphone Array Feedback Suppression for Indoor Room Acoustics by Tanmay Prakash Advisor: Dr. Jeffrey Krolik Department of Electrical and Computer Engineering Duke University 1 Abstract The objective

More information

Perceptual Speech Enhancement Using Multi_band Spectral Attenuation Filter

Perceptual Speech Enhancement Using Multi_band Spectral Attenuation Filter Perceptual Speech Enhancement Using Multi_band Spectral Attenuation Filter Sana Alaya, Novlène Zoghlami and Zied Lachiri Signal, Image and Information Technology Laboratory National Engineering School

More information

Recent Advances in Acoustic Signal Extraction and Dereverberation

Recent Advances in Acoustic Signal Extraction and Dereverberation Recent Advances in Acoustic Signal Extraction and Dereverberation Emanuël Habets Erlangen Colloquium 2016 Scenario Spatial Filtering Estimated Desired Signal Undesired sound components: Sensor noise Competing

More information

Different Approaches of Spectral Subtraction Method for Speech Enhancement

Different Approaches of Spectral Subtraction Method for Speech Enhancement ISSN 2249 5460 Available online at www.internationalejournals.com International ejournals International Journal of Mathematical Sciences, Technology and Humanities 95 (2013 1056 1062 Different Approaches

More information

AN547 - Why you need high performance, ultra-high SNR MEMS microphones

AN547 - Why you need high performance, ultra-high SNR MEMS microphones AN547 AN547 - Why you need high performance, ultra-high SNR MEMS Table of contents 1 Abstract................................................................................1 2 Signal to Noise Ratio (SNR)..............................................................2

More information

ScienceDirect. Unsupervised Speech Segregation Using Pitch Information and Time Frequency Masking

ScienceDirect. Unsupervised Speech Segregation Using Pitch Information and Time Frequency Masking Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 46 (2015 ) 122 126 International Conference on Information and Communication Technologies (ICICT 2014) Unsupervised Speech

More information

Auditory modelling for speech processing in the perceptual domain

Auditory modelling for speech processing in the perceptual domain ANZIAM J. 45 (E) ppc964 C980, 2004 C964 Auditory modelling for speech processing in the perceptual domain L. Lin E. Ambikairajah W. H. Holmes (Received 8 August 2003; revised 28 January 2004) Abstract

More information

HCS 7367 Speech Perception

HCS 7367 Speech Perception HCS 7367 Speech Perception Dr. Peter Assmann Fall 212 Power spectrum model of masking Assumptions: Only frequencies within the passband of the auditory filter contribute to masking. Detection is based

More information

Nonuniform multi level crossing for signal reconstruction

Nonuniform multi level crossing for signal reconstruction 6 Nonuniform multi level crossing for signal reconstruction 6.1 Introduction In recent years, there has been considerable interest in level crossing algorithms for sampling continuous time signals. Driven

More information

Emanuël A. P. Habets, Jacob Benesty, and Patrick A. Naylor. Presented by Amir Kiperwas

Emanuël A. P. Habets, Jacob Benesty, and Patrick A. Naylor. Presented by Amir Kiperwas Emanuël A. P. Habets, Jacob Benesty, and Patrick A. Naylor Presented by Amir Kiperwas 1 M-element microphone array One desired source One undesired source Ambient noise field Signals: Broadband Mutually

More information

HUMAN speech is frequently encountered in several

HUMAN speech is frequently encountered in several 1948 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 20, NO. 7, SEPTEMBER 2012 Enhancement of Single-Channel Periodic Signals in the Time-Domain Jesper Rindom Jensen, Student Member,

More information

A generalized framework for binaural spectral subtraction dereverberation

A generalized framework for binaural spectral subtraction dereverberation A generalized framework for binaural spectral subtraction dereverberation Alexandros Tsilfidis, Eleftheria Georganti, John Mourjopoulos Audio and Acoustic Technology Group, Department of Electrical and

More information

Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter

Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter Ching-Ta Lu, Kun-Fu Tseng 2, Chih-Tsung Chen 2 Department of Information Communication, Asia University, Taichung, Taiwan, ROC

More information

ACOUSTIC feedback problems may occur in audio systems

ACOUSTIC feedback problems may occur in audio systems IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL 20, NO 9, NOVEMBER 2012 2549 Novel Acoustic Feedback Cancellation Approaches in Hearing Aid Applications Using Probe Noise and Probe Noise

More information

Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis

Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis Mohini Avatade & S.L. Sahare Electronics & Telecommunication Department, Cummins

More information

Students: Avihay Barazany Royi Levy Supervisor: Kuti Avargel In Association with: Zoran, Haifa

Students: Avihay Barazany Royi Levy Supervisor: Kuti Avargel In Association with: Zoran, Haifa Students: Avihay Barazany Royi Levy Supervisor: Kuti Avargel In Association with: Zoran, Haifa Spring 2008 Introduction Problem Formulation Possible Solutions Proposed Algorithm Experimental Results Conclusions

More information

Mel Spectrum Analysis of Speech Recognition using Single Microphone

Mel Spectrum Analysis of Speech Recognition using Single Microphone International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree

More information

IC1004 TD(13)07037 Ilmenau, Germany 2013/May/28-31

IC1004 TD(13)07037 Ilmenau, Germany 2013/May/28-31 EUROPEAN COOPERATION IN THE FIELD OF SCIENTIFIC AND TECHNICAL RESEARCH EURO-COST IC1004 TD(13)07037 Ilmenau, Germany 2013/May/28-31 SOURCE: Departement Elektrotechniek TELEMIC, Katholieke Universiteit

More information

Dual Transfer Function GSC and Application to Joint Noise Reduction and Acoustic Echo Cancellation

Dual Transfer Function GSC and Application to Joint Noise Reduction and Acoustic Echo Cancellation Dual Transfer Function GSC and Application to Joint Noise Reduction and Acoustic Echo Cancellation Gal Reuven Under supervision of Sharon Gannot 1 and Israel Cohen 2 1 School of Engineering, Bar-Ilan University,

More information

IN REVERBERANT and noisy environments, multi-channel

IN REVERBERANT and noisy environments, multi-channel 684 IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 11, NO. 6, NOVEMBER 2003 Analysis of Two-Channel Generalized Sidelobe Canceller (GSC) With Post-Filtering Israel Cohen, Senior Member, IEEE Abstract

More information

M any clinicians use prescriptive formulae

M any clinicians use prescriptive formulae J Am Acad Audiol 10 : 458-465 (1999) Variables Affecting the Use of Prescriptive Formulae to Fit Modern Nonlinear Hearing Aids Francis K. Kuk* Carl Ludvigsent Abstract It is routine for audiologists to

More information

Towards an intelligent binaural spee enhancement system by integrating me signal extraction. Author(s)Chau, Duc Thanh; Li, Junfeng; Akagi,

Towards an intelligent binaural spee enhancement system by integrating me signal extraction. Author(s)Chau, Duc Thanh; Li, Junfeng; Akagi, JAIST Reposi https://dspace.j Title Towards an intelligent binaural spee enhancement system by integrating me signal extraction Author(s)Chau, Duc Thanh; Li, Junfeng; Akagi, Citation 2011 International

More information

NOISE ESTIMATION IN A SINGLE CHANNEL

NOISE ESTIMATION IN A SINGLE CHANNEL SPEECH ENHANCEMENT FOR CROSS-TALK INTERFERENCE by Levent M. Arslan and John H.L. Hansen Robust Speech Processing Laboratory Department of Electrical Engineering Box 99 Duke University Durham, North Carolina

More information

Resource allocation in DMT transmitters with per-tone pulse shaping

Resource allocation in DMT transmitters with per-tone pulse shaping Resource allocation in DMT transmitters with per-tone pulse shaping Prabin Pandey, M. Moonen, Luc Deneire To cite this version: Prabin Pandey, M. Moonen, Luc Deneire. Resource allocation in DMT transmitters

More information

WIND SPEED ESTIMATION AND WIND-INDUCED NOISE REDUCTION USING A 2-CHANNEL SMALL MICROPHONE ARRAY

WIND SPEED ESTIMATION AND WIND-INDUCED NOISE REDUCTION USING A 2-CHANNEL SMALL MICROPHONE ARRAY INTER-NOISE 216 WIND SPEED ESTIMATION AND WIND-INDUCED NOISE REDUCTION USING A 2-CHANNEL SMALL MICROPHONE ARRAY Shumpei SAKAI 1 ; Tetsuro MURAKAMI 2 ; Naoto SAKATA 3 ; Hirohumi NAKAJIMA 4 ; Kazuhiro NAKADAI

More information

Speech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction

Speech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction IOSR Journal of VLSI and Signal Processing (IOSR-JVSP) Volume 7, Issue, Ver. I (Mar. - Apr. 7), PP 4-46 e-issn: 9 4, p-issn No. : 9 497 www.iosrjournals.org Speech Enhancement Using Spectral Flatness Measure

More information

Spectral contrast enhancement: Algorithms and comparisons q

Spectral contrast enhancement: Algorithms and comparisons q Speech Communication 39 (2003) 33 46 www.elsevier.com/locate/specom Spectral contrast enhancement: Algorithms and comparisons q Jun Yang a, Fa-Long Luo b, *, Arye Nehorai c a Fortemedia Inc., 20111 Stevens

More information

Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter

Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter 1 Gupteswar Sahu, 2 D. Arun Kumar, 3 M. Bala Krishna and 4 Jami Venkata Suman Assistant Professor, Department of ECE,

More information

SELECTIVE TIME-REVERSAL BLOCK SOLUTION TO THE STEREOPHONIC ACOUSTIC ECHO CANCELLATION PROBLEM

SELECTIVE TIME-REVERSAL BLOCK SOLUTION TO THE STEREOPHONIC ACOUSTIC ECHO CANCELLATION PROBLEM 7th European Signal Processing Conference (EUSIPCO 9) Glasgow, Scotland, August 4-8, 9 SELECIVE IME-REVERSAL BLOCK SOLUION O HE SEREOPHONIC ACOUSIC ECHO CANCELLAION PROBLEM Dinh-Quy Nguyen, Woon-Seng Gan,

More information

Factors impacting the speech quality in VoIP scenarios and how to assess them

Factors impacting the speech quality in VoIP scenarios and how to assess them HEAD acoustics Factors impacting the speech quality in Vo scenarios and how to assess them Dr.-Ing. H.W. Gierlich HEAD acoustics GmbH Ebertstraße 30a D-52134 Herzogenrath, Germany Tel: +49 2407/577 0!

More information

Assessment of Dereverberation Algorithms for Large Vocabulary Speech Recognition Systems 1

Assessment of Dereverberation Algorithms for Large Vocabulary Speech Recognition Systems 1 Katholieke Universiteit Leuven Departement Elektrotechniek ESAT-SISTA/TR 23-5 Assessment of Dereverberation Algorithms for Large Vocabulary Speech Recognition Systems 1 Koen Eneman, Jacques Duchateau,

More information

Keywords Decomposition; Reconstruction; SNR; Speech signal; Super soft Thresholding.

Keywords Decomposition; Reconstruction; SNR; Speech signal; Super soft Thresholding. Volume 5, Issue 2, February 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Speech Enhancement

More information

Introduction to Audio Watermarking Schemes

Introduction to Audio Watermarking Schemes Introduction to Audio Watermarking Schemes N. Lazic and P. Aarabi, Communication over an Acoustic Channel Using Data Hiding Techniques, IEEE Transactions on Multimedia, Vol. 8, No. 5, October 2006 Multimedia

More information

Perception of pitch. Importance of pitch: 2. mother hemp horse. scold. Definitions. Why is pitch important? AUDL4007: 11 Feb A. Faulkner.

Perception of pitch. Importance of pitch: 2. mother hemp horse. scold. Definitions. Why is pitch important? AUDL4007: 11 Feb A. Faulkner. Perception of pitch AUDL4007: 11 Feb 2010. A. Faulkner. See Moore, BCJ Introduction to the Psychology of Hearing, Chapter 5. Or Plack CJ The Sense of Hearing Lawrence Erlbaum, 2005 Chapter 7 1 Definitions

More information

The Hybrid Simplified Kalman Filter for Adaptive Feedback Cancellation

The Hybrid Simplified Kalman Filter for Adaptive Feedback Cancellation The Hybrid Simplified Kalman Filter for Adaptive Feedback Cancellation Felix Albu Department of ETEE Valahia University of Targoviste Targoviste, Romania felix.albu@valahia.ro Linh T.T. Tran, Sven Nordholm

More information

HARMONIC INSTABILITY OF DIGITAL SOFT CLIPPING ALGORITHMS

HARMONIC INSTABILITY OF DIGITAL SOFT CLIPPING ALGORITHMS HARMONIC INSTABILITY OF DIGITAL SOFT CLIPPING ALGORITHMS Sean Enderby and Zlatko Baracskai Department of Digital Media Technology Birmingham City University Birmingham, UK ABSTRACT In this paper several

More information

Distortion products and the perceived pitch of harmonic complex tones

Distortion products and the perceived pitch of harmonic complex tones Distortion products and the perceived pitch of harmonic complex tones D. Pressnitzer and R.D. Patterson Centre for the Neural Basis of Hearing, Dept. of Physiology, Downing street, Cambridge CB2 3EG, U.K.

More information

Digitally controlled Active Noise Reduction with integrated Speech Communication

Digitally controlled Active Noise Reduction with integrated Speech Communication Digitally controlled Active Noise Reduction with integrated Speech Communication Herman J.M. Steeneken and Jan Verhave TNO Human Factors, Soesterberg, The Netherlands herman@steeneken.com ABSTRACT Active

More information

Effective post-processing for single-channel frequency-domain speech enhancement Weifeng Li a

Effective post-processing for single-channel frequency-domain speech enhancement Weifeng Li a R E S E A R C H R E P O R T I D I A P Effective post-processing for single-channel frequency-domain speech enhancement Weifeng Li a IDIAP RR 7-7 January 8 submitted for publication a IDIAP Research Institute,

More information

Detection, Interpolation and Cancellation Algorithms for GSM burst Removal for Forensic Audio

Detection, Interpolation and Cancellation Algorithms for GSM burst Removal for Forensic Audio >Bitzer and Rademacher (Paper Nr. 21)< 1 Detection, Interpolation and Cancellation Algorithms for GSM burst Removal for Forensic Audio Joerg Bitzer and Jan Rademacher Abstract One increasing problem for

More information

Comparison of adaptive noise reduction algorithms in dual microphone hearing aids q

Comparison of adaptive noise reduction algorithms in dual microphone hearing aids q Speech Communication 48 (2006) 957 970 www.elsevier.com/locate/specom Comparison of adaptive noise reduction algorithms in dual microphone hearing aids q Jean-Baptiste Maj a,b, Liesbeth Royackers a, Jan

More information

Factors Governing the Intelligibility of Speech Sounds

Factors Governing the Intelligibility of Speech Sounds HSR Journal Club JASA, vol(19) No(1), Jan 1947 Factors Governing the Intelligibility of Speech Sounds N. R. French and J. C. Steinberg 1. Introduction Goal: Determine a quantitative relationship between

More information

Speech Enhancement using Wiener filtering

Speech Enhancement using Wiener filtering Speech Enhancement using Wiener filtering S. Chirtmay and M. Tahernezhadi Department of Electrical Engineering Northern Illinois University DeKalb, IL 60115 ABSTRACT The problem of reducing the disturbing

More information

STUDIES OF EPIDAURUS WITH A HYBRID ROOM ACOUSTICS MODELLING METHOD

STUDIES OF EPIDAURUS WITH A HYBRID ROOM ACOUSTICS MODELLING METHOD STUDIES OF EPIDAURUS WITH A HYBRID ROOM ACOUSTICS MODELLING METHOD Tapio Lokki (1), Alex Southern (1), Samuel Siltanen (1), Lauri Savioja (1), 1) Aalto University School of Science, Dept. of Media Technology,

More information

THE problem of acoustic echo cancellation (AEC) was

THE problem of acoustic echo cancellation (AEC) was IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 13, NO. 6, NOVEMBER 2005 1231 Acoustic Echo Cancellation and Doubletalk Detection Using Estimated Loudspeaker Impulse Responses Per Åhgren Abstract

More information

SPECTRAL COMBINING FOR MICROPHONE DIVERSITY SYSTEMS

SPECTRAL COMBINING FOR MICROPHONE DIVERSITY SYSTEMS 17th European Signal Processing Conference (EUSIPCO 29) Glasgow, Scotland, August 24-28, 29 SPECTRAL COMBINING FOR MICROPHONE DIVERSITY SYSTEMS Jürgen Freudenberger, Sebastian Stenzel, Benjamin Venditti

More information

Stefan Launer, Lyon, January 2011 Phonak AG, Stäfa, CH

Stefan Launer, Lyon, January 2011 Phonak AG, Stäfa, CH State of art and Challenges in Improving Speech Intelligibility in Hearing Impaired People Stefan Launer, Lyon, January 2011 Phonak AG, Stäfa, CH Content Phonak Stefan Launer, Speech in Noise Workshop,

More information

Multiple Audio Spots Design Based on Separating Emission of Carrier and Sideband Waves

Multiple Audio Spots Design Based on Separating Emission of Carrier and Sideband Waves Multiple Audio Spots Design Based on Separating Emission of Carrier and Sideband Waves Tadashi MATSUI 1 ; Daisuke IKEFUJI 1 ; Masato NAKAYAMA 2 ;Takanobu NISHIURA 2 1 Graduate School of Information Science

More information

LETTER Pre-Filtering Algorithm for Dual-Microphone Generalized Sidelobe Canceller Using General Transfer Function

LETTER Pre-Filtering Algorithm for Dual-Microphone Generalized Sidelobe Canceller Using General Transfer Function IEICE TRANS. INF. & SYST., VOL.E97 D, NO.9 SEPTEMBER 2014 2533 LETTER Pre-Filtering Algorithm for Dual-Microphone Generalized Sidelobe Canceller Using General Transfer Function Jinsoo PARK, Wooil KIM,

More information

Live multi-track audio recording

Live multi-track audio recording Live multi-track audio recording Joao Luiz Azevedo de Carvalho EE522 Project - Spring 2007 - University of Southern California Abstract In live multi-track audio recording, each microphone perceives sound

More information

Perception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 5: 12 Feb A. Faulkner.

Perception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 5: 12 Feb A. Faulkner. Perception of pitch BSc Audiology/MSc SHS Psychoacoustics wk 5: 12 Feb 2009. A. Faulkner. See Moore, BCJ Introduction to the Psychology of Hearing, Chapter 5. Or Plack CJ The Sense of Hearing Lawrence

More information

Robust Low-Resource Sound Localization in Correlated Noise

Robust Low-Resource Sound Localization in Correlated Noise INTERSPEECH 2014 Robust Low-Resource Sound Localization in Correlated Noise Lorin Netsch, Jacek Stachurski Texas Instruments, Inc. netsch@ti.com, jacek@ti.com Abstract In this paper we address the problem

More information

Technical features For internal use only / For internal use only Copy / right Copy Sieme A All rights re 06. All rights re se v r ed.

Technical features For internal use only / For internal use only Copy / right Copy Sieme A All rights re 06. All rights re se v r ed. For internal use only / Copyright Siemens AG 2006. All rights reserved. Contents Technical features Wind noise reduction 3 Automatic microphone system 9 Directional microphone system 15 Feedback cancellation

More information

Effect of fast-acting compression on modulation detection interference for normal hearing and hearing impaired listeners

Effect of fast-acting compression on modulation detection interference for normal hearing and hearing impaired listeners Effect of fast-acting compression on modulation detection interference for normal hearing and hearing impaired listeners Yi Shen a and Jennifer J. Lentz Department of Speech and Hearing Sciences, Indiana

More information

Systematic Integration of Acoustic Echo Canceller and Noise Reduction Modules for Voice Communication Systems

Systematic Integration of Acoustic Echo Canceller and Noise Reduction Modules for Voice Communication Systems INTERSPEECH 2015 Systematic Integration of Acoustic Echo Canceller and Noise Reduction Modules for Voice Communication Systems Hyeonjoo Kang 1, JeeSo Lee 1, Soonho Bae 2, and Hong-Goo Kang 1 1 Dept. of

More information

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm A.T. Rajamanickam, N.P.Subiramaniyam, A.Balamurugan*,

More information

Waves C360 SurroundComp. Software Audio Processor. User s Guide

Waves C360 SurroundComp. Software Audio Processor. User s Guide Waves C360 SurroundComp Software Audio Processor User s Guide Waves C360 software guide page 1 of 10 Introduction and Overview Introducing Waves C360, a Surround Soft Knee Compressor for 5 or 5.1 channels.

More information

DEEP LEARNING BASED AUTOMATIC VOLUME CONTROL AND LIMITER SYSTEM. Jun Yang (IEEE Senior Member), Philip Hilmes, Brian Adair, David W.

DEEP LEARNING BASED AUTOMATIC VOLUME CONTROL AND LIMITER SYSTEM. Jun Yang (IEEE Senior Member), Philip Hilmes, Brian Adair, David W. DEEP LEARNING BASED AUTOMATIC VOLUME CONTROL AND LIMITER SYSTEM Jun Yang (IEEE Senior Member), Philip Hilmes, Brian Adair, David W. Krueger Amazon Lab126, Sunnyvale, CA 94089, USA Email: {junyang, philmes,

More information

RECENTLY, there has been an increasing interest in noisy

RECENTLY, there has been an increasing interest in noisy IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 52, NO. 9, SEPTEMBER 2005 535 Warped Discrete Cosine Transform-Based Noisy Speech Enhancement Joon-Hyuk Chang, Member, IEEE Abstract In

More information

Evaluation of clipping-noise suppression of stationary-noisy speech based on spectral compensation

Evaluation of clipping-noise suppression of stationary-noisy speech based on spectral compensation Evaluation of clipping-noise suppression of stationary-noisy speech based on spectral compensation Takahiro FUKUMORI ; Makoto HAYAKAWA ; Masato NAKAYAMA 2 ; Takanobu NISHIURA 2 ; Yoichi YAMASHITA 2 Graduate

More information

x ( Primary Path d( P (z) - e ( y ( Adaptive Filter W (z) y( S (z) Figure 1 Spectrum of motorcycle noise at 40 mph. modeling of the secondary path to

x ( Primary Path d( P (z) - e ( y ( Adaptive Filter W (z) y( S (z) Figure 1 Spectrum of motorcycle noise at 40 mph. modeling of the secondary path to Active Noise Control for Motorcycle Helmets Kishan P. Raghunathan and Sen M. Kuo Department of Electrical Engineering Northern Illinois University DeKalb, IL, USA Woon S. Gan School of Electrical and Electronic

More information

Synchronous Overlap and Add of Spectra for Enhancement of Excitation in Artificial Bandwidth Extension of Speech

Synchronous Overlap and Add of Spectra for Enhancement of Excitation in Artificial Bandwidth Extension of Speech INTERSPEECH 5 Synchronous Overlap and Add of Spectra for Enhancement of Excitation in Artificial Bandwidth Extension of Speech M. A. Tuğtekin Turan and Engin Erzin Multimedia, Vision and Graphics Laboratory,

More information

Speech Enhancement Techniques using Wiener Filter and Subspace Filter

Speech Enhancement Techniques using Wiener Filter and Subspace Filter IJSTE - International Journal of Science Technology & Engineering Volume 3 Issue 05 November 2016 ISSN (online): 2349-784X Speech Enhancement Techniques using Wiener Filter and Subspace Filter Ankeeta

More information

Multiple Sound Sources Localization Using Energetic Analysis Method

Multiple Sound Sources Localization Using Energetic Analysis Method VOL.3, NO.4, DECEMBER 1 Multiple Sound Sources Localization Using Energetic Analysis Method Hasan Khaddour, Jiří Schimmel Department of Telecommunications FEEC, Brno University of Technology Purkyňova

More information

Encoding a Hidden Digital Signature onto an Audio Signal Using Psychoacoustic Masking

Encoding a Hidden Digital Signature onto an Audio Signal Using Psychoacoustic Masking The 7th International Conference on Signal Processing Applications & Technology, Boston MA, pp. 476-480, 7-10 October 1996. Encoding a Hidden Digital Signature onto an Audio Signal Using Psychoacoustic

More information

Speech & Audio Processing / Part-II. Digital Audio Signal Processing DASP. Marc Moonen

Speech & Audio Processing / Part-II. Digital Audio Signal Processing DASP. Marc Moonen Speech & Audio Processing / Part-II Digital Audio Signal Processing DASP Marc Moonen Dept. E.E./ESAT-STADIUS, KU Leuven marc.moonen@esat.kuleuven.be homes.esat.kuleuven.be/~moonen/ Overview Aims/scope

More information

Terminology (1) Chapter 3. Terminology (3) Terminology (2) Transmitter Receiver Medium. Data Transmission. Direct link. Point-to-point.

Terminology (1) Chapter 3. Terminology (3) Terminology (2) Transmitter Receiver Medium. Data Transmission. Direct link. Point-to-point. Terminology (1) Chapter 3 Data Transmission Transmitter Receiver Medium Guided medium e.g. twisted pair, optical fiber Unguided medium e.g. air, water, vacuum Spring 2012 03-1 Spring 2012 03-2 Terminology

More information

Dominant Voiced Speech Segregation Using Onset Offset Detection and IBM Based Segmentation

Dominant Voiced Speech Segregation Using Onset Offset Detection and IBM Based Segmentation Dominant Voiced Speech Segregation Using Onset Offset Detection and IBM Based Segmentation Shibani.H 1, Lekshmi M S 2 M. Tech Student, Ilahia college of Engineering and Technology, Muvattupuzha, Kerala,

More information

MULTICHANNEL systems are often used for

MULTICHANNEL systems are often used for IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 52, NO. 5, MAY 2004 1149 Multichannel Post-Filtering in Nonstationary Noise Environments Israel Cohen, Senior Member, IEEE Abstract In this paper, we present

More information

Modulation Domain Spectral Subtraction for Speech Enhancement

Modulation Domain Spectral Subtraction for Speech Enhancement Modulation Domain Spectral Subtraction for Speech Enhancement Author Paliwal, Kuldip, Schwerin, Belinda, Wojcicki, Kamil Published 9 Conference Title Proceedings of Interspeech 9 Copyright Statement 9

More information

Perception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 4: 7 Feb A. Faulkner.

Perception of pitch. Definitions. Why is pitch important? BSc Audiology/MSc SHS Psychoacoustics wk 4: 7 Feb A. Faulkner. Perception of pitch BSc Audiology/MSc SHS Psychoacoustics wk 4: 7 Feb 2008. A. Faulkner. See Moore, BCJ Introduction to the Psychology of Hearing, Chapter 5. Or Plack CJ The Sense of Hearing Lawrence Erlbaum,

More information

Speaker Localization in Noisy Environments Using Steered Response Voice Power

Speaker Localization in Noisy Environments Using Steered Response Voice Power 112 IEEE Transactions on Consumer Electronics, Vol. 61, No. 1, February 2015 Speaker Localization in Noisy Environments Using Steered Response Voice Power Hyeontaek Lim, In-Chul Yoo, Youngkyu Cho, and

More information

Informed Spatial Filtering for Sound Extraction Using Distributed Microphone Arrays

Informed Spatial Filtering for Sound Extraction Using Distributed Microphone Arrays IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 22, NO. 7, JULY 2014 1195 Informed Spatial Filtering for Sound Extraction Using Distributed Microphone Arrays Maja Taseska, Student

More information

Audio Fingerprinting using Fractional Fourier Transform

Audio Fingerprinting using Fractional Fourier Transform Audio Fingerprinting using Fractional Fourier Transform Swati V. Sutar 1, D. G. Bhalke 2 1 (Department of Electronics & Telecommunication, JSPM s RSCOE college of Engineering Pune, India) 2 (Department,

More information

Recurrent Timing Neural Networks for Joint F0-Localisation Estimation

Recurrent Timing Neural Networks for Joint F0-Localisation Estimation Recurrent Timing Neural Networks for Joint F0-Localisation Estimation Stuart N. Wrigley and Guy J. Brown Department of Computer Science, University of Sheffield Regent Court, 211 Portobello Street, Sheffield

More information

Binaural Hearing. Reading: Yost Ch. 12

Binaural Hearing. Reading: Yost Ch. 12 Binaural Hearing Reading: Yost Ch. 12 Binaural Advantages Sounds in our environment are usually complex, and occur either simultaneously or close together in time. Studies have shown that the ability to

More information

A Research Concept on Bit Rate Detection using Carrier offset through Analysis of MC-CDMA SYSTEM

A Research Concept on Bit Rate Detection using Carrier offset through Analysis of MC-CDMA SYSTEM Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology ISSN 2320 088X IMPACT FACTOR: 5.258 IJCSMC,

More information

Measuring procedures for the environmental parameters: Acoustic comfort

Measuring procedures for the environmental parameters: Acoustic comfort Measuring procedures for the environmental parameters: Acoustic comfort Abstract Measuring procedures for selected environmental parameters related to acoustic comfort are shown here. All protocols are

More information

Extending the articulation index to account for non-linear distortions introduced by noise-suppression algorithms

Extending the articulation index to account for non-linear distortions introduced by noise-suppression algorithms Extending the articulation index to account for non-linear distortions introduced by noise-suppression algorithms Philipos C. Loizou a) Department of Electrical Engineering University of Texas at Dallas

More information

HIGH ORDER MODULATION SHAPED TO WORK WITH RADIO IMPERFECTIONS

HIGH ORDER MODULATION SHAPED TO WORK WITH RADIO IMPERFECTIONS HIGH ORDER MODULATION SHAPED TO WORK WITH RADIO IMPERFECTIONS Karl Martin Gjertsen 1 Nera Networks AS, P.O. Box 79 N-52 Bergen, Norway ABSTRACT A novel layout of constellations has been conceived, promising

More information

Enhancement of Speech Signal by Adaptation of Scales and Thresholds of Bionic Wavelet Transform Coefficients

Enhancement of Speech Signal by Adaptation of Scales and Thresholds of Bionic Wavelet Transform Coefficients ISSN (Print) : 232 3765 An ISO 3297: 27 Certified Organization Vol. 3, Special Issue 3, April 214 Paiyanoor-63 14, Tamil Nadu, India Enhancement of Speech Signal by Adaptation of Scales and Thresholds

More information

SELECTIVE NOISE FILTERING OF SPEECH SIGNALS USING AN ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM AS A FREQUENCY PRE-CLASSIFIER

SELECTIVE NOISE FILTERING OF SPEECH SIGNALS USING AN ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM AS A FREQUENCY PRE-CLASSIFIER SELECTIVE NOISE FILTERING OF SPEECH SIGNALS USING AN ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM AS A FREQUENCY PRE-CLASSIFIER SACHIN LAKRA 1, T. V. PRASAD 2, G. RAMAKRISHNA 3 1 Research Scholar, Computer Sc.

More information

Digital Signal Processing of Speech for the Hearing Impaired

Digital Signal Processing of Speech for the Hearing Impaired Digital Signal Processing of Speech for the Hearing Impaired N. Magotra, F. Livingston, S. Savadatti, S. Kamath Texas Instruments Incorporated 12203 Southwest Freeway Stafford TX 77477 Abstract This paper

More information

EE482: Digital Signal Processing Applications

EE482: Digital Signal Processing Applications Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 12 Speech Signal Processing 14/03/25 http://www.ee.unlv.edu/~b1morris/ee482/

More information

III. Publication III. c 2005 Toni Hirvonen.

III. Publication III. c 2005 Toni Hirvonen. III Publication III Hirvonen, T., Segregation of Two Simultaneously Arriving Narrowband Noise Signals as a Function of Spatial and Frequency Separation, in Proceedings of th International Conference on

More information

Two-channel Separation of Speech Using Direction-of-arrival Estimation And Sinusoids Plus Transients Modeling

Two-channel Separation of Speech Using Direction-of-arrival Estimation And Sinusoids Plus Transients Modeling Two-channel Separation of Speech Using Direction-of-arrival Estimation And Sinusoids Plus Transients Modeling Mikko Parviainen 1 and Tuomas Virtanen 2 Institute of Signal Processing Tampere University

More information

THE PERCEPTION OF ALL-PASS COMPONENTS IN TRANSFER FUNCTIONS

THE PERCEPTION OF ALL-PASS COMPONENTS IN TRANSFER FUNCTIONS PACS Reference: 43.66.Pn THE PERCEPTION OF ALL-PASS COMPONENTS IN TRANSFER FUNCTIONS Pauli Minnaar; Jan Plogsties; Søren Krarup Olesen; Flemming Christensen; Henrik Møller Department of Acoustics Aalborg

More information