ON THE PERFORMANCE OF WTIMIT FOR WIDE BAND TELEPHONY

Size: px
Start display at page:

Download "ON THE PERFORMANCE OF WTIMIT FOR WIDE BAND TELEPHONY"

Transcription

1 ON THE PERFORMANCE OF WTIMIT FOR WIDE BAND TELEPHONY D. Nagajyothi 1 and P. Siddaiah 2 1 Department of Electronics and Communication Engineering, Vardhaman College of Engineering, Shamshabad, Telangana, India 2 Department of Electrical and Computer Engineering, University College of Engineering and Technology, Acharya Nagarjuna University, Guntur, India nagajyothi1998@gmail.com ABSTRACT WTIMIT, which is a derivative of TIMIT emerged as a latest technique for speech quality. The technique has good wideband characteristics over a range of 50-7 KHz. in this paper, a study on the performance of phoneme recognition system has been performed. The study includes the effect of decimating the signal to 8 KHz in the conventional case. Further it is possible to evaluate the AMR-wideband codec for several acoustic models. It is possible to propose the WTIMIT type of wideband channel data from training interactive voice receiving system. Keywords: speech codecs, IVR, AMR-Wb, TIMIT. 1. INTRODUCTION The Typical Bandwidth of Speech is less than 4 KHz for applications in telephonic operations. This is often termed as narrow band. The IVR operates at a sampling rate of 8 KHz for Operation [1-3]. Citing this, the advanced speech service system expanded their BW to wideband (WB) frequency range of KHz. The influence of a conventional telephony network on the N- TIMIT is used to evaluate the performance features of recognition system in traditional telephony. The Phoneme error rate (PER) suppressed by a huge extent due to direct WB speech. Similar in NB case, 23% relative PER degradations is identified. It is also reported that there is an evidence of the impact of a WB mobile network using the WTIMIT corpus. An enhancement of 19% PER with respect to direct WB speech is observed while there is a suppressing 3% PER relative to narrow band. In spite of these efforts it is to note that investigation pertaining to effects of telephony network are to be validated. This is more desirous in IVR based Telephony system. The development of DARPA TIMIT Acoustic Phonetic Continuous Speech Corpus paved a way for evaluating automatic speech recognition (ASR) systems [4]. It constitutes wideband speech recordings which are sampled at 16 KHz. They typically containing in the rate of 50 Hz to 7 khz with respect to 630 native speakers. This is with reference to 8 major regions in the US. For training ten phonetically rich sentences are collected from every speech. In every utterance several features are extracted along with speech waveform, time aligned orthographic, phonetic and word transcriptions are taken. With reference to these efforts as of now there are five TIMIT derivatives namely FFMTIMIT, NTIMIT, CTIMIT, HTIMIT and STC-TIMIT. The FFMTIMIT can be abbreviated as Free Field Microphone TIMIT typical composed of natural TIMIT database. It typically uses a free field device for recording. NTIMIT (Network TIMIT) is adjunct to TIMIT with database constituting the speech wave form [4]. Over a telephone handset, Similarly CTIMIT constitutes of the original TIMIT recordings were passed through cellular telephone circuits. However, in the case of HTIMIT (Handset TIMIT) the data base consists of two subset with 192 male and 192 female speakers [5]. The corresponding speech signals are those which are transmitted through different telephone handsets. This typically helps in the investigation of telephone transducer effects on speech. For STCTIMIT which is single channel, the speech signals were sent through a real and, in contrast to NTIMIT, all these can be turned as the derivation of wideband speech [6-11]. While some are telephony are containing narrowband speech. The sampling is at the rate of 8 khz with a range of 200 Hz to 3.4 khz. Inspite of all these it is to be noted that there is no availability of real world wideband telephony speech corpus. Several versions of wideband speech codes like G.722 (1988), G.722 (1999) G (2001) and G (2008) have been into operation with several techniques like ADPCM, 3GPP [6] and wide band PCM. It is interesting to note that the wide band telephony speech transmission system is wide available and adaptable. In contrast to ever increasing mobile networks citing this, it its essential to have wideband system in the TIMIT for a wide range of scientific investigations. There are several advantages and applications associated with WBSTS. The integrated speech recognition system provides remote dictation or spelling. This was not a possible case with the earlier telephony system. In this paper, an investigation on the performance of the speech CODECS in terms of Bit rates is performed. The analysis is based experimentation carried out in MATLAB on windows platform in an i3 with 4 GB RAM. Further, the paper is organized as follows. A brief discussion on the standards and the corresponding bit rates of several speech codecs are presented in section 2. Results pertaining to synthesis followed by analysis of speech codecs is given section 3. Overall conclusion is given in section SPEECH CODECS In this Section, a brief introduction to the speech codecs is given. The aim of the speech codec is to compress the speech signal in order to reduce the bandwidth and requires minimum storage space. When we will reconstruct, it must be very close to original one. 1386

2 Based on intelligibility and naturalness we will measure perceived quality of the signal. Here we have considered two types of the networks GSM and VoIP. Table-1 and Table-2 shows the specification summary of the all the supported narrowband wideband codecs. Table-1. ITU-T approved VoIP supported narrowband and wideband speech codecs. Coding standard Algorithm Sampling frequency (khz) Bit rates (kbps) G.711 (A /U) Companded PCM 8 64 G.726 ADPCM 8 16/24/ 32/40 G.729 CS-ACELP 8 8 G.723.1A ACELP / MP-MLQ / 6.3 G.722 (WB) SB-ADPCM 16 48, 56, 64 G (WB) Companded PCM, MDCT 16 64, 80, 96 G (WB) CELP, TD-BWE, TDAC G (AMR-WB) (Multi Rate) MRWB- ACELP , 8.85, 12.65, 14.25, 15.85, 18.25, 19.85, 23.05, Table-2. ETSI/3GPP approved GSM supported narrowband and wideband speech codecs. Coding standard Algorithm Sampling frequency (khz) Bit rates (kbps) GSM FR RPE-LTP 8 13 GSMEFR ACELP GSM HR VSELP (Multi Rate) MR- 4.75, 5.15, 5.90, 6.70, 7.40, GSM AMR 8 ACELP 7.95, 10.2, , 8.85, 12.65, 14.25, 15.85, GSM AMR-WB MRWB-ACELP , 19.85, 23.05, RESULTS AND DISCUSSIONS Results pertaining to the technique and proposed method are presented in this Section. Testing of the Coded Data with G.711-Coded Models (8-kHz HMMs). The 8- khz un-coded and coded speech data that is coded with all other wireline codecs, such as G.711, G.726 and G.729, is tested with the G.711-coded models (8-kHz trained HMMs) for the CI, and CD-tied tri-phone models with 1, 2, 4 and 8 Gaussians per state, and are reported in the following tables. Case-1: Testing the wireline coded data (NB Codecs) with G.711-coded trained models The corresponding comparative analysis based on ASR accuracy with respect to G.711, G.726, G.729 and Un-coded are as shown in Figure-1. It is evident that the G.726 and G.711 have reported high accuracy coefficient. Table-3. Results of testing the wireline coded data (NB Codecs) with G.711-coded trained models (8kHzHMMs. Coded data used in testing Un-coded G G G

3 Figure-1. Graphic results of testing the wireline coded data (NB Codecs) with G.711-coded trained models (8kHzHMMs). Case-2: Testing of the coded data with G.729-coded models (8-kHz HMMs) is coded with all other wireline codecs, such as G.711, G.726 and G.729, is tested with the G.729-coded models (8-kHz HMMs) for the CI, and CD-tied tri-phone models with 1, 2, 4 and 8 Gaussians per state, and are reported. Results of testing the wireline coded data (NB codecs) with G.729-coded trained models (8kHzHMMs). Table-4. Testing of the coded data with G.729-coded models (8-kHz HMMs). Coded data used in testing Un-coded G G G It can be inferred from the comparative results shown in Figure.2 for G.711, G.726, G.729 and Un-coded for testing the wireline Codecs with 8 KHz G.729 coded HMMs, that the impact of the respective coding is minimal. Figure-2. Graphic results of testing the wireline coded data (NB codecs) with G.729-coded trained models (8kHzHMMs). Case-3: Testing of the coded data with HR-coded models (8-kHz HMMs) is coded with all other NB wireless codecs, such as FR, EFR, HR, and AMR, is tested with the HR-coded models (8-kHz HMMs) for the CI, and CD-tied tri-phone models with 1, 2, 4 and 8 Gaussians per state, and are reported. Results of testing the wireless coded data (NB Codecs) with HR-coded trained models (8kHzHMMs. Table-5. Testing of the coded data with HR-coded models (8-kHz HMMs). Coded data used in testing Un-coded FR EFR HR

4 when compared as shown in Figure-3. HR reported minimal, however almost similar to EFR for CD-8gau. Figure-3. Graphic results of testing the wireless coded data (NB Codecs) with HR-coded trained models (8kHzHMMs). FR reported to be having high accuracy coefficient when compared with EFR, HR and Un-coded Case-4: Testing of the coded data with AMR4.75-coded models (8-kHz HMMs) is coded with all other NB wireless codecs, such as FR, EFR, HR, and AMR, and wireline codecs, such as G.711, G.726 and G.729 are tested with the coded models (8-kHz HMMs) for the CI, and CD-tied triphone models with 1, 2, 4 and 8 Gaussians per state, and are reported. Results of testing the wireless coded data (NB Codecs) with AMR@4.75-coded trained models (8kHzHMMs) Table-6. Testing of the coded data with AMR4.75-coded models (8-kHz HMMs). Coded data used in testing Un-coded FR EFR HR AMR@ AMR@ G G G When compared with respect to accuracy as shown in Figure-4, it is clearly evident that the FR expressed high accuracy while Un-coaded produced poor results while testing the wireless coded data (NB Codecs) with AMR@4.75-coded trained models (8kHzHMMs). Figure-4. Graphic results of testing the wireless coded data (NB Codecs) with AMR@4.75-coded trained models (8kHzHMMs). Case-5: Testing of the AMR coded data with AMR12.2-coded models (8-kHz HMMs) is coded with all other NB wireless codecs, such as FR, EFR, HR, and AMR, is tested with the AMR@12.2kbps coded models for the CI, and CD-tied tri-phone models with 1, 2, 4 and 8 Gaussians per state, and are reported in the following table. 1389

5 Table-7. Results of testing the AMR coded data (NB codecs) with trained models (8kHzHMMs). Coded data used in testing Un-coded FR EFR HR models. The ASR performance is almost same when tested with either 16-kHz coded models or 8-kHz un-coded models. The ASR performance is poor when tested with 16-kHz un-coded models. REFERENCES [1] X. Huang, A. Acero and H. W. Hon Spoken Language Processing: A Guide to Theory, Algorithm and System Development, Prentice Hall. Figure-5. Graphic results of testing the wireless coded data (NB Codecs) with AMR@12.2-coded trained models (8kHzHMMs). While comparing during the testing the wireless coded data (NB Codecs) with AMR@12.2-coded trained models (8kHzHMMs) in connection with the previous analysis in Case-4, the respective FR produced accuracy at high degree with respect to EFR, HR, AMR@4.75 and AMR@12.2 also including the un-coded. 4. CONCLUSIONS ASR accuracies accuracy for un-coded and coded data when tested with the different coded models that include 8-kHz and 16-kHz HMMs. The major observations made are as follows. The ASR accuracy always increases with 8-kHz coded trained models when compared to 8-kHz un-coded models for all the narrowband codecs. The ASR accuracy of coded data of any particular codec increases by at least 2% when the same type of coded models is used. The ASR results for coded data for specific codecs, such as G.711, G.729, HR and AMR12.2 for un-coded and respective coded models, are re-organized to see the ASR improvements. Coded data (G.711/G.729/HR/AMR12.2) tested with 8-kHz uncoded HMMs while the Coded data (G.711/G.729/HR/AMR12.2) tested with 16-kHz uncoded HMMs. Similarly, the Coded data (G.711/G.729/HR/AMR12.2) tested with 8-kHz Coded (G.711/G.729/HR/AMR12.2) HMMs whereas the Coded data (G.711/G.729/HR/AMR12.2) tested with 16-kHz Coded (G.711/G.729/HR/AMR12.2) HMMs. All these codecs perform well for the respective 8-kHz coded [2] K. W. Church and R. L. Mercer Introduction to the Special Issue on Computational Linguistics Using Large Corpora. Computational Linguistics. 19(1): [3] Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Advanced Front-End Feature Extraction Algorithm; Compression Algorithms. ETSI. [4] J. S. Garofolo et al TIMIT Acoustic-Phonetic Continuous Speech Corpus. Linguistic Data Consortium, Philadelphia, USA. [5] J. S. Garofolo et al FFMTIMIT. Linguistic Data Consortium, Philadelphia, USA. [6] 3GPP Mandatory Speech Codec Speech Processing Functions: AMR Speech Codec; Transcoding Functions (3G TS ). [7] C. Jankowski et al NTIMIT: A Phonetically Balanced, Continuous Speech, Telephone Bandwidth Speech Database. In Proc. of ICASSP, pp [8] K.-F. Lee and H.-W. Hon Speaker-Independent Phone Recognition Using Hidden Markov Models. IEEE Transactions on Acoustics, Speech and Signal Processing. 37(11):

6 [9] N. Morales et al STC-TIMIT: Generation of a Single-channel Telephone Corpus. In Proc. of LREC. pp [10] D. A. Reynolds HTIMIT and LLHDB: Speech Corpora for the Study of Handset Transducer Effects. In Proc. of ICASSP. 2: [11] P. Bauer and T. Fingscheidt WTIMIT 1.0. Linguistic Data Consortium, Philadelphia. 1391

International Journal of Computer Engineering and Applications, Volume XI, Issue XII, Dec. 17, ISSN

International Journal of Computer Engineering and Applications, Volume XI, Issue XII, Dec. 17,   ISSN SPEECH-ENABLED IVR USING ARTIFICIAL BANDWIDTH EXTENSION TECHNIQUE Mohan Dholvan 1, Dr. Anitha Sheela Kancharla 2 1 Department of Electronics and Computer Engineering, SNIST, Hyderabad, Telangana, India

More information

Automatic Speech Recognition (ASR) Over VoIP and Wireless Networks

Automatic Speech Recognition (ASR) Over VoIP and Wireless Networks Final Report of the UGC Sponsored Major Research Project on Automatic Speech Recognition (ASR) Over VoIP and Wireless Networks UGC Sanction Letter: 41-600/2012 (SR) Dated 18th July 2012 by Prof.P.Laxminarayana

More information

Artificial Bandwidth Extension Using Deep Neural Networks for Spectral Envelope Estimation

Artificial Bandwidth Extension Using Deep Neural Networks for Spectral Envelope Estimation Platzhalter für Bild, Bild auf Titelfolie hinter das Logo einsetzen Artificial Bandwidth Extension Using Deep Neural Networks for Spectral Envelope Estimation Johannes Abel and Tim Fingscheidt Institute

More information

Wideband Speech Coding & Its Application

Wideband Speech Coding & Its Application Wideband Speech Coding & Its Application Apeksha B. landge. M.E. [student] Aditya Engineering College Beed Prof. Amir Lodhi. Guide & HOD, Aditya Engineering College Beed ABSTRACT: Increasing the bandwidth

More information

Transcoding free voice transmission in GSM and UMTS networks

Transcoding free voice transmission in GSM and UMTS networks Transcoding free voice transmission in GSM and UMTS networks Sara Stančin, Grega Jakus, Sašo Tomažič University of Ljubljana, Faculty of Electrical Engineering Abstract - Transcoding refers to the conversion

More information

Technical Report Speech and multimedia Transmission Quality (STQ); Speech samples and their usage for QoS testing

Technical Report Speech and multimedia Transmission Quality (STQ); Speech samples and their usage for QoS testing Technical Report Speech and multimedia Transmission Quality (STQ); Speech samples and their usage for QoS testing 2 Reference DTR/STQ-00196m Keywords QoS, quality, speech 650 Route des Lucioles F-06921

More information

3GPP TS V5.0.0 ( )

3GPP TS V5.0.0 ( ) TS 26.171 V5.0.0 (2001-03) Technical Specification 3rd Generation Partnership Project; Technical Specification Group Services and System Aspects; Speech Codec speech processing functions; AMR Wideband

More information

Acoustics of wideband terminals: a 3GPP perspective

Acoustics of wideband terminals: a 3GPP perspective Acoustics of wideband terminals: a 3GPP perspective Orange Labs Stéphane RAGOT Orange Delegate in 3GPP & 3GPP SA4 Vice-Chair Co-Rapporteur of 3GPP work item on "Requirements and Test Methods for Wideband

More information

Wideband Speech Encryption Based Arnold Cat Map for AMR-WB G Codec

Wideband Speech Encryption Based Arnold Cat Map for AMR-WB G Codec Wideband Speech Encryption Based Arnold Cat Map for AMR-WB G.722.2 Codec Fatiha Merazka Telecommunications Department USTHB, University of science & technology Houari Boumediene P.O.Box 32 El Alia 6 Bab

More information

Using RASTA in task independent TANDEM feature extraction

Using RASTA in task independent TANDEM feature extraction R E S E A R C H R E P O R T I D I A P Using RASTA in task independent TANDEM feature extraction Guillermo Aradilla a John Dines a Sunil Sivadas a b IDIAP RR 04-22 April 2004 D a l l e M o l l e I n s t

More information

Speech Coding Technique And Analysis Of Speech Codec Using CS-ACELP

Speech Coding Technique And Analysis Of Speech Codec Using CS-ACELP Speech Coding Technique And Analysis Of Speech Codec Using CS-ACELP Monika S.Yadav Vidarbha Institute of Technology Rashtrasant Tukdoji Maharaj Nagpur University, Nagpur, India monika.yadav@rediffmail.com

More information

Flexible and Scalable Transform-Domain Codebook for High Bit Rate CELP Coders

Flexible and Scalable Transform-Domain Codebook for High Bit Rate CELP Coders Flexible and Scalable Transform-Domain Codebook for High Bit Rate CELP Coders Václav Eksler, Bruno Bessette, Milan Jelínek, Tommy Vaillancourt University of Sherbrooke, VoiceAge Corporation Montreal, QC,

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 213 http://acousticalsociety.org/ ICA 213 Montreal Montreal, Canada 2-7 June 213 Signal Processing in Acoustics Session 2pSP: Acoustic Signal Processing

More information

Impact of the GSM AMR Speech Codec on Formant Information Important to Forensic Speaker Identification

Impact of the GSM AMR Speech Codec on Formant Information Important to Forensic Speaker Identification PAGE 483 Impact of the GSM AMR Speech Codec on Formant Information Important to Forensic Speaker Identification Bernard J Guillemin, Catherine I Watson Department of Electrical & Computer Engineering The

More information

COM 12 C 288 E October 2011 English only Original: English

COM 12 C 288 E October 2011 English only Original: English Question(s): 9/12 Source: Title: INTERNATIONAL TELECOMMUNICATION UNION TELECOMMUNICATION STANDARDIZATION SECTOR STUDY PERIOD 2009-2012 Audience STUDY GROUP 12 CONTRIBUTION 288 P.ONRA Contribution Additional

More information

Synchronous Overlap and Add of Spectra for Enhancement of Excitation in Artificial Bandwidth Extension of Speech

Synchronous Overlap and Add of Spectra for Enhancement of Excitation in Artificial Bandwidth Extension of Speech INTERSPEECH 5 Synchronous Overlap and Add of Spectra for Enhancement of Excitation in Artificial Bandwidth Extension of Speech M. A. Tuğtekin Turan and Engin Erzin Multimedia, Vision and Graphics Laboratory,

More information

Technical Specification Group Services and System Aspects Meeting #7, Madrid, Spain, March 15-17, 2000 Agenda Item: 5.4.3

Technical Specification Group Services and System Aspects Meeting #7, Madrid, Spain, March 15-17, 2000 Agenda Item: 5.4.3 TSGS#7(00)0028 Technical Specification Group Services and System Aspects Meeting #7, Madrid, Spain, March 15-17, 2000 Agenda Item: 5.4.3 Source: TSG-S4 Title: AMR Wideband Permanent project document WB-4:

More information

TECHNICAL REPORT Speech and multimedia Transmission Quality (STQ); Speech samples and their use for QoS testing

TECHNICAL REPORT Speech and multimedia Transmission Quality (STQ); Speech samples and their use for QoS testing TR 103 138 V1.3.1 (2015-03) TECHNICAL REPORT Speech and multimedia Transmission Quality (STQ); Speech samples and their use for QoS testing 2 TR 103 138 V1.3.1 (2015-03) Reference RTR/STQ-00203m Keywords

More information

Speech Quality Evaluation of Artificial Bandwidth Extension: Comparing Subjective Judgments and Instrumental Predictions

Speech Quality Evaluation of Artificial Bandwidth Extension: Comparing Subjective Judgments and Instrumental Predictions INTERSPEECH 01 Speech Quality Evaluation of Artificial Bandwidth Extension: Comparing Subjective Judgments and Instrumental Predictions Hannu Pulakka 1, Ville Myllylä 1, Anssi Rämö, and Paavo Alku 1 Microsoft

More information

Ninad Bhatt Yogeshwar Kosta

Ninad Bhatt Yogeshwar Kosta DOI 10.1007/s10772-012-9178-9 Implementation of variable bitrate data hiding techniques on standard and proposed GSM 06.10 full rate coder and its overall comparative evaluation of performance Ninad Bhatt

More information

Bandwidth Extension for Speech Enhancement

Bandwidth Extension for Speech Enhancement Bandwidth Extension for Speech Enhancement F. Mustiere, M. Bouchard, M. Bolic University of Ottawa Tuesday, May 4 th 2010 CCECE 2010: Signal and Multimedia Processing 1 2 3 4 Current Topic 1 2 3 4 Context

More information

Practical Limitations of Wideband Terminals

Practical Limitations of Wideband Terminals Practical Limitations of Wideband Terminals Dr.-Ing. Carsten Sydow Siemens AG ICM CP RD VD1 Grillparzerstr. 12a 8167 Munich, Germany E-Mail: sydow@siemens.com Workshop on Wideband Speech Quality in Terminals

More information

International Journal of Advanced Engineering Technology E-ISSN

International Journal of Advanced Engineering Technology E-ISSN Research Article ARCHITECTURAL STUDY, IMPLEMENTATION AND OBJECTIVE EVALUATION OF CODE EXCITED LINEAR PREDICTION BASED GSM AMR 06.90 SPEECH CODER USING MATLAB Bhatt Ninad S. 1 *, Kosta Yogesh P. 2 Address

More information

techniques are means of reducing the bandwidth needed to represent the human voice. In mobile

techniques are means of reducing the bandwidth needed to represent the human voice. In mobile 8 2. LITERATURE SURVEY The available radio spectrum for the wireless radio communication is very limited hence to accommodate maximum number of users the speech is compressed. The speech compression techniques

More information

Cellular systems & GSM Wireless Systems, a.a. 2014/2015

Cellular systems & GSM Wireless Systems, a.a. 2014/2015 Cellular systems & GSM Wireless Systems, a.a. 2014/2015 Un. of Rome La Sapienza Chiara Petrioli Department of Computer Science University of Rome Sapienza Italy 2 Voice Coding 3 Speech signals Voice coding:

More information

Mel Spectrum Analysis of Speech Recognition using Single Microphone

Mel Spectrum Analysis of Speech Recognition using Single Microphone International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree

More information

22. Konferenz Elektronische Sprachsignalverarbeitung (ESSV), September 2011, Aachen, Germany (TuDPress, ISBN )

22. Konferenz Elektronische Sprachsignalverarbeitung (ESSV), September 2011, Aachen, Germany (TuDPress, ISBN ) BINAURAL WIDEBAND TELEPHONY USING STEGANOGRAPHY Bernd Geiser, Magnus Schäfer, and Peter Vary Institute of Communication Systems and Data Processing ( ) RWTH Aachen University, Germany {geiser schaefer

More information

CHAPTER 7 ROLE OF ADAPTIVE MULTIRATE ON WCDMA CAPACITY ENHANCEMENT

CHAPTER 7 ROLE OF ADAPTIVE MULTIRATE ON WCDMA CAPACITY ENHANCEMENT CHAPTER 7 ROLE OF ADAPTIVE MULTIRATE ON WCDMA CAPACITY ENHANCEMENT 7.1 INTRODUCTION Originally developed to be used in GSM by the Europe Telecommunications Standards Institute (ETSI), the AMR speech codec

More information

The Emergence, Introduction and Challenges of Wideband Choice Codecs in the VoIP Market

The Emergence, Introduction and Challenges of Wideband Choice Codecs in the VoIP Market 5 th Nov, 2008 The Emergence, Introduction and Challenges of Wideband Choice Codecs in the VoIP Market PN101 Roger Chung of Freescale Semiconductor, Inc. All other product or service names are the property

More information

IMPROVING WIDEBAND SPEECH RECOGNITION USING MIXED-BANDWIDTH TRAINING DATA IN CD-DNN-HMM

IMPROVING WIDEBAND SPEECH RECOGNITION USING MIXED-BANDWIDTH TRAINING DATA IN CD-DNN-HMM IMPROVING WIDEBAND SPEECH RECOGNITION USING MIXED-BANDWIDTH TRAINING DATA IN CD-DNN-HMM Jinyu Li, Dong Yu, Jui-Ting Huang, and Yifan Gong Microsoft Corporation, One Microsoft Way, Redmond, WA 98052 ABSTRACT

More information

An audio watermark-based speech bandwidth extension method

An audio watermark-based speech bandwidth extension method Chen et al. EURASIP Journal on Audio, Speech, and Music Processing 2013, 2013:10 RESEARCH Open Access An audio watermark-based speech bandwidth extension method Zhe Chen, Chengyong Zhao, Guosheng Geng

More information

Bandwidth Extension of Speech Signals: A Catalyst for the Introduction of Wideband Speech Coding?

Bandwidth Extension of Speech Signals: A Catalyst for the Introduction of Wideband Speech Coding? WIDEBAND SPEECH CODING STANDARDS AND WIRELESS SERVICES Bandwidth Extension of Speech Signals: A Catalyst for the Introduction of Wideband Speech Coding? Peter Jax and Peter Vary, RWTH Aachen University

More information

An objective method for evaluating data hiding in pitch gain and pitch delay parameters of the AMR codec

An objective method for evaluating data hiding in pitch gain and pitch delay parameters of the AMR codec An objective method for evaluating data hiding in pitch gain and pitch delay parameters of the AMR codec Akira Nishimura 1 1 Department of Media and Cultural Studies, Tokyo University of Information Sciences,

More information

Improving Sound Quality by Bandwidth Extension

Improving Sound Quality by Bandwidth Extension International Journal of Scientific & Engineering Research, Volume 3, Issue 9, September-212 Improving Sound Quality by Bandwidth Extension M. Pradeepa, M.Tech, Assistant Professor Abstract - In recent

More information

AUTOMATIC SPEECH RECOGNITION FOR NUMERIC DIGITS USING TIME NORMALIZATION AND ENERGY ENVELOPES

AUTOMATIC SPEECH RECOGNITION FOR NUMERIC DIGITS USING TIME NORMALIZATION AND ENERGY ENVELOPES AUTOMATIC SPEECH RECOGNITION FOR NUMERIC DIGITS USING TIME NORMALIZATION AND ENERGY ENVELOPES N. Sunil 1, K. Sahithya Reddy 2, U.N.D.L.mounika 3 1 ECE, Gurunanak Institute of Technology, (India) 2 ECE,

More information

Voice Coding, PCM Voice, Voice Quality, E-model

Voice Coding, PCM Voice, Voice Quality, E-model Voice Coding, PCM Voice, Voice Quality, E-model PCM ~ Pulse Code Modulation Sampling Quantizing Linear Non-linear Quantizing error PCM frame structure Other Voice coding algorithms E-model, Voice quality

More information

ETSI TS V ( )

ETSI TS V ( ) TS 126 171 V14.0.0 (2017-04) TECHNICAL SPECIFICATION Digital cellular telecommunications system (Phase 2+) (GSM); Universal Mobile Telecommunications System (UMTS); LTE; Speech codec speech processing

More information

Enabling New Speech Driven Services for Mobile Devices: An overview of the ETSI standards activities for Distributed Speech Recognition Front-ends

Enabling New Speech Driven Services for Mobile Devices: An overview of the ETSI standards activities for Distributed Speech Recognition Front-ends Distributed Speech Recognition Enabling New Speech Driven Services for Mobile Devices: An overview of the ETSI standards activities for Distributed Speech Recognition Front-ends David Pearce & Chairman

More information

Enhancement of Speech Signal by Adaptation of Scales and Thresholds of Bionic Wavelet Transform Coefficients

Enhancement of Speech Signal by Adaptation of Scales and Thresholds of Bionic Wavelet Transform Coefficients ISSN (Print) : 232 3765 An ISO 3297: 27 Certified Organization Vol. 3, Special Issue 3, April 214 Paiyanoor-63 14, Tamil Nadu, India Enhancement of Speech Signal by Adaptation of Scales and Thresholds

More information

Single Channel Speaker Segregation using Sinusoidal Residual Modeling

Single Channel Speaker Segregation using Sinusoidal Residual Modeling NCC 2009, January 16-18, IIT Guwahati 294 Single Channel Speaker Segregation using Sinusoidal Residual Modeling Rajesh M Hegde and A. Srinivas Dept. of Electrical Engineering Indian Institute of Technology

More information

EFFICIENT SUPER-WIDE BANDWIDTH EXTENSION USING LINEAR PREDICTION BASED ANALYSIS-SYNTHESIS. Pramod Bachhav, Massimiliano Todisco and Nicholas Evans

EFFICIENT SUPER-WIDE BANDWIDTH EXTENSION USING LINEAR PREDICTION BASED ANALYSIS-SYNTHESIS. Pramod Bachhav, Massimiliano Todisco and Nicholas Evans EFFICIENT SUPER-WIDE BANDWIDTH EXTENSION USING LINEAR PREDICTION BASED ANALYSIS-SYNTHESIS Pramod Bachhav, Massimiliano Todisco and Nicholas Evans EURECOM, Sophia Antipolis, France {bachhav,todisco,evans}@eurecom.fr

More information

NOISE ESTIMATION IN A SINGLE CHANNEL

NOISE ESTIMATION IN A SINGLE CHANNEL SPEECH ENHANCEMENT FOR CROSS-TALK INTERFERENCE by Levent M. Arslan and John H.L. Hansen Robust Speech Processing Laboratory Department of Electrical Engineering Box 99 Duke University Durham, North Carolina

More information

Perceptual wideband speech and audio quality measurement. Dr Antony Rix Psytechnics Limited

Perceptual wideband speech and audio quality measurement. Dr Antony Rix Psytechnics Limited Perceptual wideband speech and audio quality measurement Dr Antony Rix Psytechnics Limited Agenda Background Perceptual models BS.1387 PEAQ P.862 PESQ Scope Extension to wideband Performance of wideband

More information

Subjective Voice Quality Evaluation of Artificial Bandwidth Extension: Comparing Different Audio Bandwidths and Speech Codecs

Subjective Voice Quality Evaluation of Artificial Bandwidth Extension: Comparing Different Audio Bandwidths and Speech Codecs INTERSPEECH 01 Subjective Voice Quality Evaluation of Artificial Bandwidth Extension: Comparing Different Audio Bandwidths and Speech Codecs Hannu Pulakka 1, Anssi Rämö, Ville Myllylä 1, Henri Toukomaa,

More information

Simulation of Conjugate Structure Algebraic Code Excited Linear Prediction Speech Coder

Simulation of Conjugate Structure Algebraic Code Excited Linear Prediction Speech Coder COMPUSOFT, An international journal of advanced computer technology, 3 (3), March-204 (Volume-III, Issue-III) ISSN:2320-0790 Simulation of Conjugate Structure Algebraic Code Excited Linear Prediction Speech

More information

Spatial Audio Transmission Technology for Multi-point Mobile Voice Chat

Spatial Audio Transmission Technology for Multi-point Mobile Voice Chat Audio Transmission Technology for Multi-point Mobile Voice Chat Voice Chat Multi-channel Coding Binaural Signal Processing Audio Transmission Technology for Multi-point Mobile Voice Chat We have developed

More information

ENHANCED TIME DOMAIN PACKET LOSS CONCEALMENT IN SWITCHED SPEECH/AUDIO CODEC.

ENHANCED TIME DOMAIN PACKET LOSS CONCEALMENT IN SWITCHED SPEECH/AUDIO CODEC. ENHANCED TIME DOMAIN PACKET LOSS CONCEALMENT IN SWITCHED SPEECH/AUDIO CODEC Jérémie Lecomte, Adrian Tomasek, Goran Marković, Michael Schnabel, Kimitaka Tsutsumi, Kei Kikuiri Fraunhofer IIS, Erlangen, Germany,

More information

Gerhard Schmidt / Tim Haulick Recent Tends for Improving Automotive Speech Enhancement Systems. Geneva, 5-7 March 2008

Gerhard Schmidt / Tim Haulick Recent Tends for Improving Automotive Speech Enhancement Systems. Geneva, 5-7 March 2008 Gerhard Schmidt / Tim Haulick Recent Tends for Improving Automotive Speech Enhancement Systems Speech Communication Channels in a Vehicle 2 Into the vehicle Within the vehicle Out of the vehicle Speech

More information

SIMULATION VOICE RECOGNITION SYSTEM FOR CONTROLING ROBOTIC APPLICATIONS

SIMULATION VOICE RECOGNITION SYSTEM FOR CONTROLING ROBOTIC APPLICATIONS SIMULATION VOICE RECOGNITION SYSTEM FOR CONTROLING ROBOTIC APPLICATIONS 1 WAHYU KUSUMA R., 2 PRINCE BRAVE GUHYAPATI V 1 Computer Laboratory Staff., Department of Information Systems, Gunadarma University,

More information

Quality comparison of wideband coders including tandeming and transcoding

Quality comparison of wideband coders including tandeming and transcoding ETSI Workshop on Speech and Noise In Wideband Communication, 22nd and 23rd May 2007 - Sophia Antipolis, France Quality comparison of wideband coders including tandeming and transcoding Catherine Quinquis

More information

SERIES P: TELEPHONE TRANSMISSION QUALITY, TELEPHONE INSTALLATIONS, LOCAL LINE NETWORKS Methods for objective and subjective assessment of quality

SERIES P: TELEPHONE TRANSMISSION QUALITY, TELEPHONE INSTALLATIONS, LOCAL LINE NETWORKS Methods for objective and subjective assessment of quality International Telecommunication Union ITU-T TELECOMMUNICATION STANDARDIZATION SECTOR OF ITU P.862.3 (11/2007) SERIES P: TELEPHONE TRANSMISSION QUALITY, TELEPHONE INSTALLATIONS, LOCAL LINE NETWORKS Methods

More information

Deriving Equipment Impairment Factors for Wideband Speech Codecs

Deriving Equipment Impairment Factors for Wideband Speech Codecs Deriving Equipment Impairment Factors for Wideband Speech Codecs Sebastian Möller 1, Alexander Raake 1, Vincent Barriac 2, Catherine Quinquis 2 1 IKA, Ruhr-University Bochum, Germany 2 France Télécom R&D,

More information

Lesson 8 Speech coding

Lesson 8 Speech coding Lesson 8 coding Encoding Information Transmitter Antenna Interleaving Among Frames De-Interleaving Antenna Transmission Line Decoding Transmission Line Receiver Information Lesson 8 Outline How information

More information

TELECOMMUNICATION SYSTEMS

TELECOMMUNICATION SYSTEMS TELECOMMUNICATION SYSTEMS By Syed Bakhtawar Shah Abid Lecturer in Computer Science 1 MULTIPLEXING An efficient system maximizes the utilization of all resources. Bandwidth is one of the most precious resources

More information

The Channel Vocoder (analyzer):

The Channel Vocoder (analyzer): Vocoders 1 The Channel Vocoder (analyzer): The channel vocoder employs a bank of bandpass filters, Each having a bandwidth between 100 Hz and 300 Hz. Typically, 16-20 linear phase FIR filter are used.

More information

Bandwidth Efficient Mixed Pseudo Analogue-Digital Speech Transmission

Bandwidth Efficient Mixed Pseudo Analogue-Digital Speech Transmission Bandwidth Efficient Mixed Pseudo Analogue-Digital Speech Transmission Carsten Hoelper and Peter Vary {hoelper,vary}@ind.rwth-aachen.de ETSI Workshop on Speech and Noise in Wideband Communication 22.-23.

More information

Scalable Speech Coding for IP Networks

Scalable Speech Coding for IP Networks Santa Clara University Scholar Commons Engineering Ph.D. Theses Student Scholarship 8-24-2015 Scalable Speech Coding for IP Networks Koji Seto Santa Clara University Follow this and additional works at:

More information

Keywords Decomposition; Reconstruction; SNR; Speech signal; Super soft Thresholding.

Keywords Decomposition; Reconstruction; SNR; Speech signal; Super soft Thresholding. Volume 5, Issue 2, February 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Speech Enhancement

More information

Effective post-processing for single-channel frequency-domain speech enhancement Weifeng Li a

Effective post-processing for single-channel frequency-domain speech enhancement Weifeng Li a R E S E A R C H R E P O R T I D I A P Effective post-processing for single-channel frequency-domain speech enhancement Weifeng Li a IDIAP RR 7-7 January 8 submitted for publication a IDIAP Research Institute,

More information

Different Approaches of Spectral Subtraction Method for Speech Enhancement

Different Approaches of Spectral Subtraction Method for Speech Enhancement ISSN 2249 5460 Available online at www.internationalejournals.com International ejournals International Journal of Mathematical Sciences, Technology and Humanities 95 (2013 1056 1062 Different Approaches

More information

ETSI TS V8.0.0 ( ) Technical Specification

ETSI TS V8.0.0 ( ) Technical Specification Technical Specification Digital cellular telecommunications system (Phase 2+); Enhanced Full Rate (EFR) speech processing functions; General description () GLOBAL SYSTEM FOR MOBILE COMMUNICATIONS R 1 Reference

More information

DETECTION OF CLIPPING IN CODED SPEECH SIGNALS. James Eaton and Patrick A. Naylor

DETECTION OF CLIPPING IN CODED SPEECH SIGNALS. James Eaton and Patrick A. Naylor DETECTION OF CLIPPING IN CODED SPEECH SIGNALS James Eaton and Patrick A. Naylor Department of Electrical and Electronic Engineering, Imperial College, London, UK {j.eaton, p.naylor}@imperial.ac.uk ABSTRACT

More information

International Journal of Scientific & Engineering Research, Volume 4, Issue 5, May ISSN

International Journal of Scientific & Engineering Research, Volume 4, Issue 5, May ISSN International Journal of Scientific & Engineering Research, Volume 4, Issue 5, May-2013 1840 An Overview of Distributed Speech Recognition over WMN Jyoti Prakash Vengurlekar vengurlekar.jyoti13@gmai l.com

More information

Information. LSP (Line Spectrum Pair): Essential Technology for High-compression Speech Coding. Takehiro Moriya. Abstract

Information. LSP (Line Spectrum Pair): Essential Technology for High-compression Speech Coding. Takehiro Moriya. Abstract LSP (Line Spectrum Pair): Essential Technology for High-compression Speech Coding Takehiro Moriya Abstract Line Spectrum Pair (LSP) technology was accepted as an IEEE (Institute of Electrical and Electronics

More information

Call Quality Measurement for Telecommunication Network and Proposition of Tariff Rates

Call Quality Measurement for Telecommunication Network and Proposition of Tariff Rates Call Quality Measurement for Telecommunication Network and Proposition of Tariff Rates Akram Aburas School of Engineering, Design and Technology, University of Bradford Bradford, West Yorkshire, United

More information

Preface, Motivation and The Speech Coding Scene

Preface, Motivation and The Speech Coding Scene Preface, Motivation and The Speech Coding Scene In the era of third-generation (3G) wireless personal communications standards, despite the emergence of broad-band access network standard proposals, the

More information

DERIVATION OF TRAPS IN AUDITORY DOMAIN

DERIVATION OF TRAPS IN AUDITORY DOMAIN DERIVATION OF TRAPS IN AUDITORY DOMAIN Petr Motlíček, Doctoral Degree Programme (4) Dept. of Computer Graphics and Multimedia, FIT, BUT E-mail: motlicek@fit.vutbr.cz Supervised by: Dr. Jan Černocký, Prof.

More information

EUROPEAN pr ETS TELECOMMUNICATION November 1996 STANDARD

EUROPEAN pr ETS TELECOMMUNICATION November 1996 STANDARD FINAL DRAFT EUROPEAN pr ETS 300 723 TELECOMMUNICATION November 1996 STANDARD Source: ETSI TC-SMG Reference: DE/SMG-020651 ICS: 33.060.50 Key words: EFR, digital cellular telecommunications system, Global

More information

NOVEL PITCH DETECTION ALGORITHM WITH APPLICATION TO SPEECH CODING

NOVEL PITCH DETECTION ALGORITHM WITH APPLICATION TO SPEECH CODING NOVEL PITCH DETECTION ALGORITHM WITH APPLICATION TO SPEECH CODING A Thesis Submitted to the Graduate Faculty of the University of New Orleans in partial fulfillment of the requirements for the degree of

More information

Robust Voice Activity Detection Based on Discrete Wavelet. Transform

Robust Voice Activity Detection Based on Discrete Wavelet. Transform Robust Voice Activity Detection Based on Discrete Wavelet Transform Kun-Ching Wang Department of Information Technology & Communication Shin Chien University kunching@mail.kh.usc.edu.tw Abstract This paper

More information

Voiced/nonvoiced detection based on robustness of voiced epochs

Voiced/nonvoiced detection based on robustness of voiced epochs Voiced/nonvoiced detection based on robustness of voiced epochs by N. Dhananjaya, B.Yegnanarayana in IEEE Signal Processing Letters, 17, 3 : 273-276 Report No: IIIT/TR/2010/50 Centre for Language Technologies

More information

Vocoder (LPC) Analysis by Variation of Input Parameters and Signals

Vocoder (LPC) Analysis by Variation of Input Parameters and Signals ISCA Journal of Engineering Sciences ISCA J. Engineering Sci. Vocoder (LPC) Analysis by Variation of Input Parameters and Signals Abstract Gupta Rajani, Mehta Alok K. and Tiwari Vebhav Truba College of

More information

SPEECH ENHANCEMENT USING A ROBUST KALMAN FILTER POST-PROCESSOR IN THE MODULATION DOMAIN. Yu Wang and Mike Brookes

SPEECH ENHANCEMENT USING A ROBUST KALMAN FILTER POST-PROCESSOR IN THE MODULATION DOMAIN. Yu Wang and Mike Brookes SPEECH ENHANCEMENT USING A ROBUST KALMAN FILTER POST-PROCESSOR IN THE MODULATION DOMAIN Yu Wang and Mike Brookes Department of Electrical and Electronic Engineering, Exhibition Road, Imperial College London,

More information

Voice Excited Lpc for Speech Compression by V/Uv Classification

Voice Excited Lpc for Speech Compression by V/Uv Classification IOSR Journal of VLSI and Signal Processing (IOSR-JVSP) Volume 6, Issue 3, Ver. II (May. -Jun. 2016), PP 65-69 e-issn: 2319 4200, p-issn No. : 2319 4197 www.iosrjournals.org Voice Excited Lpc for Speech

More information

Adaptive time scale modification of speech for graceful degrading voice quality in congested networks

Adaptive time scale modification of speech for graceful degrading voice quality in congested networks Adaptive time scale modification of speech for graceful degrading voice quality in congested networks Prof. H. Gokhan ILK Ankara University, Faculty of Engineering, Electrical&Electronics Eng. Dept 1 Contact

More information

A NEW FEATURE VECTOR FOR HMM-BASED PACKET LOSS CONCEALMENT

A NEW FEATURE VECTOR FOR HMM-BASED PACKET LOSS CONCEALMENT A NEW FEATURE VECTOR FOR HMM-BASED PACKET LOSS CONCEALMENT L. Koenig (,2,3), R. André-Obrecht (), C. Mailhes (2) and S. Fabre (3) () University of Toulouse, IRIT/UPS, 8 Route de Narbonne, F-362 TOULOUSE

More information

Voice Coding, PCM Voice, Voice Quality, E-model

Voice Coding, PCM Voice, Voice Quality, E-model Voice Coding, PCM Voice, Voice Quality, E-model! PCM ~ Pulse Code Modulation Sampling Quantizing Linear Non-linear Quantizing error! PCM frame structure! Other Voice coding algorithms! E-model, Voice quality

More information

INTERNATIONAL TELECOMMUNICATION UNION

INTERNATIONAL TELECOMMUNICATION UNION INTERNATIONAL TELECOMMUNICATION UNION ITU-T P.862 TELECOMMUNICATION STANDARDIZATION SECTOR OF ITU (02/2001) SERIES P: TELEPHONE TRANSMISSION QUALITY, TELEPHONE INSTALLATIONS, LOCAL LINE NETWORKS Methods

More information

Overview of Code Excited Linear Predictive Coder

Overview of Code Excited Linear Predictive Coder Overview of Code Excited Linear Predictive Coder Minal Mulye 1, Sonal Jagtap 2 1 PG Student, 2 Assistant Professor, Department of E&TC, Smt. Kashibai Navale College of Engg, Pune, India Abstract Advances

More information

Transcoding of Narrowband to Wideband Speech

Transcoding of Narrowband to Wideband Speech University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2005 Transcoding of Narrowband to Wideband Speech Christian H. Ritz University

More information

Voice Activity Detection for Speech Enhancement Applications

Voice Activity Detection for Speech Enhancement Applications Voice Activity Detection for Speech Enhancement Applications E. Verteletskaya, K. Sakhnov Abstract This paper describes a study of noise-robust voice activity detection (VAD) utilizing the periodicity

More information

Enhancing 3D Audio Using Blind Bandwidth Extension

Enhancing 3D Audio Using Blind Bandwidth Extension Enhancing 3D Audio Using Blind Bandwidth Extension (PREPRINT) Tim Habigt, Marko Ðurković, Martin Rothbucher, and Klaus Diepold Institute for Data Processing, Technische Universität München, 829 München,

More information

EE482: Digital Signal Processing Applications

EE482: Digital Signal Processing Applications Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 12 Speech Signal Processing 14/03/25 http://www.ee.unlv.edu/~b1morris/ee482/

More information

LOSS CONCEALMENTS FOR LOW-BIT-RATE PACKET VOICE IN VOIP. Outline

LOSS CONCEALMENTS FOR LOW-BIT-RATE PACKET VOICE IN VOIP. Outline LOSS CONCEALMENTS FOR LOW-BIT-RATE PACKET VOICE IN VOIP Benjamin W. Wah Department of Electrical and Computer Engineering and the Coordinated Science Laboratory University of Illinois at Urbana-Champaign

More information

IMPROVED SPEECH QUALITY FOR VMR - WB SPEECH CODING USING EFFICIENT NOISE ESTIMATION ALGORITHM

IMPROVED SPEECH QUALITY FOR VMR - WB SPEECH CODING USING EFFICIENT NOISE ESTIMATION ALGORITHM IMPROVED SPEECH QUALITY FOR VMR - WB SPEECH CODING USING EFFICIENT NOISE ESTIMATION ALGORITHM Mr. M. Mathivanan Associate Professor/ECE Selvam College of Technology Namakkal, Tamilnadu, India Dr. S.Chenthur

More information

ARTIFICIAL BANDWIDTH EXTENSION OF NARROW-BAND SPEECH SIGNALS VIA HIGH-BAND ENERGY ESTIMATION

ARTIFICIAL BANDWIDTH EXTENSION OF NARROW-BAND SPEECH SIGNALS VIA HIGH-BAND ENERGY ESTIMATION ARTIFICIAL BANDWIDTH EXTENSION OF NARROW-BAND SPEECH SIGNALS VIA HIGH-BAND ENERGY ESTIMATION Tenkasi Ramabadran and Mark Jasiuk Motorola Labs, Motorola Inc., 1301 East Algonquin Road, Schaumburg, IL 60196,

More information

RIR Estimation for Synthetic Data Acquisition

RIR Estimation for Synthetic Data Acquisition RIR Estimation for Synthetic Data Acquisition Kevin Venalainen, Philippe Moquin, Dinei Florencio Microsoft ABSTRACT - Automatic Speech Recognition (ASR) works best when the speech signal best matches the

More information

Speech Quality Assessment for Wideband Communication Scenarios

Speech Quality Assessment for Wideband Communication Scenarios Speech Quality Assessment for Wideband Communication Scenarios H. W. Gierlich, S. Völl, F. Kettler (HEAD acoustics GmbH) P. Jax (IND, RWTH Aachen) Workshop on Wideband Speech Quality in Terminals and Networks

More information

Radio Relay - Vocality to Vocality

Radio Relay - Vocality to Vocality Radio Relay - Vocality to Vocality Application Note AN230 Revision v1.5 November 2013 AN230 Radio relay - Vocality to Vocality 1 Overview This Application Note describes how you can set up two Vocality

More information

Speech Synthesis using Mel-Cepstral Coefficient Feature

Speech Synthesis using Mel-Cepstral Coefficient Feature Speech Synthesis using Mel-Cepstral Coefficient Feature By Lu Wang Senior Thesis in Electrical Engineering University of Illinois at Urbana-Champaign Advisor: Professor Mark Hasegawa-Johnson May 2018 Abstract

More information

IMPROVING QUALITY OF SPEECH SYNTHESIS IN INDIAN LANGUAGES. P. K. Lehana and P. C. Pandey

IMPROVING QUALITY OF SPEECH SYNTHESIS IN INDIAN LANGUAGES. P. K. Lehana and P. C. Pandey Workshop on Spoken Language Processing - 2003, TIFR, Mumbai, India, January 9-11, 2003 149 IMPROVING QUALITY OF SPEECH SYNTHESIS IN INDIAN LANGUAGES P. K. Lehana and P. C. Pandey Department of Electrical

More information

Combining Voice Activity Detection Algorithms by Decision Fusion

Combining Voice Activity Detection Algorithms by Decision Fusion Combining Voice Activity Detection Algorithms by Decision Fusion Evgeny Karpov, Zaur Nasibov, Tomi Kinnunen, Pasi Fränti Speech and Image Processing Unit, University of Eastern Finland, Joensuu, Finland

More information

Speech Enhancement Using a Mixture-Maximum Model

Speech Enhancement Using a Mixture-Maximum Model IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 10, NO. 6, SEPTEMBER 2002 341 Speech Enhancement Using a Mixture-Maximum Model David Burshtein, Senior Member, IEEE, and Sharon Gannot, Member, IEEE

More information

10 Speech and Audio Signals

10 Speech and Audio Signals 0 Speech and Audio Signals Introduction Speech and audio signals are normally converted into PCM, which can be stored or transmitted as a PCM code, or compressed to reduce the number of bits used to code

More information

NOISE SHAPING IN AN ITU-T G.711-INTEROPERABLE EMBEDDED CODEC

NOISE SHAPING IN AN ITU-T G.711-INTEROPERABLE EMBEDDED CODEC NOISE SHAPING IN AN ITU-T G.711-INTEROPERABLE EMBEDDED CODEC Jimmy Lapierre 1, Roch Lefebvre 1, Bruno Bessette 1, Vladimir Malenovsky 1, Redwan Salami 2 1 Université de Sherbrooke, Sherbrooke (Québec),

More information

Ap A ril F RRL RRL P ro r gra r m By Dick AH6EZ/W9

Ap A ril F RRL RRL P ro r gra r m By Dick AH6EZ/W9 April 2013 FRRL Program By Dick AH6EZ/W9 Why Digital Voice? Data speed or RF bandwidth reduction Transmission by shared digital media such as T1s Security and encryption PCM or ADPCM first US Patent in

More information

Speech Enhancement Based On Noise Reduction

Speech Enhancement Based On Noise Reduction Speech Enhancement Based On Noise Reduction Kundan Kumar Singh Electrical Engineering Department University Of Rochester ksingh11@z.rochester.edu ABSTRACT This paper addresses the problem of signal distortion

More information

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm A.T. Rajamanickam, N.P.Subiramaniyam, A.Balamurugan*,

More information

Waveform Coding Algorithms: An Overview

Waveform Coding Algorithms: An Overview August 24, 2012 Waveform Coding Algorithms: An Overview RWTH Aachen University Compression Algorithms Seminar Report Summer Semester 2012 Adel Zaalouk - 300374 Aachen, Germany Contents 1 An Introduction

More information

Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter

Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter 1 Gupteswar Sahu, 2 D. Arun Kumar, 3 M. Bala Krishna and 4 Jami Venkata Suman Assistant Professor, Department of ECE,

More information