ON THE PERFORMANCE OF WTIMIT FOR WIDE BAND TELEPHONY
|
|
- Blaise Daniels
- 5 years ago
- Views:
Transcription
1 ON THE PERFORMANCE OF WTIMIT FOR WIDE BAND TELEPHONY D. Nagajyothi 1 and P. Siddaiah 2 1 Department of Electronics and Communication Engineering, Vardhaman College of Engineering, Shamshabad, Telangana, India 2 Department of Electrical and Computer Engineering, University College of Engineering and Technology, Acharya Nagarjuna University, Guntur, India nagajyothi1998@gmail.com ABSTRACT WTIMIT, which is a derivative of TIMIT emerged as a latest technique for speech quality. The technique has good wideband characteristics over a range of 50-7 KHz. in this paper, a study on the performance of phoneme recognition system has been performed. The study includes the effect of decimating the signal to 8 KHz in the conventional case. Further it is possible to evaluate the AMR-wideband codec for several acoustic models. It is possible to propose the WTIMIT type of wideband channel data from training interactive voice receiving system. Keywords: speech codecs, IVR, AMR-Wb, TIMIT. 1. INTRODUCTION The Typical Bandwidth of Speech is less than 4 KHz for applications in telephonic operations. This is often termed as narrow band. The IVR operates at a sampling rate of 8 KHz for Operation [1-3]. Citing this, the advanced speech service system expanded their BW to wideband (WB) frequency range of KHz. The influence of a conventional telephony network on the N- TIMIT is used to evaluate the performance features of recognition system in traditional telephony. The Phoneme error rate (PER) suppressed by a huge extent due to direct WB speech. Similar in NB case, 23% relative PER degradations is identified. It is also reported that there is an evidence of the impact of a WB mobile network using the WTIMIT corpus. An enhancement of 19% PER with respect to direct WB speech is observed while there is a suppressing 3% PER relative to narrow band. In spite of these efforts it is to note that investigation pertaining to effects of telephony network are to be validated. This is more desirous in IVR based Telephony system. The development of DARPA TIMIT Acoustic Phonetic Continuous Speech Corpus paved a way for evaluating automatic speech recognition (ASR) systems [4]. It constitutes wideband speech recordings which are sampled at 16 KHz. They typically containing in the rate of 50 Hz to 7 khz with respect to 630 native speakers. This is with reference to 8 major regions in the US. For training ten phonetically rich sentences are collected from every speech. In every utterance several features are extracted along with speech waveform, time aligned orthographic, phonetic and word transcriptions are taken. With reference to these efforts as of now there are five TIMIT derivatives namely FFMTIMIT, NTIMIT, CTIMIT, HTIMIT and STC-TIMIT. The FFMTIMIT can be abbreviated as Free Field Microphone TIMIT typical composed of natural TIMIT database. It typically uses a free field device for recording. NTIMIT (Network TIMIT) is adjunct to TIMIT with database constituting the speech wave form [4]. Over a telephone handset, Similarly CTIMIT constitutes of the original TIMIT recordings were passed through cellular telephone circuits. However, in the case of HTIMIT (Handset TIMIT) the data base consists of two subset with 192 male and 192 female speakers [5]. The corresponding speech signals are those which are transmitted through different telephone handsets. This typically helps in the investigation of telephone transducer effects on speech. For STCTIMIT which is single channel, the speech signals were sent through a real and, in contrast to NTIMIT, all these can be turned as the derivation of wideband speech [6-11]. While some are telephony are containing narrowband speech. The sampling is at the rate of 8 khz with a range of 200 Hz to 3.4 khz. Inspite of all these it is to be noted that there is no availability of real world wideband telephony speech corpus. Several versions of wideband speech codes like G.722 (1988), G.722 (1999) G (2001) and G (2008) have been into operation with several techniques like ADPCM, 3GPP [6] and wide band PCM. It is interesting to note that the wide band telephony speech transmission system is wide available and adaptable. In contrast to ever increasing mobile networks citing this, it its essential to have wideband system in the TIMIT for a wide range of scientific investigations. There are several advantages and applications associated with WBSTS. The integrated speech recognition system provides remote dictation or spelling. This was not a possible case with the earlier telephony system. In this paper, an investigation on the performance of the speech CODECS in terms of Bit rates is performed. The analysis is based experimentation carried out in MATLAB on windows platform in an i3 with 4 GB RAM. Further, the paper is organized as follows. A brief discussion on the standards and the corresponding bit rates of several speech codecs are presented in section 2. Results pertaining to synthesis followed by analysis of speech codecs is given section 3. Overall conclusion is given in section SPEECH CODECS In this Section, a brief introduction to the speech codecs is given. The aim of the speech codec is to compress the speech signal in order to reduce the bandwidth and requires minimum storage space. When we will reconstruct, it must be very close to original one. 1386
2 Based on intelligibility and naturalness we will measure perceived quality of the signal. Here we have considered two types of the networks GSM and VoIP. Table-1 and Table-2 shows the specification summary of the all the supported narrowband wideband codecs. Table-1. ITU-T approved VoIP supported narrowband and wideband speech codecs. Coding standard Algorithm Sampling frequency (khz) Bit rates (kbps) G.711 (A /U) Companded PCM 8 64 G.726 ADPCM 8 16/24/ 32/40 G.729 CS-ACELP 8 8 G.723.1A ACELP / MP-MLQ / 6.3 G.722 (WB) SB-ADPCM 16 48, 56, 64 G (WB) Companded PCM, MDCT 16 64, 80, 96 G (WB) CELP, TD-BWE, TDAC G (AMR-WB) (Multi Rate) MRWB- ACELP , 8.85, 12.65, 14.25, 15.85, 18.25, 19.85, 23.05, Table-2. ETSI/3GPP approved GSM supported narrowband and wideband speech codecs. Coding standard Algorithm Sampling frequency (khz) Bit rates (kbps) GSM FR RPE-LTP 8 13 GSMEFR ACELP GSM HR VSELP (Multi Rate) MR- 4.75, 5.15, 5.90, 6.70, 7.40, GSM AMR 8 ACELP 7.95, 10.2, , 8.85, 12.65, 14.25, 15.85, GSM AMR-WB MRWB-ACELP , 19.85, 23.05, RESULTS AND DISCUSSIONS Results pertaining to the technique and proposed method are presented in this Section. Testing of the Coded Data with G.711-Coded Models (8-kHz HMMs). The 8- khz un-coded and coded speech data that is coded with all other wireline codecs, such as G.711, G.726 and G.729, is tested with the G.711-coded models (8-kHz trained HMMs) for the CI, and CD-tied tri-phone models with 1, 2, 4 and 8 Gaussians per state, and are reported in the following tables. Case-1: Testing the wireline coded data (NB Codecs) with G.711-coded trained models The corresponding comparative analysis based on ASR accuracy with respect to G.711, G.726, G.729 and Un-coded are as shown in Figure-1. It is evident that the G.726 and G.711 have reported high accuracy coefficient. Table-3. Results of testing the wireline coded data (NB Codecs) with G.711-coded trained models (8kHzHMMs. Coded data used in testing Un-coded G G G
3 Figure-1. Graphic results of testing the wireline coded data (NB Codecs) with G.711-coded trained models (8kHzHMMs). Case-2: Testing of the coded data with G.729-coded models (8-kHz HMMs) is coded with all other wireline codecs, such as G.711, G.726 and G.729, is tested with the G.729-coded models (8-kHz HMMs) for the CI, and CD-tied tri-phone models with 1, 2, 4 and 8 Gaussians per state, and are reported. Results of testing the wireline coded data (NB codecs) with G.729-coded trained models (8kHzHMMs). Table-4. Testing of the coded data with G.729-coded models (8-kHz HMMs). Coded data used in testing Un-coded G G G It can be inferred from the comparative results shown in Figure.2 for G.711, G.726, G.729 and Un-coded for testing the wireline Codecs with 8 KHz G.729 coded HMMs, that the impact of the respective coding is minimal. Figure-2. Graphic results of testing the wireline coded data (NB codecs) with G.729-coded trained models (8kHzHMMs). Case-3: Testing of the coded data with HR-coded models (8-kHz HMMs) is coded with all other NB wireless codecs, such as FR, EFR, HR, and AMR, is tested with the HR-coded models (8-kHz HMMs) for the CI, and CD-tied tri-phone models with 1, 2, 4 and 8 Gaussians per state, and are reported. Results of testing the wireless coded data (NB Codecs) with HR-coded trained models (8kHzHMMs. Table-5. Testing of the coded data with HR-coded models (8-kHz HMMs). Coded data used in testing Un-coded FR EFR HR
4 when compared as shown in Figure-3. HR reported minimal, however almost similar to EFR for CD-8gau. Figure-3. Graphic results of testing the wireless coded data (NB Codecs) with HR-coded trained models (8kHzHMMs). FR reported to be having high accuracy coefficient when compared with EFR, HR and Un-coded Case-4: Testing of the coded data with AMR4.75-coded models (8-kHz HMMs) is coded with all other NB wireless codecs, such as FR, EFR, HR, and AMR, and wireline codecs, such as G.711, G.726 and G.729 are tested with the coded models (8-kHz HMMs) for the CI, and CD-tied triphone models with 1, 2, 4 and 8 Gaussians per state, and are reported. Results of testing the wireless coded data (NB Codecs) with AMR@4.75-coded trained models (8kHzHMMs) Table-6. Testing of the coded data with AMR4.75-coded models (8-kHz HMMs). Coded data used in testing Un-coded FR EFR HR AMR@ AMR@ G G G When compared with respect to accuracy as shown in Figure-4, it is clearly evident that the FR expressed high accuracy while Un-coaded produced poor results while testing the wireless coded data (NB Codecs) with AMR@4.75-coded trained models (8kHzHMMs). Figure-4. Graphic results of testing the wireless coded data (NB Codecs) with AMR@4.75-coded trained models (8kHzHMMs). Case-5: Testing of the AMR coded data with AMR12.2-coded models (8-kHz HMMs) is coded with all other NB wireless codecs, such as FR, EFR, HR, and AMR, is tested with the AMR@12.2kbps coded models for the CI, and CD-tied tri-phone models with 1, 2, 4 and 8 Gaussians per state, and are reported in the following table. 1389
5 Table-7. Results of testing the AMR coded data (NB codecs) with trained models (8kHzHMMs). Coded data used in testing Un-coded FR EFR HR models. The ASR performance is almost same when tested with either 16-kHz coded models or 8-kHz un-coded models. The ASR performance is poor when tested with 16-kHz un-coded models. REFERENCES [1] X. Huang, A. Acero and H. W. Hon Spoken Language Processing: A Guide to Theory, Algorithm and System Development, Prentice Hall. Figure-5. Graphic results of testing the wireless coded data (NB Codecs) with AMR@12.2-coded trained models (8kHzHMMs). While comparing during the testing the wireless coded data (NB Codecs) with AMR@12.2-coded trained models (8kHzHMMs) in connection with the previous analysis in Case-4, the respective FR produced accuracy at high degree with respect to EFR, HR, AMR@4.75 and AMR@12.2 also including the un-coded. 4. CONCLUSIONS ASR accuracies accuracy for un-coded and coded data when tested with the different coded models that include 8-kHz and 16-kHz HMMs. The major observations made are as follows. The ASR accuracy always increases with 8-kHz coded trained models when compared to 8-kHz un-coded models for all the narrowband codecs. The ASR accuracy of coded data of any particular codec increases by at least 2% when the same type of coded models is used. The ASR results for coded data for specific codecs, such as G.711, G.729, HR and AMR12.2 for un-coded and respective coded models, are re-organized to see the ASR improvements. Coded data (G.711/G.729/HR/AMR12.2) tested with 8-kHz uncoded HMMs while the Coded data (G.711/G.729/HR/AMR12.2) tested with 16-kHz uncoded HMMs. Similarly, the Coded data (G.711/G.729/HR/AMR12.2) tested with 8-kHz Coded (G.711/G.729/HR/AMR12.2) HMMs whereas the Coded data (G.711/G.729/HR/AMR12.2) tested with 16-kHz Coded (G.711/G.729/HR/AMR12.2) HMMs. All these codecs perform well for the respective 8-kHz coded [2] K. W. Church and R. L. Mercer Introduction to the Special Issue on Computational Linguistics Using Large Corpora. Computational Linguistics. 19(1): [3] Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Advanced Front-End Feature Extraction Algorithm; Compression Algorithms. ETSI. [4] J. S. Garofolo et al TIMIT Acoustic-Phonetic Continuous Speech Corpus. Linguistic Data Consortium, Philadelphia, USA. [5] J. S. Garofolo et al FFMTIMIT. Linguistic Data Consortium, Philadelphia, USA. [6] 3GPP Mandatory Speech Codec Speech Processing Functions: AMR Speech Codec; Transcoding Functions (3G TS ). [7] C. Jankowski et al NTIMIT: A Phonetically Balanced, Continuous Speech, Telephone Bandwidth Speech Database. In Proc. of ICASSP, pp [8] K.-F. Lee and H.-W. Hon Speaker-Independent Phone Recognition Using Hidden Markov Models. IEEE Transactions on Acoustics, Speech and Signal Processing. 37(11):
6 [9] N. Morales et al STC-TIMIT: Generation of a Single-channel Telephone Corpus. In Proc. of LREC. pp [10] D. A. Reynolds HTIMIT and LLHDB: Speech Corpora for the Study of Handset Transducer Effects. In Proc. of ICASSP. 2: [11] P. Bauer and T. Fingscheidt WTIMIT 1.0. Linguistic Data Consortium, Philadelphia. 1391
International Journal of Computer Engineering and Applications, Volume XI, Issue XII, Dec. 17, ISSN
SPEECH-ENABLED IVR USING ARTIFICIAL BANDWIDTH EXTENSION TECHNIQUE Mohan Dholvan 1, Dr. Anitha Sheela Kancharla 2 1 Department of Electronics and Computer Engineering, SNIST, Hyderabad, Telangana, India
More informationAutomatic Speech Recognition (ASR) Over VoIP and Wireless Networks
Final Report of the UGC Sponsored Major Research Project on Automatic Speech Recognition (ASR) Over VoIP and Wireless Networks UGC Sanction Letter: 41-600/2012 (SR) Dated 18th July 2012 by Prof.P.Laxminarayana
More informationArtificial Bandwidth Extension Using Deep Neural Networks for Spectral Envelope Estimation
Platzhalter für Bild, Bild auf Titelfolie hinter das Logo einsetzen Artificial Bandwidth Extension Using Deep Neural Networks for Spectral Envelope Estimation Johannes Abel and Tim Fingscheidt Institute
More informationWideband Speech Coding & Its Application
Wideband Speech Coding & Its Application Apeksha B. landge. M.E. [student] Aditya Engineering College Beed Prof. Amir Lodhi. Guide & HOD, Aditya Engineering College Beed ABSTRACT: Increasing the bandwidth
More informationTranscoding free voice transmission in GSM and UMTS networks
Transcoding free voice transmission in GSM and UMTS networks Sara Stančin, Grega Jakus, Sašo Tomažič University of Ljubljana, Faculty of Electrical Engineering Abstract - Transcoding refers to the conversion
More informationTechnical Report Speech and multimedia Transmission Quality (STQ); Speech samples and their usage for QoS testing
Technical Report Speech and multimedia Transmission Quality (STQ); Speech samples and their usage for QoS testing 2 Reference DTR/STQ-00196m Keywords QoS, quality, speech 650 Route des Lucioles F-06921
More information3GPP TS V5.0.0 ( )
TS 26.171 V5.0.0 (2001-03) Technical Specification 3rd Generation Partnership Project; Technical Specification Group Services and System Aspects; Speech Codec speech processing functions; AMR Wideband
More informationAcoustics of wideband terminals: a 3GPP perspective
Acoustics of wideband terminals: a 3GPP perspective Orange Labs Stéphane RAGOT Orange Delegate in 3GPP & 3GPP SA4 Vice-Chair Co-Rapporteur of 3GPP work item on "Requirements and Test Methods for Wideband
More informationWideband Speech Encryption Based Arnold Cat Map for AMR-WB G Codec
Wideband Speech Encryption Based Arnold Cat Map for AMR-WB G.722.2 Codec Fatiha Merazka Telecommunications Department USTHB, University of science & technology Houari Boumediene P.O.Box 32 El Alia 6 Bab
More informationUsing RASTA in task independent TANDEM feature extraction
R E S E A R C H R E P O R T I D I A P Using RASTA in task independent TANDEM feature extraction Guillermo Aradilla a John Dines a Sunil Sivadas a b IDIAP RR 04-22 April 2004 D a l l e M o l l e I n s t
More informationSpeech Coding Technique And Analysis Of Speech Codec Using CS-ACELP
Speech Coding Technique And Analysis Of Speech Codec Using CS-ACELP Monika S.Yadav Vidarbha Institute of Technology Rashtrasant Tukdoji Maharaj Nagpur University, Nagpur, India monika.yadav@rediffmail.com
More informationFlexible and Scalable Transform-Domain Codebook for High Bit Rate CELP Coders
Flexible and Scalable Transform-Domain Codebook for High Bit Rate CELP Coders Václav Eksler, Bruno Bessette, Milan Jelínek, Tommy Vaillancourt University of Sherbrooke, VoiceAge Corporation Montreal, QC,
More informationProceedings of Meetings on Acoustics
Proceedings of Meetings on Acoustics Volume 19, 213 http://acousticalsociety.org/ ICA 213 Montreal Montreal, Canada 2-7 June 213 Signal Processing in Acoustics Session 2pSP: Acoustic Signal Processing
More informationImpact of the GSM AMR Speech Codec on Formant Information Important to Forensic Speaker Identification
PAGE 483 Impact of the GSM AMR Speech Codec on Formant Information Important to Forensic Speaker Identification Bernard J Guillemin, Catherine I Watson Department of Electrical & Computer Engineering The
More informationCOM 12 C 288 E October 2011 English only Original: English
Question(s): 9/12 Source: Title: INTERNATIONAL TELECOMMUNICATION UNION TELECOMMUNICATION STANDARDIZATION SECTOR STUDY PERIOD 2009-2012 Audience STUDY GROUP 12 CONTRIBUTION 288 P.ONRA Contribution Additional
More informationSynchronous Overlap and Add of Spectra for Enhancement of Excitation in Artificial Bandwidth Extension of Speech
INTERSPEECH 5 Synchronous Overlap and Add of Spectra for Enhancement of Excitation in Artificial Bandwidth Extension of Speech M. A. Tuğtekin Turan and Engin Erzin Multimedia, Vision and Graphics Laboratory,
More informationTechnical Specification Group Services and System Aspects Meeting #7, Madrid, Spain, March 15-17, 2000 Agenda Item: 5.4.3
TSGS#7(00)0028 Technical Specification Group Services and System Aspects Meeting #7, Madrid, Spain, March 15-17, 2000 Agenda Item: 5.4.3 Source: TSG-S4 Title: AMR Wideband Permanent project document WB-4:
More informationTECHNICAL REPORT Speech and multimedia Transmission Quality (STQ); Speech samples and their use for QoS testing
TR 103 138 V1.3.1 (2015-03) TECHNICAL REPORT Speech and multimedia Transmission Quality (STQ); Speech samples and their use for QoS testing 2 TR 103 138 V1.3.1 (2015-03) Reference RTR/STQ-00203m Keywords
More informationSpeech Quality Evaluation of Artificial Bandwidth Extension: Comparing Subjective Judgments and Instrumental Predictions
INTERSPEECH 01 Speech Quality Evaluation of Artificial Bandwidth Extension: Comparing Subjective Judgments and Instrumental Predictions Hannu Pulakka 1, Ville Myllylä 1, Anssi Rämö, and Paavo Alku 1 Microsoft
More informationNinad Bhatt Yogeshwar Kosta
DOI 10.1007/s10772-012-9178-9 Implementation of variable bitrate data hiding techniques on standard and proposed GSM 06.10 full rate coder and its overall comparative evaluation of performance Ninad Bhatt
More informationBandwidth Extension for Speech Enhancement
Bandwidth Extension for Speech Enhancement F. Mustiere, M. Bouchard, M. Bolic University of Ottawa Tuesday, May 4 th 2010 CCECE 2010: Signal and Multimedia Processing 1 2 3 4 Current Topic 1 2 3 4 Context
More informationPractical Limitations of Wideband Terminals
Practical Limitations of Wideband Terminals Dr.-Ing. Carsten Sydow Siemens AG ICM CP RD VD1 Grillparzerstr. 12a 8167 Munich, Germany E-Mail: sydow@siemens.com Workshop on Wideband Speech Quality in Terminals
More informationInternational Journal of Advanced Engineering Technology E-ISSN
Research Article ARCHITECTURAL STUDY, IMPLEMENTATION AND OBJECTIVE EVALUATION OF CODE EXCITED LINEAR PREDICTION BASED GSM AMR 06.90 SPEECH CODER USING MATLAB Bhatt Ninad S. 1 *, Kosta Yogesh P. 2 Address
More informationtechniques are means of reducing the bandwidth needed to represent the human voice. In mobile
8 2. LITERATURE SURVEY The available radio spectrum for the wireless radio communication is very limited hence to accommodate maximum number of users the speech is compressed. The speech compression techniques
More informationCellular systems & GSM Wireless Systems, a.a. 2014/2015
Cellular systems & GSM Wireless Systems, a.a. 2014/2015 Un. of Rome La Sapienza Chiara Petrioli Department of Computer Science University of Rome Sapienza Italy 2 Voice Coding 3 Speech signals Voice coding:
More informationMel Spectrum Analysis of Speech Recognition using Single Microphone
International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree
More information22. Konferenz Elektronische Sprachsignalverarbeitung (ESSV), September 2011, Aachen, Germany (TuDPress, ISBN )
BINAURAL WIDEBAND TELEPHONY USING STEGANOGRAPHY Bernd Geiser, Magnus Schäfer, and Peter Vary Institute of Communication Systems and Data Processing ( ) RWTH Aachen University, Germany {geiser schaefer
More informationCHAPTER 7 ROLE OF ADAPTIVE MULTIRATE ON WCDMA CAPACITY ENHANCEMENT
CHAPTER 7 ROLE OF ADAPTIVE MULTIRATE ON WCDMA CAPACITY ENHANCEMENT 7.1 INTRODUCTION Originally developed to be used in GSM by the Europe Telecommunications Standards Institute (ETSI), the AMR speech codec
More informationThe Emergence, Introduction and Challenges of Wideband Choice Codecs in the VoIP Market
5 th Nov, 2008 The Emergence, Introduction and Challenges of Wideband Choice Codecs in the VoIP Market PN101 Roger Chung of Freescale Semiconductor, Inc. All other product or service names are the property
More informationIMPROVING WIDEBAND SPEECH RECOGNITION USING MIXED-BANDWIDTH TRAINING DATA IN CD-DNN-HMM
IMPROVING WIDEBAND SPEECH RECOGNITION USING MIXED-BANDWIDTH TRAINING DATA IN CD-DNN-HMM Jinyu Li, Dong Yu, Jui-Ting Huang, and Yifan Gong Microsoft Corporation, One Microsoft Way, Redmond, WA 98052 ABSTRACT
More informationAn audio watermark-based speech bandwidth extension method
Chen et al. EURASIP Journal on Audio, Speech, and Music Processing 2013, 2013:10 RESEARCH Open Access An audio watermark-based speech bandwidth extension method Zhe Chen, Chengyong Zhao, Guosheng Geng
More informationBandwidth Extension of Speech Signals: A Catalyst for the Introduction of Wideband Speech Coding?
WIDEBAND SPEECH CODING STANDARDS AND WIRELESS SERVICES Bandwidth Extension of Speech Signals: A Catalyst for the Introduction of Wideband Speech Coding? Peter Jax and Peter Vary, RWTH Aachen University
More informationAn objective method for evaluating data hiding in pitch gain and pitch delay parameters of the AMR codec
An objective method for evaluating data hiding in pitch gain and pitch delay parameters of the AMR codec Akira Nishimura 1 1 Department of Media and Cultural Studies, Tokyo University of Information Sciences,
More informationImproving Sound Quality by Bandwidth Extension
International Journal of Scientific & Engineering Research, Volume 3, Issue 9, September-212 Improving Sound Quality by Bandwidth Extension M. Pradeepa, M.Tech, Assistant Professor Abstract - In recent
More informationAUTOMATIC SPEECH RECOGNITION FOR NUMERIC DIGITS USING TIME NORMALIZATION AND ENERGY ENVELOPES
AUTOMATIC SPEECH RECOGNITION FOR NUMERIC DIGITS USING TIME NORMALIZATION AND ENERGY ENVELOPES N. Sunil 1, K. Sahithya Reddy 2, U.N.D.L.mounika 3 1 ECE, Gurunanak Institute of Technology, (India) 2 ECE,
More informationVoice Coding, PCM Voice, Voice Quality, E-model
Voice Coding, PCM Voice, Voice Quality, E-model PCM ~ Pulse Code Modulation Sampling Quantizing Linear Non-linear Quantizing error PCM frame structure Other Voice coding algorithms E-model, Voice quality
More informationETSI TS V ( )
TS 126 171 V14.0.0 (2017-04) TECHNICAL SPECIFICATION Digital cellular telecommunications system (Phase 2+) (GSM); Universal Mobile Telecommunications System (UMTS); LTE; Speech codec speech processing
More informationEnabling New Speech Driven Services for Mobile Devices: An overview of the ETSI standards activities for Distributed Speech Recognition Front-ends
Distributed Speech Recognition Enabling New Speech Driven Services for Mobile Devices: An overview of the ETSI standards activities for Distributed Speech Recognition Front-ends David Pearce & Chairman
More informationEnhancement of Speech Signal by Adaptation of Scales and Thresholds of Bionic Wavelet Transform Coefficients
ISSN (Print) : 232 3765 An ISO 3297: 27 Certified Organization Vol. 3, Special Issue 3, April 214 Paiyanoor-63 14, Tamil Nadu, India Enhancement of Speech Signal by Adaptation of Scales and Thresholds
More informationSingle Channel Speaker Segregation using Sinusoidal Residual Modeling
NCC 2009, January 16-18, IIT Guwahati 294 Single Channel Speaker Segregation using Sinusoidal Residual Modeling Rajesh M Hegde and A. Srinivas Dept. of Electrical Engineering Indian Institute of Technology
More informationEFFICIENT SUPER-WIDE BANDWIDTH EXTENSION USING LINEAR PREDICTION BASED ANALYSIS-SYNTHESIS. Pramod Bachhav, Massimiliano Todisco and Nicholas Evans
EFFICIENT SUPER-WIDE BANDWIDTH EXTENSION USING LINEAR PREDICTION BASED ANALYSIS-SYNTHESIS Pramod Bachhav, Massimiliano Todisco and Nicholas Evans EURECOM, Sophia Antipolis, France {bachhav,todisco,evans}@eurecom.fr
More informationNOISE ESTIMATION IN A SINGLE CHANNEL
SPEECH ENHANCEMENT FOR CROSS-TALK INTERFERENCE by Levent M. Arslan and John H.L. Hansen Robust Speech Processing Laboratory Department of Electrical Engineering Box 99 Duke University Durham, North Carolina
More informationPerceptual wideband speech and audio quality measurement. Dr Antony Rix Psytechnics Limited
Perceptual wideband speech and audio quality measurement Dr Antony Rix Psytechnics Limited Agenda Background Perceptual models BS.1387 PEAQ P.862 PESQ Scope Extension to wideband Performance of wideband
More informationSubjective Voice Quality Evaluation of Artificial Bandwidth Extension: Comparing Different Audio Bandwidths and Speech Codecs
INTERSPEECH 01 Subjective Voice Quality Evaluation of Artificial Bandwidth Extension: Comparing Different Audio Bandwidths and Speech Codecs Hannu Pulakka 1, Anssi Rämö, Ville Myllylä 1, Henri Toukomaa,
More informationSimulation of Conjugate Structure Algebraic Code Excited Linear Prediction Speech Coder
COMPUSOFT, An international journal of advanced computer technology, 3 (3), March-204 (Volume-III, Issue-III) ISSN:2320-0790 Simulation of Conjugate Structure Algebraic Code Excited Linear Prediction Speech
More informationSpatial Audio Transmission Technology for Multi-point Mobile Voice Chat
Audio Transmission Technology for Multi-point Mobile Voice Chat Voice Chat Multi-channel Coding Binaural Signal Processing Audio Transmission Technology for Multi-point Mobile Voice Chat We have developed
More informationENHANCED TIME DOMAIN PACKET LOSS CONCEALMENT IN SWITCHED SPEECH/AUDIO CODEC.
ENHANCED TIME DOMAIN PACKET LOSS CONCEALMENT IN SWITCHED SPEECH/AUDIO CODEC Jérémie Lecomte, Adrian Tomasek, Goran Marković, Michael Schnabel, Kimitaka Tsutsumi, Kei Kikuiri Fraunhofer IIS, Erlangen, Germany,
More informationGerhard Schmidt / Tim Haulick Recent Tends for Improving Automotive Speech Enhancement Systems. Geneva, 5-7 March 2008
Gerhard Schmidt / Tim Haulick Recent Tends for Improving Automotive Speech Enhancement Systems Speech Communication Channels in a Vehicle 2 Into the vehicle Within the vehicle Out of the vehicle Speech
More informationSIMULATION VOICE RECOGNITION SYSTEM FOR CONTROLING ROBOTIC APPLICATIONS
SIMULATION VOICE RECOGNITION SYSTEM FOR CONTROLING ROBOTIC APPLICATIONS 1 WAHYU KUSUMA R., 2 PRINCE BRAVE GUHYAPATI V 1 Computer Laboratory Staff., Department of Information Systems, Gunadarma University,
More informationQuality comparison of wideband coders including tandeming and transcoding
ETSI Workshop on Speech and Noise In Wideband Communication, 22nd and 23rd May 2007 - Sophia Antipolis, France Quality comparison of wideband coders including tandeming and transcoding Catherine Quinquis
More informationSERIES P: TELEPHONE TRANSMISSION QUALITY, TELEPHONE INSTALLATIONS, LOCAL LINE NETWORKS Methods for objective and subjective assessment of quality
International Telecommunication Union ITU-T TELECOMMUNICATION STANDARDIZATION SECTOR OF ITU P.862.3 (11/2007) SERIES P: TELEPHONE TRANSMISSION QUALITY, TELEPHONE INSTALLATIONS, LOCAL LINE NETWORKS Methods
More informationDeriving Equipment Impairment Factors for Wideband Speech Codecs
Deriving Equipment Impairment Factors for Wideband Speech Codecs Sebastian Möller 1, Alexander Raake 1, Vincent Barriac 2, Catherine Quinquis 2 1 IKA, Ruhr-University Bochum, Germany 2 France Télécom R&D,
More informationLesson 8 Speech coding
Lesson 8 coding Encoding Information Transmitter Antenna Interleaving Among Frames De-Interleaving Antenna Transmission Line Decoding Transmission Line Receiver Information Lesson 8 Outline How information
More informationTELECOMMUNICATION SYSTEMS
TELECOMMUNICATION SYSTEMS By Syed Bakhtawar Shah Abid Lecturer in Computer Science 1 MULTIPLEXING An efficient system maximizes the utilization of all resources. Bandwidth is one of the most precious resources
More informationThe Channel Vocoder (analyzer):
Vocoders 1 The Channel Vocoder (analyzer): The channel vocoder employs a bank of bandpass filters, Each having a bandwidth between 100 Hz and 300 Hz. Typically, 16-20 linear phase FIR filter are used.
More informationBandwidth Efficient Mixed Pseudo Analogue-Digital Speech Transmission
Bandwidth Efficient Mixed Pseudo Analogue-Digital Speech Transmission Carsten Hoelper and Peter Vary {hoelper,vary}@ind.rwth-aachen.de ETSI Workshop on Speech and Noise in Wideband Communication 22.-23.
More informationScalable Speech Coding for IP Networks
Santa Clara University Scholar Commons Engineering Ph.D. Theses Student Scholarship 8-24-2015 Scalable Speech Coding for IP Networks Koji Seto Santa Clara University Follow this and additional works at:
More informationKeywords Decomposition; Reconstruction; SNR; Speech signal; Super soft Thresholding.
Volume 5, Issue 2, February 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Speech Enhancement
More informationEffective post-processing for single-channel frequency-domain speech enhancement Weifeng Li a
R E S E A R C H R E P O R T I D I A P Effective post-processing for single-channel frequency-domain speech enhancement Weifeng Li a IDIAP RR 7-7 January 8 submitted for publication a IDIAP Research Institute,
More informationDifferent Approaches of Spectral Subtraction Method for Speech Enhancement
ISSN 2249 5460 Available online at www.internationalejournals.com International ejournals International Journal of Mathematical Sciences, Technology and Humanities 95 (2013 1056 1062 Different Approaches
More informationETSI TS V8.0.0 ( ) Technical Specification
Technical Specification Digital cellular telecommunications system (Phase 2+); Enhanced Full Rate (EFR) speech processing functions; General description () GLOBAL SYSTEM FOR MOBILE COMMUNICATIONS R 1 Reference
More informationDETECTION OF CLIPPING IN CODED SPEECH SIGNALS. James Eaton and Patrick A. Naylor
DETECTION OF CLIPPING IN CODED SPEECH SIGNALS James Eaton and Patrick A. Naylor Department of Electrical and Electronic Engineering, Imperial College, London, UK {j.eaton, p.naylor}@imperial.ac.uk ABSTRACT
More informationInternational Journal of Scientific & Engineering Research, Volume 4, Issue 5, May ISSN
International Journal of Scientific & Engineering Research, Volume 4, Issue 5, May-2013 1840 An Overview of Distributed Speech Recognition over WMN Jyoti Prakash Vengurlekar vengurlekar.jyoti13@gmai l.com
More informationInformation. LSP (Line Spectrum Pair): Essential Technology for High-compression Speech Coding. Takehiro Moriya. Abstract
LSP (Line Spectrum Pair): Essential Technology for High-compression Speech Coding Takehiro Moriya Abstract Line Spectrum Pair (LSP) technology was accepted as an IEEE (Institute of Electrical and Electronics
More informationCall Quality Measurement for Telecommunication Network and Proposition of Tariff Rates
Call Quality Measurement for Telecommunication Network and Proposition of Tariff Rates Akram Aburas School of Engineering, Design and Technology, University of Bradford Bradford, West Yorkshire, United
More informationPreface, Motivation and The Speech Coding Scene
Preface, Motivation and The Speech Coding Scene In the era of third-generation (3G) wireless personal communications standards, despite the emergence of broad-band access network standard proposals, the
More informationDERIVATION OF TRAPS IN AUDITORY DOMAIN
DERIVATION OF TRAPS IN AUDITORY DOMAIN Petr Motlíček, Doctoral Degree Programme (4) Dept. of Computer Graphics and Multimedia, FIT, BUT E-mail: motlicek@fit.vutbr.cz Supervised by: Dr. Jan Černocký, Prof.
More informationEUROPEAN pr ETS TELECOMMUNICATION November 1996 STANDARD
FINAL DRAFT EUROPEAN pr ETS 300 723 TELECOMMUNICATION November 1996 STANDARD Source: ETSI TC-SMG Reference: DE/SMG-020651 ICS: 33.060.50 Key words: EFR, digital cellular telecommunications system, Global
More informationNOVEL PITCH DETECTION ALGORITHM WITH APPLICATION TO SPEECH CODING
NOVEL PITCH DETECTION ALGORITHM WITH APPLICATION TO SPEECH CODING A Thesis Submitted to the Graduate Faculty of the University of New Orleans in partial fulfillment of the requirements for the degree of
More informationRobust Voice Activity Detection Based on Discrete Wavelet. Transform
Robust Voice Activity Detection Based on Discrete Wavelet Transform Kun-Ching Wang Department of Information Technology & Communication Shin Chien University kunching@mail.kh.usc.edu.tw Abstract This paper
More informationVoiced/nonvoiced detection based on robustness of voiced epochs
Voiced/nonvoiced detection based on robustness of voiced epochs by N. Dhananjaya, B.Yegnanarayana in IEEE Signal Processing Letters, 17, 3 : 273-276 Report No: IIIT/TR/2010/50 Centre for Language Technologies
More informationVocoder (LPC) Analysis by Variation of Input Parameters and Signals
ISCA Journal of Engineering Sciences ISCA J. Engineering Sci. Vocoder (LPC) Analysis by Variation of Input Parameters and Signals Abstract Gupta Rajani, Mehta Alok K. and Tiwari Vebhav Truba College of
More informationSPEECH ENHANCEMENT USING A ROBUST KALMAN FILTER POST-PROCESSOR IN THE MODULATION DOMAIN. Yu Wang and Mike Brookes
SPEECH ENHANCEMENT USING A ROBUST KALMAN FILTER POST-PROCESSOR IN THE MODULATION DOMAIN Yu Wang and Mike Brookes Department of Electrical and Electronic Engineering, Exhibition Road, Imperial College London,
More informationVoice Excited Lpc for Speech Compression by V/Uv Classification
IOSR Journal of VLSI and Signal Processing (IOSR-JVSP) Volume 6, Issue 3, Ver. II (May. -Jun. 2016), PP 65-69 e-issn: 2319 4200, p-issn No. : 2319 4197 www.iosrjournals.org Voice Excited Lpc for Speech
More informationAdaptive time scale modification of speech for graceful degrading voice quality in congested networks
Adaptive time scale modification of speech for graceful degrading voice quality in congested networks Prof. H. Gokhan ILK Ankara University, Faculty of Engineering, Electrical&Electronics Eng. Dept 1 Contact
More informationA NEW FEATURE VECTOR FOR HMM-BASED PACKET LOSS CONCEALMENT
A NEW FEATURE VECTOR FOR HMM-BASED PACKET LOSS CONCEALMENT L. Koenig (,2,3), R. André-Obrecht (), C. Mailhes (2) and S. Fabre (3) () University of Toulouse, IRIT/UPS, 8 Route de Narbonne, F-362 TOULOUSE
More informationVoice Coding, PCM Voice, Voice Quality, E-model
Voice Coding, PCM Voice, Voice Quality, E-model! PCM ~ Pulse Code Modulation Sampling Quantizing Linear Non-linear Quantizing error! PCM frame structure! Other Voice coding algorithms! E-model, Voice quality
More informationINTERNATIONAL TELECOMMUNICATION UNION
INTERNATIONAL TELECOMMUNICATION UNION ITU-T P.862 TELECOMMUNICATION STANDARDIZATION SECTOR OF ITU (02/2001) SERIES P: TELEPHONE TRANSMISSION QUALITY, TELEPHONE INSTALLATIONS, LOCAL LINE NETWORKS Methods
More informationOverview of Code Excited Linear Predictive Coder
Overview of Code Excited Linear Predictive Coder Minal Mulye 1, Sonal Jagtap 2 1 PG Student, 2 Assistant Professor, Department of E&TC, Smt. Kashibai Navale College of Engg, Pune, India Abstract Advances
More informationTranscoding of Narrowband to Wideband Speech
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2005 Transcoding of Narrowband to Wideband Speech Christian H. Ritz University
More informationVoice Activity Detection for Speech Enhancement Applications
Voice Activity Detection for Speech Enhancement Applications E. Verteletskaya, K. Sakhnov Abstract This paper describes a study of noise-robust voice activity detection (VAD) utilizing the periodicity
More informationEnhancing 3D Audio Using Blind Bandwidth Extension
Enhancing 3D Audio Using Blind Bandwidth Extension (PREPRINT) Tim Habigt, Marko Ðurković, Martin Rothbucher, and Klaus Diepold Institute for Data Processing, Technische Universität München, 829 München,
More informationEE482: Digital Signal Processing Applications
Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 12 Speech Signal Processing 14/03/25 http://www.ee.unlv.edu/~b1morris/ee482/
More informationLOSS CONCEALMENTS FOR LOW-BIT-RATE PACKET VOICE IN VOIP. Outline
LOSS CONCEALMENTS FOR LOW-BIT-RATE PACKET VOICE IN VOIP Benjamin W. Wah Department of Electrical and Computer Engineering and the Coordinated Science Laboratory University of Illinois at Urbana-Champaign
More informationIMPROVED SPEECH QUALITY FOR VMR - WB SPEECH CODING USING EFFICIENT NOISE ESTIMATION ALGORITHM
IMPROVED SPEECH QUALITY FOR VMR - WB SPEECH CODING USING EFFICIENT NOISE ESTIMATION ALGORITHM Mr. M. Mathivanan Associate Professor/ECE Selvam College of Technology Namakkal, Tamilnadu, India Dr. S.Chenthur
More informationARTIFICIAL BANDWIDTH EXTENSION OF NARROW-BAND SPEECH SIGNALS VIA HIGH-BAND ENERGY ESTIMATION
ARTIFICIAL BANDWIDTH EXTENSION OF NARROW-BAND SPEECH SIGNALS VIA HIGH-BAND ENERGY ESTIMATION Tenkasi Ramabadran and Mark Jasiuk Motorola Labs, Motorola Inc., 1301 East Algonquin Road, Schaumburg, IL 60196,
More informationRIR Estimation for Synthetic Data Acquisition
RIR Estimation for Synthetic Data Acquisition Kevin Venalainen, Philippe Moquin, Dinei Florencio Microsoft ABSTRACT - Automatic Speech Recognition (ASR) works best when the speech signal best matches the
More informationSpeech Quality Assessment for Wideband Communication Scenarios
Speech Quality Assessment for Wideband Communication Scenarios H. W. Gierlich, S. Völl, F. Kettler (HEAD acoustics GmbH) P. Jax (IND, RWTH Aachen) Workshop on Wideband Speech Quality in Terminals and Networks
More informationRadio Relay - Vocality to Vocality
Radio Relay - Vocality to Vocality Application Note AN230 Revision v1.5 November 2013 AN230 Radio relay - Vocality to Vocality 1 Overview This Application Note describes how you can set up two Vocality
More informationSpeech Synthesis using Mel-Cepstral Coefficient Feature
Speech Synthesis using Mel-Cepstral Coefficient Feature By Lu Wang Senior Thesis in Electrical Engineering University of Illinois at Urbana-Champaign Advisor: Professor Mark Hasegawa-Johnson May 2018 Abstract
More informationIMPROVING QUALITY OF SPEECH SYNTHESIS IN INDIAN LANGUAGES. P. K. Lehana and P. C. Pandey
Workshop on Spoken Language Processing - 2003, TIFR, Mumbai, India, January 9-11, 2003 149 IMPROVING QUALITY OF SPEECH SYNTHESIS IN INDIAN LANGUAGES P. K. Lehana and P. C. Pandey Department of Electrical
More informationCombining Voice Activity Detection Algorithms by Decision Fusion
Combining Voice Activity Detection Algorithms by Decision Fusion Evgeny Karpov, Zaur Nasibov, Tomi Kinnunen, Pasi Fränti Speech and Image Processing Unit, University of Eastern Finland, Joensuu, Finland
More informationSpeech Enhancement Using a Mixture-Maximum Model
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 10, NO. 6, SEPTEMBER 2002 341 Speech Enhancement Using a Mixture-Maximum Model David Burshtein, Senior Member, IEEE, and Sharon Gannot, Member, IEEE
More information10 Speech and Audio Signals
0 Speech and Audio Signals Introduction Speech and audio signals are normally converted into PCM, which can be stored or transmitted as a PCM code, or compressed to reduce the number of bits used to code
More informationNOISE SHAPING IN AN ITU-T G.711-INTEROPERABLE EMBEDDED CODEC
NOISE SHAPING IN AN ITU-T G.711-INTEROPERABLE EMBEDDED CODEC Jimmy Lapierre 1, Roch Lefebvre 1, Bruno Bessette 1, Vladimir Malenovsky 1, Redwan Salami 2 1 Université de Sherbrooke, Sherbrooke (Québec),
More informationAp A ril F RRL RRL P ro r gra r m By Dick AH6EZ/W9
April 2013 FRRL Program By Dick AH6EZ/W9 Why Digital Voice? Data speed or RF bandwidth reduction Transmission by shared digital media such as T1s Security and encryption PCM or ADPCM first US Patent in
More informationSpeech Enhancement Based On Noise Reduction
Speech Enhancement Based On Noise Reduction Kundan Kumar Singh Electrical Engineering Department University Of Rochester ksingh11@z.rochester.edu ABSTRACT This paper addresses the problem of signal distortion
More informationSpeech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm
International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm A.T. Rajamanickam, N.P.Subiramaniyam, A.Balamurugan*,
More informationWaveform Coding Algorithms: An Overview
August 24, 2012 Waveform Coding Algorithms: An Overview RWTH Aachen University Compression Algorithms Seminar Report Summer Semester 2012 Adel Zaalouk - 300374 Aachen, Germany Contents 1 An Introduction
More informationSpeech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter
Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter 1 Gupteswar Sahu, 2 D. Arun Kumar, 3 M. Bala Krishna and 4 Jami Venkata Suman Assistant Professor, Department of ECE,
More information