Chapter IV THEORY OF CELP CODING

Size: px
Start display at page:

Download "Chapter IV THEORY OF CELP CODING"

Transcription

1 Chapter IV THEORY OF CELP CODING

2 CHAPTER IV THEORY OF CELP CODING 4.1 Introduction Wavefonn coders fail to produce high quality speech at bit rate lower than 16 kbps. Source coders, such as LPC vocoders, operates at the rates as low as 2 kbps but fail to provide speech quality appropriate for the commercial telephone application in the wire line as well as wireless medium. Hybrid coders based on Analysis by Synthesis (AbS) speech coding produces toll quality speech at rate less than 10 kbps. The chosen codec for this study is a time domain hybrid coder is based on Algebraic Code Excited Linear prediction (ACELP) architecture. The basic structure is that of the CELP with the secondary excitation searched from a memory less algebraic codebook. In this chapter the basic theory of the standard CELP based speech coding algorithm is described. Commencing with the generalized AbS coding, the analytical tools are briefly discussed. In order to maintain good quality in the reconstructed speech despite less computational complexity and spectrally efficient speech coding, various modifications are possible in the implementations of the different sections of the CELP based speech coding algorithm. 4.2 Generalized AbS-LPC Speech Coding In AbS-LPC coding [6] [46] schemes the locally synthesized signal is compared with original speech signal and coder parameters are selected so as to produce minimum mean square error between the original speech signal and the reconstructed speech signal. The perfonnance of this scheme is better due to the involvement of the closed loop optimization procedure for parameters estimate. The basic structure of the - 32-

3 AbS-LPC scheme [47] is as illustrated in figure 4.1. Functionally the entire AbS-LPC scheme can be divided into three sections or parts: (i) (ii) (iii) Time varying filter Excitation signal generation Error minimization procedure!"'"""" Original speech I Excitation signal generator Time varying filter Error minimization coder Excitation signal generator Time varying filter ~ ~." j s~;~~~c I Figure No. 4.1: Generalized AbS-LPC scheme Time Varying Filters The time varying filter in the model is combination of the two linear predictors namely- the STP or LPC and the LTP or pitch filter. The Short term prediction filter exploits the correlation between the adjacent speech samples. The long term predictor removes the correlation between the distant samples, normally one pitch or multiple pitch away. The STP filter is generally implemented as linear prediction time -33-

4 varying filter, whereas the L TP is generally implemented as adaptive codebook in the excitation synthesis and search Excitation Signal Generator Excitation signal represents the input to the time varying filter (normally L TP) and is the most important part of the ABS scheme. The discrimination between the various AbS-LPC schemes is based on the way in which the excitation signal vector has been represented. In CELP based speech coding algorithms, the excitation signal is chosen from a predefined codebook. The excitation signal vector can be selected from one single codebook or it may be summation of two sub-vectors from two different excitation sources. Majority of the AbS-LPC schemes reported to use two codebooks, one fixed codebook and another adaptive codebook. In this study the developed spectrally efficient speech coding algorithm uses a fixed algebraic codebook, for fixed excitation and pitch adaptive codebook to generate the excitation for the LP synthesis Error Minimization Procedure The criterion of minimization over several errors, such as absolute error, maximum error, mean square error etc. is possible; the most commonly used error minimization criterion is minimum mean square error (MMSE). The mean square error between two signals 1 n=n MSE =-l;(s,'(n)-si(n)) N n=o (4.1) criterion in the error analysis. A perceptually or weighted mean square error criterion can also be used as -34-

5 4.2.4 Types Of AbS Speech Coders There are various implementations of the speech coding algorithms based on the analysis by synthesis speech coding [48], as listed below: SELP - Self excited linear predictor MPLPC - Multi pulse excited linear predictor RPE-LPC- Regular pulse excited linear predictor CELP - Code excited linear predictor. These differ more or less in the type of excitations used in the ABS scheme. For each group, however there can be different internal variation in the design and implementation of the coding algorithms. Theoretical details of the CELP based speech coding algorithm is discussed with specific emphasis to the Algebraic CELP, as it is the technique used in the present study. 4.3 CELP: Theoretical Aspects The theoretical aspects of the CELP based implementation of the ABS speech coding scheme is presented as follows Basic Principle The redundancies in the speech signal are almost removed after the short term prediction and long term prediction of the speech signal and the residual has very little correlation left in it. Then an excitation is searched which synthesizes the speech and the codebook index and gain are searched from the fixed codebook. The optimum codebook index selection criterion is based on MMSE between the locally synthesized speech and the original speech signal. Atal and Schroeder first proposed the CELP long back in 1984 [6], but until recently CELP has got the attention as speech coding -35-

6 algorithm for spectrally efficient speech coding. The standard model of the CELP is illustrated in the figure no. 4.2 ~-.. - " "'"'"""""'"'''"''''''''- -" ""''"'''"''''"""""''"''"'''''''''''''''"'''"''-" Input speech ~ Windowing and LP analysis Zero excitation Zero excitation ~I LP Synthesis LTP ~I LP Synthesis t r s~i~~~ ~i;~;; ct~i~; l.o... r... ~~~.. ~~~.~....r.... codebook ~ ~L-_L_P_S_y_n_th_es_is_...J! :..... Select index and gain...! Figure No. 4.2: Block diagram of standard CELP coding algorithm Operation Of The CELP Algorithm The illustrated CELP operates as follows: I. The original speech signal is portioned into frames of I Oms - 20ms and LP analysis is performed. LP model parameters are estimated using one of the various LP analysis methods. The memory of the STP is flushed out before further processing. 2. The L TP analysis is then performed over the target signal, which depends on the method used. Target signal is generally the LP residual obtained by the LP inverse -36-

7 filtering in the open loop method (OLM)or modified open loop method (MOLM), whereas the original speech is used as the target signal in the closed loop method (CLM) [22]. The pitch delay and pitch gain are the two LTP model parameters estimated in the L TP analysis. 3. The new target for the fixed codebook is then obtained by considering the STP and L TP contribution from the original speech signal. Secondary excitation is then determined by performing the exhaustive search of the fixed codebook, selection criterion being the MMSE. Codebook index and codebook gain are the selected parameters of the fixed code book. 4. The decoding algorithm for the CELP is as depicted in the figure No At the decoder the excitation ids constructed from the L TP parameters and the code book parameters. The synthesized excitation is then fed to the LP synthesis filter.the update of the excitation is usually performed on the sub multiples of the LP analysis frame. r / ~ I! Zo~~b~:k STP parameters. 'I parameters LP synthesis Or roo~:';"' Synthesized speech! ~-- -"''''''''''''''''''""~ "'"''"''''""""""'''''''''''''''''" " '! Figure No. 4.3: Block diagram of standard CELP decoding algorithm -37-

8 4.3.3 Secondary Excitation Codebook The vectors contained in the codebook are very important part of the CELP based speech coding algorithms. It is used to generate the excitation for the time varying filter which synthesizes the speech at the decoder end. The contribution of the secondary excitation is more useful during the unvoiced portion or the inactive portion of the speech as for the voiced portion L TP provides more contribution. The codebook population of the excitation vectors and the search procedure of the excitation vectors are the two most important issues in the secondary excitation of the CELP based speech coding. The requirement of quality, lower search complexity and reduced memory for the storage of the codebook excitation vectors resulted in the the different type of the secondary excitation codebooks. A lot of research has been focused on reducing the complexity of the speech coding algorithm by using different code book architecture and efficient search procedures. As a consequence a variety of codebook structures has been developed. Some of the codebooks used are: I. Sparse codebook 2. Ternary codebook 3. Overlapping codebook 4. Binary pulse excited codebook 5. Algebraic codebook The secondary excitation code books have faster search if the the code book are structured, consequently CELP based coding sues the structured code book. There are three types of structured code books: 1. Sparse codebook 2. Ternary codebook 3. Algebraic codebook 38-

9 Sparse codebook A zero mean unit variance Gaussian random process is used to populate this type of the code book. Variables are usually set to zero whenever their absolute values are less than some predefined threshold. This type of code vectors has the ability tp produce natural sounding reconstructed speech. The design of the codebook, larger search complexity and the large storage needed are the limitation in the use of this code book Ternary codebook A ternary excitation codebook vector is sparse excitation codebook vector in which the nonzero values are replaced either by -1 (value<o) or +I (value>o). This result in a code vector consists of only three possible values. The computational complexity reduces due to the fact that multiplications are reduced to summation as the magnitude of the code vector elements is either zero or one Algebraic codebook The codebook uses algebraic codes; based in interlaced permutation codes (IPC) excitation vectors are derived. Earlier schemes of the algebraic code books used the binary codes to populate the codebook vectors. In the IPC the vectors contains few non zero pulses with predefined set of positions and pulses are allowed to take fixed amplitude, either + 1 or -1. Each pulse has a set of possible positions, distinct from the position of other pulses. The excitation code vector is determined by the position and amplitude of the non zero pulses. This codebook structure has a several merits, firstly it does not requires storage at the decoder or encoder, as the codebook index defines the code vectors completely. Secondly it defines inherent robustness against he channel errors. Finally and the most important merit of the algebraic codebook is the better search efficiency. -39-

10 4.3.4 Codebook Search Most of the computational complexity results due to the exhaustive search of the codebook vectors. To search for the optimum code vector out of the entire code book, exhaustive search of the code book is performed [). The search criterion is the minimum mean square error between the synthesized and the original speech. The mean square error can be minimized by maximizing the Tk, given by (4.2) alternative form as Where c k is code vector and, s k is the energy. This can be expressed in an (4.3) Where X is the target vector and H is the lower triangular matrix of impulse response of the STP synthesis filter. 4.4 CELP Implementation Issues From the above discussion it is clear that the computation can e broken down into three blocks: (ii) LPC analysis or STP (iii) Pitch analysis L TP (iv) Codebook search The LPC analysis and the L TP analysis have already been explained in the chapter II and needs no repetition here. First issue in the CLEP is the complexity resulted due to the exhaustive search of the fixed codebook (time complexity) and the storage of the code vectors (space complexity). A lot of research has been focused on reducing the -40-

11 complexity of the speech coding algorithm by using different codebook architecture and efficient search procedures. As a consequence a variety of codebook structures has been developed Second major issue in the CELP based speech coding algorithm is that of the efficient and transparent quantization of the LSF parameters, so as to encode the LP parameters into as few bits as possible and at the same time optimizing the computational complexity. The quantization issues of the LSF parameters have been discussed in the chapter III. Lastly, the issue of the search complexity of the pitch analysis or the adaptive codebook search. The exhaustive search for the entire range of the pitch delay has to carry out in order to estimate the pitch delay. A lot of focus is on the efficient pitch search algorithm as it is computational very complex. A pre selection based pitch lag search techniques has been implemented in this work. 4.5 Performance Evaluation Of The Speech Coders A speech coding algorithm is evaluated based on the bit rate, quality of reconstructed speech, complexity of the algorithm, algorithmic delay and robustness to channel errors. In general high quality speech coding at low bit rate is achieved by large complexity algorithms and hence longer algorithmic delay. The quality of reconstructed speech in the clean speech conditions as well as the algorithm evaluation has to perform with speech corrupted by background noise. Moreover, in some application the speech coding algorithm performance has to be checked for non speech signal such as DTMF (dual tone multifrquency) and codec performance in tandem [ 17]. general categories: For digital communication of speech the quality is classified into four Broadcast Network or toll

12 Communication Synthetic Broadcast quality refers to high quality "commentary" speech, generally achieved at a rate above 64 Kbps. Toll quality refers to the quality comparable to that of classical analog speech communication (200Hz to 3300 Hz). Toll quality can be achieved at the mid range of data rates. Communication quality implies high intelligibility, may be slightly degraded in quality but natural speech with speaker recognition. Communication quality can be achieved at rates above 4.8 Kbps. Synthetic quality is intelligible, and can be unnatural without speaker recognizability. Speech coder operating below 4.8 Kbps can generate synthetic quality. The quality of the reconstructed speech, based on the four class of quality can be quantified either based on objective measure or the subjective measure Objective measure The signal to noise ratio (SNR) is one of the most popular and common objective measure for evaluating the quality performance of a compression algorithm. It is a long term measure for the accuracy of the reconstructed speech. SNR is the ratio of average speech signal power of length N, to the reconstruction error difference. It can be expressed as N-1 z:s2(n) SNR = ~-""""" 0 '--- N-1 Z::(s(n)-s(n)) n:o (4.4) Where s(n) is the reconstructed speech and s(n) is the original speech. Temporal variation in the quality of reconstructed speech can be better evaluated by segmental SNR (SEGSNR) which is given by -42-

13 n N-l L-1 :~::>2 (in+ n) SEGSNR = lo L log "" 0 (4.5) L n N-1 2,., :L(s(iN+n)-s(iN+n)) n O Where N is the length of the segment and L are the number of segments. As an averaging operation occurs after the logarithm, the SGSNR penalizes the speech coding algorithm more, whose performance is variable Subjective Measure The previously discussed objective measure is often sensitive to both gain and delay variations and do not account for the perceptual properties of the ear. The selection of most of the low and medium bit rate coders is determined by the perceptual criterion therefore subjective evaluation [49] [50] is required. There are number of ways to subjectively evaluate the performance but the most popular method is Mean opinion score (MOS). The MOS measure is widely used to quantify the subjective performance of the reconstructed speech through the coding algorithm and original speech as well. The MOS usually involves number of listeners, who are instructed to rate to a five level scale, the quality of speech, as given in table No.4.!. MOS scale Speech quality I Bad 2 Poor 3 Fair 4 Good 5 Excellent Table No. 4.1: Mean opinion scores -43-

14 The MOS rating is obtained by averaging the values of several scores. The MOS range relates to speech quality as follows: MOS from 4.0 to 4.5 implies network quality. MOS from 3.5 to 4.0 implies communication quality. MOS from 2.5 to 3.5 implies synthetic quality -ooo- -44-

Enhanced Waveform Interpolative Coding at 4 kbps

Enhanced Waveform Interpolative Coding at 4 kbps Enhanced Waveform Interpolative Coding at 4 kbps Oded Gottesman, and Allen Gersho Signal Compression Lab. University of California, Santa Barbara E-mail: [oded, gersho]@scl.ece.ucsb.edu Signal Compression

More information

techniques are means of reducing the bandwidth needed to represent the human voice. In mobile

techniques are means of reducing the bandwidth needed to represent the human voice. In mobile 8 2. LITERATURE SURVEY The available radio spectrum for the wireless radio communication is very limited hence to accommodate maximum number of users the speech is compressed. The speech compression techniques

More information

Cellular systems & GSM Wireless Systems, a.a. 2014/2015

Cellular systems & GSM Wireless Systems, a.a. 2014/2015 Cellular systems & GSM Wireless Systems, a.a. 2014/2015 Un. of Rome La Sapienza Chiara Petrioli Department of Computer Science University of Rome Sapienza Italy 2 Voice Coding 3 Speech signals Voice coding:

More information

Analysis/synthesis coding

Analysis/synthesis coding TSBK06 speech coding p.1/32 Analysis/synthesis coding Many speech coders are based on a principle called analysis/synthesis coding. Instead of coding a waveform, as is normally done in general audio coders

More information

Overview of Code Excited Linear Predictive Coder

Overview of Code Excited Linear Predictive Coder Overview of Code Excited Linear Predictive Coder Minal Mulye 1, Sonal Jagtap 2 1 PG Student, 2 Assistant Professor, Department of E&TC, Smt. Kashibai Navale College of Engg, Pune, India Abstract Advances

More information

EE482: Digital Signal Processing Applications

EE482: Digital Signal Processing Applications Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 12 Speech Signal Processing 14/03/25 http://www.ee.unlv.edu/~b1morris/ee482/

More information

The Channel Vocoder (analyzer):

The Channel Vocoder (analyzer): Vocoders 1 The Channel Vocoder (analyzer): The channel vocoder employs a bank of bandpass filters, Each having a bandwidth between 100 Hz and 300 Hz. Typically, 16-20 linear phase FIR filter are used.

More information

Speech Compression Using Voice Excited Linear Predictive Coding

Speech Compression Using Voice Excited Linear Predictive Coding Speech Compression Using Voice Excited Linear Predictive Coding Ms.Tosha Sen, Ms.Kruti Jay Pancholi PG Student, Asst. Professor, L J I E T, Ahmedabad Abstract : The aim of the thesis is design good quality

More information

Speech Coding Technique And Analysis Of Speech Codec Using CS-ACELP

Speech Coding Technique And Analysis Of Speech Codec Using CS-ACELP Speech Coding Technique And Analysis Of Speech Codec Using CS-ACELP Monika S.Yadav Vidarbha Institute of Technology Rashtrasant Tukdoji Maharaj Nagpur University, Nagpur, India monika.yadav@rediffmail.com

More information

speech signal S(n). This involves a transformation of S(n) into another signal or a set of signals

speech signal S(n). This involves a transformation of S(n) into another signal or a set of signals 16 3. SPEECH ANALYSIS 3.1 INTRODUCTION TO SPEECH ANALYSIS Many speech processing [22] applications exploits speech production and perception to accomplish speech analysis. By speech analysis we extract

More information

Speech Synthesis; Pitch Detection and Vocoders

Speech Synthesis; Pitch Detection and Vocoders Speech Synthesis; Pitch Detection and Vocoders Tai-Shih Chi ( 冀泰石 ) Department of Communication Engineering National Chiao Tung University May. 29, 2008 Speech Synthesis Basic components of the text-to-speech

More information

Simulation of Conjugate Structure Algebraic Code Excited Linear Prediction Speech Coder

Simulation of Conjugate Structure Algebraic Code Excited Linear Prediction Speech Coder COMPUSOFT, An international journal of advanced computer technology, 3 (3), March-204 (Volume-III, Issue-III) ISSN:2320-0790 Simulation of Conjugate Structure Algebraic Code Excited Linear Prediction Speech

More information

International Journal of Advanced Engineering Technology E-ISSN

International Journal of Advanced Engineering Technology E-ISSN Research Article ARCHITECTURAL STUDY, IMPLEMENTATION AND OBJECTIVE EVALUATION OF CODE EXCITED LINEAR PREDICTION BASED GSM AMR 06.90 SPEECH CODER USING MATLAB Bhatt Ninad S. 1 *, Kosta Yogesh P. 2 Address

More information

EE 225D LECTURE ON MEDIUM AND HIGH RATE CODING. University of California Berkeley

EE 225D LECTURE ON MEDIUM AND HIGH RATE CODING. University of California Berkeley University of California Berkeley College of Engineering Department of Electrical Engineering and Computer Sciences Professors : N.Morgan / B.Gold EE225D Spring,1999 Medium & High Rate Coding Lecture 26

More information

Low Bit Rate Speech Coding

Low Bit Rate Speech Coding Low Bit Rate Speech Coding Jaspreet Singh 1, Mayank Kumar 2 1 Asst. Prof.ECE, RIMT Bareilly, 2 Asst. Prof.ECE, RIMT Bareilly ABSTRACT Despite enormous advances in digital communication, the voice is still

More information

Digital Speech Processing and Coding

Digital Speech Processing and Coding ENEE408G Spring 2006 Lecture-2 Digital Speech Processing and Coding Spring 06 Instructor: Shihab Shamma Electrical & Computer Engineering University of Maryland, College Park http://www.ece.umd.edu/class/enee408g/

More information

Speech Enhancement using Wiener filtering

Speech Enhancement using Wiener filtering Speech Enhancement using Wiener filtering S. Chirtmay and M. Tahernezhadi Department of Electrical Engineering Northern Illinois University DeKalb, IL 60115 ABSTRACT The problem of reducing the disturbing

More information

MASTER'S THESIS. Speech Compression and Tone Detection in a Real-Time System. Kristina Berglund. MSc Programmes in Engineering

MASTER'S THESIS. Speech Compression and Tone Detection in a Real-Time System. Kristina Berglund. MSc Programmes in Engineering 2004:003 CIV MASTER'S THESIS Speech Compression and Tone Detection in a Real-Time System Kristina Berglund MSc Programmes in Engineering Department of Computer Science and Electrical Engineering Division

More information

APPLICATIONS OF DSP OBJECTIVES

APPLICATIONS OF DSP OBJECTIVES APPLICATIONS OF DSP OBJECTIVES This lecture will discuss the following: Introduce analog and digital waveform coding Introduce Pulse Coded Modulation Consider speech-coding principles Introduce the channel

More information

COMPRESSIVE SAMPLING OF SPEECH SIGNALS. Mona Hussein Ramadan. BS, Sebha University, Submitted to the Graduate Faculty of

COMPRESSIVE SAMPLING OF SPEECH SIGNALS. Mona Hussein Ramadan. BS, Sebha University, Submitted to the Graduate Faculty of COMPRESSIVE SAMPLING OF SPEECH SIGNALS by Mona Hussein Ramadan BS, Sebha University, 25 Submitted to the Graduate Faculty of Swanson School of Engineering in partial fulfillment of the requirements for

More information

Flexible and Scalable Transform-Domain Codebook for High Bit Rate CELP Coders

Flexible and Scalable Transform-Domain Codebook for High Bit Rate CELP Coders Flexible and Scalable Transform-Domain Codebook for High Bit Rate CELP Coders Václav Eksler, Bruno Bessette, Milan Jelínek, Tommy Vaillancourt University of Sherbrooke, VoiceAge Corporation Montreal, QC,

More information

Robust Linear Prediction Analysis for Low Bit-Rate Speech Coding

Robust Linear Prediction Analysis for Low Bit-Rate Speech Coding Robust Linear Prediction Analysis for Low Bit-Rate Speech Coding Nanda Prasetiyo Koestoer B. Eng (Hon) (1998) School of Microelectronic Engineering Faculty of Engineering and Information Technology Griffith

More information

Page 0 of 23. MELP Vocoder

Page 0 of 23. MELP Vocoder Page 0 of 23 MELP Vocoder Outline Introduction MELP Vocoder Features Algorithm Description Parameters & Comparison Page 1 of 23 Introduction Traditional pitched-excited LPC vocoders use either a periodic

More information

Comparison of CELP speech coder with a wavelet method

Comparison of CELP speech coder with a wavelet method University of Kentucky UKnowledge University of Kentucky Master's Theses Graduate School 2006 Comparison of CELP speech coder with a wavelet method Sriram Nagaswamy University of Kentucky, sriramn@gmail.com

More information

SILK Speech Codec. TDP 10/11 Xavier Anguera I Ciro Gracia

SILK Speech Codec. TDP 10/11 Xavier Anguera I Ciro Gracia SILK Speech Codec TDP 10/11 Xavier Anguera I Ciro Gracia SILK Codec Audio codec desenvolupat per Skype (Febrer 2009) Previament usaven el codec SVOPC (Sinusoidal Voice Over Packet Coder): LPC analysis.

More information

An objective method for evaluating data hiding in pitch gain and pitch delay parameters of the AMR codec

An objective method for evaluating data hiding in pitch gain and pitch delay parameters of the AMR codec An objective method for evaluating data hiding in pitch gain and pitch delay parameters of the AMR codec Akira Nishimura 1 1 Department of Media and Cultural Studies, Tokyo University of Information Sciences,

More information

EC 6501 DIGITAL COMMUNICATION UNIT - II PART A

EC 6501 DIGITAL COMMUNICATION UNIT - II PART A EC 6501 DIGITAL COMMUNICATION 1.What is the need of prediction filtering? UNIT - II PART A [N/D-16] Prediction filtering is used mostly in audio signal processing and speech processing for representing

More information

COMPARATIVE REVIEW BETWEEN CELP AND ACELP ENCODER FOR CDMA TECHNOLOGY

COMPARATIVE REVIEW BETWEEN CELP AND ACELP ENCODER FOR CDMA TECHNOLOGY COMPARATIVE REVIEW BETWEEN CELP AND ACELP ENCODER FOR CDMA TECHNOLOGY V.C.TOGADIYA 1, N.N.SHAH 2, R.N.RATHOD 3 Assistant Professor, Dept. of ECE, R.K.College of Engg & Tech, Rajkot, Gujarat, India 1 Assistant

More information

Speech Synthesis using Mel-Cepstral Coefficient Feature

Speech Synthesis using Mel-Cepstral Coefficient Feature Speech Synthesis using Mel-Cepstral Coefficient Feature By Lu Wang Senior Thesis in Electrical Engineering University of Illinois at Urbana-Champaign Advisor: Professor Mark Hasegawa-Johnson May 2018 Abstract

More information

Mel Spectrum Analysis of Speech Recognition using Single Microphone

Mel Spectrum Analysis of Speech Recognition using Single Microphone International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree

More information

I D I A P R E S E A R C H R E P O R T. June published in Interspeech 2008

I D I A P R E S E A R C H R E P O R T. June published in Interspeech 2008 R E S E A R C H R E P O R T I D I A P Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain Sriram Ganapathy a b Petr Motlicek a Hynek Hermansky a b Harinath

More information

Speech Coding in the Frequency Domain

Speech Coding in the Frequency Domain Speech Coding in the Frequency Domain Speech Processing Advanced Topics Tom Bäckström Aalto University October 215 Introduction The speech production model can be used to efficiently encode speech signals.

More information

Audio Signal Compression using DCT and LPC Techniques

Audio Signal Compression using DCT and LPC Techniques Audio Signal Compression using DCT and LPC Techniques P. Sandhya Rani#1, D.Nanaji#2, V.Ramesh#3,K.V.S. Kiran#4 #Student, Department of ECE, Lendi Institute Of Engineering And Technology, Vizianagaram,

More information

Lesson 8 Speech coding

Lesson 8 Speech coding Lesson 8 coding Encoding Information Transmitter Antenna Interleaving Among Frames De-Interleaving Antenna Transmission Line Decoding Transmission Line Receiver Information Lesson 8 Outline How information

More information

Ninad Bhatt Yogeshwar Kosta

Ninad Bhatt Yogeshwar Kosta DOI 10.1007/s10772-012-9178-9 Implementation of variable bitrate data hiding techniques on standard and proposed GSM 06.10 full rate coder and its overall comparative evaluation of performance Ninad Bhatt

More information

Communications Theory and Engineering

Communications Theory and Engineering Communications Theory and Engineering Master's Degree in Electronic Engineering Sapienza University of Rome A.A. 2018-2019 Speech and telephone speech Based on a voice production model Parametric representation

More information

LOSS CONCEALMENTS FOR LOW-BIT-RATE PACKET VOICE IN VOIP. Outline

LOSS CONCEALMENTS FOR LOW-BIT-RATE PACKET VOICE IN VOIP. Outline LOSS CONCEALMENTS FOR LOW-BIT-RATE PACKET VOICE IN VOIP Benjamin W. Wah Department of Electrical and Computer Engineering and the Coordinated Science Laboratory University of Illinois at Urbana-Champaign

More information

NOVEL PITCH DETECTION ALGORITHM WITH APPLICATION TO SPEECH CODING

NOVEL PITCH DETECTION ALGORITHM WITH APPLICATION TO SPEECH CODING NOVEL PITCH DETECTION ALGORITHM WITH APPLICATION TO SPEECH CODING A Thesis Submitted to the Graduate Faculty of the University of New Orleans in partial fulfillment of the requirements for the degree of

More information

Transcoding of Narrowband to Wideband Speech

Transcoding of Narrowband to Wideband Speech University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2005 Transcoding of Narrowband to Wideband Speech Christian H. Ritz University

More information

Different Approaches of Spectral Subtraction Method for Speech Enhancement

Different Approaches of Spectral Subtraction Method for Speech Enhancement ISSN 2249 5460 Available online at www.internationalejournals.com International ejournals International Journal of Mathematical Sciences, Technology and Humanities 95 (2013 1056 1062 Different Approaches

More information

DEPARTMENT OF DEFENSE TELECOMMUNICATIONS SYSTEMS STANDARD

DEPARTMENT OF DEFENSE TELECOMMUNICATIONS SYSTEMS STANDARD NOT MEASUREMENT SENSITIVE 20 December 1999 DEPARTMENT OF DEFENSE TELECOMMUNICATIONS SYSTEMS STANDARD ANALOG-TO-DIGITAL CONVERSION OF VOICE BY 2,400 BIT/SECOND MIXED EXCITATION LINEAR PREDICTION (MELP)

More information

Datenkommunikation SS L03 - TDM Techniques. Time Division Multiplexing (synchronous, statistical) Digital Voice Transmission, PDH, SDH

Datenkommunikation SS L03 - TDM Techniques. Time Division Multiplexing (synchronous, statistical) Digital Voice Transmission, PDH, SDH TM Techniques Time ivision Multiplexing (synchronous, statistical) igital Voice Transmission, PH, SH Agenda Introduction Synchronous (eterministic) TM Asynchronous (Statistical) TM igital Voice Transmission

More information

IN RECENT YEARS, there has been a great deal of interest

IN RECENT YEARS, there has been a great deal of interest IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL 12, NO 1, JANUARY 2004 9 Signal Modification for Robust Speech Coding Nam Soo Kim, Member, IEEE, and Joon-Hyuk Chang, Member, IEEE Abstract Usually,

More information

Scalable speech coding spanning the 4 Kbps divide

Scalable speech coding spanning the 4 Kbps divide University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2003 Scalable speech coding spanning the 4 Kbps divide J Lukasiak University

More information

Auditory modelling for speech processing in the perceptual domain

Auditory modelling for speech processing in the perceptual domain ANZIAM J. 45 (E) ppc964 C980, 2004 C964 Auditory modelling for speech processing in the perceptual domain L. Lin E. Ambikairajah W. H. Holmes (Received 8 August 2003; revised 28 January 2004) Abstract

More information

Waveform interpolation speech coding

Waveform interpolation speech coding University of Wollongong Research Online University of Wollongong Thesis Collection 1954-2016 University of Wollongong Thesis Collections 1998 Waveform interpolation speech coding Jun Ni University of

More information

Wideband Speech Coding & Its Application

Wideband Speech Coding & Its Application Wideband Speech Coding & Its Application Apeksha B. landge. M.E. [student] Aditya Engineering College Beed Prof. Amir Lodhi. Guide & HOD, Aditya Engineering College Beed ABSTRACT: Increasing the bandwidth

More information

Wireless Communications

Wireless Communications Wireless Communications Lecture 5: Coding / Decoding and Modulation / Demodulation Module Representive: Prof. Dr.-Ing. Hans D. Schotten schotten@eit.uni-kl.de Lecturer: Dr.-Ing. Bin Han binhan@eit.uni-kl.de

More information

Problem Sheet 1 Probability, random processes, and noise

Problem Sheet 1 Probability, random processes, and noise Problem Sheet 1 Probability, random processes, and noise 1. If F X (x) is the distribution function of a random variable X and x 1 x 2, show that F X (x 1 ) F X (x 2 ). 2. Use the definition of the cumulative

More information

Pattern Recognition. Part 6: Bandwidth Extension. Gerhard Schmidt

Pattern Recognition. Part 6: Bandwidth Extension. Gerhard Schmidt Pattern Recognition Part 6: Gerhard Schmidt Christian-Albrechts-Universität zu Kiel Faculty of Engineering Institute of Electrical and Information Engineering Digital Signal Processing and System Theory

More information

Voice Activity Detection

Voice Activity Detection Voice Activity Detection Speech Processing Tom Bäckström Aalto University October 2015 Introduction Voice activity detection (VAD) (or speech activity detection, or speech detection) refers to a class

More information

Vocoder (LPC) Analysis by Variation of Input Parameters and Signals

Vocoder (LPC) Analysis by Variation of Input Parameters and Signals ISCA Journal of Engineering Sciences ISCA J. Engineering Sci. Vocoder (LPC) Analysis by Variation of Input Parameters and Signals Abstract Gupta Rajani, Mehta Alok K. and Tiwari Vebhav Truba College of

More information

Modulator Domain Adaptive Gain Equalizer for Speech Enhancement

Modulator Domain Adaptive Gain Equalizer for Speech Enhancement Modulator Domain Adaptive Gain Equalizer for Speech Enhancement Ravindra d. Dhage, Prof. Pravinkumar R.Badadapure Abstract M.E Scholar, Professor. This paper presents a speech enhancement method for personal

More information

Voice Excited Lpc for Speech Compression by V/Uv Classification

Voice Excited Lpc for Speech Compression by V/Uv Classification IOSR Journal of VLSI and Signal Processing (IOSR-JVSP) Volume 6, Issue 3, Ver. II (May. -Jun. 2016), PP 65-69 e-issn: 2319 4200, p-issn No. : 2319 4197 www.iosrjournals.org Voice Excited Lpc for Speech

More information

Adaptive time scale modification of speech for graceful degrading voice quality in congested networks

Adaptive time scale modification of speech for graceful degrading voice quality in congested networks Adaptive time scale modification of speech for graceful degrading voice quality in congested networks Prof. H. Gokhan ILK Ankara University, Faculty of Engineering, Electrical&Electronics Eng. Dept 1 Contact

More information

Voice and Audio Compression for Wireless Communications

Voice and Audio Compression for Wireless Communications page 1 Voice and Audio Compression for Wireless Communications by c L. Hanzo, F.C.A. Somerville, J.P. Woodard, H-T. How School of Electronics and Computer Science, University of Southampton, UK page i

More information

IMPROVED SPEECH QUALITY FOR VMR - WB SPEECH CODING USING EFFICIENT NOISE ESTIMATION ALGORITHM

IMPROVED SPEECH QUALITY FOR VMR - WB SPEECH CODING USING EFFICIENT NOISE ESTIMATION ALGORITHM IMPROVED SPEECH QUALITY FOR VMR - WB SPEECH CODING USING EFFICIENT NOISE ESTIMATION ALGORITHM Mr. M. Mathivanan Associate Professor/ECE Selvam College of Technology Namakkal, Tamilnadu, India Dr. S.Chenthur

More information

Effects of Reverberation on Pitch, Onset/Offset, and Binaural Cues

Effects of Reverberation on Pitch, Onset/Offset, and Binaural Cues Effects of Reverberation on Pitch, Onset/Offset, and Binaural Cues DeLiang Wang Perception & Neurodynamics Lab The Ohio State University Outline of presentation Introduction Human performance Reverberation

More information

Voice mail and office automation

Voice mail and office automation Voice mail and office automation by DOUGLAS L. HOGAN SPARTA, Incorporated McLean, Virginia ABSTRACT Contrary to expectations of a few years ago, voice mail or voice messaging technology has rapidly outpaced

More information

REAL-TIME IMPLEMENTATION OF A VARIABLE RATE CELP SPEECH CODEC

REAL-TIME IMPLEMENTATION OF A VARIABLE RATE CELP SPEECH CODEC REAL-TIME IMPLEMENTATION OF A VARIABLE RATE CELP SPEECH CODEC Robert Zopf B.A.Sc. Simon Fraser University, 1993 A THESIS SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF MASTER OF

More information

ENEE408G Multimedia Signal Processing

ENEE408G Multimedia Signal Processing ENEE408G Multimedia Signal Processing Design Project on Digital Speech Processing Goals: 1. Learn how to use the linear predictive model for speech analysis and synthesis. 2. Implement a linear predictive

More information

Improving Sound Quality by Bandwidth Extension

Improving Sound Quality by Bandwidth Extension International Journal of Scientific & Engineering Research, Volume 3, Issue 9, September-212 Improving Sound Quality by Bandwidth Extension M. Pradeepa, M.Tech, Assistant Professor Abstract - In recent

More information

A new quad-tree segmented image compression scheme using histogram analysis and pattern matching

A new quad-tree segmented image compression scheme using histogram analysis and pattern matching University of Wollongong Research Online University of Wollongong in Dubai - Papers University of Wollongong in Dubai A new quad-tree segmented image compression scheme using histogram analysis and pattern

More information

Techniques for low-rate scalable compression of speech signals

Techniques for low-rate scalable compression of speech signals University of Wollongong Research Online University of Wollongong Thesis Collection University of Wollongong Thesis Collections 2002 Techniques for low-rate scalable compression of speech signals Jason

More information

Analog and Telecommunication Electronics

Analog and Telecommunication Electronics Politecnico di Torino - ICT School Analog and Telecommunication Electronics D5 - Special A/D converters» Differential converters» Oversampling, noise shaping» Logarithmic conversion» Approximation, A and

More information

Speech Coding using Linear Prediction

Speech Coding using Linear Prediction Speech Coding using Linear Prediction Jesper Kjær Nielsen Aalborg University and Bang & Olufsen jkn@es.aau.dk September 10, 2015 1 Background Speech is generated when air is pushed from the lungs through

More information

Laboratory Assignment 2 Signal Sampling, Manipulation, and Playback

Laboratory Assignment 2 Signal Sampling, Manipulation, and Playback Laboratory Assignment 2 Signal Sampling, Manipulation, and Playback PURPOSE This lab will introduce you to the laboratory equipment and the software that allows you to link your computer to the hardware.

More information

Waveform Encoding - PCM. BY: Dr.AHMED ALKHAYYAT. Chapter Two

Waveform Encoding - PCM. BY: Dr.AHMED ALKHAYYAT. Chapter Two Chapter Two Layout: 1. Introduction. 2. Pulse Code Modulation (PCM). 3. Differential Pulse Code Modulation (DPCM). 4. Delta modulation. 5. Adaptive delta modulation. 6. Sigma Delta Modulation (SDM). 7.

More information

Accurate Delay Measurement of Coded Speech Signals with Subsample Resolution

Accurate Delay Measurement of Coded Speech Signals with Subsample Resolution PAGE 433 Accurate Delay Measurement of Coded Speech Signals with Subsample Resolution Wenliang Lu, D. Sen, and Shuai Wang School of Electrical Engineering & Telecommunications University of New South Wales,

More information

6/29 Vol.7, No.2, February 2012

6/29 Vol.7, No.2, February 2012 Synthesis Filter/Decoder Structures in Speech Codecs Jerry D. Gibson, Electrical & Computer Engineering, UC Santa Barbara, CA, USA gibson@ece.ucsb.edu Abstract Using the Shannon backward channel result

More information

Spanning the 4 kbps divide using pulse modeled residual

Spanning the 4 kbps divide using pulse modeled residual University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2002 Spanning the 4 kbps divide using pulse modeled residual J Lukasiak

More information

A Closed-loop Multimode Variable Bit Rate Characteristic Waveform Interpolation Coder

A Closed-loop Multimode Variable Bit Rate Characteristic Waveform Interpolation Coder A Closed-loop Multimode Variable Bit Rate Characteristic Waveform Interpolation Coder Jing Wang, Jingg Kuang, and Shenghui Zhao Research Center of Digital Communication Technology,Department of Electronic

More information

Improved signal analysis and time-synchronous reconstruction in waveform interpolation coding

Improved signal analysis and time-synchronous reconstruction in waveform interpolation coding University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2000 Improved signal analysis and time-synchronous reconstruction in waveform

More information

ARIB STD-T V Audio codec processing functions; Extended Adaptive Multi-Rate - Wideband (AMR-WB+) codec; Transcoding functions

ARIB STD-T V Audio codec processing functions; Extended Adaptive Multi-Rate - Wideband (AMR-WB+) codec; Transcoding functions ARIB STD-T63-26.290 V12.0.0 Audio codec processing functions; Extended Adaptive Multi-Rate - Wideband (AMR-WB+) codec; Transcoding functions (Release 12) Refer to Industrial Property Rights (IPR) in the

More information

STUDY OF THE PERFORMANCE OF THE LINEAR AND NON-LINEAR NARROW BAND RECEIVERS FOR 2X2 MIMO SYSTEMS WITH STBC MULTIPLEXING AND ALAMOTI CODING

STUDY OF THE PERFORMANCE OF THE LINEAR AND NON-LINEAR NARROW BAND RECEIVERS FOR 2X2 MIMO SYSTEMS WITH STBC MULTIPLEXING AND ALAMOTI CODING International Journal of Electrical and Electronics Engineering Research Vol.1, Issue 1 (2011) 68-83 TJPRC Pvt. Ltd., STUDY OF THE PERFORMANCE OF THE LINEAR AND NON-LINEAR NARROW BAND RECEIVERS FOR 2X2

More information

Adaptive Forward-Backward Quantizer for Low Bit Rate. High Quality Speech Coding. University of Missouri-Columbia. Columbia, MO 65211

Adaptive Forward-Backward Quantizer for Low Bit Rate. High Quality Speech Coding. University of Missouri-Columbia. Columbia, MO 65211 Adaptive Forward-Backward Quantizer for Low Bit Rate High Quality Speech Coding Jozsef Vass Yunxin Zhao y Xinhua Zhuang Department of Computer Engineering & Computer Science University of Missouri-Columbia

More information

Final draft ETSI EN V1.3.0 ( )

Final draft ETSI EN V1.3.0 ( ) European Standard (Telecommunications series) Terrestrial Trunked Radio (TETRA); Speech codec for full-rate traffic channel; Part 2: TETRA codec 2 Reference REN/TETRA-05059 Keywords TETRA, radio, codec

More information

Implementation of attractive Speech Quality for Mixed Excited Linear Prediction

Implementation of attractive Speech Quality for Mixed Excited Linear Prediction IOSR Journal of Electrical and Electronics Engineering (IOSR-JEEE) e-issn: 2278-1676,p-ISSN: 2320-3331, Volume 9, Issue 2 Ver. I (Mar Apr. 2014), PP 07-12 Implementation of attractive Speech Quality for

More information

Telecommunication Electronics

Telecommunication Electronics Politecnico di Torino ICT School Telecommunication Electronics C5 - Special A/D converters» Logarithmic conversion» Approximation, A and µ laws» Differential converters» Oversampling, noise shaping Logarithmic

More information

NOISE SHAPING IN AN ITU-T G.711-INTEROPERABLE EMBEDDED CODEC

NOISE SHAPING IN AN ITU-T G.711-INTEROPERABLE EMBEDDED CODEC NOISE SHAPING IN AN ITU-T G.711-INTEROPERABLE EMBEDDED CODEC Jimmy Lapierre 1, Roch Lefebvre 1, Bruno Bessette 1, Vladimir Malenovsky 1, Redwan Salami 2 1 Université de Sherbrooke, Sherbrooke (Québec),

More information

May A uthor -... LIB Depof "Elctrical'Engineering and 'Computer Science May 21, 1999

May A uthor -... LIB Depof Elctrical'Engineering and 'Computer Science May 21, 1999 Postfiltering Techniques in Low Bit-Rate Speech Coders by Azhar K Mustapha S.B., Massachusetts Institute of Technology (1998) Submitted to the Department of Electrical Engineering and Computer Science

More information

EEE 309 Communication Theory

EEE 309 Communication Theory EEE 309 Communication Theory Semester: January 2016 Dr. Md. Farhad Hossain Associate Professor Department of EEE, BUET Email: mfarhadhossain@eee.buet.ac.bd Office: ECE 331, ECE Building Part 05 Pulse Code

More information

CODING TECHNIQUES FOR ANALOG SOURCES

CODING TECHNIQUES FOR ANALOG SOURCES CODING TECHNIQUES FOR ANALOG SOURCES Prof.Pratik Tawde Lecturer, Electronics and Telecommunication Department, Vidyalankar Polytechnic, Wadala (India) ABSTRACT Image Compression is a process of removing

More information

Nonuniform multi level crossing for signal reconstruction

Nonuniform multi level crossing for signal reconstruction 6 Nonuniform multi level crossing for signal reconstruction 6.1 Introduction In recent years, there has been considerable interest in level crossing algorithms for sampling continuous time signals. Driven

More information

Published in: Proceesings of the 11th International Workshop on Acoustic Echo and Noise Control

Published in: Proceesings of the 11th International Workshop on Acoustic Echo and Noise Control Aalborg Universitet Voice Activity Detection Based on the Adaptive Multi-Rate Speech Codec Parameters Giacobello, Daniele; Semmoloni, Matteo; eri, Danilo; Prati, Luca; Brofferio, Sergio Published in: Proceesings

More information

Non-Uniform Speech/Audio Coding Exploiting Predictability of Temporal Evolution of Spectral Envelopes

Non-Uniform Speech/Audio Coding Exploiting Predictability of Temporal Evolution of Spectral Envelopes Non-Uniform Speech/Audio Coding Exploiting Predictability of Temporal Evolution of Spectral Envelopes Petr Motlicek 12, Hynek Hermansky 123, Sriram Ganapathy 13, and Harinath Garudadri 4 1 IDIAP Research

More information

SNR Scalability, Multiple Descriptions, and Perceptual Distortion Measures

SNR Scalability, Multiple Descriptions, and Perceptual Distortion Measures SNR Scalability, Multiple Descriptions, Perceptual Distortion Measures Jerry D. Gibson Department of Electrical & Computer Engineering University of California, Santa Barbara gibson@mat.ucsb.edu Abstract

More information

ON-LINE LABORATORIES FOR SPEECH AND IMAGE PROCESSING AND FOR COMMUNICATION SYSTEMS USING J-DSP

ON-LINE LABORATORIES FOR SPEECH AND IMAGE PROCESSING AND FOR COMMUNICATION SYSTEMS USING J-DSP ON-LINE LABORATORIES FOR SPEECH AND IMAGE PROCESSING AND FOR COMMUNICATION SYSTEMS USING J-DSP A. Spanias, V. Atti, Y. Ko, T. Thrasyvoulou, M.Yasin, M. Zaman, T. Duman, L. Karam, A. Papandreou, K. Tsakalis

More information

A 600 BPS MELP VOCODER FOR USE ON HF CHANNELS

A 600 BPS MELP VOCODER FOR USE ON HF CHANNELS A 600 BPS MELP VOCODER FOR USE ON HF CHANNELS Mark W. Chamberlain Harris Corporation, RF Communications Division 1680 University Avenue Rochester, New York 14610 ABSTRACT The U.S. government has developed

More information

RECENTLY, there has been an increasing interest in noisy

RECENTLY, there has been an increasing interest in noisy IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 52, NO. 9, SEPTEMBER 2005 535 Warped Discrete Cosine Transform-Based Noisy Speech Enhancement Joon-Hyuk Chang, Member, IEEE Abstract In

More information

Evaluation of MELP Quality and Principles Marcus Ek Lars Pääjärvi Martin Sehlstedt Lule_a Technical University in cooperation with Ericsson Erisoft AB

Evaluation of MELP Quality and Principles Marcus Ek Lars Pääjärvi Martin Sehlstedt Lule_a Technical University in cooperation with Ericsson Erisoft AB Evaluation of MELP Quality and Principles Marcus Ek Lars Pääjärvi Martin Sehlstedt Lule_a Technical University in cooperation with Ericsson Erisoft AB, T/RV 3th May 2 2 Abstract This report presents an

More information

ECE 556 BASICS OF DIGITAL SPEECH PROCESSING. Assıst.Prof.Dr. Selma ÖZAYDIN Spring Term-2017 Lecture 2

ECE 556 BASICS OF DIGITAL SPEECH PROCESSING. Assıst.Prof.Dr. Selma ÖZAYDIN Spring Term-2017 Lecture 2 ECE 556 BASICS OF DIGITAL SPEECH PROCESSING Assıst.Prof.Dr. Selma ÖZAYDIN Spring Term-2017 Lecture 2 Analog Sound to Digital Sound Characteristics of Sound Amplitude Wavelength (w) Frequency ( ) Timbre

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 213 http://acousticalsociety.org/ ICA 213 Montreal Montreal, Canada 2-7 June 213 Signal Processing in Acoustics Session 2pSP: Acoustic Signal Processing

More information

Preface, Motivation and The Speech Coding Scene

Preface, Motivation and The Speech Coding Scene Preface, Motivation and The Speech Coding Scene In the era of third-generation (3G) wireless personal communications standards, despite the emergence of broad-band access network standard proposals, the

More information

HIGH QUALITY AUDIO CODING AT LOW BIT RATE USING WAVELET AND WAVELET PACKET TRANSFORM

HIGH QUALITY AUDIO CODING AT LOW BIT RATE USING WAVELET AND WAVELET PACKET TRANSFORM HIGH QUALITY AUDIO CODING AT LOW BIT RATE USING WAVELET AND WAVELET PACKET TRANSFORM DR. D.C. DHUBKARYA AND SONAM DUBEY 2 Email at: sonamdubey2000@gmail.com, Electronic and communication department Bundelkhand

More information

L19: Prosodic modification of speech

L19: Prosodic modification of speech L19: Prosodic modification of speech Time-domain pitch synchronous overlap add (TD-PSOLA) Linear-prediction PSOLA Frequency-domain PSOLA Sinusoidal models Harmonic + noise models STRAIGHT This lecture

More information

Fundamental Frequency Detection

Fundamental Frequency Detection Fundamental Frequency Detection Jan Černocký, Valentina Hubeika {cernocky ihubeika}@fit.vutbr.cz DCGM FIT BUT Brno Fundamental Frequency Detection Jan Černocký, Valentina Hubeika, DCGM FIT BUT Brno 1/37

More information

Frequency Domain Implementation of Advanced Speech Enhancement System on TMS320C6713DSK

Frequency Domain Implementation of Advanced Speech Enhancement System on TMS320C6713DSK Frequency Domain Implementation of Advanced Speech Enhancement System on TMS320C6713DSK Zeeshan Hashmi Khateeb Student, M.Tech 4 th Semester, Department of Instrumentation Technology Dayananda Sagar College

More information

Department of Electronics and Communication Engineering 1

Department of Electronics and Communication Engineering 1 UNIT I SAMPLING AND QUANTIZATION Pulse Modulation 1. Explain in detail the generation of PWM and PPM signals (16) (M/J 2011) 2. Explain in detail the concept of PWM and PAM (16) (N/D 2012) 3. What is the

More information

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm A.T. Rajamanickam, N.P.Subiramaniyam, A.Balamurugan*,

More information