Speech Coding Technique And Analysis Of Speech Codec Using CS-ACELP
|
|
- Homer Barker
- 6 years ago
- Views:
Transcription
1 Speech Coding Technique And Analysis Of Speech Codec Using CS-ACELP Monika S.Yadav Vidarbha Institute of Technology Rashtrasant Tukdoji Maharaj Nagpur University, Nagpur, India Abstract Speech coding has been major issue in the area of digital speech processing. Speech coding is the application of data compression of digital audio signal containing speech. The speech signal in a more compressed form, which can then be transmitted with few numbers of binary digits. It is not possible to access unlimited bandwidth of a channel each time we send a signal across it which leads to code and compress speech signals. Speech compression aims to compress the speech signal to attain maximum channel capacity with lower bit rate and highest quality. Code Excited Linear Prediction (CS-ACELP) algorithm, operating at a bit rate of 8Kbps for discrete speech samples sampled at a rate of 8000 samples per second. G.729 operates to take every 10ms speech frame, the input speech signal is analyzed to extract the parameters. These parameters are then coded. G.729 is one of the widely used standard in ITU-T for speech compression. Speech compression is applied in long distance communication, high class speech storage, and message encryption. Speech coding is a lossy type of coding and hence the output signal does not exactly sound like the input.speech coding techniques discussed here are Linear predictive coding, waveform coding, Code excited linear predictive coding, etc. Keywords: Speech coding, Linear Predictive Coding (LPC), Wave form coding, Conjugate Structure Algebric Code Excited Linear Predictive Coding(CS-ACELP). I. Introduction Speech coding is the process of representing a voice signal for efficient transmission or storage. These codes will be sent over both band limited wired and Wireless channels. The aim of speech coding is to represent the samples of a speech signal in a compact form thus having the less code symbols Without degrading the quality of the speech signal. Speech coding is very important in Cellular and Mobile Communication. It has applications in Voice over internet protocol (VOIP), Videoconferencing, electronic toys, archiving, Digital simultaneous voice and data (DSVD), numerous computer based gaming and Multimedia applications, Most of the speech applications require minimum coding delay in order to avoid hindering the flow of the speech conversation because of long coding delays, A speech coder is one which converts a digitized speech signal into the coded representation and transmits it in a form of frames, At the receiving end, the speech decoder receives the coded frames and synthesizes reconstructed speech signal. The decoded speech should be audible distinguishable from the original speech signal. G.729(8kbps) is one of the famous standard for speech compression by ITU-T. The ITU-T recommendation is based on Conjugate Structure Adaptive Code Excited Linear Prediction (CS-ACELP) algorithm, operating at a bit rate of 8Kbps for discrete speech samples sampled at a rate of 8000 samples per second. G.729 operates to take every 10ms speech frame, the input speech signal is analyzed to extract the parameters. These parameters are then coded. The G.729 is a low complexity continuous data transmission scheme for VoIP applications and provide good synthesized speech quality at low bit rate. IJESAT May-Jun
2 CS-ACELP can only be used for human voice (due to the model used) and is relatively complex. 1. Speech coding techniques Speech coding techniques are mainly two types which are l.lossless and 2.lossy coding methods, The lossy coding technique have the reconstructed speech signal perceptually different from the original speech signal Whereas the lossless coding technique, the reconstructed signal at the decoder end has exactly the same shape as the input speech signal, Mostly the speech coding techniques are based on the lossy coding technique because it removes the information which is irrelevant from the perceptual quality point of View. Speech coders are classier based on the bit-rate at which they produce output with reasonable quality and on the type of coding techniques used for coding the speech signal. 2. Waveform coding Waveform coding is the simplest technique for speech coding, Waveform coders analyze code and reconstruct original signal, sample by sample. Waveform coders are used to reproduce the exact shape of the speech signal waveform, Without considering nature of human speech production and delivering system. The most commonly used waveform coding algorithms are uniform 16-bit PCM, companded 8-bit PCM and ADPCM. Waveform coding is explored in both time and frequency domain. 3. Differential Pulse Code Modulation Differential PCM (DPCM) is designed to calculate this difference and then transmit this small difference signal instead of the entire input sample signal, Since the difference between input samples is less than an entire input sample, the number of bits required for transmission is reduce. Using DPCM can reduce the bit rate of voice transmission down to 48 kbps. The input signal is sampled and modulated. The sampled input signal is stored in a predictor, The predictor takes the stored sample signal and sends it through a differentiator, The differentiator compares the previous sample signal with the current sample signal and sends this difference to the quantizing and coding phase of PCM (this phase can be uniform quantizing or companding with A-law or u-law). After quantizing and coding, the difference signal is transmitted to its final destination, At the receiving end of the network, everything is reversed, First the difference signal is dequantized, Then this difference signal is added to a sample signal stored in a predictor and sent to a low-pass filter that reconstructs the original input signal, DPCM faces some problems While dealing With voice quality. To solve this problem, adaptive DPCM is developed. 4. Linear Predictive Coding Linear Predictive Coding (LPC) is a powerful, good quality, low bit rate speech analysis technique for encoding a speech signal. The source filter model used in LPC is also known as the linear predictive coding model, It has two main components LPC analysis (encoding) and LPC synthesis (decoding), The goal of the LPC analysis is to estimate Whether the speech signal is voiced or unvoiced, to find the pitch of each frame and to the parameters needed to build the source filter model, These parameters are transmitted to the receiver will carry out LPC synthesis using the received parameters. 5. Code Excited Linear Prediction The basic principle that all speech coders exploit is the fact that speech signals are highly correlated waveforms. Speech can be represented using an autoregressive (AR) model. Along with its variants, such as algebraic CELP, relaxed CELP,low delay CELP and vector some excited linear prediction, it is currently the most widely used speech coding algorithm. It is also used in MPEG-4 audio speech coding. II. CS-ACELP DESCRIPTION A. Encoder IJESAT May-Jun
3 The CS-ACELP coder processes input signals on a frame-by-frame and sub frame-by-sub frame basis. The frame length is 10 ms and consists of two 5 ms sub frames. The algorithm utilizes vector quantization method, both the adaptive codebook and fixed codebook are vector quantized to form conjugate structure. The 8kbps core speech coder is derived from G.729 coder[2] and the coder is based on Code-Excited Linear Predictive(CELP) coding model operating on speech frame using analysis-by-synthesis method. The encoding principle of CS-ACELP is shown in Fig. 1. The encoding stages of CS-ACELP mainly contain six blocks. 1. Pre Processing Preprocessing block contains 2 stages scaling and high pass filtering.the input to the speech encoder is assumed to be a 16-bit PCM signal and it then undergoes combined scaling and high pass filtering. The scaling means, dividing the input signal by a factor two to avoid the possibility of overflows in the fixed-point implementation of coder. For high pass filtering a second order pole/zero filter with a cut-off frequency of 140 Hz is used. Both the scaling and high-pass filtering are combined and the resulting filter is given by (z) = The input signal filtered through Hh1(z) is referred to as s(n), and will be used in all subsequent coder operations. 2. LP Analysis The linear prediction(lp) technique, taking the advantage of order of linear prediction filter, is the most frequently used technique for speech analysis. LP analysis block is shown in Fig. 3. Reflection coefficients are obtained as by product of Levinson Durbin algorithm in LP analysis. The short term analysis and synthesis are based on 10th order LP filter. The LP synthesis filter is defined as: = The quantization of LSP parameters are obtained by using predictive twostage quantization. Fig. 1. LP ANALYSIS BLOCK The interpolated quantized and unquantized filters are converted back to the LP filter coefficients (to construct the synthesis and weighting filters for each sub frame). IJESAT May-Jun
4 where and are the quantized adaptive and fixed-codebook gains, respectively, v(n) is the adaptive codebook vector (interpolated past excitation), and C(n) is the fixed-codebook vector including harmonic enhancement. The states of the filters can be updated by filtering the signal r(n) -u(n) (difference between residual and excitation) through the filter and A( )/A( for the 40 sample subframe and saving the states of the filters. This would require three filter operations. A simpler approach, which requires only one filter operation, is as follows. The locally reconstructed speech (n) is computed by filtering the excitation signal through. The output of the filter due to the input r(n) -u(n) is equivalent to e(n) = s(n) - (n). So the states of the synthesis filter l/ (z) are given by e(n), r1 = 30,...,39. Updating the states of the filter A(z/ )/A(z/ ) can be done by filtering the error signal e(n) through this filter to find the perceptually weighted error ew(n). However, the signal ew(n) can be equivalently found by: ew(n) = X(n)- Y(n)- Z(n) Since the signals x(n), y(n) and z(n) are available, the states of the weighting filter are updated by computing ew(n) as in equation for n = 30,...,39. This saves two filter operations. Fig. 2. CS-ACELP ALGORITHM An update of the states of the synthesis and weighting filters is needed to compute the target signal in the next sub frame. After the two gains are quantized, the excitation signal, u(n), in the present sub frame is obtained using: u(n)= V(n)+ C(n) n=0,3,39 III. DECODER The decoder principle is shown in Figure 2 (b). First, the parameter s indices are extracted from the received bit stream. These indices are decoded to obtain the coder parameters corresponding to a 10 ms speech frame. These parameters are the LSP coefficients, the two fractional pitch delays, the two fixed codebook vectors, and the two sets of adaptive and fixed codebook gains. The LSP coefficients are interpolated and IJESAT May-Jun
5 converted to LP filter coefficients for each sub frame. Figure 3. Decoding principle of CS-ACELP Codec IV. METHOD The CS-ACELP algorithm was simulated using MATLAB R2010. The MATLAB application supports the import and export of data in various file format. The objective measurements like Segmented SNR (segsnr), Log Likelihood ratio (LLR), Weighted Spectral Slope Measures (WSS) and Perceptual Evaluation of Speech Quality (PESQ) of CS-ACELP was also calculated using MATLAB. Segmental SNR (SSNR) is defined as the average of SNR values over segments with speech activity. LLR which compares LPC vector of original speech signal with reconstructed speech. The Weighted Spectral Slope (WSS) distance measure is a direct spectral distance measure. It is based on comparison of smoothed spectra from the clean and distorted speech samples. The PESQ, is a family of standards comprising a test methodology for automated assessment of the speech quality as experienced by a user of a telephony system. PESQ values ranges from 0.5 to 4.5. Higher values of PSEQ provides better quality. For analysis of the output speech files, Praat software package was used. Praat consist of two part, Praat object and Praat picture. Spectrogram analysis of input and output speech of CS-ACELP was performed. Time domain representation of speech signal, information regarding pitch, intensity and formants was extracted. Formant frequencies f1, f2, f3,f4 and bandwidth information were also extracted. The Praat Objects window is used to open existing sound files or create. V. RESULT This section describes the results of CS-ACELP algorithm obtained using the MATLAB R2010a. A handel.wav file is used as a test signal for coder implementation and handeldec is the reconstructed signal. First the file compression results are performed, then the analysis details and objective measurements were displayed. The objective measurements of are shown in Table 4. From the analysis of PSEQ value of speech signal, coder works with a better quality. From the Praat analysis of input and output of CS- ACELP coder, sound pressure or amplitude of the sound waves in Pascal are shown in Table 2. The very slight differences in the minimum, maximum, root mean square (RMS) and mean values are representative of the good quality of output of the vocoder. The formant frequencies and corresponding bandwidths are listed in Table 3. The combined time domain representation of the input and output sound waves are shown in Fig. 6. From the time domain representation, it can be concluded that the overall shape has been preserved. But peaks have been clipped at some portions The waveforms of intensity, pitch and formants of the sound files are shown in Figs.4 to Figs. 5 Table 1. CS-ACELP CODER AMPLITUDE (SOUND PRESSURE) Input(handel.wav) Output(handeldec.wav) Amplitude(in pa) maximum minimum mean 2.49x 5.05x rms Table 2. CS-ACELP CODER FORMATS AND BANDWIDTH Parameter( in Hz) handel(wav) handeldec(wav) F F F F BW IJESAT May-Jun
6 BW BW BW Table 3. OBJECTIVE MEASUREMENTS File name Objective Measures SNRseg WSS LLR PESQ handel.wav male.wav child.wav female.wav (b) CS-ACELP output file (a) Original sound file IJESAT May-Jun
7 (a) Original sound file Fig.4. Intensity waveforms (b) CS-ACELP output file IJESAT May-Jun
8 Fig. 5. Pitch waveforms (b) CS-ACELP output file (a) Original sound file Fig. 6. Formants of speech V. CONCLUSION The implementation and analysis of an efficient algorithm for providing secured speech transmission for various application with different speech input is described in this paper. Praat tools have proven to be very handy in speech file analysis. From the analysis of formant, pitch and intensity graphs of the input and output files clearly have very great similarity with the input wave form. From the experimental results, it is evident that the algorithm yields good compression and obtain very good perceptual quality. REFERENCE: [1] Nimisha Susan Jacob,Ancy S. Anselam, Performance Analysis Of CS-ACELP Speech Coder ( IJEAT) ISSN: ISSUE-5,June [2] An Efficient Algebric Codebook Search for G.729 speech codec IEEE,19 June IJESAT May-Jun
9 [3] Koji Seto and Tokunbo Ogunfunni, Scalable Wideband Speech Coding for IP Networks, Dec.2012 [4] T. Ogunfunmi and M. 1. Narasimha, "Speech over VoIP Networks: Advanced Signal Processing and System Implementation," IEEE Circuits and Systems Magazine, Vol. 12, no. 2, pp , [5] K. Seto and T. Ogunfunmi, "Scalable Multi-Rate ilbc," Proceedings of IEEE International Symposium on Circuits and Systems, [6] G. Madre et al., Design of a variable rate algorithm for CS-ACELP coder, IEEE, [7] A. Johnston, S. Donovan, R. Sparks, C. Cunningham, and K. Summers, Session Initiation Protocol (SIP) Public Switched Telephone Network (PSTN) call flows, RFC 3666, December [8] ITU-T Recommendation P.862 Amendment 1, Source code for reference implementation and conformance tests, March [9] ITU-T G.729: Coding of speech at 8 kb/s using CS-ACELP.R. Salami et al., Description of the proposed ITU-T 8 kb/s speech coding standard, in Proc. IEEE Workshop on Speech Coding. IJESAT May-Jun
Simulation of Conjugate Structure Algebraic Code Excited Linear Prediction Speech Coder
COMPUSOFT, An international journal of advanced computer technology, 3 (3), March-204 (Volume-III, Issue-III) ISSN:2320-0790 Simulation of Conjugate Structure Algebraic Code Excited Linear Prediction Speech
More informationOverview of Code Excited Linear Predictive Coder
Overview of Code Excited Linear Predictive Coder Minal Mulye 1, Sonal Jagtap 2 1 PG Student, 2 Assistant Professor, Department of E&TC, Smt. Kashibai Navale College of Engg, Pune, India Abstract Advances
More informationDigital Speech Processing and Coding
ENEE408G Spring 2006 Lecture-2 Digital Speech Processing and Coding Spring 06 Instructor: Shihab Shamma Electrical & Computer Engineering University of Maryland, College Park http://www.ece.umd.edu/class/enee408g/
More informationEE482: Digital Signal Processing Applications
Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 12 Speech Signal Processing 14/03/25 http://www.ee.unlv.edu/~b1morris/ee482/
More informationProceedings of Meetings on Acoustics
Proceedings of Meetings on Acoustics Volume 19, 213 http://acousticalsociety.org/ ICA 213 Montreal Montreal, Canada 2-7 June 213 Signal Processing in Acoustics Session 2pSP: Acoustic Signal Processing
More informationAudio Signal Compression using DCT and LPC Techniques
Audio Signal Compression using DCT and LPC Techniques P. Sandhya Rani#1, D.Nanaji#2, V.Ramesh#3,K.V.S. Kiran#4 #Student, Department of ECE, Lendi Institute Of Engineering And Technology, Vizianagaram,
More informationChapter IV THEORY OF CELP CODING
Chapter IV THEORY OF CELP CODING CHAPTER IV THEORY OF CELP CODING 4.1 Introduction Wavefonn coders fail to produce high quality speech at bit rate lower than 16 kbps. Source coders, such as LPC vocoders,
More informationtechniques are means of reducing the bandwidth needed to represent the human voice. In mobile
8 2. LITERATURE SURVEY The available radio spectrum for the wireless radio communication is very limited hence to accommodate maximum number of users the speech is compressed. The speech compression techniques
More informationEnhanced Waveform Interpolative Coding at 4 kbps
Enhanced Waveform Interpolative Coding at 4 kbps Oded Gottesman, and Allen Gersho Signal Compression Lab. University of California, Santa Barbara E-mail: [oded, gersho]@scl.ece.ucsb.edu Signal Compression
More informationspeech signal S(n). This involves a transformation of S(n) into another signal or a set of signals
16 3. SPEECH ANALYSIS 3.1 INTRODUCTION TO SPEECH ANALYSIS Many speech processing [22] applications exploits speech production and perception to accomplish speech analysis. By speech analysis we extract
More informationEC 6501 DIGITAL COMMUNICATION UNIT - II PART A
EC 6501 DIGITAL COMMUNICATION 1.What is the need of prediction filtering? UNIT - II PART A [N/D-16] Prediction filtering is used mostly in audio signal processing and speech processing for representing
More informationCOMPARATIVE REVIEW BETWEEN CELP AND ACELP ENCODER FOR CDMA TECHNOLOGY
COMPARATIVE REVIEW BETWEEN CELP AND ACELP ENCODER FOR CDMA TECHNOLOGY V.C.TOGADIYA 1, N.N.SHAH 2, R.N.RATHOD 3 Assistant Professor, Dept. of ECE, R.K.College of Engg & Tech, Rajkot, Gujarat, India 1 Assistant
More informationAPPLICATIONS OF DSP OBJECTIVES
APPLICATIONS OF DSP OBJECTIVES This lecture will discuss the following: Introduce analog and digital waveform coding Introduce Pulse Coded Modulation Consider speech-coding principles Introduce the channel
More informationThe Optimization of G.729 Speech codec and Implementation on the TMS320VC5402
4th International Conference on Mechatronics, Materials, Chemistry and Computer Engineering (ICMMCCE 015) The Optimization of G.79 Speech codec and Implementation on the TMS30VC540 1 Geng wang 1, a, Wei
More informationCellular systems & GSM Wireless Systems, a.a. 2014/2015
Cellular systems & GSM Wireless Systems, a.a. 2014/2015 Un. of Rome La Sapienza Chiara Petrioli Department of Computer Science University of Rome Sapienza Italy 2 Voice Coding 3 Speech signals Voice coding:
More informationComparison of CELP speech coder with a wavelet method
University of Kentucky UKnowledge University of Kentucky Master's Theses Graduate School 2006 Comparison of CELP speech coder with a wavelet method Sriram Nagaswamy University of Kentucky, sriramn@gmail.com
More informationThe Channel Vocoder (analyzer):
Vocoders 1 The Channel Vocoder (analyzer): The channel vocoder employs a bank of bandpass filters, Each having a bandwidth between 100 Hz and 300 Hz. Typically, 16-20 linear phase FIR filter are used.
More informationTranscoding of Narrowband to Wideband Speech
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2005 Transcoding of Narrowband to Wideband Speech Christian H. Ritz University
More informationEE 225D LECTURE ON MEDIUM AND HIGH RATE CODING. University of California Berkeley
University of California Berkeley College of Engineering Department of Electrical Engineering and Computer Sciences Professors : N.Morgan / B.Gold EE225D Spring,1999 Medium & High Rate Coding Lecture 26
More informationVocoder (LPC) Analysis by Variation of Input Parameters and Signals
ISCA Journal of Engineering Sciences ISCA J. Engineering Sci. Vocoder (LPC) Analysis by Variation of Input Parameters and Signals Abstract Gupta Rajani, Mehta Alok K. and Tiwari Vebhav Truba College of
More informationWideband Speech Coding & Its Application
Wideband Speech Coding & Its Application Apeksha B. landge. M.E. [student] Aditya Engineering College Beed Prof. Amir Lodhi. Guide & HOD, Aditya Engineering College Beed ABSTRACT: Increasing the bandwidth
More informationVoice Excited Lpc for Speech Compression by V/Uv Classification
IOSR Journal of VLSI and Signal Processing (IOSR-JVSP) Volume 6, Issue 3, Ver. II (May. -Jun. 2016), PP 65-69 e-issn: 2319 4200, p-issn No. : 2319 4197 www.iosrjournals.org Voice Excited Lpc for Speech
More informationImplementation of attractive Speech Quality for Mixed Excited Linear Prediction
IOSR Journal of Electrical and Electronics Engineering (IOSR-JEEE) e-issn: 2278-1676,p-ISSN: 2320-3331, Volume 9, Issue 2 Ver. I (Mar Apr. 2014), PP 07-12 Implementation of attractive Speech Quality for
More informationFlexible and Scalable Transform-Domain Codebook for High Bit Rate CELP Coders
Flexible and Scalable Transform-Domain Codebook for High Bit Rate CELP Coders Václav Eksler, Bruno Bessette, Milan Jelínek, Tommy Vaillancourt University of Sherbrooke, VoiceAge Corporation Montreal, QC,
More informationWaveform Encoding - PCM. BY: Dr.AHMED ALKHAYYAT. Chapter Two
Chapter Two Layout: 1. Introduction. 2. Pulse Code Modulation (PCM). 3. Differential Pulse Code Modulation (DPCM). 4. Delta modulation. 5. Adaptive delta modulation. 6. Sigma Delta Modulation (SDM). 7.
More informationSpeech Compression Using Voice Excited Linear Predictive Coding
Speech Compression Using Voice Excited Linear Predictive Coding Ms.Tosha Sen, Ms.Kruti Jay Pancholi PG Student, Asst. Professor, L J I E T, Ahmedabad Abstract : The aim of the thesis is design good quality
More informationScalable Speech Coding for IP Networks
Santa Clara University Scholar Commons Engineering Ph.D. Theses Student Scholarship 8-24-2015 Scalable Speech Coding for IP Networks Koji Seto Santa Clara University Follow this and additional works at:
More informationInformation. LSP (Line Spectrum Pair): Essential Technology for High-compression Speech Coding. Takehiro Moriya. Abstract
LSP (Line Spectrum Pair): Essential Technology for High-compression Speech Coding Takehiro Moriya Abstract Line Spectrum Pair (LSP) technology was accepted as an IEEE (Institute of Electrical and Electronics
More informationLOSS CONCEALMENTS FOR LOW-BIT-RATE PACKET VOICE IN VOIP. Outline
LOSS CONCEALMENTS FOR LOW-BIT-RATE PACKET VOICE IN VOIP Benjamin W. Wah Department of Electrical and Computer Engineering and the Coordinated Science Laboratory University of Illinois at Urbana-Champaign
More informationE : Lecture 8 Source-Filter Processing. E : Lecture 8 Source-Filter Processing / 21
E85.267: Lecture 8 Source-Filter Processing E85.267: Lecture 8 Source-Filter Processing 21-4-1 1 / 21 Source-filter analysis/synthesis n f Spectral envelope Spectral envelope Analysis Source signal n 1
More informationAnalysis/synthesis coding
TSBK06 speech coding p.1/32 Analysis/synthesis coding Many speech coders are based on a principle called analysis/synthesis coding. Instead of coding a waveform, as is normally done in general audio coders
More information10 Speech and Audio Signals
0 Speech and Audio Signals Introduction Speech and audio signals are normally converted into PCM, which can be stored or transmitted as a PCM code, or compressed to reduce the number of bits used to code
More informationNOTICE WARNING CONCERNING COPYRIGHT RESTRICTIONS: The copyright law of the United States (title 17, U.S. Code) governs the making of photocopies or
NOTICE WARNING CONCERNING COPYRIGHT RESTRICTIONS: The copyright law of the United States (title 17, U.S. Code) governs the making of photocopies or other reproductions of copyrighted material. Any copying
More informationCommunications Theory and Engineering
Communications Theory and Engineering Master's Degree in Electronic Engineering Sapienza University of Rome A.A. 2018-2019 Speech and telephone speech Based on a voice production model Parametric representation
More informationWaveform Coding Algorithms: An Overview
August 24, 2012 Waveform Coding Algorithms: An Overview RWTH Aachen University Compression Algorithms Seminar Report Summer Semester 2012 Adel Zaalouk - 300374 Aachen, Germany Contents 1 An Introduction
More informationAdaptive time scale modification of speech for graceful degrading voice quality in congested networks
Adaptive time scale modification of speech for graceful degrading voice quality in congested networks Prof. H. Gokhan ILK Ankara University, Faculty of Engineering, Electrical&Electronics Eng. Dept 1 Contact
More informationPulse Code Modulation
Pulse Code Modulation EE 44 Spring Semester Lecture 9 Analog signal Pulse Amplitude Modulation Pulse Width Modulation Pulse Position Modulation Pulse Code Modulation (3-bit coding) 1 Advantages of Digital
More informationData Transmission at 16.8kb/s Over 32kb/s ADPCM Channel
IOSR Journal of Engineering (IOSRJEN) ISSN: 2250-3021 Volume 2, Issue 6 (June 2012), PP 1529-1533 www.iosrjen.org Data Transmission at 16.8kb/s Over 32kb/s ADPCM Channel Muhanned AL-Rawi, Muaayed AL-Rawi
More informationAudio Compression using the MLT and SPIHT
Audio Compression using the MLT and SPIHT Mohammed Raad, Alfred Mertins and Ian Burnett School of Electrical, Computer and Telecommunications Engineering University Of Wollongong Northfields Ave Wollongong
More informationMel Spectrum Analysis of Speech Recognition using Single Microphone
International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree
More informationComparative Analysis between DWT and WPD Techniques of Speech Compression
IOSR Journal of Engineering (IOSRJEN) ISSN: 225-321 Volume 2, Issue 8 (August 212), PP 12-128 Comparative Analysis between DWT and WPD Techniques of Speech Compression Preet Kaur 1, Pallavi Bahl 2 1 (Assistant
More informationAn objective method for evaluating data hiding in pitch gain and pitch delay parameters of the AMR codec
An objective method for evaluating data hiding in pitch gain and pitch delay parameters of the AMR codec Akira Nishimura 1 1 Department of Media and Cultural Studies, Tokyo University of Information Sciences,
More informationPulse Code Modulation
Pulse Code Modulation Modulation is the process of varying one or more parameters of a carrier signal in accordance with the instantaneous values of the message signal. The message signal is the signal
More informationEEE 309 Communication Theory
EEE 309 Communication Theory Semester: January 2016 Dr. Md. Farhad Hossain Associate Professor Department of EEE, BUET Email: mfarhadhossain@eee.buet.ac.bd Office: ECE 331, ECE Building Part 05 Pulse Code
More informationSpeech Synthesis using Mel-Cepstral Coefficient Feature
Speech Synthesis using Mel-Cepstral Coefficient Feature By Lu Wang Senior Thesis in Electrical Engineering University of Illinois at Urbana-Champaign Advisor: Professor Mark Hasegawa-Johnson May 2018 Abstract
More informationDigital Communication (650533) CH 3 Pulse Modulation
Philadelphia University/Faculty of Engineering Communication and Electronics Engineering Digital Communication (650533) CH 3 Pulse Modulation Instructor: Eng. Nada Khatib Website: http://www.philadelphia.edu.jo/academics/nkhatib/
More informationON-LINE LABORATORIES FOR SPEECH AND IMAGE PROCESSING AND FOR COMMUNICATION SYSTEMS USING J-DSP
ON-LINE LABORATORIES FOR SPEECH AND IMAGE PROCESSING AND FOR COMMUNICATION SYSTEMS USING J-DSP A. Spanias, V. Atti, Y. Ko, T. Thrasyvoulou, M.Yasin, M. Zaman, T. Duman, L. Karam, A. Papandreou, K. Tsakalis
More informationSpeech Coding using Linear Prediction
Speech Coding using Linear Prediction Jesper Kjær Nielsen Aalborg University and Bang & Olufsen jkn@es.aau.dk September 10, 2015 1 Background Speech is generated when air is pushed from the lungs through
More informationDEPARTMENT OF INFORMATION TECHNOLOGY QUESTION BANK. Subject Name: Information Coding Techniques UNIT I INFORMATION ENTROPY FUNDAMENTALS
DEPARTMENT OF INFORMATION TECHNOLOGY QUESTION BANK Subject Name: Year /Sem: II / IV UNIT I INFORMATION ENTROPY FUNDAMENTALS PART A (2 MARKS) 1. What is uncertainty? 2. What is prefix coding? 3. State the
More informationWideband Speech Encryption Based Arnold Cat Map for AMR-WB G Codec
Wideband Speech Encryption Based Arnold Cat Map for AMR-WB G.722.2 Codec Fatiha Merazka Telecommunications Department USTHB, University of science & technology Houari Boumediene P.O.Box 32 El Alia 6 Bab
More informationA Novel Approach of Compressing Images and Assessment on Quality with Scaling Factor
A Novel Approach of Compressing Images and Assessment on Quality with Scaling Factor Umesh 1,Mr. Suraj Rana 2 1 M.Tech Student, 2 Associate Professor (ECE) Department of Electronic and Communication Engineering
More information6/29 Vol.7, No.2, February 2012
Synthesis Filter/Decoder Structures in Speech Codecs Jerry D. Gibson, Electrical & Computer Engineering, UC Santa Barbara, CA, USA gibson@ece.ucsb.edu Abstract Using the Shannon backward channel result
More informationPerformance Analysis of MFCC and LPCC Techniques in Automatic Speech Recognition
www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume - 3 Issue - 8 August, 2014 Page No. 7727-7732 Performance Analysis of MFCC and LPCC Techniques in Automatic
More informationDigital Audio. Lecture-6
Digital Audio Lecture-6 Topics today Digitization of sound PCM Lossless predictive coding 2 Sound Sound is a pressure wave, taking continuous values Increase / decrease in pressure can be measured in amplitude,
More informationAnalog and Telecommunication Electronics
Politecnico di Torino - ICT School Analog and Telecommunication Electronics D5 - Special A/D converters» Differential converters» Oversampling, noise shaping» Logarithmic conversion» Approximation, A and
More informationHIGH QUALITY AUDIO CODING AT LOW BIT RATE USING WAVELET AND WAVELET PACKET TRANSFORM
HIGH QUALITY AUDIO CODING AT LOW BIT RATE USING WAVELET AND WAVELET PACKET TRANSFORM DR. D.C. DHUBKARYA AND SONAM DUBEY 2 Email at: sonamdubey2000@gmail.com, Electronic and communication department Bundelkhand
More informationSpeech Synthesis; Pitch Detection and Vocoders
Speech Synthesis; Pitch Detection and Vocoders Tai-Shih Chi ( 冀泰石 ) Department of Communication Engineering National Chiao Tung University May. 29, 2008 Speech Synthesis Basic components of the text-to-speech
More informationMASTER'S THESIS. Speech Compression and Tone Detection in a Real-Time System. Kristina Berglund. MSc Programmes in Engineering
2004:003 CIV MASTER'S THESIS Speech Compression and Tone Detection in a Real-Time System Kristina Berglund MSc Programmes in Engineering Department of Computer Science and Electrical Engineering Division
More informationInternational Journal of Advanced Engineering Technology E-ISSN
Research Article ARCHITECTURAL STUDY, IMPLEMENTATION AND OBJECTIVE EVALUATION OF CODE EXCITED LINEAR PREDICTION BASED GSM AMR 06.90 SPEECH CODER USING MATLAB Bhatt Ninad S. 1 *, Kosta Yogesh P. 2 Address
More informationECE 556 BASICS OF DIGITAL SPEECH PROCESSING. Assıst.Prof.Dr. Selma ÖZAYDIN Spring Term-2017 Lecture 2
ECE 556 BASICS OF DIGITAL SPEECH PROCESSING Assıst.Prof.Dr. Selma ÖZAYDIN Spring Term-2017 Lecture 2 Analog Sound to Digital Sound Characteristics of Sound Amplitude Wavelength (w) Frequency ( ) Timbre
More informationCompression. Encryption. Decryption. Decompression. Presentation of Information to client site
DOCUMENT Anup Basu Audio Image Video Data Graphics Objectives Compression Encryption Network Communications Decryption Decompression Client site Presentation of Information to client site Multimedia -
More information-/$5,!4%$./)3% 2%&%2%.#% 5.)4 -.25
INTERNATIONAL TELECOMMUNICATION UNION )454 0 TELECOMMUNICATION (02/96) STANDARDIZATION SECTOR OF ITU 4%,%0(/.% 42!.3-)33)/. 15!,)49 -%4(/$3 &/2 /"*%#4)6%!.$ 35"*%#4)6%!33%33-%.4 /& 15!,)49 -/$5,!4%$./)3%
More informationLesson 8 Speech coding
Lesson 8 coding Encoding Information Transmitter Antenna Interleaving Among Frames De-Interleaving Antenna Transmission Line Decoding Transmission Line Receiver Information Lesson 8 Outline How information
More informationVoice Transmission --Basic Concepts--
Voice Transmission --Basic Concepts-- Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics: Amplitude Frequency Phase Telephone Handset (has 2-parts) 2 1. Transmitter
More informationNOVEL PITCH DETECTION ALGORITHM WITH APPLICATION TO SPEECH CODING
NOVEL PITCH DETECTION ALGORITHM WITH APPLICATION TO SPEECH CODING A Thesis Submitted to the Graduate Faculty of the University of New Orleans in partial fulfillment of the requirements for the degree of
More informationDepartment of Electronics and Communication Engineering 1
UNIT I SAMPLING AND QUANTIZATION Pulse Modulation 1. Explain in detail the generation of PWM and PPM signals (16) (M/J 2011) 2. Explain in detail the concept of PWM and PAM (16) (N/D 2012) 3. What is the
More informationQUESTION BANK EC 1351 DIGITAL COMMUNICATION YEAR / SEM : III / VI UNIT I- PULSE MODULATION PART-A (2 Marks) 1. What is the purpose of sample and hold
QUESTION BANK EC 1351 DIGITAL COMMUNICATION YEAR / SEM : III / VI UNIT I- PULSE MODULATION PART-A (2 Marks) 1. What is the purpose of sample and hold circuit 2. What is the difference between natural sampling
More informationTelecommunication Electronics
Politecnico di Torino ICT School Telecommunication Electronics C5 - Special A/D converters» Logarithmic conversion» Approximation, A and µ laws» Differential converters» Oversampling, noise shaping Logarithmic
More informationCHAPTER 3 Syllabus (2006 scheme syllabus) Differential pulse code modulation DPCM transmitter
CHAPTER 3 Syllabus 1) DPCM 2) DM 3) Base band shaping for data tranmission 4) Discrete PAM signals 5) Power spectra of discrete PAM signal. 6) Applications (2006 scheme syllabus) Differential pulse code
More informationNOISE SHAPING IN AN ITU-T G.711-INTEROPERABLE EMBEDDED CODEC
NOISE SHAPING IN AN ITU-T G.711-INTEROPERABLE EMBEDDED CODEC Jimmy Lapierre 1, Roch Lefebvre 1, Bruno Bessette 1, Vladimir Malenovsky 1, Redwan Salami 2 1 Université de Sherbrooke, Sherbrooke (Québec),
More informationEEE 309 Communication Theory
EEE 309 Communication Theory Semester: January 2017 Dr. Md. Farhad Hossain Associate Professor Department of EEE, BUET Email: mfarhadhossain@eee.buet.ac.bd Office: ECE 331, ECE Building Types of Modulation
More informationDatenkommunikation SS L03 - TDM Techniques. Time Division Multiplexing (synchronous, statistical) Digital Voice Transmission, PDH, SDH
TM Techniques Time ivision Multiplexing (synchronous, statistical) igital Voice Transmission, PH, SH Agenda Introduction Synchronous (eterministic) TM Asynchronous (Statistical) TM igital Voice Transmission
More informationRobust Linear Prediction Analysis for Low Bit-Rate Speech Coding
Robust Linear Prediction Analysis for Low Bit-Rate Speech Coding Nanda Prasetiyo Koestoer B. Eng (Hon) (1998) School of Microelectronic Engineering Faculty of Engineering and Information Technology Griffith
More informationLow Bit Rate Speech Coding
Low Bit Rate Speech Coding Jaspreet Singh 1, Mayank Kumar 2 1 Asst. Prof.ECE, RIMT Bareilly, 2 Asst. Prof.ECE, RIMT Bareilly ABSTRACT Despite enormous advances in digital communication, the voice is still
More informationPULSE CODE MODULATION (PCM)
PULSE CODE MODULATION (PCM) 1. PCM quantization Techniques 2. PCM Transmission Bandwidth 3. PCM Coding Techniques 4. PCM Integrated Circuits 5. Advantages of PCM 6. Delta Modulation 7. Adaptive Delta Modulation
More informationSpeech synthesizer. W. Tidelund S. Andersson R. Andersson. March 11, 2015
Speech synthesizer W. Tidelund S. Andersson R. Andersson March 11, 2015 1 1 Introduction A real time speech synthesizer is created by modifying a recorded signal on a DSP by using a prediction filter.
More informationLecture Outline. Data and Signals. Analogue Data on Analogue Signals. OSI Protocol Model
Lecture Outline Data and Signals COMP312 Richard Nelson richardn@cs.waikato.ac.nz http://www.cs.waikato.ac.nz Analogue Data on Analogue Signals Digital Data on Analogue Signals Analogue Data on Digital
More informationInternational Journal of Modern Trends in Engineering and Research e-issn No.: , Date: 2-4 July, 2015
International Journal of Modern Trends in Engineering and Research www.ijmter.com e-issn No.:2349-9745, Date: 2-4 July, 2015 Analysis of Speech Signal Using Graphic User Interface Solly Joy 1, Savitha
More informationRealization and Performance Evaluation of New Hybrid Speech Compression Technique
Realization and Performance Evaluation of New Hybrid Speech Compression Technique Javaid A. Sheikh Post Graduate Department of Electronics & IT University of Kashmir Srinagar, India E-mail: sjavaid_29ku@yahoo.co.in
More informationSpeech Enhancement using Wiener filtering
Speech Enhancement using Wiener filtering S. Chirtmay and M. Tahernezhadi Department of Electrical Engineering Northern Illinois University DeKalb, IL 60115 ABSTRACT The problem of reducing the disturbing
More informationEC 2301 Digital communication Question bank
EC 2301 Digital communication Question bank UNIT I Digital communication system 2 marks 1.Draw block diagram of digital communication system. Information source and input transducer formatter Source encoder
More informationTree Encoding in the ITU-T G Speech Coder
Tree Encoding in the ITU-T G.711.1 Speech Abdul Hannan Khan Department of Electrical Computer and Software Engineering McGill University Montreal, Canada November, A thesis submitted to McGill University
More informationON THE PERFORMANCE OF WTIMIT FOR WIDE BAND TELEPHONY
ON THE PERFORMANCE OF WTIMIT FOR WIDE BAND TELEPHONY D. Nagajyothi 1 and P. Siddaiah 2 1 Department of Electronics and Communication Engineering, Vardhaman College of Engineering, Shamshabad, Telangana,
More informationLow Bit Rate Speech Coding Using Differential Pulse Code Modulation
Advances in Research 8(3): 1-6, 2016; Article no.air.30234 ISSN: 2348-0394, NLM ID: 101666096 SCIENCEDOMAIN international www.sciencedomain.org Low Bit Rate Speech Coding Using Differential Pulse Code
More informationA Closed-loop Multimode Variable Bit Rate Characteristic Waveform Interpolation Coder
A Closed-loop Multimode Variable Bit Rate Characteristic Waveform Interpolation Coder Jing Wang, Jingg Kuang, and Shenghui Zhao Research Center of Digital Communication Technology,Department of Electronic
More informationRobust Voice Activity Detection Based on Discrete Wavelet. Transform
Robust Voice Activity Detection Based on Discrete Wavelet Transform Kun-Ching Wang Department of Information Technology & Communication Shin Chien University kunching@mail.kh.usc.edu.tw Abstract This paper
More informationImage Compression using DPCM
GRD Journals- Global Research and Development Journal for Engineering Volume 2 Issue 4 March 2017 ISSN: 2455-5703 Image Compression using DPCM Reenu Sharma PG Student Department of Electronics & Communication
More informationIMPROVED SPEECH QUALITY FOR VMR - WB SPEECH CODING USING EFFICIENT NOISE ESTIMATION ALGORITHM
IMPROVED SPEECH QUALITY FOR VMR - WB SPEECH CODING USING EFFICIENT NOISE ESTIMATION ALGORITHM Mr. M. Mathivanan Associate Professor/ECE Selvam College of Technology Namakkal, Tamilnadu, India Dr. S.Chenthur
More informationSpeech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm
International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm A.T. Rajamanickam, N.P.Subiramaniyam, A.Balamurugan*,
More informationPhysical Layer: Outline
18-345: Introduction to Telecommunication Networks Lectures 3: Physical Layer Peter Steenkiste Spring 2015 www.cs.cmu.edu/~prs/nets-ece Physical Layer: Outline Digital networking Modulation Characterization
More informationCHAPTER 4. PULSE MODULATION Part 2
CHAPTER 4 PULSE MODULATION Part 2 Pulse Modulation Analog pulse modulation: Sampling, i.e., information is transmitted only at discrete time instants. e.g. PAM, PPM and PDM Digital pulse modulation: Sampling
More informationFinal draft ETSI EN V1.3.0 ( )
European Standard (Telecommunications series) Terrestrial Trunked Radio (TETRA); Speech codec for full-rate traffic channel; Part 2: TETRA codec 2 Reference REN/TETRA-05059 Keywords TETRA, radio, codec
More information3GPP TS V5.0.0 ( )
TS 26.171 V5.0.0 (2001-03) Technical Specification 3rd Generation Partnership Project; Technical Specification Group Services and System Aspects; Speech Codec speech processing functions; AMR Wideband
More informationAdaptive Forward-Backward Quantizer for Low Bit Rate. High Quality Speech Coding. University of Missouri-Columbia. Columbia, MO 65211
Adaptive Forward-Backward Quantizer for Low Bit Rate High Quality Speech Coding Jozsef Vass Yunxin Zhao y Xinhua Zhuang Department of Computer Engineering & Computer Science University of Missouri-Columbia
More informationAudio and Speech Compression Using DCT and DWT Techniques
Audio and Speech Compression Using DCT and DWT Techniques M. V. Patil 1, Apoorva Gupta 2, Ankita Varma 3, Shikhar Salil 4 Asst. Professor, Dept.of Elex, Bharati Vidyapeeth Univ.Coll.of Engg, Pune, Maharashtra,
More informationREAL-TIME IMPLEMENTATION OF A VARIABLE RATE CELP SPEECH CODEC
REAL-TIME IMPLEMENTATION OF A VARIABLE RATE CELP SPEECH CODEC Robert Zopf B.A.Sc. Simon Fraser University, 1993 A THESIS SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF MASTER OF
More informationComparison of Low-Rate Speech Transcoders in Electronic Warfare Situations: Ambe-3000 to G.711, G.726, CVSD
Comparison of Low-Rate Speech Transcoders in Electronic Warfare Situations: Ambe-3000 to G.711, G.726, CVSD V. Govindu Department of ECE, UCEK, JNTUK, Kakinada, India 533003. Parthraj Tripathi Defence
More informationAN ABSTRACT OF THE THESIS OF. Meeta Bhutani for the degree of Master of Science in Electrical and Computer
AN ABSTRACT OF THE THESIS OF Meeta Bhutani for the degree of Master of Science in Electrical and Computer Engineering presented on August 31,1998. Title: Comparison of DPCM and Subband Codec Performance
More informationDilpreet Singh 1, Parminder Singh 2 1 M.Tech. Student, 2 Associate Professor
A Novel Approach for Waveform Compression Dilpreet Singh 1, Parminder Singh 2 1 M.Tech. Student, 2 Associate Professor CSE Department, Guru Nanak Dev Engineering College, Ludhiana Abstract Waveform Compression
More informationSpeech Coding in the Frequency Domain
Speech Coding in the Frequency Domain Speech Processing Advanced Topics Tom Bäckström Aalto University October 215 Introduction The speech production model can be used to efficiently encode speech signals.
More information