The Optimization of G.729 Speech codec and Implementation on the TMS320VC5402

Similar documents
Simulation of Conjugate Structure Algebraic Code Excited Linear Prediction Speech Coder

Speech Coding Technique And Analysis Of Speech Codec Using CS-ACELP

Overview of Code Excited Linear Predictive Coder

EE482: Digital Signal Processing Applications

Comparison of Low-Rate Speech Transcoders in Electronic Warfare Situations: Ambe-3000 to G.711, G.726, CVSD

COMPARATIVE REVIEW BETWEEN CELP AND ACELP ENCODER FOR CDMA TECHNOLOGY

APPLICATIONS OF DSP OBJECTIVES

Transcoding of Narrowband to Wideband Speech

A New Simulation Technology Research for Missile Control System based on DSP. Bin Tian*, Jianqiao Yu, Yuesong Mei

techniques are means of reducing the bandwidth needed to represent the human voice. In mobile

Speech Compression Using Voice Excited Linear Predictive Coding

Open Access Improved Frame Error Concealment Algorithm Based on Transform- Domain Mobile Audio Codec

A DSP IMPLEMENTED DIGITAL FM MULTIPLEXING SYSTEM

10 Speech and Audio Signals

Experiment # (3) PCM Modulator

Cellular systems & GSM Wireless Systems, a.a. 2014/2015

FIR Filter for Audio Signals Based on FPGA: Design and Implementation

Design Implementation Description for the Digital Frequency Oscillator

DSP BASED SYSTEM FOR SYNCHRONOUS GENERATOR EXCITATION CONTROLL

CHAPTER 3 Syllabus (2006 scheme syllabus) Differential pulse code modulation DPCM transmitter

Mel Spectrum Analysis of Speech Recognition using Single Microphone

Enhanced Waveform Interpolative Coding at 4 kbps

BPSK_DEMOD. Binary-PSK Demodulator Rev Key Design Features. Block Diagram. Applications. General Description. Generic Parameters

Adaptive time scale modification of speech for graceful degrading voice quality in congested networks

Proceedings of Meetings on Acoustics

Datenkommunikation SS L03 - TDM Techniques. Time Division Multiplexing (synchronous, statistical) Digital Voice Transmission, PDH, SDH

YEDITEPE UNIVERSITY ENGINEERING FACULTY COMMUNICATION SYSTEMS LABORATORY EE 354 COMMUNICATION SYSTEMS

Flexible and Scalable Transform-Domain Codebook for High Bit Rate CELP Coders

IMPLEMENTATION OF G.726 ITU-T VOCODER ON A SINGLE CHIP USING VHDL

CHAPTER 5. Digitized Audio Telemetry Standard. Table of Contents

Audio Signal Compression using DCT and LPC Techniques

Voice Excited Lpc for Speech Compression by V/Uv Classification

Waveform Encoding - PCM. BY: Dr.AHMED ALKHAYYAT. Chapter Two

Chapter IV THEORY OF CELP CODING

Implementation of attractive Speech Quality for Mixed Excited Linear Prediction

The Channel Vocoder (analyzer):

SIMULATION AND PROGRAM REALIZATION OF RECURSIVE DIGITAL FILTERS

Communications Theory and Engineering

LOSS CONCEALMENTS FOR LOW-BIT-RATE PACKET VOICE IN VOIP. Outline

Analog and Telecommunication Electronics

Analysis/synthesis coding

ECE 556 BASICS OF DIGITAL SPEECH PROCESSING. Assıst.Prof.Dr. Selma ÖZAYDIN Spring Term-2017 Lecture 2

EE 225D LECTURE ON MEDIUM AND HIGH RATE CODING. University of California Berkeley

Design of stepper motor position control system based on DSP. Guan Fang Liu a, Hua Wei Li b

UNIT TEST I Digital Communication

FPGA Design of Speech Compression by Using Discrete Wavelet Transform

Voice Transmission --Basic Concepts--

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm

A MULTI-CHANNEL SIGNAL SOURCES BASED ON FPGA AND DDS

DIGITAL COMMUNICATION

DIGITAL SIGNAL PROCESSING LABORATORY

A GENERAL SYSTEM DESIGN & IMPLEMENTATION OF SOFTWARE DEFINED RADIO SYSTEM

An objective method for evaluating data hiding in pitch gain and pitch delay parameters of the AMR codec

Comparison of CELP speech coder with a wavelet method

Digital Speech Processing and Coding

Design and Implementation on a Sub-band based Acoustic Echo Cancellation Approach

Review Article AVS-M Audio: Algorithm and Implementation

EC 6501 DIGITAL COMMUNICATION UNIT - II PART A

International Journal of Advanced Engineering Technology E-ISSN

Telecommunication Electronics

3. DAC Architectures and CMOS Circuits


INTRODUCTION TO COMMUNICATION SYSTEMS LABORATORY IV. Binary Pulse Amplitude Modulation and Pulse Code Modulation

Information. LSP (Line Spectrum Pair): Essential Technology for High-compression Speech Coding. Takehiro Moriya. Abstract

ZLS38500 Firmware for Handsfree Car Kits

CHAPTER 4. PULSE MODULATION Part 2

Design of Spread-Spectrum Communication System Based on FPGA

Speech Synthesis using Mel-Cepstral Coefficient Feature

The Design and Simulation of Embedded FIR Filter based on FPGA and DSP Builder

Design and FPGA Implementation of High-speed Parallel FIR Filters

Chapter 5: Signal conversion

Data Transmission at 16.8kb/s Over 32kb/s ADPCM Channel

PULSE CODE MODULATION (PCM)

CHAPTER 2 FIR ARCHITECTURE FOR THE FILTER BANK OF SPEECH PROCESSOR

Design & Implementation of an Adaptive Delta Sigma Modulator

MODULATION AND MULTIPLE ACCESS TECHNIQUES

DESIGN AND IMPLEMENTATION OF CELP SPEECH PROCESSING SYSTEM USING TMS320C30

Design of the Chaotic Signal Generator Based on LABVIEW

SGN Audio and Speech Processing

Corona Current Data Acquisition Card Based on USB Bus in Extra High Voltage Environment

Block Diagram. i_in. q_in (optional) clk. 0 < seed < use both ports i_in and q_in

Pulse Code Modulation

EE 264 DSP Project Report

A Software Phase-Locked Loop from Theory to Practice: TMS320C6000 DSP Based Implementation and Analysis

Speech synthesizer. W. Tidelund S. Andersson R. Andersson. March 11, 2015

Optimized BPSK and QAM Techniques for OFDM Systems

System analysis and signal processing

Design of double loop-locked system for brush-less DC motor based on DSP

DATASHEET HSP Features. Description. Applications. Ordering Information. Block Diagram. Digital QPSK Demodulator. FN4162 Rev 3.

Experiment # 4. Frequency Modulation

CODING TECHNIQUES FOR ANALOG SOURCES

AN IMPLEMENTATION OF MULTI-DSP SYSTEM ARCHITECTURE FOR PROCESSING VARIANT LENGTH FRAME FOR WEATHER RADAR

Chapter 2: Digitization of Sound

SILK Speech Codec. TDP 10/11 Xavier Anguera I Ciro Gracia

DSP Project. Reminder: Project proposal is due Friday, October 19, 2012 by 5pm in my office (Small 239).

USB4. Encoder Data Acquisition USB Device Page 1 of 8. Description. Features

Synthesis of speech with a DSP

Exploration of Digital Frequency Band System

Media Devices: Audio. CTEC1465/2018S Computer System Support

Introduction to Real-Time Digital Signal Processing

Transcription:

4th International Conference on Mechatronics, Materials, Chemistry and Computer Engineering (ICMMCCE 015) The Optimization of G.79 Speech codec and Implementation on the TMS30VC540 1 Geng wang 1, a, Wei cheng, b The School of Electronic and Information Engineering, Hubei University of Science and Technology,Xianning, 437100,China The School of Electronic and Information Engineering, Hubei University of Science and Technology, Xianning, 437100, China a wanggeng000198@16.com, b799040609@qq.com Keywords: G.79;TMS30VC540;TLV30AIC3;Speech Codec Abstract.This paper designs a realtime speech codec system based on the TMS30C540 platform, from two aspects of data transmission interface and control interface for the TMS30C540 and TLV30AIC3 interfacedesign, and through the g. 79 speech decoding algorithm principle research and program optimization on the platform to realize the real-time algorithm. Introduction The G.79 algorithm is very complicated, and the real-time effect is dissatisfactory, but with rapid development of the DSP which has a strong operation ability and research on all kinds of optimization methods, this algorithm has become the mainstream in the moderate and low bit rate speech coding algorithm. The input signal of the algorithm process is a 16-bit linear PCM encoded speech signal by 8kHz sampling. The output rate is 8kbps of binary bit stream with a compression ratio of 16: 1 or so after the encoding processing. When algorithm DSP processed speech signal is used, the communication module need peripheral speech signal acquisition and necessary processing, such as signal amplification, A / D. Bitstream coded after storage and transmission can recover high-quality voice signal through the corresponding decoding process. Currently, G.79 codec has been widely used in various fields of data communications, such as H.33 and IP Phone. Codec Program Design And Optimization Codec Program Design The object processed by the encoder is each 10ms frame of speech, which is 80 sampling points, and the average signal for each frame is divided into two sub-frames. Each frame signal is analyzed to extract the relevant model and excitation parameters, which are to be encoded. To reduce the computational overflow phenomenon, the amplitude of the input signal is processed by half with a cutoff frequency of 140Hz high-pass filter to filter out low frequency noise during the preprocessing phase. In linear prediction analysis stage, the autocorrelation coefficients should be calculated first, and then the 60Hz bandwidth expansion autocorrelation coefficient correction, and last linear prediction coefficients can be obtained using the resulting coefficients after Levinson - Durbin algorithm processing[1]. For quantization and interpolation, line spectrum pair should be transferred into prediction coefficients. The line spectrum pair coefficients are quantized and interpolated, and finally reduced to line spectrum pair coefficients from the linear prediction coefficients. Perceptual weighting is used to linear prediction coefficients without quantization. Analysis of the adaptive codebook search range is obtained by ring opening of genetic analysis to reduce the codebook search complexity[]. Adaptive codebook and fixed codebook search are performed for each subframe. Then, the next subframe update the parameters of the synthesis filter and weighting filter should be used, and finally the resulting parameters could be encoded in a certain order. The decoding process is relatively simple.the parameter should be extracted firstly, the interpolated line spectral pair coefficients should be converted into linear prediction coefficients for each subframe; 015. The authors - Published by Atlantis Press 887

and the adaptive codebook and fixed codebook should be multiplied by the excitation signal after it has been gained[3]. A linear predictive synthesis filter reconstructed speech could be made by the excitation signal, together with the completion of the speech after adaptive post-reconstruction filtering and high-pass filtering processing. Optimization of Codec Program In order to achieve real-time systems, series of optimization needed to be done for the source code G.79 from ITU, whose optimization degree can be divided into the algorithms level, C language level and assembler-level optimization[4]. The main algorithm-level optimization has the following work: First, cancel 5ms preview: In the LP analysis, the original algorithm which has the forward-looking 5ms data, all of which can be replaced with zeros in operation, bringing you a large reduction of computation; Second, the open-loop gene search using roughened search mode. In the original algorithm, when the correlation coefficient is calculated, the search is increased in steps of, saving half of the time, due to smaller changes of the continuous addition of speech data frames in pitch delay value. When changes decreased within a certain range, search is unnecessary, which could be directly replaced by the value of the previous frame. Third, in all the multiplication operation, the operation result of zero could be abandoned. Fourth, the fixed codebook search algorithm should be changed into the reset pulse sequence method: The first 40 possible pulse positions may be turned into equation (1) to calculate the contribution to the value of a single pulse, and then it should be reordered according to the size of the contribution to the value in the same track. The first track of the location of each of the four pulse Reset post could be selected to search for: 39 ( dnc ( ) k ( n)) C k n 0 t E Ck Ck (1) The methods of C language level and assembler-level optimization are mainly as follows: Omitting unnecessary overflow judgment; the arrangement of the same nature with the same function, which will help the compiler to compile it into a code having parallel computing structure; when you call the cycle, the loop should be shortened as much as possible, and the transfer of the judge sentences should be avoided; fewer merge command functions, such as the autocorrelation function, windowing functions to save on the stack of the operating time[5]; The number of calls to the number of instructions are fewer functions, whose former name with a keyword inline, the compiler when comparing the province, which is a space for time optimization tools. In a PC, the algorithm is implemented in software platform CCS.0, which could be used to some of this optimization method based on proven platforms, such as the use Intrinsic functions, the options open of C / C ++ compiler of CCS, Release-mode compilation, Debug information exclusion, which have a greater impact on system performance. System Design System block diagram is shown in Figure 1. Voice signal input has been made through the line or microphone, which generates 16-bit linear PCM by TLV30AIC3 chip pretreatment[6]. TLV30AIC3 chip is controlled by DSP through the on-chip MCBSP0, and exchanged data through the on-chip MCBSP1. PC through the JTAG port of the device is programmed into the DSP, 16-bit PCM voice signal through MCBSP1 enter DSP, then processed by compression encoding algorithm to generate bit stream. When bit stream through MCBSP1 to an external communication system or module, voice could be restored at the decoding side. If it is to verify the correctness of the algorithm and reconstruction of voice quality distortion on a single DSP, after the bit stream decoding, the data could be sent back to TLV30AIC3 chip, and be reconstructed speech with D / A and its amplification. 888

Fig.1. System block diagram Hardware Design Hardware design is mainly that of the interface between AIC3 and DSP. TLV30AIC3 ADC and DAC components of highly integrated in the chip, can provide 16bit, 0bit, 4bit and 3bit sampling in the frequency range of 8K ~ 96K. Voice signal can produce 16-bit linear PCM signal sampling rate of 8KHz after TLV30AIC3 through acquisition, providing an input signal processing algorithms in line. TLV30AIC3 pins can be divided into signal input and output pins, control pins, pin data transfer and power supply pins, etc., in which the signal input, output pins and the power pin connection are relatively simple,which could be completed in reference to typical circuit chip t materials. There are four control pins, respectively, SCLK, CS, SDIN, MODE,which are used to coordinate the host DSP initialization TLV30AIC3, there are five data transmission pins, namely BCLK, LRCIN, LRCOUT, DIN, DOUT, which are used with the host DSP voice data exchange. DSP's over six-channel buffered serial port pins can be divided into control pins and data pins.the control pin is the clock sending and receiving pins BCLKX, BCLKR, frame sending and receiving pins BFSX, BFSR, and the data pins is BDX, BDR. Pin connections between TLV30AIC3 and DSP design shown in Figure and 3. Fig.. Control Interface Fig.3. Data interface Software Design TLV30AIC3 chip initialization and work processes are implemented through software control, so are the DSP receiving, processing and transmitting data. The software design uses the modular, structured programming thought tradition. The program can be divided into five modules: DSP initialization, MCBSP0 communication with TLV30AIC3, MCBSP1 exchange data with TLV30AIC3, G.79 coding procedures, G.79 decoder. The overall software design process is shown in Figure 4: 889

Figure4. Software design flow Experimental Results The optimized program is downloaded to the DSP through the emulator and JTAG, and then one s own voice recording could be input to the input interface of TLV30AIC3 through the PC line. The speech codec can be restored and reconstructed by the system. Compared with the original speech, only a very small distortion occurs. The original speech and the reconstructed speech waveform are shown in Figure 5 and 6. If a series of in-depth optimization could be implemented to the coding and decoding procedure and system, a better effect would be obtained. Fig.5. Original speech waveform Fig.6. Reconstruction speech waveform Acknowledgement Fund Project: Hubei University of Science and Technology University's scientific research project(ky104). References [1] ITU-T Rec.G.79, Coding of Speech at 8kbit/s using Conjugate Structure Algebraic Code Exited Linear Prediction(CS-ACELP) ), March 1996. [] M. Banerjee, B.A.Vani, and G.R.Krishna, Optimalreal time DSP implementation of ITU G.79 speech codec, in Proc.IEEE 60th Vehicular Technology Conf. (VTC 04) vol. 6, pp.3908 391, Sep. 004. [3] Zhengwen Zhang,Geng Wang,Yongjie Zhang. Implementation of G.79 algorithm based on ARM9, The International Conference on Consumer Electronics, Communications and Networks (CECNet 01), China, 01. 890

[4] Tsai S-M,Yang J-F.Eficient algebraic code-excited liner predictive codebook search[j].ieee Proc Vis Image Signal Process, 006,153(6):761.768. [5] Hochong Park,Younchang Choi,Doyoon Lee.Eficient codebook search method for ACELP speech codecs[c].speech Coding,IEEE Workshop Proceedings,00:17-19. [6] TMS30VC540 Data Sheet(Rev.F)Texas Instruments,February 005. 891