Digital Signal Representation of Speech Signal

Similar documents
Communications Theory and Engineering

EC 6501 DIGITAL COMMUNICATION UNIT - II PART A

EE482: Digital Signal Processing Applications

Analysis/synthesis coding

Wideband Speech Coding & Its Application

SPEECH AND SPECTRAL ANALYSIS

Speech Coding using Linear Prediction

Comm 502: Communication Theory. Lecture 4. Line Coding M-ary PCM-Delta Modulation

The source-filter model of speech production"

Overview of Code Excited Linear Predictive Coder

INTRODUCTION TO ACOUSTIC PHONETICS 2 Hilary Term, week 6 22 February 2006

PULSE CODE MODULATION (PCM)

QUESTION BANK. SUBJECT CODE / Name: EC2301 DIGITAL COMMUNICATION UNIT 2

Chapter-3 Waveform Coding Techniques

CHAPTER 3 Syllabus (2006 scheme syllabus) Differential pulse code modulation DPCM transmitter

EEE 309 Communication Theory

COMP 546, Winter 2017 lecture 20 - sound 2

Time division multiplexing The block diagram for TDM is illustrated as shown in the figure

Voice Transmission --Basic Concepts--

Department of Electronics and Communication Engineering 1

CODING TECHNIQUES FOR ANALOG SOURCES

speech signal S(n). This involves a transformation of S(n) into another signal or a set of signals

APPLICATIONS OF DSP OBJECTIVES

Speech Synthesis using Mel-Cepstral Coefficient Feature

Practical Approach of Producing Delta Modulation and Demodulation

Mel Spectrum Analysis of Speech Recognition using Single Microphone

Respiration, Phonation, and Resonation: How dependent are they on each other? (Kay-Pentax Lecture in Upper Airway Science) Ingo R.

Aspiration Noise during Phonation: Synthesis, Analysis, and Pitch-Scale Modification. Daryush Mehta

SGN Audio and Speech Processing

Voice Excited Lpc for Speech Compression by V/Uv Classification

Digital Communication (650533) CH 3 Pulse Modulation

Speech Processing. Undergraduate course code: LASC10061 Postgraduate course code: LASC11065

EC 2301 Digital communication Question bank

Resonance and resonators

Digital Signal Processing

Pulse Code Modulation

10 Speech and Audio Signals

Communications I (ELCN 306)

Adaptive Filters Linear Prediction

Speech Signal Analysis

EEE 309 Communication Theory

Enhanced Waveform Interpolative Coding at 4 kbps

Communications and Signals Processing

QUESTION BANK EC 1351 DIGITAL COMMUNICATION YEAR / SEM : III / VI UNIT I- PULSE MODULATION PART-A (2 Marks) 1. What is the purpose of sample and hold

INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET)

Telecommunication Electronics

Psychology of Language

International Journal of Modern Trends in Engineering and Research e-issn No.: , Date: 2-4 July, 2015

Analog and Telecommunication Electronics

An introduction to physics of Sound

Communication Systems Lecture-12: Delta Modulation and PTM

EXPERIMENT WISE VIVA QUESTIONS

Speech Compression Using Voice Excited Linear Predictive Coding

Digital Communication Prof. Bikash Kumar Dey Department of Electrical Engineering Indian Institute of Technology, Bombay

Voice mail and office automation

Nonuniform multi level crossing for signal reconstruction

Project 0: Part 2 A second hands-on lab on Speech Processing Frequency-domain processing

Exam 3--PHYS 151--Chapter 4--S14

Robust Linear Prediction Analysis for Low Bit-Rate Speech Coding

EE 225D LECTURE ON MEDIUM AND HIGH RATE CODING. University of California Berkeley

Sub-band Envelope Approach to Obtain Instants of Significant Excitation in Speech

Digital Speech Processing and Coding

Real-Time Application of DPCM and ADM Systems

Chapter IV THEORY OF CELP CODING

Speech Synthesis; Pitch Detection and Vocoders

Speech Enhancement using Wiener filtering

MASTER'S THESIS. Speech Compression and Tone Detection in a Real-Time System. Kristina Berglund. MSc Programmes in Engineering

Source-Filter Theory 1

Comparison of CELP speech coder with a wavelet method

2: Audio Basics. Audio Basics. Mark Handley

A Look at Un-Electronic Musical Instruments

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm

Low Bit Rate Speech Coding Using Differential Pulse Code Modulation

Epoch Extraction From Emotional Speech

Fundamentals of Digital Audio *

Block diagram of proposed general approach to automatic reduction of speech wave to lowinformation-rate signals.

Acoustic Phonetics. How speech sounds are physically represented. Chapters 12 and 13

Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter

A Physiologically Produced Impulsive UWB signal: Speech

Page 0 of 23. MELP Vocoder

Linguistic Phonetics. Spectral Analysis

SGN Audio and Speech Processing

3GPP TS V8.0.0 ( )

Pitch Period of Speech Signals Preface, Determination and Transformation

UNIT TEST I Digital Communication

Source-filter Analysis of Consonants: Nasals and Laterals

Class 4 ((Communication and Computer Networks))

Determination of instants of significant excitation in speech using Hilbert envelope and group delay function

MODIFIED DCT BASED SPEECH ENHANCEMENT IN VEHICULAR ENVIRONMENTS


DIGITAL COMMUNICATION

Laboratory Assignment 2 Signal Sampling, Manipulation, and Playback

Subtractive Synthesis & Formant Synthesis

Speech Enhancement Based On Noise Reduction

Structure of Speech. Physical acoustics Time-domain representation Frequency domain representation Sound shaping

DEPARTMENT OF INFORMATION TECHNOLOGY QUESTION BANK. Subject Name: Information Coding Techniques UNIT I INFORMATION ENTROPY FUNDAMENTALS

Basic Characteristics of Speech Signal Analysis

CS 188: Artificial Intelligence Spring Speech in an Hour

ASPIRATION NOISE DURING PHONATION: SYNTHESIS, ANALYSIS, AND PITCH-SCALE MODIFICATION DARYUSH MEHTA

In this lecture. System Model Power Penalty Analog transmission Digital transmission

Transcription:

Digital Signal Representation of Speech Signal Mrs. Smita Chopde 1, Mrs. Pushpa U S 2 1,2. EXTC Department, Mumbai University Abstract Delta modulation is a waveform coding techniques which the data rate to alarge extent in data communication ; the problem encountered in delta modulation is the slope overload error, which is inherent in the system. In order for the signal to have good fidelity, the slope overload error need to be as small as possible. Hence there is need for adaptive techniques to be applied to delta modulation to reduce noise.adaptive delta modulation reduce the slope overload error at the same time increase the dynamic range and the tracking capabilities of fixed step size delta modulation. The adaptive algorithm adjust the step size (from the range of step size) to the power level of the signal and thus enhance the dynamic range of the coding system. This paper discusses the experiment worked using quantization delta modulation and adaptive modulation and their improvements with each other. Thus in the whole the process of speech originates at the linguistic level in the speakers mind, descends to physiological level during pronunciation and then to acoustic level during transmission. The listener brings it back to physiological level during hearing and finally the sensation produce at the inner ear ends at the same linguistic level in the listener s brain. Keywords Quantization, Delta modulation, Adaptive delta modulation,digital communication and Signal Processing. I. INTRODUCTION The endeavour to communicate among themselves is a prime action of the human beings. Desire to convey one spoint of view to relate one to another is a never ending process. Of all forms of communication,speech has evolved,over along period of time as the primary means of communication between human being.the convenience and popularity of communication through speech is due to few features of speech. Speech is natural Speech is efficient Speech is flexible Speech is robust Speech signals are easily manageable A. Mechanism of Speech Production To communicate, a speaker must produce a speech signal in the form of a sound pressure wave that travels from speaker mouth to the listener s ears.the communication process has several stages. The process begins with the thoughtthat the speaker wants to speak to a listeners, the thought is converted into a linguistic structure by choosing appropriate words and phrase to express its meaning. The word is expressed as a sequence of sound units local and global sound effects and finally human brain issues several commands to form suitable articulatory gesture producing acoustic vibration in the physical world. The acoustic vibration are captured at the listeners ears and results in the perception of speaker intended meaning into listeners s mind. Figure1speech chain B. Anatomy and Physiology of Human Speech production system. Speech is an acoustic sound pressure wave that originates from coordinated movements of anatomical structures which make human speech production system. The components of this system are lungs trachea(windpipe),larynx(organ of voice production),pharyngeal cavity(throat),oral cavity (mouth) and nasal cavity(noise). The pharyngeal and oral cavities are 724

grouped together and referred as vocal tract.vocal cords velum,tongue,teeth,jaws,and lips make together speech production system and are called as articulators..a principal features of speech sounds is the manner of excitation to the vocal tract. Voiced and unvoiced excitation are the elemental components for excitation..during normal speech production,air pressure builds up below the larynx, by the efforts of muscles and lungs,which stats pushing the vocal chord apart. The glottal slit begins to open and accumulated air is forced up from the lungs through the glottis ( the space between vocal folds ) and through the possible narrowing in the vocal tract. Increased airflow reduces local air pressure and vocal cords try to close due to their elastic tension. This cycle repeats and the vocal cords get positioned in the way the air flows through the glottis setting the vocal folds into vibration. The sound produced in the process is called as voiced sounds and the time period between successive vocal fold opening (rate of vibration)is referred to as fundamental frequency or pitch. Unvoiced sounds are produced when vocal tract is sufficiently closed by formation of constriction at the same point along it and air is forced through this producing turbulence as for the word fish during the last sound. x i x i 1 = (3.1) x i x i 1 = Δ (3.2) Any value in the i th interval is mapped into the middle in this interval, i.e. Q x = x i = X min +(i -1)Δ+Δ, if x i 1 <=x<x i (3.3) Figure2:Anatomy of human speech production II. METHODS A. Uniform Quantization The quantization ranges and levels may be choosen in a variety of ways depending on the intended applications of the digital representation. With uniform quantization, the dynamic range of the signal R is divided into L equal sized intervals each with length Δ the quantization step- size. The input( unquantized value) and the output (quantized value)relationship in uniform quatizer is shown in figure3in figure3 x i represents the right boundary of interval i and x i the quantization level of this interval. Figure3 3-bit quantizer Given the signal range R a uniform quantizer has only one parameter ; the number of levels N or the quantization step size Δ and related by Δ=R/N (3.4) The number of levels N is generally chosen to be of the form 2 B so as to make the most of the efficient use of B bit binary code words. If the signal has symmetrical probability density function so that x(n) X max, or R=2 X max then Δ=2 X max /2 B (3.5) The quantized samples are represented as x n =x(n) + e(n) (3.6) x(n) is the unquantized sample e(n) quantization error or noise. Signal to Noise Ratio (SNR)=6B _ 7.2(db) (3.7) This shows that every additional bit contributes 6db improvement in SNR. the actual SNR for the given value B depends on the relation between Xmax and σ x which depends on the probability distribution of the signal. In order to improve the fidelity of representation with uniform quantization so that it is acceptable perceptually, it is necessary to use more bits. 725

B. Predictive Coding In a typical speech waveform,adjacent samples take similar value,except at transition between difference phonemes..one way to exploit this correlation is by linear predictive coding. It first predicts a present sample x(n)using linear combination of previously reconstructed samples x (n k) so that Figure4. The scheme representing the process of quantization x p n = a k x (n k) (3.8) The error between the actual samples value and the predicted ones, d n = x n x p n (3.9) quantized to d(n) and coded into codeword c(n). In the decoder the same predicted value is first produced from previously decoded samples. This value is then added to the decoded quantized value for the current samples i.e. x n = x p n + e(n) (3.1) Delta Modulation A simple predictive coding system is the delta modulation (DM) system. In case the quantizer for the prediction error has only two levels and the step size is fixed. The positive quantization level is represented by c(n) = and the negative value by c(n)=1.the d(n) is given by Figure5 Uniform quantization over the entire length of signal. Firure5 represents the complete length of the signal quantized using uniform quantization.5 d(n) =Δ or c(n)= if d(n) d(n) = - Δ o r c(n)=1 if d(n)< (3.11) A simple first order prediction is used i.e. x p n = x(n 1). x(n) satisfies the difference equation x (n) = x (n -1)+d(n) (3.12) With α =1, the above equation is the digital equivalent of integration i.e. it represents the accumulation of positive and negative increments of magnitude Δ.The input to the quantizer is d(n) = x(n) - x (n -1)=x(n) x(n -1)-e(n 1) (3.13) amp -.5 -.1 Thus except for the quantization error in x (n -1),d(n) is a first order backward difference of x(n), which can be viewed as a digital approximation to the derivative of the input, the inverse of the digital integration process. Because the error quantization is only two-level, the delta modulation has a bit rate of 1 bit/sample. If it is applied to a 16 bit/sample sequence, then it leads to a compression ratio (CR) -.15 5 52 54 56 58 51 time Figure6 shows the uniform quantization on fixed length of signal stretched over time interval t=5 to t=51.to digitize the signal the number of bits required are more. Encoder 726

Decoder Encoder Figure7.showing block diagram of a delta modulation system for the delta modulation to work well, the step size must be chosen properly to match the signal variation. This is a difficult task as the signal characteristics often change from tone to tone The quantization process of delta modulation with a fine step size is shown in figure 7. When the step size is too small the quantize signal lags below the actual signal magnitude. If the step size is too large in the latter portion, will cause the quantized signal to oscillate about the actual signal.. For a better performance, the step size should be adaptively adjusted. Decoder Figure9.General form of adaptive delta modulation. The scheme which is extensively studied by N.S.Jayant. Jayant s algorithm for adaptive modulation of quantization scheme is the step size is given as n = M (n 1) ( 3.14) min (n) max (3.15) The algorithm for choosing step size is M=P>1 if c(n)=c(n-1) M=P<1 if c(n) c(n-1) Figure8 showing the Flowchart for performing the delta modulation Adaptive Delta Modulation The scheme shown in this paper have a feedback type scheme in which the step size for two level quantizer is adapted from the output code words. The general form of such system is shown in figure 8. Such schemes maintain the advantage that no synchronization of bit pattern is recquired as there is no presence of errors, the step size information can be derived from the codeword sequence at both the transmitter and receiver.the use of adaptive quantization in delta modulation is discussed through specific adaptation algoritm. Figure1 Quantization done by Adaptive Delta Modulation Figure 1 shows how waveform would be quantized by an adaptive delta modulation. for convenience the parameter of the system are set to P=2,Q=1/2,α=1, and the minimum step size is shown in figure. It can be that the region of large positive slope still runs of sbut in this case the step size increases exponentially so as to follow the increase in slop of the waveform. The region of granularity to the right of figure is again signalled by an alternating sequence of s and 1 s but in this case step size fall rapidly to minimum (Δ min ) and remains there as long as the slope is small. 727

B. Adaptive Delta Modulation 1.8 Illustration of the adaptive delta modulation original signal Figure 11 Flowchart for performing Adaptive delta Modulation A. Delta Modulation III. RESULTS Figure 1 represents the delta modulation being performed on the complete sentence recorded at frequency of 8kHz for duration of 2 seconds. amplitude 1.5 -.5-1 -1.5 2.884 2.886 2.888 2.89 2.892 2.894 2.896 2.898 time x 1 4 Figure 12 Delta modulation performed on the complete signal. Figure 12 shows the delta modulation performed on the female voice speaker recorded for duration of 2 seconds over the complete length of signal.the experiment shows the of granular noise and hunting error is to very large extent,which cause the information to be lost. Therefore if the step size is made adaptive this problem gets ruled out..6.4.2 -.2 -.4 -.6 -.8-1 2.42 2.44 2.46 2.48 2.5 2.52 Figure 13. Adaptive delta modulation performed on the complete sentence. Figure13 represents the adaptive delta modulation being performed on the complete sentence recorded at frequency of 8KHz for duration of 2 seconds. IV. CONCLUSION The above experiments conducted showed that the adaptive delta modulation gave better result than the delta modulation as the step size for quantization is adaptive to the input signal given to the system. The signal to noise ratio is also gets improved to larger extent in adaptive delta modulation. These can made useful for vocoder and encoder,encoding the speech signal for transmitting the signal to larger distance. REFERENCES x 1 4 1] Makhoul.J. Linear Prediction :A Tutorial Review, Proceedings of the IEEE,Vol.63.No4,pp561-58,Apr1975. 2] Rabiner and Schifer, Digital Signal Processing Printice and Hall Publication. 3] B.P.Lathi and Zhi Ding Modern Digital and Analog communication systems Oxford University Press, International Fourth edition 21. 4] N.S.Jayant,Adaptive quantization with one word memory..the Bell System Technical Journal,Vol.52.N.7.pp1119-1144. 5] Douglas O Shaughnessy, Speech Communication University Press, Second edition. 6] H.R.Schinder, Delta Modulation, IEEE spectrum,vol.7,pp69-78 October 197. 7] L.R.Rabiner,R.W.Schafer, Digital Processing of Speech Signal Pearson edition 8] J.G.Prokis, Digital Signal Processing Pearson edition 728