Digital Speech Processing and Coding

Size: px
Start display at page:

Download "Digital Speech Processing and Coding"

Transcription

1 ENEE408G Spring 2006 Lecture-2 Digital Speech Processing and Coding Spring 06 Instructor: Shihab Shamma Electrical & Computer Engineering University of Maryland, College Park ENEE408G Capstone -- Multimedia Signal Processing (S'03)

2 Last Lecture UMCP ENEE408G Slides (created by M.Wu & R.Liu 2002) Course overview and logistics Bring multimedia to digital world: sampling & quantization Introduction to speech processing Different aspects of speech Friday Lab Session Warm-up with Matlab and PocketPC Today: speech processing and coding ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [2]

3 UMCP ENEE408G Slides (created by M.Wu & R.Liu 2002) Speech Production ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [3]

4 UMCP ENEE408G Slides (created by M.Wu & R.Liu 2002) Visualize Speech via Spectrogram Short-time Fourier Transform (STFT) of the windowed speech waveform is X ( ω, τ ) = x[ n, τ ]exp[ jω n] A window function is applied to the speech signal Spectrogram is Visualized using different brightness in a 2-D time-frequency plot S( ω, τ) = X( ω, τ) 2 x [ n, τ ] = w[ n, τ ] x[ n] ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [4]

5 Example of Speech Spectrogram Figure 3 of SPM May 98 Speech Survey ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [5]

6 Play Speech Figure 1 of SPM May 98 Speech Survey UMCP ENEE408G Slides (created by M.Wu 2003) First speaking machine in 1930s NY World s Fair 14 keys, 1 wristband, 1 pedal Modeling speech production as a linear system Sound sources Either voiced or unvoiced Voice sound Modeled by a generator of pulses Unvoiced sound Modeled by white noise generator Articulation Modeled by a cascade of singleresonance (pole) digital filters ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [6]

7 UMCP ENEE408G Slides (created by R. Liu & M.Wu 2002) Linear Separable Model for Speech Production Vocal tract is modeled as a linear time-varying system Parameters of the linear system are slowly varying Excited by time-varying source (voiced or unvoiced) Practical models Model each speech frame as Linear Time-Invariant Excited by either voiced or unvoiced source Allow overlaps in neighbouring frames Figure 3.2 of Furui s book ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [7]

8 Pitch UMCP ENEE408G Slides (created by R. Liu & M.Wu 2002) Pitch period: time duration of one glottal cycle (of glottis) Pitch: reciprocal of pitch period Also known as the fundamental frequency Vowel typically has one to four pitch periods Pitch ranges from about 60Hz to 400 Hz Typical pitch range of female pitch is higher than male pitch ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [8]

9 Pitch of Male and Female Speakers UMCP ENEE408G Slides (created by R. Liu & M.Wu 2002) Distribution, mean, and standard deviation of male and female pitch Figure 2.12 & 2.11 of Furui s book ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [9]

10 UMCP ENEE408G Slides (created by M.Wu & R.Liu 2002) Formant Figure 2.5 & 3.5 of Furui s book Formants: the resonance frequencies of the vocal tract Vowels typically have three formants: F1, F2, F3 Formant changes when vocal tract changes to produce different sound ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [10]

11 Formants of Vowels Figure 2.6 & 2.7 of Furui s book UMCP ENEE408G Slides (created by R. Liu & M.Wu 2002) ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [11]

12 Speech Coding ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [12]

13 PCM coding UMCP ENEE408G Slides (created by M.Wu & R.Liu 2002) I(x,y) Sampler Quantizer Encoder Input signal digitize/capture device How to encode a digital image into bits? Sampling and perform uniform quantization Pulse Coded Modulation (PCM) 8 bits per sample ~ good for speech 16 bits ~ needed for high-quality music Tradeoff between fidelity and file size How to squeeze out redundancy? transmit ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [13]

14 Discussion on Improving PCM (1) UMCP ENEE408G Slides (created by M.Wu & R.Liu 2002) Uniform quantization may give inconsistent range of relative amount errors E.g., +/- 2 incurs 20% vs. 2% at amplitude 10 and 100 Also is not optimal to non-uniformly distributed sources Non-uniform quantization Assign smaller quantization step size at small amplitude to maintain consistent range of relative quantization errors over the entire dynamic range Can apply non-linear transform before uniform quantization via companding (compression-expansion) μ-law companding: international standard for 64kbps speech ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [14]

15 Discussion on Improving PCM (2) UMCP ENEE408G Slides (created by M.Wu & R.Liu 2002) Quantized PCM values may not be equally likely Can we do better than encode each value using same # bits? Example P( 0 ) = 0.5, P( 1 ) = 0.25, P( 2 ) = 0.125, P( 3 ) = If use same # bits for all values Need 2 bits to represent the four possibilities if treat If use less bits for likely values 0 ~ Variable Length Codes (VLC) 0 => [0], 1 => [10], 2 => [110], 3 => [111] Use 1.75 bits on average ~ saves 0.25 bit per sample! Bring probability into the picture Use probability distribution to reduce average # bits per quantized sample ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [15]

16 UMCP ENEE408G Slides (created by M.Wu & R.Liu 2002) Entropy coding Idea: use less bits for commonly seen values Examples: Huffman coding (used in JPEG and MPEG) Lemple-Ziv coding (used in Unix) How many # bits needed? Compressability depends on the source s characteristics Limit of compression => Entropy Measures the uncertainty or amount of avg. information of a source ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [16]

17 List of Compression Tools UMCP ENEE408G Slides (created by M.Wu & R.Liu 2002) Lossless encoding tools Entropy coding: Huffman, Lemple-Ziv, and others (Arithmetic coding) [ Run-length coding ] Lossy tools for reducing redundancy Quantization and truncations Predictive coding Encode prediction parameters and residues with less bits Subband and Transform coding Encode different frequency bands Transform into a domain with improved energy compaction ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [17]

18 UMCP ENEE408G Slides (created by M.Wu & R.Liu 2002) How to Encode Correlated Sequence? Consider: high correlation between successive samples Predictive coding Basic principle: Remove redundancy between successive pixels and only encode residual between actual and predicted Residue usually has much smaller dynamic range Allow fewer quantization levels for the same MSE => get compression Compression efficiency depends on intersample redundancy First try e Q (n) u(n) Predictor _ e(n) Quantizer u P (n) = u(n-1) u Q (n) e Q (n) Encoder + u P (n) = u Q (n-1) Predictor Decoder ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [18]

19 UMCP ENEE408G Slides (created by M.Wu & R.Liu 2002) Predictive Coding (cont d) Problem with 1 st try Input to predictor are different at encoder and decoder decoder doesn t know u(n)! Mismatch error could propagate to future reconstructed samples Solution: Differential PCM (DPCM) Use quantized sequence u Q (n) for prediction at both encoder and decoder Prediction error e(n) Quantized prediction error e Q (n) Distortion d(n) = e(n) e Q (n) u(n) e Q (n) + _ + u P (n) = u Q (n-1) e(n) u P (n) =u Q (n-1) Encoder Quantizer u Q (n) Predictor Predictor Decoder Think: what predictor to use? e Q (n) u Q (n) ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [19]

20 All-Pole Modeling of Speech UMCP ENEE408G Slides (created by R. Liu & M.Wu 2002) Auto-regressive (AR) model: all-pole filter u[n] H(z) is the overall transfer function Glottal Flow G(z), Vocal Tract V(z), Radiation R(z), Gain β Synthesis process: u[n]: the vocal tract input, s[n]: speech output u[n] β H ( z) = s[n] A( z) β H ( z) = s[n] A( z) ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [20]

21 Model-based Coding UMCP ENEE408G Slides (created by R. Liu & M.Wu 2002) Linear Prediction Coder (LPC) LPC Vocoder ( voice coder ) Divide speech into frames (several tens milliseconds) and encode the LPC coefficients of each frame Additional parameters to facilitate synthesis: voiced/unvoiced flag, gain, pitch (for voiced) Line Spectrum Pair (LSP) Coding Hybrid Coding: LPC Residual Coding Between LPC and waveform coding ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [21]

22 Line Spectrum Pair (LSP) Coding UMCP ENEE408G Slides (created by R. Liu & M.Wu 2002) Pros and Cons of LPC method Good performance at coding rate down to 2.4kbps Synthesized voice becomes unnatural when below 2.4kbps When the poles are near the unit circle, quantization in LPC coefficients may result in instability. LSP parameters LSP are frequencies extracted from polynomials constructed from LPC coefficients Frequency domain features (similar to formant) => produce less distortion due to quantization [See details in Design Project on Speech] ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [22]

23 Hybrid Coding UMCP ENEE408G Slides (created by R. Liu & M.Wu 2002) Hybrid between LPC and waveform coding LPC Residual Coding: encode and slowly update LPC coefficients, and send the LPC residual (e.g. encoded using Vector Quantization) Advantages: Free from quality degradation due to source modeling Low-frequency waveform is exactly reproduced Spectral information of the entire frequency range is preserved No need of pitch period estimation and voiced/unvoiced decision ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [23]

24 Code-Excited Linear Predictive Coding (CELP) UMCP ENEE408G Slides (created by R. Liu & M.Wu 2002) Multipulse-Excised Linear Predictive Coding (MPC) Do not distinguish voiced/unvoiced sound explicitly Code-Excited Linear Predictive Coding (CELP) Replace the multi-pulses of MPC with vector-quantized sequences based on long-term prediction of periodicity and short-term prediction Figure 6.32 of Furui s book ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [24]

25 Speech Coding Methods Table 6.1 of Furui s book UMCP ENEE408G Slides (created by R. Liu & M.Wu 2002) Waveform coding; Hybrid coding; Analysis-synthesis coding ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [25]

26 Speech Quality vs. Transmission Rate UMCP ENEE408G Slides (created by R. Liu & M.Wu 2002) Figure 6.2 of Furui s book ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [26]

27 Comparison of Different Speech Coding Tech. UMCP ENEE408G Slides (created by R. Liu & M.Wu 2002) Table 6.2 of Furui s book ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [27]

28 Put Together: A Digital Telephone System UMCP ENEE408G Slides (created by M.Wu & R.Liu 2002) 8kHz and 8-bit per sample for telephone speech => 64kbps Anti-aliasing filter before sampling Non-uniform quantization (e.g., through μ-law or A-law companding ~ signal compression-expansion) ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [28]

29 Summary UMCP ENEE408G Slides (created by M.Wu & R.Liu 2002) Speech production and analysis Spectrogram; Pitch, Formant Linear prediction model Speech coding Basic compression tools This week s Lab session: Design project#1 on Speech Next lecture: speech recognition ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [29]

30 Assignments UMCP ENEE408G Slides (created by M.Wu & R.Liu 2002) The Past, Present, and Future of Speech Processing IEEE Signal Processing Magazine, May 1998 Hard copy handout -- Read Section I, II, III Preparing for Friday Lab Lab manual will be posted online by Wednesday afternoon Go over Part I and II before coming to lab Already have background to do Part I & II Can play with Part III, IV-1, and V Start working on Part VI (Pocket PC) Team-up and Instructor and TA your team information by Thursday 5pm ENEE408G Capstone -- Multimedia Signal Processing (S'03) Lec1 Introduction 2/4/03 [30]

-voiced. +voiced. /z/ /s/ Last Lecture. Digital Speech Processing. Overview of Speech Processing. Example on Sound Source Feature

-voiced. +voiced. /z/ /s/ Last Lecture. Digital Speech Processing. Overview of Speech Processing. Example on Sound Source Feature ENEE408G Lecture-6 Digital Speech rocessing URL: http://www.ece.umd.edu/class/enee408g/ Slides included here are based on Spring 005 offering in the order of introduction, image, video, speech, and audio.

More information

EE482: Digital Signal Processing Applications

EE482: Digital Signal Processing Applications Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 12 Speech Signal Processing 14/03/25 http://www.ee.unlv.edu/~b1morris/ee482/

More information

ENEE408G Multimedia Signal Processing

ENEE408G Multimedia Signal Processing ENEE408G Multimedia Signal Processing Design Project on Digital Speech Processing Goals: 1. Learn how to use the linear predictive model for speech analysis and synthesis. 2. Implement a linear predictive

More information

Overview of Code Excited Linear Predictive Coder

Overview of Code Excited Linear Predictive Coder Overview of Code Excited Linear Predictive Coder Minal Mulye 1, Sonal Jagtap 2 1 PG Student, 2 Assistant Professor, Department of E&TC, Smt. Kashibai Navale College of Engg, Pune, India Abstract Advances

More information

Speech Coding Technique And Analysis Of Speech Codec Using CS-ACELP

Speech Coding Technique And Analysis Of Speech Codec Using CS-ACELP Speech Coding Technique And Analysis Of Speech Codec Using CS-ACELP Monika S.Yadav Vidarbha Institute of Technology Rashtrasant Tukdoji Maharaj Nagpur University, Nagpur, India monika.yadav@rediffmail.com

More information

APPLICATIONS OF DSP OBJECTIVES

APPLICATIONS OF DSP OBJECTIVES APPLICATIONS OF DSP OBJECTIVES This lecture will discuss the following: Introduce analog and digital waveform coding Introduce Pulse Coded Modulation Consider speech-coding principles Introduce the channel

More information

Enhanced Waveform Interpolative Coding at 4 kbps

Enhanced Waveform Interpolative Coding at 4 kbps Enhanced Waveform Interpolative Coding at 4 kbps Oded Gottesman, and Allen Gersho Signal Compression Lab. University of California, Santa Barbara E-mail: [oded, gersho]@scl.ece.ucsb.edu Signal Compression

More information

Communications Theory and Engineering

Communications Theory and Engineering Communications Theory and Engineering Master's Degree in Electronic Engineering Sapienza University of Rome A.A. 2018-2019 Speech and telephone speech Based on a voice production model Parametric representation

More information

The Channel Vocoder (analyzer):

The Channel Vocoder (analyzer): Vocoders 1 The Channel Vocoder (analyzer): The channel vocoder employs a bank of bandpass filters, Each having a bandwidth between 100 Hz and 300 Hz. Typically, 16-20 linear phase FIR filter are used.

More information

speech signal S(n). This involves a transformation of S(n) into another signal or a set of signals

speech signal S(n). This involves a transformation of S(n) into another signal or a set of signals 16 3. SPEECH ANALYSIS 3.1 INTRODUCTION TO SPEECH ANALYSIS Many speech processing [22] applications exploits speech production and perception to accomplish speech analysis. By speech analysis we extract

More information

Speech Synthesis; Pitch Detection and Vocoders

Speech Synthesis; Pitch Detection and Vocoders Speech Synthesis; Pitch Detection and Vocoders Tai-Shih Chi ( 冀泰石 ) Department of Communication Engineering National Chiao Tung University May. 29, 2008 Speech Synthesis Basic components of the text-to-speech

More information

EE 225D LECTURE ON MEDIUM AND HIGH RATE CODING. University of California Berkeley

EE 225D LECTURE ON MEDIUM AND HIGH RATE CODING. University of California Berkeley University of California Berkeley College of Engineering Department of Electrical Engineering and Computer Sciences Professors : N.Morgan / B.Gold EE225D Spring,1999 Medium & High Rate Coding Lecture 26

More information

EC 6501 DIGITAL COMMUNICATION UNIT - II PART A

EC 6501 DIGITAL COMMUNICATION UNIT - II PART A EC 6501 DIGITAL COMMUNICATION 1.What is the need of prediction filtering? UNIT - II PART A [N/D-16] Prediction filtering is used mostly in audio signal processing and speech processing for representing

More information

Vocoder (LPC) Analysis by Variation of Input Parameters and Signals

Vocoder (LPC) Analysis by Variation of Input Parameters and Signals ISCA Journal of Engineering Sciences ISCA J. Engineering Sci. Vocoder (LPC) Analysis by Variation of Input Parameters and Signals Abstract Gupta Rajani, Mehta Alok K. and Tiwari Vebhav Truba College of

More information

Linguistic Phonetics. Spectral Analysis

Linguistic Phonetics. Spectral Analysis 24.963 Linguistic Phonetics Spectral Analysis 4 4 Frequency (Hz) 1 Reading for next week: Liljencrants & Lindblom 1972. Assignment: Lip-rounding assignment, due 1/15. 2 Spectral analysis techniques There

More information

Cellular systems & GSM Wireless Systems, a.a. 2014/2015

Cellular systems & GSM Wireless Systems, a.a. 2014/2015 Cellular systems & GSM Wireless Systems, a.a. 2014/2015 Un. of Rome La Sapienza Chiara Petrioli Department of Computer Science University of Rome Sapienza Italy 2 Voice Coding 3 Speech signals Voice coding:

More information

Speech Synthesis using Mel-Cepstral Coefficient Feature

Speech Synthesis using Mel-Cepstral Coefficient Feature Speech Synthesis using Mel-Cepstral Coefficient Feature By Lu Wang Senior Thesis in Electrical Engineering University of Illinois at Urbana-Champaign Advisor: Professor Mark Hasegawa-Johnson May 2018 Abstract

More information

L19: Prosodic modification of speech

L19: Prosodic modification of speech L19: Prosodic modification of speech Time-domain pitch synchronous overlap add (TD-PSOLA) Linear-prediction PSOLA Frequency-domain PSOLA Sinusoidal models Harmonic + noise models STRAIGHT This lecture

More information

MASTER'S THESIS. Speech Compression and Tone Detection in a Real-Time System. Kristina Berglund. MSc Programmes in Engineering

MASTER'S THESIS. Speech Compression and Tone Detection in a Real-Time System. Kristina Berglund. MSc Programmes in Engineering 2004:003 CIV MASTER'S THESIS Speech Compression and Tone Detection in a Real-Time System Kristina Berglund MSc Programmes in Engineering Department of Computer Science and Electrical Engineering Division

More information

Audio Signal Compression using DCT and LPC Techniques

Audio Signal Compression using DCT and LPC Techniques Audio Signal Compression using DCT and LPC Techniques P. Sandhya Rani#1, D.Nanaji#2, V.Ramesh#3,K.V.S. Kiran#4 #Student, Department of ECE, Lendi Institute Of Engineering And Technology, Vizianagaram,

More information

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm A.T. Rajamanickam, N.P.Subiramaniyam, A.Balamurugan*,

More information

Speech Compression Using Voice Excited Linear Predictive Coding

Speech Compression Using Voice Excited Linear Predictive Coding Speech Compression Using Voice Excited Linear Predictive Coding Ms.Tosha Sen, Ms.Kruti Jay Pancholi PG Student, Asst. Professor, L J I E T, Ahmedabad Abstract : The aim of the thesis is design good quality

More information

Comparison of CELP speech coder with a wavelet method

Comparison of CELP speech coder with a wavelet method University of Kentucky UKnowledge University of Kentucky Master's Theses Graduate School 2006 Comparison of CELP speech coder with a wavelet method Sriram Nagaswamy University of Kentucky, sriramn@gmail.com

More information

Chapter IV THEORY OF CELP CODING

Chapter IV THEORY OF CELP CODING Chapter IV THEORY OF CELP CODING CHAPTER IV THEORY OF CELP CODING 4.1 Introduction Wavefonn coders fail to produce high quality speech at bit rate lower than 16 kbps. Source coders, such as LPC vocoders,

More information

Speech Enhancement using Wiener filtering

Speech Enhancement using Wiener filtering Speech Enhancement using Wiener filtering S. Chirtmay and M. Tahernezhadi Department of Electrical Engineering Northern Illinois University DeKalb, IL 60115 ABSTRACT The problem of reducing the disturbing

More information

techniques are means of reducing the bandwidth needed to represent the human voice. In mobile

techniques are means of reducing the bandwidth needed to represent the human voice. In mobile 8 2. LITERATURE SURVEY The available radio spectrum for the wireless radio communication is very limited hence to accommodate maximum number of users the speech is compressed. The speech compression techniques

More information

Converting Speaking Voice into Singing Voice

Converting Speaking Voice into Singing Voice Converting Speaking Voice into Singing Voice 1 st place of the Synthesis of Singing Challenge 2007: Vocal Conversion from Speaking to Singing Voice using STRAIGHT by Takeshi Saitou et al. 1 STRAIGHT Speech

More information

Robust Linear Prediction Analysis for Low Bit-Rate Speech Coding

Robust Linear Prediction Analysis for Low Bit-Rate Speech Coding Robust Linear Prediction Analysis for Low Bit-Rate Speech Coding Nanda Prasetiyo Koestoer B. Eng (Hon) (1998) School of Microelectronic Engineering Faculty of Engineering and Information Technology Griffith

More information

Digital Audio. Lecture-6

Digital Audio. Lecture-6 Digital Audio Lecture-6 Topics today Digitization of sound PCM Lossless predictive coding 2 Sound Sound is a pressure wave, taking continuous values Increase / decrease in pressure can be measured in amplitude,

More information

Analysis/synthesis coding

Analysis/synthesis coding TSBK06 speech coding p.1/32 Analysis/synthesis coding Many speech coders are based on a principle called analysis/synthesis coding. Instead of coding a waveform, as is normally done in general audio coders

More information

Page 0 of 23. MELP Vocoder

Page 0 of 23. MELP Vocoder Page 0 of 23 MELP Vocoder Outline Introduction MELP Vocoder Features Algorithm Description Parameters & Comparison Page 1 of 23 Introduction Traditional pitched-excited LPC vocoders use either a periodic

More information

Voice Excited Lpc for Speech Compression by V/Uv Classification

Voice Excited Lpc for Speech Compression by V/Uv Classification IOSR Journal of VLSI and Signal Processing (IOSR-JVSP) Volume 6, Issue 3, Ver. II (May. -Jun. 2016), PP 65-69 e-issn: 2319 4200, p-issn No. : 2319 4197 www.iosrjournals.org Voice Excited Lpc for Speech

More information

EC 2301 Digital communication Question bank

EC 2301 Digital communication Question bank EC 2301 Digital communication Question bank UNIT I Digital communication system 2 marks 1.Draw block diagram of digital communication system. Information source and input transducer formatter Source encoder

More information

Chapter 9 Image Compression Standards

Chapter 9 Image Compression Standards Chapter 9 Image Compression Standards 9.1 The JPEG Standard 9.2 The JPEG2000 Standard 9.3 The JPEG-LS Standard 1IT342 Image Compression Standards The image standard specifies the codec, which defines how

More information

INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET)

INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET) INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET) Proceedings of the 2 nd International Conference on Current Trends in Engineering and Management ICCTEM -214 ISSN

More information

Adaptive time scale modification of speech for graceful degrading voice quality in congested networks

Adaptive time scale modification of speech for graceful degrading voice quality in congested networks Adaptive time scale modification of speech for graceful degrading voice quality in congested networks Prof. H. Gokhan ILK Ankara University, Faculty of Engineering, Electrical&Electronics Eng. Dept 1 Contact

More information

Audio processing methods on marine mammal vocalizations

Audio processing methods on marine mammal vocalizations Audio processing methods on marine mammal vocalizations Xanadu Halkias Laboratory for the Recognition and Organization of Speech and Audio http://labrosa.ee.columbia.edu Sound to Signal sound is pressure

More information

Compression. Encryption. Decryption. Decompression. Presentation of Information to client site

Compression. Encryption. Decryption. Decompression. Presentation of Information to client site DOCUMENT Anup Basu Audio Image Video Data Graphics Objectives Compression Encryption Network Communications Decryption Decompression Client site Presentation of Information to client site Multimedia -

More information

DEPARTMENT OF INFORMATION TECHNOLOGY QUESTION BANK. Subject Name: Information Coding Techniques UNIT I INFORMATION ENTROPY FUNDAMENTALS

DEPARTMENT OF INFORMATION TECHNOLOGY QUESTION BANK. Subject Name: Information Coding Techniques UNIT I INFORMATION ENTROPY FUNDAMENTALS DEPARTMENT OF INFORMATION TECHNOLOGY QUESTION BANK Subject Name: Year /Sem: II / IV UNIT I INFORMATION ENTROPY FUNDAMENTALS PART A (2 MARKS) 1. What is uncertainty? 2. What is prefix coding? 3. State the

More information

ECE 556 BASICS OF DIGITAL SPEECH PROCESSING. Assıst.Prof.Dr. Selma ÖZAYDIN Spring Term-2017 Lecture 2

ECE 556 BASICS OF DIGITAL SPEECH PROCESSING. Assıst.Prof.Dr. Selma ÖZAYDIN Spring Term-2017 Lecture 2 ECE 556 BASICS OF DIGITAL SPEECH PROCESSING Assıst.Prof.Dr. Selma ÖZAYDIN Spring Term-2017 Lecture 2 Analog Sound to Digital Sound Characteristics of Sound Amplitude Wavelength (w) Frequency ( ) Timbre

More information

E : Lecture 8 Source-Filter Processing. E : Lecture 8 Source-Filter Processing / 21

E : Lecture 8 Source-Filter Processing. E : Lecture 8 Source-Filter Processing / 21 E85.267: Lecture 8 Source-Filter Processing E85.267: Lecture 8 Source-Filter Processing 21-4-1 1 / 21 Source-filter analysis/synthesis n f Spectral envelope Spectral envelope Analysis Source signal n 1

More information

HST.582J / 6.555J / J Biomedical Signal and Image Processing Spring 2007

HST.582J / 6.555J / J Biomedical Signal and Image Processing Spring 2007 MIT OpenCourseWare http://ocw.mit.edu HST.582J / 6.555J / 16.456J Biomedical Signal and Image Processing Spring 2007 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

Voice Transmission --Basic Concepts--

Voice Transmission --Basic Concepts-- Voice Transmission --Basic Concepts-- Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics: Amplitude Frequency Phase Telephone Handset (has 2-parts) 2 1. Transmitter

More information

Multimedia Systems Entropy Coding Mahdi Amiri February 2011 Sharif University of Technology

Multimedia Systems Entropy Coding Mahdi Amiri February 2011 Sharif University of Technology Course Presentation Multimedia Systems Entropy Coding Mahdi Amiri February 2011 Sharif University of Technology Data Compression Motivation Data storage and transmission cost money Use fewest number of

More information

Distributed Speech Recognition Standardization Activity

Distributed Speech Recognition Standardization Activity Distributed Speech Recognition Standardization Activity Alex Sorin, Ron Hoory, Dan Chazan Telecom and Media Systems Group June 30, 2003 IBM Research Lab in Haifa Advanced Speech Enabled Services ASR App

More information

SILK Speech Codec. TDP 10/11 Xavier Anguera I Ciro Gracia

SILK Speech Codec. TDP 10/11 Xavier Anguera I Ciro Gracia SILK Speech Codec TDP 10/11 Xavier Anguera I Ciro Gracia SILK Codec Audio codec desenvolupat per Skype (Febrer 2009) Previament usaven el codec SVOPC (Sinusoidal Voice Over Packet Coder): LPC analysis.

More information

Project 0: Part 2 A second hands-on lab on Speech Processing Frequency-domain processing

Project 0: Part 2 A second hands-on lab on Speech Processing Frequency-domain processing Project : Part 2 A second hands-on lab on Speech Processing Frequency-domain processing February 24, 217 During this lab, you will have a first contact on frequency domain analysis of speech signals. You

More information

Advanced audio analysis. Martin Gasser

Advanced audio analysis. Martin Gasser Advanced audio analysis Martin Gasser Motivation Which methods are common in MIR research? How can we parameterize audio signals? Interesting dimensions of audio: Spectral/ time/melody structure, high

More information

Fundamentals of Digital Communication

Fundamentals of Digital Communication Fundamentals of Digital Communication Network Infrastructures A.A. 2017/18 Digital communication system Analog Digital Input Signal Analog/ Digital Low Pass Filter Sampler Quantizer Source Encoder Channel

More information

EEE 309 Communication Theory

EEE 309 Communication Theory EEE 309 Communication Theory Semester: January 2017 Dr. Md. Farhad Hossain Associate Professor Department of EEE, BUET Email: mfarhadhossain@eee.buet.ac.bd Office: ECE 331, ECE Building Types of Modulation

More information

Pitch Period of Speech Signals Preface, Determination and Transformation

Pitch Period of Speech Signals Preface, Determination and Transformation Pitch Period of Speech Signals Preface, Determination and Transformation Mohammad Hossein Saeidinezhad 1, Bahareh Karamsichani 2, Ehsan Movahedi 3 1 Islamic Azad university, Najafabad Branch, Saidinezhad@yahoo.com

More information

Wideband Speech Coding & Its Application

Wideband Speech Coding & Its Application Wideband Speech Coding & Its Application Apeksha B. landge. M.E. [student] Aditya Engineering College Beed Prof. Amir Lodhi. Guide & HOD, Aditya Engineering College Beed ABSTRACT: Increasing the bandwidth

More information

Speech Processing. Undergraduate course code: LASC10061 Postgraduate course code: LASC11065

Speech Processing. Undergraduate course code: LASC10061 Postgraduate course code: LASC11065 Speech Processing Undergraduate course code: LASC10061 Postgraduate course code: LASC11065 All course materials and handouts are the same for both versions. Differences: credits (20 for UG, 10 for PG);

More information

A 600 BPS MELP VOCODER FOR USE ON HF CHANNELS

A 600 BPS MELP VOCODER FOR USE ON HF CHANNELS A 600 BPS MELP VOCODER FOR USE ON HF CHANNELS Mark W. Chamberlain Harris Corporation, RF Communications Division 1680 University Avenue Rochester, New York 14610 ABSTRACT The U.S. government has developed

More information

Speech Coding in the Frequency Domain

Speech Coding in the Frequency Domain Speech Coding in the Frequency Domain Speech Processing Advanced Topics Tom Bäckström Aalto University October 215 Introduction The speech production model can be used to efficiently encode speech signals.

More information

Pulse Code Modulation

Pulse Code Modulation Pulse Code Modulation Modulation is the process of varying one or more parameters of a carrier signal in accordance with the instantaneous values of the message signal. The message signal is the signal

More information

Waveform Encoding - PCM. BY: Dr.AHMED ALKHAYYAT. Chapter Two

Waveform Encoding - PCM. BY: Dr.AHMED ALKHAYYAT. Chapter Two Chapter Two Layout: 1. Introduction. 2. Pulse Code Modulation (PCM). 3. Differential Pulse Code Modulation (DPCM). 4. Delta modulation. 5. Adaptive delta modulation. 6. Sigma Delta Modulation (SDM). 7.

More information

Digital Signal Processing

Digital Signal Processing COMP ENG 4TL4: Digital Signal Processing Notes for Lecture #27 Tuesday, November 11, 23 6. SPECTRAL ANALYSIS AND ESTIMATION 6.1 Introduction to Spectral Analysis and Estimation The discrete-time Fourier

More information

Reading: Johnson Ch , Ch.5.5 (today); Liljencrants & Lindblom; Stevens (Tues) reminder: no class on Thursday.

Reading: Johnson Ch , Ch.5.5 (today); Liljencrants & Lindblom; Stevens (Tues) reminder: no class on Thursday. L105/205 Phonetics Scarborough Handout 7 10/18/05 Reading: Johnson Ch.2.3.3-2.3.6, Ch.5.5 (today); Liljencrants & Lindblom; Stevens (Tues) reminder: no class on Thursday Spectral Analysis 1. There are

More information

SPEECH AND SPECTRAL ANALYSIS

SPEECH AND SPECTRAL ANALYSIS SPEECH AND SPECTRAL ANALYSIS 1 Sound waves: production in general: acoustic interference vibration (carried by some propagation medium) variations in air pressure speech: actions of the articulatory organs

More information

EEE 309 Communication Theory

EEE 309 Communication Theory EEE 309 Communication Theory Semester: January 2016 Dr. Md. Farhad Hossain Associate Professor Department of EEE, BUET Email: mfarhadhossain@eee.buet.ac.bd Office: ECE 331, ECE Building Part 05 Pulse Code

More information

NOTICE WARNING CONCERNING COPYRIGHT RESTRICTIONS: The copyright law of the United States (title 17, U.S. Code) governs the making of photocopies or

NOTICE WARNING CONCERNING COPYRIGHT RESTRICTIONS: The copyright law of the United States (title 17, U.S. Code) governs the making of photocopies or NOTICE WARNING CONCERNING COPYRIGHT RESTRICTIONS: The copyright law of the United States (title 17, U.S. Code) governs the making of photocopies or other reproductions of copyrighted material. Any copying

More information

ON-LINE LABORATORIES FOR SPEECH AND IMAGE PROCESSING AND FOR COMMUNICATION SYSTEMS USING J-DSP

ON-LINE LABORATORIES FOR SPEECH AND IMAGE PROCESSING AND FOR COMMUNICATION SYSTEMS USING J-DSP ON-LINE LABORATORIES FOR SPEECH AND IMAGE PROCESSING AND FOR COMMUNICATION SYSTEMS USING J-DSP A. Spanias, V. Atti, Y. Ko, T. Thrasyvoulou, M.Yasin, M. Zaman, T. Duman, L. Karam, A. Papandreou, K. Tsakalis

More information

Overview of Signal Processing

Overview of Signal Processing Overview of Signal Processing Chapter Intended Learning Outcomes: (i) Understand basic terminology in signal processing (ii) Differentiate digital signal processing and analog signal processing (iii) Describe

More information

Signal Analysis. Peak Detection. Envelope Follower (Amplitude detection) Music 270a: Signal Analysis

Signal Analysis. Peak Detection. Envelope Follower (Amplitude detection) Music 270a: Signal Analysis Signal Analysis Music 27a: Signal Analysis Tamara Smyth, trsmyth@ucsd.edu Department of Music, University of California, San Diego (UCSD November 23, 215 Some tools we may want to use to automate analysis

More information

COMP 546, Winter 2017 lecture 20 - sound 2

COMP 546, Winter 2017 lecture 20 - sound 2 Today we will examine two types of sounds that are of great interest: music and speech. We will see how a frequency domain analysis is fundamental to both. Musical sounds Let s begin by briefly considering

More information

Analog and Telecommunication Electronics

Analog and Telecommunication Electronics Politecnico di Torino - ICT School Analog and Telecommunication Electronics D5 - Special A/D converters» Differential converters» Oversampling, noise shaping» Logarithmic conversion» Approximation, A and

More information

COMPRESSIVE SAMPLING OF SPEECH SIGNALS. Mona Hussein Ramadan. BS, Sebha University, Submitted to the Graduate Faculty of

COMPRESSIVE SAMPLING OF SPEECH SIGNALS. Mona Hussein Ramadan. BS, Sebha University, Submitted to the Graduate Faculty of COMPRESSIVE SAMPLING OF SPEECH SIGNALS by Mona Hussein Ramadan BS, Sebha University, 25 Submitted to the Graduate Faculty of Swanson School of Engineering in partial fulfillment of the requirements for

More information

Speech Coding using Linear Prediction

Speech Coding using Linear Prediction Speech Coding using Linear Prediction Jesper Kjær Nielsen Aalborg University and Bang & Olufsen jkn@es.aau.dk September 10, 2015 1 Background Speech is generated when air is pushed from the lungs through

More information

TCET3202 Analog and digital Communications II

TCET3202 Analog and digital Communications II NEW YORK CITY COLLEGE OF TECHNOLOGY The City University of New York DEPARTMENT: SUBJECT CODE AND TITLE: COURSE DESCRIPTION: REQUIRED COURSE Electrical and Telecommunications Engineering Technology TCET3202

More information

Signal Processing for Speech Applications - Part 2-1. Signal Processing For Speech Applications - Part 2

Signal Processing for Speech Applications - Part 2-1. Signal Processing For Speech Applications - Part 2 Signal Processing for Speech Applications - Part 2-1 Signal Processing For Speech Applications - Part 2 May 14, 2013 Signal Processing for Speech Applications - Part 2-2 References Huang et al., Chapter

More information

Realization and Performance Evaluation of New Hybrid Speech Compression Technique

Realization and Performance Evaluation of New Hybrid Speech Compression Technique Realization and Performance Evaluation of New Hybrid Speech Compression Technique Javaid A. Sheikh Post Graduate Department of Electronics & IT University of Kashmir Srinagar, India E-mail: sjavaid_29ku@yahoo.co.in

More information

Telecommunication Electronics

Telecommunication Electronics Politecnico di Torino ICT School Telecommunication Electronics C5 - Special A/D converters» Logarithmic conversion» Approximation, A and µ laws» Differential converters» Oversampling, noise shaping Logarithmic

More information

Compression and Image Formats

Compression and Image Formats Compression Compression and Image Formats Reduce amount of data used to represent an image/video Bit rate and quality requirements Necessary to facilitate transmission and storage Required quality is application

More information

Waveform Coding Algorithms: An Overview

Waveform Coding Algorithms: An Overview August 24, 2012 Waveform Coding Algorithms: An Overview RWTH Aachen University Compression Algorithms Seminar Report Summer Semester 2012 Adel Zaalouk - 300374 Aachen, Germany Contents 1 An Introduction

More information

10 Speech and Audio Signals

10 Speech and Audio Signals 0 Speech and Audio Signals Introduction Speech and audio signals are normally converted into PCM, which can be stored or transmitted as a PCM code, or compressed to reduce the number of bits used to code

More information

University of Washington Department of Electrical Engineering Computer Speech Processing EE516 Winter 2005

University of Washington Department of Electrical Engineering Computer Speech Processing EE516 Winter 2005 University of Washington Department of Electrical Engineering Computer Speech Processing EE516 Winter 2005 Lecture 5 Slides Jan 26 th, 2005 Outline of Today s Lecture Announcements Filter-bank analysis

More information

Simulation of Conjugate Structure Algebraic Code Excited Linear Prediction Speech Coder

Simulation of Conjugate Structure Algebraic Code Excited Linear Prediction Speech Coder COMPUSOFT, An international journal of advanced computer technology, 3 (3), March-204 (Volume-III, Issue-III) ISSN:2320-0790 Simulation of Conjugate Structure Algebraic Code Excited Linear Prediction Speech

More information

Audio and Speech Compression Using DCT and DWT Techniques

Audio and Speech Compression Using DCT and DWT Techniques Audio and Speech Compression Using DCT and DWT Techniques M. V. Patil 1, Apoorva Gupta 2, Ankita Varma 3, Shikhar Salil 4 Asst. Professor, Dept.of Elex, Bharati Vidyapeeth Univ.Coll.of Engg, Pune, Maharashtra,

More information

A SURVEY ON DICOM IMAGE COMPRESSION AND DECOMPRESSION TECHNIQUES

A SURVEY ON DICOM IMAGE COMPRESSION AND DECOMPRESSION TECHNIQUES A SURVEY ON DICOM IMAGE COMPRESSION AND DECOMPRESSION TECHNIQUES Shreya A 1, Ajay B.N 2 M.Tech Scholar Department of Computer Science and Engineering 2 Assitant Professor, Department of Computer Science

More information

Voice mail and office automation

Voice mail and office automation Voice mail and office automation by DOUGLAS L. HOGAN SPARTA, Incorporated McLean, Virginia ABSTRACT Contrary to expectations of a few years ago, voice mail or voice messaging technology has rapidly outpaced

More information

Overview of Digital Signal Processing

Overview of Digital Signal Processing Overview of Digital Signal Processing Chapter Intended Learning Outcomes: (i) Understand basic terminology in digital signal processing (ii) Differentiate digital signal processing and analog signal processing

More information

Ch. 3: Image Compression Multimedia Systems

Ch. 3: Image Compression Multimedia Systems 4/24/213 Ch. 3: Image Compression Multimedia Systems Prof. Ben Lee (modified by Prof. Nguyen) Oregon State University School of Electrical Engineering and Computer Science Outline Introduction JPEG Standard

More information

NOVEL PITCH DETECTION ALGORITHM WITH APPLICATION TO SPEECH CODING

NOVEL PITCH DETECTION ALGORITHM WITH APPLICATION TO SPEECH CODING NOVEL PITCH DETECTION ALGORITHM WITH APPLICATION TO SPEECH CODING A Thesis Submitted to the Graduate Faculty of the University of New Orleans in partial fulfillment of the requirements for the degree of

More information

Digital Communication (650533) CH 3 Pulse Modulation

Digital Communication (650533) CH 3 Pulse Modulation Philadelphia University/Faculty of Engineering Communication and Electronics Engineering Digital Communication (650533) CH 3 Pulse Modulation Instructor: Eng. Nada Khatib Website: http://www.philadelphia.edu.jo/academics/nkhatib/

More information

Implementation of attractive Speech Quality for Mixed Excited Linear Prediction

Implementation of attractive Speech Quality for Mixed Excited Linear Prediction IOSR Journal of Electrical and Electronics Engineering (IOSR-JEEE) e-issn: 2278-1676,p-ISSN: 2320-3331, Volume 9, Issue 2 Ver. I (Mar Apr. 2014), PP 07-12 Implementation of attractive Speech Quality for

More information

QUESTION BANK EC 1351 DIGITAL COMMUNICATION YEAR / SEM : III / VI UNIT I- PULSE MODULATION PART-A (2 Marks) 1. What is the purpose of sample and hold

QUESTION BANK EC 1351 DIGITAL COMMUNICATION YEAR / SEM : III / VI UNIT I- PULSE MODULATION PART-A (2 Marks) 1. What is the purpose of sample and hold QUESTION BANK EC 1351 DIGITAL COMMUNICATION YEAR / SEM : III / VI UNIT I- PULSE MODULATION PART-A (2 Marks) 1. What is the purpose of sample and hold circuit 2. What is the difference between natural sampling

More information

2.1. General Purpose Run Length Encoding Relative Encoding Tokanization or Pattern Substitution

2.1. General Purpose Run Length Encoding Relative Encoding Tokanization or Pattern Substitution 2.1. General Purpose There are many popular general purpose lossless compression techniques, that can be applied to any type of data. 2.1.1. Run Length Encoding Run Length Encoding is a compression technique

More information

IMPROVING QUALITY OF SPEECH SYNTHESIS IN INDIAN LANGUAGES. P. K. Lehana and P. C. Pandey

IMPROVING QUALITY OF SPEECH SYNTHESIS IN INDIAN LANGUAGES. P. K. Lehana and P. C. Pandey Workshop on Spoken Language Processing - 2003, TIFR, Mumbai, India, January 9-11, 2003 149 IMPROVING QUALITY OF SPEECH SYNTHESIS IN INDIAN LANGUAGES P. K. Lehana and P. C. Pandey Department of Electrical

More information

Chapter 7. Frequency-Domain Representations 语音信号的频域表征

Chapter 7. Frequency-Domain Representations 语音信号的频域表征 Chapter 7 Frequency-Domain Representations 语音信号的频域表征 1 General Discrete-Time Model of Speech Production Voiced Speech: A V P(z)G(z)V(z)R(z) Unvoiced Speech: A N N(z)V(z)R(z) 2 DTFT and DFT of Speech The

More information

CHAPTER 3 Syllabus (2006 scheme syllabus) Differential pulse code modulation DPCM transmitter

CHAPTER 3 Syllabus (2006 scheme syllabus) Differential pulse code modulation DPCM transmitter CHAPTER 3 Syllabus 1) DPCM 2) DM 3) Base band shaping for data tranmission 4) Discrete PAM signals 5) Power spectra of discrete PAM signal. 6) Applications (2006 scheme syllabus) Differential pulse code

More information

Syllabus. osmania university UNIT - I UNIT - II UNIT - III CHAPTER - 1 : INTRODUCTION TO DIGITAL COMMUNICATION CHAPTER - 3 : INFORMATION THEORY

Syllabus. osmania university UNIT - I UNIT - II UNIT - III CHAPTER - 1 : INTRODUCTION TO DIGITAL COMMUNICATION CHAPTER - 3 : INFORMATION THEORY i Syllabus osmania university UNIT - I CHAPTER - 1 : INTRODUCTION TO Elements of Digital Communication System, Comparison of Digital and Analog Communication Systems. CHAPTER - 2 : DIGITAL TRANSMISSION

More information

Components for Signal Compression

Components for Signal Compression 4509ch09.qxd/skm 6/18/99 11:48 AM Page 290 9 Components for Signal Compression The process of signal analysis and modeling described in the previous chapter results in a compact formulation of the information-bearing

More information

ITM 1010 Computer and Communication Technologies

ITM 1010 Computer and Communication Technologies ITM 1010 Computer and Communication Technologies Lecture #20 Review: Communication Technologies 2003 香港中文大學, 電子工程學系 (Prof. H.K.Tsang) ITM 1010 計算機與通訊技術 1 Review of Communication Technologies! Information

More information

COMPARATIVE REVIEW BETWEEN CELP AND ACELP ENCODER FOR CDMA TECHNOLOGY

COMPARATIVE REVIEW BETWEEN CELP AND ACELP ENCODER FOR CDMA TECHNOLOGY COMPARATIVE REVIEW BETWEEN CELP AND ACELP ENCODER FOR CDMA TECHNOLOGY V.C.TOGADIYA 1, N.N.SHAH 2, R.N.RATHOD 3 Assistant Professor, Dept. of ECE, R.K.College of Engg & Tech, Rajkot, Gujarat, India 1 Assistant

More information

EE 225D LECTURE ON SPEECH SYNTHESIS. University of California Berkeley

EE 225D LECTURE ON SPEECH SYNTHESIS. University of California Berkeley University of California Berkeley College of Engineering Department of Electrical Engineering and Computer Sciences Professors : N.Morgan / B.Gold EE225D Speech Synthesis Spring,1999 Lecture 23 N.MORGAN

More information

ECE/OPTI533 Digital Image Processing class notes 288 Dr. Robert A. Schowengerdt 2003

ECE/OPTI533 Digital Image Processing class notes 288 Dr. Robert A. Schowengerdt 2003 Motivation Large amount of data in images Color video: 200Mb/sec Landsat TM multispectral satellite image: 200MB High potential for compression Redundancy (aka correlation) in images spatial, temporal,

More information

Synthesis Algorithms and Validation

Synthesis Algorithms and Validation Chapter 5 Synthesis Algorithms and Validation An essential step in the study of pathological voices is re-synthesis; clear and immediate evidence of the success and accuracy of modeling efforts is provided

More information

Auditory modelling for speech processing in the perceptual domain

Auditory modelling for speech processing in the perceptual domain ANZIAM J. 45 (E) ppc964 C980, 2004 C964 Auditory modelling for speech processing in the perceptual domain L. Lin E. Ambikairajah W. H. Holmes (Received 8 August 2003; revised 28 January 2004) Abstract

More information

Multimedia Communications. Lossless Image Compression

Multimedia Communications. Lossless Image Compression Multimedia Communications Lossless Image Compression Old JPEG-LS JPEG, to meet its requirement for a lossless mode of operation, has chosen a simple predictive method which is wholly independent of the

More information