Enhanced Waveform Interpolative Coding at 4 kbps
|
|
- Thomas Hood
- 6 years ago
- Views:
Transcription
1 Enhanced Waveform Interpolative Coding at 4 kbps Oded Gottesman, and Allen Gersho Signal Compression Lab. University of California, Santa Barbara [oded, gersho]@scl.ece.ucsb.edu Signal Compression Lab., University of California, Santa Barbara 1
2 Description of Waveform-Interpolative coder Our WI Coder s Novel Techniques AbS SEW VQ AbS Dispersion-Phase VQ Pitch search for transitions AbS Switched-Predictive Gain VQ Subjective Tests Summary Demo Outline Signal Compression Lab., University of California, Santa Barbara 2
3 Description of Waveform-Interpolative coder Our WI Coder s Novel Techniques AbS SEW VQ AbS Dispersion-Phase VQ Pitch search for transitions AbS Switched-Predictive Gain VQ Subjective Tests Summary Demo Outline Signal Compression Lab., University of California, Santa Barbara 3
4 Waveform Interpolation (WI) Voiced speech is nearly periodic. Succeeding pitch cycles, have a slowly evolving shape. A continuously evolving sequence of pitch cycle waveforms can be generated. Extract a subsequence of these waveforms ( subsampling ) for quantization, then synthesis of speech can be performed by interpolating missing waveforms ( upsampling ). Signal Compression Lab., University of California, Santa Barbara 4
5 Waveform Decomposition Slowly Evolving Waveform (SEW) - voiced component, is coded: (1) at low temporal resolution, (2) high spectral resolution, (3) with spectral masking distortion. Rapidly Evolving Waveform (REW) - unvoiced component, is coded: (1) at high temporal, (2) low spectral resolution, (3) with spectral and temporal masking. The system is universal for all speech sounds (no V/UV classification needed) Signal Compression Lab., University of California, Santa Barbara 5
6 Waveform Decomposition (cont d( cont d) A speech segment typically contains both voiced and unvoiced attributes. Different perceived character of voiced and unvoiced components suggests a separation of the components and distinct perceptually based coding. Signal Compression Lab., University of California, Santa Barbara 6
7 Waveform Extraction Time Pitch-Normalized Cycle Phase Signal Compression * From Kleijn Lab., University and Haagen, of California, ICASSP95 Santa Barbara 7
8 Waveform Decomposition Signal Compression Lab., University of California, Santa Barbara 8 * From Kleijn and Haagen, ICASSP95
9 Waveform Synthesis φ () π t = φ ( t ) d p() τ τ m + 2 t t m Instantaneous phase contour t m-2 t m-1 Time t m t m+1 t m+2 Pitch-Normalized Cycle Phase Signal Compression Lab., University of California, Santa Barbara 9 * From Kleijn and Haagen, ICASSP95
10 WI Encoder Speech S(n) A(z) Residual r(n) Waveform Extraction Waveform Normalization DFT LPC Analysis Pitch Extraction Waveform Alignment LPC Interpolator Pitch Interpolator Waveform Power Delay LPC Quantizer Pitch Quantizer CW Waveform Decomposition Local Decoder AbS AGC REW SEW 10 Gain Quantizer REW Quantizer SEW Quantizer Signal Compression Lab., University of California, Santa Barbara 10
11 WI Decoder Delay Waveform Interpolation SEW Decoder REW Decoder Waveform Alignment 10 SEW Phase REW Phase + Power Normalization x IDFT Power Decoder Overlap- Add Interpolation Pitch Decoder LPC Decoder Pitch Interpolator LPC Interpolator p(n) Phase Contour φ (n) 1 A(z) Reconst. Residual Reconst. Speech Signal Compression Lab., University of California, Santa Barbara 11
12 Continuous Waveform Interpolation Over the interpolation interval t t m t m+1, the continuous reconstructed excitation signal is given by the time dependent Fourier series: rt () = [ ] K() t ( ( t)) Ak( tm) + ( t) Ak( tm ) cos( k ( t)) α α 1 φ [ t B t t B t ] k = 0 ( 1 α( )) k( m) + α( ) k( m ) sin( k ( t)) + 1 φ π φ() t = φ( tm ) + 2 dτ p( τ ) t t m where φ(t), is the instantaneous phase contour, and α(t) is some (increasing) interpolation function in the range 0 α(t) 1, which can be a simple linear function of the time, t, or of the pitchnormalized phase φ. Signal Compression Lab., University of California, Santa Barbara 12
13 Description of Waveform-Interpolative coder Our WI Coder s Novel Techniques AbS SEW VQ AbS Dispersion-Phase VQ Pitch search for transitions AbS Switched-Predictive Gain VQ Subjective Tests Summary Demo Outline Signal Compression Lab., University of California, Santa Barbara 13
14 SEW Quantization SEW is downsampled - quantized - upsampled Non-ideal filters used: aliasing + mirroring Distortion most notable in the transitions How can we improve quantization? Signal Compression Lab., University of California, Santa Barbara 14
15 SEW Filter Bank CW C 0 (t) LPF h(t) Anti-Aliasing (Decomposition) SEW M M LPF Interpolation Quantized SEW e j ω 0 t C 1 (t) LPF h(t) Anti-Aliasing (Decomposition) M SEW VQ M LPF Interpolation X SEW excitation + e j ω 0 Kt C K (t) LPF h(t) Anti-Aliasing (Decomposition) M M LPF Interpolation X Signal Compression Lab., University of California, Santa Barbara 15
16 Waveform Based SEW AbS LPC Analysis LPC Interpolation Speech A(z) Residual Waveform Extraction + Alignment + Decomposition Pitch Extraction r 1 r M+L-1 A 1 A M+L-1 Waveform Codebooks z -1 r^ 0 ^r M Waveform Synthesizer ( M Interpolation +Lookahead extrapolation) ~ r Lookahead ~ r M+L W 1 (z) A 1 (z) W M+L-1 (z) A M+L-1 (z) * 2 * 2 [1-α(t M+L-1 )] 2 + min D wi M - Number of waveforms per frame L - Number of lookahead waveforms Signal Compression Lab., University of California, Santa Barbara 16
17 SEW Optimization Examples 5000 Original 1 x 104 Original Amplitude 0 Amplitude Optimized x 104 Optimized Amplitude 0 Amplitude Non-optimized x 104 Non-optimized Amplitude 0 Amplitude Time (sec) Time (sec) Signal Compression Lab., University of California, Santa Barbara 17
18 Description of Waveform-Interpolative coder Our WI Coder s Novel Techniques AbS SEW VQ AbS Dispersion-Phase VQ Pitch search for transitions AbS Switched-Predictive Gain VQ Subjective Tests Summary Demo Outline Signal Compression Lab., University of California, Santa Barbara 18
19 Observations: Phase Quantization Phase is of secondary perceptual significance No efficient phase quantization scheme is known. Two extremes: Waveform coders (CELP) implicitly allocate a perceptually-excessive number of bits to the phase. In parametric coders the phase information is commonly not transmitted: How can the phase be quantized efficiently? Signal Compression Lab., University of California, Santa Barbara 19
20 Dispersion Phase ϕ Dispersion Phase: Pitch cycle extracted from residual signal cyclically shifted such that its pulse is located at position zero. resulting DFT phase: ϕ ϕ determines (along with the magnitude), the waveform s pulse shape. Signal Compression Lab., University of California, Santa Barbara 20
21 AbS Phase Dispersion Quantization Pitch-Cycle Waveform s DFT Crude Linear- Phase Alignment Refined Linear- Phase Alignment Magnitude r ^ r^ x - Codebook + r Phase Codebook e jϕ^ W(z) A(z) Pitch min * 2 Signal Compression Lab., University of California, Santa Barbara 21
22 Vector Quantization Design Variable-Dimension VQ Eight pitch-range dependent codebooks Pitch changes over time cause the quantizer to switch among the pitch-range codebooks. Solution: overlapped training clusters used, to achieve smooth phase variations whenever such switch occurs. 0-6 bit codebooks were designed and tested. Signal Compression Lab., University of California, Santa Barbara 22
23 Vector Quantization Design (cont d( cont d) Range for... codebook #1 Range for codebook #7 Range for codebook # Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5 Cluster 6 K Number of harmonics (vector dimension) Cluster 7 Cluster 8 Signal Compression Lab., University of California, Santa Barbara 23
24 Phase Code-Vector Examples Codebook #1 Codebook #2 Codebook #3 Codebook # Codebook #5 Codebook #6 Codebook #7 Codebook # Signal Compression Lab., University of California, Santa Barbara 24-3
25 Segmental Weighted SNR of 14 Phase VQ Seg. Weighted SNR: db Non-MIRS (Flat) MIRS Phase Bits Signal Compression Lab., University of California, Santa Barbara 25
26 Subjective Results Subjective Score 50.00% 45.00% 40.00% 35.00% 30.00% 25.00% 20.00% 15.00% 10.00% 5.00% 0.00% Female 4bit VQ Male Extracted No Preference 1 2 Male Results of subjective A/B test for comparison between the 4-bit phase VQ, and male extracted fixed phase. Signal Compression Lab., University of California, Santa Barbara 26
27 Description of Waveform-Interpolative coder Our WI Coder s Novel Techniques AbS SEW VQ AbS Dispersion-Phase VQ Pitch search for transitions AbS Switched-Predictive Gain VQ Subjective Tests Summary Demo Outline Signal Compression Lab., University of California, Santa Barbara 27
28 Pitch Search Pitch search robustness needed for: high and low pitch transitions segments with rapidly varying pitch Our suggested solutions: Combined temporal/spectral domain search Higher rate temporal domain search Varying boundaries based search Average pitch - using normalized correlation as weighting Signal Compression Lab., University of California, Santa Barbara 28
29 Pitch Search Algorithm Speech Spectral domain pitch search + tracker 100 Hz No Good Pitch? Yes Weighted speech Temporal domain pitch refinement 500 Hz Temporal domain pitch search No Good Pitches? Good Pitches? Yes 500 Hz Yes No Use 4 ms waveform length Weighted-Average Pitch Signal Compression Lab., University of California, Santa Barbara Hz
30 Description of Waveform-Interpolative coder Our WI Coder s Novel Techniques AbS SEW VQ AbS Dispersion-Phase VQ Pitch search for transitions AbS Switched-Predictive Gain VQ Subjective Tests Summary Demo Outline Signal Compression Lab., University of California, Santa Barbara 30
31 Switched-Predictive Gain VQ using Temporal Weighting VQ of Log-Gain (no gain down/up-sampling). Temporal weighting used for quantization avoids smear of plosives and onsets. Switched-Predictive VQ allows different gain predictors, and reduces outliers. Switched DC levels improve performance. Signal Compression Lab., University of California, Santa Barbara 31
32 Switched-Predictive AbS Log-Gain Gain VQ g(m) + DC Codebook D i Predictor Codebook Vector Quantizer Codebook P i c ij (m) Synthesis Filter 1 1 z P i 1 ^ t(m) + t(m) Temporal Weighting min * 2 Signal Compression Lab., University of California, Santa Barbara 32
33 Bit Allocation of EWI Coder Parameter Bits / Frame Bits / second LPC Pitch 2x6= Gain 2x6= REW SEW magnitude SEW phase Total Frame size: 20 ms Two subframes per frame Signal Compression Lab., University of California, Santa Barbara 33
34 Description of Waveform-Interpolative coder Our WI Coder s Novel Techniques AbS SEW VQ AbS Dispersion-Phase VQ Pitch search for transitions AbS Switched-Predictive Gain VQ Subjective Tests Summary Demo Outline Signal Compression Lab., University of California, Santa Barbara 34
35 A/B subjective Test 14 Listeners 12 male + 12 female sentences. MIRS filtered speech. Test 4 kbps WI 4 kbps MPEG-4 Female 65% 35% Male 62% 38% Total 64% 36% With 95% certainty the WI preference lies in [59%, 69%] WI is preferred in % certainty. Signal Compression Lab., University of California, Santa Barbara 35
36 A/B subjective Test (cont d( cont d) Test 4 kbps WI 5.3 kbps G Female 58% 42% Male 61% 39% Total 60% 40% With 95% certainty the WI preference lies in [54%, 65%] WI is preferred in 99.98% certainty. Test 4 kbps WI 6.3 kbps G Female 55% 45% Male 53% 47% Total 54% 46% With 95% certainty the WI preference lies in [49%, 59%] WI is preferred in 92.21% certainty. Signal Compression Lab., University of California, Santa Barbara 36
37 Summary AbS VQ for the SEW enhances coding efficiency. AbS VQ for the dispersion phase improves speech naturalness and waveform matching. New pitch search improves accuracy and robustness. Switched-Predictive Gain VQ avoids gain smear, and reduces outliers. Enhanced WI coder slightly exceeds quality of G at 6.3 kbps. Signal Compression Lab., University of California, Santa Barbara 37
38 References [1] O. Gottesman, Dispersion Phase Vector Quantization For Enhancement of Waveform Interpolative Coder, IEEE ICASSP 99, vol. 1, pp , [2] O. Gottesman and A. Gersho, "Enhanced Waveform Interpolative Coding at 4 kbps", IEEE Workshop on Speech Coding Proceedings, pp , 1999, Finland. [3] O. Gottesman and A. Gersho, "Enhanced Analysis-by- Synthesis Waveform Interpolative Coding at 4 kbps", EUROSPEECH 99, 1999, Hungary. Signal Compression Lab., University of California, Santa Barbara 38
39 SCW 99 Best Paper Award Signal Compression Lab., University of California, Santa Barbara 39
A Closed-loop Multimode Variable Bit Rate Characteristic Waveform Interpolation Coder
A Closed-loop Multimode Variable Bit Rate Characteristic Waveform Interpolation Coder Jing Wang, Jingg Kuang, and Shenghui Zhao Research Center of Digital Communication Technology,Department of Electronic
More informationEE482: Digital Signal Processing Applications
Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 12 Speech Signal Processing 14/03/25 http://www.ee.unlv.edu/~b1morris/ee482/
More informationChapter IV THEORY OF CELP CODING
Chapter IV THEORY OF CELP CODING CHAPTER IV THEORY OF CELP CODING 4.1 Introduction Wavefonn coders fail to produce high quality speech at bit rate lower than 16 kbps. Source coders, such as LPC vocoders,
More informationImproved signal analysis and time-synchronous reconstruction in waveform interpolation coding
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2000 Improved signal analysis and time-synchronous reconstruction in waveform
More informationDigital Speech Processing and Coding
ENEE408G Spring 2006 Lecture-2 Digital Speech Processing and Coding Spring 06 Instructor: Shihab Shamma Electrical & Computer Engineering University of Maryland, College Park http://www.ece.umd.edu/class/enee408g/
More informationAnalysis/synthesis coding
TSBK06 speech coding p.1/32 Analysis/synthesis coding Many speech coders are based on a principle called analysis/synthesis coding. Instead of coding a waveform, as is normally done in general audio coders
More informationOverview of Code Excited Linear Predictive Coder
Overview of Code Excited Linear Predictive Coder Minal Mulye 1, Sonal Jagtap 2 1 PG Student, 2 Assistant Professor, Department of E&TC, Smt. Kashibai Navale College of Engg, Pune, India Abstract Advances
More informationThe Channel Vocoder (analyzer):
Vocoders 1 The Channel Vocoder (analyzer): The channel vocoder employs a bank of bandpass filters, Each having a bandwidth between 100 Hz and 300 Hz. Typically, 16-20 linear phase FIR filter are used.
More informationspeech signal S(n). This involves a transformation of S(n) into another signal or a set of signals
16 3. SPEECH ANALYSIS 3.1 INTRODUCTION TO SPEECH ANALYSIS Many speech processing [22] applications exploits speech production and perception to accomplish speech analysis. By speech analysis we extract
More informationDistributed Speech Recognition Standardization Activity
Distributed Speech Recognition Standardization Activity Alex Sorin, Ron Hoory, Dan Chazan Telecom and Media Systems Group June 30, 2003 IBM Research Lab in Haifa Advanced Speech Enabled Services ASR App
More informationCommunications Theory and Engineering
Communications Theory and Engineering Master's Degree in Electronic Engineering Sapienza University of Rome A.A. 2018-2019 Speech and telephone speech Based on a voice production model Parametric representation
More informationSpeech Coding Technique And Analysis Of Speech Codec Using CS-ACELP
Speech Coding Technique And Analysis Of Speech Codec Using CS-ACELP Monika S.Yadav Vidarbha Institute of Technology Rashtrasant Tukdoji Maharaj Nagpur University, Nagpur, India monika.yadav@rediffmail.com
More informationL19: Prosodic modification of speech
L19: Prosodic modification of speech Time-domain pitch synchronous overlap add (TD-PSOLA) Linear-prediction PSOLA Frequency-domain PSOLA Sinusoidal models Harmonic + noise models STRAIGHT This lecture
More informationTranscoding of Narrowband to Wideband Speech
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2005 Transcoding of Narrowband to Wideband Speech Christian H. Ritz University
More informationSpeech Coding in the Frequency Domain
Speech Coding in the Frequency Domain Speech Processing Advanced Topics Tom Bäckström Aalto University October 215 Introduction The speech production model can be used to efficiently encode speech signals.
More informationSpeech Synthesis; Pitch Detection and Vocoders
Speech Synthesis; Pitch Detection and Vocoders Tai-Shih Chi ( 冀泰石 ) Department of Communication Engineering National Chiao Tung University May. 29, 2008 Speech Synthesis Basic components of the text-to-speech
More informationAccurate Delay Measurement of Coded Speech Signals with Subsample Resolution
PAGE 433 Accurate Delay Measurement of Coded Speech Signals with Subsample Resolution Wenliang Lu, D. Sen, and Shuai Wang School of Electrical Engineering & Telecommunications University of New South Wales,
More informationEE 225D LECTURE ON MEDIUM AND HIGH RATE CODING. University of California Berkeley
University of California Berkeley College of Engineering Department of Electrical Engineering and Computer Sciences Professors : N.Morgan / B.Gold EE225D Spring,1999 Medium & High Rate Coding Lecture 26
More informationEffects of Reverberation on Pitch, Onset/Offset, and Binaural Cues
Effects of Reverberation on Pitch, Onset/Offset, and Binaural Cues DeLiang Wang Perception & Neurodynamics Lab The Ohio State University Outline of presentation Introduction Human performance Reverberation
More informationWaveform interpolation speech coding
University of Wollongong Research Online University of Wollongong Thesis Collection 1954-2016 University of Wollongong Thesis Collections 1998 Waveform interpolation speech coding Jun Ni University of
More informationPreeti Rao 2 nd CompMusicWorkshop, Istanbul 2012
Preeti Rao 2 nd CompMusicWorkshop, Istanbul 2012 o Music signal characteristics o Perceptual attributes and acoustic properties o Signal representations for pitch detection o STFT o Sinusoidal model o
More informationRobust Voice Activity Detection Based on Discrete Wavelet. Transform
Robust Voice Activity Detection Based on Discrete Wavelet Transform Kun-Ching Wang Department of Information Technology & Communication Shin Chien University kunching@mail.kh.usc.edu.tw Abstract This paper
More informationDEPARTMENT OF DEFENSE TELECOMMUNICATIONS SYSTEMS STANDARD
NOT MEASUREMENT SENSITIVE 20 December 1999 DEPARTMENT OF DEFENSE TELECOMMUNICATIONS SYSTEMS STANDARD ANALOG-TO-DIGITAL CONVERSION OF VOICE BY 2,400 BIT/SECOND MIXED EXCITATION LINEAR PREDICTION (MELP)
More informationA 600 BPS MELP VOCODER FOR USE ON HF CHANNELS
A 600 BPS MELP VOCODER FOR USE ON HF CHANNELS Mark W. Chamberlain Harris Corporation, RF Communications Division 1680 University Avenue Rochester, New York 14610 ABSTRACT The U.S. government has developed
More informationEC 6501 DIGITAL COMMUNICATION UNIT - II PART A
EC 6501 DIGITAL COMMUNICATION 1.What is the need of prediction filtering? UNIT - II PART A [N/D-16] Prediction filtering is used mostly in audio signal processing and speech processing for representing
More informationSimulation of Conjugate Structure Algebraic Code Excited Linear Prediction Speech Coder
COMPUSOFT, An international journal of advanced computer technology, 3 (3), March-204 (Volume-III, Issue-III) ISSN:2320-0790 Simulation of Conjugate Structure Algebraic Code Excited Linear Prediction Speech
More informationTechniques for low-rate scalable compression of speech signals
University of Wollongong Research Online University of Wollongong Thesis Collection University of Wollongong Thesis Collections 2002 Techniques for low-rate scalable compression of speech signals Jason
More informationSpeech Synthesis using Mel-Cepstral Coefficient Feature
Speech Synthesis using Mel-Cepstral Coefficient Feature By Lu Wang Senior Thesis in Electrical Engineering University of Illinois at Urbana-Champaign Advisor: Professor Mark Hasegawa-Johnson May 2018 Abstract
More informationPage 0 of 23. MELP Vocoder
Page 0 of 23 MELP Vocoder Outline Introduction MELP Vocoder Features Algorithm Description Parameters & Comparison Page 1 of 23 Introduction Traditional pitched-excited LPC vocoders use either a periodic
More informationConverting Speaking Voice into Singing Voice
Converting Speaking Voice into Singing Voice 1 st place of the Synthesis of Singing Challenge 2007: Vocal Conversion from Speaking to Singing Voice using STRAIGHT by Takeshi Saitou et al. 1 STRAIGHT Speech
More informationDECOMPOSITION OF SPEECH INTO VOICED AND UNVOICED COMPONENTS BASED ON A KALMAN FILTERBANK
DECOMPOSITIO OF SPEECH ITO VOICED AD UVOICED COMPOETS BASED O A KALMA FILTERBAK Mark Thomson, Simon Boland, Michael Smithers 3, Mike Wu & Julien Epps Motorola Labs, Botany, SW 09 Cross Avaya R & D, orth
More informationSpeech Compression Using Voice Excited Linear Predictive Coding
Speech Compression Using Voice Excited Linear Predictive Coding Ms.Tosha Sen, Ms.Kruti Jay Pancholi PG Student, Asst. Professor, L J I E T, Ahmedabad Abstract : The aim of the thesis is design good quality
More informationI D I A P R E S E A R C H R E P O R T. June published in Interspeech 2008
R E S E A R C H R E P O R T I D I A P Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain Sriram Ganapathy a b Petr Motlicek a Hynek Hermansky a b Harinath
More informationIN RECENT YEARS, there has been a great deal of interest
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL 12, NO 1, JANUARY 2004 9 Signal Modification for Robust Speech Coding Nam Soo Kim, Member, IEEE, and Joon-Hyuk Chang, Member, IEEE Abstract Usually,
More informationFlexible and Scalable Transform-Domain Codebook for High Bit Rate CELP Coders
Flexible and Scalable Transform-Domain Codebook for High Bit Rate CELP Coders Václav Eksler, Bruno Bessette, Milan Jelínek, Tommy Vaillancourt University of Sherbrooke, VoiceAge Corporation Montreal, QC,
More informationAdvanced audio analysis. Martin Gasser
Advanced audio analysis Martin Gasser Motivation Which methods are common in MIR research? How can we parameterize audio signals? Interesting dimensions of audio: Spectral/ time/melody structure, high
More informationAdaptive Forward-Backward Quantizer for Low Bit Rate. High Quality Speech Coding. University of Missouri-Columbia. Columbia, MO 65211
Adaptive Forward-Backward Quantizer for Low Bit Rate High Quality Speech Coding Jozsef Vass Yunxin Zhao y Xinhua Zhuang Department of Computer Engineering & Computer Science University of Missouri-Columbia
More informationVoice Excited Lpc for Speech Compression by V/Uv Classification
IOSR Journal of VLSI and Signal Processing (IOSR-JVSP) Volume 6, Issue 3, Ver. II (May. -Jun. 2016), PP 65-69 e-issn: 2319 4200, p-issn No. : 2319 4197 www.iosrjournals.org Voice Excited Lpc for Speech
More informationVocoder (LPC) Analysis by Variation of Input Parameters and Signals
ISCA Journal of Engineering Sciences ISCA J. Engineering Sci. Vocoder (LPC) Analysis by Variation of Input Parameters and Signals Abstract Gupta Rajani, Mehta Alok K. and Tiwari Vebhav Truba College of
More informationSNR Scalability, Multiple Descriptions, and Perceptual Distortion Measures
SNR Scalability, Multiple Descriptions, Perceptual Distortion Measures Jerry D. Gibson Department of Electrical & Computer Engineering University of California, Santa Barbara gibson@mat.ucsb.edu Abstract
More informationComparison of CELP speech coder with a wavelet method
University of Kentucky UKnowledge University of Kentucky Master's Theses Graduate School 2006 Comparison of CELP speech coder with a wavelet method Sriram Nagaswamy University of Kentucky, sriramn@gmail.com
More informationImplementation of attractive Speech Quality for Mixed Excited Linear Prediction
IOSR Journal of Electrical and Electronics Engineering (IOSR-JEEE) e-issn: 2278-1676,p-ISSN: 2320-3331, Volume 9, Issue 2 Ver. I (Mar Apr. 2014), PP 07-12 Implementation of attractive Speech Quality for
More informationLow Bit Rate Speech Coding
Low Bit Rate Speech Coding Jaspreet Singh 1, Mayank Kumar 2 1 Asst. Prof.ECE, RIMT Bareilly, 2 Asst. Prof.ECE, RIMT Bareilly ABSTRACT Despite enormous advances in digital communication, the voice is still
More informationQuantisation mechanisms in multi-protoype waveform coding
University of Wollongong Research Online University of Wollongong Thesis Collection 1954-2016 University of Wollongong Thesis Collections 1996 Quantisation mechanisms in multi-protoype waveform coding
More information3GPP TS V8.0.0 ( )
TS 46.022 V8.0.0 (2008-12) Technical Specification 3rd Generation Partnership Project; Technical Specification Group Services and System Aspects; Half rate speech; Comfort noise aspects for the half rate
More informationEEE 309 Communication Theory
EEE 309 Communication Theory Semester: January 2016 Dr. Md. Farhad Hossain Associate Professor Department of EEE, BUET Email: mfarhadhossain@eee.buet.ac.bd Office: ECE 331, ECE Building Part 05 Pulse Code
More informationON-LINE LABORATORIES FOR SPEECH AND IMAGE PROCESSING AND FOR COMMUNICATION SYSTEMS USING J-DSP
ON-LINE LABORATORIES FOR SPEECH AND IMAGE PROCESSING AND FOR COMMUNICATION SYSTEMS USING J-DSP A. Spanias, V. Atti, Y. Ko, T. Thrasyvoulou, M.Yasin, M. Zaman, T. Duman, L. Karam, A. Papandreou, K. Tsakalis
More informationSpanning the 4 kbps divide using pulse modeled residual
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2002 Spanning the 4 kbps divide using pulse modeled residual J Lukasiak
More informationCellular systems & GSM Wireless Systems, a.a. 2014/2015
Cellular systems & GSM Wireless Systems, a.a. 2014/2015 Un. of Rome La Sapienza Chiara Petrioli Department of Computer Science University of Rome Sapienza Italy 2 Voice Coding 3 Speech signals Voice coding:
More informationSPEECH ENHANCEMENT USING A ROBUST KALMAN FILTER POST-PROCESSOR IN THE MODULATION DOMAIN. Yu Wang and Mike Brookes
SPEECH ENHANCEMENT USING A ROBUST KALMAN FILTER POST-PROCESSOR IN THE MODULATION DOMAIN Yu Wang and Mike Brookes Department of Electrical and Electronic Engineering, Exhibition Road, Imperial College London,
More informationAudio Signal Compression using DCT and LPC Techniques
Audio Signal Compression using DCT and LPC Techniques P. Sandhya Rani#1, D.Nanaji#2, V.Ramesh#3,K.V.S. Kiran#4 #Student, Department of ECE, Lendi Institute Of Engineering And Technology, Vizianagaram,
More informationA Full-Band Adaptive Harmonic Representation of Speech
A Full-Band Adaptive Harmonic Representation of Speech Gilles Degottex and Yannis Stylianou {degottex,yannis}@csd.uoc.gr University of Crete - FORTH - Swiss National Science Foundation G. Degottex & Y.
More informationSynchronous Overlap and Add of Spectra for Enhancement of Excitation in Artificial Bandwidth Extension of Speech
INTERSPEECH 5 Synchronous Overlap and Add of Spectra for Enhancement of Excitation in Artificial Bandwidth Extension of Speech M. A. Tuğtekin Turan and Engin Erzin Multimedia, Vision and Graphics Laboratory,
More informationAuditory modelling for speech processing in the perceptual domain
ANZIAM J. 45 (E) ppc964 C980, 2004 C964 Auditory modelling for speech processing in the perceptual domain L. Lin E. Ambikairajah W. H. Holmes (Received 8 August 2003; revised 28 January 2004) Abstract
More informationMASTER'S THESIS. Speech Compression and Tone Detection in a Real-Time System. Kristina Berglund. MSc Programmes in Engineering
2004:003 CIV MASTER'S THESIS Speech Compression and Tone Detection in a Real-Time System Kristina Berglund MSc Programmes in Engineering Department of Computer Science and Electrical Engineering Division
More informationtechniques are means of reducing the bandwidth needed to represent the human voice. In mobile
8 2. LITERATURE SURVEY The available radio spectrum for the wireless radio communication is very limited hence to accommodate maximum number of users the speech is compressed. The speech compression techniques
More informationA new quad-tree segmented image compression scheme using histogram analysis and pattern matching
University of Wollongong Research Online University of Wollongong in Dubai - Papers University of Wollongong in Dubai A new quad-tree segmented image compression scheme using histogram analysis and pattern
More informationScalable speech coding spanning the 4 Kbps divide
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2003 Scalable speech coding spanning the 4 Kbps divide J Lukasiak University
More informationQUESTION BANK. SUBJECT CODE / Name: EC2301 DIGITAL COMMUNICATION UNIT 2
QUESTION BANK DEPARTMENT: ECE SEMESTER: V SUBJECT CODE / Name: EC2301 DIGITAL COMMUNICATION UNIT 2 BASEBAND FORMATTING TECHNIQUES 1. Why prefilterring done before sampling [AUC NOV/DEC 2010] The signal
More information10 Speech and Audio Signals
0 Speech and Audio Signals Introduction Speech and audio signals are normally converted into PCM, which can be stored or transmitted as a PCM code, or compressed to reduce the number of bits used to code
More informationProject 0: Part 2 A second hands-on lab on Speech Processing Frequency-domain processing
Project : Part 2 A second hands-on lab on Speech Processing Frequency-domain processing February 24, 217 During this lab, you will have a first contact on frequency domain analysis of speech signals. You
More informationARTIFICIAL BANDWIDTH EXTENSION OF NARROW-BAND SPEECH SIGNALS VIA HIGH-BAND ENERGY ESTIMATION
ARTIFICIAL BANDWIDTH EXTENSION OF NARROW-BAND SPEECH SIGNALS VIA HIGH-BAND ENERGY ESTIMATION Tenkasi Ramabadran and Mark Jasiuk Motorola Labs, Motorola Inc., 1301 East Algonquin Road, Schaumburg, IL 60196,
More informationEEE 309 Communication Theory
EEE 309 Communication Theory Semester: January 2017 Dr. Md. Farhad Hossain Associate Professor Department of EEE, BUET Email: mfarhadhossain@eee.buet.ac.bd Office: ECE 331, ECE Building Types of Modulation
More informationVQ Source Models: Perceptual & Phase Issues
VQ Source Models: Perceptual & Phase Issues Dan Ellis & Ron Weiss Laboratory for Recognition and Organization of Speech and Audio Dept. Electrical Eng., Columbia Univ., NY USA {dpwe,ronw}@ee.columbia.edu
More information651 Analysis of LSF frame selection in voice conversion
651 Analysis of LSF frame selection in voice conversion Elina Helander 1, Jani Nurminen 2, Moncef Gabbouj 1 1 Institute of Signal Processing, Tampere University of Technology, Finland 2 Noia Technology
More informationReduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter
Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter Ching-Ta Lu, Kun-Fu Tseng 2, Chih-Tsung Chen 2 Department of Information Communication, Asia University, Taichung, Taiwan, ROC
More informationON THE RELATIONSHIP BETWEEN INSTANTANEOUS FREQUENCY AND PITCH IN. 1 Introduction. Zied Mnasri 1, Hamid Amiri 1
ON THE RELATIONSHIP BETWEEN INSTANTANEOUS FREQUENCY AND PITCH IN SPEECH SIGNALS Zied Mnasri 1, Hamid Amiri 1 1 Electrical engineering dept, National School of Engineering in Tunis, University Tunis El
More informationENEE408G Multimedia Signal Processing
ENEE408G Multimedia Signal Processing Design Project on Digital Speech Processing Goals: 1. Learn how to use the linear predictive model for speech analysis and synthesis. 2. Implement a linear predictive
More informationEC 2301 Digital communication Question bank
EC 2301 Digital communication Question bank UNIT I Digital communication system 2 marks 1.Draw block diagram of digital communication system. Information source and input transducer formatter Source encoder
More informationCommunications I (ELCN 306)
Communications I (ELCN 306) c Samy S. Soliman Electronics and Electrical Communications Engineering Department Cairo University, Egypt Email: samy.soliman@cu.edu.eg Website: http://scholar.cu.edu.eg/samysoliman
More informationACCURATE SPEECH DECOMPOSITION INTO PERIODIC AND APERIODIC COMPONENTS BASED ON DISCRETE HARMONIC TRANSFORM
5th European Signal Processing Conference (EUSIPCO 007), Poznan, Poland, September 3-7, 007, copyright by EURASIP ACCURATE SPEECH DECOMPOSITIO ITO PERIODIC AD APERIODIC COMPOETS BASED O DISCRETE HARMOIC
More informationAPPLICATIONS OF DSP OBJECTIVES
APPLICATIONS OF DSP OBJECTIVES This lecture will discuss the following: Introduce analog and digital waveform coding Introduce Pulse Coded Modulation Consider speech-coding principles Introduce the channel
More informationDefense Technical Information Center Compilation Part Notice
UNCLASSIFIED Defense Technical Information Center Compilation Part Notice ADP010883 TITLE: The Turkish Narrow Band Voice Coding and Noise Pre-Processing NATO Candidate DISTRIBUTION: Approved for public
More informationSTUFF HAPPENS. A Naive/Ideal Communication System Flat Fading What if... idealized system. 9: Stuff Happens
STUFF HAPPENS A Naive/Ideal Communication System Flat Fading What if... idealized system Software Receiver Design Johnson/Sethares/Klein / 5 A Naive/Ideal Communication System With a perfect (i.e. gain
More informationMUS421/EE367B Applications Lecture 9C: Time Scale Modification (TSM) and Frequency Scaling/Shifting
MUS421/EE367B Applications Lecture 9C: Time Scale Modification (TSM) and Frequency Scaling/Shifting Julius O. Smith III (jos@ccrma.stanford.edu) Center for Computer Research in Music and Acoustics (CCRMA)
More informationE : Lecture 8 Source-Filter Processing. E : Lecture 8 Source-Filter Processing / 21
E85.267: Lecture 8 Source-Filter Processing E85.267: Lecture 8 Source-Filter Processing 21-4-1 1 / 21 Source-filter analysis/synthesis n f Spectral envelope Spectral envelope Analysis Source signal n 1
More informationIMPROVED CODING OF TONAL COMPONENTS IN MPEG-4 AAC WITH SBR
IMPROVED CODING OF TONAL COMPONENTS IN MPEG-4 AAC WITH SBR Tomasz Żernici, Mare Domańsi, Poznań University of Technology, Chair of Multimedia Telecommunications and Microelectronics, Polana 3, 6-965, Poznań,
More informationWaveform Encoding - PCM. BY: Dr.AHMED ALKHAYYAT. Chapter Two
Chapter Two Layout: 1. Introduction. 2. Pulse Code Modulation (PCM). 3. Differential Pulse Code Modulation (DPCM). 4. Delta modulation. 5. Adaptive delta modulation. 6. Sigma Delta Modulation (SDM). 7.
More informationEnhanced Variable Rate Codec, Speech Service Options 3, 68, 70, and 73 for Wideband Spread Spectrum Digital Systems
GPP C.S00-D Version.0 October 00 Enhanced Variable Rate Codec, Speech Service Options,, 0, and for Wideband Spread Spectrum Digital Systems 00 GPP GPP and its Organizational Partners claim copyright in
More informationAdaptive time scale modification of speech for graceful degrading voice quality in congested networks
Adaptive time scale modification of speech for graceful degrading voice quality in congested networks Prof. H. Gokhan ILK Ankara University, Faculty of Engineering, Electrical&Electronics Eng. Dept 1 Contact
More informationAudio processing methods on marine mammal vocalizations
Audio processing methods on marine mammal vocalizations Xanadu Halkias Laboratory for the Recognition and Organization of Speech and Audio http://labrosa.ee.columbia.edu Sound to Signal sound is pressure
More informationSpeech/Non-speech detection Rule-based method using log energy and zero crossing rate
Digital Speech Processing- Lecture 14A Algorithms for Speech Processing Speech Processing Algorithms Speech/Non-speech detection Rule-based method using log energy and zero crossing rate Single speech
More informationHST.582J / 6.555J / J Biomedical Signal and Image Processing Spring 2007
MIT OpenCourseWare http://ocw.mit.edu HST.582J / 6.555J / 16.456J Biomedical Signal and Image Processing Spring 2007 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.
More informationCompression. Encryption. Decryption. Decompression. Presentation of Information to client site
DOCUMENT Anup Basu Audio Image Video Data Graphics Objectives Compression Encryption Network Communications Decryption Decompression Client site Presentation of Information to client site Multimedia -
More informationIMPROVED SPEECH QUALITY FOR VMR - WB SPEECH CODING USING EFFICIENT NOISE ESTIMATION ALGORITHM
IMPROVED SPEECH QUALITY FOR VMR - WB SPEECH CODING USING EFFICIENT NOISE ESTIMATION ALGORITHM Mr. M. Mathivanan Associate Professor/ECE Selvam College of Technology Namakkal, Tamilnadu, India Dr. S.Chenthur
More informationAn objective method for evaluating data hiding in pitch gain and pitch delay parameters of the AMR codec
An objective method for evaluating data hiding in pitch gain and pitch delay parameters of the AMR codec Akira Nishimura 1 1 Department of Media and Cultural Studies, Tokyo University of Information Sciences,
More informationENHANCED TIME DOMAIN PACKET LOSS CONCEALMENT IN SWITCHED SPEECH/AUDIO CODEC.
ENHANCED TIME DOMAIN PACKET LOSS CONCEALMENT IN SWITCHED SPEECH/AUDIO CODEC Jérémie Lecomte, Adrian Tomasek, Goran Marković, Michael Schnabel, Kimitaka Tsutsumi, Kei Kikuiri Fraunhofer IIS, Erlangen, Germany,
More informationECE 556 BASICS OF DIGITAL SPEECH PROCESSING. Assıst.Prof.Dr. Selma ÖZAYDIN Spring Term-2017 Lecture 2
ECE 556 BASICS OF DIGITAL SPEECH PROCESSING Assıst.Prof.Dr. Selma ÖZAYDIN Spring Term-2017 Lecture 2 Analog Sound to Digital Sound Characteristics of Sound Amplitude Wavelength (w) Frequency ( ) Timbre
More informationStructure of Speech. Physical acoustics Time-domain representation Frequency domain representation Sound shaping
Structure of Speech Physical acoustics Time-domain representation Frequency domain representation Sound shaping Speech acoustics Source-Filter Theory Speech Source characteristics Speech Filter characteristics
More informationSGN Audio and Speech Processing
Introduction 1 Course goals Introduction 2 SGN 14006 Audio and Speech Processing Lectures, Fall 2014 Anssi Klapuri Tampere University of Technology! Learn basics of audio signal processing Basic operations
More informationYEDITEPE UNIVERSITY ENGINEERING FACULTY COMMUNICATION SYSTEMS LABORATORY EE 354 COMMUNICATION SYSTEMS
YEDITEPE UNIVERSITY ENGINEERING FACULTY COMMUNICATION SYSTEMS LABORATORY EE 354 COMMUNICATION SYSTEMS EXPERIMENT 3: SAMPLING & TIME DIVISION MULTIPLEX (TDM) Objective: Experimental verification of the
More informationMel Spectrum Analysis of Speech Recognition using Single Microphone
International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree
More informationAn Approach to Very Low Bit Rate Speech Coding
Computing For Nation Development, February 26 27, 2009 Bharati Vidyapeeth s Institute of Computer Applications and Management, New Delhi An Approach to Very Low Bit Rate Speech Coding Hari Kumar Singh
More informationImproving Sound Quality by Bandwidth Extension
International Journal of Scientific & Engineering Research, Volume 3, Issue 9, September-212 Improving Sound Quality by Bandwidth Extension M. Pradeepa, M.Tech, Assistant Professor Abstract - In recent
More informationReading: Johnson Ch , Ch.5.5 (today); Liljencrants & Lindblom; Stevens (Tues) reminder: no class on Thursday.
L105/205 Phonetics Scarborough Handout 7 10/18/05 Reading: Johnson Ch.2.3.3-2.3.6, Ch.5.5 (today); Liljencrants & Lindblom; Stevens (Tues) reminder: no class on Thursday Spectral Analysis 1. There are
More informationAudio Compression using the MLT and SPIHT
Audio Compression using the MLT and SPIHT Mohammed Raad, Alfred Mertins and Ian Burnett School of Electrical, Computer and Telecommunications Engineering University Of Wollongong Northfields Ave Wollongong
More informationSpeech Signal Analysis
Speech Signal Analysis Hiroshi Shimodaira and Steve Renals Automatic Speech Recognition ASR Lectures 2&3 14,18 January 216 ASR Lectures 2&3 Speech Signal Analysis 1 Overview Speech Signal Analysis for
More informationChapter 7. Frequency-Domain Representations 语音信号的频域表征
Chapter 7 Frequency-Domain Representations 语音信号的频域表征 1 General Discrete-Time Model of Speech Production Voiced Speech: A V P(z)G(z)V(z)R(z) Unvoiced Speech: A N N(z)V(z)R(z) 2 DTFT and DFT of Speech The
More informationLOSS CONCEALMENTS FOR LOW-BIT-RATE PACKET VOICE IN VOIP. Outline
LOSS CONCEALMENTS FOR LOW-BIT-RATE PACKET VOICE IN VOIP Benjamin W. Wah Department of Electrical and Computer Engineering and the Coordinated Science Laboratory University of Illinois at Urbana-Champaign
More informationMULTIMEDIA SYSTEMS
1 Department of Computer Engineering, Faculty of Engineering King Mongkut s Institute of Technology Ladkrabang 01076531 MULTIMEDIA SYSTEMS Pk Pakorn Watanachaturaporn, Wt ht Ph.D. PhD pakorn@live.kmitl.ac.th,
More information