ECE 556 BASICS OF DIGITAL SPEECH PROCESSING. Assıst.Prof.Dr. Selma ÖZAYDIN Spring Term-2017 Lecture 2

Similar documents
Digital Audio. Lecture-6

Pulse Code Modulation

Waveform Encoding - PCM. BY: Dr.AHMED ALKHAYYAT. Chapter Two

Basic Concepts in Data Transmission

EEE 309 Communication Theory

Chapter 2: Digitization of Sound

EEE 309 Communication Theory

Voice Transmission --Basic Concepts--

Digital Communication (650533) CH 3 Pulse Modulation

10 Speech and Audio Signals

Communications IB Paper 6 Handout 3: Digitisation and Digital Signals


Pulse Code Modulation

EEE482F: Problem Set 1

CHAPTER 4. PULSE MODULATION Part 2

Stream Information. A real-time voice signal must be digitized & transmitted as it is produced Analog signal level varies continuously in time

CSCD 433 Network Programming Fall Lecture 5 Physical Layer Continued

CSCD 433 Network Programming Fall Lecture 5 Physical Layer Continued

INTRODUCTION TO COMMUNICATION SYSTEMS LABORATORY IV. Binary Pulse Amplitude Modulation and Pulse Code Modulation

Music 270a: Fundamentals of Digital Audio and Discrete-Time Signals

Downloaded from 1

UNIT TEST I Digital Communication

DIGITAL COMMINICATIONS

EC 6501 DIGITAL COMMUNICATION UNIT - II PART A

CMPT 318: Lecture 4 Fundamentals of Digital Audio, Discrete-Time Signals

Continuous vs. Discrete signals. Sampling. Analog to Digital Conversion. CMPT 368: Lecture 4 Fundamentals of Digital Audio, Discrete-Time Signals

QUESTION BANK. SUBJECT CODE / Name: EC2301 DIGITAL COMMUNICATION UNIT 2

6. has units of bits/second. a. Throughput b. Propagation speed c. Propagation time d. (b)or(c)

Communications I (ELCN 306)

Chapter 3 Data and Signals

ANALOGUE AND DIGITAL COMMUNICATION

SOME PHYSICAL LAYER ISSUES. Lecture Notes 2A

EITF25 Internet Techniques and Applications L2: Physical layer. Stefan Höst

QUESTION BANK EC 1351 DIGITAL COMMUNICATION YEAR / SEM : III / VI UNIT I- PULSE MODULATION PART-A (2 Marks) 1. What is the purpose of sample and hold

Department of Electronics and Communication Engineering 1

EE390 Final Exam Fall Term 2002 Friday, December 13, 2002

Data Encoding g(p (part 2)

TELECOMMUNICATION SYSTEMS

Sampling, interpolation and decimation issues

Physical Layer: Outline

Chapter 3 Digital Transmission Fundamentals

MULTIMEDIA SYSTEMS

Fundamentals of Digital Communication

Sampling and Pulse Code Modulation Chapter 6

EE482: Digital Signal Processing Applications

QUESTION BANK SUBJECT: DIGITAL COMMUNICATION (15EC61)

Chapter 3 Data Transmission COSC 3213 Summer 2003

Lecture #6: Analog-to-Digital Converter

Communication Theory II

Signals and Systems. Lecture 13 Wednesday 6 th December 2017 DR TANIA STATHAKI

Digital Sampling. This Lecture. Engr325 Instrumentation. Dr Curtis Nelson. Digital sampling Sample rate. Bit depth. Other terms. Types of conversion.

Chapter 4. Digital Audio Representation CS 3570

Chapter 3 Digital Transmission Fundamentals

<#)*,$+0"$#)* ?">& B"$"')*+0"$#)* ?">&? F. S. Blair March 24, Analog and Digital Signals

Lecture 3: Data Transmission

Lecture Schedule: Week Date Lecture Title

Figure 1: Block diagram of Digital signal processing

Analog and Telecommunication Electronics

Data Communication. Chapter 3 Data Transmission

Media Devices: Audio. CTEC1465/2018S Computer System Support

Chapter-3 Waveform Coding Techniques

In this lecture. System Model Power Penalty Analog transmission Digital transmission

Terminology (1) Chapter 3. Terminology (3) Terminology (2) Transmitter Receiver Medium. Data Transmission. Direct link. Point-to-point.

Fundamentals of Data and Signals

Sampling and Reconstruction of Analog Signals

System on a Chip. Prof. Dr. Michael Kraft

Solutions to Information Theory Exercise Problems 5 8

Telecommunication Electronics

Fundamentals of Digital Audio *

Islamic University of Gaza. Faculty of Engineering Electrical Engineering Department Spring-2011

EC 2301 Digital communication Question bank

Chapter Two. Fundamentals of Data and Signals. Data Communications and Computer Networks: A Business User's Approach Seventh Edition

Pulse Code Modulation (PCM)

YEDITEPE UNIVERSITY ENGINEERING FACULTY COMMUNICATION SYSTEMS LABORATORY EE 354 COMMUNICATION SYSTEMS

UNIT I Source Coding Systems

Transmission Impairments

CODING TECHNIQUES FOR ANALOG SOURCES

ECE 2111 Signals and Systems Spring 2012, UMD Experiment 9: Sampling

AUDL Final exam page 1/7 Please answer all of the following questions.

Exercise Problems: Information Theory and Coding

SAMPLING THEORY. Representing continuous signals with discrete numbers

Lecture 3 Concepts for the Data Communications and Computer Interconnection

Design IV. E232 Spring 07

CS441 Mobile & Wireless Computing Communication Basics

Lecture Fundamentals of Data and signals

Communication Theory II

CT111 Introduction to Communication Systems Lecture 9: Digital Communications

2: Audio Basics. Audio Basics. Mark Handley

Data and Computer Communications. Chapter 3 Data Transmission

SIGNALS AND SYSTEMS LABORATORY 13: Digital Communication

Advantages of Analog Representation. Varies continuously, like the property being measured. Represents continuous values. See Figure 12.


COMP467. Local Asynchronous Communication. Goals. Data is usually sent over a single channel one bit at a time.

Introduction to Communications Part Two: Physical Layer Ch3: Data & Signals

END-OF-YEAR EXAMINATIONS ELEC321 Communication Systems (D2) Tuesday, 22 November 2005, 9:20 a.m. Three hours plus 10 minutes reading time.

EXPERIMENTAL INVESTIGATION INTO THE OPTIMAL USE OF DITHER

Chapter 2. Physical Layer

Cyber-Physical Systems ADC / DAC

Laboratory Assignment 2 Signal Sampling, Manipulation, and Playback

DIGITAL COMMUNICATION

Transcription:

ECE 556 BASICS OF DIGITAL SPEECH PROCESSING Assıst.Prof.Dr. Selma ÖZAYDIN Spring Term-2017 Lecture 2

Analog Sound to Digital Sound

Characteristics of Sound Amplitude Wavelength (w) Frequency ( ) Timbre Hearing: [20Hz 20KHz] Speech: [200Hz 8KHz]

Digital Representation of Audio Must convert wave form to digital sample quantize compress

Signal Sampling Measure amplitude at regular intervals Sampling is converting a continuous time signal into a discrete time signal Categories: Impulse (ideal) sampling Natural Sampling Sample and Hold operation

Impulse Sampling

Impulse Sampling Impulse train spaced at T s multiplies the signal x(t) in time domain, creating discrete time, continuous amplitude signal x s (t) Impulse train spaced at f s convolutes the signal X(f) in frequency domain, creating Repeating spectrum X s (f) spaced at f s

The Aliasing Effect f s > 2f m f s < 2f m Aliasing happens

Aliasing Under sampling will result in aliasing that will create spectral overlap

Ideal Sampling and Aliasing Sampled signal is discrete in time domain with spacing T s Spectrum will repeat for every f s Hz Aliasing (spectral overlapping) if f s is too small (f s < 2f m ) Nyquist sampling rate f s = 2f m Generally oversampling is done f s > 2f m

Natural Sampling

Natural Sampling Sampling pulse train has a finite width τ Sampled spectrum will repeat itself with a Sinc envelope More realistic modeling Distortion after recovery depends on τ/t s

Different Sampling Models

Nyquist Theorem For lossless digitization, the sampling rate should be at least twice the maximum frequency response. In mathematical terms: f s > 2*f m where f s is sampling frequency and f m is the maximum frequency in the signal

Nyquist Limit max data rate = 2 H log 2 V bits/second, where H = bandwidth (in Hz) V = discrete levels (bits per signal change) Shows the maximum number of bits that can be sent per second on a noiseless channel with a bandwidth of H, if V bits are sent per signal Example: what is the maximum data rate for a 3kHz channel that transmits data using 2 levels (binary)? Solution: H = 3kHz; V = 2; max. data rate = 2x3,000xln2=6,000bits/second

Limited Sampling But what if one cannot sample fast enough? Reduce signal frequency to half of maximum sampling frequency low-pass filter removes higher-frequencies e.g., if max sampling frequency is 22kHz, must low-pass filter a signal down to 11kHz

Sampling Ranges Auditory range 20Hz to 22.05 khz must sample up to to 44.1kHz common examples are 8.000 khz, 11.025 khz, 16.000 khz, 22.05 khz, and 44.1 KHz Speech frequency [200 Hz, 8 khz] sample up to 16 khz but typically 4 khz to 11 khz is used

Sampling Quantization Mapping Quantization Quantization is done to make the signal amplitude discrete 1.5 1 0.5 0-0.5-1 -1.5 Analog Signal Discrete Time Cont. Ampl. Signal Discrete Time & Discrete Ampl Signal Binary Sequence

Sampling and 4-bit quantization

Quantization Typically use 8 bits = 256 levels 16 bits = 65,536 levels How should the levels be distributed? Linearly? (PCM) Perceptually? (u-law) Differential? (DPCM) Adaptively? (ADPCM)

Linear Quantization L levels (L-1)q = 2V p = V pp For large L Lq V pp

Linear Quantization Summary Mean Squared Error (MSE) = q 2 /12 Mean signal power = E[m 2 (t)] Mean SNR = 12 E[m 2 (t)]/q 2 For binary PCM, L = 2 n n bits/sample Let signal bandwidth = B Hz If Nyquist sampling 2B samples/sec If 20% oversampling 1.2(2B) samples/sec Bit rate = 2nB bits/sec Required channel bandwidth = nb Hz

Pulse Code Modulation Pulse modulation Use discrete time samples of analog signals Transmission is composed of analog information sent at different times Variation of pulse amplitude or pulse timing allowed to vary continuously over all values PCM Analog signal is quantized into a number of discrete levels CS 414 - Spring 2012

PCM Mapping

PCM Quantization and Digitization Quantization Digitization

Quantization Index Linear Quantization (PCM) Divide amplitude spectrum into N units (for log 2 N bit quantization) 300 250 200 150 100 50 0 0 50 100 150 200 250 300 Sound Intensity

Non-uniform Quantization

Non-Uniform Quantization In speech signals, very low speech volumes predominates Only 15% of the time, the voltage exceeds the RMS value These low level signals are under represented with uniform quantization Same noise power (q 2 /12) but low signal power The answer is non uniform quantization

Non-uniform Quantization Compress the signal first Then perform linear quantization Result in nonlinear quantization

Uniform Non-Uniform

Quantization Index Perceptual Quantization (u-law) Want intensity values logarithmically mapped over N quantization units 300 250 200 150 100 50 0 0 50 100 150 200 250 300 Sound Intensity

µ-law and A-law Widely used compression algorithms

Differential Pulse Code Modulation (DPCM) What if we look at sample differences, not the samples themselves? d t = x t -x t-1 Differences tend to be smaller Use 4 bits instead of 12, maybe?

Differential Pulse Code Modulation (DPCM) Changes between adjacent samples small Send value, then relative changes value uses full bits, changes use fewer bits E.g., 220, 218, 221, 219, 220, 221, 222, 218,.. (all values between 218 and 222) Difference sequence sent: 220, +2, -3, 2, -1, -1, -1, +4... Result: originally for encoding sequence 0..255 numbers need 8 bits; Difference coding: need only 3 bits

Adaptive Differential Pulse Code Modulation (ADPCM) Adaptive similar to DPCM, but adjusts the width of the quantization steps Encode difference in 4 bits, but vary the mapping of bits to difference dynamically If rapid change, use large differences If slow change, use small differences

Signal-to-Noise Ratio (metric to quantify quality of digital audio)

Signal To Noise Ratio Measures strength of signal to noise SNR (in DB)= Signal energy 10log10( Noiseenergy Given sound form with amplitude in [-A, A] ) Signal energy = 2 A 2 A 1 0.75 0.5 0 0.25 0-0.25-0.5 CS 414 - Spring 2012 -A -0.75-1

Quantization Error Difference between actual and sampled value amplitude between [-A, A] quantization levels = N 2A N e.g., if A = 1, N = 8, = 1/4 1 0.75 0.5 0.25 0-0.25 CS 414 - Spring 2012-0.5

Compute Signal to Noise Ratio Signal energy = ; Noise energy = ; A 2 2 2 12 2A N Noise energy = 2 A 3 N 2 Signal to noise = 2 3N 10log 2 Every bit increases SNR by ~ 6 decibels

Example Consider a full load sinusoidal modulating signal of amplitude A, which utilizes all the representation levels provided The average signal power is P= A 2 /2 The total range of quantizer is 2A because modulating signal swings between A and A. Therefore, if it is N=16 (4-bit quantizer), Δ = 2A/2 4 = A/8 The quantization noise is Δ 2 /12 = A 2 /768 The S/N ratio is (A 2 /2)/(A 2 /768) = 384; SNR (in db) 25.8 db

Data Rates Data rate = sample rate * quantization * channel Compare rates for CD vs. mono audio 8000 samples/second * 8 bits/sample * 1 channel = 8 kbytes / second 44,100 samples/second * 16 bits/sample * 2 channel = 176 kbytes / second ~= 10MB / minute

Comparison and Sampling/Coding Techniques CS 414 - Spring 2012

Summary