Adaptive Filters Application of Linear Prediction

Similar documents
Pattern Recognition. Part 6: Bandwidth Extension. Gerhard Schmidt

Adaptive Filters Linear Prediction

Adaptive Filters Wiener Filter

EE482: Digital Signal Processing Applications

Signal Processing for Speech Applications - Part 2-1. Signal Processing For Speech Applications - Part 2

Digital Signal Processing

System analysis and signal processing

(i) Understanding the basic concepts of signal modeling, correlation, maximum likelihood estimation, least squares and iterative numerical methods

The University of Texas at Austin Dept. of Electrical and Computer Engineering Midterm #1

Speech and Audio Processing Recognition and Audio Effects Part 3: Beamforming

Speech Synthesis using Mel-Cepstral Coefficient Feature

Mel Spectrum Analysis of Speech Recognition using Single Microphone

Advanced Digital Signal Processing Part 5: Digital Filters

The University of Texas at Austin Dept. of Electrical and Computer Engineering Final Exam

EE 470 Signals and Systems

ECE438 - Laboratory 7a: Digital Filter Design (Week 1) By Prof. Charles Bouman and Prof. Mireille Boutin Fall 2015

RASTA-PLP SPEECH ANALYSIS. Aruna Bayya. Phil Kohn y TR December 1991

Signal Processing Toolbox

Gerhard Schmidt / Tim Haulick Recent Tends for Improving Automotive Speech Enhancement Systems. Geneva, 5-7 March 2008

Pattern Recognition Part 2: Noise Suppression

DIGITAL FILTERS. !! Finite Impulse Response (FIR) !! Infinite Impulse Response (IIR) !! Background. !! Matlab functions AGC DSP AGC DSP

(i) Understanding of the characteristics of linear-phase finite impulse response (FIR) filters

Digital Filters IIR (& Their Corresponding Analog Filters) Week Date Lecture Title

4. Design of Discrete-Time Filters

Problem Point Value Your score Topic 1 28 Discrete-Time Filter Analysis 2 24 Upconversion 3 30 Filter Design 4 18 Potpourri Total 100

Performance Analysis of MFCC and LPCC Techniques in Automatic Speech Recognition

Understanding Digital Signal Processing

Communications Theory and Engineering

(i) Understanding of the characteristics of linear-phase finite impulse response (FIR) filters

Signals. Continuous valued or discrete valued Can the signal take any value or only discrete values?

Signals and Systems Using MATLAB

speech signal S(n). This involves a transformation of S(n) into another signal or a set of signals

Overview of Code Excited Linear Predictive Coder

Advanced audio analysis. Martin Gasser


WARPED FILTER DESIGN FOR THE BODY MODELING AND SOUND SYNTHESIS OF STRING INSTRUMENTS

Copyright S. K. Mitra

Part B. Simple Digital Filters. 1. Simple FIR Digital Filters

Design of FIR Filter for Efficient Utilization of Speech Signal Akanksha. Raj 1 Arshiyanaz. Khateeb 2 Fakrunnisa.Balaganur 3

Advanced Digital Signal Processing Part 2: Digital Processing of Continuous-Time Signals

DSP Laboratory (EELE 4110) Lab#10 Finite Impulse Response (FIR) Filters

University of Washington Department of Electrical Engineering Computer Speech Processing EE516 Winter 2005

AN AUTOREGRESSIVE BASED LFM REVERBERATION SUPPRESSION FOR RADAR AND SONAR APPLICATIONS

APPLICATIONS OF DSP OBJECTIVES

Lecture 4 Biosignal Processing. Digital Signal Processing and Analysis in Biomedical Systems

CG401 Advanced Signal Processing. Dr Stuart Lawson Room A330 Tel: January 2003

Chapter 4 SPEECH ENHANCEMENT

Multimedia Signal Processing: Theory and Applications in Speech, Music and Communications

Spectral estimation using higher-lag autocorrelation coefficients with applications to speech recognition

techniques are means of reducing the bandwidth needed to represent the human voice. In mobile

The University of Texas at Austin Dept. of Electrical and Computer Engineering Midterm #2

Different Approaches of Spectral Subtraction Method for Speech Enhancement

Spectral analysis of seismic signals using Burg algorithm V. Ravi Teja 1, U. Rakesh 2, S. Koteswara Rao 3, V. Lakshmi Bharathi 4

ELEC-C5230 Digitaalisen signaalinkäsittelyn perusteet

Digital Filtering: Realization

Signal segmentation and waveform characterization. Biosignal processing, S Autumn 2012

Digital Signal Processing

EE482: Digital Signal Processing Applications

Digital Filters FIR and IIR Systems

Speech Enhancement using Wiener filtering

Digital Signal Processing

Problem Point Value Your score Topic 1 28 Filter Analysis 2 24 Filter Implementation 3 24 Filter Design 4 24 Potpourri Total 100

FOURIER analysis is a well-known method for nonparametric

Level I Signal Modeling and Adaptive Spectral Analysis

Applications of Music Processing

A Comparative Study of Formant Frequencies Estimation Techniques

Vocal Command Recognition Using Parallel Processing of Multiple Confidence-Weighted Algorithms in an FPGA

E : Lecture 8 Source-Filter Processing. E : Lecture 8 Source-Filter Processing / 21

Topic. Spectrogram Chromagram Cesptrogram. Bryan Pardo, 2008, Northwestern University EECS 352: Machine Perception of Music and Audio

Audio Engineering Society. Convention Paper. Presented at the 115th Convention 2003 October New York, New York

Discrete-Time Signal Processing (DTSP) v14

Multirate Digital Signal Processing

Advanced Signal Processing and Digital Noise Reduction

AUTOMATIC EQUALIZATION FOR IN-CAR COMMUNICATION SYSTEMS

Signal processing preliminaries

Instruction Manual for Concept Simulators. Signals and Systems. M. J. Roberts

Speech Compression Using Voice Excited Linear Predictive Coding

Revision of Channel Coding

F I R Filter (Finite Impulse Response)

IIR Filter Design Chapter Intended Learning Outcomes: (i) Ability to design analog Butterworth filters

EE 422G - Signals and Systems Laboratory

Electrical & Computer Engineering Technology

2.1 BASIC CONCEPTS Basic Operations on Signals Time Shifting. Figure 2.2 Time shifting of a signal. Time Reversal.

Equalizers. Contents: IIR or FIR for audio filtering? Shelving equalizers Peak equalizers

EC 6501 DIGITAL COMMUNICATION UNIT - II PART A

Team proposals are due tomorrow at 6PM Homework 4 is due next thur. Proposal presentations are next mon in 1311EECS.

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING. Department of Signal Theory and Communications. c/ Gran Capitán s/n, Campus Nord, Edificio D5

University Ibn Tofail, B.P. 133, Kenitra, Morocco. University Moulay Ismail, B.P Meknes, Morocco

Design of infinite impulse response (IIR) bandpass filter structure using particle swarm optimization

SIMULATION VOICE RECOGNITION SYSTEM FOR CONTROLING ROBOTIC APPLICATIONS

Synthesis Algorithms and Validation

Frequency Response Analysis

REAL TIME DIGITAL SIGNAL PROCESSING

Image acquisition. Midterm Review. Digitization, line of image. Digitization, whole image. Geometric transformations. Interpolation 10/26/2016

Mark Analyzer. Mark Editor. Single Values

Cepstrum alanysis of speech signals

DIGITAL SIGNAL PROCESSING WITH VHDL

Theory of Telecommunications Networks

Chapter 2: Signal Representation

Singing Voice Detection. Applications of Music Processing. Singing Voice Detection. Singing Voice Detection. Singing Voice Detection

Transcription:

Adaptive Filters Application of Linear Prediction Gerhard Schmidt Christian-Albrechts-Universität zu Kiel Faculty of Engineering Electrical Engineering and Information Technology Digital Signal Processing and System Theory Slide 1

Contents of the Lecture Today: Repetition of linear prediction Properties of prediction filters Application examples Improving the convergence speed of adaptive filters Speech and speaker recognition Filter design Slide 2

Repetition Structure Consisting of an Prediction Filter and of an Inverse Prediction Filter Prediction filter Prediction error filter Prediction filter Inverse prediction error filter Slide 3

Repetition Design of a Prediction Filter Cost function: Minimizing the mean squared error Solution: Yule-Walker equation system Robust and efficient implementation: Levinson-Durbin recursion Slide 4

Repetition Levinson-Durbin Recursion Initialization: Predictor: Error power (optional): Recursion: PARCOR coefficient: Forward predictor: Backward predictor: Error power (optional): Termination: Numerical problems: If, use the coefficients of the previous step and stop the recursion. Final order reached: If has reached the desired order stop the recursion. Slide 5

Repetition Impact of a Prediction Error Filter in the Frequency Domain Part 1 Estimated power spectral densities Input signal (speech) Decorrelated signal (filter order = 16) Frequency in Hz Slide 6

Repetition Impact of a Prediction Error Filter in the Frequency Domain Part 2 Prediction filter Prediction error filter Power adjustment Prediction filter Inverse prediction error filter Inverse power adjustment Slide 7

Repetition Impact of a Prediction Error Filter in the Frequency Domain Part 3 Inverse prediction error filter (order = 1) Power adjusted filter Power spectral density of the input signal Inverse prediction error filter (order = 2) Power adjusted filter Power spectral density of the input signal Inverse prediction error filter (order = 4) Power adjusted filter Power spectral density of the input signal Inverse prediction error filter (order = 8) Power adjusted filter Power spectral density of the input signal Inverse prediction error filter (order = 16) Power adjusted filter Power spectral density of the input signal Frequency in Hz Inverse prediction error filter (order = 32) Power adjusted filter Power spectral density of the input signal Frequency in Hz Slide 8

Prediction Error Filter Properties Part 1 Minimization without restrictions (included in the filter structure) Cost function: The resulting filter has minimum phase: An FIR filter is computed with all its zeros within the unit circle. Signals can pass the filter with minimum delay. The inverse prediction filter is stable, since all zeros become poles and the zeros are located in the unit circle. Normalized filters are generated Part 1: Frequency response of the filter: Frequency response of the inverse: Slide 9

Prediction Error Filter Properties Part 2 Normalized filters are generated (true for the prediction filter as well as for the inverse filter) Part 2: Frequency response of the prediction filter: Frequency response of the inverse filter: Type of normalization: Slide 10

Prediction Error Filter Properties Part 3 Normalized filters are generated (true for the prediction filter as well as for the inverse filter) Part 3: Prediction error filter (FIR, filter order = 16) Inverse prediction error filter (IIR, filter order = 16) Frequency in Hz Slide 11

Inverse Prediction Error Filter Estimation of the Spectral Envelope Parametric estimation of the spectral envelope: Reducing the amount of parameters required to describe the specral envelope (compared to short-term spectrum) Independence of other signal properties (such as the pitch frequency) Short-term spectrum of a vowel Spectrum of the corresponding inverse prediction error filter Frequency in Hz Slide 12

Applications of Linear Prediction Part 1 Improving the Speed of Convergence of Adaptive Filters Slide 13

Improving the Speed of Convergence of Adaptive Filters Part 1 Simulation example: Excitation: colored noise (power spectral density [PSD] of the excitation is changed after 1000 samples) db Excitation Samples PSD (first 1000 samples) PSD (second 1000 samples) Distortion: white noise Monitoring the error power and the system distance Normalized frequency Normalized frequency Distortion Samples System distance db Error power Samples Slide 14

Improving the Speed of Convergence of Adaptive Filters Part 2 Time-invariant decorrelation: Prediction error filter Inverse prediction error filter Decorrelated signal domain Prediction error filter Slide 15

Improving the Speed of Convergence of Adaptive Filters Part 3 Simplified time-invariant decorrelation: The adaptive filter has to model the (unknown) system in series with the inverse prediction error filter (the convolution of both impulse responses) Wiener solution: Prediction error filter Decorrelated signal domain Slide 16

Improving the Speed of Convergence of Adaptive Filters Part 4 Time-variant decorrelation Every 10 to 50 ms the prediction filters are updated. With the update also the signal memory of the adaptive filters needs to be corrected. This can be realized in an efficient manner by using a so-called double-filter structure. Prediction error filter Prediction error filter Decorrelated signal domain Slide 17

System distance in db Applications of Linear Prediction Improving the Speed of Convergence of Adaptive Filters Part 5 Convergence runs (averaged over several simulations, speech was used as excitation): Without decorrelation Time-invariant dec. (1. order) Time invariant dec.(2. order) Time-variant dec. (10. order) Time-variant dec. (18. order) Time in seconds Slide 18

Application of Linear Prediction Part 2 Speech and Speaker Recognition Slide 19

Basics of Speaker Recognition Part 1 Basic Principle: To recognize a speaker, first features are extracted out of the signal, e.g. the spectral envelope. This is performed every 5 to 30 ms. After extracting the feature vector it is compared with all entries of a codebook and the entry with minimum distance is detected. This has to be done for several codebooks, each belonging to an individual speaker. For each codebook the minimum distances are accumulated. The accumulated minimum distances determine which speaker is the one with the largest likelihood. Models for known speakers are competing with universal models. Often the winning codebook is adapted according to the new features. Slide 20

Basics of Speaker Recognition Part 2 Codebook of the first speaker Best entry of the first codebook Current spectral envelope db Codebook of the second speaker Frequency Best entry of the second codebook Slide 21

Appropriate Cost Functions for Speech and Speaker Recognition Part 1 Requirements: An appropriate cost function should measure the perceived distance between spectral envelopes. Similar envelopes should result in a small distance, very different envelopes in a large one, and the distance of equal envelopes should be zero. The cost function should be invariant to different amplitude settings when recording the speech signal. The cost function should have low computational complexity. The cost function should mimic the human perception (e.g. having a logarithmic loudness scale). Ansatz: Cepstral distance Slide 22

Appropriate Cost Functions for Speech and Speaker Recognition Part 2 Ansatz: Cepstral distance Envelope 1 Envelope 2 Frequency in Hz Slide 23

Appropriate Cost Functions for Speech and Speaker Recognition Part 3 A well known alternative The (mean) squared error: Quadratic distance (squared error) Envelope 1 Envelope 2 Frequency in Hz Slide 24

Appropriate Cost Functions for Speech and Speaker Recognition Part 4 Cepstral distance: Parseval mit Slide 25

Appropriate Cost Functions for Speech and Speaker Recognition Part 5 Efficient transformation of prediction into cepstral coefficients: Definition Fourier transform for discrete signals and systems Replacing with (z-transform) Slide 26

Appropriate Cost Functions for Speech and Speaker Recognition Part 6 Efficient transformation of prediction into cepstral coefficients: Previous result Inserting the structure of an inverse prediction error filter Slide 27

Appropriate Cost Functions for Speech and Speaker Recognition Part 7 Efficient transformation of prediction into cepstral coefficients: Previous result Computing the coefficients with non-positive index: Using the following series: Inserting Slide 28

Appropriate Cost Functions for Speech and Speaker Recognition Part 8 Efficient transformation of prediction into cepstral coefficients: Computing the coefficients with non-positive index After inserting the result of the last slide we get: Thus, we obtain All coefficients with non-positive index are zero! Slide 29

Appropriate Cost Functions for Speech and Speaker Recognition Part 9 Efficient transformation of prediction into cepstral coefficients: Previous result Differentiation Multiplication of both sides with [ ] Slide 30

Appropriate Cost Functions for Speech and Speaker Recognition Part 10 Efficient transformation of prediction into cepstral coefficients: Previous result Comparing the coefficients for Comparing the coefficients for Slide 31

Appropriate Cost Functions for Speech and Speaker Recognition Part 11 Efficient transformation of prediction into cepstral coefficients: Recursive method with low complexity. The sum can be truncated after 3/2 N, since cepstral coefficients with a larger index usually do not contribute significantly to the result. Slide 32

Applications of Linear Prediction Part 3 Filter Design Slide 33

Filter Design Part 1 Specification of a tolerance scheme: Often a lowpass, bandpass, bandstop, or highpass filter is specified. The solution is computed iteratively (e.g. by means of programs such as Matlab). FIR or IIR filters can be designed. Linear plot Normalized frequency Logarithmic plot Magnitude response Ideal response Tolerance scheme Magnitude response Ideal response Tolerance scheme Normalized frequency Slide 34

Filter Design Part 2 but what to do, if e.g. a filter with arbitrary (known only at run-time) frequency response should be designed. db Frequency the filter should have either FIR or IIR structure (or a mix of both). a mininum-phase filter should be designed (minimum group delay). only limited computational power and memory are available for the design process. Slide 35

Filter Design for Prediction Filters Part 1 Autocorrelation function Levinson-Durbin recursion Power adjustment Inverse prediction error filter (IIR filter) Slide 36

Filter Design for Prediction Filters Part 2 Design desired magnitude frequency response (square afterwards to obtain power spectral density ) IDFT Autocorrelation function Levinson-Durbin recursion Power adjustment Inverse prediction error filter (IIR filter) Slide 37

Filter Design for Prediction Filters Part 3 Design desired magnitude frequency response (square afterwards to obtain power spectral density ) Robust inversion (avoid divisions by zero) IDFT Autocorrelation function Levinson-Durbin recursion Power adjustment Inverse prediction error filter (IIR filter) Comparison IDFT Autocorrelation function Levinson-Durbin recursion Power adjustment Prediction error filter (FIR filter) Filter type selection (FIR or IIR) Slide 38

Design Example Slide 39

Applications of Prediction-based Filter Design Part 1 Application examples: For adaptively adjusting limiters. For low-delay noise reduction filters. For frequency selective gain adjustment of the output of speech prompters and hands-free systems (loudspeaker output). Gain Shaping (frequency selective) Input signal Low order FIR filter Power normalization Output signal Power spectral density of the noise Power spectral density of the echo Computaion of the gain and the spectral shape Slide 40

Applications of Prediction-based Filter Design Part 2 Measurement: Binaural recording while acceleration of a car (left ear signal depicted). Intelligibility improvement Intelligibility improvement Details: B. Iser, G. Schmidt: Receive Side Processing in a Hands-Free Application, Proc. HSCMA, 2008 Slide 41

Adpative Filters Applications of Linear Prediction Summary and Outlook This week: Repetition of linear prediction Properties of prediction filters Application examples Improving the convergence speed of adaptive filters Speech and speaker recognition Filter design Slide 42