Signal Resampling Technique Combining Level Crossing and Auditory Features

Similar documents
Nonuniform multi level crossing for signal reconstruction

Weight functions for signal reconstruction based on level crossings

Digital Audio. Lecture-6

A New Class of Asynchronous Analog-to-Digital Converters Based on Time Quantization

Chapter 2: Digitization of Sound

Image Enhancement in Spatial Domain

10 Speech and Audio Signals

Compensation of Analog-to-Digital Converter Nonlinearities using Dither

CHAPTER 4. PULSE MODULATION Part 2

Time division multiplexing The block diagram for TDM is illustrated as shown in the figure

Asynchronous analog-to-digital converter based on level-crossing sampling scheme

EEE 309 Communication Theory

Communications IB Paper 6 Handout 3: Digitisation and Digital Signals

DIGITAL COMMUNICATION

Downloaded from 1

Pulse Code Modulation

Course 2: Channels 1 1

Performance Analysis of Cognitive Radio based on Cooperative Spectrum Sensing

Waveform Encoding - PCM. BY: Dr.AHMED ALKHAYYAT. Chapter Two

Blind Blur Estimation Using Low Rank Approximation of Cepstrum

PULSE CODE MODULATION (PCM)

A Soft-Limiting Receiver Structure for Time-Hopping UWB in Multiple Access Interference

EC 6501 DIGITAL COMMUNICATION UNIT - II PART A

ECE/OPTI533 Digital Image Processing class notes 288 Dr. Robert A. Schowengerdt 2003

SIGNAL DETECTION IN NON-GAUSSIAN NOISE BY A KURTOSIS-BASED PROBABILITY DENSITY FUNCTION MODEL

Digital Communication Prof. Bikash Kumar Dey Department of Electrical Engineering Indian Institute of Technology, Bombay

NOISE ESTIMATION IN A SINGLE CHANNEL

UNEQUAL POWER ALLOCATION FOR JPEG TRANSMISSION OVER MIMO SYSTEMS. Muhammad F. Sabir, Robert W. Heath Jr. and Alan C. Bovik

ON SYMBOL TIMING RECOVERY IN ALL-DIGITAL RECEIVERS

Nyquist, Shannon and the information carrying capacity of signals

TIME encoding of a band-limited function,,

ECE 556 BASICS OF DIGITAL SPEECH PROCESSING. Assıst.Prof.Dr. Selma ÖZAYDIN Spring Term-2017 Lecture 2

Mel Spectrum Analysis of Speech Recognition using Single Microphone

A Survey on Non-Uniform Sampling using Level Crossing Techniques

Audio Imputation Using the Non-negative Hidden Markov Model

Video, Image and Data Compression by using Discrete Anamorphic Stretch Transform

UNIT TEST I Digital Communication

EEE 309 Communication Theory

Compression and Image Formats

From Encoding Sound to Encoding Touch

Comparison of Spectral Analysis Methods for Automatic Speech Recognition

A Novel Technique or Blind Bandwidth Estimation of the Radio Communication Signal

Data Encoding g(p (part 2)

Chapter 4 Digital Transmission 4.1

Communications and Signals Processing

Fundamentals of Digital Communication

Optimized threshold calculation for blanking nonlinearity at OFDM receivers based on impulsive noise estimation

Communications Theory and Engineering

FYS3240 PC-based instrumentation and microcontrollers. Signal sampling. Spring 2017 Lecture #5

Pulse Code Modulation

1.Discuss the frequency domain techniques of image enhancement in detail.

I. INTRODUCTION. Fig. 1. Nonuniform sampling of input y(t) in a level-crossing ADC. The samples are shown as dots.

CHAPTER. delta-sigma modulators 1.0

Design of an Asynchronous 1 Bit Charge Sharing Digital to Analog Converter for a Level Crossing ADC

JPEG Image Transmission over Rayleigh Fading Channel with Unequal Error Protection

TELECOMMUNICATION SYSTEMS

SUCCESSIVE approximation register (SAR) analog-todigital

Noise Power Ratio for the GSPS

IMPROVING AUDIO WATERMARK DETECTION USING NOISE MODELLING AND TURBO CODING

Effects of Basis-mismatch in Compressive Sampling of Continuous Sinusoidal Signals

Wavelet Speech Enhancement based on the Teager Energy Operator

Blind Dereverberation of Single-Channel Speech Signals Using an ICA-Based Generative Model

FIBER OPTICS. Prof. R.K. Shevgaonkar. Department of Electrical Engineering. Indian Institute of Technology, Bombay. Lecture: 22.

For the system to have the high accuracy needed for many measurements,

SIGNALS AND SYSTEMS LABORATORY 13: Digital Communication

Overview of Code Excited Linear Predictive Coder

Laboratory Assignment 2 Signal Sampling, Manipulation, and Playback

Sampling and Pulse Code Modulation Chapter 6

Content Based Image Retrieval Using Color Histogram

EEE482F: Problem Set 1

Subband Analysis of Time Delay Estimation in STFT Domain

CODING TECHNIQUES FOR ANALOG SOURCES

Lecture 10 Performance of Communication System: Bit Error Rate (BER) EE4900/EE6720 Digital Communications

Lossless Huffman coding image compression implementation in spatial domain by using advanced enhancement techniques

Timing Error Analysis in Digital-to-Analog Converters

Comparing CSI and PCA in Amalgamation with JPEG for Spectral Image Compression

Lecture #6: Analog-to-Digital Converter

Telecommunication Electronics

Lab/Project Error Control Coding using LDPC Codes and HARQ

Performance of Combined Error Correction and Error Detection for very Short Block Length Codes

A spatial squeezing approach to ambisonic audio compression

Image Compression Using Huffman Coding Based On Histogram Information And Image Segmentation

Analog and Telecommunication Electronics

ELT Receiver Architectures and Signal Processing Fall Mandatory homework exercises


Audio Watermark Detection Improvement by Using Noise Modelling

EE390 Final Exam Fall Term 2002 Friday, December 13, 2002

(Refer Slide Time: 2:23)

Speech Coding in the Frequency Domain

IN A direct-sequence code-division multiple-access (DS-

Choosing the Best ADC Architecture for Your Application Part 3:

ENGINEERING FOR RURAL DEVELOPMENT Jelgava, EDUCATION METHODS OF ANALOGUE TO DIGITAL CONVERTERS TESTING AT FE CULS

Analog to Digital Conversion

SNR Estimation in Nakagami-m Fading With Diversity Combining and Its Application to Turbo Decoding

Compressive Through-focus Imaging

Audio /Video Signal Processing. Lecture 1, Organisation, A/D conversion, Sampling Gerald Schuller, TU Ilmenau

Optimum Power Allocation in Cooperative Networks

Analysis of Complex Modulated Carriers Using Statistical Methods

MULTILEVEL CODING (MLC) with multistage decoding

(Refer Slide Time: 01:45)

Transcription:

Signal Resampling Technique Combining Level Crossing and Auditory Features Nagesha and G Hemantha Kumar Dept of Studies in Computer Science, University of Mysore, Mysore - 570 006, India shan bk@yahoo.com Abstract. Level crossing based sampling might be used as an alternative to Nyquist theory based sampling of a signal. Level crossing based approach take advantage of statistical properties of the signal, providing cues to efficient nonuniform sampling. This paper presents new threshold level allocation schemes for level crossing based nonuniform sampling. Intuitively, it is more reasonable if the information rich regions of the signal are sampled finer and those with sparse information are sampled coarser. To achieve this objective, we proposed non-linear quantization functions which dynamically assign the number of quantization levels depending on the importance of the given amplitude range. Various aspects of proposed techniques are discussed and experimentally validated. Its efficacy is investigated by comparison with Nyquist based sampling. 1 Introduction The concept of level crossing sampling scheme has been proposed for analog to digital conversion [3]. Several case studies in analog to digital converters shows that level crossing based sampling technique can be more effective than existing synchronous Nyquist analog to digital converters (ADC). The level crossing sampling scheme has also demonstrated for speech applications using CMOS technology and a voltage mode approach for the analog parts of the converter [4]. Electrical simulations proved that the Figure of Merit of asynchronous level crossing converters increased compared to synchronous Nyquist ADCs. Level crossing sampling scheme has been suggested in literature for bursty signals, non-bandlimited signal[5] and band limited Gaussian random processes[6]. Conventional Nyquist based signal sampling is with uniform time-step variable amplitude. The new class of nonuniform sampling approach has been used to improve the performance of Nyquist ADCs exploiting the inner statistical properties of the signal. Several signals has interesting statistical properties, but Nyquist based sampling do not take the advantage of them. Signals such as electro cardiograms, speech signals, temperature sensors, pressure sensors are almost always constant and may vary significantly during brief moments. Due to Nyquist theory, which is to ensure the sampling to be at least twice that of the Corresponding author: 0821-2510789. A. Ghosh, R.K. De, and S.K. Pal (Eds.): PReMI 2007, LNCS 4815, pp. 447 454, 2007. c Springer-Verlag Berlin Heidelberg 2007

448 Nagesha and G.H. Kumar input signal frequency bandwidth, it is obvious that, in the time domain, this condition may result in a large number of samples with redundant information. It has been proved in [1,4] that level crossing sampling approach can lead to reduction in number of samples. Level crossing scheme sends a pulse whenever the source signal crosses a threshold level. The threshold levels are uniformly distributed in the amplitude range [1,3,4,5]. The uniform threshold step size used in the level crossing regardless of the signal amplitude characteristics. The linear threshold allocation will result in a higher SNR at the region of higher amplitude than the region of lower amplitude. Hence, increased SNR at the higher signal amplitude does not increase the perceived audio quality because humans are most sensitive to lower amplitude components. To exploit this factor, a nonuniform threshold level allocation schemes based on Incomplete Beta Function (IBF), logarithmic and linear function are proposed in this paper. The parameters of the IBF are chosen in such a way that it focuses on sensitive lower amplitude regions. The paper is organized as follows. Section 2 describes the level crossing based sampling approach with the proposed nonuniform threshold allocation scheme. Section 3 discusses the incorporation of IBF, logarithmic and linear functions to formulate a rule for allocation of nonuniform threshold levels in multilevel crossing. Section 4 explains the experimentation setup for testing the proposed approach and gives the results and analysis. Section 5 is devoted to major conclusions as well as gives directions for future lines of work. 2 Irregular Sampling Model Level Crossing Analysis represents an approach to interpretation and characterization of time signals by relating frequency and amplitude information. Measurement of level crossing of a signal is defined as the crossings of a threshold level l by consecutive samples. The level sampling with a level allocation function f(s) is the mapping L f(s) : R f(s)z : L f(s) = f(s) s/f(f(s)) where L =(l 1,l 2,,l N ) set of nonuniform spaced levels. Since the quantization levels are irregularly spaced across the amplitude range of the signal, it increases the efficiency of bit usage. The spacing of the levels is decided by the importance of the amplitude segments which is discussed in section 3. A sample is recorded when the input signal crosses one of the nonuniformly spaced levels. The precession of time of the recorded sample is decided by the local timer T. Let L =(l 1,l 2,...,l N ) be the set of nonuniform spaced levels and 2 b 1=N quantization levels with b bit resolution. Definition 2.1 : The level crossing of the threshold level l L by signal s(t) with period T is given by [ ][ [ ] ] i 1 i 1 L f(s) (s, I ni )=l iff s n T s n T l < 0 (1)

Signal Resampling Technique Combining Level Crossing 449 where n subintervals are defined by I ni = [ i 1 n T, i n T ],i =1, 2,,n.Ifasample is recorded and transmitted every time a level crossing occurs, the encoding procedure is called asynchronous delta modulation[2]. 3 Rules for Irregular Sampling Determining the threshold levels is very important as it has a huge impact on the performance of coding. Unfortunately there is no theory available to determine those values. The uniform threshold levels are not the efficient coding of the levels because they do not take advantage of the statistical properties of the signal. The present study considers nonuniform threshold level allocation scheme depending on the signal amplitude characteristics and rules which controls the importance of amplitude regions. As a result, signals with lesser activity in higher amplitude regions compared to the lower amplitude regions, will have less number of levels at higher amplitude region. In this section, IBF, logarithmic and linear function rules are analyzed. 3.1 Incomplete Beta Function The lack of data to decide the exact number of levels for a given amplitude range, creates problems concerning the selection of number of levels. In such cases, an expert will have to assume the levels. For this reason, the flexible incomplete beta distribution, capable of attaining a variety of shapes could be used in Level Crossing applications. Because of its extreme flexibility, the distribution appears ideally suited for the computation of number of levels for a specific amplitude region of a signal. The Incomplete beta function I(z,α,β) is defined by [7]: I(z,α,β) 1 B(α, β) z 0 u α 1 (1 u) β 1 du α > 0,β >0, 0 z 1 (2) Eq.2 has the limiting values I 0 (α, β) =0,I 1 (α, β) = 1. The shape of the incomplete beta function obtained from Eq.2 depends on the choice of its two parameters α and β. Estimating these parameters is a challenge since these parameters control the number of levels for a given amplitude range of a speech signal along with the signal probability density function (PDF). We have empirically chosen the values of α and β such that IBF gives more weight to amplitude regions of importance. 3.2 Linear Function Although human auditory perception is certainly does not use a linear function, this group of mapping methods renders acceptable results for a wide range of applications. Its strength is its simplicity and speed. The linear function is defined by linear(n) =n (3)

450 Nagesha and G.H. Kumar However, computing the importance of amplitude regions in linear scale is not merely a mater of mathematical convenience. There is more compelling, physical consideration to be taken into account, related to the importance of amplitude regions. Natural worst case representation for characterization of physical systems is linear. This means that all amplitude regions become equally important. 3.3 Logarithmic Function A logarithmic of a number x in base b is a number n such that x = b n,where the value b must be neither 0 nor a root of 1. It is usually written as log b (x) =n (4) When x and b are further restricted to positive real numbers, the logarithm is a unique real number. Representation of importance of amplitude on a logarithmic scale can be helpful when the importance of regions varies slowly. Logarithmic rule assigns less number of levels to the corner amplitude regions and more levels are assigned logarithmically in important amplitude regions. The center amplitude regions (near zero amplitude regions) are considered to be important amplitude regions. This issue, however is not whether to accept or reject logarithmic rule but to appreciate where it fits in, and where it does not. 3.4 Level Estimation In a deterministic environment, the accuracy of the signal reconstruction depends on several parameters such as positioning of the levels, total number of levels, statistical properties of the signal etc. Specifically, for a given signal we analyze the structural behavior by estimating its PDF. The signal histogram is approximated to obtain the signal PDF p(x). Now consider a signal with PDF p(x) and level allocation rule R(x). Let N be the total number of levels. The locations of N levels are estimated by L(x) =p(x) R(x) (5) The symbol represents linear filtering. L(x) gives the probability of distribution of levels and it guides the distribution of N levels over the amplitude range. As expected, the spacing of N levels are not uniform they are nonuniformly spaced over the amplitude range. Each level can be represented with log 2 (N) bits. Hence only amplitude regions with high activity will be allocated more number of levels using the rule R(x) and signal amplitude PDF. 4 Experimental Evaluation In this section, the performance of the proposed approach is evaluated for speech signals. We have run simulations for the level crossing based sampling of speech

Signal Resampling Technique Combining Level Crossing 451 Fig. 1. Experimental results for IBF (a) Histogram bin versus SNR. (b) Quantization level versus ratio (c) Quantization level versus SNR.(d) SNR versus Compression ratio. signals from TIMIT database. The TIMIT speech signals are sampled by 16 KHz sampling rate and each sample size is 16 bit. The PDF of the speech signal is estimated by computing the amplitude histogram of the signal with 100 bins. The total numbers of quantization levels needed to sample the given signal are set to 16, 32, 64 and 128. The accuracy of distribution of the levels computed from Eq.5, also depends on the number of bins used to compute convoluted PDF of the signal. The levels are estimated for 20, 40, 60, 80, 100 bins for comparison and analysis. We evaluated the system with proposed rules IBF, logarithmic and linear functions. By comparing IBF, logarithmic and linear rule results, we analyze the performances. SNR of the resampled signal generally improves as the bins increase for all the levels. The IBF rule gives high SNR consistently compared to the logarithmic and linear rule at all bins. The performance of logarithmic rule is slightly less than that of IBF rule for all the levels (Fig.1(a) and Fig.3(a)). The best performance is observed for IBF rule with 128 levels. In case of linear rule, 1 db drop in SNR is observed compared to IBF for 128 levels. Similarly, higher SNR is achieved for IBF and logarithmic rule compared to linear rule at all levels (Fig.1(a), Fig.2(a) and Fig.3(a)).

452 Nagesha and G.H. Kumar Fig. 2. Experimental results for linear function (a) Histogram bin versus SNR. (b) Quantization level versus ratio (c) Quantization level versus SNR. (d) SNR versus Compression ratio. The comparison of compression ratio at various levels for the three rules is shown in Fig.1(b), Fig.2(b) and Fig.3(b). We observe that logarithmic rule slightly outperforms IBF rule. The logarithmic rule gives higher SNR for lesser levels and the ratio decreases as the levels are increased. Linear rule results in low compression ratio for all levels. For higher levels all the rules give similar results. The results of SNR versus levels (Fig.1(c), Fig.2(c) and Fig.3(c)) show that IBF and logarithmic rule performance is superior to linear rule at all levels. It is observed that IBF and logarithmic rules produce almost similar performance results. Minimum SNR for 16 levels is near 3 db in IBF and logarithmic rule whereas minimum SNR in linear rule for 16 levels is 0.4 db. Fig.1(d), Fig.2(d) and Fig.3(d) shows the plot of SNR versus compression ratio. The characteristic curve appears to be concave for IBF rule, linear for logarithmic rule and convex for linear rule. Performance of IBF and logarithmic rules are considerably better than linear rule, with higher SNR for higher compression ratio, which is nonetheless better performance. Comparison of IBF, logarithmic and linear rule shows that, the IBF rule and logarithmic outperforms linear rule. Also, performances of IBF slightly outperforms logarithmic rule in compression ratio and SNR. The behavioral patterns of IBF, logarithmic and linear rule appear to be similar except in SNR versus ratio analysis. IBF rule is based on the auditory properties of the humans. IBF rule distributes more levels in the critical amplitude regions. Similar to IBF, logarithmic rule also considers that near zero amplitude regions are important than the corner amplitude regions. The priority

Signal Resampling Technique Combining Level Crossing 453 Fig. 3. Experimental results for logarithmic function (a) Histogram bin versus SNR. (b) Quantization level versus ratio (c) Quantization level versus SNR. (d) SNR versus Compression ratio. of the amplitude regions varies logarithmically from corner amplitude regions to near zero value amplitude regions. Linear rule considers that each amplitude region is equally important. Hence, the SNR of the resampled signal remains consistently superior to linear rule. Lack of levels at critical amplitude regions of the signal decreases the SNR of the resampled signal. The performance of the proposed approaches is fairly consistent with that of Sayiner[3]. This experimental analysis illustrates that signal with special statistical behavior such as speech, medical signals are unsuitable for uniform sampling. These types of signals can be more efficiently sampled using a level crossing scheme. 5 Conclusion In summary, this paper presents new threshold level allocation schemes for level crossing based nonuniform sampling which dynamically assigns the number of quantization levels depending on the importance of the given amplitude range of the input signal. Proposed methods take the advantage of statistical properties of the signal and allocate the nonuniformly spaced quantization levels across the amplitude range. The proposed level allocation scheme for nonuniform sampling based on level crossing may motivate directed attempts to augment traditional

454 Nagesha and G.H. Kumar methods that will improve their ability. Overall, these results motivate continued work on level crossing based nonuniform sampling for improving sampling performance and analysis the signals as a whole. Acknowledgement The authors profoundly thank Mr.Noushath and Mr.Aradhya, Research Scholars, University of Mysore, for helping them to typeset the manuscript in L A TEX. References 1. Mitchell, R.J., Gonzalez, R.C.: Multilevel crossing rates for automated signal classification. In: Proc. of ICASSP 1978, vol. 3, pp. 218 222 (April 1978) 2. Inose, H., Aoki, T., Wantanable, K.: Asynchronous delta modulation systems, Electron. Commun., Japan, pp. 34 42 (March 1966) 3. Sayiner, N., Sorensen, H.V., Viswanathan, T.R.: A level crossing sampling scheme for A/D conversion. IEEE Transactions on Circuits and Systems II 43, 335 339 (1996) 4. Allier, E., Sicard, G., Fesquet, L., Renaudin, M.: Asynchronous level crossing analog to digital converters. Measurement Journal 37, 296 309 (2005) 5. Guan, K., Singer, A.C.: A Level Crossing Sampling Scheme for Non-Bandlimited Signals. In: Proc. of ICASSP 2006, vol. 3, pp. III-381-383 (May 2006) 6. Miskowicz, M.: Efficiency of Level-Crossing Sampling for Bandlimited Gaussian Random Processes. In: Proc. of IEEE International Workshop on Factory Communication Systems-2006, pp. 137 142 (June 2006) 7. Press, W.H., Teukolsky, S.A., William, T.V., Brian, P.F.: Numerical Recipes in C++, 2nd edn. Cambridge university press, Cambridge (2002)