ME 535 Course Project: Analysis of Audio Using PCA
|
|
- Martina Marshall
- 5 years ago
- Views:
Transcription
1 ME 535 Course Project: Analysis of Audio Using PCA SAMEER MESHRAM (M.S.), VARUN RUPCHANDANI (M.S.), VIDUR VIJ (M.S.), June 4, 2018 EXECUTIVE SUMMARY: In this project we perform audio separation through of mixed signal through implementation of Variational Mode Decomposition and Principal Component Analysis (PCA). It is seen that mixture of pure sinusoids of different frequencies is separated very efficiently while that of instruments - cymbals and bass drums are separated satisfactorily. Comparison of original and unmixed (separated) signal is made through various metrics such as correlation coefficient, percentage error, root mean square error, etc. and visually by through of spectrographs. A brief overview of audio acquisition, representation and storage is also provided.
2 1. INTRODUCTION The aim of this report is to provide the reader with a basic understanding of a algorithm used for single channel blind source separation (SCBSS) using variational mode decomposition (VMD) and principal component analysis (PCA). A brief description of audio acquisition, representation and storage is also included in the present section. Section 2 describes the proposed method, results are displayed in section 3 and discussion upon the work done followed by concluding remarks are presented in section 4. The program for algorithm implementation can be found in Appendix. 1A] AUDIO ACQUISITION, REPRESENTATION and STORAGE Audio acquisition is the process of converting the physical phenomenon we call sound into a format suitable for digital processing, the representation is the problem of extracting from the sound information necessary to perform a specific task, and the storage is the task of reducing the number of bits necessary to encode the acoustic signals [1]. A typical sound wave can be represented by: s(t) = A sin(2πft + ) = A sin ( 2π T t + ) where A is called the amplitude and it represents the maximum distance from equilibrium position, φ is the phase, T is called period which is the time interval length between two instances of s(t). Also, f = 1 is the frequency measured in Hz. T The only sounds that are used in audio applications are those that can be perceived by human beings, the intensities can be measured through the ratio I I 0 to the threshold of hearing (THO) I 0, i.e., the minimum intensity that can be detected by human ears. However, this is problematic as the value of I 0 corresponds to W/m 2, while the maximum value of I that can be tolerated without permanent physiological damages is I max = 10 3 W/m 2. The ratio I I 0 can thus range across 15 orders of magnitude and this makes it difficult to manage different intensity values. Hence, the ratio I I 0 is measured using the decibel (db) scale: I = 10 log 10 ( I I o ) where I is the intensity that is measured in db. a] Sampling In the process of sampling, the displacement of measurement membrane is measured at regular time steps. The number F of measurements per seconds is called sampling frequency or sampling rate and
3 the time interval T c = 1 between two consecutive measurements is called sampling period. The F relationship between the analog signal s(t) and the sampled signal s[n] is as follows: s[n] = s(nt c ) A common rule of thumb when deciding upon the sampling frequency can be ascertained by the Nyquist or Shannon frequency, given as F > 2f max, where f max is the highest frequency represented in the sound signal. The following table shows common sampling rates along with bandwidth: Sound Quality Required Bandwidth Sampling Rate Telephone 200 Hz to 3.4 khz 8000 Hz CD Audio 5 Hz to 20 khz Hz DAT Audio 5 Hz to 20 khz Hz DVD Audio 0 Hz to 96 khz Hz DVD Audio with stereomixes 0 Hz to 96 khz Hz Table 1: Common sampling rates and bandwidths b] Audio Representation Since sound is longitudinal pressure wave it consists of continuous values as opposed to digitized ones. Since most signal processing is handled by digital devices it is necessary to digitize sound. Digitization implies conversion to a stream of numbers, preferably integers for increased processing efficiency [2]. Sampling in the amplitude or voltage dimension is called quantization. The most straightforward method to perform such a task is known as pulse code modulation (PCM). PCM involves three steps viz. sampling, quantization (Linear or non-linear) and coding (A-Law or μ-law). The figure below can be used to conceptualize sampling and quantization: Figure 1: Left: Sampling the analog signal in time; Right: Quantization is sampling the analog signal in amplitude dimension [2]
4 c] Audio Storage The number of bits (B) used to represent audio samples has an important role in storage and transmission. The higher the number of bits, the larger is the memory space needed to store a recording and transmit through a channel. The number of bits per unit time necessary to represent a signal is called bit-rate. The area dealing with reducing bit-rate while at the same time preserving quality of the signal is called audio encoding. The primary encoding methods result in audio formats such as MPEG, WAV, mp3, etc. Audio Format Sampling Rate Bit Rate MPEG-1 32 to 48 khz 32 to 384 kbits/s MPEG-2 16 to 24 khz 8 to 144 kbits/s WAV 8 to 48 khz 4.8 to 176 kbits/s Mp3 8 to 44 khz 8 to 320 kbits/s Table 2: Common audio formats with sampling and bit rate 1B] BLIND SOURCE SEPARATION Blind source separation (BSS) is the extraction of single sound sources that comprise a mixture of signals and as such has several applications in bio-medical signal processing, speech processing, speech recognition, communications, etc. The independent source signals are usually multiplied with different weights (mixing matrix) and the summed mixture is recorded. The source signals are expected to be recovered only from the available mixture of the signal with no other background knowledge. This is often compared to the Cocktail Party Effect which is the ability of human brain to focus on specific human voice, while filtering out other voices or backgrounds [3,4]. In this particular text we will concentrate on single channel blind source separation (SCBSS). a] A Brief Review of Blind Source Separation Paradigms Several methods have been proposed for SCBSS in literature. In [5], single channel-independent component analysis (SCICA) has been used to separate the sources. A new method based on wavelet transform along with FastICA has been used in [6]. In [7], the technique of ensemble empirical mode decomposition (EEMD) has been used to decompose a single observation into a number of intrinsic mode functions (IMFs). Principle component analysis (PCA) was used to select the required number of IMFs to select the required number of IMFs to be given as input to ICA algorithm. A learning framework has been applied to BSS in [8], which treats BSS as a generalized eigenvalue problem. Our project deals with SCBSS employing variational mode decomposition (VMD) and PCA. b] Variational Mode Decomposition VMD decomposes a signal into a number of modes concurrently and non-recursively such that the combination of all the modes reconstruct the original signal. It also searches for the center frequency of the modes and each mode is band-limited about this center frequency. It decomposes a real valued signal into a finite number of modes simultaneously. These modes have specific sparsity
5 properties. The center frequency is also determined along with the modes. Each of the modes are compact around its center frequency. The bandwidth of each mode and reconstruction fidelity are controlled by different parameters. Details of VMD implementation can be found in section 2. c] Principal Component Analysis PCA identifies the dynamics and behavior of a system from a seemingly complex and incoherent set of data. Even without any prior knowledge of governing systems we can produce low level reductions. This procedure is called the Principal Component Analysis (PCA), Proper Mode Decomposition (POD) or the Karhunen-Loeve Decomposition. PCA can be done by eigenvalue decomposition of a data covariance matrix or singular value decomposition of a data matrix, usually after mean centering and normalizing the data matrix for each attribute. Figure 2: Representation of principal component analysis (PCA) If we imagine a random set of n-dimensional data, then PCA can be visualized as transforming that data into a n-dimensional ellipsoid with its axes serving as each of the principal components. When the axis of the ellipsoid is small, then the variance along that axis is also small, and by omitting that axis and its corresponding principal component from our representation of the dataset, we lose only a small amount of information. Thus, we can achieve reduction and identification of data from PCA.
6 2. PROPOSED METHOD In this work, a mixture of two sources x 1 (t) and x 2 (t) has been considered. The single channel containing the mixture of these two sources is given by: x(t) = a 1 x 1 (t) + a 2 x 2 (t) + n(t) Here, a 1 and a 2 are weighting coefficients and n(t) is any noise that may be introduced in the system. Our objective is to extract x 1 (t) and x 2 (t). The proposed method consists of applying VMD on x(t).and thus decomposing it into n number of modes. Each mode belongs to a different spectral band. If the signal x(t) is a mixture of source signals belonging to different spectral bands, then they are captured by the different modes in VMD. Not only that, the outband high frequency noise gets captured in other modes. If the noisy modes are rejected, then the remaining modes are the extracted source signals. For selecting the required signals modes PCA is used. PCA transforms a set of observed correlated vectors to set of uncorrelated vectors. The transformation is as follows: y = A T x where, A is known as orthogonal matrix, x is the set of correlated vectors and y is the set of uncorrelated vectors. The elements of the vector set are called the principal components. These principal components of y are maximized under the constraint that their principal component vectors are orthogonal to each other. Only the first m principal components (where m is the number of source signals), corresponding to the highest eigenvalues are selected. The eigenvalues corresponding to the modes containing noise would be much less as compared to the ones containing the source signals. Thus, PCA helps to select the m modes which directly gives the source components. Algorithm for the proposed method 1. Input observed signal x(t) 2. Apply VMD on x(t) to get n modes : M 1, M 2, - up to M n 3. Apply PCA on M 1 to M n to select m principal components : P 1, P 2,- up to P m 4. Extracted sources : P 1, P 2,- up to P m Table 3: Algorithm of the proposed method 2A] SIGNAL CONSTRUCTION a] Pure Sinusoids We first use pure sinusoids of different frequencies to test the accuracy of the proposed method. In this implementation x 1 (t) = sin (3t) and x 2 (t) = sin (7t). The below figure shows the individual
7 signals and their mixture. Figure 3: Pure sinusoids and their mixture b] Instruments - Cymbal and Bass Drum Signals for Cymbals and Bass are shown below. These instruments are chosen as they operate in nonoverlapping frequency range. Cymbals operate in the range 3 to 5 khz and bass drums between 60 to 100 Hz. Figure 4: Sound signal for cymbal, bass drum and their mixture
8 2B] ALGORITHM IMPLEMENTATION Variational mode decomposition is applied on the single channel mixed signal x(t) to decompose it into K band separated modes. For each type of signal K is kept 2 as it is believed to a combination of two dominant modes. The function VMD() contributed by [9,10] is used for this purpose. Principal components of the K mode is found out by implementing inbuilt Matlab function pca(). The signal reconstruction can be obtained by using the following equation: PCA reconstruction = PC scores Eigenvectors T + Mean Results obtained from the implementation of the above algorithm are summarized in section 3.
9 3. RESULTS The algorithm as discussed in section 2 is implemented with the following results. a] Separation of Pure Sinusoids The following figure are obtained for separation of pure sinusoidal mixed signals. Data is shown in both in terms of amplitude and frequency (spectrogram) versus time for comparison. Figure 5: Separated and original sinusoidal signal 1. Figure 6: Spectrogram of separated and original sinusoidal signal 1.
10 The following metrics were calculated to determine the efficiency of the separation algorithm: For Signal 1 Correlation coefficient Percentage error Root mean square error Table 4: Metrics for separated and original signal 1 Figure 7: Separated and original sinusoidal signal 2. Figure 8: Spectrogram of original and separated sinusoidal signal 2.
11 The following metrics were calculated to determine the efficiency of the separation algorithm: For Signal 2 Correlation coefficient Percentage error Root mean square error Table 5: Metrics for separated and original signal 2 b] Separation of Cymbal and Bass Drum The following figure are obtained for separation of pure sinusoidal mixed signals. Data is shown in both in terms of amplitude and frequency (spectrogram) versus time for comparison. Figure 9: Separated and original cymbal
12 Figure 10: Spectrogram of original and separated cymbal The following metrics were calculated to determine the efficiency of the separation algorithm: For Cymbal Correlation coefficient Percentage error Root mean square error Table 6: Metrics for separated and original Cymbal signal Figure 11: Separated and original bass drum
13 Figure 12: Spectrogram of original and separated bass drum The following metrics were calculated to determine the efficiency of the separation algorithm: For Bass Drum Correlation coefficient Percentage error Root mean square error Table 7: Metrics for separated and original Bass Drum signal 4. DICSUSSION and CONCLUSION As we can see for the results section 3 pure sinusoids are separated very efficiently by the mentioned algorithm. This behavior is anticipated as they are created through addition of pure sinusoids comprising of one dominant frequency each. However, the separation of Cymbals and Bass Drum is not very efficient. This can be attributed to the presence of several frequency modes and noise that might be introduced during recording of Cymbals and Bass Drum. The algorithm thus fairly performs separation and has scope for improvement.
14 REFERENCES [1] F. Camastra, A. Vinciarelli, Machine Learning for Audio, Image and Video Analysis, Advanced Information and Knowledge Processing, Springer-Verlag London 2015, Accessed: 5/22/2018 4:00 PM. [2] Z-N. Li, M. Drew, J. Liu, Fundamentals of Multimedia, Prentice Hall [3] P. Dey, U. Satija, B. Ramakumar, Single Channel Blind Source Separation Based on Variational Mode Decomposition and PCA, IEEE INDICON 2015 [4] Deep Learning Machine Solves the Cocktail Party Problem, Online: Accessed: 5/25/2018 3:28 PM [5] M. E. Davies and C. J. James, Source separation using single channel ICA, Signal Process [6] M-G Lopez, H. Lozano, L.P. Sanchez and L.N.O. Moreno, Blind Source Separation of audio signals using independent component analysis and wavelets, Int. Conf. Electr. Commun. Comput., [7] Y. Guo, S. Huang, Y. Li, Single-Mixture source separation using dimensionality reduction of ensemble empirical mode decomposition and independent component analysis, Circuits Syst. Signal Process., [8] H. Liu, Y. Cheung, A learning framework for blind source separation using generalized eigenvalues, Lecture Notes in Comput. Sci. Vol., [9] D. Zosso, Variational Mode Decomposition, Online: Accessed: 5/26/2018 5:18 PM [10] K. Dragomiretskiy, D. Zosso, Variational Mode Decomposition, IEEE Trans. Signal Processing, 2014
15 APPENDIX % ME 535 Project % Analysis of Audio using PCA clear all;close all, clc t = linspace(0, 6000, 100); fs = 1; %44100 s1 = sin(3*t)'; %Trying to seperate this and s2 %[s,fs] = audioread('cymbal_recording_clip.mp3'); %s_2 = s(:,1); s2 = sin(7*t)'; %s_2(3000:3999)'; mix = s1'+ s2'%0.3*s1'+ 0.7*s2'; figure(1) subplot(3,1,1) plot(s1,'-b'); title('signal 1: x_1 (t)') subplot(3,1,2) plot(s2,'-b'); title('signal 2: x_2 (t)') ylabel('amplitude'); subplot(3,1,3) plot(mix,'-b'); title('source Mixture: x(t) = 0.3x_1 (t) + 0.7*x_2 (t)'); xlabel('time (t)'); Implementation of VMD alpha = 2000; tau = 0; K = 2; DC = 0; init = 1; tol = 1e-7; % moderate bandwidth constraint % noise-tolerance (no strict fidelity enforcement) % K modes % no DC part imposed % initialize omegas uniformly [u, u_hat, omega] = VMD(mix, alpha, tau, K, DC, init, tol) % u - the collection of decomposed modes % u_hat - spectra of the modes % omega - estimated mode center-frequencies figure(2) plot(u); title('u') figure(3) plot(u_hat);title('u-hat') figure(4) plot(omega); title('omega')
16 Implementation of PCA and Signal Reconstruction close all; [coeff, score, latent, tsquared, explained, mu] = pca(u); %coeff = pca(u') recon = score*coeff'+repmat(mu,k,1) figure(5) plot(s1,'-b', 'Linewidth',1.1) hold on;xlabel('') plot (recon(1,:),'--r', 'Linewidth',1.5); xlabel('time (t)');ylabel('amplitude'); title('signal 1') legend('original', 'Separated') figure(6) plot(s2,'-b', 'Linewidth',1.1) hold on; plot (recon(2,:),'--r', 'Linewidth',1.5); xlabel('time (t)');ylabel('amplitude'); title('signal 2') legend('original', 'Separated') Spectrogram of Original Signal (s1) figure(7) spectrogram(s1, 4, 3/4*4, [], fs, 'yaxis') box on set(gca, 'FontName', 'Times New Roman', 'FontSize', 14) xlabel('time, s') ylabel('frequency, Hz') title('spectrogram of sin(3t)-original') h = colorbar; set(h, 'FontName', 'Times New Roman', 'FontSize', 14) ylabel(h, 'Magnitude, db') Spectrogram of Reconstructed Signal (s1) figure(8) spectrogram(recon(1,:), 1024, 3/4*1024, [], fs, 'yaxis') box on set(gca, 'FontName', 'Times New Roman', 'FontSize', 14) xlabel('time, s') ylabel('frequency, Hz') title('spectrogram of sin(3t)-unmixed') h = colorbar; set(h, 'FontName', 'Times New Roman', 'FontSize', 14) ylabel(h, 'Magnitude, db')
17 Spectrogram of Original Signal (s2) figure(9) spectrogram(s2, 1024, 3/4*1024, [], fs, 'yaxis') %1024 or 4 box on set(gca, 'FontName', 'Times New Roman', 'FontSize', 14) xlabel('time, s') ylabel('frequency, Hz') title('spectrogram of sin(7t)-original') h = colorbar; set(h, 'FontName', 'Times New Roman', 'FontSize', 14) ylabel(h, 'Magnitude, db') Spectrogram of Unmixed Signal (s2) figure(10) spectrogram(recon(2,:), 1024, 3/4*1024, [], fs, 'yaxis') box on set(gca, 'FontName', 'Times New Roman', 'FontSize', 14) xlabel('time, s') ylabel('frequency, Hz') title('spectrogram of sin(7t)-unmixed') h = colorbar; set(h, 'FontName', 'Times New Roman', 'FontSize', 14) ylabel(h, 'Magnitude, db') Metrics for comparison R_1 = corrcoef(s1,recon(1,:)) R_2 = corrcoef(s2,recon(2,:)) Norm_s1 = norm(s1-recon(1,:)); per_err = 100*((spectrogram(s1)-spectrogram(recon(1,:))))./spectrogram(s1); per_err_abs = abs(per_err); per_err_vector = per_err_abs(:) %RMS error signal RMSE = sqrt(mean((s1 - recon(1,:)).^2)) %RMS error spectrogram RMSE = mean(abs(sqrt(mean((spectrogram(s1) - spectrogram(recon(1,:))).^2)))) %mean_percentage_error = mean(per_err_vector) Published with MATLAB R2017b
Drum Transcription Based on Independent Subspace Analysis
Report for EE 391 Special Studies and Reports for Electrical Engineering Drum Transcription Based on Independent Subspace Analysis Yinyi Guo Center for Computer Research in Music and Acoustics, Stanford,
More informationEnhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis
Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis Mohini Avatade & S.L. Sahare Electronics & Telecommunication Department, Cummins
More informationECE 556 BASICS OF DIGITAL SPEECH PROCESSING. Assıst.Prof.Dr. Selma ÖZAYDIN Spring Term-2017 Lecture 2
ECE 556 BASICS OF DIGITAL SPEECH PROCESSING Assıst.Prof.Dr. Selma ÖZAYDIN Spring Term-2017 Lecture 2 Analog Sound to Digital Sound Characteristics of Sound Amplitude Wavelength (w) Frequency ( ) Timbre
More informationAuditory modelling for speech processing in the perceptual domain
ANZIAM J. 45 (E) ppc964 C980, 2004 C964 Auditory modelling for speech processing in the perceptual domain L. Lin E. Ambikairajah W. H. Holmes (Received 8 August 2003; revised 28 January 2004) Abstract
More informationIntroduction of Audio and Music
1 Introduction of Audio and Music Wei-Ta Chu 2009/12/3 Outline 2 Introduction of Audio Signals Introduction of Music 3 Introduction of Audio Signals Wei-Ta Chu 2009/12/3 Li and Drew, Fundamentals of Multimedia,
More informationSUB-BAND INDEPENDENT SUBSPACE ANALYSIS FOR DRUM TRANSCRIPTION. Derry FitzGerald, Eugene Coyle
SUB-BAND INDEPENDEN SUBSPACE ANALYSIS FOR DRUM RANSCRIPION Derry FitzGerald, Eugene Coyle D.I.., Rathmines Rd, Dublin, Ireland derryfitzgerald@dit.ie eugene.coyle@dit.ie Bob Lawlor Department of Electronic
More informationCSCD 433 Network Programming Fall Lecture 5 Physical Layer Continued
CSCD 433 Network Programming Fall 2016 Lecture 5 Physical Layer Continued 1 Topics Definitions Analog Transmission of Digital Data Digital Transmission of Analog Data Multiplexing 2 Different Types of
More informationSOME PHYSICAL LAYER ISSUES. Lecture Notes 2A
SOME PHYSICAL LAYER ISSUES Lecture Notes 2A Delays in networks Propagation time or propagation delay, t prop Time required for a signal or waveform to propagate (or move) from one point to another point.
More informationMultiple Input Multiple Output (MIMO) Operation Principles
Afriyie Abraham Kwabena Multiple Input Multiple Output (MIMO) Operation Principles Helsinki Metropolia University of Applied Sciences Bachlor of Engineering Information Technology Thesis June 0 Abstract
More informationEnsemble Empirical Mode Decomposition: An adaptive method for noise reduction
IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) e-issn: 2278-2834,p- ISSN: 2278-8735. Volume 5, Issue 5 (Mar. - Apr. 213), PP 6-65 Ensemble Empirical Mode Decomposition: An adaptive
More informationReview of Lecture 2. Data and Signals - Theoretical Concepts. Review of Lecture 2. Review of Lecture 2. Review of Lecture 2. Review of Lecture 2
Data and Signals - Theoretical Concepts! What are the major functions of the network access layer? Reference: Chapter 3 - Stallings Chapter 3 - Forouzan Study Guide 3 1 2! What are the major functions
More informationSpeech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm
International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm A.T. Rajamanickam, N.P.Subiramaniyam, A.Balamurugan*,
More informationMichael F. Toner, et. al.. "Distortion Measurement." Copyright 2000 CRC Press LLC. <
Michael F. Toner, et. al.. "Distortion Measurement." Copyright CRC Press LLC. . Distortion Measurement Michael F. Toner Nortel Networks Gordon W. Roberts McGill University 53.1
More informationTerminology (1) Chapter 3. Terminology (3) Terminology (2) Transmitter Receiver Medium. Data Transmission. Direct link. Point-to-point.
Terminology (1) Chapter 3 Data Transmission Transmitter Receiver Medium Guided medium e.g. twisted pair, optical fiber Unguided medium e.g. air, water, vacuum Spring 2012 03-1 Spring 2012 03-2 Terminology
More informationCommunications IB Paper 6 Handout 3: Digitisation and Digital Signals
Communications IB Paper 6 Handout 3: Digitisation and Digital Signals Jossy Sayir Signal Processing and Communications Lab Department of Engineering University of Cambridge jossy.sayir@eng.cam.ac.uk Lent
More informationData Communication. Chapter 3 Data Transmission
Data Communication Chapter 3 Data Transmission ١ Terminology (1) Transmitter Receiver Medium Guided medium e.g. twisted pair, coaxial cable, optical fiber Unguided medium e.g. air, water, vacuum ٢ Terminology
More informationCHAPTER 6 INTRODUCTION TO SYSTEM IDENTIFICATION
CHAPTER 6 INTRODUCTION TO SYSTEM IDENTIFICATION Broadly speaking, system identification is the art and science of using measurements obtained from a system to characterize the system. The characterization
More informationPulse Code Modulation
Pulse Code Modulation EE 44 Spring Semester Lecture 9 Analog signal Pulse Amplitude Modulation Pulse Width Modulation Pulse Position Modulation Pulse Code Modulation (3-bit coding) 1 Advantages of Digital
More informationElectrical & Computer Engineering Technology
Electrical & Computer Engineering Technology EET 419C Digital Signal Processing Laboratory Experiments by Masood Ejaz Experiment # 1 Quantization of Analog Signals and Calculation of Quantized noise Objective:
More informationBiomedical Signals. Signals and Images in Medicine Dr Nabeel Anwar
Biomedical Signals Signals and Images in Medicine Dr Nabeel Anwar Noise Removal: Time Domain Techniques 1. Synchronized Averaging (covered in lecture 1) 2. Moving Average Filters (today s topic) 3. Derivative
More informationRemoval of ocular artifacts from EEG signals using adaptive threshold PCA and Wavelet transforms
Available online at www.interscience.in Removal of ocular artifacts from s using adaptive threshold PCA and Wavelet transforms P. Ashok Babu 1, K.V.S.V.R.Prasad 2 1 Narsimha Reddy Engineering College,
More informationSignal Processing. Naureen Ghani. December 9, 2017
Signal Processing Naureen Ghani December 9, 27 Introduction Signal processing is used to enhance signal components in noisy measurements. It is especially important in analyzing time-series data in neuroscience.
More informationCG401 Advanced Signal Processing. Dr Stuart Lawson Room A330 Tel: January 2003
CG40 Advanced Dr Stuart Lawson Room A330 Tel: 23780 e-mail: ssl@eng.warwick.ac.uk 03 January 2003 Lecture : Overview INTRODUCTION What is a signal? An information-bearing quantity. Examples of -D and 2-D
More informationVoice Transmission --Basic Concepts--
Voice Transmission --Basic Concepts-- Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics: Amplitude Frequency Phase Telephone Handset (has 2-parts) 2 1. Transmitter
More informationCOMP211 Physical Layer
COMP211 Physical Layer Data and Computer Communications 7th edition William Stallings Prentice Hall 2004 Computer Networks 5th edition Andrew S.Tanenbaum, David J.Wetherall Pearson 2011 Material adapted
More informationCSCD 433 Network Programming Fall Lecture 5 Physical Layer Continued
CSCD 433 Network Programming Fall 2016 Lecture 5 Physical Layer Continued 1 Topics Definitions Analog Transmission of Digital Data Digital Transmission of Analog Data Multiplexing 2 Different Types of
More informationFundamentals of Digital Audio *
Digital Media The material in this handout is excerpted from Digital Media Curriculum Primer a work written by Dr. Yue-Ling Wong (ylwong@wfu.edu), Department of Computer Science and Department of Art,
More informationTerminology (1) Chapter 3. Terminology (3) Terminology (2) Transmitter Receiver Medium. Data Transmission. Simplex. Direct link.
Chapter 3 Data Transmission Terminology (1) Transmitter Receiver Medium Guided medium e.g. twisted pair, optical fiber Unguided medium e.g. air, water, vacuum Corneliu Zaharia 2 Corneliu Zaharia Terminology
More informationChapter 2: Digitization of Sound
Chapter 2: Digitization of Sound Acoustics pressure waves are converted to electrical signals by use of a microphone. The output signal from the microphone is an analog signal, i.e., a continuous-valued
More informationSound pressure level calculation methodology investigation of corona noise in AC substations
International Conference on Advanced Electronic Science and Technology (AEST 06) Sound pressure level calculation methodology investigation of corona noise in AC substations,a Xiaowen Wu, Nianguang Zhou,
More informationDigital Audio. Lecture-6
Digital Audio Lecture-6 Topics today Digitization of sound PCM Lossless predictive coding 2 Sound Sound is a pressure wave, taking continuous values Increase / decrease in pressure can be measured in amplitude,
More informationDigital Sampling. This Lecture. Engr325 Instrumentation. Dr Curtis Nelson. Digital sampling Sample rate. Bit depth. Other terms. Types of conversion.
Digital Sampling Engr325 Instrumentation Dr Curtis Nelson Digital sampling Sample rate. Bit depth. Other terms. Types of conversion. This Lecture 1 Data Acquisition and Control Computers are nearly always
More informationWaveform Encoding - PCM. BY: Dr.AHMED ALKHAYYAT. Chapter Two
Chapter Two Layout: 1. Introduction. 2. Pulse Code Modulation (PCM). 3. Differential Pulse Code Modulation (DPCM). 4. Delta modulation. 5. Adaptive delta modulation. 6. Sigma Delta Modulation (SDM). 7.
More informationAudio and Speech Compression Using DCT and DWT Techniques
Audio and Speech Compression Using DCT and DWT Techniques M. V. Patil 1, Apoorva Gupta 2, Ankita Varma 3, Shikhar Salil 4 Asst. Professor, Dept.of Elex, Bharati Vidyapeeth Univ.Coll.of Engg, Pune, Maharashtra,
More informationIntroduction to Wavelet Transform. Chapter 7 Instructor: Hossein Pourghassem
Introduction to Wavelet Transform Chapter 7 Instructor: Hossein Pourghassem Introduction Most of the signals in practice, are TIME-DOMAIN signals in their raw format. It means that measured signal is a
More informationCommunications I (ELCN 306)
Communications I (ELCN 306) c Samy S. Soliman Electronics and Electrical Communications Engineering Department Cairo University, Egypt Email: samy.soliman@cu.edu.eg Website: http://scholar.cu.edu.eg/samysoliman
More informationMultiresolution Analysis of Connectivity
Multiresolution Analysis of Connectivity Atul Sajjanhar 1, Guojun Lu 2, Dengsheng Zhang 2, Tian Qi 3 1 School of Information Technology Deakin University 221 Burwood Highway Burwood, VIC 3125 Australia
More informationMel Spectrum Analysis of Speech Recognition using Single Microphone
International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree
More informationAudio Imputation Using the Non-negative Hidden Markov Model
Audio Imputation Using the Non-negative Hidden Markov Model Jinyu Han 1,, Gautham J. Mysore 2, and Bryan Pardo 1 1 EECS Department, Northwestern University 2 Advanced Technology Labs, Adobe Systems Inc.
More informationSPEECH ENHANCEMENT WITH SIGNAL SUBSPACE FILTER BASED ON PERCEPTUAL POST FILTERING
SPEECH ENHANCEMENT WITH SIGNAL SUBSPACE FILTER BASED ON PERCEPTUAL POST FILTERING K.Ramalakshmi Assistant Professor, Dept of CSE Sri Ramakrishna Institute of Technology, Coimbatore R.N.Devendra Kumar Assistant
More informationE40M Sound and Music. M. Horowitz, J. Plummer, R. Howe 1
E40M Sound and Music M. Horowitz, J. Plummer, R. Howe 1 LED Cube Project #3 In the next several lectures, we ll study Concepts Coding Light Sound Transforms/equalizers Devices LEDs Analog to digital converters
More informationOriginal Research Articles
Original Research Articles Researchers A.K.M Fazlul Haque Department of Electronics and Telecommunication Engineering Daffodil International University Emailakmfhaque@daffodilvarsity.edu.bd FFT and Wavelet-Based
More informationSingle Channel Speaker Segregation using Sinusoidal Residual Modeling
NCC 2009, January 16-18, IIT Guwahati 294 Single Channel Speaker Segregation using Sinusoidal Residual Modeling Rajesh M Hegde and A. Srinivas Dept. of Electrical Engineering Indian Institute of Technology
More informationAn Adaptive Algorithm for Speech Source Separation in Overcomplete Cases Using Wavelet Packets
Proceedings of the th WSEAS International Conference on Signal Processing, Istanbul, Turkey, May 7-9, 6 (pp4-44) An Adaptive Algorithm for Speech Source Separation in Overcomplete Cases Using Wavelet Packets
More informationProblem Sheet 1 Probability, random processes, and noise
Problem Sheet 1 Probability, random processes, and noise 1. If F X (x) is the distribution function of a random variable X and x 1 x 2, show that F X (x 1 ) F X (x 2 ). 2. Use the definition of the cumulative
More informationCHAPTER 4. PULSE MODULATION Part 2
CHAPTER 4 PULSE MODULATION Part 2 Pulse Modulation Analog pulse modulation: Sampling, i.e., information is transmitted only at discrete time instants. e.g. PAM, PPM and PDM Digital pulse modulation: Sampling
More informationTransmission Fundamentals
College of Computer & Information Science Wireless Networks Northeastern University Lecture 1 Transmission Fundamentals Signals Data rate and bandwidth Nyquist sampling theorem Shannon capacity theorem
More informationDigital Communication (650533) CH 3 Pulse Modulation
Philadelphia University/Faculty of Engineering Communication and Electronics Engineering Digital Communication (650533) CH 3 Pulse Modulation Instructor: Eng. Nada Khatib Website: http://www.philadelphia.edu.jo/academics/nkhatib/
More informationData Communications & Computer Networks
Data Communications & Computer Networks Chapter 3 Data Transmission Fall 2008 Agenda Terminology and basic concepts Analog and Digital Data Transmission Transmission impairments Channel capacity Home Exercises
More informationClassification of ships using autocorrelation technique for feature extraction of the underwater acoustic noise
Classification of ships using autocorrelation technique for feature extraction of the underwater acoustic noise Noha KORANY 1 Alexandria University, Egypt ABSTRACT The paper applies spectral analysis to
More informationSGN Audio and Speech Processing
Introduction 1 Course goals Introduction 2 SGN 14006 Audio and Speech Processing Lectures, Fall 2014 Anssi Klapuri Tampere University of Technology! Learn basics of audio signal processing Basic operations
More informationEffects of Basis-mismatch in Compressive Sampling of Continuous Sinusoidal Signals
Effects of Basis-mismatch in Compressive Sampling of Continuous Sinusoidal Signals Daniel H. Chae, Parastoo Sadeghi, and Rodney A. Kennedy Research School of Information Sciences and Engineering The Australian
More informationChapter 3. Data Transmission
Chapter 3 Data Transmission Reading Materials Data and Computer Communications, William Stallings Terminology (1) Transmitter Receiver Medium Guided medium (e.g. twisted pair, optical fiber) Unguided medium
More informationIslamic University of Gaza. Faculty of Engineering Electrical Engineering Department Spring-2011
Islamic University of Gaza Faculty of Engineering Electrical Engineering Department Spring-2011 DSP Laboratory (EELE 4110) Lab#4 Sampling and Quantization OBJECTIVES: When you have completed this assignment,
More informationICA & Wavelet as a Method for Speech Signal Denoising
ICA & Wavelet as a Method for Speech Signal Denoising Ms. Niti Gupta 1 and Dr. Poonam Bansal 2 International Journal of Latest Trends in Engineering and Technology Vol.(7)Issue(3), pp. 035 041 DOI: http://dx.doi.org/10.21172/1.73.505
More informationAnalysis on Extraction of Modulated Signal Using Adaptive Filtering Algorithms against Ambient Noises in Underwater Communication
International Journal of Signal Processing Systems Vol., No., June 5 Analysis on Extraction of Modulated Signal Using Adaptive Filtering Algorithms against Ambient Noises in Underwater Communication S.
More informationPerformance study of Text-independent Speaker identification system using MFCC & IMFCC for Telephone and Microphone Speeches
Performance study of Text-independent Speaker identification system using & I for Telephone and Microphone Speeches Ruchi Chaudhary, National Technical Research Organization Abstract: A state-of-the-art
More informationDetection Algorithm of Target Buried in Doppler Spectrum of Clutter Using PCA
Detection Algorithm of Target Buried in Doppler Spectrum of Clutter Using PCA Muhammad WAQAS, Shouhei KIDERA, and Tetsuo KIRIMOTO Graduate School of Electro-Communications, University of Electro-Communications
More informationCommunication Systems Lecture-12: Delta Modulation and PTM
Communication Systems Lecture-12: Delta Modulation and PTM Department of Electrical and Computer Engineering Lebanese American University chadi.abourjeily@lau.edu.lb October 26, 2017 Delta Modulation (1)
More informationMusic 270a: Fundamentals of Digital Audio and Discrete-Time Signals
Music 270a: Fundamentals of Digital Audio and Discrete-Time Signals Tamara Smyth, trsmyth@ucsd.edu Department of Music, University of California, San Diego October 3, 2016 1 Continuous vs. Discrete signals
More informationLecture Schedule: Week Date Lecture Title
http://elec3004.org Sampling & More 2014 School of Information Technology and Electrical Engineering at The University of Queensland Lecture Schedule: Week Date Lecture Title 1 2-Mar Introduction 3-Mar
More informationApplications of Music Processing
Lecture Music Processing Applications of Music Processing Christian Dittmar International Audio Laboratories Erlangen christian.dittmar@audiolabs-erlangen.de Singing Voice Detection Important pre-requisite
More informationSUMMARY THEORY. VMD vs. EMD
Seismic Denoising Using Thresholded Adaptive Signal Decomposition Fangyu Li, University of Oklahoma; Sumit Verma, University of Texas Permian Basin; Pan Deng, University of Houston; Jie Qi, and Kurt J.
More informationTopic 2. Signal Processing Review. (Some slides are adapted from Bryan Pardo s course slides on Machine Perception of Music)
Topic 2 Signal Processing Review (Some slides are adapted from Bryan Pardo s course slides on Machine Perception of Music) Recording Sound Mechanical Vibration Pressure Waves Motion->Voltage Transducer
More informationNonuniform multi level crossing for signal reconstruction
6 Nonuniform multi level crossing for signal reconstruction 6.1 Introduction In recent years, there has been considerable interest in level crossing algorithms for sampling continuous time signals. Driven
More informationBEAT DETECTION BY DYNAMIC PROGRAMMING. Racquel Ivy Awuor
BEAT DETECTION BY DYNAMIC PROGRAMMING Racquel Ivy Awuor University of Rochester Department of Electrical and Computer Engineering Rochester, NY 14627 rawuor@ur.rochester.edu ABSTRACT A beat is a salient
More informationEE482: Digital Signal Processing Applications
Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 12 Speech Signal Processing 14/03/25 http://www.ee.unlv.edu/~b1morris/ee482/
More informationBasic Signals and Systems
Chapter 2 Basic Signals and Systems A large part of this chapter is taken from: C.S. Burrus, J.H. McClellan, A.V. Oppenheim, T.W. Parks, R.W. Schafer, and H. W. Schüssler: Computer-based exercises for
More informationECE 201: Introduction to Signal Analysis
ECE 201: Introduction to Signal Analysis Prof. Paris Last updated: October 9, 2007 Part I Spectrum Representation of Signals Lecture: Sums of Sinusoids (of different frequency) Introduction Sum of Sinusoidal
More informationFundamentals of Digital Communication
Fundamentals of Digital Communication Network Infrastructures A.A. 2017/18 Digital communication system Analog Digital Input Signal Analog/ Digital Low Pass Filter Sampler Quantizer Source Encoder Channel
More informationCMPT 318: Lecture 4 Fundamentals of Digital Audio, Discrete-Time Signals
CMPT 318: Lecture 4 Fundamentals of Digital Audio, Discrete-Time Signals Tamara Smyth, tamaras@cs.sfu.ca School of Computing Science, Simon Fraser University January 16, 2006 1 Continuous vs. Discrete
More informationTE 302 DISCRETE SIGNALS AND SYSTEMS. Chapter 1: INTRODUCTION
TE 302 DISCRETE SIGNALS AND SYSTEMS Study on the behavior and processing of information bearing functions as they are currently used in human communication and the systems involved. Chapter 1: INTRODUCTION
More informationTHE CITADEL THE MILITARY COLLEGE OF SOUTH CAROLINA. Department of Electrical and Computer Engineering. ELEC 423 Digital Signal Processing
THE CITADEL THE MILITARY COLLEGE OF SOUTH CAROLINA Department of Electrical and Computer Engineering ELEC 423 Digital Signal Processing Project 2 Due date: November 12 th, 2013 I) Introduction In ELEC
More informationContinuous vs. Discrete signals. Sampling. Analog to Digital Conversion. CMPT 368: Lecture 4 Fundamentals of Digital Audio, Discrete-Time Signals
Continuous vs. Discrete signals CMPT 368: Lecture 4 Fundamentals of Digital Audio, Discrete-Time Signals Tamara Smyth, tamaras@cs.sfu.ca School of Computing Science, Simon Fraser University January 22,
More informationQUESTION BANK SUBJECT: DIGITAL COMMUNICATION (15EC61)
QUESTION BANK SUBJECT: DIGITAL COMMUNICATION (15EC61) Module 1 1. Explain Digital communication system with a neat block diagram. 2. What are the differences between digital and analog communication systems?
More informationSAMPLING THEORY. Representing continuous signals with discrete numbers
SAMPLING THEORY Representing continuous signals with discrete numbers Roger B. Dannenberg Professor of Computer Science, Art, and Music Carnegie Mellon University ICM Week 3 Copyright 2002-2013 by Roger
More informationDigital Video and Audio Processing. Winter term 2002/ 2003 Computer-based exercises
Digital Video and Audio Processing Winter term 2002/ 2003 Computer-based exercises Rudolf Mester Institut für Angewandte Physik Johann Wolfgang Goethe-Universität Frankfurt am Main 6th November 2002 Chapter
More informationCOMPARATIVE STUDY OF VARIOUS FIXED AND VARIABLE ADAPTIVE FILTERS IN WIRELESS COMMUNICATION FOR ECHO CANCELLATION USING SIMULINK MODEL
COMPARATIVE STUDY OF VARIOUS FIXED AND VARIABLE ADAPTIVE FILTERS IN WIRELESS COMMUNICATION FOR ECHO CANCELLATION USING SIMULINK MODEL Mr. R. M. Potdar 1, Mr. Mukesh Kumar Chandrakar 2, Mrs. Bhupeshwari
More informationChapter 2: Signal Representation
Chapter 2: Signal Representation Aveek Dutta Assistant Professor Department of Electrical and Computer Engineering University at Albany Spring 2018 Images and equations adopted from: Digital Communications
More informationSignals, sampling & filtering
Signals, sampling & filtering Scientific Computing Fall, 2018 Paul Gribble 1 Time domain representation of signals 1 2 Frequency domain representation of signals 2 3 Fast Fourier transform (FFT) 2 4 Sampling
More informationE40M Sound and Music. M. Horowitz, J. Plummer, R. Howe 1
E40M Sound and Music M. Horowitz, J. Plummer, R. Howe 1 LED Cube Project #3 In the next several lectures, we ll study Concepts Coding Light Sound Transforms/equalizers Devices LEDs Analog to digital converters
More informationAudio Compression using the MLT and SPIHT
Audio Compression using the MLT and SPIHT Mohammed Raad, Alfred Mertins and Ian Burnett School of Electrical, Computer and Telecommunications Engineering University Of Wollongong Northfields Ave Wollongong
More informationDigital Speech Processing and Coding
ENEE408G Spring 2006 Lecture-2 Digital Speech Processing and Coding Spring 06 Instructor: Shihab Shamma Electrical & Computer Engineering University of Maryland, College Park http://www.ece.umd.edu/class/enee408g/
More informationIn this lecture. System Model Power Penalty Analog transmission Digital transmission
System Model Power Penalty Analog transmission Digital transmission In this lecture Analog Data Transmission vs. Digital Data Transmission Analog to Digital (A/D) Conversion Digital to Analog (D/A) Conversion
More informationNOISE ESTIMATION IN A SINGLE CHANNEL
SPEECH ENHANCEMENT FOR CROSS-TALK INTERFERENCE by Levent M. Arslan and John H.L. Hansen Robust Speech Processing Laboratory Department of Electrical Engineering Box 99 Duke University Durham, North Carolina
More informationMedia Devices: Audio. CTEC1465/2018S Computer System Support
Media Devices: Audio CTEC1465/2018S Computer System Support Learning Objective Describe how to implement sound in a PC Introduction The process by which sounds are stored in electronic format on your PC
More informationSampling and Reconstruction of Analog Signals
Sampling and Reconstruction of Analog Signals Chapter Intended Learning Outcomes: (i) Ability to convert an analog signal to a discrete-time sequence via sampling (ii) Ability to construct an analog signal
More informationTELECOMMUNICATION SYSTEMS
TELECOMMUNICATION SYSTEMS By Syed Bakhtawar Shah Abid Lecturer in Computer Science 1 MULTIPLEXING An efficient system maximizes the utilization of all resources. Bandwidth is one of the most precious resources
More informationLecture 3 Concepts for the Data Communications and Computer Interconnection
Lecture 3 Concepts for the Data Communications and Computer Interconnection Aim: overview of existing methods and techniques Terms used: -Data entities conveying meaning (of information) -Signals data
More informationAudio Signal Compression using DCT and LPC Techniques
Audio Signal Compression using DCT and LPC Techniques P. Sandhya Rani#1, D.Nanaji#2, V.Ramesh#3,K.V.S. Kiran#4 #Student, Department of ECE, Lendi Institute Of Engineering And Technology, Vizianagaram,
More informationWavelet Packets Best Tree 4 Points Encoded (BTE) Features
Wavelet Packets Best Tree 4 Points Encoded (BTE) Features Amr M. Gody 1 Fayoum University Abstract The research aimed to introduce newly designed features for speech signal. The newly developed features
More informationFinite Word Length Effects on Two Integer Discrete Wavelet Transform Algorithms. Armein Z. R. Langi
International Journal on Electrical Engineering and Informatics - Volume 3, Number 2, 211 Finite Word Length Effects on Two Integer Discrete Wavelet Transform Algorithms Armein Z. R. Langi ITB Research
More informationThe Role of High Frequencies in Convolutive Blind Source Separation of Speech Signals
The Role of High Frequencies in Convolutive Blind Source Separation of Speech Signals Maria G. Jafari and Mark D. Plumbley Centre for Digital Music, Queen Mary University of London, UK maria.jafari@elec.qmul.ac.uk,
More informationAudio /Video Signal Processing. Lecture 1, Organisation, A/D conversion, Sampling Gerald Schuller, TU Ilmenau
Audio /Video Signal Processing Lecture 1, Organisation, A/D conversion, Sampling Gerald Schuller, TU Ilmenau Gerald Schuller gerald.schuller@tu ilmenau.de Organisation: Lecture each week, 2SWS, Seminar
More informationHIGH QUALITY AUDIO CODING AT LOW BIT RATE USING WAVELET AND WAVELET PACKET TRANSFORM
HIGH QUALITY AUDIO CODING AT LOW BIT RATE USING WAVELET AND WAVELET PACKET TRANSFORM DR. D.C. DHUBKARYA AND SONAM DUBEY 2 Email at: sonamdubey2000@gmail.com, Electronic and communication department Bundelkhand
More informationStructure of Speech. Physical acoustics Time-domain representation Frequency domain representation Sound shaping
Structure of Speech Physical acoustics Time-domain representation Frequency domain representation Sound shaping Speech acoustics Source-Filter Theory Speech Source characteristics Speech Filter characteristics
More informationLaboratory Assignment 2 Signal Sampling, Manipulation, and Playback
Laboratory Assignment 2 Signal Sampling, Manipulation, and Playback PURPOSE This lab will introduce you to the laboratory equipment and the software that allows you to link your computer to the hardware.
More informationHigh-speed Noise Cancellation with Microphone Array
Noise Cancellation a Posteriori Probability, Maximum Criteria Independent Component Analysis High-speed Noise Cancellation with Microphone Array We propose the use of a microphone array based on independent
More informationA TWO-PART PREDICTIVE CODER FOR MULTITASK SIGNAL COMPRESSION. Scott Deeann Chen and Pierre Moulin
A TWO-PART PREDICTIVE CODER FOR MULTITASK SIGNAL COMPRESSION Scott Deeann Chen and Pierre Moulin University of Illinois at Urbana-Champaign Department of Electrical and Computer Engineering 5 North Mathews
More informationPerformance Analysis of MFCC and LPCC Techniques in Automatic Speech Recognition
www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume - 3 Issue - 8 August, 2014 Page No. 7727-7732 Performance Analysis of MFCC and LPCC Techniques in Automatic
More information