Research Article Implementation of a Tour Guide Robot System Using RFID Technology and Viterbi Algorithm-Based HMM for Speech Recognition

Size: px
Start display at page:

Download "Research Article Implementation of a Tour Guide Robot System Using RFID Technology and Viterbi Algorithm-Based HMM for Speech Recognition"

Transcription

1 Mathematical Problems in Engineering, Article ID , 7 pages Research Article Implementation of a Tour Guide Robot System Using RFID Technology and Viterbi Algorithm-Based HMM for Speech Recognition Neng-Sheng Pai, Hua-Jui Kuang, Ting-Yuan Chang, Ying-Che Kuo, and Chun-Yuan Lai Department of Electrical Engineering, National Chin-Yi University of Technology, No. 57, Section 2, Zhongshan Road, Taiping District, Taichung 41170, Taiwan Correspondence should be addressed to Neng-Sheng Pai; pai@mail.ncut.edu.tw Received 3 January 2014; Accepted 1 March 2014; Published 24 March 2014 Academic Editor: Her-Terng Yau Copyright 2014 Neng-Sheng Pai et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. This paper applied speech recognition and RFID technologies to develop an omni-directional mobile robot into a robot with voice control and guide introduction functions. For speech recognition, the speech signals were captured by short-time processing. The speaker first recorded the isolated words for the robot to create speech database of specific speakers. After the speech pre-processing of this speech database, the feature parameters of cepstrum and delta-cepstrum were obtained using linear predictive coefficient (LPC). Then, the Hidden Markov Model (HMM) was used for model training of the speech database, and the Viterbi algorithm was used to find an optimal state sequence as the reference sample for speech recognition. The trained reference model was put into the industrial computer on the robot platform, and the user entered the isolated words to be tested. After processing by the same reference model and comparing with previous reference model, the path of the maximum total probability in various models found using the Viterbi algorithm in the recognition was the recognition result. Finally, the speech recognition and RFID systems were achieved in an actual environment to prove its feasibility and stability, and implemented into the omni-directional mobile robot. 1. Introduction For speech recognition, the dissimilarity between the signal characteristic values and the characteristic values in the database was calculated in early stages to identify the minimum difference as the recognition result. However, this method has a problem of poor recognition effect due to different talking speeds. Afterwards, some scholars proposed the dynamic time warping (DTW) to improve the recognition effect [1, 2]. This method assumes two speech signal segments to be compared, and the short-time feature parameters of the two segments of speech are extracted, namely, separated into a string of frames to determine a group of parameters from each frame. The comparison between two segments of speech is indeed the comparison between two sequence feature parameters. The DTW can adjust the speech length to reduce the errors in the speech time span. In the recognition system following DTW, the Artificial Neural Network (ANN) and HMM algorithms were proposed. The ANN is a method often used in the artificial intelligence domain [3, 4]. The ANN does not need to know the mathematical model of the system in experimental data modeling and highly complex recognition of images, letters, or sound, so that it can replace system models. Knowing the typeofoutputsgeneratedfromthetypeofinputcouldachieve good recognition effect after learning and repeated training. However, the ANN updates weights and bias iteratively, and the amount of calculation is large, so it consumes a lot of computer resources. The HMM uses probability model to describe the pronunciation in statistics [5 7]. A continuous statetransitioninmarkovmodelcanberegardedasthe phonation of a short speech segment, namely, a string of connected HMM, which is the representative for segment of speech. The HMM is a method mostly used in speech

2 2 Mathematical Problems in Engineering RF 9 V Computer side RFID reader RFID tag 24 V lead acid battery Transform to 5 V. 9 V. 24 V Power circuit RS232 RF 110 V 5 V 5 V User side Industrial computer Motor encoder I/O RS232 Encoder board Wireless handset Bluetooth I/O USB 5 V 9 V RFID tag Bluetooth dongle Bluetooth serial adapter RS232 PIC microcontroller 12 V lead acid battery 5 V Ultrasonic seneor I/O 9 V 12 C PIC microcontroller Omnidirectional mobile robot side 500 W power inverter DC motor 24 V PWM Motor driver Figure 1: Robot system hardware link. recognition in recent years. This paper uses the HMM as the speech recognition core. 2. Hardware Design The direction of the voice controlled guide type omnidirectional mobile robot is controlled by voice, and the robot has the RFID guide system and infrared image tracking and ultrasonic obstacle avoidance functions [8]. All of the proposed robot systems are configured with three subsystems; they are omnidirectional mobile robot side, computer side, and user side and shown in Figure 1. ThePeripheral Interface Controller (PIC) microcontroller is the core on the omnidirectional mobile robot side; its main function includes signal process of peripheral devices and motor drive control for three wheels. The computer side uses an industrial computer processing speech recognition calculation, RFID guidesystem,andinfraredimagetracking.theusersideuses a wireless headset and the RFID active tag as voice control equipments. 3. HMM-Based Speech Recognition System 3.1. Preprocessing and Feature Parameter Extraction. The speech signals are preprocessed before speech recognition. The speech preprocessing contains sampling, frame, endpoint detection, preemphasis, and windowing. After the speech signal preprocessing, the characteristic feature parameters are identified for subsequent recognition calculation. In this paper, the Linear Predictive Coefficient (LPC) is used to deduce the cepstrum and delta-cepstrum as the most important feature parameters. The concept of linear prediction originates from that the amplitude of a sampling point is related to the amplitude of an adjacent sampling point during pronunciation. If the postsampling sequence of speech signals is S(n), the present sample of speech signal, that is, S(n), isthesamplevaluesof time n.if S(n) isthepredictedvalueofs(n),sincetheremust be an error between the predicted value and the actual value, the predicted error can be expressed as e(n), as follows: e (n) =S(n) S (n) =S(n) l j=1 a j S(n j), (1) where a j is the linear predictive coding and l is the order number of linear prediction. The coefficient a j is adjusted; as long as the squared error value of (1) is minimized, an optimal linear predictive coefficient a j can be obtained. The autocorrelation is determined before solving the linear predictive coefficient, and then the wanted linear predictive coefficient is obtained from the obtained autocorrelation using the Durbin algorithm. After determining the LPC, the cepstrum coefficient is deduced from the LPC [9]. The cepstrum coefficient separates the vocal tract model from excitation signal, and it can calculate the vocal tract parameters more precisely, so as to control

3 Mathematical Problems in Engineering 3 Start RFID actives tag transmits signal No Whether RFID reader receives tag signal or not RFID reader continues detection Yes Fixed tag is read Tag ID recognition Tag in user's hand is read No Introduce or not (speech recognition system) Yes Introduce the site End Enter verbal command on direction Speech recognition system Robot movement control Figure 2: System operation flow of voice controlled guide type omnidirectional mobile robot. j 1 c j =a j + k=1 (1 k j ) a k c j k, (1 < j l), c j = p k=1 (1 k j ) a k c j k, (j > l). (2) In a practical environment, the external noise influences the speech receiving, so that the tone in the spectrum is disturbed and distorted. The delta-cepstrum can reduce this noise effect. The delta-cepstrum parameter Δ c j (t) is shown in (3), where τ isthenumberofrelatedformer( K)orlatter frames (K). The cepstrum and delta-cepstrum parameters are to be used as feature parameters for recognition: Figure 3: Omnidirectional mobile robot. the speech spectrum characteristics. The cepstrum coefficient c j is determined from the linear predictive coefficient a j, where l is the order number of linear prediction, shown as follows: c 1 =a 1, Δ c j (t) = d c j (t) dt 3.2. HMM and Training Reference Model = K τ= K τ c j (t+τ) K τ= K τ2. (3) Build the Initial Model. The states and frames are separated averagely from the audio part of a segment of speech according to the preset HMM state number, and the feature vectors in the frames are used to calculate the mean value M j and variance Var j,asshownin(4), where A is a state of HMM, i is the frame, j is the feature parameter, T is thenumberofframesinastate,andq is the number of feature vectors of cepstrum and delta-cepstrum. This paper uses 15

4 4 Mathematical Problems in Engineering (a) (b) (c) (d) Figure 4: User using speech to control robot to move forward and turn left. (a) User commands robot to move forward. (b) User commands robot to stop. (c) User commands robot to turn left. (d) Robot turns left and moves on. cepstrum and 15 delta-cepstrum as characteristic values, and q is 30: M j = T i=1 A (i, j), 1 j q, T Var j = T i=1 [A (i, j)]2 (M T j ) 2, 1 j q. (4) Viterbi Algorithm. In order to obtain the correct relationshipbetweenframeandhmmstatemoreaccurately,this paperusesagaussian probabilityfunction [10] to determine thesimilarityprobabilityvalueofstateandframe.ahigher probability value indicates a higher similarity between the corresponding frame and the state, as shown in (5), where G i (x T ) is the probability value of each state corresponding to its frame, d is the feature vector dimension, x T is the feature vector, τ Ri isthemeanvalueofstates,r i is the covariance

5 Mathematical Problems in Engineering 5 (a) (b) (c) (d) Figure 5: Robot guide experiment. (a) User commands robot to move forward. (b) Robot detects tag and asks user whether he needs any introduction to the place or not. (c) User says YES. (d) Robot plays video. matrix of the density function, and G i (x T ) is the probability value of similarity between the feature vector x T and state i: 1 G i (x T )= exp { 1 (2π) d R 2 (x T τ Ri ) T R 1 i (x T τ Ri )}. i (5) The HMM can be represented by λ={π, A, B, S, V}, (6) where S = {s 1,s 2,...,s N } is the state sequence, N is the state number, V is the observed results, π = {π i } is the initial state probability, A = {a ij } is the state transition probability, B={b j (O t )} is the state observation probability, O t ={O 1,O 2,...,O T } is the observation sequence, and T is thesequencelength. The Gaussian probability density function determines the probability value between frame and state. The HMM has many optional paths for state transition, and the path with the maximum total probability value among all possible paths is

6 6 Mathematical Problems in Engineering required to be found. This paper uses the Viterbi algorithm [11, 12], as shown in (7) (10), where δ i (i) is the probability of staying in state i at time t. ψ t (i) is the probability of reaching state i at time t, p isthefinalprobabilityvalueoftheviterbi algorithm, and S T is the optimal state sequence. Step 1. Initializing Step 2. Recursing Step 3. Terminating δ t (i) =π i b i (o 1 ), ψ t (i) =0. 1 i N, δ t+1 (j) = max 1 i N [δ t (i) a ij ] b j (o t+1 ) Step 4. Path backtracking ψ t+1 (j) = arg max [δ t (i) a ij ], 1 i N 1 t T 1; 1 j N. p= max [δ t (i)], 1 i N S T = arg max [δ t (i)]. 1 i N (7) (8) (9) S t =ψ t+1 (S t+! ), t=t 1,T 2,T 3,...1. (10) Reevaluation. After the new relationship between state and frame is obtained using the Viterbi algorithm, the mean value and variance in old state are updated, and the Gaussian density function is used to determine the updated probabilitybetweenstateandframeagain.thenewtotal probability value is obtained using the Viterbi algorithm. The update continues until the maximum total probability value is converged, and this is the reference model after training Speech Recognition. The needed commands are trained into models, which serve as reference database of speech recognition. The feature parameters are determined according to previous procedure during recognition. The reference modelsofdatabasearecomparedusingtheviterbialgorithm to determine the probability value of each model and find the optimal state sequence. The time warping of speech signals is solved automatically when corresponding to a sequence of frames to the state sequence. The key point in the speech training procedure is to identify the correlation betweenframeandstate.therelationshipbetweenframeand state should be updated by continuous path backtracking of Viterbi, until the path with the maximum total probability is determined. The most important step in the recognition procedure is to compare the reference models of training and obtain the maximum total probability value in reference models. Table 1: Recognition rates for the speaker dependent and speaker independent. Speaker dependent Recognition rates Chun-Yuan 96.7% Jian-Min 93.3% Yi-Chung 90% Wei 96.7% Hung-Hui 93.3% Average recognition rates 94% Speaker independent Recognition rates Jason 66.7% Ian 73.3% Andy 90% Momo 83.3% Apple 50% Average recognition rates 74.7% 4. Experiment Results Figure 2 shows the system operation flow of the voice controlled guide type omnidirectional mobile robot. In the RFID guide system, the Reader captures Tag data and then attaches environmental information to the Tags of different ID codes or starts up the speech function. Figure 3 shows the picture of the proposed omnidirectional mobile robot. We place the robot in the actual environment and test various moving actions (forward, backward, turn left, turn right, stop, and turn back). The voice control of speaker dependent and speaker independent are tested by five users, respectively, and the experimental results of speech recognition rates are shown in Table 1. Figure 4 shows the experiment of the user using speech to control the robot to move forward and turn left. Figure 5 shows the user using speech to control the robot to move forward, receiving the Tag of the classroom when passingbytheclassroom,theusercanuseyesornotochoose whether accessing detailed information on the site. The site is introduced in the video format, so that the user can get acquainted with the environment quickly. 5. Conclusions This paper used the HMM-based speech recognition method to complete a voice controlled guide type omnidirectional mobile robot. The first convenience of voice control is that the operation does not require manual operation, which makes the robot more user-friendly. The guide system based on RFID technology enables the users to know the information of an unfamiliar environment quickly. Finally, the robot movement experiment and the robot guide system experiment proved the feasibility and stability of this voice controlled guide type omnidirectional mobile robot. Conflict of Interests The authors declare no conflict of interests.

7 Mathematical Problems in Engineering 7 Acknowledgment The financial support of this research by the National Science Council of Taiwan, under Grant no. NSC E is greatly appreciated. References [1] H. Sakoe and S. Chiba, Dynamic programming algorithm optimization for spoken word recognition, IEEE Transactions on Acoustics, Speech, and Signal Processing,vol.26,no.1,pp.43 49, [2] C. Kim and K.-D. Seo, Robust DTW-based recognition algorithmforhand-heldconsumerdevices, IEEE Transactions on Consumer Electronics,vol.51,no.2,pp ,2005. [3] D. P. Morgan and C. L. Scofield, Eds., Neural Networks and Speech Processing, Kluwer Academic Publishers, [4] C.-F. Juang, C.-T. Chiou, and C.-L. Lai, Hierarchical singletontype recurrent neural fuzzy networks for noisy speech recognition, IEEE Transactions on Neural Networks, vol.18,no.3,pp , [5] L. R. Rabiner, A tutorial on hidden Markov Models and selected applications in speech recognition, IEEE T Acoust Speech, vol. 77, pp , [6] S.Yoshizawa,N.Wada,N.Hayasaka,andY.Miyanaga, Scalable architecture for word HMM-based speech recognition and VLSI implementation in complete system, IEEE Transactions on Circuits and Systems I: Regular Papers,vol.53,no.1,pp.70 77, [7] J.-H. Im and S.-Y. Lee, Unified training of feature extractor and HMM classifier for speech recognition, IEEE Signal Processing Letters,vol.19,no.2,pp ,2012. [8] S. F. Huang, Design and Implementation of an Autonomous Following Omni-Directional Mobile Robot, National Digital Library of Theses and Dissertations, Taipei, Taiwan, [9] Y. Yuan, P. Zhao, and Q. Zhou, Research of speaker recognition based on combination of LPCC and MFCC, in Proceedings of the IEEE International Conference on Intelligent Computing and Intelligent Systems (ICIS 10), pp , Xiamen, China, October [10] L. Liu and J. He, On the use of orthogonal GMM in speaker recognition, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 99), pp , Phoenix, Ariz, USA, March [11] C. C. Wen, Ed., Multimedia Applications for Speech Recognition System, National Digital Library of Theses and Dissertations, Taipei, Taiwan, [12] D. F. Tseng, Robust decoding for convolutionally coded systems impaired by memoryless impulsive noise, IEEE Transactions on Communications,vol.61,pp ,2013.

8 Advances in Operations Research Advances in Decision Sciences Applied Mathematics Algebra Probability and Statistics The Scientific World Journal International Differential Equations Submit your manuscripts at International Advances in Combinatorics Mathematical Physics Complex Analysis International Mathematics and Mathematical Sciences Mathematical Problems in Engineering Mathematics Discrete Mathematics Discrete Dynamics in Nature and Society Function Spaces Abstract and Applied Analysis International Stochastic Analysis Optimization

Speech Synthesis using Mel-Cepstral Coefficient Feature

Speech Synthesis using Mel-Cepstral Coefficient Feature Speech Synthesis using Mel-Cepstral Coefficient Feature By Lu Wang Senior Thesis in Electrical Engineering University of Illinois at Urbana-Champaign Advisor: Professor Mark Hasegawa-Johnson May 2018 Abstract

More information

Mel Spectrum Analysis of Speech Recognition using Single Microphone

Mel Spectrum Analysis of Speech Recognition using Single Microphone International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree

More information

Research Article Small-Size Meandered Loop Antenna for WLAN Dongle Devices

Research Article Small-Size Meandered Loop Antenna for WLAN Dongle Devices Antennas and Propagation Volume 214, Article ID 89764, 7 pages http://dx.doi.org/1.11/214/89764 Research Article Small-Size Meandered Loop Antenna for WLAN Dongle Devices Wen-Shan Chen, Chien-Min Cheng,

More information

High-speed Noise Cancellation with Microphone Array

High-speed Noise Cancellation with Microphone Array Noise Cancellation a Posteriori Probability, Maximum Criteria Independent Component Analysis High-speed Noise Cancellation with Microphone Array We propose the use of a microphone array based on independent

More information

AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND TRANSFER FUNCTIONS

AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND TRANSFER FUNCTIONS AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND TRANSFER FUNCTIONS Kuldeep Kumar 1, R. K. Aggarwal 1 and Ankita Jain 2 1 Department of Computer Engineering, National Institute

More information

SIMULATION VOICE RECOGNITION SYSTEM FOR CONTROLING ROBOTIC APPLICATIONS

SIMULATION VOICE RECOGNITION SYSTEM FOR CONTROLING ROBOTIC APPLICATIONS SIMULATION VOICE RECOGNITION SYSTEM FOR CONTROLING ROBOTIC APPLICATIONS 1 WAHYU KUSUMA R., 2 PRINCE BRAVE GUHYAPATI V 1 Computer Laboratory Staff., Department of Information Systems, Gunadarma University,

More information

Performance Analysiss of Speech Enhancement Algorithm for Robust Speech Recognition System

Performance Analysiss of Speech Enhancement Algorithm for Robust Speech Recognition System Performance Analysiss of Speech Enhancement Algorithm for Robust Speech Recognition System C.GANESH BABU 1, Dr.P..T.VANATHI 2 R.RAMACHANDRAN 3, M.SENTHIL RAJAA 3, R.VENGATESH 3 1 Research Scholar (PSGCT)

More information

Performance analysis of voice activity detection algorithm for robust speech recognition system under different noisy environment

Performance analysis of voice activity detection algorithm for robust speech recognition system under different noisy environment BABU et al: VOICE ACTIVITY DETECTION ALGORITHM FOR ROBUST SPEECH RECOGNITION SYSTEM Journal of Scientific & Industrial Research Vol. 69, July 2010, pp. 515-522 515 Performance analysis of voice activity

More information

SPEECH ENHANCEMENT USING PITCH DETECTION APPROACH FOR NOISY ENVIRONMENT

SPEECH ENHANCEMENT USING PITCH DETECTION APPROACH FOR NOISY ENVIRONMENT SPEECH ENHANCEMENT USING PITCH DETECTION APPROACH FOR NOISY ENVIRONMENT RASHMI MAKHIJANI Department of CSE, G. H. R.C.E., Near CRPF Campus,Hingna Road, Nagpur, Maharashtra, India rashmi.makhijani2002@gmail.com

More information

Using RASTA in task independent TANDEM feature extraction

Using RASTA in task independent TANDEM feature extraction R E S E A R C H R E P O R T I D I A P Using RASTA in task independent TANDEM feature extraction Guillermo Aradilla a John Dines a Sunil Sivadas a b IDIAP RR 04-22 April 2004 D a l l e M o l l e I n s t

More information

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING. Department of Signal Theory and Communications. c/ Gran Capitán s/n, Campus Nord, Edificio D5

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING. Department of Signal Theory and Communications. c/ Gran Capitán s/n, Campus Nord, Edificio D5 IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING Javier Hernando Department of Signal Theory and Communications Polytechnical University of Catalonia c/ Gran Capitán s/n, Campus Nord, Edificio D5 08034

More information

Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter

Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter Ching-Ta Lu, Kun-Fu Tseng 2, Chih-Tsung Chen 2 Department of Information Communication, Asia University, Taichung, Taiwan, ROC

More information

Voice based Control Signal Generation for Intelligent Patient Vehicle

Voice based Control Signal Generation for Intelligent Patient Vehicle International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 12 (2014), pp. 1229-1235 International Research Publications House http://www. irphouse.com Voice based Control

More information

IDENTIFICATION OF SIGNATURES TRANSMITTED OVER RAYLEIGH FADING CHANNEL BY USING HMM AND RLE

IDENTIFICATION OF SIGNATURES TRANSMITTED OVER RAYLEIGH FADING CHANNEL BY USING HMM AND RLE International Journal of Technology (2011) 1: 56 64 ISSN 2086 9614 IJTech 2011 IDENTIFICATION OF SIGNATURES TRANSMITTED OVER RAYLEIGH FADING CHANNEL BY USING HMM AND RLE Djamhari Sirat 1, Arman D. Diponegoro

More information

LPSO-WNN DENOISING ALGORITHM FOR SPEECH RECOGNITION IN HIGH BACKGROUND NOISE

LPSO-WNN DENOISING ALGORITHM FOR SPEECH RECOGNITION IN HIGH BACKGROUND NOISE LPSO-WNN DENOISING ALGORITHM FOR SPEECH RECOGNITION IN HIGH BACKGROUND NOISE LONGFU ZHOU 1,2, YONGHE HU 1,2,3, SHIYI XIAHOU 3, WEI ZHANG 3, CHAOQUN ZHANG 2 ZHENG LI 2, DAPENG HAO 2 1,The Department of

More information

Speech Enhancement Based On Noise Reduction

Speech Enhancement Based On Noise Reduction Speech Enhancement Based On Noise Reduction Kundan Kumar Singh Electrical Engineering Department University Of Rochester ksingh11@z.rochester.edu ABSTRACT This paper addresses the problem of signal distortion

More information

SONG RETRIEVAL SYSTEM USING HIDDEN MARKOV MODELS

SONG RETRIEVAL SYSTEM USING HIDDEN MARKOV MODELS SONG RETRIEVAL SYSTEM USING HIDDEN MARKOV MODELS AKSHAY CHANDRASHEKARAN ANOOP RAMAKRISHNA akshayc@cmu.edu anoopr@andrew.cmu.edu ABHISHEK JAIN GE YANG ajain2@andrew.cmu.edu younger@cmu.edu NIDHI KOHLI R

More information

Performance Analysis of MFCC and LPCC Techniques in Automatic Speech Recognition

Performance Analysis of MFCC and LPCC Techniques in Automatic Speech Recognition www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume - 3 Issue - 8 August, 2014 Page No. 7727-7732 Performance Analysis of MFCC and LPCC Techniques in Automatic

More information

Signal Processing for Speech Applications - Part 2-1. Signal Processing For Speech Applications - Part 2

Signal Processing for Speech Applications - Part 2-1. Signal Processing For Speech Applications - Part 2 Signal Processing for Speech Applications - Part 2-1 Signal Processing For Speech Applications - Part 2 May 14, 2013 Signal Processing for Speech Applications - Part 2-2 References Huang et al., Chapter

More information

VOICE CONTROLLED ROBOT FOR SURVEILLANCE AND GAS LEAKAGE DETECTION

VOICE CONTROLLED ROBOT FOR SURVEILLANCE AND GAS LEAKAGE DETECTION VOICE CONTROLLED ROBOT FOR SURVEILLANCE AND GAS LEAKAGE DETECTION Mallikarjuna Gowda.C.P 1, Raju Hajare 2, Akhil Kumar 3,Manasa.R.E 4, Ramyashree.R 5, SmithaPatil 6 1,2 Associate professor, Department

More information

Discriminative Training for Automatic Speech Recognition

Discriminative Training for Automatic Speech Recognition Discriminative Training for Automatic Speech Recognition 22 nd April 2013 Advanced Signal Processing Seminar Article Heigold, G.; Ney, H.; Schluter, R.; Wiesler, S. Signal Processing Magazine, IEEE, vol.29,

More information

Team Description Paper: HuroEvolution Humanoid Robot for Robocup 2010 Humanoid League

Team Description Paper: HuroEvolution Humanoid Robot for Robocup 2010 Humanoid League Team Description Paper: HuroEvolution Humanoid Robot for Robocup 2010 Humanoid League Chung-Hsien Kuo 1, Hung-Chyun Chou 1, Jui-Chou Chung 1, Po-Chung Chia 2, Shou-Wei Chi 1, Yu-De Lien 1 1 Department

More information

Vocal Command Recognition Using Parallel Processing of Multiple Confidence-Weighted Algorithms in an FPGA

Vocal Command Recognition Using Parallel Processing of Multiple Confidence-Weighted Algorithms in an FPGA Vocal Command Recognition Using Parallel Processing of Multiple Confidence-Weighted Algorithms in an FPGA ECE-492/3 Senior Design Project Spring 2015 Electrical and Computer Engineering Department Volgenau

More information

Mikko Myllymäki and Tuomas Virtanen

Mikko Myllymäki and Tuomas Virtanen NON-STATIONARY NOISE MODEL COMPENSATION IN VOICE ACTIVITY DETECTION Mikko Myllymäki and Tuomas Virtanen Department of Signal Processing, Tampere University of Technology Korkeakoulunkatu 1, 3370, Tampere,

More information

EC 2301 Digital communication Question bank

EC 2301 Digital communication Question bank EC 2301 Digital communication Question bank UNIT I Digital communication system 2 marks 1.Draw block diagram of digital communication system. Information source and input transducer formatter Source encoder

More information

Voice Excited Lpc for Speech Compression by V/Uv Classification

Voice Excited Lpc for Speech Compression by V/Uv Classification IOSR Journal of VLSI and Signal Processing (IOSR-JVSP) Volume 6, Issue 3, Ver. II (May. -Jun. 2016), PP 65-69 e-issn: 2319 4200, p-issn No. : 2319 4197 www.iosrjournals.org Voice Excited Lpc for Speech

More information

EC 6501 DIGITAL COMMUNICATION UNIT - II PART A

EC 6501 DIGITAL COMMUNICATION UNIT - II PART A EC 6501 DIGITAL COMMUNICATION 1.What is the need of prediction filtering? UNIT - II PART A [N/D-16] Prediction filtering is used mostly in audio signal processing and speech processing for representing

More information

(i) Understanding the basic concepts of signal modeling, correlation, maximum likelihood estimation, least squares and iterative numerical methods

(i) Understanding the basic concepts of signal modeling, correlation, maximum likelihood estimation, least squares and iterative numerical methods Tools and Applications Chapter Intended Learning Outcomes: (i) Understanding the basic concepts of signal modeling, correlation, maximum likelihood estimation, least squares and iterative numerical methods

More information

NCCT IEEE PROJECTS ADVANCED ROBOTICS SOLUTIONS. Latest Projects, in various Domains. Promise for the Best Projects

NCCT IEEE PROJECTS ADVANCED ROBOTICS SOLUTIONS. Latest Projects, in various Domains. Promise for the Best Projects NCCT Promise for the Best Projects IEEE PROJECTS in various Domains Latest Projects, 2009-2010 ADVANCED ROBOTICS SOLUTIONS EMBEDDED SYSTEM PROJECTS Microcontrollers VLSI DSP Matlab Robotics ADVANCED ROBOTICS

More information

CS 188: Artificial Intelligence Spring Speech in an Hour

CS 188: Artificial Intelligence Spring Speech in an Hour CS 188: Artificial Intelligence Spring 2006 Lecture 19: Speech Recognition 3/23/2006 Dan Klein UC Berkeley Many slides from Dan Jurafsky Speech in an Hour Speech input is an acoustic wave form s p ee ch

More information

International Journal of Engineering and Techniques - Volume 1 Issue 6, Nov Dec 2015

International Journal of Engineering and Techniques - Volume 1 Issue 6, Nov Dec 2015 RESEARCH ARTICLE OPEN ACCESS A Comparative Study on Feature Extraction Technique for Isolated Word Speech Recognition Easwari.N 1, Ponmuthuramalingam.P 2 1,2 (PG & Research Department of Computer Science,

More information

Applications of Music Processing

Applications of Music Processing Lecture Music Processing Applications of Music Processing Christian Dittmar International Audio Laboratories Erlangen christian.dittmar@audiolabs-erlangen.de Singing Voice Detection Important pre-requisite

More information

Research Article Compact Dual-Band Dipole Antenna with Asymmetric Arms for WLAN Applications

Research Article Compact Dual-Band Dipole Antenna with Asymmetric Arms for WLAN Applications Antennas and Propagation, Article ID 19579, pages http://dx.doi.org/1.1155/21/19579 Research Article Compact Dual-Band Dipole Antenna with Asymmetric Arms for WLAN Applications Chung-Hsiu Chiu, 1 Chun-Cheng

More information

A Novel Speech Controller for Radio Amateurs with a Vision Impairment

A Novel Speech Controller for Radio Amateurs with a Vision Impairment IEEE TRANSACTIONS ON REHABILITATION ENGINEERING, VOL. 8, NO. 1, MARCH 2000 89 A Novel Speech Controller for Radio Amateurs with a Vision Impairment Chih-Lung Lin, Bo-Ren Bai, Li-Chun Du, Cheng-Tao Hu,

More information

An Improved Voice Activity Detection Based on Deep Belief Networks

An Improved Voice Activity Detection Based on Deep Belief Networks e-issn 2455 1392 Volume 2 Issue 4, April 2016 pp. 676-683 Scientific Journal Impact Factor : 3.468 http://www.ijcter.com An Improved Voice Activity Detection Based on Deep Belief Networks Shabeeba T. K.

More information

SOUND SOURCE RECOGNITION AND MODELING

SOUND SOURCE RECOGNITION AND MODELING SOUND SOURCE RECOGNITION AND MODELING CASA seminar, summer 2000 Antti Eronen antti.eronen@tut.fi Contents: Basics of human sound source recognition Timbre Voice recognition Recognition of environmental

More information

Research Article A Miniaturized Meandered Dipole UHF RFID Tag Antenna for Flexible Application

Research Article A Miniaturized Meandered Dipole UHF RFID Tag Antenna for Flexible Application Antennas and Propagation Volume 216, Article ID 2951659, 7 pages http://dx.doi.org/1.1155/216/2951659 Research Article A Miniaturized Meandered Dipole UHF RFID Tag Antenna for Flexible Application Xiuwei

More information

Automatic Morse Code Recognition Under Low SNR

Automatic Morse Code Recognition Under Low SNR 2nd International Conference on Mechanical, Electronic, Control and Automation Engineering (MECAE 2018) Automatic Morse Code Recognition Under Low SNR Xianyu Wanga, Qi Zhaob, Cheng Mac, * and Jianping

More information

Enhanced voice recognition to reduce fraudulence in ATM machine

Enhanced voice recognition to reduce fraudulence in ATM machine Enhanced voice recognition to reduce fraudulence in ATM machine 1 Hridya Venugopal, Hema.U, Kalaiselvi.S, Mahalakshmi.M Department of Information Technology Alpha college of Engineering Email:hridya.nbr@gmail.com,hemau5490@gmail.com,kalaika3@gmail.com,

More information

Auditory Based Feature Vectors for Speech Recognition Systems

Auditory Based Feature Vectors for Speech Recognition Systems Auditory Based Feature Vectors for Speech Recognition Systems Dr. Waleed H. Abdulla Electrical & Computer Engineering Department The University of Auckland, New Zealand [w.abdulla@auckland.ac.nz] 1 Outlines

More information

Vocoder (LPC) Analysis by Variation of Input Parameters and Signals

Vocoder (LPC) Analysis by Variation of Input Parameters and Signals ISCA Journal of Engineering Sciences ISCA J. Engineering Sci. Vocoder (LPC) Analysis by Variation of Input Parameters and Signals Abstract Gupta Rajani, Mehta Alok K. and Tiwari Vebhav Truba College of

More information

Spectral estimation using higher-lag autocorrelation coefficients with applications to speech recognition

Spectral estimation using higher-lag autocorrelation coefficients with applications to speech recognition Spectral estimation using higher-lag autocorrelation coefficients with applications to speech recognition Author Shannon, Ben, Paliwal, Kuldip Published 25 Conference Title The 8th International Symposium

More information

Wheeled Mobile Robot Kuzma I

Wheeled Mobile Robot Kuzma I Contemporary Engineering Sciences, Vol. 7, 2014, no. 18, 895-899 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ces.2014.47102 Wheeled Mobile Robot Kuzma I Andrey Sheka 1, 2 1) Department of Intelligent

More information

Design and Implementation of an Audio Classification System Based on SVM

Design and Implementation of an Audio Classification System Based on SVM Available online at www.sciencedirect.com Procedia ngineering 15 (011) 4031 4035 Advanced in Control ngineering and Information Science Design and Implementation of an Audio Classification System Based

More information

Electric Guitar Pickups Recognition

Electric Guitar Pickups Recognition Electric Guitar Pickups Recognition Warren Jonhow Lee warrenjo@stanford.edu Yi-Chun Chen yichunc@stanford.edu Abstract Electric guitar pickups convert vibration of strings to eletric signals and thus direcly

More information

Speech Enhancement using Wiener filtering

Speech Enhancement using Wiener filtering Speech Enhancement using Wiener filtering S. Chirtmay and M. Tahernezhadi Department of Electrical Engineering Northern Illinois University DeKalb, IL 60115 ABSTRACT The problem of reducing the disturbing

More information

Open Access Partial Discharge Fault Decision and Location of 24kV Composite Porcelain Insulator based on Power Spectrum Density Algorithm

Open Access Partial Discharge Fault Decision and Location of 24kV Composite Porcelain Insulator based on Power Spectrum Density Algorithm Send Orders for Reprints to reprints@benthamscience.ae 342 The Open Electrical & Electronic Engineering Journal, 15, 9, 342-346 Open Access Partial Discharge Fault Decision and Location of 24kV Composite

More information

Implementation of Text to Speech Conversion

Implementation of Text to Speech Conversion Implementation of Text to Speech Conversion Chaw Su Thu Thu 1, Theingi Zin 2 1 Department of Electronic Engineering, Mandalay Technological University, Mandalay 2 Department of Electronic Engineering,

More information

University of Washington Department of Electrical Engineering Computer Speech Processing EE516 Winter 2005

University of Washington Department of Electrical Engineering Computer Speech Processing EE516 Winter 2005 University of Washington Department of Electrical Engineering Computer Speech Processing EE516 Winter 2005 Lecture 5 Slides Jan 26 th, 2005 Outline of Today s Lecture Announcements Filter-bank analysis

More information

Speech Recognition using FIR Wiener Filter

Speech Recognition using FIR Wiener Filter Speech Recognition using FIR Wiener Filter Deepak 1, Vikas Mittal 2 1 Department of Electronics & Communication Engineering, Maharishi Markandeshwar University, Mullana (Ambala), INDIA 2 Department of

More information

A Correlation-Maximization Denoising Filter Used as An Enhancement Frontend for Noise Robust Bird Call Classification

A Correlation-Maximization Denoising Filter Used as An Enhancement Frontend for Noise Robust Bird Call Classification A Correlation-Maximization Denoising Filter Used as An Enhancement Frontend for Noise Robust Bird Call Classification Wei Chu and Abeer Alwan Speech Processing and Auditory Perception Laboratory Department

More information

ADAPTIVE ACTIVE NOISE CONTROL SYSTEM FOR SECONDARY PATH FLUCTUATION PROBLEM

ADAPTIVE ACTIVE NOISE CONTROL SYSTEM FOR SECONDARY PATH FLUCTUATION PROBLEM International Journal of Innovative Computing, Information and Control ICIC International c 2012 ISSN 1349-4198 Volume 8, Number 1(B), January 2012 pp. 967 976 ADAPTIVE ACTIVE NOISE CONTROL SYSTEM FOR

More information

Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis

Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis Mohini Avatade & S.L. Sahare Electronics & Telecommunication Department, Cummins

More information

IMAGE TYPE WATER METER CHARACTER RECOGNITION BASED ON EMBEDDED DSP

IMAGE TYPE WATER METER CHARACTER RECOGNITION BASED ON EMBEDDED DSP IMAGE TYPE WATER METER CHARACTER RECOGNITION BASED ON EMBEDDED DSP LIU Ying 1,HAN Yan-bin 2 and ZHANG Yu-lin 3 1 School of Information Science and Engineering, University of Jinan, Jinan 250022, PR China

More information

Research Article Research of Smart Car s Speed Control Based on the Internal Model Control

Research Article Research of Smart Car s Speed Control Based on the Internal Model Control Abstract and Applied Analysis, Article ID 274293, 5 pages http://dx.doi.org/.55/24/274293 Research Article Research of Smart Car s Speed Control Based on the Internal Model Control Han Yu, Hamid Reza Karimi,

More information

Advanced Signal Processing and Digital Noise Reduction

Advanced Signal Processing and Digital Noise Reduction Advanced Signal Processing and Digital Noise Reduction Advanced Signal Processing and Digital Noise Reduction Saeed V. Vaseghi Queen's University of Belfast UK ~ W I lilteubner L E Y A Partnership between

More information

Report 3. Kalman or Wiener Filters

Report 3. Kalman or Wiener Filters 1 Embedded Systems WS 2014/15 Report 3: Kalman or Wiener Filters Stefan Feilmeier Facultatea de Inginerie Hermann Oberth Master-Program Embedded Systems Advanced Digital Signal Processing Methods Winter

More information

Isolated Digit Recognition Using MFCC AND DTW

Isolated Digit Recognition Using MFCC AND DTW MarutiLimkar a, RamaRao b & VidyaSagvekar c a Terna collegeof Engineering, Department of Electronics Engineering, Mumbai University, India b Vidyalankar Institute of Technology, Department ofelectronics

More information

Cepstrum alanysis of speech signals

Cepstrum alanysis of speech signals Cepstrum alanysis of speech signals ELEC-E5520 Speech and language processing methods Spring 2016 Mikko Kurimo 1 /48 Contents Literature and other material Idea and history of cepstrum Cepstrum and LP

More information

T Automatic Speech Recognition: From Theory to Practice

T Automatic Speech Recognition: From Theory to Practice Automatic Speech Recognition: From Theory to Practice http://www.cis.hut.fi/opinnot// September 27, 2004 Prof. Bryan Pellom Department of Computer Science Center for Spoken Language Research University

More information

Different Approaches of Spectral Subtraction Method for Speech Enhancement

Different Approaches of Spectral Subtraction Method for Speech Enhancement ISSN 2249 5460 Available online at www.internationalejournals.com International ejournals International Journal of Mathematical Sciences, Technology and Humanities 95 (2013 1056 1062 Different Approaches

More information

Adaptive Filters Application of Linear Prediction

Adaptive Filters Application of Linear Prediction Adaptive Filters Application of Linear Prediction Gerhard Schmidt Christian-Albrechts-Universität zu Kiel Faculty of Engineering Electrical Engineering and Information Technology Digital Signal Processing

More information

Voice Activity Detection

Voice Activity Detection Voice Activity Detection Speech Processing Tom Bäckström Aalto University October 2015 Introduction Voice activity detection (VAD) (or speech activity detection, or speech detection) refers to a class

More information

Research Article Multiband Planar Monopole Antenna for LTE MIMO Systems

Research Article Multiband Planar Monopole Antenna for LTE MIMO Systems Antennas and Propagation Volume 1, Article ID 8975, 6 pages doi:1.1155/1/8975 Research Article Multiband Planar Monopole Antenna for LTE MIMO Systems Yuan Yao, Xing Wang, and Junsheng Yu School of Electronic

More information

EE482: Digital Signal Processing Applications

EE482: Digital Signal Processing Applications Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 12 Speech Signal Processing 14/03/25 http://www.ee.unlv.edu/~b1morris/ee482/

More information

Speech Signal Analysis

Speech Signal Analysis Speech Signal Analysis Hiroshi Shimodaira and Steve Renals Automatic Speech Recognition ASR Lectures 2&3 14,18 January 216 ASR Lectures 2&3 Speech Signal Analysis 1 Overview Speech Signal Analysis for

More information

* Intelli Robotic Wheel Chair for Specialty Operations & Physically Challenged

* Intelli Robotic Wheel Chair for Specialty Operations & Physically Challenged ADVANCED ROBOTICS SOLUTIONS * Intelli Mobile Robot for Multi Specialty Operations * Advanced Robotic Pick and Place Arm and Hand System * Automatic Color Sensing Robot using PC * AI Based Image Capturing

More information

Quasi-Orthogonal Space-Time Block Coding Using Polynomial Phase Modulation

Quasi-Orthogonal Space-Time Block Coding Using Polynomial Phase Modulation Florida International University FIU Digital Commons Electrical and Computer Engineering Faculty Publications College of Engineering and Computing 4-28-2011 Quasi-Orthogonal Space-Time Block Coding Using

More information

Simultaneous Recognition of Speech Commands by a Robot using a Small Microphone Array

Simultaneous Recognition of Speech Commands by a Robot using a Small Microphone Array 2012 2nd International Conference on Computer Design and Engineering (ICCDE 2012) IPCSIT vol. 49 (2012) (2012) IACSIT Press, Singapore DOI: 10.7763/IPCSIT.2012.V49.14 Simultaneous Recognition of Speech

More information

Implementation of a Hybrid Voice Control System for a Colony of Robots

Implementation of a Hybrid Voice Control System for a Colony of Robots The International Arab Journal of Information Technology, Vol. 6, No. 1, January 2009 67 Implementation of a Hybrid Voice Control System for a Colony of Robots Mohamed Fezari and Mounir Bousbia-Salah Department

More information

Voice Recognition Technology Using Neural Networks

Voice Recognition Technology Using Neural Networks Journal of New Technology and Materials JNTM Vol. 05, N 01 (2015)27-31 OEB Univ. Publish. Co. Voice Recognition Technology Using Neural Networks Abdelouahab Zaatri 1, Norelhouda Azzizi 2 and Fouad Lazhar

More information

Voice Recognition Based Automation System for Medical Applications and For Physically Challenged Patients

Voice Recognition Based Automation System for Medical Applications and For Physically Challenged Patients Voice Recognition Based Automation System for Medical Applications and For Physically Challenged Patients Sanu Kumar Das 1, Vitthal Rathod 2, Akhilesh Yadav.B 3 1Sanu Kumar Das, Dept. Of Electronics &

More information

MATLAB based Voice Controlled Wheelchair using Back-Propagation Neural Network Anjaneyulu.D 1 Ajanta Reddy.B 2 Prasanth Varma.D 3

MATLAB based Voice Controlled Wheelchair using Back-Propagation Neural Network Anjaneyulu.D 1 Ajanta Reddy.B 2 Prasanth Varma.D 3 IJSRD - International Journal for Scientific Research & Development Vol. 3, Issue 05, 2015 ISSN (online): 2321-0613 MATLAB based Voice Controlled Wheelchair using Back-Propagation Neural Network Anjaneyulu.D

More information

Power Normalized Cepstral Coefficient for Speaker Diarization and Acoustic Echo Cancellation

Power Normalized Cepstral Coefficient for Speaker Diarization and Acoustic Echo Cancellation Power Normalized Cepstral Coefficient for Speaker Diarization and Acoustic Echo Cancellation Sherbin Kanattil Kassim P.G Scholar, Department of ECE, Engineering College, Edathala, Ernakulam, India sherbin_kassim@yahoo.co.in

More information

SOUND SOURCE RECOGNITION FOR INTELLIGENT SURVEILLANCE

SOUND SOURCE RECOGNITION FOR INTELLIGENT SURVEILLANCE Paper ID: AM-01 SOUND SOURCE RECOGNITION FOR INTELLIGENT SURVEILLANCE Md. Rokunuzzaman* 1, Lutfun Nahar Nipa 1, Tamanna Tasnim Moon 1, Shafiul Alam 1 1 Department of Mechanical Engineering, Rajshahi University

More information

SELF-BALANCING MOBILE ROBOT TILTER

SELF-BALANCING MOBILE ROBOT TILTER Tomislav Tomašić Andrea Demetlika Prof. dr. sc. Mladen Crneković ISSN xxx-xxxx SELF-BALANCING MOBILE ROBOT TILTER Summary UDC 007.52, 62-523.8 In this project a remote controlled self-balancing mobile

More information

Research Article Harmonic-Rejection Compact Bandpass Filter Using Defected Ground Structure for GPS Application

Research Article Harmonic-Rejection Compact Bandpass Filter Using Defected Ground Structure for GPS Application Active and Passive Electronic Components, Article ID 436964, 4 pages http://dx.doi.org/10.1155/2014/436964 Research Article Harmonic-Rejection Compact Bandpass Filter Using Defected Ground Structure for

More information

Research Article n-digit Benford Converges to Benford

Research Article n-digit Benford Converges to Benford International Mathematics and Mathematical Sciences Volume 2015, Article ID 123816, 4 pages http://dx.doi.org/10.1155/2015/123816 Research Article n-digit Benford Converges to Benford Azar Khosravani and

More information

Dimension Reduction of the Modulation Spectrogram for Speaker Verification

Dimension Reduction of the Modulation Spectrogram for Speaker Verification Dimension Reduction of the Modulation Spectrogram for Speaker Verification Tomi Kinnunen Speech and Image Processing Unit Department of Computer Science University of Joensuu, Finland Kong Aik Lee and

More information

Open Access Improved Frame Error Concealment Algorithm Based on Transform- Domain Mobile Audio Codec

Open Access Improved Frame Error Concealment Algorithm Based on Transform- Domain Mobile Audio Codec Send Orders for Reprints to reprints@benthamscience.ae The Open Electrical & Electronic Engineering Journal, 2014, 8, 527-535 527 Open Access Improved Frame Error Concealment Algorithm Based on Transform-

More information

WIRELESS VOICE CONTROLLED ROBOTICS ARM

WIRELESS VOICE CONTROLLED ROBOTICS ARM WIRELESS VOICE CONTROLLED ROBOTICS ARM 1 R.ASWINBALAJI, 2 A.ARUNRAJA 1 BE ECE,SRI RAMAKRISHNA ENGINEERING COLLEGE,COIMBATORE,INDIA 2 ME EST,SRI RAMAKRISHNA ENGINEERING COLLEGE,COIMBATORE,INDIA aswinbalaji94@gmail.com

More information

Classification of ships using autocorrelation technique for feature extraction of the underwater acoustic noise

Classification of ships using autocorrelation technique for feature extraction of the underwater acoustic noise Classification of ships using autocorrelation technique for feature extraction of the underwater acoustic noise Noha KORANY 1 Alexandria University, Egypt ABSTRACT The paper applies spectral analysis to

More information

Research Article CPW-Fed Wideband Circular Polarized Antenna for UHF RFID Applications

Research Article CPW-Fed Wideband Circular Polarized Antenna for UHF RFID Applications Hindawi International Antennas and Propagation Volume 217, Article ID 3987263, 7 pages https://doi.org/1.1155/217/3987263 Research Article CPW-Fed Wideband Circular Polarized Antenna for UHF RFID Applications

More information

Flexible RFID Location System Based on Artificial Neural Networks for Medical Care Facilities

Flexible RFID Location System Based on Artificial Neural Networks for Medical Care Facilities Flexible RFID Location System Based on Artificial Neural Networks for Medical Care Facilities Hao-Ju Wu, Yi-Hsin Chang, Min-Shiang Hwang, Iuon-Chang Lin g9729007@mail.nchu.edu.tw, mika830@gmail.com, mshwang@nchu.edu.tw,

More information

A Robust Acoustic Echo Canceller for Noisy Environment 1

A Robust Acoustic Echo Canceller for Noisy Environment 1 A Robust Acoustic Echo Canceller for Noisy Environment 1 Shenghao Qin, Sha Meng, and Jia Liu Department of Electronic Engineering, Tsinghua University, Beijing 184 {qinsh99, mengs4}@mails.tsinghua.edu.cn,

More information

Research Article Knight s Tours on Rectangular Chessboards Using External Squares

Research Article Knight s Tours on Rectangular Chessboards Using External Squares Discrete Mathematics, Article ID 210892, 9 pages http://dx.doi.org/10.1155/2014/210892 Research Article Knight s Tours on Rectangular Chessboards Using External Squares Grady Bullington, 1 Linda Eroh,

More information

Performance study of Text-independent Speaker identification system using MFCC & IMFCC for Telephone and Microphone Speeches

Performance study of Text-independent Speaker identification system using MFCC & IMFCC for Telephone and Microphone Speeches Performance study of Text-independent Speaker identification system using & I for Telephone and Microphone Speeches Ruchi Chaudhary, National Technical Research Organization Abstract: A state-of-the-art

More information

Research Article Compressed Wideband Spectrum Sensing Based on Discrete Cosine Transform

Research Article Compressed Wideband Spectrum Sensing Based on Discrete Cosine Transform e Scientific World Journal, Article ID 464895, 5 pages http://dx.doi.org/1.1155/214/464895 Research Article Compressed Wideband Spectrum Sensing Based on Discrete Cosine Transform Yulin Wang and Gengxin

More information

KONKANI SPEECH RECOGNITION USING HILBERT-HUANG TRANSFORM

KONKANI SPEECH RECOGNITION USING HILBERT-HUANG TRANSFORM KONKANI SPEECH RECOGNITION USING HILBERT-HUANG TRANSFORM Shruthi S Prabhu 1, Nayana C G 2, Ashwini B N 3, Dr. Parameshachari B D 4 Assistant Professor, Department of Telecommunication Engineering, GSSSIETW,

More information

Announcements. Today. Speech and Language. State Path Trellis. HMMs: MLE Queries. Introduction to Artificial Intelligence. V22.

Announcements. Today. Speech and Language. State Path Trellis. HMMs: MLE Queries. Introduction to Artificial Intelligence. V22. Introduction to Artificial Intelligence Announcements V22.0472-001 Fall 2009 Lecture 19: Speech Recognition & Viterbi Decoding Rob Fergus Dept of Computer Science, Courant Institute, NYU Slides from John

More information

speech signal S(n). This involves a transformation of S(n) into another signal or a set of signals

speech signal S(n). This involves a transformation of S(n) into another signal or a set of signals 16 3. SPEECH ANALYSIS 3.1 INTRODUCTION TO SPEECH ANALYSIS Many speech processing [22] applications exploits speech production and perception to accomplish speech analysis. By speech analysis we extract

More information

We are IntechOpen, the world s leading publisher of Open Access books Built by scientists, for scientists. International authors and editors

We are IntechOpen, the world s leading publisher of Open Access books Built by scientists, for scientists. International authors and editors We are IntechOpen, the world s leading publisher of Open Access books Built by scientists, for scientists 3,800 6,000 0M Open access books available International authors and editors Downloads Our authors

More information

Real time noise-speech discrimination in time domain for speech recognition application

Real time noise-speech discrimination in time domain for speech recognition application University of Malaya From the SelectedWorks of Mokhtar Norrima January 4, 2011 Real time noise-speech discrimination in time domain for speech recognition application Norrima Mokhtar, University of Malaya

More information

Audio processing methods on marine mammal vocalizations

Audio processing methods on marine mammal vocalizations Audio processing methods on marine mammal vocalizations Xanadu Halkias Laboratory for the Recognition and Organization of Speech and Audio http://labrosa.ee.columbia.edu Sound to Signal sound is pressure

More information

Audio Restoration Based on DSP Tools

Audio Restoration Based on DSP Tools Audio Restoration Based on DSP Tools EECS 451 Final Project Report Nan Wu School of Electrical Engineering and Computer Science University of Michigan Ann Arbor, MI, United States wunan@umich.edu Abstract

More information

Robust Voice Activity Detection Based on Discrete Wavelet. Transform

Robust Voice Activity Detection Based on Discrete Wavelet. Transform Robust Voice Activity Detection Based on Discrete Wavelet Transform Kun-Ching Wang Department of Information Technology & Communication Shin Chien University kunching@mail.kh.usc.edu.tw Abstract This paper

More information

Research Article Quadrature Oscillators Using Operational Amplifiers

Research Article Quadrature Oscillators Using Operational Amplifiers Active and Passive Electronic Components Volume 20, Article ID 320367, 4 pages doi:0.55/20/320367 Research Article Quadrature Oscillators Using Operational Amplifiers Jiun-Wei Horng Department of Electronic,

More information

Separating Voiced Segments from Music File using MFCC, ZCR and GMM

Separating Voiced Segments from Music File using MFCC, ZCR and GMM Separating Voiced Segments from Music File using MFCC, ZCR and GMM Mr. Prashant P. Zirmite 1, Mr. Mahesh K. Patil 2, Mr. Santosh P. Salgar 3,Mr. Veeresh M. Metigoudar 4 1,2,3,4Assistant Professor, Dept.

More information

Query by Singing and Humming

Query by Singing and Humming Abstract Query by Singing and Humming CHIAO-WEI LIN Music retrieval techniques have been developed in recent years since signals have been digitalized. Typically we search a song by its name or the singer

More information

Chapter 4 SPEECH ENHANCEMENT

Chapter 4 SPEECH ENHANCEMENT 44 Chapter 4 SPEECH ENHANCEMENT 4.1 INTRODUCTION: Enhancement is defined as improvement in the value or Quality of something. Speech enhancement is defined as the improvement in intelligibility and/or

More information