Autonomous Vehicle Speaker Verification System

Similar documents
SIMULATION VOICE RECOGNITION SYSTEM FOR CONTROLING ROBOTIC APPLICATIONS

Sound Recognition. ~ CSE 352 Team 3 ~ Jason Park Evan Glover. Kevin Lui Aman Rawat. Prof. Anita Wasilewska

Mel Spectrum Analysis of Speech Recognition using Single Microphone

SOUND SOURCE RECOGNITION FOR INTELLIGENT SURVEILLANCE

Performance study of Text-independent Speaker identification system using MFCC & IMFCC for Telephone and Microphone Speeches

Service Robots in an Intelligent House

Implementing Speaker Recognition

Dimension Reduction of the Modulation Spectrogram for Speaker Verification

Classification of ships using autocorrelation technique for feature extraction of the underwater acoustic noise

VOICE CONTROLLED ROBOT FOR SURVEILLANCE AND GAS LEAKAGE DETECTION

Voice Recognition Technology Using Neural Networks

Vocal Command Recognition Using Parallel Processing of Multiple Confidence-Weighted Algorithms in an FPGA

SonicNet Tones t0 t1 t2 t3 t4 ~7600 Hz ~7800 Hz ~8000 Hz ~8200 Hz ~8400 Hz

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm

An Overview of Biometrics. Dr. Charles C. Tappert Seidenberg School of CSIS, Pace University

Simultaneous Recognition of Speech Commands by a Robot using a Small Microphone Array

Active Noise Cancellation System Using DSP Prosessor

VECTOR QUANTIZATION-BASED SPEECH RECOGNITION SYSTEM FOR HOME APPLIANCES

SPEECH ENHANCEMENT USING PITCH DETECTION APPROACH FOR NOISY ENVIRONMENT

Introducing COVAREP: A collaborative voice analysis repository for speech technologies

Visvesvaraya Technological University, Belagavi

Speech Recognition using FIR Wiener Filter

Automatic Text-Independent. Speaker. Recognition Approaches Using Binaural Inputs

Application Areas of AI Artificial intelligence is divided into different branches which are mentioned below:

ENGR 1 Presentation. Thomas Matthews

Preliminary. Wake on Sound Piezoelectric MEMS Microphone Evaluation Module

Gammatone Cepstral Coefficient for Speaker Identification

DSP VLSI Design. DSP Systems. Byungin Moon. Yonsei University

Low Power Microphone Acquisition and Processing for Always-on Applications Based on Microcontrollers

Relative phase information for detecting human speech and spoofed speech

Performance Analysis of MFCC and LPCC Techniques in Automatic Speech Recognition

Speech Enhancement Based On Noise Reduction

Lab 4: Static & Switched Audio Equalizer

Speech Synthesis using Mel-Cepstral Coefficient Feature

Spatial Audio Transmission Technology for Multi-point Mobile Voice Chat

AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND TRANSFER FUNCTIONS

AUTOMATIC SPEECH RECOGNITION FOR NUMERIC DIGITS USING TIME NORMALIZATION AND ENERGY ENVELOPES

Voice Recognition Based Automation System for Medical Applications and For Physically Challenged Patients

VQ Source Models: Perceptual & Phase Issues

Auto Harmonizer. EEL 4924 Electrical Engineering Design (Senior Design) Final Design Report 26 April 2012

ZLS38500 Firmware for Handsfree Car Kits

Artificial Neural Networks. Artificial Intelligence Santa Clara, 2016

ARTIFICIAL INTELLIGENCE - ROBOTICS

JOURNAL OF OBJECT TECHNOLOGY

Callouts 1. INPUT 2. THRU 3. GROUND LIFT 4. BALANCED OUTPUT

Automated Portable Cradle System with Infant Crying Sound Detector

The University of Wisconsin-Platteville

Using the VM1010 Wake-on-Sound Microphone and ZeroPower Listening TM Technology

Electronics Design Laboratory Lecture #11. ECEN 2270 Electronics Design Laboratory

FAULT DETECTION AND DIAGNOSIS OF HIGH SPEED SWITCHING DEVICES IN POWER INVERTER

Microcomputer Systems 1. Introduction to DSP S

Mikko Myllymäki and Tuomas Virtanen

Auto Harmonizer. EEL 4924 Electrical Engineering Design (Senior Design) Preliminary Design Report 2 February 2012

SL Loudspeaker 52 A Conference Loudspeaker

IC-F7000. Advanced selective call and ALE make HF communication easier than ever!

High-speed Noise Cancellation with Microphone Array

Chapter 1: Introduction to audio signal processing

Overview of Signal Processing

NCCT IEEE PROJECTS ADVANCED ROBOTICS SOLUTIONS. Latest Projects, in various Domains. Promise for the Best Projects

Artificial Neural Networks approach to the voltage sag classification

PM1060 / 1120 INSTRUCTION MANUAL. Thank you for choosing another quality product from Amperes Electronics.

Improving Meetings with Microphone Array Algorithms. Ivan Tashev Microsoft Research

Speech Recognition on Robot Controller

CHAPTER-5 DESIGN OF DIRECT TORQUE CONTROLLED INDUCTION MOTOR DRIVE

SUBELEMENT T4. Amateur radio practices and station set up. 2 Exam Questions - 2 Groups

MINE 432 Industrial Automation and Robotics

User s Manual Listen Microphones

Speech/Music Change Point Detection using Sonogram and AANN

Real-Time Testing Made Easy with Simulink Real-Time

Fundamentals of Digital Audio *

AUTOMATIC MIXER AX-1OOOA

Sensor system of a small biped entertainment robot

AI Application Processing Requirements

Traffic Sign Recognition Senior Project Final Report

Observer-based Engine Cooling Control System (OBCOOL) Functional Description & System Block Diagram. Students: Andrew Fouts & Kurtis Liggett

International Journal of Scientific & Engineering Research, Volume 4, Issue 5, May ISSN

Speech Enhancement using Wiener filtering

WIRELESS VOICE CONTROLLED ROBOTICS ARM

Technical Application Note #4

Design of Multi Lingual, Voice Signal Frequency Based Robotic Hand Control System

VM1010. Low-Noise Bottom Port Piezoelectric MEMS Microphone Data Sheet Vesper Technologies Inc. With Wake on Sound Feature

Speech Intelligibility Enhancement using Microphone Array via Intra-Vehicular Beamforming

Real-time Real-life Oriented DSP Lab Modules

Hearing Aid Redesign: Test Plans ELECTRICAL TESTING

A 600 BPS MELP VOCODER FOR USE ON HF CHANNELS

Overview of Digital Signal Processing

Electric Guitar Pickups Recognition

Callouts Front Rear 1. INPUT Jacks 2. THRU Jacks dB / -30dB Pad switch 4. STEREO/MONO Switch 5. GROUND LIFT Switches 6. BALANCED OUTPUT Jacks

Artificial Neural Network based Mobile Robot Navigation

Audio in ecall and Cluster. Clancy Soehren MSA Applications FAE Summit 2016

Understanding the Mechanism of Sonzai-Kan

WIND SPEED ESTIMATION AND WIND-INDUCED NOISE REDUCTION USING A 2-CHANNEL SMALL MICROPHONE ARRAY

Separation and Recognition of multiple sound source using Pulsed Neuron Model

CO-CHANNEL SPEECH DETECTION APPROACHES USING CYCLOSTATIONARITY OR WAVELET TRANSFORM

NAVIGATION SECURITY MODULE WITH REAL-TIME VOICE COMMAND RECOGNITION SYSTEM

Microphone Array project in MSR: approach and results

PM7400MKII. Instruction manual

XAP GWARE 119 M A T R I X. Acoustic Echo Canceller

Roofing Filters, Transmitted BW and Receiver Performance

Enabling New Speech Driven Services for Mobile Devices: An overview of the ETSI standards activities for Distributed Speech Recognition Front-ends

Transcription:

Autonomous Vehicle Speaker Verification System Functional Requirements List and Performance Specifications Aaron Pfalzgraf Christopher Sullivan Project Advisor: Dr. Jose Sanchez 4 November 2013

AVSVS 2 Introduction: This project involves developing an intelligent control system for an autonomous vehicle. The autonomous vehicle is to be controlled by voice commands from the operator, and the system must recognize the difference between an authorized and unauthorized user. A speaker verification system will be used to accomplish this task. Goals: Develop a system that accepts commands only from a specific list of users Integrate this system into a speech recognition-based vehicle control system Control the vehicle using existing systems from previous senior project o Use existing hardware controlled through I2C System Block Diagram: Figure 1 shows the overall system block diagram. The operator speaks a command into the microphone. The data from the microphone is passed into the digital signal processing system (DSP). Pre-processing is performed to remove noise; the filtered signal is passed into the feature extraction block to generate a series of feature vectors that describe the signal. These feature vectors are passed into a neural network where a comparison is made between the current audio sample and the known model of the authorized operator. The known model of the operator is determined through a set of training data that is passed into the neural network. The weights of the network are updated using this training data through the backpropagation algorithm. If the current sample is similar enough (within a to be determined threshold), then the system will accept the command from the operator and transmit the corresponding motor control signals to the autonomous vehicle control system. The control system either starts or stops the robot's motor depending on the command given by the user. Figure 2 details the software functionality of the speaker verification system implemented on the DSP.

AVSVS 3 Fig. 1, Hardware Connections Hardware Requirements: Microphone Fig. 2, High level software block diagram Omni-directional pick-up pattern for successful voice control regardless of the operator's position in the room Signal output via standard 1/8 in. cable, may be accomplished with an adaptor Capable of 16 khz sampling frequency (digital output on microphone) Nearly flat 0dB frequency response on the range 100 Hz to 8 khz Passive dynamic microphone, cannot require +48V phantom power (required for active circuitry in some microphones)

AVSVS 4 Digital Signal Processor The DSP used shall be the TI5505 ezdsp Motor Control System The existing autonomous vehicle and MCU from last year's autonomous vehicle senior project shall be used Interfacing shall be accomplished with I2C protocol Software Requirements: Basic control commands: STOP, GO, LEFT, RIGHT More may be added, such as autonomous routines (e.g. figure 8) Speech signal divided into 20 ms - 40 ms frames via Hamming windowing with 33% to 50% overlap System shall function properly in a mildly noisy environment Maximum operator-to-vehicle distance for proper functionality shall be greater than 10 feet (minimum) Operator rejection error shall be minimized to under 1% for safety reasons Imposter acceptance error is desired to be under 2% but may be modified to accomplish the desired operator rejection error percentage DSP will be programmed in the C language using the Code Composer Studio integrated design environment Authorized operator speech models may be generated in MATLAB and hardcoded in C depending on their complexity (online vs. offline training) Artificial neural network shall be used to compute the similarity between the current speaker and the authorized operator speech model Network shall be trained using the back-propagation algorithm to update the weights of the system Feature extraction shall be carried out by Mel cepstral coefficients 10 or more coefficients per frame of speech Speaker verification shall be accomplished without delay time perceptible to the operator (t 100ms)

AVSVS 5 References: [1] J. P. Cambell Jr., Speaker Recognition: A Tutorial, NSA, Ft. Mead, MD, Sep. 1997. [2] F. K. Soong et al., A Vector Quantization Approach to Speaker Recognition, AT&T, Murray Hill, NJ, 1985. [3] T. Kinnunen et al., Comparison of Clustering Algorithms in Speaker Identification, Univ. of Joensuu, Joensuu, Finland.