Interfacing with the Machine

Size: px
Start display at page:

Download "Interfacing with the Machine"

Transcription

1 Interfacing with the Machine Jay Desloge SENS Corporation Sumit Basu Microsoft Research

2 They (We) Are Better Than We Think! Machine source separation, localization, and recognition are not as distant as they may seem. There are, in fact, already systems that achieve limited success in these areas. These machines provide many opportunities to investigate the interaction of machines with the human operator.

3 Consider: Hearing Aids Directional microphones can yield target-location (in front of wearer) intelligibility-weighted SNR improvements of up to 5-6 db. Adaptive directional capability can yield higher SNR improvements (on the order of 8-12 db). FM capability allows aid to receive signals from remote sources (TVs, remote microphones). (Phonak Persio)

4 Consider: Tele/Video Conferencing Directional microphones used to identify and extract the sources from the environment. IW SNR improvements 5-6 db on average. Active speaker is determined by microphone input. Voice-tracking capability can focus video camera on an active source within the environment. RMS loc. error < 10 deg. (Polycom Soundpoint)

5 Consider: ASR State of the Art Type Characteristics WER Meeting Room (16kHz) Business Spontaneous Task oriented, but includes true meetings collected in uncontrolled conditions Far-talking, but also have close-talking (head-mounted) for comparison 30% (headmounted) 50% (distant) Switchboard (Telephone) Polite Spontaneous Close-talking, relatively free of noise These are real people (with a slight bias toward females housewives and higher education), who don t know each other and have some conversation on some topic. Real data, but instrumented Conditions 15% Broadcast News Planned speech Found data (exists in nature, not artificially collected) Spoken by professional speakers; not read, but speakers know what they are going to say in advance, and possibly Practice 9% WSJ (Dictation) Read speech High-quality microphones, professional speakers, Wall Street Journal sentences (ie it s a rich, but restricted domain) 3-8% String of Digits Read speech Easy task; no noise, close-talk <0.5% From Patrick Nguyen (MSR)

6 Consider: Wireless Communication, GPS Wireless communication links can connect team members (e.g., military, firefighter, police) and can provide clean, separated signals for each source. GPS can provide accurate information about the location of each source. Efforts have already been made to present these sources to the team members in a logical manner (e.g., spatialized audio).

7 What We Will Talk About Given that these and other possibilities for humanmachine interaction already exist, it is important to study how the humans and machines can interact in a manner that achieves the best possible performance. We will discuss: Machine enhancement of human capabilities (H+) Human enhancement of machine performance (M+) Design factors in human-machine interfaces

8 Machines Enhancing Human Capabilities (H+) Despite their limitations, machines can outdo what we do Vs.

9 H+: Going Beyond the Human Scale Very large arrays: Localization for low-frequencies Localization for impulsive/wideband sounds Silverman, Patterson, and Flanagan, The Huge Microphone Array, IEEE Concurrency, October, Pregliasco and Martinez, Gunshot Localization through Recorded Sound, Journal of Forensic Science, 2002.

10 H+: Augmenting Ears The strength of numbers: As a localizer or recognizer, machines may be at about half human performance With 100 sensors => 50 humans worth!! But what good is a fractional human? State of the Art in General Sound Recognition Speech detection Everybody and their Uncle Joe, My Novel Method for Speech Detection, Everything else

11 H+: Multiplying Ears because there may be too many things to listen to

12 H+: Multiplying Ears too many sounds in too many places

13 H+: Distant Ears because we can t be everywhere at once

14 H+: Replacing Ears because we may have limited hearing capabilities

15 H+: Augmenting Ears because we re not always paying attention

16 H+: The Sixth (Seventh, etc.) Sense We can apply existing techniques to frequency ranges/senses we don t have Ultrasound Microwave

17 Humans Enhancing Machine Performance (M+) Despite impressive machine computational capability, there are still certain tasks that the human can do faster and more reliably. vs.

18 M+: What Do We Optimize? Finding the right objective function is hard SNR vs. intelligibility Listening comfort Particularly true if a human will be listening to the output Example: Hearing Aids (Phonak Persio)

19 M+: System focus Where are the sources? S 2 S 1 S 3

20 M+: Environmental Conditions The human is often better at scene analysis Can drive system to optimize for varying conditions Low Reverb? High Reverb? Few, localized sources? Many sources?

21 M+: Calibration Some systems (e.g., conventional array processing) require knowledge of physical arrangement of microphones. Portable/body-mounted systems in particular must be configured and calibrated for proper operation.

22 Discussion and Teaser: Designing the Interactive System Input from the user: How can we use direct manipulation and implicit manipulation to control the machine s abilities Output to the user How do we decide what information is relevant to the user and how much they can handle? How do we consolidate information into concise visuals/auralizations? How can we display multiple auditory/visual streams to the user?

AUTOMATIC SPEECH RECOGNITION FOR NUMERIC DIGITS USING TIME NORMALIZATION AND ENERGY ENVELOPES

AUTOMATIC SPEECH RECOGNITION FOR NUMERIC DIGITS USING TIME NORMALIZATION AND ENERGY ENVELOPES AUTOMATIC SPEECH RECOGNITION FOR NUMERIC DIGITS USING TIME NORMALIZATION AND ENERGY ENVELOPES N. Sunil 1, K. Sahithya Reddy 2, U.N.D.L.mounika 3 1 ECE, Gurunanak Institute of Technology, (India) 2 ECE,

More information

Microphone Array project in MSR: approach and results

Microphone Array project in MSR: approach and results Microphone Array project in MSR: approach and results Ivan Tashev Microsoft Research June 2004 Agenda Microphone Array project Beamformer design algorithm Implementation and hardware designs Demo Motivation

More information

Case study for voice amplification in a highly absorptive conference room using negative absorption tuning by the YAMAHA Active Field Control system

Case study for voice amplification in a highly absorptive conference room using negative absorption tuning by the YAMAHA Active Field Control system Case study for voice amplification in a highly absorptive conference room using negative absorption tuning by the YAMAHA Active Field Control system Takayuki Watanabe Yamaha Commercial Audio Systems, Inc.

More information

BackDoor: Sensing Out-of-band Sounds through Channel Nonlinearity

BackDoor: Sensing Out-of-band Sounds through Channel Nonlinearity BackDoor: Sensing Out-of-band Sounds through Channel Nonlinearity Nirupam Roy ECE-420 Guest Lecture - 30 th October 2017 University of Illinois at Urbana-Champaign Microphones are everywhere Microphones

More information

BackDoor: Making Microphones Hear Inaudible Sounds

BackDoor: Making Microphones Hear Inaudible Sounds BackDoor: Making Microphones Hear Inaudible Sounds Nirupam Roy Haitham Hassanieh Romit Roy Choudhury University of Illinois at Urbana-Champaign Microphones are everywhere Microphones are everywhere Microphones

More information

Context-sensitive speech recognition for human-robot interaction

Context-sensitive speech recognition for human-robot interaction Context-sensitive speech recognition for human-robot interaction Pierre Lison Cognitive Systems @ Language Technology Lab German Research Centre for Artificial Intelligence (DFKI GmbH) Saarbrücken, Germany.

More information

The psychoacoustics of reverberation

The psychoacoustics of reverberation The psychoacoustics of reverberation Steven van de Par Steven.van.de.Par@uni-oldenburg.de July 19, 2016 Thanks to Julian Grosse and Andreas Häußler 2016 AES International Conference on Sound Field Control

More information

Calibration of Microphone Arrays for Improved Speech Recognition

Calibration of Microphone Arrays for Improved Speech Recognition MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Calibration of Microphone Arrays for Improved Speech Recognition Michael L. Seltzer, Bhiksha Raj TR-2001-43 December 2001 Abstract We present

More information

You know about adding up waves, e.g. from two loudspeakers. AUDL 4007 Auditory Perception. Week 2½. Mathematical prelude: Adding up levels

You know about adding up waves, e.g. from two loudspeakers. AUDL 4007 Auditory Perception. Week 2½. Mathematical prelude: Adding up levels AUDL 47 Auditory Perception You know about adding up waves, e.g. from two loudspeakers Week 2½ Mathematical prelude: Adding up levels 2 But how do you get the total rms from the rms values of two signals

More information

Improving Meetings with Microphone Array Algorithms. Ivan Tashev Microsoft Research

Improving Meetings with Microphone Array Algorithms. Ivan Tashev Microsoft Research Improving Meetings with Microphone Array Algorithms Ivan Tashev Microsoft Research Why microphone arrays? They ensure better sound quality: less noises and reverberation Provide speaker position using

More information

Robust Low-Resource Sound Localization in Correlated Noise

Robust Low-Resource Sound Localization in Correlated Noise INTERSPEECH 2014 Robust Low-Resource Sound Localization in Correlated Noise Lorin Netsch, Jacek Stachurski Texas Instruments, Inc. netsch@ti.com, jacek@ti.com Abstract In this paper we address the problem

More information

AN547 - Why you need high performance, ultra-high SNR MEMS microphones

AN547 - Why you need high performance, ultra-high SNR MEMS microphones AN547 AN547 - Why you need high performance, ultra-high SNR MEMS Table of contents 1 Abstract................................................................................1 2 Signal to Noise Ratio (SNR)..............................................................2

More information

ReSound Micro and Multi Mic

ReSound Micro and Multi Mic Tip for use of FAQ: Click on questions to go to answer. Setup & Configuration How do I pair the hearing aids to the Micro and Multi Mic?... 3 How many hearing aids can the Micro/Multi Mic be paired with?...

More information

Spatial Audio Transmission Technology for Multi-point Mobile Voice Chat

Spatial Audio Transmission Technology for Multi-point Mobile Voice Chat Audio Transmission Technology for Multi-point Mobile Voice Chat Voice Chat Multi-channel Coding Binaural Signal Processing Audio Transmission Technology for Multi-point Mobile Voice Chat We have developed

More information

Gesture Recognition with Real World Environment using Kinect: A Review

Gesture Recognition with Real World Environment using Kinect: A Review Gesture Recognition with Real World Environment using Kinect: A Review Prakash S. Sawai 1, Prof. V. K. Shandilya 2 P.G. Student, Department of Computer Science & Engineering, Sipna COET, Amravati, Maharashtra,

More information

Applying the Filtered Back-Projection Method to Extract Signal at Specific Position

Applying the Filtered Back-Projection Method to Extract Signal at Specific Position Applying the Filtered Back-Projection Method to Extract Signal at Specific Position 1 Chia-Ming Chang and Chun-Hao Peng Department of Computer Science and Engineering, Tatung University, Taipei, Taiwan

More information

Speech Intelligibility Enhancement using Microphone Array via Intra-Vehicular Beamforming

Speech Intelligibility Enhancement using Microphone Array via Intra-Vehicular Beamforming Speech Intelligibility Enhancement using Microphone Array via Intra-Vehicular Beamforming Senior Project Proposal Presentation Devin McDonald, Joseph Mesnard Advisors: Dr. Yufeng Lu, Dr. In Soo Ahn November

More information

PRINCIPLES OF COMMUNICATION SYSTEMS. Lecture 1- Introduction Elements, Modulation, Demodulation, Frequency Spectrum

PRINCIPLES OF COMMUNICATION SYSTEMS. Lecture 1- Introduction Elements, Modulation, Demodulation, Frequency Spectrum PRINCIPLES OF COMMUNICATION SYSTEMS Lecture 1- Introduction Elements, Modulation, Demodulation, Frequency Spectrum Topic covered Introduction to subject Elements of Communication system Modulation General

More information

Agenda. Wireless Defined To Zoom or Not to Zoom DuoPhone & DirectTouch De-Mystified Wireless accessories Overview Phonak CROS in depth

Agenda. Wireless Defined To Zoom or Not to Zoom DuoPhone & DirectTouch De-Mystified Wireless accessories Overview Phonak CROS in depth Agenda Wireless Defined To Zoom or Not to Zoom DuoPhone & DirectTouch De-Mystified Wireless accessories Overview Phonak CROS in depth What do wireless and binaural really mean? Coordination of settings

More information

Surround: The Current Technological Situation. David Griesinger Lexicon 3 Oak Park Bedford, MA

Surround: The Current Technological Situation. David Griesinger Lexicon 3 Oak Park Bedford, MA Surround: The Current Technological Situation David Griesinger Lexicon 3 Oak Park Bedford, MA 01730 www.world.std.com/~griesngr There are many open questions 1. What is surround sound 2. Who will listen

More information

E40M Sound and Music. M. Horowitz, J. Plummer, R. Howe 1

E40M Sound and Music. M. Horowitz, J. Plummer, R. Howe 1 E40M Sound and Music M. Horowitz, J. Plummer, R. Howe 1 LED Cube Project #3 In the next several lectures, we ll study Concepts Coding Light Sound Transforms/equalizers Devices LEDs Analog to digital converters

More information

ON THE PERFORMANCE OF WTIMIT FOR WIDE BAND TELEPHONY

ON THE PERFORMANCE OF WTIMIT FOR WIDE BAND TELEPHONY ON THE PERFORMANCE OF WTIMIT FOR WIDE BAND TELEPHONY D. Nagajyothi 1 and P. Siddaiah 2 1 Department of Electronics and Communication Engineering, Vardhaman College of Engineering, Shamshabad, Telangana,

More information

Mel Spectrum Analysis of Speech Recognition using Single Microphone

Mel Spectrum Analysis of Speech Recognition using Single Microphone International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree

More information

How Radio Works by Marshall Brain

How Radio Works by Marshall Brain How Radio Works by Marshall Brain "Radio waves" transmit music, conversations, pictures and data invisibly through the air, often over millions of miles -- it happens every day in thousands of different

More information

How Radio Works By Marshall Brain

How Radio Works By Marshall Brain How Radio Works By Marshall Brain Excerpted from the excellent resource http://electronics.howstuffworks.com/radio.htm Radio waves transmit music, conversations, pictures and data invisibly through the

More information

Eyes n Ears: A System for Attentive Teleconferencing

Eyes n Ears: A System for Attentive Teleconferencing Eyes n Ears: A System for Attentive Teleconferencing B. Kapralos 1,3, M. Jenkin 1,3, E. Milios 2,3 and J. Tsotsos 1,3 1 Department of Computer Science, York University, North York, Canada M3J 1P3 2 Department

More information

Speech and Audio Processing Recognition and Audio Effects Part 3: Beamforming

Speech and Audio Processing Recognition and Audio Effects Part 3: Beamforming Speech and Audio Processing Recognition and Audio Effects Part 3: Beamforming Gerhard Schmidt Christian-Albrechts-Universität zu Kiel Faculty of Engineering Electrical Engineering and Information Engineering

More information

Self Localization Using A Modulated Acoustic Chirp

Self Localization Using A Modulated Acoustic Chirp Self Localization Using A Modulated Acoustic Chirp Brian P. Flanagan The MITRE Corporation, 7515 Colshire Dr., McLean, VA 2212, USA; bflan@mitre.org ABSTRACT This paper describes a robust self localization

More information

Matti Karjalainen. TKK - Helsinki University of Technology Department of Signal Processing and Acoustics (Espoo, Finland)

Matti Karjalainen. TKK - Helsinki University of Technology Department of Signal Processing and Acoustics (Espoo, Finland) Matti Karjalainen TKK - Helsinki University of Technology Department of Signal Processing and Acoustics (Espoo, Finland) 1 Located in the city of Espoo About 10 km from the center of Helsinki www.tkk.fi

More information

Using the VM1010 Wake-on-Sound Microphone and ZeroPower Listening TM Technology

Using the VM1010 Wake-on-Sound Microphone and ZeroPower Listening TM Technology Using the VM1010 Wake-on-Sound Microphone and ZeroPower Listening TM Technology Rev1.0 Author: Tung Shen Chew Contents 1 Introduction... 4 1.1 Always-on voice-control is (almost) everywhere... 4 1.2 Introducing

More information

Performance study of Text-independent Speaker identification system using MFCC & IMFCC for Telephone and Microphone Speeches

Performance study of Text-independent Speaker identification system using MFCC & IMFCC for Telephone and Microphone Speeches Performance study of Text-independent Speaker identification system using & I for Telephone and Microphone Speeches Ruchi Chaudhary, National Technical Research Organization Abstract: A state-of-the-art

More information

A Digital Signal Processor for Musicians and Audiophiles Published on Monday, 09 February :54

A Digital Signal Processor for Musicians and Audiophiles Published on Monday, 09 February :54 A Digital Signal Processor for Musicians and Audiophiles Published on Monday, 09 February 2009 09:54 The main focus of hearing aid research and development has been on the use of hearing aids to improve

More information

1 White Paper. Intelligibility.

1 White Paper. Intelligibility. 1 FOR YOUR INFORMATION THE LIMITATIONS OF WIDE DISPERSION White Paper Distributed sound systems are the most common approach to providing sound for background music and paging systems. Because distributed

More information

Designing Information Devices and Systems I Spring 2015 Homework 6

Designing Information Devices and Systems I Spring 2015 Homework 6 EECS 16A Designing Information Devices and Systems I Spring 2015 Homework 6 This homework is due March 19, 2015 at 5PM. Note that unless explicitly stated otherwise, you can assume that all op-amps in

More information

Monaural and Binaural Speech Separation

Monaural and Binaural Speech Separation Monaural and Binaural Speech Separation DeLiang Wang Perception & Neurodynamics Lab The Ohio State University Outline of presentation Introduction CASA approach to sound separation Ideal binary mask as

More information

Manuel Campus S FM Transmitter (TX2S)

Manuel Campus S FM Transmitter (TX2S) 1 of 5 Distribution: Montena EMC SA, CH-1728 Rossens, Mr Manfred Portman Path: Marquis:> D:/data/phonak/T_memos/Englisch/ Name Date Visum Produced by F. Marquis FMa Approved F. Marquis FMa Released F.

More information

TELIKOU Intercom System

TELIKOU Intercom System TELIKOU Intercom System DT-100 Desktop Station Instruction Manual 2006 TELIKOU Systems All Rights Reserved www.telikou.com While TELIKOU makes every attempt to maintain the accuracy of the information

More information

Multiple Audio Spots Design Based on Separating Emission of Carrier and Sideband Waves

Multiple Audio Spots Design Based on Separating Emission of Carrier and Sideband Waves Multiple Audio Spots Design Based on Separating Emission of Carrier and Sideband Waves Tadashi MATSUI 1 ; Daisuke IKEFUJI 1 ; Masato NAKAYAMA 2 ;Takanobu NISHIURA 2 1 Graduate School of Information Science

More information

Microphone Array Design and Beamforming

Microphone Array Design and Beamforming Microphone Array Design and Beamforming Heinrich Löllmann Multimedia Communications and Signal Processing heinrich.loellmann@fau.de with contributions from Vladi Tourbabin and Hendrik Barfuss EUSIPCO Tutorial

More information

ROOM AND CONCERT HALL ACOUSTICS MEASUREMENTS USING ARRAYS OF CAMERAS AND MICROPHONES

ROOM AND CONCERT HALL ACOUSTICS MEASUREMENTS USING ARRAYS OF CAMERAS AND MICROPHONES ROOM AND CONCERT HALL ACOUSTICS The perception of sound by human listeners in a listening space, such as a room or a concert hall is a complicated function of the type of source sound (speech, oration,

More information

WTI-100 Simplex wireless Interface Instruction Manual

WTI-100 Simplex wireless Interface Instruction Manual TELIKOU Intercom System WTI-100 Simplex wireless Interface Instruction Manual 2006 TELIKOU Systems All Rights Reserved www.telikou.com While TELIKOU makes every attempt to maintain the accuracy of the

More information

overview steffen p walz, m.a.

overview steffen p walz, m.a. massively multiplayer audio reality gaming overview steffen p walz, m.a. game design researcher, ph.d. candidate Webzen Inc. 09/01/2005 2005 steffen p walz. upcoming background projects: (serious) pervasive

More information

DIGITAL SIGNAL PROCESSING. Introduction

DIGITAL SIGNAL PROCESSING. Introduction DIGITAL SIGNAL PROCESSING Introduction What is Signal? A SIGNAL is a measurement of a physical quantity of certain medium. Examples of signals: Audio patterns (voice, speech, music) Visual patterns (written

More information

Virtual Reality Calendar Tour Guide

Virtual Reality Calendar Tour Guide Technical Disclosure Commons Defensive Publications Series October 02, 2017 Virtual Reality Calendar Tour Guide Walter Ianneo Follow this and additional works at: http://www.tdcommons.org/dpubs_series

More information

Waves Nx VIRTUAL REALITY AUDIO

Waves Nx VIRTUAL REALITY AUDIO Waves Nx VIRTUAL REALITY AUDIO WAVES VIRTUAL REALITY AUDIO THE FUTURE OF AUDIO REPRODUCTION AND CREATION Today s entertainment is on a mission to recreate the real world. Just as VR makes us feel like

More information

E40M Sound and Music. M. Horowitz, J. Plummer, R. Howe 1

E40M Sound and Music. M. Horowitz, J. Plummer, R. Howe 1 E40M Sound and Music M. Horowitz, J. Plummer, R. Howe 1 LED Cube Project #3 In the next several lectures, we ll study Concepts Coding Light Sound Transforms/equalizers Devices LEDs Analog to digital converters

More information

Electronics Design Laboratory Lecture #11. ECEN 2270 Electronics Design Laboratory

Electronics Design Laboratory Lecture #11. ECEN 2270 Electronics Design Laboratory Electronics Design Laboratory Lecture # ECEN 7 Electronics Design Laboratory Project Must rely on fully functional Lab circuits, Lab circuit is optional Can re do wireless or replace it with a different

More information

Microphone Array Power Ratio for Speech Quality Assessment in Noisy Reverberant Environments 1

Microphone Array Power Ratio for Speech Quality Assessment in Noisy Reverberant Environments 1 for Speech Quality Assessment in Noisy Reverberant Environments 1 Prof. Israel Cohen Department of Electrical Engineering Technion - Israel Institute of Technology Technion City, Haifa 3200003, Israel

More information

Perception. Read: AIMA Chapter 24 & Chapter HW#8 due today. Vision

Perception. Read: AIMA Chapter 24 & Chapter HW#8 due today. Vision 11-25-2013 Perception Vision Read: AIMA Chapter 24 & Chapter 25.3 HW#8 due today visual aural haptic & tactile vestibular (balance: equilibrium, acceleration, and orientation wrt gravity) olfactory taste

More information

Sound Processing Technologies for Realistic Sensations in Teleworking

Sound Processing Technologies for Realistic Sensations in Teleworking Sound Processing Technologies for Realistic Sensations in Teleworking Takashi Yazu Makoto Morito In an office environment we usually acquire a large amount of information without any particular effort

More information

CONTACT: , ROBOTIC BASED PROJECTS

CONTACT: , ROBOTIC BASED PROJECTS ROBOTIC BASED PROJECTS 1. ADVANCED ROBOTIC PICK AND PLACE ARM AND HAND SYSTEM 2. AN ARTIFICIAL LAND MARK DESIGN BASED ON MOBILE ROBOT LOCALIZATION AND NAVIGATION 3. ANDROID PHONE ACCELEROMETER SENSOR BASED

More information

1 Publishable summary

1 Publishable summary 1 Publishable summary 1.1 Introduction The DIRHA (Distant-speech Interaction for Robust Home Applications) project was launched as STREP project FP7-288121 in the Commission s Seventh Framework Programme

More information

ScienceDirect. Unsupervised Speech Segregation Using Pitch Information and Time Frequency Masking

ScienceDirect. Unsupervised Speech Segregation Using Pitch Information and Time Frequency Masking Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 46 (2015 ) 122 126 International Conference on Information and Communication Technologies (ICICT 2014) Unsupervised Speech

More information

Distributed Speech Recognition Standardization Activity

Distributed Speech Recognition Standardization Activity Distributed Speech Recognition Standardization Activity Alex Sorin, Ron Hoory, Dan Chazan Telecom and Media Systems Group June 30, 2003 IBM Research Lab in Haifa Advanced Speech Enabled Services ASR App

More information

Auditory System For a Mobile Robot

Auditory System For a Mobile Robot Auditory System For a Mobile Robot PhD Thesis Jean-Marc Valin Department of Electrical Engineering and Computer Engineering Université de Sherbrooke, Québec, Canada Jean-Marc.Valin@USherbrooke.ca Motivations

More information

Emanuël A. P. Habets, Jacob Benesty, and Patrick A. Naylor. Presented by Amir Kiperwas

Emanuël A. P. Habets, Jacob Benesty, and Patrick A. Naylor. Presented by Amir Kiperwas Emanuël A. P. Habets, Jacob Benesty, and Patrick A. Naylor Presented by Amir Kiperwas 1 M-element microphone array One desired source One undesired source Ambient noise field Signals: Broadband Mutually

More information

Robustness (cont.); End-to-end systems

Robustness (cont.); End-to-end systems Robustness (cont.); End-to-end systems Steve Renals Automatic Speech Recognition ASR Lecture 18 27 March 2017 ASR Lecture 18 Robustness (cont.); End-to-end systems 1 Robust Speech Recognition ASR Lecture

More information

INVESTIGATING BINAURAL LOCALISATION ABILITIES FOR PROPOSING A STANDARDISED TESTING ENVIRONMENT FOR BINAURAL SYSTEMS

INVESTIGATING BINAURAL LOCALISATION ABILITIES FOR PROPOSING A STANDARDISED TESTING ENVIRONMENT FOR BINAURAL SYSTEMS 20-21 September 2018, BULGARIA 1 Proceedings of the International Conference on Information Technologies (InfoTech-2018) 20-21 September 2018, Bulgaria INVESTIGATING BINAURAL LOCALISATION ABILITIES FOR

More information

SERIES P: TERMINALS AND SUBJECTIVE AND OBJECTIVE ASSESSMENT METHODS Voice terminal characteristics

SERIES P: TERMINALS AND SUBJECTIVE AND OBJECTIVE ASSESSMENT METHODS Voice terminal characteristics I n t e r n a t i o n a l T e l e c o m m u n i c a t i o n U n i o n ITU-T P.340 TELECOMMUNICATION STANDARDIZATION SECTOR OF ITU Amendment 1 (10/2014) SERIES P: TERMINALS AND SUBJECTIVE AND OBJECTIVE

More information

Meeting Corpora Hardware Overview & ASR Accuracies

Meeting Corpora Hardware Overview & ASR Accuracies Meeting Corpora Hardware Overview & ASR Accuracies George Jose (153070011) Guide : Dr. Preeti Rao Indian Institute of Technology, Bombay 22 July, 2016 1/18 Outline 1 AMI Meeting Corpora 2 3 2/18 AMI Meeting

More information

Remote Media Immersion (RMI)

Remote Media Immersion (RMI) Remote Media Immersion (RMI) University of Southern California Integrated Media Systems Center Alexander Sawchuk, Deputy Director Chris Kyriakakis, EE Roger Zimmermann, CS Christos Papadopoulos, CS Cyrus

More information

Physics in Entertainment and the Arts

Physics in Entertainment and the Arts Physics in Entertainment and the Arts Chapter IV The Fine Arts Spectra; Some Second Looks at Waves Spectra of Continuous Waves A wave s spectrum is the range of frequencies the waves cover For sound the

More information

Speech Volume Monitor for Hearing Impaired

Speech Volume Monitor for Hearing Impaired Speech Volume Monitor for Hearing Impaired R.DEEPA (Mphil Research scholar) PSGR Krishnnaml college for women. GRG School of Applied Technology Coimbatore,India Abstract Hearing impaired can be classified

More information

Factors Governing the Intelligibility of Speech Sounds

Factors Governing the Intelligibility of Speech Sounds HSR Journal Club JASA, vol(19) No(1), Jan 1947 Factors Governing the Intelligibility of Speech Sounds N. R. French and J. C. Steinberg 1. Introduction Goal: Determine a quantitative relationship between

More information

Applications: FM wireless audio, USB PC audio broadcasting, wireless microphones, maternal and child care.

Applications: FM wireless audio, USB PC audio broadcasting, wireless microphones, maternal and child care. Product Description 1: The use of advanced digital audio signal processing (DSP) technology and FM modulation phase-locked loop (PLL) to make sound more realistic, more stable performance, long working

More information

(Refer Slide Time: 2:23)

(Refer Slide Time: 2:23) Data Communications Prof. A. Pal Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur Lecture-11B Multiplexing (Contd.) Hello and welcome to today s lecture on multiplexing

More information

Integrated Vision and Sound Localization

Integrated Vision and Sound Localization Integrated Vision and Sound Localization Parham Aarabi Safwat Zaky Department of Electrical and Computer Engineering University of Toronto 10 Kings College Road, Toronto, Ontario, Canada, M5S 3G4 parham@stanford.edu

More information

TOOLS FOR DISTANCE COLLABORATION 2012 OSEP PD CONFERENCE WASHINGTON, DC

TOOLS FOR DISTANCE COLLABORATION 2012 OSEP PD CONFERENCE WASHINGTON, DC SCHOLAR INITIATIVE FULL TRANSCRIPT TOOLS FOR DISTANCE COLLABORATION 2012 OSEP PD CONFERENCE WASHINGTON, DC Mark Horney: Once you get past the contact stage and I ll tell you about my projects and you tell

More information

Enhancing Shipboard Maintenance with Augmented Reality

Enhancing Shipboard Maintenance with Augmented Reality Enhancing Shipboard Maintenance with Augmented Reality CACI Oxnard, CA Dennis Giannoni dgiannoni@caci.com (805) 288-6630 INFORMATION DEPLOYED. SOLUTIONS ADVANCED. MISSIONS ACCOMPLISHED. Agenda Virtual

More information

ID-5100 User Evaluation & Test Report

ID-5100 User Evaluation & Test Report ID-5100 User Evaluation & Test Report By Adam Farson VA7OJ/AB4OJ Iss. 1, August 13, 2014. Part I: Brief User Evaluation. Introduction: This report describes the evaluation and lab test of ID-5100 S/N 05001175.

More information

Interfacing to the SoundStation VTX 1000 TM with Vortex Devices

Interfacing to the SoundStation VTX 1000 TM with Vortex Devices Interfacing to the SoundStation VTX 1000 TM with Vortex Devices Application Note Polycom Installed Voice Business Group September 2004 Rev. F TABLE OF CONTENTS TABLE OF CONTENTS... 2 INTRODUCTION... 6

More information

Fabrication of the kinect remote-controlled cars and planning of the motion interaction courses

Fabrication of the kinect remote-controlled cars and planning of the motion interaction courses Available online at www.sciencedirect.com ScienceDirect Procedia - Social and Behavioral Sciences 174 ( 2015 ) 3102 3107 INTE 2014 Fabrication of the kinect remote-controlled cars and planning of the motion

More information

SMARTPHONE SENSOR BASED GESTURE RECOGNITION LIBRARY

SMARTPHONE SENSOR BASED GESTURE RECOGNITION LIBRARY SMARTPHONE SENSOR BASED GESTURE RECOGNITION LIBRARY Sidhesh Badrinarayan 1, Saurabh Abhale 2 1,2 Department of Information Technology, Pune Institute of Computer Technology, Pune, India ABSTRACT: Gestures

More information

Recent Advances in Acoustic Signal Extraction and Dereverberation

Recent Advances in Acoustic Signal Extraction and Dereverberation Recent Advances in Acoustic Signal Extraction and Dereverberation Emanuël Habets Erlangen Colloquium 2016 Scenario Spatial Filtering Estimated Desired Signal Undesired sound components: Sensor noise Competing

More information

Finding an Active Shooter with GNURadio

Finding an Active Shooter with GNURadio Finding an Active Shooter with GNURadio 1 Active Shooter Tactical Response Assistant Team : George Mason University Students: Ben McCall, Puja Patel, Joel Williams, Rohini Shah, Aryan Toughiry GMU Sponsors:

More information

Accurate Delay Measurement of Coded Speech Signals with Subsample Resolution

Accurate Delay Measurement of Coded Speech Signals with Subsample Resolution PAGE 433 Accurate Delay Measurement of Coded Speech Signals with Subsample Resolution Wenliang Lu, D. Sen, and Shuai Wang School of Electrical Engineering & Telecommunications University of New South Wales,

More information

Agilent U8903A Audio Analyzer

Agilent U8903A Audio Analyzer Agilent U8903A Audio Analyzer Fast and accurate multi-channels audio analysis made affordable (Replacement of the HP 8903B) Page 1 Agilent Restricted New U8903A Audio Analyzer Replacing the Popular HP

More information

SST Expert Testimony Common Questions and Answers

SST Expert Testimony Common Questions and Answers SST Expert Testimony Common Questions and Answers This document is a collection of questions that have commonly been asked about the ShotSpotter system during court testimony and deposition. If possible,

More information

Outline. Communications Engineering 1

Outline. Communications Engineering 1 Outline Introduction Signal, random variable, random process and spectra Analog modulation Analog to digital conversion Digital transmission through baseband channels Signal space representation Optimal

More information

ISO JTC 1 SC 24 WG9 G E R A R D J. K I M K O R E A U N I V E R S I T Y

ISO JTC 1 SC 24 WG9 G E R A R D J. K I M K O R E A U N I V E R S I T Y New Work Item Proposal: A Standard Reference Model for Generic MAR Systems ISO JTC 1 SC 24 WG9 G E R A R D J. K I M K O R E A U N I V E R S I T Y What is a Reference Model? A reference model (for a given

More information

Loudspeaker Array Case Study

Loudspeaker Array Case Study Loudspeaker Array Case Study The need for intelligibility Churches, theatres and schools are the most demanding applications for speech intelligibility. The whole point of being in these facilities is

More information

Different Approaches of Spectral Subtraction Method for Speech Enhancement

Different Approaches of Spectral Subtraction Method for Speech Enhancement ISSN 2249 5460 Available online at www.internationalejournals.com International ejournals International Journal of Mathematical Sciences, Technology and Humanities 95 (2013 1056 1062 Different Approaches

More information

RIR Estimation for Synthetic Data Acquisition

RIR Estimation for Synthetic Data Acquisition RIR Estimation for Synthetic Data Acquisition Kevin Venalainen, Philippe Moquin, Dinei Florencio Microsoft ABSTRACT - Automatic Speech Recognition (ASR) works best when the speech signal best matches the

More information

GESTURE RECOGNITION SOLUTION FOR PRESENTATION CONTROL

GESTURE RECOGNITION SOLUTION FOR PRESENTATION CONTROL GESTURE RECOGNITION SOLUTION FOR PRESENTATION CONTROL Darko Martinovikj Nevena Ackovska Faculty of Computer Science and Engineering Skopje, R. Macedonia ABSTRACT Despite the fact that there are different

More information

CSCD 433 Network Programming Fall Lecture 5 Physical Layer Continued

CSCD 433 Network Programming Fall Lecture 5 Physical Layer Continued CSCD 433 Network Programming Fall 2016 Lecture 5 Physical Layer Continued 1 Topics Definitions Analog Transmission of Digital Data Digital Transmission of Analog Data Multiplexing 2 Different Types of

More information

AVAL: Audio-Visual Active Locator ECE-492/3 Senior Design Project Spring 2014

AVAL: Audio-Visual Active Locator ECE-492/3 Senior Design Project Spring 2014 AVAL: Audio-Visual Active Locator ECE-492/3 Senior Design Project Spring 204 Electrical and Computer Engineering Department Volgenau School of Engineering George Mason University Fairfax, VA Team members:

More information

Development of the 2012 SJTU HVR System

Development of the 2012 SJTU HVR System Development of the 2012 SJTU HVR System Hainan Xu Shanghai Jiao Tong University 800 Dongchuan RD. Minhang Shanghai, China xhnwww@sjtu.edu.cn Yuchen Fan Shanghai Jiao Tong University 800 Dongchuan RD. Minhang

More information

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm A.T. Rajamanickam, N.P.Subiramaniyam, A.Balamurugan*,

More information

Vocal Command Recognition Using Parallel Processing of Multiple Confidence-Weighted Algorithms in an FPGA

Vocal Command Recognition Using Parallel Processing of Multiple Confidence-Weighted Algorithms in an FPGA Vocal Command Recognition Using Parallel Processing of Multiple Confidence-Weighted Algorithms in an FPGA ECE-492/3 Senior Design Project Spring 2015 Electrical and Computer Engineering Department Volgenau

More information

Introducing evolution wireless D1

Introducing evolution wireless D1 Introducing evolution wireless D1 evolution wireless D1 is a next-generation digital wireless system. Simpler. Reliable at last. Your thing is being a singer, songwriter, instrumentalist or presenter Whatever,

More information

Autonomous Vehicle Speaker Verification System

Autonomous Vehicle Speaker Verification System Autonomous Vehicle Speaker Verification System Functional Requirements List and Performance Specifications Aaron Pfalzgraf Christopher Sullivan Project Advisor: Dr. Jose Sanchez 4 November 2013 AVSVS 2

More information

REDUCING THE NEGATIVE EFFECTS OF EAR-CANAL OCCLUSION. Samuel S. Job

REDUCING THE NEGATIVE EFFECTS OF EAR-CANAL OCCLUSION. Samuel S. Job REDUCING THE NEGATIVE EFFECTS OF EAR-CANAL OCCLUSION Samuel S. Job Department of Electrical and Computer Engineering Brigham Young University Provo, UT 84602 Abstract The negative effects of ear-canal

More information

Application Areas of AI Artificial intelligence is divided into different branches which are mentioned below:

Application Areas of AI   Artificial intelligence is divided into different branches which are mentioned below: Week 2 - o Expert Systems o Natural Language Processing (NLP) o Computer Vision o Speech Recognition And Generation o Robotics o Neural Network o Virtual Reality APPLICATION AREAS OF ARTIFICIAL INTELLIGENCE

More information

PLACEMENT BROCHURE COMMUNICATION ENGINEERING

PLACEMENT BROCHURE COMMUNICATION ENGINEERING DEPARTMENT OF ELECTRICAL ENGINEERING INDIAN INSTITUTE OF TECHNOLOGY DELHI PLACEMENT BROCHURE 2017-2018 COMMUNICATION ENGINEERING It is with great pleasure that I introduce the students of Communication

More information

Recurrent Timing Neural Networks for Joint F0-Localisation Estimation

Recurrent Timing Neural Networks for Joint F0-Localisation Estimation Recurrent Timing Neural Networks for Joint F0-Localisation Estimation Stuart N. Wrigley and Guy J. Brown Department of Computer Science, University of Sheffield Regent Court, 211 Portobello Street, Sheffield

More information

HPV Technologies LLC January 12, 2006

HPV Technologies LLC January 12, 2006 Without communication, there is chaos. Loud, annoying tones without the spoken word are just that: Annoying tones. Tones do not assist in determining intent unless the subject responds by escalating the

More information

IEEE Electromagnetic Compatibility Standards (Active & Archive) Collection: VuSpec

IEEE Electromagnetic Compatibility Standards (Active & Archive) Collection: VuSpec IEEE Electromagnetic Compatibility Standards (Active & Archive) Collection: VuSpec This value-packed VuSpec represents the most complete resource available for professional engineers looking for best practices

More information

Android Speech Interface to a Home Robot July 2012

Android Speech Interface to a Home Robot July 2012 Android Speech Interface to a Home Robot July 2012 Deya Banisakher Undergraduate, Computer Engineering dmbxt4@mail.missouri.edu Tatiana Alexenko Graduate Mentor ta7cf@mail.missouri.edu Megan Biondo Undergraduate,

More information

What is a Communications System?

What is a Communications System? Introduction to Communication Systems: An Overview James Flynn Sharlene Katz What is a Communications System? A communications system transfers an information bearing signal from a source to one or more

More information

Collection of re-transmitted data and impulse responses and remote ASR and speaker verification. Igor Szoke, Lada Mosner (et al.

Collection of re-transmitted data and impulse responses and remote ASR and speaker verification. Igor Szoke, Lada Mosner (et al. Collection of re-transmitted data and impulse responses and remote ASR and speaker verification. Igor Szoke, Lada Mosner (et al.) BUT Speech@FIT LISTEN Workshop, Bonn, 19.7.2018 Why DRAPAK project To ship

More information