Selected Research Signal & Information Processing Group

Size: px
Start display at page:

Download "Selected Research Signal & Information Processing Group"

Transcription

1 COST Action IC MC Meeting Selected Research Signal & Information Processing Group Zheng-Hua Tan Dept. of Electronic Systems, Aalborg Univ., Denmark zt@es.aau.dk 1

2 Outline Introduction to Signal and Information Processing Group, Department of Electronic Systems, Aalborg University COST Action IC1206 related activities and work 2

3 Aalborg, Denmark 3

4 Aalborg University Inaugurated in 1974 in Aalborg (population: 200,000). 20,000 students, 2,000 research personnel 12.5% international students (most graduate programmes taught in English) Engineering, natural and social sciences, medicine, and humanities Department of Electronic Systems: 300+ employees. Renowned for: Project oriented problem based learning in teams Interdisciplinarity and cooperation with industry Network university: Campus in Aalborg, Esbjerg, and Copenhagen 4

5 Research areas of the SIP group Speech and language processing, multimedia signal processing, machine learning, pattern recognition, Usability engineering, human computer/robot interaction, Signal processing, numerical linear algebra, statistics, compressed sensing, optimization, Reconfigurable architectures, resource optimal hardware/software co-design, computing, high performance scientific computing, 5

6 Funding agencies and companies + collaboration with a dozen of universities and institutes worldwide. 6

7 Outline Introduction to Signal and Information Processing Group, Department of Electronic Systems, Aalborg University COST Action IC1206 related activities and work Denoising and VAD for SID; SID of disguised voice Age and gender identification for recommender systems Durable Interaction with Socially Intelligent Robots 7

8 On-going related projects A Robust Audio-based Hybrid Recommendations Framework for Interactive TV. (TESCO UK using faces is on headlines.) Bang & Olufsen A/S and The Danish Council for Technology and Innovation. isrobot - Durable Interaction with Socially Intelligent Robots. The Danish Council for Independent Research in Technology and Production Sciences. CoSound A Cognitive Systems Approach to Enriched and Actionable Information from Audio Streams Danish Strategic Research Council. Speaker Recognition under Adverse Environments Subproject supported by European Commission Erasmus Mobility for Life Scholarship. 8

9 Research topics Speaker identification under adverse environments Acoustic noise (denoising, VAD) Disguised voice (multistyle training, multiple frame rates) Age, gender and emotion identification For TV recommender systems For human robot interaction Audio-visual fusion based on sensor networks 9

10 VAD for speaker identification Two-pass segment-based denoising and voice activity detection (VAD) DARPA Robust Automatic Transcription of Speech (RATS) database 10

11 Challenge to denoising and VAD: non-stationary noise The burst-like noise requires special attention as it makes existing methods fail. Zheng-Hua Tan Mataro, Spain, 11/

12 Two-pass segment-based de-noising and VAD Considering the very different characteristics of stationary and burst-like noise 1 st pass: 1) High-energy segments are detected by using a posteriori SNR weighted energy difference (SNR-dE). [Z.-H. Tan and B. Lindberg, IEEE Journal of Selected Topics in Signal Processing, 2010.] 2) Within a high-energy segment, if no pitch is found, the segment is classified as noise. 2 nd pass: Stationary noise is removed by a modified MSNE method. VAD approach is applied to the denoised data. Zheng-Hua Tan Mataro, Spain, 11/

13 De-noising and VAD results for known data Zheng-Hua Tan Known data, channel H Mataro, Spain, 11/

14 De-noising and VAD results for unknown data Zheng-Hua Tan Unknown data, channel H Mataro, Spain, 11/

15 Speaker ID system performance O. Plchot, S. Matsoukas, P. Matejka, N. Dehak, J. Ma, S. Cumani, O. Glembek, H. Hermansky, S.H. Mallidi, N. Mesgarani, R. Schwartz, M. Soufifar, Z.-H. Tan, S. Thomas, B. Zhang and X. Zhou, Developing a Speaker Identification System for the DARPA RATS project, ICASSP

16 Age and gender ID for recommender systems Sven Ewan Shepstone, Zheng-Hua Tan and Søren Holdt Jensen, "Audio-based Age and Gender Identification to Enhance the Recommendation of TV Content," IEEE Transactions on Consumer Electronics, vol. 59, no. 3, pp , August Sven Ewan Shepstone, Zheng-Hua Tan and Søren Holdt Jensen, Demographic Recommendation by means of Group Profile Elicitation Using Speaker Age and Gender Recognition, Interspeech 2013, Lyon, France, August 25-29,

17 Project overview A user profile is needed to make good TV recommendations. An audio classifier, as opposed to manual data or usage patterns, is used to implicitly gather data for the user profile. Age and gender are useful parameters for recommendations. Current accuracy for age and gender detection (7 classes) is just over 50 % (the agender corpus). There can be large confusion between age and gender classes. Hypothesis: Are items that are recommended based on the age-and-gender extracted profile perceived to be better than random items? 17

18 Matching and recommendation Recommendation Strategy Group profile adaptation (if necessary) to convert an M-user group profile to an N-slot content profile. Genetic Selection Algorithm (k chromosomes where each chromosome is a sequence of items) 18

19 Age and gender classification 7 age-and-gender classes: Child(C), Young Male(YM), Young Female(YF), Adult Male(AM), Adult Female(AF), Senior Male(SM) and Senior Female(SF). A Viewer Configuration is a profile for the group, e.g. C, C, SF. Each speaker is connected to real speaker utterances from the agender corpus. These are classified to determine each user s age and gender profile. Age and gender classification using both acoustic and prosodic features. 19

20 User study and results TV2 (Danish Broadcaster) advertisement corpus used. Results Significant increase in median rating for recommended ads (7.75 as opposed to 4.25). Conclusion: This work shows the potential of using age and gender audio classification for recommending sequences of video clips to group viewers. 20

21 Durable Interaction with Socially Intelligent Robots (isrobot) Socially assistive robots increase the quality of life decrease the expense in social care Robotics will be as important tomorrow as computers are today. - Aldebaran Robotics. "I can envision a future in which robotic devices will become a nearly ubiquitous part of our day to day lives." - Bill Gates. The global service robotics market: 2012 $20 billion, 2017 $46 billion (17.4% increase annually). 21

22 isrobot cont. Objective: To enable socially assistive robots to feel and express feelings with the ultimate goal of establishing durable social interaction. The Danish Council for Independent Research Challenges Low signal quality due to environmental noises and imperfect placement of sensors that significantly degrades the robot s capability to sense Lack of understanding of users and context making a robot a pet only with limited richness in expression. Social intelligence and durable interaction require the robot to locate, recognize and feel its users and to respond with awareness. 22

23 Summary Introduction to Signal and Information Processing Group, Department of Electronic Systems, Aalborg University COST Action IC1206 related activities and work Denoising and VAD for SID; SID of disguised voice Age and gender identification for recommender systems Durable Interaction with Socially Intelligent Robots Thank you for your attention! 23

Improving Robustness against Environmental Sounds for Directing Attention of Social Robots

Improving Robustness against Environmental Sounds for Directing Attention of Social Robots Improving Robustness against Environmental Sounds for Directing Attention of Social Robots Nicolai B. Thomsen, Zheng-Hua Tan, Børge Lindberg, and Søren Holdt Jensen Dept. Electronic Systems, Aalborg University,

More information

AAU SUMMER SCHOOL PROGRAMMING SOCIAL ROBOTS FOR HUMAN INTERACTION LECTURE 10 MULTIMODAL HUMAN-ROBOT INTERACTION

AAU SUMMER SCHOOL PROGRAMMING SOCIAL ROBOTS FOR HUMAN INTERACTION LECTURE 10 MULTIMODAL HUMAN-ROBOT INTERACTION AAU SUMMER SCHOOL PROGRAMMING SOCIAL ROBOTS FOR HUMAN INTERACTION LECTURE 10 MULTIMODAL HUMAN-ROBOT INTERACTION COURSE OUTLINE 1. Introduction to Robot Operating System (ROS) 2. Introduction to isociobot

More information

Microphone Array Design and Beamforming

Microphone Array Design and Beamforming Microphone Array Design and Beamforming Heinrich Löllmann Multimedia Communications and Signal Processing heinrich.loellmann@fau.de with contributions from Vladi Tourbabin and Hendrik Barfuss EUSIPCO Tutorial

More information

Multi-band long-term signal variability features for robust voice activity detection

Multi-band long-term signal variability features for robust voice activity detection INTESPEECH 3 Multi-band long-term signal variability features for robust voice activity detection Andreas Tsiartas, Theodora Chaspari, Nassos Katsamanis, Prasanta Ghosh,MingLi, Maarten Van Segbroeck, Alexandros

More information

Using RASTA in task independent TANDEM feature extraction

Using RASTA in task independent TANDEM feature extraction R E S E A R C H R E P O R T I D I A P Using RASTA in task independent TANDEM feature extraction Guillermo Aradilla a John Dines a Sunil Sivadas a b IDIAP RR 04-22 April 2004 D a l l e M o l l e I n s t

More information

Epoch Extraction From Emotional Speech

Epoch Extraction From Emotional Speech Epoch Extraction From al Speech D Govind and S R M Prasanna Department of Electronics and Electrical Engineering Indian Institute of Technology Guwahati Email:{dgovind,prasanna}@iitg.ernet.in Abstract

More information

Mel Spectrum Analysis of Speech Recognition using Single Microphone

Mel Spectrum Analysis of Speech Recognition using Single Microphone International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree

More information

Speaker and Noise Independent Voice Activity Detection

Speaker and Noise Independent Voice Activity Detection Speaker and Noise Independent Voice Activity Detection François G. Germain, Dennis L. Sun,2, Gautham J. Mysore 3 Center for Computer Research in Music and Acoustics, Stanford University, CA 9435 2 Department

More information

Lecture 2: Sensors. Zheng-Hua Tan

Lecture 2: Sensors. Zheng-Hua Tan Instrumentation and data acquisition Spring 2010 Lecture 2: Sensors Zheng-Hua Tan Multimedia Information and Signal Processing Department of Electronic Systems Aalborg University, Denmark zt@es.aau.dk

More information

NOISE ESTIMATION IN A SINGLE CHANNEL

NOISE ESTIMATION IN A SINGLE CHANNEL SPEECH ENHANCEMENT FOR CROSS-TALK INTERFERENCE by Levent M. Arslan and John H.L. Hansen Robust Speech Processing Laboratory Department of Electrical Engineering Box 99 Duke University Durham, North Carolina

More information

Speech/Music Discrimination via Energy Density Analysis

Speech/Music Discrimination via Energy Density Analysis Speech/Music Discrimination via Energy Density Analysis Stanis law Kacprzak and Mariusz Zió lko Department of Electronics, AGH University of Science and Technology al. Mickiewicza 30, Kraków, Poland {skacprza,

More information

VOICE ACTIVITY DETECTION USING NEUROGRAMS. Wissam A. Jassim and Naomi Harte

VOICE ACTIVITY DETECTION USING NEUROGRAMS. Wissam A. Jassim and Naomi Harte VOICE ACTIVITY DETECTION USING NEUROGRAMS Wissam A. Jassim and Naomi Harte Sigmedia, ADAPT Centre, School of Engineering, Trinity College Dublin, Ireland ABSTRACT Existing acoustic-signal-based algorithms

More information

Multi-Modal User Interaction

Multi-Modal User Interaction Multi-Modal User Interaction Lecture 4: Multiple Modalities Zheng-Hua Tan Department of Electronic Systems Aalborg University, Denmark zt@es.aau.dk MMUI, IV, Zheng-Hua Tan 1 Outline Multimodal interface

More information

SOUND SOURCE RECOGNITION FOR INTELLIGENT SURVEILLANCE

SOUND SOURCE RECOGNITION FOR INTELLIGENT SURVEILLANCE Paper ID: AM-01 SOUND SOURCE RECOGNITION FOR INTELLIGENT SURVEILLANCE Md. Rokunuzzaman* 1, Lutfun Nahar Nipa 1, Tamanna Tasnim Moon 1, Shafiul Alam 1 1 Department of Mechanical Engineering, Rajshahi University

More information

Introduction of Audio and Music

Introduction of Audio and Music 1 Introduction of Audio and Music Wei-Ta Chu 2009/12/3 Outline 2 Introduction of Audio Signals Introduction of Music 3 Introduction of Audio Signals Wei-Ta Chu 2009/12/3 Li and Drew, Fundamentals of Multimedia,

More information

arxiv: v2 [cs.sd] 15 May 2018

arxiv: v2 [cs.sd] 15 May 2018 Voices Obscured in Complex Environmental Settings (VOICES) corpus Colleen Richey 2 * and Maria A.Barrios 1 *, Zeb Armstrong 2, Chris Bartels 2, Horacio Franco 2, Martin Graciarena 2, Aaron Lawson 2, Mahesh

More information

Learning to Unlearn and Relearn Speech Signal Processing using Neural Networks: current and future perspectives

Learning to Unlearn and Relearn Speech Signal Processing using Neural Networks: current and future perspectives Learning to Unlearn and Relearn Speech Signal Processing using Neural Networks: current and future perspectives Mathew Magimai Doss Collaborators: Vinayak Abrol, Selen Hande Kabil, Hannah Muckenhirn, Dimitri

More information

Visvesvaraya Technological University, Belagavi

Visvesvaraya Technological University, Belagavi Time Table for M.TECH. Examinations, June / July 2017 M. TECH. 2010 Scheme 2011 Scheme 2012 Scheme 2014 Scheme 2016 Scheme [CBCS] Semester I II III I II III I II III I II IV I II Time Date, Day 14/06/2017,

More information

Lecture 4: Digital representation and data analysis

Lecture 4: Digital representation and data analysis Instrumentation and data acquisition Spring 010 Lecture 4: Digital representation and data analysis Zheng-Hua Tan Multimedia Information and Signal Processing Department of Electronic Systems Aalborg University,

More information

Dimension Reduction of the Modulation Spectrogram for Speaker Verification

Dimension Reduction of the Modulation Spectrogram for Speaker Verification Dimension Reduction of the Modulation Spectrogram for Speaker Verification Tomi Kinnunen Speech and Image Processing Unit Department of Computer Science University of Joensuu, Finland Kong Aik Lee and

More information

The Effects of Entrainment in a Tutoring Dialogue System. Huy Nguyen, Jesse Thomason CS 3710 University of Pittsburgh

The Effects of Entrainment in a Tutoring Dialogue System. Huy Nguyen, Jesse Thomason CS 3710 University of Pittsburgh The Effects of Entrainment in a Tutoring Dialogue System Huy Nguyen, Jesse Thomason CS 3710 University of Pittsburgh Outline Introduction Corpus Post-Hoc Experiment Results Summary 2 Introduction Spoken

More information

A Survey and Evaluation of Voice Activity Detection Algorithms

A Survey and Evaluation of Voice Activity Detection Algorithms A Survey and Evaluation of Voice Activity Detection Algorithms Seshashyama Sameeraj Meduri (ssme09@student.bth.se, 861003-7577) Rufus Ananth (anru09@student.bth.se, 861129-5018) Examiner: Dr. Sven Johansson

More information

Performance study of Text-independent Speaker identification system using MFCC & IMFCC for Telephone and Microphone Speeches

Performance study of Text-independent Speaker identification system using MFCC & IMFCC for Telephone and Microphone Speeches Performance study of Text-independent Speaker identification system using & I for Telephone and Microphone Speeches Ruchi Chaudhary, National Technical Research Organization Abstract: A state-of-the-art

More information

MULTI-MICROPHONE FUSION FOR DETECTION OF SPEECH AND ACOUSTIC EVENTS IN SMART SPACES

MULTI-MICROPHONE FUSION FOR DETECTION OF SPEECH AND ACOUSTIC EVENTS IN SMART SPACES MULTI-MICROPHONE FUSION FOR DETECTION OF SPEECH AND ACOUSTIC EVENTS IN SMART SPACES Panagiotis Giannoulis 1,3, Gerasimos Potamianos 2,3, Athanasios Katsamanis 1,3, Petros Maragos 1,3 1 School of Electr.

More information

IMPROVEMENTS TO THE IBM SPEECH ACTIVITY DETECTION SYSTEM FOR THE DARPA RATS PROGRAM

IMPROVEMENTS TO THE IBM SPEECH ACTIVITY DETECTION SYSTEM FOR THE DARPA RATS PROGRAM IMPROVEMENTS TO THE IBM SPEECH ACTIVITY DETECTION SYSTEM FOR THE DARPA RATS PROGRAM Samuel Thomas 1, George Saon 1, Maarten Van Segbroeck 2 and Shrikanth S. Narayanan 2 1 IBM T.J. Watson Research Center,

More information

Progress in the BBN Keyword Search System for the DARPA RATS Program

Progress in the BBN Keyword Search System for the DARPA RATS Program INTERSPEECH 2014 Progress in the BBN Keyword Search System for the DARPA RATS Program Tim Ng 1, Roger Hsiao 1, Le Zhang 1, Damianos Karakos 1, Sri Harish Mallidi 2, Martin Karafiát 3,KarelVeselý 3, Igor

More information

ENHANCED HUMAN-AGENT INTERACTION: AUGMENTING INTERACTION MODELS WITH EMBODIED AGENTS BY SERAFIN BENTO. MASTER OF SCIENCE in INFORMATION SYSTEMS

ENHANCED HUMAN-AGENT INTERACTION: AUGMENTING INTERACTION MODELS WITH EMBODIED AGENTS BY SERAFIN BENTO. MASTER OF SCIENCE in INFORMATION SYSTEMS BY SERAFIN BENTO MASTER OF SCIENCE in INFORMATION SYSTEMS Edmonton, Alberta September, 2015 ABSTRACT The popularity of software agents demands for more comprehensive HAI design processes. The outcome of

More information

Book Chapters. Refereed Journal Publications J11

Book Chapters. Refereed Journal Publications J11 Book Chapters B2 B1 A. Mouchtaris and P. Tsakalides, Low Bitrate Coding of Spot Audio Signals for Interactive and Immersive Audio Applications, in New Directions in Intelligent Interactive Multimedia,

More information

Testing the Intelligibility of Corrupted Speech with an Automated Speech Recognition System

Testing the Intelligibility of Corrupted Speech with an Automated Speech Recognition System Testing the Intelligibility of Corrupted Speech with an Automated Speech Recognition System William T. HICKS, Brett Y. SMOLENSKI, Robert E. YANTORNO Electrical & Computer Engineering Department College

More information

SONG RETRIEVAL SYSTEM USING HIDDEN MARKOV MODELS

SONG RETRIEVAL SYSTEM USING HIDDEN MARKOV MODELS SONG RETRIEVAL SYSTEM USING HIDDEN MARKOV MODELS AKSHAY CHANDRASHEKARAN ANOOP RAMAKRISHNA akshayc@cmu.edu anoopr@andrew.cmu.edu ABHISHEK JAIN GE YANG ajain2@andrew.cmu.edu younger@cmu.edu NIDHI KOHLI R

More information

TA2 Newsletter April 2010

TA2 Newsletter April 2010 Content TA2 - making communications and engagement easier among groups of people separated in space and time... 1 The TA2 objectives... 2 Pathfinders to demonstrate and assess TA2... 3 World premiere:

More information

4-206 CST Voice: (315) (o), (315) (m) Department of EECS Fax: (315)

4-206 CST Voice: (315) (o), (315) (m) Department of EECS Fax: (315) Hao Chen Contact Information Research Interests Education 4-206 CST Voice: (315) 443-4416 (o), (315) 569-3454 (m) Department of EECS Fax: (315) 443-2583 Syracuse University E-mail: hchen21@syr.edu Syracuse,

More information

Language, Context and Location

Language, Context and Location Language, Context and Location Svenja Adolphs Language and Context Everyday communication has evolved rapidly over the past decade with an increase in the use of digital devices. Techniques for capturing

More information

Electronic disguised voice identification based on Mel- Frequency Cepstral Coefficient analysis

Electronic disguised voice identification based on Mel- Frequency Cepstral Coefficient analysis International Journal of Scientific and Research Publications, Volume 5, Issue 11, November 2015 412 Electronic disguised voice identification based on Mel- Frequency Cepstral Coefficient analysis Shalate

More information

* Intelli Robotic Wheel Chair for Specialty Operations & Physically Challenged

* Intelli Robotic Wheel Chair for Specialty Operations & Physically Challenged ADVANCED ROBOTICS SOLUTIONS * Intelli Mobile Robot for Multi Specialty Operations * Advanced Robotic Pick and Place Arm and Hand System * Automatic Color Sensing Robot using PC * AI Based Image Capturing

More information

Audio Classification by Search of Primary Components

Audio Classification by Search of Primary Components Audio Classification by Search of Primary Components Julien PINQUIER, José ARIAS and Régine ANDRE-OBRECHT Equipe SAMOVA, IRIT, UMR 5505 CNRS INP UPS 118, route de Narbonne, 3106 Toulouse cedex 04, FRANCE

More information

Clean Water Modelling Advisory Group Autumn Conference

Clean Water Modelling Advisory Group Autumn Conference Clean Water Modelling Advisory Group Autumn Conference Day 1: Modelling the Demand for Water Thursday 23 October 2014 Facing water industry challenges Hans Jensen Topics for today What is UKWIR? What do

More information

Mikko Myllymäki and Tuomas Virtanen

Mikko Myllymäki and Tuomas Virtanen NON-STATIONARY NOISE MODEL COMPENSATION IN VOICE ACTIVITY DETECTION Mikko Myllymäki and Tuomas Virtanen Department of Signal Processing, Tampere University of Technology Korkeakoulunkatu 1, 3370, Tampere,

More information

A SUPERVISED SIGNAL-TO-NOISE RATIO ESTIMATION OF SPEECH SIGNALS. Pavlos Papadopoulos, Andreas Tsiartas, James Gibson, and Shrikanth Narayanan

A SUPERVISED SIGNAL-TO-NOISE RATIO ESTIMATION OF SPEECH SIGNALS. Pavlos Papadopoulos, Andreas Tsiartas, James Gibson, and Shrikanth Narayanan IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP) A SUPERVISED SIGNAL-TO-NOISE RATIO ESTIMATION OF SPEECH SIGNALS Pavlos Papadopoulos, Andreas Tsiartas, James Gibson, and

More information

Natural Interaction with Social Robots

Natural Interaction with Social Robots Workshop: Natural Interaction with Social Robots Part of the Topig Group with the same name. http://homepages.stca.herts.ac.uk/~comqkd/tg-naturalinteractionwithsocialrobots.html organized by Kerstin Dautenhahn,

More information

AUTOMATIC SPEECH RECOGNITION FOR NUMERIC DIGITS USING TIME NORMALIZATION AND ENERGY ENVELOPES

AUTOMATIC SPEECH RECOGNITION FOR NUMERIC DIGITS USING TIME NORMALIZATION AND ENERGY ENVELOPES AUTOMATIC SPEECH RECOGNITION FOR NUMERIC DIGITS USING TIME NORMALIZATION AND ENERGY ENVELOPES N. Sunil 1, K. Sahithya Reddy 2, U.N.D.L.mounika 3 1 ECE, Gurunanak Institute of Technology, (India) 2 ECE,

More information

NCCT IEEE PROJECTS ADVANCED ROBOTICS SOLUTIONS. Latest Projects, in various Domains. Promise for the Best Projects

NCCT IEEE PROJECTS ADVANCED ROBOTICS SOLUTIONS. Latest Projects, in various Domains. Promise for the Best Projects NCCT Promise for the Best Projects IEEE PROJECTS in various Domains Latest Projects, 2009-2010 ADVANCED ROBOTICS SOLUTIONS EMBEDDED SYSTEM PROJECTS Microcontrollers VLSI DSP Matlab Robotics ADVANCED ROBOTICS

More information

TECHNOLOGICAL COOPERATION MISSION COMPANY PARTNER SEARCH

TECHNOLOGICAL COOPERATION MISSION COMPANY PARTNER SEARCH TECHNOLOGICAL COOPERATION MISSION COMPANY PARTNER SEARCH The information you are about to provide in this form will be distributed among GERMAN companies matching your company profile and that might be

More information

An Optimization of Audio Classification and Segmentation using GASOM Algorithm

An Optimization of Audio Classification and Segmentation using GASOM Algorithm An Optimization of Audio Classification and Segmentation using GASOM Algorithm Dabbabi Karim, Cherif Adnen Research Unity of Processing and Analysis of Electrical and Energetic Systems Faculty of Sciences

More information

Robust Voice Activity Detection Based on Discrete Wavelet. Transform

Robust Voice Activity Detection Based on Discrete Wavelet. Transform Robust Voice Activity Detection Based on Discrete Wavelet Transform Kun-Ching Wang Department of Information Technology & Communication Shin Chien University kunching@mail.kh.usc.edu.tw Abstract This paper

More information

Extraction and Recognition of Text From Digital English Comic Image Using Median Filter

Extraction and Recognition of Text From Digital English Comic Image Using Median Filter Extraction and Recognition of Text From Digital English Comic Image Using Median Filter S.Ranjini 1 Research Scholar,Department of Information technology Bharathiar University Coimbatore,India ranjinisengottaiyan@gmail.com

More information

CONTROL OF SENSORS FOR SEQUENTIAL DETECTION A STOCHASTIC APPROACH

CONTROL OF SENSORS FOR SEQUENTIAL DETECTION A STOCHASTIC APPROACH file://\\52zhtv-fs-725v\cstemp\adlib\input\wr_export_131127111121_237836102... Page 1 of 1 11/27/2013 AFRL-OSR-VA-TR-2013-0604 CONTROL OF SENSORS FOR SEQUENTIAL DETECTION A STOCHASTIC APPROACH VIJAY GUPTA

More information

Android Speech Interface to a Home Robot July 2012

Android Speech Interface to a Home Robot July 2012 Android Speech Interface to a Home Robot July 2012 Deya Banisakher Undergraduate, Computer Engineering dmbxt4@mail.missouri.edu Tatiana Alexenko Graduate Mentor ta7cf@mail.missouri.edu Megan Biondo Undergraduate,

More information

Different Approaches of Spectral Subtraction Method for Speech Enhancement

Different Approaches of Spectral Subtraction Method for Speech Enhancement ISSN 2249 5460 Available online at www.internationalejournals.com International ejournals International Journal of Mathematical Sciences, Technology and Humanities 95 (2013 1056 1062 Different Approaches

More information

ROTATIONAL RESET STRATEGY FOR ONLINE SEMI-SUPERVISED NMF-BASED SPEECH ENHANCEMENT FOR LONG RECORDINGS

ROTATIONAL RESET STRATEGY FOR ONLINE SEMI-SUPERVISED NMF-BASED SPEECH ENHANCEMENT FOR LONG RECORDINGS ROTATIONAL RESET STRATEGY FOR ONLINE SEMI-SUPERVISED NMF-BASED SPEECH ENHANCEMENT FOR LONG RECORDINGS Jun Zhou Southwest University Dept. of Computer Science Beibei, Chongqing 47, China zhouj@swu.edu.cn

More information

The ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection

The ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection The ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection Tomi Kinnunen, University of Eastern Finland, FINLAND Md Sahidullah, University of Eastern Finland, FINLAND Héctor

More information

A multi-class method for detecting audio events in news broadcasts

A multi-class method for detecting audio events in news broadcasts A multi-class method for detecting audio events in news broadcasts Sergios Petridis, Theodoros Giannakopoulos, and Stavros Perantonis Computational Intelligence Laboratory, Institute of Informatics and

More information

Robust Low-Resource Sound Localization in Correlated Noise

Robust Low-Resource Sound Localization in Correlated Noise INTERSPEECH 2014 Robust Low-Resource Sound Localization in Correlated Noise Lorin Netsch, Jacek Stachurski Texas Instruments, Inc. netsch@ti.com, jacek@ti.com Abstract In this paper we address the problem

More information

Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis

Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis Mohini Avatade & S.L. Sahare Electronics & Telecommunication Department, Cummins

More information

Curriculum Vitae. Petar M. Djurić

Curriculum Vitae. Petar M. Djurić Curriculum Vitae Petar M. Djurić Department of Electrical and Computer Engineering 11794 Tel: (631) 632-8423; Email: petar.djuric@stonybrook.edu http://www.ee.sunysb.edu/ djuric/home.html EDUCATION: Ph.D.,

More information

MATLAB DIGITAL IMAGE/SIGNAL PROCESSING TITLES

MATLAB DIGITAL IMAGE/SIGNAL PROCESSING TITLES MATLAB DIGITAL IMAGE/SIGNAL PROCESSING TITLES -2018 S.NO PROJECT CODE 1 ITIMP01 2 ITIMP02 3 ITIMP03 4 ITIMP04 5 ITIMP05 6 ITIMP06 7 ITIMP07 8 ITIMP08 9 ITIMP09 `10 ITIMP10 11 ITIMP11 12 ITIMP12 13 ITIMP13

More information

International Journal of Scientific & Engineering Research, Volume 4, Issue 5, May ISSN

International Journal of Scientific & Engineering Research, Volume 4, Issue 5, May ISSN International Journal of Scientific & Engineering Research, Volume 4, Issue 5, May-2013 1840 An Overview of Distributed Speech Recognition over WMN Jyoti Prakash Vengurlekar vengurlekar.jyoti13@gmai l.com

More information

CO-CHANNEL SPEECH DETECTION APPROACHES USING CYCLOSTATIONARITY OR WAVELET TRANSFORM

CO-CHANNEL SPEECH DETECTION APPROACHES USING CYCLOSTATIONARITY OR WAVELET TRANSFORM CO-CHANNEL SPEECH DETECTION APPROACHES USING CYCLOSTATIONARITY OR WAVELET TRANSFORM Arvind Raman Kizhanatham, Nishant Chandra, Robert E. Yantorno Temple University/ECE Dept. 2 th & Norris Streets, Philadelphia,

More information

DERIVATION OF TRAPS IN AUDITORY DOMAIN

DERIVATION OF TRAPS IN AUDITORY DOMAIN DERIVATION OF TRAPS IN AUDITORY DOMAIN Petr Motlíček, Doctoral Degree Programme (4) Dept. of Computer Graphics and Multimedia, FIT, BUT E-mail: motlicek@fit.vutbr.cz Supervised by: Dr. Jan Černocký, Prof.

More information

Biometric Recognition: How Do I Know Who You Are?

Biometric Recognition: How Do I Know Who You Are? Biometric Recognition: How Do I Know Who You Are? Anil K. Jain Department of Computer Science and Engineering, 3115 Engineering Building, Michigan State University, East Lansing, MI 48824, USA jain@cse.msu.edu

More information

PLACEMENT BROCHURE COMMUNICATION ENGINEERING

PLACEMENT BROCHURE COMMUNICATION ENGINEERING DEPARTMENT OF ELECTRICAL ENGINEERING INDIAN INSTITUTE OF TECHNOLOGY DELHI PLACEMENT BROCHURE 2017-2018 COMMUNICATION ENGINEERING It is with great pleasure that I introduce the students of Communication

More information

Speech Enhancement using Wiener filtering

Speech Enhancement using Wiener filtering Speech Enhancement using Wiener filtering S. Chirtmay and M. Tahernezhadi Department of Electrical Engineering Northern Illinois University DeKalb, IL 60115 ABSTRACT The problem of reducing the disturbing

More information

Front Digital page Strategy and Leadership

Front Digital page Strategy and Leadership Front Digital page Strategy and Leadership Who am I? Prof. Dr. Bob de Wit What concerns me? - How to best lead a firm - How to design the strategy process - How to best govern a country - How to adapt

More information

Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter

Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter Ching-Ta Lu, Kun-Fu Tseng 2, Chih-Tsung Chen 2 Department of Information Communication, Asia University, Taichung, Taiwan, ROC

More information

Combining Voice Activity Detection Algorithms by Decision Fusion

Combining Voice Activity Detection Algorithms by Decision Fusion Combining Voice Activity Detection Algorithms by Decision Fusion Evgeny Karpov, Zaur Nasibov, Tomi Kinnunen, Pasi Fränti Speech and Image Processing Unit, University of Eastern Finland, Joensuu, Finland

More information

AUTOMATIC MODULATION RECOGNITION OF COMMUNICATION SIGNALS

AUTOMATIC MODULATION RECOGNITION OF COMMUNICATION SIGNALS エシアンゾロナルオフネチュラルアンドアプライヅサエニセズ ISSN: 2186-8476, ISSN: 2186-8468 Print AUTOMATIC MODULATION RECOGNITION OF COMMUNICATION SIGNALS Muazzam Ali Khan 1, Maqsood Muhammad Khan 2, Muhammad Saad Khan 3 1 Blekinge

More information

RESEARCH AND DEVELOPMENT OF DSP-BASED FACE RECOGNITION SYSTEM FOR ROBOTIC REHABILITATION NURSING BEDS

RESEARCH AND DEVELOPMENT OF DSP-BASED FACE RECOGNITION SYSTEM FOR ROBOTIC REHABILITATION NURSING BEDS RESEARCH AND DEVELOPMENT OF DSP-BASED FACE RECOGNITION SYSTEM FOR ROBOTIC REHABILITATION NURSING BEDS Ming XING and Wushan CHENG College of Mechanical Engineering, Shanghai University of Engineering Science,

More information

This list supersedes the one published in the November 2002 issue of CR.

This list supersedes the one published in the November 2002 issue of CR. PERIODICALS RECEIVED This is the current list of periodicals received for review in Reviews. International standard serial numbers (ISSNs) are provided to facilitate obtaining copies of articles or subscriptions.

More information

Voices Obscured in Complex Environmental Settings (VOiCES) corpus

Voices Obscured in Complex Environmental Settings (VOiCES) corpus Voices Obscured in Complex Environmental Settings (VOiCES) corpus Colleen Richey 2 * and Maria A.Barrios 1 *, Zeb Armstrong 2, Chris Bartels 2, Horacio Franco 2, Martin Graciarena 2, Aaron Lawson 2, Mahesh

More information

ENTRAINMENT IN THE SUPREME COURT

ENTRAINMENT IN THE SUPREME COURT ENTRAINMENT IN THE SUPREME COURT S A R A H I T A L E V I T A N D R. J U L I A H I R S C H B E R G C O L U M B I A U N I V E R S I T Y D E P A R T M E N T O F C O M P U T E R S C I E N C E D R E U 2 0 1

More information

Distance Estimation and Localization of Sound Sources in Reverberant Conditions using Deep Neural Networks

Distance Estimation and Localization of Sound Sources in Reverberant Conditions using Deep Neural Networks Distance Estimation and Localization of Sound Sources in Reverberant Conditions using Deep Neural Networks Mariam Yiwere 1 and Eun Joo Rhee 2 1 Department of Computer Engineering, Hanbat National University,

More information

Audio Fingerprinting using Fractional Fourier Transform

Audio Fingerprinting using Fractional Fourier Transform Audio Fingerprinting using Fractional Fourier Transform Swati V. Sutar 1, D. G. Bhalke 2 1 (Department of Electronics & Telecommunication, JSPM s RSCOE college of Engineering Pune, India) 2 (Department,

More information

INTERNATIONAL TELECOMMUNICATION UNION

INTERNATIONAL TELECOMMUNICATION UNION INTERNATIONAL TELECOMMUNICATION UNION ITU-T P.835 TELECOMMUNICATION STANDARDIZATION SECTOR OF ITU (11/2003) SERIES P: TELEPHONE TRANSMISSION QUALITY, TELEPHONE INSTALLATIONS, LOCAL LINE NETWORKS Methods

More information

An Improved Voice Activity Detection Based on Deep Belief Networks

An Improved Voice Activity Detection Based on Deep Belief Networks e-issn 2455 1392 Volume 2 Issue 4, April 2016 pp. 676-683 Scientific Journal Impact Factor : 3.468 http://www.ijcter.com An Improved Voice Activity Detection Based on Deep Belief Networks Shabeeba T. K.

More information

EIE 528 Power System Operation & Control(2 Units)

EIE 528 Power System Operation & Control(2 Units) EIE 528 Power System Operation & Control(2 Units) Department of Electrical and Information Engineering Covenant University 1. EIE528 1.1. EIE 528 Power System Operation & Control(2 Units) Overview of power

More information

Autonomous Face Recognition

Autonomous Face Recognition Autonomous Face Recognition CymbIoT Autonomous Face Recognition SECURITYI URBAN SOLUTIONSI RETAIL In recent years, face recognition technology has emerged as a powerful tool for law enforcement and on-site

More information

Chapter IV THEORY OF CELP CODING

Chapter IV THEORY OF CELP CODING Chapter IV THEORY OF CELP CODING CHAPTER IV THEORY OF CELP CODING 4.1 Introduction Wavefonn coders fail to produce high quality speech at bit rate lower than 16 kbps. Source coders, such as LPC vocoders,

More information

A General Architecture for Self-Adaptive AmI Components Applied in Speech Recognition

A General Architecture for Self-Adaptive AmI Components Applied in Speech Recognition AG Softwarearchitecture A General Architecture for Self-Adaptive AmI Components Applied in Speech Recognition Holger Klus ICSE 21.05.2006 21.05.2006 AG Softwarearchitektur 1 Outline Outline Application

More information

Automatic Text-Independent. Speaker. Recognition Approaches Using Binaural Inputs

Automatic Text-Independent. Speaker. Recognition Approaches Using Binaural Inputs Automatic Text-Independent Speaker Recognition Approaches Using Binaural Inputs Karim Youssef, Sylvain Argentieri and Jean-Luc Zarader 1 Outline Automatic speaker recognition: introduction Designed systems

More information

Vocal Command Recognition Using Parallel Processing of Multiple Confidence-Weighted Algorithms in an FPGA

Vocal Command Recognition Using Parallel Processing of Multiple Confidence-Weighted Algorithms in an FPGA Vocal Command Recognition Using Parallel Processing of Multiple Confidence-Weighted Algorithms in an FPGA ECE-492/3 Senior Design Project Spring 2015 Electrical and Computer Engineering Department Volgenau

More information

High-speed Noise Cancellation with Microphone Array

High-speed Noise Cancellation with Microphone Array Noise Cancellation a Posteriori Probability, Maximum Criteria Independent Component Analysis High-speed Noise Cancellation with Microphone Array We propose the use of a microphone array based on independent

More information

Vision & Industry 4.0: Towards smarter sensors. Dr. Amina Chebira Vision Embedded Systems, CSEM SA October 4 th, 2016

Vision & Industry 4.0: Towards smarter sensors. Dr. Amina Chebira Vision Embedded Systems, CSEM SA October 4 th, 2016 Vision & Industry 4.0: Towards smarter sensors Dr. Amina Chebira Vision Embedded Systems, CSEM SA October 4 th, 2016 Outline Perception and vision Smarter sensors Recognition applications More miniaturization,

More information

Social Big Data. LauritzenConsulting. Content and applications. Key environments and star researchers. Potential for attracting investment

Social Big Data. LauritzenConsulting. Content and applications. Key environments and star researchers. Potential for attracting investment Social Big Data LauritzenConsulting Content and applications Greater Copenhagen displays a special strength in Social Big Data and data science. This area employs methods from data science, social sciences

More information

FACULTY PROFILE. Total Experience : 18 Years 7 Months Academic : 18 Years 7 Months. Degree Branch / Specialization College University

FACULTY PROFILE. Total Experience : 18 Years 7 Months Academic : 18 Years 7 Months. Degree Branch / Specialization College University FACULTY PROFILE Name Designation Email ID Area of Specialization : Dr.P.VETRIVELAN : Associate Professor : vetrivelan.ece@srit.org vetrivelanece@gmail.com : Signal & Image Processing Total Experience :

More information

Global SNR Estimation of Speech Signals for Unknown Noise Conditions using Noise Adapted Non-linear Regression

Global SNR Estimation of Speech Signals for Unknown Noise Conditions using Noise Adapted Non-linear Regression INTERSPEECH 2017 August 20 24, 2017, Stockholm, Sweden Global SNR Estimation of Speech Signals for Unknown Noise Conditions using Noise Adapted Non-linear Regression Pavlos Papadopoulos, Ruchir Travadi,

More information

Front Digital page Strategy and leadership

Front Digital page Strategy and leadership Front Digital page Strategy and leadership Who am I? Prof. Dr. Bob de Wit What concerns me? - How to best lead a firm - How to design the strategy process - How to best govern a country - How to adapt

More information

Intelligent Power Economy System (Ipes)

Intelligent Power Economy System (Ipes) American Journal of Engineering Research (AJER) e-issn : 2320-0847 p-issn : 2320-0936 Volume-02, Issue-08, pp-108-114 www.ajer.org Research Paper Open Access Intelligent Power Economy System (Ipes) Salman

More information

Audiovisual speech source separation: a regularization method based on visual voice activity detection

Audiovisual speech source separation: a regularization method based on visual voice activity detection Audiovisual speech source separation: a regularization method based on visual voice activity detection Bertrand Rivet 1,2, Laurent Girin 1, Christine Servière 2, Dinh-Tuan Pham 3, Christian Jutten 2 1,2

More information

Voice Activity Detection

Voice Activity Detection Voice Activity Detection Speech Processing Tom Bäckström Aalto University October 2015 Introduction Voice activity detection (VAD) (or speech activity detection, or speech detection) refers to a class

More information

1997 Annual Surveys of Journalism & Mass Communication Survey of Enrollments Survey of Graduates

1997 Annual Surveys of Journalism & Mass Communication Survey of Enrollments Survey of Graduates 1997 Annual Surveys of Journalism & Mass Communication Survey of Enrollments Survey of Graduates Sponsors: AEJMC, ASJMC Council of Affiliates of AEJMC The Freedom Forum National Association of Broadcasters

More information

I. INTRODUCTION II. LITERATURE SURVEY. International Journal of Advanced Networking & Applications (IJANA) ISSN:

I. INTRODUCTION II. LITERATURE SURVEY. International Journal of Advanced Networking & Applications (IJANA) ISSN: A Friend Recommendation System based on Similarity Metric and Social Graphs Rashmi. J, Dr. Asha. T Department of Computer Science Bangalore Institute of Technology, Bangalore, Karnataka, India rash003.j@gmail.com,

More information

Journal Title ISSN 5. MIS QUARTERLY BRIEFINGS IN BIOINFORMATICS

Journal Title ISSN 5. MIS QUARTERLY BRIEFINGS IN BIOINFORMATICS List of Journals with impact factors Date retrieved: 1 August 2009 Journal Title ISSN Impact Factor 5-Year Impact Factor 1. ACM SURVEYS 0360-0300 9.920 14.672 2. VLDB JOURNAL 1066-8888 6.800 9.164 3. IEEE

More information

Slovak University of Technology and Planned Research in Voice De-Identification. Anna Pribilova

Slovak University of Technology and Planned Research in Voice De-Identification. Anna Pribilova Slovak University of Technology and Planned Research in Voice De-Identification Anna Pribilova SLOVAK UNIVERSITY OF TECHNOLOGY IN BRATISLAVA the oldest and the largest university of technology in Slovakia

More information

Convolutional Neural Networks: Real Time Emotion Recognition

Convolutional Neural Networks: Real Time Emotion Recognition Convolutional Neural Networks: Real Time Emotion Recognition Bruce Nguyen, William Truong, Harsha Yeddanapudy Motivation: Machine emotion recognition has long been a challenge and popular topic in the

More information

The Effects of Noise on Acoustic Parameters

The Effects of Noise on Acoustic Parameters The Effects of Noise on Acoustic Parameters * 1 Turgut Özseven and 2 Muharrem Düğenci 1 Turhal Vocational School, Gaziosmanpaşa University, Turkey * 2 Faculty of Engineering, Department of Industrial Engineering

More information

Data processing framework for decision making

Data processing framework for decision making Data processing framework for decision making Jan Larsen Intelligent Signal Processing Group Department of Informatics and Mathematical Modelling Technical University of Denmark jl@imm.dtu.dk, www.imm.dtu.dk/~jl

More information

AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND TRANSFER FUNCTIONS

AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND TRANSFER FUNCTIONS AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND TRANSFER FUNCTIONS Kuldeep Kumar 1, R. K. Aggarwal 1 and Ankita Jain 2 1 Department of Computer Engineering, National Institute

More information

A New Framework for Supervised Speech Enhancement in the Time Domain

A New Framework for Supervised Speech Enhancement in the Time Domain Interspeech 2018 2-6 September 2018, Hyderabad A New Framework for Supervised Speech Enhancement in the Time Domain Ashutosh Pandey 1 and Deliang Wang 1,2 1 Department of Computer Science and Engineering,

More information

Proposers Day Workshop

Proposers Day Workshop Proposers Day Workshop Monday, January 23, 2017 @srcjump, #JUMPpdw Cognitive Computing Vertical Research Center Mandy Pant Academic Research Director Intel Corporation Center Motivation Today s deep learning

More information

Policing in the 21 st Century. Response from the Royal Academy of Engineering to the Home Affairs Select Committee

Policing in the 21 st Century. Response from the Royal Academy of Engineering to the Home Affairs Select Committee Policing in the 21 st Century Response from the Royal Academy of Engineering to the Home Affairs Select Committee February 2008 Introduction The Royal Academy of Engineering is pleased to contribute to

More information