AAU SUMMER SCHOOL PROGRAMMING SOCIAL ROBOTS FOR HUMAN INTERACTION LECTURE 10 MULTIMODAL HUMAN-ROBOT INTERACTION

Similar documents
Designing and Implementing an Interactive Social Robot from Off-the-shelf Components

Selected Research Signal & Information Processing Group

Building Perceptive Robots with INTEL Euclid Development kit

GESTURE BASED HUMAN MULTI-ROBOT INTERACTION. Gerard Canal, Cecilio Angulo, and Sergio Escalera

Booklet of teaching units

Multi-Modal User Interaction

Automatic Text-Independent. Speaker. Recognition Approaches Using Binaural Inputs

Multi-modal Human-computer Interaction

MATLAB DIGITAL IMAGE/SIGNAL PROCESSING TITLES

Vision & Industry 4.0: Towards smarter sensors. Dr. Amina Chebira Vision Embedded Systems, CSEM SA October 4 th, 2016

Recognizing Military Gestures: Developing a Gesture Recognition Interface. Jonathan Lebron

Multimodal Research at CPK, Aalborg

1 Publishable summary

HUMAN-COMPUTER INTERACTION: OVERVIEW ON STATE OF THE ART TECHNOLOGY

DEEP DIVE ON AZURE ML FOR DEVELOPERS

Computer Vision in Human-Computer Interaction

Session 2: 10 Year Vision session (11:00-12:20) - Tuesday. Session 3: Poster Highlights A (14:00-15:00) - Tuesday 20 posters (3minutes per poster)

Short Course on Computational Illumination

Post-Graduate Program in Computer Engineering (PPG-EC)

Definitions and Application Areas

Pratibha -International Journal of Computer Science information and Engg., Technologies ISSN

Student Attendance Monitoring System Via Face Detection and Recognition System

Perceptual Interfaces. Matthew Turk s (UCSB) and George G. Robertson s (Microsoft Research) slides on perceptual p interfaces

MIN-Fakultät Fachbereich Informatik. Universität Hamburg. Socially interactive robots. Christine Upadek. 29 November Christine Upadek 1

Controlling Humanoid Robot Using Head Movements

Towards Intuitive Industrial Human-Robot Collaboration

HUMAN-COMPUTER INTERACTION: OVERVIEW ON STATE OF THE ART

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm

Advanced Man-Machine Interaction

Following Dirt Roads at Night-Time

Lecturers. Alessandro Vinciarelli

Understanding the Mechanism of Sonzai-Kan

INTELLIGENT WHEELCHAIRS

Contents. Part I: Images. List of contributing authors XIII Preface 1

Wheel Health Monitoring Using Onboard Sensors

Dorothy Monekosso. Paolo Remagnino Yoshinori Kuno. Editors. Intelligent Environments. Methods, Algorithms and Applications.

Sven Wachsmuth Bielefeld University

MULTIMODAL EMOTION RECOGNITION FOR ENHANCING HUMAN COMPUTER INTERACTION

Multimedia Forensics

Introduction to Talking Robots

DIGITAL SIGNAL PROCESSING. Introduction

Essential Understandings with Guiding Questions Robotics Engineering

Intro to AI. AI is a huge field. AI is a huge field 2/19/15. What is AI. One definition:

Job Description. Commitment: Must be available to work full-time hours, M-F for weeks beginning Summer of 2018.

interactive IP: Perception platform and modules

AI Frontiers. Dr. Dario Gil Vice President IBM Research

Major Project SSAD. Mentor : Raghudeep SSAD Mentor :Manish Jha Group : Group20 Members : Harshit Daga ( ) Aman Saxena ( )

Object Category Detection using Audio-visual Cues

Autonomous Vehicle Speaker Verification System

Mobile Target Tracking Using Radio Sensor Network

Visual Interpretation of Hand Gestures as a Practical Interface Modality

Announcements. HW 6: Written (not programming) assignment. Assigned today; Due Friday, Dec. 9. to me.

CAPACITIES FOR TECHNOLOGY TRANSFER

Cognitive Media Processing

S&T Stakeholders Conference

Prof. Subramanian Ramamoorthy. The University of Edinburgh, Reader at the School of Informatics

Audio data fuzzy fusion for source localization

Auditory System For a Mobile Robot

Control Design Made Easy By Ryan Gordon

Keywords Mobile Phones, Accelerometer, Gestures, Hand Writing, Voice Detection, Air Signature, HCI.

A*STAR Unveils Singapore s First Social Robots at Robocup2010

Lecture 2: Sensors. Zheng-Hua Tan

Intro to AI. AI is a huge field. AI is a huge field 2/26/16. What is AI (artificial intelligence) What is AI. One definition:

Generating Personality Character in a Face Robot through Interaction with Human

Introduction to Haptics

Dimension Reduction of the Modulation Spectrogram for Speaker Verification

Human Identification from Video: A Summary of Multimodal Approaches

BIOMETRIC IDENTIFICATION USING 3D FACE SCANS

Microphone Array Design and Beamforming

Multi-modal Human-Computer Interaction. Attila Fazekas.

Deep Learning Overview

Introduction to Talking Robots. Graham Wilcock Adjunct Professor, Docent Emeritus University of Helsinki

BRAIN COMPUTER INTERFACE (BCI) RESEARCH CENTER AT SRM UNIVERSITY

Prof Trivedi ECE253A Notes for Students only

INTRODUCTION TO DEEP LEARNING. Steve Tjoa June 2013

Reconfigurable ROS-based Resilient Reasoning Robotic Cooperating Systems. D93.62 Industrial exhibition and event

Research Seminar. Stefano CARRINO fr.ch

Performance study of Text-independent Speaker identification system using MFCC & IMFCC for Telephone and Microphone Speeches

Feel the beat: using cross-modal rhythm to integrate perception of objects, others, and self

What s Hot? The M&A and Funding Landscape for Embedded Vision Companies

ENHANCED HUMAN-AGENT INTERACTION: AUGMENTING INTERACTION MODELS WITH EMBODIED AGENTS BY SERAFIN BENTO. MASTER OF SCIENCE in INFORMATION SYSTEMS

Activity monitoring and summarization for an intelligent meeting room

Finding Lips in Unconstrained Imagery for Improved Automatic Speech Recognition

RESEARCH AND DEVELOPMENT OF DSP-BASED FACE RECOGNITION SYSTEM FOR ROBOTIC REHABILITATION NURSING BEDS

Multimodal Face Recognition using Hybrid Correlation Filters

Associated Emotion and its Expression in an Entertainment Robot QRIO

ARMY RDT&E BUDGET ITEM JUSTIFICATION (R2 Exhibit)

Reduction of Musical Residual Noise Using Harmonic- Adapted-Median Filter

Title Goes Here Algorithms for Biometric Authentication

Using Gestures to Interact with a Service Robot using Kinect 2

Office hrs: QC: Tue, 1:40pm - 2:40pm; GC: Thur: 11:15am-11:45am.or by appointment.

EFFICIENT ATTENDANCE MANAGEMENT SYSTEM USING FACE DETECTION AND RECOGNITION

ISSN Vol.02,Issue.17, November-2013, Pages:

Outline. Tracking with Unreliable Node Sequences. Abstract. Outline. Outline. Abstract 10/20/2009

Requirement Definition

The MARCS Institute for Brain, Behaviour and Development

AI for Autonomous Ships Challenges in Design and Validation

Model-Based Design for Sensor Systems

Modeling Software Systems in Experimental Robotics for Improved Reproducibility

CORC 3303 Exploring Robotics. Why Teams?

Transcription:

AAU SUMMER SCHOOL PROGRAMMING SOCIAL ROBOTS FOR HUMAN INTERACTION LECTURE 10 MULTIMODAL HUMAN-ROBOT INTERACTION COURSE OUTLINE 1. Introduction to Robot Operating System (ROS) 2. Introduction to isociobot and NAO robot, and demos 3. Social Robots and Applications 4. Machine Learning and Pattern Recognition 5. Speech Processing I: Acquisition of Speech, Feature Extraction and Speaker Localization 6. Speech Processing II: Speaker Identification and Speech Recognition 7. Image Processing I: Image Acquisition, Pre-processing and Feature Extraction 8. Image Processing II: Face Detection and Face Recognition 9. User Modelling 10. Multimodal Human-Robot Interaction 5 AUGUST 2015 AALBORG UNIVERSITY 2 1

MULTIMODAL INTERACTION WHAT? IN THE CONTEXT OF HUMAN COMPUTER INTERACTION, A MODALITY IS THE CLASSIFICATION OF A SINGLE INDEPENDENT CHANNEL OF SENSORY INPUT/OUTPUT BETWEEN A COMPUTER AND A HUMAN. A SYSTEM IS DESIGNATED UNIMODAL IF IT HAS ONLY ONE MODALITY IMPLEMENTED, AND MULTIMODAL IF IT HAS MORE THAN ONE. KARRAY, FAKHREDDINE, ET AL. "HUMAN-COMPUTER INTERACTION: OVERVIEW ON STATE OF THE ART." (2008). 5 AUGUST 2015 AALBORG UNIVERSITY 3 MULTIMODAL INTERACTION WHY? Extended functionality, e.g. we can speak to the robot instead of typing Human-Human like communication 5 AUGUST 2015 AALBORG UNIVERSITY 4 2

MULTIMODAL INTERACTION WHY? Robustness against noise Data from one modality might be very noisy, however the rest are not combine the modalities Person Identification: background music is corrupting recorded speech, however vision is unaltered. Speech Recognition: background music is corrupting recorded speech, however vision can be used to recognize lip movements and classify words How to know which modality to trust? 5 AUGUST 2015 AALBORG UNIVERSITY 5 MULTIMODAL INTERACTION WHY? Provide new information, which could not be provided by individual modalities Combination of sound + facial expression = emotion Learning: Sometimes only one modality is available, but noisy Use knowledge from one modality to re-train/adapt model in other domain Examples: Person Identification, Direction of Attention 5 AUGUST 2015 AALBORG UNIVERSITY 6 3

05/08/15 D U R A B L E I N T E R A C T I O N W I T H S O C I A L LY I N T E L L I G E N T ROBOTS isociobot Research project supported by The Danish Council for Independent R e s e a r c h Te c h n o l o g y a n d P r o d u c t i o n S c i e n c e s, M i n i s t r y o f Science, Innovation and Higher Education To m a k e r o b o t s s o c i a l l y i n t e l l i g e n t a n d c a p a b l e o f e s t a b l i s h i n g durable relationship with their users Multi-modal: speech, vision, facial expression etc. 5 AUGUST 2015 AALBORG UNIVERSITY 7 AALBORG UNIVERSITY 8 H A R D WA R E First generation 5 AUGUST 2015 4

HARDWARE First generation 5 AUGUST 2015 AALBORG UNIVERSITY 9 HARDWARE Second generation 5 AUGUST 2015 AALBORG UNIVERSITY 10 5

HARDWARE Second generation What changed? New body material and shape New ears ipad (input and output) New robot base (Pioneer P3-DX) 5 AUGUST 2015 AALBORG UNIVERSITY 11 SOFTWARE System OS: UBUNTU Robot OS: ROS A great framework for each module/function to communicate Widely used and high-quality software available Open-source Support Python or C 5 AUGUST 2015 AALBORG UNIVERSITY 12 6

SOFTWARE 5 AUGUST 2015 AALBORG UNIVERSITY 13 DEMOS The Day of Research 2014 5 AUGUST 2015 AALBORG UNIVERSITY 14 7

DEMOS Sikker 7 in Nibe 5 AUGUST 2015 AALBORG UNIVERSITY 15 DEMOS The Culture Night 2014 5 AUGUST 2015 AALBORG UNIVERSITY 16 8

DEMOS The people s meeting 2015 5 AUGUST 2015 AALBORG UNIVERSITY 17 FUTURE WORK Research User modeling Reinforcement fusion Collaboration/Application: Future Nursing Home Potential application: Playing/learning with children 5 AUGUST 2015 AALBORG UNIVERSITY 18 9

COURSE OUTLINE 1. Introduction to Robot Operating System (ROS) 2. Introduction to isociobot and NAO robot, and demos 3. Social Robots and Applications 4. Machine Learning and Pattern Recognition 5. Speech Processing I: Acquisition of Speech, Feature Extraction and Speaker Localization 6. Speech Processing II: Speaker Identification and Speech Recognition 7. Image Processing I: Image Acquisition, Pre-processing and Feature Extraction 8. Image Processing II: Face Detection and Face Recognition 9. User Modelling 10. Multimodal Human-Robot Interaction 5 AUGUST 2015 AALBORG UNIVERSITY 19 10