ロボティクスと深層学習. Robotics and Deep Learning. Keywords: robotics, deep learning, multimodal learning, end to end learning, sequence to sequence learning.

Similar documents
arxiv: v2 [cs.lg] 13 Nov 2015

Playing FPS Games with Deep Reinforcement Learning

REINFORCEMENT LEARNING (DD3359) O-03 END-TO-END LEARNING

Attention-based Multi-Encoder-Decoder Recurrent Neural Networks

Insertion of Pause in Drawing from Babbling for Robot s Developmental Imitation Learning

Playing Atari Games with Deep Reinforcement Learning

ジェスチャ併用型 Voice-to-MIDI システムの提案 第五回知識創造支援システムシンポジウム報告書 : 本著作物の著作権は著者に帰属します

Pengju

Deep learning architectures for music audio classification: a personal (re)view

Emergence of Interactive Behaviors between Two Robots by Prediction Error Minimization Mechanism

Extracting Multimodal Dynamics of Objects Using RNNPB

Deep Learning Basics Lecture 9: Recurrent Neural Networks. Princeton University COS 495 Instructor: Yingyu Liang

Powering AI Robots with Deep Learning

Deep Imitation Learning for Playing Real Time Strategy Games

車載カメラにおける信号機認識および危険運転イベント検知 Traffic Light Recognition and Detection of Dangerous Driving Events from Surveillance Video of Vehicle Camera

Unsupervised Minimax: nets that fight each other

Food Image Recognition Using Deep Convolutional Network with Pre-training and Fine-tuning

Tutorial of Reinforcement: A Special Focus on Q-Learning

Attention-based Information Fusion using Multi-Encoder-Decoder Recurrent Neural Networks

Karol Hausman Research Scientist Intern at Google DeepMind, London, UK Adviser: Prof. Martin Riedmiller

arxiv: v1 [cs.ro] 28 Feb 2017

Lecture 23 Deep Learning: Segmentation

Detection and Segmentation. Fei-Fei Li & Justin Johnson & Serena Yeung. Lecture 11 -

Online Knowledge Acquisition and General Problem Solving in a Real World by Humanoid Robots

Swing Copters AI. Monisha White and Nolan Walsh Fall 2015, CS229, Stanford University

TED コーパスを使った プレゼンにおける効果的な 英語表現の抽出

Learning to Represent Haptic Feedback for Partially-Observable Tasks

Wadehra Kartik, Kathpalia Mukul, Bahl Vasudha, International Journal of Advance Research, Ideas and Innovations in Technology

Audio Effects Emulation with Neural Networks

arxiv: v1 [cs.ai] 9 Oct 2017

Deep RL For Starcraft II

小川憲一 京都大学医学部附属病院放射線部 1. 方法 1-1. A System of Setting Exposure Conditions in General X-rays Using a Calculating Formula

Orchestrating Game Generation Antonios Liapis

Success Stories of Deep RL. David Silver

arxiv: v1 [cs.lg] 30 May 2016

Improvised Robotic Design with Found Objects

Playing CHIP-8 Games with Reinforcement Learning

Survivor Identification and Retrieval Robot Project Proposal

Audio Effects Emulation with Neural Networks

Playing Geometry Dash with Convolutional Neural Networks

Artificial Intelligence and Deep Learning

Recurrent neural networks Modelling sequential data. MLP Lecture 9 Recurrent Networks 1

Neural Network Part 4: Recurrent Neural Networks

Learning Pixel-Distribution Prior with Wider Convolution for Image Denoising

Continuous Gesture Recognition Fact Sheet

Image Manipulation Detection using Convolutional Neural Network

Learning from Hints: AI for Playing Threes

A Deep Q-Learning Agent for the L-Game with Variable Batch Training

レーダー流星ヘッドエコー DB 作成グループ (murmhed at nipr.ac.jp) 本規定は レーダー流星ヘッドエコー DB 作成グループの作成した MU レーダー流星ヘッド エコーデータベース ( 以下 本データベース ) の利用方法を定めるものである

Transfer Deep Reinforcement Learning in 3D Environments: An Empirical Study

Augmenting Self-Learning In Chess Through Expert Imitation

This guide will show you the proper way to replace the Nikon Coolpix P600 Compact Digital Camera lens.

Adaptive Humanoid Robot Arm Motion Generation by Evolved Neural Controllers

Elena Corina Grigore

REAL TIME EMULATION OF PARAMETRIC GUITAR TUBE AMPLIFIER WITH LONG SHORT TERM MEMORY NEURAL NETWORK

Temporal Difference Learning for the Game Tic-Tac-Toe 3D: Applying Structure to Neural Networks

Learning the Proprioceptive and Acoustic Properties of Household Objects. Jivko Sinapov Willow Collaborators: Kaijen and Radu 6/24/2010

Hand Gesture Recognition by Means of Region- Based Convolutional Neural Networks

Prof. Sameer Singh CS 175: PROJECTS IN AI (IN MINECRAFT) WINTER April 6, 2017

Selecting Subgoals using Deep Learning in Minecraft: A Preliminary Report

Optic Flow Based Skill Learning for A Humanoid to Trap, Approach to, and Pass a Ball

Mastering the game of Go without human knowledge

Driving Using End-to-End Deep Learning

相関語句 ( 定型のようになっている語句 ) の表現 1. A is to B what C is to D. A と B の関係は C と D の関係に等しい Leaves are to the plant what lungs are to the animal.

arxiv: v4 [cs.ro] 21 Jul 2017

Constructivist Approach to Human-Robot Emotional Communication - Design of Evolutionary Function for WAMOEBA-3 -

Real-time human control of robots for robot skill synthesis (and a bit

INTELLIGENCE EXPLOSION: SCIENCE OR FICTION? Bart Selman Cornell University

Robotics at OpenAI. May 1, 2017 By Wojciech Zaremba

Omochi rabbit amigurumi pattern

Emergence of Purposive and Grounded Communication through Reinforcement Learning

Impact of Automatic Feature Extraction in Deep Learning Architecture

Birth of An Intelligent Humanoid Robot in Singapore

VISUAL ANALOGIES BETWEEN ATARI GAMES FOR STUDYING TRANSFER LEARNING IN RL

Associated Emotion and its Expression in an Entertainment Robot QRIO

Deep Reinforcement Learning for General Video Game AI

arxiv: v2 [cs.lg] 6 Mar 2018

Intermediate Conversation Material #10

Magellan Systems Japan, Inc.

Co-Creative Level Design via Machine Learning

Sim-to-Real Transfer with Neural-Augmented Robot Simulation

arxiv: v2 [cs.ne] 8 Mar 2016

Sensor system of a small biped entertainment robot

Heriot-Watt University, UK

Service Research and Innovation in Japan

11/13/18. Introduction to RNNs for NLP. About Me. Overview SHANG GAO

CURRICULUM VITAE. Faustino John Gomez March 3, 2016 CEO and Vice-President, NNAISENSE Via Zurigo 5 Lugano, CH 6900

Carnegie Mellon University, University of Pittsburgh

Vision-Based Robot Learning for Behavior Acquisition

[ 言語情報科学論 A] 統計的言語モデル,N-grams

Decisions in games Minimax algorithm α-β algorithm Tic-Tac-Toe game

Deep Learning is Evolving into the Key Technology of Artificial Intelligence. Sepp Hochreiter

A Co-worker Robot PaDY" for Automobile Assembly Line

Introduction to Machine Learning

Adversarial examples in Deep Neural Networks. Luiz Gustavo Hafemann Le Thanh Nguyen-Meidine

A Fuller Understanding of Fully Convolutional Networks. Evan Shelhamer* Jonathan Long* Trevor Darrell UC Berkeley in CVPR'15, PAMI'16

Recurrent neural networks Modelling sequential data. MLP Lecture 9 / 13 November 2018 Recurrent Neural Networks 1: Modelling sequential data 1

Supporting Communications in Global Networks. Kevin Duh & 歐陽靖民

FEATURE COMBINATION AND STACKING OF RECURRENT AND NON-RECURRENT NEURAL NETWORKS FOR LVCSR

Behavior Acquisition via Vision-Based Robot Learning

Transcription:

210 31 2 2016 3 ニューラルネットワーク研究のフロンティア ロボティクスと深層学習 Robotics and Deep Learning 尾形哲也 Tetsuya Ogata Waseda University. ogata@waseda.jp, http://ogata-lab.jp/ Keywords: robotics, deep learning, multimodal learning, end to end learning, sequence to sequence learning. 1. はじめに Deep Learning DL,,,,.DL, 10.,,,,,.,,,.,,,,DL.,. DL,,, DL.,,,,,,DL..,,,, DL., DL.,2,3,, 4,. 5. 2. 認識と状態評価,,.,, DL.,Ian Lenz,DL 4 [Lenz 14] 1. Y. Yang,DL CNN Convolutional Neural Network,YouTube, 48 6 [Yang 15]. 図 1 Baxter [Lenz 14]

211,DL. DL,Deep Q-Learning [Mnih 13].,, s a Q,.,. Q-Learning. Deep Q-Learning,DL. Atari2600 7, 84 84 pixel,4,4 18, Q.,, DL Q.,. DL,.,DL.,,., End to End Learning,DL. S. Levine,PR2, 1 CNN,.,, [Levine 15]. 2., End to End Learning,,. 1,. 3. 運動の生成 Deep Q-Learning,,.,, 4,, 6. s, a,,., DL,,,.,,.,.,. DL, 図 2 PR2 [Levine 15] S. Levine, [Noda 14].,,. CNN, Time-delay Deep Autoencoder DA,. 3.,,, DA. Aldebaran Robotics NAO,6 4. 6 10.,3 000 4 000 DA, 30.

212 31 2 2016 3 図 3 DL 図 5 Ball lift Ball roll Ball ring L 図 4 NAO, HMM DA.,30 1,1 Time delay NN.,,. 5.,, 6.,,.,,, DL,.,.,,,,3 000 図 6. a, b 図 7 PR2, 6. DL,,.,. PR2,7 7 [ 16]. 27 19 cm.,,

213,,.,DL,,,.. 4. 言語と動作,.,,.,2 Y. Yang,DL,,., Recurrent Neural Network RNN,..RNN,, DL Back Propagation Through Time BPTT. DL., RNN,MTRNN Multi Timescale RNN [Yamashita 14] LSTM Long Short Term Memory [Hochreiter 97], RNN. RNN,,. Sequence to Sequence Learning [Sutskever 14], 2 RNN., End to End Learning.,RNN DL,.,Google Vinyals CNN, RNN, Image Caption Generator [Levine 15, Vinyals 15]., Sequence to Sequence Learning,., [Yamada 15]. 8. 図 8 Sequence to Sequence Learning RNN,,,.,. 9. 1 NAO.NAO, 2, Red, Green, Blue,., 3. 3.,.,.,,,,.,,,.,,,RNN. 図 9

214 31 2 2016 3 図 10 3,5 RNN,,.,,. 1,2..,,. 10 3,5.,Red, Green, Blue,. 1,2, 3,5..,,, MTRNN LSTM,. 5. まとめ,DL,,,,,.1,DL,,.,,. DL RNN,,. DL,RNN. DL,. DL,Batch,,. DL Batch,. RNN,,.DL,.,,.DL,,,.4 RNN,, [Takahashi 15],. DL.,,,3 4,,,.,.,Pinto Baxter 5 700 [Pinto 15].,.,,. [JST CREST 15].,,,.,

215 DeepMind AlphaGo, CNN, CNN [Silver 16]. DL,.,.,,,.. 4,DL RNN, 1,2, 3,5. RNN.,,..,, DL RNN,. 謝辞,JST..,,,,,,.. 参考文献 [Hochreiter 97] Hochreiter, S. and Schmidhuber, J.: Long shortterm memory, Neural Computation, Vol. 9 No. 8, pp. 1735-1780 1997 [JST CREST 15] JST CREST, [Lenz 14] Lenz, I., Lee, H. and Saxena, A.: Deep learning for detecting robotic grasps, Int. J. Robotics Research IJRR 2014 [Levine 15] Levine, S., Finn, C., Darrell, T. and Abbeel, P.: Endto-End Training of Deep Visuomotor Policies, arxiv:1504.00702 2015 [Mnih 13] Mnih, V., Kavukcuoglu, K., Silver, D., Graves, A., Antonoglou, I., Wierstra, D. and Riedmiller, M.: Playing atari with deep reinforcement learning, Deep Learning Workshop NIPS 2013 2013 [Noda 14] Noda, K., Arie, H., Suga, Y. and Ogata, T.: Multimodal integration learning of robot behavior using deep neural networks, Robotics and Autonomous Systems, Vol. 62, No. 6, pp. 721-736 2014 [Pinto 15] Pinto, L. and Gupta, A.: Supersizing Self-supervision: Learning to Grasp from 50K Tries and 700 Robot Hours, arxiv:1509.06825 2015 [Silver 16] Silver, D., et al.: Mastering the game of Go with deep neural networks and tree search, Nature, Vol. 529, Issue 7587, pp. 484-489 2016 [Sutskever 14] Sutskever, I., Vinyals, O. and Le, Q. V.: Sequence to sequence learning with neural networks, NIPS 2014, pp. 3104-3112 2014 [ 16],,Gordon Cheng,, 78 2016 [Takahashi 15] Takahashi, K., Ogata, T., Yamada, H., Tjandra, H. and Sugano, S.: Effective motion learning for a flexible-joint robot using motor babbling, Proc. 2015 IEEE/RAS Int. Conf. on Intelligent Robots and Systems IROS 2015 2015 [Vinyals 15] Vinyals, O., Toshev, A., Bengio, S. and Erhan, D.: Show and Tell: A Neural Image Caption Generator, arxiv:1411.4555 2015 [Yamada 15a] Yamada, T., Murata, S., Arie, H. and Ogata, T.: Attractor representations of language-behavior structure in a recurrent neural network for human-robot interaction, Proc. 2015 IEEE/RAS Int. Conf. on Intelligent Robots and Systems IROS 2015 2015 [Yamashita 08] Yamashita, Y. and Tani, J.: Emergence of functional hierarchy in a multiple timescale neural network model: a humanoid robot experiment, PLoS Computational Biology, Vol. 4, Issue. 11, e1000220 2008 [Yang 15] Yang, Y., Li, Y., Fermüller, C. and Aloimonos, Y.: Robot learning manipulation action plans by Watching unconstrained videos from the world wide web, 28th AAAI Conf. on Artificial Intelligence 2015 2016 1 18 著者紹介 尾形哲也 1993.1997 DC2,1999,2001,2003,2005 2007,2012..2009 15 JST,2015..,,,,,IEEE.