Behaviour Patterns Evolution on Individual and Group Level. Stanislav Slušný, Roman Neruda, Petra Vidnerová. CIMMACS 07, December 14, Tenerife

Similar documents
Behavior Emergence in Autonomous Robot Control by Means of Feedforward and Recurrent Neural Networks

Implicit Fitness Functions for Evolving a Drawing Robot

Swarm Intelligence W7: Application of Machine- Learning Techniques to Automatic Control Design and Optimization

Evolving non-trivial Behaviors on Real Robots: an Autonomous Robot that Picks up Objects

EMERGENCE OF COMMUNICATION IN TEAMS OF EMBODIED AND SITUATED AGENTS

! The architecture of the robot control system! Also maybe some aspects of its body/motors/sensors

CYCLIC GENETIC ALGORITHMS FOR EVOLVING MULTI-LOOP CONTROL PROGRAMS

Evolved Neurodynamics for Robot Control

Artificial Neural Networks. Artificial Intelligence Santa Clara, 2016

Using Cyclic Genetic Algorithms to Evolve Multi-Loop Control Programs

Evolutionary Robotics. IAR Lecture 13 Barbara Webb

Evolving Spiking Neurons from Wheels to Wings

Adaptive Neuro-Fuzzy Controler With Genetic Training For Mobile Robot Control

Evolution of Acoustic Communication Between Two Cooperating Robots

Biologically Inspired Embodied Evolution of Survival

Evolving communicating agents that integrate information over time: a real robot experiment

Navigating in a dynamic world

On The Role of the Multi-Level and Multi- Scale Nature of Behaviour and Cognition

Evolving Neural Mechanisms for an Iterated Discrimination Task: A Robot Based Model

Developing Frogger Player Intelligence Using NEAT and a Score Driven Fitness Function

Breedbot: An Edutainment Robotics System to Link Digital and Real World

Subsumption Architecture in Swarm Robotics. Cuong Nguyen Viet 16/11/2015

Evolving Predator Control Programs for an Actual Hexapod Robot Predator

Learning Reactive Neurocontrollers using Simulated Annealing for Mobile Robots

INTELLIGENT CONTROL OF AUTONOMOUS SIX-LEGGED ROBOTS BY NEURAL NETWORKS

Evolving Robot Behaviour at Micro (Molecular) and Macro (Molar) Action Level

MINE 432 Industrial Automation and Robotics

Multi-Robot Learning with Particle Swarm Optimization

Learning Behaviors for Environment Modeling by Genetic Algorithm

Institute of Psychology C.N.R. - Rome. Evolving non-trivial Behaviors on Real Robots: a garbage collecting robot

RoboCup. Presented by Shane Murphy April 24, 2003

Evolutions of communication

Ezequiel Di Mario, Iñaki Navarro and Alcherio Martinoli. Background. Introduction. Particle Swarm Optimization

THE EFFECT OF CHANGE IN EVOLUTION PARAMETERS ON EVOLUTIONARY ROBOTS

Evolving Mobile Robots in Simulated and Real Environments

IMPROVING TOWER DEFENSE GAME AI (DIFFERENTIAL EVOLUTION VS EVOLUTIONARY PROGRAMMING) CHEAH KEEI YUAN

61. Evolutionary Robotics

Neuro-Fuzzy and Soft Computing: Fuzzy Sets. Chapter 1 of Neuro-Fuzzy and Soft Computing by Jang, Sun and Mizutani

Evolving CAM-Brain to control a mobile robot

Deep Neural Networks (2) Tanh & ReLU layers; Generalisation and Regularisation

Evolution of communication-based collaborative behavior in homogeneous robots

GPU Computing for Cognitive Robotics

Contents 1 Introduction Optical Character Recognition Systems Soft Computing Techniques for Optical Character Recognition Systems

Curiosity as a Survival Technique

Supplementary information accompanying the manuscript Biologically Inspired Modular Neural Control for a Leg-Wheel Hybrid Robot

Genetic Evolution of a Neural Network for the Autonomous Control of a Four-Wheeled Robot

Behaviour-Based Control. IAR Lecture 5 Barbara Webb

Holland, Jane; Griffith, Josephine; O'Riordan, Colm.

Evolution in Robotic Islands

Neural Networks for Real-time Pathfinding in Computer Games

Emergence of Purposive and Grounded Communication through Reinforcement Learning

Adaptive Humanoid Robot Arm Motion Generation by Evolved Neural Controllers

Control system of person following robot: The indoor exploration subtask. Solaiman. Shokur

Evolutionary robotics Jørgen Nordmoen

Genetic Robots Play Football. William Jeggo BSc Computing

This list supersedes the one published in the November 2002 issue of CR.

1 Introduction. w k x k (1.1)

Multi-Robot Coordination. Chapter 11

Evolving High-Dimensional, Adaptive Camera-Based Speed Sensors

GA-based Learning in Behaviour Based Robotics

Forecasting Exchange Rates using Neural Neworks

One computer theorist s view of cognitive systems

Online Interactive Neuro-evolution

Service Robots in an Intelligent House

Université Libre de Bruxelles

COMPUTATONAL INTELLIGENCE

Online Evolution for Cooperative Behavior in Group Robot Systems

CS 441/541 Artificial Intelligence Fall, Homework 6: Genetic Algorithms. Due Monday Nov. 24.

Optimization Maze Robot Using A* and Flood Fill Algorithm

Creating a Poker Playing Program Using Evolutionary Computation

The Behavior Evolving Model and Application of Virtual Robots

Embodiment from Engineer s Point of View

Non-Invasive Brain-Actuated Control of a Mobile Robot

Université Libre de Bruxelles

A COMPARISON OF ARTIFICIAL NEURAL NETWORKS AND OTHER STATISTICAL METHODS FOR ROTATING MACHINE

Behavior and Cognition as a Complex Adaptive System: Insights from Robotic Experiments

A Genetic Algorithm-Based Controller for Decentralized Multi-Agent Robotic Systems

Publication P IEEE. Reprinted with permission.

Chapter 1: Introduction to Neuro-Fuzzy (NF) and Soft Computing (SC)

Performance Comparison of Power Control Methods That Use Neural Network and Fuzzy Inference System in CDMA

Introduction to Machine Learning

Efficient Learning in Cellular Simultaneous Recurrent Neural Networks - The Case of Maze Navigation Problem

Synthetic Brains: Update

Stock Price Prediction Using Multilayer Perceptron Neural Network by Monitoring Frog Leaping Algorithm

Robots in the Loop: Supporting an Incremental Simulation-based Design Process

Push Path Improvement with Policy based Reinforcement Learning

Prof. Emil M. Petriu 17 January 2005 CEG 4392 Computer Systems Design Project (Winter 2005)

5a. Reactive Agents. COMP3411: Artificial Intelligence. Outline. History of Reactive Agents. Reactive Agents. History of Reactive Agents

Retaining Learned Behavior During Real-Time Neuroevolution

How the Body Shapes the Way We Think

The Khepera Robot and the krobot Class: A Platform for Introducing Robotics in the Undergraduate Curriculum i

Supporting the Design of Self- Organizing Ambient Intelligent Systems Through Agent-Based Simulation

COOPERATIVE STRATEGY BASED ON ADAPTIVE Q- LEARNING FOR ROBOT SOCCER SYSTEMS

The Articial Evolution of Robot Control Systems. Philip Husbands and Dave Cli and Inman Harvey. University of Sussex. Brighton, UK

Encouraging Creative Thinking in Robots Improves Their Ability to Solve Challenging Problems

Neural Models for Multi-Sensor Integration in Robotics

C - Underground Exploration

Evolution, Self-Organisation and Swarm Robotics

Body articulation Obstacle sensor00

Implementing obstacle avoidance and follower behaviors on Koala robots using Numerical P Systems

Transcription:

Behaviour Patterns Evolution on Individual and Group Level Stanislav Slušný, Roman Neruda, Petra Vidnerová Department of Theoretical Computer Science Institute of Computer Science Academy of Science of the Czech Republic {slusny,roman,petra}@cs.cas.cz CIMMACS 07, December 14, Tenerife

Outline Introduction 1 Introduction 2 3 4

Introduction Behaviour Emergence study ability of autonomous agents to develop desired behaviour learning achieved by interactions with environment Evolutionary Robotics neural networks evolutionary algorithms, genetic algorithms Khepera robots

The slow small short-sighted robot mobile robot, 70 mm in diameter, 80 g two lateral wheels (rotate in both directions) 8 active infrared light sensors Motorola 68331, 25 MHz, 512 KB RAM

Evolutionary robotics = Neural Networks + Genetic Algorithms design of intelligent agent (robot) by self-organization process based on artificial evolution reactive agent - i.e. no memory

Neural Networks Multilayer Perceptrons (MLP) feed-forward neural network neuron output: y(x) = ϕ ( n i=1 w ix i ) activation function: logistic sigmoid Other Architectures Elman s networks (recurrent) RBF networks

Genetic Algorithm Individual (genom) encodes weights of neural network real encoding Fitness function quality measure of solution Fitness Evaluation 1. initialize environment 2. place robot at random start point 3. run robot for given number of steps or until it crashes

Introduction Goal evaluate feasibility of evolutionary learning for basic tasks, such as avoiding obstacles, exploration, etc. 2 experiments: individual exploration, collective exploration Methodology YAKS (Yet Another Khepera Simulator), open source different environments for learning testing tests on real robot each experiments repeated 10 times

Individual maze exploration Task robot is placed into maze, arena 60 30 cm its goal is to fully explore the maze Learning small, quite simple maze fitness: 250 simulation steps, 4 trials

Individual Maze Exploration Fitness evaluation move and avoid obstacles: T k,j = V k,j (1 V k,j )(1 i k,j ) mean evaluation for one step S j = 250 T k,j k=1 250 bonus for reaching target zone j = 1 Fitness = 4 j=1 (S j + j ) V k,j = v l + v r 0, 1 V k,j i k,j 0, 1 sum of motor rates left and right motor difference highest sensor value

Evolved behaviour: robot in big maze (video)

Collective exploration Task team of 3 robots, one of them team leader their goal is to reach target arena Setup leader is equipped with light bulb, others can sense light all robots have 8 sensors in active and passive mode, and ground sensor (17 inputs of NN) each trial 500 simulation steps, leader situated randomly, others not far from him

Collective exploration Fitness evaluation T k,j = L k,j M 1,k,j M 2,k,j L leader - exploration behaviour: L k,j = V k,j (1 V k,j )(1 i k,j ) + Z k,j Z k,j - reward for target arena M grouping behaviour: M i,k,j = (1 D k,j (i, 0)), where D k,j (i, 0) is distance from the leader

Evolved behaviour: collective exploration

Light down! Introduction

Conclusions and future work Summary demonstrated that behavioural pattern can emerge from rather simple setup learning does not require large number of parameters (30,60 weights) Future work more complex tasks, compound behaviours, specialization and labour division incremental learning, modular control systems

THANK YOU Any questions?