from AutoMoDe to the Demiurge

Similar documents
AutoMoDe-Chocolate: automatic design of control software for robot swarms

AutoMoDe-Chocolate: automatic design of control software for robot swarms

Swarm Intelligence W7: Application of Machine- Learning Techniques to Automatic Control Design and Optimization

Socially-Mediated Negotiation for Obstacle Avoidance in Collective Transport

Evolving communicating agents that integrate information over time: a real robot experiment

Université Libre de Bruxelles

Evolving Neural Mechanisms for an Iterated Discrimination Task: A Robot Based Model

Distributed Intelligent Systems W11 Machine-Learning Methods Applied to Distributed Robotic Systems

Implicit Fitness Functions for Evolving a Drawing Robot

Swarm Robotics: A Review from the Swarm Engineering Perspective

CSCI 445 Laurent Itti. Group Robotics. Introduction to Robotics L. Itti & M. J. Mataric 1

Towards Autonomous Task Partitioning in Swarm Robotics

Enabling research on complex tasks in swarm robotics

Behaviour Patterns Evolution on Individual and Group Level. Stanislav Slušný, Roman Neruda, Petra Vidnerová. CIMMACS 07, December 14, Tenerife

The TAM: abstracting complex tasks in swarm robotics research

Socially-Mediated Negotiation for Obstacle Avoidance in Collective Transport

Self-organised Feedback in Human Swarm Interaction

Evolution of Acoustic Communication Between Two Cooperating Robots

Distributed Area Coverage Using Robot Flocks

Kilogrid: a Modular Virtualization Environment for the Kilobot Robot

Environmental factors promoting the evolution of recruitment strategies in swarms of foraging robots

Surveillance strategies for autonomous mobile robots. Nicola Basilico Department of Computer Science University of Milan

Path formation in a robot swarm

Subsumption Architecture in Swarm Robotics. Cuong Nguyen Viet 16/11/2015

A Self-Adaptive Communication Strategy for Flocking in Stationary and Non-Stationary Environments

Traffic Control for a Swarm of Robots: Avoiding Target Congestion

On The Role of the Multi-Level and Multi- Scale Nature of Behaviour and Cognition

Robots in the Loop: Supporting an Incremental Simulation-based Design Process

Human-Swarm Interaction

Online Evolution for Cooperative Behavior in Group Robot Systems

Formal methods for the design and analysis of robot swarms

Evolution of communication-based collaborative behavior in homogeneous robots

Biologically Inspired Embodied Evolution of Survival

Fault Detection in Autonomous Robots

E X O G E N O U S FA U LT D E T E C T I O N I N S WA R M R O B O T I C S Y S T E M S

Université Libre de Bruxelles

Multi robot Team Formation for Distributed Area Coverage. Raj Dasgupta Computer Science Department University of Nebraska, Omaha

Keywords Multi-Agent, Distributed, Cooperation, Fuzzy, Multi-Robot, Communication Protocol. Fig. 1. Architecture of the Robots.

Booklet of teaching units

Multi-Robot Learning with Particle Swarm Optimization

Adaptive Potential Fields Model for Solving Distributed Area Coverage Problem in Swarm Robotics

Université Libre de Bruxelles

A self-adaptive communication strategy for flocking in stationary and non-stationary environments

OFFensive Swarm-Enabled Tactics (OFFSET)

Efficient Decision-Making in a Self-Organizing Robot Swarm: On the Speed Versus Accuracy Trade-Off

Evolution, Self-Organisation and Swarm Robotics

Task Partitioning in a Robot Swarm: Object Retrieval as a Sequence of Subtasks with Direct Object Transfer

Cooperative navigation in robotic swarms

Sorting in Swarm Robots Using Communication-Based Cluster Size Estimation

Distributed Task Allocation in Swarms. of Robots

Université Libre de Bruxelles

EMERGENCE OF COMMUNICATION IN TEAMS OF EMBODIED AND SITUATED AGENTS

ARGoS: a Modular, Multi-Engine Simulator for Heterogeneous Swarm Robotics

Learning Reactive Neurocontrollers using Simulated Annealing for Mobile Robots

Towards Cooperation in a Heterogeneous Robot Swarm through Spatially Targeted Communication

Self-Organized Flocking with a Mobile Robot Swarm: a Novel Motion Control Method

Biological Inspirations for Distributed Robotics. Dr. Daisy Tang

Learning and Using Models of Kicking Motions for Legged Robots

Computational Intelligence Optimization

Jager UAVs to Locate GPS Interference

Autonomous Initialization of Robot Formations

Stanford Center for AI Safety

Multi-Robot Coordination. Chapter 11

Université Libre de Bruxelles

Holland, Jane; Griffith, Josephine; O'Riordan, Colm.

Reinforcement Learning Simulations and Robotics

CAPACITIES FOR TECHNOLOGY TRANSFER

CS 599: Distributed Intelligence in Robotics

Hybrid Control of Swarms for Resource Selection

Blockchain technology for robot swarms: A shared knowledge and reputation management system for collective estimation

SYSTEMATIC MODEL BASED AND SEARCH BASED TESTING OF CYBER PHYSICAL SYSTEMS

Evolved Neurodynamics for Robot Control

! The architecture of the robot control system! Also maybe some aspects of its body/motors/sensors

Université Libre de Bruxelles

Breedbot: An Edutainment Robotics System to Link Digital and Real World

Information Aggregation Mechanisms in Social Odometry

CS594, Section 30682:

On the Design and Implementation of an Accurate, Efficient, and Flexible Simulator for Heterogeneous Swarm Robotics Systems

Efficient Decision-Making in a Self-Organizing Robot Swarm: On the Speed Versus Accuracy Trade-Off

Learning and Using Models of Kicking Motions for Legged Robots

ARGoS: a Pluggable, Multi-Physics Engine Simulator for Heterogeneous Swarm Robotics

SWARM INTELLIGENCE. Mario Pavone Department of Mathematics & Computer Science University of Catania

DETECTION AND CLASSIFICATION OF POWER QUALITY DISTURBANCES

Université Libre de Bruxelles

Collective Transport with Obstacle Avoidance through Socially-Mediated Negotiation

Neuro-Fuzzy and Soft Computing: Fuzzy Sets. Chapter 1 of Neuro-Fuzzy and Soft Computing by Jang, Sun and Mizutani

Adaptive Sensor Selection Algorithms for Wireless Sensor Networks. Silvia Santini PhD defense October 12, 2009

arxiv: v1 [cs.ai] 31 Jan 2011

On Feature Selection, Bias-Variance, and Bagging

Real-time Systems in Tokamak Devices. A case study: the JET Tokamak May 25, 2010

Constructing K-Connected M-Dominating Sets

Transferring Technical debt to automated Production Systems (aps)

Increasing the precision of mobile sensing systems through super-sampling

Cooperation through self-assembly in multi-robot systems

NAVIGATION OF MOBILE ROBOT USING THE PSO PARTICLE SWARM OPTIMIZATION

A Bandit Approach for Tree Search

IMPLEMENTING MULTIPLE ROBOT ARCHITECTURES USING MOBILE AGENTS

Self-adapting Fitness Evaluation Times for On-line Evolution of Simulated Robots

Evolution of Sensor Suites for Complex Environments

Université Libre de Bruxelles

Transcription:

INFO-H-414: Swarm Intelligence Automatic Design of Robot Swarms from AutoMoDe to the Demiurge IRIDIA's recent and forthcoming research on the automatic design of robot swarms Mauro Birattari IRIDIA, Université libre de Bruxelles

outline introductory concepts definitions working hypothesis and idea from classical ER to AutoMoDe AutoMoDe-vanilla and AutoMoDe-chocolate towards the Demiurge

swarm robotics introductory concepts in swarm robotics a mission is entrusted to a large group of robots a swarm is autonomous and self-organized: - no leader robot or external infrastructures a swarm is highly redundant - homogeneous of heterogeneous but nobody is indispensable robots are capable of local perception and communication only - each robot interacts with few neighbours and is unaffected by swarm size robots operate in parallel on multiple tasks - they switch from task to task in an autonomous and self-organized way autonomy, self-organization, redundancy, locality, and parallel execution promote fault tolerance, scalability, and flexibility

introductory concepts state of the art and limitations self-organization is viable no general design methodology swarms are mostly designed by hand via trial and error - high costs - not predictable/repeatable - no guarantees the lack of an engineering methodology prevents real-world applications

introductory concepts the design challenge in swarm robotics design the individual to obtain desired swarm-level properties

introductory concepts evolutionary robotics robots are controlled by neural networks that map sensor readings to control actions parameters (and possibly structure) of neural network are obtained via evolution

shortcomings of ER from an engineer s perspective introductory concepts focus on research questions relevant to biology rather than engineering unclear methodology concerning role of experimenter

definitions offline automatic design design phase in simulation online automatic design vs continuous design during operation in target environment operation phase in target environment not an exhaustive taxonomy: hybrids and variants are possible

one shot design process offline automatic design iterated design process definitions specifications specifications objective function objective function vs design via optimization in simulation design via optimization in simulation assessment control software control software

definitions overall problem model class of tasks sample a task design in simulation operation in real world iterate over and over

working hypothesis and idea offline automatic design machine learning design in simulation reality gap whp training set generalization operation in real world test set

working hypothesis and idea offline automatic design machine learning reality gap whp generalization idea: handle reality gap as a generalization problem

from classical ER to AutoMoDe a bias/variance dilemma? a possible explanation of the inability to successfully cross the reality gap shown by some/many robot swarms obtained via classical ER: NN could be too powerful: low bias / high variance they could overfit idiosyncrasies of simulator

from classical ER to AutoMoDe AutoMoDe limit power of control software by injecting bias finite state machines obtained by assembling and fine-tuning preexisting parametric modules modules are task independent : defined a priori on the basis of robot capabilities (an example follows )

AutoMoDe-vanilla AutoMoDe vanilla RM1: a reference model of the e-puck (subset of sensors/actuators) sensors/actuators proximity light variables proxi [0,1] & qi with i {1,2,,8} lighti [0,1] & qi with i {1,2,,8} ground gndi {0,.5,1} with i {1,2,3} range and bearing wheels n N, rm, & bm with m {1,2,,n} vl & vr [-V,+V] with V = 0.16 m/s period of control cycle: 100 ms

AutoMoDe-vanilla AutoMoDe vanilla parametric behaviors & transition conditions behaviors exploration stop conditions black-floor gray-floor attraction = 5 black-floor β = 1 phototaxis anti-phototaxis attraction white-floor neighbor-count inverted-neighbor-count gray-floor β = 1 stop repulsion fixed-probability fixed-probability β = 0.25 example: aggregation

AutoMoDe-vanilla AutoMoDe vanilla optimization algorithm: F-Race (Birattari et al., 2002) sample candidate designs candidate designs iteratively evaluate them drop dominated ones iterations stop when one survives or evaluation budget depletes

AutoMoDe-vanilla experiments design in simulation with ARGoS tests with swarm of 20 e-puck robots two tasks: aggregation and foraging three budget levels: 10k, 50k, and 200k two design methods: vanilla and EvoStick no modification whatsoever allowed to adapt design methods to task or budget level!!!

AutoMoDe-vanilla aggregation foraging F = max(na,nb)/n F = No

results: aggregation AutoMoDe-vanilla Wide boxes: robots Narrow boxes: simulation Objective function 0.0 0.2 0.4 0.6 0.8 1.0 AutoMoDe Vanilla EvoStick 10k 50k 200k AutoMoDe Vanilla EvoStick AutoMoDe Vanilla EvoStick the higher, the better EvoStick s performance is good in simulation but drops in robot experiments

AutoMoDe-vanilla results: foraging Wide boxes: robots Narrow boxes: simulation Objective function 0 20 40 60 80 100 120 10k 50k 200k the higher, the better AutoMoDe Vanilla EvoStick AutoMoDe Vanilla EvoStick AutoMoDe Vanilla EvoStick EvoStick is excellent in simulation but poor in robot experiments. The gap grows with the budget

AutoMoDe-vanilla remarks vanilla s modules proved to be general enough to produce control software for two different tasks whether they are general enough to produce control software for any tasks that could be possibly tackled by robots described by reference model RM1 is an empirical question

AutoMoDe-vanilla experiment with humans 5 human designers PhD students at IRIDIA in swarm robotics proficient in ARGoS and acquainted with the e-puck unaware of AutoMoDe-vanilla and its functioning were asked to conceive a task compatibly with given reference model and with constraint on arena, possible obstacles, ground color, etc solve another task by assembling vanilla s modules solve yet another task by direct C++ programming

AutoMoDe-vanilla experiments design in simulation with ARGoS tests with swarm of 20 e-puck robots budget: 200k for automatic methods (~2h30) and 4h for humans 5 tasks proposed by human designers: shelter with constrained access, largest covering network, coverage with forbidden areas, surface and perimeter coverage, aggregation with ambient cues 4 design methods: vanilla, EvoStick, C Human, U Human no modification whatsoever allowed to adapt the two automatic design methods to new tasks!!!

AutoMoDe-vanilla task and results Wide boxes: robots Narrow boxes: simulation Objective fuction 0 2000 4000 6000 8000 10000 12000 the higher, the better F = T N(t) Vanilla EvoStick U Human C Human SCA shelter with constrained access: robots should aggregate in the white shelter

Vanilla

AutoMoDe-vanilla task and results Wide boxes: robots Narrow boxes: simulation Objective fuction 0.5 1.0 1.5 2.0 the higher, the better F = AC(T) Vanilla EvoStick U Human C Human LCN largest covering network: robots should cover the largest possible area while maintaining connection with one another

AutoMoDe-vanilla task and results Wide boxes: robots Narrow boxes: simulation F = E[d(T)] Objective fuction 0.20 0.22 0.24 0.26 0.28 Vanilla EvoStick U Human C Human the lower, the better CFA coverage with forbidden areas: robot should cover the arena except the forbidden black areas

AutoMoDe-vanilla task and results Wide boxes: robots Narrow boxes: simulation Objective fuction 2 4 6 8 10 12 x x x x the lower, the better F = E[da(T)]/ca + E[dp(T)]/cp Vanilla EvoStick U Human C Human SPC surface and perimeter coverage: robot should cover the surface of the white square and the perimeter of the black circle

AutoMoDe-vanilla task and results Wide boxes: robots Narrow boxes: simulation Objective fuction 0 5000 10000 15000 20000 the higher, the better F = T Nb(t) Vanilla EvoStick U Human C Human AAC aggregation with ambient cues: robot should aggregate in the black region

Vanilla

AutoMoDe-vanilla aggregate results Vanilla EvoStick U-Human C-Human analysis based on Friedman test the lower, the better 15 20 25 vanilla confirms better that EvoStick also on new tasks rank C-Human is the best: vanilla s modules are good but F Race is unable to fully exploit their potential humans experience the reality gap as well as automatic methods injecting bias (by constraining the human to the given modules) improves the ability to cross the reality gap also for humans

AutoMoDe-chocolate AutoMoDe chocolate same modules of vanilla (and C-Human), but new optimization algorithm: iterated F-Race (Balaprakash et al., 2002) sample candidate designs run F-Race sample around survivors run F-Race and iterate sample initial candidate designs sample around survivors sample around survivors iterate stop when budget depletes

AutoMoDe-Chocolate task and results Wide boxes: robots Narrow boxes: simulation F = T N(t) 0 4000 8000 Vanilla Chocolate C Human the higher, the better SCA shelter with constrained access: robots should aggregate in the white shelter

AutoMoDe-Chocolate task and results Wide boxes: robots Narrow boxes: simulation 0.5 1.0 1.5 2.0 the higher, the better F = AC(T) Vanilla Chocolate C Human LCN largest covering network: robots should cover the largest possible area while maintaining connection with one another

AutoMoDe-Chocolate task and results Wide boxes: robots Narrow boxes: simulation F = E[d(T)] 0.20 0.24 0.28 0.32 Vanilla Chocolate C Human the lower, the better CFA coverage with forbidden areas: robot should cover the arena except the forbidden black areas

AutoMoDe-Chocolate task and results Wide boxes: robots Narrow boxes: simulation 2 3 4 5 6 the lower, the better F = E[da(T)]/ca + E[dp(T)]/cp Vanilla Chocolate C Human SPC surface and perimeter coverage: robot should cover the surface of the white square and the perimeter of the black circle

AutoMoDe-Chocolate task and results Wide boxes: robots Narrow boxes: simulation F = T Nb(t) 5000 10000 15000 20000 Vanilla Chocolate C Human the higher, the better AAC aggregation with ambient cues: robot should aggregate in the black region

AutoMoDe-Chocolate aggregate results analysis based on Friedman test Vanilla C-Human Chocolate the lower, the better 20 30 40 rank chocolate improves over vanilla chocolate performs better than C-Human

conclusions automatic design of robot swarms in the light of machine learning concepts AutoMoDe is a promising approach AutoMoDe-chocolate performed better that human designers some innovative elements in the empirical studies

DEMIURGE Automatic Design of Robot Swarms

towards the Demiurge Demiurge: an intelligent system that designs robot swarms in an integrated and automatic way starting from requirements expressed in an appropriate specification language, the Demiurge designs hardware and control software Plato s Demiurge in a drawing by the English pre-roman9c poet and illustrator William Blake (1757-1827) the Demiurge does not create designs from scratch: it operates on preexisting software and hardware modules

contributors Gianpiero Francesca Manuele Brambilla Arne Brutschy Lorenzo Garattoni Roman Miletitch Gaëtan Podevijn Andreagiovanni Reina Touraj Soleymani Mattia Salvaro Carlo Pinciroli Franco Mascia Vito Trianni