HyperNEAT-GGP: A HyperNEAT-based Atari General Game Player. Matthew Hausknecht, Piyush Khandelwal, Risto Miikkulainen, Peter Stone

Size: px
Start display at page:

Download "HyperNEAT-GGP: A HyperNEAT-based Atari General Game Player. Matthew Hausknecht, Piyush Khandelwal, Risto Miikkulainen, Peter Stone"

Transcription

1 -GGP: A -based Atari General Game Player Matthew Hausknecht, Piyush Khandelwal, Risto Miikkulainen, Peter Stone

2 Motivation Create a General Video Game Playing agent which learns from visual representations Little domain specific knowledge Capable of playing different Atari 2600 games without reconfiguration

3 Introducing GVGP GGP agents given a declarative representation of the game including complete game dynamics In contrast we seek to learn from a visual representation without knowing dynamics General video game playing (GVGP) is a good challenge

4 Atari games with wildly varying dynamics Standard interface for control - 18 Actions Two player (Multi-agent) Capabilities Multiple standardized state representations Good open source emulation

5

6 -GGP Architecture Action Selection Action Atari 2600 Emulator Neural Network CPPN Visual Processing

7 -GGP Architecture Action Selection Action Atari 2600 Emulator Neural Network CPPN Raw Game Screen Visual Processing

8 -GGP Architecture Action Selection Action Atari 2600 Emulator Neural Network CPPN Visual Processing

9 -GGP Architecture Action Selection Action Atari 2600 Emulator Neural Network CPPN Continuously Valued Firings Visual Processing

10 -GGP Architecture Action Selection Action Atari 2600 Emulator Neural Network CPPN Visual Processing

11 -GGP Architecture Continuously Valued Outputs Action Selection Action Atari 2600 Emulator Neural Network CPPN Visual Processing

12 -GGP Architecture Action Selection Action Atari 2600 Emulator Neural Network CPPN Visual Processing

13 -GGP Architecture Action Selection Action Atari 2600 Emulator Neural Network CPPN Visual Processing

14 -GGP Architecture Action Selection Action Atari 2600 Emulator Neural Network CPPN Visual Processing

15 Fitness Evaluation Atari 2600 Emulator Individual Game Score Neural Network CPPN At end of game, Score is given to the individual as fitness

16 Evolution Fitness Fitness Fitness Evolution then produces the next generation

17 -GGP Architecture Action Selection Action Atari 2600 Emulator Neural Network CPPN Visual Processing

18 Extension of NEAT Stanley, Ambrosio, Gauci A Hypercube-Based Indirect Encoding for Evolving Large-Scale Neural Networks - Kenneth Stanley, David Ambrosio, Jason Gauci. Artificial Life 2009

19

20 Input node firings are continuous valued and taken directly from the processed screen

21 Firings are propagated through the network forming continuously valued outputs

22 How to determine connection weights?

23 Evolve Evolvethe the weights! weights! NEAT Maybe we can do better...

24 X1 Y1 X2 Y2 CPPN B

25 CPPN B

26 CPPN B

27 CPPN B

28 X1 Y1 X2 Y2 B CPPN determines all connection weights

29 CPPN Evolved by NEAT Add Nodes - Gaussian, sinusoidal, sigmoid, absolute value, linear Add links X1 Y1 X2 Y2 B Change Connection Weights

30 Advantages of vs NEAT Indirect Encoding Geometrically Aware Learn a function and apply it regardless of the absolute location Ultimately allows better policies to be found more quickly

31 -GGP Architecture Action Selection Action Atari 2600 Emulator Neural Network CPPN Visual Processing

32 Visual Processing Framework Raw Game Screen: 160x210 pixels; 256 colors

33 Visual Processing Framework Blob Detection Simple 8-connectivity check Each blob assigned a velocity based on its location in previous frame

34 Visual Processing Framework Adjacent blobs with non-zero velocity are merged into objects Object Detection

35 Visual Processing Framework Class Detection Objects with sufficient pixel similarity are said to belong to the same "object class" 3 main classes: car left, car right, chicken

36 Self Detection Information gain approach to identify the object on screen most likely to be the "self" Intuitively object that responds to actions Self circled in red

37 Atari- Interface Raw screen reduced to a 16x21 grid Mapping from object classes to continuous values

38 -GGP Architecture Action Selection Action Atari 2600 Emulator Neural Network CPPN Visual Processing

39 Action Selection Examine squares adjacent to "self" Take directional action corresponding to max valued neighbor no-op if "self" square is highest valued Up action selected in this case

40 -GGP Architecture Action Selection Action Atari 2600 Neural Network CPPN Visual Processing

41 Selected Games Examine 2 Atari games - Freeway and Asterix

42 Experimental Setup Run -GGP for 250 generations with 100 individuals in each generation Individual evaluations performed in parallel Individual fitness = raw game score Compared against previously published results using Sarsa-Lambda with linear function approximation

43 Results Freeway Asterix Sarsa-Lambda BASS* Sarsa-Lambda DISCO* Sarsa-Lambda RAM* Random GGP Avg -GGP Champ *Y. Naddaf. Game-independent ai agents for playing atari 2600 console games. Master's thesis, University of Alberta, 2010.

44 Freeway Results

45 Asterix Results

46 Related Work Atari Game Playing: Y. Naddaf. Game-independent ai agents for playing atari 2600 console games. Master's thesis, University of Alberta, GGP: M. Genesereth and N. Love. General game playing: Overview of the aaai competition. AI Magazine, 26:62-72, Ms. Pac-Man: S. M. Lucas. Ms pac-man competition (screen capturemode). uk/staff/sml/pacman/cig2011results.html. Quake II: M. Parker and B. Bryant. Backpropagation without human supervision for visual control in quake ii. Proceedings of the 2009 IEEE Symposium on Computational Intelligence and Games (CIG'09), pages , Pitfall: C. Diuk, A. Cohen, and M. L. Littman. An object-oriented representation for efficient reinforcement learning. In Proceedings of 25th International Conference on Machine Learning (ICML), pages , : P. Verbancsics and K. O. Stanley. Evolving static representations for task transfer. J. Mach. Learn. Res., 1: , August J. Gauci and K. O. Stanley. A case study on the critical role of geometric regularity in machine learning. In Proceedings of the 23rd National Conference on Articial Intelligence (AAAI), D. B. D'Ambrosio and K. O. Stanley. Generative encoding for multiagent learning. In GECCO '08: Proceedings of the 10th annual conference on Genetic and evolutionary computation, pages , New York, NY, USA, ACM. J. Clune, B. E. Beckmann, C. Ofria, and R. T. Pennock. Evolving coordinated quadruped gaits with the hyperneat generative encoding. In Proceedings of the Eleventh conference on Congress on Evolutionary Computation, CEC'09, pages 2764{2771, Piscataway, NJ, USA, IEEE Press.

47 Conclusion Introduce -GGP, a general Atari game playing agent Learns from the game screen using to evolve policies for gameplay Performance results exceed prior work (Sarsa-Lambda) for the games Asterix and Freeway In future work extend this system to more games Represents a first step toward the challenge of general video game playing (GVGP) from visual representations

48 Questions? -GGP Code: A.L.E. Code:

49 Self Detection Algorithm actions = set of actions applicable to this game current_blobs = set of blobs in the current game frame ActionHist = Set of action at time 0...n for blob b in current_blobs do vhist_b = Set of velocities of blob b at time 0...n H_b = H(vHist_b) for action a in actions do vhist_(b a) = [vhist_b[t] forall t s.t. ActionHist[t-1] == a] H_(b a) = H(vHist_(b a)) end for InfoGain_b = H_b - sum(p_a * H_(b a)) end for return arg_max over blobs (InfoGain_b)

Evolutionary robotics, neural networks, artificial intelligence. Assistant Professor, IT University of Copenhagen, July July 2016

Evolutionary robotics, neural networks, artificial intelligence. Assistant Professor, IT University of Copenhagen, July July 2016 Joel Lehman Contact Information Assistant Professor IT University of Copenhagen WWW: www.joellehman.com E-mail: jleh@itu.dk Research Interests Academic Experience Evolutionary robotics, neural networks,

More information

Implementation and Comparison the Dynamic Pathfinding Algorithm and Two Modified A* Pathfinding Algorithms in a Car Racing Game

Implementation and Comparison the Dynamic Pathfinding Algorithm and Two Modified A* Pathfinding Algorithms in a Car Racing Game Implementation and Comparison the Dynamic Pathfinding Algorithm and Two Modified A* Pathfinding Algorithms in a Car Racing Game Jung-Ying Wang and Yong-Bin Lin Abstract For a car racing game, the most

More information

Evolutionary Computation for Creativity and Intelligence. By Darwin Johnson, Alice Quintanilla, and Isabel Tweraser

Evolutionary Computation for Creativity and Intelligence. By Darwin Johnson, Alice Quintanilla, and Isabel Tweraser Evolutionary Computation for Creativity and Intelligence By Darwin Johnson, Alice Quintanilla, and Isabel Tweraser Introduction to NEAT Stands for NeuroEvolution of Augmenting Topologies (NEAT) Evolves

More information

USING A FUZZY LOGIC CONTROL SYSTEM FOR AN XPILOT COMBAT AGENT ANDREW HUBLEY AND GARY PARKER

USING A FUZZY LOGIC CONTROL SYSTEM FOR AN XPILOT COMBAT AGENT ANDREW HUBLEY AND GARY PARKER World Automation Congress 21 TSI Press. USING A FUZZY LOGIC CONTROL SYSTEM FOR AN XPILOT COMBAT AGENT ANDREW HUBLEY AND GARY PARKER Department of Computer Science Connecticut College New London, CT {ahubley,

More information

Synthetic Brains: Update

Synthetic Brains: Update Synthetic Brains: Update Bryan Adams Computer Science and Artificial Intelligence Laboratory (CSAIL) Massachusetts Institute of Technology Project Review January 04 through April 04 Project Status Current

More information

LEARNABLE BUDDY: LEARNABLE SUPPORTIVE AI IN COMMERCIAL MMORPG

LEARNABLE BUDDY: LEARNABLE SUPPORTIVE AI IN COMMERCIAL MMORPG LEARNABLE BUDDY: LEARNABLE SUPPORTIVE AI IN COMMERCIAL MMORPG Theppatorn Rhujittawiwat and Vishnu Kotrajaras Department of Computer Engineering Chulalongkorn University, Bangkok, Thailand E-mail: g49trh@cp.eng.chula.ac.th,

More information

Quiz. Quiz. Quiz. Quiz. What is the most complex artifact in the known universe? What is the most complex artifact in the known universe?

Quiz. Quiz. Quiz. Quiz. What is the most complex artifact in the known universe? What is the most complex artifact in the known universe? Neuroevolution: Evolving Neural Networks Kenneth O. Stanley Dept. of Electrical Engineering and Computer Science University of Central Florida kstanley@eecs.ucf.edu http://www.cs.ucf.edu/~kstanley Copyright

More information

Understanding Coevolution

Understanding Coevolution Understanding Coevolution Theory and Analysis of Coevolutionary Algorithms R. Paul Wiegand Kenneth A. De Jong paul@tesseract.org kdejong@.gmu.edu ECLab Department of Computer Science George Mason University

More information

The Evolution of Multi-Layer Neural Networks for the Control of Xpilot Agents

The Evolution of Multi-Layer Neural Networks for the Control of Xpilot Agents The Evolution of Multi-Layer Neural Networks for the Control of Xpilot Agents Matt Parker Computer Science Indiana University Bloomington, IN, USA matparker@cs.indiana.edu Gary B. Parker Computer Science

More information

The Behavior Evolving Model and Application of Virtual Robots

The Behavior Evolving Model and Application of Virtual Robots The Behavior Evolving Model and Application of Virtual Robots Suchul Hwang Kyungdal Cho V. Scott Gordon Inha Tech. College Inha Tech College CSUS, Sacramento 253 Yonghyundong Namku 253 Yonghyundong Namku

More information

Sebastian Risi. Joel Lehman. David B. D Ambrosio Kenneth O. Stanley ABSTRACT

Sebastian Risi. Joel Lehman. David B. D Ambrosio Kenneth O. Stanley ABSTRACT Automatically Categorizing Procedurally Generated Content for Collecting Games In: Proceedings of the Workshop on Procedural Content Generation in Games (PCG) at the 9th International Conference on the

More information

Pareto Evolution and Co-Evolution in Cognitive Neural Agents Synthesis for Tic-Tac-Toe

Pareto Evolution and Co-Evolution in Cognitive Neural Agents Synthesis for Tic-Tac-Toe Proceedings of the 27 IEEE Symposium on Computational Intelligence and Games (CIG 27) Pareto Evolution and Co-Evolution in Cognitive Neural Agents Synthesis for Tic-Tac-Toe Yi Jack Yau, Jason Teo and Patricia

More information

Impact of Neuron Models and Network Structure on Evolving Modular Robot Neural Network Controllers

Impact of Neuron Models and Network Structure on Evolving Modular Robot Neural Network Controllers Impact of Neuron Models and Network Structure on Evolving Modular Robot Neural Network Controllers ABSTRACT Leo Cazenille TAO/LRI Univ. Paris-Sud, CNRS, INRIA F-91405 Orsay, France leo.cazenille@lri.fr

More information

arxiv: v1 [cs.ne] 3 May 2018

arxiv: v1 [cs.ne] 3 May 2018 VINE: An Open Source Interactive Data Visualization Tool for Neuroevolution Uber AI Labs San Francisco, CA 94103 {ruiwang,jeffclune,kstanley}@uber.com arxiv:1805.01141v1 [cs.ne] 3 May 2018 ABSTRACT Recent

More information

SMARTER NEAT NETS. A Thesis. presented to. the Faculty of California Polytechnic State University. San Luis Obispo. In Partial Fulfillment

SMARTER NEAT NETS. A Thesis. presented to. the Faculty of California Polytechnic State University. San Luis Obispo. In Partial Fulfillment SMARTER NEAT NETS A Thesis presented to the Faculty of California Polytechnic State University San Luis Obispo In Partial Fulfillment of the Requirements for the Degree Master of Science in Computer Science

More information

An Influence Map Model for Playing Ms. Pac-Man

An Influence Map Model for Playing Ms. Pac-Man An Influence Map Model for Playing Ms. Pac-Man Nathan Wirth and Marcus Gallagher, Member, IEEE Abstract In this paper we develop a Ms. Pac-Man playing agent based on an influence map model. The proposed

More information

Online Interactive Neuro-evolution

Online Interactive Neuro-evolution Appears in Neural Processing Letters, 1999. Online Interactive Neuro-evolution Adrian Agogino (agogino@ece.utexas.edu) Kenneth Stanley (kstanley@cs.utexas.edu) Risto Miikkulainen (risto@cs.utexas.edu)

More information

Art in the Sciences of the Artificial

Art in the Sciences of the Artificial Published in: Leonardo Volume 51, Issue 2, p. 165-172, April 2018, MIT Press (c) ISAST Art in the Sciences of the Artificial Kenneth O. Stanley (kstanley@cs.ucf.edu) Department of Computer Science University

More information

Evolving Parameters for Xpilot Combat Agents

Evolving Parameters for Xpilot Combat Agents Evolving Parameters for Xpilot Combat Agents Gary B. Parker Computer Science Connecticut College New London, CT 06320 parker@conncoll.edu Matt Parker Computer Science Indiana University Bloomington, IN,

More information

General Video Game AI: Learning from Screen Capture

General Video Game AI: Learning from Screen Capture General Video Game AI: Learning from Screen Capture Kamolwan Kunanusont University of Essex Colchester, UK Email: kkunan@essex.ac.uk Simon M. Lucas University of Essex Colchester, UK Email: sml@essex.ac.uk

More information

Artefacts: Minecraft meets Collaborative Interactive Evolution

Artefacts: Minecraft meets Collaborative Interactive Evolution Artefacts: Minecraft meets Collaborative Interactive Evolution Cristinel Patrascu Center for Computer Games Research IT University of Copenhagen Copenhagen, Denmark Email: patrascu.cristinel@gmail.com

More information

Neuroevolution. Evolving Neural Networks. Today s Main Topic. Why Neuroevolution?

Neuroevolution. Evolving Neural Networks. Today s Main Topic. Why Neuroevolution? Today s Main Topic Neuroevolution CSCE Neuroevolution slides are from Risto Miikkulainen s tutorial at the GECCO conference, with slight editing. Neuroevolution: Evolve artificial neural networks to control

More information

CYCLIC GENETIC ALGORITHMS FOR EVOLVING MULTI-LOOP CONTROL PROGRAMS

CYCLIC GENETIC ALGORITHMS FOR EVOLVING MULTI-LOOP CONTROL PROGRAMS CYCLIC GENETIC ALGORITHMS FOR EVOLVING MULTI-LOOP CONTROL PROGRAMS GARY B. PARKER, CONNECTICUT COLLEGE, USA, parker@conncoll.edu IVO I. PARASHKEVOV, CONNECTICUT COLLEGE, USA, iipar@conncoll.edu H. JOSEPH

More information

VIDEO games provide excellent test beds for artificial

VIDEO games provide excellent test beds for artificial FRIGHT: A Flexible Rule-Based Intelligent Ghost Team for Ms. Pac-Man David J. Gagne and Clare Bates Congdon, Senior Member, IEEE Abstract FRIGHT is a rule-based intelligent agent for playing the ghost

More information

Award Ellbogen Next Generation Program Sept Grant awarded to the Laramie Robotics Club, obtained as Treasurer,

Award Ellbogen Next Generation Program Sept Grant awarded to the Laramie Robotics Club, obtained as Treasurer, Name Joost Huizinga Address 3918 Fulton Street Apartment 1 San Francisco, California, 94118 Phone +1 (307) 460 1368 E-mail joost.hui@gmail.com Date of Birth 03-08-1986 Education Program Ph. D. Computer

More information

Evolving robots to play dodgeball

Evolving robots to play dodgeball Evolving robots to play dodgeball Uriel Mandujano and Daniel Redelmeier Abstract In nearly all videogames, creating smart and complex artificial agents helps ensure an enjoyable and challenging player

More information

Evolutionary Neural Networks for Non-Player Characters in Quake III

Evolutionary Neural Networks for Non-Player Characters in Quake III Evolutionary Neural Networks for Non-Player Characters in Quake III Joost Westra and Frank Dignum Abstract Designing and implementing the decisions of Non- Player Characters in first person shooter games

More information

CS 229 Final Project: Using Reinforcement Learning to Play Othello

CS 229 Final Project: Using Reinforcement Learning to Play Othello CS 229 Final Project: Using Reinforcement Learning to Play Othello Kevin Fry Frank Zheng Xianming Li ID: kfry ID: fzheng ID: xmli 16 December 2016 Abstract We built an AI that learned to play Othello.

More information

IMPROVING TOWER DEFENSE GAME AI (DIFFERENTIAL EVOLUTION VS EVOLUTIONARY PROGRAMMING) CHEAH KEEI YUAN

IMPROVING TOWER DEFENSE GAME AI (DIFFERENTIAL EVOLUTION VS EVOLUTIONARY PROGRAMMING) CHEAH KEEI YUAN IMPROVING TOWER DEFENSE GAME AI (DIFFERENTIAL EVOLUTION VS EVOLUTIONARY PROGRAMMING) CHEAH KEEI YUAN FACULTY OF COMPUTING AND INFORMATICS UNIVERSITY MALAYSIA SABAH 2014 ABSTRACT The use of Artificial Intelligence

More information

Evolutions of communication

Evolutions of communication Evolutions of communication Alex Bell, Andrew Pace, and Raul Santos May 12, 2009 Abstract In this paper a experiment is presented in which two simulated robots evolved a form of communication to allow

More information

Playing Atari Games with Deep Reinforcement Learning

Playing Atari Games with Deep Reinforcement Learning Playing Atari Games with Deep Reinforcement Learning 1 Playing Atari Games with Deep Reinforcement Learning Varsha Lalwani (varshajn@iitk.ac.in) Masare Akshay Sunil (amasare@iitk.ac.in) IIT Kanpur CS365A

More information

Encouraging Creative Thinking in Robots Improves Their Ability to Solve Challenging Problems

Encouraging Creative Thinking in Robots Improves Their Ability to Solve Challenging Problems Encouraging Creative Thinking in Robots Improves Their Ability to Solve Challenging Problems Jingyu Li Evolving AI Lab Computer Science Dept. University of Wyoming Laramie High School jingyuli@mit.edu

More information

Bachelor thesis. Influence map based Ms. Pac-Man and Ghost Controller. Johan Svensson. Abstract

Bachelor thesis. Influence map based Ms. Pac-Man and Ghost Controller. Johan Svensson. Abstract 2012-07-02 BTH-Blekinge Institute of Technology Uppsats inlämnad som del av examination i DV1446 Kandidatarbete i datavetenskap. Bachelor thesis Influence map based Ms. Pac-Man and Ghost Controller Johan

More information

Creating a Poker Playing Program Using Evolutionary Computation

Creating a Poker Playing Program Using Evolutionary Computation Creating a Poker Playing Program Using Evolutionary Computation Simon Olsen and Rob LeGrand, Ph.D. Abstract Artificial intelligence is a rapidly expanding technology. We are surrounded by technology that

More information

Hierarchical Controller for Robotic Soccer

Hierarchical Controller for Robotic Soccer Hierarchical Controller for Robotic Soccer Byron Knoll Cognitive Systems 402 April 13, 2008 ABSTRACT RoboCup is an initiative aimed at advancing Artificial Intelligence (AI) and robotics research. This

More information

Playing CHIP-8 Games with Reinforcement Learning

Playing CHIP-8 Games with Reinforcement Learning Playing CHIP-8 Games with Reinforcement Learning Niven Achenjang, Patrick DeMichele, Sam Rogers Stanford University Abstract We begin with some background in the history of CHIP-8 games and the use of

More information

Training a Neural Network for Checkers

Training a Neural Network for Checkers Training a Neural Network for Checkers Daniel Boonzaaier Supervisor: Adiel Ismail June 2017 Thesis presented in fulfilment of the requirements for the degree of Bachelor of Science in Honours at the University

More information

Neural Networks for Real-time Pathfinding in Computer Games

Neural Networks for Real-time Pathfinding in Computer Games Neural Networks for Real-time Pathfinding in Computer Games Ross Graham 1, Hugh McCabe 1 & Stephen Sheridan 1 1 School of Informatics and Engineering, Institute of Technology at Blanchardstown, Dublin

More information

Neuroevolution for RTS Micro

Neuroevolution for RTS Micro Neuroevolution for RTS Micro Aavaas Gajurel, Sushil J Louis, Daniel J Méndez and Siming Liu Department of Computer Science and Engineering, University of Nevada Reno Reno, Nevada Email: avs@nevada.unr.edu,

More information

Behaviour Patterns Evolution on Individual and Group Level. Stanislav Slušný, Roman Neruda, Petra Vidnerová. CIMMACS 07, December 14, Tenerife

Behaviour Patterns Evolution on Individual and Group Level. Stanislav Slušný, Roman Neruda, Petra Vidnerová. CIMMACS 07, December 14, Tenerife Behaviour Patterns Evolution on Individual and Group Level Stanislav Slušný, Roman Neruda, Petra Vidnerová Department of Theoretical Computer Science Institute of Computer Science Academy of Science of

More information

Neuroevolution of Multimodal Ms. Pac-Man Controllers Under Partially Observable Conditions

Neuroevolution of Multimodal Ms. Pac-Man Controllers Under Partially Observable Conditions Neuroevolution of Multimodal Ms. Pac-Man Controllers Under Partially Observable Conditions William Price 1 and Jacob Schrum 2 Abstract Ms. Pac-Man is a well-known video game used extensively in AI research.

More information

Orchestrating Game Generation Antonios Liapis

Orchestrating Game Generation Antonios Liapis Orchestrating Game Generation Antonios Liapis Institute of Digital Games University of Malta antonios.liapis@um.edu.mt http://antoniosliapis.com @SentientDesigns Orchestrating game generation Game development

More information

Feature Learning Using State Differences

Feature Learning Using State Differences Feature Learning Using State Differences Mesut Kirci and Jonathan Schaeffer and Nathan Sturtevant Department of Computing Science University of Alberta Edmonton, Alberta, Canada {kirci,nathanst,jonathan}@cs.ualberta.ca

More information

Gameplay as On-Line Mediation Search

Gameplay as On-Line Mediation Search Gameplay as On-Line Mediation Search Justus Robertson and R. Michael Young Liquid Narrative Group Department of Computer Science North Carolina State University Raleigh, NC 27695 jjrobert@ncsu.edu, young@csc.ncsu.edu

More information

The Effects of Supervised Learning on Neuro-evolution in StarCraft

The Effects of Supervised Learning on Neuro-evolution in StarCraft The Effects of Supervised Learning on Neuro-evolution in StarCraft Tobias Laupsa Nilsen Master of Science in Computer Science Submission date: Januar 2013 Supervisor: Keith Downing, IDI Norwegian University

More information

FreeCiv Learner: A Machine Learning Project Utilizing Genetic Algorithms

FreeCiv Learner: A Machine Learning Project Utilizing Genetic Algorithms FreeCiv Learner: A Machine Learning Project Utilizing Genetic Algorithms Felix Arnold, Bryan Horvat, Albert Sacks Department of Computer Science Georgia Institute of Technology Atlanta, GA 30318 farnold3@gatech.edu

More information

Retaining Learned Behavior During Real-Time Neuroevolution

Retaining Learned Behavior During Real-Time Neuroevolution Retaining Learned Behavior During Real-Time Neuroevolution Thomas D Silva, Roy Janik, Michael Chrien, Kenneth O. Stanley and Risto Miikkulainen Department of Computer Sciences University of Texas at Austin

More information

Neuroevolution of Content Layout in the PCG: Angry Bots Video Game

Neuroevolution of Content Layout in the PCG: Angry Bots Video Game 2013 IEEE Congress on Evolutionary Computation June 20-23, Cancún, México Neuroevolution of Content Layout in the PCG: Angry Bots Video Game Abstract This paper demonstrates an approach to arranging content

More information

Backpropagation without Human Supervision for Visual Control in Quake II

Backpropagation without Human Supervision for Visual Control in Quake II Backpropagation without Human Supervision for Visual Control in Quake II Matt Parker and Bobby D. Bryant Abstract Backpropagation and neuroevolution are used in a Lamarckian evolution process to train

More information

Tutorial of Reinforcement: A Special Focus on Q-Learning

Tutorial of Reinforcement: A Special Focus on Q-Learning Tutorial of Reinforcement: A Special Focus on Q-Learning TINGWU WANG, MACHINE LEARNING GROUP, UNIVERSITY OF TORONTO Contents 1. Introduction 1. Discrete Domain vs. Continous Domain 2. Model Based vs. Model

More information

THE WORLD video game market in 2002 was valued

THE WORLD video game market in 2002 was valued IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, VOL. 9, NO. 6, DECEMBER 2005 653 Real-Time Neuroevolution in the NERO Video Game Kenneth O. Stanley, Bobby D. Bryant, Student Member, IEEE, and Risto Miikkulainen

More information

Controller for TORCS created by imitation

Controller for TORCS created by imitation Controller for TORCS created by imitation Jorge Muñoz, German Gutierrez, Araceli Sanchis Abstract This paper is an initial approach to create a controller for the game TORCS by learning how another controller

More information

A Hybrid Method of Dijkstra Algorithm and Evolutionary Neural Network for Optimal Ms. Pac-Man Agent

A Hybrid Method of Dijkstra Algorithm and Evolutionary Neural Network for Optimal Ms. Pac-Man Agent A Hybrid Method of Dijkstra Algorithm and Evolutionary Neural Network for Optimal Ms. Pac-Man Agent Keunhyun Oh Sung-Bae Cho Department of Computer Science Yonsei University Seoul, Republic of Korea ocworld@sclab.yonsei.ac.kr

More information

Creating a Dominion AI Using Genetic Algorithms

Creating a Dominion AI Using Genetic Algorithms Creating a Dominion AI Using Genetic Algorithms Abstract Mok Ming Foong Dominion is a deck-building card game. It allows for complex strategies, has an aspect of randomness in card drawing, and no obvious

More information

A CONCRETE WORK OF ABSTRACT GENIUS

A CONCRETE WORK OF ABSTRACT GENIUS A CONCRETE WORK OF ABSTRACT GENIUS A Dissertation Presented by John Doe to The Faculty of the Graduate College of The University of Vermont In Partial Fullfillment of the Requirements for the Degree of

More information

Efficient Evaluation Functions for Multi-Rover Systems

Efficient Evaluation Functions for Multi-Rover Systems Efficient Evaluation Functions for Multi-Rover Systems Adrian Agogino 1 and Kagan Tumer 2 1 University of California Santa Cruz, NASA Ames Research Center, Mailstop 269-3, Moffett Field CA 94035, USA,

More information

The Importance of Look-Ahead Depth in Evolutionary Checkers

The Importance of Look-Ahead Depth in Evolutionary Checkers The Importance of Look-Ahead Depth in Evolutionary Checkers Belal Al-Khateeb School of Computer Science The University of Nottingham Nottingham, UK bxk@cs.nott.ac.uk Abstract Intuitively it would seem

More information

Creating Intelligent Agents in Games

Creating Intelligent Agents in Games Creating Intelligent Agents in Games Risto Miikkulainen The University of Texas at Austin Abstract Game playing has long been a central topic in artificial intelligence. Whereas early research focused

More information

Population Initialization Techniques for RHEA in GVGP

Population Initialization Techniques for RHEA in GVGP Population Initialization Techniques for RHEA in GVGP Raluca D. Gaina, Simon M. Lucas, Diego Perez-Liebana Introduction Rolling Horizon Evolutionary Algorithms (RHEA) show promise in General Video Game

More information

TEMPORAL DIFFERENCE LEARNING IN CHINESE CHESS

TEMPORAL DIFFERENCE LEARNING IN CHINESE CHESS TEMPORAL DIFFERENCE LEARNING IN CHINESE CHESS Thong B. Trinh, Anwer S. Bashi, Nikhil Deshpande Department of Electrical Engineering University of New Orleans New Orleans, LA 70148 Tel: (504) 280-7383 Fax:

More information

arxiv: v1 [cs.ai] 24 Apr 2017

arxiv: v1 [cs.ai] 24 Apr 2017 Analysis of Vanilla Rolling Horizon Evolution Parameters in General Video Game Playing Raluca D. Gaina, Jialin Liu, Simon M. Lucas, Diego Pérez-Liébana School of Computer Science and Electronic Engineering,

More information

Neuro-Fuzzy and Soft Computing: Fuzzy Sets. Chapter 1 of Neuro-Fuzzy and Soft Computing by Jang, Sun and Mizutani

Neuro-Fuzzy and Soft Computing: Fuzzy Sets. Chapter 1 of Neuro-Fuzzy and Soft Computing by Jang, Sun and Mizutani Chapter 1 of Neuro-Fuzzy and Soft Computing by Jang, Sun and Mizutani Outline Introduction Soft Computing (SC) vs. Conventional Artificial Intelligence (AI) Neuro-Fuzzy (NF) and SC Characteristics 2 Introduction

More information

Training a Back-Propagation Network with Temporal Difference Learning and a database for the board game Pente

Training a Back-Propagation Network with Temporal Difference Learning and a database for the board game Pente Training a Back-Propagation Network with Temporal Difference Learning and a database for the board game Pente Valentijn Muijrers 3275183 Valentijn.Muijrers@phil.uu.nl Supervisor: Gerard Vreeswijk 7,5 ECTS

More information

Integrating Learning in a Multi-Scale Agent

Integrating Learning in a Multi-Scale Agent Integrating Learning in a Multi-Scale Agent Ben Weber Dissertation Defense May 18, 2012 Introduction AI has a long history of using games to advance the state of the field [Shannon 1950] Real-Time Strategy

More information

CURRICULUM VITAE. Doctor of Philosophy in Computer Science, The University of Texas at Austin, August 2004.

CURRICULUM VITAE. Doctor of Philosophy in Computer Science, The University of Texas at Austin, August 2004. CURRICULUM VITAE Dr. Kenneth O. Stanley Professor Dept. of Computer Science The University of Central Florida Orlando, FL 32816 Degrees kstanley@cs.ucf.edu voice (407) 473-0072 fax (407)823-5419 http://www.cs.ucf.edu/

More information

Reinforcement Learning in Games Autonomous Learning Systems Seminar

Reinforcement Learning in Games Autonomous Learning Systems Seminar Reinforcement Learning in Games Autonomous Learning Systems Seminar Matthias Zöllner Intelligent Autonomous Systems TU-Darmstadt zoellner@rbg.informatik.tu-darmstadt.de Betreuer: Gerhard Neumann Abstract

More information

Learning to Play 2D Video Games

Learning to Play 2D Video Games Learning to Play 2D Video Games Justin Johnson jcjohns@stanford.edu Mike Roberts mlrobert@stanford.edu Matt Fisher mdfisher@stanford.edu Abstract Our goal in this project is to implement a machine learning

More information

Learning Behaviors for Environment Modeling by Genetic Algorithm

Learning Behaviors for Environment Modeling by Genetic Algorithm Learning Behaviors for Environment Modeling by Genetic Algorithm Seiji Yamada Department of Computational Intelligence and Systems Science Interdisciplinary Graduate School of Science and Engineering Tokyo

More information

Artificial Neural Networks. Artificial Intelligence Santa Clara, 2016

Artificial Neural Networks. Artificial Intelligence Santa Clara, 2016 Artificial Neural Networks Artificial Intelligence Santa Clara, 2016 Simulate the functioning of the brain Can simulate actual neurons: Computational neuroscience Can introduce simplified neurons: Neural

More information

Super Mario Evolution

Super Mario Evolution Super Mario Evolution Julian Togelius, Sergey Karakovskiy, Jan Koutník and Jürgen Schmidhuber Abstract We introduce a new reinforcement learning benchmark based on the classic platform game Super Mario

More information

Learning Artificial Intelligence in Large-Scale Video Games

Learning Artificial Intelligence in Large-Scale Video Games Learning Artificial Intelligence in Large-Scale Video Games A First Case Study with Hearthstone: Heroes of WarCraft Master Thesis Submitted for the Degree of MSc in Computer Science & Engineering Author

More information

Evolving Opponent Models for Texas Hold Em

Evolving Opponent Models for Texas Hold Em Evolving Opponent Models for Texas Hold Em Alan J. Lockett and Risto Miikkulainen Abstract Opponent models allow software agents to assess a multi-agent environment more accurately and therefore improve

More information

Influence Map-based Controllers for Ms. PacMan and the Ghosts

Influence Map-based Controllers for Ms. PacMan and the Ghosts Influence Map-based Controllers for Ms. PacMan and the Ghosts Johan Svensson Student member, IEEE and Stefan J. Johansson, Member, IEEE Abstract Ms. Pac-Man, one of the classic arcade games has recently

More information

Analyzing the Impact of Knowledge and Search in Monte Carlo Tree Search in Go

Analyzing the Impact of Knowledge and Search in Monte Carlo Tree Search in Go Analyzing the Impact of Knowledge and Search in Monte Carlo Tree Search in Go Farhad Haqiqat and Martin Müller University of Alberta Edmonton, Canada Contents Motivation and research goals Feature Knowledge

More information

Extending the STRADA Framework to Design an AI for ORTS

Extending the STRADA Framework to Design an AI for ORTS Extending the STRADA Framework to Design an AI for ORTS Laurent Navarro and Vincent Corruble Laboratoire d Informatique de Paris 6 Université Pierre et Marie Curie (Paris 6) CNRS 4, Place Jussieu 75252

More information

Evolutionary Othello Players Boosted by Opening Knowledge

Evolutionary Othello Players Boosted by Opening Knowledge 26 IEEE Congress on Evolutionary Computation Sheraton Vancouver Wall Centre Hotel, Vancouver, BC, Canada July 16-21, 26 Evolutionary Othello Players Boosted by Opening Knowledge Kyung-Joong Kim and Sung-Bae

More information

Developing Frogger Player Intelligence Using NEAT and a Score Driven Fitness Function

Developing Frogger Player Intelligence Using NEAT and a Score Driven Fitness Function Developing Frogger Player Intelligence Using NEAT and a Score Driven Fitness Function Davis Ancona and Jake Weiner Abstract In this report, we examine the plausibility of implementing a NEAT-based solution

More information

Enhancements for Monte-Carlo Tree Search in Ms Pac-Man

Enhancements for Monte-Carlo Tree Search in Ms Pac-Man Enhancements for Monte-Carlo Tree Search in Ms Pac-Man Tom Pepels June 19, 2012 Abstract In this paper enhancements for the Monte-Carlo Tree Search (MCTS) framework are investigated to play Ms Pac-Man.

More information

Game State Evaluation Heuristics in General Video Game Playing

Game State Evaluation Heuristics in General Video Game Playing Game State Evaluation Heuristics in General Video Game Playing Bruno S. Santos, Heder S. Bernardino Departament of Computer Science Universidade Federal de Juiz de Fora - UFJF Juiz de Fora, MG, Brasil

More information

Model-Based Reinforcement Learning in Atari 2600 Games

Model-Based Reinforcement Learning in Atari 2600 Games Model-Based Reinforcement Learning in Atari 2600 Games Daniel John Foley Research Adviser: Erik Talvitie A thesis presented for honors within Computer Science on May 15 th, 2017 Franklin & Marshall College

More information

The Evolution of Blackjack Strategies

The Evolution of Blackjack Strategies The Evolution of Blackjack Strategies Graham Kendall University of Nottingham School of Computer Science & IT Jubilee Campus, Nottingham, NG8 BB, UK gxk@cs.nott.ac.uk Craig Smith University of Nottingham

More information

AI for Autonomous Ships Challenges in Design and Validation

AI for Autonomous Ships Challenges in Design and Validation VTT TECHNICAL RESEARCH CENTRE OF FINLAND LTD AI for Autonomous Ships Challenges in Design and Validation ISSAV 2018 Eetu Heikkilä Autonomous ships - activities in VTT Autonomous ship systems Unmanned engine

More information

Endless forms (of regression models) James McDermott

Endless forms (of regression models) James McDermott Endless forms (of regression models) Darwinian approaches to free-form numerical modelling James McDermott UCD Complex and Adaptive Systems Lab UCD Lochlann Quinn School of Business 1 / 54 Copyright 2015,

More information

Anca ANDREICA Producția științifică

Anca ANDREICA Producția științifică Anca ANDREICA Producția științifică Lucrări categoriile A, B și C Lucrări categoriile A și B puncte 9 puncte Lucrări categoria A A. Agapie, A. Andreica, M. Giuclea, Probabilistic Cellular Automata, Journal

More information

Ensemble Approaches in Evolutionary Game Strategies: A Case Study in Othello

Ensemble Approaches in Evolutionary Game Strategies: A Case Study in Othello Ensemble Approaches in Evolutionary Game Strategies: A Case Study in Othello Kyung-Joong Kim and Sung-Bae Cho Abstract In pattern recognition area, an ensemble approach is one of promising methods to increase

More information

International Journal of Modern Trends in Engineering and Research. Optimizing Search Space of Othello Using Hybrid Approach

International Journal of Modern Trends in Engineering and Research. Optimizing Search Space of Othello Using Hybrid Approach International Journal of Modern Trends in Engineering and Research www.ijmter.com Optimizing Search Space of Othello Using Hybrid Approach Chetan Chudasama 1, Pramod Tripathi 2, keyur Prajapati 3 1 Computer

More information

Hybrid of Evolution and Reinforcement Learning for Othello Players

Hybrid of Evolution and Reinforcement Learning for Othello Players Hybrid of Evolution and Reinforcement Learning for Othello Players Kyung-Joong Kim, Heejin Choi and Sung-Bae Cho Dept. of Computer Science, Yonsei University 134 Shinchon-dong, Sudaemoon-ku, Seoul 12-749,

More information

Representations for search-based methods

Representations for search-based methods Chapter 9 Representations for search-based methods Dan Ashlock, Sebastian Risi, and Julian Togelius Abstract One of the key considerations in search-based PCG is how to represent the game content. There

More information

Experiments with Learning for NPCs in 2D shooter

Experiments with Learning for NPCs in 2D shooter 000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050

More information

Tree depth influence in Genetic Programming for generation of competitive agents for RTS games

Tree depth influence in Genetic Programming for generation of competitive agents for RTS games Tree depth influence in Genetic Programming for generation of competitive agents for RTS games P. García-Sánchez, A. Fernández-Ares, A. M. Mora, P. A. Castillo, J. González and J.J. Merelo Dept. of Computer

More information

Cooperative Behavior Acquisition in A Multiple Mobile Robot Environment by Co-evolution

Cooperative Behavior Acquisition in A Multiple Mobile Robot Environment by Co-evolution Cooperative Behavior Acquisition in A Multiple Mobile Robot Environment by Co-evolution Eiji Uchibe, Masateru Nakamura, Minoru Asada Dept. of Adaptive Machine Systems, Graduate School of Eng., Osaka University,

More information

Co-evolution for Communication: An EHW Approach

Co-evolution for Communication: An EHW Approach Journal of Universal Computer Science, vol. 13, no. 9 (2007), 1300-1308 submitted: 12/6/06, accepted: 24/10/06, appeared: 28/9/07 J.UCS Co-evolution for Communication: An EHW Approach Yasser Baleghi Damavandi,

More information

Enhancements for Monte-Carlo Tree Search in Ms Pac-Man

Enhancements for Monte-Carlo Tree Search in Ms Pac-Man Enhancements for Monte-Carlo Tree Search in Ms Pac-Man Tom Pepels Mark H.M. Winands Abstract In this paper enhancements for the Monte-Carlo Tree Search (MCTS) framework are investigated to play Ms Pac-Man.

More information

INTRODUCTION. a complex system, that using new information technologies (software & hardware) combined

INTRODUCTION. a complex system, that using new information technologies (software & hardware) combined COMPUTATIONAL INTELLIGENCE & APPLICATIONS INTRODUCTION What is an INTELLIGENT SYSTEM? a complex system, that using new information technologies (software & hardware) combined with communication technologies,

More information

This is a postprint version of the following published document:

This is a postprint version of the following published document: This is a postprint version of the following published document: Alejandro Baldominos, Yago Saez, Gustavo Recio, and Javier Calle (2015). "Learning Levels of Mario AI Using Genetic Algorithms". In Advances

More information

Reactive Control of Ms. Pac Man using Information Retrieval based on Genetic Programming

Reactive Control of Ms. Pac Man using Information Retrieval based on Genetic Programming Reactive Control of Ms. Pac Man using Information Retrieval based on Genetic Programming Matthias F. Brandstetter Centre for Computational Intelligence De Montfort University United Kingdom, Leicester

More information

Neural Network Principles By Robert L. Harvey

Neural Network Principles By Robert L. Harvey Neural Network Principles By Robert L. Harvey 0130633305 - Neural Network Principles by Harvey, - Neural Network Principles by Robert L. Harvey and a great selection of similar Used, New and Collectible

More information

Creating an Agent of Doom: A Visual Reinforcement Learning Approach

Creating an Agent of Doom: A Visual Reinforcement Learning Approach Creating an Agent of Doom: A Visual Reinforcement Learning Approach Michael Lowney Department of Electrical Engineering Stanford University mlowney@stanford.edu Robert Mahieu Department of Electrical Engineering

More information

Comparison of Monte Carlo Tree Search Methods in the Imperfect Information Card Game Cribbage

Comparison of Monte Carlo Tree Search Methods in the Imperfect Information Card Game Cribbage Comparison of Monte Carlo Tree Search Methods in the Imperfect Information Card Game Cribbage Richard Kelly and David Churchill Computer Science Faculty of Science Memorial University {richard.kelly, dchurchill}@mun.ca

More information

UT^2: Human-like Behavior via Neuroevolution of Combat Behavior and Replay of Human Traces

UT^2: Human-like Behavior via Neuroevolution of Combat Behavior and Replay of Human Traces UT^2: Human-like Behavior via Neuroevolution of Combat Behavior and Replay of Human Traces Jacob Schrum, Igor Karpov, and Risto Miikkulainen {schrum2,ikarpov,risto}@cs.utexas.edu Our Approach: UT^2 Evolve

More information