Prof. Sameer Singh CS 175: PROJECTS IN AI (IN MINECRAFT) WINTER April 6, 2017

Size: px
Start display at page:

Download "Prof. Sameer Singh CS 175: PROJECTS IN AI (IN MINECRAFT) WINTER April 6, 2017"

Transcription

1 Prof. Sameer Singh CS 175: PROJECTS IN AI (IN MINECRAFT) WINTER 2017 April 6, 2017

2 Upcoming Misc. Check out course webpage and schedule Check out Canvas, especially for deadlines Do the survey by tomorrow, April 7, 2017 Homework Homework 1 will be up soon Meanwhile, install and get Malmo working Due: April 14, 2017 Project Teams are due April 17, 2017, Proposals April 21, 2017 Start assembling teams now! (use Piazza) Start thinking of project ideas CS 175: PROJECTS IN AI (SPRING 2017) 2

3 Projects in AI in Minecraft Project Overview Some Project Ideas Introduction to Reinforcement Learning CS 175: PROJECTS IN AI (SPRING 2017) 3

4 Projects in AI in Minecraft Project Overview Some Project Ideas Introduction to Reinforcement Learning CS 175: PROJECTS IN AI (SPRING 2017) 4

5 What is AI? "Artificial intelligence is anything computers can't do yet." - Douglas Hofstadter CS 175: PROJECTS IN AI (SPRING 2017) 5

6 What can a project be? Research Do difficult things automatically, Minecraft is just a testbed Help players do things that are otherwise time-consuming Practical Tool Art Just cool! Use AI/ML to create stuff in the world CS 175: PROJECTS IN AI (SPRING 2017) 6

7 Technical Solution Use Artificial Intelligence or Machine Learning algorithms Artificial Intelligence Machine Learning Heuristic/Adversarial/Local Search Supervised Learning Logic Planning Bayesian Networks Unsupervised Learning Reinforcement Learning Natural Language Processing Computer Vision Recommendation Systems Computer Vision Constraint Satisfaction Time Series Modeling Deep Learning CS 175: PROJECTS IN AI (SPRING 2017) 7

8 Evaluation How would YOU define that your project was a success? Quantitative Evaluation Numerical Metrics: Accuracy, F1, AUC, Time to run, time to train Baselines: What would be currently used? What are reasonable simpler methods? By how much amount? We hope to improve the METRIC by AMOUNT over BASELINE! (I won t hold you to it, just want you to think about it) CS 175: PROJECTS IN AI (SPRING 2017) 8

9 Evaluation How would YOU define that your project was a success? Qualitative Evaluation Simple Example Cases: What are examples that your idea will definitely work on? What is the expected output on these? Error Analysis and Introspection: Are there plots/figures to verify the behavior? If it doesn t work, how will you improve it? The Super-Impressive Example What is the best example? awesome if it works E.g. something that perfectly captures your idea! CS 175: PROJECTS IN AI (SPRING 2017) 9

10 You will have doubts! Is it too simple? Is there data to train my classifier? Is it too ambitious? Is there a different algorithm I should use? Is my evaluation inappropriate? Can I only use off-the-shelf code? Every team has to meet me during Week 4. Use Piazza! Discussion will cover many simple situations Both TA and me are available for appointments CS 175: PROJECTS IN AI (SPRING 2017) 10

11 Projects in AI in Minecraft Project Overview Some Project Ideas Introduction to Reinforcement Learning CS 175: PROJECTS IN AI (SPRING 2017) 11

12 Projects in AI in Minecraft Course Information Some Project Ideas Introduction to Reinforcement Learning CS 175: PROJECTS IN AI (SPRING 2017) 12

13 Reinforcement Learning Agent learns to do things by trying things, and succeeding/failing Navigation Explore the map without dying Solve mazes Learn the best way home from anywhere Get to the highest hill in the map Learn Recipes Figure out best way to make items Without any knowledge of the recipes Combat Learn to hide/find shelter Learn to fight, example paper CS 175: PROJECTS IN AI (SPRING 2017) 13

14 Reinforcement Learning Agent learns to do things by trying things, and succeeding/failing Observation What the agent sees Action What the agent can do Reward What the agent likes/dislikes New Item++ No Item- Goal++ Died--- CS 175: PROJECTS IN AI (SPRING 2017) 14

15 Reinforcement Learning Next few lectures will go into details (and more ideas) For now, let s look at non-rl ideas CS 175: PROJECTS IN AI (SPRING 2017) 15

16 Describe the Scene Houses and a pig on a grassy field during the day. Pig staring at me in a village. CS 175: PROJECTS IN AI (SPRING 2017) 16

17 Live Commentator Hit a rabbit CS 175: PROJECTS IN AI (SPRING 2017) 17

18 How is this even possible? 3 block in a line Grass blocks as floor Daylight, clear weather Malmo Training Signal 3 block in a line Deep Learning, CNN + LSTM Machine Learning CS 175: PROJECTS IN AI (SPRING 2017) 18

19 Many Variations of These Label x1000 Your code Agent/World in Malmo x Render x Label Machine Learning object objects action depth of pixel object detection ~caption generation ~action detection, commentary ~stereoscopy, depth/distance prediction CS 175: PROJECTS IN AI (SPRING 2017) 19

20 Captions to Speech Why are you making me read? Pig staring at me in a village. CS 175: PROJECTS IN AI (SPRING 2017) 20

21 Natural Language Navigation Quite Difficult! > Go forward till you hit a wall > Go to the pig > Go to the house on the right > Go behind the house trivial hardest CS 175: PROJECTS IN AI (SPRING 2017) 21

22 Natural Language Interface Quite Difficult! > Choose steel pickaxe and dig > Go and destroy that window > Put the blue block on the closest wall > Find a tree and chop it trivial hardest CS 175: PROJECTS IN AI (SPRING 2017) 22

23 SHRDLU (from 1970!) CS 175: PROJECTS IN AI (SPRING 2017) 23

24 Natural Speech to Commands Why are you making me type? Off the shelf Speech to Text systems Online Speech to Text APIs CS 175: PROJECTS IN AI (SPRING 2017) 24

25 Photo to Minecraft Character Photo of a person Minecraft Skin Your Project Need to label data? Can you use existing classifiers, like Visual QA? CS 175: PROJECTS IN AI (SPRING 2017) 25

26 Recipe Planners Inventory Need (s) Steps > Get 2 wood planks > Make a stick > Get 2 diamonds > Make diamond sword CS 175: PROJECTS IN AI (SPRING 2017) 26

27 Lots of other possibilities Many other games in Minecraft Create AI for those? One AI that works for all of those? CS 175: PROJECTS IN AI (SPRING 2017) 27

28 Projects in AI in Minecraft Course Information Some Project Ideas Introduction to Reinforcement Learning CS 175: PROJECTS IN AI (SPRING 2017) 28

29 Projects in AI in Minecraft Course Information Some Project Ideas Introduction to Reinforcement Learning Based on slides by David Silver CS 175: PROJECTS IN AI (SPRING 2017) 29

30 Reinforcement Learning CS 175: PROJECTS IN AI (SPRING 2017) 30

31 What makes it different? No direct supervision, only rewards Feedback is delayed, not instantaneous Time really matters, i.e. data is sequential Agent s actions affect what data it will receive Examples Fly stunt maneuvers in a helicopter Defeat the world champion at Backgammon Manage an investment portfolio Control a power station Make a humanoid robot walk Play many different Atari games better than humans Beat the world champion in Go CS 175: PROJECTS IN AI (SPRING 2017) 31

32 Agent-Environment Interface Agent decides on an action receives next observation receives next reward Environment executes the action computes next observation computes next reward CS 175: PROJECTS IN AI (SPRING 2017) 32

33 Reward, R t How well the agent is doing +, positive (Good) -, negative (Bad) Nothing about WHY it is doing well, could have little to do with A t-1 Agent is trying to maximize its cumulative reward CS 175: PROJECTS IN AI (SPRING 2017) 33

34 Example of Rewards Fly stunt maneuvers in a helicopter +ve reward for following desired trajectory ve reward for crashing Defeat the world champion at Backgammon +/ ve reward for winning/losing a game Manage an investment portfolio +ve reward for each $ in bank Control a power station +ve reward for producing power ve reward for exceeding safety thresholds Make a humanoid robot walk +ve reward for forward motion ve reward for falling over Play many different Atari games better than humans +/ ve reward for increasing/decreasing score CS 175: PROJECTS IN AI (SPRING 2017) 34

35 Sequential Decision Making Actions have long term consequences Rewards may be delayed May be better to sacrifice short term reward for long term benefit Examples A financial investment (may take months to mature) Refuelling a helicopter (might prevent a crash later) Blocking opponent moves (might eventually help win) Spend a lot of money and go to college (earn more later) Don t commit crimes (rewarded by not going to jail) Get started on Malmo/project soon (make it an easy quarter) A key aspect of intelligence, how far ahead are you able to plan? CS 175: PROJECTS IN AI (SPRING 2017) 35

36 Reinforcement Learning Given an environment (produces observations and rewards) Reinforcement Learning Automated agent that selects actions to maximize total rewards in the environment CS 175: PROJECTS IN AI (SPRING 2017) 36

37 Let s look at the Agent What does the choice of action depend on? Can you ignore O t completely? Is just O t enough? Or (O t,a t )? Is it last few observations? Is it all observations so far? CS 175: PROJECTS IN AI (SPRING 2017) 37

38 Agent State, S t History: everything that happened so far H t = O 1 R 1 A 1 O 2 R 2 A 2 O 3 R 3,,A t-1 O t R t State, S t can be O t O t R t A t-1 O t R t O t-3 O t-2 O t-1 O t In general, S t = f(h t ) You, as AI designer, specify this function CS 175: PROJECTS IN AI (SPRING 2017) 38

39 Agent Policy, π Current state S t π Next action A t Deterministic Policy: A # = π S # Stochastic Policy: π a s = P(A # = a S # = s) Good policy: Leads to larger cumulative reward Bad policy: Leads to worse cumulative reward (we will explore this more in the next week) CS 175: PROJECTS IN AI (SPRING 2017) 39

40 Example: Atari Rules are unknown What makes the score increase? Dynamics are unknown How do actions change pixels? CS 175: PROJECTS IN AI (SPRING 2017) 40

41 Video Time! CS 175: PROJECTS IN AI (SPRING 2017) 41

42 Example: Robotic Soccer CS 175: PROJECTS IN AI (SPRING 2017) 42

43 AlphaGo CS 175: PROJECTS IN AI (SPRING 2017) 43

44 Projects in AI in Minecraft Course Information Some Project Ideas Introduction to Reinforcement Learning CS 175: PROJECTS IN AI (SPRING 2017) 44

CS 730/830: Intro AI. Prof. Wheeler Ruml. TA Bence Cserna. Thinking inside the box. 5 handouts: course info, project info, schedule, slides, asst 1

CS 730/830: Intro AI. Prof. Wheeler Ruml. TA Bence Cserna. Thinking inside the box. 5 handouts: course info, project info, schedule, slides, asst 1 CS 730/830: Intro AI Prof. Wheeler Ruml TA Bence Cserna Thinking inside the box. 5 handouts: course info, project info, schedule, slides, asst 1 Wheeler Ruml (UNH) Lecture 1, CS 730 1 / 23 My Definition

More information

COS 402 Machine Learning and Artificial Intelligence Fall Lecture 1: Intro

COS 402 Machine Learning and Artificial Intelligence Fall Lecture 1: Intro COS 402 Machine Learning and Artificial Intelligence Fall 2016 Lecture 1: Intro Sanjeev Arora Elad Hazan Today s Agenda Defining intelligence and AI state-of-the-art, goals Course outline AI by introspection

More information

Game Artificial Intelligence ( CS 4731/7632 )

Game Artificial Intelligence ( CS 4731/7632 ) Game Artificial Intelligence ( CS 4731/7632 ) Instructor: Stephen Lee-Urban http://www.cc.gatech.edu/~surban6/2018-gameai/ (soon) Piazza T-square What s this all about? Industry standard approaches to

More information

CS440/ECE448 Lecture 11: Stochastic Games, Stochastic Search, and Learned Evaluation Functions

CS440/ECE448 Lecture 11: Stochastic Games, Stochastic Search, and Learned Evaluation Functions CS440/ECE448 Lecture 11: Stochastic Games, Stochastic Search, and Learned Evaluation Functions Slides by Svetlana Lazebnik, 9/2016 Modified by Mark Hasegawa Johnson, 9/2017 Types of game environments Perfect

More information

Google DeepMind s AlphaGo vs. world Go champion Lee Sedol

Google DeepMind s AlphaGo vs. world Go champion Lee Sedol Google DeepMind s AlphaGo vs. world Go champion Lee Sedol Review of Nature paper: Mastering the game of Go with Deep Neural Networks & Tree Search Tapani Raiko Thanks to Antti Tarvainen for some slides

More information

Hierarchical Controller for Robotic Soccer

Hierarchical Controller for Robotic Soccer Hierarchical Controller for Robotic Soccer Byron Knoll Cognitive Systems 402 April 13, 2008 ABSTRACT RoboCup is an initiative aimed at advancing Artificial Intelligence (AI) and robotics research. This

More information

CS343 Introduction to Artificial Intelligence Spring 2012

CS343 Introduction to Artificial Intelligence Spring 2012 CS343 Introduction to Artificial Intelligence Spring 2012 Prof: TA: Daniel Urieli Department of Computer Science The University of Texas at Austin Good Afternoon, Colleagues Welcome to a fun, but challenging

More information

Reinforcement Learning for CPS Safety Engineering. Sam Green, Çetin Kaya Koç, Jieliang Luo University of California, Santa Barbara

Reinforcement Learning for CPS Safety Engineering. Sam Green, Çetin Kaya Koç, Jieliang Luo University of California, Santa Barbara Reinforcement Learning for CPS Safety Engineering Sam Green, Çetin Kaya Koç, Jieliang Luo University of California, Santa Barbara Motivations Safety-critical duties desired by CPS? Autonomous vehicle control:

More information

CMSC 372 Artificial Intelligence. Fall Administrivia

CMSC 372 Artificial Intelligence. Fall Administrivia CMSC 372 Artificial Intelligence Fall 2017 Administrivia Instructor: Deepak Kumar Lectures: Mon& Wed 10:10a to 11:30a Labs: Fridays 10:10a to 11:30a Pre requisites: CMSC B206 or H106 and CMSC B231 or permission

More information

CS221 Project Final Report Automatic Flappy Bird Player

CS221 Project Final Report Automatic Flappy Bird Player 1 CS221 Project Final Report Automatic Flappy Bird Player Minh-An Quinn, Guilherme Reis Introduction Flappy Bird is a notoriously difficult and addicting game - so much so that its creator even removed

More information

CS 188: Artificial Intelligence Spring Announcements

CS 188: Artificial Intelligence Spring Announcements CS 188: Artificial Intelligence Spring 2011 Lecture 7: Minimax and Alpha-Beta Search 2/9/2011 Pieter Abbeel UC Berkeley Many slides adapted from Dan Klein 1 Announcements W1 out and due Monday 4:59pm P2

More information

CS6700: The Emergence of Intelligent Machines. Prof. Carla Gomes Prof. Bart Selman Cornell University

CS6700: The Emergence of Intelligent Machines. Prof. Carla Gomes Prof. Bart Selman Cornell University EMERGENCE OF INTELLIGENT MACHINES: CHALLENGES AND OPPORTUNITIES CS6700: The Emergence of Intelligent Machines Prof. Carla Gomes Prof. Bart Selman Cornell University Artificial Intelligence After a distinguished

More information

CS325 Artificial Intelligence Ch. 5, Games!

CS325 Artificial Intelligence Ch. 5, Games! CS325 Artificial Intelligence Ch. 5, Games! Cengiz Günay, Emory Univ. vs. Spring 2013 Günay Ch. 5, Games! Spring 2013 1 / 19 AI in Games A lot of work is done on it. Why? Günay Ch. 5, Games! Spring 2013

More information

CS 309: Autonomous Intelligent Robotics FRI I. Instructor: Justin Hart.

CS 309: Autonomous Intelligent Robotics FRI I. Instructor: Justin Hart. CS 309: Autonomous Intelligent Robotics FRI I Instructor: Justin Hart http://justinhart.net/teaching/2017_fall_cs378/ Today Basic Information, Preliminaries FRI Autonomous Robots Overview Panel with the

More information

Unit 12: Artificial Intelligence CS 101, Fall 2018

Unit 12: Artificial Intelligence CS 101, Fall 2018 Unit 12: Artificial Intelligence CS 101, Fall 2018 Learning Objectives After completing this unit, you should be able to: Explain the difference between procedural and declarative knowledge. Describe the

More information

Swing Copters AI. Monisha White and Nolan Walsh Fall 2015, CS229, Stanford University

Swing Copters AI. Monisha White and Nolan Walsh  Fall 2015, CS229, Stanford University Swing Copters AI Monisha White and Nolan Walsh mewhite@stanford.edu njwalsh@stanford.edu Fall 2015, CS229, Stanford University 1. Introduction For our project we created an autonomous player for the game

More information

Announcements. CS 188: Artificial Intelligence Spring Game Playing State-of-the-Art. Overview. Game Playing. GamesCrafters

Announcements. CS 188: Artificial Intelligence Spring Game Playing State-of-the-Art. Overview. Game Playing. GamesCrafters CS 188: Artificial Intelligence Spring 2011 Announcements W1 out and due Monday 4:59pm P2 out and due next week Friday 4:59pm Lecture 7: Mini and Alpha-Beta Search 2/9/2011 Pieter Abbeel UC Berkeley Many

More information

AlphaGo and Artificial Intelligence GUEST LECTURE IN THE GAME OF GO AND SOCIETY

AlphaGo and Artificial Intelligence GUEST LECTURE IN THE GAME OF GO AND SOCIETY AlphaGo and Artificial Intelligence HUCK BENNET T (NORTHWESTERN UNIVERSITY) GUEST LECTURE IN THE GAME OF GO AND SOCIETY AT OCCIDENTAL COLLEGE, 10/29/2018 The Game of Go A game for aliens, presidents, and

More information

Introduction to Computer Science with MakeCode for Minecraft

Introduction to Computer Science with MakeCode for Minecraft Introduction to Computer Science with MakeCode for Minecraft Lesson 9: Artificial Intelligence In this chapter, we ll dive into the popular field of Artificial Intelligence, or AI. From driverless cars,

More information

CS343 Introduction to Artificial Intelligence Spring 2010

CS343 Introduction to Artificial Intelligence Spring 2010 CS343 Introduction to Artificial Intelligence Spring 2010 Prof: TA: Daniel Urieli Department of Computer Science The University of Texas at Austin Good Afternoon, Colleagues Welcome to a fun, but challenging

More information

Artificial Intelligence. Minimax and alpha-beta pruning

Artificial Intelligence. Minimax and alpha-beta pruning Artificial Intelligence Minimax and alpha-beta pruning In which we examine the problems that arise when we try to plan ahead to get the best result in a world that includes a hostile agent (other agent

More information

COMP219: Artificial Intelligence. Lecture 2: AI Problems and Applications

COMP219: Artificial Intelligence. Lecture 2: AI Problems and Applications COMP219: Artificial Intelligence Lecture 2: AI Problems and Applications 1 Introduction Last time General module information Characterisation of AI and what it is about Today Overview of some common AI

More information

Poker AI: Equilibrium, Online Resolving, Deep Learning and Reinforcement Learning

Poker AI: Equilibrium, Online Resolving, Deep Learning and Reinforcement Learning Poker AI: Equilibrium, Online Resolving, Deep Learning and Reinforcement Learning Nikolai Yakovenko NVidia ADLR Group -- Santa Clara CA Columbia University Deep Learning Seminar April 2017 Poker is a Turn-Based

More information

CSC321 Lecture 23: Go

CSC321 Lecture 23: Go CSC321 Lecture 23: Go Roger Grosse Roger Grosse CSC321 Lecture 23: Go 1 / 21 Final Exam Friday, April 20, 9am-noon Last names A Y: Clara Benson Building (BN) 2N Last names Z: Clara Benson Building (BN)

More information

an AI for Slither.io

an AI for Slither.io an AI for Slither.io Jackie Yang(jackiey) Introduction Game playing is a very interesting topic area in Artificial Intelligence today. Most of the recent emerging AI are for turn-based game, like the very

More information

CS 188: Artificial Intelligence

CS 188: Artificial Intelligence CS 188: Artificial Intelligence Adversarial Search Prof. Scott Niekum The University of Texas at Austin [These slides are based on those of Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley.

More information

LECTURE 26: GAME THEORY 1

LECTURE 26: GAME THEORY 1 15-382 COLLECTIVE INTELLIGENCE S18 LECTURE 26: GAME THEORY 1 INSTRUCTOR: GIANNI A. DI CARO ICE-CREAM WARS http://youtu.be/jilgxenbk_8 2 GAME THEORY Game theory is the formal study of conflict and cooperation

More information

CSE 40171: Artificial Intelligence. Adversarial Search: Games and Optimality

CSE 40171: Artificial Intelligence. Adversarial Search: Games and Optimality CSE 40171: Artificial Intelligence Adversarial Search: Games and Optimality 1 What is a game? Game Playing State-of-the-Art Checkers: 1950: First computer player. 1994: First computer champion: Chinook

More information

LECTURE 1: OVERVIEW. CS 4100: Foundations of AI. Instructor: Robert Platt. (some slides from Chris Amato, Magy Seif El-Nasr, and Stacy Marsella)

LECTURE 1: OVERVIEW. CS 4100: Foundations of AI. Instructor: Robert Platt. (some slides from Chris Amato, Magy Seif El-Nasr, and Stacy Marsella) LECTURE 1: OVERVIEW CS 4100: Foundations of AI Instructor: Robert Platt (some slides from Chris Amato, Magy Seif El-Nasr, and Stacy Marsella) SOME LOGISTICS Class webpage: http://www.ccs.neu.edu/home/rplatt/cs4100_spring2018/index.html

More information

ECE 517: Reinforcement Learning in Artificial Intelligence

ECE 517: Reinforcement Learning in Artificial Intelligence ECE 517: Reinforcement Learning in Artificial Intelligence Lecture 17: Case Studies and Gradient Policy October 29, 2015 Dr. Itamar Arel College of Engineering Department of Electrical Engineering and

More information

SDS PODCAST EPISODE 110 ALPHAGO ZERO

SDS PODCAST EPISODE 110 ALPHAGO ZERO SDS PODCAST EPISODE 110 ALPHAGO ZERO Show Notes: http://www.superdatascience.com/110 1 Kirill: This is episode number 110, AlphaGo Zero. Welcome back ladies and gentlemen to the SuperDataSceince podcast.

More information

CS 188: Artificial Intelligence Spring Game Playing in Practice

CS 188: Artificial Intelligence Spring Game Playing in Practice CS 188: Artificial Intelligence Spring 2006 Lecture 23: Games 4/18/2006 Dan Klein UC Berkeley Game Playing in Practice Checkers: Chinook ended 40-year-reign of human world champion Marion Tinsley in 1994.

More information

Game-Playing & Adversarial Search

Game-Playing & Adversarial Search Game-Playing & Adversarial Search This lecture topic: Game-Playing & Adversarial Search (two lectures) Chapter 5.1-5.5 Next lecture topic: Constraint Satisfaction Problems (two lectures) Chapter 6.1-6.4,

More information

CS 188: Artificial Intelligence

CS 188: Artificial Intelligence CS 188: Artificial Intelligence Adversarial Search Instructor: Stuart Russell University of California, Berkeley Game Playing State-of-the-Art Checkers: 1950: First computer player. 1959: Samuel s self-taught

More information

COMP9414/ 9814/ 3411: Artificial Intelligence. Week 2. Classifying AI Tasks

COMP9414/ 9814/ 3411: Artificial Intelligence. Week 2. Classifying AI Tasks COMP9414/ 9814/ 3411: Artificial Intelligence Week 2. Classifying AI Tasks Russell & Norvig, Chapter 2. COMP9414/9814/3411 18s1 Tasks & Agent Types 1 Examples of AI Tasks Week 2: Wumpus World, Robocup

More information

Tutorial of Reinforcement: A Special Focus on Q-Learning

Tutorial of Reinforcement: A Special Focus on Q-Learning Tutorial of Reinforcement: A Special Focus on Q-Learning TINGWU WANG, MACHINE LEARNING GROUP, UNIVERSITY OF TORONTO Contents 1. Introduction 1. Discrete Domain vs. Continous Domain 2. Model Based vs. Model

More information

Learning Artificial Intelligence in Large-Scale Video Games

Learning Artificial Intelligence in Large-Scale Video Games Learning Artificial Intelligence in Large-Scale Video Games A First Case Study with Hearthstone: Heroes of WarCraft Master Thesis Submitted for the Degree of MSc in Computer Science & Engineering Author

More information

RISTO MIIKKULAINEN, SENTIENT (HTTP://VENTUREBEAT.COM/AUTHOR/RISTO-MIIKKULAINEN- SATIENT/) APRIL 3, :23 PM

RISTO MIIKKULAINEN, SENTIENT (HTTP://VENTUREBEAT.COM/AUTHOR/RISTO-MIIKKULAINEN- SATIENT/) APRIL 3, :23 PM 1,2 Guest Machines are becoming more creative than humans RISTO MIIKKULAINEN, SENTIENT (HTTP://VENTUREBEAT.COM/AUTHOR/RISTO-MIIKKULAINEN- SATIENT/) APRIL 3, 2016 12:23 PM TAGS: ARTIFICIAL INTELLIGENCE

More information

Artificial Intelligence Adversarial Search

Artificial Intelligence Adversarial Search Artificial Intelligence Adversarial Search Adversarial Search Adversarial search problems games They occur in multiagent competitive environments There is an opponent we can t control planning again us!

More information

CS 4700: Foundations of Artificial Intelligence

CS 4700: Foundations of Artificial Intelligence CS 4700: Foundations of Artificial Intelligence Bart Selman Reinforcement Learning R&N Chapter 21 Note: in the next two parts of RL, some of the figure/section numbers refer to an earlier edition of R&N

More information

What is Artificial Intelligence? Alternate Definitions (Russell + Norvig) Human intelligence

What is Artificial Intelligence? Alternate Definitions (Russell + Norvig) Human intelligence CSE 3401: Intro to Artificial Intelligence & Logic Programming Introduction Required Readings: Russell & Norvig Chapters 1 & 2. Lecture slides adapted from those of Fahiem Bacchus. What is AI? What is

More information

CMSC 671 Project Report- Google AI Challenge: Planet Wars

CMSC 671 Project Report- Google AI Challenge: Planet Wars 1. Introduction Purpose The purpose of the project is to apply relevant AI techniques learned during the course with a view to develop an intelligent game playing bot for the game of Planet Wars. Planet

More information

CMU Lecture 22: Game Theory I. Teachers: Gianni A. Di Caro

CMU Lecture 22: Game Theory I. Teachers: Gianni A. Di Caro CMU 15-781 Lecture 22: Game Theory I Teachers: Gianni A. Di Caro GAME THEORY Game theory is the formal study of conflict and cooperation in (rational) multi-agent systems Decision-making where several

More information

Decision Making in Multiplayer Environments Application in Backgammon Variants

Decision Making in Multiplayer Environments Application in Backgammon Variants Decision Making in Multiplayer Environments Application in Backgammon Variants PhD Thesis by Nikolaos Papahristou AI researcher Department of Applied Informatics Thessaloniki, Greece Contributions Expert

More information

Administrivia. CS 188: Artificial Intelligence Spring Agents and Environments. Today. Vacuum-Cleaner World. A Reflex Vacuum-Cleaner

Administrivia. CS 188: Artificial Intelligence Spring Agents and Environments. Today. Vacuum-Cleaner World. A Reflex Vacuum-Cleaner CS 188: Artificial Intelligence Spring 2006 Lecture 2: Agents 1/19/2006 Administrivia Reminder: Drop-in Python/Unix lab Friday 1-4pm, 275 Soda Hall Optional, but recommended Accommodation issues Project

More information

Playful AI Education. Todd W. Neller Gettysburg College

Playful AI Education. Todd W. Neller Gettysburg College Playful AI Education Todd W. Neller Gettysburg College Introduction Teachers teach best when sharing from the core of their enjoyment of the material. E.g. Those with enthusiasm for graphics should use

More information

Jane Li. Assistant Professor Mechanical Engineering Department, Robotic Engineering Program Worcester Polytechnic Institute

Jane Li. Assistant Professor Mechanical Engineering Department, Robotic Engineering Program Worcester Polytechnic Institute Jane Li Assistant Professor Mechanical Engineering Department, Robotic Engineering Program Worcester Polytechnic Institute (2 pts) How to avoid obstacles when reproducing a trajectory using a learned DMP?

More information

CS325 Artificial Intelligence Robotics I Autonomous Robots (Ch. 25)

CS325 Artificial Intelligence Robotics I Autonomous Robots (Ch. 25) CS325 Artificial Intelligence Robotics I Autonomous Robots (Ch. 25) Dr. Cengiz Günay, Emory Univ. Günay Robotics I Autonomous Robots (Ch. 25) Spring 2013 1 / 15 Robots As Killers? The word robot coined

More information

Lab 11: GoFirst and Nim 12:00 PM, Nov 19, 2017

Lab 11: GoFirst and Nim 12:00 PM, Nov 19, 2017 CS17 Integrated Introduction to Computer Science Hughes Contents Lab 11: GoFirst and Nim 12:00 PM, Nov 19, 2017 1 Prologue 1 2 Game Theory 1 3 Game Signature 2 4 GoFirst, A Game Module 3 5 Nim, A Game

More information

CMU-Q Lecture 20:

CMU-Q Lecture 20: CMU-Q 15-381 Lecture 20: Game Theory I Teacher: Gianni A. Di Caro ICE-CREAM WARS http://youtu.be/jilgxenbk_8 2 GAME THEORY Game theory is the formal study of conflict and cooperation in (rational) multi-agent

More information

Computer Science Faculty Publications

Computer Science Faculty Publications Computer Science Faculty Publications Computer Science 2-4-2017 Playful AI Education Todd W. Neller Gettysburg College Follow this and additional works at: https://cupola.gettysburg.edu/csfac Part of the

More information

CS10 The Beauty and Joy of Computing

CS10 The Beauty and Joy of Computing CS10 The Beauty and Joy of Computing Lecture #15 Artificial Intelligence UC Berkeley EECS Lecturer SOE Dan Garcia 2011-10-24 The PRIMER-V2 robot is capable of starting from a stopped position, start riding,

More information

Programming Project 1: Pacman (Due )

Programming Project 1: Pacman (Due ) Programming Project 1: Pacman (Due 8.2.18) Registration to the exams 521495A: Artificial Intelligence Adversarial Search (Min-Max) Lectured by Abdenour Hadid Adjunct Professor, CMVS, University of Oulu

More information

Analyzing the Impact of Knowledge and Search in Monte Carlo Tree Search in Go

Analyzing the Impact of Knowledge and Search in Monte Carlo Tree Search in Go Analyzing the Impact of Knowledge and Search in Monte Carlo Tree Search in Go Farhad Haqiqat and Martin Müller University of Alberta Edmonton, Canada Contents Motivation and research goals Feature Knowledge

More information

CS 229 Final Project: Using Reinforcement Learning to Play Othello

CS 229 Final Project: Using Reinforcement Learning to Play Othello CS 229 Final Project: Using Reinforcement Learning to Play Othello Kevin Fry Frank Zheng Xianming Li ID: kfry ID: fzheng ID: xmli 16 December 2016 Abstract We built an AI that learned to play Othello.

More information

Minecraft IRL (In Real Life) Ages: I held this program as a middle school afterschool event. You could easily scale the difficulty up or down.

Minecraft IRL (In Real Life) Ages: I held this program as a middle school afterschool event. You could easily scale the difficulty up or down. Minecraft IRL (In Real Life) Ages: I held this program as a middle school afterschool event. You could easily scale the difficulty up or down. Size: I planned for a group of 15. Time: My program lasted

More information

CS 188: Artificial Intelligence. Overview

CS 188: Artificial Intelligence. Overview CS 188: Artificial Intelligence Lecture 6 and 7: Search for Games Pieter Abbeel UC Berkeley Many slides adapted from Dan Klein 1 Overview Deterministic zero-sum games Minimax Limited depth and evaluation

More information

Adversarial Search. Hal Daumé III. Computer Science University of Maryland CS 421: Introduction to Artificial Intelligence 9 Feb 2012

Adversarial Search. Hal Daumé III. Computer Science University of Maryland CS 421: Introduction to Artificial Intelligence 9 Feb 2012 1 Hal Daumé III (me@hal3.name) Adversarial Search Hal Daumé III Computer Science University of Maryland me@hal3.name CS 421: Introduction to Artificial Intelligence 9 Feb 2012 Many slides courtesy of Dan

More information

46.1 Introduction. Foundations of Artificial Intelligence Introduction MCTS in AlphaGo Neural Networks. 46.

46.1 Introduction. Foundations of Artificial Intelligence Introduction MCTS in AlphaGo Neural Networks. 46. Foundations of Artificial Intelligence May 30, 2016 46. AlphaGo and Outlook Foundations of Artificial Intelligence 46. AlphaGo and Outlook Thomas Keller Universität Basel May 30, 2016 46.1 Introduction

More information

Artificial Intelligence ( CS 365 ) IMPLEMENTATION OF AI SCRIPT GENERATOR USING DYNAMIC SCRIPTING FOR AOE2 GAME

Artificial Intelligence ( CS 365 ) IMPLEMENTATION OF AI SCRIPT GENERATOR USING DYNAMIC SCRIPTING FOR AOE2 GAME Artificial Intelligence ( CS 365 ) IMPLEMENTATION OF AI SCRIPT GENERATOR USING DYNAMIC SCRIPTING FOR AOE2 GAME Author: Saurabh Chatterjee Guided by: Dr. Amitabha Mukherjee Abstract: I have implemented

More information

How to Survive Your First Night in Minecraft

How to Survive Your First Night in Minecraft How to Survive Your First Night in Minecraft Survival Mode Lizza Igoe, Colleen Griffiths, Becky Hayes Objectives 1. 2. 3. 4. 5. Explore Your Surroundings Gather Resources Build a Shelter Craft Tools Avoid

More information

COMP219: COMP219: Artificial Intelligence Artificial Intelligence Dr. Annabel Latham Lecture 12: Game Playing Overview Games and Search

COMP219: COMP219: Artificial Intelligence Artificial Intelligence Dr. Annabel Latham Lecture 12: Game Playing Overview Games and Search COMP19: Artificial Intelligence COMP19: Artificial Intelligence Dr. Annabel Latham Room.05 Ashton Building Department of Computer Science University of Liverpool Lecture 1: Game Playing 1 Overview Last

More information

Game Playing for a Variant of Mancala Board Game (Pallanguzhi)

Game Playing for a Variant of Mancala Board Game (Pallanguzhi) Game Playing for a Variant of Mancala Board Game (Pallanguzhi) Varsha Sankar (SUNet ID: svarsha) 1. INTRODUCTION Game playing is a very interesting area in the field of Artificial Intelligence presently.

More information

REINFORCEMENT LEARNING (DD3359) O-03 END-TO-END LEARNING

REINFORCEMENT LEARNING (DD3359) O-03 END-TO-END LEARNING REINFORCEMENT LEARNING (DD3359) O-03 END-TO-END LEARNING RIKA ANTONOVA ANTONOVA@KTH.SE ALI GHADIRZADEH ALGH@KTH.SE RL: What We Know So Far Formulate the problem as an MDP (or POMDP) State space captures

More information

History and Philosophical Underpinnings

History and Philosophical Underpinnings History and Philosophical Underpinnings Last Class Recap game-theory why normal search won t work minimax algorithm brute-force traversal of game tree for best move alpha-beta pruning how to improve on

More information

Introduction and History of AI

Introduction and History of AI 15-780 Introduction and History of AI J. Zico Kolter January 13, 2014 1 What is AI? 2 Some classic definitions Buildings computers that... Think like humans Act like humans Think rationally Act rationally

More information

Humanoid Robot NAO: Developing Behaviors for Football Humanoid Robots

Humanoid Robot NAO: Developing Behaviors for Football Humanoid Robots Humanoid Robot NAO: Developing Behaviors for Football Humanoid Robots State of the Art Presentation Luís Miranda Cruz Supervisors: Prof. Luis Paulo Reis Prof. Armando Sousa Outline 1. Context 1.1. Robocup

More information

CS10 The Beauty and Joy of Computing

CS10 The Beauty and Joy of Computing CS10 The Beauty and Joy of Computing Lecture #21 Artificial Intelligence UC Berkeley EECS Lecturer SOE Dan Garcia 2011-04-13 IBM s Watson is being used by researchers in Canada to provide early warnings

More information

Mastering Chess and Shogi by Self- Play with a General Reinforcement Learning Algorithm

Mastering Chess and Shogi by Self- Play with a General Reinforcement Learning Algorithm Mastering Chess and Shogi by Self- Play with a General Reinforcement Learning Algorithm by Silver et al Published by Google Deepmind Presented by Kira Selby Background u In March 2016, Deepmind s AlphaGo

More information

CS 771 Artificial Intelligence. Adversarial Search

CS 771 Artificial Intelligence. Adversarial Search CS 771 Artificial Intelligence Adversarial Search Typical assumptions Two agents whose actions alternate Utility values for each agent are the opposite of the other This creates the adversarial situation

More information

Artificial Intelligence: Definition

Artificial Intelligence: Definition Lecture Notes Artificial Intelligence: Definition Dae-Won Kim School of Computer Science & Engineering Chung-Ang University What are AI Systems? Deep Blue defeated the world chess champion Garry Kasparov

More information

CS 4700: Foundations of Artificial Intelligence

CS 4700: Foundations of Artificial Intelligence CS 4700: Foundations of Artificial Intelligence selman@cs.cornell.edu Module: Adversarial Search R&N: Chapter 5 1 Outline Adversarial Search Optimal decisions Minimax α-β pruning Case study: Deep Blue

More information

Monte Carlo Tree Search

Monte Carlo Tree Search Monte Carlo Tree Search 1 By the end, you will know Why we use Monte Carlo Search Trees The pros and cons of MCTS How it is applied to Super Mario Brothers and Alpha Go 2 Outline I. Pre-MCTS Algorithms

More information

CRYPTOSHOOTER MULTI AGENT BASED SECRET COMMUNICATION IN AUGMENTED VIRTUALITY

CRYPTOSHOOTER MULTI AGENT BASED SECRET COMMUNICATION IN AUGMENTED VIRTUALITY CRYPTOSHOOTER MULTI AGENT BASED SECRET COMMUNICATION IN AUGMENTED VIRTUALITY Submitted By: Sahil Narang, Sarah J Andrabi PROJECT IDEA The main idea for the project is to create a pursuit and evade crowd

More information

Artificial Intelligence for Engineers. EE 562 Winter 2015

Artificial Intelligence for Engineers. EE 562 Winter 2015 Artificial Intelligence for Engineers EE 562 Winter 2015 1 Administrative Details Instructor: Linda Shapiro, 634 CSE, shapiro@cs.washington.edu TA: ½ time Bilge Soran, bilge@cs.washington.edu Course Home

More information

Outline. Game Playing. Game Problems. Game Problems. Types of games Playing a perfect game. Playing an imperfect game

Outline. Game Playing. Game Problems. Game Problems. Types of games Playing a perfect game. Playing an imperfect game Outline Game Playing ECE457 Applied Artificial Intelligence Fall 2007 Lecture #5 Types of games Playing a perfect game Minimax search Alpha-beta pruning Playing an imperfect game Real-time Imperfect information

More information

COMP9414: Artificial Intelligence Adversarial Search

COMP9414: Artificial Intelligence Adversarial Search CMP9414, Wednesday 4 March, 004 CMP9414: Artificial Intelligence In many problems especially game playing you re are pitted against an opponent This means that certain operators are beyond your control

More information

Human Computation and Crowdsourcing Systems

Human Computation and Crowdsourcing Systems Human Computation and Crowdsourcing Systems Walter S. Lasecki EECS 598, Fall 2015 Who am I? http://wslasecki.com New to UMich! Prof in CSE, SI BS, Virginia Tech, CS/Math PhD, University of Rochester, CS

More information

Games (adversarial search problems)

Games (adversarial search problems) Mustafa Jarrar: Lecture Notes on Games, Birzeit University, Palestine Fall Semester, 204 Artificial Intelligence Chapter 6 Games (adversarial search problems) Dr. Mustafa Jarrar Sina Institute, University

More information

CS343 Artificial Intelligence

CS343 Artificial Intelligence CS343 Artificial Intelligence Prof: Department of Computer Science The University of Texas at Austin Good Morning, Colleagues Good Morning, Colleagues Are there any questions? Logistics Questions about

More information

Announcements. Homework 1. Project 1. Due tonight at 11:59pm. Due Friday 2/8 at 4:00pm. Electronic HW1 Written HW1

Announcements. Homework 1. Project 1. Due tonight at 11:59pm. Due Friday 2/8 at 4:00pm. Electronic HW1 Written HW1 Announcements Homework 1 Due tonight at 11:59pm Project 1 Electronic HW1 Written HW1 Due Friday 2/8 at 4:00pm CS 188: Artificial Intelligence Adversarial Search and Game Trees Instructors: Sergey Levine

More information

CS510 \ Lecture Ariel Stolerman

CS510 \ Lecture Ariel Stolerman CS510 \ Lecture04 2012-10-15 1 Ariel Stolerman Administration Assignment 2: just a programming assignment. Midterm: posted by next week (5), will cover: o Lectures o Readings A midterm review sheet will

More information

Using Artificial intelligent to solve the game of 2048

Using Artificial intelligent to solve the game of 2048 Using Artificial intelligent to solve the game of 2048 Ho Shing Hin (20343288) WONG, Ngo Yin (20355097) Lam Ka Wing (20280151) Abstract The report presents the solver of the game 2048 base on artificial

More information

Foundations of Artificial Intelligence Introduction State of the Art Summary. classification: Board Games: Overview

Foundations of Artificial Intelligence Introduction State of the Art Summary. classification: Board Games: Overview Foundations of Artificial Intelligence May 14, 2018 40. Board Games: Introduction and State of the Art Foundations of Artificial Intelligence 40. Board Games: Introduction and State of the Art 40.1 Introduction

More information

6. Games. COMP9414/ 9814/ 3411: Artificial Intelligence. Outline. Mechanical Turk. Origins. origins. motivation. minimax search

6. Games. COMP9414/ 9814/ 3411: Artificial Intelligence. Outline. Mechanical Turk. Origins. origins. motivation. minimax search COMP9414/9814/3411 16s1 Games 1 COMP9414/ 9814/ 3411: Artificial Intelligence 6. Games Outline origins motivation Russell & Norvig, Chapter 5. minimax search resource limits and heuristic evaluation α-β

More information

POKER AGENTS LD Miller & Adam Eck April 14 & 19, 2011

POKER AGENTS LD Miller & Adam Eck April 14 & 19, 2011 POKER AGENTS LD Miller & Adam Eck April 14 & 19, 2011 Motivation Classic environment properties of MAS Stochastic behavior (agents and environment) Incomplete information Uncertainty Application Examples

More information

CS 380: ARTIFICIAL INTELLIGENCE MONTE CARLO SEARCH. Santiago Ontañón

CS 380: ARTIFICIAL INTELLIGENCE MONTE CARLO SEARCH. Santiago Ontañón CS 380: ARTIFICIAL INTELLIGENCE MONTE CARLO SEARCH Santiago Ontañón so367@drexel.edu Recall: Adversarial Search Idea: When there is only one agent in the world, we can solve problems using DFS, BFS, ID,

More information

DeepMind Self-Learning Atari Agent

DeepMind Self-Learning Atari Agent DeepMind Self-Learning Atari Agent Human-level control through deep reinforcement learning Nature Vol 518, Feb 26, 2015 The Deep Mind of Demis Hassabis Backchannel / Medium.com interview with David Levy

More information

Training a Back-Propagation Network with Temporal Difference Learning and a database for the board game Pente

Training a Back-Propagation Network with Temporal Difference Learning and a database for the board game Pente Training a Back-Propagation Network with Temporal Difference Learning and a database for the board game Pente Valentijn Muijrers 3275183 Valentijn.Muijrers@phil.uu.nl Supervisor: Gerard Vreeswijk 7,5 ECTS

More information

CS 5522: Artificial Intelligence II

CS 5522: Artificial Intelligence II CS 5522: Artificial Intelligence II Adversarial Search Instructor: Alan Ritter Ohio State University [These slides were adapted from CS188 Intro to AI at UC Berkeley. All materials available at http://ai.berkeley.edu.]

More information

Board Game AIs. With a Focus on Othello. Julian Panetta March 3, 2010

Board Game AIs. With a Focus on Othello. Julian Panetta March 3, 2010 Board Game AIs With a Focus on Othello Julian Panetta March 3, 2010 1 Practical Issues Bug fix for TimeoutException at player init Not an issue for everyone Download updated project files from CS2 course

More information

CSC384 Intro to Artificial Intelligence* *The following slides are based on Fahiem Bacchus course lecture notes.

CSC384 Intro to Artificial Intelligence* *The following slides are based on Fahiem Bacchus course lecture notes. CSC384 Intro to Artificial Intelligence* *The following slides are based on Fahiem Bacchus course lecture notes. Artificial Intelligence A branch of Computer Science. Examines how we can achieve intelligent

More information

Artificial Intelligence

Artificial Intelligence Artificial Intelligence Adversarial Search Instructors: David Suter and Qince Li Course Delivered @ Harbin Institute of Technology [Many slides adapted from those created by Dan Klein and Pieter Abbeel

More information

Reinforcement Learning Agent for Scrolling Shooter Game

Reinforcement Learning Agent for Scrolling Shooter Game Reinforcement Learning Agent for Scrolling Shooter Game Peng Yuan (pengy@stanford.edu) Yangxin Zhong (yangxin@stanford.edu) Zibo Gong (zibo@stanford.edu) 1 Introduction and Task Definition 1.1 Game Agent

More information

Adversarial Search. Human-aware Robotics. 2018/01/25 Chapter 5 in R&N 3rd Ø Announcement: Slides for this lecture are here:

Adversarial Search. Human-aware Robotics. 2018/01/25 Chapter 5 in R&N 3rd Ø Announcement: Slides for this lecture are here: Adversarial Search 2018/01/25 Chapter 5 in R&N 3rd Ø Announcement: q Slides for this lecture are here: http://www.public.asu.edu/~yzhan442/teaching/cse471/lectures/adversarial.pdf Slides are largely based

More information

Experiments with Tensor Flow Roman Weber (Geschäftsführer) Richard Schmid (Senior Consultant)

Experiments with Tensor Flow Roman Weber (Geschäftsführer) Richard Schmid (Senior Consultant) Experiments with Tensor Flow 23.05.2017 Roman Weber (Geschäftsführer) Richard Schmid (Senior Consultant) WEBGATE CONSULTING Gegründet Mitarbeiter CH Inhaber geführt IT Anbieter Partner 2001 Ex 29 Beratung

More information

Spring 19 Planning Techniques for Robotics Introduction; What is Planning for Robotics?

Spring 19 Planning Techniques for Robotics Introduction; What is Planning for Robotics? 16-350 Spring 19 Planning Techniques for Robotics Introduction; What is Planning for Robotics? Maxim Likhachev Robotics Institute Carnegie Mellon University About Me My Research Interests: - Planning,

More information

Welcome to CompSci 171 Fall 2010 Introduction to AI.

Welcome to CompSci 171 Fall 2010 Introduction to AI. Welcome to CompSci 171 Fall 2010 Introduction to AI. http://www.ics.uci.edu/~welling/teaching/ics171spring07/ics171fall09.html Instructor: Max Welling, welling@ics.uci.edu Office hours: Wed. 4-5pm in BH

More information

Chapter 4: Internal Economy. Hamzah Asyrani Sulaiman

Chapter 4: Internal Economy. Hamzah Asyrani Sulaiman Chapter 4: Internal Economy Hamzah Asyrani Sulaiman in games, the internal economy can include all sorts of resources that are not part of a reallife economy. In games, things like health, experience,

More information

CSSE220 BomberMan programming assignment Team Project

CSSE220 BomberMan programming assignment Team Project CSSE220 BomberMan programming assignment Team Project You will write a game that is patterned off the 1980 s BomberMan game. You can find a description of the game, and much more information here: http://strategywiki.org/wiki/bomberman

More information