CS343 Artificial Intelligence

Similar documents
CS 730/830: Intro AI. Prof. Wheeler Ruml. TA Bence Cserna. Thinking inside the box. 5 handouts: course info, project info, schedule, slides, asst 1

CS343 Introduction to Artificial Intelligence Spring 2012

CPS331 Lecture: Intelligent Agents last revised July 25, 2018

CS343 Introduction to Artificial Intelligence Spring 2010

Outline. Introduction to AI. Artificial Intelligence. What is an AI? What is an AI? Agents Environments

Agent. Pengju Ren. Institute of Artificial Intelligence and Robotics

HIT3002: Introduction to Artificial Intelligence

CPS331 Lecture: Agents and Robots last revised April 27, 2012

CS 486/686 Artificial Intelligence

Course Info. CS 486/686 Artificial Intelligence. Outline. Artificial Intelligence (AI)

Unit 12: Artificial Intelligence CS 101, Fall 2018

COMP9414/ 9814/ 3411: Artificial Intelligence. 2. Environment Types. UNSW c Alan Blair,

CPS331 Lecture: Agents and Robots last revised November 18, 2016

Intelligent Agents p.1/25. Intelligent Agents. Chapter 2

Administrivia. CS 188: Artificial Intelligence Spring Agents and Environments. Today. Vacuum-Cleaner World. A Reflex Vacuum-Cleaner

COMP9414/ 9814/ 3411: Artificial Intelligence. Week 2. Classifying AI Tasks

Outline. Agents and environments Rationality PEAS (Performance measure, Environment, Actuators, Sensors) Environment types Agent types

2. Environment Types. COMP9414/ 9814/ 3411: Artificial Intelligence. Agent Model. Agents as functions. The PEAS model of an Agent

LECTURE 26: GAME THEORY 1

Inf2D 01: Intelligent Agents and their Environments

CS 380: ARTIFICIAL INTELLIGENCE RATIONAL AGENTS. Santiago Ontañón

Intelligent Agents & Search Problem Formulation. AIMA, Chapters 2,

Overview Agents, environments, typical components

Introduction to Multi-Agent Systems. Michal Pechoucek & Branislav Bošanský AE4M36MAS Autumn Lect. 1

Structure of Intelligent Agents. Examples of Agents 1. Examples of Agents 2. Intelligent Agents and their Environments. An agent:

Today s Assignment. Outline. Course Objective 1: Agent Architectures. Agent Architecture (Objective 1) Types of Agents (Objective 1)

Section Marks Agents / 8. Search / 10. Games / 13. Logic / 15. Total / 46

CS325 Artificial Intelligence Ch. 5, Games!

Artificial Intelligence

Reinforcement Learning Simulations and Robotics

Informatics 2D: Tutorial 1 (Solutions)

Plan for the 2nd hour. What is AI. Acting humanly: The Turing test. EDAF70: Applied Artificial Intelligence Agents (Chapter 2 of AIMA)

Introduction and History of AI

CSE 473 Artificial Intelligence (AI) Outline

CS 380: ARTIFICIAL INTELLIGENCE

Google DeepMind s AlphaGo vs. world Go champion Lee Sedol

Multi-Robot Teamwork Cooperative Multi-Robot Systems

Prof. Sameer Singh CS 175: PROJECTS IN AI (IN MINECRAFT) WINTER April 6, 2017

CS 380: ARTIFICIAL INTELLIGENCE INTRODUCTION. Santiago Ontañón

CS 343H: Artificial Intelligence. Week 1a: Introduction

CS494/594: Software for Intelligent Robotics

Outline. Game Playing. Game Problems. Game Problems. Types of games Playing a perfect game. Playing an imperfect game

Lecture Overview. c D. Poole and A. Mackworth 2017 Artificial Intelligence, Lecture 1.1, Page 1 1 / 15

CMU-Q Lecture 20:

Multiple Agents. Why can t we all just get along? (Rodney King)

CS 380: ARTIFICIAL INTELLIGENCE

Learning Artificial Intelligence in Large-Scale Video Games

CMSC 372 Artificial Intelligence What is AI? Thinking Like Acting Like Humans Humans Thought Processes Behaviors

Game Artificial Intelligence ( CS 4731/7632 )

In cooperative robotics, the group of robots have the same goals, and thus it is

Integrating Learning in a Multi-Scale Agent

Introduction. Ioannis Rekleitis

Artificial Intelligence (Introduction to)

Introduction To Cognitive Robots

Artificial Intelligence Adversarial Search

Enterprise ISEA of the Future a Technology Vision for Fleet Support

Application of Artificial Neural Networks in Autonomous Mission Planning for Planetary Rovers

CS 309: Autonomous Intelligent Robotics. Instructor: Jivko Sinapov

[31] S. Koenig, C. Tovey, and W. Halliburton. Greedy mapping of terrain.

5.1 State-Space Search Problems

CS 188: Artificial Intelligence

Introduction to Multiagent Systems

CS221 Final Project Report Learn to Play Texas hold em

introduction to the course course structure topics

3.1 Agents. Foundations of Artificial Intelligence. 3.1 Agents. 3.2 Rationality. 3.3 Summary. Introduction: Overview. 3. Introduction: Rational Agents

Artificial Intelligence CS365. Amitabha Mukerjee

Unit-III Chap-II Adversarial Search. Created by: Ashish Shah 1

Outline. What is AI? A brief history of AI State of the art

Our 2-course meal for this evening

Instructor. Artificial Intelligence (Introduction to) What is AI? Introduction. Dr Sergio Tessaris

Autonomous Agents and MultiAgent Systems* Lecture 2

Elements of Artificial Intelligence and Expert Systems

COMP5121 Mobile Robots

COMP219: Artificial Intelligence. Lecture 2: AI Problems and Applications

COMP9414: Artificial Intelligence Problem Solving and Search

MEM455/800 Robotics II/Advance Robotics Winter 2009

Artificial Intelligence: Definition

CS8678_L1. Course Introduction. CS 8678 Introduction to Robotics & AI Dr. Ken Hoganson. Start Momentarily

Lecture 1 What is AI? EECS 348 Intro to Artificial Intelligence Doug Downey

RoboCup. Presented by Shane Murphy April 24, 2003

and : Principles of Autonomy and Decision Making. Prof Brian Williams, Prof Emilio Frazzoli and Sertac Karaman September, 8 th, 2010

CS510 \ Lecture Ariel Stolerman

Hierarchical Controller for Robotic Soccer

Artificial Intelligence

CMU Lecture 22: Game Theory I. Teachers: Gianni A. Di Caro

Artificial Intelligence

CompSci 101 Data into Information and Knowledge. CompSci 101 Introduction to Computer Science

Introduction to Multi-Agent Programming

CS 1571 Introduction to AI Lecture 1. Course overview. CS 1571 Intro to AI. Course administrivia

Modeling Security Decisions as Games

Foundations of Artificial Intelligence Introduction State of the Art Summary. classification: Board Games: Overview

CS 188: Artificial Intelligence Spring Announcements

Announcements. CS 188: Artificial Intelligence Spring Game Playing State-of-the-Art. Overview. Game Playing. GamesCrafters

Multiagent Systems: Intro to Game Theory. CS 486/686: Introduction to Artificial Intelligence

Adversarial Search. CS 486/686: Introduction to Artificial Intelligence

Reinforcement Learning for CPS Safety Engineering. Sam Green, Çetin Kaya Koç, Jieliang Luo University of California, Santa Barbara

Introduction Week 1, Lecture 1

POKER AGENTS LD Miller & Adam Eck April 14 & 19, 2011

Introduction to Artificial Intelligence

Multiagent Systems: Intro to Game Theory. CS 486/686: Introduction to Artificial Intelligence

Transcription:

CS343 Artificial Intelligence Prof: Department of Computer Science The University of Texas at Austin

Good Morning, Colleagues

Good Morning, Colleagues Are there any questions?

Logistics Questions about the syllabus?

Logistics Questions about the syllabus? Class registration

Logistics Questions about the syllabus? Class registration Problems with the assignment?

Logistics Questions about the syllabus? Class registration Problems with the assignment? Piazza useful discussion yesterday

Logistics Questions about the syllabus? Class registration Problems with the assignment? Piazza useful discussion yesterday CC Kim (houck@cs), and me on everything

Logistics Questions about the syllabus? Class registration Problems with the assignment? Piazza useful discussion yesterday CC Kim (houck@cs), and me on everything Assignments up through week 3

Logistics Questions about the syllabus? Class registration Problems with the assignment? Piazza useful discussion yesterday CC Kim (houck@cs), and me on everything Assignments up through week 3

Example Intelligent (autonomous) Agents Autonomous robot

Example Intelligent (autonomous) Agents Autonomous robot Information gathering agent Find me the cheapest?

Example Intelligent (autonomous) Agents Autonomous robot Information gathering agent Find me the cheapest? E-commerce agents Decides what to buy/sell and does it

Example Intelligent (autonomous) Agents Autonomous robot Information gathering agent Find me the cheapest? E-commerce agents Decides what to buy/sell and does it Air-traffic controller

Example Intelligent (autonomous) Agents Autonomous robot Information gathering agent Find me the cheapest? E-commerce agents Decides what to buy/sell and does it Air-traffic controller Meeting scheduler

Example Intelligent (autonomous) Agents Autonomous robot Information gathering agent Find me the cheapest? E-commerce agents Decides what to buy/sell and does it Air-traffic controller Meeting scheduler Computer-game-playing agent

Not Intelligent Agents Thermostat Telephone Answering machine Pencil Java object

Environments Environment = sensations, actions

Environments Environment = sensations, actions fully observable vs. partially observable (accessible)

Environments Environment = sensations, actions fully observable vs. partially observable (accessible) single-agent vs. multiagent

Environments Environment = sensations, actions fully observable vs. partially observable (accessible) single-agent vs. multiagent deterministic vs. non-deterministic (stochastic)

Environments Environment = sensations, actions fully observable vs. partially observable (accessible) single-agent vs. multiagent deterministic vs. non-deterministic (stochastic) episodic vs. sequential

Environments Environment = sensations, actions fully observable vs. partially observable (accessible) single-agent vs. multiagent deterministic vs. non-deterministic (stochastic) episodic vs. sequential static vs. dynamic

Environments Environment = sensations, actions fully observable vs. partially observable (accessible) single-agent vs. multiagent deterministic vs. non-deterministic (stochastic) episodic vs. sequential static vs. dynamic discrete vs. continuous

Environments Environment = sensations, actions fully observable vs. partially observable (accessible) single-agent vs. multiagent deterministic vs. non-deterministic (stochastic) episodic vs. sequential static vs. dynamic discrete vs. continuous known vs. unknown

Student Examples game bot robot waiter bowling robot, ping pong player kiva robots, Mars rover, robot suturing agent Wall-E Words with friends word checker thermostat trading agent Siri Briggo piano playing agent unhappiness agent

BE a learning agent

BE a learning agent You, as a group, act as a learning agent

BE a learning agent You, as a group, act as a learning agent Actions: Wave, Stand, Clap

BE a learning agent You, as a group, act as a learning agent Actions: Wave, Stand, Clap Observations: colors, reward

BE a learning agent You, as a group, act as a learning agent Actions: Wave, Stand, Clap Observations: colors, reward Goal: Find an optimal policy

BE a learning agent You, as a group, act as a learning agent Actions: Wave, Stand, Clap Observations: colors, reward Goal: Find an optimal policy Way of selecting actions that gets you the most reward

How did you do it?

How did you do it? What is your policy? What does the world look like?

How did you do it? What is your policy? What does the world look like? +1 1 Stand +10 3 Clap +2 1 Wave 1 1

Formalizing what Just Happened Knowns:

Formalizing what Just Happened Knowns: O = {Blue, Red, Green, Black,...} Rewards in IR A = {W ave, Clap, Stand} o 0, a 0, r 0, o 1, a 1, r 1, o 2,...

Formalizing what Just Happened Knowns: O = {Blue, Red, Green, Black,...} Rewards in IR A = {W ave, Clap, Stand} o 0, a 0, r 0, o 1, a 1, r 1, o 2,... Unknowns:

Formalizing what Just Happened Knowns: O = {Blue, Red, Green, Black,...} Rewards in IR A = {W ave, Clap, Stand} o 0, a 0, r 0, o 1, a 1, r 1, o 2,... Unknowns: S = 4x3 grid R : S A IR P = S O T : S A S

Formalizing what Just Happened Knowns: O = {Blue, Red, Green, Black,...} Rewards in IR A = {W ave, Clap, Stand} o 0, a 0, r 0, o 1, a 1, r 1, o 2,... Unknowns: S = 4x3 grid R : S A IR P = S O T : S A S o i = P(s i )

Formalizing what Just Happened Knowns: O = {Blue, Red, Green, Black,...} Rewards in IR A = {W ave, Clap, Stand} o 0, a 0, r 0, o 1, a 1, r 1, o 2,... Unknowns: S = 4x3 grid R : S A IR P = S O T : S A S o i = P(s i ) r i = R(s i, a i )

Formalizing what Just Happened Knowns: O = {Blue, Red, Green, Black,...} Rewards in IR A = {W ave, Clap, Stand} o 0, a 0, r 0, o 1, a 1, r 1, o 2,... Unknowns: S = 4x3 grid R : S A IR P = S O T : S A S o i = P(s i ) r i = R(s i, a i ) s i+1 = T (s i, a i )

Describe the environment Environment = sensations, actions

Describe the environment Environment = sensations, actions fully observable vs. partially observable (accessible)

Describe the environment Environment = sensations, actions fully observable vs. partially observable (accessible) single-agent vs. multiagent

Describe the environment Environment = sensations, actions fully observable vs. partially observable (accessible) single-agent vs. multiagent deterministic vs. non-deterministic (stochastic)

Describe the environment Environment = sensations, actions fully observable vs. partially observable (accessible) single-agent vs. multiagent deterministic vs. non-deterministic (stochastic) episodic vs. sequential

Describe the environment Environment = sensations, actions fully observable vs. partially observable (accessible) single-agent vs. multiagent deterministic vs. non-deterministic (stochastic) episodic vs. sequential static vs. dynamic

Describe the environment Environment = sensations, actions fully observable vs. partially observable (accessible) single-agent vs. multiagent deterministic vs. non-deterministic (stochastic) episodic vs. sequential static vs. dynamic discrete vs. continuous

Describe the environment Environment = sensations, actions fully observable vs. partially observable (accessible) single-agent vs. multiagent deterministic vs. non-deterministic (stochastic) episodic vs. sequential static vs. dynamic discrete vs. continuous known vs. unknown

Next week: Search Textbook readings Responses both Monday and Wednesday Python tutorial due