CS6550 Computer Vision

Similar documents
Introduction to Computer Vision

CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu

CSE 455: Computer Vision

CENG 595 Selected Topics in Computer Engineering Computer Vision. Zafer ARICAN, PhD

Introduction. Visual data acquisition devices. The goal of computer vision. The goal of computer vision. Vision as measurement device

Lecture 1 Introduction to Computer Vision. Lin ZHANG, PhD School of Software Engineering, Tongji University Spring 2014

Lecture 1 Introduction to Computer Vision. Lin ZHANG, PhD School of Software Engineering, Tongji University Spring 2015

COMP 776: Computer Vision

Today I t n d ro ucti tion to computer vision Course overview Course requirements

Introduction. BIL719 Computer Vision Pinar Duygulu Hacettepe University

Lecture 1 Introduction to Computer Vision. Lin ZHANG, PhD School of Software Engineering, Tongji University Spring 2018

CSE 408 Multimedia Information System

COMP 9517 Computer Vision. Introduc<on

Computer Vision Lecture 1

Computer Vision for HCI. Introduction. Machines That See? Science fiction. HAL, Terminator, Star Wars, I-Robot, etc.

CSE Tue 10/09. Nadir Weibel

CS 534: Computer Vision

Today. CS 395T Visual Recognition. Course content. Administration. Expectations. Paper reviews

Digital image processing vs. computer vision Higher-level anchoring

ELE 882: Introduction to Digital Image Processing (DIP)

CSCE 763: Digital Image Processing

Transportation Informatics Group, ALPEN-ADRIA University of Klagenfurt. Transportation Informatics Group University of Klagenfurt 3/10/2009 1

ARTIFICIAL INTELLIGENCE - ROBOTICS

Introduction. Ioannis Rekleitis

Spring 2018 CS543 / ECE549 Computer Vision. Course webpage URL:

Computer Vision. Thursday, August 30

University of Wisconsin-Madison, Nelson Institute for Environmental Studies September 2, 2014

Computer Vision Introduction

CS 131 Lecture 1: Course introduction

(15-862): Computational Photography

Computational and Biological Vision

Visual Imaging in the Electronic Age An Interdisciplinary Course Bridging Art, Architecture, Computer Science, and Engineering Offered in Fall 2016

VIRTUAL REALITY Introduction. Emil M. Petriu SITE, University of Ottawa

CIS 849: Autonomous Robot Vision

Intro to AI. AI is a huge field. AI is a huge field 2/19/15. What is AI. One definition:

Artificial Intelligence: Definition

Keywords: Data Compression, Image Processing, Image Enhancement, Image Restoration, Image Rcognition.

Game Artificial Intelligence ( CS 4731/7632 )

Digital Image Processing COSC 6380/4393

MATLAB DIGITAL IMAGE/SIGNAL PROCESSING TITLES

CSC320H: Intro to Visual Computing. Course WWW (course information sheet available there):

Digital images. Digital Image Processing Fundamentals. Digital images. Varieties of digital images. Dr. Edmund Lam. ELEC4245: Digital Image Processing

(15-862): Computational Photography

Image Processing. Gabriel Brostow & Simon Prince. GV12/3072 Image Processing.

CS686: Robot Motion Planning and Applications

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt.

Intro to AI. AI is a huge field. AI is a huge field 2/26/16. What is AI (artificial intelligence) What is AI. One definition:

DIGITAL IMAGE PROCESSING

CS6670: Computer Vision

23270: AUGMENTED REALITY FOR NAVIGATION AND INFORMATIONAL ADAS. Sergii Bykov Technical Lead Machine Learning 12 Oct 2017

Digital Image Processing. Lecture 1 (Introduction) Bu-Ali Sina University Computer Engineering Dep. Fall 2011

Machine Vision Beyond the Factory. Jeff Burnstein President October 18, 2012 Beijing

TDI2131 Digital Image Processing

A NEW NEUROMORPHIC STRATEGY FOR THE FUTURE OF VISION FOR MACHINES June Xavier Lagorce Head of Computer Vision & Systems

Digital Image Processing ECE 178 Winter 2003

Digital Image Processing ECE 178 Winter On the WEB. Class list/discussion sessions. Today: Jan About this course.

Lecture # 01. Introduction

Advances in Human!!!!! Computer Interaction

GGS 412 Air Photography Interpretation

Logic Programming. Dr. : Mohamed Mostafa

Prof. Feng Liu. Winter /09/2017

Autonomous Mobile Robot Design. Dr. Kostas Alexis (CSE)

3D Virtual Training Systems Architecture

PELLISSIPPI STATE TECHNICAL COMMUNITY COLLEGE MASTER SYLLABUS. CIVIL ENGINEERING DRAWING W/LAB CID 2290 (formerly CID 2195)

(15-862): Computational Photography

Introduction Image Analysis & Computer Vision. Guido Gerig CS/BIOEN 6640 FALL 2012

Image formation - Cameras. Grading & Project. About the course. Tentative Schedule. Course Content. Students introduction

Background. Computer Vision & Digital Image Processing. Improved Bartlane transmitted image. Example Bartlane transmitted image

CS 309: Autonomous Intelligent Robotics FRI I. Instructor: Justin Hart.

Visione per il veicolo Paolo Medici 2017/ Visual Perception

Introduction to Remote Sensing of the Environment. Dr. Anne Nolin Department of Geosciences

2 days University Experience Programme - From Physics and ICT to Engineering

Perceptual Interfaces. Matthew Turk s (UCSB) and George G. Robertson s (Microsoft Research) slides on perceptual p interfaces

Computer Graphics. Spring April Ghada Ahmed, PhD Dept. of Computer Science Helwan University

Recommended Text. Logistics. Course Logistics. Intelligent Robotic Systems

Short Course on Computational Illumination

Robot Motion Control and Planning

LECTURE 1: OVERVIEW. CS 4100: Foundations of AI. Instructor: Robert Platt. (some slides from Chris Amato, Magy Seif El-Nasr, and Stacy Marsella)

COMP371 COMPUTER GRAPHICS SESSION 1 COURSE OVERVIEW - SYLLABUS

Welcome to EGN-1935: Electrical & Computer Engineering (Ad)Ventures

On the WEB. Digital Image Processing ECE 178. B. S. MANJUNATH RM 3157 ENGR I Tel:

CS343 Introduction to Artificial Intelligence Spring 2010

Image Processing. COMP 3072 / GV12 Gabriel Brostow. TA: Josias P. Elisee (with help from Dr Wole Oyekoya) Image Processing.

DESIGN STYLE FOR BUILDING INTERIOR 3D OBJECTS USING MARKER BASED AUGMENTED REALITY

Choosing the Optimum Mix of Sensors for Driver Assistance and Autonomous Vehicles

ART 263 Digital Photography

CSE 165: 3D User Interaction. Lecture #11: Travel

interactive laboratory

Revised and extended. Accompanies this course pages heavier Perception treated more thoroughly. 1 - Introduction

Module 1 : Numerical Methods for PDEs : Course Introduction, Lecture 1

Computer Vision Lesson Plan

Student s Signature Completion Date. High School Teacher s Signature Date. Recommended Grade High School. COCC Review Instructor s Signature

CS343 Introduction to Artificial Intelligence Spring 2012

OBJECTIVE OF THE BOOK ORGANIZATION OF THE BOOK

Information Infrastructure II (Data Mining) I211

COS Lecture 1 Autonomous Robot Navigation

CS8678_L1. Course Introduction. CS 8678 Introduction to Robotics & AI Dr. Ken Hoganson. Start Momentarily

November 30, Prof. Sung-Hoon Ahn ( 安成勳 )

Syllabus for ENGR065-01: Circuit Theory

Video Games and Interfaces: Past, Present and Future Class #2: Intro to Video Game User Interfaces

Transcription:

CS6550 Computer Vision Class Meeting: M7M8 (3:30pm 5:20pm), R6 (2:20pm 3:10pm). Rm 106 Delta Bldg., 台達館 106 室 Instructor: Prof. Shang-Hong Lai, Rm. 636 Delta Bldg., 賴尚宏, 台達館 636 室, Tel: ext. 42958, Email: lai@cs.nthu.edu.tw, URL: http://www.cs.nthu.edu.tw/~lai Office Hours: R7R8 or by appointment Teaching Assistant: 李東穎 蘇宏任, 台達館 720 721, CV lab. Email: d9562818@oz.nthu.edu.tw; suhongren@gmail.com 1

Prerequisite Linear Algebra Probability and Statistics Basic Programming 2

Course Description This course is to provide an introductory background in computer vision for graduate students to start research in this field. We will focus on teaching representative computer vision algorithms in class. You will need to implement some algorithms with computer programs for the homeworks and the final project. 3

Course Contents 1. Image Formation (1 week) 2. Image Features (2 weeks) 3. Image Segmentation (2 weeks) 4. Camera Calibration (1 week) 5. Two-View Geometry (1 week) 6. Stereo Reconstruction (1 week) 7. Image Matching (1 week) 8. Motion Analysis (1 week) 9. Object Recognition (1 week) 10. Augmented Reality (1 week) 4

Textbooks Primary: Computer Vision: Algorithms and Applications, by Richard Szeliski, draft (9/3/2010 version) http://szeliski.org/book/ Secondary: Computer Vision: A Modern Approach, by David Forsyth and Jean Ponce, Prentice Hall, 2003. Image Processing, Analysis, and Machine Vision, by M. Sonka, V. Hlavac, R. Boyle, Thomson Engineering, 3rd Edition, 2007(8). Lecture slides distributed in class. 5

Sample Contents 6

Grading Midterm Exam. (11/26) 30% Final Project 20% Homeworks (4) 40% Class Participation 5% Quizzes 5% 7

Homework Policy Discussion of homework is encouraged, but you have to write your own. No copying is strictly enforced. Homework should be delivered before the announced due time, normally before the lecture. Late homework will be degraded by 25% per day. No make-up homeworks 4 days after the deadline. 8

Course Webpage http://cv.cs.nthu.edu.tw/courses.php It contains the course slides, basic course information, and class announcement. Important course announcement will also be posted on this webpage. 9

Class Participation Class attendance is required and treated as the basic requirement for class participation. Asking questions is strongly encouraged. Extra credit will be given for finding mistakes or asking questions. 10

CS 6550 Classroom Rule No eating is permitted in class. No sleeping is allowed in class. Disturbance to others in class should be minimized. Cell phone should be turned off during the class. 11

Computer Vision Make computers understand images and video. What kind of scene? Where are the cars? How far is the building?

What is Computer Vision? To extract useful information about real physical objects and scenes from sensed images/video. 3D reconstruction from images Object detection/recognition Automatic understanding of images and video Computing properties of the 3D world from visual data (measurement) Algorithms and representations to allow a machine to recognize objects, people, scenes, and activities. (perception and interpretation) 13

Vision for measurement Real-time stereo Structure from motion Multi-view stereo for community photo collections NASA Mars Rover Pollefeys et al. Goesele et al. Slide credit: L. Lazebnik

Vision for perception, interpretation The Wicked Twister rid e Lake Erie deck sky tree tree bench wate r amusement park Ferris wheel tree Cedar Point tree rid e 12 E rid e people waiting in people line sitting on umbrellas ride maxair carousel pedestrians Objects Activities Scenes Locations Text / writing Faces Gestures Motions Emotions

Related Disciplines Graphics Image processing Artificial intelligence Computer vision Algorithms Machine learning Cognitive science

Vision and Graphics Images Vision Model Graphics Inverse problems: analysis and synthesis.

Why computer vision? As image sources multiply, so do applications Relieve humans of boring, easy tasks Enhance human abilities: human-computer interaction, visualization Perception for robotics / autonomous agents Organize and give access to visual content

Why computer vision? Images and videos are everywhere! Personal photo albums Movies, news, sports Surveillance and security Medical and scientific images Slide credit; L. Lazebnik

Why computer vision matters? Safety Health Security Comfort Fun Access

Again, what is computer vision? Mathematics of geometry of image formation? Statistics of the natural world? Models for neuroscience? Engineering methods for matching images? Science Fiction?

Very brief history of computer vision 1966: Minsky assigns computer vision as an undergrad summer project 1960 s: interpretation of synthetic worlds 1970 s: some progress on interpreting selected images 1980 s: ANNs come and go; shift toward geometry and increased mathematical rigor 1990 s: face recognition; statistical analysis in vogue 2000 s: broader recognition; large annotated datasets available; computational photography starts Guzman 68 Ohta Kanade 78 Turk and Pentland 91

Applications of Computer Vision Robot Vision / Autonomous Vehicles Biometric Identification / Recognition Industrial Inspection Video Surveillance Digital Camera Medical Image Analysis/Processing Remote Sensing Multimedia Retrieval Augmented Reality 23

Consumer Applications (a) image stitching: merging different views (Szeliski and Shum 1997) (b) exposure bracketing: merging different exposures. 24

Real-Time Stereo Camera Point Grey Research makes video rate stereo camera (640 x 480 at 30 fps). Bumblebee 25

3D Reconstruction from Images 26

Earth viewers (3D modeling) Image from Microsoft s Virtual Earth (see also: Google Earth) 27

Photosynth http://photosynth.net/ 28

Object Detection 29

Optical Character Recognition (OCR) Technology to convert scanned docs to text If you have a scanner, it probably came with OCR software Digit recognition, AT&T labs http://www.research.att.com/~yann/ License plate readers http://en.wikipedia.org/wiki/automatic_number_plate_recognition 30

Face Detection Many new digital cameras now detect faces Canon, Sony, Fuji, 31

Smile detection? Sony Cyber-shot T70 Digital Still Camera 32

Face Detection and Recognition Face detection algorithms, coupled with color-based clothing and hair detection algorithms, can locate and recognize the individuals in this image (Sivic, Zitnick, and Szeliski 2006) 33

Biometric Recognition Who is she? 34

Vision-based Biometrics How the Afghan Girl was Identified by Her Iris Patterns? http://www.cl.cam.ac.uk/~jgd1000/afghan.html 35

Login without a password Fingerprint scanners on many new laptops, other devices Face recognition systems now beginning to appear more widely http://www.sensiblevision.com/ 36

Object Recognition (in mobile phones) This is becoming real: Google goggles Point & Find, Nokia 37

Special effects: shape capture The Matrix movies, ESC Entertainment, XYZRGB, NRC 38

Sports Augmented Reality Sportvision first down line Nice explanation on www.howstuffworks.com 39

Google street view

Google street view

Smart Cars Mobileye Vision systems currently in high-end BMW, GM, Volvo models 42

meters Pedestrian and car detection Assisted Driving Ped Ped Car Lane detection meters Collision warning systems with adaptive cruise control, Lane departure warning systems, Rear object detection systems,

Google Autonomous Car The U.S. state of Nevada passed a law in June 2011 concerning the operation of driverless cars in Nevada. The Google Driverless Car combines information gathered from Google Street View, video cameras inside the car, a LIDAR sensor on top of the vehicle, radar sensors on the front of the vehicle and a position sensor attached to one of the rear wheels. 44

Vision-based Interaction Nintendo Wii has camera-based IR tracking built in. Control games with your own body motion/gesture and create immersive experiences by combining 3D personal image into the game scene 45

Vision in Space NASA'S Mars Exploration Rover Spirit captured this westward view from atop a low plateau where Spirit spent the closing months of 2007. Vision systems (JPL) used for several tasks Panorama stitching 3D terrain modeling Obstacle detection, position tracking 46

Robotics NASA s Mars Spirit Rover http://en.wikipedia.org/wiki/spirit_rover Darpa s Robotics Challenge http://www.darpa.mil/our_work/tto/progra ms/darpa_robotics_challenge.aspx 47

Medical Imaging 3D imaging MRI, CT Image guided surgery Grimson et al., MIT 48

Augmented Reality AR allows the user to see the real world, with virtual objects superimposed upon or composited with the real world. Therefore, AR supplements reality, rather than completely replacing it. Google Glasses is a research and development program to develop an augmented reality head-mounted display (HMD). 49

Virtual Dressing Room

Things to Do Read Chap. 1 (Szeliski) Next classes Introduction to Matlab programming Image formation (Chap. 2, Szeliski) 51