Today I t n d ro ucti tion to computer vision Course overview Course requirements

Similar documents
COMP 776: Computer Vision

Introduction. Visual data acquisition devices. The goal of computer vision. The goal of computer vision. Vision as measurement device

Introduction. BIL719 Computer Vision Pinar Duygulu Hacettepe University

CSE 408 Multimedia Information System

Lecture 1 Introduction to Computer Vision. Lin ZHANG, PhD School of Software Engineering, Tongji University Spring 2014

Spring 2018 CS543 / ECE549 Computer Vision. Course webpage URL:

Lecture 1 Introduction to Computer Vision. Lin ZHANG, PhD School of Software Engineering, Tongji University Spring 2015

Lecture 1 Introduction to Computer Vision. Lin ZHANG, PhD School of Software Engineering, Tongji University Spring 2018

CENG 595 Selected Topics in Computer Engineering Computer Vision. Zafer ARICAN, PhD

Introduction to Computer Vision

CSE 455: Computer Vision

CS6550 Computer Vision

CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu

Computer Vision Lecture 1

COMP 9517 Computer Vision. Introduc<on

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt.

Image Analysis & Searching

Recognition problems. Object Recognition. Readings. What is recognition?

Lecture 19: Depth Cameras. Kayvon Fatahalian CMU : Graphics and Imaging Architectures (Fall 2011)

CS 131 Lecture 1: Course introduction

Computer Vision. Thursday, August 30

OBJECTIVE OF THE BOOK ORGANIZATION OF THE BOOK

DIGITAL IMAGE PROCESSING

Computer Vision for HCI. Introduction. Machines That See? Science fiction. HAL, Terminator, Star Wars, I-Robot, etc.

High Level Computer Vision. Introduction - April 16, Bernt Schiele & Mario Fritz MPI Informatics and Saarland University, Saarbrücken, Germany

Chapter 12 Image Processing

Digital image processing vs. computer vision Higher-level anchoring

Today. CS 395T Visual Recognition. Course content. Administration. Expectations. Paper reviews

Computer vision, wearable computing and the future of transportation

CS6670: Computer Vision

Video Registration: Key Challenges. Richard Szeliski Microsoft Research

Book Cover Recognition Project

Introduction. Ioannis Rekleitis

CS 534: Computer Vision

Occlusion. Atmospheric Perspective. Height in the Field of View. Seeing Depth The Cue Approach. Monocular/Pictorial

Manipulation. Manipulation. Better Vision through Manipulation. Giorgio Metta Paul Fitzpatrick. Humanoid Robotics Group.

Practical Image and Video Processing Using MATLAB

CSE 527: Introduction to Computer Vision

Computer Vision Slides curtesy of Professor Gregory Dudek

Image stitching. Image stitching. Video summarization. Applications of image stitching. Stitching = alignment + blending. geometrical registration

Computer Vision Introduction or

(15-862): Computational Photography

CSE Tue 10/09. Nadir Weibel

Light-Field Database Creation and Depth Estimation

Transportation Informatics Group, ALPEN-ADRIA University of Klagenfurt. Transportation Informatics Group University of Klagenfurt 3/10/2009 1

CSCE 763: Digital Image Processing

CPSC 425: Computer Vision

Ant? Bird? Dog? Human -SURE

(15-862): Computational Photography

Telling What-Is-What in Video. Gerard Medioni

Background. Computer Vision & Digital Image Processing. Improved Bartlane transmitted image. Example Bartlane transmitted image

Curriculum Vitae. Computer Vision, Image Processing, Biometrics. Computer Vision, Vision Rehabilitation, Vision Science

- Basics of informatics - Computer network - Software engineering - Intelligent media processing - Human interface. Professor. Professor.

Detection and Tracking of the Vanishing Point on a Horizon for Automotive Applications

Image Processing Based Vehicle Detection And Tracking System

6.869 Advances in Computer Vision Spring 2010, A. Torralba

Recommended Text. Logistics. Course Logistics. Intelligent Robotic Systems

Colour correction for panoramic imaging

ELE 882: Introduction to Digital Image Processing (DIP)

ROAD TO THE BEST ALPR IMAGES

Image Filtering in Spatial domain. Computer Vision Jia-Bin Huang, Virginia Tech

Intelligent Identification System Research

Panoramic Vision System for an Intelligent Vehicle using. a Laser Sensor and Cameras

Machine Vision for the Life Sciences

Implementation of License Plate Recognition System in ARM Cortex A8 Board

Recognition Of Vehicle Number Plate Using MATLAB

A NEW NEUROMORPHIC STRATEGY FOR THE FUTURE OF VISION FOR MACHINES June Xavier Lagorce Head of Computer Vision & Systems

CIS 849: Autonomous Robot Vision

ARTIFICIAL INTELLIGENCE - ROBOTICS

Driver Assistance for "Keeping Hands on the Wheel and Eyes on the Road"

Exercise questions for Machine vision

Aerial photography: Principles. Frame capture sensors: Analog film and digital cameras

Image formation - Cameras. Grading & Project. About the course. Tentative Schedule. Course Content. Students introduction

CS6670: Computer Vision Noah Snavely. Administrivia. Administrivia. Reading. Last time: Convolution. Last time: Cross correlation 9/8/2009

Prof. Feng Liu. Winter /10/2019

CSCI 1290: Comp Photo

Computer Vision Lesson Plan

Computational and Biological Vision

FOCAL LENGTH CHANGE COMPENSATION FOR MONOCULAR SLAM

(15-862): Computational Photography

MAV-ID card processing using camera images

Number Plate Detection with a Multi-Convolutional Neural Network Approach with Optical Character Recognition for Mobile Devices

CymbIoT Visual Analytics

Introduction to Computer Vision

Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays. Habib Abi-Rached Thursday 17 February 2005.

Insights into High-level Visual Perception

MATLAB 및 Simulink 를이용한운전자지원시스템개발

ASPECT RATIO. Aspect ratio is the relationship of the width of a picture or sensor in relation to the height.

Next Classes. Spatial frequency Fourier transform and frequency domain. Reminder: Textbook. Frequency view of filtering Hybrid images Sampling

Choosing the Optimum Mix of Sensors for Driver Assistance and Autonomous Vehicles

COMP371 COMPUTER GRAPHICS SESSION 1 COURSE OVERVIEW - SYLLABUS

CSE 473/573 Computer Vision and Image Processing (CVIP)

06: Thinking in Frequencies. CS 5840: Computer Vision Instructor: Jonathan Ventura

Applications of Flash and No-Flash Image Pairs in Mobile Phone Photography

(15-862): Computational Photography

COS Lecture 1 Autonomous Robot Navigation

Perception platform and fusion modules results. Angelos Amditis - ICCS and Lali Ghosh - DEL interactive final event

Unit 12: Artificial Intelligence CS 101, Fall 2018

APPLICATION OF COMPUTER VISION FOR DETERMINATION OF SYMMETRICAL OBJECT POSITION IN THREE DIMENSIONAL SPACE

Virtual Worlds for the Perception and Control of Self-Driving Vehicles

Transcription:

COMP 776: Computer Vision

Today Introduction ti to computer vision i Course overview Course requirements

The goal of computer vision To extract t meaning from pixels What we see What a computer sees Source: S. Narasimhan

The goal of computer vision To extract t meaning from pixels Humans are remarkably good at this Source: 80 million tiny images by Torralba et al.

What kind of information can be extracted from an image? Metric 3D information Semantic information

Vision as measurement device Real-time stereo Structure from motion Reconstruction from Internet photo collections NASA Mars Rover Pollefeys et al. Goesele et al.

Vision as a source of semantic information slide credit: Fei-Fei, Fergus & Torralba

Object categorization sky building flag banner bus face street lamp bus wall cars slide credit: Fei-Fei, Fergus & Torralba

Scene and context categorization outdoor city traffic slide credit: Fei-Fei, Fergus & Torralba

Qualitative spatial information slanted non-rigid moving object vertical rigid moving object horizontal rigid moving object slide credit: Fei-Fei, Fergus & Torralba

Why study computer vision? Vision is useful: Images and video are everywhere! Personal photo albums Movies, news, sports Surveillance and security Medical and scientific images

Why study computer vision? Vision is useful Vision is interesting Vision is difficult Half of primate cerebral cortex is devoted to visual processing Achieving human-level visual perception is probably AI-complete

Why is computer vision difficult?

Challenges: viewpoint variation Michelangelo 1475-1564 slide credit: Fei-Fei, Fergus & Torralba

Challenges: illumination image credit: J. Koenderink

Challenges: scale slide credit: Fei-Fei, Fergus & Torralba

Challenges: deformation Xu, Beihong 1943 slide credit: Fei-Fei, Fergus & Torralba

Challenges: occlusion Magritte, 1957 slide credit: Fei-Fei, Fergus & Torralba

Challenges: background clutter

Challenges: Motion

Challenges: object intra-class variation slide credit: Fei-Fei, Fergus & Torralba

Challenges: local ambiguity slide credit: Fei-Fei, Fergus & Torralba

Challenges: local ambiguity Source: Rob Fergus and Antonio Torralba

Challenges: local ambiguity Source: Rob Fergus and Antonio Torralba

Challenges or opportunities? Images are confusing, but they also reveal the structure t of the world through numerous cues Our job is to interpret the cues! Image source: J. Koenderink

Depth cues: Linear perspective

Depth cues: Aerial perspective

Depth ordering cues: Occlusion Source: J. Koenderink

Shape cues: Texture gradient

Shape and lighting cues: Shading Source: J. Koenderink

Position and lighting cues: Cast shadows Source: J. Koenderink

Grouping cues: Similarity (color, texture, proximity)

Grouping cues: Common fate Image credit: Arthus-Bertrand (via F. Durand)

Inherent ambiguity of the problem M diff t 3D ld h i i t Many different 3D scenes could have given rise to a particular 2D picture

Inherent ambiguity of the problem Many different 3D scenes could have given rise to a particular 2D picture Possible solutions Bring in more constraints (more images) Use prior knowledge about the structure of the world Need a combination of geometric and statistical methods

Connections to other disciplines Artificial Intelligence Robotics Machine Learning Computer Vision Computer Graphics Cognitive science Neuroscience Image Processing

Origins of computer vision L. G. Roberts, Machine Perception of Three Dimensional Solids, Ph.D. thesis, MIT Department of Electrical Engineering, 1963.

Successes of computer vision to date

Optical character recognition (OCR) Digit recognition yann.lecun.com License plate readers http://en.wikipedia.org/wiki/automatic_number_plate_recognition Sudoku grabber http://sudokugrab.blogspot.com/ Automatic check processing Source: S. Seitz, N. Snavely

Biometrics Fingerprint scanners on many new laptops, other devices Face recognition systems now beginning to appear more widely http://www.sensiblevision.com/ i i / Source: S. Seitz

Biometrics How the Afghan Girl was Identified by Her Iris Patterns Source: S. Seitz

Mobile visual search: Google Goggles

Face detection Many new digital cameras now detect faces Canon, Sony, Fuji, Source: S. Seitz

Smile detection Sony Cyber-shot T70 Digital Still Camera Source: S. Seitz

Face recognition: Apple iphoto software http://www.apple.com/ilife/iphoto/

Automotive safety Mobileye: Vision systems in high-end BMW, GM, Volvo models Pedestrian collision warning Forward collision warning Lane departure warning Headway monitoring and warning Source: A. Shashua, S. Seitz

Vision-based interaction: Xbox Kinect http://blogs.howstuffworks.com/2010/11/05/how-microsoft- kinect-works-an-amazing-use-of-infrared-light/ http://www.xbox.com/en-us/live/engineeringblog/122910- HowYouBecometheController http://electronics.howstuffworks.com/microsoft-kinect.htm http://www.ismashphone.com/2010/12/kinect-hacks-moreinteresting-than-the-devices-original-intention.html

Special effects: shape and motion capture Source: S. Seitz

3D visualization: Microsoft Photosynth http://photosynth.net Source: S. Seitz

Vision for robotics, space exploration NASA'S Mars Exploration Rover Spirit captured this westward view from atop a low plateau where Spirit spent the closing months of 2007. Vision systems (JPL) used for several tasks Panorama stitching 3D terrain modeling Obstacle detection, position tracking For more, read Computer Vision on Mars by Matthies et al. Source: S. Seitz

The computer vision industry Ali list of companies here: http://www.cs.ubc.ca/spider/lowe/vision.html p

Basic Info Instructor: Svetlana Lazebnik (lazebnik@cs.unc.edu) edu) Office hours: By appointment, FB 244 Class webpage: http://www.cs.unc.edu/~lazebnik/spring11 Textbooks (suggested): Forsyth & Ponce, Computer Vision: i A Modern Approach Richard Szeliski, Computer Vision: Algorithms and Applications (available online)

Course requirements Philosophy: computer vision is best experienced hands-on Programming assignments: 50% About four assignments Expect the first one in the next couple of classes Brush up on your MATLAB skills (see web page for tutorial) Final assignment: 30% Recognition competition Winner gets a prize! Participation: 20% Come to class regularly Ask questions Answer questions

Collaboration policy Feel free to discuss assignments with each other, but coding must be done individually Feel free to incorporate code or tips you find on the Web, provided this doesn t make the assignment trivial and you explicitly acknowledge your sources Remember: I can Google too (and I have the copies of g ( p everybody s assignments from the last three years this class was offered)

Course overview I. Early vision: i Image formation and processing II. Mid-level vision: Grouping and fitting III. Multi-view geometry IV. Recognition V. Advanced topics

I. Early vision Basic image formation and processing * = Cameras and sensors Light and color Linear filtering Edge detection Feature extraction: corner and blob detection

Fitting and grouping II. Mid-level vision Alignment Fitting: Least squares Hough transform RANSAC

III. Multi-view geometry Stereo Epipolar geometry Tomasi & Kanade (1993) Affine structure from motion Projective structure from motion

IV. Recognition Patch description and matching Clustering and visual vocabularies Bag-of-features features models Classification Sources: D. Lowe, L. Fei-Fei

V. Advanced Topics Time permitting Segmentation Face detection Articulated models Motion and tracking