Computer Vision Introduction

Similar documents
CS 534: Computer Vision

Introduction

Digital Image Processing and Machine Vision Fundamentals

ARTIFICIAL INTELLIGENCE - ROBOTICS

Choosing the Optimum Mix of Sensors for Driver Assistance and Autonomous Vehicles

Digital image processing vs. computer vision Higher-level anchoring

OBJECTIVE OF THE BOOK ORGANIZATION OF THE BOOK

Digital Image Processing

Background. Computer Vision & Digital Image Processing. Improved Bartlane transmitted image. Example Bartlane transmitted image

MATLAB DIGITAL IMAGE/SIGNAL PROCESSING TITLES

Paper on: Optical Camouflage

Computational and Biological Vision

Computer Vision for HCI. Introduction. Machines That See? Science fiction. HAL, Terminator, Star Wars, I-Robot, etc.

Lecture 4 Foundations and Cognitive Processes in Visual Perception From the Retina to the Visual Cortex

Embedding Artificial Intelligence into Our Lives

Humans used a web interface to say same person or different person for a large set of faces. Several computer programs made the same comparisons

Multiple Choice Questions Collecting, Storing & Sharing Information

What is AI? AI is the reproduction of human reasoning and intelligent behavior by computational methods. an attempt of. Intelligent behavior Computer

Digital Image Processing COSC 6380/4393

Lecture 1 Introduction to Computer Vision. Lin ZHANG, PhD School of Software Engineering, Tongji University Spring 2015

Introduction. Stefano Ferrari. Università degli Studi di Milano Methods for Image Processing. academic year

CS6550 Computer Vision

A NEW NEUROMORPHIC STRATEGY FOR THE FUTURE OF VISION FOR MACHINES June Xavier Lagorce Head of Computer Vision & Systems

Name: Date: Block: Light Unit Study Guide Matching Match the correct definition to each term. 1. Waves

Digital Image Processing. Lecture 1 (Introduction) Bu-Ali Sina University Computer Engineering Dep. Fall 2011

Object Perception. 23 August PSY Object & Scene 1

CS 131 Lecture 1: Course introduction

VIRTUAL REALITY Introduction. Emil M. Petriu SITE, University of Ottawa

EE631 Cooperating Autonomous Mobile Robots. Lecture 1: Introduction. Prof. Yi Guo ECE Department

SECTION I - CHAPTER 2 DIGITAL IMAGING PROCESSING CONCEPTS

* Intelli Robotic Wheel Chair for Specialty Operations & Physically Challenged

Introduction. Visual data acquisition devices. The goal of computer vision. The goal of computer vision. Vision as measurement device

Transportation Informatics Group, ALPEN-ADRIA University of Klagenfurt. Transportation Informatics Group University of Klagenfurt 3/10/2009 1

IMAGE PROCESSING PAPER PRESENTATION ON IMAGE PROCESSING

KÜNSTLICHE INTELLIGENZ JOBKILLER VON MORGEN?

Chapter 1 Virtual World Fundamentals

On the WEB. Digital Image Processing ECE 178. B. S. MANJUNATH RM 3157 ENGR I Tel:

NCCT IEEE PROJECTS ADVANCED ROBOTICS SOLUTIONS. Latest Projects, in various Domains. Promise for the Best Projects

Keywords: Data Compression, Image Processing, Image Enhancement, Image Restoration, Image Rcognition.

Machine Vision Beyond the Factory. Jeff Burnstein President October 18, 2012 Beijing

ROBOTICS ENG YOUSEF A. SHATNAWI INTRODUCTION

R (2) Controlling System Application with hands by identifying movements through Camera

CS 730/830: Intro AI. Prof. Wheeler Ruml. TA Bence Cserna. Thinking inside the box. 5 handouts: course info, project info, schedule, slides, asst 1

Introduction to Computer Vision

Digital Image Processing ECE 178 Winter 2003

Digital Image Processing ECE 178 Winter On the WEB. Class list/discussion sessions. Today: Jan About this course.

Logic Programming. Dr. : Mohamed Mostafa

Perceptual Interfaces. Matthew Turk s (UCSB) and George G. Robertson s (Microsoft Research) slides on perceptual p interfaces

III: Vision. Objectives:

Global Image Sensor Market with Focus on Automotive CMOS Sensors: Industry Analysis & Outlook ( )

PERCEIVING SCENES. Visual Perception

TECHNOLOGY DEVELOPMENT AREAS IN AAWA

Introduction. BIL719 Computer Vision Pinar Duygulu Hacettepe University

Lecture 1 Introduction to Computer Vision. Lin ZHANG, PhD School of Software Engineering, Tongji University Spring 2018

Automotive Applications ofartificial Intelligence

Geo/SAT 2 INTRODUCTION TO REMOTE SENSING

Enhancing Shipboard Maintenance with Augmented Reality

Don t miss surprising. facts about the way we see

WHITE PAPER Need for Gesture Recognition. April 2014

Our focus is innovating security where you need it most. Smoother traffic flow - Better image quality - Higher efficiency

Fun with visual illusions. Professor William Ayliffe Gresham Professor of Physic

Vision. Definition. Sensing of objects by the light reflected off the objects into our eyes

Practical Image and Video Processing Using MATLAB

ITS Radiocommunications in Japan Progress report and future directions

Today I t n d ro ucti tion to computer vision Course overview Course requirements

Uses of Electromagnetic Waves

Chapter 9: Light, Colour and Radiant Energy. Passed a beam of white light through a prism.

Intro to AI. AI is a huge field. AI is a huge field 2/19/15. What is AI. One definition:

Automatics Vehicle License Plate Recognition using MATLAB

INTELLIGENT UNMANNED GROUND VEHICLES Autonomous Navigation Research at Carnegie Mellon

UNIT 2 TOPICS IN COMPUTER SCIENCE. Emerging Technologies and Society

Introduction. Ioannis Rekleitis

P1.4. Light has to go where it is needed: Future Light Based Driver Assistance Systems

Digital Image Processing COSC 6380/4393

Optical Illusions ONLINE EYE CHECKUP TEST LINKS:

Gesture Recognition with Real World Environment using Kinect: A Review

Imaging with hyperspectral sensors: the right design for your application

Introduction to AI. What is Artificial Intelligence?

Perception. Read: AIMA Chapter 24 & Chapter HW#8 due today. Vision

VSI Labs The Build Up of Automated Driving

Making Vehicles Smarter and Safer with Diode Laser-Based 3D Sensing

3D Virtual Training Systems Architecture

Situational Awareness A Missing DP Sensor output

ELECTROMAGNETIC SPECTRUM ELECTROMAGNETIC SPECTRUM

dr hab. Michał Strzelecki tel , room 216 cons. hours: Wednesday 14-15, Thursday P. Strumillo, M.

10.2 Color and Vision

Artificial Intelligence and Robotics Getting More Human

Electromagnetic Waves & the Electromagnetic Spectrum

Jeff Bezos, CEO and Founder Amazon

UNIT1. Keywords page 13-14

Short Course on Computational Illumination

& Medical Tourism. DIHTF - Dubai 20 th -21 st Feb 2018 V S Venkatesh -India

An Introduction to Geomatics. Prepared by: Dr. Maher A. El-Hallaq خاص بطلبة مساق مقدمة في علم. Associate Professor of Surveying IUG

The five senses of Artificial Intelligence

Minority Report Assignment

Machine Learning for Intelligent Transportation Systems

Lecture 8. Human Information Processing (1) CENG 412-Human Factors in Engineering May

Home Inspection Leak and Poor Insulation Detection

Introduction. Lighting

23270: AUGMENTED REALITY FOR NAVIGATION AND INFORMATIONAL ADAS. Sergii Bykov Technical Lead Machine Learning 12 Oct 2017

Transcription:

Computer Vision Introduction Ahmed Elgammal Dept of Computer Science Rutgers University Outlines Vision What and Why? Human vision Computer vision General computer vision applications Course Outlines Administrative A. Elgammal 1

What is vision? What does it mean to see? The plain man s answer (and Aristotle s too) would be, to know what is where by looking. In other words, vision is the process of discovering from images what is present in the world, and where it is David Marr, Vision 1982 What is vision? Recognize objects people we know things we own Locate objects in space to pick them up to navigate through them Track objects in motion catching a baseball avoiding collisions with cars on the road Recognize actions walking, running, pushing A. Elgammal 2

Vision is Deceivingly easy Deceptive Computationally demanding Critical to many applications Vision is deceivingly easy We see effortlessly seeing seems simpler than thinking we can all see but only select gifted people can solve hard problems like chess we use nearly 70% of our brains for visual perception! All creatures see frogs see birds see snakes see but they do not see alike A. Elgammal 3

Vision is deceptive Vision is an exceptionally strong sensation vision is immediate we perceive the visual world as external to ourselves, but it is a reconstruction within our brains we regard how we see as reflecting the world as it is; but human vision is subject to illusions quantitatively imprecise limited to a narrow range of frequencies of radiation passive Vision is deceptive Human vision is subject to illusions quantitatively imprecise Müller-Lyer illusion A. Elgammal 4

Zollner's illusion Delboenf's illusion A. Elgammal 5

More illusion Subjective contours More illusion Subjective contours Figure completion A. Elgammal 6

Necker cube: The human visual system picks an interpretation of each part that makes the whole consistent. A. Elgammal 7

More illusion Subjective contours Depth, reversibility, Figure completion More illusion Depth, reversibility, Do the cubes shift independently or as a unit A. Elgammal 8

More illusion The Hermann grid illusion: Illusory dark spots appear at all the intersections of the white stripes except the one on which you are currently fixated More illusion We can see impossible figures A. Elgammal 9

Spectral limitations of human vision We see only a small part of the energy spectrum of sunlight we don t see ultraviolet or lower frequencies of light we don t see infrared or higher frequencies of light we see less than.1% of the energy that reaches our eyes But objects in the world reflect and emit energy in these and other parts of the spectrum Non-human vision Infrared vision Polarization vision navigation for birds Ultrasound vision X-ray vision! RADAR vision A. Elgammal 10

Infrared vision Vision systems exist that can see reflected and emitted infrared light visual system of the pit viper infrared cameras used for night vision Why don t we see the infrared? we would see the blood flow through the capillaries in the eye Human vision is passive It relies on external energy sources (sunlight, light bulbs, fires) providing light that reflects off of objects to our eyes Vision systems can be active - carry their own energy sources Radars Bat acoustic imaging systems A. Elgammal 11

According to Marr: Vision is an information-processing task But not just a process Our brain must somehow be capable of representing this information. vision study not only the study of how to extract from images the various aspects of the world that are useful to us, but also an inquiry into the nature of the internal representations by which we capture this information and thus make it available as a basis for decisions about our thoughts and actions Representation + Processing if vision is an information-processing task, then I should be able to make my computer do it, provided that it has sufficient power, memory, and some way of being connected to a home television camera. We wants to know how to program vision. A. Elgammal 12

Computer Vision Understanding the content of images and videos Vision is deceivingly easy = Computer Vision is hard The M.I.T. summer vision program summer of 1965 point TV camera at stack of blocks locate individual blocks recognize them from small database of blocks describe physical structure of the scene support relationships Formally ended in 1985 The first great revelation was that the problems are difficult. Of course, these days this fact is a commonplace. But in the 1960s almost no one realized that machine vision was difficult. The field had to go through the same experience as the machine translation field did in its fiascoes of the 1950 s before it was at least realized that here were some problems that had to be taken seriously. D. Marr, Vision, 1982. A. Elgammal 13

Understanding and Recognition People draw distinctions between what is seen Object recognition This could mean is this a fish or a bicycle? It could mean is this George Washington? It could mean is this poisonous or not? It could mean is this slippery or not? It could mean will this support my weight? Great mystery How to build programs that can draw useful distinctions based on image properties Generic Object Recognition Variations in scale, orientation and visibility Variability within Specificity Object of interest has to be recognized in context of multiple other objects and cluttered background A. Elgammal 14

What are the problems in recognition? Which bits of image should be recognized together? Segmentation. How can objects be recognized without focusing on detail? Abstraction. How can objects with many free parameters be recognized? No popular name, but it s a crucial problem anyhow. How do we structure very large modelbases? again, no popular name; abstraction and learning come into this Why study Computer Vision? Images and movies are everywhere Fast-growing collection of useful applications building representations of the 3D world from pictures automated surveillance (who s doing what) movie post-processing face finding Various deep and attractive scientific mysteries how does object recognition work? Greater understanding of human vision A. Elgammal 15

2 D image 2 D image Image Processing 2 D image Computer Vision 3 D objects 3 D model, objects Computer Graphics 2 D image Related Fields: AI, pattern recognition, machine learning, signal processing, neural networks, cognitive vision. Critical to many applications in Manufacturing Communications Medicine Transportation Entertainment Agriculture Defense A. Elgammal 16

Manufacturing Visual inspection for quality control during the manufacture of parts in the automotive industry inspection of semiconductors Visual control of robots during assembly of parts from pieces during calibration of robot control systems Communications Smart document readers character recognition discrimination of text from graphics and images reading cursive script language recognition Virtual teleconferencing Virtual reality A. Elgammal 17

Medicine Diagnosis radiology - read X rays, CAT scans pathology - read biopsies Remote and tele-medicine Virtual reality surgical assistance project images onto head during brain surgery MRI CTI NMI USI Reprinted from Image and Vision Computing, v. 13, N. Ayache, Medical computer vision, virtual reality and robotics, Page 296, copyright, (1995), with permission from Elsevier Science A. Elgammal 18

Transportation Traffic safety and control detection and ticketing of speeding vehicles vehicle counting for flow control Robot drivers convoys Advanced automobiles autonomous parallel parking road sign detectors and driver alerts collision avoidance smart cruise control Pittsburgh to San Diego! Entertainment Acquisition of 3D computer models for graphical manipulation Control of animation through vision Indexing tools for video databases A. Elgammal 19

Detect ground plan in video and introduce pictures on them Images and videos from: SYMAH VISION, Easily Virtual www.symah-vision.fr Applications Tracking Baseball Pitches for Broadcast Television K Zone: system developed by Sportvision for ESPN. The system is used by ESPN for its Major League Baseball broadcast. The system draw a representation of the strike zone on the TV screen superimposed over the replayed broadcast video. The system would determine electronically whether the each pitch qualified as a strike or a ball. Copyright(C) 2001 Andre Gueziec and Sportvision LLC. All Rights Reserved. A. Elgammal 20

Agriculture Safety and quality inspection sorting by size - peaches sorting by shape - potatoes identifying defects - blemishes on fruit, rot in potatoes disease monitoring - chickens Robotic farming equipment robotic harvesters - apple pickers, orange pickers A. Elgammal 21

Defense Automatic target recognition systems cruise missiles air to surface smart missiles Reconnaissance monitoring strategic sites Simulation acquisition of terrain models from imagery model acquisition of buildings, roads, etc. Looking at People Human detection Human tracking Human recognition and biometrics Face recognition Gait recognition Iris, etc. Gesture recognition Facial expression recognition Activity recognition Duchenne de Boulogne, C.-B. (1862) The Mechanism of Human Facial Expression A. Elgammal 22

Applications Human Computer Interaction Keyboard and mouse are restrictive Driver Assistance, Autonomous driving Pedestrian detection Traffic signs detection/recognition Lane detection Occupant detection Motion Capture Video editing, archival and retrieval. Surveillance: security, safety, resource mangement Visual Surveillance Consider a visual surveillance system State of the art: archive huge volumes of video for eventual off-line human inspection Goal : Automatic understanding of events happening in the site. Efficient archiving Automatic Annotation Direct human attention Reduce bandwidth required for video transmission and storage. A. Elgammal 23

.. Introduction to Imaging and Multimedia Face Detection Motion Capture 2D Localization Detection and Tracking 3D localization Prediction Videos by: Dr. Thanarat Horprasert A. Elgammal 24