High Level Computer Vision. Introduction - April 16, Bernt Schiele & Mario Fritz MPI Informatics and Saarland University, Saarbrücken, Germany

Similar documents
Computer Vision Lecture 1

Today I t n d ro ucti tion to computer vision Course overview Course requirements

Introduction. Visual data acquisition devices. The goal of computer vision. The goal of computer vision. Vision as measurement device

Spring 2018 CS543 / ECE549 Computer Vision. Course webpage URL:

Color Space 1: RGB Color Space. Color Space 2: HSV. RGB Cube Easy for devices But not perceptual Where do the grays live? Where is hue and saturation?

CSCE 763: Digital Image Processing

Today. CS 395T Visual Recognition. Course content. Administration. Expectations. Paper reviews

CSE 408 Multimedia Information System

COMP 776: Computer Vision

Introduction. BIL719 Computer Vision Pinar Duygulu Hacettepe University

Image Processing Based Vehicle Detection And Tracking System

Computer Vision. Howie Choset Introduction to Robotics

SECTION I - CHAPTER 2 DIGITAL IMAGING PROCESSING CONCEPTS

PERCEIVING MOVEMENT. Ways to create movement

Digital image processing vs. computer vision Higher-level anchoring

Computer Vision Lesson Plan

Prof. Riyadh Al_Azzawi F.R.C.Psych

Automatics Vehicle License Plate Recognition using MATLAB

Telling What-Is-What in Video. Gerard Medioni

ECC419 IMAGE PROCESSING

Last Lecture. photomatix.com

P rcep e t p i t on n a s a s u n u c n ons n c s ious u s i nf n e f renc n e L ctur u e 4 : Recogni n t i io i n

Computational Vision and Picture. Plan. Computational Vision and Picture. Distal vs. proximal stimulus. Vision as an inverse problem

Computational Approaches to Cameras

Last Lecture. photomatix.com

Lecture 1 Introduction to Computer Vision. Lin ZHANG, PhD School of Software Engineering, Tongji University Spring 2014

Perception. What We Will Cover in This Section. Perception. How we interpret the information our senses receive. Overview Perception

Lecture 1 Introduction to Computer Vision. Lin ZHANG, PhD School of Software Engineering, Tongji University Spring 2018

CS 376b Computer Vision

Introduction. Ioannis Rekleitis

Background. Computer Vision & Digital Image Processing. Improved Bartlane transmitted image. Example Bartlane transmitted image

Digital Image Processing. Lecture 1 (Introduction) Bu-Ali Sina University Computer Engineering Dep. Fall 2011

Lecture 1 Introduction to Computer Vision. Lin ZHANG, PhD School of Software Engineering, Tongji University Spring 2015

An Improved Bernsen Algorithm Approaches For License Plate Recognition

Module 2. Lecture-1. Understanding basic principles of perception including depth and its representation.

Computer Vision Slides curtesy of Professor Gregory Dudek

Practical Image and Video Processing Using MATLAB

6.098 Digital and Computational Photography Advanced Computational Photography. Bill Freeman Frédo Durand MIT - EECS

ELEC Dr Reji Mathew Electrical Engineering UNSW

02/02/10. Image Filtering. Computer Vision CS 543 / ECE 549 University of Illinois. Derek Hoiem

COPYRIGHTED MATERIAL. Overview

Image Extraction using Image Mining Technique

COPYRIGHTED MATERIAL OVERVIEW 1

COMPARATIVE PERFORMANCE ANALYSIS OF HAND GESTURE RECOGNITION TECHNIQUES

CS6670: Computer Vision

Chapter 17. Shape-Based Operations

Lane Detection in Automotive

Computer Vision. Thursday, August 30

Computational Photography and Video. Prof. Marc Pollefeys

Techniques. Introduction to Drawing Final Exam Study Guide

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt.

MICA at ImageClef 2013 Plant Identification Task

Super resolution with Epitomes

Cameras. CSE 455, Winter 2010 January 25, 2010

Object Perception. 23 August PSY Object & Scene 1

Vision. Definition. Sensing of objects by the light reflected off the objects into our eyes

Image Analysis & Searching

Human Vision and Human-Computer Interaction. Much content from Jeff Johnson, UI Wizards, Inc.

Computational and Biological Vision

Vision Review: Image Processing. Course web page:

Ant? Bird? Dog? Human -SURE

DIGITAL IMAGE PROCESSING

Geog183: Cartographic Design and Geovisualization Spring Quarter 2018 Lecture 2: The human vision system

Frequencies and Color

Lecture 8. Human Information Processing (1) CENG 412-Human Factors in Engineering May

Carmen Alonso Montes 23rd-27th November 2015

Image Filtering in Spatial domain. Computer Vision Jia-Bin Huang, Virginia Tech

EC-433 Digital Image Processing

3D Interaction using Hand Motion Tracking. Srinath Sridhar Antti Oulasvirta

23270: AUGMENTED REALITY FOR NAVIGATION AND INFORMATIONAL ADAS. Sergii Bykov Technical Lead Machine Learning 12 Oct 2017

Chapter 12 Image Processing

PRACTICAL IMAGE AND VIDEO PROCESSING USING MATLAB

License Plate Localisation based on Morphological Operations

What you see is not what you get. Grade Level: 3-12 Presentation time: minutes, depending on which activities are chosen

Image processing for gesture recognition: from theory to practice. Michela Goffredo University Roma TRE

Physics 2310 Lab #5: Thin Lenses and Concave Mirrors Dr. Michael Pierce (Univ. of Wyoming)

8.2 IMAGE PROCESSING VERSUS IMAGE ANALYSIS Image processing: The collection of routines and

Machine Learning for Intelligent Transportation Systems

Content Based Image Retrieval Using Color Histogram

Prof Trivedi ECE253A Notes for Students only

Image Processing & Projective geometry

Unit IV: Sensation & Perception. Module 19 Vision Organization & Interpretation

VEHICLE LICENSE PLATE DETECTION ALGORITHM BASED ON STATISTICAL CHARACTERISTICS IN HSI COLOR MODEL

CPSC 425: Computer Vision

Morphological Image Processing Approach of Vehicle Detection for Real-Time Traffic Analysis

Lecture # 01. Introduction

CSE 473/573 Computer Vision and Image Processing (CVIP)

FACE DETECTION. Sahar Noor Abdal ID: Mashook Mujib Chowdhury ID:

Fixing the Gaussian Blur : the Bilateral Filter

Midterm Examination CS 534: Computational Photography

Computer Vision. The Pinhole Camera Model

Main Subject Detection of Image by Cropping Specific Sharp Area

Advanced Techniques for Mobile Robotics Location-Based Activity Recognition

the dimensionality of the world Travelling through Space and Time Learning Outcomes Johannes M. Zanker

CSC321 Lecture 11: Convolutional Networks

IV: Visual Organization and Interpretation

Human Vision. Human Vision - Perception

Haptics CS327A

An Automatic System for Detecting the Vehicle Registration Plate from Video in Foggy and Rainy Environments using Restoration Technique

Teaching Scheme. Credits Assigned (hrs/week) Theory Practical Tutorial Theory Oral & Tutorial Total

Transcription:

Perceptual and Sensory Augmented Computing High Level Computer Vision Introduction - April 16, 2014 MPI Informatics and Saarland University, Saarbrücken, Germany http://www.d2.mpi-inf.mpg.de/cv

Computer Vision Lecturer: Bernt Schiele (schiele@mpi-inf.mpg.de) Mario Fritz (mfritz@mpi-inf.mpg.de) Assistants: Fabio Galasso (galasso@mpi-inf.mpg.de) Zeynep Akata (akata@mpi-inf.mpg.de) Language: English Webpage: http://www.d2.mpi-inf.mpg.de/cv mailing list for announcements etc. use link on webpage to enroll in mailing list High Level Computer Vision - April 16, 2o14 2

Lecture & Exercise Officially: 2V (lecture) + 2Ü (exercise) Lecture: Wed: 14:15-16:00 (room 024) Exercise: Thu: 12:15-14:00 (room 024) typically 1 exercise sheet every 1-2 weeks part of the final grade pencil and paper, as well as matlab-based exercise, reading assignment (research papers, overview papers, etc.) & larger project at end of lecture we/you propose project, mentoring, final presentation Exam planned as oral exam after the SS - there will be proposed dates High Level Computer Vision - April 16, 2o14 3

Material For part of the lecture: http://szeliski.org/book/ available online High Level Computer Vision - April 16, 2o14 4

Why Study Computer Vision Science Foundations of perception. How do WE see? computer vision to explore computational model of human vision High Level Computer Vision - April 16, 2o14 5

Why Study Computer Vision Science Foundations of perception. How do WE see? computer vision to explore computational model of human vision Engineering How do we build systems that perceive the world computer vision to solve real-world problems: cars to detect pedestrians High Level Computer Vision - April 16, 2o14 6

Why Study Computer Vision Science Foundations of perception. How do WE see? computer vision to explore computational model of human vision Engineering How do we build systems that perceive the world computer vision to solve real-world problems: cars to detect pedestrians Applications medical imaging (computer vision to support medical diagnosis, visualization) surveillance (to follow/track people at the airport, train-station,...) entertainment (vision-based interfaces for games) graphics (image-based rendering, vision to support realistic graphics) car-industry (lane-keeping, pre-crash intervention, ) High Level Computer Vision - April 16, 2o14 7

Some Applications License Plate Recognition London Congestion Charge http://www.cclondon.com/ imagingandcameras.html http://en.wikipedia.org/wiki/ London_congestion_charge Surveillance Face Recognition Airport Security (People Tracking) Medical Imaging (Semi-)automatic segmentation and measurements Robotics Driver assistance High Level Computer Vision - April 16, 2o14 8

More Applications Microsoft High Level Computer Vision - April 16, 2o14 9

Goals of today s lecture First intuitions about What is computer vision? What does it mean to see and how do we (as humans) do it? How can we make this computational? Applications & Appetizers 2 case studies: Recovery of 3D structure - slides taken from Michael Black @ Brown University / MPI Intelligent Systems Object Recognition - intuition from human vision... High Level Computer Vision - April 16, 2o14 10

Perceptual and Sensory Augmented Computing Applications & Appetizers... work from our group

Detection & Recognition of Visual Categories Challenges: multi-scale multi-view multi-class varying illumination occlusion cluttered background articulation high intraclass variance low interclass variance High Level Computer Vision - April 16, 2o14 12

Challenges of Visual Categorization low inter-class variation high intra-class variation High Level Computer Vision - April 16, 2o14 13

Sample Category: Motorbikes High Level Computer Vision - April 16, 2o14 14

Basic Idea global I know where the Eiffel Tower is local High Level Computer Vision - April 16, 2o14 15

Large Scale Object Class Recognition Learning Shape Models from 3D CAD Data 3D Computer Aided Design (CAD) Models for - computer graphics, game design - polygonal meshes + texture descriptions - semantic part annotations (may) exist Learning Object Class Model directly from 3D CAD-data: Michael Stark High Level Computer Vision - April 16, 2o14 16

Video... High Level Computer Vision - April 16, 2o14 18

Articulation Model Assume uniform position prior for the whole body Learn the conditional relation between part position and body center from data: 400 annotated training images High Level Computer Vision - April 16, 2o14 19

Modeling Body Dynamics Visualization of the hierarchical Gaussian process latent variable model (hgplvm) High Level Computer Vision - April 16, 2o14 21

High Level Computer Vision - April 16, 2o14 23

High Level Computer Vision - April 16, 2o14 24

Complete 3D Scene Modeling Goal: Infer consistent 3D world hypothesis from 2D image sequences with a moving monocular camera Tracking 3D Scene Model Integrate SoA object detectors, scene labeling Efficiently leverage domain knowledge [Wojek et.al.@eccv10] High Level Computer Vision - April 16, 2o14 29

[Wojek et.al.@eccv10] System sample video (pedestrians) ETH-Loewenplatz sequence: By courtesy of ETH Zürich [Ess et al., PAMI 09] High Level Computer Vision - April 16, 2o14 31

[Wojek et.al.@eccv10] System sample video (vehicles) High Level Computer Vision - April 16, 2o14 32

Sequential Model Update for Scene Labeling (Fritz,Levinkov) High Level Computer Vision - April 16, 2o14 33

Sequential Model Update for Scene Labeling (Fritz,Levinkov) High Level Computer Vision - April 16, 2o14 34

Perception for Manipulation High Level Computer Vision - April 16, 2o14 35

Perception for Manipulation High Level Computer Vision - April 16, 2o14 36

Multi-Class Video Co-Segmentation (Fritz, Chiu) High Level Computer Vision - April 16, 2o14 37

Multi-Class Video Co-Segmentation (Fritz, Chiu) High Level Computer Vision - April 16, 2o14 38

Efficient Object Detection with Shared Representations High Level Computer Vision 2014 39

Perceptual and Sensory Augmented Computing Basic Concepts and Terminology Computer Vision vs. Computer Graphics

Pinhole Camera (Model) (simple) standard and abstract model today box with a small hole in it High Level Computer Vision - April 16, 2o14 50

Camera Obscura around 1519, Leonardo da Vinci (1452-1519) http://www.acmi.net.au/aic/camera_obscura.html when images of illuminated objects penetrate through a small hole into a very dark room you will see [on the opposite wall] these objects in their proper form and color, reduced in size in a reversed position owing to the intersection of the rays High Level Computer Vision - April 16, 2o14 51

Principle of pinhole......used by artists (e.g. Vermeer 17th century, dutch) and scientists High Level Computer Vision - April 16, 2o14 52

Digital Images Imaging Process: (pinhole) camera model digitizer to obtain digital image High Level Computer Vision - April 16, 2o14 53

(Grayscale) Image Goals of Computer Vision how can we recognize fruits from an array of (gray-scale) numbers? how can we perceive depth from an array of (gray-scale) numbers? Goals of Graphics how can we generate an array of (gray-scale) numbers that looks like fruits? how can we generate an array of (gray-scale) numbers so that the human observer perceives depth? computer vision = the problem of inverse graphics? High Level Computer Vision - April 16, 2o14 54

Perceptual and Sensory Augmented Computing Visual Cues for Image Analysis... in art and visual illusions

1. Case Study: Human & Art - Recovery of 3D Structure High Level Computer Vision - April 16, 2o14 57

1. Case Study: Human & Art - Recovery of 3D Structure High Level Computer Vision - April 16, 2o14 58

1. Case Study: Human & Art - Recovery of 3D Structure High Level Computer Vision - April 16, 2o14 59

1. Case Study: Human & Art - Recovery of 3D Structure High Level Computer Vision - April 16, 2o14 60

1. Case Study: Human & Art - Recovery of 3D Structure High Level Computer Vision - April 16, 2o14 61

1. Case Study: Human & Art - Recovery of 3D Structure High Level Computer Vision - April 16, 2o14 62

1. Case Study Computer Vision - Recovery of 3D Structure take all the cues of artists and turn them around exploit these cues to infer the structure of the world need mathematical and computational models of these cues sometimes called inverse graphics High Level Computer Vision - April 16, 2o14 63

A trompe l oeil depth-perception movement of ball stays the same location/trace of shadow changes High Level Computer Vision - April 16, 2o14 64

Another trompe l oeil illusory motion only shadows changes square is stationary High Level Computer Vision - April 16, 2o14 65

Color & Shading High Level Computer Vision - April 16, 2o14 66

Color & Shading High Level Computer Vision - April 16, 2o14 67

High Level Computer Vision - April 16, 2o14 68

High Level Computer Vision - April 16, 2o14 69

High Level Computer Vision - April 16, 2o14 70

High Level Computer Vision - April 16, 2o14 71

High Level Computer Vision - April 16, 2o14 72

Do you still think you see the world? High Level Computer Vision - April 16, 2o14 73

Do you still believe what you see? Experiment carefully point flash light into your eye from one corner don t hurt yourself! Observation you ll see your own blood vessels they are actually in front of the retina we ve adapted to their usual shadow High Level Computer Vision - April 16, 2o14 75

2. Case Study: Computer Vision & Object Recognition is it more than inverse graphics? how do you recognize the banana? the glass? the towel? how can we make computers to do this? ill posed problem: missing data ambiguities multiple possible explanations High Level Computer Vision - April 16, 2o14 76

Image Analysis vs. Synthesis from: Object Perception as Bayesian Inference Kersten 2003 High Level Computer Vision - April 16, 2o14 78

Complexity of Recognition High Level Computer Vision - April 16, 2o14 79

Complexity of Recognition High Level Computer Vision - April 16, 2o14 80

Complexity of Recognition High Level Computer Vision - April 16, 2o14 81

Complexity of Recognition High Level Computer Vision - April 16, 2o14 82

Complexity of Recognition High Level Computer Vision - April 16, 2o14 83

Recognition: the Role of Context Antonio Torralba High Level Computer Vision - April 16, 2o14 84

Recognition: the role of Prior Expectation Guiseppe Arcimboldo High Level Computer Vision - April 16, 2o14 85

Complexity of Recognition High Level Computer Vision - April 16, 2o14 86

Complexity of Recognition High Level Computer Vision - April 16, 2o14 87

One or Two Faces? High Level Computer Vision - April 16, 2o14 88

Class of Models: Pictorial Structure Fischler & Elschlager 1973 Model has two components parts (2D image fragments) structure (configuration of parts) High Level Computer Vision - April 16, 2o14 89

Deformations High Level Computer Vision - April 16, 2o14 90

Clutter High Level Computer Vision - April 16, 2o14 91

Example High Level Computer Vision - April 16, 2o14 92

Perceptual and Sensory Augmented Computing Recognition, Localization, and Segmentation a few terms let s briefly define what we mean by that

Object Recognition: First part of this Computer Vision class Different Types of Recognition Problems: Object Identification - recognize your pencil, your dog, your car Object Classification - recognize any pencil, any dog, any car - also called: generic object recognition, object categorization, Recognition and Segmentation: separate pixels belonging to the foreground (object) and the background Localization/Detection: position of the object in the scene, pose estimate (orientation, size/scale, 3D position) High Level Computer Vision - April 16, 2o14 94

Object Recognition: First part of this Computer Vision class Different Types of Recognition Problems: Object Identification - recognize your apple, your cup, your dog Object Classification - recognize any apple, any cup, any dog - also called: generic object recognition, object categorization, - typical definition: basic level category High Level Computer Vision - April 16, 2o14 95

Which Level is right for Object Classes? Basic-Level Categories the highest level at which category members have similar perceived shape the highest level at which a single mental image can reflect the entire category the highest level at which a person uses similar motor actions to interact with category members the level at which human subjects are usually fastest at identifying category members the first level named and understood by children (while the definition of basic-level categories depends on culture there exist a remarkable consistency across cultures...) Most recent work in object recognition has focused on this problem we will discuss several of the most successful methods in the lecture :-) High Level Computer Vision - April 16, 2o14 96

Object Recognition: First part of this Computer Vision class Recognition and Segmentation: separate pixels belonging to the foreground (object) and the background High Level Computer Vision - April 16, 2o14 97

Object Recognition: First part of this Computer Vision class Recognition and Localization: to position the object in the scene, estimate the object s pose (orientation, size/scale, 3D position) Example from David Lowe: High Level Computer Vision - April 16, 2o14 98

Localization: Example Video 1 High Level Computer Vision - April 16, 2o14 99

Localization: Example Video 2 High Level Computer Vision - April 16, 2o14 100

Object Recognition: First part of this Computer Vision class Different Types of Recognition Problems: Object Identification - recognize your pencil, your dog, your car Object Classification - recognize any pencil, any dog, any car - also called: generic object recognition, object categorization, Recognition and Segmentation: separate pixels belonging to the foreground (object) and the background Localization: position the object in the scene, estimate pose of the object (orientation, size/scale, 3D position) High Level Computer Vision - April 16, 2o14 101

Perceptual and Sensory Augmented Computing Basic Filtering

Computer Vision and Fundamental Components computer vision: reverse the imaging process 2D (2-dimensional) digital image processing pattern recognition / 3D image analysis image understanding High Level Computer Vision - April 16, 2o14 104

Digital Image Processing Some Basics (digital signal processing, FFT, ) Image Filtering - (taken from a class by Bill Freeman @MIT) Image Filtering take some local image patch (e.g. 3x3 block) image filtering: apply some function to local image patch High Level Computer Vision - April 16, 2o14 105

Image Filtering Some Examples: what assumptions are you making to infer the center value? Goals of Image Filtering: reduce noise fill-in missing values/information extract image features (e.g.edges/corners)... 3 or 4 High Level Computer Vision - April 16, 2o14 106

Image Filtering simplest case: linear filtering: replace each pixel by a linear combination of its neighbors the prescription for the linear combination is called the convolution kernel High Level Computer Vision - April 16, 2o14 107

2D signals and convolution Components of convolution : Image: - continuous: I(x,y) - discrete: I[k,l] or I k,l filter kernel : g[k,l] filtered image: f[m,n] 2D convolution (discrete): special case: convolution (discrete) of a 2D-image with a 1D-filter High Level Computer Vision - April 16, 2o14 108

Linear Filtering (warm-up slide) High Level Computer Vision - April 16, 2o14 109

Linear Filtering (warm-up slide) High Level Computer Vision - April 16, 2o14 110

Try it out in GIMP You can try out linear filter kernels in the free image manipulation tool GIMP - availble at gimp.org open image from the menu pick: Filters - Generic Convolution Matrix... enter filter kernel in Matrix press ok to apply High Level Computer Vision - April 16, 2o14 111

Linear Filtering High Level Computer Vision - April 16, 2o14 112

Linear Filtering High Level Computer Vision - April 16, 2o14 113

Linear Filtering High Level Computer Vision - April 16, 2o14 114

Blurring High Level Computer Vision - April 16, 2o14 115

Blurring Examples High Level Computer Vision - April 16, 2o14 116

Linear Filtering (warm-up slide) High Level Computer Vision - April 16, 2o14 117

Linear Filtering (warm-up slide) High Level Computer Vision - April 16, 2o14 118

Linear Filtering High Level Computer Vision - April 16, 2o14 119

(remember blurring) High Level Computer Vision - April 16, 2o14 120

Sharpening High Level Computer Vision - April 16, 2o14 121

Sharpening Example High Level Computer Vision - April 16, 2o14 122

Sharpening High Level Computer Vision - April 16, 2o14 123