Recognition problems. Object Recognition. Readings. What is recognition?

Size: px

Start display at page:

Download "Recognition problems. Object Recognition. Readings. What is recognition?"

Preston Fleming
5 years ago
Views:

1 Recognition problems Object Recognition Computer Vision CSE576, Spring 2008 Richard Szeliski What is it? Object and scene recognition Who is it? Identity recognition Where is it? Object detection What are they doing? Activities All of these are classification problems Choose one class from a list of possible candidates CSE 576, Spring 2008 Object recognition 2 What is recognition? A different taxonomy from [Csurka et al. 2006]: Recognition Where is this particular object? Categorization What kind of object(s) is(are) present? Content-based image retrieval Find me something that looks similar Detection Locate all instances of a given class CSE 576, Spring 2008 Object recognition 3 Readings Weakly Supervised Scale-Invariant Learning of Models for Visual Recognition Fergus, R., Perona, P. and Zisserman, A. International Journal of Computer Vision, Vol. 71(3), , March 2007 CSE 576, Spring 2008 Object recognition 4

Sources Steve Seitz, CSE 455/576, previous quarters Fei-Fei, Fergus, Torralba, CVPR 2007 course Efros, CMU 16-721 Learning in Vision Freeman, MIT 6.

2 Sources Steve Seitz, CSE 455/576, previous quarters Fei-Fei, Fergus, Torralba, CVPR 2007 course Efros, CMU Learning in Vision Freeman, MIT Computer Vision: Learning Linda Shapiro, CSE 576, Spring 2007 CSE 576, Spring 2008 Object recognition 5 CSE 576, Spring 2008 Object recognition 6 CVPR 2007 Minneapolis, Short Course, June 17 Recognizing and Learning Object Categories: Year 2007 Li Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT (see other slide deck) CSE 576, Spring 2008 Object recognition 8

Single object recognition Single object recognition Lowe, et al.

2004 Rothganger, Lazebnik, and Ponce, 2004 Moreels and Perona, 2005 CSE

recognition 10 Planar object recognition [Lowe] Use SIFT features Verify

[Lowe] Use SIFT features Verify affine (or homography) geometric

3 Single object recognition Single object recognition Lowe, et al. 1999, 2003 Mahamud and Herbert, 2000 Ferrari, Tuytelaars, and Van Gool, 2004 Rothganger, Lazebnik, and Ponce, 2004 Moreels and Perona, 2005 CSE 576, Spring 2008 Object recognition 9 CSE 576, Spring 2008 Object recognition 10 Planar object recognition [Lowe] Use SIFT features Verify affine (or homography) geometric alignment Planar object recognition [Lowe] Use SIFT features Verify affine (or homography) geometric alignment CSE 576, Spring 2008 Object recognition 11 CSE 576, Spring 2008 Object recognition 12

3D object recognition [Lowe] Extract object outlines with background subtraction 3D object recognition [Lowe]

Object recognition 13 CSE 576, Spring 2008 Object recognition 14 Feature-based recognition How can we scale to

Answer: quantize features into words [Csurka et al.

4 3D object recognition [Lowe] Extract object outlines with background subtraction 3D object recognition [Lowe] Use 3 matches to recognize Use additional matches for verification Tolerant to occlusions CSE 576, Spring 2008 Object recognition 13 CSE 576, Spring 2008 Object recognition 14 Feature-based recognition How can we scale to millions of objects? Comparison to all stored objects/features is infeasible. Answer: quantize features into words [Csurka et al. 04] use information retrieval (inverted index) use metric tree for faster quantization (NN) [Nister & Stewenius 05] CSE 576, Spring 2008 Object recognition 15 CSE 576, Spring 2008 Object recognition 16

CVPR 2007 Minneapolis, Short Course, June 17 (see other slide deck) Part 1: Bag-of-words models by Li Fei-Fei (Princeton) CSE 576, Spring 2008 Object recognition 18 How to scale to 10 6 s of images?

5 CVPR 2007 Minneapolis, Short Course, June 17 (see other slide deck) Part 1: Bag-of-words models by Li Fei-Fei (Princeton) CSE 576, Spring 2008 Object recognition 18 How to scale to 10 6 s of images? Make word generation even more efficient: Vocabulary tree Scalable Recognition with a Vocabulary Tree David Nistér, Henrik Stewénius CSE 576, Spring 2008 Object recognition 19 CSE 576, Spring 2008 Object recognition 20

6 Vocabulary Tree CSE 576, Spring 2008 Object recognition 21 CSE 576, Spring 2008 Object recognition 22 Performance CSE 576, Spring 2008 Object recognition 23 CSE 576, Spring 2008 Object recognition 24

7 Location Recognition Can we apply this to recognizing your location from a cell-phone photo? City-Scale Location Recognition Grant Schindler, Matthew Brown, and Richard Szeliski CVPR 2007 CSE 576, Spring 2008 Object recognition 25 The Problem Main idea Find N-best matches in vocabulary tree CSE 576, Spring 2008 Object recognition 27 CSE 576, Spring 2008 Object recognition 28

Other ideas Use only informative features (ignore trees ) Integrate matches with adjacent (streetside) neighbors CSE 576, Spring 2008 Object recognition 29 CSE 576, Spring 2008

8 Other ideas Use only informative features (ignore trees ) Integrate matches with adjacent (streetside) neighbors CSE 576, Spring 2008 Object recognition 29 CSE 576, Spring 2008 Object recognition 30 CVPR 2007 Minneapolis, Short Course, June 17 (see other slide deck) Part 2: part-based models by Rob Fergus (MIT) CSE 576, Spring 2008 Object recognition 32

CVPR 2007 Minneapolis, Short Course, June 17 Aim Given

Category Model Segmentation Part 4: Combined

Image Segmentation should (ideally) be shaped like the

cow-like obtained efficiently in an unsupervised manner

Spring 2008 Object recognition 34 Slide from Kumar 05

Points Matched Codebook Entries Probabilistic Voting

Space (continuous) Refined Hypotheses (uniform

Maxima CSE 576, Spring 2008 Object recognition 35

9 CVPR 2007 Minneapolis, Short Course, June 17 Aim Given an image and object category, segment the object Object Category Model Segmentation Part 4: Combined segmentation and recognition by Rob Fergus (MIT) Cow Image Segmentation should (ideally) be shaped like the object e.g. cow-like obtained efficiently in an unsupervised manner able to handle self-occlusion Segmented Cow CSE 576, Spring 2008 Object recognition 34 Slide from Kumar 05 Implicit Shape Model - Liebe and Schiele, 2003 Interest Points Matched Codebook Entries Probabilistic Voting Other topics: context (scenes) Segmentation Voting Space (continuous) Refined Hypotheses (uniform sampling) Backprojected Hypotheses Backprojection of Maxima CSE 576, Spring 2008 Object recognition 35 Antonio Torralba, Contextual Priming for Object Detection, IJCV(53), No. 2, July 2003, pp CSE 576, Spring 2008 Object recognition 36

CSE 576, Spring 2008 Object recognition 37 Summary of object

10 CVPR 2007 Minneapolis, Short Course, June 17 New work: tiny images (see other slide deck) Datasets and object collections CSE 576, Spring 2008 Object recognition 37 Summary of object recognition Context and scenes CSE 576, Spring 2008 Object recognition 39

Today I t n d ro ucti tion to computer vision Course overview Course requirements

Today I t n d ro ucti tion to computer vision Course overview Course requirements COMP 776: Computer Vision Today Introduction ti to computer vision i Course overview Course requirements The goal of computer vision To extract t meaning from pixels What we see What a computer sees Source: