Today. CS 395T Visual Recognition. Course content. Administration. Expectations. Paper reviews

Similar documents
Computer Vision Lecture 1

CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu

Face detection, face alignment, and face image parsing

Computer Vision. Thursday, August 30

Today I t n d ro ucti tion to computer vision Course overview Course requirements

High Level Computer Vision. Introduction - April 16, Bernt Schiele & Mario Fritz MPI Informatics and Saarland University, Saarbrücken, Germany

An Un-awarely Collected Real World Face Database: The ISL-Door Face Database

EFFICIENT ATTENDANCE MANAGEMENT SYSTEM USING FACE DETECTION AND RECOGNITION

A Comparison of Histogram and Template Matching for Face Verification

Spring 2018 CS543 / ECE549 Computer Vision. Course webpage URL:

Book Cover Recognition Project

Graz University of Technology (Austria)

Semantic Localization of Indoor Places. Lukas Kuster

CS6550 Computer Vision

Recognition problems. Object Recognition. Readings. What is recognition?

Real-Time Tracking via On-line Boosting Helmut Grabner, Michael Grabner, Horst Bischof

Multiple Kernels for Object Detection. Andrea Vedaldi Varun Gulshan Manik Varma Andrew Zisserman

A VIDEO CAMERA ROAD SIGN SYSTEM OF THE EARLY WARNING FROM COLLISION WITH THE WILD ANIMALS

Visual Search using Principal Component Analysis

Practical Image and Video Processing Using MATLAB

Object Recognition + Gesture Recognition

Multi-modal Human-computer Interaction

A Proposal for Security Oversight at Automated Teller Machine System

Image Processing Based Vehicle Detection And Tracking System

INTAIRACT: Joint Hand Gesture and Fingertip Classification for Touchless Interaction

Automatic understanding of the visual world

An Efficient Approach to Face Recognition Using a Modified Center-Symmetric Local Binary Pattern (MCS-LBP)

Telling What-Is-What in Video. Gerard Medioni

CS 7643: Deep Learning

The Distributed Camera

Colour correction for panoramic imaging

COMP 776: Computer Vision

Real-Time Face Detection and Tracking for High Resolution Smart Camera System

Applying Vision to Intelligent Human-Computer Interaction

3D Interaction using Hand Motion Tracking. Srinath Sridhar Antti Oulasvirta

DIGITAL IMAGE PROCESSING

A TWO-PART PREDICTIVE CODER FOR MULTITASK SIGNAL COMPRESSION. Scott Deeann Chen and Pierre Moulin

Face Detection using 3-D Time-of-Flight and Colour Cameras

CSE Tue 10/09. Nadir Weibel

Digital Image Processing. Lecture # 6 Corner Detection & Color Processing

CSE 527: Introduction to Computer Vision

Image Analysis & Searching

Pose Invariant Face Recognition

Object Category Detection using Audio-visual Cues

Comparing Computer-predicted Fixations to Human Gaze

Face Detection System on Ada boost Algorithm Using Haar Classifiers

Perception. Introduction to HRI Simmons & Nourbakhsh Spring 2015

Introduction. BIL719 Computer Vision Pinar Duygulu Hacettepe University

Study Impact of Architectural Style and Partial View on Landmark Recognition

Lecture 23 Deep Learning: Segmentation

A Fuller Understanding of Fully Convolutional Networks. Evan Shelhamer* Jonathan Long* Trevor Darrell UC Berkeley in CVPR'15, PAMI'16

Sketching Interface. Larry Rudolph April 24, Pervasive Computing MIT SMA 5508 Spring 2006 Larry Rudolph

Domain Adaptation & Transfer: All You Need to Use Simulation for Real

Intelligent Traffic Sign Detector: Adaptive Learning Based on Online Gathering of Training Samples

To Infinity And Beyond. Computer Vision for Astronomy

Sketching Interface. Motivation

Improved SIFT Matching for Image Pairs with a Scale Difference

A Real Time Static & Dynamic Hand Gesture Recognition System

Evaluation of Image Segmentation Based on Histograms

Vehicle Detection using Images from Traffic Security Camera

Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays. Habib Abi-Rached Thursday 17 February 2005.

Student Attendance Monitoring System Via Face Detection and Recognition System

Computer Vision Lesson Plan

Visual Interpretation of Hand Gestures as a Practical Interface Modality

Wadehra Kartik, Kathpalia Mukul, Bahl Vasudha, International Journal of Advance Research, Ideas and Innovations in Technology

ELE 882: Introduction to Digital Image Processing (DIP)

SCIENCE & TECHNOLOGY

Finding people in repeated shots of the same scene

Session 2: 10 Year Vision session (11:00-12:20) - Tuesday. Session 3: Poster Highlights A (14:00-15:00) - Tuesday 20 posters (3minutes per poster)

International Journal of Advanced Research in Computer Science and Software Engineering

Controlling Humanoid Robot Using Head Movements

Real Time Word to Picture Translation for Chinese Restaurant Menus

CS4670 / 5670: Computer Vision Noah Snavely

Evaluating Context-Aware Saliency Detection Method

OBJECTIVE OF THE BOOK ORGANIZATION OF THE BOOK

Detection and Segmentation. Fei-Fei Li & Justin Johnson & Serena Yeung. Lecture 11 -

Face Detection: A Literature Review

Advanced Techniques for Mobile Robotics Location-Based Activity Recognition

23270: AUGMENTED REALITY FOR NAVIGATION AND INFORMATIONAL ADAS. Sergii Bykov Technical Lead Machine Learning 12 Oct 2017

Main Subject Detection of Image by Cropping Specific Sharp Area

CSE 408 Multimedia Information System

CROSS-LAYER FEATURES IN CONVOLUTIONAL NEURAL NETWORKS FOR GENERIC CLASSIFICATION TASKS. Kuan-Chuan Peng and Tsuhan Chen

CPSC 340: Machine Learning and Data Mining. Convolutional Neural Networks Fall 2018

ME 6406 MACHINE VISION. Georgia Institute of Technology

VICs: A Modular Vision-Based HCI Framework

Evaluating the stability of SIFT keypoints across cameras

Colour Based People Search in Surveillance

Introduction. Ioannis Rekleitis

Research Seminar. Stefano CARRINO fr.ch

Autocomplete Sketch Tool

Effects of the Unscented Kalman Filter Process for High Performance Face Detector

Computer Vision Slides curtesy of Professor Gregory Dudek

Low-level global features for vision-based localization

Taking Great Pictures (Automatically)

Personal Driving Diary: Constructing a Video Archive of Everyday Driving Events

Cafeteria Vision. Identification and Amount Measurement of Foods in a Plate. Ting-Fan Wu. Abstract

Lesson Plans 5/13/13 5/17/13. Art 1: Perspective & Surreal Interiors Painting & Drawing: Perspective Paintings & Drawings

Machine Learning and Decision Making for Sustainability

Multi-Image Deblurring For Real-Time Face Recognition System

Lecture 1 Introduction to Computer Vision. Lin ZHANG, PhD School of Software Engineering, Tongji University Spring 2015

Transcription:

Today CS 395T Visual Recognition Course logistics Overview Volunteers, prep for next week Thursday, January 18 Administration Class: Tues / Thurs 12:30-2 PM Instructor: Kristen Grauman grauman at cs.utexas.edu TAY 4.118 Office hours: Tues / Thurs 3-4 PM Class page: link from http://www.cs.utexas.edu/~grauman ** Check for updates to schedule. Course content Focus on current research in visual category and object recognition High-level vision and learning problems We will not spend much time on low-level image processing, video-based techniques, particular vision systems, human vision system. Expectations Discussions will center on recent papers in the field Paper reviews, prepared discussion points Student presentations Paper content, demos (extra credit) Projects Research-oriented Paper reviews For each class, choose one of the 2-3 papers we are covering to review Reviews due via email to me before class Posted for our class (anonymously) Also, prepare (write down) a few discussion points to have on hand in class about all of the papers you read. 1

Paper review guidelines Brief (2-3 sentences) summary Main contribution Strengths? Weaknesses? How convincing are the experiments? Suggestions to improve them? Extensions? Additional comments. More is not necessarily more. Presentation guidelines Approx. 25 minutes Clear overview of the paper Consider: Main problem, motivation Assumptions Technical approach: high level and intuition Important technical details Experiments Connections to other papers Demo guidelines Implement/download code for a main idea in the paper and show us toy example(s): Experiment with different types of (mini) training/testing data sets Evaluate sensitivity to parameter settings Show (on a small scale) an example in practice that highlights a strength/weakness of the approach Projects Possibilities: Extend a technique studied in class Empirical evaluation and analysis of a few related techniques Design and evaluate a novel approach May be possible to tie it into your research Work in pairs Proposal due at midterm (March 8) Short presentation at end of term, paper What is visual recognition? Perception of familiar objects Given image data, determine what s in it, and where Detection, categorization, identification 2

ride deck sky The Wicked Twister Lake Erie tree tree bench water amusement park Cedar Point Ferris wheel tree ride 12 E Categories Instances Activities Scenes Locations Text / writing Faces Gestures Emotions ride people waiting in line people sitting on ride umbrellas maxair carousel tree pedestrians Our focus in this class (primarily) Categories butterfly butterfly building building Specific Wild card Tower Bridge Bevo Some recognition applications Why recognition? Fundamental problem in computer vision Area is rich with very challenging questions Applications 3

Key challenges: robustness Key challenges: efficiency Illumination Object pose Clutter Thousands to millions of pixels in an image 3,000-30,000 human recognizable object categories 30+ degrees of freedom in the pose of articulated objects (humans) Billions of images indexed by Google Image Search 18 billion+ prints produced from digital camera images in 2004 295.5 million camera phones sold in 2005 Occlusions Intra-class appearance Viewpoint Disciplines Recognition (and vision in general) draws from Machine learning Probability Geometry, physics AI Algorithms Image processing Cognitive science Recognition research Representation Algorithms Contributions in recognition Evaluation paradigms Machine learning Systems Global image representations Local image representations Describe component regions or patches separately Map image to a single vector based on overall characteristics vector of pixel intensities grayscale / color histogram bank of filter responses SIFT [Lowe] Salient regions [Kadir et al.] Shape context [Belongie et al.] Harris-Affine [Schmid et al.] Superpixels [Ren et al.] Spin images [Johnson and Hebert] Maximally Stable Extremal Regions [Matas et al.] Geometric Blur [Berg et al.] 4

Representing object shape, geometry Algorithms optimization indexing 3-D models View-based models Parts + structure Bag of features Image from Sigal et al. Images from ETH-80 Image from Fischler and Elschlager Image from ICCV short course graphs matching distances ASM image from T. Cootes, Graph image from S. Yu Learning What defines a category/class? What distinguishes classes from one another? How to understand the connection between the real world and what we observe? What features are most informative? What can we do without human intervention? Does previous learning experience help learn the next category? Inputs/outputs/assumptions What input is available? Static grayscale image 3D range data Video sequence Multiple calibrated cameras Segmented data, unsegmented data Labeled data, unlabeled data, partially labeled data What is the goal? Say yes/no as to whether an object present in image Categorize all objects Forced choice from pool of categories Bounding box on object Full segmentation Build a model of an object category Less Spectrum of supervision More Category recognition: state-of-the-art What s possible now? What s difficult now? One way to measure: benchmark data sets. 5

Category recognition: state-of-the-art PASCAL Visual Object Classes Challenge 2006 10 categories Unsegmented, realistic images Supervised setup Classification task: For each class, predict presence/absence of an example of that class in the test image. 26 teams/methods competed Bicycles Images thanks to Mark Everingham Bicycles Cats Images thanks to Mark Everingham Images thanks to Mark Everingham Cats Cars Images thanks to Mark Everingham 6

Cars Cows Cows Person Person Category recognition: state-of-the-art Caltech-101 Database 101 categories Wide appearance variation Images fairly centered and scaled similarly Supervised setup Classification task: predict class for test images Around 12 methods tested in literature 7

Example images from the Caltech-101 database ant sunflower tick fan cougar cup 2004 6/05 12/05 3/06 6/06 Time of publication Topics Through readings in recent vision literature part-based models for recognition invariant local features bags of features and feature vocabularies spatial constraints and geometry shape descriptors and matching learning similarity measures fast indexing methods recognition with text and images the role of context in recognition unsupervised category discovery Goals of this course Understand current approaches Analyze Identify interesting research questions Coming up For tomorrow: Send me 4-5 paper preferences for presentations up to spring break For Tuesday Jan 23 Face Recognition Using Eigenfaces by Turk and Pentland Face Recognition Using Active Appearance Models by Edwards et al. Bring discussion points Review one of the papers (email to grauman@cs by Tuesday 12:30) Demos preferences For Thursday Jan 25 Rapid Object Detection Using a Boosted Cascade of Simple Features by Viola and Jones Face Recognition by Humans by Sinha et al. Bring discussion points Review on Viola and Jones 8

Coming up Presentation volunteers Quick survey 9