Computer Vision. Howie Choset Introduction to Robotics

Similar documents
8.2 IMAGE PROCESSING VERSUS IMAGE ANALYSIS Image processing: The collection of routines and

Digital Image Processing Lec.(3) 4 th class

Computing for Engineers in Python

Study guide for Graduate Computer Vision

Images and Graphics. 4. Images and Graphics - Copyright Denis Hamelin - Ryerson University

Computer Vision Slides curtesy of Professor Gregory Dudek

Digital Image Processing. Lecture # 3 Image Enhancement

An Efficient Color Image Segmentation using Edge Detection and Thresholding Methods

CMVision and Color Segmentation. CSE398/498 Robocup 19 Jan 05

CS534 Introduction to Computer Vision. Linear Filters. Ahmed Elgammal Dept. of Computer Science Rutgers University

Exercise questions for Machine vision

Chapter 17. Shape-Based Operations

Image compression with multipixels

Image Processing for feature extraction

Digital Image Processing

Introduction to DSP ECE-S352 Fall Quarter 2000 Matlab Project 1

Digital Image Processing. Lecture # 4 Image Enhancement (Histogram)

Image Formation. World Optics Sensor Signal. Computer Vision. Introduction to. Light (Energy) Source. Surface Imaging Plane. Pinhole Lens.

ME 6406 MACHINE VISION. Georgia Institute of Technology

Vision Review: Image Processing. Course web page:

SECTION I - CHAPTER 2 DIGITAL IMAGING PROCESSING CONCEPTS

A Structured Light Range Imaging System Using a Moving Correlation Code

COMPARATIVE PERFORMANCE ANALYSIS OF HAND GESTURE RECOGNITION TECHNIQUES

Prof. Feng Liu. Fall /04/2018

IMAGE PROCESSING: POINT PROCESSES

Lane Detection in Automotive

IMAGE FORMATION. Light source properties. Sensor characteristics Surface. Surface reflectance properties. Optics

ROBOT VISION. Dr.M.Madhavi, MED, MVSREC

Assistant Lecturer Sama S. Samaan

Improved SIFT Matching for Image Pairs with a Scale Difference

Overview. Pinhole camera model Projective geometry Vanishing points and lines Projection matrix Cameras with Lenses Color Digital image

APPLICATION OF COMPUTER VISION FOR DETERMINATION OF SYMMETRICAL OBJECT POSITION IN THREE DIMENSIONAL SPACE

CS 445 HW#2 Solutions

Image processing for gesture recognition: from theory to practice. Michela Goffredo University Roma TRE

Image Filtering in Spatial domain. Computer Vision Jia-Bin Huang, Virginia Tech

DIGITAL IMAGE PROCESSING

DECISION NUMBER FOURTEEN TO THE TREATY ON OPEN SKIES

IMAGE PROCESSING PAPER PRESENTATION ON IMAGE PROCESSING

EC-433 Digital Image Processing

The ultimate camera. Computational Photography. Creating the ultimate camera. The ultimate camera. What does it do?

Image Filtering. Median Filtering

CoE4TN4 Image Processing. Chapter 3: Intensity Transformation and Spatial Filtering

Fundamentals of Multimedia

On spatial resolution

Digital Image Processing 3/e

Chapter 9 Image Compression Standards

Sensors and Sensing Cameras and Camera Calibration

What is an image? Images and Displays. Representative display technologies. An image is:

Design of Temporally Dithered Codes for Increased Depth of Field in Structured Light Systems

Chapter 8. Representing Multimedia Digitally

PLazeR. a planar laser rangefinder. Robert Ying (ry2242) Derek Xingzhou He (xh2187) Peiqian Li (pl2521) Minh Trang Nguyen (mnn2108)

Chapter 12 Image Processing

DESIGN OF A LASER DISTANCE SENSOR WITH A WEB CAMERA FOR A MOBILE ROBOT

SYLLABUS CHAPTER - 2 : INTENSITY TRANSFORMATIONS. Some Basic Intensity Transformation Functions, Histogram Processing.

Digitizing Color. Place Value in a Decimal Number. Place Value in a Binary Number. Chapter 11: Light, Sound, Magic: Representing Multimedia Digitally

Midterm Examination CS 534: Computational Photography

Computer and Machine Vision

PSEUDO HDR VIDEO USING INVERSE TONE MAPPING

Solution for Image & Video Processing

Image analysis. CS/CME/BIOPHYS/BMI 279 Fall 2015 Ron Dror

CSC 320 H1S CSC320 Exam Study Guide (Last updated: April 2, 2015) Winter 2015

Linear Gaussian Method to Detect Blurry Digital Images using SIFT

CS 376b Computer Vision

5/17/2009. Digitizing Color. Place Value in a Binary Number. Place Value in a Decimal Number. Place Value in a Binary Number

Preparing Remote Sensing Data for Natural Resources Mapping (image enhancement, rectifications )

CATEGORY SKILL SET REF. TASK ITEM

Image Processing for Mechatronics Engineering For senior undergraduate students Academic Year 2017/2018, Winter Semester

Modeling and Synthesis of Aperture Effects in Cameras

Images and Displays. Lecture Steve Marschner 1

Physics 2310 Lab #5: Thin Lenses and Concave Mirrors Dr. Michael Pierce (Univ. of Wyoming)

Lane Detection in Automotive

Image Enhancement in Spatial Domain

Sampling and Reconstruction. Today: Color Theory. Color Theory COMP575

Digital Image Processing Introduction

Computer Vision, Lecture 3

Histogram equalization

BSB663 Image Processing Pinar Duygulu. Slides are adapted from Gonzales & Woods, Emmanuel Agu Suleyman Tosun

APPLICATIONS FOR TELECENTRIC LIGHTING

Checkerboard Tracker for Camera Calibration. Andrew DeKelaita EE368

Color Image Processing

CS101 Lecture 19: Digital Images. John Magee 18 July 2013 Some material copyright Jones and Bartlett. Overview/Questions

Color. Used heavily in human vision. Color is a pixel property, making some recognition problems easy

Image analysis. CS/CME/BioE/Biophys/BMI 279 Oct. 31 and Nov. 2, 2017 Ron Dror

A Primer on Human Vision: Insights and Inspiration for Computer Vision

Robot Visual Mapper. Hung Dang, Jasdeep Hundal and Ramu Nachiappan. Fig. 1: A typical image of Rovio s environment

Preprocessing and Segregating Offline Gujarati Handwritten Datasheet for Character Recognition

Image Processing Lecture 4

Be aware that there is no universal notation for the various quantities.

INSTITUTIONEN FÖR SYSTEMTEKNIK LULEÅ TEKNISKA UNIVERSITET

Image acquisition. Midterm Review. Digitization, line of image. Digitization, whole image. Geometric transformations. Interpolation 10/26/2016

Image Capture and Problems

NEW HIERARCHICAL NOISE REDUCTION 1

from: Point Operations (Single Operands)

Digital Image Fundamentals and Image Enhancement in the Spatial Domain

Achim J. Lilienthal Mobile Robotics and Olfaction Lab, AASS, Örebro University

Removing Temporal Stationary Blur in Route Panoramas

Applications of Flash and No-Flash Image Pairs in Mobile Phone Photography

CSE 527: Introduction to Computer Vision

Goal: Label Skin Pixels in an Image. Their Application. Background/Previous Work. Understanding Skin Albedo. Measuring Spectral Albedo of Skin

A Fast Algorithm of Extracting Rail Profile Base on the Structured Light

Transcription:

Computer Vision Howie Choset http://www.cs.cmu.edu.edu/~choset Introduction to Robotics http://generalrobotics.org

What is vision?

What is computer vision?

Edge Detection

Edge Detection

Interest points and descriptors (SIFT,..)

Segmentation

Segmentation

Recognizing objects and understanding scenes

Recognition

Tracking & Estimating motion

Motion

Reconstructing 3D

Multi-Camera Geometry: Stereo

Multi-Camera Geometry: Stereo

Multi-Camera Geometry

Understanding videos

Goal of Vision: Recover Projection P: R 3 -> R 2 or P: R 3 -> Z 2 Recover third dimension or just infer stuff

Optics Focal length Length f of projection through lens on image plane Inversion Object Projection on image plane is inverted Image plane f

Perspective 1 Point Perspective Using similar triangles, it is possible to determine the relative sizes of objects in an image Given a calibrated camera (predetermine a mathematical relationship between size on the image plane and the actual object) Object Image plane f

Projection on the image plane Size of an image on the image plane is inversely proportional to the distance from the focal point Image plane h d Focal point f h h d h f By conceptually moving the image plane, we can eliminate the negative sign h d Image plane h f Focal point h d h f

Move to three dimensions x x x f y y z z x x f, y z f y z z Image plane (x,y,z) y (x,y,z ) f x

Stereo

Images Discrete representation of a continuous function Pixel: Picture Element cell of constant color in a digital image An image is a two dimensional array of pixels Pixel: numeric value representing a uniform portion of an image Resolution Number of pixels across in horizontal Number of pixels in the vertical Number of layers used for color Often measured in bits per pixel (bpp) where each color uses 8 bits of data Ex: 640x480x24bpp

Images Binary images: Two color image Pixel is only one byte of information Indicates if the intensity of color is above or below some nominal value Thresholding Grayscale All pixels represent the intensity of light in an image, be it red, green, blue, or another color Like holding a piece of transparent colored plastic over your eyes Intensity of light in a pixel is stored as a number, generally 0..255 inclusive Color Three grayscale images layered on top of eachother with each layer indicating the intensity of a specific color light, generally red, green, and blue (RGB) Third dimension in a digital image

Grayscale vs. Binary image Grayscale Binary threshold

Thresholding Purpose Trying to find areas of high color intensity Highlights locations of different features of the image (notice Mona s eyes) Image compression, use fewer bits to encode a pixel How done Decide on a value Scan every pixel in the image If it is greater than, make it 255 If it is less than, make it 0 Picking a good Often 128 is a good value to start with Use a histogram to determine values based on color frequency features

Histogram Measure the number of pixels of different values in an image. Yields information such as the brightness of an image, important color features, possibilities of color elimination for compression

Mona s Histogram 0 255

How to Choose a Threshold Probability that a pixel has a gray value z Following Copied from Robot Motion and Control By Spong, Hutchinson, Vidyasagar

Probability and Mean (con t) H i z histogram for ith object z = 0 is background The mean for the ith object is μ i = N 1 z=0 z H i (z) N 1 H i z z=0 Conditional mean

Variance What is the mean for an image Half pixels 127, half pixels 128 Half pixel 0, half pixels 255 127.5 Same mean Different images Average deviation or variance σ 2 = N 1 z=0 z μ 2 P(z) σ i 2 N 1 = z μ i 2 H i z N 1 H i (z) z=0 z=0 Conditional variance

Automatic Threshold Selection i = 0,1 Less than z t More than z t Approach: Pick a threshold z t that minimizes the variance of the two resulting groups

More threshold selection Two groups, Background below Foreground - above

More threshold selection We could pick z t to minimize sum of BAD, because assumes equal number of pixels in back and foreground

More threshold selection Two groups, Background below Foreground - above We could pick z t to minimize sum of BAD, because assumes equal number of pixels in back and foreground

Copied from Robot Motion and Control By Spong, Hutchinson, Vidyasagar Example

Gaussian Masks Used to smooth images and for noise reduction Use before edge detection to avoid spurious edges Johann Carl Friedrich Gauss April 30, 1777 Feb 23, 1855 number theory, statistics, analysis, differential geometry, geodesy, electrostatics, astronomy, and optics. 17 heptadecagon

Connectivity Two conventions on considering two pixels next to each other 8 point connectivity All pixels sharing a side or corner are considered adjacent 4 point connectivity Only pixels sharing a side are considered adjacent To eliminate the ambiguity, we could define the shape of a pixel to be a hexagon