A Primer on Human Vision: Insights and Inspiration for Computer Vision

Similar documents
A Primer on Human Vision: Insights and Inspiration for Computer Vision

Spatial Vision: Primary Visual Cortex (Chapter 3, part 1)

Lecture 5. The Visual Cortex. Cortical Visual Processing

Fundamentals of Computer Vision

Probing sensory representations with metameric stimuli

Chapter Six Chapter Six

Parvocellular layers (3-6) Magnocellular layers (1 & 2)

CS 534: Computer Vision

Maps in the Brain Introduction

The Visual System. Computing and the Brain. Visual Illusions. Give us clues as to how the visual system works

Outline. The visual pathway. The Visual system part I. A large part of the brain is dedicated for vision

Spatial Vision: Primary Visual Cortex (Chapter 3, part 1)

Cortical sensory systems

Introduction to Visual Perception

Retina. Convergence. Early visual processing: retina & LGN. Visual Photoreptors: rods and cones. Visual Photoreptors: rods and cones.

Biological Vision. Ahmed Elgammal Dept of Computer Science Rutgers University

The Physiology of the Senses Lecture 3: Visual Perception of Objects

Psych 333, Winter 2008, Instructor Boynton, Exam 1

Computer Vision. Howie Choset Introduction to Robotics

Vision III. How We See Things (short version) Overview of Topics. From Early Processing to Object Perception

TSBB15 Computer Vision

Lecture 4 Foundations and Cognitive Processes in Visual Perception From the Retina to the Visual Cortex

Chapter 2: Starting from the very beginning

A Model of Feedback to the Lateral Geniculate Nucleus

1/21/2019. to see : to know what is where by looking. -Aristotle. The Anatomy of Visual Pathways: Anatomy and Function are Linked

Object Perception. 23 August PSY Object & Scene 1

Modeling cortical maps with Topographica

9.01 Introduction to Neuroscience Fall 2007

Neuromorphic Implementation of Orientation Hypercolumns. Thomas Yu Wing Choi, Paul A. Merolla, John V. Arthur, Kwabena A. Boahen, and Bertram E.

258 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART B: CYBERNETICS, VOL. 33, NO. 2, APRIL 2003

Neuromorphic Implementation of Orientation Hypercolumns

Vision V Perceiving Movement

SPATIAL VISION. ICS 280: Visual Perception. ICS 280: Visual Perception. Spatial Frequency Theory. Spatial Frequency Theory

Vision V Perceiving Movement

Color and perception Christian Miller CS Fall 2011

Spectral colors. What is colour? 11/23/17. Colour Vision 1 - receptoral. Colour Vision I: The receptoral basis of colour vision

Neural basis of pattern vision

AS Psychology Activity 4

The eye, displays and visual effects

7Motion Perception. 7 Motion Perception. 7 Computation of Visual Motion. Chapter 7

Color. Color. Colorfull world IFT3350. Victor Ostromoukhov Université de Montréal. Victor Ostromoukhov - Université de Montréal

Outline 2/21/2013. The Retina

The visual and oculomotor systems. Peter H. Schiller, year The visual cortex

iris pupil cornea ciliary muscles accommodation Retina Fovea blind spot

Early Visual Processing: Receptive Fields & Retinal Processing (Chapter 2, part 2)

Achromatic and chromatic vision, rods and cones.

3 THE VISUAL BRAIN. No Thing to See. Copyright Worth Publishers 2013 NOT FOR REPRODUCTION

Spatial coding: scaling, magnification & sampling

VISUAL NEURAL SIMULATOR

Yokohama City University lecture INTRODUCTION TO HUMAN VISION Presentation notes 7/10/14

A specialized face-processing network consistent with the representational geometry of monkey face patches

8.2 IMAGE PROCESSING VERSUS IMAGE ANALYSIS Image processing: The collection of routines and

Visual Perception. Martin Čadík. Czech Technical University in Prague, Czech Republic

Lecture 4. Opponent Colors. Hue Cancellation Experiment HUV Color Space

Limulus eye: a filter cascade. Limulus 9/23/2011. Dynamic Response to Step Increase in Light Intensity

MITOCW watch?v=awyzdtetqbe

Reverse Engineering the Human Vision System

AP PSYCH Unit 4.2 Vision 1. How does the eye transform light energy into neural messages? 2. How does the brain process visual information? 3.

Victor Ostromoukhov Université de Montréal. Victor Ostromoukhov - Université de Montréal

Image Processing Final Test

Domain-Specificity versus Expertise in Face Processing

A Biological Model of Object Recognition with Feature Learning

Night-time pedestrian detection via Neuromorphic approach

Don t twinkle, little star!

vertical horizonta fovea Figure by MIT OCW.

Review, the visual and oculomotor systems

Thesis: Bio-Inspired Vision Model Implementation In Compressed Surveillance Videos by. Saman Poursoltan. Thesis submitted for the degree of

VISUAL NEURAL SIMULATOR

A Biological Model of Object Recognition with Feature Learning. Jennifer Louie

Digital Image Processing Lec 02 - Image Formation - Color Space

Biologically Inspired Computation

Sensing and Perception

The eye* The eye is a slightly asymmetrical globe, about an inch in diameter. The front part of the eye (the part you see in the mirror) includes:

Human Vision. Human Vision - Perception

Visual System I Eye and Retina

Psy 627: Advanced Topics in Visual Perception

Lecture 15 End Chap. 6 Optical Instruments (2 slides) Begin Chap. 7 Visual Perception

Sunderland, NE England

Large Scale Imaging of the Retina. 1. The Retina a Biological Pixel Detector 2. Probing the Retina

CMVision and Color Segmentation. CSE398/498 Robocup 19 Jan 05

2 The First Steps in Vision

Human Vision and Human-Computer Interaction. Much content from Jeff Johnson, UI Wizards, Inc.

CS510: Image Computation. Ross Beveridge Jan 16, 2018

PERCEIVING MOVEMENT. Ways to create movement

Visual Rules. Why are they necessary?

Structure and Measurement of the brain lecture notes

Sensation and Perception. Sensation. Sensory Receptors. Sensation. General Properties of Sensory Systems

Our visual system always has to compute a solid object given definite limitations in the evidence that the eye is able to obtain from the world, by

Simple Measures of Visual Encoding. vs. Information Theory

Brain Computer Interfaces Lecture 2: Current State of the Art in BCIs

PERCEIVING MOTION CHAPTER 8

Invariant Object Recognition in the Visual System with Novel Views of 3D Objects

Decoding Natural Signals from the Peripheral Retina

Adaptive pyramid model for the Traveling Salesman Problem

P rcep e t p i t on n a s a s u n u c n ons n c s ious u s i nf n e f renc n e L ctur u e 4 : Recogni n t i io i n

An Engineer s Perspective on of the Retina. Steve Collins Department of Engineering Science University of Oxford

Chapter 73. Two-Stroke Apparent Motion. George Mather

Motion Detector Using High Level Feature Extraction

Sensation, Part 4 Gleitman et al. (2011), Chapter 4

Templates and Image Pyramids

Transcription:

A Primer on Human Vision: Insights and Inspiration for Computer Vision Guest&Lecture:&Marius&Cătălin&Iordan&& CS&131&8&Computer&Vision:&Foundations&and&Applications& 27&October&2014

detection recognition segmentation! visual! understanding

1. The Mammalian Visual System anatomy and processing pathways from processing to perception 2. Early Computation first stages of information processing edges: a basis for representing the visual world 3. Object Recognition in the Human Visual System building invariance: pooling and transformation untangling object representations Roadmap 23

The Camera : Retinal Projection Mammalian Visual System anatomy and processing pathways 34

Neuron Mammalian Visual System anatomy and processing pathways 35

figure courtesy of A. Alahi Mammalian Visual System anatomy and processing pathways 36

From Retina to Cortex world retina (compression) LGN V1 visual cortex (expansion) Mammalian Visual System anatomy and processing pathways 7

From Retina to Cortex size of! representation world retina (compression) LGN V1 visual cortex (expansion) Mammalian Visual System anatomy and processing pathways 7

Cortex: At Least Two Dozen Visual Areas plus more: motion areas, functional regions, etc. IT IT Weiner & Grill-Spector (2012) Mammalian Visual System anatomy and processing pathways 8

Visual Processing vs. Perception perception involves integration and higher functions IT IT Weiner & Grill-Spector (2012) Mammalian Visual System from processing to perception 9

Mammalian Visual System visual processing is done in stages but it is not synonymous with perception DiCarlo & Cox (2007) Mammalian Visual System key points 10

2. Early Computation local information processing Early Computation 11 3

Retinal Projection picture is inverted, but spatial relationships are preserved Early Computation first stages of processing 12 3

Receptive Fields extract similar features at each position in the visual field receptive fields figure adapted from Ebner & Hameroff (2011) Early Computation first stages of processing 13 3

Receptive Fields extract similar features at each position in the visual field ganglion cells center-surround receptive fields figure adapted from Ebner & Hameroff (2011) Early Computation first stages of processing 14 3

Receptive Fields extract similar features at each position in the visual field ON ganglion cells Early Computation first stages of processing 15 3

Receptive Fields extract similar features at each position in the visual field ON ganglion cells Early Computation first stages of processing 15 3

DoG Threshold figure courtesy of A. Alahi Early Computation first stages of processing 16 3

From Retina to Cortex world retina (compression) LGN V1 visual cortex (expansion) Early Computation first stages of processing 17 3

Single Electrode Recording Early Computation first stages of processing 18 3

Single Electrode Recording https://www.youtube.com/watch?v=8vdff3egwfg Hubel & Wiesel (1962) Early Computation first stages of processing 19 3

Types of V1 cells simple cells Hubel & Wiesel (1962) Early Computation first stages of processing 20

Types of V1 cells simple cells Hubel & Wiesel (1962) Early Computation first stages of processing 21

Types of V1 cells simple cells Hubel & Wiesel (1962) Early Computation first stages of processing 21

Types of V1 cells complex cells Hubel & Wiesel (1962) Early Computation first stages of processing 22

Types of V1 cells simple and complex cells are sensitive to: center-surround (difference of gaussians!) edges (symmetrical and asymmetrical) rectangles of various elongation, visual half fields V1 also has cells that are sensitive to: motion (in fact, it s a separate processing stream!) color (groups of cells called color blobs ) other stuff Hubel & Wiesel (1962) Early Computation first stages of processing 23

Why Are Edges Special? image = neurons * weights? 0? 0? 0? 1? 0? 0? 0? 0? 0? 0? 0? 1? 0? 2? 0? 0 Olshausen & Field (1996) Early Computation edges: a basis for natural images 24 3

Why Are Edges Special? image = neurons * weights 0 0 0 1 0 0 0 0 0 0 0 1 0 2 0 0 Olshausen & Field (1996) Early Computation edges: a basis for natural images 24 3

Why Are Edges Special? each pixel activated independently INPUT BASIS FOUND Olshausen & Field (1996) Early Computation edges: a basis for natural images 25 3

Why Are Edges Special? images are sums of independent gratings INPUT BASIS FOUND Olshausen & Field (1996) Early Computation edges: a basis for natural images 26 3

Why Are Edges Special? patches of real-world images Olshausen & Field (1996) Early Computation edges: a basis for natural images 27 3

Early Computation V1 cells encode edge orientation and position across visual field: simple and complex receptive fields edges are a sufficient basis for real-world images! Hubel & Wiesel (1962), Olshausen & Field (1996) Early Computation key points 28 3

3. Object Recognition in the Human Visual System sequential transformations Object Recognition 29 3

Object Recognition building invariance 30 3

The Flow of Information IT IT Weiner & Grill-Spector (2012) Object Recognition building invariance 31 3

V1 Retina Van Essen (1991) Object Recognition building invariance 32 3

Specialization: What and Where Pathways monkey lesion studies lesion where pathway: difficulty in spatial reasoning lesion what pathway: difficulty in object recognition Mishkin & Ungerleider 1982 Object Recognition building invariance 33 3

Specialization: What and Where Pathways monkey lesion studies lesion where pathway: difficulty in spatial reasoning lesion what pathway: difficulty in object recognition Mishkin & Ungerleider 1982 Object Recognition building invariance 33 3

Object Recognition: The What Pathway DiCarlo & Cox (2007) Object Recognition building invariance 34 3

Object Recognition: The What Pathway V1 IT Freeman & Simoncelli (2011), Tanaka (1997) Object Recognition building invariance 35 3

Object as Manifolds in High Dimensional Space DiCarlo & Cox (2007) Object Recognition untangling object representation 36 3

Untangling Object Manifolds DiCarlo & Cox (2007) Object Recognition untangling object representation 37 3

Object Recognition: A Step Towards Visual Understanding in the human visual system, invariance is built gradually across many successive transformations human and computer vision both strive to achieve invariant representations Freeman & Simoncelli (2011), DiCarlo & Cox (2007) Object Recognition key points 38 3

Discussion human visual invariance vs. CV features incremental invariance vs. all-at-once should we build detectors invariant to everything?

Mammalian Visual System visual processing is done in stages but it is not synonymous with perception DiCarlo & Cox (2007) Conclusion #1 40

Early Computation V1 cells encode edge orientation and position across visual field: simple and complex receptive fields edges are a sufficient basis for real-world images! Hubel & Wiesel (1962), Olshausen & Field (1996) Conclusion #2 41 3

Object Recognition: A Step Towards Visual Understanding in the human visual system, invariance is built gradually across many successive transformations human and computer vision both strive to achieve invariant representations Freeman & Simoncelli (2011), DiCarlo & Cox (2007) Conclusion #3 42 3

Mid-Quarter Feedback Forms