physiology of vision: a swift overview 16-721: learning-based methods in vision a. efros, cmu,...

63
Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Visi A. Efros, CMU, Spring 20 Some figures from Steve Palmer Thalamus LGN Striate cortex (V1) Dorsal Stream Parietal visual cortex Temporal visual cortex Eye Optic nerve Extrastriate cortex Ventral Stream

Upload: gavin-anthony

Post on 31-Dec-2015

218 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Physiology of Vision: a swift overview

16-721: Learning-Based Methods in VisionA. Efros, CMU, Spring 2009

Some figures from Steve Palmer

ThalamusLGN

Striatecortex(V1)

DorsalStream

Parietalvisualcortex

Temporalvisualcortex

Eye Opticnerve

Extrastriatecortex

VentralStream

Page 2: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Image Formation

Digital Camera

The Eye

Film

Page 3: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Monocular Visual Field: 160 deg (w) X 135 deg (h)Binocular Visual Field: 200 deg (w) X 135 deg (h)

Page 4: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Point of observation

Figures © Stephen E. Palmer, 2002

What do we see?

3D world 2D image

Page 5: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Point of observation

What do we see?

3D world 2D image

Painted backdrop

Page 6: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

The Plenoptic Function

Q: What is the set of all things that we can ever see?

A: The Plenoptic Function (Adelson & Bergen)

Let’s start with a stationary person and try to parameterize everything that he can see…

Figure by Leonard McMillan

Page 7: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Grayscale snapshot

is intensity of light • Seen from a single view point

• At a single time

• Averaged over the wavelengths of the visible spectrum

(can also do P(x,y), but spherical coordinate are nicer)

P()

Page 8: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Color snapshot

is intensity of light • Seen from a single view point

• At a single time

• As a function of wavelength

P()

Page 9: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Spherical Panorama

All light rays through a point form a ponorama

Totally captured in a 2D array -- P()Where is the geometry???

See also: 2003 New Years Eve

http://www.panoramas.dk/fullscreen3/f1.html

Page 10: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

A movie

is intensity of light • Seen from a single view point

• Over time

• As a function of wavelength

P(,t)

Page 11: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Space-time images

x

y

t

Page 12: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Holographic movie

is intensity of light • Seen from ANY viewpoint

• Over time

• As a function of wavelength

P(,t,VX,VY,VZ)

Page 13: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

The Plenoptic Function

• Can reconstruct every possible view, at every moment, from every position, at every wavelength

• Contains every photograph, every movie, everything that anyone has ever seen! it completely captures our visual reality! Not bad for a function…

P(,t,VX,VY,VZ)

Page 14: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

The Eye is a camera

The human eye is a camera!• Iris - colored annulus with radial muscles

• Pupil - the hole (aperture) whose size is controlled by the iris

• What’s the “film”?– photoreceptor cells (rods and cones) in the retina

Page 15: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

The Retina

Cross-section of eye

Ganglion cell layer

Bipolar cell layer

Receptor layer

Pigmentedepithelium

Ganglion axons

Cross section of retina

Page 16: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Retina up-close

Light

Page 17: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

© Stephen E. Palmer, 2002

Cones cone-shaped less sensitive operate in high light color vision

Two types of light-sensitive receptors

cone

rod

Rods rod-shaped highly sensitive operate at night gray-scale vision

Page 18: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Rod / Cone sensitivity

The famous sock-matching problem…

Page 19: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

© Stephen E. Palmer, 2002

Distribution of Rods and Cones.

0

150,000

100,000

50,000

020 40 60 8020406080

Visual Angle (degrees from fovea)

Rods

Cones Cones

Rods

FoveaBlindSpot

# R

ecep

tors

/mm

2

Night Sky: why are there more stars off-center?

Page 20: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Electromagnetic Spectrum

http://www.yorku.ca/eye/photopik.htm

Human Luminance Sensitivity Function

Page 21: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Why do we see light of these wavelengths?

© Stephen E. Palmer, 2002

.

0 1000 2000 3000

En

erg

y

Wavelength (nm)

400 700

700 C

2000 C

5000 C

10000 C

VisibleRegion

…because that’s where theSun radiates EM energy

Visible Light

Page 22: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

The Physics of Light

Any patch of light can be completely describedphysically by its spectrum: the number of photons (per time unit) at each wavelength 400 - 700 nm.

400 500 600 700

Wavelength (nm.)

# Photons(per ms.)

© Stephen E. Palmer, 2002

Page 23: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

The Physics of Light

.

# P

ho

ton

s

D. Normal Daylight

Wavelength (nm.)

B. Gallium Phosphide Crystal

400 500 600 700

# P

ho

ton

s

Wavelength (nm.)

A. Ruby Laser

400 500 600 700

400 500 600 700

# P

ho

ton

s

C. Tungsten Lightbulb

400 500 600 700

# P

ho

ton

s

Some examples of the spectra of light sources

© Stephen E. Palmer, 2002

Page 24: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

The Physics of Light

Some examples of the reflectance spectra of surfaces

Wavelength (nm)

% P

hoto

ns R

efle

cted

Red

400 700

Yellow

400 700

Blue

400 700

Purple

400 700

© Stephen E. Palmer, 2002

Page 25: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

The Psychophysical Correspondence

There is no simple functional description for the perceivedcolor of all lights under all viewing conditions, but …...

A helpful constraint: Consider only physical spectra with normal distributions

area

Wavelength (nm.)

# Photons

400 700500 600

mean

variance

© Stephen E. Palmer, 2002

Page 26: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

The Psychophysical Correspondence

Mean Hue

yellowgreenblue

# P

hoto

ns

Wavelength

© Stephen E. Palmer, 2002

Page 27: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

The Psychophysical Correspondence

Variance Saturation

Wavelength

high

medium

low

hi.

med.

low# P

hoto

ns

© Stephen E. Palmer, 2002

Page 28: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

The Psychophysical Correspondence

Area Brightness#

Pho

tons

Wavelength

B. Area Lightness

bright

dark

© Stephen E. Palmer, 2002

Page 29: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

© Stephen E. Palmer, 2002

.

400 450 500 550 600 650

RE

LAT

IVE

AB

SO

RB

AN

CE

(%

)

WAVELENGTH (nm.)

100

50

440

S

530 560 nm.

M L

Three kinds of cones:

Physiology of Color Vision

• Why are M and L cones so close?

Page 30: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Retinal Processing

© Stephen E. Palmer, 2002

Page 31: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Single Cell Recording

Microelectrode

Amplifier

Time

Electrical response(action potentials)

mV

© Stephen E. Palmer, 2002

Page 32: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Single Cell Recording

© Stephen E. Palmer, 2002

Page 33: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Retinal Receptive Fields

Receptive field structure in ganglion cells:On-center Off-surround

Stimulus condition Electrical response

Time

Response

© Stephen E. Palmer, 2002

Page 34: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Receptive field structure in ganglion cells:On-center Off-surround

Stimulus condition Electrical response

Time

Response

Retinal Receptive Fields

© Stephen E. Palmer, 2002

Page 35: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Receptive field structure in ganglion cells:On-center Off-surround

Stimulus condition Electrical response

Time

Response

Retinal Receptive Fields

© Stephen E. Palmer, 2002

Page 36: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Receptive field structure in ganglion cells:On-center Off-surround

Stimulus condition Electrical response

Time

Response

Retinal Receptive Fields

© Stephen E. Palmer, 2002

Page 37: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Receptive field structure in ganglion cells:On-center Off-surround

Stimulus condition Electrical response

Time

Response

Retinal Receptive Fields

© Stephen E. Palmer, 2002

Page 38: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Receptive field structure in ganglion cells:On-center Off-surround

Stimulus condition Electrical response

Time

Response

Retinal Receptive Fields

© Stephen E. Palmer, 2002

Page 39: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

RF of On-center Off-surround cells

Receptive FieldNeural Response

Center

Surround

On Off

Response Profile

on-center

off-surround

Horizontal Position

FiringRate

Retinal Receptive Fields

© Stephen E. Palmer, 2002

Page 40: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

RF of Off-center On-surround cells

Receptive Field

Horizontal Position

on-surround

off-center

Response Profile

FiringRate

Retinal Receptive Fields

© Stephen E. Palmer, 2002

Center

Surround

On Off

Neural Response

Surround

Center

Page 41: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Retinal Receptive Fields

Page 42: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Receptive field structure in bipolar cells

Light

Retinal Receptive Fields

© Stephen E. Palmer, 2002

Page 43: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Receptive field structure in bipolar cells

Receptors

Bipolar Cell

A. WIRING DIAGRAM

HorizontalCells

Direct excitatory component (D)

B. RECEPTIVE FIELD PROFILES

LIGHT

Direct Path

Indirect Path

Indirectinhibitory

component (I)

D + I

Retinal Receptive Fields

© Stephen E. Palmer, 2002

Page 44: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

© Stephen E. Palmer, 2002

Visual CortexVisual Cortex

aka:Primary visual cortexStriate cortexBrodman’s area 17

Cortical Area V1

ThalamusLGN

Striatecortex(V1)

DorsalStream

Parietalvisualcortex

Temporalvisualcortex

Eye Opticnerve

Extrastriatecortex

VentralStream

Page 45: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Cortical Receptive Fields

Single-cell recording from visual cortex

David Hubel & Thorston Wiesel© Stephen E. Palmer, 2002

Page 46: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Cortical Receptive Fields

Single-cell recording from visual cortex

TimeTime

© Stephen E. Palmer, 2002

Page 47: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Cortical Receptive Fields

Three classes of cells in V1

Simple cells

Complex cells

Hypercomplex cells

© Stephen E. Palmer, 2002

Page 48: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Cortical Receptive Fields

Simple Cells: “Line Detectors”

A. Light Line Detector

Horizontal Position

FiringRate

B. Dark Line Detector

Horizontal Position

FiringRate

© Stephen E. Palmer, 2002

Page 49: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Cortical Receptive Fields

Simple Cells: “Edge Detectors”

C. Dark-to-light Edge Detector

Horizontal Position

FiringRate

D. Light-to-dark Edge Detector

Horizontal Position

FiringRate

© Stephen E. Palmer, 2002

Page 50: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Cortical Receptive Fields

Constructing a line detector

Receptive Fields

Retina LGN

Center-Surround Cells

Simple Cell

CorticalArea V1

© Stephen E. Palmer, 2002

Page 51: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Cortical Receptive Fields

Complex Cells

STIMULUS NEURAL RESPONSE

Time

00o

© Stephen E. Palmer, 2002

Page 52: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Cortical Receptive Fields

Complex Cells

STIMULUS NEURAL RESPONSE

Time

060o

© Stephen E. Palmer, 2002

Page 53: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Cortical Receptive Fields

Complex Cells

STIMULUS NEURAL RESPONSE

Time

090o

© Stephen E. Palmer, 2002

Page 54: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Cortical Receptive Fields

Complex Cells

STIMULUS NEURAL RESPONSE

Time

0120o

© Stephen E. Palmer, 2002

Page 55: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Cortical Receptive Fields

Constructing a Complex Cell

Simple Cells

Cortical Area V1

Complex CellReceptive Fields

Retina

© Stephen E. Palmer, 2002

Page 56: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Cortical Receptive Fields

Hypercomplex Cells

Time

STIMULUS NEURAL RESPONSE

© Stephen E. Palmer, 2002

Page 57: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Cortical Receptive Fields

Hypercomplex Cells

Time

STIMULUS NEURAL RESPONSE

© Stephen E. Palmer, 2002

Page 58: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Cortical Receptive Fields

Hypercomplex Cells

Time

STIMULUS NEURAL RESPONSE

© Stephen E. Palmer, 2002

Page 59: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Cortical Receptive Fields

Hypercomplex Cells

Time

STIMULUS NEURAL RESPONSE

“End-stopped” Cells© Stephen E. Palmer, 2002

Page 60: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Cortical Receptive Fields

Time

STIMULUS NEURAL RESPONSE

“End-stopped” Simple Cells

© Stephen E. Palmer, 2002

Page 61: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Cortical Receptive Fields

Constructing a Hypercomplex Cell

Receptive Fields

RETINA CORTICAL AREA V1

Complex Cell End-stopped Cell

© Stephen E. Palmer, 2002

Page 62: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Mapping from Retina to V1Mapping from Retina to V1

Page 63: Physiology of Vision: a swift overview 16-721: Learning-Based Methods in Vision A. Efros, CMU, Spring 2009 Some figures from Steve Palmer

Why edges?

So, why “edge-like” structures in the Plenoptic Function?