Course Overview
Knowledge
ch. 3: Vision. How are objects recognized? -It looks easy but it’s not
The Brain
Acquisition(perception)
ch.4: Attention.
Use
Object Recognition
• Visual System Recognizes:– Object Identities (man, woman, child, oar, lake)
– Spatial Layout of Scene– Properties of Surfaces(color, texture, etc.)
• Are these real psychological dimensions? (What about Small vs. big? Alive vs. Inanimate? Tool vs. Non-tool? soft vs. hard? )
• - Are these dimensions processed independently by the mind/brain? (after all, our subjective experience is that of an integrated scene)
Specific Disorders suggest functionallyindependent types of visual analysis
1. Visual Agnosia:(object identity)• loss of ability to recognize objects• preserved ability to navigate, reach• Bilateral damage to occipital/temporal• Prosopagnosia (specific to face recognition)
2. Balint’s Syndrome:(spatial layout)• inability to navigate, reach• preserved ability to recognize objects• damage to parietal areas
3. Cerebral Achromatopsia (color)• inability to discriminate colors•distinct from color blindness--Color blindness results from abnormalities in the photoreceptors of the eye. But cerebral achromatopsia results from damage to posterior visual areas in the brain
fMRI studies have confirmed the anatomicalSegregation of these functions
1. Object recognition:(lateral occipital cortex)•face recognition (medio-temporal)
2. spatial layout and spatial attention•parietal areas
3. Color• medio-temporal
What is difficult about object recognition?
1. The visual system must carry out “image segmentation”, but object boundaries are not easily determined
3. Objects may occlude each otheror they may be superimposed oneach other.
2. Objects may appear anywhere on retina in any size
AAAA A
A+A A
A AA
A
A
A
APerhaps the brain is able to represent these objects in a way that is “translationallyinvariant” and “size invariant”.
4. Same object category (‘e’) may have different shapes
EE
EE
E
EEE
EEE
EE
E
E
E
E
E
E E
E
E
E
EE
EE
E
E
E E
E
E
E
E
E
E
E
E
EE
E
E
EE
E
EE
E
E E
EE
E
E
EE
E
E
E
E
E
E
EE E
EE E
E
E
E
E
E
EE
E
E
EE
EEE
E EE
E
E
EE E
E
E EE
E
E
E
E
EE
E
E
EE
EEE
E EE
E
EEE E
E
E EE
E
E
E
EE
E
E
E
EE
EEE
E E
EE
EEE E
E
E EE
E
E
E
EE
E
E
E
EE
EEE
E E
EE
EEE E
E
E EE
E
E
E
EE
E
E
E
EE
EEE
E E
EE
E
EE E
EE E
E
E
E
E
E
EE
E
E
EE
EEE
E EE
E
E
EE E
EE E
E
E
E
E
E
EE
E
E
EE
EEE
E EE
E
EEE E
E
E EE
E
E
E
EE
E
E
E
EE
EEE
E E
EE
Perceptual constancy: ShapePerceptual constancy: Shape4. Same object (‘this door’) may have different shapes
The many shapes of Hillary…
QuickTime™ and aGIF decompressor
are needed to see this picture.
5. Object may be abnormally oriented
Letters & Digits: Small Effects
Words: Moderate Effects
How is object recognition accomplished?
Simplest idea: “Template Model”:
Store in brain a copy of what every possible input will look like.
Match observed object to the proper image in memory
Perceptual Representation Memory Representations
Template Theory
Problems with Template Theory:(1) Massive numbers of templates are required (remember all those E’s?).
(2) Predicts no transfer to novel views of the same object (unlikely for different retinal positions)
(3) Objects are often obstructed (remember the baby?)
Feature Analysis Theory
A fixed set of elementary properties are analyzedIndependently and in parallel across visual field.
Possible examples
Line Orientations:
Different Sizes:
Curvature:
+45deg. -10deg.
Free line endings:
Colors:
PerceptualRepresentation
3 Horizontal lines1 Vertical line4 Right angles
MemoryRepresentation
3 Horizontal lines 1 Vertical line 4 Right angles
A Simple Version of Feature Theory
EF 2 Horizontal lines
1 Vertical line 3 Right angles
Evidence that Features are really basic elements of visual processing
• Physiological Evidence
– Individual neurons respond preferentially to different kinds of simple visual features
• simple cells--respond best to lines or angles of a specific orientation and retinal position
• complex cells--fire maximally to lines or angles without respect to location. They often also have preference for stimuli moving in certain directions…
• hypercomplex cells--fire maximally have even more complicated sets of requirements for maximal firing. (e.g. corners, notches….etc.)
Adaptation Effects are indicators of elementary visual features...
•Your eyes are always moving, even when “fixated”(microsaccades)
•An image completely still on the retina will slowly fade from view, because
• individual feature detectors become habituated (fatigued) with prolonged, sustained stimulation.
•Retinal stabilization procedure allows direct demonstration of thisphenomenon.
More Evidence for Features...
In a Visual Search Task:
(1)Targets defined by a single feature are easy to detect(the red item; the square)
(2) Targets defined by a combination of features are difficult to detect (the red square).
Let’s try it out
Call out “now!” whenyou see the horizontal line.
Typical Results for “Feature Search”
# of items in display
2 4 6 10 20 30
“No”
“Yes”
ReactionTime(msec)
These results suggests parallel analysis and detection of simple visual features.
Conjunction Search
• Combination of features (e.g., red AND horizontal)
• Spatial arrangements of features (e.g. black above white)
When targets are defined by:
Call out “Now!” when you find the blacksquare above the white square:
Treisman’s Results for “Conjunction Search”
# of items in display
2 4 6 10 20 30
“No”
“Yes”
ReactionTime(msec)
When higher order analysis or integration of multiple features is required, search is much harder, and reaction time rises with number of distractors.
Detecting absence of a feature
Look for circle missing the free lineending
among
Detecting presence of a feature
Look for circle with the free lineending
among
How do we make an object out of a pile of features?
Word Recognition: A Case Study
• Frequency Effects• Word Frequency: Frequent words are
recognized more easily
• Repetition Priming:Words seen recently are perceived more easily
• Context Effects• Word Superiority: Individual letters
are easiest to identify when they are part of a word (work vs. orwk)
• Well-formedness: Individual letters are recognized more easily as part of “word-like” stimuli than in a random strings of letters (lipe vs. lpei).
w o r d
An item will appear
k
An letter will appear
Word superiority
WORD _ _ _ _
D _
WORDWORKWORMWOREWORN
Guess = 1/5 correct
Guess = 1/26 correct
WORD
RWOD
XXXX
XXXX
KXXXX D
KXXXX D
D XXXX KXXXX D
This “forced-choice” procedure controls for the effects of guessing, and confirms the validity of the word superiority effect.
WORD
ZORD
XXXX
XXXX
KXXXX D
KXXXX D
D XXXX KXXXX D
Interactive Activation Model explains the word superiority effect
• Letters in words benefit from bottom-up and top-down activation
• But letters alone receive only bottom-up activation.
features
WORD
features
D
D
WORK
KWORD
D
KXXXX D
K KXXXX D
Spared slides
A simple “feature net” model of word recognition
•baseline activation: in the absence of direct stimulation. This is sensitive to recency and frequency of stimulation.
•activation level: how active is a particular detector at a given moment
•response threshold: how much excitatory input does a detector require before it “fires” and sends excitatory input further downstream.
TORQUE vs.
CQRNCQRN
Interactive Activation Model (IAM):(McClelland & Rumelhart)
Previous models posed a bottom-up flow of information (from features to letters to words).
IAM also poses a top-down flows of information
In addition, detectors at the same level interact with each other.
Another important aspect of this theory is the presence of inhibitory connections between detectors that are inconsistent with each other...
Interactive Activation Model
RAP
Interactive Activation Model explains the word superiority effect
• Letters in words benefit from bottom-up and top-down activation
• But letters alone receive only bottom-up activation.
features
WORD
features
D
D
WORK
KWORD
D
KXXXX D
K KXXXX D