the ring model (1) the ring model (2)homepages.inf.ed.ac.uk/pseries/ccn09/ccn-wk4-2.pdf · the...

9
Models of networks - continued Readings: D&A, chapter 7. Network models - summary Network models: to understand the implications of connectivity in terms of computation and dynamics. 2 Main strategies: Spiking vs Firing rate models. The issue of the emergence of orientation selectivity as a model problem, extensively studied theoretically and experimentally. - Two main models: feed-forward and recurrent. - Detailed spiking models have been constructed which can be directly compared to electrophysiology - The same problem is also investigated with a firing rate model, a.k.a. the !ring model". The Ring Model (1) The Ring Model (2) τ r dv(θ ) dt = -v(θ ) + h(θ ) + π/2 -π/2 dθ π ( -λ 0 + λ 1 cos(2(θ - θ )) ) v(θ ) + (7.36) v(θ ) θ N neurons, with preferred angle, ,evenly distributed between and Neurons receive thalamic inputs h. + recurrent connections, with excitatory weights between nearby cells and inhibitory weights between cells that are further apart (mexican-hat profile) θ i -π/2 π/2 !!" " !" !#"" !$!" !$"" !!" " !" %&’()*+*’%) ! -(’./*

Upload: doantuyen

Post on 14-Jun-2019

212 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The Ring Model (1) The Ring Model (2)homepages.inf.ed.ac.uk/pseries/CCN09/CCN-wk4-2.pdf · the recti Þ cation, the ! = 0,1, and 2 F ourier components are all ampli Þ ed (Þ gur

Models of networks - continued

Readings: D&A, chapter 7.

Network models - summary

• Network models: to understand the implications of connectivity in

terms of computation and dynamics.

• 2 Main strategies: Spiking vs Firing rate models.

• The issue of the emergence of orientation selectivity as a model

problem, extensively studied theoretically and experimentally.

- Two main models: feed-forward and recurrent.

- Detailed spiking models have been constructed which can be directly

compared to electrophysiology

- The same problem is also investigated with a firing rate model, a.k.a.

the !ring model".

The Ring Model (1) The Ring Model (2)

7.4 Recurrent Networks 25

1.9. This value, being larger than one, would lead to an unstable networkin the linear case. While nonlinear networks can also be unstable, the re-striction to eigenvalues less than one is no longer the relevant condition.

In a nonlinear network, the Fourier analysis of the input and output re-sponses is no longer as informative as it is for a linear network. Due tothe rectification, the ! = 0,1, and 2 Fourier components are all amplified(figure 7.9D) compared to their input values (figure 7.9C). Nevertheless,except for rectification, the nonlinear recurrent network amplifies the in-put signal selectively in a similar manner as the linear network.

A Recurrent Model of Simple Cells in Primary Visual Cortex

In chapter 2, we discussed a feedforward model in which the elongatedreceptive fields of simple cells in primary visual cortex were formed bysumming the inputs from lateral geniculate (LGN) neurons with their re-ceptive fields arranged in alternating rows of ON andOFF cells. While thismodel quite successfully accounts for a number of features of simple cells,such as orientation tuning, it is difficult to reconcile with the anatomy andcircuitry of the cerebral cortex. By far the majority of the synapses ontoany cortical neuron arise from other cortical neurons, not from thalamicafferents. Therefore, feedforward models account for the response prop-erties of cortical neurons while ignoring the inputs that are numericallymost prominent. The large number of intracortical connections suggests,instead, that recurrent circuitry might play an important role in shapingthe responses of neurons in primary visual cortex.

Ben-Yishai, Bar-Or, and Sompolinsky (1995) developed a model at theother extreme, for which recurrent connections are the primary determin-ers of orientation tuning. The model is similar in structure to the modelof equations 7.35 and 7.33, except that it includes a global inhibitory inter-action. In addition, because orientation angles are defined over the rangefrom !"/2 to "/2, rather than over the full 2" range, the cosine functionsin the model have extra factors of 2 in them. The basic equation of themodel, as we implement it, is

#rdv($)

dt= !v($) +

!

h($) +" "/2

!"/2

d$"

"

#

!%0 + %1 cos(2($ ! $"))$

v($")

%

+(7.36)

where v($) is the firing rate of a neuron with preferred orientation $.

The input to the model represents the orientation-tuned feedforward in-put arising from ON-center and OFF-center LGN cells responding to anoriented image. As a function of preferred orientation, the input for animage with orientation angle& = 0 is

h($) = Ac (1! ' + ' cos(2$)) (7.37)

Draft: December 19, 2000 Theoretical Neuroscience

• N neurons, with preferred angle, ,evenly distributed

between and

• Neurons receive thalamic inputs h.

+ recurrent connections, with excitatory weights between

nearby cells and inhibitory weights between cells that are

further apart (mexican-hat profile)

!i

!!/2 !/2

!!" " !"!#""

!$!"

!$""

!!"

"

!"

%&'()*+*'%),!

-('./*

Page 2: The Ring Model (1) The Ring Model (2)homepages.inf.ed.ac.uk/pseries/CCN09/CCN-wk4-2.pdf · the recti Þ cation, the ! = 0,1, and 2 F ourier components are all ampli Þ ed (Þ gur

The Ring Model (3)

• h is input, can be tuned (Hubel Wiesel

scenario) or very broadly tuned.

!!" " !""

"#$

"#%

"#&

"#'

"#!

()*+,-.-*(,/!

01!2

/

/

"$

"%

• The steady-state can be solved analytically.

Model analyzed like a physical system.

• Model achieves i) orientation selectivity; ii) contrast invariance of tuning, even

if input is very broad.

• The width of orientation selectivity depends on the shape of the mexican-hat,

but is independent of the width of the input.

• Symmetry breaking /Attractor dynamics.

h(!) = c[1! " + " " cos(2!)]

The Ring Model (4)

7.4 Recurrent Networks 27

firing r

ate

(H

z)

80%

40%

20%

10%

80

60

40

20

0180 200 220 240

(deg)

A B C80

60

40

20

0

v (H

z)

! (deg)

30

20

10

0

h (H

z)

-40 -20 0 20 40

! (deg)-40 -20 0 20 40

"

Figure 7.10: The effect of contrast on orientation tuning. A) The feedforward in-put as a function of preferred orientation. The four curves, from top to bottom,correspond to contrasts of 80%, 40%, 20%, and 10%. B) The output firing ratesin response to different levels of contrast as a function of orientation preference.These are also the response tuning curves of a single neuron with preferred orien-tation zero. As in A, the four curves, from top to bottom, correspond to contrastsof 80%, 40%, 20%, and 10%. The recurrent model had !0 = 7.3, !1 = 11, A = 40Hz, and " = 0.1. C) Tuning curves measure experimentally at four contrast levelsas indicated in the legend. (C adapted from Sompolinsky and Shapley, 1997; basedon data from Sclar and Freeman, 1982.)

A Recurrent Model of Complex Cells in Primary Visual Cortex

In the model of orientation tuning discussed in the previous section, recur-rent amplification enhances selectivity. If the pattern of network connec-tivity amplifies nonselective rather than selective responses, recurrent in-teractions can also decrease selectivity. Recall from chapter 2 that neuronsin the primary visual cortex are classified as simple or complex depend-ing on their sensitivity to the spatial phase of a grating stimulus. Simplecells respond maximally when the spatial positioning of the light and darkregions of a grating matches the locations of the ON and OFF regions oftheir receptive fields. Complex cells do not have distinct ON and OFF re-gions in their receptive fields and respond to gratings of the appropriateorientation and spatial frequency relatively independently of where theirlight and dark stripes fall. In other words, complex cells are insensitive tospatial phase.

Chance, Nelson, and Abbott (1999) showed that complex cell responsescould be generated from simple cell responses by a recurrent network. Asin chapter 2, we label spatial phase preferences by the angle #. The feed-forward input h(#) in the model is set equal to the rectified response ofa simple cell with preferred spatial phase # (figure 7.11A). Each neuronin the network is labeled by the spatial phase preference of its feedfor-ward input. The network neurons also receive recurrent input given bythe weight function M(# ! #") = !1/(2$%#) that is the same for all con-

Draft: December 19, 2000 Theoretical Neuroscience

Attractor Networks

•Attractor network : a network of neurons, usually recurrently connected, whose time

dynamics settle to a stable pattern.

• That pattern may be stationary (fixed points), time-varying (e.g. cyclic), or even

stochastic-looking (e.g., chaotic).

• The particular pattern a network settles to is called its !attractor".

•The ring model is called a line (or ring) attractor network. Its stable states are also

sometimes referred to as !bump attractors".

Point AttractorLine Attractor

8

• Reduction of the spiking model of Somers et al 1995 to rate model

Page 3: The Ring Model (1) The Ring Model (2)homepages.inf.ed.ac.uk/pseries/CCN09/CCN-wk4-2.pdf · the recti Þ cation, the ! = 0,1, and 2 F ourier components are all ampli Þ ed (Þ gur

9

• Model was tested with stimuli containing more

than 1 orientation (crosses)

• Model fails to distinguish angles separated by 30

deg, overestimates larger angles

• spurious attractors with noise

matlab code available online : http://www.carandinilab.net/publications

The Ring Model (5): Sustained Activity

• If recurrent connections are strong enough, the pattern of population

activity once established can become independent of the structure of the

input. It can persists when input is removed.

• A model of working memory ?

Attention

Readings:

• Maunsell & Cook, the role of attention in visual processing, 2002.

• Itti & Koch, Computational modeling of visual attention, Nat Rev Neurosci, 2001.

M. C. Escher, Balcony, 1945

A- 'Unaware'

Fixed !!

Decoder

Adaptation!

State

Population !

Response

Encoder!!! r

B- 'Aware'

Adaptive !!

Decoder

Adaptation!

State

Population !

Response

Encoder!!! r

s s

s s

A- 'Unaware'

Fixed !!

Decoder

Adaptation!

State

Population !

Response

Encoder!!! r

B- 'Aware'

Adaptive !!

Decoder

Adaptation!

State

Population !

Response

Encoder!!! r

s s

s s

Context

When neurons and perception change

Page 4: The Ring Model (1) The Ring Model (2)homepages.inf.ed.ac.uk/pseries/CCN09/CCN-wk4-2.pdf · the recti Þ cation, the ! = 0,1, and 2 F ourier components are all ampli Þ ed (Þ gur

• The process by which certain information is selected for further processing

and other information is discarded

• The assumption is that the brain does not have the capacity to fully process

all the information it receives. Attention as a filter, or bottleneck in processing.

• 2 types: spatial attention, feature-based attention.

• Spatial attention as a spotlight, highlighting locations, moving (search),

zooming in or out.

In vision, the locus of the spotlight can be the same as eye fixation (overt

attention) but it doesn"t have to (covert attention)

- limit to the metaphor : attention can be split in 2 or more locations (up to 4).

- exogeneous attention: - unvoluntary vs endogeneous : voluntary

Attention

• focus visual attention to an area

by using a cue, e.g. brief flash at

given location, or arrow

(endogeneous attention)

• measure reaction time to detect

target when :

i) observer doesn"t know where

item will appear (neutral cue)

ii) observer is cued to where item

will appear (valid cue)

iii) observer is wrongly cued

(invalid cue)

Posner’s Task (1980)

• Detection is faster for valid

targets

• Detection is slower for

invalid cues (inhibition of

return)

• effect can begin 300 ms

after cue and last for seconds

afterwards

Posner’s Task (1980)

Posner, Nissen, & Ogden (1978)

• task of detecting the presence or absence of a specified object (target) in

the middle of other objects (distractors)

• feature search: find the red cross, find the circle

Visual Search Tasks

• conjunction search : find

the orange square

which is easier?

POP-OUT

Page 5: The Ring Model (1) The Ring Model (2)homepages.inf.ed.ac.uk/pseries/CCN09/CCN-wk4-2.pdf · the recti Þ cation, the ! = 0,1, and 2 F ourier components are all ampli Þ ed (Þ gur

• In conjunction search, the binding

problem has to be solved.

• Interpretation: 2 processes:

- a first pre-attentive stage, processes

that are fast, parallel and involuntary --

Pop out.

- a second stage, attentive, serial.

volitional deployment of attention.

• Later shown that these represents 2

extremes in a continuum of search

difficulty (Wolfe 1996)

Feature Integration theory (FIT) - A. Treisman, early 1980s

Feature Search

Conjunction Search

Re

actio

n T

ime

s

Display Size

• In Feature Search: RTs are independent of display size. idea of feature maps,

parallel search different feature maps.• How does the spotlight know where to go?

• Models postulate existence of a saliency map, which would represent

topographically the relevance of the different parts of the visual field.

• could be built based on bottom-up information (eg. how different is a

given stimulus compared to its neighborhood?) or could include task

dependent (top-down) information.

• unclear whether such a functional saliency map is implemented in a

distributed manner across different cortical and subcortical areas, or

whether saliency is implemented directly into the individual cortical

feature maps.

• There is some evidence that neurons in LIP could encode saliency

[Gottlieb et al 1998].

The idea of a saliency map

• change blindness - unless we attend to an object, we are unlikely to perceive consciously it in any detail and detect when it is altereddemos: http://www.psych.ubc.ca/~rensink/flicker/download/

• But we are not totally blind outside the focus of attention (e.g. negative priming)

• attentional blink: inability to report a target 150 to 450 msec after a first one.http://www.cs.kent.ac.uk/people/rpg/pc52/AB_Webscript/instr.html

Are we blind outside the focus of attention?

• Spatial attention improves detection rate, and reaction times,

and also discrimination performances [Lee et al 1999].

• Attention enhances apparent contrast [Carrasco et al 2004],

perceived motion coherence [Liu et al 2006], spatial frequency

[Gobell & Carrasco, 2006]

Attention changes perceived size [Treue and colleagues, 2008]

Makes moving objects appear to move faster [Turatto et al 2007] ...

• Thus, attention not only enhances perception, it also distorts our

representation of the visual scene according to the behavioral

relevance of its components.

Does attention change appearance?

Page 6: The Ring Model (1) The Ring Model (2)homepages.inf.ed.ac.uk/pseries/CCN09/CCN-wk4-2.pdf · the recti Þ cation, the ! = 0,1, and 2 F ourier components are all ampli Þ ed (Þ gur

21

Neural basis : Questions

• Where does attention !come from" ?

• Which parts of the brain does it affect ?

• What does it change in the neural responses ?

- amplitude?

- tuning?

- baseline?

- noise?

- temporal properties?

• Can we explain the perceptual changes based on the neural

changes?

• Lots of (recent) data, but picture is not clear yet.22

Attention changes in neural responses depends on ...

• cortical area

• neurons

• task demands

1064 J. H. R. Maunsell and E. P. Cook Attention, neuronal response and behaviour

60

0 0.3 1 3

nu

mb

er o

f n

euro

ns

modulation by attention(attended/unattended)

Figure 1. Typical distribution of attentional modulation forvisual cortex. Responses were recorded from 197 neurons inarea V4 while monkeys performed a task that directed theirattention towards or away from a stimulus in the receptive!eld of the neuron being recorded. Most neurons respondedmore strongly when the animal paid attention to thestimulus in the receptive !eld (average increase 26%,indicated by arrow). However, the distribution was broad,and some neurons responded more strongly when the animaldirected its attention away from the receptive !eld. Responsechanges that were statistically signi!cant are shown in black.(Data from McAdams & Maunsell (1999a).)

of attentional modulation been shown to correlate withother properties of the neurons.

Attentional modulation also differs between visualareas. Modulation by attention is typically weakest in theearliest stages of visual cortex, and strongest in the lateststages. This increase has not been studied extensively, butit is apparent in comparing results from different reports.The best data come from individual studies that haverecorded from different cortical areas in the same animalswhile they performed a given task. Figure 2 shows theaverage attentional modulation in different cortical areasthat were examined in this way in our laboratory. Averageattentional modulation is plotted as a function of the levelof cortical processing, as de!ned by the hierarchy of Felle-man & Van Essen (1991). In each study, areas at laterstages had stronger average attentional modulation. Theseresults were based on extracellular recording of individualaction potentials, but stronger modulation at later stagesof visual cortex has also been seen by using current sourcedensity measurements (Mehta et al. 2000).

It is not known why attentional modulations arestronger in later cortical levels. While it is tempting toimagine that neuronal activity related to attentionaccumulates in successive levels of processing, this seemsunlikely given that sensory responses arriving from differ-ent sources do not appear to accumulate in this way. Simi-larly, the notion that neuronal signals related to attentionare inserted at the latest stages of processing and diminishas they are fed back to earlier stages seems unlikely. Itseems probable that the degree of modulation is optimizedfor each cortical level. There is little reason to believe thatthe cerebral cortex would be unable to prevent undesirableaccumulation or diminution of signals that altered sen-sory responses.

In addition to the range of attentional modulation seenwithin and between visual areas, individual neurons canshow different degrees of attentional modulationdepending on task demands. Task demand can be affectedeither by the number of relevant items or by the com-

Phil. Trans. R. Soc. Lond. B (2002)

100

50

0

aver

age

resp

on

seen

han

cem

ent

by

atte

nti

on (

%)

V1 MTV4

VIPMST

7a

cortical hierarchy levels

Figure 2. Average attentional modulation in visual corticalareas. The average response enhancement reported fromstudies that measured the effects of attention in two or morecortical areas in the same subjects while they performed agiven task are shown. Positions on the x-axis are assignedaccording to the hierarchical levels de!ned by Felleman &Van Essen (1991). More attention modulation is found inlater stages of cortical processing (squares, McAdams &Maunsell (1999a); crosses, Treue & Maunsell (1999);circles, Ferrera et al. (1994); triangles, Cook & Maunsell(2002)). (Based on Cook & Maunsell (2002).)

plexity of the processing to be performed on the relevantitems (Lavie & Tsal 1994; Urbach & Spitzer 1995; Sade &Spitzer 1998). Mountcastle et al. (1981, 1987) showedthat most neurons in area 7a and neighbouring regionswere far more responsive when an animal was engagedin a visual task than when the same retinal stimuli werepresented during periods of alert wakefulness when no!xation was required. Neuronal responses in inferotempo-ral cortex have also been found to be progressivelystronger as a !xating animal goes from a situation wherethe stimulus is irrelevant, to monitoring the stimulus todetect its dimming, to discriminating the shape or textureof the stimulus (Spitzer & Richmond 1991). Similarly, theresponses of V4 neurons are stronger when an animal per-forms a more dif!cult version of an orientation changedetection task (Spitzer et al. 1988). The effects of taskdemand may be related to arousal or vigilance, but theycan affect spatial attention. In some circumstances, modu-lation by spatial attention can be more than twice as strongduring a dif!cult task (Boudreau & Maunsell 2001).

The attentional modulation of the responses of individ-ual neurons can vary not only between tasks but alsowithin trials. Motter (1994) provided a clear example ofthe dynamics of attention. He trained monkeys to do atask in which an instruction redirected their spatial atten-tion in the middle of some trials. Neurons in V4 changedtheir responses within a few hundred milliseconds of thenew instruction. Other studies have shown that attentionalmodulation can vary systematically during the course oftask trials (e.g. McAdams & Maunsell 1999a; Reynoldset al. 2000). These observations indicate that attentionalmodulation of individual neurons is dynamic, and variesover a time-course of no more than a few hundred millise-conds.

In summary, attentional modulation of neuronal signals

1064 J. H. R. Maunsell and E. P. Cook Attention, neuronal response and behaviour

60

0 0.3 1 3

nu

mb

er o

f n

euro

ns

modulation by attention(attended/unattended)

Figure 1. Typical distribution of attentional modulation forvisual cortex. Responses were recorded from 197 neurons inarea V4 while monkeys performed a task that directed theirattention towards or away from a stimulus in the receptive!eld of the neuron being recorded. Most neurons respondedmore strongly when the animal paid attention to thestimulus in the receptive !eld (average increase 26%,indicated by arrow). However, the distribution was broad,and some neurons responded more strongly when the animaldirected its attention away from the receptive !eld. Responsechanges that were statistically signi!cant are shown in black.(Data from McAdams & Maunsell (1999a).)

of attentional modulation been shown to correlate withother properties of the neurons.

Attentional modulation also differs between visualareas. Modulation by attention is typically weakest in theearliest stages of visual cortex, and strongest in the lateststages. This increase has not been studied extensively, butit is apparent in comparing results from different reports.The best data come from individual studies that haverecorded from different cortical areas in the same animalswhile they performed a given task. Figure 2 shows theaverage attentional modulation in different cortical areasthat were examined in this way in our laboratory. Averageattentional modulation is plotted as a function of the levelof cortical processing, as de!ned by the hierarchy of Felle-man & Van Essen (1991). In each study, areas at laterstages had stronger average attentional modulation. Theseresults were based on extracellular recording of individualaction potentials, but stronger modulation at later stagesof visual cortex has also been seen by using current sourcedensity measurements (Mehta et al. 2000).

It is not known why attentional modulations arestronger in later cortical levels. While it is tempting toimagine that neuronal activity related to attentionaccumulates in successive levels of processing, this seemsunlikely given that sensory responses arriving from differ-ent sources do not appear to accumulate in this way. Simi-larly, the notion that neuronal signals related to attentionare inserted at the latest stages of processing and diminishas they are fed back to earlier stages seems unlikely. Itseems probable that the degree of modulation is optimizedfor each cortical level. There is little reason to believe thatthe cerebral cortex would be unable to prevent undesirableaccumulation or diminution of signals that altered sen-sory responses.

In addition to the range of attentional modulation seenwithin and between visual areas, individual neurons canshow different degrees of attentional modulationdepending on task demands. Task demand can be affectedeither by the number of relevant items or by the com-

Phil. Trans. R. Soc. Lond. B (2002)

100

50

0

aver

age

resp

on

seen

han

cem

ent

by

atte

nti

on (

%)

V1 MTV4

VIPMST

7a

cortical hierarchy levels

Figure 2. Average attentional modulation in visual corticalareas. The average response enhancement reported fromstudies that measured the effects of attention in two or morecortical areas in the same subjects while they performed agiven task are shown. Positions on the x-axis are assignedaccording to the hierarchical levels de!ned by Felleman &Van Essen (1991). More attention modulation is found inlater stages of cortical processing (squares, McAdams &Maunsell (1999a); crosses, Treue & Maunsell (1999);circles, Ferrera et al. (1994); triangles, Cook & Maunsell(2002)). (Based on Cook & Maunsell (2002).)

plexity of the processing to be performed on the relevantitems (Lavie & Tsal 1994; Urbach & Spitzer 1995; Sade &Spitzer 1998). Mountcastle et al. (1981, 1987) showedthat most neurons in area 7a and neighbouring regionswere far more responsive when an animal was engagedin a visual task than when the same retinal stimuli werepresented during periods of alert wakefulness when no!xation was required. Neuronal responses in inferotempo-ral cortex have also been found to be progressivelystronger as a !xating animal goes from a situation wherethe stimulus is irrelevant, to monitoring the stimulus todetect its dimming, to discriminating the shape or textureof the stimulus (Spitzer & Richmond 1991). Similarly, theresponses of V4 neurons are stronger when an animal per-forms a more dif!cult version of an orientation changedetection task (Spitzer et al. 1988). The effects of taskdemand may be related to arousal or vigilance, but theycan affect spatial attention. In some circumstances, modu-lation by spatial attention can be more than twice as strongduring a dif!cult task (Boudreau & Maunsell 2001).

The attentional modulation of the responses of individ-ual neurons can vary not only between tasks but alsowithin trials. Motter (1994) provided a clear example ofthe dynamics of attention. He trained monkeys to do atask in which an instruction redirected their spatial atten-tion in the middle of some trials. Neurons in V4 changedtheir responses within a few hundred milliseconds of thenew instruction. Other studies have shown that attentionalmodulation can vary systematically during the course oftask trials (e.g. McAdams & Maunsell 1999a; Reynoldset al. 2000). These observations indicate that attentionalmodulation of individual neurons is dynamic, and variesover a time-course of no more than a few hundred millise-conds.

In summary, attentional modulation of neuronal signals23

13 May 2004 14:13 AR AR217-NE27-22.tex AR217-NE27-22.sgm LaTeX2e(2002/01/18) P1: IKH

ATTENTION EFFECTS ON VISUAL PROCESSING 613

attended location versus an unattended location (Carrasco et al. 2000, Lu et al. 2000,Lu & Dosher 1998), and there is recent evidence that attention increases perceivedstimulus contrast (Carrasco et al. 2004). This signal enhancement is reflected ingreater stimulus-evoked neuronal activity, as measured by scalp potentials (e.g.,Luck et al. 1994, reviewed by Hillyard & Anllo-Vento 1998), and brain imaging(e.g., Brefczynski & DeYoe 1999, Heinze et al. 1994; see Pessoa et al. 2003, Yantis& Serences 2003 for recent reviews).

Consistent with these observations, single-unit recording studies in monkeystrained to perform attention-demanding tasks have found that spatial attentionoften enhances neuronal responses evoked by a single stimulus appearing withinthe receptive field, an effect observed in neurons throughout the visual system(Ito & Gilbert 1999, McAdams & Maunsell 1999a, Motter 1993, Mountcastleet al. 1987, Roelfsema & Spekreijse 2001, Spitzer et al. 1988, Treue & Maunsell1996). An example of this attention-dependent response facilitation is illustrated inFigure 1, which shows data recorded by Reynolds et al. (2000). The dashed line in

Figure 1 Responses of an example area V4 neuron as a function of attention andstimulus contrast. The contrast of the stimulus in the receptive field varied from 5%(bottom panel) to 10% (middle panel) to 80% (upper panel). On any given trial, atten-tion was directed to either the location of the stimulus inside the receptive field (solidline) or a location far away from the receptive field (dotted line). The animal’s taskwas to detect a target grating at the attended location. Attention reduced the thresholdlevel of contrast required to elicit a response without causing a measurable change inresponse at saturation contrast (80%). Adapted from Reynolds et al. (2000).

spatial attention enhances neuronal

responses and decreases threshold

(Ito & Gilbert 1999, McAdams &

Maunsell 1999a, Motter 1993,

Mountcastle et al. 1987, Roelfsema &

Spekreijse 2001, Spitzer et al. 1988,

Treue & Maunsell 1996).

Attention decreases contrast response threshold

24

Attention decreases contrast response threshold

• It is often proposed that Attention acts like an increase in stimulus

contrast [Reynold and Chelazzi, 2001]

• Unlike increases in contrast, attention doesn"t reduce response

latency in V4 [Lee et al 2007]

Page 7: The Ring Model (1) The Ring Model (2)homepages.inf.ed.ac.uk/pseries/CCN09/CCN-wk4-2.pdf · the recti Þ cation, the ! = 0,1, and 2 F ourier components are all ampli Þ ed (Þ gur

• Multiple Stimuli in the visual field

activate population of neurons that

engage in competition

• Moran and Desimone (1985) - V4 -

2 stimuli in receptive field. After

some time delay, the pair response is

driven almost entirely by attended

stimulus.

• ~ change in tuning, shrink

response around attended feature

Attention as Biased Competition

13 May 2004 14:13 AR AR217-NE27-22.tex AR217-NE27-22.sgm LaTeX2e(2002/01/18) P1: IKH

616 REYNOLDS ! CHELAZZI

[Chelazzi et al 2001], V4, average of 76 neurons

13 May 2004 14:13 AR AR217-NE27-22.tex AR217-NE27-22.sgm LaTeX2e(2002/01/18) P1: IKH

630 REYNOLDS ! CHELAZZI

0.5

030-30 0

Nor

mal

ized

res

pons

e

Relative orientation (degrees)60-90

1.0

90-60

Figure 7 Attention increases tuning curves by a gain factor. Average normalizedorientation tuning curves computed across a population of area V4 neurons while themonkey attended either to the location of a grating stimulus inside the receptive field( filled squares) or to a location in the opposite hemifield (empty circles). The uppercurve is approximately a multiplicative version of the lower curve. Adapted fromMcAdams & Maunsell (1999a).

for the two stimuli. As mentioned earlier, this pattern has been observed in sev-eral single-unit recording studies of attention (Chelazzi et al. 1998, 2001; Lucket al. 1997; Martınez-Trujillo & Treue 2002; Reynolds et al. 1999; Reynolds &Desimone 2003; Treue & Martınez-Trujillo 1999; Treue & Maunsell 1996).Figure 8, adapted from a study conducted by Treue & Martınez-Trujillo, illus-trates attention-dependent increases and decreases in firing rate. Two patterns ofdots appeared within the classical receptive field of an MT neuron. One of them(“pattern A” in Figure 8A) moved in the null direction for the neuron, and the other(“pattern B”) moved in one of twelve directions of motion, selected at random oneach trial. The monkey either attended to the fixation point to detect a change inits luminance or, on separate trials, attended to one of the two patterns of dotsto detect a change in the direction or speed of its motion. The neuronal responsewhen attention was directed outside the receptive field is indicated by the middlecurve in Figure 8B, which shows responses averaged over 56 neurons (sensoryresponse), aligned on the each neuron’s preferred direction of motion. Becausepattern A moved in each neuron’s null direction, pattern B was the more preferredstimulus over a range of contrasts. Over this range, attending to pattern B elevatedthe response, with the magnitude of this increase growing in proportion to theneuron’s selectivity for the two stimuli. When attention was directed to the nullstimulus, the response was reduced, again, with changes growing in proportion toselectivity.

A final prediction that follows from the idea that attention is equivalent toan increase in contrast is illustrated in Figure 5D. As noted above, increasing

Spatial Attention as Gain Modulation

sponses of the animal, and recorded the behavioral and neuronal data.Stimuli were presented on a color video monitor, positioned 70 cm fromthe animal.

The animals performed a delayed match-to-sample task (Fig. 1). Thetrial began when the animal looked at the fixation point and depressed alever. Both animals were required to keep their gaze within 0.7° of thecenter of the fixation point throughout the trial. After 500 msec, samplestimuli appeared at two different locations. Only one location was be-haviorally relevant on a given trial. After a sample presentation of 500msec, the stimuli were removed for a 500 msec delay period. Then twotest stimuli appeared, at which point the animal had to indicate whetherthe test stimulus at the relevant location matched the sample stimuluspreviously presented at that location. If the test stimulus matched, theanimal had to release a lever within 500 msec of its onset to receive areward. If the test stimulus did not match the previously presentedsample, the animal had to keep the lever depressed for 750–1000 msec,after which time he received a reward.

The animal was trained to attend only to stimuli in one location on atrial. The relevant location was cued using instruction trials in whichstimuli were only presented in one location. After the animal performedtwo instruction trials correctly the stimuli at the second location werereintroduced. The stimuli at one location were Gabors. The Gabors wereconstructed by multiplying a sinusoidal grating and a two-dimensionalGaussian. The contrast of these stimuli was modulated sinusoidally intime at 4 Hz, although the mean luminance of the stimulus averaged overspace and time was the same as the background. The animal had toreport whether the sample and test Gabor orientations matched wheninstructed to attend to that location. The stimuli at the other locationwere isoluminant colored patches whose saturation varied with a two-dimensional Gaussian profile. The animal had to report whether thesample and test colors matched when instructed to attend to that loca-tion. Matches and nonmatches at the two locations were uncorrelated sothat the animal could get no advantage from attending to the wronglocation. Animals were instructed to shift their attention from one

location to the other after correctly completing 24 trials using onelocation. The first animal, A, had no problems remembering whichlocation he was supposed to attend to, and he only received two instruc-tion trials for each shift in location. The second animal, B, was moreeasily distracted and occasionally extra instruction trials were requiredwithin a block. A single instruction trial was given to him whenever hemissed or ignored three trials in a row.

By presenting the same visual stimuli to the animal when he wasperforming the color matching task and when he was performing theorientation matching task, differences in neuronal responses occurringbetween the two types of trials could be attributed to differences inbehavioral state between the tasks. The Gabors were always placed insidethe receptive field of the neuron being recorded. The Gaussian patcheswere placed outside the receptive field of the neuron, diametricallyopposed at the same eccentricity. Although the animal was attending tosomething in both task modes, we define an attention difference withrespect to whether the neuronal signals we recorded were relevant orirrelevant to the current task. Thus, when the animal performed theorientation matching task, the neuron from which we recorded wasresponding to the relevant stimulus, so we refer to this mode as the“attended” mode. When the animal performed the color matching task,the neuron was still responding to the Gabor, but because this stimuluswas now irrelevant to the animal’s task, we refer to this mode as the“unattended” mode.

Attention effects in area V4 have been attributed to both spatialattention (Moran and Desimone, 1985; Motter, 1993; Connor et al., 1996,1997) and to feature-directed attention (Maunsell et al., 1991; Motter,1994a,b). The attentional modulation that we measured with this taskdesign could have been caused by either of these forms of attentionbecause the attended and unattended stimuli differed in both locationand relevant dimension (orientation or color). We chose this design toincrease the chances of encountering attentional modulations in area V4.

Neuronal recording and data collection. In both animals data werecollected from V4, with additional recordings made in V1 for compari-son. After the animals were trained, a recording chamber (20 mmdiameter) was implanted on intact skull overlying the operculum of V1.When recordings were completed, this chamber was removed, and a newchamber was positioned over V4. Animal A received a second V4chamber, so that data were collected from three hemispheres in twoanimals. Recordings were usually made daily during a 3–5 week session.At the start of each session, a 5– 8 mm craniotomy was made inside thechamber leaving the dura mater intact. Two or three craniotomies weremade in each chamber. Each day a hydraulic microdrive was mounted onthe recording chamber, which was then filled with sterile mineral oil andsealed. Transdural recordings were made using Pt/Ir recording elec-trodes of 1–2 M! at 1 kHz (Wolbarsht et al., 1960). A small fraction ofthe data (30 of 262 cells) was recorded from the parts of V4 in thesuperior temporal sulcus using guide tube recordings with similar elec-trodes. Signals from the microelectrode were amplified, filtered, andmonitored on an oscilloscope and audio monitor using conventionalequipment. Recordings were made only from cortex within 3 mm of thesurface using the transdural electrodes and up to "6 mm from thesurface using guide tubes in one chamber.

The animal performed the match-to-sample task while we searched forresponses. Units were isolated on the basis of waveform, with the re-quirement that the peak of the action potential be at least three times thebackground noise. When a unit was isolated, its receptive field wasmapped with a bar moved by hand while the animal fixated a small spotof light. The receptive fields of the neurons were between 1 and 5°eccentric. The Gabor stimuli were then adjusted in spatial frequency,color, and size to yield the best response using the match-to-sample task,as judged by listening to the audio monitor. The spatial frequencies usedranged from 1 to 5 cycles/°. The size of the stimuli were taken as the SDof the Gabor and ranged from 0.6 to 2.2°. The spatial frequency and SDwere varied independently; the range of the ratios of spatial frequency toSD was 0.8–8.3 cycles/degree 2 with a median of 2.2 cycles/degree 2.Colors for the Gabors were selected from five options (black/white,blue/yellow, red/green, cyan/red, and magenta/green). Perhaps one infive neurons did not have any obvious orientation tuning or could not bedriven well and was not examined further. For "15% of the neuronsrecorded, the animal would not work using the best stimulus we couldfind for that neuron. These stimuli were typically high spatial frequenciesor small for their eccentricity. For those neurons, a suboptimal stimuluswas used, provided that the neuron remained selective for orientation.

Figure 1. Schematic representation of the delayed match-to-sample task.Each f rame represents the display at a different point in a trial, with thefixation spot in the center and the receptive field of the neuron indicatedby a dashed oval. The fixation, sample, and delay periods were each 500msec. The test period could last 1000 msec, but ended when the animalreleased the lever. The monkey was required to bring his gaze to thefixation spot and depress a lever to begin the trial. A Gabor and a coloredGaussian were presented in the sample period. The monkey attended toonly one of these stimuli in each trial, based on previous instruction trialsin which only one stimulus appeared. In the attended mode, the monkeywas required to pay attention to the orientation of the stimulus in thereceptive field. In the other mode, the monkey was required to payattention to the color of the stimulus outside the receptive field. Bothstimuli were removed during the delay period. In the test period, theanimal had to report whether the test stimulus at the attended locationmatched the sample stimulus previously presented there. In the caseillustrated, if the animal had been instructed to pay attention to theoriented stimuli, the animal would be required to keep the lever depressedto receive a juice reward because the orientations do not match. Con-versely, if the animal had been instructed to pay attention to the coloredstimuli, the animal would be required to release the lever within 500 msecof the test stimulus onset to receive juice because the colors match.

432 J. Neurosci., January 1, 1999, 19(1):431–441 McAdams and Maunsell • Effects of Attention on Orientation Tuning

monkey performs

either color task or

orientation task

(match to sample)

A change in gain, not in selectivity (sharpness)

Feature-Based Attention as Gain Modulation

Current Biology748

Figure 4. Experiment 2. Example Neuron

(A) The average firing rate of an MT neuron (ordinate) in the attend-same (open circles and dashed line) and attend-fixation (opensquares and solid line) conditions was plotted as a function of thedirection of the pattern inside the RF (abscissa). The error barsrepresent standard errors. The light-gray shaded area between thetwo curves represents the region of response enhancement, and thedark-gray shaded area represents the area of response inhibition.(B) Modulation ratios between the responses in the two conditionsshown in (A) (open circles). The abscissa represents the angulardistance between the direction of the RDPs and the cells’ preferreddirection, and the ordinate represents the modulation ratio. Notethat the 12 directions tested were collapsed into seven data pointsbecause directions that were angled clockwise and counterclock-

Figure 3. Experiment 1. Population Data wise by the same angular distance from the preferred direction have(A) Average modulation ratios between the different conditions been pooled and averaged. The line represents the best linear modelfor the responses to the preferred and anti-preferred directions for fitted to the data (intercept: 1.45 ! 0.18, slope: "0.0054 ! 0.0016).the sample of 135 MT neurons. The circle represents the ratio for The linear correlation coefficient (r # "0.95) is indicated.the responses to the preferred, the squares for the responses to theanti-preferred direction. The abscissa represents the attentional condi-tion and the ordinate the modulation ratios. The error bars represent attentional modulation of the neutral responses is athe 95% confidence intervals for the mean. The symbols are the monotonic function of the similarity between the at-same as in Figure 2. The ratios attend-same/attend-opposite corre- tended direction and the cell’s preferred direction.sponding to the responses to the preferred (open circle) and anti-

The same analysis was repeated for every neuron.preferred (filled square) are significantly different from each otherFor each of the seven angular difference values, we(p$0.001, Wilcoxon test). The ratios attend-preferred/attend-anti-averaged the modulation ratios across neurons to obtainpreferred for the responses to the preferred (open circle) and anti-

preferred (open square) directions do not differ significantly (p%0.9, an average ratio for every direction. Figure 5 shows that,Wilcoxon test). just like for the individual cell in Figure 4B, the slope of(B) Average normalized responses to the anti-preferred and pre- the linear fit through the data points is significantly lowerferred direction across the population of neurons, with sensory (at-

than zero (p $ 0.05, t test). The correlation coefficienttend-fixation) responses to the anti-preferred direction (gray arrow("0.93) indicates a highly significant negative correla-on the ordinate) being higher than the baseline response (dashedtion between the response modulation and the angularline), plotted as in Figure 2C.

(C) Average modulation for the data in (B) plotted as in (A). See text distance. The maximum response increase occurred atfor details. 0& (significant mean increase of 7% for attention to the

preferred direction, p $ 0.05, paired t test), no modula-tion occurred around 90&, and the maximum decrease

Current Biology746

Figure 1. Experimental Design

(A) Experiment 1: the panels in the two col-umns represent the two attentional condi-tions (attend-same and attend-opposite) forthe two motion directions (preferred and anti-preferred) of the RDP shown inside the RF.The arrows indicate the directions of the dots.RF: receptive field, FC: fixation cross.(B) Experiment 2: the top panels representthe conditions in which the monkey attendedto the same motion direction presented insidethe RF (attend-same). The bottom panels rep-resent the condition in which the animals ig-nored both stimuli and attended to a colorsquare on the fixation point (attend-fixation).The allocation of attention and the differentcomparisons are indicated. The black dotsindicate that similar conditions existed for di-rections in between.

tions inside the RF, were calculated and then averaged rate. If this is the case, the feature-matching hypothesismight be able to account for the reduced response whenacross units so that average modulation ratios were

obtained for the entire sample. Figure 3A shows the two anti-preferred-stimuli were present in our experi-ments by suggesting that this reflects an increased inhi-ratios, which exhibit the same pattern as the example

cell in Figure 2. Both attentional hypotheses predict the bition. Note that the modulation observed in the examplecell in Figure 2 cannot be accounted for in this wayratio computed at the left of the panel to be more than

1 (open circle, p ! 0.05, paired t test; average response because that cell shows an enhanced response even tothe anti-preferred direction presented during the attend-ratio: 1.12). The pattern of the two ratios on the right is

consistent only with the feature-similarity hypothesis; fixation condition (gray arrow on the ordinate of all thepanels in Figure 2). Nevertheless, the population datait indicates an enhancement only for attention to the

preferred direction (open square, p ! 0.05, paired t test; in Figure 3A might reflect the contribution of the propor-tion of cells that were suppressed by the anti-preferredaverage response ratio: 1.13), not for attention to the

same direction (filled square, p ! 0.05 paired t test; direction in the receptive field in our sample.We therefore repeated the analysis shown in Figuresaverage response ratio: 0.87).

Moreover, the modulation ratios between the attend- 2C and 3A in a subset of 13 of our neurons whosesensory responses to the anti-preferred direction insidepreferred and the attend-anti-preferred conditions for

both the preferred and the anti-preferred directions were their RFs (gray arrow in the ordinate of Figure 3B) werehigher than their baseline responses (dashed line in Fig-not different from each other (1.12 versus 1.13, respec-

tively; p " 0.7, paired t test). This represents a similar ure 3B), i.e., by removing the suppressed cells. The aver-age responses in the attend-same (circles) and attend-multiplicative modulation of responses to both dis-

tractor directions when attention switches between anti- opposite (squares) conditions are illustrated in panel 3B.The responses for each neuron were normalized to thepreferred and preferred target directions.

The modulation ratios plotted in Figure 3A are aver- response to the preferred direction in the attend-samecondition and then averaged across neurons. Matchingages across cells, including those (such as the example

in Figure 2) that show an enhanced response even to the result depicted in Figure 2C, responses to the anti-preferred direction were lower in the “attend-same” con-the anti-preferred-direction as well as those that are

suppressed by the appearance of the anti-preferred- dition relative to the “attend-opposite” condition (filledsquare), whereas responses to the preferred directiondirection. It is conceivable that, rather than modulating

a cortical neuron’s overall response (which can only be showed the reversed pattern (open circle). In agreementwith the predictions of the feature-similarity gain model,positive), attentional modulation influences those re-

sponse components (inhibitory or excitatory) that cause the respective ratios, plotted in Figure 3C, were essen-tially the same as those for the complete data set (Figurethe cell’s response to change from its baseline firing

Martinez Trujillo & Treue, 1999

The change in gain depends on

similarity between preferred direction

and attended direction.

Not a change in Noise

Neuron768

Figure 4. Population Response Variance Func-tions

These response variance functions were con-structed by fitting power functions to the re-sponse variance data from all of the V4 neu-rons. The two functions are not significantlydifferent (power: attended, 1.11; unattended,1.12; coefficient: attended, 1.22; unattended,1.26).

in the attended mode tuning curve results in a corre- Figure 6 plots the best discriminability in the attendedcondition against the best discriminability in the unat-spondingly greater slope.tended condition for all V4 neurons recorded. If attentionThe variability of the responses also affects the abilityhad no systemic effect on the ability of neurons to signalof the neuron to signal a particular orientation and there-orientation changes, the points would be symmetricallyfore to signal differences between orientations. Becausedistributed about the diagonal. More points fall belowvariance is typically proportional to mean response, thethe diagonal, showing that attention improves the abilitystandard deviation (SD), which is the square root ofof these neurons to discriminate orientations (Wilcoxonvariance, increases approximately as the square root ofsigned rank test, p ! 0.001), albeit modestly. The medianthe response. In Figure 5C, the relationship between SDdiscriminability value for the attended mode is 20.4",and the response is shown, calculated as the squareand the median discriminability value for the unattendedroot of the best-fitting response variance function. Thismode is 26.5".neuron, like most neurons recorded, did not show any

We also used receiver operating characteristic (ROC)significant difference in the fitted response varianceanalysis to assess the effects of attention on orientationfunction for the attended mode compared to the unat-discriminability (see Experimental Procedures). ROCtended mode. The data have therefore been fit with aanalysis is a direct comparison of the probability thatsingle curve. The logarithmic increase of SD (noise) withtwo response distributions overlap. We used this analy-mean response (signal) causes the neuron’s signal-to-sis to confirm the results of the previous analysis, whichnoise ratio to improve at higher firing rates. This resultdepended on modeling both the tuning functions andhas been shown previously in experiments in anesthe-the response variance functions. Although the data fromtized animals using different stimulus parameters to alterspecific cells was slightly different depending on theresponse rate (Tolhurst et al., 1983; Snowden et al.,analysis method, the overall population data was very1992). To relate the effects of mean response and itssimilar. For the population, the median ROC discrimina-SD to orientation tuning curves, the SDs of the responsebility values were 14.5" in the attended mode and 19.6"to different orientations are plotted in Figure 5D for thein the unattended mode (Wilcoxon signed rank test, p !attended and unattended conditions. The curves in this0.001). Thus, the two analysis techniques producedplot are derived from the two fitted curves in Figure 5Avery similar results with a median 6" improvement inand the single function in Figure 5C. orientation discriminability using the first method and a

The ability of a neuron to discriminate changes around median 5" improvement using ROC analysis.a given orientation can be estimated by a signal-to-noise ratio that relates the slope of the orientation tuning

Population Summarycurve (“signal” about orientation difference, Figure 5B) In the previous section, we reported that attentionto the SD of responses (“noise,” Figure 5D). This ratio, causes small improvements in the orientation discrimi-a d# measure (see Experimental Procedures), is plotted nability of single cells. However, visual information canas a function of orientation in Figure 5E. This function be integrated by pooling the responses from many cells.corresponds to the difference in orientation needed to We assessed the potential effects of pooling by examin-make this neuron produce responses that differed by ing how the number of neurons whose discriminabilityone SD of the response. This value varies considerably performance reached a particular criterion varied withwith orientation. Smaller d# values indicate a better abil- attention.ity to discriminate. As noted by Scobey and Gabor In Figure 7, a cumulative plot of the proportion of cells(1989), this function has a long, broad minimum that whose best orientation discriminability was at or belowcovers much of the flank of the tuning function. It rises the particular orientation discriminability values on thesharply where the slope of the tuning function ap- x is are shown for the attended and unattended modes.proaches zero. We defined the best discriminability of The two curves diverge as the proportion of cells in-the neuron for each task mode as the minimum of this creases. This suggests that the effects of attention onfunction. For the responses shown in Figure 5E, the best orientation discriminability are independent of the selec-discriminability was 10" in the attended mode and 14" tivity of the cell for orientation, consistent with our previ-

ous report on the tuning of these cells (McAdams andin the unattended mode.

[McAdams and

Maunsell 1999b]

Neuron768

Figure 4. Population Response Variance Func-tions

These response variance functions were con-structed by fitting power functions to the re-sponse variance data from all of the V4 neu-rons. The two functions are not significantlydifferent (power: attended, 1.11; unattended,1.12; coefficient: attended, 1.22; unattended,1.26).

in the attended mode tuning curve results in a corre- Figure 6 plots the best discriminability in the attendedcondition against the best discriminability in the unat-spondingly greater slope.tended condition for all V4 neurons recorded. If attentionThe variability of the responses also affects the abilityhad no systemic effect on the ability of neurons to signalof the neuron to signal a particular orientation and there-orientation changes, the points would be symmetricallyfore to signal differences between orientations. Becausedistributed about the diagonal. More points fall belowvariance is typically proportional to mean response, thethe diagonal, showing that attention improves the abilitystandard deviation (SD), which is the square root ofof these neurons to discriminate orientations (Wilcoxonvariance, increases approximately as the square root ofsigned rank test, p ! 0.001), albeit modestly. The medianthe response. In Figure 5C, the relationship between SDdiscriminability value for the attended mode is 20.4",and the response is shown, calculated as the squareand the median discriminability value for the unattendedroot of the best-fitting response variance function. Thismode is 26.5".neuron, like most neurons recorded, did not show any

We also used receiver operating characteristic (ROC)significant difference in the fitted response varianceanalysis to assess the effects of attention on orientationfunction for the attended mode compared to the unat-discriminability (see Experimental Procedures). ROCtended mode. The data have therefore been fit with aanalysis is a direct comparison of the probability thatsingle curve. The logarithmic increase of SD (noise) withtwo response distributions overlap. We used this analy-mean response (signal) causes the neuron’s signal-to-sis to confirm the results of the previous analysis, whichnoise ratio to improve at higher firing rates. This resultdepended on modeling both the tuning functions andhas been shown previously in experiments in anesthe-the response variance functions. Although the data fromtized animals using different stimulus parameters to alterspecific cells was slightly different depending on theresponse rate (Tolhurst et al., 1983; Snowden et al.,analysis method, the overall population data was very1992). To relate the effects of mean response and itssimilar. For the population, the median ROC discrimina-SD to orientation tuning curves, the SDs of the responsebility values were 14.5" in the attended mode and 19.6"to different orientations are plotted in Figure 5D for thein the unattended mode (Wilcoxon signed rank test, p !attended and unattended conditions. The curves in this0.001). Thus, the two analysis techniques producedplot are derived from the two fitted curves in Figure 5Avery similar results with a median 6" improvement inand the single function in Figure 5C. orientation discriminability using the first method and a

The ability of a neuron to discriminate changes around median 5" improvement using ROC analysis.a given orientation can be estimated by a signal-to-noise ratio that relates the slope of the orientation tuning

Population Summarycurve (“signal” about orientation difference, Figure 5B) In the previous section, we reported that attentionto the SD of responses (“noise,” Figure 5D). This ratio, causes small improvements in the orientation discrimi-a d# measure (see Experimental Procedures), is plotted nability of single cells. However, visual information canas a function of orientation in Figure 5E. This function be integrated by pooling the responses from many cells.corresponds to the difference in orientation needed to We assessed the potential effects of pooling by examin-make this neuron produce responses that differed by ing how the number of neurons whose discriminabilityone SD of the response. This value varies considerably performance reached a particular criterion varied withwith orientation. Smaller d# values indicate a better abil- attention.ity to discriminate. As noted by Scobey and Gabor In Figure 7, a cumulative plot of the proportion of cells(1989), this function has a long, broad minimum that whose best orientation discriminability was at or belowcovers much of the flank of the tuning function. It rises the particular orientation discriminability values on thesharply where the slope of the tuning function ap- x is are shown for the attended and unattended modes.proaches zero. We defined the best discriminability of The two curves diverge as the proportion of cells in-the neuron for each task mode as the minimum of this creases. This suggests that the effects of attention onfunction. For the responses shown in Figure 5E, the best orientation discriminability are independent of the selec-discriminability was 10" in the attended mode and 14" tivity of the cell for orientation, consistent with our previ-

ous report on the tuning of these cells (McAdams andin the unattended mode.

Page 8: The Ring Model (1) The Ring Model (2)homepages.inf.ed.ac.uk/pseries/CCN09/CCN-wk4-2.pdf · the recti Þ cation, the ! = 0,1, and 2 F ourier components are all ampli Þ ed (Þ gur

What about the baseline?

Figure 4 Sensory suppression and attentional modulation in human visual

cortex. (A) Brain areas activated by the complex images compared with blank

presentations. Coronal brain slices of a single subject at a distance of 25 mm

(left) and 40 mm (right) from the posterior pole. The complex images activat-

ed the upper visual field representations of areas V1, V2, VP, V4, and TEO of

the left hemisphere. R, right hemisphere. (B) Sensory suppression in V1 and

V4. As shown by the time series of functional magnetic resonance imaging

signals, simultaneously presented stimuli (SIM) evoked less activity than

sequentially presented stimuli (SEQ) in V4 but not in V1. This finding sug-

gests that sensory suppressive interactions were scaled to the receptive field

size of neurons in visual cortex. Presentation blocks were 18 s. (C) Attentional

modulation of sensory suppression. The sensory suppression effect in V4 was

replicated in the unattended condition of this experiment, when the subjects’

attention was directed away from the stimulus display (unshaded). Spatially

directed attention (blue) increased responses to simultaneously presented stim-

uli to a larger degree than to sequentially presented ones in V4. Presentation

blocks were 15 s. (From Kastner et al 1998a.)

C-2 KASTNER ! UNGERLEIDER

• Some studies have reported an

increase in baseline.

• V2 and V4, 30-40% increase in

spontaneous activity [Luck et al

1997], LIP [Colby et al 1996]

before the stimulus was presented

at cued location.

• fMRI [Kastner 1999], increases

found in all visual areas, but

stronger in V4.

30

1

Science, Volume 291, Number 5508, Issue of 23 Feb 2001, pp. 1506-1507.

Copyright © 2001 by The American Association for the Advancement of

Science.

NEUROSCIENCE:

Drums Keep Pounding a Rhythm in the Brain

Michael P. Stryker*

The rhythmic activity of neurons in the brain has fascinated neuroscientists ever since electrical

potentials were first recorded from the human scalp more than 70 years ago. The rhythms of

electrical activity in sensory neurons that encode visual information are known to vary markedly

with attention. How does neuronal encoding differ for a visual stimulus that is the center of attention

compared with one that is ignored? To answer this question, Fries et al. (1) simultaneously recorded

electrical activity from several clusters of neurons in the V4 region of the visual cortex of macaque

monkeys that were shown behaviorally relevant and distracter objects (see the figure). On page 1560

of this issue, they report a rapid increase in the synchronization of electrical activity in the gamma

frequency range (35 to 90 Hz) in V4 neurons activated by the attended stimulus (that is, the stimulus

on which attention is focused) but not in V4 neurons activated by distracter objects (1).

SOURCE FOR BRAIN: CARIN CAIN

The benefits of paying attention. A halo of attention surrounds one of the two physically

similar stimuli (vertical and horizontal stripes) that the monkey can see while his eyes

fixate on a point between them. The attended stimulus has a more powerful

representation in the V4 area of the visual cortex because the neurons that respond to it

tend to fire rhythmically in synchrony with one another, as illustrated by the wiggly trace

to the right of the stimulus. V4 neurons that respond to the other (distracter) stimulus fire

at similar rates but not in synchrony. Synchronized firing provides the attended stimulus

with a more powerful representation, illustrated by the greater clarity of the mental

image.

phoretically labeled cells are mature (innervated) re-ceptor cells. Furthermore, the incidence of bitter-responsive dye-labeled cells (18%) is remarkablyclose to the incidence of cells expressing candidatebitter receptors [15 to 20% (1)], suggesting thatdye-labeled cells are representative taste receptorcells.

11. Ca2! signals presumably indicate physiological acti-vation and presage neurotransmitter release. Howev-er, the absence of Ca2! signals does not necessarilyindicate that the cell has not been affected (forexample, if a stimulus inhibits the taste cell).

12. C. P. Richter, K. H. Clisby, Am. J. Physiol. 134, 157(1941).

13. H. D. Patton, T. C. Ruch, J. Comp. Psychol. 37, 35(1944).

14. S. D. Koh, P. Teitelbaum, J. Comp. Physiol. Psychol.54, 223 (1961).

15. E. Tobach, J. S. Bellin, D. K. Das, Behav. Genet. 4, 405(1974).

16. K. Iwasaki, M. Sato, Chem. Senses 6, 119 (1981).17. J. I. Glendinning, Physiol. Behav. 56, 1217 (1994).18. A. K. Thaw, Chem. Senses 21, 189 (1996).19. N. K. Dess, Physiol. Behav. 69, 247 (2000).

20. Bath application of cycloheximide (0.1 to 300 "M),denatonium benzoate (3 to 3000 "M), quinine HCl(10 to 3000 "M), SOA (10 to 1000 "M), and PTC(10 to 1000 "M) induced transient Ca2! responsesin taste cells in 83% of foliate taste buds tested(43 out of 52 taste buds). At the concentrationsused, none of the compounds interfered with flu-orescence intensity.

21. Lowering extracellular Ca2! appeared to increaseintracellular Ca2! transiently in many cells (Fig. 2D),consistent with previous findings in catfish taste cells[M. M. Zviman, D. Restrepo, J. H. Teeter, J. Membr.Biol. 149, 81 (1996)].

22. Quinine- and denatonium-responsive taste cells (n #2 for each) also did not show Ca2! increases whendepolarized by 50 mM KCl [see also (25)].

23. P. M. Hwang, A. Verma, D. S. Bredt, S. H. Snyder, Proc.Natl. Acad. Sci. U.S.A. 87, 7395 (1990).

24. A. I. Spielman et al., Am. J. Physiol. 270, 926 (1996).25. M. H. Akabas, J. Dodd, Q. Al-Awqati, Science 242,1047 (1988).

26. T. Ogura, A. Mackay-Sim, S. C. Kinnamon, J. Neurosci.17, 3580 (1997).

27. The response thresholds versus the behavioral

thresholds, respectively, were as follows: 0.1 to 0.3"M (Fig. 2F) versus 0.2 to 2 "M for cycloheximide(15); 10 to 30 "M (red circles, Fig. 3C) versus 8 to 20"M for quinine (13, 14, 17, 18); 3 to 10 "M (graytriangles, Fig. 3C) versus $1 "M for denatonium (16,17); 100 to 300 "M (blue triangles, Fig. 3C) versus 20to 600 "M for PTC (12, 15); and 30 to 100 "M (greensquares, Fig. 3C) versus 100 to 200 "M for SOA (19).

28. S. Bernhardt, M. Naim, U. Zehavi, B. Lindemann,J. Physiol. 490, 325 (1996).

29. J. F. Delwiche, Z. Buletic, P. A. S. Breslin, paper pre-sented at the XII International Symposium on Olfac-tion and Taste/XIV Biennial Congress of the EuropeanChemoreception Research Organisation, Brighton,UK, 20 to 24 July 2000.

30. T. Leinders-Zufall, G. M. Shepherd, C. A. Greer, F.Zufall, J. Neurosci. 18, 5630 (1998).

31. This work was supported by NIH grants DC00374(S.D.R.) and DC04525-01 (A.C.) from the NationalInstitute on Deafness and Other CommunicationDisorders.

17 October 2000; accepted 22 January 2001

Modulation of OscillatoryNeuronal Synchronization bySelective Visual AttentionPascal Fries,1* John H. Reynolds,1,2 Alan E. Rorie,1

Robert Desimone1

In crowded visual scenes, attention is needed to select relevant stimuli. To studythe underlying mechanisms, we recorded neurons in cortical area V4 whilemacaque monkeys attended to behaviorally relevant stimuli and ignored dis-tracters. Neurons activated by the attended stimulus showed increased gamma-frequency (35 to 90 hertz) synchronization but reduced low-frequency (%17hertz) synchronization compared with neurons at nearby V4 sites activated bydistracters. Because postsynaptic integration times are short, these localizedchanges in synchronization may serve to amplify behaviorally relevant signalsin the cortex.

Visual scenes typically contain multiple stimulicompeting for control over behavior, and atten-tion biases this competition in favor of the mostrelevant stimulus (1). Correspondingly, if twocompeting stimuli are contained within the re-ceptive field (RF) of an extrastriate neuron, andone of them is attended, the neuron responds asthough only the attended stimulus is present(2–6). Thus, inputs from attended stimuli musthave an advantage over inputs from unattendedstimuli (6). This is apparently not alwaysachieved by a simple increase in firing rates toan attended stimulus, however, because firingrates to a single, high-contrast stimulus in theRF are often not increased with attention (2, 5,

7). As an alternative to increases in firing rate,one potential “amplifier” of selected neural sig-nals is gamma-frequency synchronization (8–17). Small changes in gamma-frequency syn-chronization with attention might lead to pro-nounced firing-rate changes at subsequent stag-es (10, 18). Indeed, it was recently reported thatneurons in monkey somatosensory cortexshowed stronger synchronization during a tac-tile task than during a visual task, which waspresumably caused by increased attention to thetactile stimulus in the tactile task (19). Howev-er, it is not clear whether the enhanced synchro-nization was present throughout the somatosen-sory system or whether it was restricted to thoseneurons processing the relevant tactile stimuli.To be useful in selective visual attention, en-hanced synchronization would need to be con-fined to neurons activated by the features ofattended stimuli, sparing neurons activated bydistracters.

We recorded both spikes from small clus-ters of neurons (multi-unit activity) and localfield potentials (LFPs) simultaneously from

multiple V4 sites with overlapping receptivefields (RFs) (20). The monkey fixated a cen-tral spot, and after a short delay, two stimuliwere presented at equal eccentricity, one in-side and one outside the RFs (Fig. 1C). Onseparate trials, the monkey’s attention wasdirected to either stimulus location (21), andwe compared neuronal activity between thetwo attention conditions. We refer to the con-dition with attention into the RF as “withattention,” always implicitly comparing withidentical sensory conditions but with atten-tion outside the RF.

One example pair of recording sites isshown in Fig. 1. The response histograms (Fig.1D) show stimulus-evoked responses but noclear effect of attention, either during the pre-stimulus delay or during the stimulus period. Toexamine the effect of attention on synchroniza-tion, we calculated spike-triggered averages(STAs) of the LFP (11, 14, 22). The STAsrevealed oscillatory synchronization betweenspikes and LFP from two separate electrodes,both during the delay (Fig. 1, E and F) andthe stimulus period (Fig. 1, H and I). Duringthe delay, the power spectra of the STAs (Fig.1G) were dominated by frequencies below 17Hz. With attention, this low-frequency syn-chronization was reduced (23). During thestimulus period, there were two distinctbands in the power spectrum of the STAs(Fig. 1J), one below 10 Hz and another at 35to 60 Hz. With attention, the reduction inlow-frequency synchronization was main-tained and, conversely, gamma-frequencysynchronization was increased.

To determine whether these changes insynchronization were precisely localizedwithin V4, we made additional recordingswith the stimulus outside the RF very close tothe RF border (Fig. 2). Even with closelyspaced stimuli, we found the same attentionalmodulation of synchronization as with thesecond stimulus far away (Fig. 2, C to E). In

1Laboratory of Neuropsychology, National Instituteof Mental Health, National Institutes of Health, Build-ing 49, Room 1B80, 9000 Rockville Pike, Bethesda, MD20892–4415, USA. 2Systems Neurobiology Laborato-ry, The Salk Institute for Biological Studies, 10010North Torrey Pines Road, La Jolla, CA 92037–1099,USA.

*To whom correspondence should be addressed. E-mail: [email protected]

R E P O R T S

23 FEBRUARY 2001 VOL 291 SCIENCE www.sciencemag.org1560

31

addition to these changes in synchronization,firing rates to the RF stimulus were alsomoderately suppressed when attention wasdirected to the surround stimulus (Fig. 2B),consistent with previous studies of competi-tive interactions between stimuli in V4 RFs(2, 5, 6). Large firing-rate changes with at-tention occurred only with a competing stim-ulus very near to the RF border.

Across the set of recordings, attentionalmodulations of oscillatory synchronizationwere similar to the presented examples. Wequantified the STA modulation by calculatingthe spike-field coherence (SFC) (14), whichmeasures phase synchronization betweenspikes and LFP oscillations as a function offrequency. The SFC is normalized for spikerate and spectral power of the LFP and is

therefore immune to changes in these param-eters. The SFC ranges from 0 (complete lackof synchronization) to 1 (perfect phase syn-chronization). Computing the coherence be-tween a point process (spikes) and an analogsignal (LFP) is a special case, and thereforedetailed information is given as supplemen-tary material (24). We pooled data for thestimulus configurations in which the distract-ers were near to and far from the RF.

For the delay period (Fig. 3, A and B),low-frequency SFC was reduced by a me-dian of 51% with attention (160 decreases,23 increases; P ! 10"6) (25). The delay-period STAs did not show clear gamma-frequency modulations (Fig. 1, E to G).However, statistically, the gamma-bandSFC (35 to 60 Hz) increased by a median of10% with attention (106 increases, 77 de-creases; P ! 0.02). Delay-period firingrates were nonsignificantly increased by amedian of 5% with attention (35 increases,26 decreases; P # 0.13). During the stim-ulus period (Fig. 3, C and D), low-frequen-cy SFC was reduced by a median of 23%with attention (142 decreases, 65 increases;P ! 10"6), whereas gamma-frequencySFC increased by a median of 19% (167increases, 40 decreases; P ! 10"6). Firingrates were enhanced by a median of 16%with attention (68 increases, one decrease;P ! 10"6). Attention affected the normal-ized power spectrum of the raw LFP essen-tially in the same way as the SFC.

The above analysis of the sustained re-

Fig. 1. Attentional mod-ulation of oscillatory syn-chronization betweenspikes and LFP from twoseparate electrodes. Rawstimulus–driven LFP andmulti-unit activity withattention outside the RF(A) and into the RF (B).(C) RFs (not visible tomonkey; green: spike re-cording site, yellow: LFPrecording site); fixationpoint and grating stimuliare to scale. The RFs forboth recording sites weredetermined from themulti-unit activity andincluded only one of thetwo stimuli. In separatetrials, this stimulus waseither attended or ig-nored. Data are from 300correct trials per atten-tion condition. (D) Firing-rate histograms. Verticallines indicate stimulusonset and 300 ms afterstimulus onset. Delay pe-riod was the 1-s intervalbefore stimulus onset,and stimulus period wasfrom 300 ms after stim-ulus onset until one ofthe stimuli changed itscolor. Delay-period STAsfor attention outside theRF (E) and into the RF (F)and the respective powerspectra (G). Stimulus-pe-riod STAs for attentionoutside the RF (H) andinto the RF (I) and the respective power spectra (J).

Fig. 2. Attentional modu-lation of synchronizationhas high spatial resolutionin the cortex. Conventionsare as for Fig. 1 exceptthat the stimulus outsidethe RF is only 1.5° fromthe RF border. Spikes andLFP are from two separateelectrodes. Data are from125 correct trials per at-tention condition. (A) RFs,fixation point, and gratingstimuli. (B) Firing-rate his-tograms. (C and D) STAsfor stimulus period and(E) the respective pow-er spectra.

Fig. 3. Population measures of attentional ef-fects on the SFC. Scatter plots compare atten-tional effects on low- and gamma-frequencySFC and on firing rates. Each dot represents onepair of recording sites. The x- and y-axis valuesare attentional indices defined as AI(P) #[P(in) " P(out)]/[P(in) $ P(out)], with P beingone of the three parameters under study: low-frequency synchronization (L), gamma-fre-quency synchronization (G), and firing rates (R).P(in) is the value of the parameter with atten-tion directed into the RF, and P(out), withattention directed outside the RF. (A and B)Activity from the 1-s delay period before stim-ulus onset. (C and D) Activity from the stimulusperiod.

R E P O R T S

www.sciencemag.org SCIENCE VOL 291 23 FEBRUARY 2001 1561

Attention increases Gamma synchrony

• By increasing impact on post-synaptic neurons, increase in synchrony could be

a powerful way to amplify the signal

A- 'Unaware'

Fixed !!

Decoder

Adaptation!

State

Population !

Response

Encoder!!! r

B- 'Aware'

Adaptive !!

Decoder

Adaptation!

State

Population !

Response

Encoder!!! r

s s

s s

A- 'Unaware'

Fixed !!

Decoder

Adaptation!

State

Population !

Response

Encoder!!! r

B- 'Aware'

Adaptive !!

Decoder

Adaptation!

State

Population !

Response

Encoder!!! r

s s

s s

!180 !90 0 90 1800

25

50

!test

Resp

on

ses

A ! Tuning Curves

!180 !90 0 90 180

0

25

50

!test

B ! Population Response

Context

When neurons and perception change

Page 9: The Ring Model (1) The Ring Model (2)homepages.inf.ed.ac.uk/pseries/CCN09/CCN-wk4-2.pdf · the recti Þ cation, the ! = 0,1, and 2 F ourier components are all ampli Þ ed (Þ gur

A- 'Unaware'

Fixed !!

Decoder

Adaptation!

State

Population !

Response

Encoder!!! r

B- 'Aware'

Adaptive !!

Decoder

Adaptation!

State

Population !

Response

Encoder!!! r

s s

s s

A- 'Unaware'

Fixed !!

Decoder

Adaptation!

State

Population !

Response

Encoder!!! r

B- 'Aware'

Adaptive !!

Decoder

Adaptation!

State

Population !

Response

Encoder!!! r

s s

s s

!180 !90 0 90 1800

25

50

!test

Resp

on

ses

A ! Tuning Curves

!180 !90 0 90 180

0

25

50

!test

B ! Population Response

Context

When neurons and perception change