an examination of goal-directed ... - academic commons
TRANSCRIPT
![Page 1: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/1.jpg)
An Examination of Goal-Directed Motivation in Mice: The Role of
Dopamine D2 and Serotonin 2C Receptors
Matthew R. Bailey
Submitter in partial fulfillment of the
requirements for the degree of
Doctor of Philosophy
in the Graduate School of Arts and Sciences
COLUMBIA UNIVERSITY
2017
![Page 2: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/2.jpg)
© 2017
Matthew R. Bailey
All rights reserved
![Page 3: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/3.jpg)
Abstract
An Examination of Goal-Directed Motivation in Mice: The Role of
Dopamine D2 and Serotonin 2C Receptors
Matthew R. Bailey
Motivation has been defined as a set of processes which enables organisms to overcome
obstacles by energizing behavior in the pursuit of a goal. There are several important
observations about motivated behavior which provide insight into the neural mechanisms
underlying goal-directed motivation. First, motivation serves two important functions, as it both
energizes behavior and also directs it toward or away from specific stimuli. Many of the
behavioral tasks used to assay motivation in laboratory rodents do not specifically aim to
measure these two distinct aspects of motivation. A second feature of goal-directed motivation is
that it is sensitive to both costs and benefits of a given situation, enabling animals to make cost-
benefit decisions. Again, many of the behavioral tasks which study cost-benefit decision making
do not specifically aim to independently measure the impact of cost manipulations and benefit
manipulations in an isolated manner. Here, I first develop behavioral measures which aim to
specifically dissociate activational and directional effects of motivation. By characterizing a
novel behavioral measure known as a Progressive Hold Down (PHD) task, and using this task in
parallel with a more traditionally used Progressive Ratio (PR) task, I show that
methamphetamine robustly enhances activational effects of motivation, leading to increased
response rates in both the PHD and PR task, but mice are not more goal-directed in the PHD
task. I next develop and characterize two novel behavioral assays which are specifically used to
examine effort and value contributions to cost-benefit decision making. The Concurrent Effort
![Page 4: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/4.jpg)
Choice (CEC) task measures how changes in effort levels impact decision making whereas the
Concurrent Value Choice (CVC) task measure how changes in reward value impact decision
making. Using these novel assays to examine specific processes important for goal-directed
motivation, I carefully examine the role of manipulation of the Dopamine D2 receptor (D2R) in a
mouse model which over-expresses the D2R within the striatum (D2R-OE), and the role of
pharmacological manipulation of the Serotonin 2C receptor (5-HT2CR) with the functionally
selective ligand SB242084. Whereas D2R-OE specifically impacts sensitivity to changes in
effort levels which decrease overall levels of goal-directed motivation, selective modulation of
the 5-HT2CR via treatment with SB242084 increases response vigor through enhanced
dopamine release in the dorsomedial striatum, but this increase in response vigor does not alter
sensitivity to effort or value changes when working for rewards. Together, these studies
demonstrate the benefits of developing a more nuanced understanding of how specific
manipulations impact motivated behavior by examining the specific underlying processes being
altered.
![Page 5: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/5.jpg)
i
TABLE OF CONTENTS
………………………………………………
List of Figures and Tables ..........................….………………………………………………..... iii
Acknowledgements ..……………………….………………………………………………......... v
Dedication ..…………………….……………………………………………………………...... xi
Chapter 1: Introduction …………………………………………………………...................... 1
6. References ...................................……………………………………………………… 6
Chapter 2: Neural Substrates Underlying Effort, Time, and Risk Based Decision Making in
Motivated Behavior …………………………………………………………………………... 10
Abstract ……………………………………………………………………………….... 10
1. Introduction ………………………………………………………………………….. 10
2. Evidence of animals processing information about costs …………………………… 17
3. Activational components of motivation ……………………………………………... 20
4. Neurobiology of cost-benefit decision making ……………………………………… 37
5. Summary and future directions ……………………………………………………… 61
6. References ................................……………………………………………………… 71
Chapter 3: A Novel Strategy for Dissecting Goal-Directed Action
And Arousal Components of Motivation ………………………….…………………............ 87
Abstract ………………………………………………………………………………… 87
1. Introduction ………………………………………………………………………….. 88
2. Methods ……………………………………………………………………………… 91
3. Results ……………………………………………………………………………….. 96
4. Discussion ………………………………………………………………………….. 105
5. References ..............................……………………………………………………… 110
Chapter 4: Novel Strategies for Isolating the Effects of Effort and Value on Cost-Benefit
Decision Making in Mice …………………………………………......................................... 116
Abstract ……………………………………………………………………………….. 116
1. Introduction ………………………………………………………………………… 117
2. Methods …………………………………………………………………………...... 120
3. Results ……………………………………………………………………………… 125
4. Discussion ………………………………………………………………………….. 141
5. References ..............................……………………………………………………… 152
Chapter 5: The Selective involvement of Striatal Dopamine D2 Receptors in Effort-Based
vs Value-Based Decision Making ...…………………………………..................................... 157
![Page 6: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/6.jpg)
ii
Abstract ……………………………………………………………………………….. 157
1. Introduction ……………………………………………………………………….... 168
2. Methods …………………………………………………………………………...... 160
3. Results ……………………………………………………………………………… 166
4. Discussion ………………………………………………………………………….. 182
5. References ..............................……………………………………………………… 189
Chapter 6: The Effects of Pharmacological Modulation of the Serotonin 2c Receptor on
Goal-Directed Behavior in Mice .….…………………………………................................... 192
Abstract ……………………………………………………………………………….. 192
1. Introduction ………………………………………………………………………… 193
2. Methods …………………………………………………………………………….. 195
3. Results ……………………………………………………………………………… 199
4. Discussion ………………………………………………………………………….. 207
5. References ..............................……………………………………………………… 211
Chapter 7: A functional interaction between serotonin receptor signaling and striatal
dopamine release enhances goal-directed activity in mice ......………................................. 217
Abstract ……………………………………………………………………………….. 217
1. Introduction ………………………………………………………………………… 217
2. Methods .…………………………………………………………………………..... 220
3. Results .……………………………………………………………………………... 226
4. Discussion .…………………………………………………………………………. 238
5. References ..............................……………………………………………………… 243
Chapter 8: Conclusion .…………………………………………………………………........ 247
1. References ..............................……………………………………………………… 255
![Page 7: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/7.jpg)
iii
LIST OF FIGURES
Chapter 2 Figure 2.1 – Schematic diagram of a model of motivation .............................................. 16
Figure 2.2 – Effects of methamphetamine in a PR and PHD task …………................... 24
Figure 2.3 – Diagram of brain regions involved in PR and effort based choice ............ 36
Figure 2.4 – Diagram of brain regions involved in delay and risk based choice ……... 55
Figure 2.5 – Summary of brain regions involved in cost-benefit decision making ……. 66
Chapter 3 Figure 3.1 – Relationship between PR performance and locomotor activity…………... 90
Figure 3.2 – Characterization of baseline behavior in the PHD task ............................. 96
Figure 3.3 – Food restriction effects on PHD performance……………......................... 98
Figure 3.4 – Reward value effects on PHD performance………………......................... 99
Figure 3.5 – Methamphetamine increases responding in a PR……………………….. 100
Figure 3.6 – Methamphetamine increases lever pressing in a PHD task ..................... 102
Figure 3.7 – Methamphetamine increases hyperactivity in a PHD task ……………... 104
Chapter 4 Figure 4.1– Schematic Representation of the CEC task …............................................ 126
Figure 4.2 – Effort-based choice in the CEC task ……................................................. 127
Figure 4.3 – Response number costs impact latency to work ....................................... 129
Figure 4.4 – Response duration costs impact latency to work ........………………….. 131
Figure 4.5 – Temporal variables are predictive of PSE in the CEC task ……............. 134
Figure 4.6 – Schematic representation of the CVC task ............................................... 135
Figure 4.7 – Reward magnitude alters value-based choice in the CVC task ............... 136
Figure 4.8 – Reward magnitude modulates latency to work ........................................ 137
Figure 4.9 – Reward magnitude modulates the vigor of responding ........................... 139
Figure 4.10 – Reward devaluation in the CVC task ..................................................... 140
Figure 4.11 – Reward devaluation modulates vigor of responding ............................. 141
Supplementary Figure 4.1S – Effort choice functions of individual subjects ………... 149
Supplementary Figure 4.2S – Value Choice functions of individual subjects………... 150
Supplementary Figure 4.3S – Value Choice functions following pellet devaluation… 151
Chapter 5 Figure 5.1 – D2R-OE mice are less willing to work for preferred rewards ……......... 168
Figure 5.2 – D2R-OE mice have altered effort sensitivity in a CEC task …................. 171
Figure 5.3 – D2R-OE mice have deficits when repeatedly executing actions ………... 173
Figure 5.4 – D2R-OE mice have altered temporal dynamics of lever pressing …........ 174
Figure 5.5 – D2R-OE mice quit working with a response number cost …………........ 176
Figure 5.6 – D2R-OE mice maintain persistence with a response duration cost…....... 177
Figure 5.7 – Reward magnitude manipulation effects on choice in the CVC task ........ 179
Figure 5.9 – D2R-OE mice are sensitive to reward magnitude in a CVC task ............. 180
Figure 5.8 – D2R-OE mice are sensitive to reward devaluation in a CVC task ........... 181
![Page 8: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/8.jpg)
iv
Chapter 6 Figure 6.1 – SB242084 increases responding in a PR .................................................. 200
Figure 6.2 – SB242084 increases responding for a preferred reward .......................... 202
Figure 6.3 – SB242084 increases goal-directed action in a PHD task ......................... 203
Figure 6.4 – SB242084 doesn’t enhance non-goal-directed hyperactivity ................... 204
Figure 6.5 – The acute effects of SB242084 can be reinstated repeatedly .................... 206
Chapter 7 Figure 7.1 – SB242084 treatment does not alter reward value sensitivity ................… 227
Figure 7.2 – SB242084 Does Not Alter the Encoding of Rewards or Reward Cues …. 229
Figure 7.3 – SB242084 Treatment Does Not Alter Effort Sensitivity ………................ 231
Figure 7.4 – SB242084 Enhances Goal-Directed Persistence in a PR …..................... 232
Figure 7.5 – SB242084 Enhances Response Vigor in a PR ……………….................. 233
Figure 7.6 – SB242084 does not alter sensitivity to extinction ……............................. 234
Figure 7.7 – SB242084 increases vigor in FR schedules .............................................. 236
Figure 7.8 – SB242084 potentiates dopamine release in the DMS ............................... 238
LIST OF TABLES
Chapter 2 Table 2.1 – Regional brain manipulations that modulate PR behavior .......................... 26
Table 2.2 – Studies of effort-based decision making ....................................................... 39
Table 2.3 – Brain regions which modulate delay based choice ...................................... 49
Table 2.4 – Brain regions which modulate risk based choice ......................................... 57
Chapter 4 Table 4.1 – Effort & value variables altered in effort-related choice tasks .................. 119
Table 4.2 – Results from linear regression of temporal variables ................................ 133
![Page 9: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/9.jpg)
v
Acknowledgements
I would like to start by thanking the members of my dissertation committee, Peter
Balsam, Eleanor Simpson, James Curley, Rae Silver, and Jon Horvitz for the time and effort they
put into guiding me throughout my graduate education, and the role they played in helping me to
complete the work involved in this dissertation.
I have been incredibly fortunate to have had so many supportive and influential people
impact me in different ways throughout my life, and I will do my best to thank as many of them
here as I can, though I will inevitably end up having to leave some people out.
My family has always been a tremendous source of support, and I first want to thank my
parents, Ann and Mike Bailey, for the nearly incomprehensible amount of love, encouragement,
and unconditional support they have shown me throughout my life. You have supported me in all
of my various endeavors throughout life, from shuttling me all over the place to hockey
practices, games, and tournements, to always being there watching and cheering me on at hockey
games, track meets, and tennis matches, to supporting me and enabaling me to figure out what on
earth I was going to do with myself once I started college, and encouraging me to pursue my
interest in Psychology as an undergraduate at the University of Minnesota once I figured out that
was what I wanted to do, and most recently through your continuous support while I was in
graduate school. I would not be the same person I am today had it not been for the gift of having
had such loving and supportive parents throughout my life.
I also want to thank my brother, Michael, for the impact he had on my life as I was very
fortunate to have grown up with an older brother looking out for me, teaching me how to do any
and everything he happened to be doing, and for letting me do my best to keep up with whatever
the older kids were up to. I also want to thank my sister, Melanie, for having been a wonderful
![Page 10: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/10.jpg)
vi
younger sister who put up with me often acting like “one of her four parents!” The amount of
passion and tenacity with which you pursue any and every thing you do has always made me a
very proud older brother. I want to thank my brother Michael and my sister-in-law Stacy for
always being so accomodating and understadning with their time, as they always allow me to
sporadically show up and get to spend time with Elaina, Hadley, and Alex. Your family has been
a tremendous source of joy and inspiration to me over these last five years. And Finally, I want
to thank my grandparents for the role they have played in my life. My grandfather, Richard
Caldecott, promised me he would live to see me get my PhD and he kept that promise. I will
always remember and cherrish our many lengthy lunchtime discussions about science and about
life. My grandmother, Lucille Caldecott, who was an inspiration to me through her passion for
education, and her generosity in seeing that her family was able to puruse their educational
aspirations as well. And my grandfather Bill Bailey and my grandmother Loretta Bailey who will
always remind me of the extraordinary amount of good that one can bring into this world through
our actions and decisions to help others.
I would also like to say thank you to some people who have been friends of mine for
many years. Lincoln Hughes, my “Bobby Orr: Lightnigh on Ice” partner in crime, the guy who
got me on the invite list to BugBee Hive seven years straight, and the person who I’ve respected
ever since the day I saw he was able to raise the puck off the ice before me. Tony Tulp, the
friend I know would be there for me for better or worse, through think and thin, with whom even
an ordinary night of sitting around can turn into an epic, philosophical, guitar filled night I’ll
likely never forget. Scottie O’Donnell, who has always been a generous friend I can count on to
offer me a spare couch for the night, meet me for a quick luncheon at the Cahill Dinner, get in a
round of golf (and probably lose), or play one more late night game of cribbage with an E&J
![Page 11: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/11.jpg)
vii
nightcap. Thank you all very much for being the kind of friends who have supported me
throughout all of my different phases in life, and people who have offered unwavering support
during my most successful times and my least successful times in life.
My time in graduate school and my experience in New York would not have been nearly
as meaningful, or fun, had it not been for the incredible group of people I am lucky enough to
call my friends. I will be forever grateful to Greg Jensen for the time he spent helping me during
my first years of graduate school, and our endless discussions about science, statistics,
programming, and thinking about data in new ways. I also want to thank Chris Mezias, for being
an incredibly supportive friend, for helping me to remember not take myself and my work too
seriously, and for helping me to re-discover the importance of keeping a healthy dose of fun in
life with impromtu restraunt booth kareeokee sessions, rooftop excurtions, spur of the moment
road trips, and middle of the night paddle boat rides! I also want to thank, Emalick Njie, my
good friend with whom many of our stories begin with, “A Minnesotan and a Gambian walk into
a bar...” Emalick has been an incredibly supportive friend that has been there for me when I
needed it most, and has always had an amazing gift of knowing exactly what to say or do at
exactly the right time.
This one is a lengthy thank you, but bare with me. One afternoon in the spring semester
of my first year at Columbia I was sitting in a lecture for a course called Neurobiology II:
Developmental and Systems Neurosience. I noticed one day that a guy with curly black hair one
row in front of me was asking some pretty interesting questions, the kind of stuff even the
professor didn’t know how to answer. As I was leaving the building following the lecture on that
particular day I saw that same guy with curly black hair talking to a guy with a beard and
flowing shoulder length brown hair. Based on a gut feeling I walked up to them, told them my
![Page 12: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/12.jpg)
viii
opion as to how I would answer one of the more interesting questions Mr. curly black hair had
asked, and the three of use proceeded to discuss ideas for the next 30 minutes. These two guys
were Ben Shababo and Sasha Rayshubskiy, and the three of us would meet after class outside the
building after nearly every lecture to discuss ideas about sciecne and life. As the semester was
coming to an end we realized that we wouldn’t have an excue to meet and have our
conversations any longer, so we decided that we would all meet up on Wednesday nights to keep
discussing ideas under the guise of a journal club. This “rouge” journal club became known as
NeuroStorm, and early on Ben Shababo, Sasha Rayshubskiy and I met Emalick Njie and
Chrisophe Dupre and the rest was history. For five and a half years, on any given Wednesday
night at 7:00 PM in room 1014 of Fairchild Hall you could find an incredible group of some of
the most wonderful people I’ve ever known discussing ideas and having fun. When I walked up
to Ben and Sasha that first day they welcomed me, and listened to my ideas with enthusiasm and
made me feel entirely welcomed, and for the first time since I had moved to New York, I felt
truly accepted. From the very first day of NeuroStorm’s existence, I made it my goal to create an
atmosphere and an enviornment which welcomed everyone with open arms, and a place that was
open to any and every idea people had no matter how crazy or unconventional it was. Over the
years many people have explained feeling accepted when they joined our group. I want to thank
every single person who was a part of this group and this experience with me, as it has
undeniably made me a better person. Ben Shababo, Sasha Rayshubskiy, Emalick Njie, Chrisophe
Dupre, Greg Jensen, Matti Vuorre, Raphael Gerraty, Chris Mezias, Elke Schipanni, Emily
Feierman, Paul Frazel, Cooo Pete, Ben Joffe, Olivia Goldman, Nuri Jeong, Joud Hijazi, Dave
Barack, Brittany DeFeis, Judy Xu, Narmine, Terese, and because I am inevitably forgetting some
![Page 13: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/13.jpg)
ix
people, everyone on the [email protected] list, thank you so incredibly much for
having been a part of this life shaping experience with me!
I would also like to thank the many people who were a part of the Balsam Lab who
contributed in so many different ways to the many different projects I was involved with. Not a
single one of the experiemnts I have done could have been done alone and I was fortunate to
have so many awesome partners in crime over the years. Greg Jensen, was instrumental in
teaching me many of the necessary basics of programming in MedPC and Matlab to get started,
and Kathleen Taylor who showed me the ropes when it came to Med Associates equiptment, and
taught me the basics of running an experiment and data collection for my first few operant
experiments. I want to thank Chris Mezias and Cait Williamson, my TA partner in crime many
times, who helped me in every single aspect of my work and experimnets during my 2nd and 3rd
years of graduate school, especially with the early work with the Progressive Hold Down task
and SB242084 pharmacology experiments. I could not have learned to juggle multiple
simultaneuos experiments had it not been for all of their help! I also want to thank Elke Schipani,
and I am extremley fortunate to have had her helping me with experiments around the time I was
first developing the Concurrent Effort Choice and Concurrent Value Choice tasks, as her project
management and organizational skills undoubtedly provided much needed organization and
precision to those complicated experiemnts at every single level! I also want to thank Eileen
Chun for her help in running many experiments which are in this thesis, and Talia Malenka and
Ben Gersten for their help in running experiements. And last, but certaitnly not least, I am very
thankful that Peter Balsam gave me the opportunity to work in his lab, and that he allowed me to
run experiments with the freedom to do things in new and creative ways, and to think differently
about these questions. Some of the most fun times I had thinking about science in graduate
![Page 14: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/14.jpg)
x
school came while Peter and I were discussing how we could measure certain aspects of
behavior and how to analyze data to try to understand what on earth was going on inside the
mind of a mouse casuing it to behave in the way it did.
I would also like to thank Eleanor Simpson and several members of her lab who have
been instrumental in helping me with various experiemnts throughout the years. Vannessa
Winiger who helped out with a large amount of the daily behavioral running of mice for many
many different experiments. Olivia Goldmann was instrumental in carrying out all of the
microdialysis experiments which became such an important piece to understanding what
SB242084 is doing to alter motivation. Estefania Ballo was instrumental in carrying out all of the
cyclic voltametry studies which helped us to definitivley show that SB242084 is not impacting
the encoding of value. In both of these cases, by instrumental, I mean 100% responsible! And
finally, Eleanor Simpson was an incredible mentor to me over the last 5 years. Eleanor taught me
much of what I know today about conducting neuroscience research, and helped me
tremendously in developing my abilities to think about science. What I will remember most,
however, is how much fun I had thinking about data, planning experiemnts, and analyzing data
with Eleanor!
Finally, I would like to thank Anindya Bagchi, my previous mentor at the University of
Minnesota, for teaching me to believe in myself, and always encouraging outside the box
thinking. Anindya was the person who convinced me to apply to Columbia in the first place, and
provided me with the push I needed to take a risk in order to discover something new.
Thank you all very very much!
![Page 15: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/15.jpg)
xi
Dedicated to:
My mom and dad,
Ann and Mike Bailey
![Page 16: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/16.jpg)
1
Chapter 1
Introduction
1.1 General Overview
Motivation has been defined as a set of processes which enable organisms to overcome
obstacles by energizing behavior in the pursuit of a goal. Goal-directed motivation is essential to
survival, as most natural environments have limited natural resources, and effort often must be
expended to obtain these resources. Moreover, the conditions in most environments are rarely
stable for long periods of time, so animals must be able to adapt their behavioral strategies based
on current conditions. Given the importance of motivational processes to survival, it is not
surprising that evolution has co-opted multiple distinct neural processes that operate in
cooperation to give rise to what we recognize as motivated behavior, as sensory, motor,
cognitive, and emotional processes are all involved (Pezzulo and Castelfranchi, 2009; Salamone,
2010). Various observations over the years have helped to shed light on some of the different
processes involved in motivated behavior. For example, motivation both energizes behavior and
also directs it toward a specific goal. Moreover, in response to constantly changing
environmental conditions, motivational processes enable organism to adapt their behavior in
response to both costs and benefits of a given situation, allowing animals to engage in cost-
benefit decision making in response to environmental conditions. Each of these concepts is
reviewed in Chapter 2.
One important consequence of relying on multiple interacting sub-processes working in
unison for goal-directed action is that there are many different ways in which motivated behavior
could be altered. A manipulation could alter the extent to which an animal’s behavior can be
![Page 17: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/17.jpg)
2
energized, either by increasing or decreasing the vigor of behavior. Secondly, a manipulation
could alter the directional function of motivation altering an animal’s selection of specific classes
of actions in different situations. Third, a manipulation could alter an animal’s sensitivity to
changing costs or effort requirements which would disrupt cost-benefit decision making. Fourth,
a manipulation could alter an animal’s reward value sensitivity which would also disrupt cost-
benefit decision making.
In this thesis, I employ an approach to the study of motivation that recognizes these
various underlying processes working in parallel to give rise to motivated behavior. To this end,
I first carry out experiments which develop behavioral measures to isolate sub-processes
important for motivation. I first develop a method for dissociating the activational or energizing
from directional effects that specifically alter goal-directed behavior (Chapter 3). I next develop
methods for examining how effort and value specifically impact cost-benefit decision making
when these variables are manipulated in independently (Chapter 4). After developing these
measures, I carry out an examination of the impact of two different manipulations that have
previously been shown to alter motivated behavior, with the goal of identifying the
neurobiological mechanisms that regulate sub-processes impacting motivation. First, I examine
impact of chronic developmental over-expression of the Dopamine D2R (D2R-OE), which
results in decreased goal-directed motivation (Chapter 5). I next examine the impact of acute
pharmacological modulation of the Serotonin 2C receptor (5-HT2CR) via the functionally
selective ligand SB242084, which has been shown to enhance motivated behavior (Chapter 6-7).
Finally, I discuss the insights about motivation gained as a result of this work, and present future
questions that will be important for future work investigating these topics (Chapter 8).
1.2 Activational vs Directional effects of motivation
![Page 18: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/18.jpg)
3
Motivation has been recognized as serving the important functions of both energizing
behavior, and directing it towards positive goals such as food and water, and away from
unpleasant experiences such as pain or resource deprivation (Duffy, 1957; Hebb, 1955;
Salamone et al., 2015; Simpson et al., 2016).
The activational component of motivation enables an organisms to overcome obstacles
standing between itself and a goal by invigorating the execution of behavioral responses.
Activation influences motivated behavior through increases in the likelihood of initiating
behaviors, the speed or vigor with which the behavior is performed, and the level of persistence
of action in the face of distractions or failures (Floresco, 2015; Salamone, 1992; Salamone et al.,
2002).
The directional component of motivation guides animals toward stimuli and outcomes
relevant to survival at any given time. As an early example, animals approach pleasurable or
positive stimuli (e.g. food, water, sex, etc.), and avoid or escape negative stimuli (e.g. painful
conditions, predators, stress) (Salamone et al., 2016). More recently, researchers have developed
far more specific definitions of directional aspects of motivation by examining the specific
neural circuits, and physiological and environmental conditions involved in selecting specific
actions such feeding, drinking, mating (Kelley, Baldo, Pratt, & Will, 2005; Oka, Ye, & Zuker,
2015).
1.3 Cost-Benefit Decision making processes underlying motivated behavior
Observations that specific deprivation states (i.e. water vs food) direct behavior to seek
out the deprived nutrient lend support to the importance of the directing influence motivation has
on behavior. At any given time, the different durations of deprivation of specific resources
![Page 19: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/19.jpg)
4
impact how important that resource is for an animal. As such, the relative value of resources
fluctuates in time and circumstance. Additionally, because environments create differential costs
for procuring a given resource, animals must be able to make adaptive cost-benefit decisions on a
regular basis.
There is considerable evidence, that in many circumstances both humans (Croxson et al.,
2009) and other animals (Atalayer et al., 2009; Collier et al., 1997) engage in cost benefit
decision making processes. This area of research is increasingly gaining attention as we
recognize that effective cost-benefit decision making processes go awry in psychiatric conditions
known to impair motivation, such as Schizophrenia and Depression (Barch et al., 2016; Reddy et
al., 2016; Treadway, 2016), as well as excessive motivation seen in Drug and Gambling
Addiction (Meyer et al., 2016; Peter W. Kalivas et al., 2005). Currently, nearly all studies
investigating cost-benefit decision making in both human subjects and experimental organisms
employ behavioral choice paradigms that simultaneously manipulate effort and value variables
(Gold et al., 2015). Methods to study the impact of these two variables in isolation are developed
in Chapter 4.
1.4 Dopamine D2 Receptors and motivated behavior
Dopamine signaling has long been implicated in various aspects of motivated behavior
(Lyon and Robbins 1975; Evenden and Robbins 1983; Taylor and Robbins 1984, 1986;
Ljungberg and Enquist 1987). Disruptions of mesolimbic dopamine signaling decreases operant
responding in progressive ratio tasks and shifts effort related choice behavior away from options
that require high effort. Cell body lesions or inactivation of NAc (Bezzina et al., 2008; Ghods-
Sharifi et al., 2010; Hauber et al., 2009), neurotoxic lesions of mesoacumbal dopamine terminals
![Page 20: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/20.jpg)
5
(Hamill et al., 1999; Salamone, 1994; Salamone et al., 1999), and blockade of dopamine D1 or
D2 receptors within the NAc (Bari et al., 2005; Nowend et al., 2001) have all been found to
reduce effortful behavioral output in both types of behavioral paradigms.
There have been several genetic mouse models developed which manipulate dopamine
receptors and other dopamine related proteins which have also been found to impact motivated
behavior (Cagniard et al., 2006; Palmiter, 2008). Here, I examine a model which over-expresses
the dopamine D2 receptor within the striatum (D2R-OE) (Kellendonk et al., 2006), developed to
mimick the increased D2R density found in the striatum of some patients with schizophrenia
(Wong et al., 1986; Laruelle et al., 1996; Breier et al., 1997; Laruelle, 1998; Abi-Dargham et al.,
2000; Seeman and Kapur, 2000). The D2R-OE mouse model has been previously shown to have
a deficit in goal-directed motivation (Drew et al., 2007; Simpson et al., 2011; Ward et al., 2012).
In this thesis, I conduct a detailed analysis of the specific motivational sub-processes impacted in
this D2R-OE model (Chapter 5).
1.5 Serotonin 2C Receptors and motivated behavior
There have been a number of different neurotransmitter systems identified as modifying
dopamine signaling indirectly, making these additional targets by which motivation can be
modulated. One candidate which has recently been shown to interact with dopamine signaling is
the serotonin 2C receptor (5-HT2CR). This receptor is expressed in brain regions which are
known to modulate motivated behavior, including the Ventral Tegmental Area (VTA),
Substantial Niagra (SN), the Dorsal Striatum (DS, and Nucleus Accumbens (Nac) (Bubar, M. J.
et al., 2007; Bubar, Marcy J. et al., 2011; Eberle-Wang et al., 1997; Pasqualetti et al., 1999).
While serotonergic activation of 5-HT2CR’s and 5-HT2CR agonists have been found to exert an
![Page 21: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/21.jpg)
6
inhibitory effect on dopamine neuron firing rates in the VTA and SN, thereby decreasing
dopamine release at the terminal sites of the mesolimbic, and nigrostriatal dopamine pathways,
5-HT2CR antagonists and inverse agonists have been shown to increase the firing rates of DA
neurons and enhance DA efflux in the terminal targets of these neurons (Alex et al., 2007; Di
Matteo et al., 2008).
There have been several behavioral effects of drugs which act at the 5-HT2CR as well.
Selective 5-HT2CR agonists lorcaserin, Ro-600175, and CP-809101 all impair motivation on
operant tasks for food rewards (Bezzina et al., 2015; Fletcher et al., 2009; Higgins et al., 2013),
whereas the selective 5-HT2CR antagonist-like ligand SB242084 have been shown to increase
motivation (Simpson et al., 2011). In this thesis (Chapters 6-7), I carry out a detailed analysis of
the specific motivational sub-processes impacted through pharmacological modulation of the 5-
HT2CR via the functionally selective ligand SB242084.
References
Alex, K. D., & Pehek, E. A. (2007). Pharmacologic mechanisms of serotonergic regulation of
dopamine neurotransmission. Pharmacology & Therapeutics, 113(2), 296-320.
Atalayer, D., & Rowland, N. E. (2009). Meal patterns of mice under systematically varying
approach and unit costs for food in a closed economy. Physiology & Behavior, 98(1-2),
85-93.
Barch, D. M., Pagliaccio, D., & Luking, K. (2016). Mechanisms Underlying Motivational
Deficits in Psychopathology: Similarities and Differences in Depression and
Schizophrenia. In E. H. Simpson & P. D. Balsam (Eds.), Behavioral Neuroscience of
Motivation (pp. 411-449). Cham: Springer International Publishing.
Bari, A. A., & Pierce, R. C. (2005). D1-like and D2 dopamine receptor antagonists administered
into the shell subregion of the rat nucleus accumbens decrease cocaine, but not food,
reinforcement. Neuroscience, 135(3), 959-968.
Bezzina, G., Body, S., Cheung, T. H. C., Hampson, C. L., Bradshaw, C. M., Glennon, J. C., &
Szabadi, E. (2015). Evidence for a role of 5-HT2C receptors in the motor aspects of
performance, but not the efficacy of food reinforcers, in a progressive ratio schedule.
Psychopharmacology, 232(4), 699-711.
![Page 22: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/22.jpg)
7
Bezzina, G., Body, S., Cheung, T. H. C., Hampson, C. L., Deakin, J. F. W., Anderson, I. M.,
Bradshaw, C. M. (2008). Effect of quinolinic acid-induced lesions of the nucleus
accumbens core on performance on a progressive ratio schedule of reinforcement:
implications for inter-temporal choice. Psychopharmacology, 197(2), 339-350.
Bubar, M. J., & Cunningham, K. A. (2007). Distribution of serotonin 5-HT2C receptors in the
ventral tegmental area. Neuroscience, 146(1), 286-297.
Bubar, M. J., Stutz, S. J., & Cunningham, K. A. (2011). 5-HT2C Receptors Localize to
Dopamine and GABA Neurons in the Rat Mesoaccumbens Pathway. Plos One, 6(6),
e20508.
Cagniard, B., Balsam, P. D., Brunner, D., & Zhuang, X. X. (2006). Mice with chronically
elevated dopamine exhibit enhanced motivation, but not learning, for a food reward.
Neuropsychopharmacology, 31(7), 1362-1370.
Collier, G., & Johnson, D. F. (1997). Who is in Charge? Animal vs Experimenter Control.
Appetite, 29(2), 159-180.
Croxson, P. L., Walton, M. E., O'Reilly, J. X., Behrens, T. E. J., & Rushworth, M. F. S. (2009).
Effort-Based Cost-Benefit Valuation and the Human Brain. Journal of Neuroscience,
29(14), 4531-4541.
Di Matteo, V., Di Giovanni, G., Pierucci, M., & Esposito, E. (2008). Serotonin control of central
dopaminergic function: focus on in vivo microdialysis studies. Serotonin-Dopamine
Interaction: Experimental Evidence and Therapeutic Relevance, 172, 7-44.
Drew, M. R., Simpson, E. H., Kellendonk, C., Herzberg, W. G., Lipatova, O., Fairhurst, S.,
Balsam, P. D. (2007). Transient Overexpression of Striatal D2 Receptors Impairs Operant
Motivation and Interval Timing. The Journal of Neuroscience, 27(29), 7731-7739.
Duffy, E. (1957). The psychological significance of the concept of "arousal" or "activation.".
Psychological Review, 64(5), 265-275.
Eberle-Wang, K., Mikeladze, Z., Uryu, K., & Chesselet, M. F. (1997). Pattern of expression of
the serotonin2C receptor messenger RNA in the basal ganglia of adult rats. The Journal
of Comparative Neurology, 384(2), 233-247.
Fletcher, P. J., Tampakeras, M., Sinyard, J., Slassi, A., Isaac, M., & Higgins, G. A. (2009).
Characterizing the effects of 5-HT2C receptor ligands on motor activity and feeding
behaviour in 5-HT2C receptor knockout mice. Neuropharmacology, 57(3), 259-267.
Floresco, S. B. (2015). The Nucleus Accumbens: An Interface Between Cognition, Emotion, and
Action. Annual Review of Psychology, Vol 66, (66).
Ghods-Sharifi, S., & Floresco, S. B. (2010). Differential Effects on Effort Discounting Induced
by Inactivations of the Nucleus Accumbens Core or Shell. Behavioral Neuroscience,
124(2), 179-191.
![Page 23: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/23.jpg)
8
Gold, J. M., Waltz, J. A., & Frank, M. J. (2015). Effort cost computation in schizophrenia: a
commentary on the recent literature. Biol Psychiatry, 78(11), 747-753.
Hamill, S., Trevitt, J. T., Nowend, K. L., Carlson, B. B., & Salamone, J. D. (1999). Nucleus
accumbens dopamine depletions and time-constrained progressive ratio performance:
Effects of different ratio requirements. Pharmacology Biochemistry and Behavior, 64(1),
21-27.
Hauber, W., & Sommer, S. (2009). Prefrontostriatal Circuitry Regulates Effort-Related Decision
Making. Cerebral Cortex, 19(10), 2240-2247.
Hebb, D. O. (1955). Drives and the C. N. S. (conceptual nervous system). Psychological Review,
62(4), 243-254.
Higgins, G. A., Silenieks, L. B., Lau, W., de Lannoy, I. A. M., Lee, D. K. H., Izhakova, J.,
Fletcher, P. J. (2013). Evaluation of chemically diverse 5-HT2C receptor agonists on
behaviours motivated by food and nicotine and on side effect profiles.
Psychopharmacology, 226(3), 475-490.
Kellendonk, C., Simpson, E. H., Polan, H. J., Malleret, G., Vronskaya, S., Winiger, V., Kandel,
E. R. (2006). Transient and selective overexpression of dopamine D2 receptors in the
striatum causes persistent abnormalities in prefrontal cortex functioning. Neuron, 49(4),
603-615.
Meyer, P. J., King, C. P., & Ferrario, C. R. (2016). Motivational Processes Underlying Substance
Abuse Disorder. In E. H. Simpson & P. D. Balsam (Eds.), Behavioral Neuroscience of
Motivation (pp. 473-506). Cham: Springer International Publishing.
Nowend, K. L., Arizzi, M., Carlson, B. B., & Salamone, J. D. (2001). D1 or D2 antagonism in
nucleus accumbens core or dorsomedial shell suppresses lever pressing for food but leads
to compensatory increases in chow consumption. Pharmacology Biochemistry and
Behavior, 69(3-4), 373-382.
Palmiter, R. D. (2008). Dopamine signaling in the dorsal striatum is essential for motivated
behaviors - Lessons from dopamine-deficient mice. Molecular and Biophysical
Mechanisms of Arousal, Alertness, and Attention, 1129, 35-46.
Pasqualetti, M., Ori, M., Castagna, M., Marazziti, D., Cassano, G. B., & Nardi, I. (1999).
Distribution and cellular localization of the serotonin type 2C receptor messenger RNA in
human brain. Neuroscience, 92(2), 601-611.
Peter W. Kalivas, & Nora D. Volkow. (2005). The Neural Basis of Addiction: A Pathology of
Motivation and Choice. American Journal of Psychiatry, 162(8), 1403-1413.
Reddy, L. F., Horan, W. P., & Green, M. F. (2016). Motivational Deficits and Negative
Symptoms in Schizophrenia: Concepts and Assessments. In E. H. Simpson & P. D.
Balsam (Eds.), Behavioral Neuroscience of Motivation (pp. 357-373). Cham: Springer
International Publishing.
![Page 24: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/24.jpg)
9
Salamone, J. D. (1992). COMPLEX MOTOR AND SENSORIMOTOR FUNCTIONS OF
STRIATAL AND ACCUMBENS DOPAMINE - INVOLVEMENT IN
INSTRUMENTAL BEHAVIOR PROCESSES. Psychopharmacology, 107(2-3), 160-
174.
Salamone, J. D. (1994). THE INVOLVEMENT OF NUCLEUS-ACCUMBENS DOPAMINE IN
APPETITIVE AND AVERSIVE MOTIVATION. Behavioural Brain Research, 61(2),
117-133.
Salamone, J. D., Aberman, J. E., Sokolowski, J. D., & Cousins, M. S. (1999). Nucleus
accumbens dopamine and rate of responding: Neurochemical and behavioral studies.
Psychobiology, 27(2), 236-247.
Salamone, J. D., & Correa, M. (2002). Motivational views of reinforcement: implications for
understanding the behavioral functions of nucleus accumbens dopamine. Behavioural
Brain Research, 137(1-2), 3-25.
Salamone, J. D., Koychev, I., Correa, M., & McGuire, P. (2015). Neurobiological basis of
motivational deficits in psychopathology. European Neuropsychopharmacology, 25(8),
1225-1238.
Salamone, J. D., Pardo, M., Yohn, S. E., López-Cruz, L., SanMiguel, N., & Correa, M. (2016).
Mesolimbic Dopamine and the Regulation of Motivated Behavior. In E. H. Simpson & P.
D. Balsam (Eds.), Behavioral Neuroscience of Motivation (pp. 231-257). Cham: Springer
International Publishing.
Simpson, E. H., & Balsam, P. D. (2016). The Behavioral Neuroscience of Motivation: An
Overview of Concepts, Measures, and Translational Applications (pp. 1-12). Berlin,
Heidelberg: Springer Berlin Heidelberg.
Simpson, E. H., Kellendonk, C., Ward, R. D., Richards, V., Lipatova, O., Fairhurst, S., Balsam,
P. D. (2011). Pharmacologic Rescue of Motivational Deficit in an Animal Model of the
Negative Symptoms of Schizophrenia. Biological Psychiatry, 69(10), 928-935.
Treadway, M. T. (2016). The Neurobiology of Motivational Deficits in Depression—An Update
on Candidate Pathomechanisms. In E. H. Simpson & P. D. Balsam (Eds.), Behavioral
Neuroscience of Motivation (pp. 337-355). Cham: Springer International Publishing.
Ward, R. D., Simpson, E. H., Richards, V. L., Deo, G., Taylor, K., Glendinning, J. I., . . .
Balsam, P. D. (2012). Dissociation of Hedonic Reaction to Reward and Incentive
Motivation in an Animal Model of the Negative Symptoms of Schizophrenia.
Neuropsychopharmacology, 37(7), 1699-1707.
![Page 25: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/25.jpg)
10
Chapter 2
Neural Substrates Underlying Effort, Time, and Risk Based
Decision Making in Motivated Behavior
ABSTRACT
All mobile organisms rely on adaptive motivated behavior to overcome the challenges of
living in an environment in which essential resources may be limited. A variety of influences
ranging from an organism’s environment, experiential history, and physiological state all
influence a cost-benefit analysis which allows motivation to energize behavior and direct it
toward specific goals. Here we review the substantial amount of research aimed at discovering
the interconnected neural circuits which allow organisms to carry-out the cost-benefit
computations which allow them to behave in adaptive ways. We specifically focus on how the
brain deals with different types of costs, including effort requirements, delays to reward and
payoff riskiness. An examination of this broad literature highlights the importance of the
extended neural circuits which enable organisms to make decisions about these different types of
costs. This involves Cortical Structures, including the Anterior Cingulate Cortex (ACC), the
Orbital Frontal Cortex (OFC), the Infralimbic Cortex (IL), and prelimbic Cortex (PL), as well as
the Baso-Lateral Amygdala (BLA), the Nucleus Accumbens (NAcc), the Ventral Pallidal (VP),
the Sub Thalamic Nucleus (STN) among others. Some regions are involved in multiple aspects
of cost-benefit computations while the involvement of other regions is restricted to information
relating to specific types of costs.
1. Introduction
1.1. Historical/background information
![Page 26: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/26.jpg)
11
Some of the earliest laboratory studies of motivated behavior led researchers to observe
that most complex behavior tends to occur in bouts and that specific behaviors such as feeding or
grooming can be characterized by their frequency, intensity, temporal distribution and direction
towards or away from a particular stimulus. One of the prominent researchers of the day went so
far as to say that identifying the factors responsible for the initiation and termination of these
specific bouts of behavior would be the central problem for experimental psychologists to
understand (Richter, 1927). Over the years there have been numerous theories of motivation put
forth (Bolles & Moot, 1972; Hebb, 1955; Hull, 1943; Young, 1961), each of which has been
influential in stimulating what has been a continuous stream of experiments and research on this
topic. There exist excellent reviews of many of these theories and concepts (Berridge, 2004).
Almost a century later, researchers from numerous fields including psychology,
psychiatry, and neurobiology are still actively studying goal-directed motivation, which is the
name that has been given to the set of biological and psychological processes which guides
behavior in pursuit of a goal. Research in this realm of behavioral neuroscience has come a long
way toward understanding the wide array of factors which come together to modulate goal-
directed action. Neurobiologists are uncovering the widely distributed collection of neural
circuits which underlie the various aspects of goal-directed motivation. This has led to the
identification of limbic and midbrain regions including the Ventral Tegmental Area (VTA),
Nucleus Accumbens (NAcc), and Ventral Pallidum (VP) which appear to be critical for
invigorating effortful behavior. Additionally, cortical regions such as the Anterior Cingulate
Cortex (ACC) and medial Prefrontal Cortex (mPFC) are crucial for comparing costs and benefits
which becomes important when one is faced with several potential response choices. In addition
to the basic work being done in animal models, clinicians and psychiatrists using modern brain
![Page 27: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/27.jpg)
12
imaging methods have started to uncover some of the neurobiological correlates of impairments
in goal-directed motivation commonly seen in many forms of psychopathology, including
schizophrenia and depression. Currently, the unprecedented technical arsenal of neuroscience
tools available to researchers makes it an extremely exciting and fruitful time to be studying a
question which has captivated researchers for nearly a century.
1.2. Motivation: Energizing and directing behavior toward specific goals
All mobile organisms are faced with the universal challenge of living in a world in which
the resources needed for survival may be limited in number and unevenly dispersed throughout
the environment. Obtaining essential resources often requires one to overcome obstacles which
inherently contain many different kinds of costs to the organism. When seeking food, water, or
potential mates, one might be faced with any number of these costs, including: a physical
distance one must traverse, the height of an obstacle one must climb, the number of responses
one must make, or the commitment of time one must invest. Goal-directed motivation represents
the set of processes which allows an organism to weigh these costs against potential benefits of
obtaining a goal. It has been recognized by researcher for a long time that motivation serves two
important functions, as it provides both a directional influence on behavior and also has an
activational or energizing effect as well as (Duffy, 1957; Hebb, 1955); and more recent work has
started to describe the underlying neurobiological substrates of both the directional processes
(Kim, Lee, & Jung, 2013; Kimchi & Laubach, 2009) as well as activational processes (Anaclet et
al., 2009; Pfaff, Martin, & Faber, 2012) and whereas the directional component of motivation
guides behavior toward a specific goal and away from competing actions (Dickinson & Balleine,
1994), the activational component of motivation provides the energy or vigor needed to
overcome the physical costs standing between the animal and its goal. This activational influence
![Page 28: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/28.jpg)
13
on motivation is reflected in the likelihood of initiation, and the speed, vigor and persistence of
an action (Floresco, 2015; Salamone, 1992; Salamone & Correa, 2002; Salamone, Correa,
Nunes, Randall, & Pardo, 2012).
1.2.1. Directional effects of motivation
The most general way in which the concept of directional motivation is used is to say that
animals pursue positive stimuli (e.g. food, water, sex, etc.) and avoid negative stimuli (e.g.
painful conditions, predators, stress) (Salamone, Yohn, López-Cruz, San Miguel, & Correa,
2016). A more specific definition of the concept of directional motivation is the processes which
cause animals to choose one specific class of behavior to engage in at a given time over all others
(i.e. Feeding, Drinking, Mating, Aggressive Behavior, etc.). This concept proves useful in that it
allows researchers to attempt to figure out the physiological and environmental variables which
influence animals to engage in one class of behaviors over another (e.g. feeding as opposed to
drinking). This usage helps to explain observations such as when animals choose to pursue food
following a long period of food deprivation, as it is the directional influence of motivation which
leads the animal to pursue food while forgoing pursuits of other behaviors. This is unsurprising
as there are distinct neural circuits which control food seeking as opposed to something like
thirst (Kelley, Baldo, Pratt, & Will, 2005; Oka, Ye, & Zuker, 2015). There has been an extensive
amount of research aimed at understanding what circulating hormones and brain regions are
responsible for directional motivational effects for feeding (Belgardt, Okamura, & Brüning,
2009), thirst (Johnson & Thunhorst, 1997), as well as sexual behavior (Davidson, 1966), and
other social behaviors (Hong, Kim, & Anderson, 2014; Wang, Kessels, & Hu, 2014). We point
readers to recent reviews of this literature (Sternson, 2013), as an extensive discussion of these
directional effects are beyond the scope of the present review. In the present review, we focus on
![Page 29: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/29.jpg)
14
situations in which subjects are food restricted and working for food rewards (i.e. experimentally
manipulated to be directed towards food), and we examine how different types of costs a subject
must overcome to obtain the food reward alters both activational aspects of behavior and the
choice of what specific action to take to obtain reward.
1.2.2. Activational motivation effects
As animals are deprived of necessary resources their behavior changes in a number of
ways: (a) there is often an increase in general locomotor activity, (b) an increase the likelihood of
performing actions known to lead to that deprived resource, (c) and an increase in the speed,
vigor, and the persistence of these goal directed actions (Floresco, 2015; Salamone, 1992;
Salamone & Correa, 2002; Salamone, Correa, Nunes, Randall, & Pardo, 2012). These changes in
behavior are thought to reflect changes in the activational or energizing effects of motivation. It
is this activational or energizing influence of motivation which allows animals to overcome the
costs standing between them and the goal for which they are working. In this review, we focus
specifically on what is known about the neural substrates that influence how the costs of
responding affect the activational aspects of motivated behavior. We also examine what is
known about the neural machinery involved in processing information about different types of
costs that enter into the cost-benefit computation that guides choices about how to allocate effort
in situations in which there is more than one response option that could lead to the desired
resource.
1.3. Cost-benefit computations underlying motivated behavior
How does motivation properly guide an organism through the environment to overcome
obstacles and meet needs necessary for survival? Current theories suggest that animals
incorporate information from many different levels and perform cost-benefit computations which
![Page 30: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/30.jpg)
15
allow for adaptive decision making. A typical laboratory experiment in which a rat has learned to
press a lever for a food reward serves as an excellent example of how this might work. A fully
sated rat will make a very small number of lever presses for food. The few lever presses it does
make will be made slowly with many pauses in between presses, and the rat will spend a
substantial amount of time engaging in other behaviors such as exploring the chamber and
grooming itself. The same animal’s behavior will look very different when its access to food has
been restricted. Both the number of lever presses made as well as the rate/vigor of those
responses are highly correlated with the percent body weight loss induced by the food restriction
(Collier, 1969; Collier & Levitsky, 1967; Marwine & Collier, 1971). In these two scenarios the
cost of responding is constant (i.e. the same number of lever presses is required in both
situations), but the benefit or value of the food differs greatly. The difference between the cost
and the benefit of pressing in each particular condition determines the direction of behavior
(lever pressing and not exploring or grooming/etc.) as well as the intensity or vigor (response
rate of the lever presses) with which the behaviors are executed.
Research over that last 5 decades shows that there are many factors which influence the
cost-benefit decision making processes. These factors include environment factors (such as local
food availability, time of day, or temperature), an animal’s experiential history (whether it was
trained on a continuous or intermittent schedule of reinforcement), physiology (circulating
hormone levels) and internal biological clocks (e.g. location in a circadian rhythm (Antle &
Silver, 2015). Fig. 1 illustrates a conceptual model of how all of these factors might act in
concert in a hierarchical manner to modulate goal-directed motivation by influencing the
underlying cost-benefit decision making processes, and provides examples of these different
factors influencing motivation (Simpson & Balsam, 2016). As shown in this figure, this model
![Page 31: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/31.jpg)
16
posits that the physiological state of the organism, the environment, and past history/learning of
the organism interact to influence the representation of costs and benefits that determine the
specific types of behavior at any given time. Moreover, the information about the costs and
benefits are compared in a cost-benefit computation which then influences the selection and
vigor of behavior. We present Fig. 1 to suggest one possible model of how goal-directed
motivation may work, and to provide a context in which to place this review. We do not attempt
to state which brain regions are definitively involved in specific stages of the Cost/Benefit
computation process, rather we examine an array of studies which focus on the cost input to this
computation. In doing so we compare 3 different kinds of costs: Effort, Time, and Risk. We
bring together these three separate lines of investigation to identify both the overlapping and
distinct neurobiological substrates for processing these costs.
Fig. 1. Hypothetical model of factors influencing motivational cost-benefit decision making
processes. A hypothetical model of how motivation is influenced by physiological state,
environment, and past history to modulate an underlying cost benefit decision making
computation which gives direction and vigor to goal directed behavior.
1.4. Scope and purpose of the review
![Page 32: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/32.jpg)
17
The purpose of this review is to summarize and synthesize a number of varying studies
which examine different types of motivated behavior through the framework of motivated
behavior as relying on a cost-benefit computation to give rise to both the direction and vigor of
behavior. The direction and vigor of behavior represent the final behavioral output which one
can measure, and a number of studies are reviewed which have been performed to understand the
neural locations at which manipulations to the region impact either directional or activational
aspects of behavior. Additionally, we give a primary focus to studies which have examined
motivated behavior through various forms of cost-benefit decision making. In this review, we
systematically focus on studies which have manipulated one of the factors which goes into the
cost-benefit calculation: cost, as this represents one critical side of the cost-benefit computations
that guide motivated behavior. We first provide a summary of the behavioral data that
demonstrates animals’ ability to process information related to various types of costs. We then
discuss the more recent work examining the neurobiology of the activational effects of
motivation. We finish by reviewing an array of studies aimed at understanding the
neurobiological underpinnings of cost-benefit decision making by specifically focusing on
studies which employed manipulations of three types of response costs: (1) effort, (2) time, and
(3) risk. In doing so we describe studies which have employed neural manipulations such as
various types of lesions, as well as locally delivered pharmacological manipulations. While we
also discuss a number of results from systemic pharmacology studies, we have limited this to
results which further inform our understanding of the neural circuits underlying the different
behavioral processes discussed in the review.
2. Evidence of animals processing and using information about the costs going into cost-
benefit computations underlying motivation
![Page 33: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/33.jpg)
18
Motivation activates and directs behavior allowing organisms to overcome response costs
to obtain specific goals. The decision to continue exerting effort in pursuit of a goal while
neglecting other available response options is thought to be influenced by an underlying cost-
benefit decision making process. During this process, the organism is thought to use information
and knowledge of the costs of the current situation and weigh them against the anticipated
benefit the effort will ultimately result in. There is a rich history of studies from experimental
psychology in which various specific parameters of cost and benefit are manipulated which
generally show that animals can make adaptive decisions in the face of changing costs and
benefits (Atalayer & Rowland, 2009; Collier & Johnson, 1997). Additionally, there is evidence
that animals can process and use information related to different response costs, including:
distance, number, time, height, force and vigor. While an extensive literature exists on animals
cognition of distance (Gallistel, 1989), and their sensitivity to manipulations of force required in
a lever press (Ettenberg, 1989; Fowler, 1999) here, we limit the discussion to number of
responses, time, and vigor/rate of responding as they are the most commonly used manipulations
of cost in the studies covered in this review.
2.1. Number of responses as a cost
Several elegant experiments demonstrated that animals are aware of the number of
responses they have made, and that they are not only able to process this information but can also
dynamically use it to guide behavior. In these experiments subjects were trained to make lever
presses to earn rewards. In the testing phase of these experiments, subjects made lever presses on
fixed ratio schedules, but the rewards were delivered without being cued when the criterion was
reached. In two variants on this procedure, rats then had to either switch from Lever A and make
1 response on lever B to check if they received a reward (Mechner, 1958), or simply make a head
![Page 34: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/34.jpg)
19
entry to the receptacle when they thought the reward would be present (Platt & Johnson, 1971).
Rats were not only able to estimate the minimum number of responses they needed to emit
before checking for the reward, but they were actually able to use this information to guide
behavior as they were shown to be sensitive to the consequences of their errors in either direction
(checking after too few or too many presses) and were able to adjust their estimates to either
overestimate or underestimate when they had done enough depending on the contingencies of the
given situation (Platt & Johnson, 1971), reviewed in Gallistel and Gelman (1992).
Given that subjects have an awareness of how many presses they have made since the
beginning of a bout of responding, it is then perhaps unsurprising that rodents can use this
information when given a choice between working on two levers paying off after different
numbers of presses. When given a choice on two different levers with different press
requirements (whether on a Fixed Ratio or Random Ratio) subjects will allocate their responding
in a manner which matched the relative payoff between the two levers (McDowell, 2013).
2.2. Time
Animals are also sensitive to time and the temporal distribution of events (Balsam, Drew,
& Gallistel, 2010; Balsam & Gallistel, 2009). When rewards are delivered following a response
occurring after a fixed duration of time, as in a fixed-interval schedule of reinforcement, animals
are most likely to respond around the time that reward is expected (Dews, 1978). Increasing
motivation levels by increasing the probability of reinforcement on any given trial increases how
precisely animals estimate this interval (Roberts, 1981; Ward et al., 2009). Additionally, when
asked to discriminate between durations many studies have shown that a 15–20% change in
duration is easily discriminated (Gibbon et al., 1984). Thus it is not surprising that choice is
allocated based on payoff rates (McDowell, 2013) or that the relative delay to reward has a
![Page 35: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/35.jpg)
20
strong influence on response selection (Evenden & Ryan, 1996). Since all action occurs in time it
is worth noting that manipulations of response number, response duration or distance to obtain a
goal generally also involve changes in the time to reach that outcome.
2.3. Rate or vigor
Rate or vigor of responding is modulated by motivational factors such as deprivation
level and reward magnitude, e.g. response speed tends to increase as a function of reward
magnitude, whereas it decreases with increasing delays to reward, reviewed in Bitterman and
Schoel (1970). Rats are able to process information about how vigorously they are responding
and can subsequently modulate their levels of vigor when the magnitude of reward is made
dependent on response vigor. Rats taking longer to run down a runway when reward size is
increased contingent on increasing latency to reach the goal box (Logan, 1966). Similar results
have been observed with lever pressing. When reward is contingent on response speed the vigor
of the action can be both raised (Girolami, Kahng, Hilker, & Girolami, 2009; Tanno, Silberberg,
& Sakagami, 2012) and lowered (Pizzo, Kirkpatrick, & Blundell, 2009; Tanno & Silberberg,
2014).
3. Activational components of motivation
3.1. Activational component of motivation can be observed through measures of response
vigor/persistence
There are a number of different tasks which have allowed researcher to quantify changes
in response vigor/persistence. Many of the tasks which have been used involve having animals
make responses of a single type to obtain the goal (i.e. running down a runway, or responding on
a single lever). The activational component of motivation is readily observed in runway tasks as
animals run faster for a food reward as a function of the duration that they have been deprived of
![Page 36: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/36.jpg)
21
food, or as a function of the magnitude of food reward/concentration of sucrose awaiting them in
the goal-box (Bitterman & Schoel, 1970; Bower & Trapold, 1959; Goodrich, 1960; Kintsch,
1962; Knarr & Collier, 1962). Similar results have been observed in rates of lever pressing
(Collier, 1969; Collier & Levitsky, 1967; Marwine & Collier, 1971), and rates of licking for
varied sucrose concentrations (Beer & Trumble, 1965; Vogel, Mikulka, & Spear, 1968; Ward et
al., 2012). Much of the subsequent work which has examined the neurobiology and
pharmacology of activational components of motivation has been done using a lever pressing
tasks in which response cost is manipulated by varying the numbers of responses required to
produce a reward. One commonly used task is called the Progressive Ratio (PR) (Hodos, 1961).
In a PR schedule of reinforcement, the required number of responses can either be increased
within a single session from one reinforcer to the next (Hodos, 1961; Hodos & Kalman, 1963) or
can be changed between sessions over days (Czachowski & Samson, 1999), with the former
being the most widely used. In PR schedules, subjects make an increasing number of responses
until eventually they reach a breakpoint (BP), a point at which the number of lever presses is too
high for the animal to continue making responses (Hodos, 1961; Hodos & Kalman, 1963). The
breakpoint is directly related to deprivation level and incentive value/reward magnitude (Cheeta,
Brooks, & Willner, 1995; Covarrubias & Aparicio, 2008; Ferguson & Paule, 1995; Hodos, 1961;
Rickard, Body, Zhang, Bradshaw, & Szabadi, 2009; Skjoldager, Pierre, & Mittleman, 1993).
Many variants of PR have been used, demonstrating that the breakpoint is influenced by both the
absolute response requirement (Aberman & Salamone, 1999; Skjoldager et al., 1993) as well as
the step size of the ratio increase (Covarrubias & Aparicio, 2008).
3.2. The challenge of dissociating activational motivation effects from locomotor effects
![Page 37: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/37.jpg)
22
While PR schedules have been used extensively to study motivated behavior, use of this
task alone has made it challenging to discern whether increases or decreases in breakpoints
represent changes in activational motivation OR an increase in non-goal directed general
activity. If an animal is more hyperactive and makes all types of motor responses more rapidly
this may also lead them to make many more lever presses in a similar amount of time.
Conversely, if a manipulation has caused locomotor slowing and an animal makes all types of
motor responses more slowly this could lead to making fewer responses purely due to a motor
deficit. Many of the drug treatments and genetic manipulations which have been shown to
increase or decrease breakpoints in a PR schedule also lead to a corresponding increase or
decrease in locomotor activity in an open field test (Aberman & Salamone, 1999; Antoniou,
2005; Cagniard, Balsam, Brunner, & Zhuang, 2006; Hall, Stanis, Avila, & Gulley, 2008;
Kellendonk et al., 2006; Mayorga, Popke, Fogle, & Paule, 2000; Randall et al., 2012; Sanders,
Hussain, Hen, & Zhuang, 2007; Simpson et al., 2011; Simón et al., 2000; Zhuang et al., 2001).
This correlation between PR performance and locomotor activity points out the challenge of
being able to distinguish activational motivation effects from locomotor effects when using just
one measure of motivated behavior. This challenge as one investigator put it is making, ‘‘The
distinction between motor deficits (wants to but cannot) and motivation deficits (can but does not
want to)” (Wise, 2008). To this, we add the opposite problem of no change in motivation (wants
the reward to the same degree), but an animal is in a general hyperactive state which leads to
making all types of behaviors (those which are goal directed as well as those which are not) at a
faster rate which may make the animal appear to want the reward more. While there is no perfect
solution to this challenge to date, we attempted to address the issue by developing methods for
![Page 38: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/38.jpg)
23
studying motivated behavior by altering the type of work requirements making rate of initiation
unrelated to the level of wanting the reward.
3.3. A strategy for dissociating changes in non-goal specific locomotor output from changes
in activational motivation and the willingness to perform goal-directed work
To address the challenge presented when trying to distinguish motivational changes from
general locomotor changes in behavior our lab developed a novel task known as the progressive
hold down (PHD) task (Bailey et al., 2015), which was specifically designed to make
hyperactive motor behavior incompatible with increased willingness to work. In the classic PR
task, subjects must make more lever presses in order to earn each subsequent reward (Fig. 2A).
Unlike the classic PR task where the increasing work requirement is an increasing number of
responses, in the PHD task the increasing work requirement is the duration of time a subject is
required to maintain the lever in the depressed position during a single lever press. Thus, subjects
are required to make single lever holds (maintaining the lever in the depressed position) for
increasing durations of time in the PHD task in order to keep earning rewards (Fig. 2B). This
task intentionally makes increased goal-directed action and increased general locomotor arousal
incompatible with one another as hyperactive lever pressing will continually reset the duration of
each rapidly emitted press.
In an examination of this novel method, we first tested the manipulations of food
deprivation and reward magnitude to see how these variables influenced behavior in the PHD
task. Hungry mice worked for more rewards and reached higher breakpoints before quitting. The
increased breakpoints in this task meant that hungry subjects were making lever holds of
substantially longer durations. In a similar manner, subject’s willingness to work for rewards and
breakpoints increased as a function of reward magnitude when working for sucrose solutions of
![Page 39: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/39.jpg)
24
increasing concentration. The observation that increasing food deprivation levels and increasing
reward magnitude led to increases in BP’s in both the classic PR schedules (Skjoldager et al.,
1993), as well as the BP in our PHD task (Bailey et al., 2015), suggests that these manipulations
are impacting some central motivational mechanism which makes animals more willing to work
for rewards regardless of the specific modality of the work (i.e. pressing versus holding).
Fig. 2. Effects of methamphetamine in a PR and PHD Task. (A and B) A schematic
representation of the Progressive Ratio Task (A) and the Progressive Hold Down Task (B). The
yellow bars represent lever presses in (A), and lever holds in (B). The red arrows signify
![Page 40: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/40.jpg)
25
rewards. (C). IP administration of 1.0 mg/kg of methamphetamine leads to significant increases
in lever pressing and breakpoint in a progressive ratio schedule of reinforcement. (D). IP
administration of 1.0 mg/kg of methamphetamine leads to significant increase in the number of
hold attempts in a PHD task, but does not lead to a significant increase in the highest duration
requirement completed (BP). (E). Data from a single subject treated with methamphetamine or
vehicle in the PHD task demonstrates the increased number of responses which occur while on
the drug, but the responses are inefficient and of shorted durations that required by the schedule.
Data in (C–E) from Bailey et al., 2015. (For interpretation of the references to colour in this
figure legend, the reader is referred to the web version of this article.)
As a test of this strategy of examining behavior in both a classic lever pressing PR
alongside the lever holding PHD, we tested subjects who had been treated with
methamphetamine (Meth) in both tasks. As shown in Fig. 2C–E, Meth treated subjects made
more lever presses and had a higher breakpoint in the classic PR task. The subjects tested
following treatment with Meth in the PHD task, however, did not show increases in long
duration goal-directed presses, but showed an increase in rapidly initiated short duration hold
attempts which were ineffective in the PHD test (Bailey et al., 2015). Unlike manipulations such
as food deprivation and increasing the reward magnitude, Meth only increased the BP in the
classic PR. We interpret the increase in ineffective short duration responses in the PHD task as
reflecting Meth’s ability to enhance hyperactive motor output (which is important for initiating
repeated numbers of responses). We also interpret the results to mean that Meth is not acting on a
central motivation mechanism which would increase willingness to perform any type of work as
is the case when animals are hungry vs sated. Thus, the PR appears to be a good measure of
arousal, but cannot by itself dissociate goal-directed action from arousal or increases in general
motor activity. Additional experiments have recently shown that selective inactivation of the
dopamine D2 receptor expressing neurons in the indirect pathway of the striatum results in a
similar increase in arousal and activation in the PR and overall locomotor activity in an open
![Page 41: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/41.jpg)
26
field, but at the cost of decreased efficiency as a result of bursts of short duration rapid responses
in the PHD (Carvalho Poyraz et al., 2016).
3.4. Neurobiology of activational components of motivation
There have been a large number of studies which have examined different brain regions
and neurotransmitters involved in vigorous effortful responding in operant lever pressing tasks.
Many of these have used the PR to assess vigor or persistence in responding (Table 1).
3.4.1. The NAcc and mesolimbic dopamine
A wealth of evidence implicates mesolimbic dopamine pathway, which consists of the
dopamine neurons located within the VTA which project to the NAcc, in behavioral activation
and energy expenditure (Salamone, 1992; Salamone et al., 2012). Specifically, reducing
dopamine levels in the mesolimbic pathway suppresses general locomotor activity (Maldonado-
Irizarry & Kelley, 1994; Wu, Brudzynski, & Mogenson, 1993), as well as novelty-induced
locomotion (Baldo, Sadeghian, Basso, & Kelley, 2002; Cousins, Sokolowski, & Salamone, 1993;
Koob, Riley, Smith, & Robbins, 1978). The effects of dopamine antagonist within the NAcc also
![Page 42: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/42.jpg)
27
impacts goal-directed locomotion as intra-NAcc dopamine antagonists lead to both increased
latency to run down a runway maze and reach a goal box containing food reward (slowing of
reward approach) as well as reductions in spontaneous locomotion in the start box (Ikemoto &
Panksepp, 1996). Moreover, the readily observed increases in numerous different types of
activity which develop following the scheduled presentation of food (excessive drinking,
voluntary wheel running, and locomotion) are all correlated with increases in mesolimbic
dopamine signaling (McCullough & Salamone, 1992), and NAcc dopamine depletions suppress
these behaviors (McCullough & Salamone, 1992; Robbins & Koob, 1980; Wallace, Singer,
Finlay, & Gibson, 1983). These observations resulted in the development of a number of
different genetic models which alter dopamine signaling. A dopamine transporter knockdown
mouse (DAT KD) shows elevated open field activity (Cagniard et al., 2006), and cell type
specific loss of D1/D2 receptors have been shown to induce hypoactivity or hyperactivity with
numerous different manipulations of these cell types (Kreitzer & Berke, 2011).
In addition to the studies on the locomotor activating effects of the mesolimbic dopamine
pathway, there has been a specific focus of the role of this pathway on motivated responding in
tasks which offer a single response choice and provide a measure of vigor or behavioral
activation. Dopamine neurons which project to the NAcc have been found to be important for
effortful responding. Early observations indicated that when a rat was lever pressing for food on
a fixed ratio-1 (FR-1), levels of dopamine and DOPAC increased within the NAcc (McCullough,
Cousins, & Salamone, 1993).
Subsequent studies showing lesions of dopamine neurons with 6-hydroxydopamine (6-
OHDA) projecting to either the NAcc Core or NAcc Shell had little impact on a behavior in an
FR-1 schedule, a schedule with a low effort requirement (Salamone & Correa, 2002; Salamone,
![Page 43: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/43.jpg)
28
Correa, Mingote, & Weber, 2005). Furthermore, disruption of dopamine in either the NAcc Core
or NAcc Shell also had no impact in a VI-30 schedule, which requires subjects to wait an
average of 30 s before making a reinforced press (Sokolowski & Salamone, 1998). However,
when the response cost was increased to an FR-05 schedule disruption of dopamine signaling to
the NAcc Core, was found to impair responding, but dopamine depletion in the NAcc Shell did
not have any effect (Sokolowski & Salamone, 1998). Additionally, there was a correlation
between the number of presses made and the amount of dopamine present in the NAcc Core, but
not the shell (Sokolowski & Salamone, 1998). Subsequent studies further explored the impact of
dopamine depletions in the NAcc Core across several different fixed ratio schedules (FR-01, 05,
10, 16, 32). In these studies, NAcc Core dopamine depletions reduced the amount of responding,
and this reduction was greater in the higher FR schedules (Aberman & Salamone, 1999). This
schedule dependent decrease in responding differs from that seen following pre-feeding
manipulations, as prefeeding leads to reductions in responding across all schedules, not just the
more demanding ones (Aberman & Salamone, 1999). Further studies tested the effects of NAcc
dopamine depletion in a time constrained PR and showed that DA depletions decreased
breakpoints at both a PR+1 and PR+5, with the impairments being more marked in the more
demanding schedule (Hamill, Trevitt, Nowend, Carlson, & Salamone, 1999).
Taken together, these studies indicate that dopamine depletion appears to affect an
animal’s willingness to expend effort to earn a reward. The recognition of the specific
involvement of dopamine signaling within the NAcc Core spurred lots of research on the
dopamine receptor subtypes important for effort expenditure in these tasks. Numerous studies
demonstrated that dopamine D1 or D2 receptor antagonists reduce responding in a PR (Aberman,
Ward, & Salamone, 1998; Caul & Brindle, 2001; Cheeta et al., 1995; Olarte-Sanchez, Valencia-
![Page 44: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/44.jpg)
29
Torres, Cassaday, Bradshaw, & Szabadi, 2013), whereas drugs which can increase synaptic
dopamine levels, such as amphetamine, increase breakpoints (Bailey et al., 2015; Mayorga et al.,
2000; Sommer et al., 2014). Local administration of either the D1 antagonist (SCH-23390) or D2
antagonist (eticlopride) into the NAcc Core decreased lever presses for food in a PR schedule,
but neither drug had any impact when infused into the NAcc Shell (Bari & Pierce, 2005). Both
the dopamine depletion and localized drug infusion studies suggest that the activating effects of
mesolimbic dopamine signaling appear to be quite specific to the NAcc Core. Greater
understanding of the nature of the deficit induced by NAcc Core dopamine manipulations has
been revealed by more careful examination of the within session data for tasks in which
dopamine depletion or antagonisms has an impact on behavior. The impact of NAcc Core lesions
in the FR5 task was primarily seen through slower responding, which resulted from longer
interresponse- times (IRT’s) in the FR-05 schedule (Sokolowski & Salamone, 1998), and slower
response rates and longer post reinforcement pauses in a PR schedule (Bezzina, Body, Cheung,
Hampson, Bradshaw et al., 2008). Nicola (2010) conducted a detailed behavioral analysis of the
effects of intra-NAcc dopamine antagonism on different types of behaviors which further
elucidate the nature of the within session behavioral changes. In a task which cues rats to make
either 1 lever press or 8 lever presses for a reward (cued FR1 and cued FR8), it was shown that
dopamine D1 and D2 antagonists both impair subjects ability to earn rewards, and that the
primary influence of the drugs is to increase latencies to begin lever pressing when the animals
are currently in a non-responding state (Nicola, 2010). Additionally, the subjects are more likely
to be engaged in non-task related behaviors, and the latency to make a lever press (i.e. reengage
in task related behavior) is independent of the class of responses subjects are engaged in
(immobile resting, random locomoting, or grooming), which suggests that dopamine disruption
![Page 45: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/45.jpg)
30
in the NAcc Core may be impacting motivation by disrupting the initiation of ‘‘flexible approach
behavior”.
3.4.2. Ventral tegmental area
Much like the locomotor effects induced by blocking NAcc dopamine, more recent
studies which have used DREADD (designer receptors exclusively activated by designer drugs)
methods to inactivate VTA dopamine neurons showed that this lead to suppression of general
locomotor activity (Marchant et al., 2016). Far fewer studies have looked at the influence of the
VTA in PR responding to see how this area impacts arousal/vigor processes of motivation. It is
known, however, that both dopamine D1 and D2 receptors are important for the VTA’s influence
on motivated responding. In one study, it was found that localized infusions of the D1 receptor
antagonist (SCH 23390) into the VTA lead to a decreased breakpoint in a PR (Sharf, Lee, &
Ranaldi, 2005). In another study, reducing the expression of D2 receptors in the VTA via shRNA
knockdown lead to increased breakpoints for food in a PR, but did not impact baseline locomotor
activity, fixed ratio responding, or responding in extinction (de Jong et al., 2015). The
observation that decreasing D2 receptor levels within the VTA enhances motivation is in line
with the finding that food deprived rats have lower levels of D2 receptor expression in the VTA
relative to ad lib fed rats (Skibicka et al., 2013).
There have also been a number of other receptors on neurons within the VTA which have
been examined. While an extensive discussion of all of these is beyond the scope of this review
we highlight the role of ghrelin in the VTA, as it’s effects on motivated responding appear to be
directly modulated through NAcc dopamine signaling. Ghrelin is a circulating hormone which
promotes both food intake as well as motivated responding for food. Studies have shown that
both systemic injections of ghrelin or intra-VTA ghrelin enhance food responding in a PR
![Page 46: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/46.jpg)
31
(Naleid, Grace, Cummings, & Levine, 2005; Perello et al., 2010; Skibicka, Hansson, Alvarez-
Crespo, Friberg, & Dickson, 2011; Skibicka, Shirazi, Hansson, & Dickson, 2012). This effect of
ghrelin has been shown to act by modulating the VTA’s dopamine output to the NAcc. Lesion of
VTA dopamine neurons via 6-OHDA, suppresses ghrelin’s ability to increase responding on a
PR (Weinberg, Nicholson, & Currie, 2011). Moreover, pretreatment with either a D1 or D2
receptor antagonist in the NAcc blocks intra VTA ghrelin’s ability to increase BP in a PR for
food rewards (Skibicka et al., 2013).
3.4.3. The dorsal striatum and nigrostriatal dopamine
Another dopaminergic pathway in the brain, known as the nigrostriatal pathway, consists
of dopaminergic neurons in the Substantia Nigra (SN) and projects to the dorsal striatum. While
the role of the NAcc and mesolimbic dopamine signaling in PR schedules has been extensively
studied, there has been a smaller amount of work examining the dorsal striatum and nigrostriatal
pathways. An early study lesioned cell bodies within the Dorsomedial (DMS) and Dorsolateral
Striatum (DLS) via quinolinic acid observed that lesions to both regions failed to alter
breakpoints in a PR schedule, but destruction of these regions did have some impact on other
aspects of motor performance in the PR (Eagle, Humby, Dunnett, & Robbins, 1999). There was
an increase in the number of preservative presses as well as the latency to get to the food hopper
when a reward was delivered. Worth noting is that these lesions in the dorsal striatum destroyed
cell bodies via quinolinic acid. We are not aware of any studies which examined dopamine
specific depletion in the dorsal striatum via 6-OHDA as was done in the studies mentioned above
that focused on the NAcc.
Investigators have also examined the influence of the Substantia Nigra in motivated
behavior. One study looked at the effect of inactivating the Substantia Nigra pars reticulata (SNr)
![Page 47: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/47.jpg)
32
during FR- 05 responding and found that infusions of the GABAA antagonist bicuculline
resulted in a dose-related decrease in lever pressing. Additionally, GABA levels within the
region were higher during the lever pressing than during baseline periods before the operant
responding (Correa, Mingote, Betz, Wisniecki, & Salamone, 2003). Another study looked at the
Substantia Niagra pars Compacta (SNc) on motivated behavior. This study induced partial
lesions to the SNc which didn’t disrupt overall locomotor behavior resulted in decreased lever
pressing in a PR for sucrose rewards, but these same effects were not observed with partial
lesions of the VTA (Drui et al., 2013). Additional evidence for the role of the Nigrostriatal DA
system in motivated behavior comes from a recent study using a novel operant joystick based
task. Subjects were head fixed and required to move the joystick at a given rate to earn a reward.
MitoPark mice, which have progressive loss of SN to DA dopamine neurons (Ekstrand et al.,
2007), show impairments in this task, and optogenetic inhibition of the nigrostriatal pathway
induced similar impairments such that subjects took longer to complete a criterion number of
trials. Electrophysiological recording from the neurons in this region showed that DMS neurons
appear to be both representing and controlling movement vigor in this task (Panigrahi et al.,
2015). This is in line with another recent study which used a self-paced nose poking paradigm to
demonstrate that overall reward payoff expectancy as well as response vigor appear to be
represented in the DS (Wang, Miura, & Uchida, 2013).
3.4.4. Genetically induced dopamine receptor manipulations
There have been a number of genetically modified mouse lines which have allowed
researchers to examine the role of specific dopamine receptors in different brain regions. These
studies have examined the effects of alterations in the levels of expression of dopamine
receptors.
![Page 48: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/48.jpg)
33
It has been shown that developmental overexpression of the dopamine D2 receptor (D2R-
OE) within the striatum (Kellendonk et al., 2006) leads to an impairment in PR responding
(Drew et al., 2007; Simpson et al., 2011; Ward et al., 2012). This genetic model is developmental
and D2R overexpression continues into adulthood. In contrast, viral vector mediated
manipulations were developed which allow D2 receptors to be expressed at much higher levels
selectively in adulthood (Trifilieff et al., 2013). While viral over-expression of the D2 receptor in
the NAcc led to increased PR responding, this same effect was not observed when the over-
expression was in the dorsal striatum (Trifilieff et al., 2013). As well as the difference in D2R
overexpression during development, another important difference between the two models is that
the viral D2 receptor over-expression is not restricted to the MSN’s (as it is in the developmental
D2R-OE model). The dopamine D3 receptor also appears to be involved in the activational
aspects of motivation as a developmental genetic model of dopamine D3 receptor overexpression
which is restricted to the striatum also showed decreases in lever press behavior in a PR
(Simpson et al., 2014).
3.4.5. Other brain regions modulating response vigor in a progressive ratio
There have been a number of brain regions in addition to the striatum and midbrain
which have been implicated in motivation (McGinty et al., 2011), and have been investigated to
determine their contribution to responding in a PR schedule. We briefly describe several of these
other areas that are also involved in other motivational processes to be discussed later in the
review.
3.4.6. Ventral pallidum
Another brain region which is thought to be involved in activational aspects of
motivation is the Ventral Pallidum (VP), a region which receives GABAergic projections from
![Page 49: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/49.jpg)
34
the NAcc (Root, Melendez, Zaborszky, & Napier, 2015). While we are unware of any studies
which have manipulated the VP and specifically looked at PR responding, a number of studies
have established the role of the VP in the motivation to eat and drink as damage to this region
leads to a failure to voluntarily consume food and water, reviewed in Root et al. (2015) and
Smith, Tindell, Aldridge, and Berridge (2009).
3.4.7. Cortical structures
Lesions studies of the prefrontal cortex have demonstrated that different sub-regions of
the PFC contribute to PR performance. One study lesioned several cortical structures and found
dissociations between a number of these regions. Lesions to the pre-limbic cortex (PL) were
shown to decrease breakpoints in a PR schedule, and the same was found with lesions of the
lateral orbitofrontal cortex (lOFC). Lesions of the medial orbitofrontal cortex (mOFC) lead to
increased responding and increased breakpoints in one study (Gourley, Lee, Howell, Pittenger, &
Taylor, 2010), but not another (Kheramin et al., 2005). Studies in which dopamine antagonists
are locally infused into the mOFC suggests that this region is indeed involved in modulating PR
performance as infusions of either the D1-receptor antagonist (SCH23390) or the DA D2-
receptor antagonist (sulpiride) lead to reductions in the breakpoint in a PR while leaving food
preference and consumption unchanged (Cetin, Freudenberg, Fuchtemeier, & Koch, 2004). The
ACC is a region which also receives dopaminergic projections from the VTA, and has reciprocal
connections with the NAcc, however an experiment which lesioned the ACC did not find any
effect of the lesions on the breakpoint in a PR (Judith Schweimer, Saft, & Hauber, 2005).
3.4.8. Hippocampus
The hippocampus is another region which has received a small amount of attention for its
role in motivated behavior assessed with a PR schedule. Lesions to the ventral hippocampus, (an
![Page 50: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/50.jpg)
35
area which projects to the mOFC) was shown to increase BP in the PR. In another study,
neonatal ventral hippocampal lesions were shown to increase the BP’s of rats when tested in a
PR in adulthood (Chambers & Self, 2002).
3.4.9. Sub thalamic nucleus
The sub thalamic nucleus (STN) is a basal ganglia nucleus which sends glutamate
projections most densely to the pallidal complex and the SNr, and less dense connections to the
striatum and SNc (Parent & Hazrati, 1995). Rats with lesions to the STN showed higher
breakpoints in a PR (Baunez, Amalric, & Robbins, 2002; Bezzina, Body, Cheung, Hampson,
Bradshaw et al., 2008), and another study found that discrete lesions of the STN increased
responding for liquid sucrose rewards in a PR, but greatly decreased the motivation of rats for
cocaine (Baunez, Dias, Cador, & Amalric, 2005).
3.4.10. Summary
There are a number of different structures which regulate behavioral activation and
locomotor output in goal-directed responding in a progressive ratio task (Fig. 3A). The
mesoaccumbal dopamine system (VTA and NAcc), the nigrostrial dopamine system (SN and
DS), as well as the subthalamic nucleus, ventral hippocampus, and a number of prefrontal
cortical regions (PL and mOFC), all modulate behavior in a PR schedule. It is also clear that the
NAcc core, innervated by dopaminergic neurons from the VTA, plays an important role in the
activational aspects of motivation (Bari & Pierce, 2005; Bezzina, Body, Cheung, Hampson,
Deakin, et al., 2008; Hamill et al., 1999). The NAcc shell, however, does not appear to be
important for this activational process (Bari & Pierce, 2005; Sokolowski & Salamone, 1998).
Additionally, the SN, and DS also appear to be involved in some aspects of this activational
component of motivated behavior (Drui et al., 2014). Specifically, recent experiments which
![Page 51: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/51.jpg)
36
monitor in vivo activity within the dorsal striatum in awake behaving animals performing
motivated operant behavioral tasks have demonstrated that this region appears to be important
for representing and modulating response vigor (Panigrahi et al., 2015; Wang et al., 2013).
Finally, the role of, various PFC structures (Cetin et al., 2004; Gourley et al., 2010), the
hippocampus (Chambers & Self, 2002; Gourley et al., 2010), and the STN (Baunez et al., 2002;
Bezzina, den Boon et al., 2008) appear to contribute to the activational aspects of motivated
responding.
Fig. 3. Brain regions which impact performance on a progressive ratio and effort based choice.
(A) Brain regions which have been studied to examine their involvement in PR performance
through lesion, inactivation, or localized drug infusion studies which have been shown to
modulate PR behavior (Orange), have no effect on PR behavior (Grey), or have yet to be
examined (White). Areas which have been shown to modulate PR behavior include: the VTA, the
Ventral hippocampus, the SN pars reticulata and SN par compata, the NAcc Core, the OFC, and
the PR/IL cortex. Areas which have been studies, but damage or inactivation had no impact on
PR performance include: ACC, and NAcc Shell. All other areas have not been studied with a PR
task: VP, DS. IL, Hipp. (B) Brain regions which have been studied to examine their involvement
![Page 52: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/52.jpg)
37
in effort based choice performance through lesion, inactivation, or localized drug infusion studies
which have been shown to modulate effort choice behavior (Green), have no effect (Grey), or
have yet to be examined (White). Areas which have been shown to modulate effort based choice
include: the VTA, NAcc Core, NAcc Shell, VP, ACC. Areas which have been studies, but
damage or inactivation had no impact include: OFC, PL, IL. Areas which have yet to be studied
include: DS, STN, SN, Hipp, vSyb. (For interpretation of the references to colour in this figure
legend, the reader is referred to the web version of this article.)
4. Neurobiology of cost-benefit decision making: Manipulations of different costs
Over the last several decades a substantial amount of progress has been made toward
understanding of the neural circuits involved in various aspects of cost-benefit decision making
(Table 2). Many of the studies involved tasks which require subjects to make a choice between
different response options. In these paradigms, animals have been faced with alternatives
associated with different costs - differences in the effort requirements, the time delay from
response choice to reward delivery, and the probability of reward of each option. As will be
described in more detail below, what has emerged as a result of this work is an increasing
understanding of the key neural structures and neurotransmitters involved in different forms of
cost-benefit decision making. Within this distributed neural circuitry, several neurotransmitters
have been found to be involved in the modulation of different types of decision making. Below,
we discuss the specific brain regions and neurotransmitters involved in the cost-benefit decision
making process in studies that manipulate effort requirements, time delays, and/or probability of
reward associated with different choice alternatives.
4.1. The choice between two effort options
There have been a number of tasks which have been developed to study effort based
decision making which give animals a choice between 2 effort alternatives (high vs low) for 2
different types or amount of reward (high vs low). The development of these tasks has been
important because it allows researchers to determine whether the critical functioning of
![Page 53: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/53.jpg)
38
dopamine in the NAcc Core and associated circuits are involved in processes related to effort
expenditure and not the result of a dopamine related motor effects which enhance or impair an
animal’s capacity to make a particular response. Below, we provide a summary of the different
tasks employed and the findings which each have allowed researchers to make.
4.1.1. Concurrent lever pressing/chow feeding task
One behavioral task which was developed to study effort based decision making is an
operant lever pressing task often referred to as either a Concurrent Lever Pressing/Chow Feeding
task or Effort- Based Choice Task (EBCT) (Salamone, 1991). We will hereafter refer to the task
as the EBCT. In the EBCT testing sessions, subjects make a choice between lever pressing on a
given schedule to earn a preferred reward (i.e. sucrose pellets or evaporated milk) or consume a
freely available, but less preferred home cage chow. Rats and mice will earn most of their food
in the task by lever pressing for the preferred reward (Salamone, 1991). As the effort to earn the
preferred reward is increased subjects choose to press the lever less frequently and consume
more of the freely available chow. This task has been useful in assessing willingness to expend
effort for a preferred reward because pre-feeding subjects or giving them appetite suppressants
leads to decreases in both lever pressing and chow consumptions (Randall et al., 2012, 2014;
Salamone, 1991; Salamone, Arizzi, Sandoval, Cervone, & Aberman, 2002; Sink, Vemuri,
Olszewska, Makriyannis, & Salamone, 2008), whereas a reduction in willingness to work is
reflected in less lever pressing and more consumption of the freely available choice.
![Page 54: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/54.jpg)
39
4.1.1.1. NAcc dopamine and extended circuitry.
Dopamine signaling in the NAcc is important in the EBCT, as 6-OHDA lesions in the
NAcc Core decreased the number of lever presses made in the EBCT and lead to an increase in
chow consumption, whereas the same lesions in the NAcc shell had a smaller impact (Salamone,
1991; Sokolowski & Salamone, 1998). Subsequent work has shown that both dopamine D1
(SCH 23390, SKF83566, and Ecopipam) and D2 (Haloperidol, cis-flupenthixol, raclopride,
eticlopride) receptor antagonists produce similar shifts from lever pressing to consuming the less
preferred freely available chow when injected systemically, or directly into the NAcc Core or
Shell (Cousins & Salamone, 1994; Farrar et al., 2010; Koch, Schmid, & Schnitzler, 2000;
![Page 55: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/55.jpg)
40
Nowend, Arizzi, Carlson, & Salamone, 2001; Salamone, 1991; Salamone et al., 2002; Sink et al.,
2008; Worden et al., 2009).
A series of experiments subsequently demonstrated the importance of the connection
between the NAcc Core and Ventral Pallidum (VP) in regulating effort-based choice. Injections
of the GABAA receptor agonist muscimol into the VP decreases lever pressing and increases
consumption of the freely available chow in the same manner as NAcc Core dopamine depletion
(Farrar et al., 2008). Retrograde tracers injected into the same region of the VP in which
muscimol caused a shift in effort-based choice behavior confirmed that NAcc Core was an input
to the VP. While the VP also received input from the DS, it was previously shown that dopamine
depletion in this region did not impact lever pressing or chow consumption (Farrar et al., 2008).
Further studies revealed that both systemic treatment and intra-NAcc infusions of the Adenosine
2A receptor (A2aR) agonist (CGS 2168) could produce a decrease in lever pressing and increase
in free chow consumption (Font et al., 2008). The effect of the A2aR agonist drug occurs
through the GABA-ergic pathway between the NAcc and VP as CSG 2168 leads to an increase
in GABA release in the VP (Mingote et al., 2008). Finally, the importance of the NAcc-VP
projection was demonstrated in a circuit disruption experiment in which CSG 2168 was infused
unilaterally into the NAcc and muscimol infused into the contralateral VP and resulted in
decrease in lever pressing and an increase in chow consumption (Mingote et al., 2008) though
ipsilateral infusions did not.
Following up on the observation that the A2aR agonist CSG 2168 could reduce lever
pressing and increase chow consumption, it was later found that systemic treatment with an
A2aR antagonist rescued a D1/D2R antagonist induced decrease in the choice of an effortful
response (Salamone & Correa, 2009). Moreover, this effect appears to be selective to the A2A
![Page 56: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/56.jpg)
41
receptor as opposed to the A1a receptor as both A2a selective antagonists and Caffeine (a
nonspecific A2a and A1a antagonist) rescue the dopamine antagonist impairment, whereas an
A1A selective antagonist does not (Salamone & Correa, 2009; Worden et al., 2009).
4.1.2. Operant effort discounting
Another task used to assess effort-based choice is known as the Effort Discounting task
(Floresco, Tse, & Ghods-Sharifi, 2008). In this task, subjects have the option to make lever press
responses on a Low-Effort/Low-Reward lever (LR) (e.g.1 press leads to 2 pellets or a High-
Effort/High-Reward lever (HR) (e.g. 5, 10, 20, or 40 presses leads to 3 pellets). The requirement
on the HR lever is increased over the course of a session. In this paradigm, well trained rats tend
to earn all of their rewards on the HR lever when the requirement is 5 presses, and make fewer of
the higher effort lever choices as the cost requirement is increased throughout the session.
4.1.2.1. NAcc dopamine, the basolateral amygdala, and the anterior cingulate cortex.
A number of systemic pharmacology studies have implicated dopamine’s involvement in
the Effort-Discounting task, similar to that which is seen in the EBCT. The non-selective
dopamine receptor antagonist flupenthixol reduced choice on the HR lever in an Effort-
Discounting task (Floresco et al., 2008). Moreover, the D1 (SCH23390) and D2 (Eticlopride)
receptor antagonists were also shown to decrease the number of choices of the HR lever (Jay G.
Hosking, Floresco, & Winstanley, 2015; Randall et al., 2012, 2014). Using a variant of this
procedure in which rats could lever press on a high effort lever (FR12) for high reward (4
pellets) vs a low effort lever (FR4) for low reward (2 pellets), it was shown that the D2 receptor
antagonists (haloperidol), caused rats to shift to the low effort lever (Walton et al., 2009).
Whereas dopamine antagonists reliably cause subjects to shift to make more responses on the
low effort/low reward lever, amphetamine was found to exert a bi-phasic dose dependent effect
![Page 57: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/57.jpg)
42
on effort choice. At low doses subjects made more responses on the high effort large reward
lever, whereas at high doses subjects made fewer responses on the high effort large reward
compared to vehicle treated subjects (Floresco et al., 2008). These effects of dopaminergic drugs
appear to be mediated via the NAcc Core sub region as local blockade of GABAA and B
receptors decreases the selection of the HR lever under both standard and equivalent delay
conditions, whereas the same effect was not seen when the blockade occurred in the NAcc Shell
(Ghods-Sharifi & Floresco, 2010). A control experiment demonstrated that the inactivation of the
NAcc Core did not alter the preference for 4 vs 2 pellets when the press requirement for each is
equivalent. Interestingly, dopamine’s impact on effort appear to be specific to physical as
compared to cognitive effort, as Eticlopride and SCH23390 decreased willingness to expend
physical effort, but had no effect on cognitive effort in a novel rodent cognitive effort task which
allowed subjects to choose between an easy and difficult discrimination for small vs larger
rewards (Hosking et al., 2015).
4.1.2.2. Basolateral amygdala.
Using the operant effort-discounting paradigm, it has also been shown that the BLA is
also involved in effort based decision making. Infusions of the GABAB agonist baclofen and the
GABAA agonist muscimol combined into the BLA increased effort discounting, reducing the
preference for the HR lever, even in conditions in which the delays to reward delivery were
equalized across response conditions (Ghods-Sharifi, Onge, & Floresco, 2009). Additional
evidence of the BLA’s involvement in effortful behavior comes from a study by Simmons et al.
(2009) which showed that bilateral inactivation of the BLA with muscimol reduced lever
pressing on an FR15 schedule while leaving consumption of food in a separate free consumption
test unchanged (Simmons & Neill, 2009). This study found that the connection between the BLA
![Page 58: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/58.jpg)
43
and the NAcc Core appears to be important as inactivation of the NAcc Core as well as a
contralateral inactivation procedure of the BLA and NAcc Core reduced lever pressing and left
food consumption unaltered in a separate chow consumption test (Simmons & Neill, 2009).
4.1.2.3. Anterior cingulate cortex.
The ACC was lesioned and subjects were tested in the EBCT, but lesioning of ACC had
no effect on either the number of lever presses made or the amount of chow consumed
(Schweimer & Hauber, 2005). In a different experiment, when rats were given the choice to lever
press on a high effort lever (FR12) for high reward (4 pellets) vs a low effort lever (FR4) for low
reward (2 pellets) lesions to the ACC caused a shift in responding from the high effort lever to
the low effort lever (Walton et al., 2009). Moreover, lesions to the ACC were also shown to
decrease willingness to expend cognitive effort in the variant of the task which allowed subjects
to choose between an easy and difficult discrimination for small vs larger rewards (Hosking,
Cocker, & Winstanley, 2014).
4.1.3. T-arm barrier maze
The effort based choice paradigms which have been discussed so far have both involved
continuous availability of choices in which at least one alternative involved operant lever
pressing. Consequently, there is a specific motor element to these tasks as subjects must be able
to repeatedly initiate responses for the high effort option and the exact times at which the options
are being compared is unknown. To evaluate cost-benefit decision making with a different motor
response in a task which isolated the decision point to a single action, a task was developed
known as the TArm barrier maze (Salamone, Cousins, & Bucher, 1994). In this task, subjects are
required to navigate down a T Maze, and choose to go either to the right arm or left arm in order
to obtain a reward. In this paradigm, there is a high reward and low reward arm (e.g. 2 pellets vs
![Page 59: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/59.jpg)
44
4 pellets; respectively). In the absence of any barriers, subjects will choose the high reward arm
almost all of the time. When there is a barrier placed in the high reward arm that requires
additional effort to reach the reward, subjects will still choose this high reward arm about 80% of
the time.
4.1.3.1. Nucleus accumbens dopamine.
Much like the studies of effort based choice in the operant lever pressing paradigms,
dopamine appears to be involved in the processes underlying effort based decision making in the
T-Arm Barrier Maze as well. Dopamine depletion in the NAcc via 6-OHDA and dopamine
receptor blockade via systemic treatment with dopamine D1 and D2 antagonists decrease the
likelihood of choosing the high-effort/highreward arm (Bardgett, Depenbrock, Downs, Points, &
Green, 2009; Cousins, Atherton, Turner, & Salamone, 1996; Mott et al., 2009; Salamone et al.,
1994). In contrast, increasing dopamine levels with systemic treatment of amphetamine increase
the likelihood of choosing the high-effort/high-reward arm (Bardgett et al., 2009). This effect
seems to be specific to the effort requirement of climbing over the barrier in the high reward arm
and not due to a change in the relative value of the high and low rewards, as subjects will choose
the high reward arm in the absence of any barrier (Salamone et al., 1994), and subjects will
choose the high reward arm with the barrier present when the other arm contains no pellets
(Cousins et al., 1996). Just as with the NAcc dopamine depletion and dopamine receptor
antagonism impairment seen in the EBCT, the impairments caused in the T-arm barrier maze can
be reversed by systemic administration of A2a antagonists (Mott et al., 2009). Again, as in the
operant effort based choice task, this rescue is receptor subtype specific, as the A1a receptor does
not rescue the impaired behavior (Mott et al., 2009).
![Page 60: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/60.jpg)
45
While the NAcc dopamine depletion effect on the T-arm barrier maze demonstrates that
dopamine in the NAcc influences the effort based decisions, systemic dopamine manipulations
may be acting in multiple sites. The ACC receives dopaminergic projections from the VTA
(Hoover & Vertes, 2007; Lindvall, Björklund, & Divac, 1978). Two initial studies which
lesioned dopamine neurons within the ACC via 6-OHDA showed mixed results as impairments
in choosing the high-effort/high-reward arm were observed following dopamine depletions in
one study (Schweimer & Hauber, 2005), but did not impair the behavior in another (Walton,
Croxson, Rushworth, & Bannerman, 2005). Support for the hypothesis that dopamine does act in
the ACC for effort based decision making in the T-Arm Barrier Maze comes from finding that
local- ized infusions of the D1 Antagonists into the ACC disrupt higheffort/ high-reward arm
choices, whereas D2-Antagonists administered here do not (Schweimer & Hauber, 2006). The
studies of dopamine which implicate the ACC as being involved in effortbased decisions fits
with a number of other studies examining the requirement of ACC function choice in the T-arm
barrier maze described in the next section.
4.1.3.2. The prefrontal cortex and basolateral amygdala.
One of the early studies examining the role of the prefrontal cortex in effort based
decision making using the T-Arm Barrier Maze looked at the effects of a broad, non-region
specific lesion of the medialprefrontal cortex (mPFC), encompassing several sub-regions of the
mPFC including: the Infra-Limbic Cortex (IL), the Prelimbic Cortex (PL), and the Anterior
Cingulate Cortex (ACC). These nonspecific lesions which damaged all 3 sub-regions impaired
effort based decision making, as subjects chose the low effort-low reward arm a higher
percentage of the time (Walton, Bannerman, & Rushworth, 2002). Subsequent work revealed a
functional specialization within the sub-regions of the mPFC as lesions to the ACC alone were
![Page 61: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/61.jpg)
46
sufficient to produce the impaired effort based decision making behavior, whereas lesions to the
both the IL and PL were no different from the sham lesioned control group (Walton, Bannerman,
Alterescu, & Rushworth, 2003).
Later studies went on to show that while bilateral destruction of the ACC was sufficient
to impair effort based decision making, this same impairment could also be produced by
damaging brain regions which directly project to the ACC. It was shown that bilateral
inactivation of the Basao-Lateral Amygdala (BLA) with Bupivacaine lead to impairments in
choosing the high-effort/high-reward arm (Floresco & Ghods-Sharifi, 2007). In a similar manner,
bilateral lesions of the NAcc Core impaired choices of the high-effort/highreward arm (Hauber
& Sommer, 2009). Importantly, however, the common denominator between these two studies
appears to be the connection between these nuclei and the ACC. It was shown that a functional
connection between the BLA and ACC is involved in cost benefit decision making in the T-arm
barrier maze as bilateral inactivation of this circuit leads to impaired responding identical to
bilateral inactivation of either the ACC or BLA alone (Floresco, 2007). Similarly, bilateral
inactivation of the NAcc – ACC circuit also disrupts the choices of the high-effort/highreward
arm (Hauber & Sommer, 2009).
Convergent evidence for an important role for ACC in effort based decision making
comes from in vivo electrophysiological recording studies in behaving animals. Hillman and
Bilkey (2010) employed a spatial cost benefit decision making paradigm, similar to the T-arm
Barrier Maze task. Rats could choose to navigate to earn 6 rewards vs 2 rewards depending on
which arm they chose. When a barrier was present in front of the 6 pellet arm, a substantial
portion of ACC neurons (63%) exhibited significantly higher firing for one goal trajectory versus
the other; for 94% of these cells, higher firing was associated with the arm with a barrier and 6
![Page 62: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/62.jpg)
47
pellets. In intersession and intra-session manipulations involving at least one barrier, ACC
activity rapidly adapted to the changing conditions and was consistently biased toward the low
effort option relative to the configuration. Interestingly, when no barrier was present and the only
difference between the 2 arms was the reward magnitude, the high reward arm was chosen on
84% of trials and ACC activity was minimal and nonbiased (Hillman & Bilkey, 2010). Together,
these observations demonstrated that the High effort/HR bias was not simply attributable to the
larger reward, the barrier, or behavioral preference.
4.1.3.3. Summary
The results from these three different effort based choice tasks reveals some converging
observations implicating certain brain regions which appear to be involved in making decisions
about effortful choice across different types of tasks (Fig. 3B). These include the NAcc Core,
VP, BLA, and the ACC. The NAcc core appears to be critical for effortful choice behavior, as it
is was shown to be involved in all three tasks. The GABAergic connection between the NAcc
Core and the VP has been shown to be important in the EBCT (Farrar et al., 2008; Font et al.,
2008; Mingote et al., 2008), but to our knowledge has yet to be examined in other effort based
choice paradigms, but such studies may be fruitful for future investigators as the VP is thought to
be implicated in motivational processes (McGinty et al., 2011).
Brain targets which have direct connections with the NAcc Core are also important in
effort based decision making. Specifically, the ACC, as well as its connection with the NAcc
appear to modulate effort based choice behavior, as lesions/inactivation of the ACC and
disconnection of the ACC - NAcc decreases willingness to choose the high effort option in the
operant effort discounting task and the T-arm barrier maze task (Hauber & Sommer, 2009).
Worth consideration, however, is the notion that while the ACC is involved in some types of
![Page 63: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/63.jpg)
48
effort-based decision making, this structure may not be universally involved in all situations
requiring effort based decision making as there are some tasks in which lesions of this structure
do not impact effortful choice behavior (Schweimer & Hauber, 2005). Moreover, some studies
report that the deficits observed in effort based choice following lesions to the ACC are only
transient which may suggest that other brain regions can compensate for the loss of this region in
these situations.
The BLA is another region which is connected to the NAcc Core and has a role in effort
based choice processes. In both the operant effort discounting procedure and the T-arm barrier
maze, lesions of the BLA decreased the willingness to choose the high effort option for larger
rewards (Floresco & Ghods-Sharifi, 2007; Ghods-Sharifi et al., 2009). While we are unaware of
any studies which have specifically examined the BLA in the EBCT, the observation that BLA
inactivation decreases responding in an FR-16 (Simmons & Neill, 2009) suggests the BLA may
be involved in this behavior as well.
4.2. Choice between two reward delays
Yet another cost which one can incur in a decision making task is the cost of waiting for a
reward. The investigators who have studied behavior by systematically manipulating delay to
reward have conceptualized this paradigm as reflecting one’s level of impulsive choice
(summarized in Table 3). The central idea behind such tasks is as follows: how do subjects
choose between a small reward delivered immediately versus a larger reward delivered after
some delay, as a function of increasing durations of delay. Of note, is that these tasks are all
similarly controlling effort by having equal physical effort requirements to initiate delays to the
next reward.
![Page 64: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/64.jpg)
49
4.2.1. Operant delay discounting
There have been a few different variants of paradigms developed to study delay-
discounting. In an operant delay-discounting task, subjects have a choice between pressing a
lever which will deliver a small reward (2 pellets) immediately or pressing a different lever
which will result in a larger reward (4 pellets) which is delivered after some delay. The delay is
usually increased over the course of a session. Under baseline conditions, subjects will make
more choices on the long delay/high reward when the delays are relatively short and decrease
their percent of long delay choices as a function of the reward delay (Floresco et al., 2008).
A number of studies have examined the impact of systemic administration of
dopaminergic drugs on delay-discounting. The results of treatment with amphetamine on delay
based choice have been mixed, and appear to depend on a variety of procedural factors. On the
one hand, there have been numerous studies which have shown that amphetamine increases the
number of large reward-long delay choices, which is interpreted as a decrease in impulsive
choice (Barbelivien, Billy, Lazarus, Kelche, & Majchrzak, 2008; Floresco et al., 2008; van
Gaalen, van Koten, Schoffelmeer, & Vanderschuren, 2006; Wade, de Wit, & Richards, 2000;
![Page 65: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/65.jpg)
50
Winstanley, Dalley, Theobald, & Robbins, 2003; Winstanley, Theobald, Dalley, & Robbins,
2005). In these studies, amphetamine appears to increase subject’s indifferent point for delay,
meaning subjects are willing to wait longer to get the larger reward (Wade et al., 2000).
Amphetamine treatment also leads to decreases in choice latency and increases the number of
trials subjects will complete. Treatment with methylphenidate produces the same effect on
indifference point as amphetamine (van Gaalen et al., 2006). These effects are mediated by the
increases in extracellular dopamine levels as the increased choice of the large delayed reward
were mimicked by the selective dopamine reuptake inhibitor GBR 12909 but not by the
noradrenaline reuptake inhibitor desipramine (van Gaalen et al., 2006). As previously alluded to,
however, there have also been some studies which have shown either no influence of
amphetamine on delay based choice, or a decrease in preference for the delayed but larger
reward (Koffarnus, Newman, Grundt, Rice, & Woods, 2011; Slezak & Anderson, 2009; Stanis,
Marquez Avila, White, & Gulley, 2008; Tanno, Maguire, Henson, & France, 2014). The
existence of these mixed results suggests that drugs like amphetamine do not uniformly increase
preference for larger, long delay rewards in all situations.
A number of studies have found that dopamine antagonists increase the number of small
reward immediate choices associated with more impulsive choice (Floresco et al., 2008; van
Gaalen et al., 2006; Wade et al., 2000). Specifically, the non-selective dopamine receptor
antagonist flupenthixol (25, 50, and 100 lg/kg) and the D2 antagonist raclopride (40, 80, and 120
lg/kg), both decreased subject’s indifference point of delays suggesting subjects treat a shorter
delay as being equivalent to the immediate reward delivery option as compared to vehicle treated
subjects (Wade et al., 2000). The D1R antagonist SCH 23390 (5, 10, and 20 lg/kg), however, did
not affect the indifference point.
![Page 66: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/66.jpg)
51
Furthermore, treatment with the adenosine A2aR agonist (clonidine) was also shown to
increase the selection of the low reward-low delayed option, whereas the A1aR agonist
phenylephrine did not affect behavior in the delay-discounting task (van Gaalen et al., 2006).
Interestingly, this is a similar pattern of results observed in the EBCT – A2aR, but not A1aR
agonists acting like D1/D2R antagonists.
It does not appear as though these dopamine drugs are acting within the NAcc, as
dopamine depletions via intra-NAcc 6-OHDA injections, which decreases DA and NA levels by
70–75%, had no impact on delay-discounting behavior alone (Winstanley, Theobald et al.,
2005), although this did transiently potentiate the D-amphetamine-induced decrease in impulsive
choice of large reward-delayed choices (Winstanley, Baunez, Theobald, & Robbins, 2005).
Dopamine signaling to the OFC appears to be important as lesioning dopamine neurons with 6-
OHDA led to an increased indifference point (Kheramin et al., 2004). Further studies showed
that levels of the dopamine metabolite DOPAC increased in the OFC when animals were
performing the delayed discounting task compared to a yoked control condition (Winstanley,
Theobald, Dalley, Cardinal, & Robbins, 2006).
4.2.1.1. The prefrontal cortex: importance of the OFC.
A number of studies have examined different prefrontal structures involvement in delay
discounting. The results on the involvement of the OFC were initially mixed, as it was shown
that bilateral lesions for the OFC led to a decrease in the number of choices of largerdelayed
reward in one study (Mobini et al., 2002), but another study found that it increased the choices of
the larger delayed rewards (Winstanley, Theobald, Cardinal, & Robbins, 2004). Subsequent
studies went on to discover that the role of the OFC in delaydiscounting appears to be dependent
![Page 67: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/67.jpg)
52
both, on whether the delays are cued or not, as well as the baseline levels of impulsivity shown in
subjects, which explains the discrepancy in the results between Mobini et al. (2002) and
Winstanley et al. (2004). Inactivation of the OFC was shown to increase impulsive choice of the
small reward-small delay lever when the delay was cued, but only in rats low in baseline
impulsivity, whereas the same OFC lesion decreases the number of small reward – small delay
choices in an un-cued condition, but only in highly impulsive rats (Zeeb, Floresco, &
Winstanley, 2010).
Several lines of evidence implicate dopamine signaling within the prefrontal cortex in
delay based decision making. Results from a gene expression study found a positive correlation
between baseline levels of impulsive choice and the transcript levels of the dopamine D1 and D5
receptor as well as the D1 receptor interacting protein Calcyon (Loos et al., 2010). Moreover,
local mPFC infusions of the D1/D5 receptor antagonist SCH 23390 and the D1/D5 partial
agonist SKF 38393 increased impulsive choice, which supports the notion that endogenous
receptor D1/D5 signaling in the mPFC is involved in making choices about delayed rewards
(Loos et al., 2010). As was observed with studies employing lesions in the OFC, whether or not
the delayed duration was cued or not also appears to be important for dopaminergic
manipulations in the OFC. Specifically, intra-OFC infusions of D2 antagonist (eticlopride) and
D1 antagonist (SCH23390) do not alter delayed discounting in a delay-discounting procedure
which does not cue the beginning of the delay duration, but both D2 antagonist (eticlopride) and
D1 antagonist (SCH23390) decrease choice of large rewards with long delays when the delay
duration is cued at the beginning of the delay (Zeeb et al., 2010).
4.2.1.2. The basolateral amygdala.
![Page 68: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/68.jpg)
53
One study looked at the role of the BLA and found that inactivation of the BLA leads to
an increased the number of small immediate reward choices (Winstanley et al., 2004). 4.2.1.3.
The sub thalamic nucleus. There have been a small number of studies which have investigated
the influence of the STN on delay based decision making. In one study, it was found that lesions
to the STN decrease impulsive choice, leading to a higher percent of choices on the large reward
lever with a long delay (Winstanley, Baunez et al., 2005), a result which was replicated in
another study (Uslaner & Robinson, 2006). In a third study which manipulated both the duration
of the large delay as well as the small delay, this same pattern of results was not observed
(Bezzina et al., 2009). One key difference between the studies which found that STN lesions lead
to more choices of the large reward lever with long delay (Uslaner & Robinson, 2006;
Winstanley, Baunez et al., 2005) and the Bezzina et al. (2009) was that the former studies
produced the lesions after the subjects were already trained to baseline on the task, whereas the
later induced the lesions before any training. This is likely important as the effect of STN lesions
on delay discounting appear to be most marked immediately after the lesion is made, and animals
seems to compensate in some way after more time passes, a pattern of results which fits with
observations made by Uslaner & Robinson, (2006).
4.2.2. T-arm delay maze
Another paradigm which has been used to study decision making with delays to reward is
the T-Arm maze with delay. In this task, subjects can choose to either go left or right to choose
either a high or low reward. In the low reward arm subjects were able to get access to 1 pellet
immediately. In the high reward arm subjects were enclosed into a gated waiting area for 15 s,
after which the gate to the reward area opened and the subject had access to 15 food pellets. In a
study using this T-Arm Delay maze version of delay discounting, both the ACC and OFC were
![Page 69: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/69.jpg)
54
lesioned. Subjects which received OFC lesions chose the low reward arm far more often than the
sham control group. In the group with lesions to the ACC, delay based decision making was
unaffected as subjects didn’t differ from the sham control group (Rudebeck, Walton, Smyth,
Bannerman, & Rushworth, 2006). Another study investigated the involvement of the OFC,
mPFC and BLA in delay based choice using a T-arm delay procedure. In this study, bilateral
inactivation of the OFC did not impact delay based choice, whereas bilateral inactivation of both
the mPFC and BLA both lead to a decreased preference for the larger delayed reward
(Churchwell, Morris, Heurtelou, & Kesner, 2009). In this study, it was demonstrated that
contralateral disconnection of the mPFC – BLA circuit also reduced preference for the larger but
delayed reward.
4.2.2.1. Summary.
There appear to be some consistent findings in the literature on the brain regions and
pharmacological manipulations which influence delay based decision making across behavioral
paradigms (Fig. 4A). The brain region which has been most reliably implicated in delay based
decision making is the OFC. Both studies using an operant delay discounting procedure and a T-
Arm Delay task have found that lesions to the OFC decrease the number of choices of the large
reward lever with long delays (Mobini et al., 2002; Rudebeck et al., 2006; Zeeb et al., 2010). In
one study inactivation of OFC did not change delay based choice behavior (Churchwell et al.,
2009), though both the mPFC and BLA were found to be involved in delay based choice as
manipulations of these regions as well as the connection between them can alter delay based
choice behavior. While there is a large amount of converging evidence implicating the OFC in
delay based choice, a comprehensive understanding of its involvement and the involvement of
other regions has yet to be fully worked out.
![Page 70: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/70.jpg)
55
Fig. 4. Brain regions involved in delay and risky choice. (A) Brain regions which have been
studied to examine their involvement in delay based choice through lesions, inactivation, or
localized drug infusions which have been shown to modulate delay based choice (Blue), have no
effect on delay based choice (Grey), or have not yet been examine (White). Areas which
influence delay based choice include: the PL, IL, OFC, NAcc Core, BLA, STN, and VTA. Areas
which do not influence delay based choice include: the ACC, and NAcc Shell. Areas which have
not been studied include: VP, DS, Hipp, SN, vSub. (B) Brain regions which have been studied to
examine their involvement in risk based choice through lesions, inactivation, or localized drug
infusions which have been shown to modulate delay based choice (Red), have no effect on delay
based choice (Grey), or have not yet been examine (White). Areas which influence delay based
choice include: the PL, NAcc Shell, BLA, and VTA. Areas which do not influence delay based
choice include: the ACC, OFC, and NAcc Core. Areas which have not been studied include: IL,
VP, DS, Hipp, STN, SN, vSub. (For interpretation of the references to colour in this figure
legend, the reader is referred to the web version of this article.)
Systemic treatment with drugs acting on the dopamine system produce consistent patterns
of results across a number of studies. Drugs which increase synaptic levels of dopamine, like
amphetamine, lead to an increase in choices of the large reward lever with long delays, whereas
![Page 71: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/71.jpg)
56
antagonists of the D1 and D2 receptor both lead to increased choices of the low reward lever
with no delay. Systemic treatment with drugs altering dopamine signaling are not likely acting
within the NAcc to impact delay discounting, however, as NAcc dopamine depletion does not
alter delay based decision making. Dopamine does appear to be acting within the OFC, however,
as D1 and D2 antagonists decrease the number of choices of the large reward lever with long
delays (Loos et al., 2010; Zeeb et al., 2010). Finally, lesions to the STN have been found to lead
to an increase in this type of choice (Uslaner & Robinson, 2006; Winstanley, Baunez et al.,
2005).
4.3. Choice between two probabilities
The studies of choice between two costs up until this point have all been studies which
systematically manipulated the effort requirements of the tasks (e.g. amount of work or length of
delay to reward) to see what brain regions and neurotransmitter systems are involved in guiding
behavior in the face of different costs. Another type of cost that can influence the choice between
two options is the likelihood of each option paying off. This type of task is thought to represent
risky decision making as when one chooses an option with a lower probability of paying off, but
with the possibility of obtaining a larger reward they are taking a risk. There have been a number
of studies examining the role of different brain regions in this risk based decision making
behavior (summarized in Table 4).
4.3.1. Operant risk discounting
A variant of the Effort-Discounting task was developed to study Risk-Discounting. In this
paradigm, the response cost remains fixed at 1 response, but the probability that the response will
be rewarded is systematically manipulated. These risk-discounting paradigms give subjects a
choice between a high-reward (4 pellets) low probability lever or a low reward (2 pellet) high
![Page 72: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/72.jpg)
57
probability lever which pays off 100 percent of the time. The probability of the high-reward low
probability lever is varied over the course of the session in ten trial blocks (i.e. 4 pellets delivered
with probability of - 100, 50, 25, 12.5%), and can be performed with either decreasing or
increasing probabilities as the session progresses. When both levers have equal payoff
probabilities, rats will choose the high reward lever most of the time and these high response
choices become less likely as a function of the decreasing probability.
4.3.1.1. Dopaminergic influence on risk based choice.
Numerous studies have found that administration of amphetamine increases the
preference for the large risky reward, whereas dopamine antagonists such as Flupenthixol
decreased preference for the large risky option (St. Onge, Chiu, & Floresco, 2010). Interestingly,
the effects of systemic amphetamine are seen through an increased preference for the large risky
options, but this is only observed when the probability decreased over the session, whereas the
preference is actually reduced when the probabilities start low and get larger throughout the
![Page 73: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/73.jpg)
58
session (St. Onge et al., 2010). The effects of amphetamine can be blocked or attenuated with
either systemic D1R (SCH23390) or D2R (SKF81297) antagonists, and are therefore not
mediated by specific receptor type (St Onge & Floresco, 2009). Additionally, blockade of D1 or
D2 receptors alone induced risk aversion (St Onge & Floresco, 2009). Other studies have
examined the involvement of the Dopamine D3 receptors and have found that the D3 antagonist
(PD 128,907) reduced the number of choices on the large/risky lever, whereas the D3 antagonist
(nafadotride) potentiated the amphetamine-induced risky choice, but didn’t alter risk-based
choice when administered alone (St Onge & Floresco, 2009). Finally, blockade (L745) or
stimulation (PD168) of D4 receptors did not alter behavior (St Onge & Floresco, 2009).
Having found that systemic dopamine manipulations impact risk based decision making,
subsequent studies then went on to more specifically examine the role of the NAcc and NAcc
dopamine on risky behavior. Inactivation of the entire NAcc with a mixture of the GABAA and
B agonists mucsimol and baclofen, lead to a decreased preference for the high reward - risky
option (Stopper & Floresco, 2011). Moreover, the sub regions of the NAcc appear to be
differentially involved in aspects of risk-based decision making, as inactivation of the NAcc
Shell impacted the percent of choices on the high reward – risky lever, but did not have any
impact on the latency to make the choice (Stopper, Khayambashi, Kelly, & Floresco, 2010).
Inactivation of the NAcc Core, on the other hand, impacted the latency to respond, but did not
affect the percentage of high reward – risky choices.
Studies examining the role of the NAcc in risky choice also looked at the impact of
dopaminergic drugs infused directly into the NAcc. NAcc infusions of the D1 receptor antagonist
(SCH 23390) found that this NAcc D1 blockade decreased preference for the large/uncertain
rewards, which occurred because of an enhanced negative-feedback sensitivity - reflected in the
![Page 74: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/74.jpg)
59
increased tendency to choose the smaller but more certain option immediately after an
unsuccessful attempt on the large reward-high risk lever (Stopper, Khayambashi, & Floresco,
2013). In contrast, NAcc infusion of the D1 receptor agonist (SKF 81297) had the opposite
effect, increasing choice of high risky option when the risky lever probability was high and
decreased preference when the risky lever had lower probabilities, suggesting an increased
sensitivity to the probability of payoff (Stopper et al., 2013). In contrast to the bidirectional
effects of the D1 receptors on risk-based decision making, neither D2 antagonists (eticlopride)
nor agonists (quinpirole or bromocriptine) in the NAcc influenced risky choice (Stopper et al.,
2013). Finally, the D3-preffering agonist (PD 128 907) decreased risky choice and subjects were
more likely to shift to the low risk lever after a successful high risk outcome (Stopper et al.,
2013).
In a manner similar to the results observed with Effort- Discounting in the T-arm Barrier
maze, dopamine appears to be acting to influence risk based decision making not only within the
NAcc but also within the prefrontal cortex. Studies which locally administered the D1 antagonist
(SCH23390) into the medial PFC found a decreased preference for the large/risky option,
whereas infusion of the D1 agonist (SKF81297) caused a slight, nonsignificant increased in
preference for the large risky lever (St. Onge, Abhari, & Floresco, 2011). Dopamine D2 receptor
antagonists (eticlopride) infused into the mPFC reduced risk discounting and increased the
percent of risky choices, whereas D2 agonist (quinpirole) induced an impairment in risk based
decision making, as subjects were less likely to choose the high risk option, showing a flattening
of the discounting function overall (St. Onge et al., 2011).
4.3.1.2. Prefrontal cortex and basolateral amygdala.
![Page 75: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/75.jpg)
60
Within the Pre Frontal Cortex, inactivation of the PL cortex of the mPFC through
infusions of the GABAA and B agonists (muscimol and baclofen) increased risky choice when
the probability on the risky lever was decreased over the course of the session, but this same
inactivation lead to decreases in risky choice when the large/risky reward probability increases
over a session (St. Onge & Floresco, 2009). Control experiments demonstrated that the results
following PL cortex inactivation could not be explained by a more general disruptions in flexible
behavior (reversal learning) or judgments about the relative value of probabilistic rewards (St.
Onge & Floresco, 2009). In contrast to the results of inactivation of the PL cortex, inactivation of
the OFC through infusions of muscimol and baclofen increased response latencies, but did not
have any effect on risky choice (St. Onge & Floresco, 2009). Further studies demonstrated that
inactivation of the medial OFC increased risky choice on risk-discounting task in either
ascending or descending probability conditions (Stopper, Green, & Floresco, 2014). This
increased risky choice was associated with enhancement in win-stay behavior as rats showed a
tendency to choose the risky option again following a rewarded risky trial.
In contrast to the PL cortex and OFC, inactivation of the ACC via infusions of
muscimo/baclofen did not affect risky choice or latency to respond (St. Onge & Floresco, 2009).
Finally, infusions of the GABAA and B agonists (muscimol and/ baclofen) into the BLA
disrupted risk discounting, inducing a risk averse pattern of choice, as there were observed
increases in response latencies as well as trial omissions, with these effects appearing most
prominently in cases with the greatest amounts of uncertainty (Ghods-Sharifi et al., 2009). A set
of studies was performed which assessed the role of the functional connection between the BLA
and mPFC, as both of these areas were shown to alter risk based choice when bilaterally
inactivated. It was shown that functional disconnection of the BLA-mPFC connection altered
![Page 76: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/76.jpg)
61
risk based choice behavior such that subjects chose the risky option more often, which resulted
from decreased sensitivity to negative feedback following an unsuccessful risky choice (St Onge,
Stopper, Zahm, & Floresco, 2012). Moreover, it is specifically the top-down pathway by which
the mPFC – BLA connection is involved in risky choice, as specific disconnection of the mPFC
to BLA circuit altered risk based choice, whereas inactivation of the BLA to mPFC connection
did not (St Onge et al., 2012).
4.3.1.3. Summary.
There have been a number of consistent findings from studies which have looked at
choice behavior involving different probability of payoff (Fig. 4B). Drugs which increase
synaptic levels of dopamine such as amphetamine increase the number of choices of the risky
large reward, dopamine antagonists decrease these choices leading to more selections of smaller
more certain rewards (St Onge & Floresco, 2009). Inactivation of the NAcc Shell and the BLA
both decrease the number of choices of the high reward lever/lower probabilities (Ghods-Sharifi
et al., 2009; Stopper & Floresco, 2011). The effects of inactivation of the PL cortex of the mPFC
appear to be dependent on whether the probabilities increase or decrease over the course of the
session (St Onge & Floresco, 2010b). Inactivation of this region decreases risky choice when the
probabilities decrease over the course of the session, but increase risky choice when the
probabilities increase over the course of the session. Finally, inactivation of the NAcc Core, the
OFC, and the ACC do not specifically impact risk based choice (St Onge & Floresco, 2010b;
Stopper & Floresco, 2011).
5. Summary and future directions
We have summarized a wide range of studies which address the question: how does the
brain process and use information related to different types of response costs underlying
![Page 77: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/77.jpg)
62
motivated behavior? We focused on cost manipulations of effort, time delays, and
risk/probability and found that a number of brain regions seem to appear to be important in many
different types of tasks, whereas others appear to be highly specific to some tasks but not others.
5.1. Dopamine influences response vigor, as well as cost, delay, and risk based decision
making
Systemic dopamine treatments influenced performance in all of the different types of
tasks covered in the review. Systemic treatment with drugs which increased synaptic dopamine
levels, like amphetamine, leads to increased activation in a PR schedule (Bailey et al., 2015;
Mayorga et al., 2000; Sommer et al., 2014), increases in high effort/high reward choice (Bardgett
et al., 2009; Floresco et al., 2008), increases in long delay/large reward choice (Barbelivien et al.,
2008; Floresco et al., 2008; van Gaalen et al., 2006; Wade et al., 2000), and increases in risky
choice for large rewards (St. Onge et al., 2010). Systemic treatment with dopamine antagonists
drugs decreased these behaviors in a PR (Aberman & Salamone, 1999; Caul & Brindle, 2001;
Cheeta et al., 1995; Olarte-Sanchez et al., 2013), effort choice tasks (Cousins & Salamone, 1994;
Farrar et al., 2010; Koch et al., 2000; Nowend et al., 2001; Salamone, 1991; Salamone et al.,
2002; Sink et al., 2008; Worden et al., 2009), delay choice tasks (Floresco et al., 2008; van
Gaalen et al., 2006; Wade et al., 2000), and risk choice tasks (St. Onge et al., 2010). The site of
action of these dopaminergic drugs differs for different behavioral processes. The NAcc Core is
the important site of action of dopaminergic drugs for modulating behavior in the PR task ().
Both the NAcc Core and Shell are sites of action of dopamine antagonists in two different effort
based choice tasks (EBCT and T-Arm barrier maze) Sokolowski & Salamone, 1998, but only the
NAcc Core was important in an operant effort discounting task (Ghods-Sharifi & Floresco,
2010). In tasks which require subjects to make choices about delays to reward, both the OFC and
![Page 78: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/78.jpg)
63
the mPFC (PL and IL) are sites where dopamine acts, whereas NAcc dopamine has no impact on
delay based decision making (Winstanley, Baunez et al., 2005). For risk based choice, both the
IL and PL cortex as well as the NAcc appear to be sites of action of dopamine (St. Onge et al.,
2011), but only the NAcc shell appears to be modulating risk based decision making (Stopper &
Floresco, 2011).
5.2. Neural substrates involved in effort, delay, and risk costs
There appear to be a number of different brain regions which are involved in multiple
types of response cost related behaviors, but there are also some regions in which there appears
to be selectivity in the types of costs they are required for making decisions about (Fig. 5).
5.2.1. ACC
The ACC is a region which appears to be specifically involved in decisions about effort
based choice. Lesions to the ACC did not impact PR responding (Schweimer & Hauber, 2005),
which suggests the region by itself cannot influence activational aspects of motivation. This is
supported by a study which found that lesioning the ACC did not alter locomotor activity in an
open field (Li et al., 2012). In decision making situations involving manipulations of delay to
reward and risk/probability, lesions to the ACC also do not seem to have any effect (Rudebeck et
al., 2006; St. Onge et al., 2010). On the other hand, the ACC is involved in effort based decisions
as lesions to the ACC lead to more choices of low effort options (Walton et al., 2003, 2009).
While the majority of the studies done on the ACC’s involvement in effort based choice utilized
the T-arm barrier maze (which all found the region is required for normal decision making), one
study using the EBCT found that lesions to the ACC did not have any effect (Schweimer &
Hauber, 2005), whereas a study employing a variant of the operant effort discounting (FR-14 for
4 pellets vs FR-4 for 2 pellets), found that lesions to the ACC lead to a decreased selection of the
![Page 79: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/79.jpg)
64
high effort lever (Walton et al., 2009). Together, these results suggest that the ACC plays a
specific role in the computation of whether the value of an outcome offsets the effort needed to
obtain it.
5.2.2. OFC
Lesions to the OFC were shown to increase subject’s BP in a PR task (Gourley et al.,
2010), which suggests that the region may be modulating vigor via activational influences of
motivation. In line with this idea is the observation that lesions to the OFC in rats leads to
increased locomotor activity in an open field test (De Bruin et al., 1983). Worth noting is the fact
that in a PR schedule two things are systemically increasing together: the number of responses
required for the next reward and the time the required bout of work will require to obtaining that
reward. Thus, both the effort required and delay to reward are simultaneously manipulated in this
task. While there have been many studies done which have found the OFC is involved in delay
based choice behavior there has only been one study done to our knowledge which has looked at
effort based choice (Rudebeck et al., 2006).
Of the numerous studies which have been done examine the influence of the OFC on
delay based choice, a consensus is that lesioning or inactivation of the OFC leads to altered delay
based choices such that subjects prefer smaller more immediate rewards and are averse to delays
(Mobini et al., 2002; Rudebeck et al., 2006; Zeeb et al., 2010). Whereas many of the early
studies did not specifically distinguish between sub regions within the OFC, recently the
important functional distinctions between the medial OFC (mOFC) and lateral OFC (lOFC) has
been recognized. Specifically, lesions or inactivation of the lateral OFC induces an impairment
in delay based choice (Rudebeck et al., 2006; Winstanley et al., 2004), whereas inactivation
restricted to the mOFC does not (Stopper et al., 2014).
![Page 80: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/80.jpg)
65
Finally, while initial studies seemed to suggest that the OFC is not involved in decision
making about risk or probabilities (St. Onge et al., 2010), these studies did not specifically
disentangle the contribution of the medial and lateral OFC. Specific inactivation of the medial
OFC leads to alterations in risk based choice (Stopper et al., 2014), whereas inactivation of the
lateral OFC does not (St Onge & Floresco, 2010a). The apparent lack of an effect of the OFC in
the effort based choice behaviors, along with the regional specification of involvement in delay
and risk based choice seems to suggests that the sub regions of the OFC may play a specific role
in processing certain types of information going into the computation of whether the benefit of a
specific outcome is worth the cost of obtaining that outcome. Future studies will be needed to
further understand the regional distinctions of this structure as well as whether differential input
and output targets can be identified based on their connectivity within the sub regions of the
OFC.
5.2.3. The IL and PL cortex
Many of the studies which have made lesions to the mPFC have targeted the IL and PL
cortex. The PL cortex appears to be involved in activational influences of motivation as lesions
to this region decrease BP’s in a PR (Gourley et al., 2010), though in the one study to examine
this it is likely that the IL was also damaged based on the reported histology. While damage to
the PL can enhance response vigor in a PR, neither the IL nor PL appear to be involved in effort
based choice (Walton et al., 2002). The PL and IL both appear to be involved in delay based
choice, as local dopamine antagonist infusions into this region can lead to decrease selection of
larger rewards with longer delays (Loos et al., 2010). Finally, the PL cortex appears to be
involved in risk based decision making (St. Onge et al., 2010). Thus these regions of mPFC seem
to play a role in assessing the costs of both delays and risks but not effort. It is possible that an
![Page 81: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/81.jpg)
66
assessment of delay mediates all these results as the average delay to risky outcomes is greater
than the delay of less risky outcomes in the studies reviewed here.
Fig. 5. Summary brain regions and types of costs. An overall summary of the brain regions which
have been shown to modulate different aspects of motivated behavior and overcoming response
costs through lesion studies, chemical inactivation, and pharmacological manipulations. (A) Brain
regions which have been shown to be modulate vigor in a PR (orange) have no effect on PR
(grey), and have not been studied (white). (B) Brain regions which have been shown to be
involved in effort based choice behavior (green), have no effect on effort based choice (grey), and
have not be studied (white). (C) Brain regions which have been shown to be involved in delay
based choice behavior (blue), have no effect on delay based choice (grey), and have not been
studied (white). (D) Brain regions which have been shown to be involved in risk based choice
behavior (red), have no effect on risk based choice (grey), and have not been studied (white). (E)
Provides an overall summary of the different types of behavioral tasks each of the different brain
regions has been shown to be involved in. (For interpretation of the references to colour in this
figure legend, the reader is referred to the web version of this article.)
![Page 82: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/82.jpg)
67
5.2.4. NAcc Core
The NAcc Core appears to be involved in activational aspects of motivation. Cell body
lesions to this region enhance BP’s in a PR, whereas dopamine depletion and dopamine
antagonist drugs lead to reductions in BP’s (Bezzina, Body, Cheung, Hampson, Bradshaw et al.,
2008; Bezzina, Body, Cheung, Hampson, Deakin, et al., 2008; Bezzina, den Boon et al., 2008;
Hamill et al., 1999; Bari & Pierce, 2005). The NAcc Core is also involved in effort based choice,
as dopamine depletion and dopamine antagonist drugs in this region leads to reduction of
choosing high effort options in all three of the discussed measures of effort choice: EBCT,
operant effort discounting, and the T-arm barrier maze (Salamone et al., 1991; Ghods- Sharifi &
Floresco, 2010; Hauber & Sommer, 2009). The NAcc Core is also involved in delay based
choice as lesions of the NAcc Core lead to increased selection of the smaller more immediate
reward (Cardinal, Pennicott, Sugathapala, Robbins, & Everitt, 2001). This is different from the
effects of dopamine depletion to the region, however, as this has been shown to not have any
effect on delay based choice (Winstanley, Theobald et al., 2005), and intra NAcc Core D1 and
D2 receptor antagonists do not impair the ability to wait for reward in a cued progressive delay
procedure (Wakabayashi, Fields, & Nicola, 2004).
Excitotoxic lesions of the NAcc lead to a risk averse pattern of responding in risk based
choice task, but this does not seem to be specific to the NAcc Core because inactivation of the
NAcc Core alone doesn’t alter risk based choice (Stopper & Floresco, 2011). Thus the NAcc
Core appears to process information about the effort and risk costs of an alternative but not be
involved in processing delay costs.
5.2.5. NAcc Shell
![Page 83: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/83.jpg)
68
The NAcc Shell is distinct from the NAcc Core in a number of ways. First, whereas cell
body lesions within the NAcc Shell can increase responding in a PR, dopaminergic depletion
within the shell and D1 and D2R antagonists within the shell do not increase response vigor in
this task (Bari & Pierce, 2005). Additionally, while all 3 of the effort based choice tasks found
the NAcc Core to be important for effort based choice, only the EBCT found that DA depletion
and D1 and D2 antagonists could alter this behavior (Sokolowski & Salamone, 1998), although
to a lesser extent that within the Core. A study which examined NAcc DA involvement in delay
based choice found that intra-accumbal infusions of 6-OHDA did not have any impact on delay
based choice (Winstanley, Theobald et al., 2005), but did impact risk based choice as
inactivation of this region decreased risky choice behavior (Stopper & Floresco, 2011). In sum, it
appears that the NAcc Shell may play a very similar computational role to that played by the
core, but it is not directly responsible for activational effects of dopamine as measured in a PR
and is involved in risky choice.
5.2.6. VP
To our knowledge, the effect of lesioning the VP has not been examined in PR tasks,
delay choice tasks, or risk based choice tasks. There is however, evidence that the VP is involved
in effort based choice tasks as inactivation of this region lead to a decrease in willingness to
choose high effort options in the EBCT (Farrar et al., 2008). Given the direct connection with the
NAcc Core, it seems like the VP may likely be involved in other behaviors which the NAcc Core
is involved in and future studies may benefit from examining this region more closely.
5.2.7. STN
The STN also appears to be involved in activational aspects of motivation as lesions to
this region increase BP’s in a PR (Baunez et al., 2002; Bezzina, Body, Cheung, Hampson,
![Page 84: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/84.jpg)
69
Bradshaw et al., 2008; Bezzina, Body, Cheung, Hampson, Deakin, et al., 2008; Bezzina, den
Boon et al., 2008), and it also appears to be involved in delay based choice as lesions to the STN
alter delay based choice behavior as subjects choose more of the large rewards with long delays
(Winstanley, Theobald et al., 2005; Uslaner and Robinson (2006)). We are unaware of studies
which have examined the impact of lesioning the STN in either effort based or risk based choice.
Interestingly, the pattern of results observed with STN lesions are highly similar to those of OFC
lesions (increased BP, increased delay based choice).
5.2.8. BLA
The BLA is another brain region which appears to be involved in many aspects of
behavior covered in the review, as it influences behavior effort based choice (Floresco & Ghods-
Sharifi, 2007; Ghods-Sharifi et al., 2009), delay based choice (Winstanley et al., 2004), and risk
based choice (Ghods-Sharifi et al., 2009). While we could not a find a specific study which used
a PR, one which used a FR-16 found that inactivation of the BLA decreased responding
(Simmons & Neill, 2009). Thus we hypothesize that the BLA may be processing information
about the net costs and benefits of different behavioral options.
5.2.9. VTA
As discussed in the earlier section on the effects of dopamine in the various behaviors,
the VTA appears to be involved in all of the behaviors discussed: PR, effort choice, delay choice,
and risk choice. This is due to the fact that the VTA sends dopaminergic neurons to other brain
regions for which dopamine is important for modulating these behaviors: including NAcc, OFC,
BLA, IL, PL, and the ACC. D1 antagonists injected directly into the VTA leads to a decrease in
BP, whereas over expression of D2 receptors in this region leads to increases in BP (Sharf et al.,
2005). That VTA is involved in activational aspects of motivation is well known, and studies
![Page 85: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/85.jpg)
70
have shown that the area can modulate general locomotor activity, as injections of the GABAA
antagonist picrotoxin into the VTA increase locomotor activity in an open field (Mogenson et al.
1980). This set of data reflect the key role that dopamine plays in modulating the widely
distributed network involved in these cost-benefit computations.
5.2.10. Hippocampus
While lesions to the ventral hippocampus were shown increase BP’s in a PR task
(Gourley et al., 2010; Chambers & Self, 2002), we are unaware of any studies which have
specifically looked at the role of the ventral hippocampus in effort, delay, or risk based choice
procedures. Given the direct connection between the OFC and the ventral hippocampus, it may
be interesting to see if delay based choice requires ventral hippocampal functioning.
5.3. Conclusion/future directions
The studies covered in this review suggest that there is a widely distributed network
engaged during motivated action. Some of that network seems to energize behavior while other
structures in the network are involved in more specific computations about different kinds of
costs. It also seems likely that this network is involved in computing different kinds of benefits
as well. When examining the summary of studies which have been done in Fig. 5E, it becomes
apparent that there are a number of different brain areas which have yet to be studied for certain
types of processes. It may be helpful in developing a more complete picture of this distributed
circuit to more fully understand what each region does with relation to each other. We believe
that to understand motivation the field must continue to dissecting this network and map it to
specific behavioral functions. The arsenal of new techniques which exist for cell type specific
manipulation of circuits with high levels of temporal control should aid this continued quest to
![Page 86: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/86.jpg)
71
better understand the neural circuits guiding and directing behavior to overcome obstacles in the
environment.
References
Aberman, J. E., & Salamone, J. D. (1999). Nucleus accumbens dopamine depletions make rats
more sensitive to high ratio requirements but do not impair primary food reinforcement.
Neuroscience, 92(2), 545-552.
Aberman, J. E., Ward, S. J., & Salamone, J. D. (1998). Effects of dopamine antagonists and
accumbens dopamine depletions on time-constrained progressive-ratio performance.
Pharmacology Biochemistry and Behavior, 61(4), 341-348. doi: 10.1016/s0091-
3057(98)00112-9
Anaclet, C., Parmentier, R., Ouk, K., Guidon, G., Buda, C., Sastre, J.-P., . . . Lin, J.-S. (2009).
Orexin/Hypocretin and Histamine: Distinct Roles in the Control of Wakefulness
Demonstrated Using Knock-Out Mouse Models. The Journal of Neuroscience, 29(46),
14423-14438.
Antle, M. C., & Silver, R. (2015). Circadian Insights into Motivated Behavior (pp. 1-33). Berlin,
Heidelberg: Springer Berlin Heidelberg.
Antoniou, K. (2005). A detailed behavioral analysis of the acute motor effects of caffeine in the
rat: involvement of adenosine A1 and A2A receptors. Psychopharmacology (Berlin,
Germany), 183(2), 154-162.
Atalayer, D., & Rowland, N. E. (2009). Meal patterns of mice under systematically varying
approach and unit costs for food in a closed economy. Physiology & Behavior, 98(1-2),
85-93.
Bailey, M. R., Jensen, G., Taylor, K., Mezias, C., Williamson, C., Silver, R., . . . Balsam, P. D.
(2015). A novel strategy for dissecting goal-directed action and arousal components of
motivated behavior with a progressive hold-down task. Behavioral Neuroscience, 129(3),
269-280. doi: 10.1037/bne0000060
Baldo, B. A., Sadeghian, K., Basso, A. M., & Kelley, A. E. (2002). Effects of selective dopamine
D1 or D2 receptor blockade within nucleus accumbens subregions on ingestive behavior
and associated motor activity. Behavioural Brain Research, 137(1-2), 165-177. doi:
10.1016/s0166-4328(02)00293-0
Balsam, P. D., Drew, M. R., & Gallistel, C. R. (2010). Time and Associative Learning.
Comparative cognition & behavior reviews, 5, 1-22.
Balsam, P. D., & Gallistel, C. R. (2009). Temporal maps and informativeness in associative
learning. Trends in Neurosciences, 32(2), 73-78.
![Page 87: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/87.jpg)
72
Barbelivien, A., Billy, E., Lazarus, C., Kelche, C., & Majchrzak, M. (2008). Rats with different
profiles of impulsive choice behavior exhibit differences in responses to caffeine and d-
amphetamine and in medial prefrontal cortex 5-HT utilization. Behavioural Brain
Research, 187(2), 273-283. doi: http://dx.doi.org/10.1016/j.bbr.2007.09.020
Bardgett, M. E., Depenbrock, M., Downs, N., Points, M., & Green, L. (2009). Dopamine
Modulates Effort-Based Decision Making in Rats. Behavioral Neuroscience, 123(2), 242-
251. doi: 10.1037/a0014625
Bari, A. A., & Pierce, R. C. (2005). D1-like and D2 dopamine receptor antagonists administered
into the shell subregion of the rat nucleus accumbens decrease cocaine, but not food,
reinforcement. Neuroscience, 135(3), 959-968. doi: 10.1016/j.neuroscience.2005.06.048
Baunez, C., Amalric, M., & Robbins, T. W. (2002). Enhanced food-related motivation after
bilateral lesions of the subthalamic nucleus. Journal of Neuroscience, 22(2), 562-568.
Baunez, C., Dias, C., Cador, M., & Amalric, M. (2005). The subthalamic nucleus exerts opposite
control on cocaine and 'natural' rewards. Nature Neuroscience, 8(4), 484-489. doi:
10.1038/nn1429
Beer, B., & Trumble, G. (1965). Timing behavior as a function of amount of reinforcement.
Psychonomic Science, 2(3), 71-72.
Bezzina, G., Body, S., Cheung, T. H. C., Hampson, C. L., Bradshaw, C. M., Szabadi, E., . . .
Deakin, J. F. W. (2008). Effect of disconnecting the orbital prefrontal cortex from the
nucleus accumbens core on inter-temporal choice behaviour: A quantitative analysis.
Behavioural Brain Research, 191(2), 272-279. doi: 10.1016/j.bbr.2008.03.041
Bezzina, G., Body, S., Cheung, T. H. C., Hampson, C. L., Deakin, J. F. W., Anderson, I. M., . . .
Bradshaw, C. M. (2008). Effect of quinolinic acid-induced lesions of the nucleus
accumbens core on performance on a progressive ratio schedule of reinforcement:
implications for inter-temporal choice. Psychopharmacology, 197(2), 339-350.
Bezzina, G., Cheung, T. H. C., Body, S., Deakin, J. F. W., Anderson, I. M., Bradshaw, C. M., &
Szabadi, E. (2009). Quantitative analysis of the effect of lesions of the subthalamic
nucleus on intertemporal choice: further evidence for enhancement of the incentive value
of food reinforcers. Behavioural Pharmacology, 20(5-6), 437-446.
Bezzina, G., den Boon, F. S., Hampson, C. L., Cheung, T. H. C., Body, S., Bradshaw, C. M., . . .
Deakin, J. F. W. (2008). Effect of quinolinic acid-induced lesions of the subthalamic
nucleus on performance on a progressive-ratio schedule of reinforcement: A quantitative
analysis. Behavioural Brain Research, 195(2), 223-230. doi: 10.1016/j.bbr.2008.09.005
Bitterman, M. E., & Schoel, W. M. (1970). Instrumental learning in animals: Parameters of
reinforcement. Annual Review of Psychology, 21, 367-436. doi:
10.1146/annurev.ps.21.020170.002055
![Page 88: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/88.jpg)
73
Bower, G. H., & Trapold, M. A. (1959). Reward magnitude and learning in a single-presentation
discrimination. Journal of Comparative and Physiological Psychology, 52(6), 727-729.
doi: 10.1037/h0039617
Cagniard, B., Balsam, P. D., Brunner, D., & Zhuang, X. X. (2006). Mice with chronically
elevated dopamine exhibit enhanced motivation, but not learning, for a food reward.
Neuropsychopharmacology, 31(7), 1362-1370. doi: 10.1038/sj.npp.1300966
Caul, W. F., & Brindle, N. A. (2001). Schedule-dependent effects of haloperidol and
amphetamine: multiple-schedule task shows within-subject effects. Pharmacology
Biochemistry and Behavior, 68(1), 53-63. doi: 10.1016/s0091-3057(00)00431-7
Cetin, T., Freudenberg, F., Fuchtemeier, M., & Koch, M. (2004). Dopamine in the orbitofrontal
cortex regulates operant responding under a progressive ratio of reinforcement in rats.
Neuroscience Letters, 370(2-3), 114-117. doi: 10.1016/j.neulet.2004.08.002
Chambers, R. A., & Self, D. W. (2002). Motivational responses to natural and drug rewards in
rats with neonatal ventral hippocampal lesions: An animal model of dual diagnosis
schizophrenia. Neuropsychopharmacology, 27(6), 889-905. doi: 10.1016/s0893-
133x(02)00365-2
Cheeta, S., Brooks, S., & Willner, P. (1995). EFFECTS OF REINFORCER SWEETNESS AND
THE D2/D3 ANTAGONIST RACLOPRIDE ON PROGRESSIVE RATIO OPERANT
PERFORMANCE. Behavioural Pharmacology, 6(2), 127-132.
Collier, G. (1969). Body weight loss as a measure of motivation in hunger and thirst. Annals of
the New York Academy of Sciences, 157(2), 594-608.
Collier, G., & Johnson, D. F. (1997). Who is in Charge? Animal vs Experimenter Control.
Appetite, 29(2), 159-180. doi: http://dx.doi.org/10.1006/appe.1997.0124
Collier, G., & Levitsky, D. (1967). DEFENSE OF WATER BALANCE IN RATS:
BEHAVIORAL AND PHYSIOLOGICAL RESPONSES TO DEPLETION. Journal of
Comparative and Physiological Psychology, 64(1), 59-67. doi: 10.1037/h0024800
Correa, M., Mingote, S., Betz, A., Wisniecki, A., & Salamone, J. D. (2003). Substantia nigra
pars reticulata GABA is involved in the regulation of operant lever pressing:
pharmacological and microdialysis studies. Neuroscience, 119(3), 759-766. doi:
http://dx.doi.org/10.1016/S0306-4522(03)00117-9
Cousins, M. S., Atherton, A., Turner, L., & Salamone, J. D. (1996). Nucleus accumbens
dopamine depletions alter relative response allocation in a T-maze cost/benefit task.
Behavioural Brain Research, 74(1-2), 189-197. doi: 10.1016/0166-4328(95)00151-4
Cousins, M. S., & Salamone, J. D. (1994). NUCLEUS-ACCUMBENS DOPAMINE
DEPLETIONS IN RATS AFFECT RELATIVE RESPONSE ALLOCATION IN A
NOVEL COST/BENEFIT PROCEDURE. Pharmacology Biochemistry and Behavior,
49(1), 85-91. doi: 10.1016/0091-3057(94)90460-x
![Page 89: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/89.jpg)
74
Cousins, M. S., Sokolowski, J. D., & Salamone, J. D. (1993). Different effects of nucleus
accumbens and ventrolateral striatal dopamine depletions on instrumental response
selection in the rat. Pharmacology Biochemistry and Behavior, 46(4), 943-951. doi:
http://dx.doi.org/10.1016/0091-3057(93)90226-J
Covarrubias, P., & Aparicio, C. F. (2008). Effects of reinforcer quality and step size on rats'
performance under progressive ratio schedules. Behavioural Processes, 78(2), 246-252.
doi: 10.1016/j.beproc.2008.02.001
Czachowski, C. L., & Samson, H. H. (1999). Breakpoint Determination and Ethanol Self-
Administration Using an Across-Session Progressive Ratio Procedure in the Rat.
Alcoholism: Clinical and Experimental Research, 23(10), 1580-1586. doi:
10.1111/j.1530-0277.1999.tb04047.x
de Jong, J. W., Roelofs, T. J. M., Mol, F. M. U., Hillen, A. E. J., Meijboom, K. E., Luijendijk,
M. C. M., . . . Adan, R. A. H. (2015). Reducing Ventral Tegmental Dopamine D2
Receptor Expression Selectively Boosts Incentive Motivation.
Neuropsychopharmacology, 40(9), 2085-2095. doi: 10.1038/npp.2015.60
Dews, P. B. (1978). STUDIES ON RESPONDING UNDER FIXED-INTERVAL SCHEDULES
OF REINFORCEMENT: II. THE SCALLOPED PATTERN OF THE CUMULATIVE
RECORD. Journal of the Experimental Analysis of Behavior, 29(1), 67-75. doi:
10.1901/jeab.1978.29-67
Dickinson, A., & Balleine, B. (1994). MOTIVATIONAL CONTROL OF GOAL-DIRECTED
ACTION. Animal Learning & Behavior, 22(1), 1-18. doi: 10.3758/bf03199951
Drew, M. R., Simpson, E. H., Kellendonk, C., Herzberg, W. G., Lipatova, O., Fairhurst, S., . . .
Balsam, P. D. (2007). Transient Overexpression of Striatal D2 Receptors Impairs Operant
Motivation and Interval Timing. The Journal of Neuroscience, 27(29), 7731-7739. doi:
10.1523/jneurosci.1736-07.2007
Drui, G., Carnicella, S., Carcenac, C., Favier, M., Bertrand, A., Boulet, S., & Savasta, M. (2013).
Loss of dopaminergic nigrostriatal neurons accounts for the motivational and affective
deficits in Parkinson's disease. Molecular Psychiatry, epub ahead of print.
Drui, G., Carnicella, S., Carcenac, C., Favier, M., Bertrand, A., Boulet, S., & Savasta, M. (2014).
Loss of dopaminergic nigrostriatal neurons accounts for the motivational and affective
deficits in Parkinson's disease. Molecular Psychiatry, 19(3), 358-367.
Duffy, E. (1957). The psychological significance of the concept of "arousal" or "activation.".
Psychological Review, 64(5), 265-275. doi: 10.1037/h0048837
Eagle, D. M., Humby, T., Dunnett, S. B., & Robbins, T. W. (1999). Effects of regional striatal
lesions on motor, motivational, and executive aspects of progressive-ratio performance in
rats. Behavioral Neuroscience, 113(4), 718-731. doi: 10.1037//0735-7044.113.4.718
![Page 90: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/90.jpg)
75
Ekstrand, M. I., Terzioglu, M., Galter, D., Zhu, S., Hofstetter, C., Lindqvist, E., . . . Larsson, N.-
G. (2007). Progressive parkinsonism in mice with respiratory-chain-deficient dopamine
neurons. Proceedings of the National Academy of Sciences, 104(4), 1325-1330. doi:
10.1073/pnas.0605208103
Ettenberg, A. (1989). Dopamine, neuroleptics and reinforced behavior. Neuroscience &
Biobehavioral Reviews, 13(2–3), 105-111. doi: http://dx.doi.org/10.1016/S0149-
7634(89)80018-1
Evenden, J. L., & Ryan, C. N. (1996). The pharmacology of impulsive behaviour in rats: the
effects of drugs on response choice with varying delays of reinforcement.
Psychopharmacology, 128(2), 161.
Farrar, A. M., Font, L., Pereira, M., Mingote, S., Bunce, J. G., Chrobak, J. J., & Salamone, J. D.
(2008). Forebrain circuitry involved in effort-related choice: Injections of the GABA(A)
agonist muscimol into ventral pallidum alter response allocation in food-seeking
behavior. Neuroscience, 152(2), 321-330. doi: 10.1016/j.neuroscience.2007.12.034
Farrar, A. M., Segovia, K. N., Randall, P. A., Nunes, E. J., Collins, L. E., Stopper, C. M., . . .
Salamone, J. D. (2010). NUCLEUS ACCUMBENS AND EFFORT-RELATED
FUNCTIONS: BEHAVIORAL AND NEURAL MARKERS OF THE INTERACTIONS
BETWEEN ADENOSINE A(2A) AND DOPAMINE D-2 RECEPTORS. Neuroscience,
166(4), 1056-1067. doi: 10.1016/j.neuroscience.2009.12.056
Ferguson, S. A., & Paule, M. G. (1995). LACK OF EFFECT OF PREFEEDING ON FOOD-
REINFORCED TEMPORAL RESPONSE DIFFERENTIATION AND PROGRESSIVE
RATIO RESPONDING. Behavioural Processes, 34(2), 153-160.
Floresco, S. B. (2007). Dopaminergic regulation of limbic-striatal interplay. Journal of
Psychiatry & Neuroscience, 32(6), 400-411.
Floresco, S. B. (2015). The Nucleus Accumbens: An Interface Between Cognition, Emotion, and
Action. In S. T. Fiske (Ed.), Annual Review of Psychology, Vol 66 (Vol. 66, pp. 25-52).
Floresco, S. B., & Ghods-Sharifi, S. (2007). Amygdala-Prefrontal Cortical Circuitry Regulates
Effort-Based Decision Making. Cerebral Cortex, 17(2), 251-260.
Floresco, S. B., Tse, M. T. L., & Ghods-Sharifi, S. (2008). Dopaminergic and glutamatergic
regulation of effort- and delay-based decision making. Neuropsychopharmacology, 33(8),
1966-1979.
Font, L., Mingote, S., Farrar, A. M., Pereira, M., Worden, L., Stopper, C., . . . Salamone, J. D.
(2008). Intra-accumbens injections of the adenosine A(2A) agonist CGS 21680 affect
effort-related choice behavior in rats. Psychopharmacology, 199(4), 515-526. doi:
10.1007/s00213-008-1174-z
![Page 91: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/91.jpg)
76
Fowler, S. C. (1999). Microbehavioral methods for measuring the motor and associative effects
of dopamine receptor antagonists and related drugs in rodents. Psychopharmacology,
147(1), 8-10. doi: 10.1007/s002130051129
Gallistel, C. R. (1989). Animal cognition: The representation of space, time and number. Annual
Review of Psychology, 40, 155-189. doi: 10.1146/annurev.ps.40.020189.001103
Gallistel, C. R., & Gelman, R. (1992). Preverbal and verbal counting and computation.
Cognition, 44(1?2), 43-74. doi: http://dx.doi.org/10.1016/0010-0277(92)90050-R
Ghods-Sharifi, S., & Floresco, S. B. (2010). Differential Effects on Effort Discounting Induced
by Inactivations of the Nucleus Accumbens Core or Shell. Behavioral Neuroscience,
124(2), 179-191. doi: 10.1037/a0018932
Ghods-Sharifi, S., Onge, J. R. S., & Floresco, S. B. (2009). Fundamental Contribution by the
Basolateral Amygdala to Different Forms of Decision Making. Journal of Neuroscience,
29(16), 5251-5259. doi: 10.1523/jneurosci.0315-09.2009
Girolami, K. M., Kahng, S., Hilker, K. A., & Girolami, P. A. (2009). Differential reinforcement
of high rate behavior to increase the pace of self-feeding. Behavioral Interventions, 24(1),
17-22. doi: 10.1002/bin.273
Goodrich, K. P. (1960). Running speed and drinking rate as functions of sucrose concentration
and amount of consummatory activity. Journal of Comparative and Physiological
Psychology, 53(3), 245-250. doi: 10.1037/h0045984
Gourley, S. L., Lee, A. S., Howell, J. L., Pittenger, C., & Taylor, J. R. (2010). Dissociable
regulation of instrumental action within mouse prefrontal cortex. European Journal of
Neuroscience, 32(10), 1726-1734. doi: 10.1111/j.1460-9568.2010.07438.x
Hall, D. A., Stanis, J. J., Avila, H. M., & Gulley, J. M. (2008). A comparison of amphetamine-
and methamphetamine-induced locomotor activity in rats: Evidence for qualitative
differences in behavior. Psychopharmacology, 195(4), 469-478.
Hamill, S., Trevitt, J. T., Nowend, K. L., Carlson, B. B., & Salamone, J. D. (1999). Nucleus
accumbens dopamine depletions and time-constrained progressive ratio performance:
Effects of different ratio requirements. Pharmacology Biochemistry and Behavior, 64(1),
21-27. doi: 10.1016/s0091-3057(99)00092-1
Hauber, W., & Sommer, S. (2009). Prefrontostriatal Circuitry Regulates Effort-Related Decision
Making. Cerebral Cortex, 19(10), 2240-2247. doi: 10.1093/cercor/bhn241
Hebb, D. O. (1955). Drives and the C. N. S. (conceptual nervous system). Psychological Review,
62(4), 243-254. doi: 10.1037/h0041823
Hillman, K. L., & Bilkey, D. K. (2010). Neurons in the Rat Anterior Cingulate Cortex
Dynamically Encode Cost-Benefit in a Spatial Decision-Making Task. Journal of
Neuroscience, 30(22), 7705-7713. doi: 10.1523/jneurosci.1273-10.2010
![Page 92: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/92.jpg)
77
Hodos, W. (1961). PROGRESSIVE RATION AS A MEASURE OF REWARD STRENGTH.
Science, 134(348), 943-&. doi: 10.1126/science.134.3483.943
Hodos, W., & Kalman, G. (1963). EFFECTS OF INCREMENT SIZE AND REINFORCER
VOLUME ON PROGRESSIVE RATIO PERFORMANCE. Journal of the Experimental
Analysis of Behavior, 6(3), 387-&. doi: 10.1901/jeab.1963.6-387
Hoover, W. B., & Vertes, R. P. (2007). Anatomical analysis of afferent projections to the medial
prefrontal cortex in the rat. Brain Structure & Function, 212(2), 149-179. doi:
10.1007/s00429-007-0150-4
Hosking, J. G., Cocker, P. J., & Winstanley, C. A. (2014). Dissociable Contributions of Anterior
Cingulate Cortex and Basolateral Amygdala on a Rodent Cost/Benefit Decision-Making
Task of Cognitive Effort. Neuropsychopharmacology, 39(7), 1558-1567.
Hosking, J. G., Floresco, S. B., & Winstanley, C. A. (2015). Dopamine Antagonism Decreases
Willingness to Expend Physical, But Not Cognitive, Effort: A Comparison of Two
Rodent Cost/Benefit Decision-Making Tasks. Neuropsychopharmacology, 40(4), 1005-
1015. doi: 10.1038/npp.2014.285
Ikemoto, S., & Panksepp, J. (1996). Dissociations between appetitive and consummatory
responses by pharmacological manipulations of reward-relevant brain regions.
Behavioral Neuroscience, 110(2), 331-345. doi: 10.1037/0735-7044.110.2.331
Kellendonk, C., Simpson, E. H., Polan, H. J., Malleret, G., Vronskaya, S., Winiger, V., . . .
Kandel, E. R. (2006). Transient and Selective Overexpression of Dopamine D2 Receptors
in the Striatum Causes Persistent Abnormalities in Prefrontal Cortex Functioning.
Neuron, 49(4), 603-615. doi: http://dx.doi.org/10.1016/j.neuron.2006.01.023
Kheramin, S., Body, S., Herrera, F. M., Bradshaw, C. M., Szabadi, E., Deakin, J., & Anderson, I.
M. (2005). The effect of orbital prefrontal cortex lesions on performance on a progressive
ratio schedule: implications for models of inter-temporal choice. Behavioural Brain
Research, 156(1), 145-152. doi: 10.1016/j.bbr.2004.05.017
Kheramin, S., Body, S., Ho, M. Y., Velázquez-Martinez, D. N., Bradshaw, C. M., Szabadi, E., . .
. Anderson, I. M. (2004). Effects of orbital prefrontal cortex dopamine depletion on inter-
temporal choice: a quantitative analysis. Psychopharmacology, 175(2), 206-214. doi:
http://dx.doi.org/10.1007/s00213-004-1813-y
Kim, H., Lee, D., & Jung, M. W. (2013). Signals for Previous Goal Choice Persist in the
Dorsomedial, but Not Dorsolateral Striatum of Rats. The Journal of Neuroscience, 33(1),
52-63. doi: 10.1523/jneurosci.2422-12.2013
Kimchi, E. Y., & Laubach, M. (2009). Dynamic Encoding of Action Selection by the Medial
Striatum. The Journal of Neuroscience, 29(10), 3148-3159.
![Page 93: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/93.jpg)
78
Kintsch, W. (1962). Runway performance as function of drive strength and magnitude of
reinforcement. Journal of Comparative and Physiological Psychology, 55(5), 882-887.
doi: 10.1037/h0043377
Knarr, F. A., & Collier, G. (1962). Taste and consummatory activity in amount and gradient of
reinforcement functions. Journal of Experimental Psychology, 63(6), 579-588. doi:
10.1037/h0045175
Koch, M., Schmid, A., & Schnitzler, H. U. (2000). Role of nucleus accumbens dopamine D-1
and D-2 receptors in instrumental and Pavlovian paradigms of conditioned reward.
Psychopharmacology, 152(1), 67-73. doi: 10.1007/s002130000505
Koob, G. F., Riley, S. J., Smith, S. C., & Robbins, T. W. (1978). EFFECTS OF 6-
HYDROXYDOPAMINE LESIONS OF NUCLEUS ACCUMBENS SEPTI AND
OLFACTORY TUBERCLE ON FEEDING, LOCOMOTOR-ACTIVITY, AND
AMPHETAMINE ANOREXIA IN RAT. Journal of Comparative and Physiological
Psychology, 92(5), 917-927. doi: 10.1037/h0077542
Kreitzer, A. C., & Berke, J. D. (2011). Investigating striatal function through cell-type-specific
manipulations. Neuroscience, 198, 19-26. doi:
http://dx.doi.org/10.1016/j.neuroscience.2011.08.018
Lindvall, O., Björklund, A., & Divac, I. (1978). Organization of catecholamine neurons
projecting to the frontal cortex in the rat. Brain Research, 142(1), 1-24. doi:
http://dx.doi.org/10.1016/0006-8993(78)90173-7
Logan, F. A. (1966). Continuously negatively correlated amount of reinforcement. Journal of
Comparative and Physiological Psychology, 62(1), 31-34. doi: 10.1037/h0023486
Loos, M., Pattij, T., Janssen, M. C. W., Counotte, D. S., Schoffelmeer, A. N. M., Smit, A. B., . . .
van Gaalen, M. M. (2010). Dopamine Receptor D1/D5 Gene Expression in the Medial
Prefrontal Cortex Predicts Impulsive Choice in Rats. Cerebral Cortex, 20(5), 1064-1070.
doi: 10.1093/cercor/bhp167
Maldonado-Irizarry, C. S., & Kelley, A. E. (1994). Differential behavioral effects following
microinjection of an NMDA antagonist into nucleus accumbens subregions.
Psychopharmacology, 116(1), 65-72. doi: 10.1007/BF02244872
Marchant, N. J., Whitaker, L. R., Bossert, J. M., Harvey, B. K., Hope, B. T., Kaganovsky, K., . . .
Shaham, Y. (2016). Behavioral and Physiological Effects of a Novel Kappa-Opioid
Receptor-Based DREADD in Rats. Neuropsychopharmacology, 41(2), 402-409. doi:
10.1038/npp.2015.149
Marwine, A. G., & Collier, G. (1971). Instrumental and consummatory behavior as a function of
rate of weight loss and weight maintenance schedule. Journal of Comparative and
Physiological Psychology, 74(3), 441-447. doi: 10.1037/h0030577
![Page 94: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/94.jpg)
79
Mayorga, A. J., Popke, E. J., Fogle, C. M., & Paule, M. G. (2000). Similar effects of
amphetamine and methylphenidate on the performance of complex operant tasks in rats.
Behavioural Brain Research, 109(1), 59-68. doi: 10.1016/s0166-4328(99)00165-5
McCullough, L. D., Cousins, M. S., & Salamone, J. D. (1993). THE ROLE OF NUCLEUS-
ACCUMBENS DOPAMINE IN RESPONDING ON A CONTINUOUS
REINFORCEMENT OPERANT SCHEDULE - A NEUROCHEMICAL AND
BEHAVIORAL-STUDY. Pharmacology Biochemistry and Behavior, 46(3), 581-586.
McCullough, L. D., & Salamone, J. D. (1992). Involvement of nucleus accumbens dopamine in
the motor activity induced by periodic food presentation: a microdialysis and behavioral
study. Brain Research, 592(1), 29-36.
McDowell, J. J. (2013). On the theoretical and empirical status of the matching law and
matching theory. Psychological Bulletin, 139(5), 1000-1028. doi: 10.1037/a0029924
McGinty, V. B., Hayden, B. Y., Heilbronner, S. R., Dumont, E. C., Graves, S. M., Mirrione, M.
M., . . . Haber, S. (2011). Emerging, reemerging, and forgotten brain areas of the reward
circuit: Notes from the 2010 Motivational Neural Networks conference. Behavioural
Brain Research, 225(1), 348-357. doi: 10.1016/j.bbr.2011.07.036
Mechner, F. (1958). PROBABILITY RELATIONS WITHIN RESPONSE SEQUENCES
UNDER RATIO REINFORCEMENT1. Journal of the Experimental Analysis of
Behavior, 1(2), 109-121. doi: 10.1901/jeab.1958.1-109
Mingote, S., Font, L., Farrar, A. M., Vontell, R., Worden, L. T., Stopper, C. M., . . . Salamone, J.
D. (2008). Nucleus accumbens adenosine A(2A) receptors regulate exertion of effort by
acting on the ventral striatopallidal pathway. Journal of Neuroscience, 28(36), 9037-
9046. doi: 10.1523/jneurosci.1525-08.2008
Mobini, S., Body, S., Ho, M. Y., Bradshaw, C. M., Szabadi, E., Deakin, J. F. W., & Anderson, I.
M. (2002). Effects of lesions of the orbitofrontal cortex on sensitivity to delayed and
probabilistic reinforcement. Psychopharmacology, 160(3), 290-298. doi:
10.1007/s00213-001-0983-0
Mott, A. M., Nunes, E. J., Collins, L. E., Port, R. G., Sink, K. S., Hockemeyer, J., . . . Salamone,
J. D. (2009). The adenosine A(2A) antagonist MSX-3 reverses the effects of the
dopamine antagonist haloperidol on effort-related decision making in a T-maze
cost/benefit procedure. Psychopharmacology, 204(1), 103-112.
Naleid, A. M., Grace, M. K., Cummings, D. E., & Levine, A. S. (2005). Ghrelin induces feeding
in the mesolimbic reward pathway between the ventral tegmental area and the nucleus
accumbens. Peptides, 26(11), 2274-2279.
Nowend, K. L., Arizzi, M., Carlson, B. B., & Salamone, J. D. (2001). D1 or D2 antagonism in
nucleus accumbens core or dorsomedial shell suppresses lever pressing for food but leads
to compensatory increases in chow consumption. Pharmacology Biochemistry and
Behavior, 69(3-4), 373-382. doi: 10.1016/s0091-3057(01)00524-x
![Page 95: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/95.jpg)
80
Olarte-Sanchez, C. M., Valencia-Torres, L., Cassaday, H. J., Bradshaw, C. M., & Szabadi, E.
(2013). Effects of SKF-83566 and haloperidol on performance on progressive ratio
schedules maintained by sucrose and corn oil reinforcement: quantitative analysis using a
new model derived from the Mathematical Principles of Reinforcement (MPR).
Psychopharmacology, 230(4), 617-630. doi: 10.1007/s00213-013-3189-3
Panigrahi, B., Martin, K. A., Li, Y., Graves, A. R., Vollmer, A., Olson, L., . . . Dudman, J. T.
(2015). Dopamine Is Required for the Neural Representation and Control of Movement
Vigor. Cell, 162(6), 1418-1430. doi: 10.1016/j.cell.2015.08.014
Parent, A., & Hazrati, L.-N. (1995). Functional anatomy of the basal ganglia. II. The place of
subthalamic nucleus and external pallidium in basal ganglia circuitry. Brain Research
Reviews, 20(1), 128-154. doi: http://dx.doi.org/10.1016/0165-0173(94)00008-D
Perello, M., Sakata, I., Birnbaum, S., Chuang, J.-C., Osborne-Lawrence, S., Rovinsky, S. A., . . .
Zigman, J. M. (2010). Ghrelin Increases the Rewarding Value of High-Fat Diet in an
Orexin-Dependent Manner. Biological Psychiatry, 67(9), 880-886. doi:
http://dx.doi.org/10.1016/j.biopsych.2009.10.030
Pfaff, D. W., Martin, E. M., & Faber, D. (2012). Origins of arousal: roles for medullary reticular
neurons. Trends in Neurosciences, 35(8), 468-476. doi:
http://dx.doi.org/10.1016/j.tins.2012.04.008
Pizzo, M. J., Kirkpatrick, K., & Blundell, P. J. (2009). THE EFFECT OF CHANGES IN
CRITERION VALUE ON DIFFERENTIAL REINFORCEMENT OF LOW RATE
SCHEDULE PERFORMANCE. Journal of the Experimental Analysis of Behavior,
92(2), 181-198. doi: 10.1901/jeab.2009.92-181
Platt, J. R., & Johnson, D. M. (1971). Localization of position within a homogeneous behavior
chain: Effects of error contingencies. Learning and Motivation, 2(4), 386-414. doi:
http://dx.doi.org/10.1016/0023-9690(71)90020-8
Randall, P. A., Lee, C. A., Nunes, E. J., Yohn, S. E., Nowak, V., Khan, B., . . . Salamone, J. D.
(2014). The VMAT-2 Inhibitor Tetrabenazine Affects Effort-Related Decision Making in
a Progressive Ratio/Chow Feeding Choice Task: Reversal with Antidepressant Drugs.
Plos One, 9(6). doi: 10.1371/journal.pone.0099320
Randall, P. A., Pardo, M., Nunes, E. J., Cruz, L. L., Vemuri, V. K., Makriyannis, A., . . .
Salamone, J. D. (2012). Dopaminergic Modulation of Effort-Related Choice Behavior as
Assessed by a Progressive Ratio Chow Feeding Choice Task: Pharmacological Studies
and the Role of Individual Differences. Plos One, 7(10).
Richter, C. P. (1927). Animal Behavior and Internal Drives. The Quarterly Review of Biology,
2(3), 307-343.
Rickard, J. F., Body, S., Zhang, Z., Bradshaw, C. M., & Szabadi, E. (2009). EFFECT OF
REINFORCER MAGNITUDE ON PERFORMANCE MAINTAINED BY
![Page 96: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/96.jpg)
81
PROGRESSIVE-RATIO SCHEDULES. Journal of the Experimental Analysis of
Behavior, 91(1), 75-87. doi: 10.1901/jeab.2009.91-75
Robbins, T. W., & Koob, G. F. (1980). Selective disruption of displacement behaviour by lesions
of the mesolimbic dopamine system. Nature, 285(5764), 409-412.
Roberts, S. (1981). Isolation of an internal clock. Journal of Experimental Psychology: Animal
Behavior Processes, 7(3), 242-268. doi: 10.1037/0097-7403.7.3.242
Rudebeck, P. H., Walton, M. E., Smyth, A. N., Bannerman, D. M., & Rushworth, M. F. S.
(2006). Separate neural pathways process different decision costs. Nature Neuroscience,
9(9), 1161-1168. doi: 10.1038/nn1756
Salamone, J. D. (1991). BEHAVIORAL PHARMACOLOGY OF DOPAMINE SYSTEMS - A
NEW SYNTHESIS.
Salamone, J. D. (1992). COMPLEX MOTOR AND SENSORIMOTRO FUNCTIONS OF STRIATAL AND
ACCUMBENS DOPAMINE - INVOLVEMENT IN INSTRUMENTAL BEHAVIOR PROCESSES.
Psychopharmacology, 107(2-3), 160- 174.
Salamone, J. D., Arizzi, M. N., Sandoval, M. D., Cervone, K. M., & Aberman, J. E. (2002).
Dopamine antagonists alter response allocation but do not suppress appetite for food in
rats: contrast between the effects of SKF 83566, raclopride, and fenfluramine on a
concurrent choice task. Psychopharmacology, 160(4), 371-380.
Salamone, J. D., & Correa, M. (2002). Motivational views of reinforcement: implications for
understanding the behavioral functions of nucleus accumbens dopamine. Behavioural
Brain Research, 137(1-2), 3-25. doi: 10.1016/s0166-4328(02)00282-6
Salamone, J. D., & Correa, M. (2009). Dopamine/adenosine interactions involved in effort-
related aspects of food motivation. Appetite, 53(3), 422-425.
Salamone, J. D., Correa, M., Mingote, S. M., & Weber, S. M. (2005). Beyond the reward
hypothesis: alternative functions of nucleus accumbens dopamine. Current Opinion in
Pharmacology, 5(1), 34-41. doi: 10.1016/j.coph.2004.09.004
Salamone, J. D., Correa, M., Nunes, E. J., Randall, P. A., & Pardo, M. (2012). THE
BEHAVIORAL PHARMACOLOGY OF EFFORT-RELATED CHOICE BEHAVIOR:
DOPAMINE, ADENOSINE AND BEYOND. Journal of the Experimental Analysis of
Behavior, 97(1), 125-146. doi: 10.1901/jeab.2012.97-125
Salamone, J. D., Cousins, M. S., & Bucher, S. (1994). ANHEDONIA OR ANERGIA -
EFFECTS OF HALOPERIDOL AND NUCLEUS-ACCUMBENS DOPAMINE
DEPLETION ON INSTRUMENTAL RESPONSE SELECTION IN A T-MAZE COST-
BENEFIT PROCEDURE. Behavioural Brain Research, 65(2), 221-229. doi:
10.1016/0166-4328(94)90108-2
![Page 97: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/97.jpg)
82
Salamone, J. D., Pardo, M., Yohn, S. E., López-Cruz, L., SanMiguel, N., & Correa, M. (2015).
Mesolimbic Dopamine and the Regulation of Motivated Behavior. Current topics in
behavioral neurosciences.
Sanders, A. C., Hussain, A. J., Hen, R., & Zhuang, X. X. (2007). Chronic blockade or
constitutive deletion of the serotonin transporter reduces operant responding for food
reward. Neuropsychopharmacology, 32(11), 2321-2329. doi: 10.1038/sj.npp.1301368
Schweimer, J., & Hauber, W. (2005). Involvement of the rat anterior cingulate cortex in control
of instrumental responses guided by reward expectancy. Learning & Memory, 12(3),
334-342. doi: 10.1101/lm.90605
Schweimer, J., & Hauber, W. (2006). Dopamine D1 receptors in the anterior cingulate cortex
regulate effort-based decision making. Learning & Memory, 13(6), 777-782. doi:
10.1101/lm.409306
Schweimer, J., Saft, S., & Hauber, W. (2005). Involvement of catecholamine neurotransmission
in the rat anterior cingulate in effort-related decision making. Behavioral Neuroscience,
119(6), 1687-1692. doi: 10.1037/0735-7044.119.6.1687
Sharf, R., Lee, D. Y., & Ranaldi, R. (2005). Microinjections of SCH 23390 in the ventral
tegmental area reduce operant responding under a progressive ratio schedule of food
reinforcement in rats. Brain Research, 1033(2), 179-185.
Simmons, D. A., & Neill, D. B. (2009). FUNCTIONAL INTERACTION BETWEEN THE
BASOLATERAL AMYGDALA AND THE NUCLEUS ACCUMBENS UNDERLIES
INCENTIVE MOTIVATION FOR FOOD REWARD ON A FIXED RATIO
SCHEDULE. Neuroscience, 159(4), 1264-1273. doi: 10.1016/j.neuroscience.2009.01.026
Simón, V. M., Parra, A., Miñarro, J., Arenas, M. C., Vinader-Caerols, C., & Aguilar, M. A.
(2000). Predicting how equipotent doses of chlorpromazine, haloperidol, sulpiride,
raclopride and clozapine reduce locomotor activity in mice. European
Neuropsychopharmacology, 10(3), 159-164.
Simpson, E. H., & Balsam, P. D. (2015). The Behavioral Neuroscience of Motivation: An
Overview of Concepts, Measures, and Translational Applications (pp. 1-12). Berlin,
Heidelberg: Springer Berlin Heidelberg.
Simpson, E. H., Kellendonk, C., Ward, R. D., Richards, V., Lipatova, O., Fairhurst, S., . . .
Balsam, P. D. (2011). Pharmacologic Rescue of Motivational Deficit in an Animal Model
of the Negative Symptoms of Schizophrenia. Biological Psychiatry, 69(10), 928-935. doi:
http://dx.doi.org/10.1016/j.biopsych.2011.01.012
Simpson, E. H., Winiger, V., Biezonski, D. K., Haq, I., Kandel, E. R., & Kellendonk, C. (2014).
Selective Overexpression of Dopamine D3 Receptors in the Striatum Disrupts Motivation
but not Cognition. Biological Psychiatry, 76(10), 823-831. doi:
http://dx.doi.org/10.1016/j.biopsych.2013.11.023
![Page 98: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/98.jpg)
83
Sink, K. S., Vemuri, V. K., Olszewska, T., Makriyannis, A., & Salamone, J. D. (2008).
Cannabinoid CB1 antagonists and dopamine antagonists produce different effects on a
task involving response allocation and effort-related choice in food-seeking behavior.
Psychopharmacology, 196(4), 565-574. doi: 10.1007/s00213-007-0988-4
Skibicka, K. P., Hansson, C., Alvarez-Crespo, M., Friberg, P. A., & Dickson, S. L. (2011).
GHRELIN DIRECTLY TARGETS THE VENTRAL TEGMENTAL AREA TO
INCREASE FOOD MOTIVATION. Neuroscience, 180, 129-137.
Skibicka, K. P., Shirazi, R. H., Hansson, C., & Dickson, S. L. (2012). Ghrelin Interacts with
Neuropeptide Y Y1 and Opioid Receptors to Increase Food Reward. Endocrinology,
153(3), 1194-1205. doi: 10.1210/en.2011-1606
Skibicka, K. P., Shirazi, R. H., Rabasa-Papio, C., Alvarez-Crespo, M., Neuber, C., Vogel, H., &
Dickson, S. L. (2013). Divergent circuitry underlying food reward and intake effects of
ghrelin: Dopaminergic VTA-accumbens projection mediates ghrelin's effect on food
reward but not food intake. Neuropharmacology, 73, 274-283.
Skjoldager, P., Pierre, P. J., & Mittleman, G. (1993). REINFORCER MAGNITUDE AND
PROGRESSIVE RATIO RESPONDING IN THE RAT - EFFECTS OF INCREASED
EFFORT, PREFEEDING, AND EXTINCTION. Learning and Motivation, 24(3), 303-
343. doi: 10.1006/lmot.1993.1019
Sokolowski, J. D., & Salamone, J. D. (1998). The role of accumbens dopamine in lever pressing
and response allocation: Effects of 6-OHDA injected into core and dorsomedial shell.
Pharmacology Biochemistry and Behavior, 59(3), 557-566. doi: 10.1016/s0091-
3057(97)00544-3
Sommer, S., Danysz, W., Russ, H., Valastro, B., Flik, G., & Hauber, W. (2014). The dopamine
reuptake inhibitor MRZ-9547 increases progressive ratio responding in rats. International
Journal of Neuropsychopharmacology, 17(12), 2045-2056.
St Onge, J. R., Ahn, S., Phillips, A. G., & Floresco, S. B. (2011). Dynamic and dissociable
fluctuations in prefrontal and ventral striatal dopamine efflux during risk-based decision
making. Society for Neuroscience Abstract Viewer and Itinerary Planner, 41.
St Onge, J. R., & Floresco, S. B. (2009). Dopaminergic Modulation of Risk-Based Decision
Making. Neuropsychopharmacology, 34(3), 681-697. doi: 10.1038/npp.2008.121
St Onge, J. R., & Floresco, S. B. (2010). Prefrontal Cortical Contribution to Risk-Based Decision
Making. Cerebral Cortex, 20(8), 1816-1828. doi: 10.1093/cercor/bhp250
St. Onge, J. R., Chiu, Y. C., & Floresco, S. B. (2010). Differential effects of dopaminergic
manipulations on risky choice. Psychopharmacology, 211(2), 209-221. doi:
10.1007/s00213-010-1883-y
![Page 99: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/99.jpg)
84
Stopper, C. M., & Floresco, S. B. (2011). Contributions of the nucleus accumbens and its
subregions to different aspects of risk-based decision making. Cognitive Affective &
Behavioral Neuroscience, 11(1), 97-112. doi: 10.3758/s13415-010-0015-9
Stopper, C. M., Green, E. B., & Floresco, S. B. (2014). Selective Involvement by the Medial
Orbitofrontal Cortex in Biasing Risky, But Not Impulsive, Choice. Cerebral Cortex,
24(1), 154-162. doi: 10.1093/cercor/bhs297
Stopper, C. M., Khayambashi, S., & Floresco, S. B. (2013). Receptor-Specific Modulation of
Risk-Based Decision Making by Nucleus Accumbens Dopamine.
Neuropsychopharmacology, 38(5), 715-728. doi: 10.1038/npp.2012.240
Stopper, C. M., Khayambashi, S., Kelly, D., & Floresco, S. B. (2010). Regulation of risk-based
decision making by dopamine D1 and D2 receptors in the nucleus accumbens. Society for
Neuroscience Abstract Viewer and Itinerary Planner, 40.
Tanno, T., & Silberberg, A. (2014). Inter-response-time reinforcement and relative reinforcer
frequency control choice. Learning & Behavior, 43(1), 54-71.
Tanno, T., Silberberg, A., & Sakagami, T. (2012). DISCRIMINATION OF VARIABLE
SCHEDULES IS CONTROLLED BY INTERRESPONSE TIMES PROXIMAL TO
REINFORCEMENT. Journal of the Experimental Analysis of Behavior, 98(3), 341-354.
doi: 10.1901/jeab.2012.98-341
Trifilieff, P., Feng, B., Urizar, E., Winiger, V., Ward, R. D., Taylor, K. M., . . . Javitch, J. A.
(2013). Increasing dopamine D2 receptor expression in the adult nucleus accumbens
enhances motivation. Mol Psychiatry, 18(9), 1025-1033. doi: 10.1038/mp.2013.57
Uslaner, J. M., & Robinson, T. E. (2006). Subthalamic nucleus lesions increase impulsive action
and decrease impulsive choice − mediation by enhanced incentive motivation? European
Journal of Neuroscience, 24(8), 2345-2354. doi: 10.1111/j.1460-9568.2006.05117.x
van Gaalen, M. M., van Koten, R., Schoffelmeer, A. N. M., & Vanderschuren, L. J. M. J. (2006).
Critical Involvement of Dopaminergic Neurotransmission in Impulsive Decision Making.
Biological Psychiatry, 60(1), 66-73.
Vogel, J. R., Mikulka, P. J., & Spear, N. E. (1968). Effects of shifts in sucrose and saccharine
concentrations on licking behavior in the rat. Journal of Comparative and Physiological
Psychology, 66(3, Pt.1), 661-666. doi: 10.1037/h0026556
Wade, T. R., de Wit, H., & Richards, J. B. (2000). Effects of dopaminergic drugs on delayed
reward as a measure of impulsive behavior in rats. Psychopharmacology, 150(1), 90-101.
doi: 10.1007/s002130000402
Wallace, M., Singer, G., Finlay, J., & Gibson, S. (1983). The effect of 6-OHDA lesions of the
nucleus accumbens septum on schedule-induced drinking, wheelrunning and
corticosterone levels in the rat. Pharmacology Biochemistry and Behavior, 18(1), 129-
136. doi: http://dx.doi.org/10.1016/0091-3057(83)90262-9
![Page 100: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/100.jpg)
85
Walton, M. E., Bannerman, D. M., Alterescu, K., & Rushworth, M. F. S. (2003). Functional
Specialization within Medial Frontal Cortex of the Anterior Cingulate for Evaluating
Effort-Related Decisions. The Journal of Neuroscience, 23(16), 6475-6479.
Walton, M. E., Bannerman, D. M., & Rushworth, M. F. S. (2002). The Role of Rat Medial
Frontal Cortex in Effort-Based Decision Making. The Journal of Neuroscience, 22(24),
10996-11003.
Walton, M. E., Croxson, P. L., Rushworth, M. F. S., & Bannerman, D. M. (2005). The
Mesocortical Dopamine Projection to Anterior Cingulate Cortex Plays No Role in
Guiding Effort-Related Decisions. Behavioral Neuroscience, 119(1), 323-328. doi:
10.1037/0735-7044.119.1.323
Walton, M. E., Groves, J., Jennings, K. A., Croxson, P. L., Sharp, T., Rushworth, M. F. S., &
Bannerman, D. M. (2009). Comparing the role of the anterior cingulate cortex and 6-
hydroxydopamine nucleus accumbens lesions on operant effort-based decision making.
European Journal of Neuroscience, 29(8), 1678-1691.
Wang, A. Y., Miura, K., & Uchida, N. (2013). The dorsomedial striatum encodes net expected
return, critical for energizing performance vigor. Nature Neuroscience, 16(5), 639-+. doi:
10.1038/nn.3377
Ward, R. D., Kellendonk, C., Simpson, E. H., Lipatova, O., Drew, M. R., Fairhurst, S., . . .
Balsam, P. D. (2009). Impaired timing precision produced by striatal D2 receptor
overexpression is mediated by cognitive and motivational deficits. Behavioral
Neuroscience, 123(4), 720-730. doi: 10.1037/a0016503
Ward, R. D., Simpson, E. H., Richards, V. L., Deo, G., Taylor, K., Glendinning, J. I., . . .
Balsam, P. D. (2012). Dissociation of Hedonic Reaction to Reward and Incentive
Motivation in an Animal Model of the Negative Symptoms of Schizophrenia.
Neuropsychopharmacology, 37(7), 1699-1707.
Weinberg, Z. Y., Nicholson, M. L., & Currie, P. J. (2011). 6-Hydroxydopamine lesions of the
ventral tegmental area suppress ghrelin's ability to elicit food-reinforced behavior.
Neuroscience Letters, 499(2), 70-73. doi: 10.1016/j.neulet.2011.05.034
Winstanley, C. A., Baunez, C., Theobald, D. E. H., & Robbins, T. W. (2005). Lesions to the
subthalamic nucleus decrease impulsive choice but impair autoshaping in rats: the
importance of the basal ganglia in Pavlovian conditioning and impulse control. European
Journal of Neuroscience, 21(11), 3107-3116. doi: 10.1111/j.1460-9568.2005.04143.x
Winstanley, C. A., Dalley, J. W., Theobald, D. E. H., & Robbins, T. W. (2003). Global 5-HT
depletion attenuates the ability of amphetamine to decrease impulsive choice on a delay-
discounting task in rats. Psychopharmacology, 170(3), 320-331. doi: 10.1007/s00213-
003-1546-3
![Page 101: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/101.jpg)
86
Winstanley, C. A., Theobald, D. E. H., Cardinal, R. N., & Robbins, T. W. (2004). Contrasting
Roles of Basolateral Amygdala and Orbitofrontal Cortex in Impulsive Choice. The
Journal of Neuroscience, 24(20), 4718-4722. doi: 10.1523/jneurosci.5606-03.2004
Winstanley, C. A., Theobald, D. E. H., Cardinal, R. N., & Robbins, T. W. (2004). Contrasting
roles of basolateral amygdala and orbitofrontal cortex in impulsive choice. Journal of
Neuroscience, 24(20), 4718-4722. doi: 10.1523/jneurosci.5606-03.2004
Winstanley, C. A., Theobald, D. E. H., Dalley, J. W., Cardinal, R. N., & Robbins, T. W. (2006).
Double dissociation between serotonergic and dopaminergic modulation of medial
prefrontal and orbitofrontal cortex during a test of impulsive choice. Cerebral Cortex,
16(1), 106-114. doi: 10.1093/cercor/bhi088
Winstanley, C. A., Theobald, D. E. H., Dalley, J. W., & Robbins, T. W. (2005). Interactions
between serotonin and dopamine in the control of impulsive choice in rats: Therapeutic
implications for impulse control disorders. Neuropsychopharmacology, 30(4), 669-682.
doi: 10.1038/sj.npp.1300610
Worden, L. T., Shahriari, M., Farrar, A. M., Sink, K. S., Hockemeyer, J., Muller, C. E., &
Salamone, J. D. (2009). The adenosine A(2A) antagonist MSX-3 reverses the effort-
related effects of dopamine blockade: differential interaction with D1 and D2 family
antagonists. Psychopharmacology, 203(3), 489-499. doi: 10.1007/s00213-008-1396-0
Wu, M., Brudzynski, S. M., & Mogenson, G. J. (1993). Functional interaction of dopamine and
glutamate in the nucleus accumbens in the regulation of locomotion. Canadian Journal of
Physiology and Pharmacology, 71(5-6), 407-413. doi: 10.1139/y93-061
Zeeb, F. D., Floresco, S. B., & Winstanley, C. A. (2010). Contributions of the orbitofrontal
cortex to impulsive choice: interactions with basal levels of impulsivity, dopamine
signalling, and reward-related cues. Psychopharmacology, 211(1), 87-98.
Zhuang, X., Oosting, R. S., Jones, S. R., Gainetdinov, R. R., Miller, G. W., Caron, M. G., &
Hen, R. (2001). Hyperactivity and impaired response habituation in hyperdopaminergic
mice. Proceedings of the National Academy of Sciences, 98(4), 1982-1987. doi:
10.1073/pnas.98.4.1982
![Page 102: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/102.jpg)
87
Chapter 3
A Novel Strategy for Dissecting Goal-Directed Action and Arousal
Components of Motivated Behavior With a Progressive Hold-Down Task
Abstract
Motivation serves 2 important functions: It guides actions to be goal-directed, and it
provides the energy and vigor required to perform the work necessary to meet those goals.
Dissociating these 2 processes with existing behavioral assays has been a challenge. In this
article, we report a novel experimental strategy to distinguish the 2 processes in mice. First, we
characterize a novel motivation assay in which animals must hold down a lever for progressively
longer intervals to earn each subsequent reward; we call this the progressive hold-down (PHD)
task. We find that performance on the PHD task is sensitive to both food deprivation level and
reward value. Next, we use a dose of methamphetamine (METH) 1.0 mg/kg, to evaluate
behavior in both the progressive ratio (PR) and PHD tasks. Treatment with METH leads to more
persistent lever pressing for food rewards in the PR. In the PHD task, we found that METH
increased arousal, which leads to numerous bouts of hyperactive responding but neither increases
nor impairs goal-directed action. The results demonstrate that these tools enable a more precise
understanding of the underlying processes being altered in manipulations that alter motivated
behavior.
![Page 103: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/103.jpg)
88
Introduction
Motivation drives us to execute actions and provides the vigor needed to overcome
obstacles and achieve goals. Decades of research has led to the recognition that motivated
behavior is complex, consisting of multiple interacting components that can be dissociated at
both the behavioral and neural levels (Berridge & Robinson, 2003; Kelley, 2004; Salamone &
Correa, 2002; Zhang et al., 2003). One long-recognized example is that motivation consists of
both a goaldirected, directional component and an increased arousal, activational component
(Duffy, 1957; Hebb, 1955; Salamone, 1988). Despite several recent advances in understanding
the neurobiology of motivation (Gore et al., 2013; Trifilieff et al., 2013), most studies do not
dissociate the directional and activational components of motivated behavior. Additionally, the
progress that has been made studying goal-directed action selection (Kim et al., 2013; Kimchi &
Laubach, 2009) and general arousal (Anaclet et al., 2009; Pfaff et al., 2012) has largely been
made studying these processes in isolation. Thus, experimentally dissociating between goal-
directed and activational processes remains a challenge. A more comprehensive understanding of
motivation would be possible with a strategy of implementing behavioral assays that can detect
changes in goal-directed behavior, alterations in arousal/response vigor, or changes in both of
these processes.
The progressive ratio (PR) schedule is frequently used to assay motivation (Bradshaw &
Killeen, 2012). In this task, subjects must increase the number of responses made to earn
subsequent rewards. The point at which a subject quits working for rewards is called the
breakpoint (BP) and serves as an index of motivation (Aberman et al., 1998; Hodos, 1961).
Assays like the PR, however, do not clearly indicate which of the two processes of motivation
are altered because they are not designed to distinguish increases in goal-directed responses from
![Page 104: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/104.jpg)
89
those arising as a consequence of enhanced arousal. For example, mice with reduced expression
of the dopamine transporter (DAT-knockdown) have chronically elevated extracellular dopamine
levels. These DAT-knockdown mice make more lever presses and reach higher BPs in a PR for
food rewards (Cagniard et al., 2006) but also show increased locomotor activity in a novel
environment (Zhuang et al., 2001). The higher BP could be the result of increased goal-directed
action, increased arousal, or some interaction between the two processes.
The effects of psychostimulants, like amphetamines, represent another example of the
challenge of measuring motivation. In a PR for food rewards, amphetamine increases subjects’
BPs (Mayorga et al., 2000; Olausson et al., 2006) and also increases arousal across multiple
measures, including locomotor activity (Hall et al., 2008; McNamara et al., 1993), wakefulness
(Berridge, 2006), and the activity of neurons that promote arousal (Estabrooke et al., 2001).
Numerous pharmacological and genetic manipulations can alter motivation in the PR and overall
levels of locomotor activity in an open field test (see Figure 1). Developing a behavioral assay to
separate these components would facilitate the neurobehavioral analysis of motivation, as well as
have important practical implications for the treatment of motivational disorders. Impairments in
motivation related to behavioral activation and the willingness to expend effort to accomplish
goals are a clinically recognized problem for patients with schizophrenia and some affective
disorders (Demyttenaere et al., 2005; Salamone et al., 2006; Stahl, 2002; Treadway & Zald,
2011; Tylee et al., 1999). Despite this awareness, at the present time no effective
pharmacological interventions exist for these specific symptoms (Chase, 2011; Levy &
Czernecki, 2006).
We here report a new strategy to differentiate behavior into goal-directed action and
arousal. The strategy involves using the PR task along with a novel procedure that involves
![Page 105: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/105.jpg)
90
making a sustained response. Whereas increased goal-directed motivation increases responding
in both tasks, hyperactivity would increase responding in the PR but make behavior much less
efficient when a long-duration sustained response is required. We call this task the progressive
hold-down (PHD) task because subjects are required to hold a lever down for progressively
longer durations to earn subsequent rewards.
Figure 1. Relationship between progressive ratio (PR) performance and locomotor activity. Data
for both PR performance and locomotor activity is expressed as a log ratio of treatment condition
(transgenic or drug treated) divided by control condition (Wildtype or Vehicle), [e.g,. log (DAT
KD PR performance/WT PR performance)]. Bold = genetic manipulation; Italics =
pharmacological manipulation. Data for PR performance replotted from dopamine transporter
knockdown (DAT KD; Cagniard et al., 2006), SERT KD (Sanders et al., 2007), Fluoxetine
(Sanders et al., 2007), Amphetamine (Mayorga et al., 2000), D2R-OE (Simpson et al., 2011),
Haloperidol (Aberman et al., 1998), Raclopride (Aberman et al., 1998), MSX-3 (Randall et al.,
2012). Data for locomotor activity replotted from DAT KD (Zhuang et al., 2001), SERT KD
(Sanders et al., 2007), Fluoxetine (Sanders et al., 2007), Amphetamine (Hall et al., 2008), D2R-
OE (Kellendonk et al., 2006), Haloperidol (Simón et al., 2000), Raclopride (Simón et al., 2000),
MSX-3 (Antoniou et al., 2005).
We first validate the PHD task as a measure of motivation by showing that it is sensitive to both
levels of food deprivation and reward magnitude. Next, to evaluate the effectiveness of this
strategy, we look at how performance on both PR and PHD task is affected by a dose of
![Page 106: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/106.jpg)
91
methamphetamine (METH; 1 mg/kg) that is known to induce hyperactivity. If METH increases
goal-directed motivation, then performance on both PR and PHD should be enhanced; but if
METH alters general arousal– hyperactivity then we would expect to see increased responding
on the PR but impaired performance on the PHD task.
METHOD
Subjects
Subjects were C57BL/6J:129SvEvTac F1 hybrid female mice 120 days of age and
weighing 24 g to 29 g at the start of the experiment. Mice were limited to 1.5 hr of food made
available 1 hr after each behavioral testing session to motivate them to earn rewards of
evaporated liquid milk. The one exception to this was when we ran the PHD task on a cohort of
mice undergoing a restricted diet, during which mice received a set amount of food each day to
be maintained at 85% of their ad lib body weight. Water was available ad libitum in home cages
throughout the entire experiment, and subjects were maintained in a 12:12 light– dark schedule
and tested during the light phase.
Separate groups of mice were used for the METH PR experiment (METH–PR, n = 8), the
PHD under restricted feeding and reward magnitude experiments (PHD manipulations, n = 12),
and the METH PHD experiment (METH–PHD, n = 13). All animal procedures were performed
in accordance with Columbia University’s animal care committee’s regulations.
Apparatus
Experimental chambers (ENV-307w; Med Associates, St. Albans, VT) equipped with
liquid dippers were used in the experiment. Unless otherwise noted, the apparatus was identical
to that used by Drew and colleagues (2007). Two retractable levers were mounted on either side
of a feeding trough, and a house light (Model 1820; Med Associates, St. Albans, VT) located at
![Page 107: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/107.jpg)
92
the top of the chamber was used to illuminate the chamber during the sessions. Rewards
consisted of evaporated milk (.01 ml) delivered by raising a dipper located inside the feeder
trough.
Behavioral Procedures
Subjects in the PHD experiments were trained to press levers for milk rewards using the
procedure described by Drew and colleagues (2007). Once proficient at earning rewards on a
continuous reinforcement schedule, subjects were then trained to hold the lever down.
Lever hold-down procedures.
Subjects in all PHD experiments were exposed to two different hold-down procedures:
variable interval hold down (VIH) and PHD. In both schedules, a required hold duration was
assigned prior to the start of each trial. This was the duration of time the subject was required to
hold the lever in the depressed position to receive a reward. An individual trial in either schedule
followed a similar procedure: At the start of each trial, the house light was illuminated and a
lever was extended. As soon as the mouse depressed the lever, a timer began counting how long
the lever was in the depressed position. This timer stopped and was reset to 0.0s if the mouse
ended the lever press before the required time was reached. If the lever was depressed as long as
the required duration, the trial ended, and the subject received a reward. A tone (2 s) sounded and
the house light was shut off to signal the presentation of the dipper (5 s).
VIH training.
As in the PR experiment, all subjects were given initial lever press training, as described
by Drew and colleagues (2007). Next, subjects were trained using the VIH task. At the beginning
of each trial, the required hold duration was drawn randomly from a truncated exponential
distribution. This hold requirement remained in place until the subject was reinforced for
![Page 108: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/108.jpg)
93
completing the trial, at which time the next trial’s required hold duration was randomly
determined. During the first session, the distribution of required hold durations had a mean of 0.5
s; (minimum = .01 s; maximum = 2.44 s). When a mouse earned 40 rewards on 3 consecutive
days, the required hold durations for the subsequent session were drawn from an exponential
distribution with a higher mean (1 s, 2 s, 3 s, 4 s, 5 s, 8 s, 10 s). Thus, during the final session of
VIH training, subjects were required to hold down the lever for intervals that averaged 10 s but
could be as long as 18.8 s.
Progressive hold-down testing.
Once all mice earned 40 rewards on VIH-10 for 5 consecutive days, they moved on to the
PHD task. In the PHD task, the first required hold duration was fixed, and the requirement for
subsequent hold durations was increased by a multiplicative amount. We used a PHD schedule
of (2.0 s × 1.13), meaning the first required hold duration was 2 s and was multiplied by 1.13 on
each trial thereafter. Thus, the requirement
in trial number (t) was (2.0 s × 1.13t-1) such that the first four requirements were 2.00 s, 2.26 s,
2.55 s, and 2.88 s, and then 6.00 s for the 10th trial, 20.4 s for the 20th trial, and so forth.
Sessions ended after 2 hr elapsed or following 15 min without a single lever press, whichever
came first.
Food Deprivation in the PHD Task
Subjects were tested on the PHD (2.0 s × 1.13) task under highand low-motivational
states through two different feeding conditions. In the high motivation condition, subject’s daily
access to food was restricted to maintain their bodyweight at either 85% of their ad libitum
baseline bodyweights. In the low-motivation condition, subjects were given 24 hr access to home
![Page 109: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/109.jpg)
94
cage chow to allow them to maintain 100% of their baseline bodyweight. Subjects were tested on
the PHD task in each condition for 3 consecutive days.
Reward Value in the PHD Task
Subjects were tested in the PHD (2.0 s × 1.13) schedule, and the reward consisted of a
sucrose solution of different concentrations on different days (i.e., 5%, 10%, 20%, and 40%
sucrose solutions). Subjects were first tested on consecutive days with the sucrose percentage
changing in ascending order (5% to 10% to 20% to 40%) from day to day and were subsequently
tested the following week on consecutive days with the sucrose percentage changing in a
descending order (40% to 20% to 10% to 5%). Data was averaged over the 2 days of testing at
each percentage. Sessions lasted 1 hr or until subjects failed to make a press for 15 min,
whichever came first.
Progressive Ratio
Subjects in the METH PR experiment were trained to press levers for milk rewards using
the procedure described by Drew and colleagues (2007). Once proficient at earning rewards,
subjects were rewarded for pressing according to a variable interval (VI) schedule. In a VI
schedule, no lever presses were reinforced until an uncertain interval had elapsed; the first press
following this interval was reinforced. The duration of each interval was drawnrandomly from an
exponential distribution following reinforcement. Subjects were trained on VI-3 (mean interval
duration of 3s) for 2 days, followed by VI-10 for 2 days, and VI-20 for 4 days before moving on
to PR testing.
In PR testing the lever was extended at the start of the session. Once the mouse made a
criterion number of lever presses, a reward was delivered. The criterion was set at four lever
presses for the first trial and was multiplied by 1.18 thereafter and rounded to the nearest integer.
![Page 110: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/110.jpg)
95
This is subsequently denoted as PR (4 × 1.18). Thus, the requirement at trial (t) was (4 × 1.18t-1;
i.e., 6 on Trial 5, 31 on Trial 15, 160 on Trial 25, and so forth). The session ended after 2 hr or
after 3 min had elapsed without a lever press.
Subjects were tested in the PR following intraperitoneal (IP) injections of vehicle for 3
days to establish a behavioral baseline. Next, methamphetamine hydrochloride (Sigma Aldrich,
St. Louis, Missouri) was dissolved in .9% saline and IP injected at 1.0 mg/kg in a volume of 0.01
ml/g prior to being tested on the PR schedule for 3 consecutive days. All injections were
performed 20 min before the start of the behavioral session.
Methamphetamine in the PHD Task
Subjects were tested on the same PHD (2.0 s × 1.13) schedule used in the other
experiments with one important difference: An inactive lever was extended in addition to the
normal active lever at the start of each trial. This inactive lever was the lever opposite to that
which each subject was trained to press, and responses made to it never yielded rewards. This
lever was included to measure nongoal-directed hyperactive responses. Subjects were tested on
the PHD task and received IP injections of vehicle for 4 days followed by 4 days of 1.0 mg/kg of
METH. The methamphetamine hydrochloride was prepared as described in the PR experiment.
All injections were performed 20 min before the start of the behavioral session.
Data Analysis
All data were analyzed using two-tailed Student t tests without assuming equal variance
or, where appropriate, repeated measure analysis of variance (ANOVA). In all experiments, data
were averaged across all days of a specific treatment type (e.g., vehicle or METH) with the
number of days provided in the figure legend. Planned comparisons are reported in the main text
and significant post hoc analyses are reported in the figure legends.
![Page 111: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/111.jpg)
96
Results
Measuring Motivation With the Progressive Hold Down Task
Baseline performance on the PHD task.
We tested subjects in a PHD task under various conditions to characterize the PHD task
as an assay of motivational behavior. Figure 2A shows performance from a representative
session of one individual subject in the PHD task. Subjects were able to hold the lever down for
longer durations, meeting the required hold duration set by the schedule on each trial. Successful
presses (held long enough to meet the required duration) resulted in the delivery of a reward.
Failed presses (those not held down long enough to meet the required hold duration for that trial)
tended to occur toward the end of the session prior to the point when the subject stops pressing
altogether.
Figure 2. Characterization of baseline behavior in the progressive hold-down (PHD) task. (A)
Representative performance of pressing for a single subject throughout a single PHD session.
Shows both successful presses (white) and unsuccessful presses (gray). (B–C) There is a positive
![Page 112: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/112.jpg)
97
linear relationship between the number of presses made on the correct lever (B) and session
duration (C) with the number of rewards earned. In (B–C), each point represents a single
subject’s average over 5 days of baseline progressive hold down testing.
Similar to behavior in the PR, the number of lever presses made in a PHD session is
related to the number of rewards a subject earns (see Figure 2B), showing a significant positive
correlation, r(11) = 0.697, p < .05. There is also a significant positive correlation between the
session duration and the number of rewards a subject earns (see Figure 2C), r(11) = 0.748, p < .05.
This implies that under baseline conditions the amount and the duration of goal-directed
behavior in this task are related to how motivated a subject is to earn rewards, as has been
demonstrated repeatedly for behavior in the PR.
Performance in the PHD task is modulated by level of food deprivation and reward value.
To validate that the PHD task was sensitive to differences in motivation, we manipulated
the level of two parameters known to alter motivated responding: level of food deprivation and
reward value. In the deprivation experiment, subjects performed the PHD task under a high
motivation condition (restricted feeding: subjects maintained at 85% of baseline body weight)
and low motivation condition (ad lib feeding: subjects maintained at 100% baseline body
weight). Food deprived subjects made more lever presses, t(11) = 7.037, p < .0001 (see Figure
3A), continued working for longer durations (Vehicle M = 62.8 min + 4.44; METH M = 112.5
min + 2.49), t(11) = 16.77, p < .0001, and consequently earned more rewards, t(11) = 9.826, p <
.0001 (see Figure 3B). To distinguish between hyperactive and goal-directed responding, we
looked at two different measures of the lever-holding behavior. To estimate the amount of goal-
directed responding, we calculated the mean duration of all of the holds that were greater than 2
s. We chose 2 s as the cutoff because it was the shortest duration requirement on the very first
trial, and presses shorter than this could not possibly result in the goal. This measure is strongly
![Page 113: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/113.jpg)
98
correlated with the number of rewards subjects earn, r(11) = 0.697, p < .05. There was a
significant increase in the mean of all holds greater than 2 s in the restricted feeding condition,
t(11) = 5.466, p = .0002 (see Figure 3C), reflecting an increased amount of goal-directed behavior.
To estimate the amount of hyperactive nongoal-directed responding a subject produced, we
calculated the total number of presses made which were less than 2 s in duration. There was a
small increase in the number of short < 2 s presses made in the restricted feeding group
compared with the ad lib feeding group, t(11) = 2.562, p = .0264 (see Figure 3D). Thus, food
restriction leads to a large increase in goal-directed behavior and a small increase in rapid,
hyperactive responding that does not earn rewards.
Figure 3. Food restriction effects on progressive hold-down (PHD) task performance. (A) Mean
(+ SEM) lever presses in the PHD task under ad lib and restricted feeding conditions. (B) Mean
(+ SEM) number of rewards earned in the PHD task under ad lib and restricted feeding
conditions. (C) Mean (+ SEM) duration (s) of all lever presses longer than 2 s under ad lib and
restricted feeding conditions. (D) Mean (+ SEM) number of lever presses made that were shorter
than 2 s under ad lib and restricted feeding conditions. In (A–D), each point represents a subjects
performance averaged over 3 days of ad lib and restricted feeding conditions. * p < 0.5. ** p <
.01.
To further examine the sensitivity of the PHD task to a subject’s motivation to obtain
rewards, we next tested subjects using sucrose solutions of different concentrations as the
reward. Pilot studies showed that mice prefer sucrose solutions much less than evaporated milk
![Page 114: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/114.jpg)
99
so test session of 1 hr were used in this experiment. Subjects were more motivated to work for
the higher reward value as they made more lever presses for higher percentage sucrose solutions,
F(3, 27) = 6.47, p = .015 (see Figure 4A), and earned more rewards, F(3, 27) = 20.2, p < .001 (see
Figure 4B), as subjects made holds of significantly longer durations when they were working for
higher sucrose concentrations. We again used the mean hold times of all presses longer than 2 s
to estimate goal-directedness and the number of presses less than 2 s to estimate hyperactivity or
arousal. There was a significant effect of reward value on the amount of goaldirected behavior,
F(3, 27) = 14.98, p < .001 (see Figure 4C), as the mean hold durations were longer for the higher
sucrose concentrations. The reward value did not have a significant effect on the number of short
presses (< 2 s) made, F(3, 27) = 0.612, p = .439 (see Figure 4D).
Figure 4. Reward value effects on progressive hold-down (PHD) task performance. (A) Mean (+
SEM) number of lever presses for different sucrose concentrations in the PHD task; Tukey’s HSD
test. (B) Mean (+ SEM) number of rewards earned for different sucrose concentrations in the
PHD task; Tukey’s HSD test. (C) Mean (+ SEM) duration (s) of all lever presses longer than 2 s
for different sucrose concentrations in the PHD task; Tukey’s HSD test. (D) Mean (+ SEM)
number of lever presses made which were shorter than 2 s for different sucrose concentrations in
the PHD task. In (A-D) each point represents a subjects performance averaged over 2 days of
PHD testing at each sucrose concentration. ** p < .01.
![Page 115: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/115.jpg)
100
Methamphetamine leads to greater persistence in the progressive ratio task.
We tested mice using a PR schedule of reinforcement to determine whether METH
would lead to an increase in performance similar to that reported for amphetamine in rats
(Olausson et al., 2006; Poncelet et al., 1983; Mayorga et al., 2000) and in the DAT KD mouse
(Cagniard et al., 2006). Figure 5A shows the progression of requirements in a PR (4 × 1.18),
indicating the number of presses required to earn each reward. Treatment with METH led to a
significant increase in the number or lever presses made, t(7) = 4.538, p = .0027 (see Figure 5B).
Treatment with METH also caused subjects to continue working for longer durations before
quitting (Vehicle M = 109.0 s + 5.67, METH M = 120.0 s + .001), t(7) = 2.504, p = .047. As a
result of making more lever presses and continuing to work for longer durations before quitting,
subjects earned significantly more rewards when given METH, as compared to vehicle (Vehicle:
M = 10.94 presses/min + 1.59; METH: M = 16.08 presses/min + 2.21), t(7) = 5.396, p < .0010.
Figure 5. Methamphetamine (METH) increases responding in a progressive ratio (PR). (A)
Relationship between number of presses required and reward number for the PR 4 × 1.18
schedule. (B) Mean (+ SEM) number of lever presses in the PR. (C) Mean (+ SEM) number of
rewards earned in the PR. (D) Mean (+ SEM) session duration (min) in the PR. For (B–D), each
![Page 116: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/116.jpg)
101
point represents a single subject’s performance averaged over 4 days of vehicle or METH
treatment. ** p < .01.
To analyze the effect that treatment with METH had on the rate or vigor of behavior in
the PR, we looked at the response rate (presses per min) and the duration of each single response
(time from the lever being pressed down to the time when it is let backup). METH led to an
increase in the response rate, t(7) = 3.084, p = .0177 (see Figure 5D), as well as a shorter average
response duration (Vehicle M = 0.72 s + 0.05; METH M = 0.53s + 0.05), t(7) = 3.390, p = .0095.
Thus, METH responses were made more often, and each response was executed more quickly.
Although both measures could indicate that METH may result in increased arousal or
hyperactive responding, this cannot be definitively determined with the PR as every response
counts toward the next reward and is considered goal directed. There was no significant
difference in the number of missed rewards (i.e., rewards that the subject failed to collect)
between vehicle and METH conditions (Vehicle M = 0.54 + 0.19; METH M = 0.45 + 0.14), t(7) =
0.290, p = .779, suggesting that METH treatment did not interfere with subjects motivation to
collect the rewards.
Methamphetamine leads to inefficient performance in the PHD task.
Having confirmed that the PHD task is sensitive to changes in a subject’s motivational
state, we next tested mice following treatment with METH to determine whether subjects would
work harder and longer, as was shown in PR testing. Subjects were tested 20 min after receiving
an IP injection of 1 mg/kg of METH, a dose known to induce a robust increase in activity (Hall
et al., 2008). In addition, we included a non-reinforced “inactive lever” to provide a measure of
activity that was not related to the goal of the task.
Figure 6A depicts the change in PHD task performance following administration of
METH, using a representative press record of an individual subject during vehicle and METH
treatment sessions. The number of lever presses made on the active lever was significantly
![Page 117: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/117.jpg)
102
higher on METH, t(12) = 9.175, p < .0001 (see Figure 6B). Treatment with METH also lead to a
significant increase in how long subjects continued working (Vehicle M = 62.8 min + 4.44;
METH M = 112.5 min + 2.49), t(12) = 6.743, p < .0001, as 75% of the sessions on METH went
the full 2 hr compared with 11% during vehicle treatment. Despite increasing the number of
presses on the active lever and the length of time that subjects continued pressing, mice did not
earn significantly more rewards during the METH sessions, t(12) = 1.647, p = .1254 (see Figure
6C). Moreover, there was not a significant difference in the average hold duration of presses
greater than 2 s, t(12) = 0.4647, p = .6505 (see Figure 6D), suggesting that METH treatment did
not enhance goal-directed behavior.
Figure 6. Methamphetamine (METH) increases amount lever pressing in the progressive hold-
down (PHD) task. (A) Shows a representative press record of the first 100 presses for a single
subject tested on the PHD task following treatment with vehicle (white) and METH (grey). (B)
Mean (+ SEM) number of lever presses on the correct lever. (C) Mean (+ SEM), number of
![Page 118: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/118.jpg)
103
rewards earned. (D) Mean (+ SEM) duration (s) of all lever presses longer than 2 s. (E–F) There
is a significant positive relationship between the number of presses and the number of rewards
earned during vehicle treatment (E), whereas there is a negative relationship between these
measures during METH treatment (F). (G) Shows the distribution of the durations of
unsuccessful holds, Mean (+ SEM). (G) Mean (+ SEM) number of lever presses made which
were shorter than 2 s. In (B–E, G), each point represents a single subject’s performance averaged
across 4 days of each treatment condition. ** p < .01.
During baseline vehicle treatment there was a significant positive relationship between
the number of lever presses made on the active lever and the number of rewards earned, r(12) =
0.585, p = .05 (see Figure 6E). During METH treatment, however, there was a nonsignificant
negative relationship between the number of lever presses made and the number of rewards
earned, r(12) = 0.395, p = .10 (see Figure 6F), as there was a significant increase in the total
number of failed press attempts on the correct lever (Vehicle M = 30.9 + 1.77; METH M =
167.26 + 10.2), t(12) = 10.16, p < 0.001. This is evident from the distribution of the duration of
unsuccessful press attempts (see Figure 6G), there was a significant increase in the number of
presses made which were shorter than 2 s in duration during METH treatment, t(12) = 6.903, p <
.0001 (see Figure 6H), suggestive of an increase in hyperactive responding.
Behavioral Changes in Progressive Hold-Down Task Induced by Methamphetamine Are
the Result of Increased Arousal
We next analyzed how the increase in hyperactive presses following METH treatment
affected various measures of performance in the PHD task. Treatment with METH led to a
significant increase in the rate of lever pressing (Vehicle: M = 1.18 responses/min + 0.08;
METH: M = 2.10 + 0.15), t(12) = 6.503, p < .0001. We computed each subject’s efficiency,
defined as the proportion of lever presses made that were rewarded, from the first press to the
last rewarded press. Following METH treatment, there was a significant decrease in the
efficiency of responding, t(12) = 10.55, p < .0001 (see Figure 7A). This decrease in efficiency can
be seen from the beginning of the session, as there is a decrease in efficiency at every single hold
![Page 119: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/119.jpg)
104
requirement (see Figure 7B). Additionally, as a consequence of the increased hyperactive
responding on METH, it took subjects longer to complete each hold requirement, even the short
ones at the very beginning of the session (see Figure 7C). Thus, METH appears to increase
arousal and hyperactive responding at the expense of efficient, goal-directed action.
Figure 7. Methamphetamine (METH) increases hyperactive responses in the progressive hold-down
(PHD) task. (A) The mean (+ SEM) efficiency (proportion of successful responses on the active lever
until the last rewarded press). (B) Mean efficiency as a function of the required hold duration (s) in the
PHD session. (C) Mean time (s) it takes subjects to complete a required hold duration (s). (D–E) The
mean cumulative number of failed presses on the active and inactive lever as a function of the trial
number during vehicle (D) and METH (E) treatment. (F) The mean (+ SEM) difference in the number of
trials between the first failed press on the active lever and the first press on the inactive lever. In (A, F),
each point represents a single subject’s performance averaged across the 4 days of each treatment
condition. In (B–C), each point represents the treatment condition average of all subjects at each given
hold requirement. In (D–E), each point represents the treatment condition average of all subjects for each
trial number. ** p < .01.
![Page 120: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/120.jpg)
105
We additionally looked at the number of lever presses made on the inactive lever as this
is thought to measure hyperactive nongoal-directed responding. Although the average number of
inactive presses was higher during METH treatment (Vehicle M = 11.46 + 7.4; METH M =
35.46 + 20.7), this measure was strongly skewed such that most subjects made few presses on
the inactive lever and a few subjects made many. A Mann–Whitney test of a difference in the
medians did not detect a significantly difference in the number of inactive lever presses made, U
(12) = 52.50, p = .106. Further evidence that METH did not increase goal-directed action comes
from the within session pattern of responding on the inactive lever. As can be seen in Figure
7(D–E), the number of presses on the inactive lever did not begin to increase until there was a
rise in the number of failed press attempts on the active lever in both vehicle (see Figure 7D) and
METH (see Figure 7E) conditions. Previous studies suggest that when subjects are no longer
rewarded for goal-directed action (e.g., during extinction), behavior becomes more variable, and
subjects make responses that were not previously reinforced (Rick et al., 2006; Neuringer et al.,
2001; Antonitis, 1951). Our data show that subjects wait to switch to the inactive lever only after
they have exceeded a certain number of failed goal-directed attempts. It is interesting to note that
there was no difference in the number of failed, long goal-directed attempts between vehicle and
METH, t(12) = 0.1779, p = .8617 (Figure 7F), which further suggests that METH does not change
goal-directed motivation because subjects are not treating the short hyperactive responses in the
same way they treat failed goal-directed attempts.
Discussion
Motivation has long been known to consist of several underlying processes, two
important ones being a directional process, steering behavior toward a specific goal, and an
activation process, providing energy and vigor to behavior by increasing arousal. Our novel
![Page 121: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/121.jpg)
106
experimental strategy elucidates which of these two processes is altered by evaluating subjects in
both the PR and the novel PHD task. In the PHD task, we demonstrate that established
motivation manipulations of food deprivation and reward value lead to an increase in goal-
directed behavior, but have little effect on hyperactive nonrewarded responding. To test the
efficacy of our novel strategy, we use a dose of a drug known to increase overall levels of
general locomotor activity (1.0 mg/kg of METH) as a tool to compare the behavioral profiles of
mice when tested on the PR and PHD task. In the PR, treatment with METH leads to increases in
the number or responses and amount of time subjects continue working in the task. In the PHD
task, treatment with METH leads to a large increase in responding, making overall performance
less efficient but neither increases nor impairs goal-directed motivation. As elaborated in the
following paragraphs, the results of the current experiment demonstrate the efficacy of using the
PR and PHD tasks jointly to differentiate the components of motivation.
The PHD Task Measures Both Goal-Directed Behavior and Hyperactive Responding
One of the main reasons that the PR has become a popular measure of motivated
behavior is that the number of lever presses made and the amount of time spent working are
directly related to the number of rewards earned, giving the task substantial face validity
(Bradshaw & Killeen, 2012). Under baseline conditions in the PHD task, the number of presses
subjects make and the amount of time they spend working are also strongly positively correlated
with the number of rewards earned. The main distinction between the PR and PHD tasks,
however, is that these three measures are not always proxies for one another. In the PHD task, if
a subject keeps prolonging the duration of their lever presses, then the relationships will remain.
If, however, subjects make presses of a short duration, as one would expect in the case of high
![Page 122: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/122.jpg)
107
arousal or hyperactivity, this dilutes the relationship between the number of presses made, the
amount of time spent working, and the number of rewards earned.
We found the PHD task to be sensitive to both changes in goal-directed responding as
well as high arousal hyperactive responding by examining the effects of manipulating the value
of rewards in two different ways. First, we tested mice on a restricted diet, in which they are
more motivated to earn food rewards. Second, we increased the value of the rewards that could
be earned in the task. We show that being on a restricted diet resulted in increased rewards
earned, amount of lever pressing, and time spent working. It is important to note that this
manipulation also led to a large increase in the maximal hold durations, as would be expected
with increased amounts of goal-directed behavior in this task. There was a small increase in the
number of short duration responses made, but this is not surprising as food deprived subjects do
show elevated levels of arousal compared with ad lib maintained subjects (Harrison & Archer,
1987; Heiderstadt et al., 2000). Similarly, subjects are sensitive to the value of the reward, and
subjects made more presses and held the lever down longer for higher concentrations of a
sucrose reward. Increasing reward value did not, however, lead to an increase in the amount of
short duration (hyperactive) responses made. Thus, performance in the PHD task reflects the
goal-directed motivation of subjects.
We further demonstrated the sensitivity of PHD task to changes in general arousal–
hyperactivity by examining the effects of a dose of METH known to increase this aspect of
motivation. Administration of METH in PR led to an increased number of lever presses, number
of rewards earned, and total duration of time spent working for food rewards. However, the
effect of METH on PHD performance suggests that the increases observed in the PR are driven
mainly by the activational process of motivation rather than the directional process. As in the PR,
![Page 123: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/123.jpg)
108
METH led to a greater number of responses, but the main increase in behavior seen with METH
was with presses of short durations (< 2 s), which led to inefficient PHD performance. In contrast
to PR results, METH did not result in subjects earning an increased number of rewards in the
PHD task. These results are consistent with several studies documenting METH’s ability to
increase arousal across a variety of behavioral measures (Cruickshank & Dyer, 2009; Estabrooke
et al., 2001; Hart et al., 2008) and suggests that this dose of METH does not lead to increased
goal-directed motivation. Please note that this study was not intended to be a comprehensive
examination of the psychopharmacology of methamphetamine. Rather, it is intended to be a
demonstration of the additional nuanced information one can gain by using the PHD in
conjunction with the PR. We only examined one dose that we knew would increase general
activity. Thus, whether and how amphetamines might affect motivation at different doses is a
remaining question that future studies could address.
Implications for Studying the Neurobiology of Motivation
The present results indicate that we cannot rely on any single behavioral assay to study
complex behavioral processes like motivation. An increase in responding on the PR may reflect
increased goal-directed motivation in studies that employed a wide range of techniques,
including behavioral manipulations (Barr & Phillips, 1999; Bowman & Brown, 1998; Ferguson
& Paule, 1997; Hodos, 1961; Hodos, & Kalman, 1963), pharmacological manipulations
(Aberman et al., 1998; Randall, 2012; Simpson et al., 2011), and genetic manipulations
(Cagniard et al., 2006; Drew et al., 2007; Gore & Zweifel, 2013; Sanders et al., 2007; Trifilieff et
al., 2013). The current results suggest that these different manipulations may affect motivated
behavior by affecting different underlying processes.
![Page 124: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/124.jpg)
109
Use of both the PR and the PHD tasks allows understanding of how drug or genetic
manipulations influence the processes underlying motivated behavior and reduces the risk
drawing incorrect conclusions based on a single assay. By understanding the results of
manipulations across these two tasks, a deeper understanding can be achieved. For example, if
either improved or impaired performance occurred in both tasks it would strongly suggest an
increase or decrease in goal-directed motivation, respectively. In contrast, increased performance
in the PR and impaired or unaltered performance in the PHD task (as demonstrated with METH)
is indicative of increased arousal or hyperkinesia. Moreover, no difference in PR and an increase
in PHD might reflect intact goal-directed motivation and impaired arousal, whereas a decrease in
PR performance and an increase in PHD performance may reflect psychomotor slowing or
bradykinesia, either of which would facilitate holding behavior but disrupt continuous and fluid
initiation of the behaviors required for success in the PR. Finally, we note that improvements in
the ability to measure complex behavior in laboratory animals may have a large impact on the
development of new treatments for psychiatric disease. Impaired goal-directed motivation or
apathy is a problematic symptom common to several psychiatric diseases, including
schizophrenia (Kiang, Christensen, Remington, & Kapur, 2003; Roth, Flashman, Saykin,
McAllister, & Vidaver 2004; Faerden et al., 2009) and some affective disorders (Feil, Razani,
Boone, & Lesser, 2003; Marin, Razani, Boone, & Lesser, 2003). There are, however, currently
no effective treatments for this aspect of impaired motivation (Chase, 2011; Levy & Czernecki,
2006), representing a major gap in the current treatment repertoire. The methods developed here
enable the identification of mechanisms and factors that specifically enhance goal-directed
responding in preclinical research.
![Page 125: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/125.jpg)
110
References
Aberman, J. E., Ward, S. J., & Salamone, J. D. (1998). Effects of dopamine antagonists and
accumbens dopamine depletions on time-constrained progressive-ratio performance.
Pharmacology, Biochemistry, and Behavior, 61, 341–348.
http://dx.doi.org/10.1016/S0091-3057(98)00112-9
Anaclet, C., Parmentier, R., Ouk, K., Guidon, G., Buda, C., Sastre, J.-P., . . . Lin, J.-S. (2009).
Orexin/hypocretin and histamine: Distinct roles in the control of wakefulness
demonstrated using knock-out mouse models. The Journal of Neuroscience, 29, 14423–
14438. http://dx.doi.org/ 10.1523/JNEUROSCI.2604-09.2009
Antoniou, K., Papadopoulou-Daifoti, Z., Hyphantis, T., Papathanasiou, G., Bekris, E., Marselos,
M., Ferré, S. (2005). A detailed behavioral analysis of the acute motor effects of
caffeine in the rat: Involvement of adenosine A1 and A2A receptors.
Psychopharmacology, 183, 154–162. http://dx.doi.org/10.1007/s00213-005-0173-6
Antonitis, J. J. (1951). Response variability in the white rat during conditioning, extinction, and
reconditioning. Journal of Experimental Psychology, 42, 273–281.
http://dx.doi.org/10.1037/h0060407
Barr, A. M., & Phillips, A. G. (1999). Withdrawal following repeated exposure to d-
amphetamine decreases responding for a sucrose solution as measured by a progressive
ratio schedule of reinforcement. Psychopharmacology, 141, 99–106.
http://dx.doi.org/10.1007/s002130050812
Berridge, C. W. (2006). Neural substrates of psychostimulant-induced arousal.
Neuropsychopharmacology, 31, 2332–2340. http://dx.doi.org/ 10.1038/sj.npp.1301159
Berridge, K. C., & Robinson, T. E. (2003). Parsing reward. Trends in Neurosciences, 26, 507–
513. http://dx.doi.org/10.1016/S0166-2236 (03)00233-9
Bowman, E. M., & Brown, V. J. (1998). Effects of excitotoxic lesions of the rat ventral striatum
on the perception of reward cost. Experimental Brain Research, 123, 439 – 448.
http://dx.doi.org/10.1007/ s002210050588
Bradshaw, C. M., & Killeen, P. R. (2012). A theory of behaviour on progressive ratio schedules,
with applications in behavioural pharmacology. Psychopharmacology, 222, 549–564.
http://dx.doi.org/10.1007/ s00213-012-2771-4
Cagniard, B., Balsam, P. D., Brunner, D., & Zhuang, X. (2006). Mice with chronically elevated
dopamine exhibit enhanced motivation, but not learning, for a food reward.
Neuropsychopharmacology, 31, 1362–1370. http://dx.doi.org/10.1038/sj.npp.1300966
![Page 126: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/126.jpg)
111
Chase, T. N. (2011). Apathy in neuropsychiatric disease: Diagnosis, pathophysiology, and
treatment. Neurotoxicity Research, 19, 266–278.
Cruickshank, C. C., & Dyer, K. R. (2009). A review of the clinical pharmacology of
methamphetamine. Addiction, 104, 1085–1099. http:// dx.doi.org/10.1111/j.1360-
0443.2009.02564.x
Demyttenaere, K., De Fruyt, J., & Stahl, S. M. (2005). The many faces of fatigue in major
depressive disorder. International Journal of Neuropsychopharmacology, 8, 93, –105.
Drew, M. R., Simpson, E. H., Kellendonk, C., Herzberg, W. G., Lipatova, O., Fairhurst, S., . . .
Balsam, P. D. (2007). Transient overexpression of striatal D2 receptors impairs operant
motivation and interval timing. The Journal of Neuroscience, 27, 7731–7739.
http://dx.doi.org/10.1523/ JNEUROSCI.1736-07.2007
Duffy, E. (1957). The psychological significance of the concept of arousal or activation.
Psychological Review, 64, 265–275. http://dx.doi.org/ 10.1037/h0048837
Estabrooke, I. V., McCarthy, M. T., Ko, E., Chou, T. C., Chemelli, R. M., Yanagisawa, M., . . .
Scammell, T. E. (2001). Fos expression in orexin neurons varies with behavioral state.
The Journal of Neuroscience, 21, 1656–1662.
Faerden, A., Vaskinn, A., Finset, A., Agartz, I., Ann Barrett, E., Friis, S., . . . Melle, I. (2009).
Apathy is associated with executive functioning in first episode psychosis. BMC
Psychiatry, 9, 1. http://dx.doi.org/10.1186/ 1471-244X-9-1
Feil, D., Razani, J., Boone, K., & Lesser, I. (2003). Apathy and cognitive performance in older
adults with depression. International Journal of Geriatric Psychiatry, 18, 479–485.
http://dx.doi.org/10.1002/gps.869
Ferguson, S. A., & Paule, M. G. (1997). Progressive ratio performance varies with body weight
in rats. Behavioural Processes, 40, 177–182. http://dx.doi.org/10.1016/S0376-
6357(97)00786-9
Gore, B. B., & Zweifel, L. S. (2013). Genetic reconstruction of dopamine D1 receptor signaling
in the nucleus accumbens facilitates natural and drug reward responses. The Journal of
Neuroscience, 33, 8640–8649. http://dx.doi.org/10.1523/JNEUROSCI.5532-12.2013
Hall, D. A., Stanis, J. J., Marquez Avila, H., & Gulley, J. M. (2008). A comparison of
amphetamine- and methamphetamine-induced locomotor activity in rats: Evidence for
qualitative differences in behavior. Psychopharmacology, 195, 469 – 478.
http://dx.doi.org/10.1007/s00213- 007-0923-8
Harrison, D. E., & Archer, J. R. (1987). Genetic differences in effects of food restriction on
aging in mice. The Journal of Nutrition, 117, 376– 382.
![Page 127: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/127.jpg)
112
Hart, C. L., Gunderson, E. W., Perez, A., Kirkpatrick, M. G., Thurmond, A., Comer, S. D., &
Foltin, R. W. (2008). Acute physiological and behavioral effects of intranasal
methamphetamine in humans. Neuropsychopharmacology, 33, 1847–1855.
http://dx.doi.org/10.1038/sj.npp .1301578
Hebb, D. O. (1955). Drives and the C.N.S. (conceptual nervous system). Psychological Review,
62, 243–254. http://dx.doi.org/10.1037/h0041823
Heiderstadt, K. M., McLaughlin, R. M., Wright, D. C., Walker, S. E., & Gomez-Sanchez, C. E.
(2000). The effect of chronic food and water restriction on open-field behaviour and
serum corticosterone levels in rats. Laboratory Animals, 34, 20 –28.
http://dx.doi.org/10.1258/ 002367700780578028
Hodos, W. (1961). Progressive ratio as a measure of reward strength. Science, 134, 943–944.
http://dx.doi.org/10.1126/science.134.3483.943
Hodos, W., & Kalman, G. (1963). Effects of increment size and reinforce volume on progressive
ratio performance. Journal of the Experimental Analysis of Behavior, 6, 387–392.
http://dx.doi.org/10.1901/jeab.1963 .6-387
Kellendonk, C., Simpson, E. H., Polan, H. J., Malleret, G., Vronskaya, S., Winiger, V., . . .
Kandel, E. R. (2006). Transient and selective overexpression of dopamine D2 receptors
in the striatum causes persistent abnormalities in prefrontal cortex functioning. Neuron,
49, 603–615. http://dx.doi.org/10.1016/j.neuron.2006.01.023
Kelley, A. E. (2004). Ventral striatal control of appetitive motivation: Role in ingestive behavior
and reward-related learning. Neuroscience and Biobehavioral Reviews, 27, 765–776.
http://dx.doi.org/10.1016/j .neubiorev.2003.11.015
Kiang, M., Christensen, B. K., Remington, G., & Kapur, S. (2003). Apathy in schizophrenia:
Clinical correlates and association with functional outcome. Schizophrenia Research, 63,
79 – 88. http://dx.doi.org/ 10.1016/S0920-9964(02)00433-4
Kim, H., Lee, D., & Jung, M. W. (2013). Signals for previous goal choice persist in the
dorsomedial, but not dorsolateral striatum of rats. The Journal of Neuroscience, 33, 52–
63. http://dx.doi.org/10.1523/ JNEUROSCI.2422-12.2013
Kimchi, E. Y., & Laubach, M. (2009). Dynamic encoding of action selection by the medial
striatum. The Journal of Neuroscience, 29, 3148–3159.
http://dx.doi.org/10.1523/JNEUROSCI.5206-08.2009
Levy, R., & Czernecki, V. (2006). Apathy and the basal ganglia. Journal of Neurology,
253(Suppl. 7), VII54 –VII61. http://dx.doi.org/10.1007/ s00415-006-7012-5
![Page 128: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/128.jpg)
113
Marin, R. S., Butters, M. A., Mulsant, B. H., Pollock, B. G., & Reynolds, C. F., III. (2003).
Apathy and executive function in depressed elderly. Journal of Geriatric Psychiatry and
Neurology, 16, 112–116. http://dx .doi.org/10.1177/0891988703016002009
Mayorga, A. J., Popke, E. J., Fogle, C. M., & Paule, M. G. (2000). Similar effects of
amphetamine and methylphenidate on the performance of complex operant tasks in rats.
Behavioural Brain Research, 109, 59–68. http://dx.doi.org/10.1016/S0166-
4328(99)00165-5
McNamara, C. G., Davidson, E. S., & Schenk, S. (1993). A comparison of the motor-activating
effects of acute and chronic exposure to amphetamine and methylphenidate.
Pharmacology, Biochemistry, and Behavior, 45, 729–732.
http://dx.doi.org/10.1016/0091-3057(93)90532-X
Neuringer, A., Kornell, N., & Olufs, M. (2001). Stability and variability in extinction. Journal of
Experimental Psychology: Animal Behavior Processes, 27, 79–94.
http://dx.doi.org/10.1037/0097-7403.27.1.79
Olausson, P., Jentsch, J. D., Tronson, N., Neve, R. L., Nestler, E. J., & Taylor, J. R. (2006).
DeltaFosB in the nucleus accumbens regulates food-reinforced instrumental behavior and
motivation. The Journal of Neuroscience, 26, 9196–9204.
http://dx.doi.org/10.1523/JNEUROSCI .1124-06.2006
Pfaff, D. W., Martin, E. M., & Faber, D. (2012). Origins of arousal: Roles for medullary reticular
neurons. Trends in Neurosciences, 35, 468–476.
http://dx.doi.org/10.1016/j.tins.2012.04.008
Poncelet, M., Chermat, R., Soubrie, P., & Simon, P. (1983). The progressive ratio schedule as a
model for studying the psychomotor stimulant activity of drugs in the rat.
Psychopharmacology, 80, 184–189. http:// dx.doi.org/10.1007/BF00427967
Randall, P. A., Pardo, M., Nunes, E. J., López Cruz, L., Vemuri, V. K., Makriyannis, A., . . .
Salamone, J. D. (2012). Dopaminergic modulation of effort-related choice behavior as
assessed by a progressive ratio chow feeding choice task: Pharmacological studies and
the role of individual differences. PLoS ONE, 7, e47934.
http://dx.doi.org/10.1371/journal .pone.0047934
Rick, J. H., Horvitz, J. C., & Balsam, P. D. (2006). Dopamine receptor blockade and extinction
differentially affect behavioral variability. Behavioral Neuroscience, 120, 488–492.
http://dx.doi.org/10.1037/0735- 7044.120.2.488
Roth, R. M., Flashman, L. A., Saykin, A. J., McAllister, T. W., & Vidaver, R. (2004). Apathy in
schizophrenia: Reduced frontal lobe volume and neuropsychological deficits. The
American Journal of Psychiatry, 161, 157–159.
http://dx.doi.org/10.1176/appi.ajp.161.1.157
![Page 129: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/129.jpg)
114
Salamone, J. D. (1988). Dopaminergic involvement in activational aspects of motivation—
Effects of haloperidol on schedule-induced activity, feeding, and foraging in rats.
Psychobiology, 16, 196–206.
Salamone, J. D., & Correa, M. (2002). Motivational views of reinforcement: Implications for
understanding the behavioral functions of nucleus accumbens dopamine. Behavioural
Brain Research, 137, 3–25. http:// dx.doi.org/10.1016/S0166-4328(02)00282-6
Salamone, J. D., Correa, M., Mingote, S. M., Weber, S. M., & Farrar, A. M. (2006). Nucleus
accumbens dopamine and the forebrain circuitry involved in behavioral activation and
effort-related decision making: Implications for understanding anergia and psychomotor
slowing in depression. Current Psychiatry Reviews, 2, 267–280. http://dx.doi.org/
10.2174/157340006776875914
Sanders, A. C., Hussain, A. J., Hen, R., & Zhuang, X. (2007). Chronic blockade or constitutive
deletion of the serotonin transporter reduces operant responding for food reward.
Neuropsychopharmacology, 32, 2321–2329. http://dx.doi.org/10.1038/sj.npp.1301368
Simón, V. M., Parra, A., Miñarro, J., Arenas, M. C., Vinader-Caerols, C., & Aguilar, M. A.
(2000). Predicting how equipotent doses of chlorpromazine, haloperidol, sulpiride,
raclopride and clozapine reduce locomotor activity in mice. European
Neuropsychopharmacology, 10, 159–164. http://dx.doi.org/10.1016/S0924-
977X(00)00070-5
Simpson, E. H., Kellendonk, C., Ward, R. D., Richards, V., Lipatova, O., Fairhurst, S., . . .
Balsam, P. D. (2011). Pharmacologic rescue of motivational deficit in an animal model of
the negative symptoms of schizophrenia. Biological Psychiatry, 69, 928–935.
http://dx.doi.org/ 10.1016/j.biopsych.2011.01.012
Stahl, S. M. (2002). Does depression hurt? The Journal of Clinical Psychiatry, 63, 273–274.
http://dx.doi.org/10.4088/JCP.v63n0401 Treadway, M. T., & Zald, D. H. (2011).
Reconsidering anhedonia in depression: Lessons from translational neuroscience.
Neuroscience and Biobehavioral Reviews, 35, 537–555.
Trifilieff, P., Feng, B., Urizar, E., Winiger, V., Ward, R. D., Taylor, K. M., . . . Javitch, J. A.
(2013). Increasing dopamine D2 receptor expression in the adult nucleus accumbens
enhances motivation. Molecular Psychiatry, 18, 1025–1033.
Tylee, A., Gastpar, M., Lépine, J. P., & Mendlewicz, J., & the DEPRES Steering Committee.
(1999). DEPRES II (Depression Research in European Society II): A patient survey of
the symptoms, disability and current management of depression in the community.
International Clinical Psychology, 14, 139–151.
![Page 130: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/130.jpg)
115
Zhang, M., Balmadrid, C., & Kelley, A. E. (2003). Nucleus accumbens opioid, GABaergic, and
dopaminergic modulation of palatable food motivation: Contrasting effects revealed by a
progressive ratio study in the rat. Behavioral Neuroscience, 117, 202–211.
Zhuang, X., Oosting, R. S., Jones, S. R., Gainetdinov, R. R., Miller, G. W., Caron, M. G., &
Hen, R. (2001). Hyperactivity and impaired response habituation in hyperdopaminergic
mice. Proceedings of the National Academy of Sciences of the United States of America,
98, 1982–1987. http://dx.doi.org/10.1073/pnas.98.4.1982
![Page 131: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/131.jpg)
116
Chapter 4
Novel Strategies for Isolating the Effects of Effort and Value on Cost-Benefit
Decision Making in Mice
Abstract
The ability to engage in cost-benefit decision making is essential for organisms to adapt
to the ever-changing conditions they encounter within their environment. Because we cannot
directly measure an animal’s perception of different levels of effort or alternative reward values,
we must rely on making inferences about these experiences based on behavior in different
conditions. Many of the behavioral measures currently used to study cost-benefit decision
making simultaneously alter effort and value variables, making it challenging to identify which
specific aspect of the cost-benefit decision making process is affected: (1) estimating the effort
required in future actions, (2) estimating value of future outcomes, or (3) comparing estimates of
future effort with future value. Here, we characterize two tasks which are specifically designed to
isolate the impacts of effort and value manipulations on cost-benefit decision making,
independently of one another. In the Concurrent Effort Choice (CEC) task, we show that mice
are highly sensitive to the effort manipulations of number of responses and duration of a
response, and can use expectations about these costs to guide their behavior when given a
choice. Importantly, reward value is held constant in this task, so differences in behavior must be
related to the effort manipulations. In another task, called the Concurrent Value Choice (CVC)
task, we show that mice are highly sensitive to manipulations of reward value, such as
concentration of a sucrose solution (5% vs 20%), and devaluation of an outcome. Mice readily
use value information to guide decision making when given a choice, and we measure how
![Page 132: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/132.jpg)
117
reward value modulates the vigor of action. Together, these tasks will allow for a more sensitive
ability to examine cost-benefit decision making in laboratory rodents.
Introduction
The ability to process information about the consequences of our actions and the
outcomes they produce allows humans (Croxson, Walton, O'Reilly, Behrens, & Rushworth,
2009) and other animals (Atalayer & Rowland, 2009; Collier & Johnson, 1997) to engage in
cost-benefit decision making. Performing cost-benefit computations is a critical function of the
central nervous system, which enables organisms to behave in adaptable and flexible ways when
both internal state and environmental conditions change. In order to engage in cost-benefit
analyses, organisms must be able to perform three critical processes: (1) accurately estimate the
effort required to obtain a goal, (2) accurately estimate the value or benefit of the goal to be
obtained, and (3) compare the estimated effort with the estimated value to determine if the
outcome is worth pursuing (Rangel & Hare, 2010; Simpson & Balsam, 2016). A major challenge
in studying such processes in experimental animals results from not being able to directly
measure how effortful an animal perceives an action to be, or how valued one outcome is relative
to another. Investigators must instead make inferences about these things based on observable
behavior under different conditions which subjects are exposed to in behavioral tasks.
Decades of basic research in laboratory rodents have shown that animals are sensitive to
various types of costs, including number (McDowell, 2013; Mechner, 1958; Platt & Johnson,
1971), time (Balsam, Drew, & Gallistel, 2010; Balsam & Gallistel, 2009; Dews, 1978), and the
opportunity costs associated with the uncertainty of an actions outcome, or risk (Winstanley &
Floresco, 2016). Many studies have specifically focused on animals’ sensitivity to different effort
demands, reviewed in (Bailey, Simpson, & Balsam, 2016), such as the number of responses
![Page 133: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/133.jpg)
118
required in different reinforcement schedules (Baron & Derenne, 2000; Covarrubias & Aparicio,
2008; Floresco, Tse, & Ghods-Sharifi, 2008; Hodos, 1961; P. R. Killeen, Posadas-Sanchez,
Johansen, & Thrailkill, 2009), the height of a barrier a subjects must climb (Hauber & Sommer,
2009; J. D. Salamone, Cousins, & Bucher, 1994; Schweimer & Hauber, 2005), and the force
which must be exerted to press a lever (Fouriezos, Bielajew, & Pagotto, 1990; Fowler,
Morgenst.C, & Notterma.Jm, 1972; Ishiwari, Weber, Mingote, Correa, & Salamone, 2004;
Notterman & Mintz, 1965).
In addition to the work on different effort costs, many studies have examined sensitivity
to different kinds of benefits. These include reward value manipulations of magnitude (i.e.
number of pellets, volume of a sucrose solution), the concentration of a sweet solution (Flaherty,
Turovsky, & Krauss, 1994; Glendinning, Gresack, & Spector, 2002) and qualitatively different
outcomes (Flaherty, 1982). Reward value manipulations modulates aspects of instrumental
behavior for future outcomes as well, as expected reward value influences how long a subject
continues to work for an outcome (Bailey et al., 2015; Covarrubias & Aparicio, 2008; Floresco et
al., 2008; Skjoldager, Pierre, & Mittleman, 1993), as well as the rate/vigor of responding (Hutsell
& Newland, 2013).
These studies represent only a fraction of the body of work demonstrating sensitivity to
changes in effort requirements and reward value. There has been a large amount of progress
made studying cost-benefit decision making in laboratory rodents, reviewed in (Bailey et al.,
2016; Miller, Thome, & Cowen, 2013; Winstanley & Floresco, 2016). In many of the studies
conducted to date, however, behavioral test are designed such that effort and reward variables are
simultaneously manipulated (Gold, Waltz, & Frank, 2015). By using behavioral tasks that isolate
and manipulate the variables of effort and value independently of one another, one can more
![Page 134: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/134.jpg)
119
directly assess an animal’s capacity to (1) accurately estimate future effort requirements and (2)
accurately estimate the value of future outcomes.
In the present set of studies, we developed two independent behavioral measures to study
cost-benefit decision making. The first task, known as the Concurrent Effort Choice (CEC) task,
manipulates only the effort requirement while holding the value variable constant. This measures
how sensitively subjects can sense or estimate different effort levels to guide decision making.
This task is conceptually similar to previously measures which have been used to study effort-
related choice, including a Concurrent Lever Press/Free Choice Consumption task (Cousins &
Salamone, 1994; J. D. Salamone et al., 1991), a the T-arm Barrier Maze (J. D. Salamone et al.,
1994), and an operant “effort-discounting” task (Floresco et al., 2008; Shafiei, Gray, Viau, &
Floresco, 2012). All of these tasks provide a choice between a High Effort → Large Reward, and
a Low Effort → Small Reward response option. The key difference with the CEC task is that it
parametrically alters 2 work options, but always has subjects working for the same reward, as
opposed to choosing between a high and low reward option (see table 1).
Table 4.1. Effort and Value Variables Being Manipulated in Effort-Related Choice Tasks.
Task Effort Choice Value Choice Effort Value High Low High Low Change? By Change? By
T-Arm Barrier Maze Climb Walk 4 pellets 2 pellets Yes Climb/ No
Climb
Yes # Pellets
Concurrent Lever
Press/Free Chow
Consumption
5 Lever
Presses
0 Lever
Presses
Pellet Chow Yes Press/ No
Press
Yes Type of
food
Operant Effort
Discounting
2-20 Lever
Presses
1 Lever
Press
4 Pellets 1 Pellet Yes # of
Presses
Yes # of Pellets
The second task, known as the Concurrent Value Choice (CVC) task, manipulates the value
variable, while controlling for the variable of effort. This measures how sensitively subjects can
sense or estimate different value levels to guide decision making. In the present set of
![Page 135: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/135.jpg)
120
experiments, we perform a detailed analysis of behavior in the CEC and CVC tasks, to
characterize this approach to studying cost-benefit decision making in animal models.
Methods
Subjects
Experiments used male C57BL/6J mice (The Jackson Laboratory, Bar Harbor, ME, USA)
which were 10 weeks old at the start of the experiments. All subjects were maintained at 85% of
their ad libitum bodyweight in order to motivate them to work for food rewards in operant
procedures. All experiments and animal care protocols were in accordance with the Columbia
University and NYSPI Institutional Animal Care and Use Committees and Animal Welfare
regulations. The number of subjects used is indicated in the description of each experiment
below.
Apparatus
Experimental chambers (ENV-307w; Med Associates, St. Albans, VT) equipped with
liquid dippers and pellet dispensers were used in the experiment. Unless otherwise noted, the
apparatus was identical to that used by Drew and colleagues, (2007). Two retractable levers were
mounted on either side of a feeding trough, and a house light (model 1820; Med Associates)
located at the top of the chamber was used to illuminate the chamber during the sessions. The
specific reward outcomes used is detailed for each experiment.
Behavioral Procedures
Lever Press Training Procedures. Subjects in the CEC experiments were trained to
press levers for milk rewards using the procedure described by Drew and colleagues (2007).
Briefly, mice were first trained to consume the liquid milk reward from the feeding trough when
![Page 136: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/136.jpg)
121
a dipper containing milk was presented. To do this, mice were first placed into the operant
chamber with the dipper in the accessible, raised position. From the time mice first made a head
entry into the feeding trough, the dipper remained raised for another 10 seconds and then was
lowered. Following a variable inter-trial interval (ITI), (mean = 40s) a new trial began, which
was identical to the first trial. These sessions lasted 30 minutes or until the mice consumed 20
rewards. On the next day, the dipper began in the lowered position, but was raised while at the
same time a 0.5 second tone was played. The dipper then remained in the raised position for 8
seconds and then was lowered. A variable ITI interval (mean = 40s) was followed by an identical
trial through the remainder of the session. This was done to teach the mice to quickly get to the
feeding trough to consume the reward when it was presented. Sessions of this type lasted 30
minutes or until mice earned 30 rewards. Mice were exposed to sessions of this type until they
earned 30 rewards on 2 consecutive days. Once all subjects reached this criterion they were then
moved on to the training phase to learn to make lever presses for milk rewards.
In the second phase of basic lever press training mice were taught to press levers to earn
the rewards. In these continuous reinforcement sessions (CRF), mice were rewarded with a 5
second presentation of a dipper following every lever press. At the beginning of the session the
lever was extended into the chamber, remained extended for 2 rewards, and then retracted. After
a variable ITI (mean = 40s), the lever was re-extended and remained so for the next two rewards.
Sessions continued like this for 1 hour or until 40 rewards were earned. Mice were trained on
CRF until they earned 40 rewards on 3 consecutive days, and were then trained on random ratio
(RR) schedules. They were only moved on to a higher RR after they reached this same criterion.
In RR schedule, subjects were required to make some variable number of lever presses in each
trial, with the number required being drawn from an exponential distribution with a given mean.
![Page 137: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/137.jpg)
122
Subjects were tested in RR05, RR10, and RR20 schedules until they earned 40 rewards on 3
consecutive days. Once this criterion was reached, subjects were then moved on to the lever
hold-down training phase of the experiment.
Lever Hold-Down Training Procedures. Subjects in all experiments were exposed to
two different “hold down” procedures (Bailey et al, 2015): the Variable Interval Hold Down
(VIH) and the Progressive Hold Down (PHD). In both schedules, a required hold duration was
assigned prior to the start of each trial. This was the duration of time the subject was required to
hold the lever in the depressed position in order to receive a reward. Each individual trial in
either schedules followed a similar procedure: at the start of each trial, the house light was
illuminated and a lever was extended. As soon as the mouse depressed the lever, a timer began
counting how long the lever was in the depressed position. This timer stopped and reset to 0.0 if
the mouse ended the lever press before the required time was reached. If the lever was depressed
for as long as the required duration, the trial ended, and the subject received a reward. A tone (2
s) sounded and the house light was shut off until the start of the next trial to signal the
presentation of the dipper (5 s).
Variable Interval Hold training. Following initial lever press training (described
above), subjects were then trained to make lever holds using the VIH task. At the beginning of
each trial, the required hold duration was drawn randomly from a truncated exponential
distribution. This hold requirement remained in place until the subject was reinforced for
completing the trial, at which time the next trial’s required hold duration was randomly
determined. During the first session, the distribution of required hold durations had a mean = 0.5
s; (min = .01 s; max = 2.44 s). When a mouse earned 40 rewards on three consecutive days, the
required hold durations for the subsequent session were drawn from an exponential distribution
![Page 138: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/138.jpg)
123
with a higher mean (1 s, 2 s, 3 s, 4 s, 5 s, 8 s, 10 s). Thus, during the final session of VIH
training, subjects were required to hold down the lever for intervals that averaged 10 s, but could
be as long as 18.8 s.
Concurrent Effort Choice Task
All subjects were first trained to press a lever using the basic lever press training
procedure for a rewards of evaporated milk (.01 ml) delivered by raising a dipper located inside
the feeder trough for 5 seconds. Subjects were then exposed to increasing RR schedules on a
single lever. The progression was as follows: 3 days on CRF, 3 days on RR05, 3days on RR10,
and 3 days on RR20. Subjects were next trained to make lever holds on the opposite lever. This
was done using the VIH training program. Subjects were exposed to 3 days of each of the
following VIH programs in increasing order (0.5, 1.0, 2.0, 4.0, 6.0, 8.0, 10.0 sec).
Once subjects were trained to make presses on lever A and holds on lever B they were
then moved to the CEC task. In this task, subject could lever press on one lever, or lever hold on
the opposite lever to obtain a reward. The session consisted of two separate phases. First, there
were 10 forced choice trials in which either the press lever or the hold lever was presented (each
lever was presented 5 times). Once subjects completed all 10 of the single lever forced choice
trials, they were then presented with 40 free choice trials. In this phase of the session subjects
were presented with both levers and they were able to choose which one they worked on to
obtain the reward.
In this experiment, we first maintained the hold duration required at a constant value (5
seconds), and varied the press requirement over days. We used the following press requirements
(1, 5, 10, 20, 40, 80, and 160). After running through this progression 2 times, we then changed
![Page 139: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/139.jpg)
124
the hold requirement to 10 seconds and ran through the same progression of different press
requirements.
Concurrent Value Choice Task
All subjects were first trained to press a lever on each side of the operant chamber using
the basic lever press training procedure described above. The lever on one side of the chamber
delivered a liquid sucrose solution, while the opposite lever delivered a 14mg sucrose pellet.
Subjects were then exposed to increasing RR schedules on each lever for the different outcomes,
receiving one session for a single outcome per day. The progression was as follows: 3 days on
CRF, 3 days on RR05, 3days on RR10, and 3 days on RR20.
Once subjects were trained to make presses on lever A for liquid sucrose and lever
presses on lever B for a sucrose pellet they were then moved to the CVC task. Each CVC session
consisted of two separate phases. First, there were 10 single lever forced choice trials in which
either the sucrose lever or the pellet lever was presented. Subjects then had to complete the press
requirement on that lever to obtain the reward and gain experience with the cost of that particular
outcome on a given day. Each lever was presented 5 times in a semi-random order. Once
subjects completed all 10 of the forced choice trials they were then presented with 20 free choice
trials in which subjects were presented with both levers and they were able to choose which one
they worked on to obtain the reward.
Sucrose Concentration Manipulation. In this experiment, the cost of liquid sucrose was
fixed at 5 presses, whereas the cost for a pellet varied over days (10, 20, 40, or 80 presses).
Subjects were first tested in the CVC with 20% liquid sucrose and were exposed to each of the
different pellet costs in ascending order 2 times. Following this, subjects were then tested in the
![Page 140: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/140.jpg)
125
CVC with a 5% sucrose solution and were exposed to each of the different pellet costs in
ascending order 2 time. The average of these two rounds of 2 exposure to each of the different
pellet costs was taken for each sucrose concentration condition.
Outcome Devaluation Manipulation. In this experiment, the sucrose concentration was
kept at 5% and the cost of a sucrose reward was fixed at 5 presses. The cost of a pellet varied
over days (10, 20, 40, or 80 presses). Subjects were tested in 3 consecutive weeks in this
experiment. In week 1 (Valued-1), subjects were tested on the CVC schedule as described above.
In week 2 (Devalued), subjects received a 30 minute exposure to pellets prior to the testing
session, and then were tested in the CVC task. In week 3 (valued-2), subjects were tested on the
CVC without pre-session exposure to pellets.
Data Analysis
Planned statistical analyses are reported in the text, and post-hoc comparisons are reported in the
figure legends.
Results
Concurrent Effort Choice (CEC) task to measure effort-related choice
Mice were trained to make lever presses on the Press Lever (PL) and make lever holds on
the Hold Lever (HL). Once subjects were proficient at making the different types of responses on
the two different levers they were tested in the CEC task. In this task, subjects get 10 single lever
Forced Choice trials during which either the PL or the HL is presented. Subjects had to make the
required number of presses or hold for the required amount of time to earn rewards in the forced
trials. These trials served to teach the subjects the two different requirements on any given day,
because the hold duration and number of lever presses required to earn rewards changed over
![Page 141: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/141.jpg)
126
days. After completing the 10 forced trials subjects are then presented with 40 choice trials in
which both levers were presented and the subject can choose which lever to work on to earn
rewards (see Fig 4.1 for a schematic representation).
Figure 4.1. Schematic Representation of the CEC task. Session begins with 10 single levers trials
in which either the Press Lever (PL) or Hold Lever (HL) is presented. The number of presses
required on the PL varies over days (1, 5, 10, 20, 40, 80, or 160), as does the hold requirement on
the HL (5 or 10 sec). Upon completing the 10 single lever trials, subjects then received 40 choice
trials in which both levers were presented for subjects to choose which lever to work on to obtain
the milk reward.
Both Response Number and Hold Time Modulate Effort-Related Choice
In order to examine subject’s sensitivity to the different effort requirements of response
number and response duration, we performed an on the proportion of hold choices subjects made
during the choice trials, defined as a trial in which holding lead to reward (Fig 4.2A). The curve
for the 10 second condition displayed a rightward shift relative to the 5 second hold, indicating
subjects were willing to continue making lever presses to a higher extent when the alternative
was a 10 second as compared to a 5 second hold. For both hold durations subjects became more
likely to choose the hold option as the number of responses required to earn reward increased.
There was also a main effect of press requirement (F(3,15) = 223.495, p < .0001), a main effect of
hold duration (F(1,15) = 73.040, p < .0001), but no significant interaction (F(1,6) = 1.887, p =
.0815). To better capture the effect of the hold duration requirement on subjects choice we
![Page 142: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/142.jpg)
127
computed each subject’s point of subjective equality (PSE) by extrapolating the number of
presses at which the choice behavior would be equal to 0.5, which gave us an estimate of the
number of presses a subject considered equally effortful to a given hold duration. Subject’s PSE
was significantly affected by the hold duration condition (t (15) = 5.143, p < .0001; Fig 4.2B,
Left). All subjects were sensitive to this effort manipulation as indicated by both individual
subjects PSE’s (Fig 4.2B, Right), as well as individual subject’s choice functions (Supplemental
Figure 4.1S).
Figure 4.2. Effort-based choice in the CEC task. (A). Shows the mean + (SEM) proportion of
hold choices in the CEC task in 2 different hold duration conditions. (B). Left, shows the mean +
(SEM) point of subjective equality (PSE) for subjects in the 2 different hold duration conditions.
Right, shows individual subject’s shift in PSE. (C). Shows the mean + (SEM) number of switch
trials in the CEC task in 2 different hold duration conditions. (D). Shows the main effect of press
requirement for number of switch trials in the CEC task. n = 16; *p < .05; **p < .01
![Page 143: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/143.jpg)
128
From our analysis of each subject’s choice function (Fig 4.2B), we were able to
determine how effortful mice found different amounts of lever pressing to be relative to lever
holding. To explore a finer-grain analysis of choice behavior we examined whether decisions
came in a binary fashion, or whether mice started off pressing and then decided to switch over to
holding in the middle of a trial. We analyzed how frequently switch trials occurred, which we
defined as trials in which subjects began working on one lever, and then switched to the other
lever to complete obtain the reward in that trial. Overall, subjects switched work types in the
middle of a choice trial less than 20% of the time across all of the different conditions (Fig 4.2C).
We found a significant main effect of Press Requirement on the proportion of Switch Trials
made (Fig 4.2D; F(6,210) = 2.348, p = .0324), but no main effect of Hold Duration or any
interactions.
The Effort Requirement of Number of Presses Impacts the Latency to Begin Working
The data from the choice trials demonstrated subject’s sensitivity to different effort
requirements and their ability to adapt their behavior in response to changing costs. We next
wanted to see if there were any other indices of subject’s sensitivity to different costs in addition
to their choice behavior. To explore this, we examined behavior while subjects performed the
two different types of work (pressing vs holding). Since subjects completed different numbers of
presses and holds during the choice trials we examined behavior in the 10 single lever forced
trials at the beginning of each session, as all subjects would have completed equivalent numbers
of such trials. We first examined behavior in the press trials by looking at the total time it took
subjects to complete the different press requirements. Unsurprisingly, it took subjects a longer
total time to complete larger press requirements (Fig 4.3A), as we detected a significant main
effect of the press requirement (F(6, 90) = 198.2, p < .0001), a significant main effect of the hold
![Page 144: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/144.jpg)
129
duration (F(1, 15) = 14.46, p = .0017), and a significant press requirement × hold duration
interaction (F(6, 90) = 3.639, p = .0028), as subjects took longer to complete t the larger
requirements of 80 and 160 presses in the 5s hold condition (Fig 4.3A).
Figure 4.3. Response number costs impact latency to work. (A). Shows mean + (SEM) total
times to complete press trials for the Press Lever. (B). Schematic representing how the single
lever press trials were divided into latency to begin working (time from lever extension to first
response), and work time (time from first response to completion of the requirement). (C). Shows
mean + (SEM) latencies to begin working on the Press Lever. (D). Shows mean + (SEM) work
times on the Press Lever. (E). Shows mean + (SEM) latency to begin pressing as a function of the
trial number. n = 16; *p < .05; **p < .01
![Page 145: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/145.jpg)
130
We further parsed the data in lever pressing trials into 2 separate components: latency to
start working (amount of time that elapsed from when the lever was extended to start the trial
until the time the subject made the first press or hold), and work time (time from first response to
the delivery of the reward), (Fig 4.3B). We found that the latency to begin working increased as
a function of the ratio requirement (Fig 4.3C), as indicated by a significant main effect of the
press requirement (F(6, 90) = 21.39, p < .0001), but found no significant effect of the hold duration
(F(1, 15) = 2.474, p = .1366) on latency to begin working on the press bar. The increases in latency
as a function of the press requirement indicates that this is another measure which reflects
subjects sensitivity to different effort requirements, an observation which has been made in
previous studies as well (Capehart, Eckerman, Guilkey, & Shull, 1980; P. Killeen, 1969). We
next examined the work times on the press lever (Fig 4.3D) and found that work times also
significantly increased as a function of the number of presses required (F(6, 90) = 60.25, p < .001),
but this measure again was not impacted by the hold duration (F(1, 15) = 0.18, p = 0.6774).
The increasing latency to begin pressing as a function of the press requirement indicates
that this measure also reflects the subjects knowledge about what is required to obtain reward.
We looked at the latencies on a trial by trial basis each day to see how quickly this knowledge
developed with each new daily requirement. Figure 4.3E shows the latency to press as a function
of both the press requirement and trial number. We detected a significant main effect of Press
Requirement (F(6, 1057) = 45.966; p < 0.001), as well as a significant main effect of the trial
number (F(4, 1057) = 6.019; p < 0.001), and a significant press requirement × trial number
interaction (F(24, 1057) = 4.115; p < 0.001), as the impact of trial number was observed only at the
highest press requirements (Fig 4.3E).
The Effort Requirement of Hold Duration Impacts the Latency to Begin Working
![Page 146: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/146.jpg)
131
We next analyzed the total time to complete hold trials, the latency to begin hold trials,
and the hold work times to see how the effort manipulation of hold duration impacted these three
measures. We found that the total time to complete hold requirements increased for the longer
hold duration (Fig 4.4A), as we detected a significant main effect of hold duration (F (1, 15) =
61.69, p < .0001), as well as a significant main effect of press requirements (F (6, 90) = 2.789, p =
.0156), but no interaction was detected.
Figure 4.4. Response duration costs impact latency to work. (A). Shows mean + (SEM) total
times to complete hold trials for the Hold Lever. (B). Schematic representing how the single lever
hold trials were divided into latency to begin working (time from lever extension to first
response), and work time (time from first response to completion of the requirement). (C). Shows
mean + (SEM) latencies to begin working on the Hold Lever. (D). Shows mean + (SEM) work
times on the Hold Lever. (E). Shows the correlation between a subjects PSE in choice trials for
the different hold conditions with various temporal variables.
Further breaking this data into latencies and work times (Fig 4.4B) revealed that the
latency to begin working on the Hold Lever was significantly longer for the 10 second
requirement (Fig 4.4C; F (1, 15) = 14.45, p = 0.0017), but this was not significantly impacted by
the press requirement. The work times on the hold lever also significantly increased in the 10s
![Page 147: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/147.jpg)
132
requirement (Fig 4.4D); (F (1, 15) = 65.39, p < 0.0001), but there was no significant main effect of
press requirement or a hold duration by press requirement interaction. Collectively, the data on
latencies to begin working on either the press or the hold lever demonstrated that subjects are
sensitive to the increasing cost manipulations of both number and time.
Temporal Demands of Lever Pressing and Lever Holding are Predictive of Effort Choice
Sensitivity
We identified two independent measures which reflect a subject’s sensitivity to changing
effort demands of response number and response duration: the PSE derived from the proportion
of hold choices during choice trials, and the latency to begin working in the different work
options during the single lever trials. We next wanted to explore if any of these temporal
measures of responding (total time, latency, and work time) were factored into decisions about
how effortful a given work requirement is, and whether these temporal measures are predictive
of effort choice behavior.
To explore this, we looked at the relationship between a subject’s PSE and various
temporal variables. We first looked at 3 temporal measures related to lever pressing (Total Press
Times, Press Worktimes, and Press Latency). Because these measures systematically varied as a
function of the press requirement (Fig 4.2A, C-D), we fit each subject’s temporal response data
(i.e. Total Press Times), using simple linear regression (y = a + bx) where y is the value estimate,
(a) is the intercept, and (b) is the slope of the function for a given press requirement x. This
yielded high quality of fits, and the slope parameter (b) for Total Press Times, Press Work
Times, and Latency Press all significantly differed from 0 (see Table 4.2).
![Page 148: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/148.jpg)
133
Table 2. Results from linear regression of temporal variables
Intercept (a) Slope (b) Does Slope (b) Differ from 0?
Variable Hold
Duration R2 Mean SD Mean SD df t p-value
Press Worktime 5 sec 0.97 -2.94 3.41 0.81 0.29 15.00 10.99 < 0.001**
10 sec 0.97 -3.51 3.07 0.71 0.27 15.00 10.36 < 0.001**
Latency to Press 5 sec 0.81 1.05 1.12 0.13 0.11 15.00 4.86 < 0.001**
10 sec 0.71 0.80 1.26 0.10 0.10 15.00 4.17 < 0.001**
Press Total Time 5 sec 0.97 -2.46 3.05 0.80 0.24 15.00 13.51 < 0.001**
10 sec 0.97 -1.98 5.06 0.94 0.22 15.00 17.15 < 0.001**
Mean IRT 5 sec 0.39 0.27 0.09 0.00 0.00 15.00 -1.80 0.09
10 sec 0.51 0.23 0.09 0.00 0.00 15.00 -1.66 0.12
Mean Response Duration 5 sec 0.35 0.24 0.05 0.00 0.00 15.00 3.15 0.01
10 sec 0.16 0.21 0.08 0.00 0.00 15.00 3.59 < 0.001
Mean Hold Worktime 5 sec 0.24 8.02 4.13 -0.01 0.01 15.00 -2.38 0.03
10 sec 0.26 23.80 8.91 -0.03 0.07 15.00 -1.76 0.10
Mean Latency to Hold 5 sec 0.28 4.22 2.19 -0.01 0.02 15.00 -1.13 0.28
10 sec 0.20 6.93 3.91 -0.01 0.03 15.00 -1.19 0.25
Mean Hold Total Time 5 sec 0.33 12.23 5.12 -0.02 0.03 15.00 -2.23 0.04
10 sec 0.23 30.73 11.54 -0.04 0.09 15.00 -1.85 0.08
We examined the relationship of the slope (b) for each variable to each subjects PSE, and
found a negative relationship between the slope of these variables and PSE (Fig 4.5). The slope
of the function of increasing Press Total Time showed the strongest negative correlation with the
PSE (5s: r(15) = -0. 57; 10s: r(15) = -0.56). While less extreme that Press Total Time, there was
also a negative relationship between the slope parameter of Press Work Time (5s: r(15) = -0. 39;
10s: r(15) = -0.45) and the slope parameter for Latency to Press (5s: r(15) = -0. 38; 10s: r(15) = -
0.48).
Because these temporal variables related to lever pressing were negatively related to PSE
we also examined the correlation between the PSE and parameters related more directly to the
execution of individual lever presses: average response duration (time it took to execute each
lever press), and their average inter-response-time (IRT: time elapsing between the end of a press
and the beginning of the next press). Because neither response duration nor IRT was fit well by a
linear function as a function of press time we took the mean duration of this parameter for each
![Page 149: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/149.jpg)
134
subject. We found these variables to be less correlated with PSE than many of the other variables
(Fig 4.5), but there was a small negative correlation between PSE and response duration (5s: r(15)
= -0. 29; 10s: r(15) = -0.32), and to a less extent IRT (5s: r(15) = -0. 03; 10s: r(15) = -0.25).
Figure 4.5. Temporal variables are predictive of PSE in the CEC task. Shows the correlation
between a subjects PSE for the different hold conditions with various temporal variables.
We next looked at 3 temporal measures related to lever holding (total hold times, hold
work time, and latency to hold). Because these measures did not systematically vary as a
function of the press requirement (Fig 4.3A, C-D), and linear fits of these variables as a function
of press requirements did not produce good fits (Table 4.2), we used each subject’s average
duration for each of these temporal variables and examined the correlation with the PSE (Fig
4.5). We found that the average Hold Work Time was most strongly correlated with the PSE (5s:
r(15) = 0.54; 10s: r(15) = 0.51), and the Hold Total Time showed a similar positive correlation with
the PSE (5s: r(15) = 0.47; 10s: r(15) = 0.51). The correlation between Hold Latency and PSE was
not as strong of a relationship (5s: r(15) = 0.14; 10s: r(15) = 0.35).
![Page 150: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/150.jpg)
135
Taken together, these results suggest that one factor which likely goes into a subject’s
calculation of effort is the time it takes that subject to complete the different types of work.
Subjects who take longer to complete hold durations have higher PSE’s. On the flip side, the
longer it takes subjects to complete increasing press requirements is predictive of lower PSE’s
and switching from pressing to holding at lower press requirements.
Concurrent Value Choice (CVC) task to measure value-related choice
Mice were trained to earn liquid sucrose solutions by pressing on one lever and sucrose
pellets by pressing on the opposite lever. All subjects learned to press for these different
outcomes through a series of increasingly demanding random ratio schedules (RR5, RR10, and
RR20). We first examined the impact of reward value changes on subject’s choice behavior in
the CVC task by altering the concentration of a liquid sucrose solution (5% vs 20% sucrose
solution). In the CVC task subjects were able to work for either liquid sucrose, which always
required 5 lever presses, or they could work for a sucrose pellet which required 10, 20, 40, or 80
presses on consecutive days. At the start of each session, subjects were exposed to 10 single
lever forced choice trials in which either the sucrose lever or the pellet lever was extended. This
was done so subjects would learn the cost of the two options each day. Subjects were then given
20 choice trials in which both of the levers were extended (Fig 4.6).
Figure 4.6. Schematic representation of the CVC task. Session begins with 10 single levers trials
in which either the Sucrose Lever or Pellet Lever is presented. The number of presses required on
![Page 151: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/151.jpg)
136
the Pellet Lever varied over days (10, 20, 40, and 80), whereas the # of presses required on the
Sucrose Lever was always 5 presses. Upon completing the 10 single lever trials subjects then
received 20 choice trials in which both levers were presented for subjects to choose which lever
to work on to obtain reward.
Subjects are Sensitive to Sucrose Concentration in a Value Choice Task
To examine subject’s choice behavior we computed the proportion of liquid sucrose
choices during the 20 choice trials for 5 and 20% sucrose solution (Fig 4.7A). Choice behavior
was modulated by both the ratio value (Cost of the Pellets; F (3, 45) = 87.71, p < 0.0001), as well
as the sucrose concentration (F (1, 15) = 64.16, p < 0.0001).
Figure 4.7. Reward magnitude alters value-based choice in the CVC task. (A). Shows the mean
+ (SEM) proportion of hold choices in the CVC task in 2 different sucrose concentration
conditions. (B). Left, shows the mean + (SEM) point of subjective equality (PSE) for subjects in
the 2 different sucrose concentration conditions. Right, shows individual subject PSE’s. n = 16;
*p < .05; **p < .01
To further analyze the extent to which sucrose concentration impacted choice behavior
we determined the PSE for each subject, which reflects the point at which subjects are choosing
both the sucrose reward and the pellet reward with equal probability. The sucrose concentration
led to a significant difference in the PSE (Fig 4.7B, Left) as the 5% PSE was significantly higher
![Page 152: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/152.jpg)
137
than the 20% PSE (t (15) = 7.946, p < 0.0001), and 15 out of 16 subjects exhibited this change in
PSE (Fig 4.7B, Right) and in their choice function (Supplemental Fig 4.2S).
Reward Value Influences both Latency to Work and how Fast Subjects Work
We next wanted to see what other aspects of behavior were modulated by value. Since
subjects completed different numbers of requirements on the different levers during the choice
trials we analyzed the first 10 single lever forced trials, as subjects completed equivalent
numbers of these trial types. We examined latency to begin responding, as well as work time.
We first examined the sucrose lever trials across the two different sucrose concentration
conditions. The sucrose concentration impacted worktimes for sucrose (Fig 4.8A), as sucrose
concentration was the only factor having a significant main effect (F (1, 15) = 11.5, p = 0.0040),
indicating that even though the sucrose only required 5 presses, subjects were slower to complete
those 5 presses when it was for 05% sucrose. The sucrose concentration also significantly
influenced latency to begin pressing for sucrose (Fig 4.8B), as there was a main effect of ratio (F
(3, 45) = 4.038, p = 0.0126), sucrose concentration (F (1, 15) = 4.66, p = 0.0475), and a ratio ×
sucrose concentration interaction (F (3, 45) = 3.258, p = 0.0301).
Figure 4.8. Reward magnitude modulates latency to work. (A). Shows mean + (SEM) sucrose
work times on the sucrose lever in the 2 different sucrose concentration conditions. (B). Shows
mean + (SEM) sucrose trial latencies to begin working on the sucrose lever in the 2 different
![Page 153: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/153.jpg)
138
sucrose concentration conditions. (C). Shows mean + (SEM) sucrose work times on the pellet
lever in the 2 different sucrose concentration conditions. (D). Shows mean + (SEM) sucrose trial
latencies to begin working on the pellet lever in the 2 different sucrose concentration conditions.
We next examined the impact of sucrose concentration on work time and latency for
pellets in the pellet lever trials. While the sucrose manipulation did not directly influence the
pellet value, there was a difference in the relative value between the two rewards. This difference
appeared to matter as sucrose concentration modulated how quickly subjects worked for pellets
(Fig 4.8C), indicated by a significant main effect of ratio (F (3, 45) = 77.99, p < 0.0001), a main
effect of sucrose concentration (F (1, 15) = 7.209, p = 0.0170), and a ratio by sucrose concentration
interaction (F (3, 45) = 3.327, p = 0.0278). This suggests that the value of pellet relative to the
sucrose modulated the rate of responding for the pellet reward. We next examined the latency to
begin working for a pellet (Fig 4.8D) and found this to only be modulated by the pellet cost (F (3,
45) = 2.506, p = 0.0710).
Reward Value Influences Bout and Pause Dynamics
The reward magnitude had an impact on how vigorously subjects worked for the pellet
rewards, so we further analyzed the lever press for the pellet reward to better characterize the
way in which the increased vigor manifested. We began by determining the average bout length
(defined as the number of consecutive responses occurring without a 2 second inter-response-
time between any two responses) and the number of pauses (defined as inter-response-times
lasting longer that 2 sec) at each of the pellet ratio requirements (Fig 4.9A). The average bout
length was altered by the reward value manipulation (Fig 4.9B), as there was a main effect of
ratio (F (3, 45) = 46.32, p < 0.0001) and sucrose concentration (F (1, 15) = 12.91, p = 0.0027).
Similarly, the number of pauses subjects took was influenced by the reward value manipulation
![Page 154: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/154.jpg)
139
(Fig 4.9C), indicated through a main effect of ratio (F (3, 45) = 25.5, p < 0.0001) and sucrose
concentration (F (1, 15) = 20.34, p = 0.0004).
Figure 4.9. Reward magnitude modulates the vigor of responding for pellets. (A). Schematic
representation of the defining of a Bout (number of consecutive presses made without a 2 second
inter-response-time occurring), and Pause (a duration of greater than 2 seconds elapsing without a
press being made). (B). Shows mean + (SEM) Bout Length in the 2 different sucrose
concentration conditions. (C). Shows mean + (SEM) # of Pauses in the 2 different sucrose
concentration conditions. n = 16; *p < .05; **p < .01
Reward Devaluation impacts value based choice in the CVC task
The CVC task is able to detect changes in subject’s reward based choice behavior when
the quality or magnitude of the reward is altered as demonstrated by using 20% vs 5% liquid
sucrose concentration. Additionally, this reward magnitude manipulation impacts the latency and
vigor with which subjects will work for both the specific reward being altered (liquid sucrose), as
well as the alternative outcome (pellet). We next wanted to assess whether the CVC task would
be sensitive to a manipulation which devalues one of the rewards as this is another method used
to study whether behavior is sensitive to reward value. In this manipulation, subjects experienced
one week of normal testing in the CVC task (Valued-1), followed by a week in which the pellet
![Page 155: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/155.jpg)
140
reward was devalued by giving subjects 30 minutes of free access to pellets prior to the session
(Devalued), followed by a second normal week of testing in the CVC task (Valued-2). The
proportion of sucrose lever choices generally increased when the pellets were devalued (Fig
4.10A), and an ANOVA detected a significant main effect of devaluation condition (F(1, 15) =
143.2, p < 0.0001), the pellet cost (F(3, 45) = 88.47, p < 0.0001), and a pellet cost by devaluation
interaction (F(3, 45) = 9.044, p < 0.0001).
Figure 4.10. Reward devaluation in the CVC task. (A). Shows the mean + (SEM) proportion of
hold choices in the CVC task for the different devaluation conditions. (B). Left, Shows the mean
+ (SEM) point of subjective equality (PSE) for subjects in the different devaluation conditions.
Right, Shows individual subjects PSE’s.
The PSE was also significantly impacted by the devaluation condition (t (15) = 12.06, p <
0.0001; Fig 4.10B, Left), indicating that subjects were willing to pay a smaller cost for pellets
when they had been devalued. This was true for 100% of subjects, as indicated by both the PSE
(Fig 4.10B, Right), as well as their overall choice functions (Supplementary Fig 4.3S).
Reward Devaluation Modulates Latency and Vigor of Work
We next examined work times and latency in the single lever force choice trials at the
beginning of the session to examine whether devaluing the pellet reward modulated these aspects
![Page 156: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/156.jpg)
141
of behavior. The pellet work times (Fig 4.11A) were significantly impacted by both the pellet
cost (F (3, 45) = 70.25, p < 0.0001), and devaluation (F (2, 30) = 8.405, p = 0.0013). Devaluing the
pellet reward also lead to a significant increase in the latency to begin working for pellets (Fig
4.11B), as an ANOVA detected a main effect of pellet cost (F (3, 45) = 4.652, p = 0.0065), a
significant main effect of devaluation (F (2, 30) = 16.21, p < 0.0001), and a significant pellet cost ×
devaluation interaction (F (6, 90) = 7.081, p < 0.0001).
Figure 4.11. Reward devaluation modulates vigor of responding. (A). Shows the mean + (SEM)
Work Time for Pellets in the different devaluation conditions. (B). Shows the mean + (SEM)
Latency to begin working for pellets in the different devaluation conditions. (C). Shows the mean
+ (SEM) Bout Length when working for pellets in the different devaluation conditions. (D).
Shows the mean + (SEM) Number of Pauses taken when working for pellets in the different
devaluation conditions. n = 16; *p < .05; **p < .01
To further characterize the nature of this change in the rate in which subjects worked we
looked at the dynamics of each work bout within a trial. The devaluation impacted bout lengths
(Fig 4.11C), and an ANOVA detected a main effects of pellet cost (F (3, 45) = 60.07, p < 0.0001),
a significant main effect of devaluation (F (2, 30) = 28.81, p < 0.0001), and a significant pellet cost
by devaluation interaction (F (6, 90) = 11.27, p < 0.0001). Additionally, the number of pauses (2
seconds or greater) increased as a function of the devaluation manipulation (Fig 4.11D), and an
ANOVA detected a significant main effect of pellet cost (F (3, 45) = 15.85, p < 0.0001), a
![Page 157: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/157.jpg)
142
significant main effect of devaluation (F (2, 30) = 17.07, p < 0.0001), and a significant pellet cost
by devaluation interaction (F (6, 90) = 3.26, p = 0.0060).
Discussion
How the brain processes information about effort and value in order to allow animals to
engage in effective cost-benefit decision making is a question of interest for many fields. In order
to perform cost-benefit computations animals must be able to (1) estimate the effort that will be
required to obtain a particular goal, (2) estimate the value or benefit of the goal to be obtained,
and (3) compare the anticipated effort against the estimated value (Rangel & Hare, 2010).
Because we cannot directly measure a subject’s experience of how effortful a task was, or how
rewarding an outcome was directly, we must rely on behavioral readouts to infer these things in
non-verbal organisms. In order to assess which of the 3 processes needed for cost-benefit
decision making is impacted by any experimental manipulation, it is sometimes critical to have
behavioral tasks which do not confound effort and value. In the present set of experiments we
have developed and characterized two behavioral tasks known as the Concurrent Effort Choice
(CEC) task, and the Concurrent Value Choice (CVC) task to isolate the effects of effort and
value in cost-benefit decision making.
The Concurrent Effort Choice Task Measures Subject’s Sensitivity to Effort Changes
Independent of Changes in Reward Value
In the CEC task, our subjects can make the choice between lever pressing to earn a milk
reward, or holding a lever down for a required duration of time to earn the same milk reward.
Previous work has demonstrated that effort manipulations of hold duration are sensitive to the
same motivational factors as the more traditionally used effort manipulation of number of
![Page 158: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/158.jpg)
143
responses (Bailey et al., 2015). In the choice trials of the CEC task, the choice of which work
requirement is chosen is sensitive to both the effort manipulation of number of responses and
duration of hold time. When choosing between pressing and holding for either 5 or 10 seconds,
all subjects choose to make lever presses when the requirement is low and switch over to holding
as the press requirement increases. When the press requirement gets large enough subjects
choose to hold for all of their rewards. This sigmoidal function indicates that subjects are
sensitive to the number of responses effort manipulation. The fact that the entire choice function
shows a rightward shift when comparing choice between the 5 and 10 second hold requirement
indicates that subjects are sensitive to the effort manipulation of time. We further demonstrated
this by showing that subjects point of subjective equality, or the number of presses for which
they choose to hold and press at equal rates, shifts as the hold requirement is increased from 5 to
10 seconds.
The Concurrent Value Choice task measures subject’s sensitivity to value changes
In the CVC task, subjects choose to work for either a liquid sucrose solution or a sucrose
pellet. By altering the reward value of these two outcomes, either through titrating the sucrose
concentration of the liquid sucrose solution, or by devaluing the pellet outcome with a pre-
feeding procedure, we are able to see how these changes in reward value impact subject’s choice
behavior. In the choice trials of the CVC task, subjects were sensitive to both sucrose
concentration manipulation changes (5 vs 20% sucrose), as well as devaluation. We capture this
shift in the value of these outcomes by looking at the point of subjective equality, defined as the
number of presses for which subjects will equally choose the pellets of the sucrose. When the
sucrose concentration is 20%, subjects will choose the liquid sucrose much more often than
when the sucrose concentration is 5%.
![Page 159: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/159.jpg)
144
The CEC task reveals differences in effort judgements through changes in latency
In addition to the behavioral differences in subject’s choice behavior in the CEC task we
were also able to identify two more measures which were sensitive to both number and time
manipulations of effort. First, the work time, defined as the time from the first response until the
subject earned a reward, for both lever pressing and lever holding increased as a function of
press requirement and hold duration. This makes sense as it will necessarily take more time to
execute more presses as the requirement increases, and it takes 5 seconds longer to make a 5 vs
10 second hold. Despite this, we propose this is a useful variable to analyze when studying effort
based choice. If work time greatly increases following a manipulation, this indicates that the
impact of the manipulation is likely impacting motor behavior, which may or may not lead to
changes in choice behavior.
Whereas the time to complete the work requirement must necessarily increase as work
requirements are increased, the latency to being working does not necessarily have to increase as
well. Despite this, we observed that the latency to begin working was modulated by the effort
requirement, as this went up both for the manipulation of number of presses as well as for the
hold duration. Latency to begin working is therefore another potential measure which reflects a
subject’s sensitivity to different effort magnitudes in addition to their choice behavior.
Value and Effort affect the dynamics of responding differently
In addition to choice behavior, we were able to detect other measures of behavior which
were sensitive to reward value changes as well. Subject’s worked at a faster rate for 20% sucrose
as compared to 5% sucrose. They also began working for the 20% sucrose more rapidly than 5%
sucrose as the latency to begin working was lower as well. We also observed a similar sensitivity
![Page 160: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/160.jpg)
145
to reward value through the pellet devaluation manipulation, as subjects worked faster for pellets
and had shorter latencies when they were valued, but the work times were slower and the
latencies were longer when the pellet was devalued. These two manipulations (sucrose
concentration and devaluation) both reflect that the rate and likelihood of starting to engage in
goal-directed action are modulated by reward value, and that the CVC task can measure this,
without the confound of differences in effort requirements for the different reward values.
Importantly, these two measures of behavior directly demonstrate that subjects are able use
expectations about reward value to guide their behavior, working faster for the better sucrose
solution. This is an important distinction from responding at a faster rate when engaging in the
consumption of a reward. It has been nicely demonstrated that subjects will increase their rates of
licking for higher sucrose concentrations when the liquids are readily available to subjects
(Glendinning et al. 2002). While lick rates in such tasks show that reward magnitude can
modulate rates of consumption, the data in the CVC task importantly demonstrates that reward
value can guide the vigor of goal-directed behavior.
Advantages of CEC and CVC tasks
As Winstanely & Floresco point out, paying close attention to small experimental details
is necessary to fully appreciate exactly what a behavior task can really teach us (Winstanley and
Floresco, 2016). The change in choice of the press lever when press requirement changes
demonstrates subjects are sensitive to effort manipulations of number of responses is similar to
that observed in an operant effort discounting task (Floresco et al., 2008). In the operant effort
discounting task, subjects can make a large number of responses (2, 5, 10, or 20) for 4 pellets, or
they can make a single lever press for 2 pellets. The important difference between the operant
effort discounting task and the CEC is that the reward value does not differ for the two different
![Page 161: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/161.jpg)
146
work options of lever pressing or lever holding. Similarly in procedures (Salamone, 1991) have
employed animals are given a choice between making an effortful response for a preferred
reward versus a less effortful response for a less preferred reward. A manipulation which causes
a change in choice behavior in this type of task is useful in differentiating whether a change in
willingness to work for the preferred reward has to do with satiation. However, such changes
could be due to due to changes in (1) estimate of effort (2, 5, 10, or 20 vs 1 press), (2) estimate of
value or benefit of the goal (4 pellets vs 2 pellets), or (3) the comparison of the anticipated effort
against the estimated value. A change in the behavior in the CEC task, on the other hand could
only result from a change in (1) estimate of effort (evaluating press number relative to hold
duration), or (3) the comparison of the anticipated effort against the estimated value, but could
not be explained by (2) estimate of value or benefit of the goal, as both options offer the same
reward.
Similarly, the CVC task offers some advantages over some of the previously used
measures of value or reward sensitivity. One such method which is known as a Preference
Assessment gives a subject free access to either a liquid sucrose solution or water (Muscat and
Willner 1989), uses a subject’s development of a preference for the liquid sucrose over water to
reflect the capacity to experience the hedonic pleasure associated and reward sensitivity (Ward,
2015). While this method does reflect a subject’s ability to make choices based on reward value,
it does not explicitly require subjects to use the reward value estimations to make decisions about
how much effort they are willing to expend to gain access to that reward. The CVC task requires
subjects to use information about reward value to guide on-going, goal-directed responding at
different effort requirements.
![Page 162: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/162.jpg)
147
Another method which has been previously used to study an animal’s sensitivity to
reward value is an outcome devaluation procedure. Subjects first learn to make two different
types of responses (e.g. pressing a lever or pulling a chain), which lead to two different reward
outcomes (sucrose solution or a pellet). After instrumental training for these two responses is
performed one of the two outcomes is devalued before subjects are tested in a session in which
they have the opportunity to work for the two different outcomes. When one outcome is
devalued before the start of the testing session (either through satiation or pairing the outcome
with an illness producing agent), the animal will be less likely to make responses for that
outcome in the subsequent testing session (Corbit & Balleine, 2005; Yin, Ostlund, Knowlton, &
Balleine, 2005), demonstrating that the current reward value of the outcomes is guiding the
instrumental behavior. This effect has been shown in humans (Klossek, Russell, & Dickinson,
2008; Valentin, Dickinson, & O'Doherty, 2007), monkeys (West, DesJardin, Gale, & Malkova,
2011), rats (B. Balleine & Dickinson, 1991; Bernard Balleine & Dickinson, 1992), and mice
(Crombag, Johnson, Zimmer, Zimmer, & Holland, 2010; Hilário, Clouse, Yin, & Costa, 2007).
This outcome devaluation task has helped identify the important role of the BLA, the Insular
cortex, and the NAcc in outcome devaluation (Parkes and Balleine, 2013; Parkes et al., 2015).
This procedure asks subjects to use reward value information in a similar way to what is asked of
subjects in the CVC task, so it would be interesting to know whether these structures are
involved in actively processing information about reward value while subjects are working in the
CVC task. An advantage offered from the CVC task is that subjects can be tested on this task
continuously, and behavior before the manipulations looks indistinguishable from behavior after
the reward value has been returned to the original level, as is demonstrated in the CVC pellet
devaluation experiment.
![Page 163: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/163.jpg)
148
Conclusions and Future Directions
There is a large interested in understanding how the central nervous system processes
information about effort and value, and how it is able to use this information to enable subjects to
engage in cost-benefit decision making. Human neuroimaging studies are attempting to identify
brain circuits involved in these processes in humans (Croxson et al., 2009; Prévost, Pessiglione,
Météreau, Cléry-Melin, & Dreher, 2010), clinical researchers and psychiatrists are attempting to
understand how symptoms from a number of different disorders may be manifesting as a result
of impairments in decision making (Fervaha, Foussias, Agid, & Remington, 2013; G. Fervaha et
al., 2013; Gold et al., 2015; John D. Salamone, Koychev, Correa, & McGuire, 2015), and basic
researchers aiming to further elucidate and dissect the circuits involved in these types of
information processing (Hillman & Bilkey, 2010; Wang, Shi, & Li, 2017). The commonality
across all of these research endeavors is that one must ultimately rely on some type of behavioral
readout to draw inferences about how a human participant or experimental model organisms is
processing information about costs and reward values and subsequently using this information in
a cost-benefit decision-making computation. We have developed the CEC and CVC tasks with
the hope that using these or similar methods will ultimately aid in future endeavors to more
precisely investigate the behavioral, neural, and pharmacological mechanisms which influence
cost-benefit decision making so that knowledge gained in experimental organisms may help to
better understand and more effectively treat human disorders for which similar cost-benefit
decision making processes have gone awry.
![Page 164: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/164.jpg)
149
Supplemental Figures
Figure 4.S1. Individual Subject Choice Functions in CEC task. Shows the proportion hold
choices made for every subject tested in the CEC task as a function of the lever press requirement
for both the 5s Hold Condition (White) and 10s Hold Condition (Black).
![Page 165: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/165.jpg)
150
Figure 4.S2. Individual Subject Choice Functions in CVC task for 5 vs 20% Sucrose. Shows the
proportion of sucrose choices made for every subject tested in the CVC task as a function of the
pellet cost for both the 5% sucrose concentration (White) and 20% sucrose concentration (Black).
![Page 166: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/166.jpg)
151
Figure 4.S3. Individual Subject Choice Functions in CVC task with Pellet Devaluation. Shows
the proportion of sucrose choices made for every subject tested in the CVC task as a function of
the pellet cost in the 3 different conditions: Valued-1 (White), Devalued (Blue), and Valued-2
(Grey).
![Page 167: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/167.jpg)
152
References
Atalayer, D., & Rowland, N. E. (2009). Meal patterns of mice under systematically varying
approach and unit costs for food in a closed economy. Physiology & Behavior, 98(1-2),
85-93. doi: 10.1016/j.physbeh.2009.04.016
Bailey, M. R., Jensen, G., Taylor, K., Mezias, C., Williamson, C., Silver, R., . . . Balsam, P. D.
(2015). A novel strategy for dissecting goal-directed action and arousal components of
motivated behavior with a progressive hold-down task. Behavioral Neuroscience, 129(3),
269-280. doi: 10.1037/bne0000060
Bailey, M. R., Simpson, E. H., & Balsam, P. D. (2016). Neural substrates underlying effort, time,
and risk-based decision making in motivated behavior. Neurobiology of Learning and
Memory, 133, 233-256. doi: http://dx.doi.org/10.1016/j.nlm.2016.07.015
Balleine, B., & Dickinson, A. (1991). INSTRUMENTAL PERFORMANCE FOLLOWING
REINFORCER DEVALUATION DEPENDS UPON INCENTIVE LEARNING.
Quarterly Journal of Experimental Psychology Section B-Comparative and Physiological
Psychology, 43(3), 279-296.
Balleine, B., & Dickinson, A. (1992). Signalling and incentive processes in instrumental
reinforcer devaluation. The Quarterly Journal of Experimental Psychology Section B,
45(4), 285-301. doi: 10.1080/14640749208401007
Balsam, P. D., Drew, M. R., & Gallistel, C. R. (2010). Time and Associative Learning.
Comparative Cognition & Behavior Reviews, 5, 1-22. doi: 10.3819/ccbr.2010.50001
Balsam, P. D., & Gallistel, C. R. (2009). Temporal maps and informativeness in associative
learning. Trends in Neurosciences, 32(2), 73-78. doi:
http://dx.doi.org/10.1016/j.tins.2008.10.004
Baron, A., & Derenne, A. (2000). Progressive-ratio schedules: Effects of later schedule
requirements on earlier performances. Journal of the Experimental Analysis of Behavior,
73(3), 291-304. doi: 10.1901/jeab.2000.73-291
Capehart, G. W., Eckerman, D. A., Guilkey, M., & Shull, R. L. (1980). A comparison of ratio
and interval reinforcement schedules with comparable interreinforcement times. Journal
of the Experimental Analysis of Behavior, 34(1), 61-76. doi: 10.1901/jeab.1980.34-61
Collier, G., & Johnson, D. F. (1997). Who is in Charge? Animal vs Experimenter Control.
Appetite, 29(2), 159-180. doi: http://dx.doi.org/10.1006/appe.1997.0124
Corbit, L. H., & Balleine, B. W. (2005). Double dissociation of basolateral and central amygdala
lesions on the general and outcome-specific forms of pavlovian-instrumental transfer.
Journal of Neuroscience, 25(4), 962-970. doi:10.1523/jneurosci.4507-04.2005
Cousins, M. S., & Salamone, J. D. (1994). NUCLEUS-ACCUMBENS DOPAMINE
DEPLETIONS IN RATS AFFECT RELATIVE RESPONSE ALLOCATION IN A
![Page 168: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/168.jpg)
153
NOVEL COST/BENEFIT PROCEDURE. Pharmacology Biochemistry and Behavior,
49(1), 85-91. doi: 10.1016/0091-3057(94)90460-x
Covarrubias, P., & Aparicio, C. F. (2008). Effects of reinforcer quality and step size on rats?
performance under progressive ratio schedules. Behavioural Processes, 78(2), 246-252.
doi: http://dx.doi.org/10.1016/j.beproc.2008.02.001
Crombag, H. S., Johnson, A. W., Zimmer, A. M., Zimmer, A., & Holland, P. C. (2010). Deficits
in sensory-specific devaluation task performance following genetic deletions of
cannabinoid (CB1) receptor. Learning & Memory, 17(1), 18-22. doi:
10.1101/lm.1610510
Croxson, P. L., Walton, M. E., O'Reilly, J. X., Behrens, T. E. J., & Rushworth, M. F. S. (2009).
Effort-Based Cost-Benefit Valuation and the Human Brain. Journal of Neuroscience,
29(14), 4531-4541. doi: 10.1523/jneurosci.4515-08.2009
Dews, P. B. (1978). STUDIES ON RESPONDING UNDER FIXED-INTERVAL SCHEDULES
OF REINFORCEMENT: II. THE SCALLOPED PATTERN OF THE CUMULATIVE
RECORD. Journal of the Experimental Analysis of Behavior, 29(1), 67-75. doi:
10.1901/jeab.1978.29-67
Fervaha, G., Foussias, G., Agid, O., & Remington, G. (2013). Neural substrates underlying effort
computation in schizophrenia. Neuroscience & Biobehavioral Reviews, 37(10, Part 2),
2649-2665. doi: http://dx.doi.org/10.1016/j.neubiorev.2013.09.001
Fervaha, G., Graff-Guerrero, A., Zakzanis, K. K., Foussias, G., Agid, O., & Remington, G.
(2013). Incentive motivation deficits in schizophrenia reflect effort computation
impairments during cost-benefit decision-making. J Psychiatr Res, 47(11), 1590- 1596.
doi: 10.1016/j.jpsychires.2013.08.003
Flaherty, C. F. (1982). Incentive contrast: A review of behavioral changes following shifts in
reward. Animal Learning & Behavior, 10(4), 409-440.
Flaherty, C. F., Turovsky, J., & Krauss, K. L. (1994). Relative hedonic value modulates
anticipatory contrast. Physiology & Behavior, 55(6), 1047-1054. doi:
http://dx.doi.org/10.1016/0031-9384(94)90386-7
Floresco, S. B., Tse, M. T. L., & Ghods-Sharifi, S. (2008). Dopaminergic and glutamatergic
regulation of effort- and delay-based decision making. Neuropsychopharmacology, 33(8),
1966-1979. doi: 10.1038/sj.npp.1301565
Fouriezos, G., Bielajew, C., & Pagotto, W. (1990). Task difficulty increases thresholds of
rewarding brain stimulation. Behavioural Brain Research, 37(1), 1-7. doi:
http://dx.doi.org/10.1016/0166-4328(90)90066-N
Fowler, S. C., Morgenst.C, & Notterma.Jm. (1972). SPECTRAL ANALYSIS OF
VARIATIONS IN FORCE DURING A BAR-PRESSING TIME DISCRIMINATION.
Science, 176(4039), 1126-&. doi: 10.1126/science.176.4039.1126
![Page 169: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/169.jpg)
154
Glendinning, J. I., Gresack, J., & Spector, A. C. (2002). A High-throughput Screening Procedure
for Identifying Mice with Aberrant Taste and Oromotor Function. Chemical Senses,
27(5), 461-474. doi: 10.1093/chemse/27.5.461
Gold, J. M., Waltz, J. A., & Frank, M. J. (2015). Effort cost computation in schizophrenia: a
commentary on the recent literature. Biol Psychiatry, 78(11), 747-753. doi:
10.1016/j.biopsych.2015.05.005
Hauber, W., & Sommer, S. (2009). Prefrontostriatal Circuitry Regulates Effort-Related Decision
Making. Cerebral Cortex, 19(10), 2240-2247. doi: 10.1093/cercor/bhn241
Hilário, M., Clouse, E., Yin, H., & Costa, R. (2007). Endocannabinoid signaling is critical for
habit formation. Frontiers in Integrative Neuroscience, 1(6). doi:
10.3389/neuro.07.006.2007
Hillman, K. L., & Bilkey, D. K. (2010). Neurons in the Rat Anterior Cingulate Cortex
Dynamically Encode Cost-Benefit in a Spatial Decision-Making Task. Journal of
Neuroscience, 30(22), 7705-7713. doi: 10.1523/jneurosci.1273-10.2010
Hodos, W. (1961). PROGRESSIVE RATION AS A MEASURE OF REWARD STRENGTH.
Science, 134(348), 943-&. doi: 10.1126/science.134.3483.943
Hutsell, B. A., & Newland, M. C. (2013). A quantitative analysis of the effects of qualitatively
different reinforcers on fixed ratio responding in inbred strains of mice. Neurobiology of
Learning and Memory, 101, 85-93. doi: http://dx.doi.org/10.1016/j.nlm.2013.01.005
Ishiwari, K., Weber, S. M., Mingote, S., Correa, M., & Salamone, J. D. (2004). Accumbens
dopamine and the regulation of effort in food-seeking behavior: modulation of work
output by different ratio or force requirements. Behavioural Brain Research, 151(1–2),
83-91. doi: http://dx.doi.org/10.1016/j.bbr.2003.08.007
Killeen, P. (1969). Reinforcement frequency and contingency as factors in fixed-ratio behavior.
Journal of the Experimental Analysis of Behavior, 12(3), 391-395. doi:
10.1901/jeab.1969.12-391
Killeen, P. R., Posadas-Sanchez, D., Johansen, E. B., & Thrailkill, E. A. (2009). Progressive
Ratio Schedules of Reinforcement. Journal of Experimental Psychology-Animal
Behavior Processes, 35(1), 35-50. doi: 10.1037/a0012497
Klossek, U. M. H., Russell, J., & Dickinson, A. (2008). The control of instrumental action
following outcome devaluation in young children aged between 1 and 4 years. Journal of
Experimental Psychology: General, 137(1), 39-51. doi: 10.1037/0096-3445.137.1.39
McDowell, J. J. (2013). On the theoretical and empirical status of the matching law and
matching theory. Psychological Bulletin, 139(5), 1000-1028. doi: 10.1037/a0029924
Mechner, F. (1958). PROBABILITY RELATIONS WITHIN RESPONSE SEQUENCES
UNDER RATIO REINFORCEMENT1. Journal of the Experimental Analysis of
Behavior, 1(2), 109-121. doi: 10.1901/jeab.1958.1-109
![Page 170: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/170.jpg)
155
Miller, M., Thome, A., & Cowen, S. (2013). Intersection of Effort and Risk: Ethological and
Neurobiological Perspectives. Frontiers in Neuroscience, 7(208). doi:
10.3389/fnins.2013.00208
Notterman, J. M., & Mintz, D. E. (1965). Dynamics of response: Wiley.
Platt, J. R., & Johnson, D. M. (1971). Localization of position within a homogeneous behavior
chain: Effects of error contingencies. Learning and Motivation, 2(4), 386-414. doi:
http://dx.doi.org/10.1016/0023-9690(71)90020-8
Prévost, C., Pessiglione, M., Météreau, E., Cléry-Melin, M.-L., & Dreher, J.-C. (2010). Separate
Valuation Subsystems for Delay and Effort Decision Costs. The Journal of Neuroscience,
30(42), 14080-14090. doi: 10.1523/jneurosci.2752-10.2010
Rangel, A., & Hare, T. (2010). Neural computations associated with goal-directed choice.
Current Opinion in Neurobiology, 20(2), 262-270. doi:
http://dx.doi.org/10.1016/j.conb.2010.03.001
Salamone, J. D., Cousins, M. S., & Bucher, S. (1994). ANHEDONIA OR ANERGIA -
EFFECTS OF HALOPERIDOL AND NUCLEUS-ACCUMBENS DOPAMINE
DEPLETION ON INSTRUMENTAL RESPONSE SELECTION IN A T-MAZE COST-
BENEFIT PROCEDURE. Behavioural Brain Research, 65(2), 221-229. doi:
10.1016/0166-4328(94)90108-2
Salamone, J. D., Koychev, I., Correa, M., & McGuire, P. (2015). Neurobiological basis of
motivational deficits in psychopathology. European Neuropsychopharmacology, 25(8),
1225-1238. doi: http://dx.doi.org/10.1016/j.euroneuro.2014.08.014
Salamone, J. D., Steinpreis, R. E., McCullough, L. D., Smith, P., Grebel, D., & Mahan, K.
(1991). HALOPERIDOL AND NUCLEUS-ACCUMBENS DOPAMINE DEPLETION
SUPPRESS LEVER PRESSING FOR FOOD BUT INCREASE FREE FOOD-
CONSUMPTION IN A NOVEL FOOD CHOICE PROCEDURE. Psychopharmacology,
104(4), 515-521. doi: 10.1007/bf02245659
Schweimer, J., & Hauber, W. (2005). Involvement of the rat anterior cingulate cortex in control
of instrumental responses guided by reward expectancy. Learning & Memory, 12(3),
334-342. doi: 10.1101/lm.90605
Shafiei, N., Gray, M., Viau, V., & Floresco, S. B. (2012). Acute Stress Induces Selective
Alterations in Cost/Benefit Decision-Making. Neuropsychopharmacology, 37(10), 2194-
2209. doi: 10.1038/npp.2012.69
Simpson, E. H., & Balsam, P. D. (2016). The Behavioral Neuroscience of Motivation: An
Overview of Concepts, Measures, and Translational Applications (pp. 1-12). Berlin,
Heidelberg: Springer Berlin Heidelberg.
Skjoldager, P., Pierre, P. J., & Mittleman, G. (1993). REINFORCER MAGNITUDE AND
PROGRESSIVE RATIO RESPONDING IN THE RAT - EFFECTS OF INCREASED
![Page 171: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/171.jpg)
156
EFFORT, PREFEEDING, AND EXTINCTION. Learning and Motivation, 24(3), 303-
343. doi: 10.1006/lmot.1993.1019
Valentin, V. V., Dickinson, A., & O'Doherty, J. P. (2007). Determining the Neural Substrates of
Goal-Directed Learning in the Human Brain. The Journal of Neuroscience, 27(15),
4019-4026. doi: 10.1523/jneurosci.0564-07.2007
Wang, S., Shi, Y., & Li, B.-M. (2017). Neural representation of cost–benefit selections in rat
anterior cingulate cortex in self-paced decision making. Neurobiology of Learning and
Memory, 139, 1-10. doi: http://dx.doi.org/10.1016/j.nlm.2016.12.003
West, E. A., DesJardin, J. T., Gale, K., & Malkova, L. (2011). Transient Inactivation of
Orbitofrontal Cortex Blocks Reinforcer Devaluation in Macaques. The Journal of
Neuroscience, 31(42), 15128-15135. doi: 10.1523/jneurosci.3295-11.2011
Winstanley, C. A., & Floresco, S. B. (2016). Deciphering Decision Making: Variation in Animal
Models of Effort- and Uncertainty-Based Choice Reveals Distinct Neural Circuitries
Underlying Core Cognitive Processes. The Journal of Neuroscience, 36(48), 12069-
12079. doi: 10.1523/jneurosci.1713-16.2016
Yin, H. H., Ostlund, S. B., Knowlton, B. J., & Balleine, B. W. (2005). The role of the
dorsomedial striatum in instrumental conditioning. European Journal of Neuroscience,
22(2), 513-523. doi: 10.1111/j.1460-9568.2005.04218.x
![Page 172: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/172.jpg)
157
Chapter 5
Selective Involvement of Striatal Dopamine D2 Receptors in Effort-Based vs
Value-Based Decision Making in Mice
Abstract
Deficits in goal-directed motivation represent a debilitating type of symptom which
impacts many individuals with schizophrenia. Recent studies suggest that patients with
motivation impairments have deficits in both effort and value related information processing
which disrupts effective cost-benefit decision making. Here, we examine a mouse model of
upregulated dopamine D2 receptor activity within the striatum (D2R-OE mice) which results in
impaired motivation. Studies have previously shown that D2R-OE mice display both effort and
value related deficits. Here we specifically explore how effort and value related manipulations
impact cost-benefit decision making in this model. In specific tests of effort based decision
making, we find that the D2R-OE mice are more sensitive to effort manipulations when the cost
is number of responses, but they are unperturbed by effort manipulations of time. When we
examined specific tests of value based decision making, we found that D2R-OE mice are as
sensitive as control mice to reward value changes, and can use these changes in reward value to
guide their choices. We find, however, that the D2R-OE mice may have a blunted increase in
response vigor to reward value, which is likely the result of the same alterations leading to their
aversion to costs requiring the repeated initiation of action.
![Page 173: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/173.jpg)
158
Introduction
Impairments in goal-directed motivation represents a problem experienced by many
individuals with schizophrenia (Faerden et al., 2009; Feil et al., 2003; Kiang et al., 2003; Myin-
Germeys et al., 2000; Roth et al., 2004). Research suggests that the motivation impairments
observed in these patients may be the consequence of alterations in how patients process
information related to cost-benefit decision making. Two distinct types of impairment have been
identified. First, many patients have a difficult time accurately estimating the effort required to
complete tasks in the future (Barch, Deanna M. et al., 2014; Fervaha et al., 2013; Gard et al.,
2009; Gold et al., 2013; Gold et al., 2015; Marin, 1991; Treadway et al., 2015). A second
impairment is difficulty generating accurate estimates of the value of future rewards (Barch, D.
M. et al., 2010; Gold et al., 2012; Gold et al., 2008; Heerey et al., 2008; Kring et al., 2013).
Impairments in processing either effort or value related information could contribute to the
deficits in goal-directed motivation seen in patients. Consequently, there now exists considerable
interest in understanding the neural substrates that underlie effective cost-benefit decision
making in experimental animals.
There have been a large number of studies in rodents which implicate mesolimbic
dopamine signaling in goal-directed behaviors. One critical function of dopamine signaling is
invigorating goal-directed responding. Disruptions to normal mesolimbic dopamine
neurotransmission through cell body lesions or inactivation of NAcc (Bezzina et al., 2008),
neurotoxic lesions of mesoacumbal dopamine terminals (Hamill et al., 1999), or blockade of
dopamine D1 or D2 receptors within the NAcc (Bari et al., 2005) all reduce the vigor of goal-
directed responding. These same manipulations also impact the evaluation of different effort
requirements in tasks which give subjects a choice between a High-Effort/Large Reward and a
![Page 174: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/174.jpg)
159
Low Effort/Small Reward (e.g. 10 presses/4 pellets vs 1 press/1 pellet). Cell body lesions or
inactivation of the NAcc (Ghods-Sharifi et al., 2010; Hauber et al., 2009), neurotoxic lesions of
mesoaccumbal dopamine terminals (Salamone et al., 1999; Salamone et al., 1994), and blockade
of dopamine D1 or D2 receptors within the NAcc (Nowend et al., 2001) all result in a reduction
of High Effort/Large Reward choice.
In line with the results of these pharmacological, lesion, and inactivation manipulations
of mesolimbic dopamine signaling, several different genetic manipulations of dopaminergic
function including targeted dopamine transporter (DAT) and Dopamine D1/D2 receptors
mutations impact behavior in operant tasks requiring the expenditure of effort (Cagniard et al.,
2006). One specific genetic model is the over-expressed Dopamine D2 receptor (D2R-OE)
selectively in the striatum (Kellendonk et al., 2006) created to model the ~12% increase in D2
receptor occupancy observed in a subset of patients with schizophrenia (Breier et al., 1997;
Laruelle et al., 1996; Wong et al., 1986). In this model, it is possible to shut off the over-
expression of the D2R transgene by feeding D2R-OE mice chow that is supplemented with the
tetracycline analogue doxycycline (Dox). This enables one to compare differences between
current over-expression of the D2R, and developmental effects. This D2R-OE mouse model has
been characterized as having a deficit in goal-directed motivation (Drew et al., 2007; Simpson et
al., 2011; Ward et al., 2012). Much like the observations made in patients, D2R-OE mice have
alterations in the assessment or use of information about anticipated effort (Drew et al., 2007;
Simpson et al., 2011; Ward et al., 2012) and value (Ward et al., 2012; Ward et al., 2015).
Here, we first characterize the motivational deficit that results from developmental
overexpression of D2Rs in the striatum by using an effort based choice task. Next we
independently investigate which specific sources of altered cost-benefit decision making
![Page 175: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/175.jpg)
160
contribute to the motivational deficits observed in the D2R-OE mouse. We investigated whether
the deficit in motivated behavior in the D2R-OE mice is due to an altered sensitivity to effort
demands and/or ability to utilize effort information to guide effort-based choice. Next, we
independently assessed whether these mice show a change in sensitivity to reward values and/or
their ability to use reward value information to guide reward-based choice. To independently
examine how Effort and Value manipulations impact cost-benefit decision making in the D2R-
OE mice we use two separate behavioral assays called the Concurrent Effort Choice (CEC) task
and the Concurrent Value Choice (CVC) task. The validation of these novel tasks are described
elsewhere (Chapter 4).
Methods
Subjects
Adult female mice, between 10-12 weeks old at the start of the experiments were F1
hybrids of the C57BL/6J and 129Svev (Tac) background strain. D2R-OE mice overexpress
dopamine D2 receptors selectively in the postsynaptic medium spiny neurons of the striatum, and
littermate mice carrying only one of the transgenes, or neither transgene, were combined and
used as controls (Controls). Mice were bred and maintained as previously described (Drew et al.,
2007; Kellendonk et al., 2006; Simpson et al., 2011). Throughout experiments, all subjects were
maintained at 85% of their ad libitum bodyweight in order to motivate them to work for food
rewards in operant procedures. All experiments and animal care protocols were in accordance
with the Columbia University and NYSPI Institutional Animal Care and Use Committees and
Animal Welfare regulations.
Apparatus
![Page 176: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/176.jpg)
161
Experimental chambers (ENV-307w; Med Associates, St. Albans, VT) equipped with
liquid dippers were used in the experiment. Unless otherwise noted, the apparatus was identical
to that used by Drew and colleagues, (2007). Two retractable levers were mounted on either side
of a feeding trough, and a house light (model 1820; Med Associates) located at the top of the
chamber was used to illuminate the chamber during the sessions. Rewards consisted of
evaporated milk (.01 ml) delivered by raising a dipper located inside the feeder trough.
Behavioral Procedures
Lever Press Training Procedures. All subjects were trained to press levers for milk
rewards using the procedure described by Drew and colleagues (2007). Briefly, mice were first
trained to consume the liquid milk reward from the feeding trough when a dipper containing
milk was presented. To do this, mice were first placed into the operant chamber with the dipper
in the raised position. From the time mice first made a head entry into the feeding trough, the
dipper remained raised for another 10 seconds and then was lowered. Following a variable inter-
trial interval (ITI) which averaged 40 seconds, a new trial began, which was identical to the first
trial. These sessions lasted 30 minutes or until the mice consumed 20 rewards. On the next day,
the dipper began in the lowered position, but was raised simultaneously with a 0.5 sec tone (90
db, 2500 Hz) was played. The dipper then remained in the raised position for 8 seconds and then
was lowered. A variable inter-trial interval (mean= 40 s) was followed by an identical trial
through the remainder of the session. This was done to teach mice to quickly get to the feeding
trough to consume the reward when it was presented. Sessions of this type lasted 30 minutes or
until mice earned 30 rewards. Mice were exposed to sessions of this type until they earned 30
rewards on 2 consecutive days. Once all subjects reached this criterion they were then moved on
to the training phase to learn to make lever presses for milk rewards.
![Page 177: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/177.jpg)
162
In the second phase of basic lever press training mice were taught to press the lever to
earn the rewards. In these continuous reinforcement sessions (CRF), mice would be rewarded
with a 5 second presentation of a dipper following every lever press. At the beginning of the
session the lever extended into the chamber, and remained extended for 2 rewards, then was
retracted. After a variable ITI averaging 40 s, the lever was re-extended and remained so for the
next two rewards. Sessions continued like this for 1 hour or until 40 rewards were earned. Mice
were trained on CRF until they earned 40 rewards on 3 consecutive days, and were then trained
on random ratio (RR) schedules with the ratio value only incremented when they reached this
same criterion. In RR schedule, subjects are required to make some variable number or lever
presses in each trial, with the number required being drawn from a distribution with a given
mean. Subjects were tested in RR05, RR10, and RR20 schedules until they earned 40 rewards on
3 consecutive days. Once this criterion was reached, subjects were then moved on to the lever
hold-down training phase of the experiment.
Lever Hold-Down Training Procedures.
Following initial lever press training identical to the phase 1 procedure described above,
subjects in CEC experiments were trained to hold levers down using a Variable Interval Hold
Down (VIH) schedule. In the VIH schedule, a required hold duration is determined prior to the
start of each trial, and this was the duration of time the subject are required to hold the lever in
the depressed position in order to receive a reward. This hold requirement remained in place until
the subject was reinforced for completing the trial, at which time the next trial’s required hold
duration was randomly determined. Earlier experiments (Chapter 3) determined that when no
hold requirement was in effect lever press durations averaged ~ 0.5 s. During the first VIH
session, the distribution of required hold durations had a mean = 0.5 s; (min = .01 s; max = 2.44
![Page 178: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/178.jpg)
163
s). When a mouse earned 40 rewards on three consecutive days, the required hold durations for
the subsequent session were drawn from an exponential distribution with a higher mean (1 s, 2 s,
3 s, 4 s, 5 s, 8 s, 10 s). Thus, during the final session of VIH training, subjects were required to
hold down the lever for intervals that averaged 10 s, but could be as long as 18.8 s.
Each trial in the VIH schedule followed a similar procedure: At the start of each trial, the
house light was illuminated and a lever was extended. As soon as the mouse depressed the lever,
a timer began counting how long the lever was in the depressed position. This timer stopped and
was reset to 0.0 if the mouse ended the lever press before the required time was reached. If the
lever was depressed as long as the required duration, the trial ended, and the subject received a
reward. A tone (2 s) sounded and the house light was shut off to signal the presentation of the
dipper (5 s). Note that this contingency does not require the subject to time the duration of a
hold. No matter what the criterion duration, the mouse can simply hold the bar down until the
tone and reward are presented without having to track the duration of its hold. Because D2R-OE
show a timing deficit (Drew et al., 2007) it was particularly important in the current experiment
to manipulate effort in a way that did not explicitly depend on timing.
Concurrent Effort Choice Task
All subjects were first trained to press a lever using the basic lever press training
procedure. Subjects were then exposed to increasing Random Ratio schedules on a single lever.
The progression was as follows: 3 consecutive days each on CRF, RR05, RR10, and RR20.
Subjects were next trained to make lever holds on the opposite lever. This was done using the
VIH training program. Subjects were exposed to 3 days of each of the following VIH programs
in increasing order (0.5, 1.0, 2.0, 4.0, 6.0, 8.0, 10.0 sec).
![Page 179: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/179.jpg)
164
Once subjects were trained to make presses on lever A and holds on lever B they were
then moved to the concurrent effort choice schedule. In this schedule the lever subjects were
trained to press on required some fixed number or lever presses to obtain a reward and the lever
subjects were trained to hold required a hold of a fixed duration for a reward. The session
consisted of two separate phases. First, there were 10 forced choice trials in which either the ratio
lever or the hold lever was presented. Subjects then had to complete the requirement on that
lever to obtain the reward. There were 5 of each lever presentations presented in a semi-random
order. Once subjects completed all 10 of the forced choice trials they were then presented with
30 free choice trials. In this phase of the session, subjects were presented with both levers and
they were able to choose which one they worked on to obtain the reward.
In this experiment, we first maintained the hold duration required at a constant value (5
seconds), and varied the press requirement over days. We used the following press requirements
(1, 5, 10, 20, 40, 80, and 160). After running through this progression 2 times, we then made the
hold requirement 10 seconds and ran through the same progression of different press
requirements.
Concurrent Value Choice Task
All subjects were first trained to press a lever on each side of the operant chamber using
the basic lever press training procedure described above and in (Drew, 2007). The lever on one
side of the chamber delivered a liquid sucrose solution, while the opposite lever delivered a
14mg sucrose pellet (Bio-Serv, Frenchtown, NJ). Subjects were then exposed to increasing
Random Ratio schedules on each lever for the different outcomes, receiving one session for a
single outcome per day. The progression was as follows: 3 consecutive days each on CRF,
RR05, RR10, and RR20.
![Page 180: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/180.jpg)
165
Once subjects were trained to make presses on lever A for liquid sucrose and lever
presses on lever B for a sucrose pellet they were then moved to the concurrent value choice
schedule. Each CVC session consisted of two separate phases. First, there were 10 trials which
were single lever forced choice trials in which either the sucrose lever or the pellet lever was
presented. Subjects then had to complete the requirement on that lever to obtain the reward and
gain experience with the cost of each outcome on a given day. There were 5 presentations of
each lever presented in a semi-random order. Once subjects completed all 10 of the forced choice
trials they were then presented with 20 free choice trials in which both levers were extended and
they were able to choose which one they worked on to obtain reward.
Sucrose Concentration Manipulation. In this experiment, the cost of liquid sucrose was
fixed at 5 presses, whereas the cost for a pellet varied over days (10, 20, 40, or 80 presses).
Subjects were first tested in the CVC with 20% liquid sucrose and were exposed to each of the
different pellet costs in ascending order 2 times. Following this, subjects were then tested in the
CVC with a 5% sucrose solution and were exposed to each of the different pellet costs in
ascending order 2 times. The average of the 2 exposures to each of the different pellet costs was
taken for each sucrose concentration condition.
Outcome Devaluation Manipulation. In this experiment, the sucrose concentration was
kept at 5% and the cost of a sucrose reward was fixed at 5 presses. The cost of a pellet varied
over days (10, 20, 40, or 80 presses). Subjects were tested in 3 consecutive weeks in this
experiment. In week 1 (Valued-1), subjects were tested on the CVC schedule as described above.
In week 2 (Devalued), subjects received a 30 minute exposure to pellets prior to the testing
session. In week 3 (valued-2), subjects were tested on the CVC without pre-session exposure to
pellets.
![Page 181: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/181.jpg)
166
Data Analysis
In almost all measures analyzed, there were no significant differences between control
chow and control dox mice, and in such cases these groups were collapsed into a single group to
simplify the presentation of the results. Unless otherwise noted, the data were subjected to an
ANOVA with the factor group comprised of 3 levels: Control (pooling control chow and control
dox mice), D2R-OE Chow, and D2R-OE Dox. Significant effects of group were followed up
with post hoc comparisons to determine if D2R-OE Chow or D2R-OE Dox mice differed from
the control group. Planned ANOVA results are reported in the text, and significant Bonferoni
corrected comparisons are reported in the figures with an asterisk. α was set at 0.05 for all
analyses.
Results
1. Concurrent Lever Pressing/Free Chow Consumption Task
1A. Striatal D2R overexpression reduces willingness to work for preferred rewards when
given a High Effort/Large Reward and Low Effort/Small Reward choice
To investigate cost-benefit decision making in the D2R-OE mice we first used an Effort
Based-Choice Task (EBCT) that was originally described by (Salamone et al., 1991). In this task,
subjects choose between lever pressing for a preferred reward (milk) and consuming a freely
available less preferred reward (home cage chow). We tested mice in this task using increasing
random ratio schedules (RR05, RR10 and RR20) and measured the number of lever presses,
number of rewards earned, and amount of chow consumed as a function of the effort demanded
by the different schedules. Using a repeated measures One-Way ANOVA on control subjects to
test the effect of increasing the work requirement, we observe a significant main effect of the
![Page 182: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/182.jpg)
167
Ratio on responding (F(2,20) = 8.327, p = .0041), as subjects increase their pressing at the higher
ratios, but there is a significant decrease in the number of rewards earned (F(2,20) = 26.58, p <
.0001). To compensate for this, subjects consume more of the freely available chow (F(1.987, 25.83)
= 13.57, p < .0001).
We next examined the effects of both concurrent and developmental D2R over-
expression in the striatum. Because the D2R transgene is activated during embryonic
development (Kellendonk et al., 2006) any phenotypes observed in adult D2R-OE mice
maintained on a regular chow diet (D2R-OE:Chow) could result from the developmental
consequences of striatal D2R overexpression, or concurrent D2R overexpression at the time of
testing. Feeding D2R-OE mice chow that is supplemented with the tetracycline analogue
doxycycline (Dox) shuts of the D2R transgene. Therefore, any difference in phenotype observed
between D2R-OE:Dox mice and controls would be due to only developmental effects of
transgene expression. In this task, D2R-OE mice made fewer lever presses for the preferred
reward (Fig 5.1A), and a 3 × 3 Mixed ANOVA (Factors: Ratio, Group) detected a main effect of
ratio (F(2,58) = 4.253, p = 0.0189), and a main effect of group (F(2,29) = 13.82, p < 0.001). Post-
hoc comparisons found that the main effect of group was due to the D2R-OE chow group (Fig
5.1A) and therefore driven by concurrent not developmental D2R overexpression
We found a similar pattern of results in the number of rewards earned across the different
groups, as the D2R-OE:chow mice earned fewer rewards at all ratios (Fig 5.1B). We again found
a main effect of ratio (F(2,58) = 42.94, p < 0.0001), and a main effect of group (F(2,29) = 15.48, p <
0.001), as well as a significant ratio x group interaction (F(4,58) = 3.996, p = 0.0062), as the
control and D2R-OE:Dox mice showed a much larger drop as a function of the increasing ratio
![Page 183: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/183.jpg)
168
demand. Post hoc comparisons again confirmed that the differences were driven by concurrent
D2R overexpression, the D2R-OE:chow group (Fig 5.1B).
Figure 5.1. D2R-OE mice are less willing to work for preferred rewards. Shows the (A) mean +
(SEM) number of lever presses made, (B) mean + (SEM) number of rewards earned, and (C)
mean + (SEM) chow consumption (g) in the concurrent lever press/free chow consumption task.
While the D2R-OE:chow mice made fewer presses and earned fewer rewards compared
to the other groups, they compensated for this by consuming more of the freely available chow
(Fig 5.1C), as we detected a significant main effect of ratio (F(2,58) = 23.51, p < 0.0001), and a
main effect of group (F(2,29) = 5.469, p = 0.0097), and this difference was again confirmed to be
due to the D2R-OE:chow mice through post hoc comparisons. These data are consistent with an
earlier report of D2R-OE mice having altered cost-benefit decision making (Ward et al., 2012)
under a single effort condition (RR20). When D2R are normalized motivation increases to the
level exhibited by controls. The deficit and subsequent rescue may be mediated by differences in
processing effort and/or differences in processing value. The next experiments examine which of
these processes is affected by D2R overexpression.
2. Concurrent Effort Choice Task
![Page 184: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/184.jpg)
169
2A. Striatal D2R overexpression alters sensitivity to effort associated with the repeated
imitation of action
In order to determine if the D2R-OE and control mice have a differential sensitivity to
effort requirements when engaging in cost-benefit decision making we tested them in the
Concurrent Effort Choice (CEC) task, as this task measures sensitivity to differential effort
without the confound of multiple reward values (Bailey et al., 2017, CEC/CVC paper). Subjects
are trained to perform two different types of work to earn the same reward. First, subjects
learned to earn milk rewards by making lever presses on one side of the chamber, and
subsequently learn to earn milk rewards by making lever holds (maintaining the lever in the
depresses position for a required duration) on the other lever. After subjects have learned these
two distinct types of responding they are tested in the CEC task. In the CEC task, subjects are
exposed to each day’s press and hold requirements on each of the different levers by completing
10 single lever trials (5 press trials and 5 hold trials) at the beginning of each session, followed
by 30 choice trials in which both levers are presented and subjects can choose which lever to
work on to earn rewards.
We first examined sensitivity to the different effort manipulations by analyzing the
proportion of hold choices subjects made during the choice trials, where a hold choice was
defined as a trial in which a subject earned the reward through holding the lever down. Across 3
different hold requirement durations (5s, 10s, and 20s), the D2R-OE mice showed a strong bias
towards holding to earn rewards as compared to controls (Fig 5.2A). The results of a 6 × 3 × 3
mixed ANOVA (Factors: Ratio Requirement, Hold Duration, Group) confirmed that subjects
were sensitive to the number cost, indicated by a main effect of Ratio Requirement (F(5,8) =
153.399, p < .0001), as well as the time cost (F(2,8) = 37.645, p < .0001). We also found a
![Page 185: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/185.jpg)
170
significant main effect of group (F(2,27) = 31.19, p < .0001), and a significant group x ratio x hold
requirement interaction (F(1,8) = 3.27, p < .0001). This 3 way interaction appeared to be due to the
fact that control and D2R-OE:Dox mice shifted more as the hold duration increased, but this
shifting was much less in the D2R-OE:chow mice. We performed post hoc comparisons on the
D2R-OE chow vs control group and the D2R-OE dox vs control group. The D2R-OE chow mice
clearly differed from controls, as we detected a significant main effect of Genotype (F (1, 20) =
94.122, p < 0.001), Hold Duration (F (2, 20) = 88.787, p < 0.001), Press Requirement (F (5, 20) =
257.786, p < 0.001), and a significant Hold x Press x Geno interaction (F (10, 26) = 7.250, p <
0.001). The D2R-OE dox mice also appeared to differ from controls, as we found a significant
main effect of Genotype (F (1, 20) = 5.267, p = 0.033), Hold Duration (F (2, 20) = 137.596, p <
0.001), Press Requirement (F (5, 20) = 294.716, p < 0.001), and significant Press Requirement x
Genotype interaction (F (5, 26) = 5.875, p < 0.001).
We further characterized each subject’s sensitivity to the different effort requirements by
calculating the point of subjective equality (PSE) between pressing and holding, defined as the
number of presses at which each subject would chose holding for a given duration and lever
pressing an equal percentage of the time. The analysis of the PSE also reflected the D2R-OE bias
for holding versus pressing (Fig 5.2B), as a 3 × 3 Mixed ANOVA (Factors: Hold Requirement,
Group) detected a significant main effect of group (F (2,25) = 9.015, p = .0011), along with a
significant main effect of the hold duration (F (2,50) = 15.29, p < .0001), and a significant group x
hold duration interaction (F (4,50) = 5.481, p = .0010), as both control and D2R-OE dox mice
show increases in the PSE as the hold duration increases whereas the D2R-OE chow mice do
not. Post hoc comparisons confirm that this difference is driven by the D2R-OE chow mice (Fig
5.2B), and in this measure there are no differences between the D2R-OE dox and control mice.
![Page 186: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/186.jpg)
171
Figure 5.2: D2R-OE mice have altered effort sensitivity in a CEC task. Shows the (A) mean +
(SEM) proportion of hold choices for control (left) D2R-OE chow (center) and D2R-OE dox
(right) mice in 3 different hold duration conditions in the CEC task. Shows (B) the mean +
(SEM) point of subjective equality (PSE) in the 3 different hold duration conditions.
2C. Striatal D2R overexpression increases the time needed to complete lever press, but not
lever hold requirements
We next wanted to see if we could find any differences in the patterns of behavior when
subjects were responding on these two different types of work which could provide insight into
why the D2R-OE mice showed a strong bias toward holding compared to controls. To examine
behavior at the different effort demands for lever pressing (5, 10, 20, 40, 80, 160 presses) and
lever holding (5, 10, 20s) we analyzed behavior in the single lever trials at the beginning of each
session as this provided us with an equal number of trials for each subject across the different
lever press and lever hold requirements. For each trial type (Press vs Hold) we broke the
![Page 187: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/187.jpg)
172
behavior into two components: latency (duration from a lever being inserted at the start of a trial
and the time of the first response) and work time (duration which elapsed between the first
response in a trial and when the requirement was completed) see (Fig 5.3A, Press Trial) and (Fig
3B, Hold Trial). In the Press Trials, we found that at the highest ratio requirements (80 and 160
presses), the proportion of D2R-OE mice who quit responding for 3 minutes, defined as an “opt
out”, was significantly higher than controls, so we examined the work times at the lower ratios.
We first looked at the latencies to press in the lower ratios (5 – 40 presses) and found that
the D2R-OE chow mice had much longer latencies than the other groups (Fig 5.3C). We
analyzed this difference by first fitting each subjects latency to press as a function of the press
requirements with the simple linear function, y = a + b × n, in order to be able to compare the
slope (b) of how much latency increased as a function of the ratio requirement (n), as well as the
intercept (a), and then using a one way ANOVA (Factor: Group) to compare differences in the
distribution of these parameters between the groups. We found a significant difference in the
slope (Fig 5.3D left; F (2, 27) = 3.797, p = 0.0352), as well as the intercept (Fig 5.3D right; F (2, 27)
= 32.31, p < 0.0001) for latency to press. When we looked at the latency to hold, as a function of
the hold duration, (Fig 5.3G), as there was no difference in the slope between the groups (Fig
5.3H, left), but the D2R-OE chow mice had significantly larger intercepts (Fig 5.3H right; F (2, 27)
= 11.06, p = 0.0003). We next looked at the worktimes for the two different trial types. D2R-OE
chow mice took much longer to complete even these low press requirements of 5 – 40 presses
(Fig 5.3E), as we found they had a significantly steeper slope (Fig 5.3F, left; F (2, 27) = 15.31, p <
0.0001), but no difference in the intercepts (Fig 5.3F, right). Finally, we looked at the hold
worktimes (Fig 5.3I), but there were no differences between the groups in either the slope (Fig
5.3J, left) or the intercepts (Fig 5.3J, right).
![Page 188: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/188.jpg)
173
Figure 5.3: D2R-OE mice have deficits when repeatedly executing actions. (A-B) Shows a raster
plot example of lever pressing (A) and lever holding (B) behavior in the single lever trials in the
CEC task by indicating the time lever presses or holds were initiated and maintained (black
dashes) and the time rewards were earned (red triangle). Both the latency (blue line), and work
time (green line) are indicated. (C) Shows the mean latency to begin press and linear fits + (95%
CI). (D) Shows the mean + (SEM) for the (left) slope parameter, b, and (right) intercept
parameter, a, of the linear fits for latency to press. (E) Shows the mean latency to hold and linear
fits + (95% CI). (F) Shows the mean + (SEM) for the (left) slope parameter, b, and (right)
intercept parameter, a, from the linear fits for latency to hold. (G) Shows the mean work time in
the press trials and linear fits + (95% CI). (H) Shows the mean + (SEM) for the (left) slope
parameter, b, and (right) intercept parameter, a, from the linear fits for work times in the press
trials. (I) Shows the mean work time in the hold trials and linear fits + (95% CI). (J) Shows the
mean + (SEM) for the (left) slope parameter, b, and (right) intercept parameter, a, from the linear
fits for work times in the hold trials.
2D. Striatal D2R overexpression Alters the Temporal Dynamics of Lever Pressing
![Page 189: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/189.jpg)
174
The analysis of work times and latencies in the single lever trials revealed that the D2R-
OE mice are much slower to complete ratios requirements when lever pressing, but are equally
efficient at completing hold requirements. We further explored differences between D2R-OE
mice and controls while they were actually engaging in the action of lever pressing. We first
analyzed the durations of each individual lever press (Fig 5.4A), as well as each inter-response-
times (IRTs) or times between consecutive lever presses within each bout (Fig 5.4D). We found
that the D2R-OE chow mice took more time to execute each single lever press (Fig 5.4B), but
this did not change significantly as a function of the press requirement. We therefor calculated
each individual subjects average response duration to complete a lever press across all press
requirements and found that the D2R-OE chow mice had significantly longer response durations
than either control or D2R-OE dox mice (Fig 5.4C; F (2, 115) = 65.3, p < 0.0001).
Figure 5.4. D2R-OE mice have altered temporal dynamics of lever pressing. (A) Shows a
schematic representation of a press record during an FR10 trial in the single lever press trials of
the CEC task. Press Record indicates the time at which presses where made (black boxes), the
Response durations are the time a single lever press lasted (light red boxes). (B) Shows the mean
+ (SEM) response durations in the press trials at different press requirements for the 3 different
groups of mice. (C) Shows subjects mean + (SEM) response duration averaged across all sessions
and trials. (D) Shows a schematic representation of a press record during an FR10 trial in the
single lever press trials of the CEC task. The IRTs are the time occurring between consecutive
lever presses (light green boxes). (E) Shows the mean + (SEM) IRTs in the press trials at
![Page 190: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/190.jpg)
175
different press requirements for the 3 different groups of mice. (F) Shows subjects mean + (SEM)
IRTs averaged across all sessions and trials.
Additionally, the D2R-OE chow mice also had longer IRTs (Fig 5.4E), but this wasn’t
significantly related to the press requirement so we averaged across press requirements. We
found that D2R-OE chow mice had slower inter-response times compared to control and D2R-
OE dox mice (Fig 5.4F; F (2, 115) = 58.48, p < 0.0001).
To summarize the findings from our analysis of the behavior while working on the two
different work options of lever pressing and lever holding, we found that D2R-OE chow mice are
equally adept at completing lever hold requirements as control mice. Their ability to lever press,
on the other hand, appears to differ from controls in ways which require them to spend increased
amount of time working to complete lever press ratios. Specifically, the D2R-OE chow mice do
not chunk as many responses together into a single bout and consequently, make more pauses
than controls. It is conceivable that the extra effort of making long presses slows the initiation of
each response. Taken together, these results fit with the finding in the CEC choice trials that
D2R-OE mice find repeatedly executing a response to be more demanding than making a single
sustained response relative to controls. The impact on effort and repeatedly initiating responses
appears largely normalized in D2R-OE dox treated mice, suggesting that the current over-
expression of D2R’s at the time of testing is responsible for the effort based deficits.
3. Progressive Ratio and Progressive Hold Down
The results of the CEC task suggest that the D2R-OE mice are differentially sensitive to
the response cost of Number (repeated initiation of actions) as opposed to Time (maintaining a
single action). To directly test this, we exposed the D2R-OE mice to two different behavioral
tests with increasing effort demands. In the Progressive Ratio (PR), an increasing number of
![Page 191: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/191.jpg)
176
responses are required to obtain each subsequent reward. In the Progressive Hold Down (PHD),
an increasing duration a single response is required in order to earn each subsequent reward.
Because all the differences in EBCT were found to be due to concurrent expression of the D2R,
these experiments were limited to D2R-OE: chow and control mice.
Figure 5.5. D2R-OE mice quit working sooner than controls with a response number cost. (A)
Shows a survival function of how long mice continued lever pressing in a PR before quitting for a
period of 5 minutes without a single press. (B) Shows the mean + (SEM) number of lever presses
in a PR x 2 schedule of reinforcement. (C) Shows the mean + (SEM) breakpoint in a PR x 2
schedule of reinforcement.
3A. Striatal D2R overexpression causes subjects to quit working sooner with an increasing
cost of number of responses
We tested subjects in a PR x 2 schedule in which the number of responses required to
obtain each subsequent reward was multiplied by 2. With this increasing effort demand D2R-OE
mice quit working more quickly than controls (Fig 5.5A). A Mantel-Cox Log-rank analysis of
the survival function confirmed that D2R-OE mice quit working earlier than controls (ᵡ2 = 8.606,
p = .0034). Additionally, the D2R-OE mice made fewer lever presses (F(3,108) = 285.8, p < .001;
Fig 5.5B), and had a significantly lower breakpoint than controls (F(3,108) = 285.8, p < .001; Fig
5.5C), replicating previously reported findings (Drew et al., 2007; Simpson et al., 2011).
![Page 192: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/192.jpg)
177
3B. Striatal D2R overexpression doesn’t alter how long subjects continue working when
duration of an action is the increasing cost
We next tested D2R-OE mice on two different schedules of a PHD task (Easy vs Hard),
which requires subjects to make lever holds of increasing durations to obtain each subsequent
reward. In an Easy version of the PHD task, the hold requirement was determined by the
function y = 2 × 1.13^ (n-1), where n was the trial number, and in a Hard version of the task the
hold requirement was determined by the function y = 4 × 1.2^ (n-1). Unlike what was observed in
the PR, there was no difference in how long D2R-OE mice and controls continued working in
the PHD task before giving up (Fig 5.6A), as the survival functions were not different in the
Easy (ᵡ2 = 0.097, p = .7555) or Hard (ᵡ2 = 0.2996, p = .5841) PHD schedules. While the difficulty
of the PHD schedule had a significant main effect on number of hold attempts (F(1,13) = 17.76, p
= 0.0010), there was no difference between D2R-OE mice and controls (Fig 5.6B). In contrast to
what was observed for the breakpoint in the PR test, the D2R-OE mice had a significantly higher
breakpoint in the PHD task on both schedules (Fig 5.6C) than controls, as there was a main
effect of both schedule (F(1,13) = 13.36, p = 0.0029) and genotype on the breakpoint (F(1,13) =
10.67, p = 0.0061).
Figure 5.6. D2R-OE mice and controls maintain equal persistence with a response duration cost.
(A) Shows a survival function of how long mice continued lever holding in a PHD task in an
Easy (left) and Hard (right) schedule before quitting for a period of 5 minutes without a single
press. (B) Shows the mean + (SEM) number of lever holds in 2 different PHD schedules of
![Page 193: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/193.jpg)
178
reinforcement. (C) Shows the mean + (SEM) breakpoint in 2 different PHD schedules of
reinforcement.
4. Concurrent Value Choice Task
4A. Reward Value Manipulation Effects on Value-Based Choice
The results from the CEC task demonstrate that the D2R-OE mice have differential
sensitivity to increases in the number of responses required to earn rewards relative to controls,
which could explain the differences in cost-benefit decision making observed in the EBCT.
Because previous studies have reported findings to suggest that D2R-OE mice have an impaired
ability to use reward value information to guide their behavior (Ward, 2012; Ward, 2015), we
wanted to directly determine whether the D2R-OE mice were sensitive to changes in reward
value. To this end, we tested both concurrent and developmental effects of D2R over-expression
in the Concurrent Value Choice (CVC) task. In this task, subjects first learn to work for two
different outcomes (liquid sucrose vs pellets) by pressing on levers on opposite sides of an
operant chamber. In the CVC task, subjects work for these two different reward outcomes at
different costs. Specifically, mice can always earn a drop of liquid sucrose by making 5 lever
presses, or they can obtain a pellet by pressing a given number of times (5, 10, 20, 40, or 80
time), depending on the day. Subjects first experience 10 single lever trials (5 Sucrose and 5
Pellet) to learn about the costs of each reward on a given day, and are then given 20 choice trials
in which both levers are presented and they can choose which outcome to work for.
4A.1. Striatal D2R overexpression does not fully disrupt sensitivity to sucrose
concentration shifts when making value-based choices
We used two different reward magnitude manipulations to examine the D2R-OE’s
sensitivity to changes in reward value in the CVC task. The first involved manipulating the
![Page 194: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/194.jpg)
179
concentration of the liquid sucrose (5 vs 20%) to see how this influenced subject’s choice
between liquid sucrose and pellets.
Figure 5.7. Reward magnitude manipulation effects on choice in the CVC task. (A) Shows the
main effect of Pellet Cost on the mean + (SEM) for proportion of sucrose choices in the CVC
task. (B) Shows the main effect of Sucrose Concentration on the mean + (SEM) for proportion of
sucrose choices in the CVC task. (C) Shows the pellet cost x sucrose concentration interaction on
the mean + (SEM) for proportion of sucrose choices in the CVC task.
In this first experiment, we examined the proportion of sucrose choices subjects made in
the choice trials across the two sucrose concentration conditions. Analysis with a mixed ANOVA
(Factors: Pellet Cost, Sucrose Concentration, and Group), confirmed that the proportion of
sucrose choices was significantly impacted by the Pellet Cost (Fig 5.7A; F(4, 28) = 149.545, p <
.001), as well as the sucrose concentration (Fig 5.7B; F(1, 28) = 295.294, p < .001). The Sucrose
Concentration shifted the entire choice function equally across all of the levels of the Pellet Cost
as there was no Pellet Cost x Sucrose Choice interaction (Fig 5.7C; F(4, 28) = 9.665, p < .0001).
We next specifically examined the D2R-OE mice to see whether they differed from
controls. While we found no overall main effect of group, (Fig 5.8A; F(2, 24) = 1.317 , p = 0.287),
there was a group × Pellet Costs interaction (F(2, 28) = 2.550, p = .0098). Post hoc comparisons
between the D2R-OE chow mice and controls detected a significant genotype × pellet cost
![Page 195: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/195.jpg)
180
interaction (F(2, 28) = 5.262, p = .0004), whereas this was not the case between D2R-Dox mice and
controls.
Figure 5.8. D2R-OE mice are sensitive to reward magnitude in a CVC task. (A) Shows the mean +
(SEM) proportion of sucrose choices in the CVC task for controls (left), D2R-OE chow (center), and
D2R-OE dox (right) in 2 different sucrose concentration conditions. (B) Shows the mean + (SEM) point
of subjective equality (PSE) in the 2 different sucrose concentration conditions.
To further analyze the shift in choice resulting from the change in sucrose concentration
we determined the PSE for each subject, defined as the Pellet Cost (presses) at which subjects
chose between the sucrose and pellet outcome on an equal number of trials. Both genotypes had
higher PSE values for the 5 vs 20% sucrose concentration (Fig 5.8B), and a 2 × 3 mixed
ANOVA (factors: Sucrose Concentration, Group) detected a significant main effect of sucrose
concentration (F(1, 11) = 17.741, p = .0001), but there was no main effect of group or sucrose
concentration x group interaction. Post hoc comparisons revealed that the only group with a
significantly different PSE between the 5 and 20% sucrose was the vehicle group (Fig 5.8B).
4A.2 Striatal D2R overexpression doesn’t impair sensitivity to reward devaluation when
making value-based choices
While the results in the sucrose concentration CVC experiment showed that D2R-OE
chow mice can shift their behavior in response to a reward value change, the PSE between the 2
sucrose concentrations for these two groups was not significantly different. This could mean the
![Page 196: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/196.jpg)
181
D2R-OE chow mice are less sensitive to reward value changes, but one could also imagine that
the increasing number of presses used as the pellet cost impacted these mice as we have shown
an effort sensitivity difference. Therefore, we wanted another way to test whether D2R-OE mice
were sensitive to reward value.
Figure 5.9. D2R-OE mice are sensitive to reward devaluation in a CVC task. (A) Shows the
mean + (SEM) proportion of sucrose choices for controls (left) and D2R-OE chow mice (right) in
the CVC task in 2 different devaluation conditions. (B) Shows the mean + (SEM) point of
subjective equality (PSE) for subjects in the 2 different devaluation conditions.
In a second experiment to test the D2R-OE’s value based decision making, we
manipulated the reward value of the pellet outcome by devaluing this outcome by giving subjects
30 minutes of free access to pellets prior to the testing session, and compared the choice behavior
between the Valued (no pre-feed) and Devalued (30 minute pre-feed) conditions. This
devaluation procedure had an impact on the proportion of sucrose choices made in the choice
trials (Fig 5.9A). A mixed ANOVA (Factors: Pellet Cost, Value Condition, Genotype) found a
significant main effect of Pellet Cost (F (3, 77) = 15.478, p < 0.0001), and a significant main effect
of Devaluation (F (2, 77) = 44.907, p < 0.0001). In this experiment there was no main effect of
Genotype (F (2, 77) = 3.103, p = 0.106). We determined the PSE for subjects in the valued vs
devalued condition (Fig 5.9B). A mixed ANOVA (Factors: Genotype, Value Condition),
![Page 197: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/197.jpg)
182
detected a significant main effect of Value Condition (F (1, 12) = 23.455, p < 0.001). While there
was a trend towards a Genotype difference (Fig 5.9B) this was not statistically significant (F (1, 12)
= 4.165, p = 0.0639), and there was no Value Condition × Genotype interaction (F (1, 12) = 0.652,
p = 0.435). Post hoc comparisons found that difference in PSE between the value and devalued
condition to be significantly different in both the control (t(12) = 3.996; p = 0.0036) and D2R-OE
mice (t(12) = 2.853; p = 0.0291).
Discussion
The present set of studies examines the impact of striatal D2 receptor over-expression on
effort-based vs value-based decision making. There have been several reports that the motivation
deficits observed in patients with schizophrenia may be due to differences in how both effort and
value related aspects of cost-benefit decision making. Specifically, patients are less willing to
make choices of high effort for large reward and instead make more low effort choices for small
reward in a number of different conditions (Barch et al., 2014; Fervaha et al., 2013; Gold et al.,
2013; Treadway, Michael T. et al., 2015). Moreover, patients have difficulty using estimates of
future reward value to guide their behavior (Cohen & Minor, 2010; Llerena et al., 2012;
Oorschot et al., 2013). Here we examine the impact of D2 receptor overexpression in the
striatum to see if effort or value aspects of cost-benefit decision making are impacted.
Using a commonly employed assay of effort-related choice (Salamone, 1991), we find
that over-expression of the D2R within the striatum alters the willingness to work for a preferred
reward. We observed the same pattern of results of decreased pressing for reward and increased
chow consumption that is produced with D2R blockade with various D2 antagonists (Salamone,
2007) and dopamine depletion following neurotoxic lesions of dopaminergic terminals in the
NAcc (Salamone, 2012). This decreased willingness to work for preferred rewards is due to
![Page 198: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/198.jpg)
183
current levels of the D2R, as this impairment is reversed when the over-expression of the D2R is
restored to normal following treatment with Dox. Because this assay requires subjects to choose
between lever pressing (high effort) and forgoing pressing (low effort) in order to obtain milk
(large value) or home cage chow (low value), it is possible that the deficit we observed in the
D2R-OE mice could result from differences in sensitivity to effort demands, different sensitivity
to reward value, or both. We examined effort and reward related decision making processes
separately to identify which is altered by striatal D2R over-expression.
Striatal D2R overexpression alters the sensitivity to the effort demands of NUMBER vs
TIME
One potential explanation for the decreased willingness to work in the EBCT observed in
mice with striatal D2R over expression is that they find the act of lever pressing to be more
effortful than control mice do. Testing this hypothesis requires the non-trivial task of determining
how difficult a mouse judges 1 vs 5 vs 10 lever presses compared to another subject. We were
able to generate subject’s “perceived judgements of effort”, by giving them an option of either
lever pressing or lever holding to obtain rewards. Using the CEC task, we were able to generate
effort choice functions by manipulating the work requirement of the two work options (number
of lever presses vs duration of hold) to see how subjects scaled the effort of one type of work
against the other. Importantly, in this task the reward outcome was the same for the two work
options, so the only variable manipulated was the effort on each work option.
Our analysis of these effort choice functions revealed that striatal D2R over-expression
resulted in differential sensitivity to the cost of response number (having to repeatedly initiate
actions) as these mice strongly preferred the cost of time (maintaining a single sustained action)
to the repetitive responding involved in lever pressing. This differential sensitivity to effort
![Page 199: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/199.jpg)
184
demands of number was largely normalized in Dox treated D2R-OE mice as their preference
functions differed only from controls at the longest (20s) hold duration.
In an attempt to gain insight into the large difference in preference for holding over
pressing we observed in the D2R-OE mice, we closely examined several aspects of response
patterns when subjects were working in the two different work conditions. Whereas it took D2R-
OE mice longer to complete lever press requirements, they were equally adept at completing the
lever hold requirements. An even closer look at the lever pressing behavior revealed that it took
D2R-OE mice longer to execute each individual response (response duration), and it took them
more time from the termination of a lever press to the initiation of a new press (IRTs).
Interestingly, these temporal dynamics of responding were normalized in D2R-OE Dox treated
mice, which suggests that the current over-expression of the D2R slows responding and this
likely explains why these mice judge repeatedly initiating lever press responses to be more
effortful. Consistent with this inference, D2R-OE mice quit working sooner than controls in a
progressive ratio which requires increasing number of responses for each reward, but worked just
as long as controls, earning even more rewards, when tested in a progressive hold down which
requires subsequent lever holds to maintained for longer durations for each subsequent reward.
Does Striatal D2R signaling modulate the fluidity of executing repeated actions?
The present results demonstrate that striatal D2R over-expression slows the execution of
individual lever presses (response duration) as well as the time it takes to initiate a new lever
press (IRTs), and this results in subjects judging repeatedly executing lever presses to be more
effortful than maintaining a single response. One potential explanation for this result is that
manipulation of D2 receptors signaling alters the ease or readiness with which subjects can
repeatedly initiate new actions. Numerous studies have reported that D2R antagonists and 6-
![Page 200: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/200.jpg)
185
OHDA induced DA depletion produce a reduction in operant responding, and the impairment
observed are most pronounced at higher ratio requirements (Domenger & Schwarting, 2006;
Salamone et al., 1997). Stated differently, the reduction in behavior seen with DA depletion and
D2R antagonists increases as a function of the number of times subjects must repeatedly initiate
responses. This same pattern of larger decrements in responding at higher ratios is highly similar
to what is observed in the D2R-OE mice (Simpson, 2011).
Another line of evidence which supports the notion that D2R signaling influences the
ease of repeatedly initiating responses comes from a recent study which used the Gαi – coupled
designer receptor hM4D in D2R expressing neurons in the striatum to modulate the activity of
D2 expressing medium spiny neurons in D2R-OE mice. This study found that at baseline and in
vehicle treated conditions, D2R-OE mice make fewer lever presses and quit sooner in a
progressive ratio task, but inhibiting the activity of D2R expressing neurons through treatment
with clozapine-N-oxide (CNO) resulted in large increases in both the number of lever presses
subjects made, as well as how long subjects continued working (Carvalho et al., 2016). An
interesting question is whether this hM4D Gαi induced inhibition of D2-expressing cells, which
greatly increases the number of responses subjects can make per unit time, would also cause a
shift in effort choice, which is in the opposite direction that we observed in the D2R-OE mice.
Moreover, considered in the context of the present set of studies, it would be interesting to know
if any of the wide range of pharmacological manipulations which cause large increases in
response initiation (Amphetamine, Cocaine, etc.), or manipulations which greatly reduce
behavioral output (Haloperidol, etc.) would all impact effort choice judgements in the same
manner that was observed with D2R-OE mice.
![Page 201: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/201.jpg)
186
In addition to the finding that D2R-OE mice are less inclined to initiate repeated
responses, we also found that D2R-OE mice are equally willing to work compared to controls
when the response is maintaining a single hold. In fact, these mice are actually more efficient at
this type of response. Various lines of evidence exist which suggest that the D2 receptor plays an
important role in regulating the duration of actions. For example, D2 antagonists increase the
duration of time that a rat’s paw remains in contact with a lever when lever pressing (Fowler &
Liou, 1994), which aligns with our finding that the duration of individual lever press responses
made by the D2R-OE mice last longer across large ranges of ratio requirements. Other studies
have reported D2 receptor antagonists increasing the duration of several different actions such as
increased latencies to retrieve a reward after it has been earned (Fowler & Liou, 1994, 1998;
Horvitz & Eyny, 2000), and increased durations of head pokes into a feeding compartment in a
cue Pavlovian as well as operant conditioned procedure (Choi, 2009). One study performed
careful analysis of all of the different behaviors rats engaged in during an operant lever pressing
task and found D2R antagonists even increased the durations of time subjects spent immobile
and grooming themselves, in addition to longer latencies in reward related operant responses
(Nicola, 2010). Importantly, in all of these procedures, the longer duration of the different types
of responses either neutrally or negatively impacts performance. When maintaining an action
benefits a subject, as in the PHD task, the D2R-OE mice seem to perform more effectively than
controls and do not give up as easily as when repeated responses are required of them.
Striatal D2R Overexpression Does Not Fully Disrupt Value Related Decision Making
We next explored whether D2R-OE mice are impaired in the ability to anticipate and
process information about reward values. Previous evidence suggested that the D2R-OE mice
might have an impairment in processing information about the value of future rewards. One
![Page 202: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/202.jpg)
187
previous study (Ward et al., 2012) tested these mice using two different Variable Interval (VI)
schedules, which reward lever presses at differential rates (i.e. VI 20s vs VI 120s produce reward
after an average of 20 and 120s; respectively). Unlike control mice who will match the
percentage of responses allocated to each schedule to the percentage of rewards earned on that
schedule, the D2R-OE mice show relatively undifferentiated rates of responding on the two
different levers. In another study (Ward et al., 2015) showed that control mice became more
accurate in a vigilance task when a signal indicated a higher likelihood of payoff than when there
was a lower likelihood of reward. Accuracy of D2R-OE mice, on the other hand, was not
affected by signaling payoff probability. These two findings suggest a relative insensitivity of
D2R-OE mice to the value of future reward outcomes.
In contrast to these previous findings, our assays of value-based choice found that D2R-
OE mice were sensitive to two different kinds of reward value shifts, and their choice behavior
was adjusted in a manner that suggested they were able to use these value changes to guide their
decision making. Whether their sensitivity was blunted relative to controls remains worth further
consideration, but given the fact that we relied on increasing costs of the pellet reward to
generate a choice function it seems likely that the effort sensitivity difference likely confounds
this measure of reward value to some extent. An interesting future experiment may entail using
lever holding as the work type in a CVC task as D2R-OE mice are eqaully willing to work when
holding compared to controls. This would directly test the hypothesis that the reward value shift
was confounded by the effort required to obtain the pellet outcome. Considered against previous
studies of reward related behavior (Ward et al., 2012; Ward et al, 2015), it seems that
considering reward value sensitivity to include sensitivity to reward magnitude (sucrose
concentration), current reward value (devaluation), rates of reward payoff (2 different VI
![Page 203: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/203.jpg)
188
schedules) and probability of future rewards (probability cues signaling reward likelihood) seems
to be an unproductive generalization and warrant further investigation to understand why these
later processes are disrupted in the D2R-OE mice.
Conclusions and Implications
Impairments in motivation remains a problematic symptom experienced by many
individuals with schizophrenia. Studies in these patients have found evidence that these
motivation deficits may be due to either impairments in effort related (Barch et al., 2014;
Fervaha et al., 2013; Gold et al., 2013; Treadway, Michael T. et al., 2015) and value related
(Barch, Deanna M. et al., 2010; Gold et al., 2012; Gold et al., 2008; Heerey et al., 2008; Kring et
al., 2013) decision making processes. An interesting question relevant to future strategies for
developing new pharmacological treatments is whether these effort and value related
impairments both occur in the same individual, or whether these deficits are found in distinct
subsets of patients. In our present results, we have demonstrated that in mice, overexpression of
the D2R at the time of testing specifically impacts effort-based decision making, but leaves value
based decision making intact. Future studies employing a similar strategy of specifically
isolating effort and value variables will able to further elucidate the neural mechanisms
underlying effort and value related processes in finer temporal and neurobiological detail, as the
potential for cell type and circuit specific manipulations is now readily available in rodent
models.
![Page 204: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/204.jpg)
189
References
Barch, D. M., & Dowd, E. C. (2010). Goal representations and motivational drive in
schizophrenia: the role of prefrontal-striatal interactions. Schizophr Bull, 36(5), 919-934.
Barch, D. M., Treadway, M. T., & Schoen, N. (2014). Effort, anhedonia, and function in
schizophrenia: Reduced effort allocation predicts amotivation and functional impairment.
Journal of Abnormal Psychology, 123(2), 387-397.
Bari, A. A., & Pierce, R. C. (2005). D1-like and D2 dopamine receptor antagonists administered
into the shell subregion of the rat nucleus accumbens decrease cocaine, but not food,
reinforcement. Neuroscience, 135(3), 959-968.
Bezzina, G., Body, S., Cheung, T. H. C., Hampson, C. L., Bradshaw, C. M., Szabadi, E., . . .
Deakin, J. F. W. (2008). Effect of disconnecting the orbital prefrontal cortex from the
nucleus accumbens core on inter-temporal choice behaviour: A quantitative analysis.
Behavioural Brain Research, 191(2), 272-279.
Breier, A., Su, T. P., Saunders, R., Carson, R. E., Kolachana, B. S., de Bartolomeis, A., . . .
Pickar, D. (1997). Schizophrenia is associated with elevated amphetamine-induced
synaptic dopamine concentrations: Evidence from a novel positron emission tomography
method. Proceedings of the National Academy of Sciences of the United States of
America, 94(6), 2569-2574.
Cagniard, B., Balsam, P. D., Brunner, D., & Zhuang, X. X. (2006). Mice with chronically
elevated dopamine exhibit enhanced motivation, but not learning, for a food reward.
Neuropsychopharmacology, 31(7), 1362-1370.
Drew, M. R., Simpson, E. H., Kellendonk, C., Herzberg, W. G., Lipatova, O., Fairhurst, S.,
Balsam, P. D. (2007). Transient Overexpression of Striatal D2 Receptors Impairs
Operant Motivation and Interval Timing. The Journal of Neuroscience, 27(29), 7731-
7739.
Faerden, A., Vaskinn, A., Finset, A., Agartz, I., Ann Barrett, E., Friis, S., . . . Melle, I. (2009).
Apathy is associated with executive functioning in first episode psychosis. BMC
Psychiatry, 9, 1.
Feil, D., Razani, J., Boone, K., & Lesser, I. (2003). Apathy and cognitive performance in older
adults with depression. Int J Geriatr Psychiatry, 18(6), 479-485.
Fervaha, G., Foussias, G., Agid, O., & Remington, G. (2013). Amotivation and functional
outcomes in early schizophrenia. Psychiatry Res, 210(2), 665-668.
Gard, D. E., Fisher, M., Garrett, C., Genevsky, A., & Vinogradov, S. (2009). Motivation and its
Relationship to Neurocognition, Social Cognition, and Functional Outcome in
Schizophrenia. Schizophrenia Research, 115(1), 74-81.
Ghods-Sharifi, S., & Floresco, S. B. (2010). Differential Effects on Effort Discounting Induced
by Inactivations of the Nucleus Accumbens Core or Shell. Behavioral Neuroscience,
124(2), 179-191.
![Page 205: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/205.jpg)
190
Gold, J. M., Strauss, G. P., Waltz, J. A., Robinson, B. M., Brown, J. K., & Frank, M. J. (2013).
Negative Symptoms of Schizophrenia Are Associated with Abnormal Effort-Cost
Computations. Biological Psychiatry, 74(2), 130-136.
Gold, J. M., Waltz, J. A., & Frank, M. J. (2015). Effort cost computation in schizophrenia: a
commentary on the recent literature. Biol Psychiatry, 78(11), 747-753.
Gold, J. M., Waltz, J. A., Matveeva, T. M., & et al. (2012). Negative symptoms and the failure to
represent the expected reward value of actions: Behavioral and computational modeling
evidence. Archives of General Psychiatry, 69(2), 129-138.
Gold, J. M., Waltz, J. A., Prentice, K. J., Morris, S. E., & Heerey, E. A. (2008). Reward
Processing in Schizophrenia: A Deficit in the Representation of Value. Schizophrenia
Bulletin, 34(5), 835-847.
Hamill, S., Trevitt, J. T., Nowend, K. L., Carlson, B. B., & Salamone, J. D. (1999). Nucleus
accumbens dopamine depletions and time-constrained progressive ratio performance:
Effects of different ratio requirements. Pharmacology Biochemistry and Behavior, 64(1),
21-27.
Hauber, W., & Sommer, S. (2009). Prefrontostriatal Circuitry Regulates Effort-Related Decision
Making. Cerebral Cortex, 19(10), 2240-2247.
Heerey, E. A., Bell-Warren, K. R., & Gold, J. M. (2008). Decision-Making Impairments in the
Context of Intact Reward Sensitivity in Schizophrenia. Biological Psychiatry, 64(1), 62-
69.
Kellendonk, C., Simpson, E. H., Polan, H. J., Malleret, G., Vronskaya, S., Winiger, V., . . .
Kandel, E. R. (2006). Transient and selective overexpression of dopamine D2 receptors
in the striatum causes persistent abnormalities in prefrontal cortex functioning. Neuron,
49(4), 603-615.
Kiang, M., Christensen, B. K., Remington, G., & Kapur, S. (2003). Apathy in schizophrenia:
clinical correlates and association with functional outcome. Schizophrenia Research,
63(1–2), 79-88.
Kring, A. M., & Elis, O. (2013). Emotion Deficits in People with Schizophrenia. Annual Review
of Clinical Psychology, Vol 9, 9, 409-433.
Laruelle, M., Abi-Dargham, A., vanDyck, C. H., Gil, R., Dsouza, C. D., Erdos, J.,Innis, R.
B. (1996). Single photon emission computerized tomography imaging of amphetamine-
induced dopamine release in drug-free schizophrenic subjects. Proceedings of the
National Academy of Sciences of the United States of America, 93(17), 9235-9240.
Marin, R. S. (1991). APATHY - A NEUROPSYCHIATRIC SYNDROME. Journal of
Neuropsychiatry and Clinical Neurosciences, 3(3), 243-254.
Myin-Germeys, I., Delespaul, P. A. E. G., & deVries, M. W. (2000). Schizophrenia Patients Are
More Emotionally Active Than Is Assumed Based on Their Behavior. Schizophrenia
Bulletin, 26(4), 847-854.
![Page 206: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/206.jpg)
191
Nowend, K. L., Arizzi, M., Carlson, B. B., & Salamone, J. D. (2001). D1 or D2 antagonism in
nucleus accumbens core or dorsomedial shell suppresses lever pressing for food but leads
to compensatory increases in chow consumption. Pharmacology Biochemistry and
Behavior, 69(3-4), 373-382.
Roth, Laura A. Flashman, Andrew J. Saykin, Thomas W. McAllister, & Robert Vidaver. (2004).
Apathy in Schizophrenia: Reduced Frontal Lobe Volume and Neuropsychological
Deficits. American Journal of Psychiatry, 161(1), 157-159.
Salamone, J. D., Aberman, J. E., Sokolowski, J. D., & Cousins, M. S. (1999). Nucleus
accumbens dopamine and rate of responding: Neurochemical and behavioral studies.
Psychobiology, 27(2), 236-247.
Salamone, J. D., Cousins, M. S., & Bucher, S. (1994). ANHEDONIA OR ANERGIA -
EFFECTS OF HALOPERIDOL AND NUCLEUS-ACCUMBENS DOPAMINE
DEPLETION ON INSTRUMENTAL RESPONSE SELECTION IN A T-MAZE COST-
BENEFIT PROCEDURE. Behavioural Brain Research, 65(2), 221-229.
Simpson, E. H., Kellendonk, C., Ward, R. D., Richards, V., Lipatova, O., Fairhurst, S., . . .
Balsam, P. D. (2011). Pharmacologic Rescue of Motivational Deficit in an Animal Model
of the Negative Symptoms of Schizophrenia. Biological Psychiatry, 69(10), 928-935.
Treadway, M. T., Peterman, J. S., Zald, D. H., & Park, S. (2015). Impaired effort allocation in
patients with schizophrenia. Schizophrenia Research, 161(2–3), 382-385.
Ward, R. D., Simpson, E. H., Richards, V. L., Deo, G., Taylor, K., Glendinning, J. I., . . .
Balsam, P. D. (2012). Dissociation of Hedonic Reaction to Reward and Incentive
Motivation in an Animal Model of the Negative Symptoms of Schizophrenia.
Neuropsychopharmacology, 37(7), 1699-1707.
Ward, R. D., Winiger, V., Higa, K. K., Kahn, J. B., Kandel, E. R., Balsam, P. D., & Simpson, E.
H. (2015). The Impact of Motivation on Cognitive Performance in an Animal Model of
the Negative and Cognitive Symptoms of Schizophrenia. Behavioral Neuroscience,
129(3), 292-299
Wong, D. F., Wagner, H. N., Tune, L. E., Dannals, R. F., Pearlson, G. D., Links, J. M., . . .
Gjedde, A. (1986). POSITRON EMISSION TOMOGRAPHY REVEALS ELEVATED
D2 DOPAMINE-RECEPTORS IN DRUG-NAIVE SCHIZOPHRENICS. Science,
234(4783), 1558-1563.
![Page 207: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/207.jpg)
192
Chapter 6
The Effects of Pharmacological Modulation of the Serotonin 2c
Receptor on Goal-Directed Behavior in Mice
Abstract
Rationale Impaired goal-directed motivation represents a debilitating class of symptoms
common to psychological disorders including schizophrenia and some affective disorders.
Despite the known negative impact of impaired motivation, there are currently no effective
pharmacological interventions to treat these symptoms. Objectives Here, we evaluate the
effectiveness of the serotonin 2C (5-HT2C) receptor selective ligand, SB242084, as a potential
pharmacological intervention for enhancing goal-directed motivation in mice. The studies were
designed to identify not only efficacy but also the specific motivational processes that were
affected by the drug treatment.
Methods We tested subjects following treatment with SB242084 (0.75 mg/kg) in several operant
lever pressing assays including the following: a progressive ratio (PR) schedule of
reinforcement, an effort-based choice task, a progressive hold down task (PHD), and various
food intake tests.
Results Acute SB242084 treatment leads to an increase in instrumental behavior. Using a battery
of behavioral tasks, we demonstrate that the major effect of SB242084 is an increase in the
amount of responses and duration of effort that subjects will make for food rewards. This
enhancement of behavior is not the result of non-specific hyperactivity or arousal nor is it due to
changes in food consumption.
![Page 208: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/208.jpg)
193
Conclusions Because of this specificity of action, we suggest that the 5-HT2C receptor warrants
further attention as a novel therapeutic target for treating pathological impairments in goal-
directed motivation.
Introduction
Many patients with schizophrenia (50 %) and affective disorders (20–80 %) experience a
pervasive inability to activate their behavior in pursuit of positive goals, reflecting a deficit in
goal-directed motivation (Kiang et al. 2003; Roth 2004; Faerden 2009; Feil et al. 2003; Marin et
al. 2003). Clinically, a paucity of goal-directed behavior is referred to as apathy (Markou et al.
2013), and the impact of this debilitating symptom has become increasingly recognized in recent
years (Chase 2011; Treadway and Zald 2011, 2013). Impaired goal-directed motivation
decreases a patient’s quality of life and functional outcomes (Kiang et al. 2003), which presents a
challenge, as there are not approved pharmacological treatments with demonstrated effectiveness
for alleviating these deficits (Chase 2011; Levy and Czernecki 2006). Some medications used to
treat schizophrenia and affective disorders have even been shown to exacerbate these symptoms
in preclinical models (Sanders et al. 2007; Simpson et al. 2011; Aberman et al. 1998; Salamone
2009) and in patients (Wongpakaran et al. 2007).
In healthy individuals, intact goal-directed motivation requires a normal hedonic response
to an outcome, as well as the willingness to expend energy in pursuit of a goal. These two
processes are known to be controlled by distinct neurobiological substrates (Berridge and
Robinson 2003; Salamone et al. 2007). Substantial research indicates that hedonic reaction (the
capacity to experience pleasure) is mostly intact in schizophrenia (Cohen and Minor 2010;
Oorschot et al. 2011), whereas the willingness to expend effort is impaired (Treadway and Zald
2013). This has now been observed for some patients with depression (Treadway et al. 2009,
![Page 209: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/209.jpg)
194
2012) and schizophrenia (Barch and Dowd 2010; Strauss et al. 2011) and represents a deficit for
which the development of new pharmacological treatment strategies would be highly beneficial.
A new potential target for modulating goal-directed motivation is the serotonin 2C (5-
HT2C) receptor. This receptor is expressed in several brain regions involved in various aspects
of motivated behaviors, including the following: regions of the cortex (pyriform, cingulate,
prefrontal), limbic areas (nucleus accumbens (NAcc), dorsal striatum, amygdala, hippocampus),
and dopaminergic midbrain nuclei (ventral tegmental area (VTA), substantia nigra (SN))—
observations made through studies of mRNA, radioactive ligand binding, and
immunohistochemical analyses (reviewed in Fletcher and Higgins 2011).
Numerous recent studies have demonstrated that 5HT2C receptor activity can modulate
the neuronal activity of dopamine (DA) neurons and DA release at the terminal sites of the
mesolimbic, mesocortical, and niagro-striatal DA pathways (Alex and Pehek 2007; Di Matteo et
al. 2008a, b). Importantly, DA neurotransmission is well known to be involved in motivated
behavior and the willingness to exert effort toward a goal (Salamone et al. 2007). Specifically,
low doses of DA antagonists and NAcc DA depletions lower subjects’ willingness to work,
impairing the selection of higheffort/ high-reward options while increasing the selections of low-
effort options (Salamone et al. 1994, 2007; Nowend et al. 2001). Generally, the 5-HT2C receptor
exerts an inhibitory influence on DA neurotransmission, as serotonin and 5- HT2C receptor
agonists decrease the firing rates of DA neurons in the VTA and SN, leading to reductions in
NAcc and striatal DA efflux, respectively. In contrast, antagonists and inverse agonists increase
the firing rates of DA neurons and enhance DA efflux in the terminal targets of these neurons
(reviewed in Alex and Pehek 2007; DiMatteo et al. 2008a, b).
![Page 210: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/210.jpg)
195
Pharmacological modulation of the 5-HT2C receptor with the highly selective 5-HT2C
receptor ligand SB242084 was previously reported to increase behavioral output in an effortful
appetitive task in mice (Simpson et al. 2011). Although often referred to as an antagonist,
SB242084 is one of several compounds displaying functional selectivity at the 5-HT2C receptor,
meaning that its effects differ on the multiple downstream signaling pathways associated with
the receptor. Specifically, SB242084 acts as an inverse agonist on phospholipase A2 (PLA2) and
the inhibitory G protein, Gαi, but has agonist effects on phospholipase C (PLC) (De
Deurwaerdère et al. 2004).
Here, we characterize the effects of SB242084 in several assays of motivated behavior
(progressive ratio, an effort-based choice task, and a progressive hold down task) and then rule
out possible alternative explanations for the observed increases in motivation (changes in
nongoal- directed hyperactivity and feeding behavior). We also test the time course of the drug’s
effect on behavior. Our data demonstrate that acute treatment with SB242084 specifically
enhances goal-directed motivation, indicating that the 5-HT2C receptor should be considered as
a target for the development of treatments for pathological deficits in motivation.
Methods
Subjects
Experiments used C57BL/6J mice (The Jackson Laboratory, Bar Harbor,ME, USA)
which were 10 weeks old at the start of the experiments. The sex and the number of mice used
are provided below. All experiments and animal care protocols were in accordance with the
Columbia University and NYSPI Institutional Animal Care and Use Committees and Animal
Welfare regulations.
Drug treatment
![Page 211: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/211.jpg)
196
The selective 5-HT2C receptor ligand SB242084 (Tocris Bioscience, Ellisville, Missouri,
USA) was dissolved in 0.9 % saline and injected intraperitoneally (IP) 20 min before the start of
behavioral testing (doses used for each described below).
Behavioral procedures
Progressive ratio
Twenty-four male C57BL/6J mice were divided into two groups: vehicle control group
(n=12) and SB242084-treated group (n=12). Regular home cage chow (Isopro RMH 3000
complete mouse diet; Prolab, Syracuse, NY) was provided in a restricted manner to maintain
subjects at 85 % of ad lib baseline body weight. Unless otherwise noted, the apparatus used was
identical to that used by Drew et al. (2007).
The protocol for operant lever press training was carried out as described in Drew et al.
(2007), with some differences. Once all subjects lever pressed for evaporated milk reinforcers at
a high rate, subjects were then tested on a PR (3) × 2 schedule of reinforcement. The press
requirement started at 3 and was multiplied by 2 thereafter, following the function 3×2(n−1);
where n is the trial number (i.e., 3, 6, 12, 24, 48, 96, 192, and so on). Sessions ended after 3 min
elapsed without a subject making a single press or after 2 h, whichever came first. Subjects were
tested on the PR (3) × 2 for five consecutive days receiving an IP injection of either saline or
0.75 mg/kg SB242084. The dose of 0.75 mg/kg was chosen based on previous work showing a
behavioral effect (Simpson et al. 2011).
Effort-based choice task
Twenty male C57BL/6J mice were divided into two groups: saline control group (n = 10)
and SB242084-treated group (n = 10). Standard lever press training was carried out as described
in Drew et al. (2007). In the effort-based choice task (EBCT), subjects were exposed to a random
![Page 212: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/212.jpg)
197
ratio (RR) schedule of reinforcement (subjects were reinforced after a variable number of
responses), and there was a small petri dish of freely available chow present in the chamber.
Sessions lasted 1 h, and the testing occurred 5 days per week for 3 weeks. In week 1, both groups
were treated with vehicle and tested in a RR10; in week 2, while earning rewards on the RR10,
controls received saline and the drug group received 0.75 mg/kg SB242084; in week 3, an RR20
was in effect and subjects received the same drug treatments as the previous phase.
Food intake
2-h chow intake
Twenty male and 20 female C57BL/6J mice were randomly assigned to a restricted
feeding (RF) condition in which food was only available during the test or an ad lib feeding
condition (n = 10 per sex per condition) in which food was also freely available in the home
cage. The feeding test consisted of 2-h access to home cage chow and water in a separate clean
cage identical to the home cages of the subjects. Subjects were acclimated to the 2-h daily
feeding session for 5 days, receiving vehicle injections each day. In the drug phase, subjects from
both the RF and ad lib feeding conditions were randomly assigned to one of two treatment orders
each lasting for 4 days: saline followed by 0.75 mg/kg SB242084 or the reverse order (n = 5
subjects per feeding condition, drug treatment, order, and sex).
1-h milk intake
Twenty male C57BL/6J mice were randomly assigned to a saline (n = 10) - or SB242084
(n = 10) - treated group for three consecutive days and given 1-hr free access to the evaporated
milk. Total milk consumed in the hour was determined by weighing the amount of milk
consumed in the session.
Progressive hold down
![Page 213: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/213.jpg)
198
Sixteen male C57BL/6J mice were used for the progressive hold down (PHD)
experiment. Standard lever press training was carried out as described above. Next, mice were
trained to make lever presses of extended durations (i.e., holding the lever in the depressed
position until a required criterion time) as previously described (Bailey et al. 2015). Briefly, mice
were first trained with a variable interval hold (VIH) schedule in which the required hold time at
the start of each trial was randomly determined from an exponential distribution of times of a
given mean. Subjects were successively trained on VIH schedules with means of 0.5, 1, 2, 3, 4,
5, 8, and 10 s. Sessions lasted 1 h or until 40 reinforcers were earned, whichever came first.
When all subjects reached the criterion of 40 reinforcers for three consecutive days on a
schedule, they were advanced to VIH schedules with a higher mean. After all subjects earned 40
reinforcers for four consecutive days on a VIH of 10 s, PHD testing began. Subjects were then
tested on a PHD (2.0 s) × 1.13 schedule - the first hold duration was 2 s and was multiplied by
1.13 for each trial thereafter, following the function 2× 1.13(n−1); (i.e., 3.26 s on the 5th trial,
6.0 s on the 10th trial, 20.39 s on the 20th trial, etc.).
Subjects were then tested on a more difficult PHD (4 s) × 1.18 schedule - the first hold
duration was 4 s and was multiplied by 1.18 thereafter, following the function 4×1.18(n−1); (i.e.,
7.75 s on the fifth trial, 17.7 s on the tenth trial, etc.). Following 4 days of testing on PHD (4 s) ×
1.18 in which all subjects received saline prior to testing, subjects were tested during the drug
phase of the experiment. Two doses (0.75 and 1.5 mg/kg SB242084) were used. Subjects were
randomly divided into two treatment orders and were tested on the PHD schedule for 3 days at
one dose, followed by a 4-day vehicle washout period before being tested for 3 days at the
second dose.
Progressive ratio: pretreatment
![Page 214: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/214.jpg)
199
Twenty male C57BL/6J mice were divided into a saline control group (n = 10) and a 0.75
mg/kg SB242084 group (n = 10). All aspects of the PR experiment were identical to the
previously described PR experiment with one exception. After learning to press at high rates,
Subjects were treated for five consecutive days with either saline or 0.75 mg/kg SB242084
without any behavioral testing. In the next 5 days, subjects were tested on the PR without
receiving any injections.
Progressive ratio: repeated exposure
Twenty male C57BL/6J mice were tested on the PR for four consecutive weeks. In the
first baseline week, all subjects received daily vehicle injections prior to PR testing. Subjects
were then divided into a control group (n = 10) and a treatment group (n = 10). Over the next 3
weeks, the control group received vehicle injections in every week, whereas the treatment group
received 0.75 mg/kg SB242084 in week 1 (treatment 1), vehicle in week 2 (Washout), and 0.75
mg/kg SB242084 in week 3 (treatment 2).
Data analysis
All data were analyzed using two-tailed Student t tests or, where appropriate, repeated
measure analysis of variance (ANOVA). In all experiments, data were averaged across all days
of a specific treatment type (e.g., vehicle or SB242084 treatment) with the number of days
provided in the figure legend. Planned comparisons are reported in the main text, and significant
post hoc analyses are reported in the figure legends.
Results
SB242084 increases responding for food rewards in a progressive ratio schedule of
reinforcement. It was previously reported that SB242084 increased responding in a PR schedule
of reinforcement (Simpson et al. 2011). In a replication of this work, treatment with SB242084
![Page 215: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/215.jpg)
200
led to a significant increase in lever presses (t (22) = 3.971, p = 0.0006; Fig. 1a) and significantly
longer session durations (t (22) = 3.355, p = 0.0029). Figure 1b shows cumulative survival, the
percentage of mice still working for rewards as a function of session time (sessions were
terminated after 3 min without a response or after 2 h has elapsed). A Mantel-Cox log rank test
revealed that SB242084-treated mice worked significantly longer than vehicle-treated subjects.
As a result of this, treatment with SB242084 also led to a significant increase in the number of
reinforcers earned (t (22) = 3.710, p = 0.0012; Fig. 1c).
Fig. 1 SB242084 at 0.75 mg/kg increases responding in a progressive ratio. (A) Mean + (SEM)
number of lever presses made during the PR session. (B) Cumulative survival curves in the PR.
(C) Mean + (SEM) number of rewards earned in PR session. (D) Mean + (SEM) response rate
(presses/minute) as a function of trial number/reward number in the PR session. (E) Group
average response rate from the peak of responding through the end of session fit with the negative
exponential function y=a^(−b×n). (F) Mean + (SEM) peak response rate (a) in PR session. (G)
![Page 216: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/216.jpg)
201
Mean + (SEM) decay rate (b) in the PR session. Average of five consecutive days of testing in PR
(3) × 2 for vehicle (n=12) and SB (n=12)-treated mice. **p<0.01
We further characterized SB242084’s effect on behavior in the PR by examining the rate
of lever pressing during the session and observed that the rate increases to a peak rate by about
the third trial and then slowly declines over the session (Fig. 1d). To evaluate the peak press rate
and the rate of decline in responding over trials, we fit the response rate data starting on the third
trial for each individual subject with negative exponential functions of the form y = (a × exp (−b
× n)), where a is the y intercept of the function (reflecting the maximum response rate reached),
b is the rate of decay (how fast the decline in the rate of responding occurred), and n is the trial
number (Fig. 1e). The drug did not affect the maximum response rate, as the a parameter was not
significantly different (t (22) = 1.03, p = 0.310; Fig. 1f). It did significantly change the decay rate,
as the decline in response rate (b parameter) was slower in SB242084-treated mice (t (22) = 2.835,
p = 0.009; Fig. 1g). Finally, both groups showed equal interest in consuming reinforcers, as the
number of unconsumed rewards was the same in each group (vehicle: mean = 1.08, SEM =
0.676; SB: mean = 1.125, SEM=0.688; t(22) = 0.064, p = 0.949).
SB242084 increases responding for a preferred reward in an effort-based choice task
To determine if the effects of SB242084 generalized across different assays of
motivation, we tested mice in an EBCT to evaluate willingness to choose to work for a preferred
reward. Figure 2a shows that subjects treated with 0.75 mg/kg SB242084 made more presses as
there was a significant main effect of schedule (F (1,56) = 108.7, p < 0.001), a significant main
effect of the drug (F (1,56) = 7.28, p < 0.01), and a significant drug by schedule interaction (F (1,56)
= 6.02, p = 0.017). Post hoc comparisons revealed that the groups differed significantly only for
the RR20 schedule. Subjects consumed more freely available chow as the effort requirement
increased (F (1,56) = 8.654, p = 0.004) and SB242084 reduced chow consumption (F (1,56) = 4.345,
![Page 217: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/217.jpg)
202
p = 0.041). There was no drug by schedule interaction (F (1,56 )= 2.749, p = 0.102). Again, there
was no difference in the number of missed rewards between groups.
Fig. 2 SB242084 at 0.75 mg/kg increases responding for a preferred reward in an effort-based
choice task. (A) Mean + (SEM) number of lever presses made during 1 h of an effort-based
choice task under different ratio schedules. (B) Mean + (SEM) total intake (g) of freely available
chow during the 1 h effort-based choice task for the different ratio schedules. Average of 5 days
of testing in RR 10 (Veh/Veh), 5 days of testing in RR 10 (Veh/SB), and 5 days of testing in RR
20 (Veh/SB) for vehicle (n=12) and SB (n=12)-treated mice. *p<0.05, with Bonferoni correction
for multiple comparisons
SB242084 enhances goal-directed action in a progressive hold down task
To assess if SB242084 would increase responding across different types of work, we
tested subjects in the PHD task, in which the increasing work requirement is the time that
subjects must hold a lever in the depressed position. Subjects were first tested on a low-difficulty
PHD (3 s) × 1.13 schedule, but because this was too easy, the median session duration was the
maximum 120 min as most subjects were at a ceiling level of performance. We then tested all
subjects on a more demanding PHD (4 s) × 1.18 schedule in order to be able to measure potential
![Page 218: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/218.jpg)
203
increases in performance when subjects were given SB242084. In the harder PHD schedule,
SB242084 led to a dose dependent increase in the number of reinforcers earned (F (2, 38) = 3.329,
p = 0.041; Fig. 3a), as well the session durations (F (2,38) = 7.846, p = 0.0008; Fig. 3b), and the
total time subjects spent successfully holding the lever down (F (2,38) = 11.17, p < 0.0001; Fig.
3c), showing that the drug increases responding for rewards in a manner similar to that which we
observed in both the PR and the effort-based choice task.
Fig. 3. SB242084 increases goal-directed action in a progressive hold down task. (A) Mean +
(SEM) number of rewards earned in the PHD session. (B) Mean + (SEM) session duration (min)
in the PHD task. (C) Mean + (SEM) time spent successfully working in the PHD task. Average of
3 days of testing in each condition. *p<0.05, with Bonferoni correction for multiple comparisons
SB242084 effects on increased operant responding are not due to increases in non-goal-
directed hyperactivity/arousal
We also assessed whether SB242084 enhanced non-goalspecific hyperactivity, as this
could possibly occur independently of the drug’s effect of increasing goal-directed action.
SB242084 treatment significantly increased the total number of lever presses in the PHD session
(F (2,38) = 14.63, p < 0.0001; Fig. 4a). A contribution of hyperactivity would be expected to
manifest itself in unsuccessful presses throughout a PHD session (see Bailey et al. 2015), but the
within session response profiles of subjects suggested that the increased responses were
occurring mostly at the end of the session. Figure 4b shows the mean number of failed or
![Page 219: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/219.jpg)
204
unsuccessful presses per trial, where trial number 0 reflects the last trial attempted for every
subject (i.e., subjects did not complete the requirement). Unsuccessful presses increase as the
hold requirement becomes more difficult and the drug appeared to increase unsuccessful
attempts in the final trial. We analyzed the first and last five trials of subjects to see if there was a
group difference in the number of failed presses at the beginning or end of the session. Across
the first five trials (Fig. 4c), there was no significant effect of trial (F (4,210) = 1.852; p = 0.120),
no significant effect of dose (F (2,210) = 1.115; p = 0.330), and no dose × trial interaction (F (8,210)
= 0.772; p = 0.628). In the last five trials (Fig. 4d), there was a significant main effect of trial
number (F (4,210) = 5.237; p = 0.0004), but no significant main effect of dose (F (2,210) = 0.650; p =
0.522), and despite the mean differences, there was no dose × trial interaction (F (8,210) = 1.039; p
= 0.407), likely due to the high variability in the last trial.
Fig. 4 SB242084 does not enhance non-goal-directed hyperactivity. (A) Mean + (SEM) number
of lever presses in the PHD session. (B) Mean + (SEM) number of failed (unsuccessful) press
attempts in the PHD session as a function of trial number from each subject’s last attempted trial
![Page 220: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/220.jpg)
205
(i.e., for each subject, 0 corresponds to their last trial and −1 corresponds to their penultimate
trial, and so forth). (C) Mean + (SEM) number of unsuccessful presses in each subject’s first five
trials. (D) Mean + (SEM) number of unsuccessful presses in each subject’s last five trials.
Average of 3 days of testing in each condition. **p<0.01, with Bonferoni correction for multiple
comparisons
SB242084 does not alter feeding behavior in either a hungry or sated state
We examined SB242084’s effects on feeding behavior to see if this was altered by the
drug. In a 2-h food intake test (the same timescale used in the operant experiments), SB242084
had no effect on food intake in hungry or sated mice, in either males or females (Fig. S1a-b in
Supplement). In a 1-h feeding intake test using evaporated milk (the same reward used in the
operant experiments), SB242084 had no effect on the amount of milk consumed (Fig. S1c in
Supplement).
Prior treatment with SB242084 has no effect on progressive ratio performance
Systemic treatment with SB242084 for five consecutive days has been recently shown to
induce molecular changes in the brain and has antidepressant-like behavioral effects (Opal et al.
2013), so we tested this treatment regimen and examined behavior in the PR. When tested drug-
free, pretreatment with SB242084 for 5 days had no effect on any dependent variables in the PR:
number of presses, session duration, or reinforcers earned (Fig. S2a–c in Supplement).
The acute effect of SB242084 on goal-directed motivation can be reinstated repeatedly
We next tested whether there are post-administration carryover effects of the drug and
whether or not the effectiveness of the drug is changed with repeated administration. All subjects
were given vehicle baseline week of PR testing and assigned to two groups so that no baseline
differences existed in any parameters. Control subjects then received 3 weeks of vehicle
injections, and drug-treated subjects received treatment with SB242084 (Treatment 1), followed
![Page 221: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/221.jpg)
206
by a week of vehicle (Washout), followed by a week of SB242084 (Treatment 2) while being
tested in the PR (Fig. 5a).
Pooled across the 2 weeks of drug treatments, the SB242084 group earned more rewards
(t (36) = 2.71, p = 0.010), pressed more (t (36) = 2.437, p = 0.020), and continued responding longer
(t (36) = 2.883, p = 0.0068) than vehicle controls (Fig. 3a–c in Supplement), replicating our earlier
findings (Fig. 1) and those of Simpson et al. (2011).
Fig. 5. The acute effects of SB242084 (0.75 mg/kg) can be reinstated repeatedly. (A) Schematic
of the experimental design used for the repeated administration of SB242084 experiment. (B)
Mean + (SEM) number of rewards earned in PR session for each treatment condition. (C) Mean +
(SEM) number of lever presses made in PR session for each treatment condition. (D) Mean +
(SEM) session duration (min) for each treatment condition. Average of 5 days of testing *p<0.05;
**p<0.01, with Bonferoni correction for multiple comparisons
![Page 222: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/222.jpg)
207
Planned comparisons were made to assess carryover effects and whether drug efficacy
changes with repeated administration. Figure 5b–d shows data for each treatment week in the
experiment. In week 3 (when both groups received vehicle), there were no differences in any
aspect of PR performance [number of rewards earned (t (16) = 1.352; p = 0.195; Fig. 5a), number
of lever presses (t (16) = 0.463; p = 0.649; Fig. 5b), and session duration (t (16) = 0.551; p = 0.588;
Fig. 5c), again showing that the drug acts acutely but that it does not have carryover effects.
Finally, in the drug group, Treatment 1 and Treatment 2 were compared and there were no
significant differences between the two exposures in any of the measures [number of rewards
earned (t (9) = 0.001; p = 1.000), the number of lever presses (t (9) = 0.588; p = 0.570), and session
duration (t (9) = 1.801; p = 0.105)], showing that enhanced goaldirected responding can be
reinstated repeatedly in the same subjects.
Discussion
Prolonged impairments in goal-directed motivation are a problematic class of symptoms
for which no approved pharmacological treatments currently exist (Chase 2011; Levy and
Czernecki 2006). Systemic treatment with the 5-HT2C receptor ligand SB242084 increases the
firing rate of VTA DA neurons and enhances DA efflux at the NAcc in a manner that would be
predictive of the drug-enhancing motivated behavior. It was previously shown that SB242084
increased lever press responding in a PR schedule of reinforcement (Simpson et al. 2011). Here,
we investigate the effects of this compound on goal-directed motivation using a comprehensive
series of assays and determine the conditions under which the drug produces behavioral effects.
SB242084 enhances behavioral output across several measures of motivated behavior
In the PR, SB242084 treatment increased lever pressing and also how long subjects
pressed before quitting, which resulted in more rewards being earned—consistent with increased
![Page 223: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/223.jpg)
208
motivation (Hodos 1961; Aberman et al. 1998).We replicated this in an additional experiment
with alternating weeks of SB242084 treatment. SB242084 increased all dependent measures in
both exposures to the drug, replicating our first experiment and the observations in Simpson et
al. (2011).
We next tested the effects of SB242084 in an EBCT, which gave subjects a choice
between a high-effort option for a preferred reward (lever pressing for milk) and a low-effort
option for a less preferred reward (consuming freely available home cage chow). SB242084
increased lever pressing for the preferred reward and decreased chow consumption, a pattern of
results frequently interpreted as an increased willingness to work for a preferred reward
(Salamone et al. 2007).
The results of both the PR and EBCT experiments demonstrate that SB242084 increases
motivation for food rewards, but both of these tasks are response rate dependent (i.e., higher
response rates benefit the subject and lead to more rewards). Because SB242084 has been shown
to increase overall locomotor activity in an open field test (Fletcher et al. 2009), we wanted to
test whether an increase in hyperactivity or general arousal could have contributed to the PR
result. To assess this, we used the PHD task which requires subjects to make sustained responses
of increasing durations, making increased willingness to work for a goal and increased response
rates incompatible with one another. SB242084- treated subjects earned more rewards by
continuing to hold the lever down for longer durations, clearly showing that the drug increases
behavioral output in pursuit of a goal across different modalities of work requirements (i.e.,
making multiple lever presses or holding the lever down for longer durations).
In the PHD task, SB242084 also increased the number of total lever presses made, but the
extra presses mainly occurred in the last trial when subjects were not able to meet the next hold
![Page 224: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/224.jpg)
209
requirement. This pattern may reflect continued persistence in light of repeated failure. The
observation that the number of short duration unsuccessful presses did not differ throughout the
entire session also suggests that the drug does not induce non-goal-specific hyperactivity as seen
with methamphetamine (Bailey et al. 2015). Taken together, the results of the PR, EBCT, and
PHD demonstrate that SB242084 increases subject’s persistence in responding across different
operant tasks, independent of the type of work requirement demanded by the task.
Because an increase in hunger or feeding behavior could enhance operant responding for
food, we conducted several experiments to examine this alternative explanation, which is
plausible as several 5-HT2C receptor agonists have been shown to reduce food intake and
bodyweight (Clifton et al. 2000; Dalton et al. 2006; reviewed in Fletcher et al. 2010). The
experiments reported here found no effect of SB242084 on food intake in either males or females
under various conditions, which is consistent with several previous reports (Vickers et al. 2000;
Hewitt et al. 2002; Dalton et al. 2006; Fletcher et al. 2009).
The 5-HT2C receptor and reward-related behaviors
After observing that SB242084 can enhance the firing rates of DA VTA neurons and DA
efflux in the NAcc, many studies have since examined 5-HT2C receptor’s involvement in
reward-related behaviors. Several studies have looked at the effect of 5-HT2C ligands on a
number of psychostimulantinduced behaviors, including the following: drug-induced
locomotion, self-administration, and reinstatement. A generalized finding has been that 5-HT2C
receptor agonists attenuate psychostimulant-induced behaviors, whereas antagonists and inverse
agonists tend to enhance such behaviors (reviewed in Fletcher and Higgins 2011).
More specifically related to the present study, the effects of 5-HT2C ligands have been
tested on operant responding for food. 5-HT2C receptor agonists have consistently been shown
![Page 225: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/225.jpg)
210
to reduce motivated responding for food rewards (Grottick et al. 2000; Ward et al. 2008;
Cunningham et al. 2011; Higgins et al. 2013).
There have been two previous reports on the effects of SB242084 on motivation for food
by Fletcher et al. 2010 and Bezzina et al. 2015 which may appear contradictory to the present
findings because they do not report a significant increase in incentive motivation. There are a
number of differences between these studies and the ones reported here, but in both studies, the
overall level of effort/output required of subjects was significantly lower compared to most of
the procedures used here. Like these studies, we found that when the effort requirement was low
(RR10) in the EBCT, drug and control groups did not differ, but when the effort requirement
increased (RR20), a significant drug effect emerged. If a task is too easy (or too hard), it may be
difficult to study motivation-enhancing compounds.
Treatment conditions in which SB242084 can enhance goal-directed behavior
Given the apparent specificity of SB242084 on goal-directed motivation, determining the
effective treatment conditions will be important for the development of future pharmacological
treatments for impaired motivation. One recent study has shown that SB242084 given to mice
for five consecutive days induces fast-acting antidepressant-like effects at both the behavioral
and molecular levels (Opal et al. 2013). Using this same treatment regimen, we found that the
drug was only effective when present during testing: 5 days of prior treatment with SB242084
had no effect on subsequent performance in the PR task in the absence of the drug. We also
found that the acute effects of the drug can be reinstated repeatedly.
Potential mechanisms of action of SB242084 in enhancing motivation
One possible mechanism by which SB242084 may act to increase motivation in the
present studies is through phasic modulation of mesolimbic DA neurotransmission. This is a
![Page 226: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/226.jpg)
211
plausible mechanism for the present results, as systemically administered 5-HT2C receptor
ligands have been shown to alter the firing rate of DA VTA neurons (Di Giovanni et al. 2000;
DiMatteo et al. 1999, 2000), altering DA release to the NAcc (Di Giovanni et al. 2001). Many
studies have demonstrated DA’s involvement in motivation and the willingness to exert effort
(Salamone et al. 2007), such that drugs which reduce and enhance DA neurotransmission lead to
decreased and increased willingness to work, respectively (Salamone et al. 1994, 1991, 2007;
Nowend et al. 2001). Future studies will be needed to determine the locations and mechanism
through which SB242084 acts to increase goal-directed behavior.
5-HT2C receptor as a target for novel therapeutic interventions of impaired motivation
There have been previous suggestions that 5-HT2C receptor ligands may be useful in
treating depression and schizophrenia (Simpson et al. 2011; Millan 2005). The present work
provides support for the possibility that such drugs can help to acutely ameliorate the impaired
motivation commonly seen in these two disorders.
References
Aberman JE, Ward SJ, Salamone JD (1998) Effects of dopamine antagonists and accumbens
dopamine depletions on time-constrained progressive- ratio performance. Pharmacol
Biochem Behav 61:341– 348. doi:10.1016/S0091-3057(98)00112-9
Alex KD, Pehek EA (2007) Pharmacologic mechanisms of serotonergic regulation of dopamine
neurotransmission. Pharmacol Ther 113(2): 296–320.
Bailey MR, Jensen G, Taylor K, Mezias C, Williamson C, Silver R, Simpson EH, Balsam PD
(2015) A novel strategy for dissecting goal-directed action and arousal components of
motivated behavior with a progressive hold-down task. Behav Neurosci 129:269–280.
doi:10.1037/bne0000060
Barch DM, Dowd EC (2010) Goal representations andmotivational drive in schizophrenia: the
role of prefrontal–striatal interactions. Schizophr Bull 36:919–934.
doi:10.1093/schbul/sbq068 Berridge KC, Robinson TE (2003) Parsing reward. Trends
Neurosci 26: 507–513. doi:10.1016/S0166-2236(03)00233-9
![Page 227: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/227.jpg)
212
Bezzina G, Body S, Cheung THC, Hampson CL, Bradshaw CM, Glennon JC, Szabadi E (2015)
Evidence for a role of 5-HT2C receptors in the motor aspects of performance, but not the
efficacy of food reinforcers , in a progress ive r a t io schedu l e . Psychopharmacology
232:699–711. doi:10.1007/s00213-014- 3700-5
Chase T (2011) Apathy in neuropsychiatric disease: diagnosis, pathophysiology, and treatment.
Neurotox Res 19:266–278. doi:10. 1007/s12640-010-9196-9
Clifton PG, Lee MD, Dourish CT (2000) Similarities in the action of Ro 60-0175, a 5-HT2C
receptor agonist, and d-fenfluramine on feeding patterns in the rat. Psychopharmacology
152:256–267. doi:10.1007/ s002130000504
Cohen AS, Minor KS (2010) Emotional experience in patients with schizophrenia revisited:
meta-analysis of laboratory studies. Schizophr Bull 36:143–150.
Cunningham KA, Fox RG, Anastasio NC, Bubar MJ, Stutz SJ, Moeller FG, Gilbertson SR,
Rosenzweig-Lipson S (2011) Selective serotonin 5-HT2C receptor activation suppresses
the reinforcing efficacy of cocaine and sucrose but differentially affects the
incentivesalience value of cocaine- vs. sucrose-associated cues. Neuropharmacology
61:513–523. doi:10.1016/j.neuropharm.2011. 04.034
Dalton GL, Lee MD, Kennett GA, Dourish CT, Clifton PG (2006) Serotonin 1B and 2C receptor
interactions in the modulation of feeding behaviour in the mouse. Psychopharmacology
185:45–57. doi:10.1007/s00213-005-0212-3
De Deurwaerdère P, Navailles S, Berg KA, Clarke WP, Spampinato U (2004) Constitutive
activity of the Serotonin2C receptor inhibits in vivo dopamine release in the rat striatum
and nucleus accumbens. J Neurosci 24:3235–3241.
Di Giovanni G, Di Matteo V, Di Mascio M, Esposito E (2000) Preferential modulation of
mesolimbic vs. nigrostriatal dopaminergic function by serotonin2C/2B receptor agonists:
a combined in vivo electrophysiological and microdialysis study. Synapse 35: 53–61.
doi:10.1002/(sici)1098-2396(200001)35:1<53::aid-syn7>3. 0.co;2-2
Di Giovanni G, Di Matteo V, La Grutta V, Esposito E (2001) m- Chlorophenylpiperazine excites
non-dopaminergic neurons in the rat substantia nigra and ventral tegmental area by
activating serotonin-2C receptors. Neuroscience 103:111–116.
Di Matteo V, Di Giovanni G, Di Mascio M, Esposito E (1999) SB 242 084, a selective
serotonin(2C) receptor antagonist, increases dopaminergic c t r a nsmission i n t h e
mesolimbic system. Neuropharmacology 38:1195–1205.
Di Matteo M, Di Giovanni G, Di Mascio M, Esposito E (2000) Biochemical and
electrophysiological evidence that RO 60-0175 inhibits mesolimbic dopaminergic
function through serotonin(2C) receptors. Brain Res 865:85–90
![Page 228: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/228.jpg)
213
Di Matteo V, Di Giovanni G, Pierucci M, Esposito E (2008) Serotonin control of central
dopaminergic function: focus on in vivo microdialysis studies. In: DiGiovanni G,
DiMatteo V, Esposito E (eds) Serotonin-dopamine interaction: experimental evidence
and therapeutic relevance, Vol. 172, p 7–44
Drew MR, Simpson EH, Kellendonk C, Herzberg WG, Lipatova O, Fairhurst S, Kandel ER,
Malapani C, Balsam PD (2007) Transient overexpression of striatal D-2 receptors
impairs operant motivation and interval timing. J Neurosci 27:7731–7739. doi:10.1523/
jneurosci.1736-07.2007
Faerden A (2009) Apathy is associated with executive functioning in first episode psychosis.
BMC Psychiatr 9:1. doi:10.1186/1471-244x-9-1
Feil D, Razani J, Boone K, Lesser I (2003) Apathy and cognitive performance in older adults
with depression. Int J Geriatric Psychiatr 18: 479–485. doi:10.1002/gps.869
Fletcher P, Higgins G (2011) Serotonin and reward-related behavior: focus on 5-HT2C receptors.
In: Di Giovanni G, Esposito E, Di Matteo V (eds) 5-HT2C receptors in the
pathophysiology of CNS disease. Humana Press, Vol. 22, p 293–324
Fletcher PJ, Tampakeras M, Sinyard J, Slassi A, Isaac M, Higgins GA (2009) Characterizing the
effects of 5-HT2C receptor ligands on motor activity and feeding behaviour in 5-HT2C
receptor knockout mice. Neuropharmacology 57:259–267.
Fletcher PJ, Sinyard J, Higgins GA (2010) Genetic and pharmacological evidence that 5-HT2C
receptor activation, but not inhibition, affects motivation to feed under a progressive ratio
schedule of reinforcement. Pharmacol Biochem Behav 97:170–178. doi:10.1016/j.pbb.
2010.07.002
Grottick AJ, Fletcher PJ, Higgins GA (2000) Studies to investigate the role of 5-HT2C receptors
on cocaine- and food-maintained behavior. J Pharmacol Exp Ther 295:1183–1191
Hewitt KN, Lee MD, Dourish CT, Clifton PG (2002) Serotonin 2C receptor agonists and the
behavioural satiety sequence in mice. Pharmacol Biochem Behav 71:691–700.
doi:10.1016/S0091- 3057(01)00709-2
Higgins GA, Silenieks LB, LauW, de Lannoy IAM, Lee DKH, Izhakova J, Coen K, Le AD,
Fletcher PJ (2013) Evaluation of chemically diverse 5-HT2C receptor agonists on
behaviours motivated by food and nicotine and on side effect profiles.
Psychopharmacology 226: 475–490. doi:10.1007/s00213-012-2919-2
Hodos W (1961) Progressive ratio as a measure of reward strength. Science 134:943–944.
doi:10.1126/science.134.3483.943
![Page 229: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/229.jpg)
214
Kiang M, Christensen BK, Remington G, Kapur S (2003) Apathy in schizophrenia: clinical
correlates and association with functional outcome. Schizophr Res 63:79–88.
doi:10.1016/S0920-9964(02) 00433-4
Levy R, Czernecki V (2006) Apathy and the basal ganglia. J Neurol 253: vii54–vii61.
doi:10.1007/s00415-006-7012-5
Marin RS, Butters MA, Mulsant BH, Pollock BG, Reynolds CF (2003) Apathy and executive
function in depressed elderly. J Geriatr Psychiatry Neurol 16 : 112 – 116 . doi : 10 .1177
/0891988703016002009
Markou A, Salamone JD, Bussey TJ, Mar AC, Brunner D, Gilmour G, Balsam P (2013)
Measuring reinforcement learning and motivation constructs in experimental animals:
relevance to the negative symptoms of schizophrenia. Neurosci Biobehav Rev 37:2149–
2165. doi: 10.1016/j.neubiorev.2013.08.007
Millan MJ (2005) Serotonin 5-HT2C receptors as a target for the treatment of depressive and
anxious states: focus on novel therapeutic strategies. Therapie 60:441–460.
doi:10.2515/therapie:2005065
Nowend KL, Arizzi M, Carlson BB, Salamone JD (2001) D1 or D2 antagonism in nucleus
accumbens core or dorsomedial shell suppresses lever pressing for food but leads to
compensatory increases in chow consumption. Pharmacol Biochem Behav 69:373–382.
doi: 10.1016/S0091-3057(01)00524-X
OorschotM, Lataster T, Thewissen V, Lardinois M,Wichers M, van Os J, Delespaul P, Myin-
Germeys I (2011) Emotional experience in negative symptoms of schizophrenia—no
evidence for a generalized hedonic deficit. Schizophr Bull. doi:10.1093/schbul/sbr137
Opal MD, Klenotich SC, Morais M, Bessa J, Winkle J, Doukas D, Kay LJ, Sousa N, Dulawa SM
(2013) Serotonin 2C receptor antagonists induce fast-onset antidepressant effects. Mol
Psychiatry. doi:10. 1038/mp.2013.144
Roth RM (2004) Apathy in schizophrenia: reduced frontal lobe volume and neuropsychological
deficits. Am J Psychiatry 161:157–159. doi:10.1176/appi.ajp.161.1.157
Salamone JDJD (2009) Dopamine, behavioral economics, and effort. Front Behav Neurosci
3:13. doi:10.3389/neuro.08.013.2009
Salamone JD, Steinpreis RE,McCullough LD, Smith P, Grebel D,Mahan K (1991) Haloperidol
and nucleus accumbens dopamine depletion suppress lever pressing for food but increase
free food consumption in a novel food choice procedure. Psychopharmacology 104:515–
521. doi:10.1007/bf02245659
![Page 230: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/230.jpg)
215
Salamone JD, Cousins MS, Bucher S (1994) Anhedonia or anergia? Effects of haloperidol and
nucleus accumbens dopamine depletion on instrumental response selection in a T-maze
cost/benefit procedure. Behav Brain Res 65:221–229.
Salamone JD, Correa M, Farrar A, Mingote SM (2007) Effort-related functions of nucleus
accumbens dopamine and associated forebrain circuits. Psychopharmacology 191:461–
482. doi:10.1007/s00213- 006-0668-9
Sanders AC, Hussain AJ, Hen R, Zhuang X (2007) Chronic blockade or constitutive deletion of
the serotonin transporter reduces operant responding for food reward.
Neuropsychopharmacology 32:2321– 2329. doi:10.1038/sj.npp.1301368
Simpson EH, Kellendonk C,Ward RD, Richards V, Lipatova O, Fairhurst S, Kandel ER, Balsam
PD (2011) Pharmacologic rescue of motivational deficit in an animal model of the
negative symptoms of schizophrenia. Biol Psychiatry 69:928–935.
Strauss GP, Wilbur RC, Warren KR, August SM, Gold JM (2011) Anticipatory vs.
consummatory pleasure: what is the nature of hedonic deficits in schizophrenia?
Psychiatry Res 187:36–41. doi:10. 1016/j.psychres.2011.01.012
Treadway MT, Zald DH (2011) Reconsidering anhedonia in depression: lessons from
translational neuroscience. Neurosci Biobehav Rev 35: 537–555.
doi:10.1016/j.neubiorev.2010.06.006
Treadway MT, Zald DH (2013) Parsing anhedonia: translational models of reward-processing
deficits in psychopathology. Curr Dir Psychol Sci 22:244–249.
Treadway MT, Buckholtz JW, Schwartzman AN, LambertWE, Zald DH (2009) Worth the
‘EEfRT’? The effort expenditure for rewards task as an objective measure of motivation
and anhedonia. PLoS ONE 4, e6598. doi:10.1371/journal.pone.0006598
Treadway MT, Buckholtz JW, Cowan RL, Woodward ND, Li R, Ansari MS, Baldwin RM,
Schwartzman AN, Kessler RM, Zald DH (2012) Dopaminergic mechanisms of individual
differences in human effort-based decision-making. J Neurosci 32:6170–6176. doi:10.
1523/jneurosci.6459-11.2012
Vickers SP, Benwell KR, Porter RH, Bickerdike MJ, Kennett GA, Dourish CT (2000)
Comparative effects of continuous infusion of mCPP, Ro 60-0175 and d-fenfluramine on
food intake, water intake, body weight and locomotor activity in rats. Br J Pharmacol
130: 1305–1314. doi:10.1038/sj.bjp.0703443
Ward SJ, Lefever TW, Jackson C, Tallarida RJ,Walker EA (2008) Effects of a Cannabinoid1
receptor antagonist and Serotonin2C receptor agonist alone and in combination on
motivation for palatable food: a dose-addition analysis study in mice. J Pharmacol Exp
Ther 325: 567–576. doi:10.1124/jpet.107.131771
![Page 231: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/231.jpg)
216
Wongpakaran N, van Reekum R, Wongpakaran T, Clarke D (2007) Selective serotonin reuptake
inhibitor use associates with apathy among depressed elderly: a case–control study.
Annals Gen Psychiatr 6:1–6. doi:10.1186/1744-859x-6-7
![Page 232: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/232.jpg)
217
Chapter 7
A functional interaction between serotonin receptor signaling and striatal
dopamine release enhances goal-directed vigor and persistence in mice
Abstract
The neural mechanisms of motivation depend on midbrain dopamine for encoding both
effort and reward value. The functionally selective serotonin 5-HT2C receptor ligand SB242084
increases motivated behavior. SB242084 modulates striatal dopamine release in vitro and in
anesthetized preparations, suggesting that the drug increases motivation by altering either the
encoding or influence of reward value and/or response cost. We show that SB242084 increases
persistence and response vigor, particularly when both work is demanding and it is possible to
earn reward, but it does not alter value-based choice. This selective influence on the dynamics of
behavioral output, rather than value based decision making, is reflected in in vivo measures of
striatal dopamine during behavior. SB242084 has no effect on either dopamine tone or reward
related phasic dopamine release in the ventral striatum. In contrast, SB242084 increases
dopamine tone in the DMS, but only during goal directed activity. Therefore the 5-HT2C
receptor maybe an exceptionally good target for specifically treating deficits in motivation.
Introduction
Motivation, defined as the energizing of behavior in the pursuit of a goal, results from a
complex analysis of the costs and benefits associated with that behavior (Rangel et al., 2010;
Simpson et al., 2016). The dopamine system has been implicated in the encoding of costs,
benefits and also cost-benefit computation (Salamone et al., 2012; Salamone et al., 2016).
Understanding the neurobiology of motivation is important because deficits in motivation occur
in a number of psychiatric and neurological disorders. Patients with schizophrenia, depression,
![Page 233: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/233.jpg)
218
PTSD, anxiety disorders and Parkinson’s disease experience deficits in motivation that
significantly impact their functioning and quality of life (Barch et al., 2014; Green, 2016).
Currently there are no available treatments for motivational deficit.
We have previously shown that the functionally selective serotonin 5-HT2C receptor
ligand SB242084 robustly enhances motivated behavior for appetitive reinforcement in mice
across several different tasks (Avlar et al., 2015; Bailey et al., 2015; Simpson et al., 2011). The
same compound has also been shown to induce fast-onset antidepressant effects in mice (Opal et
al., 2014). SB242084 is one of several compounds known to display functional selectivity at the
5-HT2C receptor. It acts as an inverse agonist on Phospholipase A2 (PLA2) and the inhibitory
G-protein, Gαi, and it additionally produces agonist effects on Phospholipase C (PLC) (Berg et
al., 2008; De Deurwaerdère et al., 2004). Functionally selective compounds such as SB242084
offer the exciting possibility of modulating selected signaling pathways, thereby reducing the
chances or severity of unwanted side effects which may result from complete receptor blockade
or activation (Gesty-Palmer et al., 2011).
Studies in anesthetized rats have demonstrated that systemic treatment with 5-HT2C
receptor selective ligands can modulate the dopamine system (Alex et al., 2007; Di Matteo,
Vincenzo et al., 2008). SB242084 has been shown to increase the activity of midbrain dopamine
neurons and also to augment stimulated dopamine release (De Deurwaerdere et al., 1999; Di
Giovanni et al., 1999; Di Matteo, V. et al., 1999). We therefore wondered whether 5-HT2C
receptor modulation enhances incentive motivation by potentiating the normal dopamine release
that affects different aspects of goal directed action.
Information about benefits, including rewards, is encoded by dopamine neurons in the
ventral tegmental area (VTA) of the midbrain that project to the nucleus accumbens (NAc)
![Page 234: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/234.jpg)
219
within the ventral striatum (Goldstein et al., 2012; Schultz, 1998). Support for the involvement of
this mesoaccumbal dopamine pathway in reward encoding comes from studies correlating VTA
dopamine cell activity and NAc dopamine release with several aspects of reward value, including
magnitude (Gan et al., 2010) and prediction error (Shultz, 1998). Information about costs,
including work requirements and delays to rewards, are reflected in neuronal activity and phasic
dopamine release in the NAc (Ostlund et al., 2011). The NAc is also the site where cost- benefit
computations guide decision making (Bissonette et al., 2016; Gan et al., 2010; Walton et al.,
2006; Roesch, 2016). Consequently, if SB242084 alters dopamine release in the mesolimbic
pathway, enhanced motivation may result from the impact on reward encoding and/or the
decision about whether the effort associated with a particular action is worth the payoff.
Information about reward, including reward prediction error is also encoded by neurons
in the dorsal striatum (DS) (Oyama et al., 2010). Neurons in the dorsal striatum also encode the
value of actions (Lau et al., 2007; Samejima et al., 2005), and dopamine input is involved in this
encoding (Balleine et al., 2007; Stalnaker et al., 2012). In addition to the role the DS may play in
reward processing, there are subpopulations processing information that is not used for deciding
whether to make a particular response, but instead guides the dynamics of locomotor response
output (Dudman et al., 2016; Panigrahi et al., 2015). These dynamics reflect the energizing
aspect of motivation. For example, greater motivation often manifests in shorter latencies to start
responding, faster responding, more forceful responding; responding for longer durations; and
persisting in an action in the face of disruptive circumstances. Therefore, it is also possible that
SB242084 enhances motivated behavior by modulating dopamine release in the DS, thereby
altering some or all these aspects of response dynamics and consequently changing the total
effort output.
![Page 235: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/235.jpg)
220
To distinguish between these possible mechanisms, we examined the effects of
SB242084 on both motivated behavior and dopamine signaling. By employing several
behavioral assays, we determined that the motivational effect of SB242084 is related to the
dynamics of reward- related response output, rather than the calculation of response cost or
benefit. This behavioral dissection is also reflected in our dopamine findings. We determined
that dopamine release in the NAc in response to rewards or the cues that predict them is not
altered by SB242084 treatment. Instead, we found that during an effort based choice task,
SB242084 increased extracellular dopamine levels selectively in the dorsal striatum. We
hypothesize that this drug modulates the normal release of dopamine as the increase in striatal
dopamine was only observed when a goal was obtainable and the animal engaged in goal
directed action.
Methods
Subjects
Experiments used male C57BL/6J mice (The Jackson Laboratory, Bar Harbor, ME, USA)
which were 10 - 12 weeks old at the start of the experiments. All subjects were maintained at
85% of their ad libitum bodyweight in order to motivate them to work for food rewards in
operant procedures. All experiments and animal care protocols were in accordance with the
Columbia University and NYSPI Institutional Animal Care and Use Committees and Animal
Welfare regulations. The number of subjects used is indicated in the description of each
experiment below.
Apparatus
Experimental chambers (ENV-307w; Med Associates, St. Albans, VT) equipped with
liquid dippers and pellet dispensers were used in the experiment. Unless otherwise noted, the
![Page 236: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/236.jpg)
221
apparatus was identical to that used by Drew and colleagues, (2007). Two retractable levers were
mounted on either side of a feeding trough, and a house light (model 1820; Med Associates)
located at the top of the chamber was used to illuminate the chamber during the sessions. The
specific reward outcomes used is detailed for each experiment.
Behavioral Procedures
Basic Lever Press Training
Lever press training was carried out in the identical manner as described in Chapter 4.
Concurrent Value Choice
All subjects were first trained to press a lever on each side of the operant chamber using
the basic lever press training procedure described in Chapter 4. The lever on one side of the
chamber delivered a liquid sucrose solution, while the opposite lever delivered a 14mg sucrose
pellet. Subjects were then exposed to increasing RR schedules on each lever for the different
outcomes, receiving one session for a single outcome per day. The progression was as follows: 3
days on CRF, 3 days on RR05, 3days on RR10, and 3 days on RR20.
Once subjects were trained to make presses on lever A for liquid sucrose and lever
presses on lever B for a sucrose pellet they were then moved to the CVC task. Each CVC session
consisted of two separate phases. First, there were 10 single lever forced choice trials in which
either the sucrose lever or the pellet lever was presented. Subjects then had to complete the press
requirement on that lever to obtain the reward and gain experience with the cost of that particular
outcome on a given day. Each lever was presented 5 times in a semi-random order. Once
subjects completed all 10 of the forced choice trials they were then presented with 20 free choice
trials in which subjects were presented with both levers and they were able to choose which one
they worked on to obtain the reward.
![Page 237: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/237.jpg)
222
Drug treatment during CVC testing. After 2 weeks of baseline training in the 20%
sucrose solution vs pellet condition, subjects were then tested over the course of 4 consecutive
weeks, receiving intraperitoneal (i.p). injections 20 minutes prior to the start of testing. Subjects
first received vehicle injections, followed by 0.75mg/kg SB242084, followed by another week of
vehicle, followed by another week of 0.75mg/kg SB242084. Subjects were then exposed to 2
baseline weeks of 5% sucrose vs pellets in the CVC and then underwent 4 consecutive weeks of
testing with the same drug treatment order as described for the 20% sucrose vs pellets condition.
Data was averaged across the 2 weeks of each treatment type.
Pavlovian Conditioning
Mice were trained for 13 consecutive days in a Pavlovian conditioning paradigm which
consisted of 12 CS+ trials and 12 CS- trials occurring in a pseudorandom order. In all sessions,
there was a 100s variable ITI, drawn from an exponential distribution of times, which occurred
between trials. In CS+ trials, a 10 second tone was presented followed by a 5 second presentation
of a dipper containing a liquid milk reward. In CS- trials, a different tone was played for 10
seconds followed by an ITI.
Concurrent Effort Choice
All subjects were first trained to press a lever using the basic lever press training
procedure for a rewards of evaporated milk (.01 ml) delivered by raising a dipper located inside
the feeder trough for 5 seconds. Subjects were then exposed to increasing RR schedules on a
single lever. The progression was as follows: 3 days on CRF, 3 days on RR05, 3days on RR10,
and 3 days on RR20. Subjects were next trained to make lever holds on the opposite lever. This
was done using the VIH training program. Subjects were exposed to 3 days of each of the
following VIH programs in increasing order (0.5, 1.0, 2.0, 4.0, 6.0, 8.0, 10.0 sec).
![Page 238: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/238.jpg)
223
Once subjects were trained to make presses on lever A and holds on lever B they were
then moved to the CEC task. In this task, subject could lever press on one lever, or lever hold on
the opposite lever to obtain a reward. The session consisted of two separate phases. First, there
were 10 forced choice trials in which either the press lever or the hold lever was presented (each
lever was presented 5 times). Once subjects completed all 10 of the single lever forced choice
trials, they were then presented with 40 free choice trials. In this phase of the session subjects
were presented with both levers and they were able to choose which one they worked on to
obtain the reward.
In this experiment, we maintained the hold duration required at a constant value (10
seconds), and varied the press requirement over days. We used the following press requirements
(10, 20, 40, and 80).
Drug treatment during CEC testing. After 2 weeks of baseline training in the CEC
task, subjects were then tested over the course of 4 consecutive weeks, receiving i.p. injections
20 minutes prior to the start of testing. Subjects first received vehicle injections, followed by
0.75mg/kg SB242084, followed by another week of vehicle, followed by another week of
0.75mg/kg SB242084. Data was averaged across the 2 weeks of each treatment type.
Progressive Ratio
Prior to drug testing subjects were trained for the progressive ratio testing in an identical
manner as described in Chapter 6.
Drug treatment during PR testing. Subjects were tested in a PR 3 x 2 over 2
consecutive weeks for 5 days per week, receiving i.p. injections 20 minutes prior to testing. Half
of the subjects received 5 days of vehicle followed by 5 days of 0.75mg/kg SB242084, and half
![Page 239: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/239.jpg)
224
received 5 days of 0.75mg/kg SB242084 followed by 5 days of vehicle. Data is averaged over
the 5 days of testing in each condition.
Extinction
Subjects were first trained with basic lever pressing procedures as described in chapter 6.
Subjects were then run on increasing RR schedules over days (RR5, RR10, and RR20) being
tested on each schedule for 3 consecutive days.
Drug Testing During Extinction. Following the third day on a RR20 schedule, subjects
were assigned into either a vehicle or drug group, matching the 2 groups based on press rates in
RR20 so there were no baseline differences between the groups. Subjects were then tested in a
60 minute extinction session which was identical in all ways to the RR training except reward
was never delivered. Subjects received i.p. injections of either vehicle or 0.75mg/kg SB242084
20 minutes prior to behavioral testing.
Effort Based Choice
Basic lever press training was carried out in a manner identical to that described for this
procedure in Chapter 6.
Fast Scan Cyclic Voltammetry
Adult mice 10-12 weeks old at the start of the experiment were implanted during
stereotaxic surgery with a custom made carbon fiber electrode and reference electrode
(Ag/AgCl). During the surgery, a bipolar electrode is temporarily introduce to stimulate the
medium forebrain bundle while dopamine is simultaneously registered in the NAc. The
recording electrode is lowered to the maximum release spot identified (coordinates relative to
bregma: anteroposterior, +1.2 mm; mediolateral, +1.5 mm; dorsoventral, -3.6 mm). After the
surgery the animal is allowed to recover for 2 weeks before starting food restriction.
![Page 240: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/240.jpg)
225
During experimental recording sessions, the chronically implanted carbon-fiber
microelectrodes were connected to a head-mounted voltammetric amplifier for DA detection by
fast-scan cyclic voltammetry. The potential applied to the carbon fiber was ramped from 0.4 V
(vs Ag/AgCl) to 1.3 V and back at a rate of 400 V/s during a voltammetric scan and held at 0.4 V
between scans. During the first 15 min, mice were cycled at 60Hz,then scans were repeated at a
frequency of 10 Hz throughout the session (DA detection: approximately 0.7 and 0.3 V peak
oxidation and reduction potentials, respectively, which can be measured as changes in current).
To ensure that electrodes were capable of detecting dopamine, unexpected food pellets
were delivered before and after a recording session to elicit dopamine release. The average
dopamine release for this portion was used for normalization. Chemical verification of dopamine
was achieved by obtaining high correlation of the cyclic voltammogram (electrochemical
signature) to that of a dopamine standard (correlation coefficient, r2 0.75 by linear regression).
Voltammetric data analysis were performed using software written in LabVIEW
(Tarheel) and low-pass filtered at 2KHz. Dopamine was isolated from the voltammetric signal
using chemometric analysis and a standard training set of stimulated dopamine release detected
by chronically implanted electrodes. Background was set 0.5s before the event.
Microdialysis
Adult mice were implanted with a cannula during stereotaxic surgery which was custom
designed for a microdialysis probe. The coordinates for the NAc probe were: relative to bregma:
anteroposterior, +1.2 mm; mediolateral, +1.5 mm; dorsoventral, -3.6 mm, and the coordinates for
the DS probe were relative to bregma: anteroposterior, +1.2 mm; mediolateral, +1.5 mm;
dorsoventral, -1.5 mm. After the surgery the animals were allowed to recover for 2 weeks before
starting food restriction.
![Page 241: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/241.jpg)
226
On dialysis sampling days samples were collected at 20 minute intervals. 3 samples were
taken as a baseline, prior to behavioral testing. Ten minutes after the 3rd baseline sample was
collected, subjects received an i.p. injection of either vehicle or 0.75mg/kg SB242084, and
another sample was taken after 20 minutes elapsed. At this point, subjects began working in the
effort based choice task and were sampled every 20 minutes for an hour throughout behavior.
Finally, 1 post-behavior sample was collected 20 minutes after the end of the behavioral session.
Results
Systemic SB242084 Treatment Does Not Alter Animal’s Choice for Different Types of
Reward
Here we examined if SB242084 mediates its effect on motivation by changing the
animal’s behavioral response to reward. We implemented a novel concurrent choice paradigm, in
which the mice could choose between lever pressing for food pellets or lever pressing for a
sucrose solution (Figure 1A, and see Chapter 4 for details). By varying the concentration of
sucrose across sessions and keeping the quantity of pellets delivered constant, we could
determine the point of subjective equality (PSE) for pellets and sucrose concentration for each
individual subject. Figure 1B shows that SB242084 treatment did not affect reward choice in this
task. While we found a significant main effect of the pellet cost (F (3, 15) = 249.203, p < 0.001),
and sucrose concentration (F (1, 15) = 319.293, p < 0.001), which resulted in a pellet cost x sucrose
concentration interaction (F (3, 15) = 18.101, p < 0.001), there was no effect of treatment with
SB242084 (F (1, 15) = 0.130, p = 0.719) or any drug interactions in this task. Moreover, as Figure
1C shows, the PSE for each subject was unaltered by SB242084 treatment (F (1, 15) = 0.131, p =
0.719), as the sucrose was the only factor which significantly impacted the PSE (F (1, 15) =
178.463, p < 0.001). This result shows that the enhancement in goal directed activity driven by
![Page 242: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/242.jpg)
227
SB242084 is not due to a change in the computation of the relative reward value of different
options.
Figure 7.1. SB242084 treatment does not alter reward value sensitivity. (A). Schematic diagram
of the CVC task. (B). Choice functions of proportion of sucrose choices made in the CVC task for
5% sucrose (Left) and 20% sucrose (Right) following either vehicle (blue) or SB (red) treatment.
(C). Individual PSE’s following either vehicle (blue) or SB treatment (red) plot for 5 or 20%
sucrose vs pellet.
SB242084 Enhanced Goal Directed Activity is Not Due to Altered Encoding of Rewards or
Reward Cues
Although SB242084 enhanced responding in reinforced instrumental tasks does not
appear to be due to altered reward preference, the motivational effect of SB242084 could result
from a general increase in sensitivity to all rewards. Because reinforcement value is encoded by
mesolimbic dopamine, we used Fast Scan Cyclic Voltammetry (FSCV) to determine if
SB242084 alters dopamine released in the NAc in response to primary reinforcers or cues
![Page 243: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/243.jpg)
228
predicting reinforcers. Mice chronically implanted with carbon fiber electrodes were trained in a
Pavlovian schedule in which two different auditory stimuli were pseudo-randomly presented.
One stimulus (CS+) was paired to delivery of a food pellet (US) while the other stimulus (CS-)
was not. Details of the procedure are provided in figure 2A. All mice successfully learned the
Pavlovian association. Figure 2B shows the number of head entries during the 10s CS - the heads
entries during the previous 10 s of ITI for each CS type. On days 15 and 16, mice received an i.p.
injection of either SB242084 or vehicle 20 minutes before the behavioral session (order was
counterbalanced across the group), during which dopamine release was recorded.
Figure 2C depicts dopamine release in response to CS+ and CS- cues, normalized to
unexpected pellet evoked release recorded for each subject (see methods for details). Repeated
Measures ANOVA determined that cue-evoked maximum release was affected by trial type
(CS+ or CS-), (F (1, 58) = 38.38, p < 0.0001), but not by drug treatment (F (1, 58) = 0.246, p =
0.6218) and no trial type x drug interaction was observed (F (1, 58) = 0.2998, p = 0.5861), Fig 2D).
Similarly, no trial type x drug interaction was observed for dopamine release accumulation
within 10s of tone onset (F (1, 58) = 0.1303, p = 0.7195, Fig 2E). Figure 2F depicts dopamine
release in response to US delivery, or in the case of CS- trials, dopamine release after cue offset,
as no reward was delivered in those trials. Repeated Measures ANOVA determined that US-
evoked maximum release was affected by trial type (US+ or US-), (F (1, 55) = 30.35, p < 0.0001),
but not by drug treatment (F (1, 55) = 2.448, p = 0.1234) and no trial type × drug interaction was
observed (F (1, 55) = 1.853, p = 0.1790), Fig 2G). Again, no trial type × drug interaction was
observed for dopamine release accumulation within 5s of reward delivery/tone offset
(Interaction: F (1, 55) = 0.2474, p = 0.6209, Fig 2H). These results show that SB242084 has no
![Page 244: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/244.jpg)
229
impact on the NAc dopamine encoding of rewards, or cues that are have already been established
to predict reward.
Figure 7.2. SB242084 Does Not Alter the Encoding of Rewards or Reward Cues. (A) Schematic diagram
of the Pavlovian task. (B) CS- ITI head entries demonstrates Pavlovian learning. (C) Normalized CS+ and
CS- dopamine response traces. The solid vertical line indicates CS onset, the dashed vertical line depicts
2s prior to CS onset, used as background value. (D) The maximal CS+ and CS- evoked dopamine release.
(E) CS+ and CS- accumulated dopamine release (Area under the curve). (F) US+ and US - dopamine
response traces. The solid vertical line indicates CS offset (and reward delivery in CS+ trials), the dashed
vertical line depicts 2s prior to CS offset, used as background value. (G) The maximal CS+ and CS-
evoked dopamine release. 2h US+ and US- accumulated dopamine release (Area under the curve). In
Figures (D) and (G), data point represent individual trials, line represents mean and errors represent SEM.
In figures (E) and (H), boxes represent the time during which the dopamine levels are analyzed during the
task.
![Page 245: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/245.jpg)
230
Systemic SB242084 Treatment Does Not Alter Animal’s Choice for Different Forms of
Work
The choice to execute goal directed behaviors result from computation of relative costs
and benefits of different options. Our FSCV studies showed that SB242084 does not alter benefit
(reward) encoding; therefore, we focused the remainder of our studies on the cost component.
We previously showed that SB242084 enhances performance in tasks that involve different types
of work requirements. SB242084 improved performance on a progressive schedule of
reinforcement where an increasing number of lever presses are required for each successive
reward (Simpson et al., 2011, Bailey et al., 2016) and when single lever presses of increasing
duration are required for each successive reward (Bailey et al., 2016). To determine if acute
SB242084 treatment would alter animal’s preference for these two different types of work, we
implemented a concurrent choice task in which mice can earn reinforcers by holding one lever
for a fixed period of time, or by pressing a second lever for a fixed number of presses (Figure
3A, for details see Chapter 4). By varying the lever press requirement across sessions while
keeping the time requirement of the holding lever constant, effort choice functions and the point
of subjective equality (PSE) for holding and pressing were obtained for each subject. Figure 3B
shows that effort choice changed as a function of the lever press requirement (F (3, 14) = 75.848, p
< 0.001), but treatment with SB242084 did not affect preference for holding versus pressing in
this task (F (1, 14) = 2.077, p = 0.153). The PSE for each subject is shown in Figure 3C, and this
measure was also not altered by SB242084 treatment (t (29.97) = 0.5275, p = 0.602). Additional
evidence that SB242084 treatment had no impact on this measure was the extremely high
correlation, r (15) = .955, between a subjects vehicle PSE and SB242084 PSE, Figure 3D. These
![Page 246: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/246.jpg)
231
results imply that the SB242084 enhancement of goal directed action is not due to a change in
the computation of the relative effort of different response options.
Figure 7.3. SB242084 Treatment Does Not Alter Effort Sensitivity. (A) Schematic diagram of
CEC task. (B) Average function of proportion of hold choices made in the CEC task following
vehicle (blue) or SB (red) treatment. (C) The PSE of individual subjects between lever pressing
and holding for 10 seconds following vehicle (blue) or SB (red) treatment. (D) Vehicle PSE x SB
PSE scatter plot.
The Functionally Selective 5-HT2C Ligand SB242084 Enhances Motivation in Goal
Directed Activity Across Different Work Types by Increasing Vigor and Persistence
We previously found that SB242084 enhanced responding in tasks of lever pressing
(Simpson et al., 2011, Bailey et al., 2016) and lever holding (Bailey et al., 2016), but the reason
for this enhanced willingness to continue responding is not the result of the drug altering or
![Page 247: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/247.jpg)
232
biasing subjects sensitivity to either of these two distinct types of work. To specifically identify
how treatment with SB242084 enhances responding in progressive schedules we carried out a
detailed analysis of the specific changes induced by the drug. In a progressive pressing task
(Figure 4A), treatment with SB242084 increased how long subjects continued to work (Figure
4B), confirmed with a Mantle-Cox log rank test of survival curves (ᵡ2 = 9.94; p = 0.0016). Figure
5c shows the increase in response rates following treatment with SB242084 (Veh: M = 53.48 +
7.03; SB: M = 69.83 + 9.49; t (20) = 4.652, p = 0.002).
Figure 7.4. SB242084 Enhances Goal-Directed Persistence in a PR. (A) Schematic of
progressive pressing task. (B) Survival curse of how long subjects continued pressing in a PR
schedule before quitting. (C) Left, Average press rates in PR schedule with and without
SB242084, Right, Changes in press rates for each subject.
Examination of the microstructure of lever pressing at a finer temporal resolution
revealed that SB242084 treatment altered the characteristics of bouts of lever pressing (defined
as consecutive responses made with less than 2 seconds elapsing between responses, Figure 5A).
Figure 5B shows that the average bout length increased with SB242084 treatment at high press
![Page 248: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/248.jpg)
233
requirements, and the maximum bout lengths subjects make in the session (Figure 5C) increased
as well (Veh: M = 15.01 + 2.137; SB: M = 21.58 + 3.0; t (20) = 4.876, p < 0.001). Finally, we
observed that SB treatment subjects executed responses within a bout more rapidly (Fig 5D), as
the Inter-Response-Times (defined as the time between consecutive lever presses) were
significantly shorter after drug treatment (Veh: M = 0.518 + 0.05; SB: M = 0.44 + 0.47; t (20) =
5.286, p < 0.001).
Figure 7.5. SB242084 Enhances Response Vigor in a PR. (A) Schematic of bouts of responding
(defined as consecutive responses with less than 2 seconds elapsing without a response). (B)
Average bout length as a function of press requirement in a PR schedule. (C) Maximum bout
lengths completed in the session. (D) Average inter-response times (IRTs) while subjects are
responding in a PR schedule with and without SB242084.
SB242084 Enhanced Goal Directed Activity is Not Due to an Increased Resistance to
Extinction
Persistence in responding could also be due to changes in extinction processes, especially
under progressive schedules in which greater numbers of non-reinforced responses are required
![Page 249: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/249.jpg)
234
as the schedule progresses. Therefore we tested the effect of systemic SB242084 on behavioral
extinction. Mice were trained on random ratio schedules of reinforcement, with the ratio
increasing progressively across sessions, up to RR20. After stable performance on this schedule,
mice were treated with either drug or vehicle 20 minutes before a testing session in which no
reinforcers were delivered. Figure 6A shows no differences in press rates between the two
treatment assignment groups during the last RR20 session prior to extinction testing (t (26) = 0.17,
p = 0.86).
Figure 7.6. SB242084 does not alter sensitivity to extinction. (A) Lever press rates during the last
RR20 session prior to the extinction test, in groups assigned to either saline or SB treatment. (B)
Rate of lever pressing in 5 minute bins during the extinction session, lines represent non-liner fit
of the group mean data to the exponential decay equation Y=(a × exp(-b × x). function. (C) Peak
press rate (a) during the extinction session. (D) Rate of decay (b) during extinction. (E) Goodness
of fit (R2) during extinction. In graphs (A, C-E), Symbols represent individual data points, lines
represent group mean, error bars represent SEM. n=14 per group.
![Page 250: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/250.jpg)
235
Figure 6B depicts the rate of decay in pressing during the extinction session for each
group fit to the exponential decay equation: y = (a × exp(-b × x). The lever press data for each
individual subject was fit to this equation to determine if extinction was affected by treatment.
Figure 6C shows that SB had no effect on the a parameter, which predicts the peak press rate, (t
(26) = 0.4125, p = 0.68). Figure 6D shows no effect of drug on rate of decay (t (26) = 0.47, p =
0.64). Figure 6E shows no effect of drug on goodness of fit (R2) (t (26) = 0.49, p = 0.62).
SB242084-Mediated Increase in Goal Directed Activity is Observed Only at Higher Work
Requirements.
Given that SB242084 makes responding more vigorous and persistent in progressive ratio
tasks without affecting extinction, we investigated how SB242084 impacted responding on trials
with fixed work requirements, across a range of demands. We tested the effect of SB242084 on
different Fixed Ratios (FRs (10, 20, 40, and 80) and found that the drug’s effects on performance
were noticeable only at higher work requirements. Figure 7A depicts the rate of lever pressing as
a function of time across trials for the 4 different work requirements. Differences in the
distribution of press rates across the trial increases with the number of lever presses required.
Therefore, we looked at the time taken to complete each trial as a function of lever press
requirement (Fig 7B). An ANOVA detected a significant main effect of ratio (F(3,42) = 63.69; p <.
0001), a significant main effect of drug treatment (F(1,14) = 17.4; p =. 0009), and a significant
ratio x drug treatment interaction (F(3,42) = 8.89; p = .0001). Bonferroni's multiple comparisons
test showed that time to complete the FR was significantly decreased by drug only in FR80 trials
(p <0.0001). We also looked at the IRTs (Figure 7C), and found that SB lead to a significant
decrease in average IRTs (F(3,42) = 8.453; p = .015), but there was no interaction with the ratio
![Page 251: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/251.jpg)
236
used. Along with our previous data from progressive ratio experiments, these results show that
SB242084 increases the vigor capacity.
Figure 7.7. SB242084 increases vigor in FR schedules. (A) Lever press rates per second as a
function of time within the trial are plotted for each FR schedule, FR10, 20, 40 and 80. Data
represent the mean and SEM for press rates from 15 subjects and 20 trials per ratio. (B) Time to
complete FR trials are presented for each trial type. (C) Average IRTs for the different FR
requirements. Data points represent the mean for each subject, error bars represent the SEM. n=
15.
Systemic SB242084 Potentiates Effort Dependent Dopamine Release in the Dorsomedial
Striatum and Not the NAc.
![Page 252: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/252.jpg)
237
Because response vigor is modulated by dopamine in the dorsal striatum (Pex et al.,
2016), we used in vivo microdialysis to examine the effect of SB242048 on dopamine level in
the dorsal striatum as well as the ventral striatum (NAc) during goal-directed behavior. Mice
were tested in an effort-based choice paradigm, in which they could lever press for a preferred
reinforcer (evaporated milk), or consume home cage chow that was freely available on the floor
of the test chamber. Because rate of reinforcement influences extracellular dopamine levels,
(Hamid et al., 2016), we set the work requirement such that SB242084 treatment would not
cause a significant increase in the number of reinforcers earned (RR15). Drug treatment
significantly increased extracellular dopamine in the dorsomedial striatum, selectively when
mice were engaged in behavior (Figure 8A). A repeated measures 2-way ANOVA revealed a
significant effect of treatment (F(2,14) = 10.34, p = 0.0017 and a significant treatment × time
interaction (F (14,98) = 4.33, p < 0.0001). The maximum increase in dopamine during the last 3
collection time points (Behavior 2 through Post-behavior) was significantly different across
conditions (Figure 8B, Work + Vehicle = 1.072 +/- 0.04, Work + SB242084 =1.376 +/- 0.63, No
Work + SB242084 = 1.1+/-0.061, RM one-way ANOVA: (F(1.756,12.29) = 7.112, p = 0.01). No
increase in dopamine in the NAc was observed (figures 8C and 8D). Placements of the
microdialysis probes are represented in Figure 8E.
We analyzed the behavior to see if the increased extracellular dopamine in the dorsal
striatum was related to any changes in behavior. As shown in Figure 8F, we found that subjects
treated with SB made more lever presses for the preferred reward throughout the entire session
compared to vehicle (t(7) = 3.093, p = 0.0175). This increased rate of lever pressing was the result
of subjects executing successive presses more rapidly, as the IRTs we significantly lower
following SB treatment (Figure 8G; t(7) = 2.516, p = 0.0401), reflecting enhanced vigor.
![Page 253: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/253.jpg)
238
Figure 7.8. SB242084 potentiates dopamine release in the DMS. (A). DMS dopamine level
across time in each condition tested, Work+ Vehicle, Work + SB, No Work + SB. / SB behavior/
vehicle behavior. (B). Maximum increase in DMS DA release during behaviour. (C). NAc
dopamine level across time in each condition tested, Work+ Vehicle, Work+ SB, No Work + SB.
/ SB behavior/ vehicle behavior. (D). Maximum increase in NAc DA release during behaviour.
(E). Placement of the microdialysis probes. Light grey represents NAc probes, dark grey
represents DMS probes. Positions of the sections relative to bregma are provided. (F) Total lever
presses during the 60 minute behavioral session increased with SB242084. (G) Average inter-
response times (IRTs) decreased with SB242084.
Discussion
We previously showed that the functionally selective ligand of a serotonin receptor,
SB242084, increases motivation in mice (Simpson et al., 2011, Avlar et al., 2015; Bailey et al.,
2015). Because motivated behaviors are driven by a computation of the relative the costs and
![Page 254: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/254.jpg)
239
benefits of particular actions, there are multiple possible mechanisms by which a drug such as
SB242084 might enhance motivation. At the stage of processing incoming information, both the
reward value and effort demands must be encoded and cost-benefit computations must be made
in order to guide the selection of actions. In addition to action selection, this information is also
used to modulate the vigor (speed, amplitude and frequency) and duration that actions are
sustained to overcome the effort demanded by a given situation. Our behavioral analyses
dissected these different processes and identified the mechanism by which SB242084 enhances
motivated behavior in mice.
Reaction to reward is not altered by SB242084 treatment
We previously showed the SB242084 enhances goal directed behavior in mice (Simpson
et al., 2011, Avlar et al., 2015, Bailey et al., 2016) was not accompanied by an increase in daily
food intake, but home cage feeding may not reflect possible changes in reactivity to appetitive
reinforcers. Here we directly tested the effect of SB242084 on reward choice and found that the
drug did not alter preference or sensitivity for different types of rewards. We also evaluated the
effect of SB242084 on reward processing at the neurochemical level. Using FSCV we found that
SB242084 does not alter dopamine release in the NAc in response to reward or cues that predict
reward. Because VTA dopamine neuron activity and dopamine release in the NAc are both
associated with reward expectation (Schultz, 1998), our FSCV data also suggest that the
enhancement in motivation following treatment with SB242084 is not the result of altered
reactivity to rewards.
SB242084 treatment does not alter sensitivity to effort demands
Because treatment with SB242084 did not alter the encoding of value or reward
reactivity, we next examined whether the drug altered sensitivity to effort demands. To test if
![Page 255: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/255.jpg)
240
drug-treated subjects kept working longer because altered encoding of effort, we applied our
recently developed Effort Based choice task (Chapter 4). We found that SB242084 did not alter
preference for different types of work, as subjects made the decision to switch work options at
the same lever press requirement on and off of drug, suggesting that SB242084 is not altering
how effort requirements are encoded or subsequently acted upon.
Behavioral effects of SB242084 treatment are only observed when subjects are engaging in
goal-directed behavior
Earlier, we showed that SB242084 increases instrumental responding (Simpson et al.,
2011). However, some increases in instrumental responding can be due to a generalized increase
in locomotor activity, rather than an increase in goal-directed behavior, as is the case with some
stimulants (Bailey et al., 2015). We previously applied a battery of behavioral tasks and
determined that SB242084 results in an increase in both the number of responses and the
duration of time a subject maintains a single response, indicating increased effort expenditure for
food rewards without non-specific hyperactivity or arousal (Bailey et al., 2015). Another
possible reason for the increased responding could also result from SB242084 causing changes
in extinction processes, especially since progressive schedules expose subjects to greater
numbers of non-reinforced responses as the schedule progresses. SB242084 treatment had no
impact on the decline in response rate over the whole course of an extinction session. This
strongly suggests that the drug’s effect on goal-directed responding only occurs when the goal is
obtainable.
SB242084 treatment increases total goal-directed output by increasing the vigor and
persistence of responding
![Page 256: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/256.jpg)
241
We ruled out altered reward processing, sensitivity to effort demands, and resistance to
extinction as possible mechanisms by which SB242084 acts to increase motivation. We next
performed a more detailed analysis of the temporal dynamics of responding in PR. In addition to
SB242084 increasing the overall number of responses and duration of time subjects continue
working, we also found that subjects executed larger numbers of responses in each bout of
responding as the effort requirements increased. Moreover, the responses within these bouts were
executed more rapidly, as subjects reached higher peak response rates when tested with
increasing FR schedules. These findings in PR and FR tasks show that the drug increases
response vigor and goal directed persistence.
SB242084 treatment increases effort related dopamine release in the dorsal striatum and
not the NAc.
We measured extracellular dopamine in the NAc and DMS during an effort based choice
task and found that SB242084 treatment potentiated dopamine release in the DMS concomitant
with an increase in response vigor. This dopamine increase appeared to be the result of the
subject working to earn rewards, as treatment with SB242084 did not increase dopamine levels
when subjects were not engaging in an effortful goal-directed task. Additionally, there was a
delay in the increase in dopamine levels, which could be partially due to a lag in the sensitivity
of the dialysis sampling method, or to the necessity for subjects to be actively responding and
earning rewards to see a drug effect on both dopamine release and behavior.
Several lines of evidence suggest that an increase in dopamine in the DMS can support
the observed increase in response vigor. In rats, lesion of the DMS impairs the modulation of
response vigor in response to changes in the rate of reinforcement (Wang et al., 2013). Neuronal
recordings in the DMS in both rodents and primates have identified neurons that encode aspects
![Page 257: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/257.jpg)
242
of response vigor including velocity and amplitude of movements (Opris et al., 2011; Panigrahi
et al., 2015). Dopaminergic input into DMS is essential for adaptive changes in movement vigor,
as evidenced by the loss of vigor modulation in a mouse model of Parkinson’s disease in which
dopaminergic innervation of the DMS progressively declines (Panigrahi et al., 2015). When
nigrostriatal dopamine levels decline, mice are selectively unable to complete the most
demanding trials, while still being able to complete trials that require less vigor. Therefore,
dopamine tone is particularly important when high effort is demanded. Further evidence that
DMS dopamine modulates vigor comes from optogenetic experiments. Response vigor, as
measured through the velocity with which mice can move a joystick, can be increased by
stimulating striatal output neurons in the direct pathway and decreased by stimulating neurons in
the indirect pathway. This bidirectional modulation of movement velocity is dopamine
dependent as dopamine antagonists block the optogenetically-driven changes in response vigor
(Yttri et al., 2016).
5-HT2c receptor modulation represents an in road for the development of treatment for
apathy
Deficits in motivation are a debilitating symptom of several psychiatric and neurological
conditions for which there are currently no treatments available. SB242084 treatment does not
result in a generalized hyper-locomotor or hyper-arousal state; rather, it enhances response vigor
during the ongoing pursuit of goals. This suggests a complex interaction in the serotonin and
dopamine systems that regulates the dynamics of motivated behavior. Because SB242084 only
potentiated dopamine when animals were working, this indirect modulation of dopamine may
represent a significantly lower abuse potential than drugs that directly increase dopamine release.
![Page 258: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/258.jpg)
243
A more detailed understanding of how SB242084 increases effort-related dopamine release may
permit the development of treatments for patients with debilitating motivational deficits.
References
Alex, K. D., & Pehek, E. A. (2007). Pharmacologic mechanisms of serotonergic regulation of
dopamine neurotransmission. Pharmacology & Therapeutics, 113(2), 296-320.
Avlar, B., Kahn, J. B., Jensen, G., Kandel, E. R., Simpson, E. H., & Balsam, P. D. (2015).
Improving Temporal Cognition by Enhancing Motivation. Behavioral Neuroscience,
129(5), 576-588.
Bailey, M., Williamson, C., Mezias, C., Winiger, V., Silver, R., Balsam, P., & Simpson, E.
(2015). The effects of pharmacological modulation of the serotonin 2C receptor on goal-
directed behavior in mice. Psychopharmacology, 1-10.
Bailey, M. R., Jensen, G., Taylor, K., Mezias, C., Williamson, C., Silver, R., Balsam, P. D.
(2015). A novel strategy for dissecting goal-directed action and arousal components of
motivated behavior with a progressive hold-down task. Behavioral Neuroscience, 129(3),
269-280.
Balleine, B. W., Delgado, M. R., & Hikosaka, O. (2007). The role of the dorsal striatum in
reward and decision-making. Journal of Neuroscience, 27(31), 8161-8165.
Barch, D. M., Treadway, M. T., & Schoen, N. (2014). Effort, anhedonia, and function in
schizophrenia: Reduced effort allocation predicts amotivation and functional impairment.
Journal of Abnormal Psychology, 123(2), 387-397.
Berg, K. A., Dunlop, J., Sanchez, T., Silva, M., & Clarke, W. P. (2008). A Conservative, Single-
Amino Acid Substitution in the Second Cytoplasmic Domain of the Human
Serotonin 2C Receptor Alters Both Ligand-Dependent and -Independent Receptor
Signaling. Journal of Pharmacology and Experimental Therapeutics, 324(3), 1084-1092.
Bissonette, G. B., & Roesch, M. R. (2016). Neurophysiology of Reward-Guided Behavior:
Correlates Related to Predictions, Value, Motivation, Errors, Attention, and Action. In E.
H. Simpson & P. D. Balsam (Eds.), Behavioral Neuroscience of Motivation (pp. 199-
230). Cham: Springer International Publishing.
De Deurwaerdère, P., Navailles, S., Berg, K. A., Clarke, W. P., & Spampinato, U. (2004).
Constitutive Activity of the Serotonin2C Receptor Inhibits -In Vivo- Dopamine Release
in the Rat Striatum and Nucleus Accumbens. The Journal of Neuroscience, 24(13), 3235-
3241.
De Deurwaerdere, P., & Spampinato, U. (1999). Role of serotonin(2A) and serotonin(2B/2C)
receptor subtypes in the control of accumbal and striatal dopamine release elicited in vivo
![Page 259: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/259.jpg)
244
by dorsal raphe nucleus electrical stimulation. Journal of Neurochemistry, 73(3), 1033-
1042.
Di Giovanni, G., De Deurwaerdére, P., Di Mascio, M., Di Matteo, V., Esposito, E., &
Spampinato, U. (1999). Selective blockade of serotonin-2C/2B receptors enhances
mesolimbic and mesostriatal dopaminergic function: a combined in vivo
electrophysiological and microdialysis study. Neuroscience, 91(2), 587-597.
Di Matteo, V., Di Giovanni, G., Di Mascio, M., & Esposito, E. (1999). SB 242 084, a selective
serotonin(2C) receptor antagonist, increases dopaminergic transmission in the
mesolimbic system. Neuropharmacology, 38(8), 1195-1205.
Di Matteo, V., Di Giovanni, G., Pierucci, M., & Esposito, E. (2008). Serotonin control of central
dopaminergic function: focus on in vivo microdialysis studies. In V. D. M. Giuseppe Di
Giovann & E. Ennio (Eds.), Progress in Brain Research (Vol. Volume 172, pp. 7-44):
Elsevier.
Dudman, J. T., & Krakauer, J. W. (2016). The basal ganglia: from motor commands to the
control of vigor. Current Opinion in Neurobiology, 37, 158-166.
Gan, J. O., Walton, M. E., & Phillips, P. E. M. (2010). Dissociable cost and benefit encoding of
future rewards by mesolimbic dopamine. Nat Neurosci, 13(1), 25-27.
Gesty-Palmer, D., & Luttrell, L. M. (2011). Refining Efficacy: Exploiting Functional Selectivity
for Drug Discovery. In R. R. Neubig (Ed.), Pharmacology of G Protein Coupled
Receptors (Vol. 62, pp. 79-107). Amsterdam: Elsevier Science Bv.
Goldstein, B. L., Barnett, B. R., Vasquez, G., Tobia, S. C., Kashtelyan, V., Burton, A. C.,
Roesch, M. R. (2012). Ventral Striatum Encodes Past and Predicted Value Independent
of Motor Contingencies. The Journal of Neuroscience, 32(6), 2027-2036.
Green, M. F. (2016). Impact of cognitive and social cognitive impairment on functional
outcomes in patients with schizophrenia. The Journal of clinical psychiatry, 77 Suppl 2,
8-11.
Hamid, A. A., Pettibone, J. R., Mabrouk, O. S., Hetrick, V. L., Schmidt, R., Vander Weele, C.
M., Berke, J. D. (2016). Mesolimbic dopamine signals the value of work. Nat Neurosci,
19(1), 117-126.
Lau, B., & Glimcher, P. W. (2007). Action and Outcome Encoding in the Primate Caudate
Nucleus. The Journal of Neuroscience, 27(52), 14502-14514.
Opal, M. D., Klenotich, S. C., Morais, M., Bessa, J., Winkle, J., Doukas, D., Dulawa, S. M.
(2014). Serotonin 2C receptor antagonists induce fast-onset antidepressant effects. Mol
Psychiatry, 19(10), 1106-1114.
Opris, I., Lebedev, M., & Nelson, R. J. (2011). Motor Planning under Unpredictable Reward:
Modulations of Movement Vigor and Primate Striatum Activity. Frontiers in
Neuroscience, 5, 61.
![Page 260: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/260.jpg)
245
Ostlund, S. B., Wassum, K. M., Murphy, N. P., Balleine, B. W., & Maidment, N. T. (2011).
Extracellular Dopamine Levels in Striatal Subregions Track Shifts in Motivation and
Response Cost during Instrumental Conditioning. Journal of Neuroscience, 31(1), 200-
207.
Oyama, K., Hernádi, I., Iijima, T., & Tsutsui, K.-I. (2010). Reward Prediction Error Coding in
Dorsal Striatal Neurons. The Journal of Neuroscience, 30(34), 11447-11457.
Panigrahi, B., Martin, K. A., Li, Y., Graves, A. R., Vollmer, A., Olson, L.,Dudman, J. T. (2015).
Dopamine Is Required for the Neural Representation and Control of Movement Vigor.
Cell, 162(6), 1418-1430.
Rangel, A., & Hare, T. (2010). Neural computations associated with goal-directed choice.
Current Opinion in Neurobiology, 20(2), 262-270.
Salamone, J. D., & Correa, M. (2012). The mysterious motivational functions of mesolimbic
dopamine. Neuron, 76(3), 470-485.
Salamone, J. D., Pardo, M., Yohn, S. E., López-Cruz, L., SanMiguel, N., & Correa, M. (2016).
Mesolimbic Dopamine and the Regulation of Motivated Behavior. In E. H. Simpson & P.
D. Balsam (Eds.), Behavioral Neuroscience of Motivation (pp. 231-257). Cham: Springer
International Publishing.
Samejima, K., Ueda, Y., Doya, K., & Kimura, M. (2005). Representation of Action-Specific
Reward Values in the Striatum. Science, 310(5752), 1337-1340.
Schultz, W. (1998). Predictive reward signal of dopamine neurons. Journal of Neurophysiology,
80(1), 1-27.
Simpson, E. H., & Balsam, P. D. (2016). The Behavioral Neuroscience of Motivation: An
Overview of Concepts, Measures, and Translational Applications (pp. 1-12). Berlin,
Heidelberg: Springer Berlin Heidelberg.
Simpson, E. H., Kellendonk, C., Ward, R. D., Richards, V., Lipatova, O., Fairhurst, S.,
Balsam, P. D. (2011). Pharmacologic Rescue of Motivational Deficit in an Animal Model
of the Negative Symptoms of Schizophrenia. Biological Psychiatry, 69(10), 928-935.
Stalnaker, T. A., Calhoon, G. G., Ogawa, M., Roesch, M. R., & Schoenbaum, G. (2012). Reward
Prediction Error Signaling in Posterior Dorsomedial Striatum Is Action Specific. The
Journal of Neuroscience, 32(30), 10296-10305.
Wang, A. Y., Miura, K., & Uchida, N. (2013). The dorsomedial striatum encodes net expected
return, critical for energizing performance vigor. Nature Neuroscience, 16(5), 639.
Yttri, E. A., & Dudman, J. T. (2016). Opponent and bidirectional control of movement velocity
in the basal ganglia. Nature, 533(7603), 402-406.
![Page 261: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/261.jpg)
246
Chapter 8
Conclusion
Motivation consists of multiple interacting sub-processes which energize and direct
behavior in pursuit of a goal in a manner that is sensitive to changing costs and benefits in the
environment. The studies contained within this dissertation reflect my attempt to better
understand several reasons one might observe a change in motivated behavior. I first focus on
how we measure motivational sub-processes, and then employ a multi-process perspective to
investigate two neurotransmitter receptors involved in different aspects of motivated behavior,
the dopamine D2 receptor and the serotonin 2c receptor. In this final chapter, I highlight some of
the key insights and discoveries made by examining sub-processes involved in motivation.
8.1 Using response duration as the unit of “work” in the Progressive Hold
Down task helps to show that not all changes in behavior occur for the same
reason
For many years, investigators relied on a single behavioral measure as an assay of
motivated behavior. Following the original use of the progressive ratio schedule as an assay of
motivation (Hodos, 1963), numerous studies used the PR alone as the measure of a subject’s
motivational state (Brown et al., 1998; McCue et al., 2017; Pandit et al., 2014). We now
understand enough about motivation and the many sub-processes it involves to recognize that
this single-assay approach is over simplistic: for example, we know that motivation consists of
both an activational and direction component (Duffy, 1957; Hebb, 1955; Salamone, John D.,
1988), as it both invigorates behavior to overcome effort requirements and directs behavior
toward relevant stimuli.
![Page 262: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/262.jpg)
247
After observing that most known manipulations that increase or decrease responding in a
PR (Cagniard et al., 2006; Mayorga et al., 2000; Randall et al., 2012) also increase or decrease
general locomotor activity in an open field, respectively (Antoniou et al., 2005; Cagniard et al.,
2006; Hall et al., 2008), (Aberman et al., 1998; Sanders et al., 2007; Simpson et al., 2011)
(Kellendonk et al., 2006; Sanders et al., 2007; Simón et al., 2000),I became particularly
interested in whether responding in a PR could be influenced by general motor activity increases
which are not necessarily reflective of increased goal-directedness.
To test this question, I considered the underlying assumption behind why subjects quit
working in a PR task – namely, when the work requirement becomes so great that the animal
decides the reward is not worth the effort required. When the work requirement is measured by
the number of lever presses made, animals aided by hyperactivity and have a greater ability to
rapidly execute the work required. To test a subject’s willingness to work in a progressive
schedule in a manner such that they would not be benefitted by hyperactive responding, I
changed the unit of work from number of responses to the time a single response must be
maintained. In creating the Progressive Hold Down task (PHD), which requires subjects to make
increasingly longer holds for each subsequent reward, I was able to test willingness to continue
working in a manner incompatible with hyperactivity.
In studies carried out to characterize behavior in the PHD task, I found that progressively
increasing the time required for a single action produces similar behavioral results as observed in
different PR conditions. Hungry animals will make holds of longer durations and continue
working longer before quitting than sated animals, demonstrating that motivation is related to
performance in the PHD task. Additionally, mice will make longer holds and will continue
working for longer durations when the reward magnitude is increased. In order to test what
![Page 263: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/263.jpg)
248
additional information the PHD task can provide, I then treated mice with methamphetamine
(METH) at a dose known to enhance locomotor output and tested them in both a PR task and a
PHD task. Mice treated with METH made more lever presses and continued working longer in
the PR task, a result which had previously been interpreted as an increase in motivation. When
tested in the PHD following treatment with METH, however, we observed a large increase in the
number of lever hold attempts, but the duration of these responses was extremely short and did
not result in earning more rewards. A more careful characterization of behavior in the PHD
following treatment with METH demonstrated that the increase of extremely short, rapidly
executed responses was characteristic of hyperactive responding.
Potential implications of these findings
It is my hope that the development of the PHD task will cause other researchers studying
motivation to consider why they are observing a change in motivated behavior following a
manipulation. The results from the METH PR-PHD experiments show that not all changes in
behavior are attributable to the same underlying cause. One example of this already occurred in a
recent set of studies which found that decreasing the function of the striatopallidal “no-go”
pathway, using the Gαi-coupled designer receptor hM4D, results in increased activational
behavioral effects. Decreasing striatopallidal function in mice resulted in a large increases in
lever pressing in a PR, and large numbers of short duration rapid responses in a PHD task
(Carvalho Poyraz et al., 2016), similar to that seen with METH. By employing both a PR and
PHD this study was able to develop a more nuanced understanding of how decreasing
striatopallidal function alters motivated responding, and it is my hope that future studies may be
able to benefit from this approach as well.
![Page 264: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/264.jpg)
249
8.2 Dopamine D2 receptor over-expression in the striatum specifically alters
effort related aspects of cost-benefit decision making
A large number of patients with schizophrenia suffer from impaired goal-directed
motivation (Faerden et al., 2009; Kiang et al., 2003; Roth et al., 2004), for which there are no
effective pharmacological treatments. Preclinical research in rodents represents an important
avenue to understand how motivational processes work, and offers the opportunity to screen new
potential drug candidates to alleviate this symptom. I was specifically interested in examining
motivational processes in a mouse model which over-expresses the dopamine D2R within the
striatum (D2R-OE) (Kellendonk et al., 2006; Ward et al., 2011), as this model has been found to
have impaired goal-directed motivation (Drew, M. R. et al., 2007; Simpson et al., 2011). Because
many different processes underlie motivated behavior, I wanted to examine this model to
understand which processes are altered in motivation impairments.
Previous studies identified that D2R-OE mice have motivation deficits in a PR task
(Drew, M. R. et al., 2007; Simpson et al., 2011). I first examined cost-benefit decision making in
these mice using an assay of effort-related choice, showing that D2R-OE mice have a reduced
willingness to work for a preferred reward (Salamone, 1991). I wanted to determine whether this
reduced willingness to work was due to a deficit in value- or effort-related decision making, as
evidence exists which suggests both could be disrupted (Drew, Michael R. et al., 2007; Simpson
et al., 2011; Ward et al., 2011; Ward et al., 2015).
D2R-OE mice are able to use changes in reward value to guide decision making
I tested the D2R-OE mice in a value based decision making task which I developed and
characterized in chapter 4. While previous work suggested D2R-OE mice might have a deficit
processing reward value information because these mice do not match their response rates to
![Page 265: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/265.jpg)
250
differential rates of reward (Ward et al., 2012), and they fail to use information about the
probability of reward payoff to guide their behavior (Ward et al., 2015).I found that the D2R-OE
mice are sensitive to both reward magnitude (sucrose concentration shifts) and reward
devaluation. The D2R-OE mice were able to use information about reward value to guide their
behavior. These results show that D2R-OE mice can use reward value specific information to
guide their behavior.
D2R-OE mice have a differential sensitivity to the effort to number (repeated action
initiation) vs time (maintaining a single action)
Because the D2R-OE mice were able to use reward value information to guide their
behavior, I next assessed their ability to use effort-related information to guide decision making.
I tested the D2R-OE mice in an effort based decision making task which I developed and
characterized in chapter 4. This task gives the mice an option between lever pressing and lever
holding to earn rewards, and we determined effort sensitivity functions by looking at the choice
behavior as we systematically vary the effort requirements. We discovered that D2R-OE mice
have an altered sensitivity to effort requirements of response number (having to repeatedly
initiate actions) as they strongly preferred the cost of time (maintaining a single sustained
action). By examining the dynamics of responding in these two types of work conditions we
found that the D2R-OE mice are equally adept at executing the action of lever holding, but are
much slower to execute the lever press requirements. Even more specifically, we determined that
it takes D2R-OE mice longer to execute the actions involved in the lever press requirements
because both the duration of single presses (response durations) and the time between successive
presses (inter-response times) are longer.
![Page 266: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/266.jpg)
251
Potential implications of these findings
There are a few possible future implications of the results from the experiments in the
D2R-OE mice. First, these studies may serve as a model by which to examine the underlying
processes involved in motivated behavior in other genetic mouse models. By isolating reward
value and effort sensitivity using this approach, understanding how the neurobiology of cost-
benefit decision making works is possible. As discussed in chapter 2, existing paradigms have
implicated several brain regions in cost-benefit decision making (Fig 3 in chapter 2). While a
decreased willingness to work for preferred rewards in the effort-related choice task in my first
experiment (Salamone, J. D. et al., 1991) could result from changes in either effort-or value-
related processes, these novel tasks demonstrate that effort processes are specifically impacted in
the D2R-OE model.
Finally, I discovered that D2R-OE mice have altered sensitivity to effort changes of
number of responses relative to time of a response. Additionally, these mice have altered ability
to repeatedly execute actions, as individual lever presses last longer and inter-response-times are
longer. Future work may develop deeper insights into motivation and effort allocation by
exploring how information about physical action is encoded and evaluated when making a cost-
benefit decision.
8.3 The selective 5HT2CR ligand SB242084 increases the vigor of motivated
responding is associated with dorsomedial dopamine release
New medication options which could alleviate impairments in goal-directed motivation
observed in disorders such as schizophrenia and depression is needed. The 5-HT2C receptor (5-
HT2CR) has become a potential candidate for developing new treatments toward this end due to
![Page 267: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/267.jpg)
252
its ability to modulate the mesolimbic and niagrostriatal dopamine systems (Alex et al., 2007; Di
Matteo et al., 2008). Indeed, modulation of 5-HT2CR activity with the functionally selective
ligand, SB242084, has been found to increase goal-directed motivation in mice (Simpson et al.,
2011). Because there are many underlying processes involved in motivation, I performed various
studies with several collaborators to investigate the effect of SB242084 on motivational
processes.
I first replicated the findings from Simpson et al, (2011) using wildtype C57/BL6J mice,
and found that SB242084 treatment resulted in increased lever pressing and how long subjects
continued pressing before quitting in a PR. As I demonstrated in chapter 3, increased responding
in the PR could result from either increased goal-directedness or increased general locomotor
activity, as is the case with stimulants such as methamphetamine. Using the PHD task, I found
that SB242084 treatment enabled subjects to earn more rewards by holding the lever down for
longer durations, suggesting the drug enhanced the goal-directedness of behavior.
Behavioral effects of SB242084 treatment are only observed when subjects are engaging in
goal-directed behavior
One possible way that SB242084 could increase the persistence of responding observed
in the PR and the PHD task is by making subjects less sensitive to unrewarded responses made
before subjects quit working. Such a change would reflect an alteration of sensitivity to
extinction, which is relevant to progressive schedules, as subjects experience increasing numbers
of non-reinforced responses at the end of the session. We found that treatment with SB242084
did not alter response rates in an extinction test. Moreover, SB242084 didn’t change response
rates at any point in our hour long extinction test. This suggests that the drugs effect is specific to
times when subjects are engaging in a goal-directed activity and earning rewards.
![Page 268: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/268.jpg)
253
SB242084 does not alter reward value sensitivity or effort sensitivity
We next examined whether SB242084 altered reward value-related processes. Treatment
with SB242084 did not alter reward value sensitivity in a reward value choice experiment, and it
also had no impact on NAc dopamine release in response to reward delivery or cues predicting
reward. Together, these results show that the drug does not alter processes related to reward.
We then examined whether SB242084 impacts sensitivity to increasing effort demands in
an effort choice task, as this too could explain the increased responding in a PR or PHD
schedule. Treatment with SB242084 did not alter sensitivity to increasing effort demands. Taken
together with the results of the reward value studies, it appears that SB242084 does not impact
the decision making processes involved in motivated behavior.
SB242084 increases goal-directed output by enhancing the vigor of responding and
increased dopamine release in the dorsomedial striatum
The extinction experiment suggested that SB242084 only impacts behavior when subjects
are working and earning rewards. The results from the value- and effort- related choice
experiments suggest that SB242084 doesn’t impact decision making. We therefore examined the
behavioral changes of the drug at the level of motor responding when subjects were working for
rewards in PR schedules.
An examination of the response dynamics in the PR revealed that SB242084 increased
not only the total number of response subjects made, but also increased the number of responses
subjects executed per bout while decreasing the time between successive responses. Taken
together, these results suggest an enhancement in response vigor and persistence. Experiments
![Page 269: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/269.jpg)
254
using in vivo microdialysis procedures performed with colleagues showed that enhanced
dopamine release in the DMS while subjects were working for rewards was associated with
increased response vigor following SB242084 treatment. The specificity of this effect on
dopamine and behavior was supported by two observations. First, dopamine release within the
NAc was not altered by treatment with SB242084. Second, the drug did not change dopamine
levels in either brain region when subjects were treated with drug, but were not working for
rewards. This DMS-specific impact on response vigor fits with recent studies implicating activity
in this region with the modulation of response vigor (Panigrahi et al., cell 2015; Opris et al.,
Frontiers 2011).
Potential implications of these findings
This set of studies was able to identify a highly specific effect of SB242084, as we
demonstrated that treatment enhanced the vigor of goal-directed behavior only when subjects
were earning rewards. These studies could serve as a model for investigating the effects of new
drugs believed to impact motivation. At present, there is a need to develop new drugs which
alleviate motivation impairments, and deeply understanding which specific motivational sub-
processes a drug impacts will likely be of importance for deciding whether a drug is a good new
pharmacological candidate.
A second implication of this work is the finding of DMS dopamine specifically
enhancing response vigor, while leaving effort and value related decision making unaltered.
Whereas drugs which globally increase or decrease dopamine levels are likely to impact
numerous different functions throughout the brain, identifying drugs which selectively alter
specific dopamine pathways and functions may be important to future efforts to find drugs which
have more specific effects while posing less risk through undesired side effects.
![Page 270: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/270.jpg)
255
Finally, the finding that the vigor of behavior can change independently of cost-benefit
decision making has implications for how we conceptualize motivation and study it in the future.
Whereas the experiments in D2R-OE mice found both reductions in vigor (slowed lever press
responding) and altered effort based decision making, here SB242084 acted only on the motor
output by modulating vigor while leaving effort sensitivity unaltered. Understanding how and
where the brain makes the distinction between changes in the execution of motor responses and
changes in the evaluating how effortful those motor responses are perceived will help shed light
on the study of motivated behavior. I anticipate that experiments performed to determine whether
animals are aware of increases or decreases in their response capacity will serve as an important
next step in making further advances towards understanding the larger and broader question of
how motoric output is produced, modulated, and evaluated by the nervous system.
References
Aberman, J. E., Ward, S. J., & Salamone, J. D. (1998). Effects of dopamine antagonists and
accumbens dopamine depletions on time-constrained progressive-ratio performance.
Pharmacology Biochemistry and Behavior, 61(4), 341-348. doi: 10.1016/s0091-
3057(98)00112-9
Alex, K. D., & Pehek, E. A. (2007). Pharmacologic mechanisms of serotonergic regulation of
dopamine neurotransmission. Pharmacology & Therapeutics, 113(2), 296-320. doi:
10.1016/j.pharmthera.2006.08.004
Antoniou, K., Papadopoulou-Daifoti, Z., Hyphantis, T., Papathanasiou, G., Bekris, E., Marselos,
M., Ferré, S. (2005). A detailed behavioral analysis of the acute motor effects of caffeine
in the rat: involvement of adenosine A1 and A2A receptors. Psychopharmacology,
183(2), 154-162. doi: 10.1007/s00213-005-0173-6
Brown, C. M., Fletcher, P. J., & Coscina, D. V. (1998). Neuropeptide Y-induced operant
responding for sucrose is not mediated by dopamine. Peptides, 19(10), 1667-1673. doi:
10.1016/s0196-9781(98)00117-x
Cagniard, B., Balsam, P. D., Brunner, D., & Zhuang, X. X. (2006). Mice with chronically
elevated dopamine exhibit enhanced motivation, but not learning, for a food reward.
Neuropsychopharmacology, 31(7), 1362-1370. doi: 10.1038/sj.npp.1300966
![Page 271: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/271.jpg)
256
Carvalho Poyraz, F., Holzner, E., Bailey, M. R., Meszaros, J., Kenney, L., Kheirbek, M. A., . . .
Kellendonk, C. (2016). Decreasing Striatopallidal Pathway Function Enhances
Motivation by Energizing the Initiation of Goal-Directed Action. The Journal of
Neuroscience, 36(22), 5988-6001. doi: 10.1523/jneurosci.0444-16.2016
Di Matteo, V., Di Giovanni, G., Pierucci, M., & Esposito, E. (2008). Serotonin control of central
dopaminergic function: focus on in vivo microdialysis studies. Serotonin-Dopamine
Interaction: Experimental Evidence and Therapeutic Relevance, 172, 7-44. doi:
10.1016/s0079-6123(08)00902-3
Drew, M. R., Simpson, E. H., Kellendonk, C., Herzberg, W. G., Lipatova, O., Fairhurst, S.,
Balsam, P. D. (2007). Transient Overexpression of Striatal D2 Receptors Impairs Operant
Motivation and Interval Timing. The Journal of Neuroscience, 27(29), 7731-7739. doi:
10.1523/jneurosci.1736-07.2007
Drew, M. R., Simpson, E. H., Kellendonk, C., Herzberg, W. G., Lipatova, O., Fairhurst, S.,
Balsam, P. D. (2007). Transient overexpression of striatal D-2 receptors impairs operant
motivation and interval timing. Journal of Neuroscience, 27(29), 7731-7739. doi:
10.1523/jneurosci.1736-07.2007
Duffy, E. (1957). The psychological significance of the concept of "arousal" or "activation.".
Psychological Review, 64(5), 265-275. doi: 10.1037/h0048837
Faerden, A., Vaskinn, A., Finset, A., Agartz, I., Ann Barrett, E., Friis, S., Melle, I. (2009).
Apathy is associated with executive functioning in first episode psychosis. BMC
Psychiatry, 9, 1. doi: 10.1186/1471-244X-9-1
Hall, D. A., Stanis, J. J., Marquez Avila, H., & Gulley, J. M. (2008). A comparison of
amphetamine- and methamphetamine-induced locomotor activity in rats: evidence for
qualitative differences in behavior. Psychopharmacology, 195(4), 469-478. doi:
10.1007/s00213-007-0923-8
Hebb, D. O. (1955). Drives and the C. N. S. (conceptual nervous system). Psychological Review,
62(4), 243-254. doi: 10.1037/h0041823
Kellendonk, C., Simpson, E. H., Polan, H. J., Malleret, G., Vronskaya, S., Winiger, V., Kandel,
E. R. (2006). Transient and selective overexpression of dopamine D2 receptors in the
striatum causes persistent abnormalities in prefrontal cortex functioning. Neuron, 49(4),
603-615. doi: 10.1016/j.neuron.2006.01.023
Kiang, M., Christensen, B. K., Remington, G., & Kapur, S. (2003). Apathy in schizophrenia:
clinical correlates and association with functional outcome. Schizophrenia Research,
63(1–2), 79-88. doi: http://dx.doi.org/10.1016/S0920-9964(02)00433-4
Mayorga, A. J., Popke, E. J., Fogle, C. M., & Paule, M. G. (2000). Similar effects of
amphetamine and methylphenidate on the performance of complex operant tasks in rats.
Behavioural Brain Research, 109(1), 59-68. doi: 10.1016/s0166-4328(99)00165-5
![Page 272: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/272.jpg)
257
McCue, D. L., Kasper, J. M., & Hommel, J. D. (2017). Regulation of motivation for food by
neuromedin U in the paraventricular nucleus and the dorsal raphe nucleus. Int J Obes
(Lond), 41(1), 120-128. doi: 10.1038/ijo.2016.178
Pandit, R., Luijendijk, M. C., Vanderschuren, L. J., la Fleur, S. E., & Adan, R. A. (2014). Limbic
substrates of the effects of neuropeptide Y on intake of and motivation for palatable food.
Obesity (Silver Spring), 22(5), 1216-1219. doi: 10.1002/oby.20718
Randall, P. A., Pardo, M., Nunes, E. J., Cruz, L. L., Vemuri, V. K., Makriyannis, A.,
Salamone, J. D. (2012). Dopaminergic Modulation of Effort-Related Choice Behavior as
Assessed by a Progressive Ratio Chow Feeding Choice Task: Pharmacological Studies
and the Role of Individual Differences. Plos One, 7(10). doi:
10.1371/journal.pone.0047934
Roth, Laura A. Flashman, Andrew J. Saykin, Thomas W. McAllister, & Robert Vidaver. (2004).
Apathy in Schizophrenia: Reduced Frontal Lobe Volume and Neuropsychological
Deficits. American Journal of Psychiatry, 161(1), 157-159. doi:
10.1176/appi.ajp.161.1.157
Salamone, J. D. (1988). Dopaminergic involvement in activational aspects of motivation: Effects
of haloperidol on schedule-induced activity, feeding, and foraging in rats. Psychobiology,
16(3), 196-206. doi: 10.3758/bf03327307
Salamone, J. D., Steinpreis, R. E., McCullough, L. D., Smith, P., Grebel, D., & Mahan, K.
(1991). HALOPERIDOL AND NUCLEUS-ACCUMBENS DOPAMINE DEPLETION
SUPPRESS LEVER PRESSING FOR FOOD BUT INCREASE FREE FOOD-
CONSUMPTION IN A NOVEL FOOD CHOICE PROCEDURE. Psychopharmacology,
104(4), 515-521. doi: 10.1007/bf02245659
Sanders, A. C., Hussain, A. J., Hen, R., & Zhuang, X. X. (2007). Chronic blockade or
constitutive deletion of the serotonin transporter reduces operant responding for food
reward. Neuropsychopharmacology, 32(11), 2321-2329. doi: 10.1038/sj.npp.1301368
Simón, V. M., Parra, A., Miñarro, J., Arenas, M. C., Vinader-Caerols, C., & Aguilar, M. A.
(2000). Predicting how equipotent doses of chlorpromazine, haloperidol, sulpiride,
raclopride and clozapine reduce locomotor activity in mice. European
Neuropsychopharmacology, 10(3), 159-164. doi: http://doi.org/10.1016/S0924-
977X(00)00070-5
Simpson, E. H., Kellendonk, C., Ward, R. D., Richards, V., Lipatova, O., Fairhurst, S.,
Balsam, P. D. (2011). Pharmacologic Rescue of Motivational Deficit in an Animal Model
of the Negative Symptoms of Schizophrenia. Biological Psychiatry, 69(10), 928-935. doi:
http://dx.doi.org/10.1016/j.biopsych.2011.01.012
Ward, R. D., Simpson, E. H., Kandel, E. R., & Balsam, P. D. (2011). Modeling motivational
deficits in mouse models of schizophrenia: Behavior analysis as a guide for neuroscience.
Behavioural Processes, 87(1), 149-156. doi: 10.1016/j.beproc.2011.02.004
![Page 273: An Examination of Goal-Directed ... - Academic Commons](https://reader034.vdocuments.us/reader034/viewer/2022051319/627b69ec8fdb4060fd70d885/html5/thumbnails/273.jpg)
258
Ward, R. D., Simpson, E. H., Richards, V. L., Deo, G., Taylor, K., Glendinning, J. I.,
Balsam, P. D. (2012). Dissociation of Hedonic Reaction to Reward and Incentive
Motivation in an Animal Model of the Negative Symptoms of Schizophrenia.
Neuropsychopharmacology, 37(7), 1699-1707. doi: 10.1038/npp.2012.15
Ward, R. D., Winiger, V., Higa, K. K., Kahn, J. B., Kandel, E. R., Balsam, P. D., & Simpson, E.
H. (2015). The Impact of Motivation on Cognitive Performance in an Animal Model of
the Negative and Cognitive Symptoms of Schizophrenia. Behavioral Neuroscience,
129(3), 292-299. doi: 10.1037/bne0000051