operant conditioning
DESCRIPTION
Operant Conditioning. The Learner is NOT passive. Learning based on consequence!!!. The Law of Effect. Edward Thorndike Locked cats in a cage Behavior changes because of its consequences. Rewards strengthen behavior. - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Operant Conditioning](https://reader036.vdocuments.us/reader036/viewer/2022062501/5681674a550346895ddbf930/html5/thumbnails/1.jpg)
Operant Conditioning
The Learner is NOT passive.Learning based on consequence!!!
![Page 2: Operant Conditioning](https://reader036.vdocuments.us/reader036/viewer/2022062501/5681674a550346895ddbf930/html5/thumbnails/2.jpg)
The Law of Effect• Edward Thorndike• Locked cats in a cage• Behavior changes because
of its consequences.• Rewards strengthen
behavior.• If consequences are
unpleasant, the Stimulus-Reward connection will weaken.
• Called the whole process instrumental learning.
Click picture to see a better explanation of the Law of Effect.
![Page 3: Operant Conditioning](https://reader036.vdocuments.us/reader036/viewer/2022062501/5681674a550346895ddbf930/html5/thumbnails/3.jpg)
B.F. Skinner• Influence of
Nurture• Used a Skinner Box
(Operant Conditioning Chamber) to prove his concepts.
![Page 4: Operant Conditioning](https://reader036.vdocuments.us/reader036/viewer/2022062501/5681674a550346895ddbf930/html5/thumbnails/4.jpg)
Skinner Box
![Page 5: Operant Conditioning](https://reader036.vdocuments.us/reader036/viewer/2022062501/5681674a550346895ddbf930/html5/thumbnails/5.jpg)
Reinforces• A reinforcer is anything
the INCREASES a behavior.Positive Reinforcement:• The addition of something
pleasant.Negative Reinforcement:• The removal of something
unpleasant.• Two types of NR• Escape Learning• Avoidance Learning(Getting kicked out of class
versus cutting class)
![Page 6: Operant Conditioning](https://reader036.vdocuments.us/reader036/viewer/2022062501/5681674a550346895ddbf930/html5/thumbnails/6.jpg)
Positive or Negative?
Putting your seatbelt on. Studying for a test.Having a headache and taking an aspirin.
Faking sick to avoid AP Psych class.
Breaking out of jail.
Getting a kiss for doing the dishes.
![Page 7: Operant Conditioning](https://reader036.vdocuments.us/reader036/viewer/2022062501/5681674a550346895ddbf930/html5/thumbnails/7.jpg)
PunishmentMeant to decrease a
behavior.Positive Punishment• Addition of something
unpleasant.Negative Punishment
(Omission Training)• Removal of something
pleasant.Punishment works best
when it is immediately done after behavior and if it is harsh!
![Page 8: Operant Conditioning](https://reader036.vdocuments.us/reader036/viewer/2022062501/5681674a550346895ddbf930/html5/thumbnails/8.jpg)
How do we actually use Operant Conditioning?
Do we wait for the subject to deliver the desired behavior?
Sometimes, we use a process called shaping.
Shaping is reinforcing small steps on the way to the desired behavior.
To train a dog to get your slippers, you would have to reinforce him in small steps. First, to find the slippers. Then to put them in his mouth. Then to bring them to you and so on…this is shaping behavior.
To get Barry to become a better student, you need to do more than give him a massage when he gets good grades. You have to give him massages when he studies for ten minutes, or for when he completes his homework. Small steps to get to the desired behavior.
![Page 10: Operant Conditioning](https://reader036.vdocuments.us/reader036/viewer/2022062501/5681674a550346895ddbf930/html5/thumbnails/10.jpg)
Chaining Behaviors• Subjects are taught
a number of responses successively in order to get a reward.
Click picture to see a rat chaining behaviors.
Click to see a cool example of chaining behaviors.
![Page 11: Operant Conditioning](https://reader036.vdocuments.us/reader036/viewer/2022062501/5681674a550346895ddbf930/html5/thumbnails/11.jpg)
Same Terminology as Classical Conditioning
• Acquisition• Extinction• Spontaneous
Recovery• Generalization• Discrimination
If I wanted to reinforce my son’s dancing by giving him lollipops when he dances. Identify the following….
![Page 12: Operant Conditioning](https://reader036.vdocuments.us/reader036/viewer/2022062501/5681674a550346895ddbf930/html5/thumbnails/12.jpg)
Primary v. Secondary ReinforcersPrimary Reinforcer
• Things that are in themselves rewarding.
Secondary Reinforcer
• Things we have learned to value.
• Money is a special secondary reinforcer called a generalized reinforcer (because it can be traded for just about anything)
![Page 13: Operant Conditioning](https://reader036.vdocuments.us/reader036/viewer/2022062501/5681674a550346895ddbf930/html5/thumbnails/13.jpg)
Token Economy
• Every time a desired behavior is performed, a token is given.
• They can trade tokens in for a variety of prizes (reinforcers)
• Used in homes, prisons, mental institutions and schools.
![Page 14: Operant Conditioning](https://reader036.vdocuments.us/reader036/viewer/2022062501/5681674a550346895ddbf930/html5/thumbnails/14.jpg)
Premack Principle
• You have to take into consideration the reinforcers used.
• Is the reinforcer wanted….or at least is it more preferable than the targeted behavior.
Hamburgers might be a great positive reinforcer for some, but it would not work well on a vegetarian.
![Page 15: Operant Conditioning](https://reader036.vdocuments.us/reader036/viewer/2022062501/5681674a550346895ddbf930/html5/thumbnails/15.jpg)
Reinforcement SchedulesHow often to you
give the reinforcer?• Every time or just
sometimes you see the behavior.
![Page 16: Operant Conditioning](https://reader036.vdocuments.us/reader036/viewer/2022062501/5681674a550346895ddbf930/html5/thumbnails/16.jpg)
Continuous v. Partial ReinforcementContinuous
• Reinforce the behavior EVERYTIME the behavior is exhibited.
• Usually done when the subject is first learning to make the association.
• Acquisition comes really fast.
• But so does extinction.
Partial• Reinforce the behavior
only SOME of the times it is exhibited.
• Acquisition comes more slowly.
• But is more resistant to extinction.
• FOUR types of Partial Reinforcement schedules.
![Page 17: Operant Conditioning](https://reader036.vdocuments.us/reader036/viewer/2022062501/5681674a550346895ddbf930/html5/thumbnails/17.jpg)
Ratio SchedulesFixed Ratio
• Provides a reinforcement after a SET number of responses.
Variable Ratio• Provides a
reinforcement after a RANDOM number of responses.
• Very hard to get acquisition but also very resistant to extinction.
Fixed Ration- She gets a manicure for every 5 pounds she loses.
![Page 18: Operant Conditioning](https://reader036.vdocuments.us/reader036/viewer/2022062501/5681674a550346895ddbf930/html5/thumbnails/18.jpg)
Interval SchedulesFixed Interval
• Requires a SET amount of time to elapse before giving the reinforcement.
Variable Interval• Requires a RANDOM
amount of time to elapse before giving the reinforcement.
• Very hard to get acquisition but also very resistant to extinction.
Fixed Interval: She gets a manicure for every 7 days she stays on her diet.
![Page 19: Operant Conditioning](https://reader036.vdocuments.us/reader036/viewer/2022062501/5681674a550346895ddbf930/html5/thumbnails/19.jpg)
Observational Learning• Albert Bandura and his
BoBo Doll• We learn through
modeling behavior from others.
• Observational learning + Operant Conditioning = Social Learning Theory
Click pic to see some observational learning.
![Page 20: Operant Conditioning](https://reader036.vdocuments.us/reader036/viewer/2022062501/5681674a550346895ddbf930/html5/thumbnails/20.jpg)
Latent Leaning• Edward Tolman• Three rat experiment.• Latent means hidden.• Sometimes learning is
not immediately evident.
• Rats needed a reason to display what they have learned.
• Cognitive Maps
![Page 21: Operant Conditioning](https://reader036.vdocuments.us/reader036/viewer/2022062501/5681674a550346895ddbf930/html5/thumbnails/21.jpg)
Insight Learning• Wolfgang Kohler
and his Chimpanzees.
• Some animals learn through the “ah ha” experience. Click pic to see insight learning.