Bernstein - 06 - Learning Posted
Bernstein - 06 - Learning Posted
Learning
QuickTimeª and a
decompressor
are needed to see this picture.
QuickTimeª and a
decompressor
are needed to see this picture.
QuickTimeª and a
decompressor
are needed to see this picture.
2
Adaptation to a constant
stimulus is simple learning
Reponses to unchanging stimuli decreases
over time.
– Habituation is simplest form of learning.
3
Habituation is an example of
Non-Associative Learning
• Learning results from the impact of
one particular stimulus.
– Not the result of learning to associate
one stimulus with another
– We learn to ignore repeated or
constant stimuli QuickTimeª a
decompresso
are needed to see th
4
Another example of non-
associative learning
• Why do people engage in risky behavior?
• Solomon’s (1980) Opponent Process
Theory explains this – Based on disruption
and restoration of equilibrium. (like color
vision)
• Explains drug addiction, bungee jumping,
and maybe even self destructive
gangbanging behavior.
5
Solomon’s Opponent-Process
Theory
– New stimuli that cause extreme positive or negative
feeling cause opposite (opponent) feeling to occur
to restore equilibrium.
– If new stimulus is repeated the opponent feeling
happens faster and stronger, eventually
suppressing original stimulus.
– i.e. Drug addiction – over time addicts need more
drug to get the same effect (habituation), and
withdrawal gets worse over time too.
– i.e. Why do people skydive, ride rollercoasters?
6
Associative Learning
Most learning theories are based on
associations of one stimulus with another,
or associations between behavior and its
consequences.
Classical Conditioning
- Ivan Pavlov, John B. Watson
QuickTimeª and a
decompressor
are needed to see this picture.
8
Pavlov’s Classical Conditioning
• Start with Unconditioned Stimulus UCS
– Causes an instinctive Unconditioned Response UCR
– For example, food causes salivation (drooling)
• Then PAIR the UCS with a Neutral Stimulus
– Presenting the UCS with Neutral Stimulus causes an
association to form. The more you do it, the stronger
the association.
– Ex., ring a bell when you present food…
– Eventually the bell ALONE will cause salivation
• The bell was neutral, but is now a Conditioned
Stimulus CS which causes salivation, the
Conditioned Response CR
9
Example - Pavlovian (Classical)
Conditioning
QuickTimeª and a
decompressor
are needed to see this picture.
10
Classical Conditioning –
UCS, UCR, CS, CR
11
Apparatus for Measuring
Conditioned Responses
http://nobelprize.org/educational_games/medicine/pavlov/index.html
12
Changes Over Time in the Strength
of a Conditioned Response:
Extinction and Spontaneous
Recovery
13
Stimulus Generalization - Pavlov
14
Stimulus Discrimination
• Pavlov’s dogs didn’t respond to every
sound… just those similar to the on which
was conditioned.
http://nobelprize.org/educational/medicine/pavlov/pavlov.html
16
Factors Affecting the Learning
of a Conditioned Response
• Timing
– Forward conditioning = CS then UCS
• Most effective (bell then food0
– Backward conditioning = UCS then CS
• Less effective (food then bell)
– Simultaneous conditioning = Same time
• Least effective (food and bell at the same time)
• Predictability (response criterion)
• Signal Strength (abs threshold, JND)
• Attention to Stimulus (attention)
17
Classical Conditioning
• Example
• Need a volunteer.
• UCS
• UCR
• NS
• CS
• CR
18
John B. Watson
• Showed that classical conditioning could be
used to condition emotions.
• Watson believed in “Nurture” not “Nature”
– People are shaped by their environment
• Famous experiment with Little Albert
– Neutral Stim. (NS) = white rat,other furry stuff
– Uncond. Stim (UCS) = Loud BANG!
– Uncond. Resp (UCR) = Crying
After Conditioning CS = white rat, CR = crying
19
John Watson showed that
emotions can be conditioned
20
More Factors Affecting the Learning
of a Conditioned Response
• Biopreparedness (BIOlogically PREPARED)
– Animals (including humans) are predisposed to
certain conditioning situations… perhaps genetic.
It is easy for us to learn certain associations.
– E.g. taste aversions, snakes vs. cars.
21
More Factors Affecting the Learning
of a Conditioned Response
• Second-Order Conditioning
– When a conditioned stimulus (CS) begins to
act as an unconditioned stimulus (UCS) for a
NEW CS
– Ex: Dr.’s waiting room (cs) and shot (ucs)–
waiting room could begin to act as ucs…
eventually magazines become cs.
– While adaptive, can cause problems.
are
QuickTimeª and a
ne
decompressor
ede deco Time
QuickTimeª and a
ick
o
decompressor
seepressond a
22
is pic
ture
.
Some Applications of Classical
Conditioning
• Can play role in the development of phobias.
(extreme fears that are not based on real danger
or fear reactions that aren’t appropriate to real
danger)
– Systematic desensitization as a treatment – to cause
extinction.
• Predator Control – taste aversion
• Predicting Alzheimer’s Disease – ability of
patients to be conditioned to blink (air puff = ucs,
light =cs) deteriorates.
• Other applications? 23
Instrumental and Operant
Conditioning:
26 Return
B. F. Skinner - Operant Cond.
• Extended and formalized many of
Thorndike’s ideas.
• Organisms learn responses by operating on
the environment.
– “Operant conditioning”
• Primary aim = analyze how behavior is
changed by its consequences.
• Does target behavior increase or decrease?
27
Basic Parts of Operant Conditioning
• Operant – a response that has an effect on the
world.
• Reinforcer – a stimulus that increases the
probability that the behavior which preceded it
will occur again.
– Positive reinforcer – a pleasurable thing follows
behavior – ie: Mom gives kid candy for good
behavior in store (kid is being conditined)
– Negative reinforcer – an unpleasurable thing
STOPS following behavior. ie: Kid stops whining
when mom gives them candy in checkout line (mom
is being conditioned) 28
Figure 6.6: Positive and
Negative Reinforcement
29
Escape and Avoidance: Two types
of negative reinforcement
Escape
Conditioning Avoidance Conditioning
Adapted from: The Psychology of Memory and Learning by Hintzman. © 1978 by W.H. Freeman and Company. Used with permission.
30
IMPORTANT!!
• Negative reinforcement is NOT punishment.
•Positive reinforcement of
behavior approaching target
behavior.
•Shaping.
BF Skinner 1904-1990
Operant Conditioning
33
Forming and Strengthening
Operant Behavior
• Shaping – Process of reinforcing responses that
get closer and closer to the desired response.
• Primary Reinforcer – meets basic needs i.e food,
water.- Give dog a treat.
• Secondary Reinforcement
– Say “good dog” when you give the dog a treat…
eventually you won’t need so many treats. (money vs.
food/shelter…)
– Secondary reinforcers (or “conditioned reinforcers”)
– Greatly expands the power of operant conditioning.
– Depends on what people like… rock concert, opera?
34
Delay and Size of
Reinforcement
• Timing of Reinforcer – Usually the shorter
the delay between the target behavior and
reinforcement, the more effective.
• Size of Reinforcer – Usually the larger the
reinforcer, the more effective (10 bucks for
each right answer in class?)
35
Operant Schedules of
Reinforcement
• Continuous reinforcement schedule: give
reinforcer for every instance of target behavior.
– works well, but not practical in many
situations.
38
Effectiveness of Different
Schedules of Reinforcement
Adapted from "Teaching Machines" by B.F. Skinner, Copyright © 1961 by Scientific American, Inc.
All rights reserved.
39
Schedules and Extinction
• Failure to reinforce a response
extinguishes that response.
40
Self Stimulation – James Olds
• Pleasure Center in brain – Median
Forebrain Bundle
QuickTimeª and a
Cinepak Codec by Radius decompressor
are needed to see this picture.
41
Punishment
• Reduces the frequency of an operant
behavior by presenting an unpleasant
stimulus or removing a pleasant one.
43 Return
Potential Drawbacks of
Punishment
• Does not “erase” an undesirable habit;
merely suppresses it. (don’t get caught)
50
Observational
Learning
Vicarious
Conditioning -
Observing another
person being
reinforced for
behavior changes
the observers
behavior 51
Main Learning Theories Summary
• Classical Conditioning
– Pairing stimuli leads to conditioned responses
– Pavlov, Watson
• Operant Conditioning
– Behavior is shaped by its consequences
– Schedules of reinforcement
– Thorndike, Skinner
• Observational Learning
– People learn by watching others and observing the
consequences others receive
– Bandura
52