Operant Conditioning
Operant Conditioning
Operant Conditioning
KEY CONCEPTS
What is the difference between operant conditioning and classical conditioning?
In operant conditioning, a voluntary response is then followed by a reinforcing stimulus. In this way,
the voluntary response (e.g. studying for an exam) is more likely to be done by the individual.
In contrast, classical conditioning is when a stimulus automatically triggers an involuntary response.
OPERANT CONDITIONING
Operant conditioning can be described as a process that attempts to modify behavior through the use of
positive and negative reinforcement.
Through operant conditioning, an individual makes an association between a particular behavior and a
consequence.
The term “operant conditioning” originated by the behaviorist B. F. Skinner, who believed that one
should focus on the external, observable causes of behavior (rather than try to unpack the internal
thoughts and motivations)
REINFORCEMENT
Reinforcement is anything that strengthen the desired response.
PUNISHMENT
Punishment is when the increase of something undesirable attempts to cause a decrease in the behavior
that follows.
Reinforcement schedule
Continuous reinforcement schedule
The desired response is reinforced every time it occurs. The problem is that because the organism is
used to receiving the reinforcement after every behavior, the responder may give up quickly when it doesn’t
appear.
Partial reinforcement schedule
The responses are sometimes reinforced, and sometimes not. In comparison to continuous
reinforcement, partial reinforcement schedules lead to slower initial learning, but they also lead to greater
resistance to extinction.
Fixed-interval schedule
Behavior is reinforced for the first response after a specific amount of time has passed.
Variable-interval schedule
Behavior is reinforced for the first response after an average, but unpredictable, amount of time has
passed.
Fixed-ratio schedule
Behavior is reinforced after a specific number of responses.
Variable-ratio schedule
Behavior is reinforced after an average, but unpredictable, number of responses