0% found this document useful (0 votes)
203 views5 pages

Operant Conditioning 1

B.F. Skinner developed the theory of operant conditioning which examines how behavior is influenced by consequences. Using positive and negative reinforcement in a Skinner box, rats learn that pressing a lever will either provide a food reward or painful stimulus. Operant conditioning involves reinforcing behaviors through consequences to increase or decrease the likelihood of that behavior occurring again in the future. It is an important theory in understanding how behaviors are acquired through reward and punishment during trial and error learning.

Uploaded by

clumsy16
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
203 views5 pages

Operant Conditioning 1

B.F. Skinner developed the theory of operant conditioning which examines how behavior is influenced by consequences. Using positive and negative reinforcement in a Skinner box, rats learn that pressing a lever will either provide a food reward or painful stimulus. Operant conditioning involves reinforcing behaviors through consequences to increase or decrease the likelihood of that behavior occurring again in the future. It is an important theory in understanding how behaviors are acquired through reward and punishment during trial and error learning.

Uploaded by

clumsy16
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 5

Operant Conditioning: Trial and Error

Image: B.F. Skinner at Harvard. By silly rabbit, License: CC BY-SA 3.0

The US-American psychologist B.F. Skinner is inevitably associated with the


term operant conditioning (Skinner-box see below). Operant conditioning
describes the acquisition of stimulus-reaction-patterns:

How do we adjust our originally spontaneous behavior through


reward and punishment?

The following terms are important with operant conditioning:

 Positive reinforcement: The probability of certain behavior to occur


increases through positive reinforcement.
 Negative reinforcement: The discontinuation of negative impulses also
leads to an increase in incidents.
 Reinforcement: Positive or negative behavior consequences

Warning:

 Reinforcement = increase of behavior, regardless of whether positive


or negative reinforcement
 positive and negative are not judgmental
 Positive: adding consequences
 Negative: removing consequences

Skinner box: Equipment for animal testing with a fixed lever. If this is pushed,
the rat will be rewarded with a food pill for its behavior. The behavior of the rat
(pushing down the lever) is reinforced and so the action is executed more
often. If the animals receive a painful stimulus after pushing the lever, they will
stop this behavior after a little while.

Image by Lecturio

There are 4 important intermittent reinforcement schedules:

Fixed-ratio Variable-ratio schedule Fixed- Variable-


schedule interval interval
schedule schedule

Reinforce after a Reinforce an unpredictable Reinforce a Reinforce an


set number of number of instances of the set period inconsistent
instances of the behavior (i.e. gambling) that is period
behavior

Differentiation of positive and negative reinforcement vs. punishment

The behavior consequence will be…


… added … removed

Positive reinforcement Positive strengthening Punishment(behavior


(behavior increases) decreases)

Negative reinforcemen Punishment (behavior Negative strengthening


t decreases) (behavior increases)

Image by Lecturio

Escape and Avoidance Learning

Escape Avoidance

Learn how to avoid an aversive stimulus by Perform a behavior to ensure the aversive
engaging in a particular behavior stimuli is not encountered

More important terms and examples of operant conditioning

Term Definition Example

Primary reinforcement The satisfaction of primary Food, sleep, rest


needs

Secondary reinforcemen Linking to primary Praise, admiration,


t reinforcement (social, material, money
etc.)

Emitted behavior Spontaneously occurring Dog lifts its paw, gets


behavior that can be rewarded, and repeats
reinforced. the behavior

Prompting Cue through initiating behavior Acquisition of language


externally

Fading Gradual fading out of the Teacher gives tips to get


prompts throughout the the right answer and
conditioning process reduces the number of
tips in the process.

Shaping Stepwise acquisition of Toddler learns to tie


complex behavior by shoes.
rewarding consequences

Chaining Learning complex chains of Brushing teeth:


behavior, usually, the last toothpaste on the brush,
element is reinforced 1st clean various areas,
flush, use dental floss,
etc.

Premack principle Linking of a less favorable “First you have to eat


activity with a popular one the salad, then you’ll
get the dessert!”

The reinforcement plans

Reinforcement plans are consistent relations between behavior and


consequence = contingency.

The high contingency is given, when almost every behavior results in a


consequence, whereas we speak of low contingency when consequences
follow only occasionally. Continuous reinforcement means that every single
desired behavior is strengthened. Intermittent reinforcement means that just
a certain number of all desired behavior is strengthened. The distinction is
drawn between ratio plans and interval plans:

Fixed ratio plans Consequence after fixed-rate, e.g., every 3rd time

Variable Consequence after variable rate, e.g., after the 2nd, then the 5th,
ratio plans then the 10th time

Fixed Consequence after a fixed time interval, e.g., every 5 minutes


interval plans

Variable Consequence after a variable time interval, e.g., after 5, then after
interval plans 10, and then after 15 minutes

Important: While continuous reinforcement leads to fast learning, skills are


more resistant to erasing if reinforcement occurs intermittently.

You might also like