RL Lecture 19
RL Lecture 19
RL Lecture 19
04/04/23
Lecturer: Prof. Subrahmanya Swamy Peruru Scribe: Rohan Baijal, Harsh Garg
1 Dopamine
Dopamine is the brain’s ”pleasure chemical”. Dopamine release can be triggered by various
stimulants such as food, psychomotor stimulants, opiates, etc. Apart from that, Beautiful
faces, images of lovers, monetary rewards can also trigger the dopamine reward system.
The experiment was performed on monkeys where they were presented with small quan-
tity of fruit juice as a reward and then their dopamine neuron response was studied. In the
first Figure, we can see that before learning about the reward, the dopamine neuron is acti-
vated after the unpredicted occurrence of the reward. Figure 2 suggests that after learning,
monkey’s conditioned stimulus predicts reward and there is a spike in dopamine even before
the actual reward is presented. In Figure 3, the conditioned stimulus predicts a reward, but
the reward fails to occur because of a mistake in the behavioural response of the monkey.
Due to the absence of reward, we can observe a depression in dopamine levels which occurs
more than 1 s after the conditioned stimulus which reveals an internal representation of the
time of the predicted reward in monkeys.
1
1.1 Dopamine and Food
Studies related to the connection between food and dopamine have also shown interesting
results. According to a study by Nestler (2001), presentation of palatable food induced
dopamine release.
Appetite is a very good example of delayed rewards in the real world. We eat now
(action) and our nutrients rise hours later (reward).
where effort_factor denotes a suitable factor that converts cost of effort into equivalent
2
quantity of food pellets. For this experiment the Fixed Ratio Schedule is the number of
times the rat has to press lever for 1 unit of food. As we can see in the above image, with an
increase in Fixed Ratio Requirement, the number of times the lever is pulled first increases,
but then eventually starts decreasing. Hence with dopamine depletion, the effort_factor
increases. Which means that the animal becomes more sensitive to cost(number of lever
presses in this case).
References
1. Lectures notes by “Prof. Pragathi”