Q-Learning Algorithm (1)
Q-Learning Algorithm (1)
Content:
Reinforcement Learning Technique: Q-learning
Some imp terms in Q-Learning
Factors and Algorithm of Q-learning
Steps with examples
Advantages and disadvantages Applications
Q(s,a)=R(s,a)+ *max[Q(s’,a’)].
Whereas:
Suppose that we have 5 rooms in a building.We will number the rooms from 0 to 4 and the
outside of building can be thought of as one big room(5).
We can represent each room as a node (states) and each door as a link(action).
We have to get into the room 5 that’s why Our goal state is room 5 .
Others that have been not directly connected to room5 have 0 reward.
https://www.geeksforgeeks.org/q-learning-in-python/
https://youtu.be/QRMNPCsnSHk
https://youtu.be/3Rx2x2traxw
https://youtu.be/ibBEEZNQZtk
https://youtu.be/5MC8Wdo-hS8