Lecture 35 36 - Exploration vs. Exploitation
Lecture 35 36 - Exploration vs. Exploitation
• Exploitation vs Exploration
Exploration vs. Exploitation Dilemma
Examples
Principles
The Multi-Armed Bandit
Regret
Counting Regret
Linear or Sublinear Regret
Greedy Algorithm
Optimistic Initialization
Epsilon-Greedy Algorithm
Decaying Epsilon-Greedy Algorithm
Lower Bound
Optimism in the Face of Uncertainty
Optimism in the Face of Uncertainty
Upper Confidence Bounds