Homework - 0: - MD Aamir Sohail - EE16BTECH11021 - AI5001: Introduction To Modern AI CODE: Q.5 ( - Greedy Method)

Uploaded by

2015mdaamir2015

The document implements an epsilon-greedy algorithm to solve a multi-armed bandit problem with 10 arms over 2000 tasks. It runs the algorithm for different epsilon values and plots the average reward over steps. It finds that the epsilon-greedy approach performs better than the greedy approach as it is not stuck with sub-optimal actions. For smaller epsilon values like 0.01, the approach improves slower initially but performs better than higher epsilon values like 0.1 as it gains more experience.

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Homework - 0: - MD Aamir Sohail - EE16BTECH11021 - AI5001: Introduction To Modern AI CODE: Q.5 ( - Greedy Method)

Uploaded by

2015mdaamir2015

0% found this document useful (0 votes)

49 views10 pages

Original Title

EE16BTECH11021_hw0.pdf

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

0% found this document useful (0 votes)

49 views10 pages

Homework - 0: - MD Aamir Sohail - EE16BTECH11021 - AI5001: Introduction To Modern AI CODE: Q.5 ( - Greedy Method)

Uploaded by

2015mdaamir2015

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

Jump to Page

You are on page 1of 10

Search inside document

Homework - 0

• MD AAMIR SOHAIL
• EE16BTECH11021
• AI5001: Introduction to Modern AI

CODE: Q.5 ( - greedy method)

1 # MD AAMIR SOHAIL
2 # EE16BTECH11021
3 # AI5001 : Assignment 1 , Q. 5 : P l o t ( Average reward , S t e p s )
4 # N−armed b a n d i t problem u s i n g e p s i l o n −g r e e d y approach
5
6 im po rt numpy a s np
7 im po rt random
8 im po rt m a t p l o t l i b . p y p l o t a s p l t
9
10 n = 10 # o f arms
11 t a s k s = 2000 # of bandit ’ s
12 epsilon = [0.10 ,0.01 ,0.00] # s p e c i f i c e p s i n e−g r e e d y method
13 s t e p s = 1000 # o f t i m e s t o s e l e c t an arm
14
15 mu , sigma = 0 , 1
16 q = np . random . normal (mu, sigma , [ n , t a s k s ] ) # ( True ) Action−v a l u e
17
18 Q = np . z e r o s ( [ n , t a s k s ])
19 # Q = 1 [ t1 t2 . . . . . t (# t a s k s )
20 # 2 [ t1 t2 . . . . . t (# t a s k s )
21 # .
22 # .
23 # .
24 # n [ t1 t2 . . . . . t (# t a s k s ) ]
25
26 N t = np . z e r o s ( [ n , t a s k s ] )
27
28 f o r i t r in range ( len ( e p s i l o n ) ) :
29
30 p r i n t ( ’ For e p s i l o n : ’ , epsilon [ itr ])
31
32 epsilon avg = [ ]
33 e p s i l o n a v g . append ( 0 )
34
35 Q[ : ] = 0
36 N t [:] = 1
37
38 f o r i i n r a n g e ( 1 , s t e p s +1) : # s t e p s
39
40 R = []
41
42 f o r task in range ( t as k s ) :
43
44 # At each time s t e p and f o r each t a s k :
45 # with e p s i l o n prob . : randomly c h o o s e an a c t i o n f r o m s e t o f a l l a c t i o n s with
e q u a l prob .
46 # independent of acrion value estimate
47 # with 1− e p s i l o n prob . : c h o o s e an a c t i o n with max a c t i o n v a l u e e s t i m a t e
48
49 i f random . u n i f o r m ( 0 , 1 ) < e p s i l o n [ i t r ] :
50 i n d e x = np . random . r a n d i n t ( n )
51 else :
52 i f i != 1 :
53 i n d e x = np . argmax (Q [ : , t a s k ] ) # Reward o f a c t i o n a a t time s t e p t : R ˜ N( q ( a ) , 1 )
54 else :
55 i n d e x = np . random . r a n d i n t ( n ) # Randomnly s e l e c t an a c t i o n a t i n i t i a l s t e p f o r
Greedy−approach
56
57 reward = np . random . normal (mu, sigma ) + q [ i n d e x ] [ t a s k ]
58 R. append ( reward )

1
59
60 # Updating count o f t h e s e l e c t e d a c t i o n
61 N t [ index ] [ task ] = N t [ index ] [ task ] + 1
62 # Updating Action−v a l u e e s t i m a t e o f t h e s e l e c t e d a c t i o n
63 Q[ i n d e x ] [ t a s k ] = Q[ i n d e x ] [ t a s k ] + ( reward − Q[ i n d e x ] [ t a s k ] ) / N t [ i n d e x ] [ t a s k ]
64
65 # a v e r a g e o v e r 2000 t a s k s
66 avg R = np . mean (R)
67 e p s i l o n a v g . append ( avg R ) # S t e p s no . o f e l e m e n t s
68
69 p r i n t ( ’ Done e p s i l o n ’ )
70 plt . plot ( epsilon avg )
71
72 p l t . r c ( ’ t e x t ’ , u s e t e x=True )
73
74 p l t . x l a b e l ( ’ Steps ’ )
75 p l t . y l a b e l ( ’ Average Reward ’ )
76 p l t . l e g e n d ( [ r ” $ \ e p s i l o n=$ ”+s t r ( e p s i l o n [ 0 ] ) , r ” $ \ e p s i l o n=$ ”+s t r ( e p s i l o n [ 1 ] ) , r ” $ \ e p s i l o n=$ ”+s t r
( e p s i l o n [ 2 ] ) ] , l o c= ’ l o w e r r i g h t ’ , prop={ ’ s i z e ’ : 1 6 } )
77 p l t . t i t l e ( r ” $ \ e p s i l o n $ −g r e e d y a l g o r i t h m : 10−armed b a n d i t t e s t b e d ( Average o v e r 2000 t a s k s ) ”
)
78 p l t . show ( )

Figure 1: Average Reward vs Steps ( = 1000)

2
Figure 2: Average Reward vs Steps ( = 3000)

OBSERVATIONS:
• -Greedy approach eventually performs better than Greedy approach.

• For the first ∼ 100 steps, Greedy method improved faster but stuck with a sub-optimal action.
• = 0.01-Greedy approach improves slowly but after some experience performs better than = 0.10-
Greedy approach (see Figure 2)

3
Scanned by CamScanner
Scanned by CamScanner
Scanned by CamScanner
Scanned by CamScanner
Scanned by CamScanner
Scanned by CamScanner
Scanned by CamScanner

Basic Model of Strategic Management
Document7 pages
Basic Model of Strategic Management
Fatima Shah
100% (3)
Lua 5.1 虚拟机指令简明手册
Document90 pages
Lua 5.1 虚拟机指令简明手册
董旭
No ratings yet
B2B Startup Digital Marketing Plan Workbook and Case Study Smart Insights
Document5 pages
B2B Startup Digital Marketing Plan Workbook and Case Study Smart Insights
John evans
No ratings yet
SaaS Marketing Automation Playbook Teaser
Document19 pages
SaaS Marketing Automation Playbook Teaser
Valery Fenske
No ratings yet
z0z - 2018 Update - Windows - Office + AD0BE-Visual - Studio
Document4 pages
z0z - 2018 Update - Windows - Office + AD0BE-Visual - Studio
cometone Macgyver
No ratings yet
HW1 G9 Report - English
Document4 pages
HW1 G9 Report - English
Trieu Huynh
No ratings yet
Model Predictive Control
Document49 pages
Model Predictive Control
chethan
No ratings yet
Project 4
Document8 pages
Project 4
janecoco0222
No ratings yet
Indian Institute of Technology Gandhinagar: PH 509: Computational Physics
Document6 pages
Indian Institute of Technology Gandhinagar: PH 509: Computational Physics
Anonymous uL9MI6
No ratings yet
Model Predictive Control
Document49 pages
Model Predictive Control
Safiya Vachiat
No ratings yet
CS-311 Design and Analysis of Algorithms
Document23 pages
CS-311 Design and Analysis of Algorithms
abdullah noor
No ratings yet
Unit 2.1 AsymptoticAnalysis
Document39 pages
Unit 2.1 AsymptoticAnalysis
Harshil Modh
No ratings yet
Aptitude Simplified
Document155 pages
Aptitude Simplified
rakeshkamasani123
No ratings yet
Code
Document2 pages
Code
aida YOUSFI
No ratings yet
Aaa 7
Document36 pages
Aaa 7
Muhammad Khaleel Afzal
No ratings yet
Group Work
Document2 pages
Group Work
Chom Nab
No ratings yet
Unit-1 DAA - Notes
Document25 pages
Unit-1 DAA - Notes
shivam02774
No ratings yet
ProblemSheets2015 Solutions
Document202 pages
ProblemSheets2015 Solutions
sumit_raised
No ratings yet
Unit 1 Daa Notes Daa Unit 1 Note
Document26 pages
Unit 1 Daa Notes Daa Unit 1 Note
Jayanti Gupta
No ratings yet
Problem 9
Document7 pages
Problem 9
Kiran Adhikari
No ratings yet
CSEN202 Practice 1 Solution 31917 PDF
Document8 pages
CSEN202 Practice 1 Solution 31917 PDF
Mostafa Bayoumy
No ratings yet
Statistics For Management and Economics, Eighth Edition Formulas
Document16 pages
Statistics For Management and Economics, Eighth Edition Formulas
Leyu Wang
No ratings yet
Monte Carlo Simulation
Document70 pages
Monte Carlo Simulation
Jano Lima
100% (2)
Generation of Synthetic Curves and Surfaces Using Matlab: - Nishant Chandrashekhar
Document23 pages
Generation of Synthetic Curves and Surfaces Using Matlab: - Nishant Chandrashekhar
vinayak
No ratings yet
New Approaches To The Design of Fixed Order Controllers: S. P. Bhattacharyya Department of Electrical Engineering
Document71 pages
New Approaches To The Design of Fixed Order Controllers: S. P. Bhattacharyya Department of Electrical Engineering
Asghar Ali
No ratings yet
Probability Theory and Mathematical Statistics: Homework 5, Vitaliy Pozdnyakov
Document12 pages
Probability Theory and Mathematical Statistics: Homework 5, Vitaliy Pozdnyakov
Garakhan Talibov
No ratings yet
Group 4 Assignment 5
Document3 pages
Group 4 Assignment 5
Sanu Gangwar
No ratings yet
Modeling and Analyzing System Behavior: February 25, 2013
Document88 pages
Modeling and Analyzing System Behavior: February 25, 2013
elvagojp
No ratings yet
ADA Practical File - Complete 1
Document61 pages
ADA Practical File - Complete 1
Meet Brahmbhatt
No ratings yet
Pyth
Document13 pages
Pyth
Othniel
No ratings yet
2.1 AsymptoticAnalysis
Document33 pages
2.1 AsymptoticAnalysis
srinivas reddy
No ratings yet
Analysis of Algorithms-Tarun-IIIT Ranchi
Document39 pages
Analysis of Algorithms-Tarun-IIIT Ranchi
Amit kumar
No ratings yet
Unit 1 Daa Notes Daa Unit 1 Note
Document26 pages
Unit 1 Daa Notes Daa Unit 1 Note
Saurabh Gupta
No ratings yet
MATLAB Programs (IVSEM) - 1
Document9 pages
MATLAB Programs (IVSEM) - 1
faizan soudagar
No ratings yet
Assignment
Document8 pages
Assignment
shubham
No ratings yet
8 Puzzel Problem Using Best First Search: Import As Def
Document3 pages
8 Puzzel Problem Using Best First Search: Import As Def
Brijesh Kuvadiya
No ratings yet
2024 Week 12 - Jupyter Notebook
Document3 pages
2024 Week 12 - Jupyter Notebook
Souvik Chakraborty
No ratings yet
Permutations: P (N, R) N! / (N-R) !
Document19 pages
Permutations: P (N, R) N! / (N-R) !
RENANTE NERA
No ratings yet
L05 & 06-Recurrences
Document31 pages
L05 & 06-Recurrences
api-19981384
No ratings yet
Statistics For Data Sciences
Document10 pages
Statistics For Data Sciences
galdo2
No ratings yet
5 Module #5 Parallel Algorithms Actual October 30 2024
Document49 pages
5 Module #5 Parallel Algorithms Actual October 30 2024
Omar Amer
No ratings yet
Combinatorics, Probability and Expected Value
Document18 pages
Combinatorics, Probability and Expected Value
Harsh Agrawal
No ratings yet
2108 - Assignment 3
Document4 pages
2108 - Assignment 3
shariat.shanu
No ratings yet
Analysis of Algorithms: Asymptotic Notation
Document25 pages
Analysis of Algorithms: Asymptotic Notation
hariom12367855
No ratings yet
DSA MK Lect3 PDF
Document75 pages
DSA MK Lect3 PDF
Ankit Priyarup
No ratings yet
Week 2 Growth Asymptotic Insertion 05042021 022500pm
Document48 pages
Week 2 Growth Asymptotic Insertion 05042021 022500pm
Ali Sher Shahid
No ratings yet
04-Induction and Recursion
Document32 pages
04-Induction and Recursion
vietpvhe186335
No ratings yet
hw04 Solution PDF
Document4 pages
hw04 Solution PDF
siddharth1k
No ratings yet
Asymptotic Analysis
Document35 pages
Asymptotic Analysis
beki BA
No ratings yet
Ahmadmj 3
Document9 pages
Ahmadmj 3
Abdulhadi Ahmad
No ratings yet
The Problem: Library (MASS) Data (Faithful) Attach (Faithful)
Document7 pages
The Problem: Library (MASS) Data (Faithful) Attach (Faithful)
Diego Moreno
No ratings yet
COSC 3100 Brute Force and Exhaustive Search: Instructor: Tanvir
Document44 pages
COSC 3100 Brute Force and Exhaustive Search: Instructor: Tanvir
Gobiga Rajalingam
No ratings yet
Problem Statement TCS
Document23 pages
Problem Statement TCS
Aman Kumar
No ratings yet
3-Asymptotic Notation, Time and Space Complexity of An Algorithm-25!05!2024
Document37 pages
3-Asymptotic Notation, Time and Space Complexity of An Algorithm-25!05!2024
Gaming world
No ratings yet
ENGR691, Spring 2015, Homework For Lecture Block 1: 1 Code Listing
Document4 pages
ENGR691, Spring 2015, Homework For Lecture Block 1: 1 Code Listing
Shakil Ahmed Chowdhury
No ratings yet
Computer Sciences Department Bahria University (Karachi Campus)
Document35 pages
Computer Sciences Department Bahria University (Karachi Campus)
Abdullah
No ratings yet
Induction and Recursion
Document34 pages
Induction and Recursion
Vĩ Khang
No ratings yet
L03 Randomized Algorithms
Document61 pages
L03 Randomized Algorithms
Gio Villa
No ratings yet
Dynamic Programming 1 Compu Comb Knapsack
Document32 pages
Dynamic Programming 1 Compu Comb Knapsack
Aakansh Shrivastava
No ratings yet
Manik 2-5 2
Document8 pages
Manik 2-5 2
Aditya JHA
No ratings yet
AAA-5
Document35 pages
AAA-5
Muhammad Khaleel Afzal
No ratings yet
Damgard-Jurik Scheme of Paillier - Klefki 1.7 Documentation
Document6 pages
Damgard-Jurik Scheme of Paillier - Klefki 1.7 Documentation
lee.marreros
No ratings yet
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
From Everand
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet
Application of Derivatives Tangents and Normals (Calculus) Mathematics E-Book For Public Exams
From Everand
Application of Derivatives Tangents and Normals (Calculus) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
Rating: 5 out of 5 stars
5/5 (1)
Applications of Derivatives Errors and Approximation (Calculus) Mathematics Question Bank
From Everand
Applications of Derivatives Errors and Approximation (Calculus) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet
Ai5001 A1
Document7 pages
Ai5001 A1
2015mdaamir2015
No ratings yet
RTOS1
Document23 pages
RTOS1
2015mdaamir2015
No ratings yet
Profile Ass
Document1 page
Profile Ass
2015mdaamir2015
No ratings yet
I R Radiance
Document1 page
I R Radiance
2015mdaamir2015
No ratings yet
Device Drivers PDF
Document28 pages
Device Drivers PDF
2015mdaamir2015
100% (1)
Assignment-1: La1050: Introduction To Western Art
Document4 pages
Assignment-1: La1050: Introduction To Western Art
2015mdaamir2015
No ratings yet
Variance 2
Document1 page
Variance 2
2015mdaamir2015
No ratings yet
Digital Clock Project Report: Department of Electrical Engineering Indian Institute of Technology Hyderabad
Document10 pages
Digital Clock Project Report: Department of Electrical Engineering Indian Institute of Technology Hyderabad
2015mdaamir2015
No ratings yet
Cordic
Document21 pages
Cordic
2015mdaamir2015
No ratings yet
Nelson - Sound of The Divine in Daily Life
Document4 pages
Nelson - Sound of The Divine in Daily Life
2015mdaamir2015
No ratings yet
Variance 1 PDF
Document1 page
Variance 1 PDF
2015mdaamir2015
No ratings yet
New Microsoft PowerPoint Presentation
Document9 pages
New Microsoft PowerPoint Presentation
2015mdaamir2015
No ratings yet
Chapter 12, Lesson 3, Extra Example: Larson Algebra 2
Document1 page
Chapter 12, Lesson 3, Extra Example: Larson Algebra 2
2015mdaamir2015
No ratings yet
Joint X 4
Document1 page
Joint X 4
2015mdaamir2015
No ratings yet
ABB ICSTT-SDS-8448 - en Plantguard TMR Zone Interface Module P8448
Document2 pages
ABB ICSTT-SDS-8448 - en Plantguard TMR Zone Interface Module P8448
salic2013
No ratings yet
eCAD Manual PDF
Document11 pages
eCAD Manual PDF
ichrak
No ratings yet
Intro Lec - C++2
Document28 pages
Intro Lec - C++2
Madeehah Aatif
No ratings yet
Um1079 User Manual
Document39 pages
Um1079 User Manual
Bharat G Hegde
100% (1)
Lec11 Resolution Function
Document25 pages
Lec11 Resolution Function
dhillonrocks
No ratings yet
Module 3: Sampling and Reconstruction Lecture 25: Aliasing (Under Sampling)
Document2 pages
Module 3: Sampling and Reconstruction Lecture 25: Aliasing (Under Sampling)
Prasad Kavthakar
No ratings yet
Cross Casting
Document10 pages
Cross Casting
Gobara Dhan
No ratings yet
Survey On Metric Dimension
Document29 pages
Survey On Metric Dimension
dragance107
No ratings yet
Ansible Basic Level Interview Questions
Document31 pages
Ansible Basic Level Interview Questions
career path
No ratings yet
A3 - 1bm15me039 - Nyquist Plot Using Matlab
Document12 pages
A3 - 1bm15me039 - Nyquist Plot Using Matlab
Bharath Dixith
No ratings yet
InstallGuide Pro50
Document52 pages
InstallGuide Pro50
minus166
No ratings yet
Winlogger - Version 1.5.410: Release Notes
Document4 pages
Winlogger - Version 1.5.410: Release Notes
Leonardo Sierra Lombardero
No ratings yet
HPE0 S50 Demo
Document5 pages
HPE0 S50 Demo
Roman Gorbunov
No ratings yet
Element Design (STAAD)
Document4 pages
Element Design (STAAD)
ashwanibiet2k3
No ratings yet
UNIX Command Cheat Sheets: Command Description (Short) Example Explanation
Document8 pages
UNIX Command Cheat Sheets: Command Description (Short) Example Explanation
jdroxy4201108
No ratings yet
User Guide - Cognos Transformer
Document153 pages
User Guide - Cognos Transformer
urzmunna
No ratings yet
Eviews Notes: Ch. 2
Document8 pages
Eviews Notes: Ch. 2
Raheel Butt
No ratings yet
Unity 5.x Game Development Blueprints - Sample Chapter
Document57 pages
Unity 5.x Game Development Blueprints - Sample Chapter
Packt Publishing
No ratings yet
PowerPoint Is The Presentation Software of The Microsoft Office Software Suite
Document2 pages
PowerPoint Is The Presentation Software of The Microsoft Office Software Suite
Jenjen Bautista
No ratings yet
Sap Abap Error Code
Document12 pages
Sap Abap Error Code
vmuthukumaar
No ratings yet
T Test Formula
Document2 pages
T Test Formula
Markus
100% (1)
MPLABX 5.25 IPE User Guide
Document45 pages
MPLABX 5.25 IPE User Guide
Marcelo Coronel Castromonte
No ratings yet
DSA Solved Paper (May - June 2023) by VP
Document31 pages
DSA Solved Paper (May - June 2023) by VP
adityashindeorgx
No ratings yet
Reorganiz Ation: Organizational Structure & Staffing Pattern Proposal
Document21 pages
Reorganiz Ation: Organizational Structure & Staffing Pattern Proposal
Mary Neil Galviso
100% (1)
GSM Presentation Ericsson
Document37 pages
GSM Presentation Ericsson
Adityaa Anand
No ratings yet