0% found this document useful (0 votes)

21 views13 pages

Lab 01 QRoutingv5

Research notes

Uploaded by

Apoorv Sahni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views13 pages

Lab 01 QRoutingv5

Research notes

Uploaded by

Apoorv Sahni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 13

Reinforcement Learning QRouting Laboratory

By John Cosmas

1 Objective

In this lab you will familiarise yourself with Reinforcement Learning techniques that
can be modelled with Python and then use it to write Python software calculate the
most efficient route through the network.

2 Introduction

Python is an Object-Oriented programming language with many packages developed

by a Python community, which includes an incredibly diverse and welcoming group
of people. This community develop packages that help people use Python for many
purposes: to make games, build web applications, solve business problems, and
develop internal tools at all kinds of interesting companies. Python is also used
heavily in scientific fields such as artificial intelligence for academic research and
applied work.

2.1 Reinforcement Learning QRouting

An agent in an unknown environment obtains some rewards by interacting with the

environment.
The agent ought to take actions so as to maximize cumulative rewards.
The goal of Reinforcement Learning (RL) is to learn a good strategy for the agent
from experimental trials and relative simple feedback received.
With the optimal strategy, the agent is capable to actively adapt to the environment to
maximize future rewards.
The agent is acting in an environment. How the environment reacts to certain actions
is defined by a model which we may or may not know.
The agent can stay in one of many states (s∈S) of the environment, and choose to
take one of many actions (a∈A) to switch from one state to another. Which state the
agent will arrive in, is decided by transition probabilities between states (P). Once an
action is taken, the environment delivers a reward (r∈R) as feedback.
The model defines the reward function and transition probabilities. We may or may
not know how the model works and this differentiate two circumstances:
Know the model: planning with perfect information; do model-based RL. When we
fully know the environment, we can find the optimal solution by Dynamic
Programming (i.e. optimisation algorithms).
Does not know the model: learning with incomplete information; do model-free RL or
try to learn the model explicitly as part of the algorithm.

3 Laboratory
The laboratory tutorial is subdivided into six sections:
1. Creating and Drawing a Graph and adding Edges
2. Generating Available Actions and Quality Matrices
3. Working with Functions
4. Iterating through Learning loop
5. Charting most efficient route from initial_state to
goal
6. Plotting the Reward Gained against iteration scores

3.1 Creating a New Anaconda Environment

Create and activate a virtual environment by using command line. Open Anaconda
Prompt and run this command:

conda create -n py36 python=3.6.13

activate it by using this command:

conda activate py36

download the required packages for today’s lab such as numpy, networkx and
matplotlib:

conda install numpy

conda install networkx
conda install matplotlib==2.2.3

Today, we run qrouting.py file in two methods, command line and PyCharm
editor.

1- Command line: On the same window run the following command:

python ‘path’\qrouting.py

2- PyCharm editor: Open Anaconda Navigator, set Applications on to py36

and launch PyCharm. Create a new project “RL1” on PyCharm. In order to
use the environment you created previously (py36), follow the next steps: On
New Project window, choose Previously configured interpreter and then
click on Add Interpreter and Add Local Interpreter. On the new window
(Add Python Interpreter), choose Existing and browse to the location of
py36 “C:\Users\’yourlogin’\.conda\envs\py36\python.exe”.
Open qrouting.py with PyCharm and run it.

3.2 Creating and Drawing a Graph and adding Edges using networkx package
3.2.1 Creating a List of Tuples for defining edges of a Graph
Lists are used to store multiple items in a single variable.
Lists are one of 4 built-in data types in Python used to store collections of data, the
other 3 are Tuple, Set, and Dictionary, all with different qualities and usage.
Lists are created using square brackets:

Example
Create a List:
>>> thislist = ["apple", "banana", "cherry"]
>>> print(thislist)

Tuples are used to store multiple items in a single variable.

Tuple is one of 4 built-in data types in Python used to store collections of data, the
other 3 are List, Set, and Dictionary, all with different qualities and usage.
A tuple is a collection which is ordered and unchangeable.
Tuples are written with round brackets.
Example
Create a Tuple:
>>> thistuple = ("apple", "banana", "cherry")
>>>print(thistuple)

Task: Complete this List of Tuples to define the edges of the above Graph.

>>> edges = [(0, 1), (1, 5)]

3.2.2 Creating an empty Graph

Creating a graph
Create an empty graph with no nodes and no edges.

Example
>>> import networkx as nx
>>> G=nx.Graph()
By definition, a Graph is a collection of nodes (vertices) along with identified pairs of
nodes (called edges, links, etc). In NetworkX, nodes can be any hashable object e.g. a
text string, an image, an XML object, another Graph, a customized node object, etc.
Note: To import the networkx package, type in the terminal window prompt: conda
install networkx
Type Cntl Shift P, then type python and then Select Interpreter and select the ‘base’
conda interpreter.

3.2.3 Adding Edges to Graph

networkx.Graph.add_edges_from
Graph.add_edges_from(ebunch_to_add, **attr)
Add all the edges in ebunch_to_add.

Examples
G.add_edges_from([(0, 1), (1, 5)]) # using a list of edge tuples
G.add_edges_from(edges)

3.2.4 Laying out a Graph

networkx.drawing.layout.spring_layout
spring_layout(G, k=None, pos=None, fixed=None, iterations=50, threshold=0.0001,
weight='weight', scale=1, center=None, dim=2, seed=None)[source]
Position nodes using Fruchterman-Reingold force-directed algorithm.

The algorithm simulates a force-directed representation of the network treating edges

as springs holding nodes close, while treating nodes as repelling objects, sometimes
called an anti-gravity force. Simulation continues until the positions are close to an
equilibrium.
There are some hard-coded values: minimal distance between nodes (0.01) and
“temperature” of 0.1 to ensure nodes don’t fly away. During the simulation, k helps
determine the distance between nodes, though scale and center determine the size and
place after rescaling occurs at the end of the simulation.

Examples
>>> pos = nx.spring_layout(G)

3.2.5 Drawing Nodes of a Graph

pos : dictionary
A dictionary with nodes as keys and positions as values. If not specified a
spring layout positioning will be computed. See networkx.layout for functions
that compute node positions.

ax : Matplotlib Axes object, optional

Draw the graph in the specified Matplotlib axes.

nodelist : list, optional

Draw only specified nodes (default G.nodes())

node_size : scalar or array

Size of nodes (default=300). If an array is specified it must be the same length
as nodelist.

node_color : color string, or array of floats

Node color. Can be a single color format string (default=’r’), or a sequence of
colors with the same length as nodelist. If numeric values are specified they
will be mapped to colors using the cmap and vmin,vmax parameters. See
matplotlib.scatter for more details.

node_shape : string
The shape of the node. Specification is as matplotlib.scatter marker, one of
‘so^>v<dph8’ (default=’o’).
alpha : float
The node transparency (default=1.0)

cmap : Matplotlib colormap

Colormap for mapping intensities of nodes (default=None)

vmin,vmax : floats
Minimum and maximum for node colormap scaling (default=None)

linewidths : [None | scalar | sequence]

Line width of symbol border (default =1.0)

label : [None| string]

Label for legend

Example
>>> G=nx.dodecahedral_graph()
>>> nodes=nx.draw_networkx_nodes(G,pos)

3.2.6 Drawing Edges of a Graph

draw_networkx_nodes
draw_networkx_nodes(G, pos, nodelist=None, node_size=300, node_color='r',
node_shape='o', alpha=1.0, cmap=None, vmin=None, vmax=None, ax=None,
linewidths=None, label=None, **kwds)[source]
Draw the nodes of the graph G.
draw_networkx_edges(G, pos, edgelist=None, width=1.0, edge_color='k',
style='solid', alpha=None, edge_cmap=None, edge_vmin=None, edge_vmax=None,
ax=None, arrows=True, label=None, **kwds)[source]
Draw the edges of the graph G.
This draws only the edges of the graph G.
Parameters :
G : graph
A networkx graph

pos : dictionary
A dictionary with nodes as keys and positions as values. If not specified a
spring layout positioning will be computed. See networkx.layout for functions
that compute node positions.

edgelist : collection of edge tuples

Draw only specified edges(default=G.edges())

width : float
Line width of edges (default =1.0)

edge_color : color string, or array of floats

Edge color. Can be a single color format string (default=’r’), or a sequence of
colors with the same length as edgelist. If numeric values are specified they
will be mapped to colors using the edge_cmap and edge_vmin,edge_vmax
parameters.

style : string
Edge line style (default=’solid’) (solid|dashed|dotted,dashdot)

alpha : float
The edge transparency (default=1.0)

edge_ cmap : Matplotlib colormap

Colormap for mapping intensities of edges (default=None)

edge_vmin,edge_vmax : floats
Minimum and maximum for edge colormap scaling (default=None)

ax : Matplotlib Axes object, optional

Draw the graph in the specified Matplotlib axes.

arrows : bool, optional (default=True)

For directed graphs, if True draw arrowheads.

label : [None| string]

Label for legend

For directed graphs, “arrows” (actually just thicker stubs) are drawn at the head end.
Arrows can be turned off with keyword arrows=False. Yes, it is ugly but drawing
proper arrows with Matplotlib this way is tricky.

Examples
>>> edges=nx.draw_networkx_edges(G,pos)

3.2.7 Drawing Labels of a Graph

draw_networkx_labels(G, pos, labels=None, font_size=12, font_color='k',
font_family='sans-serif', font_weight='normal', alpha=1.0, ax=None, **kwds)[source]
Draw node labels on the graph G.

Parameters :
G : graph
A networkx graph

pos : dictionary, optional

A dictionary with nodes as keys and positions as values. If not specified a
spring layout positioning will be computed. See networkx.layout for functions
that compute node positions.

font_size : int
Font size for text labels (default=12)

font_color : string
Font color string (default=’k’ black)

font_family : string
Font family (default=’sans-serif’)

font_weight : string
Font weight (default=’normal’)

alpha : float
The text transparency (default=1.0)

ax : Matplotlib Axes object, optional

Draw the graph in the specified Matplotlib axes.

Examples
>>> labels=nx.draw_networkx_labels(G,pos)

3.2.8 Plotting Graph with pylab

>>> pylab.show(x, y)

3.3 Generating Available Actions and Quality Matrices

3.3.1 Generating Available Actions Matrix
Create the available actions matrix M
If initial_state = 0 and target = 10
For all the edges in the graph
(0, 1)
(1, 5)
(5, 6)
(5, 4)
(1, 2)
(1, 3)
(9, 10)
(2, 4)
(0, 6)
(6, 7)
(8, 9)
(7, 8)
(1, 7)
(3, 9)
Set -1 if there is no edge between two nodes,
0 if there is an edge between two nodes
100 if the start or the end of an edge is the goal
M= [[ -1. 0. -1. -1. -1. -1. 0. -1. -1. -1. -1.]
[ 0. -1. 0. 0. -1. 0. -1. 0. -1. -1. -1.]
[ -1. 0. -1. -1. 0. -1. -1. -1. -1. -1. -1.]
[ -1. 0. -1. -1. -1. -1. -1. -1. -1. 0. -1.]
[ -1. -1. 0. -1. -1. 0. -1. -1. -1. -1. -1.]
[ -1. 0. -1. -1. 0. -1. 0. -1. -1. -1. -1.]
[ 0. -1. -1. -1. -1. 0. -1. 0. -1. -1. -1.]
[ -1. 0. -1. -1. -1. -1. 0. -1. 0. -1. -1.]
[ -1. -1. -1. -1. -1. -1. -1. 0. -1. 0. -1.]
[ -1. -1. -1. 0. -1. -1. -1. -1. 0. -1. 100.]
[ -1. -1. -1. -1. -1. -1. -1. -1. -1. 0. 100.]]

3.3.2 Generating Quality Matrix

Q= [[0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0.]
[0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0.]
[0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0.]
[0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0.]
[0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0.]
[0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0.]
[0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0.]
[0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0.]
[0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0.]
[0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0.]
[0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0.]]

3.4 Working with Functions

3.4.1 Determine the available actions for a given state
Create a function that takes the given current state provides a list of available states
def available_actions(state):
return available_action

using
numpy.where ()
This function accepts a numpy-like array (ex. a NumPy array of
integers/booleans). It returns a new numpy array, after filtering based on a
condition, which is a numpy-like array of boolean values. For example,
condition can take the value of array ([ [True, True, True]]), which is a
numpy-like boolean array.
3.4.2 Choosing one of the actions
Create a function that chooses one of the available actions at random
def sample_next_action(available_actions_range):
return next_action

using
random.choice(a, size=None, replace=True, p=None)
Generates a random sample from a given 1-D array

Parameters
A 1-D array-like or int
If an ndarray, a random sample is generated from its elements. If an int, the
random sample is generated as if it were np.arange(a)

sizeint or tuple of ints, optional

Output shape. If the given shape is, e.g., (m, n, k), then m * n * k samples are
drawn. Default is None, in which case a single value is returned.

replaceboolean, optional
Whether the sample is with or without replacement. Default is True, meaning
that a value of a can be selected multiple times.

p1-D array-like, optional

The probabilities associated with each entry in a. If not given, the sample
assumes a uniform distribution over all entries in a.

Returns
samplessingle item or ndarray
The generated random samples

Raises
ValueError
If a is an int and less than zero, if a or p are not 1-dimensional, if a is an array-
like of size 0, if p is not a vector of probabilities, if a and p have different
lengths, or if replace=False and the sample size is greater than the population
size

Examples
Generate a uniform random sample from np.arange(5) of size 1:
np.random.choice(5, 1)
array([4]) # random

3.4.3 Update Q matrix according to path chosen

Create a function that updates the Q matrix according to the path chosen

def update(current_state, action, gamma):

find index of largest value in a vector defined by action of Q matrix
if there is more than one choice of largest values
pick one from the choices available
else:
pick the max index from the Q matrix
get the max index value from the Q matrix
set Q[current_state, action] to M[current_state, action] + gamma * max_value
if max value of Q matrix is greater than 0:
return np.sum(Q / np.max(Q)*100
else:
return 0

Example
# find index of largest value in a vector of an array
max_index = np.where(Q[action, ] == np.max(Q[action, ]))[1]

numpy.shape(a)
Return the shape of an array.
Parameters
array_like
Input array.

Returns
shapetuple of ints
The elements of the shape tuple give the lengths of the corresponding array
dimensions.

Examples
np.shape(np.eye(3))
(3, 3)
np.shape([[1, 2]])
(1, 2)
np.shape([0])
(1,)
np.shape(0)
()

if max_index.shape[0] > 1: # there is more than one choice from Q matrix

max_index = int(np.random.choice(max_index, size = 1)) # pick one from the
choices available
else:
max_index = int(max_index) # pick the max index from the Q matrix

numpy.max() function.
Syntax
The syntax of max() function as given below.
max_value = numpy.max(arr)
Pass the numpy array as argument to numpy.max(), and this function shall return the
maximum value.

Example: Get Maximum Value of Numpy Array

In this example, we will take a numpy array with random numbers and then find the
maximum of the array using numpy.max() function.

Python Program

import numpy as np
arr = np.random.randint(10, size=(4,5))
print(arr)
#find maximum value
max_value = np.max(arr)
print('Maximum value of the array is',max_value)

Output
[[3 2 2 2 2]
[5 7 0 4 5]
[8 1 4 8 4]
[2 0 7 2 1]]
Maximum value of the array is 8

numpy.sum(arr, axis, dtype, out) : This function returns the sum of array elements
over the specified axis.

Parameters :
arr :
input array.
axis :
axis along which we want to calculate the sum value. Otherwise, it will
consider arr to be flattened(works on all the axis). axis = 0 means along the
column and axis = 1 means working along the row.
out :
Different array in which we want to place the result. The array must have
same dimensions as expected output. Default is None.
initial :
[scalar, optional] Starting value of the sum.

Return :
Sum of the array elements (a scalar value if axis is none) or array with sum values
along the specified axis.

3.5 Iterating through Learning loop

gamma = 0.75
# learning parameter
initial_state = 0

available_action = available_actions(initial_state)
action = sample_next_action(available_action)
update(initial_state, action, gamma)

scores = []
for i in range(1000):
current_state = np.random.randint(0, int(Q.shape[0]))
available_action = available_actions(current_state)
action = sample_next_action(available_action)
score = update(current_state, action, gamma)
scores.append(score)
print(Q)

3.6 Charting most efficient route from initial_state to goal

print("Trained Q matrix:")
print(Q / np.max(Q)*100)
# You can uncomment the above two lines to view the trained Q matrix

# Testing
current_state = 0
steps = [current_state]

while current_state != goal:

next_step_index = np.where(Q[current_state, ] == np.max(Q[current_state, ]))[1]

if next_step_index.shape[0] > 1:
next_step_index = int(np.random.choice(next_step_index, size = 1))
else:
next_step_index = int(next_step_index)
steps.append(next_step_index)
current_state = next_step_index

3.7 Plotting the Reward Gained against iteration scores

pl.plot(scores)
pl.xlabel('No of iterations')
pl.ylabel('Reward gained')
pl.show()

Practical File Artificial Intelligence Class 10 For 2023-24
78% (9)
Practical File Artificial Intelligence Class 10 For 2023-24
26 pages
Dynetx
No ratings yet
Dynetx
72 pages
Building Probabilistic Graphical Models With Python
No ratings yet
Building Probabilistic Graphical Models With Python
24 pages
Palas Blackbook Original
100% (3)
Palas Blackbook Original
43 pages
Tutorial
No ratings yet
Tutorial
46 pages
Networkx: Network Analysis With Python
No ratings yet
Networkx: Network Analysis With Python
47 pages
P3 - Graph Theory - 19-10-2022
No ratings yet
P3 - Graph Theory - 19-10-2022
23 pages
Networkx: Network Analysis With Python: Salvatore Scellato
No ratings yet
Networkx: Network Analysis With Python: Salvatore Scellato
49 pages
PDF Networkx
No ratings yet
PDF Networkx
38 pages
10.2 Construct A Simple Network
No ratings yet
10.2 Construct A Simple Network
9 pages
Social Network Analysis Con Python PDF
No ratings yet
Social Network Analysis Con Python PDF
80 pages
05 Solving Shortest Path Problems With Networkx - Completed PDF
No ratings yet
05 Solving Shortest Path Problems With Networkx - Completed PDF
7 pages
NX Tutorial PDF
No ratings yet
NX Tutorial PDF
29 pages
Networkx - Directed - Ipynb - Colaboratory
No ratings yet
Networkx - Directed - Ipynb - Colaboratory
2 pages
Networkx Tutorial
No ratings yet
Networkx Tutorial
8 pages
Networkx Tutorial
100% (1)
Networkx Tutorial
8 pages
Tutorial
No ratings yet
Tutorial
3 pages
Stna Lecture11 PDF
No ratings yet
Stna Lecture11 PDF
51 pages
Lab 2
No ratings yet
Lab 2
14 pages
Murenei Network Analysis With Python and Networkx
No ratings yet
Murenei Network Analysis With Python and Networkx
3 pages
Data Structures and Algorithms Assignment 3: Minimum Spanning Trees and Shortest Path Algorithms
No ratings yet
Data Structures and Algorithms Assignment 3: Minimum Spanning Trees and Shortest Path Algorithms
5 pages
Getting Started With Graphviz and Python
No ratings yet
Getting Started With Graphviz and Python
9 pages
121a1114 D2 Sma Exp4
No ratings yet
121a1114 D2 Sma Exp4
5 pages
Artificial Intelligence Lab Manual
No ratings yet
Artificial Intelligence Lab Manual
35 pages
Social Network Analysis
No ratings yet
Social Network Analysis
20 pages
AIML Lab Manual
No ratings yet
AIML Lab Manual
76 pages
Program
No ratings yet
Program
25 pages
Week2 Lab and Assessment
No ratings yet
Week2 Lab and Assessment
7 pages
Documentation
No ratings yet
Documentation
75 pages
AIML Lab Manual
No ratings yet
AIML Lab Manual
38 pages
File 3
No ratings yet
File 3
5 pages
PyTorch & PyTorch Geometric
No ratings yet
PyTorch & PyTorch Geometric
21 pages
Final AI & ML Lab Manual
No ratings yet
Final AI & ML Lab Manual
59 pages
A Star
No ratings yet
A Star
5 pages
CS102 Lab14 Spring2025
No ratings yet
CS102 Lab14 Spring2025
11 pages
23MCB0003 Sna 04
No ratings yet
23MCB0003 Sna 04
15 pages
Aiml Sem Practicals
No ratings yet
Aiml Sem Practicals
8 pages
Thura2022-05-30 - Geometric Deep Learning
No ratings yet
Thura2022-05-30 - Geometric Deep Learning
33 pages
AI Lab5
No ratings yet
AI Lab5
6 pages
AI Dynamic
No ratings yet
AI Dynamic
9 pages
ML Lab File Vijay Kumar
No ratings yet
ML Lab File Vijay Kumar
27 pages
AI LAB Contents
No ratings yet
AI LAB Contents
19 pages
Ai Lab FINAL-28 Merged
No ratings yet
Ai Lab FINAL-28 Merged
81 pages
Graph Theroy Practical
No ratings yet
Graph Theroy Practical
102 pages
AIML With Outputs
No ratings yet
AIML With Outputs
34 pages
Lab 3
No ratings yet
Lab 3
7 pages
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Directed Graphs: Digraph Digraph - Out - Edges Digraph - in - Degree Digraph - Predecessors Digraph - Successors
No ratings yet
Directed Graphs: Digraph Digraph - Out - Edges Digraph - in - Degree Digraph - Predecessors Digraph - Successors
3 pages
3.3 Distributed Execution: Relu Dadd Drelu
No ratings yet
3.3 Distributed Execution: Relu Dadd Drelu
5 pages
Lecturer2 - Basic of Python
No ratings yet
Lecturer2 - Basic of Python
45 pages
Lecture 2: Graphs And: Networkx
No ratings yet
Lecture 2: Graphs And: Networkx
20 pages
Machine Learning
No ratings yet
Machine Learning
18 pages
Ai Lab Document-1
No ratings yet
Ai Lab Document-1
18 pages
Ai Manuval
No ratings yet
Ai Manuval
49 pages
AIML Lab Manual
No ratings yet
AIML Lab Manual
41 pages
Gd Script
From Everand
Gd Script
Marijo Trkulja
No ratings yet
Phase1 Report
No ratings yet
Phase1 Report
7 pages
Aiml-Lab-Manual 24-25
No ratings yet
Aiml-Lab-Manual 24-25
39 pages
Dy Ai Rec
No ratings yet
Dy Ai Rec
24 pages
Ai-Lab Task-07
No ratings yet
Ai-Lab Task-07
8 pages
Package Deal': R Topics Documented
No ratings yet
Package Deal': R Topics Documented
33 pages
Ian Talks Python A-Z
From Everand
Ian Talks Python A-Z
Ian Eress
No ratings yet
Python Civil Structural Engineers
100% (1)
Python Civil Structural Engineers
299 pages
Mathematical Foundations of Soft Robotics Engineering
No ratings yet
Mathematical Foundations of Soft Robotics Engineering
369 pages
Fake New Detection Assignment
No ratings yet
Fake New Detection Assignment
13 pages
Docker
No ratings yet
Docker
24 pages
FDS Lab Manual (1-3) PDF
No ratings yet
FDS Lab Manual (1-3) PDF
17 pages
Python Interview Questions
No ratings yet
Python Interview Questions
8 pages
CSA Lab 2
No ratings yet
CSA Lab 2
5 pages
Advanced NumPy Broadcasting and Strides Guide
No ratings yet
Advanced NumPy Broadcasting and Strides Guide
21 pages
Using Machine Learning To Locate Support and Resistance Lines For Stocks - by Suhail Saqan - The Startup - Medium
No ratings yet
Using Machine Learning To Locate Support and Resistance Lines For Stocks - by Suhail Saqan - The Startup - Medium
16 pages
BIA Master of Science in Computer Science 3
No ratings yet
BIA Master of Science in Computer Science 3
22 pages
Audio and Digital Signal Processing
No ratings yet
Audio and Digital Signal Processing
18 pages
Final Report Mini Project Final
No ratings yet
Final Report Mini Project Final
12 pages
Mkce Python Lab Manual
No ratings yet
Mkce Python Lab Manual
46 pages
AI With Python Â - Speech Recognition
No ratings yet
AI With Python Â - Speech Recognition
10 pages
Analogy of Water Quality Prediction Using SVM and Xgboost Algorithms
No ratings yet
Analogy of Water Quality Prediction Using SVM and Xgboost Algorithms
104 pages
Ds Lab-1
No ratings yet
Ds Lab-1
40 pages
Mastering Exploratory Data Analysis With Python - A Comprehensive Guide To Unveiling Hidden Insights
No ratings yet
Mastering Exploratory Data Analysis With Python - A Comprehensive Guide To Unveiling Hidden Insights
73 pages
Final Prac
No ratings yet
Final Prac
118 pages
Depression Detection Using EEG
No ratings yet
Depression Detection Using EEG
57 pages
Numpy Cheat Sheet
No ratings yet
Numpy Cheat Sheet
13 pages
Dedalus Project Readthedocs Io en Latest
No ratings yet
Dedalus Project Readthedocs Io en Latest
186 pages
DSEB - Syllabus - Python Programming
No ratings yet
DSEB - Syllabus - Python Programming
16 pages
Introduction To Python Programming: Fission Reactor Physics 1
No ratings yet
Introduction To Python Programming: Fission Reactor Physics 1
56 pages
Diya Basera
No ratings yet
Diya Basera
15 pages
Python For Data Analysis: Dr. Kishore Kunal
100% (1)
Python For Data Analysis: Dr. Kishore Kunal
43 pages
Three Dimensional Points and Lines in Python
No ratings yet
Three Dimensional Points and Lines in Python
3 pages
Emotion Recognition Using Facial Expressions PDF
No ratings yet
Emotion Recognition Using Facial Expressions PDF
26 pages
Python Programming Lab Manual
No ratings yet
Python Programming Lab Manual
61 pages

Lab 01 QRoutingv5

Uploaded by

Lab 01 QRoutingv5

Uploaded by

Reinforcement Learning QRouting Laboratory

Python is an Object-Oriented programming language with many packages developed

2.1 Reinforcement Learning QRouting

An agent in an unknown environment obtains some rewards by interacting with the

3.1 Creating a New Anaconda Environment

conda create -n py36 python=3.6.13

activate it by using this command:

conda activate py36

conda install numpy

1- Command line: On the same window run the following command:

2- PyCharm editor: Open Anaconda Navigator, set Applications on to py36

Tuples are used to store multiple items in a single variable.

>>> edges = [(0, 1), (1, 5)]

3.2.2 Creating an empty Graph

3.2.3 Adding Edges to Graph

3.2.4 Laying out a Graph

The algorithm simulates a force-directed representation of the network treating edges

3.2.5 Drawing Nodes of a Graph

ax : Matplotlib Axes object, optional

nodelist : list, optional

node_size : scalar or array

node_color : color string, or array of floats

cmap : Matplotlib colormap

linewidths : [None | scalar | sequence]

label : [None| string]

3.2.6 Drawing Edges of a Graph

edgelist : collection of edge tuples

edge_color : color string, or array of floats

edge_ cmap : Matplotlib colormap

ax : Matplotlib Axes object, optional

arrows : bool, optional (default=True)

label : [None| string]

3.2.7 Drawing Labels of a Graph

pos : dictionary, optional

ax : Matplotlib Axes object, optional

3.2.8 Plotting Graph with pylab

3.3 Generating Available Actions and Quality Matrices

3.3.2 Generating Quality Matrix

3.4 Working with Functions

sizeint or tuple of ints, optional

p1-D array-like, optional

3.4.3 Update Q matrix according to path chosen

def update(current_state, action, gamma):

if max_index.shape[0] > 1: # there is more than one choice from Q matrix

Example: Get Maximum Value of Numpy Array

3.5 Iterating through Learning loop

3.6 Charting most efficient route from initial_state to goal

while current_state != goal:

next_step_index = np.where(Q[current_state, ] == np.max(Q[current_state, ]))[1]

3.7 Plotting the Reward Gained against iteration scores

You might also like