0% found this document useful (0 votes)

14 views7 pages

Technology

technology

Uploaded by

meolvidelacontrasena4444

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views7 pages

Technology

technology

Uploaded by

meolvidelacontrasena4444

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

### Importing Libraries

```python
import numpy as np
import random
```

- `numpy` is imported as `np`, though it is not used in the provided code.

- `random` is imported to facilitate random choices in the agent's decision-making process.

### TicTacToeGame Class

This class encapsulates the Tic-Tac-Toe game logic.

#### `init` Method

```python
class TicTacToeGame():
def __init__(self):
self.state = ' ' # a string of length 9 that encodes the state of the 3*3 board
self.player = 'X'
self.winner = None
```

- Initializes the game state, the current player (`'X'`), and the winner (initially `None`).

#### `allowed_moves` Method

```python
def allowed_moves(self):
states = [] # store all possible next states
for i in range(len(self.state)):
if self.state[i] == ' ':
states.append(self.state[:i] + self.player + self.state[i+1:])
return states
```

- Returns a list of all possible next states by filling empty spaces with the current player's
symbol.

#### `make_move` Method

```python
def make_move(self, next_state):
if self.winner:
raise(Exception("Game already completed, cannot make another move!"))
if not self.__valid_move(next_state):
raise(Exception("Cannot make move {} to {} for player {}".format(
self.state, next_state, self.player)))

self.state = next_state
self.winner = self.predict_winner(self.state)
if self.winner:
self.player = None
elif self.player == 'X':
self.player = 'O'
else:
self.player = 'X'
```

- Updates the game state to `next_state`.

- Checks if the game is already over or if the move is valid.
- Updates the current player and checks for a winner.

#### `playable` Method

```python
def playable(self):
return ( (not self.winner) and any(self.allowed_moves()) )
```

- Returns `True` if the game is still ongoing and there are moves available.

#### `predict_winner` Method

```python
def predict_winner(self, state):
lines = [(0,1,2), (3,4,5), (6,7,8), (0,3,6), (1,4,7), (2,5,8), (0,4,8), (2,4,6)]
winner = None
for line in lines:
line_state = state[line[0]] + state[line[1]] + state[line[2]]
if line_state == 'XXX':
winner = 'X'
elif line_state == 'OOO':
winner = 'O'
return winner
```

- Checks all possible winning combinations and returns the winner (`'X'` or `'O'`).

#### `__valid_move` Method

```python
def __valid_move(self, next_state):
allowed_moves = self.allowed_moves()
if any(state == next_state for state in allowed_moves):
return True
return False
```

- Checks if a given `next_state` is a valid move.

#### `print_board` Method

```python
def print_board(self):
s = self.state
print(' {} | {} | {} '.format(s[0],s[1],s[2]))
print(' -----------')
print(' {} | {} | {} '.format(s[3],s[4],s[5]))
print(' -----------')
print(' {} | {} | {} '.format(s[6],s[7],s[8]))
```

- Prints the current game board.

### Agent Class

This class represents an AI agent that learns to play Tic-Tac-Toe using reinforcement
learning.

#### `init` Method

```python
class Agent():
def __init__(self, game_class, epsilon=0.1, alpha=0.5, value_player='X'):
self.V = dict() # dictionary to store value
self.NewGame = game_class
self.epsilon = epsilon
self.alpha = alpha
self.value_player = value_player
```

- Initializes the agent with a value dictionary `V`, game class, exploration rate `epsilon`,
learning rate `alpha`, and the player the agent values (`'X'` or `'O'`).

#### `state_value` Method

```python
def state_value(self, game_state):
return self.V.get(game_state, 0.0)
```

- Returns the value of a given game state.

#### `learn_game` Method

```python
def learn_game(self, num_episodes=1000):
for episode in range(num_episodes):
self.learn_from_episode()
```

- Trains the agent by playing a specified number of episodes.

#### `learn_from_episode` Method

```python
def learn_from_episode(self):
game = self.NewGame()
_, move = self.learn_select_move(game)
while move:
move = self.learn_from_move(game, move)
```

- Runs a learning episode by repeatedly selecting and making moves.

#### `learn_from_move` Method

```python
def learn_from_move(self, game, move):
game.make_move(move)
r = self.__reward(game)
td_target = r
next_state_value = 0.0
selected_next_move = None
if game.playable():
best_next_move, selected_next_move = self.learn_select_move(game)
next_state_value = self.state_value(best_next_move)
current_state_value = self.state_value(move)
td_target = r + next_state_value
self.V[move] = current_state_value + self.alpha * (td_target - current_state_value)
return selected_next_move
```

- Updates the value of the current state based on the reward and the value of the next state.

#### `learn_select_move` Method

```python
def learn_select_move(self, game):
allowed_state_values = self.__state_values(game.allowed_moves())
if game.player == self.value_player:
best_move = self.__argmax_V(allowed_state_values)
else:
best_move = self.__argmin_V(allowed_state_values)

selected_move = best_move
if random.random() < self.epsilon:
selected_move = self.__random_V(allowed_state_values)

return (best_move, selected_move)

```

- Selects the best move using an epsilon-greedy strategy.

#### `play_select_move` Method

```python
def play_select_move(self, game):
allowed_state_values = self.__state_values(game.allowed_moves())
if game.player == self.value_player:
return self.__argmax_V(allowed_state_values)
else:
return self.__random_V(allowed_state_values)
```

- Selects the move to play during a demonstration game.

#### `demo_game` Method

```python
def demo_game(self, verbose=False):
game = self.NewGame()
t=0
while game.playable():
if verbose:
print(" \nTurn {}\n".format(t))
game.print_board()
move = self.play_select_move(game)
game.make_move(move)
t += 1
if verbose:
print(" \nTurn {}\n".format(t))
game.print_board()
if game.winner:
if verbose:
print("\n{} is the winner!".format(game.winner))
return game.winner
else:
if verbose:
print("\nIt's a draw!")
return '-'
```

- Demonstrates a game played by the agent and optionally prints the game progress.

#### `interactive_game` Method

```python
def interactive_game(self, agent_player='X'):
game = self.NewGame()
t=0
while game.playable():
print(" \nTurn {}\n".format(t))
game.print_board()
if game.player == agent_player:
move = self.play_select_move(game)
game.make_move(move)
else:
move = self.__request_human_move(game)
game.make_move(move)
t += 1

print(" \nTurn {}\n".format(t))

game.print_board()

if game.winner:
print("\n{} is the winner!".format(game.winner))
return game.winner
print("\nIt's a draw!")
return '-'
```

- Allows a human to play against the agent.

#### `round_V` Method

```python
def round_V(self):
for k in self.V.keys():
self.V[k] = round(self.V[k], 1)
```

- Rounds the values in the value dictionary to one decimal place.

#### Private Methods

```python
def __state_values(self, game_states):
return dict((state, self.state_value(state)) for state in game_states)

def __argmax_V(self, state_values):

max_V = max(state_values.values())
chosen_state = random.choice([state for state, v in state_values.items() if v == max_V])
return chosen_state

def __argmin_V(self, state_values):

min_V = min(state_values.values())
chosen_state = random.choice([state for state, v in state_values.items() if v == min_V])
return chosen_state

def __random_V

Chapter 5 - Solid-State Physics For Quantum ESPRESSO
No ratings yet
Chapter 5 - Solid-State Physics For Quantum ESPRESSO
42 pages
Computer Project Class 11
50% (8)
Computer Project Class 11
27 pages
Maths Project
67% (3)
Maths Project
21 pages
Big Book For Buckyballs Tricks
0% (2)
Big Book For Buckyballs Tricks
6 pages
Statistical Analysis of Climate Series
100% (1)
Statistical Analysis of Climate Series
179 pages
ML - 6 - Jupyter Notebook
No ratings yet
ML - 6 - Jupyter Notebook
5 pages
Code Pal Result
No ratings yet
Code Pal Result
3 pages
Adi 2
No ratings yet
Adi 2
3 pages
Tic Tac Toe TXT
No ratings yet
Tic Tac Toe TXT
4 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
21L7734 Shais Quiz3 Aml 8A
No ratings yet
21L7734 Shais Quiz3 Aml 8A
25 pages
National Textile University
No ratings yet
National Textile University
12 pages
MCSL 228
No ratings yet
MCSL 228
26 pages
Group 20 Lab 6
No ratings yet
Group 20 Lab 6
15 pages
AI Based Game Project Report
No ratings yet
AI Based Game Project Report
6 pages
5 6 Ai
No ratings yet
5 6 Ai
8 pages
All
No ratings yet
All
10 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
7 pages
Ai Practical Journal
No ratings yet
Ai Practical Journal
24 pages
Game
No ratings yet
Game
8 pages
Implementation of BFS For Tic-Tac-Toe Problem
No ratings yet
Implementation of BFS For Tic-Tac-Toe Problem
2 pages
Experiment 5 - Adversarial Searching
No ratings yet
Experiment 5 - Adversarial Searching
9 pages
Tik Tak Toe AI GAME Documentation
No ratings yet
Tik Tak Toe AI GAME Documentation
22 pages
Tugas 2 Grid World
No ratings yet
Tugas 2 Grid World
7 pages
AI Outputs (4,5,6,7)
No ratings yet
AI Outputs (4,5,6,7)
16 pages
From Ipython - Display Import Clear - Output Def Display - Board (Board)
No ratings yet
From Ipython - Display Import Clear - Output Def Display - Board (Board)
5 pages
Case StudyAI
No ratings yet
Case StudyAI
9 pages
Python
No ratings yet
Python
4 pages
Lab Left Ai
No ratings yet
Lab Left Ai
20 pages
Minimax
No ratings yet
Minimax
6 pages
I008 Khemal 3 Experiment PAI
No ratings yet
I008 Khemal 3 Experiment PAI
9 pages
Practical 1 Aim: Artificial Intelligence (3170716) 191390107018
No ratings yet
Practical 1 Aim: Artificial Intelligence (3170716) 191390107018
48 pages
Experiment-5: AIM: Write A Program To Implement The Tic-Tac-Toe Game Problem. Theory
No ratings yet
Experiment-5: AIM: Write A Program To Implement The Tic-Tac-Toe Game Problem. Theory
5 pages
Ai Lab Final
No ratings yet
Ai Lab Final
27 pages
AI Program Codes 2023
No ratings yet
AI Program Codes 2023
9 pages
5 - Tic-Tak-Toe
No ratings yet
5 - Tic-Tak-Toe
7 pages
Python Report
No ratings yet
Python Report
10 pages
Computer Project Class 11 PR
No ratings yet
Computer Project Class 11 PR
27 pages
AI Lab File
No ratings yet
AI Lab File
24 pages
Naughts and Crosses
No ratings yet
Naughts and Crosses
3 pages
AI Experiment Part 2
No ratings yet
AI Experiment Part 2
13 pages
Tic Tac Toe - Py
No ratings yet
Tic Tac Toe - Py
2 pages
Ai PRA1
No ratings yet
Ai PRA1
6 pages
Python L6 Worksheet 1
No ratings yet
Python L6 Worksheet 1
3 pages
Tic Tac Toe
No ratings yet
Tic Tac Toe
3 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
34 pages
Ai Outputs
No ratings yet
Ai Outputs
12 pages
Practical: 1: Aim: - Write A Program To Implement Tic-Tac-Toe Game Problem. Code
No ratings yet
Practical: 1: Aim: - Write A Program To Implement Tic-Tac-Toe Game Problem. Code
30 pages
Amity University, Noida Aset (Cse) Batch: 2020-2024: Course Code: CSE401 Course: Artificial Intelligence
No ratings yet
Amity University, Noida Aset (Cse) Batch: 2020-2024: Course Code: CSE401 Course: Artificial Intelligence
31 pages
Tic Tac Toe Game in Python: Updated On Feb 17, 2023 16:56 IST
No ratings yet
Tic Tac Toe Game in Python: Updated On Feb 17, 2023 16:56 IST
8 pages
Ai 5 11
No ratings yet
Ai 5 11
13 pages
CS Project-1
No ratings yet
CS Project-1
4 pages
Ai Fie
No ratings yet
Ai Fie
15 pages
AI Exp-5
No ratings yet
AI Exp-5
2 pages
Open Ended Lab
No ratings yet
Open Ended Lab
5 pages
AI File
No ratings yet
AI File
32 pages
Ai 56
No ratings yet
Ai 56
7 pages
Untitled
No ratings yet
Untitled
29 pages
Lab Programs
No ratings yet
Lab Programs
16 pages
R22-AI Lab Manual
No ratings yet
R22-AI Lab Manual
24 pages
Artifical Intelligence Project Report
No ratings yet
Artifical Intelligence Project Report
29 pages
Book Programs and Challenges of Chapter 6
No ratings yet
Book Programs and Challenges of Chapter 6
6 pages
No Ph.D. Game Design With Three.js
From Everand
No Ph.D. Game Design With Three.js
Nikiforos Kontopoulos
No ratings yet
Python For Beginners
From Everand
Python For Beginners
Célio Azevedo
No ratings yet
Quantum Mechanics of Klein-Gordon Equation. (Semoradova - Tesis)
No ratings yet
Quantum Mechanics of Klein-Gordon Equation. (Semoradova - Tesis)
55 pages
Solid Mensuration
100% (1)
Solid Mensuration
3 pages
Psych Stat Problem and Answers
No ratings yet
Psych Stat Problem and Answers
3 pages
Lecture Notes - Extensions of Functions
No ratings yet
Lecture Notes - Extensions of Functions
42 pages
Experiment Title: Experimental Study of Sinusoids and Their Characteristics
No ratings yet
Experiment Title: Experimental Study of Sinusoids and Their Characteristics
5 pages
(1980) G.E.Wall - Conjugacy Classes in Projective and Special Linear Groups
No ratings yet
(1980) G.E.Wall - Conjugacy Classes in Projective and Special Linear Groups
26 pages
Linear Differential Equation - Wikipedia, The Free Encyclopedia
100% (1)
Linear Differential Equation - Wikipedia, The Free Encyclopedia
8 pages
PDF4
No ratings yet
PDF4
1 page
3F4 Power and Energy Spectral Density: Dr. I. J. Wassell
No ratings yet
3F4 Power and Energy Spectral Density: Dr. I. J. Wassell
12 pages
Parallel Binary Adders: Script
No ratings yet
Parallel Binary Adders: Script
9 pages
Plagiarism Survey
No ratings yet
Plagiarism Survey
35 pages
2b. Spatial Weight Matrices
No ratings yet
2b. Spatial Weight Matrices
6 pages
Partial Differential Equations (Pdes)
No ratings yet
Partial Differential Equations (Pdes)
66 pages
9702 w02 QP 2
No ratings yet
9702 w02 QP 2
16 pages
Enviromatics01 - Introduction
No ratings yet
Enviromatics01 - Introduction
14 pages
Three I Allproofs
No ratings yet
Three I Allproofs
63 pages
Learning To (Learn at Test Time) - RNNs With Expressive Hidden States
No ratings yet
Learning To (Learn at Test Time) - RNNs With Expressive Hidden States
34 pages
Firefighting Hydraulic Calculation
82% (17)
Firefighting Hydraulic Calculation
27 pages
Advanced Pharmaceutical Solids
No ratings yet
Advanced Pharmaceutical Solids
534 pages
John Clements Davis
No ratings yet
John Clements Davis
2 pages
Spe 17741 PDF
No ratings yet
Spe 17741 PDF
8 pages
Exercise Bank For Chapter Two: Truth Tables
No ratings yet
Exercise Bank For Chapter Two: Truth Tables
2 pages
Design of Carry Save Adder Using Transmission Gate Logic
No ratings yet
Design of Carry Save Adder Using Transmission Gate Logic
5 pages
CIRED MV Shielded Busbar Long Term Ageing Test
No ratings yet
CIRED MV Shielded Busbar Long Term Ageing Test
5 pages
5
No ratings yet
5
6 pages
Marine Enginnering Syllabus
0% (1)
Marine Enginnering Syllabus
3 pages

Technology

Uploaded by

Technology

Uploaded by

### Importing Libraries

- `numpy` is imported as `np`, though it is not used in the provided code.

### TicTacToeGame Class

This class encapsulates the Tic-Tac-Toe game logic.

#### `__init__` Method

#### `allowed_moves` Method

#### `make_move` Method

- Updates the game state to `next_state`.

#### `playable` Method

#### `predict_winner` Method

#### `__valid_move` Method

- Checks if a given `next_state` is a valid move.

#### `print_board` Method

- Prints the current game board.

### Agent Class

#### `__init__` Method

#### `state_value` Method

- Returns the value of a given game state.

- Trains the agent by playing a specified number of episodes.

#### `learn_from_episode` Method

- Runs a learning episode by repeatedly selecting and making moves.

#### `learn_from_move` Method

#### `learn_select_move` Method

return (best_move, selected_move)

- Selects the best move using an epsilon-greedy strategy.

#### `play_select_move` Method

- Selects the move to play during a demonstration game.

#### `demo_game` Method

#### `interactive_game` Method

print(" \nTurn {}\n".format(t))

- Allows a human to play against the agent.

#### `round_V` Method

- Rounds the values in the value dictionary to one decimal place.

#### Private Methods

def __argmax_V(self, state_values):

def __argmin_V(self, state_values):

You might also like

#### `init` Method

#### `init` Method