Tic-Tac-Toe AI bot

A bot for Tic-Tac-Toe that uses deep reinforcement learning trained using keras-rl DQN agent.

All code in main.ipynb

Trained weights are present in dqn_weights.h5f

Additional

A PPO agent built with stable-baselines.

All code in MlpPolicy.ipynb

A trained model with PPO1 of stable baselines is in PPO1_model.
A trained model with PPO2 of stable baselines is in PPO2_model.

A heuristic using bot. A n-step-lookahead bot which uses minimax algorithm to find the optimal move.

import heuristicBot
...
move = heuristicBot.nslAgent(no_of_step_to_lookahead, game_grid, available_moves_on_grid, player_number)
...
# returns a valid index to where the move should be made
# PARAMETERS =>
# no_of_step_to_lookahead = (max 9, min 1)
# game_grid = a 9x9 2D numpy array with value 0:empty, 1:X, 2:O
# available_moves_on_grid = list of all empty places on the grid
# player_number = 1 or 2

Requirements

numpy==1.19.4
gym==0.18.3
stable-baselines==2.10.2
tensorflow==1.15.0
tensorflow-gpu==1.15.0
python==3.7
keras-rl

by Debashish Gogoi

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
training_logs_MlpPolicy		training_logs_MlpPolicy
training_logs_customPolicy		training_logs_customPolicy
.gitignore		.gitignore
MlpPolicy.ipynb		MlpPolicy.ipynb
MlpPolicy_test.py		MlpPolicy_test.py
PPO1_model.zip		PPO1_model.zip
PPO2_model.zip		PPO2_model.zip
README.md		README.md
TicTacToe.py		TicTacToe.py
checkpoint		checkpoint
dqn_weights.h5f.data-00000-of-00001		dqn_weights.h5f.data-00000-of-00001
dqn_weights.h5f.index		dqn_weights.h5f.index
heuristicBot-tests.py		heuristicBot-tests.py
heuristicBot.py		heuristicBot.py
main.ipynb		main.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tic-Tac-Toe AI bot

Additional

Requirements

About

Releases

Packages

Languages

Devzard/TicTacToe-AI-bot

Folders and files

Latest commit

History

Repository files navigation

Tic-Tac-Toe AI bot

Additional

Requirements

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages