Implementation of a TicTacToe AI in Python. Uses Q-learning with either e-greedy or softmax policy.
Use pipenv to install dependencies
pipenv install
For playing against a trained AI:
python run.py
For testing e-greedy against softmax:
python test.py