An Introduction to Artificial Intelligence
What is Tic Tac Toe?
Tic Tac Toe is simply an awesome game. (Although I do have to say that I really think its original name - noughts and crosses - is a whole lot cooler). It is simple and easy to play wherever you are. The success of this simple but yet intriguing game is quite staggering and therefore even some mathematicians had a peek at it. Who knew that those 4 lines that people draw on a piece of paper would gather such an attention?
There is some debate on where this game originated; theories are ranging from ancient Egypt to the Roman Empire. Luckily the guy decided not to patent it.
The Stats
One could assume that there are 9!
different possible game states for Tic Tac Toe. After there would be a winner, players would stop playing so we have to discard those states. Instead of the full 9! = 362,880
states we therefore only have 255,168 possible game states.
- 131,184 (1st player)
- 77,904 (2nd player)
- 46,080 (tie)
This might seem like a lot but for a Computer this is a joke. If we assume symmetries (discarding states already present by rotations or reflections) this changes to just about 26,830 possibilitiess.
There are 138 terminal positions after assuming symmetries.
- 91 (player 1)
- 44 (player 2)
- 3 (tie)
A little bit of Game Theory
Mathematics has a really interesting branch called Game Theory. Game Theory is defined as the study of “strategic decision making”. It even plays a very important role in economics, since that too is often about making different decisions. Some of you might have seen the movie A Beautiful Mind about the Life of the schizophrenic John Nash who ended up winning a Noble Prize regarding his work in the field of Game Theory.
So how is this important for Tic Tac Toe?
Tic Tac Toe is something called a zero-sum game. A game where the outcome always sums up to zero in the end. Furthermore this is a perfect information game where the game state is completely open to you. You know everything about it, no hidden cards or anything like that. Therefore you can play perfectly since no luck or unknown variables are involved. (Playing perfectly doesn’t mean you will win). For this purpose this means there will always be a tie.
Perfect Play
To develop an AI for Tic Tac Toe it has to be able to decide what to do next. It has to know how to play perfectly. We don’t want the computer to just play randomly after all do we?
The famous Tic Tac Toe xkcd. (#832)
Artificial Intelligence 101
What would a human do?
When every single one of us plays Tic Tac Toe it is easy to say what goes on in our minds:
We look at the board and try to predict the future. We try to find the move that will be best for us and worst for our opponent.
Essentially the computer does the exact same thing. It has one important advantage though it can see further into the future (given enough computing power). We could make the computer look far enough so it could see every possible move. This is how the computer can play perfectly and this is how many AI’s related to abstract games (chess, connect four, …) are implemented. If the games are too complex the computer might not be able to compute everything, but we don’t have to worry about that yet.
The Setup
First of all we need to represent the current game state. I created a simple python class for this purpose. It basically follows the following function outline:
show()
: shows the boardavailable_moves()
: all empty spacesavailbale_combos()
: all the spaces not taken by the opponentcomplete()
: is the game overwinner()
: is there a winnerget_squares(player)
: all squares of a playermake_move(position, player)
: place a mark on the boardget_enemy(player)
: get the opponent player marker
The Algorithm
So here comes what this whole article is all about. Creating an algorithm that is perfect and will never loose. This algorithm will walk through all the different game states and look into the future. We have to carefully craft it so it always selects the best move.
Minimax
The Minimax algorithm follows a simple rule:
Minimizing the possible loss in a worst case scenario.
Or in everyday language: making the best next move. This algorithm is especially helpful in zero-sum games like Tic Tac Toe.
Minimax basically recursively steps through all the steps. Each of the states that is being traversed is going to receive a value based on how good or bad this state is for the player. In case of Tic Tac Toe there aren’t many possibilities so we can just iterate through all the possibility and see whether a chain of steps lead to a win, a tie or a loss of the player. All the different states are going to be the nodes in our game tree.
The idea is pretty simple. The algorithm evaluates the current position and processes it step by step:
- Check if the node is completed. If yes, return the winner.
- Recursively call minimax with all possible moves.
- If it is the computer’s turn return the highest result, otherwise the lowest.
And that is all there is to it. The Minimax algorithm is one of the most straightforward algorithm and sometimes it is implemented in two different functions. One of these would do the minimizing and the other one would do the maximizing.
Alpha-Beta Pruning
Alpha-Beta Pruning is a simple but significant expansion of the minimax algorithm that can sometimes drastically improve performance. This might not be a big problem with Tic Tac Toe cause of the small search-space but I will dive into it anyway since this is a easy example.
The basic idea behind both algorithms is the same. The game states are being recursively iterated and the best state is found. The Advantage is that branches of the search tree can be eliminated and valuable computing power could be saved. The algorithm will focus on the better parts of the tree. If one move is worse than a previous one the algorithm is going to simply break. Therefore there need to be two variables that keep constant track of the upper and lower cutoff limits: Alpha and Beta in this case.
The Wrapper
Both of the algorithms mentioned above will return a score associated with that position. What we need now is a wrapper function that calls our algorithm for different position and chooses the best one accordingly.
This function looks almost identical to the algorithm itself. It’s purpose is similar. It iterates through the different possible moves and keeps track of the best move. In the end it will return that move since it is an optimal move for the computer to play.
The Website
After finishing all of the game logic and the artificial intelligence I decided to create a little website for the game. It runs on Flask and simply makes an Ajax requests to the server to determine the next move. I made some minor additions to save moves that were already calculated so that it doesn’t crash when too many people play at the same time. I embedded it at the top of this post if you wanna give it a shot, otherwise check out the site: tic.cwoebker.com
The Result
Open Source
The complete code to this project will be hosted up on Github soon.
For now this is the complete python file I ended up with:
Post Scriptum
Feel free to <iframe>
this wherever you want. Make sure to give credit! Everyone should play some Tic Tac Toe from day to day.
Update
The current live version of Tic Tac Toe uses a different code base than the one described here.