Factorization Ranking Model for Fast Move Prediction in the Game of Go
Date
Author
Institution
Degree Level
Degree
Department
Supervisor / Co-Supervisor and Their Department(s)
Examining Committee Member(s) and Their Department(s)
Citation for Previous Publication
Link to Related Item
Abstract
In this thesis, we investigate the move prediction problem in the game of Go by proposing a new ranking model named Factorization Bradley Terry (FBT) model. This new model considers the move prediction problem as group competitions while also taking the interaction between features into account. A FBT model is able to provide a probability distribution that expresses a preference over moves. Therefore it can be easily compiled into an evaluation function and applied in a modern Go program. We propose a Stochastic Gradient Decent (SGD) algorithm to train a FBT model using expert game records, and provide two methods for fast computation of the gradient in order to speed up the training process. We also investigate the problem of integrating feature knowledge learned by the FBT model in Monte Carlo Tree Search (MCTS). Experimental results show that our FBT model outperforms the state-of-the-art fast move prediction system of Latent Factor Ranking, and it is useful in improving the performance of MCTS.
