I would be interested if you or anyone knows of a better approach, especially if it scales better (currently setting GAMES to 500,000 seems to make it play worse!)so about your question :) note that it does not actually play random moves in any way. the 'select' method is used to play lots of playouts each turn (determined by GAMES, default 10,000), during which it equally balances 'exploration' (rando