Computers

Near perfect poker playing software will clean you out

Near perfect poker playing sof...
Academics at the University of Alberta have effectively solved a variation of poker (Photo: John Ulan)
Academics at the University of Alberta have effectively solved a variation of poker (Photo: John Ulan)
View 1 Image
Academics at the University of Alberta have effectively solved a variation of poker (Photo: John Ulan)
1/1
Academics at the University of Alberta have effectively solved a variation of poker (Photo: John Ulan)

Every poker player will have wished they had just a little more insight into a hand at some point. A new piece of software created by the Computer Poker Research Group at the University of Alberta, however, has no such crises of confidence. Cepheus has "solved" heads-up limit Texas hold 'em.

"We define a game to be essentially solved if a lifetime of play is unable to statistically differentiate it from being solved at 95 percent confidence," explains lead author of the research Michael Bowling. "Imagine someone playing 200 hands of poker an hour for 12 hours a day without missing a day for 70 years. Furthermore, imagine them employing the worst-case, maximally exploitive opponent strategy, and never making a mistake. They still cannot be certain they are actually winning."

Heads-up limit Texas hold ‘em is played with just two players, fixed bet sizes and a limited number of raises allowed. According to the University of Alberta, this version of poker has fewer possible situations than Checkers, a game the institution says it solved in 2007.

In poker, however, all of the the game information is not laid bare to the players as it is in checkers. It is not possible for players to have full knowledge of past events or to see their opponents' hands, for example. This "imperfect-information" nature of heads-up limit Texas hold ‘em, says the University of Alberta, makes playing or solving it a much more challenging problem for computers.

Cepheus was programmed with the rules of the game and was then trained against itself. According to Bowling, the software was run for two months using more than four thousand CPUs each considering over six billion hands every second. Over a billion, billion hands were played, with the software learning and improving with each hand.

"The breakthroughs behind this result are general algorithmic advances that make game-theoretic reasoning in large-scale models of any sort more tractable," explains Bowling. "With real-life decision-making settings almost always involving uncertainty and missing information, algorithmic advances – such as those needed to solve Poker – are needed to drive future applications."

The university says that this is the first time a nontrivial imperfect-information game played competitively by humans has been solved. Although the research focused on a poker variation, imperfect-information situations in the real-world to which it might be applied could include decision making at airport checkpoints and coast guard patrolling.

Source: University of Alberta

1 comment
Carbon Cycle
Cepheus is not unbeatable, as I have already beaten it twice out of five 100-hand matches. It may beat most on-line challengers because the web interface is horrible and that beats down a human's emotions. Worse, the queue to play is 50 users long, but only two can play at once. It used to be 4 players, but I noticed the server kept crashing and you would get summarily tossed out of your match. So, you end up waiting as long as two hours to get into a match. There is no indication of where you are in the queue, so you can easily get distracted and miss your 1-minute window to start playing, hours after you have finally gotten into the queue. It might help to pass the time, if you could see how the current players were doing. But no, this is all about Cepheus. Still, if you can hang in there it is not a perfect poker player, and you can learn how to beat it fairly rapidly. Cepheus still has to get cards to win, and that remains a 50-50 proposition. Don't be intimidated by its betting strategy. Often, it doesn't have good cards, but keeps betting into them. You can bluff it and slow play it effectively, lose small and win big, just like playing a real person.