Computer poker player

This is an old revision of this page, as edited by Zipster969 (talk | contribs) at 01:24, 19 February 2023 (Player bots: Clarified wording.). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

A computer poker player is a computer program designed to play the game of poker against human opponents or other computer opponents. It is commonly referred to as pokerbot or just simply bot. As of 2019, computers can beat any human player in poker[1][2][3].

On the Internet

These bots or computer programs are used often in online poker situations as either legitimate opponents for humans players or a form of cheating. As of 2020, all use of Real Time Assistance or automated bots is considered cheating by all online poker sites, although the level of enforcement from site operators varies considerably. [citation needed]

Player bots

Use of player bots or computer assistance while playing online poker is prohibited by most, if not all, online sites. Actions taken for breaches are a permanent ban and confiscation of winnings. [citation needed] One kind of bot can interface with the poker client (in other words, play by itself as an auto player) without the help of its human operator. Real-Time Assistance (RTA) is another method of using computer programs. RTA is when a human player uses program called an “solver” such as PioSOLVER or PokerSnowie[4], running on a different computer[5], to make their decisions.

The issue of unfair advantage is twofold. For one, bots can play for many hours at a time without human weaknesses such as fatigue and can endure the natural variances of the game without being influenced by human emotion (or "tilt"). Older bots used to have some significant disadvantages - for example, it used to be difficult for a bot to accurately read a bluff or adjust to the strategy of opponents the way humans can. Since 2019, however, the computer program Pluribus (poker bot) is successful enough at reading bluffs and adjusting to strategy that it consistently beats professional poker players at 6-player no-limit Hold’em [6][7][8].

House enforcement

While the terms and conditions of poker sites generally forbid the use of bots, the level of enforcement depends on the site operator. Some will seek out and ban bot users through the utilization of a variety of software tools. The poker client can be programmed to detect bots although this is controversial in its own right as it might be seen as tantamount to embedding spyware in the client software.[citation needed] Another method is to use CAPTCHAs at random intervals during play to catch automated bots, although isn’t effective against Real-Time Assistance.

House bots

“House bots” can pose a conflict of interest. By the strictest definition, a house bot is an automated player operated by the online poker room itself. These type of bots would be the equivalent of brick and mortar shills.

In a brick and mortar casino, a house player does not subvert the fairness of the game being offered as long as the house is dealing honestly. In an online setting the same is also true. By definition, an honest online poker room that chooses to operate house bots would guarantee that the house bots did not have access to any information not also available to any other player in the hand (the same would apply to any human shill as well). The problem is that in an online setting the house has no way to prove their bots are not receiving sensitive information from the card server. This is further exacerbated by the ease with which clandestine information sharing can be accomplished in a digital environment. It is essentially impossible even for the house to prove that they do not control some players.

Artificial Intelligence

Like in the games of chess, Go (game), Jeopardy!, and many other games, artificial intelligence systems beat even the best humans at poker[9]. Poker is a game of imperfect information (because some cards in play are concealed) thus making it harder for anyone (including a computer) to deduce the final outcome of the hand. Because of this lack of information, the computer's programmers used to have to implement systems based on the Bayes theorem, Nash equilibrium, Monte Carlo simulation or neural networks, all of which are imperfect techniques. Pluribus, however, perfected poker by only looking ahead a few moves to determine what action to take, rather than attempting to evaluate all moves until the end of the game.

Older AIs like PokerSnowie and Claudico were created by allowing the computer to determine the best possible strategy by letting it play itself an enormous number of times. For years, this was the approach to poker AI, as opposed to attempting to make a computer that plays like a human. This resulted in odd bet sizing and a much different strategy than humans are used to seeing.

Methods were first developed to approximate perfect poker strategy from the game theory perspective in the heads-up (two player) game, and then for the multi-player game. Perfect strategy has multiple meanings in this context. From a game-theoretic optimal point of view, a perfect strategy is one that cannot expect to lose to any other player's strategy; however, optimal strategy can vary in the presence of sub-optimal players who have weaknesses that can be exploited. In this case, a perfect strategy is one that correctly or closely models those weaknesses and takes advantage of them to make a profit, such as those explained above.

AI broke through to superhuman performance in poker during the 2010s, with the following timeline. In 2015, computers solved heads-up limit hold'em via Cepheus. Around 2018, Libratus demonstrated superhuman ability in heads-up no-limit hold'em. In 2019, Pluribus (a newer version of Libratus)[10] demonstrated superhuman ability at six-player no-limit hold'em, the most commonly played single variety of poker in the world.[11]. In 2021, Microsoft released the older poker-playing program, Libratus, commercially, which then beat four professional poker players in a 20-day long poker competition at Rivers Casino[12].

Research groups

Neo Poker Laboratory

Neo Poker Lab was an established science team focused on the research of poker artificial intelligence. For several years it developed and applied state-of-the-art algorithms and procedures like regret minimization and gradient search equilibrium approximation, decision trees, recursive search methods as well as expert algorithms to solve a variety of problems related to the game of poker. Neo Poker Lab’s website, https://www.neopokerlab.com is no longer running.

The University of Auckland Game AI Group

Until 2017, a team from the University of Auckland consisted of a small number of scientists who employ case-based reasoning to create and enhance Texas Hold’em poker agents. The group applied different AI techniques to a number of games including participation in the commercial projects Small Worlds and Civilization (video game).

Computer Poker Research Group (University of Alberta, Canada)

Until 2019, a large amount of the research into computer poker players was being performed at the University of Alberta by the Computer Poker Research Group, led by Dr. Michael Bowling. The group developed the agents Poki, PsOpti, Hyperborean and Polaris. Poki has been licensed for the entertainment game STACKED featuring Canadian poker player Daniel Negreanu. PsOpti was available under the name "SparBot" in the poker training program "Poker Academy". The series of Hyperborean programs have competed in the Annual Computer Poker Competition, most recently taking three gold medals out of six events in the 2012 competition. The same line of research also produced Polaris, which played against human professionals in 2007 and 2008, and became the first computer poker program to win a meaningful poker competition.

In January 2015, an article in Science[13] by Michael Bowling, Neil Burch, Michael Johanson, and Oskari Tammelin claimed that their poker bot Cepheus had "essentially weakly solved" the game of heads-up limit Texas hold 'em.[14][15][16]

School of Computer Science from Carnegie Mellon University

T. Sandholm and A. Gilpin from Carnegie Mellon University started poker AI research in 2004 beginning with unbeatable agent for 3-card game called Rhode-Island Hold 'em. Next step was GS1 which outperformed the best commercially available poker bots. Since 2006 poker agents from this group have participated in annual computer competitions. "At some point we will have a program better than the best human players" – claimed Sandholm. His bot, Claudico, faced off against four human opponents in 2015. In 2017 the program's latest software, Libratus, faced off against four professional poker players. By the end of the experiment the four human players had lost a combined $1.8 million of simulated money to Libratus.[17]

Libratus was later replaced by the final version called Pluribus (poker bot).

Historic contests

ICCM 2004 PokerBot competition

One of the earliest no-limit poker bot competitions was organized in 2004 by International Conference on Cognitive Modelling.[18] The tournament hosted five bots from various universities from around the world. The winner was Ace Gruber, from University of Toronto.[19]

ACM competitions

The Association for Computing Machinery (ACM) used to host competitions where the competitors submit a piece of software capable of playing poker on their specific platform. The event hosts conducted the contests by operating the software and reporting the results.[citation needed]

The 2005 World Series of Poker Robots

In the summer 2005, the online poker room Golden Palace hosted a promotional tournament in Las Vegas, at the old Binions, with a $100k giveaway prize. It was billed as the 2005 World Series of Poker Robots. The tournament was bots only with no entry fee. The bot developers were computer scientists from six nationalities who traveled at their own expense. The host platform was Poker Academy. The event also featured a demonstration headsup event with Phil Laak.

University of Alberta's Man V Machine experiments

In the summer 2007, the University of Alberta hosted a highly specialized headsup tournament between humans and their Polaris bot, at the AAAI conference in Vancouver, BC, Canada. The host platform was written by the University of Alberta. There was a $50k maximum giveaway purse with special rules to motivate the humans to play well. The humans paid no entry fee. The unique tournament featured four duplicate style sessions of 500 hands each. The humans won by a narrow margin.

In the summer of 2008, the University of Alberta and the poker coaching website Stoxpoker ran a second tournament during the World Series of Poker in Las Vegas. The tournament had six duplicate sessions of 500 hands each, and the human players were Heads-Up Limit specialists. Polaris won the tournament with 3 wins, 2 losses and a draw. The results of the tournament, including the hand histories from the matches, are available on the competition website.

The 2015 Brains vs AI competition by Rivers Casino, CMU and Microsoft

From April–May 2015, Carnegie Mellon University Sandholm's bot, Claudico, faced off against four human opponents, in a series of no-limit Texas Hold'em matches.[20][21] Finally, after playing 80,000 hands, humans were up by a combined total of $732,713. But even though humans technically won, scientists considered the win as statistically insignificant (rather, a statistical tie) when that $732,713 is compared to the total betting amount of $170,000,000 ($170 million). However, some have determined this claim to be disingenuous.[22] Statistically insignificant here means that the programmers of Claudico can not say with 95% confidence (a 95% confidence interval) that humans are better than the computer program. However, it is a statistically significant win on a 90% confidence interval. This means that the human players are somewhere between a 10 to 1 and 20 to 1 favorite.[23]

The way the tournament was structured was in two sets of two players each. In each of the two sets, the players got the opposite cards. Meaning if the computer has As9c (Ace of Spades & Nine of Clubs) and the human has Jh8d on one computer, the other of the two players in the set will have As9c up against the computer's Jh8d. However, even with the human players winning more than the computer—not all of the players were positive in their head-to-head match ups.

The totals for each of the players winnings were as follows:

  • Douglas Polk: +$213,671
  • Dong Kim: +$70,491
  • Bjorn Li: +$529,033
  • Jason Les: -$80,482[24]

The Annual Computer Poker Competition

From 2006 to 2018, the Annual Computer Poker Competition ran a series of competitions for poker programs. Since 2010, three types of poker were played: Heads-Up Limit Texas Hold'em, Heads-Up No-Limit Texas Hold'em, and 3-player Limit Texas Hold'em. Within each event, two winners are named: the agent that wins the most matches (Bankroll Instant Run-off), and the agent that wins the most money (Total Bankroll). These winners are often not the same agent, as Bankroll Instant Run-off rewards robust players, and Total Bankroll rewards players that are good at exploiting the other agents' mistakes. The competition was motivated by scientific research, and there was an emphasis on ensuring that all of the results are statistically significant by running millions of hands of poker. The 2012 competition had the same formats with more than 70 million hands played to eliminate luck factor.

Some researchers developed web application where people could play and assess quality of the AI. So as of December 2012 the following top groups and individual researchers’ agents could be found:

  • Hyperborean (9 gold, 5 silver and 3 bronze)
  • Bluffbot (1 gold, 3 silver and 2 bronze medals)
  • Sartre (1 gold, 5 silver and 3 bronze medals)
  • Neo Poker Bot (1 gold, 5 bronze medals)

Results

2010 [25]
Heads-up Limit Texas Hold'em
Total Bankroll Bankroll Instant Run-off
1. PULPO (Marv Andersen, UK)
2. Hyperborean-TBR (University of Alberta, Canada)
3. Sartre (University of Auckland, New Zealand)
1. Rockhopper (David Lin, USA)
2. GGValuta (Mihai Ciucu, Romania)
3. Hyperborean-IRO (University of Alberta, Canada)
Heads Up No Limit Texas Hold'em
Total Bankroll Bankroll Instant Run-off
1. Tartanian4-TBR (Carnegie Mellon University, USA)
2. PokerBotSLO (Universities of Maribor & Ljubljana, Slovenia)
3. HyperboreanNL-TBR (University of Alberta, Canada)
1. HyperboreanNL-IRO (University of Alberta, Canada)
2. SartreNL (University of Auckland, New Zealand)
3. Tartanian4-IRO (Carnegie Mellon University, USA)
3-max Limit Texas Hold'em
Total Bankroll Bankroll Instant Run-off
1. Hyperborean3P-TBR (University of Alberta, Canada)
2. LittleRock (Rod Byrnes, Australia)
3. Bender (Technical University Darmstadt, German)
1. Hyperborean3P-IRO (University of Alberta, Canada)
2. dcu3pl-IRO (Dublin City University, Ireland)
3. LittleRock (Rod Byrnes, Australia)
2011 [26]

[27]

Heads-up Limit Texas Hold'em
Total Bankroll Bankroll Instant Run-off
1. Calamari (Marv Andersen, UK)
2. Sartre (University of Auckland, New Zealand)
3. Hyperborean-2011-2p-limit-tbr (University of Alberta, Canada)
1. Hyperborean-2011-2p-limit-iro (University of Alberta, Canada)
2. Slumbot (Eric Jackson, USA)
3. Calamari (Marv Andersen, UK)
Heads Up No Limit Texas Hold'em
Total Bankroll Bankroll Instant Run-off
1. Lucky7 (Mikrospin d.o.o., Slovenia)
2. SartreNL (University of Auckland, New Zealand)
3. Hyperborean-2011-2p-nolimit-tbr (University of Alberta, Canada)
1. Hyperborean-2011-2p-nolimit-iro (University of Alberta, Canada)
2. SartreNL (University of Auckland, New Zealand)
3. Hugh (USA & Canada)
3-max Limit Texas Hold'em
Total Bankroll Bankroll Instant Run-off
1. Sartre3p (University of Auckland, New Zealand)
2. Hyperborean-2011-3p-limit-tbr (University of Alberta, Canada)
3. AAIMontybot (Charles University in Prague, Czech Republic)
3. LittleRock (Rod Byrnes, Australia)
1. Hyperborean-2011-3p-limit-iro (University of Alberta, Canada)
2. Sartre3p (University of Auckland, New Zealand)
3. LittleRock (Rod Byrnes, Australia)
2012
Heads-up Limit Texas Hold'em
Total Bankroll Bankroll Instant Run-off
1. Slumbot (Eric Jackson, USA)
2. Little Rock (Rod Byrnes, Australia)
2. Zbot (Ilkka Rajala, Finland)
1. Slumbot (Eric Jackson, USA)
2. Hyperborean (University of Alberta, Canada)
3. Zbot (Ilkka Rajala, Finland)
Heads Up No Limit Texas Hold'em
Total Bankroll Bankroll Instant Run-off
1. Little Rock (Rod Byrnes, Australia)
2. Hyperborean (University of Alberta, Canada)
3. Tartanian5 (Carnegie Mellon University, USA)
1. Hyperborean (University of Alberta, Canada)
2. Tartanian5 (Carnegie Mellon University, USA)
3. Neo Poker Bot (Alexander Lee, Spain)
3-max Limit Texas Hold'em
Total Bankroll Bankroll Instant Run-off
1. Hyperborean (University of Alberta, Canada)
2. Little Rock (Rod Byrnes, Australia)
3. Neo Poker Bot (Alexander Lee, Spain)
3. Sartre (University of Auckland, New Zealand)
1. Hyperborean (University of Alberta, Canada)
2. Little Rock (Rod Byrnes, Australia)
3. Neo Poker Bot (Alexander Lee, Spain)
3. Sartre (University of Auckland, New Zealand)
2013
Heads-up Limit Texas Hold'em
Total Bankroll Bankroll Instant Run-off
1. Marv (Marv Anderson, UK)
2. Feste (François Pays, France)
2. Hyperborean (University of Alberta, Canada)
1. Neo Poker Bot (Alexander Lee, Spain)
2. Hyperborean (University of Alberta, Canada)
3. Zbot (Ilkka Rajala, Finland)
3. Marv (Marv Anderson, UK)
Heads Up No Limit Texas Hold'em
Total Bankroll Bankroll Instant Run-off
1. Slumbot NL (Eric Jackson, USA)
2. Hyperborean (University of Alberta, Canada)
3. Tartanian6 (Carnegie Mellon University, USA)
1. Hyperborean (University of Alberta, Canada)
2. Slumbot NL (Eric Jackson, USA)
3. Tartanian6 (Carnegie Mellon University, USA)
3. Nyx (Charles University, Czech Republic)
3-max Limit Texas Hold'em
Total Bankroll Bankroll Instant Run-off
1. Hyperborean (University of Alberta, Canada)
2. Little Rock (Rod Byrnes, Australia)
3. Neo Poker Bot (Alexander Lee, Spain)
1. Hyperborean (University of Alberta, Canada)
2. Little Rock (Rod Byrnes, Australia)
3. Neo Poker Bot (Alexander Lee, Spain)
2014
Heads-up Limit Texas Hold'em
Total Bankroll Bankroll Instant Run-off
1. Escabeche (Marv Andersen, UK)
2. SmooCT (University College London, UK)
3. Hyperborean (University of Alberta, Canada)
3. Feste (Francois Pays, France)


Heads Up No Limit Texas Hold'em
Total Bankroll Bankroll Instant Run-off
1. Tartanian7 (Carnegie Mellon University, USA)
2. Nyx (Charles University, Czech Republic)
2. Prelude (Unfold Poker, USA)
2. Slumbot (Eric Jackson, USA)
1. Tartanian7 (Carnegie Mellon University, USA)
2. Prelude (Unfold Poker, USA)
2. Hyperborean (University of Alberta, Canada)
2. Slumbot (Eric Jackson, USA)
3-max Limit Texas Hold'em
Total Bankroll Bankroll Instant Run-off
1. Hyperborean (University of Alberta, Canada)
2. SmooCT (University College London, UK)
3. KEmpfer (Technische Universität Darmstadt, Germany)
1. Hyperborean (University of Alberta, Canada)
2. SmooCT (University College London, UK)
3. KEmpfer (Technische Universität Darmstadt, Germany)

Pluribus

The final poker contest was not public. When the Pluribus (poker bot) program consistently beat professionals at 6-hand no-limit Hold’em, the result was quietly announced on a Facebook post[28].

See also

References

  1. ^ Nature. “DeepMind AI topples experts at complex game Stratego”. Anil Ananthaswamy. NEWS 01 December 2022, Clarification 05 December 2022.
  2. ^ Nature. “ No limit: AI poker bot is first to beat professionals at multiplayer game.” Douglas Heaven. 11 July 2019. https://www.nature.com/articles/d41586-019-02156-9
  3. ^ https://www.scientificamerican.com/article/ai-learns-what-an-infant-knows-about-the-physical-world/?amp=true Scientific American. “AI Learns What an Infant Knows About the Physical World.” Dana G. Smith, July 11, 2022.
  4. ^ Seven Games: A Human History. Chapter 5. Oliver Roeder, 2022. Publisher: W.W. Norton and Co.
  5. ^ https://docs.house.gov/meetings/IF/IF17/20131210/101570/HHRG-113-IF17-Wstate-EggertK-20131210.pdf Testimony of Kurt Eggert Professor of Law Chapman University Dale E. Fowler School of Law Before the House Energy and Commerce Committee Subcommittee on Commerce, Manufacturing, and Trade At a Hearing Entitled: “The State of Online Gaming” Rayburn House Office Building, Washington, DC December 10, 2013
  6. ^ Brown, Noam; Sandholm, Tuomas (30 August 2019). "Superhuman AI for multiplayer poker". Science. 365 (6456): 885–890. Bibcode:2019Sci...365..885B. doi:10.1126/science.aay2400. PMID 31296650. S2CID 195892791.
  7. ^ https://www.cmu.edu/news/stories/archives/2019/july/cmu-facebook-ai-beats-poker-pros.html Carnegie Mellon University. “Carnegie Mellon and Facebook AI Beats Professionals in Six-Player Poker: ‘Superhuman’ card shark achieves new AI milestone.” July 11, 2019.
  8. ^ https://www.forbes.com/sites/bernardmarr/2019/09/13/artificial-intelligence-masters-the-game-of-poker--what-does-that-mean-for-humans/amp/ Forbes. “Artificial Intelligence Masters The Game of Poker – What Does That Mean For Humans?” Sep 13, 2019.
  9. ^ https://www.scientificamerican.com/podcast/episode/this-artificial-intelligence-learns-like-a-widdle-baby/ Scientific American. “This Artificial Intelligence Learns like a Baby.” August 6, 2022. Christopher Intagliata
  10. ^ https://www.smithsonianmag.com/smart-news/poker-playing-ai-knows-when-hold-em-when-fold-em-180972643/ Smithsonian Magazine. “This Poker-Playing A.I. Knows When to Hold ‘Em and When to Fold ‘Em”. Meilan Solly, July 15, 2019.
  11. ^ https://www.npr.org/2019/07/11/740661470/bet-on-the-bot-ai-beats-the-professionals-at-6-player-texas-hold-em NPR. “Bet On The Bot: AI Beats The Professionals At 6-Player Texas Hold 'Em.” Merritt Kennedy. July 11, 2019.
  12. ^ “AI’s Disruption Of The Strategy Gaming Space Proves That Machines Are Getting Smarter.” Forbes. Annie Brown. Nov 10, 2021,04:42pm EST.
  13. ^ Bowling, Michael; Burch, Neil; Johanson, Michael; Tammelin, Oskari (Jan 2015). "Heads-up limit hold'em poker is solved". Science. 347 (6218): 145–9. Bibcode:2015Sci...347..145B. CiteSeerX 10.1.1.697.72. doi:10.1126/science.1259433. PMID 25574016. S2CID 3796371.
  14. ^ Philip Ball (2015-01-08). "Game Theorists Crack Poker". Nature. doi:10.1038/nature.2015.16683. S2CID 155710390. Retrieved 2015-01-13. {{cite journal}}: Unknown parameter |agency= ignored (help)
  15. ^ Robert Lee Hotz (2015-01-08). "Computer Conquers Texas Hold 'Em, Researchers Say". Wall Street Journal.
  16. ^ Bob McDonald (2015-01-10). "Poker Computer Takes the Pot [audio interview]". Quirks & Quarks (Podcast).
  17. ^ Joshua Brustein (31 January 2017). "Daftar Situs Poker Online Pkv Games". Bloomberg.com. Bloomberg. Retrieved 2 February 2017.
  18. ^ "Iccm 2004".
  19. ^ https://www.era.lib.ed.ac.uk/bitstream/1842/2392/2/Carter%20RG%20thesis%2007.pdf [bare URL PDF]
  20. ^ Marilyn Malara (April 25, 2015). "Brains vs. AI: Computer faces poker pros in no-limit Texas Hold'em". UPI. Retrieved April 26, 2015.
  21. ^ "Rivers Casino's Brains vs AI". Archived from the original on 2016-02-03. Retrieved 2015-10-23.
  22. ^ "Brains Vs. AI | Carnegie Mellon School of Computer Science". www.cs.cmu.edu. Retrieved 2016-02-10.
  23. ^ "Brains Vs. AI | Carnegie Mellon School of Computer Science". www.cs.cmu.edu. Retrieved 2016-02-10.
  24. ^ "Brains vs Artificial Intelligence". www.riverscasino.com. Archived from the original on 2016-02-03. Retrieved 2016-02-10.
  25. ^ "CPRG".
  26. ^ "CPRG".
  27. ^ "Poker Bots Evolution – from History to the Future". 8 July 2020.
  28. ^ Solly, Meilan. "This Poker-Playing A.I. Knows When to Hold 'Em and When to Fold 'Em". Smithsonian.