Evaluate performance of NLTH AIs under full multi-player engagement

Determine the performance of no-limit Texas hold'em AI agents, including DeepStack, Libratus, and Pluribus, in multi-player games where all players participate in every betting round rather than folding early, thereby assessing their effectiveness in settings with sustained multi-player engagement.

Background

Recent AI systems such as DeepStack, Libratus, and Pluribus have achieved strong results in no-limit Texas hold'em (NLTH), including multi-player settings. However, typical NLTH dynamics often see most players folding after the opening round, effectively reducing many hands to two-player scenarios. This raises concerns about how well these AIs perform when the game maintains robust multi-player engagement throughout all betting rounds.

The authors emphasize that Liar's Poker intrinsically enforces continuous multi-player participation across bidding rounds, motivating its use as a testbed. Nonetheless, the specific question of how NLTH AIs fare when every player remains engaged across rounds is explicitly identified as unresolved in the introduction.

References

It is, therefore, unclear how well these AI programs perform in a multi-player setting where all players participate in each betting round.

— Outbidding and Outbluffing Elite Humans: Mastering Liar's Poker via Self-Play and Reinforcement Learning (2511.03724 - Dewey et al., 5 Nov 2025) in Section 1: Introduction (following footnote on folding rates), page 2

Evaluate performance of NLTH AIs under full multi-player engagement

Background

References

Related Problems