Balancing the AI Strength of Roles in Self-Play Training with Regret Matching+
Abstract: When training artificial intelligence for games encompassing multiple roles, the development of a generalized model capable of controlling any character within the game presents a viable option. This strategy not only conserves computational resources and time during the training phase but also reduces resource requirements during deployment. training such a generalized model often encounters challenges related to uneven capabilities when controlling different roles. A simple method is introduced based on Regret Matching+, which facilitates a more balanced performance of strength by the model when controlling various roles.
- “A simple adaptive procedure leading to correlated equilibrium” In Econometrica 68.5 Wiley Online Library, 2000, pp. 1127–1150
- “Mastering the game of Go with deep neural networks and tree search” In nature 529.7587 Nature Publishing Group, 2016, pp. 484–489
- Oskari Tammelin “Solving large imperfect information games using CFR+” In arXiv preprint arXiv:1407.5042, 2014
- Gerald Tesauro “Temporal difference learning and TD-Gammon” In Communications of the ACM 38.3, 1995, pp. 58–68
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.