An Implementation of Werewolf Agent That does not Truly Trust LLMs
Abstract: Werewolf is an incomplete information game, which has several challenges when creating a computer agent as a player given the lack of understanding of the situation and individuality of utterance (e.g., computer agents are not capable of characterful utterance or situational lying). We propose a werewolf agent that solves some of those difficulties by combining a LLM and a rule-based algorithm. In particular, our agent uses a rule-based algorithm to select an output either from an LLM or a template prepared beforehand based on the results of analyzing conversation history using an LLM. It allows the agent to refute in specific situations, identify when to end the conversation, and behave with persona. This approach mitigated conversational inconsistencies and facilitated logical utterance as a result. We also conducted a qualitative evaluation, which resulted in our agent being perceived as more human-like compared to an unmodified LLM. The agent is freely available for contributing to advance the research in the field of Werewolf game.
- Gpt-4 technical report. arXiv preprint arXiv:2303.08774.
- Palm 2 technical report. arXiv preprint arXiv:2305.10403.
- Mafia: A theoretical study of players and coalitions in a partial information environment. The Annals of Applied Probability, 18(3).
- Dungeons and dragons as a dialog challenge for artificial intelligence. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics.
- Automatic classification of remarks in werewolf bbs. In 2017 5th Intl Conf on Applied Computing and Information Technology/4th Intl Conf on Computational Science/Intelligence and Applied Informatics/2nd Intl Conf on Big Data, Cloud Computing, Data Science (ACIT-CSII-BCD), pages 210–215.
- Google. 2024. Introducing Gemini 1.5. (Accessed on Apr 30, 2024).
- Yu Huang. 2024. Levels of ai agents: from rules to large language models. arXiv preprint arXiv:2405.06643.
- AIWolfDial 2023: Summary of natural language division of 5th international AIWolf contest. In Proceedings of the 16th International Natural Language Generation Conference: Generation Challenges, pages 84–100, Prague, Czechia. Association for Computational Linguistics.
- Meta. 2023. Introducing Llama2. (Accessed on Apr 30, 2024).
- Meta. 2024. Introducing Llama3. (Accessed on Apr 30, 2024).
- Piotr Migdał. 2013. A mathematical model of the mafia game. Preprint, arXiv:1009.1031.
- Strategies for an autonomous agent playing the “werewolf game” as a stealth werewolf. In Proceedings of the 1st International Workshop of AI Werewolf and Dialog System (AIWolfDial2019), pages 20–24, Tokyo, Japan. Association for Computational Linguistics.
- Constructing a human-like agent for the werewolf game using a psychological model based multiple perspectives. In 2016 IEEE Symposium Series on Computational Intelligence (SSCI), pages 1–8.
- OpenAI. 2022. Introducing ChatGPT. (Accessed on Apr 29, 2024).
- OpenAI. 2023. Introducing GPT-4. (Accessed on Apr 30, 2024).
- The Dynamics of Minority versus Majority Behaviors: A Case Study of the Mafia Game. Information, 13(3):134.
- Gemini: a family of highly capable multimodal models. arXiv preprint arXiv:2312.11805.
- Llama 2: Open foundation and fine-tuned chat models. arXiv preprint arXiv:2307.09288.
- Aliaksei Vertsel and Mikhail Rumiantsau. 2024. Hybrid llm/rule-based approaches to business insights generation from structured data. arXiv preprint arXiv:2404.15604.
- Tianhe Wang and Tomoyuki Kaneko. 2018. Application of deep reinforcement learning in werewolf game agents. In 2018 conference on technologies and applications of artificial intelligence (TAAI), pages 28–33. IEEE.
- A prompt pattern catalog to enhance prompt engineering with chatgpt. arXiv preprint arXiv:2302.11382.
- Enhance Reasoning for Large Language Models in the Game Werewolf. arXiv preprint arXiv:2402.02330.
- Exploring large language models for communication games: An empirical study on werewolf. arXiv preprint arXiv:2309.04658.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.