CREW: Facilitating Human-AI Teaming Research

Published 31 Jul 2024 in cs.HC, cs.AI, and cs.LG | (2408.00170v3)

Abstract: With the increasing deployment of AI technologies, the potential of humans working with AI agents has been growing at a great speed. Human-AI teaming is an important paradigm for studying various aspects when humans and AI agents work together. The unique aspect of Human-AI teaming research is the need to jointly study humans and AI agents, demanding multidisciplinary research efforts from machine learning to human-computer interaction, robotics, cognitive science, neuroscience, psychology, social science, and complex systems. However, existing platforms for Human-AI teaming research are limited, often supporting oversimplified scenarios and a single task, or specifically focusing on either human-teaming research or multi-agent AI algorithms. We introduce CREW, a platform to facilitate Human-AI teaming research in real-time decision-making scenarios and engage collaborations from multiple scientific disciplines, with a strong emphasis on human involvement. It includes pre-built tasks for cognitive studies and Human-AI teaming with expandable potentials from our modular design. Following conventional cognitive neuroscience research, CREW also supports multimodal human physiological signal recording for behavior analysis. Moreover, CREW benchmarks real-time human-guided reinforcement learning agents using state-of-the-art algorithms and well-tuned baselines. With CREW, we were able to conduct 50 human subject studies within a week to verify the effectiveness of our benchmark.

Abstract PDF Upgrade to Chat

References (63)

Citations (2)

View on Semantic Scholar

Summary

The paper introduces the CREW platform that enables multidisciplinary, scalable human-AI teaming with integrated real-time communication and extensive data collection.
It details a modular design supporting roles differentiation, human feedback mechanisms, and reproducible experimental settings.
Benchmarking results reveal that individual cognitive differences significantly impact training outcomes, underscoring the need for personalized AI-human collaboration strategies.

Facilitating Human-AI Teaming Research with "CREW"

Introduction

The paper introduces "CREW: Facilitating Human-AI Teaming Research" (2408.00170), focusing on developing a comprehensive platform to enhance the study and application of Human-AI collaboration. As AI systems become increasingly integrated into daily life, the necessity for effective collaboration between humans and AI agents grows. Traditional AI research centers on isolated algorithm development, whereas Human-AI teaming requires a multidisciplinary approach, incorporating insights from fields such as cognitive science, neuroscience, and complex systems. The paper addresses limitations in existing platforms and proposes a solution with CREW, emphasizing the involvement of various scientific domains.

Platform Vision and Design

CREW is designed to support a wide range of tasks and facilitate multitasking environments, addressing critical challenges like real-time communication and extensive human data collection. The platform's functionalities include extensible environments, enabling researchers to develop and modify experimental settings with ease, and supporting real-time interaction between humans and AI agents.

Figure 1: CREW supports multiple tasks from single-agent tasks to multi-agent competitive settings and offers various camera views for perceptual-motor research.

Figure 2: Environment generation in CREW with randomized mazes and procedurally generated terrains.

CREW enables hybrid Human-AI teaming modes, allowing for both collaborative and competitive settings across multiple environment instances. By supporting parallel sessions, CREW significantly improves scalability and efficiency in conducting large-scale experiments, particularly beneficial for processing data from multiple human subjects simultaneously.

Human and Agent Role Assignment

The platform differentiates between roles such as "Player," "Viewer," "Server," and "AI Agent." The "Player" role provides direct control over an agent, whereas "Viewer" allows users to observe and provide feedback. Human feedback mechanisms in CREW are particularly refined, supporting continuous and discrete scalar feedback, facilitating detailed guidance to AI agents.

Figure 3: Simple connectivity setup in CREW, enabling participation across various tasks through IP selection.

Data Collection and Analysis

CREW's data collection capabilities cover both agent and human physiological data, with synchronized streaming through Lab Streaming Layer and integration with tools like Weights & Biases for detailed monitoring. The platform's modular design enables algorithm customization and deployment, crucial for real-time human feedback integration into RL algorithms.

Figure 4: CREW's comprehensive data collection from game states to physiological signals, streamed through Lab Streaming Layer.

Figure 5: Analysis of human subjects with the highest cognitive test scores shows improved agent guiding performance, highlighting individual cognitive differences impact.

Benchmarking and Results

CREW was utilized to benchmark the c-Deep TAMER algorithm against RL baselines in diverse scenarios involving 50 human subjects, illustrating significant findings in human-AI training capabilities. The correlation between cognitive test scores and agent performance highlights the importance of individual human differences in training outcomes.

Figure 6: c-Deep TAMER training examples showcase human feedback integration.

Figure 7: Linear regression plots detailing cognitive test scores' correlation with training performance.

Conclusion

CREW represents a significant advancement in Human-AI teaming research, providing an infrastructure for multidisciplinary collaboration. Future developments will focus on expanding environment diversity and supporting advanced physiological data analysis techniques. CREW's modular and open design sets a new standard for scalable, reproducible research in human-AI teaming.

CREW aims to foster interdisciplinary collaboration and set benchmarks for Human-AI interaction, looking forward to supporting diverse research avenues in AI enhancement and human cognitive understanding.