Deconstructing Human-AI Collaboration: Agency, Interaction, and Adaptation

Published 18 Apr 2024 in cs.HC and cs.AI | (2404.12056v1)

Abstract: As full AI-based automation remains out of reach in most real-world applications, the focus has instead shifted to leveraging the strengths of both human and AI agents, creating effective collaborative systems. The rapid advances in this area have yielded increasingly more complex systems and frameworks, while the nuance of their characterization has gotten more vague. Similarly, the existing conceptual models no longer capture the elaborate processes of these systems nor describe the entire scope of their collaboration paradigms. In this paper, we propose a new unified set of dimensions through which to analyze and describe human-AI systems. Our conceptual model is centered around three high-level aspects - agency, interaction, and adaptation - and is developed through a multi-step process. Firstly, an initial design space is proposed by surveying the literature and consolidating existing definitions and conceptual frameworks. Secondly, this model is iteratively refined and validated by conducting semi-structured interviews with nine researchers in this field. Lastly, to illustrate the applicability of our design space, we utilize it to provide a structured description of selected human-AI systems.

Abstract PDF HTML Upgrade to Chat

References (49)

Summary

The paper introduces a unified framework that deconstructs human-AI collaboration into agency, interaction, and adaptation dimensions.
It employs a systematic literature review and expert interviews to refine model dimensions and inform design space analysis.
Case studies demonstrate the framework's utility in comparative analysis, design reasoning, and gap identification for future research.

Deconstructing Human-AI Collaboration: Agency, Interaction, and Adaptation

Introduction

The paper "Deconstructing Human-AI Collaboration: Agency, Interaction, and Adaptation" (2404.12056) addresses the increasing complexity and ambiguity in the characterization of human-AI collaborative systems. As full automation by AI remains unattainable in most real-world scenarios, the focus has shifted to hybrid systems that leverage the complementary strengths of humans and AI agents. The authors identify a lack of systematic frameworks for comparing and classifying such systems, noting that existing models and terminology are fragmented and insufficiently expressive. To address this, the paper introduces a unified conceptual model structured around three high-level aspects: agency, interaction, and adaptation. This model is developed through a systematic literature review and iterative refinement via expert interviews, culminating in a design space that enables comprehensive analysis and comparison of human-AI collaborative systems.

Methodology

The conceptual model is constructed through a two-phase process:

Systematic Literature Review: The authors survey recent literature using targeted keywords and manual curation from major conferences (CHI, IUI, EuroVis, VIS), assembling a corpus of works relevant to human-AI collaboration. From this corpus, they extract and aggregate system properties, collapsing redundancies and focusing on dimensions that describe system characteristics rather than user motivations.
Interview Study: Semi-structured interviews with nine domain experts (PhDs and postdocs in visual analytics, explainable AI, NLP, RL, GNNs, ML) are conducted to refine the initial set of dimensions. The interview process is staged to elicit unbiased ideation, encourage thinking in terms of dimensions, and compare participant suggestions with the proposed model. Iterative feedback leads to convergence on a compact set of dimensions grouped into three categories.

The methodology emphasizes balancing expressiveness and generalizability, ensuring the model is sufficiently granular to distinguish between systems while remaining broadly applicable.

Conceptual Model: Design Space

The proposed design space is organized into three high-level categories, each decomposed into specific dimensions:

Agency

Agency Distribution: Specifies which agents (human, AI, mixed) possess decision-making control.
Agency Allocation: Describes whether agency is pre-determined (static assignment) or negotiated (dynamic, opportunistic assignment).

The model recognizes the trade-off between control and automation, highlighting the need for further research into unbiased negotiation mechanisms and temporal dynamics of agency.

Interaction

Interaction Intent: Captures agent motives (receiving guidance, requesting information, exploration, providing feedback).
Guidance Degree: Quantifies the extent of guidance (orienting, directing, prescribing).
Guidance Focus: Specifies the elements targeted by guidance (system, interface, interaction, data, task-specific).
Feedback Type: Differentiates between explicit, implicit, or both forms of feedback.

The interaction category is modeled symmetrically for human and AI agents, avoiding anthropocentric bias and enabling per-agent analysis. The dimensions are informed by prior work in visual analytics, communication models, and guidance frameworks.

Adaptation

Adapting Agents: Identifies which agents (human, AI, both) are capable of learning during collaboration.
Adaptation Method: Distinguishes between adaptation for task performance improvement and communication enhancement.
Information Learned: Details the types of information acquired (domain, data, task, agent goals, agent preferences).

The adaptation category acknowledges the challenge of modeling human learning and the intertwined nature of information acquisition in collaborative settings.

Case Studies

Three representative systems are analyzed using the proposed design space:

Co-Adaptive Guidance for Topic Model Refinement: A mixed-initiative system where both human and multiple AI agents iteratively refine topic models. Agency is negotiated, interaction involves directing guidance and explicit/implicit feedback, and both agents adapt by learning preferences and improving communication.
Podium: A visual analytics tool for ranking multivariate data. Agency is mixed but pre-determined, interaction includes directing/prescribing guidance and explicit feedback, and adaptation occurs for both agents, primarily through learning data properties and user preferences.
ProtoSteer: An interface for steering deep sequence models via prototype modification. Agency is human-centered, interaction involves directing guidance and explicit feedback, and adaptation is present for both agents, with the AI learning domain/task information and the human improving communication.

These case studies demonstrate the model's capacity to encode diverse collaborative systems and highlight the multidimensional nature of real-world human-AI collaboration.

Implications and Future Directions

Practical Implications

The design space provides a systematic framework for:

Comparative Analysis: Enables structured comparison of collaborative systems across domains.
Design Reasoning: Assists practitioners in making informed design choices by clarifying trade-offs in agency, interaction, and adaptation.
Gap Identification: Reveals latent dimensions and underexplored areas, guiding future research and system development.

Theoretical Implications

Taxonomy of Agency: The model exposes the need for a more nuanced taxonomy of agency, including mechanisms for negotiation and dynamic allocation.
Temporal Dynamics: Current static assignment of properties limits analysis of systems with evolving collaboration modes; modeling temporal aspects is a critical research opportunity.
Expressivity vs. Generalizability: The categorical simplification of continuous dimensions facilitates systematic analysis but may obscure fine-grained distinctions; future work could explore more expressive frameworks with rigorous definitions.

Limitations

Human Learning Modeling: The symmetric treatment of human and AI learning is an acknowledged oversimplification; cognitive modeling of human adaptation remains an open challenge.
Granularity: The choice of categorical values for dimensions may limit expressivity; more granular options (e.g., human-in-the-loop, AI-in-the-loop) could enhance descriptive power.
Corpus Coverage: The current model is validated on three systems; comprehensive encoding of a larger corpus is necessary for broader generalization.

Speculation on Future Developments

Complex System Engineering: Platforms that abstract underlying complexity could facilitate experimentation with variations in design space dimensions.
Dynamic Collaboration Models: Integration of temporal and context-dependent properties will enable richer modeling of adaptive, evolving human-AI teams.
Interdisciplinary Research: Advances in cognitive science, HCI, and machine learning will inform more sophisticated models of human learning and agent interaction.

Conclusion

The paper presents a unified conceptual model for human-AI collaboration, structured around agency, interaction, and adaptation. Developed through literature synthesis and expert feedback, the design space enables systematic analysis and comparison of collaborative systems. Application to representative case studies demonstrates its utility and highlights areas for future research, including agency taxonomy, temporal modeling, and cognitive adaptation. The framework advances the capacity of the research community to reason about, design, and analyze human-AI collaborative systems in a rigorous and comprehensive manner.

Markdown