In-IDE Human-AI Experience in the Era of Large Language Models; A Literature Review

Published 19 Jan 2024 in cs.SE and cs.HC | (2401.10739v2)

Abstract: Integrated Development Environments (IDEs) have become central to modern software development, especially with the integration of AI to enhance programming efficiency and decision-making. The study of in-IDE Human-AI Experience is critical in understanding how these AI tools are transforming the software development process, impacting programmer productivity, and influencing code quality. We conducted a literature review to study the current state of in-IDE Human-AI Experience research, bridging a gap in understanding the nuanced interactions between programmers and AI assistants within IDEs. By analyzing 36 selected papers, our study illustrates three primary research branches: Design, Impact, and Quality of Interaction. The trends, challenges, and opportunities identified in this paper emphasize the evolving landscape of software development and inform future directions for research and development in this dynamic field. Specifically, we invite the community to investigate three aspects of these interactions: designing task-specific user interface, building trust, and improving readability.

Abstract PDF HTML Upgrade to Chat

References (43)

Citations (2)

View on Semantic Scholar

Summary

The paper presents a systematic literature review of 36 studies from 2020 to 2024, identifying design, workflow, and quality aspects of in-IDE Human-AI interactions.
It outlines design principles for AI-enabled tools, emphasizing user control, adaptability, and clarity in code suggestions.
It reveals the impact of AI assistance on productivity and code security, while addressing challenges related to model errors and trust.

In-IDE Human-AI Experience in the Era of LLMs: A Literature Review

This paper presents a literature review of Human-AI eXperience (HAX) within Integrated Development Environments (IDEs) in light of recent advancements in LLMs. The review analyzes 36 papers published between 2020 and 2024, identifying three primary research areas: Design, Impact, and Quality of Interaction. The study synthesizes current research trends, challenges, and opportunities, offering insights for future research and development in this evolving field.

Methodological Approach to Literature Review

The study employed a systematic literature review methodology, selecting relevant papers from ACM Digital Library, DBLP, IEEE Digital Library, and ArXiv. A targeted search string was used to identify papers focusing on the intersection of IDEs and AI assistance. Inclusion and exclusion criteria were applied to ensure the relevance and recency of the selected studies, resulting in a final set of 36 papers. The extracted information included publication year, authorship, study goals, research questions, methodology, and key findings.

Key Research Areas in In-IDE HAX

Design of AI-Enabled Tools

This research area focuses on user interface design considerations when integrating AI technologies into programming environments. The review identifies design principles for AI assistance, emphasizing clear communication, user control, adaptability, and user-friendly interactions. Generative AI design principles include communicating probabilistic nature, facilitating user annotation, accommodating imperfection through feedback, and implementing user-driven controls. Code assistants should act as adaptable ghostwriters, offering context control, balancing politeness and promotion, integrating search and documentation, and incorporating means of verification. Autocompletion features should provide glanceable suggestions, juxtaposition for clarity, simplicity through familiarity, sufficient visibility for validation, and snoozability to prevent interruptions. Several papers explore the potential of AI as a pair programmer, highlighting both the acceptance and the challenges in creating user-friendly interfaces for novice programmers.

Impact of HAX on Programmers' Workflow

This area investigates how AI assistance reshapes the programming workflow, focusing on usability issues, effects on productivity, and user trust. The research indicates that in-IDE Human-AI Interaction significantly alters the traditional programming workflow, introducing dedicated time for interacting with AI and processing its outputs. While AI tools can increase productivity, they may also lead to trade-offs in code quality, as developers sometimes struggle to align AI-generated outputs with their requirements and expectations. The context in which AI tools are used, the quality of suggestions, and compatibility issues play crucial roles in shaping the overall effectiveness and user perception. Studies on novices suggest that AI tools can positively influence programming education but require attention to challenges such as over-reliance.

Quality of AI Assistants

This research area examines the performance of AI assistants, focusing on correctness, understandability, security, and the ability to solve algorithmic problems. The review reveals that the effectiveness of an AI assistant depends not only on its user interface but also on the quality of the model's output. Studies show that while AI assistants can provide relevant solutions and suggestions, they might also be erroneous and require user correction. Regarding code comprehensibility and complexity, AI assistants generally produce understandable code that may be less complex than human-written code. However, security assessments reveal potential vulnerabilities, highlighting the importance of fine-tuning foundational models to enhance overall interaction quality.

Future Research Directions

The authors suggest focusing on three aspects of in-IDE HAX: task-specific user interfaces, trust, and readability. They propose that chat-based interaction may not always be the most effective approach and that different tasks may require alternative methods. Enhancing model reactivity by transforming predictable actions into automatic suggestions is also suggested. Addressing the developers' attitudes toward AI may impact the interaction with technology. Highlighting tokens that affect the output the most, approximating the uncertainty of the model, and providing clear and transparent context could facilitate trust between AI and the user. In terms of code quality, the authors propose that readability is a promising concept for code models' alignment.

Threats to Validity

The authors acknowledge potential threats to the validity of their findings, including sampling bias, temporal bias, source reliability, and interpretation bias. They address these threats by providing a detailed search protocol, acknowledging the limitations of including non-peer-reviewed papers, and emphasizing transparency in the analysis process.

Conclusion

The literature review provides a comprehensive overview of in-IDE Human-AI Experience, highlighting key research areas, design principles, and future research directions. The study emphasizes the need for task-specific user interfaces, building trust in AI assistants, and improving code readability to enhance the overall developer experience. The identified research areas, curated dataset, and proposed directions contribute to the collective understanding of the evolving dynamics between humans and AI within IDEs.