Enhancing user experience in large language models through human-centered design: Integrating theoretical insights with an experimental study to meet diverse software learning needs with a single document knowledge base

Published 19 May 2024 in cs.HC | (2405.11505v1)

Abstract: This paper begins with a theoretical exploration of the rise of LLMs in Human-Computer Interaction (HCI), their impact on user experience (HX), and related challenges. It then discusses the benefits of Human-Centered Design (HCD) principles and the possibility of their application within LLMs, subsequently deriving six specific HCD guidelines for LLMs. Following this, a preliminary experiment is presented as an example to demonstrate how HCD principles can be employed to enhance user experience within GPT by using a single document input to GPT's Knowledge base as a new knowledge resource to control the interactions between GPT and users, aiming to meet the diverse needs of hypothetical software learners as much as possible. The experimental results demonstrate the effect of different elements' forms and organizational methods in the document, as well as GPT's relevant configurations, on the interaction effectiveness between GPT and software learners. A series of trials are conducted to explore better methods to realize text and image displaying, and jump action. Two template documents are compared in the aspects of the performances of the four interaction modes. Through continuous optimization, an improved version of the document was obtained to serve as a template for future use and research.

Abstract PDF Upgrade to Chat

Citations (5)

View on Semantic Scholar

Summary

The paper presents six Human-Centered Design guidelines for enhancing Large Language Model user experience and evaluates an experimental method using a single document knowledge base for software learning.
An experiment in ChatGPT-4 using a single document knowledge base tested templates for software learning modes, finding Template 2 better for seamless switching and performance.
The study highlights the challenge of balancing LLM reinterpretation with preserving original content, finding the 'Instructions' area provides the highest control weight in GPT configurations.

The paper provides a comprehensive exploration of integrating Human-Centered Design (HCD Human-Centered Design) principles into LLM (LLM LLM) interactions in Human-Computer Interaction (HCI Human-Computer Interaction), specifically targeting improved user experience in a software learning context. The work is structured in two main parts: a theoretical study on the transformative impact and inherent challenges of LLMs in enhancing user experience, and an experimental investigation employing a single-document knowledge base to configure and control user interactions. The summary below delineates key aspects of the study:

Theoretical Framework and HCD Guidelines

Impact of LLMs on HCI and UX The paper discusses how LLMs have revolutionized personalized, context-aware interactions by leveraging advanced Natural Language Processing (NLP Natural Language Processing) techniques. It critically examines challenges—such as ethical concerns, specification gaming, and hallucinated outputs—that complicate the direct application of LLMs in user-centered designs.
Derivation of HCD Guidelines
- High efficiency (fast response times),
- Feedback consideration (systematic collection and periodic update of user feedback),
- Support for diverse user needs (balancing generality with personalization),
- Emotional consideration (humanizing machine responses),
- High simplicity and reliability (ease of input/output interactions, including multimodality such as text, images, voice),
- Authenticity and accuracy (preserving the integrity of source information).

Experimental Methodology

Document as a Knowledge Base The paper reports on a preliminary experiment that uses a single single-document (in a .docx format) as the principal knowledge resource, uploaded into the GPTs Editor of ChatGPT-4. This document is partitioned into a control part (Part 0) and a detailed knowledge part (Part 1). The design supports multiple interaction modes to meet software learning needs:
- Mode 1: Step-by-step learning with original content.
- Mode 2: Step-by-step learning with NLP-based reinterpretation.
- Mode 3: Issue-driven solutions using original content.
- Mode 4: Issue-driven solutions with NLP reinterpretation.
Virtual Experimental Environment and Variable Controls The experiment is conducted in a controlled virtual environment where web browsing is disabled to isolate the new knowledge. The authors deploy several trials to assess:
- Text Printing Methods: Testing different forms of the “print()” function to determine how well original textual content can be maintained versus reinterpreted.
- Image Display Techniques: Comparing methods for code-driven image extraction from an uploaded zip archive, aiming to achieve correct sequential display of multimodal outputs.
- Jump Action and Interaction Modes: Evaluating how interactive requests and conditional statements, distributed across segment titles, interactive prompts, and a centralized “Instructions” area, affect the system’s ability to navigate through the document.

Results and Comparative Analysis

Template Optimization Two templates are developed to integrate various approaches:
- Template 1: Relies on a Control Center for handling user queries and mode navigation, with a design that mandates additional user input to switch interaction modes. While offering high accuracy in certain modes (rated as “perfect” in some trials), it demonstrates limitations such as forced changes that disrupt the original content presentation.
- Template 2: Consolidates all conditional statements into the “Instructions” area and enables seamless switching between modes without requiring extra input. Template 2 generally achieves a “good” to “excellent” performance across multiple interaction modes and exhibits fewer disruptions, despite occasional issues such as missed images on the first entry. Tables in the paper quantify these results, showing categorization into “perfect,” “excellent,” “good,” “fair,” and “bad” across five repeated trials.

Discussion and Concluding Remarks

Integration Challenges The study highlights that merging diverse HCD requirements in a single knowledge base is a nontrivial task. The inherent NLP-driven nature of LLMs, which tends to modify or reinterpret given text, presents a challenge to preserving original instructional content. The authors note that enforcing strict adherence to original text often requires higher-level control mechanisms within the GPT configuration.
Control Hierarchies and Workflow Management A key insight is that the “Instructions” area exerts the highest control weight over the interaction. When multiple guiding elements (section titles, interactive prompts, conditional statements) are present, the system prioritizes explicit directives from this area, thereby streamlining or complicating the user’s progression depending on the design consistency.
Path Forward and Future Work
- The potential interference of pre-existing LLM knowledge with newly input knowledge.
- The influence of context length on mode determination.
- Alternative document structuring methods that could further enhance UX.
- Optimizing for speed alongside accuracy.

In summary, the work offers detailed guidelines and empirical data on configuring GPTs for a user-centric software learning environment. It provides not only a methodological template for integrating HCD into LLM configurations but also highlights the technical challenges and trade-offs involved in balancing creative response generation with controlled, content-preserving interactions.

Markdown Report Issue