Expert-in-the-Loop Models

Updated 14 January 2026

Expert-in-the-loop models are AI systems that incorporate human and artificial experts to handle uncertainty in classification tasks.
They utilize dynamic routing with OOD detection (e.g., ODIN, Mahalanobis) to segregate in-distribution data from ambiguous cases.
Empirical evaluations show high accuracy and reduced human intervention, though challenges remain in scalability and integrating non-ideal expert feedback.

Expert-in-the-loop models are a class of AI and ML systems that systematically integrate domain experts into the model development and decision-making pipeline. These models generalize traditional human-in-the-loop (HITL) frameworks by allowing AI systems, often equipped with out-of-distribution (OOD) detection and dynamic routing, to collaborate with human experts and artificial experts (learnt specialist models) for optimal efficiency and accuracy. While human expertise remains critical for resolving instances that cannot be reliably classified or processed by automated systems, expert-in-the-loop approaches aim to transfer the handling of certain unknowns to trainable AI components, thereby reducing repetitive human workload and enhancing overall system performance (Jakubik et al., 2023).

1. System Architectures: Hybridization of Human and Artificial Experts

Contemporary expert-in-the-loop systems, such as the AI-in-the-Loop (AIITL) paradigm, extend classic HITL designs with a modular bank of artificial experts. Each incoming instance $x$ is processed as follows:

General Model $f_0$ : Classifies $x$ among the known classes $C_0$ , equipped with an OOD detector $s_0(x)$ .
Expert Consultancy Decision: If $s_0(x)$ indicates in-distribution, $f_0(x)$ is accepted. Otherwise, $x$ is routed to Expert Selection.
Expert Selection: A suite of $n$ $n$ artificial experts $f_1,\dots, f_n$ $f_{1}, \dots, f_{n}$ , each managing a disjoint non-overlapping class set $f_0$ $f_{0}$ 0, with distinct OOD detectors $f_0$ $f_{0}$ 1.
- If exactly one artificial expert claims $f_0$ 2 (in-distribution), its output is used.
- If no artificial expert claims $f_0$ 3, or multiple claimants exist, the instance is escalated to the human expert.
Human Expert: Assigns the correct label or, for novel classes, instantiates a new artificial expert trained on these previously unseen categories.

This architecture ensures the gradual absorption of unknown data by specialist models, thus systematically offloading routine classification from humans as the knowledge base of the system evolves (Jakubik et al., 2023).

2. Algorithms and Expert Engagement Strategies

Expert-in-the-loop models rely on a set of algorithmic primitives for effective partitioning of decision space between machinery and human specialists:

OOD Detection and Deferral: Out-of-distribution scores ( $f_0$ $f_{0}$ 4) determine model self-confidence. Popular algorithms include:
- ODIN: $f_0$ 5, augmented by temperature scaling and adversarial noise.
- Mahalanobis-based: Computes the minimal Mahalanobis distance between $f_0$ 6 and per-class embedding means.
- Thresholds $f_0$ 7 and a mixture-of-experts gating network orchestrate assignment and escalation decisions.
Artificial Expert Training: Each artificial expert $f_0$ 8 is incrementally trained on class-specific datasets $f_0$ 9 via cross-entropy minimization:

$x$ 0

with $x$ 1 parameters updated by stochastic optimization and an activation threshold (e.g., 95% validation accuracy) gating expert participation.

Gating and Allocation: The deferral mechanism can be formalized as:

$x$ 2

This stratification of expertise and training enables low-latency, high-confidence decisions for in-distribution data, iterative expansion of machine-specialized domains, and dynamic expert engagement for ambiguous or novel instances (Jakubik et al., 2023).

3. Performance Metrics and Experimental Protocols

The assessment of expert-in-the-loop models relies on multiple quantitative measures:

Accuracy $x$ 3: Fraction of correct predictions on the test set.
Human Effort $x$ 4: Proportion of instances routed to human review.
Combined Utility $x$ 5: Weighted accuracy minus human effort, $x$ 6 with sensitivity to the human cost parameter $x$ 7.

Empirical evaluations use benchmark setups such as CIFAR-10 (known domain), with SVHN, MNIST, Fashion-MNIST as incrementally revealed unknowns. The general model (e.g., Wide-ResNet-28-10) is trained on known classes and extended as new artificial experts are activated upon reaching accuracy thresholds. Iterative testing tracks both accuracy and reduction in human interventions (Jakubik et al., 2023).

4. Empirical Findings and Comparative Analysis

Notable findings from experimental studies include:

AIITL with Mixture-of-Experts Gating: Achieves $x$ 8, $x$ 9, $C_0$ 0—retiring the human expert once sufficient artificial expert coverage is achieved.
Mahalanobis and ODIN OOD Detectors: Both yield significant reductions in human effort and increased utility ( $C_0$ 1 and $C_0$ 2 respectively) over traditional HITL ( $C_0$ 3).
Baseline and Upper Bound: Even a perfect HITL setting (with optimal human allocation, $C_0$ 4) is outperformed by AIITL variants.

Statistical significance is confirmed via paired t-tests ( $C_0$ 5) across repeated runs. The modular artificial expert bank also circumvents catastrophic forgetting—a key issue in monolithic model updates—by task decomposition (Jakubik et al., 2023).

5. Theoretical Insights, Practical Benefits, and Limitations

Key insights and practical consequences for expert-in-the-loop systems:

Effort–Accuracy Tradeoff: AIITL dominates for any nontrivial value of $C_0$ 6 (human review cost), except in regimes where human effort is nearly free.
Learning Dynamics: Artificial experts require enough labeled support to meet activation thresholds; ODIN and Mahalanobis scoring can be deployed earlier, while learned gating ultimately offers higher utility at the price of increased initial annotation.
Scalability Challenges: Linear growth in the number of experts per novel class introduces potential scalability issues if the domain contains hundreds of emerging classes—a limitation yet to be resolved.
Domain and Feedback Limitations: Evaluations are currently restricted to the vision domain with idealized oracle humans. Real-world studies involving noisy or late expert feedback, or application to text and structured data, remain open areas for development.
Activation Heuristic Flexibility: Reliance on fixed accuracy thresholds could be replaced or complemented by uncertainty-driven or PAC-style criteria to further optimize engagement (Jakubik et al., 2023).

6. Broader Context and Future Directions

The expert-in-the-loop paradigm generalizes to a variety of settings beyond image classification, as a formalization of mixed-initiative, modular, and incrementally adaptive machine intelligence architectures. Core design patterns—dynamic routing via OOD estimation, modular expert instantiation, incremental specialization, and human escalation—are applicable to structured data, continual learning, and lifelong adaptation. Open questions remain in managing expert bank complexity, efficiently handling non-ideal expert behaviors, and formalizing optimal engagement schedules under cost constraints. The architecture outlined sets the foundation for next-generation hybrid AI systems with provable benefits in utility and expert resource efficiency (Jakubik et al., 2023).

Markdown Report Issue Upgrade to Chat

References (1)

Improving the Efficiency of Human-in-the-Loop Systems: Adding Artificial to Human Experts (2023)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Expert-in-the-Loop Models.