Optimal combination of multiple uncertainty scores for better discriminability
Determine an optimal strategy for combining multiple uncertainty estimation scores for large language models—such as pairwise concatenations of scores like CoE-C and verbalized confidence within the inter-score Truth Anchoring (TAC) framework—to maximize discriminability between correct and incorrect responses as measured by AUC.
References
However, it remains to be seen how we can optimally combine scores to achieve even better discriminability.
— Towards Reliable Truth-Aligned Uncertainty Estimation in Large Language Models
(2604.00445 - Srey et al., 1 Apr 2026) in Section: Inter-score Anchoring