Training for Token-Utility Maximization in Late Interaction Retrieval
Determine whether explicitly training ColBERT-style MaxSim late-interaction retrieval models with objectives that maximize the utility of each document token in multi-vector representations leads to improved retrieval performance across multimodal datasets.
References
These results suggest that training late interaction methods to maximize the utility of each token in its document representations will lead to strong performance, which we leave for future work to explore.
— Multi-Vector Index Compression in Any Modality
(2602.21202 - Qin et al., 24 Feb 2026) in Section 6 (Experiments), Subsection "Index Utilization" — paragraph "Predicting Performance with Utilization"