Test generalization of SmartSearch to other domains and languages
Determine whether the SmartSearch deterministic retrieval and ranking pipeline for conversational memory retrieval generalizes beyond English conversational memory tasks to other domains such as document search and code retrieval, as well as to other languages, and rigorously evaluate performance and required adaptations in these settings.
References
Generalization to other domains (e.g., document search, code retrieval) or languages remains untested.
— SmartSearch: How Ranking Beats Structure for Conversational Memory Retrieval
(2603.15599 - Derehag et al., 16 Mar 2026) in Threats to Validity – Benchmark Limitations