Effectiveness of LLM-based deanonymization when true matches are extremely rare
Establish whether large-language-model-based deanonymization retains non-trivial recall at high precision even when the prior probability that a query has any matching candidate in the pool is extremely small (e.g., π = 0.0001), demonstrating that, for the few identifiable users, the pipeline can still reliably find correct matches under this low-matchability regime.
References
We hence conjecture that, even in settings where almost no users can be deanonymized, LLM-based attacks are reasonably likely to find a correct match for the few users that are identifiable.
— Large-scale online deanonymization with LLMs
(2602.16800 - Lermen et al., 18 Feb 2026) in Section 6.2 (Comparing difficulty parameters of our attack model)