Ascertain whether ARC-AGI overfitting is accidental or intentional
Determine whether the observed knowledge-dependent benchmark overfitting affecting ARC-AGI-1 and ARC-AGI-2 has arisen accidentally or intentionally, in order to clarify the source of apparent contamination and its implications for benchmark validity.
References
We assert that this phenomenon is now occurring with ARC-AGI-1 and ARC-AGI-2 â accidentally or intentionally, although we cannot determine which.
— ARC Prize 2025: Technical Report
(2601.10904 - Chollet et al., 15 Jan 2026) in Section: Knowledge Overfitting