Autonomous AI capability on research-level mathematics
Ascertain the current capabilities of contemporary AI systems to autonomously solve research-level mathematics questions without expert human involvement, thereby determining where such systems presently stand in independent research problem-solving.
References
While commercial AI systems are undoubtedly already at a level where they are useful tools for mathematicians, it is not yet clear where AI systems stand at solving research-level math questions on their own, without an expert in the loop.
— First Proof
(2602.05192 - Abouzaid et al., 5 Feb 2026) in Section 1 (Introduction), paragraph 3, page 2