Usefulness of LLM-generated hypotheses and ideas for scientific discovery
Determine whether hypotheses and research ideas generated by large language models are useful in practice and lead to new scientific discoveries, given their typically theoretical nature and the costly validation required to justify them.
References
Additionally, given that hypotheses and ideas are typically theoretical and cannot be validated without costly justification, it is unclear whether generated hypotheses and ideas are truly useful and lead to new scientific discoveries.
— Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation
(2502.05151 - Eger et al., 7 Feb 2025) in Subsection "Limitations and future directions", Designing and conducting experiments; AI-based discovery (Section \ref{sec:experiments})