Measuring and evaluating design AI systems
Develop principled methodologies for measuring and evaluating graphic design AI systems, addressing open questions about how design capabilities should be quantified and compared across tasks and models.
References
Despite its breadth, GDB surfaces several limitations that point to open problems in how design AI should be measured and evaluated.
— Graphic-Design-Bench: A Comprehensive Benchmark for Evaluating AI on Graphic Design Tasks
(2604.04192 - Deganutti et al., 5 Apr 2026) in Discussion, Section 7 (Evaluation Gaps)