Effective evaluation of unified multimodal generation
Develop effective methodologies to evaluate unified multimodal generation systems that jointly produce images and text in response to a single prompt, ensuring that the evaluation captures both modalities and their interaction.
References
How to effectively evaluate unified multimodal generation remains an open problem.
— UEval: A Benchmark for Unified Multimodal Generation
(2601.22155 - Li et al., 29 Jan 2026) in Section: Rubric Generation and Evaluation