Necessity and content of images in SWE-bench task instances
Determine, for the subset of SWE-bench task instances that include an image, what the image depicts and whether the image is necessary for solving the corresponding task instance, in order to clarify the role of visual information within SWE-bench problem statements.
References
For the 5.6% of SWE-bench task instances with an image, it is unclear what these images portray and whether they are necessary to solving the task.
— SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
(2410.03859 - Yang et al., 2024) in Section 2.1 (Preliminaries), Limitations