AltChart: Enhancing VLM-based Chart Summarization Through Multi-Pretext Tasks
Abstract: Chart summarization is a crucial task for blind and visually impaired individuals as it is their primary means of accessing and interpreting graphical data. Crafting high-quality descriptions is challenging because it requires precise communication of essential details within the chart without vision perception. Many chart analysis methods, however, produce brief, unstructured responses that may contain significant hallucinations, affecting their reliability for blind people. To address these challenges, this work presents three key contributions: (1) We introduce the AltChart dataset, comprising 10,000 real chart images, each paired with a comprehensive summary that features long-context, and semantically rich annotations. (2) We propose a new method for pretraining Vision-LLMs (VLMs) to learn fine-grained chart representations through training with multiple pretext tasks, yielding a performance gain with ${\sim}2.5\%$. (3) We conduct extensive evaluations of four leading chart summarization models, analyzing how accessible their descriptions are. Our dataset and codes are publicly available on our project page: https://github.com/moured/AltChart.
- Commission, D.R.: The web: Access and inclusion for disabled people; a formal investigation. The Stationery Office (2004)
- Diagram Center: Specific guidelines – graphs. http://diagramcenter.org/specific-guidelines-e.html (2022)
- Post, M.: A call for clarity in reporting bleu scores. In: Proceedings of the Third Conference on Machine Translation: Research Papers. pp. 186–191 (2018)
- W3C: Standards. https://www.w3.org/standards/ (2022)
- Web Content Accessibility Guidelines (WCAG): Complex images. https://www.w3.org/WAI/tutorials/images/complex/ (2022)
- WebAIM: Screen reader user survey 9 results. https://webaim.org/projects/screenreadersurvey9/ (2021)
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.