Papers
Topics
Authors
Recent
Search
2000 character limit reached

Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging

Published 8 May 2025 in cs.CL | (2505.05464v1)

Abstract: Vision-LLMs (VLMs) combine visual perception with the general capabilities, such as reasoning, of LLMs. However, the mechanisms by which these two abilities can be combined and contribute remain poorly understood. In this work, we explore to compose perception and reasoning through model merging that connects parameters of different models. Unlike previous works that often focus on merging models of the same kind, we propose merging models across modalities, enabling the incorporation of the reasoning capabilities of LLMs into VLMs. Through extensive experiments, we demonstrate that model merging offers a successful pathway to transfer reasoning abilities from LLMs to VLMs in a training-free manner. Moreover, we utilize the merged models to understand the internal mechanism of perception and reasoning and how merging affects it. We find that perception capabilities are predominantly encoded in the early layers of the model, whereas reasoning is largely facilitated by the middle-to-late layers. After merging, we observe that all layers begin to contribute to reasoning, whereas the distribution of perception abilities across layers remains largely unchanged. These observations shed light on the potential of model merging as a tool for multimodal integration and interpretation.

Summary

I’m sorry, but it seems there is no information available about the content of the paper named “2505.05464v1”. Consequently, I am unable to provide a summary or analysis of the paper without specific details or a description of its findings and contributions. If you have any other information or links to the paper, please share them so I can assist you further.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 1 tweet with 0 likes about this paper.