Addressing Failures in Robotics using Vision-Based Language Models (VLMs) and Behavior Trees (BT)

Published 3 Nov 2024 in cs.RO | (2411.01568v1)

Abstract: In this paper, we propose an approach that combines Vision LLMs (VLMs) and Behavior Trees (BTs) to address failures in robotics. Current robotic systems can handle known failures with pre-existing recovery strategies, but they are often ill-equipped to manage unknown failures or anomalies. We introduce VLMs as a monitoring tool to detect and identify failures during task execution. Additionally, VLMs generate missing conditions or skill templates that are then incorporated into the BT, ensuring the system can autonomously address similar failures in future tasks. We validate our approach through simulations in several failure scenarios.