Sequential and compositional generalization in model-free imitation learning from pixels
Determine whether model-free imitation learning from raw image observations can achieve sequential and compositional generalization to novel tasks beyond the training distribution, particularly in settings with long-horizon tasks and small demonstration datasets.
References
However, sequential and compositional generalization to novel tasks beyond the training distribution remain open challenges, especially when task horizons are long and demonstration datasets are small.
— From Pixels to Predicates: Learning Symbolic World Models via Pretrained Vision-Language Models
(2501.00296 - Athalye et al., 2024) in Section 1 (Introduction)