Planning directly in latent action space
Develop methods to perform planning directly within the latent action space learned by latent action world models trained on in-the-wild videos, rather than mapping from known action spaces. Specifically, construct sampling and optimization procedures that operate over the continuous latent action vectors inferred by the inverse dynamics model and account for the geometry of sparsity- or noise-regularized latent actions, enabling goal-directed sequence generation in latent space.
References
Performing planning directly in latent action space is, to the best of our knowledge, an open problem that can be made worse depending on the geometry of the latent action space.
— Learning Latent Action World Models In The Wild
(2601.05230 - Garrido et al., 8 Jan 2026) in Appendix, Section “Sampling latent actions”