Papers
Topics
Authors
Recent
Search
2000 character limit reached

FLD: Fourier Latent Dynamics for Structured Motion Representation and Learning

Published 21 Feb 2024 in cs.LG, cs.AI, cs.RO, cs.SY, eess.SP, and eess.SY | (2402.13820v1)

Abstract: Motion trajectories offer reliable references for physics-based motion learning but suffer from sparsity, particularly in regions that lack sufficient data coverage. To address this challenge, we introduce a self-supervised, structured representation and generation method that extracts spatial-temporal relationships in periodic or quasi-periodic motions. The motion dynamics in a continuously parameterized latent space enable our method to enhance the interpolation and generalization capabilities of motion learning algorithms. The motion learning controller, informed by the motion parameterization, operates online tracking of a wide range of motions, including targets unseen during training. With a fallback mechanism, the controller dynamically adapts its tracking strategy and automatically resorts to safe action execution when a potentially risky target is proposed. By leveraging the identified spatial-temporal structure, our work opens new possibilities for future advancements in general motion representation and learning algorithms.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (47)
  1. The nonstochastic multiarmed bandit problem. SIAM journal on computing, 32(1):48–77, 2002.
  2. Adapting wavelet compression to human motion capture clips. In Proceedings of Graphics Interface 2007, pp.  313–318, 2007.
  3. Drecon: data-driven responsive control of physics-based characters. ACM Transactions On Graphics (TOG), 38(6):1–11, 2019.
  4. Towards learning to imitate from a single video demonstration. arXiv preprint arXiv:1901.07186, 2019.
  5. Motion signal processing. In Proceedings of the 22nd annual conference on Computer graphics and interactive techniques, pp.  97–104, 1995.
  6. The mit humanoid robot: Design, motion planning, and control for acrobatic behaviors. In 2020 IEEE-RAS 20th International Conference on Humanoid Robots (Humanoids), pp.  1–8. IEEE, 2021.
  7. Maximum likelihood from incomplete data via the em algorithm. Journal of the royal statistical society: series B (methodological), 39(1):1–22, 1977.
  8. Synchronization in complex networks of phase oscillators: A survey. Automatica, 50(6):1539–1564, 2014.
  9. Deep spatial autoencoders for visuomotor learning. In 2016 IEEE International Conference on Robotics and Automation (ICRA), pp.  512–519. IEEE, 2016.
  10. Learning robot gait stability using neural networks as sensory feedback function for central pattern generators. In 2013 IEEE/RSJ international conference on intelligent robots and systems, pp.  194–201. Ieee, 2013.
  11. World models. arXiv preprint arXiv:1803.10122, 2018.
  12. Dream to control: Learning behaviors by latent imagination. arXiv preprint arXiv:1912.01603, 2019.
  13. Phase-functioned neural networks for character control. ACM Transactions on Graphics (TOG), 36(4):1–13, 2017.
  14. How to train your robot with deep reinforcement learning: lessons we have learned. The International Journal of Robotics Research, 40(4-5):698–721, 2021.
  15. Auke Jan Ijspeert. Central pattern generators for locomotion control in animals and robots: a review. Neural networks, 21(4):642–653, 2008.
  16. Policies modulating trajectory generators. In Conference on Robot Learning, pp.  916–926. PMLR, 2018.
  17. Billion-scale similarity search with GPUs. IEEE Transactions on Big Data, 7(3):535–547, 2019.
  18. Learning to correspond dynamical systems. In Learning for Dynamics and Control, pp.  105–117. PMLR, 2020.
  19. Learning quadrupedal locomotion over challenging terrain. Science robotics, 5(47):eabc5986, 2020.
  20. Learning a family of motor skills from a single motion clip. ACM Transactions on Graphics (TOG), 40(4):1–13, 2021.
  21. Motion fields for interactive character locomotion. In ACM SIGGRAPH Asia 2010 papers, pp.  1–8. 2010.
  22. Continuous character control with low-dimensional embeddings. ACM Transactions on Graphics (TOG), 31(4):1–10, 2012.
  23. Versatile skill control via self-supervised adversarial imitation of unlabeled mixed motions. In 2023 IEEE International Conference on Robotics and Automation (ICRA), pp.  2944–2950. IEEE, 2023a.
  24. Learning agile skills via adversarial imitation of rough partial demonstrations. In Conference on Robot Learning, pp.  342–352. PMLR, 2023b.
  25. Hierarchical spacetime control. In Proceedings of the 21st annual conference on Computer graphics and interactive techniques, pp.  35–42, 1994.
  26. The strategic student approach for life-long exploration and learning. In 2012 IEEE international conference on development and learning and epigenetic robotics (ICDL), pp.  1–8. IEEE, 2012.
  27. Learning robust perceptive locomotion for quadrupedal robots in the wild. Science Robotics, 7(62):eabk2822, 2022.
  28. Motion graphs++ a compact generative model for semantic motion analysis and synthesis. ACM Transactions on Graphics (TOG), 31(6):1–12, 2012.
  29. Deepmimic: Example-guided deep reinforcement learning of physics-based character skills. ACM Transactions On Graphics (TOG), 37(4):1–14, 2018.
  30. Learning agile robotic locomotion skills by imitating animals. arXiv preprint arXiv:2004.00784, 2020.
  31. Amp: Adversarial motion priors for stylized physics-based character control. ACM Transactions on Graphics (ToG), 40(4):1–20, 2021.
  32. Ase: Large-scale reusable adversarial skill embeddings for physically simulated characters. ACM Transactions On Graphics (TOG), 41(4):1–17, 2022.
  33. Teacher algorithms for curriculum learning of deep rl in continuously parameterized environments. In Conference on Robot Learning, pp.  835–853. PMLR, 2020.
  34. Carl Rasmussen. The infinite gaussian mixture model. Advances in neural information processing systems, 12, 1999.
  35. Verbs and adverbs: Multidimensional motion interpolation. IEEE Computer Graphics and Applications, 18(5):32–40, 1998.
  36. Learning to walk in minutes using massively parallel deep reinforcement learning. In Conference on Robot Learning, pp.  91–100. PMLR, 2022.
  37. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347, 2017.
  38. Deeptransition: Viability leads to the emergence of gait transitions in learning anticipatory quadrupedal locomotion skills. arXiv preprint arXiv:2306.07419, 2023.
  39. Motion in-betweening with phase manifolds. Proceedings of the ACM on Computer Graphics and Interactive Techniques, 6(3):1–17, 2023.
  40. Local motion phases for learning multi-contact character movements. ACM Transactions on Graphics (TOG), 39(4):54–1, 2020.
  41. Deepphase: Periodic autoencoders for learning motion phase manifolds. ACM Transactions on Graphics (TOG), 41(4):1–13, 2022.
  42. Sim-to-real: Learning agile locomotion for quadruped robots. arXiv preprint arXiv:1804.10332, 2018.
  43. Fourier principles for emotion-based human figure animation. In Proceedings of the 22nd annual conference on Computer graphics and interactive techniques, pp.  91–96, 1995.
  44. Embed to control: A locally linear latent dynamics model for control from raw images. Advances in neural information processing systems, 28, 2015.
  45. Interpolation synthesis of articulated figure motion. IEEE Computer Graphics and Applications, 17(6):39–45, 1997.
  46. Simper: Simple self-supervised learning of periodic targets. arXiv preprint arXiv:2210.03115, 2022.
  47. Spectral style transfer for human motion between independent actions. ACM Transactions on Graphics (TOG), 35(4):1–8, 2016.
Citations (2)

Summary

  • The paper introduces FLD as a novel framework that leverages Fourier-transformed latent dynamics to significantly reduce reconstruction and prediction errors over extended horizons.
  • It enhances traditional periodic autoencoders by incorporating a continuously parameterized latent space that captures critical trajectory parameters like frequency, amplitude, and offset.
  • FLD demonstrates robust performance in robotics and animation by effectively generalizing to unseen motions and yielding structured latent manifolds with regular temporal consistency.

A Self-Supervised Structured Representation for Motion Learning via Fourier Latent Dynamics

The paper presents a novel framework termed Fourier Latent Dynamics (FLD) to address the challenges of motion interpolation and generalization in physics-based motion learning using sparse motion trajectories. The core aim is to develop a method that effectively captures the spatial-temporal relationships inherent in periodic or quasi-periodic motions for robust motion learning applications.

Overview of the Methodology

At the heart of the proposed approach is the enhancement of traditional autoencoders with a structured learning paradigm that emphasizes a continuously parameterized latent space. By employing a periodic autoencoder (PAE) as the baseline, FLD extends its capabilities by introducing predictive knowledge within the Fourier-transformed latent space. The learning formulation utilizes latent dynamics to maintain and predict trajectory parameters—namely frequency, amplitude, and offset—across an extended horizon, relying on a quasi-constant parameterization assumption over periodic-like motions. This assumption permits the approximation of latent trajectories with bounded error over time, significantly reducing the required dimensions for accurate and generalizable representation.

Numerical Evaluation and Results

The experimental evaluation demonstrates the advantages of FLD in both reconstruction and prediction tasks, exhibiting reduced mean squared errors over longer prediction horizons compared to traditional models such as feed-forward networks and base PAEs. The model's robustness is particularly noted in its ability to generalize to unseen motions, validated through the diagonal run motion example where FLD consistently predicts future states with smaller errors relative to alternative modeling approaches.

Moreover, the paper highlights FLD's proficiency in manifesting structured latent manifolds, which assert intrinsic motion similarities by clustering latent features in a meaningful way. Compared to a Variational Autoencoder (VAE) and raw input states, FLD's enforcement of latent structure yields significantly enhanced temporal regularity and consistency in the learned manifold.

Practical Implications and Future Directions

The potential implications of this work span both theoretical advancements and practical applications. Practically, the ability to interpolate and generate realistic motion sequences with fewer data points makes FLD particularly valuable for robotic systems and virtual character animations, where data scarcity often limits performance. The proposed fallback mechanism serves as a safety net for such systems, dynamically shifting strategies to avoid dangerous states and ensuring reliable motion execution even in unanticipated conditions.

Theoretically, FLD invites further exploration into enhancing self-supervised learning architectures for complex temporal tasks, potentially expanding the framework to accommodate broader non-periodic or transient actions through adaptive structure modifications. Future works could explore the integration of this methodology with reinforcement learning to dynamically adapt agents' strategies in complex environments. Moreover, addressing limitations such as quasi-constant parameter assumptions for mixed motion phases could lead to further refinements and broader applicability of the framework in diverse settings.

In conclusion, FLD offers an innovative pathway to reconcile motion learning with the need for compact and effective representation, fostering enhanced interpolation and generalization capabilities that hold promise across a range of computational and robotic applications.

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 5 tweets with 61 likes about this paper.