Real-time Streaming Joint Audio-Visual Generation
Develop real-time streaming methods for joint audio-visual generation that can produce audio and video concurrently within a single framework.
References
Achieving real-time streaming for joint audio-visual generation remains an open, highly compelling, and unresolved problem.
— OmniForcing: Unleashing Real-time Joint Audio-Visual Generation
(2603.11647 - Su et al., 12 Mar 2026) in Section 2, Related Work