Develop a Continuous Unified Visual Tokenizer for Understanding and Generation
Develop a simple yet effective continuous visual tokenizer that naturally supports both visual understanding and image generation.
References
As a result, developing a simple yet effective continuous visual tokenizer that naturally supports both visual understanding and generation remains an open and practically important challenge.
— OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation
(2601.15369 - Zhang et al., 21 Jan 2026) in Section 1, Introduction