Papers
Topics
Authors
Recent
Search
2000 character limit reached

Holoported Characters: Real-time Free-viewpoint Rendering of Humans from Sparse RGB Cameras

Published 12 Dec 2023 in cs.CV | (2312.07423v2)

Abstract: We present the first approach to render highly realistic free-viewpoint videos of a human actor in general apparel, from sparse multi-view recording to display, in real-time at an unprecedented 4K resolution. At inference, our method only requires four camera views of the moving actor and the respective 3D skeletal pose. It handles actors in wide clothing, and reproduces even fine-scale dynamic detail, e.g. clothing wrinkles, face expressions, and hand gestures. At training time, our learning-based approach expects dense multi-view video and a rigged static surface scan of the actor. Our method comprises three main stages. Stage 1 is a skeleton-driven neural approach for high-quality capture of the detailed dynamic mesh geometry. Stage 2 is a novel solution to create a view-dependent texture using four test-time camera views as input. Finally, stage 3 comprises a new image-based refinement network rendering the final 4K image given the output from the previous stages. Our approach establishes a new benchmark for real-time rendering resolution and quality using sparse input camera views, unlocking possibilities for immersive telepresence.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (54)
  1. TensorFlow: Large-scale machine learning on heterogeneous systems, 2015. Software available from tensorflow.org.
  2. 4D Video Textures for Interactive Character Appearance. Computer Graphics Forum (Proceedings of EUROGRAPHICS), 33(2):371–380, 2014.
  3. Everybody dance now. IEEE International Conference on Computer Vision (ICCV), 1:0–0, 2019.
  4. Uv volumes for real-time rendering of editable free-view human performance. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 16621–16631, 2023.
  5. Fusion4d: Real-time performance capture of challenging scenes. ACM Transactions on Graphics (TOG), 35(4):114, 2016.
  6. Floating textures. Computer Graphics Forum (Proc. of Eurographics EG), 27(2):409–418, 2008. Received the Best Student Paper Award at Eurographics 2008.
  7. Deepcap: Monocular human performance capture using weak supervision. Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR), 1:1, 2020.
  8. Real-time deep dynamic characters. ACM Transactions on Graphics (TOG), 40(4):1–16, 2021.
  9. Hdhumans: A hybrid approach for high-fidelity digital humans, 2022.
  10. Gans trained by a two time-scale update rule converge to a local nash equilibrium. In Proceedings of the 31st International Conference on Neural Information Processing Systems, page 6629–6640, Red Hook, NY, USA, 2017. Curran Associates Inc.
  11. HumanRF: High-fidelity neural radiance fields for humans in motion. ACM Transactions on Graphics, 42(4):1–12, 2023.
  12. Neuman: Neural human radiance field from a single video. In European Conference on Computer Vision (ECCV), 2022.
  13. High-fidelity neural human motion transfer from monocular video. In Computer Vision and Pattern Recognition (CVPR), pages 1541–1550, 2021.
  14. Skinning with dual quaternions. In Proceedings of the 2007 Symposium on Interactive 3D Graphics and Games, page 39–46, New York, NY, USA, 2007a. Association for Computing Machinery.
  15. Skinning with dual quaternions. In Proceedings of the 2007 symposium on Interactive 3D graphics and games, pages 39–46. ACM, 2007b.
  16. Adam: A method for stochastic optimization, 2017.
  17. Neural image-based avatars: Generalizable radiance fields for human avatar modeling, 2023.
  18. Tava: Template-free animatable volumetric actors. In European Conference on Computer Vision (ECCV), 2022a.
  19. Neural 3d video synthesis from multi-view video. In Computer Vision and Pattern Recognition (CVPR), 2022b.
  20. Enhanced deep residual networks for single image super-resolution, 2017.
  21. Efficient neural radiance fields with learned depth-guided sampling. In SIGGRAPH Asia Conference Proceedings, 2022.
  22. Real-time high-resolution background matting. In Conference on Computer Vision and Pattern Recognition (CVPR), pages 8762–8771, 2021.
  23. Neural actor: Neural free-view synthesis of human actors with pose control. ACM Trans. Graph., 40(6), 2021.
  24. Robust dynamic radiance fields. In Computer Vision and Pattern Recognition (CVPR), 2023.
  25. Local light field fusion: Practical view synthesis with prescriptive sampling guidelines. ACM Transactions on Graphics (TOG), 38(4):1–14, 2019.
  26. Neural articulated radiance field. In International Conference on Computer Vision, 2021.
  27. Expressive body capture: 3d hands, face, and body from a single image, 2019.
  28. Animatable neural radiance fields for human body modeling. ICCV, 2021a.
  29. Neural body: Implicit neural representations with structured latent codes for novel view synthesis of dynamic humans. CVPR, 1(1):9054–9063, 2021b.
  30. Anr: Articulated neural rendering for virtual avatars. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
  31. Drivable volumetric avatars using texel-aligned features. In ACM SIGGRAPH 2022 Conference Proceedings, pages 1–9, 2022.
  32. U-net: Convolutional networks for biomedical image segmentation, 2015.
  33. Textured neural avatars. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
  34. First order motion model for image animation. In Conference on Neural Information Processing Systems (NeurIPS), 2019.
  35. As-rigid-as-possible surface modeling. In Proceedings of the Fifth Eurographics Symposium on Geometry Processing. Eurographics Association, 2007.
  36. TheCaptury. The Captury. http://www.thecaptury.com/, 2020.
  37. Non-rigid neural radiance fields: Reconstruction and novel view synthesis of a dynamic scene from monocular video. In IEEE International Conference on Computer Vision (ICCV). IEEE, 2021.
  38. Complete multi-view reconstruction of dynamic scenes from probabilistic fusion of narrow and wide baseline stereo. In International Conference on Computer Vision (ICCV), pages 1709–1716. IEEE, 2009.
  39. Dynamic shape capture using multi-view photometric stereo. ACM Transactions on Graphics (TOG), 28(5):174, 2009.
  40. Flow supervision for deformable nerf. In Computer Vision and Pattern Recognition (CVPR), 2023a.
  41. Ibrnet: Learning multi-view image-based rendering. In CVPR, 2021.
  42. Arah: Animatable volume rendering of articulated human sdfs. In European Conference on Computer Vision, 2022a.
  43. Neus2: Fast learning of neural implicit surfaces for multi-view reconstruction. In International Conference on Computer Vision (ICCV), pages 3295–3306, 2023b.
  44. Hvh: Learning a hybrid neural volumetric representation for dynamic hair performance capture. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022b.
  45. Scalable 3d video of dynamic scenes. The Visual Computer, 21:629–638, 2005.
  46. Humannerf: Free-viewpoint rendering of moving people from monocular video. In Computer Vision and Pattern Recognition (CVPR), pages 16210–16220, 2022.
  47. Dressing avatars: Deep photorealistic appearance for physically simulated clothing. ACM Trans. Graph., 41(6), 2022.
  48. Video-based characters: Creating new human performances from a multi-view video database. In ACM SIGGRAPH 2011 Papers, pages 32:1–32:10, New York, NY, USA, 2011. ACM.
  49. H-nerf: Neural radiance fields for rendering and temporal reconstruction of humans in motion, 2021.
  50. pixelNeRF: Neural radiance fields from one or few images. In CVPR, 2021.
  51. Cem Yuksel. Sample elimination for generating poisson disk sample sets. Computer Graphics Forum, 34, 2015.
  52. Editable free-viewpoint video using a layered neural representation. ACM Transactions on Graphics (TOG), 40(4):1–18, 2021.
  53. The unreasonable effectiveness of deep features as a perceptual metric. In Conference on Computer Vision and Pattern Recognition (CVPR), pages 586–595, Los Alamitos, CA, USA, 2018. IEEE Computer Society.
  54. Human performance modeling and rendering via neural animated mesh, 2022.
Citations (5)

Summary

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 1 tweet with 66 likes about this paper.