Papers
Topics
Authors
Recent
Search
2000 character limit reached

Cicero: Addressing Algorithmic and Architectural Bottlenecks in Neural Rendering by Radiance Warping and Memory Optimizations

Published 18 Apr 2024 in cs.AR and cs.GR | (2404.11852v1)

Abstract: Neural Radiance Field (NeRF) is widely seen as an alternative to traditional physically-based rendering. However, NeRF has not yet seen its adoption in resource-limited mobile systems such as Virtual and Augmented Reality (VR/AR), because it is simply extremely slow. On a mobile Volta GPU, even the state-of-the-art NeRF models generally execute only at 0.8 FPS. We show that the main performance bottlenecks are both algorithmic and architectural. We introduce, CICERO, to tame both forms of inefficiencies. We first introduce two algorithms, one fundamentally reduces the amount of work any NeRF model has to execute, and the other eliminates irregular DRAM accesses. We then describe an on-chip data layout strategy that eliminates SRAM bank conflicts. A pure software implementation of CICERO offers an 8.0x speed-up and 7.9x energy saving over a mobile Volta GPU. When compared to a baseline with a dedicated DNN accelerator, our speed-up and energy reduction increase to 28.2x and 37.8x, respectively - all with minimal quality loss (less than 1.0 dB peak signal-to-noise ratio reduction).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (7)
  1. ``Agisoft metashape,'' https://www.agisoft.com/.
  2. ``Apple A15 Die Shot and Annotation - IP Block Area Analysis.'' [Online]. Available: https://www.semianalysis.com/p/apple-a15-die-shot-and-annotation
  3. ``Micron 178-Ball, Single-Channel Mobile LPDDR3 SDRAM Features.'' [Online]. Available: https://www.micron.com/-/media/client/global/documents/products/data-sheet/dram/mobile-dram/low-power-dram/lpddr3/178b_8-16gb_2c0f_mobile_lpddr3.pdf
  4. ``Micron System Power Calculators.'' [Online]. Available: https://www.micron.com/support/tools-and-utilities/power-calc
  5. ``Nvidia reveals xavier soc details.'' [Online]. Available: https://www.forbes.com/sites/moorinsights/2018/08/24/nvidia-reveals-xavier-soc-details/amp/
  6. ``NVIDIA’s Xavier System-on-Chip, HotChips 30.'' [Online]. Available: https://fuse.wikichip.org/news/1618/hot-chips-30-nvidia-xavier-soc/
  7. PCL, ``Spatial partitioning and search operations with octrees,'' https://pcl.readthedocs.io/projects/tutorials/en/latest/octree.html.
Citations (2)

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We found no open problems mentioned in this paper.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 1 tweet with 13 likes about this paper.