CLiFT: Compressive Light-Field Tokens for Compute Efficient and Adaptive Neural Rendering
–Neural Information Processing Systems
This paper proposes a neural rendering approach that represents a scene as "compressed light-field tokens (CLiFTs)", retaining rich appearance and geometric information of a scene. CLiFT enables compute-efficient rendering by compressed tokens, while being capable of changing the number of tokens to represent a scene or render a novel view with one trained network. Concretely, given a set of images, multi-view encoder tokenizes the images with the camera poses. Latent-space K-means selects a reduced set of rays as cluster centroids using the tokens. The multi-view "condenser" compresses the information of all the tokens into the centroid tokens to construct CLiFTs. At test time, given a target view and a compute budget (i.e., the number of CLiFTs), the system collects the specified number of nearby tokens and synthesizes a novel view using a compute-adaptive renderer.
Neural Information Processing Systems
Jun-14-2026, 16:26:14 GMT
- Genre:
- Research Report > Experimental Study (1.00)
- Industry:
- Information Technology (0.93)
- Technology:
- Information Technology
- Communications (0.93)
- Artificial Intelligence
- Vision (1.00)
- Machine Learning > Neural Networks (1.00)
- Information Technology