AITopics | pvcnn

5737034557ef5b8c02c0e46513b98f90-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-12-2026, 06:03:19 GMT

convolution, pvcnn, voxel resolution, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.49)

Add feedback

Point-Voxel CNN for Efficient 3D Deep Learning

Neural Information Processing SystemsDec-25-2025, 10:10:57 GMT

Previous work processes 3D data using either voxel-based or point-based NN models. However, both approaches are computationally inefficient. The computation cost and memory footprints of the voxel-based models grow cubically with the input resolution, making it memory-prohibitive to scale up the resolution. As for point-based networks, up to 80% of the time is wasted on dealing with the sparse data which have rather poor memory locality, not on the actual feature extraction. In this paper, we propose PVCNN that represents the 3D input data in points to reduce the memory consumption, while performing the convolutions in voxels to reduce the irregular, sparse data access and improve the locality. Our PVCNN model is both memory and computation efficient. Evaluated on semantic and part segmentation datasets, it achieves much higher accuracy than the voxel-based baseline with 10 GPU memory reduction; it also outperforms the state-of-the-art point-based models with 7 measured speedup on average. Remarkably, the narrower version of PVCNN achieves 2 speedup over PointNet (an extremely efficient model) on part and scene segmentation benchmarks with much higher accuracy. We validate the general effectiveness of PVCNN on 3D object detection: by replacing the primitives in Frustrum PointNet with PVConv, it outperforms Frustrum PointNet++ by 2.4% mAP on average with 1.5 measured speedup and GPU memory reduction.

deep learning, name change, point-voxel cnn, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.42)

Add feedback

Point-Voxel CNN for Efficient 3D Deep Learning

Zhijian Liu, Haotian Tang, Yujun Lin, Song Han

Neural Information Processing SystemsOct-2-2025, 18:43:13 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, pvcnn, (17 more...)

Neural Information Processing Systems

Country: North America (0.28)

Industry: Information Technology (0.68)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.84)

Add feedback

5737034557ef5b8c02c0e46513b98f90-AuthorFeedback.pdf

Neural Information Processing SystemsOct-2-2025, 18:42:57 GMT

artificial intelligence, machine learning, pvcnn, (18 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.49)

Add feedback

Point-Voxel CNN for Efficient 3D Deep Learning

Neural Information Processing SystemsOct-10-2024, 02:00:20 GMT

Previous work processes 3D data using either voxel-based or point-based NN models. However, both approaches are computationally inefficient. The computation cost and memory footprints of the voxel-based models grow cubically with the input resolution, making it memory-prohibitive to scale up the resolution. As for point-based networks, up to 80% of the time is wasted on dealing with the sparse data which have rather poor memory locality, not on the actual feature extraction. In this paper, we propose PVCNN that represents the 3D input data in points to reduce the memory consumption, while performing the convolutions in voxels to reduce the irregular, sparse data access and improve the locality.

deep learning, gpu memory reduction, point-voxel cnn, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.44)

Add feedback

Distance Estimation and Animal Tracking for Wildlife Camera Trapping

Johanns, Peter, Haucke, Timm, Steinhage, Volker

arXiv.org Artificial IntelligenceFeb-9-2022

The ongoing biodiversity crysis calls for accurate estimation of animal density and abundance to identify, for example, sources of biodiversity decline and effectiveness of conservation interventions. Camera traps together with abundance estimation methods are often employed for this purpose. The necessary distances between camera and observed animal are traditionally derived in a laborious, fully manual or semi-automatic process. Both approaches require reference image material, which is both difficult to acquire and not available for existing datasets. In this study, we propose a fully automatic approach to estimate camera-to-animal distances, based on monocular depth estimation (MDE), and without the need of reference image material. We leverage state-of-the-art relative MDE and a novel alignment procedure to estimate metric distances. We evaluate the approach on a zoo scenario dataset unseen during training. We achieve a mean absolute distance estimation error of only 0.9864 meters at a precision of 90.3% and recall of 63.8%, while completely eliminating the previously required manual effort for biodiversity researchers. The code will be made available.

dataset, detection, estimation, (16 more...)

arXiv.org Artificial Intelligence

2202.04613

Country:

Europe > United Kingdom > England > Hampshire (0.04)
Europe > Spain > Valencian Community > Alicante Province > Alicante (0.04)
Europe > Germany > North Rhine-Westphalia > Cologne Region > Cologne (0.04)

Genre: Research Report (0.84)

Industry:

Information Technology (0.46)
Leisure & Entertainment > Games > Computer Games (0.34)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.67)
(2 more...)

Add feedback

Point-Voxel CNN for Efficient 3D Deep Learning

Liu, Zhijian, Tang, Haotian, Lin, Yujun, Han, Song

Neural Information Processing SystemsMar-18-2020, 20:46:12 GMT

Previous work processes 3D data using either voxel-based or point-based NN models. However, both approaches are computationally inefficient. The computation cost and memory footprints of the voxel-based models grow cubically with the input resolution, making it memory-prohibitive to scale up the resolution. As for point-based networks, up to 80% of the time is wasted on dealing with the sparse data which have rather poor memory locality, not on the actual feature extraction. In this paper, we propose PVCNN that represents the 3D input data in points to reduce the memory consumption, while performing the convolutions in voxels to reduce the irregular, sparse data access and improve the locality.

deep learning, gpu memory reduction, point-voxel cnn, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.44)

Add feedback