AITopics | sparse convolution

A Training Objectives Our model is trained from scratch with the semantic loss L

Neural Information Processing SystemsFeb-15-2026, 12:19:54 GMT

The computational overhead of CluB is 1.2 / 1.3 times that of the BEV -only A detailed comparison is shown in the following table. GPUs and the batch size per GPU is set as 2. Table 2: Ablation study on the effect of the two kinds of object queries for the transformer decoder. Red boxes and green boxes are the predictions and ground-truth, respectively. Transfusion: Robust lidar-camera fusion for 3d object detection with transformers. Fully sparse 3d object detection.

artificial intelligence, detection, machine learning, (14 more...)

Neural Information Processing Systems

Genre: Instructional Material (0.41)

Technology:

Information Technology > Artificial Intelligence > Vision (0.96)
Information Technology > Artificial Intelligence > Machine Learning (0.67)

Add feedback

Accelerating Sparse Convolutions in Voxel-Based Point Cloud Networks

Adamopoulos, Dionysios, Poulopoulou, Anastasia, Goumas, Georgios, Giannoula, Christina

arXiv.org Artificial IntelligenceNov-27-2025

Sparse Convolution (SpC) powers 3D point cloud networks widely used in autonomous driving and AR/VR. SpC builds a kernel map that stores mappings between input voxel coordinates, output coordinates, and weight offsets, then uses this map to compute feature vectors for output coordinates. Our work identifies three key properties of voxel coordinates: they are integer-valued, bounded within a limited spatial range, and geometrically continuous-neighboring voxels on the same object surface are highly likely to exist at small spatial offsets from each other. Prior SpC engines do not fully exploit these properties and suffer from high pre-processing and post-processing overheads during kernel map construction. To address this, we design Spira, the first voxel-property-aware SpC engine for GPUs. Spira proposes: (i) a high-performance one-shot search algorithm that builds the kernel map with no preprocessing and high memory locality, (ii) an effective packed-native processing scheme that accesses packed voxel coordinates at low cost, (iii) a flexible dual-dataflow execution mechanism that efficiently computes output feature vectors by adapting to layer characteristics, and (iv) a network-wide parallelization strategy that builds kernel maps for all SpC layers concurrently at network start. Our evaluation shows that Spira significantly outperforms prior SpC engines by 1.71x on average and up to 2.31x for end-to-end inference, and by 2.13x on average and up to 3.32x for layer-wise execution across diverse layer configurations.

artificial intelligence, dataflow, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2511.20834

Genre: Research Report (0.40)

Industry: Information Technology > Robotics & Automation (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.66)

Add feedback

One for All: Multi-Domain Joint Training for Point Cloud Based 3D Object Detection

Neural Information Processing SystemsNov-19-2025, 09:37:09 GMT

To address this issue, multi-domain joint training ( i.e., multi-dataset joint training) should be introduced into point cloud based 3D object detection, to allow 3D detectors to learn from point clouds of

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country: Asia > China > Hong Kong (0.04)

Genre: Research Report (1.00)

Industry: Information Technology > Services (0.61)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.67)

Add feedback

SparseST: Exploiting Data Sparsity in Spatiotemporal Modeling and Prediction

Wu, Junfeng, Benmeziane, Hadjer, Maghraoui, Kaoutar El, Liu, Liu, Wang, Yinan

arXiv.org Artificial IntelligenceNov-19-2025

Spatiotemporal data mining (STDM) has a wide range of applications in various complex physical systems (CPS), i.e., transportation, manufacturing, healthcare, etc. Among all the proposed methods, the Convolutional Long Short-Term Memory (ConvLSTM) has proved to be generalizable and extendable in different applications and has multiple variants achieving state-of-the-art performance in various STDM applications. However, ConvLSTM and its variants are computationally expensive, which makes them inapplicable in edge devices with limited computational resources. With the emerging need for edge computing in CPS, efficient AI is essential to reduce the computational cost while preserving the model performance. Common methods of efficient AI are developed to reduce redundancy in model capacity (i.e., model pruning, compression, etc.). However, spatiotemporal data mining naturally requires extensive model capacity, as the embedded dependencies in spatiotemporal data are complex and hard to capture, which limits the model redundancy. Instead, there is a fairly high level of data and feature redundancy that introduces an unnecessary computational burden, which has been largely overlooked in existing research. Therefore, we developed a novel framework SparseST, that pioneered in exploiting data sparsity to develop an efficient spatiotemporal model. In addition, we explore and approximate the Pareto front between model performance and computational efficiency by designing a multi-objective composite loss function, which provides a practical guide for practitioners to adjust the model according to computational resource constraints and the performance requirements of downstream tasks.

data mining, machine learning, sparse convolution, (22 more...)

arXiv.org Artificial Intelligence

2511.14753

Country:

North America > United States > New York > Rensselaer County > Troy (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > France (0.04)

Genre:

Research Report (0.64)
Workflow (0.46)
Overview (0.46)

Industry: Transportation (0.48)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
(2 more...)

Add feedback

c383e44d9a878d1982d9abb838bd5d8a-Paper-Conference.pdf

Neural Information Processing SystemsNov-16-2025, 01:11:44 GMT

artificial intelligence, machine learning, sparsity, (17 more...)

Neural Information Processing Systems

Country: Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

c1aaf7c3f306fe94f77236dc0756d771-Supplemental-Conference.pdf

Neural Information Processing SystemsNov-16-2025, 00:34:23 GMT

We first slightly enlarge the proposals by 0.3m, then randomly take out 128 voxels

artificial intelligence, machine learning, proposal, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)

Add feedback

CAGroup3D: Class-Aware Grouping for 3D Object Detection on Point Clouds

Neural Information Processing SystemsNov-16-2025, 00:34:19 GMT

We present a novel two-stage fully sparse convolutional 3D object detection framework, named CAGroup3D.

artificial intelligence, detection, machine learning, (16 more...)

Neural Information Processing Systems

Country: Asia > China > Beijing > Beijing (0.04)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Spatial Pruned Sparse Convolution for Efficient 3D Object Detection

Neural Information Processing SystemsNov-13-2025, 19:50:41 GMT

Code and models are available at this link.

artificial intelligence, convolution, machine learning, (14 more...)

Neural Information Processing Systems

Country:

Asia > China > Hong Kong (0.04)
North America > United States > Washington > King County > Seattle (0.04)
North America > Canada > Quebec > Montreal (0.04)
(3 more...)

Genre: Research Report > New Finding (0.46)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.70)

Add feedback

SPLite Hand: Sparsity-Aware Lightweight 3D Hand Pose Estimation

Hao, Yeh Keng, Wei, Hsu Tzu, Min, Sun

arXiv.org Artificial IntelligenceOct-31-2025

With the increasing ubiquity of AR/VR devices, the deployment of deep learning models on edge devices has become a critical challenge. These devices require real-time inference, low power consumption, and minimal latency. Many framework designers face the conundrum of balancing efficiency and performance. We design a light framework that adopts an encoder-decoder architecture and introduces several key contributions aimed at improving both efficiency and accuracy. We apply sparse convolution on a ResNet-18 backbone to exploit the inherent sparsity in hand pose images, achieving a 42% end-to-end efficiency improvement. Moreover, we propose our SPLite decoder. This new architecture significantly boosts the decoding process's frame rate by 3.1x on the Raspberry Pi 5, while maintaining accuracy on par. To further optimize performance, we apply quantization-aware training, reducing memory usage while preserving accuracy (PA-MPJPE increases only marginally from 9.0 mm to 9.1 mm on FreiHAND). Overall, our system achieves a 2.98x speed-up on a Raspberry Pi 5 CPU (BCM2712 quad-core Arm A76 processor). Our method is also evaluated on compound benchmark datasets, demonstrating comparable accuracy to state-of-the-art approaches while significantly enhancing computational efficiency.

artificial intelligence, convolution, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2510.16396

Country: