AITopics | pointcloud

Collaborating Authors

pointcloud

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

8b94879b177d9780c17f5a78f62a6a8a-Supplemental-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsFeb-15-2026, 18:33:07 GMT

artificial intelligence, json, machine learning, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.31)

Add feedback

Knowledge-inspired3DSceneGraphPredictionin PointCloud

Neural Information Processing SystemsFeb-10-2026, 04:05:31 GMT

Our thorough evaluations indicate the new method canachievethestate-of-the-art performance compared with other scene graphpredictionmethods.

artificial intelligence, knowledge, machine learning, (19 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
Asia > Middle East > Republic of Türkiye > Karaman Province > Karaman (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

STITCH 2.0: Extending Augmented Suturing with EKF Needle Estimation and Thread Management

Hari, Kush, Chen, Ziyang, Kim, Hansoul, Goldberg, Ken

arXiv.org Artificial IntelligenceOct-30-2025

Abstract--Surgical suturing is a high-precision task that impacts patient healing and scarring. Suturing skill varies widely between surgeons, highlighting the need for robot assistance. Previous robot suturing works, such as STITCH 1.0 [1], struggle to fully close wounds due to inaccurate needle tracking and poor thread management. T o address these challenges, we present STITCH 2.0, an elevated augmented dexterity pipeline with seven improvements including: improved EKF needle pose estimation, new thread untangling methods, and an automated 3D suture alignment algorithm. Experimental results over 15 trials find that STITCH 2.0 on average achieves 74.4% wound closure with 4.87 sutures per trial, representing 66% more sutures in 38% less time compared to the previous baseline. When two human interventions are allowed, STITCH 2.0 averages six sutures with 100% wound closure rate. URGICAL robots have revolutionized minimally invasive surgery, with Intuitive Surgical's da Vinci system performing over 2.6 million procedures in 2024 [2]. While these procedures require complete human control, recent advances in artificial intelligence (AI) present opportunities for surgical robot autonomy. However, the high-risk nature of surgery raises safety concerns for fully autonomous AI systems.

artificial intelligence, stitch 2, suture, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/LRA.2025.3625503

2510.25768

Country: North America > United States (0.46)

Genre: Research Report (0.50)

Industry:

Health & Medicine > Surgery (0.68)
Health & Medicine > Health Care Technology (0.67)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

Neural 3D Object Reconstruction with Small-Scale Unmanned Aerial Vehicles

Veres-Vitàlyos, Àlmos, Gomez-Raya, Genis Castillo, Lemic, Filip, Bugelnig, Daniel Johannes, Rinner, Bernhard, Abadal, Sergi, Costa-Pérez, Xavier

arXiv.org Artificial IntelligenceOct-22-2025

Small Unmanned Aerial Vehicles (UAVs) exhibit immense potential for navigating indoor and hard-to-reach areas, yet their significant constraints in payload and autonomy have largely prevented their use for complex tasks like high-quality 3-Dimensional (3D) reconstruction. To overcome this challenge, we introduce a novel system architecture that enables fully autonomous, high-fidelity 3D scanning of static objects using UAVs weighing under 100 grams. Our core innovation lies in a dual-reconstruction pipeline that creates a real-time feedback loop between data capture and flight control. A near-real-time (near-RT) process uses Structure from Motion (SfM) to generate an instantaneous pointcloud of the object. The system analyzes the model quality on the fly and dynamically adapts the UAV's trajectory to intelligently capture new images of poorly covered areas. This ensures comprehensive data acquisition. For the final, detailed output, a non-real-time (non-RT) pipeline employs a Neural Radiance Fields (NeRF)-based Neural 3D Reconstruction (N3DR) approach, fusing SfM-derived camera poses with precise Ultra Wide-Band (UWB) location data to achieve superior accuracy. We implemented and validated this architecture using Crazyflie 2.1 UAVs. Our experiments, conducted in both single- and multi-UAV configurations, conclusively show that dynamic trajectory adaptation consistently improves reconstruction quality over static flight paths. This work demonstrates a scalable and autonomous solution that unlocks the potential of miniaturized UAVs for fine-grained 3D reconstruction in constrained environments, a capability previously limited to much larger platforms.

application, artificial intelligence, reconstruction, (17 more...)

arXiv.org Artificial Intelligence

2509.12458

Country: Europe (1.00)

Genre: Research Report > New Finding (1.00)

Industry:

Information Technology > Robotics & Automation (1.00)
Aerospace & Defense > Aircraft (0.84)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)
Information Technology > Architecture (1.00)

Add feedback

R2RGEN: Real-to-Real 3D Data Generation for Spatially Generalized Manipulation

Xu, Xiuwei, Ma, Angyuan, Li, Hankun, Yu, Bingyao, Zhu, Zheng, Zhou, Jie, Lu, Jiwen

arXiv.org Artificial IntelligenceOct-10-2025

Towards the aim of generalized robotic manipulation, spatial generalization is the most fundamental capability that requires the policy to work robustly under different spatial distribution of objects, environment and agent itself. To achieve this, substantial human demonstrations need to be collected to cover different spatial configurations for training a generalized visuomotor policy via imitation learning. Prior works explore a promising direction that leverages data generation to acquire abundant spatially diverse data from minimal source demonstrations. However, most approaches face significant sim-to-real gap and are often limited to constrained settings, such as fixed-base scenarios and predefined camera viewpoints. In this paper, we propose a real-to-real 3D data generation framework (R2RGen) that directly augments the pointcloud observation-action pairs to generate real-world data. R2RGen is simulator- and rendering-free, thus being efficient and plug-and-play. Specifically, given a single source demonstration, we introduce an annotation mechanism for fine-grained parsing of scene and trajectory. A group-wise augmentation strategy is proposed to handle complex multi-object compositions and diverse task constraints. We further present camera-aware processing to align the distribution of generated data with real-world 3D sensor. Empirically, R2RGen substantially enhances data efficiency on extensive experiments and demonstrates strong potential for scaling and application on mobile manipulation.

demonstration, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2510.08547

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.34)

Add feedback

First Plan Then Evaluate: Use a Vectorized Motion Planner for Grasping

Matak, Martin, Shanthi, Mohanraj Devendran, Van Wyk, Karl, Hermans, Tucker

arXiv.org Artificial IntelligenceSep-16-2025

Abstract-- Autonomous multi-finger grasping is a fundamental capability in robotic manipulation. Optimization-based approaches show strong performance, but tend to be sensitive to initialization and are potentially time-consuming. As an alternative, the generator-evaluator-planner framework has been proposed. A generator generates grasp candidates, an evaluator ranks the proposed grasps, and a motion planner plans a trajectory to the highest-ranked grasp. If the planner doesn't find a trajectory, a new trajectory optimization is started with the next-best grasp as the target and so on. However, executing lower-ranked grasps means a lower chance of grasp success, and multiple trajectory optimizations are time-consuming. Alternatively, relaxing the threshold for motion planning accuracy allows for easier computation of a successful trajectory but implies lower accuracy in estimating grasp success likelihood. It's a lose-lose proposition: either spend more time finding a successful trajectory or have a worse estimate of grasp success. We propose a framework that plans trajectories to a set of generated grasp targets in parallel, the evaluator estimates the grasp success likelihood of the resulting trajectories, and the robot executes the trajectory most likely to succeed. T o plan trajectories to different targets efficiently, we propose the use of a vectorized motion planner . Our experiments show our approach improves over the traditional generator-evaluator-planner framework across different objects, generators, and motion planners, and successfully generalizes to novel environments in the real world, including different shelves and table heights.

artificial intelligence, motion planner, trajectory, (13 more...)

arXiv.org Artificial Intelligence

2509.07162

Genre: Research Report (0.51)

Technology:

Information Technology > Artificial Intelligence > Robots > Manipulation (0.47)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.46)

Add feedback

M3DMap: Object-aware Multimodal 3D Mapping for Dynamic Environments

Yudin, Dmitry

arXiv.org Artificial IntelligenceAug-26-2025

3D mapping in dynamic environments poses a challenge for modern researchers in robotics and autonomous transportation. There are no universal representations for dynamic 3D scenes that incorporate multimodal data such as images, point clouds, and text. This article takes a step toward solving this problem. It proposes a taxonomy of methods for constructing multimodal 3D maps, classifying contemporary approaches based on scene types and representations, learning methods, and practical applications. Using this taxonomy, a brief structured analysis of recent methods is provided. The article also describes an original modular method called M3DMap, designed for object-aware construction of multimodal 3D maps for both static and dynamic scenes. It consists of several interconnected components: a neural multimodal object segmentation and tracking module; an odometry estimation module, including trainable algorithms; a module for 3D map construction and updating with various implementations depending on the desired scene representation; and a multimodal data retrieval module. The article highlights original implementations of these modules and their advantages in solving various practical tasks, from 3D object grounding to mobile manipulation. Additionally, it presents theoretical propositions demonstrating the positive effect of using multimodal data and modern foundational models in 3D mapping methods. Details of the taxonomy and method implementation are available at https://yuddim.github.io/M3DMap.

arxiv preprint arxiv, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2508.17044

Country:

Asia (0.28)
North America (0.28)

Genre: Research Report (0.64)

Industry:

Transportation (0.88)
Information Technology (0.68)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

Add feedback

Pixels-to-Graph: Real-time Integration of Building Information Models and Scene Graphs for Semantic-Geometric Human-Robot Understanding

Longo, Antonello, Chung, Chanyoung, Palieri, Matteo, Kim, Sung-Kyun, Agha, Ali, Guaragnella, Cataldo, Khattak, Shehryar

arXiv.org Artificial IntelligenceJul-1-2025

-- Autonomous robots are increasingly playing key roles as support platforms for human operators in high-risk, dangerous applications. T o accomplish challenging tasks, an efficient human-robot cooperation and understanding is required. While typically robotic planning leverages 3D geometric information, human operators are accustomed to a high-level compact representation of the environment, like top-down 2D maps representing the Building Information Model (BIM). In this work, we introduce Pixels-to-Graph (Pix2G), a novel lightweight method to generate structured scene graphs from image pixels and LiDAR maps in real-time for the autonomous exploration of unknown environments on resource-constrained robot platforms. T o satisfy onboard compute constraints, the framework is designed to perform all operation on CPU only. The method output are a de-noised 2D top-down environment map and a structure-segmented 3D pointcloud which are seamlessly connected using a multi-layer graph abstracting information from object-level up to the building-level. The proposed method is quantitatively and qualitatively evaluated during real-world experiments performed using the NASA JPL NeBula-Spot legged robot to autonomously explore and map cluttered garage and urban office like environments in real-time. I. INTRODUCTION Autonomous mobile robots are increasingly utilized for augmenting human actions in everyday operations. Given their maturing abilities to robustly carry out complex tasks in dynamic and challenging environments, they are especially being deployed in dirty and dangerous applications where the risk to human lives is high. Nevertheless, in applications like infrastructure inspection and disaster response, robotic autonomy still needs human operator support for carrying out the complex decision making process. The decision making process is typically guided by the situational awareness provided by the robot and transmitted to human operators: detailed and time-critical situational awareness provision leads to more accurate and efficient mission strategies.

artificial intelligence, deep learning, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2506.22593

Country:

North America > United States > California > Orange County > Mission Viejo (0.04)
North America > United States > California > Los Angeles County > Pasadena (0.04)
Europe > Netherlands > South Holland > Delft (0.04)
Europe > Italy > Apulia > Bari (0.04)

Genre: Research Report (0.50)

Industry:

Government > Space Agency (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Robots > Locomotion (0.68)

Add feedback

Experimental Assessment of Neural 3D Reconstruction for Small UAV-based Applications

Gómez-Raya, Genís Castillo, Veres-Vitályos, Álmos, Lemic, Filip, Royo, Pablo, Montagud, Mario, Fernández, Sergi, Abadal, Sergi, Costa-Pérez, Xavier

arXiv.org Artificial IntelligenceJun-25-2025

The increasing miniaturization of Unmanned Aerial Vehicles (UAVs) has expanded their deployment potential to indoor and hard-to-reach areas. However, this trend introduces distinct challenges, particularly in terms of flight dynamics and power consumption, which limit the UAVs' autonomy and mission capabilities. This paper presents a novel approach to overcoming these limitations by integrating Neural 3D Reconstruction (N3DR) with small UAV systems for fine-grained 3-Dimensional (3D) digital reconstruction of small static objects. Specifically, we design, implement, and evaluate an N3DR-based pipeline that leverages advanced models, i.e., Instant-ngp, Nerfacto, and Splatfacto, to improve the quality of 3D reconstructions using images of the object captured by a fleet of small UAVs. We assess the performance of the considered models using various imagery and pointcloud metrics, comparing them against the baseline Structure from Motion (SfM) algorithm. The experimental results demonstrate that the N3DR-enhanced pipeline significantly improves reconstruction quality, making it feasible for small UAVs to support high-precision 3D mapping and anomaly detection in constrained environments. In more general terms, our results highlight the potential of N3DR in advancing the capabilities of miniaturized UAV systems.

anomaly detection, artificial intelligence, reconstruction, (14 more...)

arXiv.org Artificial Intelligence

2506.19491

Country: Europe (0.94)

Genre: