AITopics | Xue, Bohuan

Plotting

Xue, Bohuan

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Incorporating GNSS Information with LIDAR-Inertial Odometry for Accurate Land-Vehicle Localization

Cheng, Jintao, Xue, Bohuan, Chen, Shiyang, Xiang, Qiuchi, Tang, Xiaoyu

arXiv.org Artificial IntelligenceMar-29-2025

-- Currently, visual odometry and LIDAR odometry are performing well in pose estimation in some typical environments, but they still cannot recover the localization state at high speed or reduce accumulated drifts. In order to solve these problems, we propose a novel LIDAR-based localization framework, which achieves high accuracy and provides robust localization in 3D pointcloud maps with information of multi-sensors. T o improve robustness and enable fast resumption of localization, this paper uses offline pointcloud maps for prior knowledge and presents a novel registration method to speed up the convergence rate. The algorithm is tested on various maps of different data sets and has higher robustness and accuracy than other localization algorithms. Accurate localization is a crucial component of Autonomous driving [1], [2]. Besides integrated navigation-based solutions, the main approaches include LIDAR-based localization [8]-[10] and Vision-based localization [11]- [13].

artificial intelligence, localization, odometry, (15 more...)

arXiv.org Artificial Intelligence

2503.23199

Genre: Research Report (0.50)

Industry: Information Technology > Robotics & Automation (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)
Information Technology > Artificial Intelligence > Vision (0.66)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.34)

Add feedback

MambaFlow: A Novel and Flow-guided State Space Model for Scene Flow Estimation

Luo, Jiehao, Cheng, Jintao, Tang, Xiaoyu, Zhang, Qingwen, Xue, Bohuan, Fan, Rui

arXiv.org Artificial IntelligenceFeb-24-2025

Scene flow estimation aims to predict 3D motion from consecutive point cloud frames, which is of great interest in autonomous driving field. Existing methods face challenges such as insufficient spatio-temporal modeling and inherent loss of fine-grained feature during voxelization. However, the success of Mamba, a representative state space model (SSM) that enables global modeling with linear complexity, provides a promising solution. In this paper, we propose MambaFlow, a novel scene flow estimation network with a mamba-based decoder. It enables deep interaction and coupling of spatio-temporal features using a well-designed backbone. Innovatively, we steer the global attention modeling of voxel-based features with point offset information using an efficient Mamba-based decoder, learning voxel-to-point patterns that are used to devoxelize shared voxel representations into point-wise features. To further enhance the model's generalization capabilities across diverse scenarios, we propose a novel scene-adaptive loss function that automatically adapts to different motion patterns.Extensive experiments on the Argoverse 2 benchmark demonstrate that MambaFlow achieves state-of-the-art performance with real-time inference speed among existing works, enabling accurate flow estimation in real-world urban scenarios. The code is available at https://github.com/SCNU-RISLAB/MambaFlow.

artificial intelligence, deep learning, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2502.16907

Country:

Asia > China (0.96)
North America > United States > California (0.14)

Genre: Research Report (1.00)

Industry: Information Technology (0.49)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.35)

Add feedback

Three-Filters-to-Normal+: Revisiting Discontinuity Discrimination in Depth-to-Normal Translation

Yang, Jingwei, Xue, Bohuan, Feng, Yi, Wang, Deming, Fan, Rui, Chen, Qijun

arXiv.org Artificial IntelligenceDec-13-2023

This article introduces three-filters-to-normal+ (3F2N+), an extension of our previous work three-filters-to-normal (3F2N), with a specific focus on incorporating discontinuity discrimination capability into surface normal estimators (SNEs). 3F2N+ achieves this capability by utilizing a novel discontinuity discrimination module (DDM), which combines depth curvature minimization and correlation coefficient maximization through conditional random fields (CRFs). To evaluate the robustness of SNEs on noisy data, we create a large-scale synthetic surface normal (SSN) dataset containing 20 scenarios (ten indoor scenarios and ten outdoor scenarios with and without random Gaussian noise added to depth images). Extensive experiments demonstrate that 3F2N+ achieves greater performance than all other geometry-based surface normal estimators, with average angular errors of 7.85$^\circ$, 8.95$^\circ$, 9.25$^\circ$, and 11.98$^\circ$ on the clean-indoor, clean-outdoor, noisy-indoor, and noisy-outdoor datasets, respectively. We conduct three additional experiments to demonstrate the effectiveness of incorporating our proposed 3F2N+ into downstream robot perception tasks, including freespace detection, 6D object pose estimation, and point cloud completion. Our source code and datasets are publicly available at https://mias.group/3F2Nplus.

artificial intelligence, machine learning, surface normal, (18 more...)

arXiv.org Artificial Intelligence

2312.07964

Country: Asia > China (0.29)

Genre:

Research Report (1.00)
Overview (0.68)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.88)

Add feedback

D2NT: A High-Performing Depth-to-Normal Translator

Feng, Yi, Xue, Bohuan, Liu, Ming, Chen, Qijun, Fan, Rui

arXiv.org Artificial IntelligenceApr-24-2023

Surface normal holds significant importance in visual environmental perception, serving as a source of rich geometric information. However, the state-of-the-art (SoTA) surface normal estimators (SNEs) generally suffer from an unsatisfactory trade-off between efficiency and accuracy. To resolve this dilemma, this paper first presents a superfast depth-to-normal translator (D2NT), which can directly translate depth images into surface normal maps without calculating 3D coordinates. We then propose a discontinuity-aware gradient (DAG) filter, which adaptively generates gradient convolution kernels to improve depth gradient estimation. Finally, we propose a surface normal refinement module that can easily be integrated into any depth-to-normal SNEs, substantially improving the surface normal estimation accuracy. Our proposed algorithm demonstrates the best accuracy among all other existing real-time SNEs and achieves the SoTA trade-off between efficiency and accuracy.

accuracy, artificial intelligence, surface normal, (15 more...)

arXiv.org Artificial Intelligence

2304.12031

Country:

Asia > China (0.48)
North America > United States > California (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Vision (0.69)

Add feedback

Monocular Camera Mapping with Pose-Guided Optimization: Enhancing Marking-Level HD Map Accuracy

Liu, Hongji, Zheng, Linwei, Yan, Xiaoyang, Xu, Zhenhua, Xue, Bohuan, Yu, Yang, Liu, Ming

arXiv.org Artificial IntelligenceMar-7-2023

Marking-level high-definition maps (HD maps) are of great significance for autonomous vehicles (AVs), especially in large-scale, appearance-changing scenarios where AVs rely on markings for localization and lanes for safe driving. In this paper, we propose a pose-guided optimization framework for automatically building a marking-level HD map with accurate markings positions using a simple sensor setup (one or more monocular cameras). We optimize the position of the marking corners to fit the result of marking segmentation and simultaneously optimize the inverse perspective mapping (IPM) matrix of the corresponding camera to obtain an accurate transformation from the front view image to the bird's-eye view (BEV). In the quantitative evaluation, the built HD map almost attains centimeter-level accuracy. The accuracy of the optimized IPM matrix is similar to that of the manual calibration. The method can also be generalized to build HD maps in a broader sense by increasing the types of recognizable markings. The supplementary materials and videos are available at http://liuhongji.site/V2HDM-Mono/.

artificial intelligence, machine learning, matrix, (14 more...)

arXiv.org Artificial Intelligence

2209.07737

Country: Asia > China > Guangdong Province (0.28)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.94)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.89)

Add feedback