AITopics | Li, Chengyang

Collaborating Authors

Li, Chengyang

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Autonomous Driving in Unstructured Environments: How Far Have We Come?

Min, Chen, Si, Shubin, Wang, Xu, Xue, Hanzhang, Jiang, Weizhong, Liu, Yang, Wang, Juan, Zhu, Qingtian, Zhu, Qi, Luo, Lun, Kong, Fanjie, Miao, Jinyu, Cai, Xudong, An, Shuai, Li, Wei, Mei, Jilin, Sun, Tong, Zhai, Heng, Liu, Qifeng, Zhao, Fangzhou, Chen, Liang, Wang, Shuai, Shang, Erke, Shang, Linzhi, Zhao, Kunlong, Li, Fuyang, Fu, Hao, Jin, Lei, Zhao, Jian, Mao, Fangyuan, Xiao, Zhipeng, Li, Chengyang, Dai, Bin, Zhao, Dawei, Xiao, Liang, Nie, Yiming, Hu, Yu, Li, Xuelong

arXiv.org Artificial IntelligenceOct-31-2024

Research on autonomous driving in unstructured outdoor environments is less advanced than in structured urban settings due to challenges like environmental diversities and scene complexity. These environments-such as rural areas and rugged terrains-pose unique obstacles that are not common in structured urban areas. Despite these difficulties, autonomous driving in unstructured outdoor environments is crucial for applications in agriculture, mining, and military operations. Our survey reviews over 250 papers for autonomous driving in unstructured outdoor environments, covering offline mapping, pose estimation, environmental perception, path planning, end-to-end autonomous driving, datasets, and relevant challenges. We also discuss emerging trends and future research directions. This review aims to consolidate knowledge and encourage further research for autonomous driving in unstructured environments. To support ongoing work, we maintain an active repository with up-to-date literature and open-source projects at: https://github.com/chaytonmin/Survey-Autonomous-Driving-in-Unstructured-Environments.

artificial intelligence, machine learning, survey article, (20 more...)

arXiv.org Artificial Intelligence

2410.07701

Country:

North America > United States (1.00)
Asia (1.00)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Transportation > Ground > Road (1.00)
Information Technology > Robotics & Automation (1.00)
Automobiles & Trucks (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Learn2Talk: 3D Talking Face Learns from 2D Talking Face

Zhuang, Yixiang, Cheng, Baoping, Cheng, Yao, Jin, Yuntao, Liu, Renshuai, Li, Chengyang, Cheng, Xuan, Liao, Jing, Lin, Juncong

arXiv.org Artificial IntelligenceApr-19-2024

Speech-driven facial animation methods usually contain two main classes, 3D and 2D talking face, both of which attract considerable research attention in recent years. However, to the best of our knowledge, the research on 3D talking face does not go deeper as 2D talking face, in the aspect of lip-synchronization (lip-sync) and speech perception. To mind the gap between the two sub-fields, we propose a learning framework named Learn2Talk, which can construct a better 3D talking face network by exploiting two expertise points from the field of 2D talking face. Firstly, inspired by the audio-video sync network, a 3D sync-lip expert model is devised for the pursuit of lip-sync between audio and 3D facial motion. Secondly, a teacher model selected from 2D talking face methods is used to guide the training of the audio-to-3D motions regression network to yield more 3D vertex accuracy. Extensive experiments show the advantages of the proposed framework in terms of lip-sync, vertex accuracy and speech perception, compared with state-of-the-arts. Finally, we show two applications of the proposed framework: audio-visual speech recognition and speech-driven 3D Gaussian Splatting based avatar animation.

animation, artificial intelligence, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2404.12888

Genre: Research Report (0.64)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision > Face Recognition (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

OmniColor: A Global Camera Pose Optimization Approach of LiDAR-360Camera Fusion for Colorizing Point Clouds

Liu, Bonan, Zhao, Guoyang, Jiao, Jianhao, Cai, Guang, Li, Chengyang, Yin, Handi, Wang, Yuyang, Liu, Ming, Hui, Pan

arXiv.org Artificial IntelligenceApr-6-2024

A Colored point cloud, as a simple and efficient 3D representation, has many advantages in various fields, including robotic navigation and scene reconstruction. This representation is now commonly used in 3D reconstruction tasks relying on cameras and LiDARs. However, fusing data from these two types of sensors is poorly performed in many existing frameworks, leading to unsatisfactory mapping results, mainly due to inaccurate camera poses. This paper presents OmniColor, a novel and efficient algorithm to colorize point clouds using an independent 360-degree camera. Given a LiDAR-based point cloud and a sequence of panorama images with initial coarse camera poses, our objective is to jointly optimize the poses of all frames for mapping images onto geometric reconstructions. Our pipeline works in an off-the-shelf manner that does not require any feature extraction or matching process. Instead, we find optimal poses by directly maximizing the photometric consistency of LiDAR maps. In experiments, we show that our method can overcome the severe visual distortion of omnidirectional images and greatly benefit from the wide field of view (FOV) of 360-degree cameras to reconstruct various scenarios with accuracy and stability. The code will be released at https://github.com/liubonan123/OmniColor/.

artificial intelligence, optimization problem, point cloud, (13 more...)

arXiv.org Artificial Intelligence

2404.04693

Country: Asia > China (0.29)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)

Add feedback

NeuPAN: Direct Point Robot Navigation with End-to-End Model-based Learning

Han, Ruihua, Wang, Shuai, Wang, Shuaijun, Zhang, Zeqing, Chen, Jianjun, Lin, Shijie, Li, Chengyang, Xu, Chengzhong, Eldar, Yonina C., Hao, Qi, Pan, Jia

arXiv.org Artificial IntelligenceMar-11-2024

Navigating a nonholonomic robot in a cluttered environment requires extremely accurate perception and locomotion for collision avoidance. This paper presents NeuPAN: a real-time, highly-accurate, map-free, robot-agnostic, and environment-invariant robot navigation solution. Leveraging a tightly-coupled perception-locomotion framework, NeuPAN has two key innovations compared to existing approaches: 1) it directly maps raw points to a learned multi-frame distance space, avoiding error propagation from perception to control; 2) it is interpretable from an end-to-end model-based learning perspective, enabling provable convergence. The crux of NeuPAN is to solve a high-dimensional end-to-end mathematical model with various point-level constraints using the plug-and-play (PnP) proximal alternating-minimization network (PAN) with neurons in the loop. This allows NeuPAN to generate real-time, end-to-end, physically-interpretable motions directly from point clouds, which seamlessly integrates data- and knowledge-engines, where its network parameters are adjusted via back propagation. We evaluate NeuPAN on car-like robot, wheel-legged robot, and passenger autonomous vehicle, in both simulated and real-world environments. Experiments demonstrate that NeuPAN outperforms various benchmarks, in terms of accuracy, efficiency, robustness, and generalization capability across various environments, including the cluttered sandbox, office, corridor, and parking lot. We show that NeuPAN works well in unstructured environments with arbitrary-shape undetectable objects, making impassable ways passable.

artificial intelligence, machine learning, robot, (19 more...)

arXiv.org Artificial Intelligence

2403.06828

Country: Asia > China > Guangdong Province (0.14)

Genre: Research Report > New Finding (0.67)

Industry: Transportation > Passenger (0.34)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (0.93)

Add feedback

Decentralized Planning for Car-Like Robotic Swarm in Cluttered Environments

Ma, Changjia, Han, Zhichao, Zhang, Tingrui, Wang, Jingping, Xu, Long, Li, Chengyang, Xu, Chao, Gao, Fei

arXiv.org Artificial IntelligenceJul-5-2023

Robot swarm is a hot spot in robotic research community. In this paper, we propose a decentralized framework for car-like robotic swarm which is capable of real-time planning in cluttered environments. In this system, path finding is guided by environmental topology information to avoid frequent topological change, and search-based speed planning is leveraged to escape from infeasible initial value's local minima. Then spatial-temporal optimization is employed to generate a safe, smooth and dynamically feasible trajectory. During optimization, the trajectory is discretized by fixed time steps. Penalty is imposed on the signed distance between agents to realize collision avoidance, and differential flatness cooperated with limitation on front steer angle satisfies the non-holonomic constraints. With trajectories broadcast to the wireless network, agents are able to check and prevent potential collisions. We validate the robustness of our system in simulation and real-world experiments. Code will be released as open-source packages.

artificial intelligence, planning & scheduling, trajectory, (17 more...)

arXiv.org Artificial Intelligence

2210.05863

Genre: Research Report (0.50)

Industry:

Automobiles & Trucks (0.93)
Transportation > Ground > Road (0.68)
Transportation > Passenger (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.96)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.90)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.66)

Add feedback

EMEF: Ensemble Multi-Exposure Image Fusion

Liu, Renshuai, Li, Chengyang, Cao, Haitao, Zheng, Yinglin, Zeng, Ming, Cheng, Xuan

arXiv.org Artificial IntelligenceMay-22-2023

Although remarkable progress has been made in recent years, current multi-exposure image fusion (MEF) research is still bounded by the lack of real ground truth, objective evaluation function, and robust fusion strategy. In this paper, we study the MEF problem from a new perspective. We don't utilize any synthesized ground truth, design any loss function, or develop any fusion strategy. Our proposed method EMEF takes advantage of the wisdom of multiple imperfect MEF contributors including both conventional and deep learning-based methods. Specifically, EMEF consists of two main stages: pre-train an imitator network and tune the imitator in the runtime. In the first stage, we make a unified network imitate different MEF targets in a style modulation way. In the second stage, we tune the imitator network by optimizing the style code, in order to find an optimal fusion result for each input pair. In the experiment, we construct EMEF from four state-of-the-art MEF methods and then make comparisons with the individuals and several other competitive methods on the latest released MEF benchmark dataset. The promising experimental results demonstrate that our ensemble framework can "get the best of all worlds". The code is available at https://github.com/medalwill/EMEF.

artificial intelligence, machine learning, mef method, (15 more...)

arXiv.org Artificial Intelligence

2305.12734

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

An Efficient Spatial-Temporal Trajectory Planner for Autonomous Vehicles in Unstructured Environments

Han, Zhichao, Wu, Yuwei, Li, Tong, Zhang, Lu, Pei, Liuao, Xu, Long, Li, Chengyang, Ma, Changjia, Xu, Chao, Shen, Shaojie, Gao, Fei

arXiv.org Artificial IntelligenceApr-10-2023

As a core part of autonomous driving systems, motion planning has received extensive attention from academia and industry. However, real-time trajectory planning capable of spatial-temporal joint optimization is challenged by nonholonomic dynamics, particularly in the presence of unstructured environments and dynamic obstacles. To bridge the gap, we propose a real-time trajectory optimization method that can generate a high-quality whole-body trajectory under arbitrary environmental constraints. By leveraging the differential flatness property of car-like robots, we simplify the trajectory representation and analytically formulate the planning problem while maintaining the feasibility of the nonholonomic dynamics. Moreover, we achieve efficient obstacle avoidance with a safe driving corridor for unmodelled obstacles and signed distance approximations for dynamic moving objects. We present comprehensive benchmarks with State-of-the-Art methods, demonstrating the significance of the proposed method in terms of efficiency and trajectory quality. Real-world experiments verify the practicality of our algorithm. We will release our codes for the research community

artificial intelligence, optimization problem, trajectory, (14 more...)

arXiv.org Artificial Intelligence

2208.1316

Country:

Asia > China (0.28)
North America > United States (0.28)

Genre: Research Report (0.84)

Industry: Transportation > Ground > Road (0.48)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)

Add feedback

Federated Deep Learning Meets Autonomous Vehicle Perception: Design and Verification

Wang, Shuai, Li, Chengyang, Ng, Derrick Wing Kwan, Eldar, Yonina C., Poor, H. Vincent, Hao, Qi, Xu, Chengzhong

arXiv.org Artificial IntelligenceDec-5-2022

Realizing human-like perception is a challenge in open driving scenarios due to corner cases and visual occlusions. To gather knowledge of rare and occluded instances, federated learning assisted connected autonomous vehicle (FLCAV) has been proposed, which leverages vehicular networks to establish federated deep neural networks (DNNs) from distributed data captured by vehicles and road sensors. Without the need of data aggregation, FLCAV preserves privacy while reducing communication costs compared with conventional centralized learning. However, it is challenging to determine the network resources and road sensor placements for multi-stage training with multi-modal datasets in multi-variant scenarios. This article presents networking and training frameworks for FLCAV perception. Multi-layer graph resource allocation and vehicle-road contrastive sensor placement are proposed to address the network management and sensor deployment problems, respectively. We also develop CarlaFLCAV, a software platform that implements the above system and methods. Experimental results confirm the superiority of the proposed techniques compared with various benchmarks.

artificial intelligence, deep learning, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2206.01748

Country: Asia > China (0.47)

Genre: Research Report > New Finding (0.88)

Industry:

Information Technology > Security & Privacy (0.93)
Transportation > Ground > Road (0.47)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Adaptive Environment Modeling Based Reinforcement Learning for Collision Avoidance in Complex Scenes

Wang, Shuaijun, Gao, Rui, Han, Ruihua, Chen, Shengduo, Li, Chengyang, Hao, Qi

arXiv.org Artificial IntelligenceOct-27-2022

The major challenges of collision avoidance for robot navigation in crowded scenes lie in accurate environment modeling, fast perceptions, and trustworthy motion planning policies. This paper presents a novel adaptive environment model based collision avoidance reinforcement learning (i.e., AEMCARL) framework for an unmanned robot to achieve collision-free motions in challenging navigation scenarios. The novelty of this work is threefold: (1) developing a hierarchical network of gated-recurrent-unit (GRU) for environment modeling; (2) developing an adaptive perception mechanism with an attention module; (3) developing an adaptive reward function for the reinforcement learning (RL) framework to jointly train the environment model, perception function and motion planning policy. The proposed method is tested with the Gym-Gazebo simulator and a group of robots (Husky and Turtlebot) under various crowded scenes. Both simulation and experimental results have demonstrated the superior performance of the proposed method over baseline methods.

machine learning, obstacle, reinforcement learning, (19 more...)

arXiv.org Artificial Intelligence

2203.07709

Country: Asia > China (0.68)

Genre: Research Report (0.84)

Industry: Transportation (0.91)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.96)
(2 more...)

Add feedback

BBA-net: A bi-branch attention network for crowd counting

Hou, Yi, Li, Chengyang, Yang, Fan, Ma, Cong, Zhu, Liping, Li, Yuan, Jia, Huizhu, Xie, Xiaodong

arXiv.org Artificial IntelligenceJan-22-2022

In the field of crowd counting, the current mainstream CNN-based regression methods simply extract the density information of pedestrians without finding the position of each person. This makes the output of the network often found to contain incorrect responses, which may erroneously estimate the total number and not conducive to the interpretation of the algorithm. To this end, we propose a Bi-Branch Attention Network (BBA-NET) for crowd counting, which has three innovation points. i) A two-branch architecture is used to estimate the density information and location information separately. ii) Attention mechanism is used to facilitate feature extraction, which can reduce false responses. iii) A new density map generation method combining geometric adaptation and Voronoi split is introduced. Our method can integrate the pedestrian's head and body information to enhance the feature expression ability of the density map. Extensive experiments performed on two public datasets show that our method achieves a lower crowd counting error compared to other state-of-the-art methods.

artificial intelligence, density map, machine learning, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/ICASSP40776.2020.9053955

2201.08983

Country:

Asia > China (0.29)
North America > United States (0.28)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.49)

Add feedback