AITopics | Cai, Yinghao

Collaborating Authors

Cai, Yinghao

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

NeuGrasp: Generalizable Neural Surface Reconstruction with Background Priors for Material-Agnostic Object Grasp Detection

Fan, Qingyu, Cai, Yinghao, Li, Chao, He, Wenzhe, Zheng, Xudong, Lu, Tao, Liang, Bin, Wang, Shuo

arXiv.org Artificial IntelligenceMar-5-2025

Robotic grasping in scenes with transparent and specular objects presents great challenges for methods relying on accurate depth information. In this paper, we introduce NeuGrasp, a neural surface reconstruction method that leverages background priors for material-agnostic grasp detection. NeuGrasp integrates transformers and global prior volumes to aggregate multi-view features with spatial encoding, enabling robust surface reconstruction in narrow and sparse viewing conditions. By focusing on foreground objects through residual feature enhancement and refining spatial perception with an occupancy-prior volume, NeuGrasp excels in handling objects with transparent and specular surfaces. Extensive experiments in both simulated and real-world scenarios show that NeuGrasp outperforms state-of-the-art methods in grasping while maintaining comparable reconstruction quality. More details are available at https://neugrasp.github.io/.

artificial intelligence, machine learning, surface reconstruction, (14 more...)

arXiv.org Artificial Intelligence

2503.03511

Country:

Europe > Germany (0.14)
Asia (0.14)

Genre: Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.98)
Information Technology > Sensing and Signal Processing > Image Processing (0.93)

Add feedback

Exploring Consistency in Graph Representations:from Graph Kernels to Graph Neural Networks

Liu, Xuyuan, Cai, Yinghao, Yang, Qihui, Yan, Yujun

arXiv.org Artificial IntelligenceDec-11-2024

Graph Neural Networks (GNNs) have emerged as a dominant approach in graph representation learning, yet they often struggle to capture consistent similarity relationships among graphs. While graph kernel methods such as the Weisfeiler-Lehman subtree (WL-subtree) and Weisfeiler-Lehman optimal assignment (WLOA) kernels are effective in capturing similarity relationships, they rely heavily on predefined kernels and lack sufficient non-linearity for more complex data patterns. Our work aims to bridge the gap between neural network methods and kernel approaches by enabling GNNs to consistently capture relational structures in their learned representations. Given the analogy between the message-passing process of GNNs and WL algorithms, we thoroughly compare and analyze the properties of WL-subtree and WLOA kernels. We find that the similarities captured by WLOA at different iterations are asymptotically consistent, ensuring that similar graphs remain similar in subsequent iterations, thereby leading to superior performance over the WL-subtree kernel. Inspired by these findings, we conjecture that the consistency in the similarities of graph representations across GNN layers is crucial in capturing relational structures and enhancing graph classification performance. Thus, we propose a loss to enforce the similarity of graph representations to be consistent across different layers. Our empirical analysis verifies our conjecture and shows that our proposed consistency loss can significantly enhance graph classification performance across several GNN backbones on various datasets.

artificial intelligence, kernel, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2410.23748

Country:

Europe (1.00)
North America > United States > California > Los Angeles County > Long Beach (0.14)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Real-Time Marker Localization Learning for GelStereo Tactile Sensing

Liu, Shujuan, Cui, Shaowei, Zhang, Chaofan, Cai, Yinghao, Wang, Shuo

arXiv.org Artificial IntelligenceNov-24-2022

Visuotactile sensing technology is becoming more popular in tactile sensing, but the effectiveness of the existing marker detection localization methods remains to be further explored. Instead of contour-based blob detection, this paper presents a learning-based marker localization network for GelStereo visuotactile sensing called Marknet. Specifically, the Marknet presents a grid regression architecture to incorporate the distribution of the GelStereo markers. Furthermore, a marker rationality evaluator (MRE) is modelled to screen suitable prediction results. The experimental results show that the Marknet combined with MRE achieves 93.90% precision for irregular markers in contact areas, which outperforms the traditional contour-based blob detection method by a large margin of 42.32%. Meanwhile, the proposed learning-based marker localization method can achieve better real-time performance beyond the blob detection interface provided by the OpenCV library through GPU acceleration, which we believe will lead to considerable perceptual sensitivity gains in various robotic manipulation tasks.

machine learning, marknet, real time system, (16 more...)

arXiv.org Artificial Intelligence

2211.13428

Country: Asia > China (0.32)

Genre: Research Report (0.71)

Technology: Information Technology > Artificial Intelligence > Robots > Manipulation (0.71)

Add feedback

ACDER: Augmented Curiosity-Driven Experience Replay

Li, Boyao, Lu, Tao, Li, Jiayi, Lu, Ning, Cai, Yinghao, Wang, Shuo

arXiv.org Artificial IntelligenceNov-16-2020

Exploration in environments with sparse feedback remains a challenging research problem in reinforcement learning (RL). When the RL agent explores the environment randomly, it results in low exploration efficiency, especially in robotic manipulation tasks with high dimensional continuous state and action space. In this paper, we propose a novel method, called Augmented Curiosity-Driven Experience Replay (ACDER), which leverages (i) a new goal-oriented curiosity-driven exploration to encourage the agent to pursue novel and task-relevant states more purposefully and (ii) the dynamic initial states selection as an automatic exploratory curriculum to further improve the sample-efficiency. Our approach complements Hindsight Experience Replay (HER) by introducing a new way to pursue valuable states. Experiments conducted on four challenging robotic manipulation tasks with binary rewards, including Reach, Push, Pick&Place and Multi-step Push. The empirical results show that our proposed method significantly outperforms existing methods in the first three basic tasks and also achieves satisfactory performance in multi-step robotic task learning.

agent, artificial intelligence, computer game, (16 more...)

arXiv.org Artificial Intelligence

2011.08027

Country: Asia > China (0.14)

Genre: Research Report > New Finding (0.48)

Industry: Leisure & Entertainment > Games > Computer Games (0.68)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

FameSVD: Fast and Memory-efficient Singular Value Decomposition

Li, Xiaocan, Wang, Shuo, Cai, Yinghao

arXiv.org Machine LearningJun-28-2019

Dimensionality reduction has always been a trendy topic in machine learning. Linear subspace method for reduction, e.g., Principal Component Analysis and its variation have been widely studied[21, 16, 23], and some pieces of literature introduce probability and randomness to realize PCA[18, 12, 11, 14, 20]. However, linear subspace is not applicable when the data lies in a nonlinear manifold[2, 15]. Due to the direct connection with PCA, Singular Value Decomposition (SVD) is one of the most well-known algorithms for low-rank approximation[19, 9], and it has been widely used throughout the machine learning and statistics community. Some implementations of SVD are solving least squares[10, 22], latent semantic analysis[7, 13], genetic analysis, matrix completion[17, 3, 5, 4], data mining[6, 1] etc. However, when it comes to a large scale matrix, the runtime of traditional SVD is intolerable and the memory usage could be enormously consuming. Notations: we have a matrix A with size m n, usually m n. Our goal is to find the Principal Components (PCs) given the cumulative explained variance threshold t.

artificial intelligence, complexity, machine learning, (16 more...)

arXiv.org Machine Learning

1906.12085

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.56)

Add feedback

Hindsight Generative Adversarial Imitation Learning

Liu, Naijun, Lu, Tao, Cai, Yinghao, Li, Boyao, Wang, Shuo

arXiv.org Machine LearningMar-19-2019

Compared to reinforcement learning, imitation learning (IL) is a powerful paradigm for training agents to learn control policies efficiently from expert demonstrations. However, in most cases, obtaining demonstration data is costly and laborious, which poses a significant challenge in some scenarios. A promising alternative is to train agent learning skills via imitation learning without expert demonstrations, which, to some extent, would extremely expand imitation learning areas. To achieve such expectation, in this paper, we propose Hindsight Generative Adversarial Imitation Learning (HGAIL) algorithm, with the aim of achieving imitation learning satisfying no need of demonstrations. Combining hindsight idea with the generative adversarial imitation learning (GAIL) framework, we realize implementing imitation learning successfully in cases of expert demonstration data are not available. Experiments show that the proposed method can train policies showing comparable performance to current imitation learning methods. Further more, HGAIL essentially endows curriculum learning mechanism which is critical for learning policies.

artificial intelligence, imitation learning, reinforcement learning, (17 more...)

arXiv.org Machine Learning

1903.07854

Country: Europe > Sweden (0.14)

Genre: Research Report (1.00)

Industry: Education > Focused Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback