AITopics | Sun, Jianyong

Collaborating Authors

Sun, Jianyong

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Learning from Few Demonstrations with Frame-Weighted Motion Generation

Sun, Jianyong, Kober, Jens, Gienger, Michael, Zhu, Jihong

arXiv.org Artificial IntelligenceOct-26-2023

Learning from Demonstration (LfD) enables robots to acquire versatile skills by learning motion policies from human demonstrations. It endows users with an intuitive interface to transfer new skills to robots without the need for time-consuming robot programming and inefficient solution exploration. During task executions, the robot motion is usually influenced by constraints imposed by environments. In light of this, task-parameterized LfD (TP-LfD) encodes relevant contextual information into reference frames, enabling better skill generalization to new situations. However, most TP-LfD algorithms typically require multiple demonstrations across various environmental conditions to ensure sufficient statistics for a meaningful model. It is not a trivial task for robot users to create different situations and perform demonstrations under all of them. Therefore, this paper presents a novel algorithm to learn skills from few demonstrations. By leveraging the reference frame weights that capture the frame importance or relevance during task executions, our method demonstrates excellent skill acquisition performance, which is validated in real robotic environments.

artificial intelligence, demonstration, reference frame, (13 more...)

arXiv.org Artificial Intelligence

2303.14188

Country:

Europe > Netherlands (0.14)
Europe > United Kingdom (0.14)

Genre: Research Report (0.40)

Industry: Education (0.48)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

Amortized Variational Deep Q Network

Zhang, Haotian, Wang, Yuhao, Sun, Jianyong, Xu, Zongben

arXiv.org Artificial IntelligenceNov-3-2020

Efficient exploration is one of the most important issues in deep reinforcement learning. To address this issue, recent methods consider the value function parameters as random variables, and resort variational inference to approximate the posterior of the parameters. In this paper, we propose an amortized variational inference framework to approximate the posterior distribution of the action value function in Deep Q Network. We establish the equivalence between the loss of the new model and the amortized variational inference loss. We realize the balance of exploration and exploitation by assuming the posterior as Cauchy and Gaussian, respectively in a two-stage training process. We show that the amortized framework can results in significant less learning parameters than existing state-of-the-art method. Experimental results on classical control tasks in OpenAI Gym and chain Markov Decision Process tasks show that the proposed method performs significantly better than state-of-art methods and requires much less training time.

avdqn, neural network, upstream oil & gas, (17 more...)

arXiv.org Artificial Intelligence

2011.01706

Country:

Asia > China (0.14)
North America > Canada (0.14)

Genre: Research Report (1.00)

Industry: Energy > Oil & Gas > Upstream (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.48)

Add feedback

On Hyper-parameter Tuning for Stochastic Optimization Algorithms

Zhang, Haotian, Sun, Jianyong, Xu, Zongben

arXiv.org Machine LearningMar-4-2020

This paper proposes the first-ever algorithmic framework for tuning hyper-parameters of stochastic optimization algorithm based on reinforcement learning. Hyper-parameters impose significant influences on the performance of stochastic optimization algorithms, such as evolutionary algorithms (EAs) and meta-heuristics. Yet, it is very time-consuming to determine optimal hyper-parameters due to the stochastic nature of these algorithms. We propose to model the tuning procedure as a Markov decision process, and resort the policy gradient algorithm to tune the hyper-parameters. Experiments on tuning stochastic algorithms with different kinds of hyper-parameters (continuous and discrete) for different optimization problems (continuous and discrete) show that the proposed hyper-parameter tuning algorithms do not require much less running times of the stochastic algorithms than bayesian optimization method. The proposed framework can be used as a standard tool for hyper-parameter tuning in stochastic algorithms.

algorithm, artificial intelligence, optimization problem, (1 more...)

arXiv.org Machine Learning

2003.02038

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Homotopic Convex Transformation: A New Method to Smooth the Landscape of the Traveling Salesman Problem

Shi, Jialong, Sun, Jianyong, Zhang, Qingfu

arXiv.org Artificial IntelligenceMay-14-2019

This paper proposes a novel landscape smoothing method for the symmetric Traveling Salesman Problem (TSP). We first define the homotopic convex (HC) transformation of a TSP as a convex combination of a well-constructed simple TSP and the original TSP. We observe that controlled by the coefficient of the convex combination, (i) the landscape of the HC transformed TSP is smoothed in terms that its number of local optima is reduced compared to the original TSP; (ii) the fitness distance correlation of the HC transformed TSP is increased. We then propose an iterative algorithmic framework in which the proposed HC transformation is combined with a heuristic TSP solver. It works as an escaping scheme from local optima for improving the global search ability of the combined heuristic. A case study with the 3-Opt local search as the heuristic solver shows that the resultant algorithm significantly outperforms iterated local search and two other smoothing-based TSP heuristic solvers on most of commonly-used test instances.

artificial intelligence, optimization problem, transformation, (18 more...)

arXiv.org Artificial Intelligence

1906.03223

Country: Asia > China (0.28)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)

Add feedback

From Adaptive Kernel Density Estimation to Sparse Mixture Models

Schretter, Colas, Sun, Jianyong, Schelkens, Peter

arXiv.org Machine LearningDec-11-2018

We introduce a balloon estimator in a generalized expectation-maximization method for estimating all parameters of a Gaussian mixture model given one data sample per mixture component. Instead of limiting explicitly the model size, this regularization strategy yields low-complexity sparse models where the number of effective mixture components reduces with an increase of a smoothing probability parameter $\mathbf{P>0}$. This semi-parametric method bridges from non-parametric adaptive kernel density estimation (KDE) to parametric ordinary least-squares when $\mathbf{P=1}$. Experiments show that simpler sparse mixture models retain the level of details present in the adaptive KDE solution.

artificial intelligence, bayesian inference, density estimation, (17 more...)

arXiv.org Machine Learning

1812.04397

Country: Europe > Belgium (0.30)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.32)

Add feedback