AITopics | Zou, Xinyun

Collaborating Authors

Zou, Xinyun

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Domain Adaptation In Reinforcement Learning Via Latent Unified State Representation

Xing, Jinwei, Nagata, Takashi, Chen, Kexin, Zou, Xinyun, Neftci, Emre, Krichmar, Jeffrey L.

arXiv.org Artificial IntelligenceFeb-10-2021

Despite the recent success of deep reinforcement learning (RL), domain adaptation remains an open problem. Although the generalization ability of RL agents is critical for the real-world applicability of Deep RL, zero-shot policy transfer is still a challenging problem since even minor visual changes could make the trained agent completely fail in the new task. To address this issue, we propose a two-stage RL agent that first learns a latent unified state representation (LUSR) which is consistent across multiple domains in the first stage, and then do RL training in one source domain based on LUSR in the second stage. The cross-domain consistency of LUSR allows the policy acquired from the source domain to generalize to other target domains without extra training. We first demonstrate our approach in variants of CarRacing games with customized manipulations, and then verify it in CARLA, an autonomous driving simulator with more complex and realistic visual observations. Our results show that this approach can achieve state-of-the-art domain adaptation performance in related RL tasks and outperforms prior approaches based on latent-representation based RL and image-to-image translation.

artificial intelligence, reinforcement learning, target domain, (17 more...)

arXiv.org Artificial Intelligence

2102.05714

Country: North America > United States > California (0.14)

Genre: Research Report > New Finding (0.68)

Industry: Transportation (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Neuromodulated Patience for Robot and Self-Driving Vehicle Navigation

Xing, Jinwei, Zou, Xinyun, Krichmar, Jeffrey L.

arXiv.org Artificial IntelligenceSep-14-2019

Robots and self-driving vehicles face a number of challenges when navigating through real environments. Successful navigation in dynamic environments requires prioritizing subtasks and monitoring resources. Animals are under similar constraints. It has been shown that the neuromodulator serotonin regulates impulsiveness and patience in animals. In the present paper, we take inspiration from the serotonergic system and apply it to the task of robot navigation. In a set of outdoor experiments, we show how changing the level of patience can affect the amount of time the robot will spend searching for a desired location. To navigate GPS compromised environments, we introduce a deep reinforcement learning paradigm in which the robot learns to follow sidewalks. This may further regulate a tradeoff between a smooth long route and a rough shorter route. Using patience as a parameter may be beneficial for autonomous systems under time pressure.

ground transportation, health & medicine, waypoint, (19 more...)

arXiv.org Artificial Intelligence

1909.06533

Country: North America > United States > California > Orange County > Irvine (0.16)

Genre: Research Report (0.82)

Industry: Health & Medicine (0.97)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)

Add feedback

Attention-Based Structural-Plasticity

Kolouri, Soheil, Ketz, Nicholas, Zou, Xinyun, Krichmar, Jeffrey, Pilly, Praveen

arXiv.org Machine LearningMar-2-2019

Catastrophic forgetting/interference is a critical problem for lifelong learning machines, which impedes the agents from maintaining their previously learned knowledge while learning new tasks. Neural networks, in particular, suffer plenty from the catastrophic forgetting phenomenon. Recently there has been several efforts towards overcoming catastrophic forgetting in neural networks. Here, we propose a biologically inspired method toward overcoming catastrophic forgetting. Specifically, we define an attention-based selective plasticity of synapses based on the cholinergic neuromodulatory system in the brain. We define synaptic importance parameters in addition to synaptic weights and then use Hebbian learning in parallel with backpropagation algorithm to learn synaptic importances in an online and seamless manner. We test our proposed method on benchmark tasks including the Permuted MNIST and the Split MNIST problems and show competitive performance compared to the state-of-the-art methods.

health & medicine, importance parameter, neural network, (18 more...)

arXiv.org Machine Learning

1903.0607

Country: North America > United States > California > Orange County > Irvine (0.14)

Genre: Research Report (0.84)

Industry:

Health & Medicine (0.46)
Education (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback