Huang, Xiucai
Exploiting Hybrid Policy in Reinforcement Learning for Interpretable Temporal Logic Manipulation
Zhang, Hao, Wang, Hao, Huang, Xiucai, Chen, Wenrui, Kan, Zhen
Reinforcement Learning (RL) based methods have been increasingly explored for robot learning. However, RL based methods often suffer from low sampling efficiency in the exploration phase, especially for long-horizon manipulation tasks, and generally neglect the semantic information from the task level, resulted in a delayed convergence or even tasks failure. To tackle these challenges, we propose a Temporal-Logic-guided Hybrid policy framework (HyTL) which leverages three-level decision layers to improve the agent's performance. Specifically, the task specifications are encoded via linear temporal logic (LTL) to improve performance and offer interpretability. And a waypoints planning module is designed with the feedback from the LTL-encoded task level as a high-level policy to improve the exploration efficiency. The middle-level policy selects which behavior primitives to execute, and the low-level policy specifies the corresponding parameters to interact with the environment. We evaluate HyTL on four challenging manipulation tasks, which demonstrate its effectiveness and interpretability. Our project is available at: https://sites.google.com/view/hytl-0257/.
Neuroadaptive Distributed Event-triggered Control of Networked Uncertain Pure-feedback Systems with Polluted Feedback
Sun, Libei, Zhang, Zhirong, Huang, Xinjian, Huang, Xiucai
This paper investigates the distributed event-triggered control problem for a class of uncertain pure-feedback nonlinear multi-agent systems (MASs) with polluted feedback. Under the setting of event-triggered control, substantial challenges exist in both control design and stability analysis for systems in more general non-affine pure-feedback forms wherein all state variables are not directly and continuously available or even polluted due to sensor failures, and thus far very limited results are available in literature. In this work, a nominal control strategy under regular state feedback is firstly developed by combining neural network (NN) approximating with dynamic filtering technique, and then a NN-based distributed event-triggered control strategy is proposed by resorting to a novel replacement policy, making the non-differentiability issue arising from event-triggering setting completely circumvented. Besides, the sensor ineffectiveness is accommodated automatically without using fault detection and diagnosis unit or controller reconfiguration. It is shown that all the internal signals are semi-globally uniformly ultimately bounded (SGUUB) with the aid of several vital lemmas, while the outputs of all the subsystems reaching a consensus without infinitely fast execution. Finally, the efficiency of the developed algorithm are verified via numerical simulation.
Asymptotic Tracking Control of Uncertain MIMO Nonlinear Systems with Less Conservative Controllability Conditions
Zhou, Bing, Huang, Xiucai, Song, Yongduan
For uncertain multiple inputs multi-outputs (MIMO) nonlinear systems, it is nontrivial to achieve asymptotic tracking, and most existing methods normally demand certain controllability conditions that are rather restrictive or even impractical if unexpected actuator faults are involved. In this note, we present a method capable of achieving zero-error steady-state tracking with less conservative (more practical) controllability condition. By incorporating a novel Nussbaum gain technique and some positive integrable function into the control design, we develop a robust adaptive asymptotic tracking control scheme for the system with time-varying control gain being unknown its magnitude and direction. By resorting to the existence of some feasible auxiliary matrix, the current state-of-art controllability condition is further relaxed, which enlarges the class of systems that can be considered in the proposed control scheme. All the closed-loop signals are ensured to be globally ultimately uniformly bounded. Moreover, such control methodology is further extended to the case involving intermittent actuator faults, with application to robotic systems. Finally, simulation studies are carried out to demonstrate the effectiveness and flexibility of this method.