Efficient Symbolic Policy Learning with Differentiable Symbolic Expression

Jan-19-2025, 06:24:21 GMT–Neural Information Processing Systems

Deep reinforcement learning (DRL) has led to a wide range of advances in sequential decision-making tasks. However, the complexity of neural network policies makes it difficult to understand and deploy with limited computational resources. Currently, employing compact symbolic expressions as symbolic policies is a promising strategy to obtain simple and interpretable policies. Previous symbolic policy methods usually involve complex training processes and pre-trained neural network policies, which are inefficient and limit the application of symbolic policies. In this paper, we propose an efficient gradient-based learning method named Efficient Symbolic Policy Learning (ESPL) that learns the symbolic policy from scratch in an end-to-end way.

differentiable symbolic expression, efficient symbolic policy learning, symbolic policy, (6 more...)

Neural Information Processing Systems

Jan-19-2025, 06:24:21 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.99)