Deep Reinforcement Learning with Stage Incentive Mechanism for Robotic Trajectory Planning

Sep-25-2020–arXiv.org Artificial Intelligence

ABSTRACT To improve the efficiency of deep reinforcement learning (DRL) based methods for robot manipulator trajectory planning in random working environment. Firstly, posture reward function is proposed to accelerate the learning process with a more reasonable trajectory by modeling the distance and direction constraints, which can reduce the blindness of exploration. Secondly, to improve the stability, a reward function at stride reward is proposed by modeling the distance and movement distance of joints constraints, it can make the learning process more stable. In order to further improve learning efficiency, we are inspired by the cognitive process of human behavior and propose a stage incentive mechanism, including hard stage incentive reward function and soft stage incentive reward function. Extensive experiments show that the soft stage incentive reward function proposed is able to improve convergence rate by up to 46.9% with the state-of-the-art DRL methods. The percentage increase in convergence mean reward is 4.4% 15.5% and the percentage decreases with respect to standard deviation by 21.9% 63.2%. In the evaluation, the success rate of trajectory planning for robot manipulator is up to 99.6%.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

arXiv.org Artificial Intelligence

Sep-25-2020

arXiv.org PDF

Add feedback

Country:
- North America
  - United States
    - New York (0.04)
    - Hawaii > Honolulu County
      - Honolulu (0.04)
    - Alaska > Anchorage Municipality
      - Anchorage (0.04)
  - Canada > Ontario
    - National Capital Region > Ottawa (0.04)
- Europe > Germany
  - Hesse > Darmstadt Region > Darmstadt (0.04)
- Asia
  - Singapore (0.04)
  - Macao (0.04)
  - Malaysia > Kuala Lumpur
    - Kuala Lumpur (0.04)
  - Japan > Honshū
    - Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
  - China
    - Zhejiang Province > Hangzhou (0.04)
    - Liaoning Province > Shenyang (0.04)
    - Hubei Province > Wuhan (0.04)

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Robots (1.00)
  - Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found