Annotating Motion Primitives for Simplifying Action Search in Reinforcement Learning
Sledge, Isaac J., Bryner, Darshan W., Principe, Jose C.
–arXiv.org Artificial Intelligence
Reinforcement learning in large-scale environments is challenging due to the many possible actions that can be taken in specific situations. We have previously developed a means of constraining, and hence speeding up, the search process through the use of motion primitives; motion primitives are sequences of pre-specified actions taken across a state series. As a byproduct of this work, we have found that if the motion primitives' motions and actions are labeled, then the search can be sped up further. Since motion primitives may initially lack such details, we propose a theoretically viewpoint-insensitive and speed-insensitive means of automatically annotating the underlying motions and actions. We do this through a differential-geometric, spatio-temporal kinematics descriptor, which analyzes how the poses of entities in two motion sequences change over time. We use this descriptor in conjunction with a weighted-nearest-neighbor classifier to label the primitives using a limited set of training examples. In our experiments, we achieve high motion and action annotation rates for human-action-derived primitives with as few as one training sample. We also demonstrate that reinforcement learning using accurately labeled trajectories leads to high-performing policies more quickly than standard reinforcement learning techniques. This is partly because motion primitives encode prior domain knowledge and preempt the need to re-discover that knowledge during training. It is also because agents can leverage the labels to systematically ignore action classes that do not facilitate task objectives, thereby reducing the action space.
arXiv.org Artificial Intelligence
Feb-23-2021
- Country:
- Asia
- China
- Middle East
- Israel > Haifa District
- Haifa (0.04)
- Jordan (0.04)
- Israel > Haifa District
- Taiwan > Taiwan Province
- Taipei (0.04)
- Europe
- Austria > Styria
- Graz (0.04)
- Finland > Uusimaa
- Helsinki (0.04)
- France > Provence-Alpes-Côte d'Azur
- Alpes-Maritimes > Nice (0.04)
- Bouches-du-Rhône > Marseille (0.04)
- Netherlands > South Holland
- Dordrecht (0.04)
- Portugal (0.04)
- Austria > Styria
- North America
- Canada > Quebec (0.04)
- United States
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Massachusetts
- Middlesex County > Cambridge (0.04)
- Suffolk County > Boston (0.04)
- Alaska > Anchorage Municipality
- Anchorage (0.04)
- Washington > King County
- Seattle (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Utah > Salt Lake County
- Salt Lake City (0.04)
- Rhode Island > Providence County
- Providence (0.04)
- New York
- Bronx County > New York City (0.04)
- Kings County > New York City (0.04)
- New York County > New York City (0.14)
- Queens County > New York City (0.04)
- Richmond County > New York City (0.04)
- Minnesota > Ramsey County
- Saint Paul (0.04)
- California > San Francisco County
- San Francisco (0.14)
- Florida
- Alachua County > Gainesville (0.14)
- Bay County > Panama City (0.04)
- Miami-Dade County > Miami (0.04)
- Pennsylvania > Allegheny County
- Oceania > Australia
- New South Wales > Sydney (0.04)
- South America > Brazil
- Rio de Janeiro > Rio de Janeiro (0.04)
- Asia
- Genre:
- Research Report > New Finding (0.66)
- Industry:
- Leisure & Entertainment (0.46)
- Technology: