safe region
- North America > United States > New York > Tompkins County > Ithaca (0.04)
- North America > Canada > Quebec > Montreal (0.04)
- Europe > Germany (0.04)
TimeDiscretization-Invariant SafeActionRepetitionforPolicyGradientMethods
In reinforcement learning, continuous time is often discretized by a time scale δ, to which the resulting performance is known to be highly sensitive. In this work, we seek tofind aδ-invariantalgorithm for policygradient (PG) methods, which performs well regardless of the value ofδ. We first identify the underlying reasons that cause PG methods to fail asδ 0, proving that the variance of the PG estimator can diverge to infinity in stochastic environments under a certain assumption of stochasticity. While durative actions or action repetition can be employed to haveδ-invariance, previous action repetition methods cannot immediately react to unexpected situations in stochastic environments. We thus propose a novelδ-invariant method namedSafe Action Repetition (SAR) applicable to any existing PG algorithm. SAR can handle the stochasticity of environments byadaptivelyreacting tochanges instates during action repetition.
- Asia > Middle East > Jordan (0.04)
- Europe > France (0.04)
- Asia > Vietnam > Long An Province (0.04)
- Europe > Sweden > Östergötland County > Linköping (0.04)
- Africa > Senegal > Dakar Region > Dakar (0.04)
- Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
- Europe > France > Île-de-France > Paris > Paris (0.04)
- North America > United States > New York > Tompkins County > Ithaca (0.04)
- North America > Canada > Quebec > Montreal (0.04)
- Europe > Germany > Hesse > Darmstadt Region > Wiesbaden (0.04)
Safe Active Navigation and Exploration for Planetary Environments Using Proprioceptive Measurements
Jiang, Matthew, Liu, Shipeng, Qian, Feifei
Abstract--Legged robots can sense terrain through force interactions during locomotion, offering more reliable traversability estimates than remote sensing and serving as scouts for guiding wheeled rovers in challenging environments. However, even legged scouts face challenges when traversing highly deformable or unstable terrain. We present Safe Active Exploration for Granular T errain (SAEGT), a navigation framework that enables legged robots to safely explore unknown granular environments using proprioceptive sensing, particularly where visual input fails to capture terrain deformability. SAEGT estimates the safe region and frontier region online from leg-terrain interactions using Gaussian Process regression for traversability assessment, with a reactive controller for real-time safe exploration and navigation. SAEGT demonstrated its ability to safely explore and navigate toward a specified goal using only proprioceptively estimated traversability in simulation.
- North America > United States > California > Los Angeles County > Los Angeles (0.28)
- North America > United States > Texas > Montgomery County > The Woodlands (0.04)
- North America > United States > California > Los Angeles County > Pasadena (0.04)
- (2 more...)
- Asia > South Korea > Seoul > Seoul (0.05)
- Asia > Middle East > Jordan (0.04)
- Europe > France (0.04)
- Asia > Vietnam > Long An Province (0.04)