Sagi, Aditya
Robust Quadrupedal Locomotion on Sloped Terrains: A Linear Policy Approach
Paigwar, Kartik, Krishna, Lokesh, Tirumala, Sashank, Khetan, Naman, Sagi, Aditya, Joglekar, Ashish, Bhatnagar, Shalabh, Ghosal, Ashitava, Amrutur, Bharadwaj, Kolathaya, Shishir
In this paper, with a view toward fast deployment of locomotion gaits in low-cost hardware, we use a linear policy for realizing end-foot trajectories in the quadruped robot, Stoch $2$. In particular, the parameters of the end-foot trajectories are shaped via a linear feedback policy that takes the torso orientation and the terrain slope as inputs. The corresponding desired joint angles are obtained via an inverse kinematics solver and tracked via a PID control law. Augmented Random Search, a model-free and a gradient-free learning algorithm is used to train this linear policy. Simulation results show that the resulting walking is robust to terrain slope variations and external pushes. This methodology is not only computationally light-weight but also uses minimal sensing and actuation capabilities in the robot, thereby justifying the approach.
Learning Stable Manoeuvres in Quadruped Robots from Expert Demonstrations
Tirumala, Sashank, Gubbi, Sagar, Paigwar, Kartik, Sagi, Aditya, Joglekar, Ashish, Bhatnagar, Shalabh, Ghosal, Ashitava, Amrutur, Bharadwaj, Kolathaya, Shishir
With the research into development of quadruped robots picking up pace, learning based techniques are being explored for developing locomotion controllers for such robots. A key problem is to generate leg trajectories for continuously varying target linear and angular velocities, in a stable manner. In this paper, we propose a two pronged approach to address this problem. First, multiple simpler policies are trained to generate trajectories for a discrete set of target velocities and turning radius. These policies are then augmented using a higher level neural network for handling the transition between the learned trajectories. Specifically, we develop a neural network-based filter that takes in target velocity, radius and transforms them into new commands that enable smooth transitions to the new trajectory. This transformation is achieved by learning from expert demonstrations. An application of this is the transformation of a novice user's input into an expert user's input, thereby ensuring stable manoeuvres regardless of the user's experience. Training our proposed architecture requires much less expert demonstrations compared to standard neural network architectures. Finally, we demonstrate experimentally these results in the in-house quadruped Stoch 2.