Not enough data to create a plot.
Try a different view from the menu above.
Singh, Arambam James
CRLLK: Constrained Reinforcement Learning for Lane Keeping in Autonomous Driving
Gao, Xinwei, Singh, Arambam James, Royyuru, Gangadhar, Yuhas, Michael, Easwaran, Arvind
Lane keeping in autonomous driving systems requires scenario-specific weight tuning for different objectives. We formulate lane-keeping as a constrained reinforcement learning problem, where weight coefficients are automatically learned along with the policy, eliminating the need for scenario-specific tuning. Empirically, our approach outperforms traditional RL in efficiency and reliability. Additionally, real-world demonstrations validate its practical value for real-world autonomous driving.
Robust Decision Making for Stochastic Network Design
Kumar, Akshat (Singapore Management University) | Singh, Arambam James (Singapore Management University) | Varakantham, Pradeep (Singapore Management University) | Sheldon, Daniel (University of Massachusetts Amherst)
We address the problem of robust decision making for stochastic network design. Our work is motivated by spatial conservation planning where the goal is to take management decisions within a fixed budget to maximize the expected spread of a population of species over a network of land parcels. Most previous work for this problem assumes that accurate estimates of different network parameters (edge activation probabilities, habitat suitability scores) are available, which is an unrealistic assumption. To address this shortcoming, we assume that network parameters are only partially known, specified via interval bounds. We then develop a decision making approach that computes the solution with minimax regret. We provide new theoretical results regarding the structure of the minmax regret solution which help develop a computationally efficient approach. Empirically, we show that previous approaches that work on point estimates of network parameters result in high regret on several standard benchmarks, while our approach provides significantly more robust solutions.