IsBang-BangControlAllYouNeed? SolvingContinuousControlwithBernoulliPolicies
–Neural Information Processing Systems
Real-world robotics tasks commonly manifest ascontrol problems overcontinuous action spaces. When learning to act in such settings, control policies are typically represented as continuous probability distributions that cover all feasible control inputs - often Gaussians. The underlying assumption is that this enables more refined decisions compared to crude policy choices such as discretized controllers, which limit the search space but induce abrupt changes. While switching controls canbeundesirable inpractice astheymaychallenge stability andaccelerate system weardown, they are theoretically feasible and even arise as optimal strategies in some settings.
Neural Information Processing Systems
Feb-11-2026, 15:19:01 GMT
- Country:
- Asia
- Middle East > Republic of Türkiye
- Karaman Province > Karaman (0.04)
- Russia (0.04)
- Middle East > Republic of Türkiye
- Europe > Russia
- Central Federal District > Moscow Oblast > Moscow (0.04)
- North America > United States
- Massachusetts > Middlesex County > Cambridge (0.04)
- Asia
- Technology: