Approximation of Convex Envelope Using Reinforcement Learning
Borkar, Vivek S., Akarsh, Adit
–arXiv.org Artificial Intelligence
Oberman gave a stochastic control formulation of the problem of estimating the convex envelope of a non-convex function. Based on this, we develop a reinforcement learning scheme to approximate the convex envelope, using a variant of Q-learning for controlled optimal stopping. It shows very promising results on a standard library of test problems.
arXiv.org Artificial Intelligence
Nov-24-2023
- Country:
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- Asia > India
- Maharashtra > Mumbai (0.05)
- Europe > United Kingdom
- Genre:
- Research Report (0.64)
- Technology: