Fitted Q-iteration in continuous action-space MDPs

Open in new window