Linear Fitted-Q Iteration with Multiple Reward Functions

Open in new window