SlateFree: a Model-Free Decomposition for Reinforcement Learning with Slate Actions

Open in new window