Composing Ensembles of Policies with Deep Reinforcement Learning