EpiCare: A Reinforcement Learning Benchmark for Dynamic Treatment Regimes