Learning Continuous Action Models in a Real-Time Strategy Environment