TimeSeriesGym: A Scalable Benchmark for (Time Series) Machine Learning Engineering Agents