DYNAMIX: RL-based Adaptive Batch Size Optimization in Distributed Machine Learning Systems