High-Throughput LLM inference on Heterogeneous Clusters