Robust identification of thermal models for in-production High-Performance-Computing clusters with machine learning-based data selection