From Variability to Stability: Advancing RecSys Benchmarking Practices