Byzantine-Robust Learning on Heterogeneous Data via Gradient Splitting