Robust Policy Learning for Multi-UAV Collision Avoidance with Causal Feature Selection