Diagnosing failures of fairness transfer across distribution shift in real-world medical settings