Deep Hypothesis Tests Detect Clinically Relevant Subgroup Shifts in Medical Images