An Affective-Taxis Hypothesis for Alignment and Interpretability