Auto-Rubric: Learning to Extract Generalizable Criteria for Reward Modeling