Interpretable and Accurate Fine-grained Recognition via Region Grouping