Revisiting Distance Metric Learning for Few-Shot Natural Language Classification