Memorizing Long-tail Data Can Help Generalization Through Composition

Open in new window