Beyond Distributional Hypothesis: Let Language Models Learn Meaning-Text Correspondence

Open in new window