Are Pre-trained Language Models Aware of Phrases? Simple but Strong Baselines for Grammar Induction
Kim, Taeuk, Choi, Jihun, Edmiston, Daniel, Lee, Sang-goo
–arXiv.org Artificial Intelligence
With the recent success and popularity of pre-trained language models (LMs) in natural language processing, there has been a rise in efforts to understand their inner workings. In line with such interest, we propose a novel method that assists us in investigating the extent to which pre-trained LMs capture the syntactic notion of constituency. Our method provides an effective way of extracting constituency trees from the pre-trained LMs without training. In addition, we report intriguing findings in the induced trees, including the fact that some pre-trained LMs outperform other approaches in correctly demarcating adverb phrases in sentences.
arXiv.org Artificial Intelligence
Jan-30-2020
- Country:
- Oceania > Australia
- North America
- Canada (0.04)
- United States
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.15)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Illinois > Cook County
- Chicago (0.04)
- California > San Diego County
- San Diego (0.04)
- Pennsylvania > Philadelphia County
- Europe
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Italy > Tuscany
- Florence (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Spain > Catalonia
- Asia
- China > Hong Kong (0.04)
- South Korea > Seoul
- Seoul (0.04)
- Genre:
- Research Report > New Finding (0.46)
- Technology: