Diversity-Aware Batch Active Learning for Dependency Parsing
Shi, Tianze, Benton, Adrian, Malioutov, Igor, İrsoy, Ozan
–arXiv.org Artificial Intelligence
While the predictive performance of modern statistical dependency parsers relies heavily on the availability of expensive expert-annotated treebank data, not all annotations contribute equally to the training of the parsers. In this paper, we attempt to reduce the number of labeled examples needed to train a strong dependency parser using batch active learning (AL). In particular, we investigate whether enforcing diversity in the sampled batches, using determinantal point processes (DPPs), can improve over their diversity-agnostic counterparts. Simulation experiments on an English newswire corpus show that selecting diverse batches with DPPs is superior to strong selection strategies that do not enforce batch diversity, especially during the initial stages of the learning process. Additionally, our diversityaware strategy is robust under a corpus duplication setting, where diversity-agnostic sampling strategies exhibit significant degradation.
arXiv.org Artificial Intelligence
Apr-28-2021
- Country:
- Asia
- China > Hong Kong (0.04)
- Japan (0.04)
- Middle East > Jordan (0.04)
- Europe
- Spain > Basque Country
- Biscay Province > Bilbao (0.04)
- Czechia > Prague (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Netherlands > South Holland
- Dordrecht (0.04)
- France (0.04)
- United Kingdom > England (0.04)
- Sweden > Uppsala County
- Uppsala (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Germany (0.04)
- Italy > Tuscany
- Florence (0.04)
- Slovenia > Upper Carniola
- Municipality of Bled > Bled (0.04)
- Spain > Basque Country
- North America
- Canada
- Alberta > Census Division No. 15
- Improvement District No. 9 > Banff (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Quebec > Montreal (0.04)
- Alberta > Census Division No. 15
- Puerto Rico (0.04)
- United States
- California
- Orange County > Santa Ana (0.04)
- San Diego County > San Diego (0.04)
- San Francisco County > San Francisco (0.04)
- Mississippi (0.04)
- Wyoming (0.04)
- District of Columbia > Washington (0.04)
- Virginia (0.04)
- Idaho (0.04)
- Nebraska (0.04)
- Vermont (0.04)
- Louisiana (0.04)
- Kentucky > Jefferson County
- Louisville (0.04)
- Ohio (0.04)
- Oklahoma (0.04)
- New Mexico (0.04)
- Iowa (0.04)
- North Dakota (0.04)
- Wisconsin > Dane County
- Madison (0.04)
- New Hampshire (0.04)
- Michigan (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- Kansas (0.04)
- Maryland (0.04)
- Oregon (0.04)
- South Dakota (0.04)
- New York (0.05)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Massachusetts > Suffolk County
- Boston (0.04)
- Indiana (0.04)
- Utah (0.04)
- Minnesota (0.04)
- Arkansas (0.04)
- Maine (0.04)
- Alaska (0.04)
- South Carolina (0.04)
- California
- Canada
- Oceania > Australia
- New South Wales > Sydney (0.04)
- South America > Uruguay (0.04)
- Asia
- Genre:
- Research Report (0.64)
- Industry:
- Banking & Finance (0.93)
- Government > Regional Government (0.46)
- Technology: