F-coref: Fast, Accurate and Easy to Use Coreference Resolution
Otmazgin, Shon, Cattan, Arie, Goldberg, Yoav
–arXiv.org Artificial Intelligence
We introduce fastcoref, a python package for fast, accurate, and easy-to-use English coreference resolution. The package is pip-installable, and allows two modes: an accurate mode based on the LingMess architecture, providing state-of-the-art coreference accuracy, and a substantially faster model, F-coref, which is the focus of this work. F-coref allows to process 2.8K OntoNotes documents in 25 seconds on a V100 GPU (compared to 6 minutes for the LingMess model, and to 12 minutes of the popular AllenNLP coreference model) with only a modest drop in accuracy. The fast speed is achieved through a combination of distillation of a compact model from the LingMess model, and an efficient batching implementation using a technique we call leftover batching. Our code is available at https://github.com/shon-otmazgin/fastcoref
arXiv.org Artificial Intelligence
Oct-25-2022
- Country:
- Oceania > Australia
- North America > United States
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Louisiana > Orleans Parish
- Europe
- Sweden (0.04)
- Italy > Tuscany
- Florence (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- China > Hong Kong (0.04)
- South Korea (0.04)
- Taiwan > Taiwan Province
- Taipei (0.04)
- Genre:
- Research Report (0.64)
- Industry:
- Education (0.48)
- Technology: