An Exploration of Encoder-Decoder Approaches to Multi-Label Classification for Legal and Biomedical Text
Kementchedjhieva, Yova, Chalkidis, Ilias
–arXiv.org Artificial Intelligence
Standard methods for multi-label text classification largely rely on encoder-only pre-trained language models, whereas encoder-decoder models have proven more effective in other classification tasks. In this study, we compare four methods for multi-label classification, two based on an encoder only, and two based on an encoder-decoder. We carry out experiments on four datasets -- two in the legal domain and two in the biomedical domain, each with two levels of label granularity -- and always depart from the same pre-trained model, T5. Our results show that encoder-decoder methods outperform encoder-only methods, with a growing advantage on more complex datasets and labeling schemes of finer granularity. Using encoder-decoder models in a non-autoregressive fashion, in particular, yields the best performance overall, so we further study this approach through ablations to better understand its strengths.
arXiv.org Artificial Intelligence
May-9-2023
- Country:
- North America > United States
- Minnesota > Hennepin County
- Minneapolis (0.14)
- California > Los Angeles County
- Long Beach (0.04)
- Minnesota > Hennepin County
- Europe
- United Kingdom (0.28)
- Slovenia > Upper Carniola
- Municipality of Bled > Bled (0.04)
- Italy > Tuscany
- Florence (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- North America > United States
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Law > International Law (0.67)
- Health & Medicine > Therapeutic Area
- Infections and Infectious Diseases (1.00)
- Immunology (1.00)
- Government > Regional Government
- Europe Government (0.68)
- Technology: