LexGLUE: A Benchmark Dataset for Legal Language Understanding in English
Chalkidis, Ilias, Jana, Abhik, Hartung, Dirk, Bommarito, Michael, Androutsopoulos, Ion, Katz, Daniel Martin, Aletras, Nikolaos
–arXiv.org Artificial Intelligence
Laws and their interpretations, legal arguments and agreements\ are typically expressed in writing, leading to the production of vast corpora of legal text. Their analysis, which is at the center of legal practice, becomes increasingly elaborate as these collections grow in size. Natural language understanding (NLU) technologies can be a valuable tool to support legal practitioners in these endeavors. Their usefulness, however, largely depends on whether current state-of-the-art models can generalize across various tasks in the legal domain. To answer this currently open question, we introduce the Legal General Language Understanding Evaluation (LexGLUE) benchmark, a collection of datasets for evaluating model performance across a diverse set of legal NLU tasks in a standardized way. We also provide an evaluation and analysis of several generic and legal-oriented models demonstrating that the latter consistently offer performance improvements across multiple tasks.
arXiv.org Artificial Intelligence
Nov-8-2022
- Country:
- South America > Chile
- Oceania > Australia
- North America
- United States
- District of Columbia > Washington (0.04)
- Texas > Travis County
- Austin (0.14)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Oklahoma > Garfield County
- Enid (0.04)
- Illinois > Cook County
- Chicago (0.04)
- California
- San Diego County > San Diego (0.04)
- Los Angeles County > Long Beach (0.04)
- New York > New York County
- New York City (0.04)
- Canada > British Columbia
- United States
- Europe
- Germany > Hamburg (0.04)
- Netherlands (0.04)
- Switzerland (0.04)
- Greece (0.04)
- Ireland (0.04)
- Bulgaria > Varna Province
- Varna (0.04)
- Italy > Tuscany
- Florence (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Spain
- Basque Country (0.04)
- Valencian Community > Valencia Province
- Valencia (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- United Kingdom > England
- South Yorkshire > Sheffield (0.04)
- Luxembourg > Luxembourg Canton
- Luxembourg City (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- Middle East > Republic of Türkiye (0.04)
- Thailand (0.04)
- Philippines (0.04)
- China > Hong Kong (0.04)
- Japan > Kyūshū & Okinawa
- Kyūshū > Miyazaki Prefecture > Miyazaki (0.04)
- Genre:
- Research Report (1.00)
- Industry:
- Technology: