LLMs for Law: Evaluating Legal-Specific LLMs on Contract Understanding
Singh, Amrita, Karaca, H. Suhan, Joshi, Aditya, Paik, Hye-young, Jiang, Jiaojiao
–arXiv.org Artificial Intelligence
Despite advances in legal NLP, no comprehensive evaluation covering multiple legal-specific LLMs currently exists for contract classification tasks in contract understanding. To address this gap, we present an evaluation of 10 legal-specific LLMs on three English language contract understanding tasks and compare them with 7 general-purpose LLMs. The results show that legal-specific LLMs consistently outperform general-purpose models, especially on tasks requiring nuanced legal understanding. Legal-BERT and Contracts-BERT establish new SOTAs on two of the three tasks, despite having 69% fewer parameters than the best-performing general-purpose LLM. We also identify CaseLaw-BERT and LexLM as strong additional baselines for contract understanding. Our results provide a holistic evaluation of legal-specific LLMs and will facilitate the development of more accurate contract understanding systems.
arXiv.org Artificial Intelligence
Aug-12-2025
- Country:
- Asia > India (0.14)
- Europe
- Germany (0.04)
- Switzerland (0.04)
- North America
- Canada > Ontario
- Toronto (0.04)
- United States
- Florida > Miami-Dade County
- Miami (0.04)
- New York (0.04)
- Florida > Miami-Dade County
- Canada > Ontario
- Oceania > Australia
- New South Wales (0.04)
- South America > Chile
- Genre:
- Research Report > New Finding (0.68)
- Industry:
- Government > Regional Government
- Law > Statutes (0.97)
- Technology: