AITopics | multi-lexsum

552ef803bef9368c29e53c167de34b55-Supplemental-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsFeb-9-2026, 01:45:01 GMT

For what purpose was the dataset created?Was therea specific task in mind? Was there aspecific gap that needed to be filled? Please provide a description.The Multi-LexSum dataset was curated to facilitate the development of automaticsummarization methods for civil rights lawsuits.Recent advances in document summarization have led to impressive results in generating ashort description for passages typically in hundreds of words. However, the source inputs forsummarizing civil right lawsuits are considerably longer: they can contain up to 70k words onaverage.

artificial intelligence, dataset, natural language, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Michigan (0.05)
North America > United States > Nevada (0.04)
North America > United States > California > Los Angeles County > Los Angeles (0.04)
North America > United States > New York (0.04)

Industry:

Law > Litigation (1.00)
Law > Civil Rights & Constitutional Law (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language (0.93)

Add feedback

Multi-LexSum: Real-World Summariesof Civil Rights Lawsuitsat Multiple Granularities

Neural Information Processing SystemsFeb-9-2026, 01:44:57 GMT

EEOC sought injunctiverelief, as well as damages (including backpay) for the Black apprenticeshipapplicants. Don' t Give Methe Details, Just the Summary!

artificial intelligence, arxiv, natural language, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > Michigan (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
(7 more...)

Genre: Research Report (0.31)

Industry:

Law > Litigation (0.96)
Law > Civil Rights & Constitutional Law (0.84)

Technology: Information Technology > Artificial Intelligence > Natural Language (0.98)

Add feedback

Multi-LexSum: Real-world Summaries of Civil Rights Lawsuits at Multiple Granularities

Neural Information Processing SystemsDec-24-2025, 05:38:25 GMT

With the advent of large language models, methods for abstractive summarization have made great strides, creating potential for use in applications to aid knowledge workers processing unwieldy document collections. One such setting is the Civil Rights Litigation Clearinghouse (CRLC, https://clearinghouse.net),

civil right lawsuit, multi-lexsum, real-world summary, (8 more...)

Neural Information Processing Systems

Industry:

Law > Litigation (1.00)
Law > Civil Rights & Constitutional Law (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language (0.59)

Add feedback

A Multi release

Neural Information Processing SystemsAug-14-2025, 22:31:36 GMT

License Multi-LexSum is distributed under the Open Data Commons Attribution License (ODC-By).

docket, multi-lexsum, source document, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Nevada (0.04)
North America > United States > California > Los Angeles County > Los Angeles (0.04)

Genre: Research Report (0.45)

Industry:

Law > Litigation (1.00)
Government (1.00)
Law > Labor & Employment Law (0.92)
Law > Civil Rights & Constitutional Law (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.45)

Add feedback

552ef803bef9368c29e53c167de34b55-Paper-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsAug-14-2025, 22:31:33 GMT

arxiv, dataset, multi-lexsum, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Michigan (0.04)
North America > United States > Ohio (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(7 more...)

Genre: Research Report (0.69)

Industry:

Law > Litigation (1.00)
Government (0.93)
Law > Civil Rights & Constitutional Law (0.69)
Law > Labor & Employment Law (0.68)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Multi-LexSum: Real-world Summaries of Civil Rights Lawsuits at Multiple Granularities

Neural Information Processing SystemsOct-11-2024, 02:57:21 GMT

With the advent of large language models, methods for abstractive summarization have made great strides, creating potential for use in applications to aid knowledge workers processing unwieldy document collections. One such setting is the Civil Rights Litigation Clearinghouse (CRLC, https://clearinghouse.net), Today, summarization in the CRLC requires extensive training of lawyers and law students who spend hours per case understanding multiple relevant documents in order to produce high-quality summaries of key events and outcomes. Motivated by this ongoing real-world summarization effort, we introduce Multi-LexSum, a collection of 9,280 expert-authored summaries drawn from ongoing CRLC writing. Multi-LexSum presents a challenging multi-document summarization task given the length of the source documents, often exceeding two hundred pages per case.

civil right lawsuit, multi-lexsum, multiple granularity, (5 more...)

Neural Information Processing Systems

Industry:

Law > Litigation (1.00)
Law > Civil Rights & Constitutional Law (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language (0.61)

Add feedback

Multi-LexSum: Real-World Summaries of Civil Rights Lawsuits at Multiple Granularities

Shen, Zejiang, Lo, Kyle, Yu, Lauren, Dahlberg, Nathan, Schlanger, Margo, Downey, Doug

arXiv.org Artificial IntelligenceJul-22-2022

With the advent of large language models, methods for abstractive summarization have made great strides, creating potential for use in applications to aid knowledge workers processing unwieldy document collections. One such setting is the Civil Rights Litigation Clearinghouse (CRLC) (https://clearinghouse.net),which posts information about large-scale civil rights lawsuits, serving lawyers, scholars, and the general public. Today, summarization in the CRLC requires extensive training of lawyers and law students who spend hours per case understanding multiple relevant documents in order to produce high-quality summaries of key events and outcomes. Motivated by this ongoing real-world summarization effort, we introduce Multi-LexSum, a collection of 9,280 expert-authored summaries drawn from ongoing CRLC writing. Multi-LexSum presents a challenging multi-document summarization task given the length of the source documents, often exceeding two hundred pages per case. Furthermore, Multi-LexSum is distinct from other datasets in its multiple target summaries, each at a different granularity (ranging from one-sentence "extreme" summaries to multi-paragraph narrations of over five hundred words). We present extensive analysis demonstrating that despite the high-quality summaries in the training data (adhering to strict content and style guidelines), state-of-the-art summarization models perform poorly on this task. We release Multi-LexSum for further research in summarization methods as well as to facilitate development of applications to assist in the CRLC's mission at https://multilexsum.github.io.

dataset, multi-lexsum, source document, (14 more...)

arXiv.org Artificial Intelligence

2206.10883

Country:

North America > United States > Michigan (0.05)
North America > United States > Ohio (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(8 more...)

Genre: Research Report (1.00)

Industry:

Law > Litigation (1.00)
Law > Civil Rights & Constitutional Law (1.00)
Government > Regional Government > North America Government > United States Government (0.67)
Education > Educational Setting > Higher Education (0.46)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Filters

Collaborating Authors

multi-lexsum

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

552ef803bef9368c29e53c167de34b55-Supplemental-Datasets_and_Benchmarks.pdf

Multi-LexSum: Real-World Summariesof Civil Rights Lawsuitsat Multiple Granularities

Multi-LexSum: Real-world Summaries of Civil Rights Lawsuits at Multiple Granularities

A Multi release

552ef803bef9368c29e53c167de34b55-Paper-Datasets_and_Benchmarks.pdf

Multi-LexSum: Real-world Summaries of Civil Rights Lawsuits at Multiple Granularities

Multi-LexSum: Real-World Summaries of Civil Rights Lawsuits at Multiple Granularities