Learning from Litigation: Graphs and LLMs for Retrieval and Reasoning in eDiscovery
Lahiri, Sounak, Pai, Sumit, Weninger, Tim, Bhattacharya, Sanmitra
–arXiv.org Artificial Intelligence
Electronic Discovery (eDiscovery) involves identifying relevant documents from a vast collection based on legal production requests. The integration of artificial intelligence (AI) and natural language processing (NLP) has transformed this process, helping document review and enhance efficiency and cost-effectiveness. Although traditional approaches like BM25 or fine-tuned pre-trained models are common in eDiscovery, they face performance, computational, and interpretability challenges. In contrast, Large Language Model (LLM)-based methods prioritize interpretability but sacrifice performance and throughput. This paper introduces DISCOvery Graph (DISCOG), a hybrid approach that combines the strengths of two worlds: a heterogeneous graph-based method for accurate document relevance prediction and subsequent LLM-driven approach for reasoning. Graph representational learning generates embeddings and predicts links, ranking the corpus for a given request, and the LLMs provide reasoning for document relevance. Our approach handles datasets with balanced and imbalanced distributions, outperforming baselines in F1-score, precision, and recall by an average of 12%, 3%, and 16%, respectively. In an enterprise context, our approach drastically reduces document review costs by 99.9% compared to manual processes and by 95% compared to LLM-based classification methods
arXiv.org Artificial Intelligence
May-29-2024
- Country:
- Asia
- India > Karnataka
- Bengaluru (0.04)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- Singapore (0.04)
- India > Karnataka
- Europe > France (0.04)
- North America
- Dominican Republic (0.04)
- United States
- Indiana > St. Joseph County
- Notre Dame (0.04)
- New York
- Bronx County > New York City (0.04)
- Kings County > New York City (0.04)
- New York County > New York City (0.04)
- Queens County > New York City (0.04)
- Richmond County > New York City (0.04)
- Indiana > St. Joseph County
- Asia
- Genre:
- Research Report (0.64)
- Industry:
- Law > Litigation (1.00)
- Technology: