AuditCopilot: Leveraging LLMs for Fraud Detection in Double-Entry Bookkeeping
Kadir, Md Abdul, Vasu, Sai Suresh Macharla, Nair, Sidharth S., Sonntag, Daniel
–arXiv.org Artificial Intelligence
Auditors rely on Journal Entry Tests (JETs) to detect anomalies in tax-related ledger records, but rule-based methods generate overwhelming false positives and struggle with subtle irregularities. We investigate whether large language models (LLMs) can serve as anomaly detectors in double-entry bookkeeping. Benchmarking SoTA LLMs such as LLaMA and Gemma on both synthetic and real-world anonymized ledgers, we compare them against JETs and machine learning baselines. Our results show that LLMs consistently outperform traditional rule-based JETs and classical ML baselines, while also providing natural-language explanations that enhance interpretability. These results highlight the potential of \textbf{AI-augmented auditing}, where human auditors collaborate with foundation models to strengthen financial integrity.
arXiv.org Artificial Intelligence
Dec-3-2025
- Country:
- Europe
- France (0.04)
- Germany
- Lower Saxony > Oldenburg (0.04)
- North Rhine-Westphalia > Düsseldorf Region
- Düsseldorf (0.04)
- Saarland > Saarbrücken (0.04)
- Europe
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Banking & Finance (0.47)
- Law (0.47)
- Law Enforcement & Public Safety > Fraud (0.41)
- Technology: