AITopics | invoice

Collaborating Authors

invoice

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Explanations based on the Missing: Towards Contrastive Explanations with Pertinent Negatives

Amit Dhurandhar, Pin-Yu Chen, Ronny Luss, Chun-Chen Tu, Paishun Ting, Karthikeyan Shanmugam, Payel Das

Neural Information Processing SystemsFeb-14-2026, 12:42:39 GMT

In this paper we propose a novel method that provides contrastive explanations justifying the classification of an input by a black box classifier such as a deep neuralnetwork.

artificial intelligence, explanation, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
North America > Canada > Quebec > Montreal (0.04)

Genre: Research Report (0.49)

Industry: Health & Medicine > Therapeutic Area > Neurology > Autism (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)

Add feedback

Give Your Problems (and Passwords) to Moltbot, Then Watch It Go

WIREDJan-28-2026, 19:01:04 GMT

A viral new virtual assistant formerly known as Clawdbot is complex and brings security risks--but some early adopters say it feels like the future. Dan Peguine, a tech entrepreneur and marketing consultant based in Lisbon, lets a precocious, lobster-themed AI assistant called Moltbot run much of his life. Peguine, a self-professed early adopter and trendspotter, discovered Moltbot several weeks ago--back then it was Clawdbot--after discussing a vibe-coding side project with friends on WhatsApp. He installed it on his computer, connected it to numerous apps and online accounts, including Google Apps, and was astonished by how capable it was. "I tried it, got interested, then got really obsessed," Peguine says.

machine learning, moltbot, natural language, (21 more...)

WIRED

Country:

Europe > Portugal > Lisbon > Lisbon (0.24)
North America > United States > California (0.14)

Industry:

Information Technology (0.50)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.49)
Government (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Communications > Social Media (0.91)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.74)

Add feedback

Explanations based on the Missing: Towards Contrastive Explanations with Pertinent Negatives

Amit Dhurandhar, Pin-Yu Chen, Ronny Luss, Chun-Chen Tu, Paishun Ting, Karthikeyan Shanmugam, Payel Das

Neural Information Processing SystemsNov-20-2025, 19:56:28 GMT

We argue that such explanations are natural for humans and are used commonly in domains such as health care and criminology.

artificial intelligence, explanation, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Michigan > Washtenaw County > Ann Arbor (0.14)
North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe (0.04)

Genre: Research Report (0.93)

Industry:

Information Technology > Security & Privacy (0.93)
Law Enforcement & Public Safety > Fraud (0.68)
Health & Medicine > Therapeutic Area > Neurology > Autism (0.49)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Security & Privacy (0.93)

Add feedback

Financial Management System for SMEs: Real-World Deployment of Accounts Receivable and Cash Flow Prediction

Małkus, Bartłomiej, Bobek, Szymon, Nalepa, Grzegorz J.

arXiv.org Artificial IntelligenceNov-6-2025

Small and Medium Enterprises (SMEs), particularly freelancers and early-stage businesses, face unique financial management challenges due to limited resources, small customer bases, and constrained data availability. This paper presents the development and deployment of an integrated financial prediction system that combines accounts receivable prediction and cash flow forecasting specifically designed for SME operational constraints. Our system addresses the gap between enterprise-focused financial tools and the practical needs of freelancers and small businesses. The solution integrates two key components: a binary classification model for predicting invoice payment delays, and a multi-module cash flow forecasting model that handles incomplete and limited historical data. A prototype system has been implemented and deployed as a web application with integration into Cluee's platform, a startup providing financial management tools for freelancers, demonstrating practical feasibility for real-world SME financial management.

data mining, machine learning, prediction, (17 more...)

arXiv.org Artificial Intelligence

2511.03631

Country:

North America (0.15)
Europe > Poland (0.14)

Genre: Financial News (0.91)

Industry: Banking & Finance (0.90)

Technology:

Information Technology > Data Science > Data Mining (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Invoice Information Extraction: Methods and Performance Evaluation

Yashwant, Sai, Dubey, Anurag, Paikray, Praneeth, Thulsiram, Gantala

arXiv.org Artificial IntelligenceOct-23-2025

This paper presents methods for extracting structured information from invoice documents and proposes a set of evaluation metrics (EM) to assess the accuracy of the extracted data against annotated ground truth. The approach involves pre-processing scanned or digital invoices, applying Docling and LlamaCloud Services to identify and extract key fields such as invoice number, date, total amount, and vendor details. To ensure the reliability of the extraction process, we establish a robust evaluation framework comprising field-level precision, consistency check failures, and exact match accuracy. The proposed metrics provide a standardized way to compare different extraction methods and highlight strengths and weaknesses in field-specific performance.

large language model, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

2510.15727

Genre: Research Report (0.82)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

Add feedback

Dynamic Lagging for Time-Series Forecasting in E-Commerce Finance: Mitigating Information Loss with A Hybrid ML Architecture

Sharma, Abhishek, Parush, Anat, Wadhwa, Sumit, Savir, Amihai, Guinard, Anne, Srivastava, Prateek

arXiv.org Artificial IntelligenceSep-25-2025

Accurate forecasting in the e-commerce finance domain is particularly challenging due to irregular invoice schedules, payment deferrals, and user-specific behavioral variability. These factors, combined with sparse datasets and short historical windows, limit the effectiveness of conventional time-series methods. While deep learning and Transformer-based models have shown promise in other domains, their performance deteriorates under partial observability and limited historical data. To address these challenges, we propose a hybrid forecasting framework that integrates dynamic lagged feature engineering and adaptive rolling-window representations with classical statistical models and ensemble learners. Our approach explicitly incorporates invoice-level behavioral modeling, structured lag of support data, and custom stability-aware loss functions, enabling robust forecasts in sparse and irregular financial settings. Empirical results demonstrate an approximate 5% reduction in MAPE compared to baseline models, translating into substantial financial savings. Furthermore, the framework enhances forecast stability over quarterly horizons and strengthens feature target correlation by capturing both short- and long-term patterns, leveraging user profile attributes, and simulating upcoming invoice behaviors. These findings underscore the value of combining structured lagging, invoice-level closure modeling, and behavioral insights to advance predictive accuracy in sparse financial time-series forecasting.

data mining, machine learning, support data, (18 more...)

arXiv.org Artificial Intelligence

2509.20244

Genre: Research Report > New Finding (0.86)

Industry: Information Technology > Services > e-Commerce Services (0.63)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Design and Implementation of an OCR-Powered Pipeline for Table Extraction from Invoices

Patel, Parshva Dhilankumar

arXiv.org Artificial IntelligenceJul-10-2025

This paper presents a robust system for automated invoice data extraction using a hybrid pipeline that combines OpenCV-based pre-processing with OCR and advanced table extraction techniques. Our approach addresses real-world challenges including skewed perspectives, variable lighting, noise from signatures, barcodes, staplers, and broken table structures. We segment invoices into detail and product sections, apply hybrid table detection using both Img2Table and manual fallback methods, and finally generate structured JSON outputs using row-wise OCR. This method proves particularly effective for physical invoices with multiple products and complex layouts, significantly reducing the need for manual data entry.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2507.07029

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.70)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.47)

Add feedback

An agentic system with reinforcement-learned subsystem improvements for parsing form-like documents

Amjad, Ayesha, Sthapit, Saurav, Syed, Tahir Qasim

arXiv.org Artificial IntelligenceMay-21-2025

Extracting alphanumeric data from form-like documents such as invoices, purchase orders, bills, and financial documents is often performed via vision (OCR) and learning algorithms or monolithic pipelines with limited potential for systemic improvements. We propose an agen-tic AI system that leverages Large Language Model (LLM) agents and a reinforcement learning (RL) driver agent to automate consistent, self-improving extraction under LLM inference uncertainty. Our work highlights the limitations of monolithic LLM-based extraction and introduces a modular, multi-agent framework with task-specific prompts and an RL policy of rewards and penalties to guide a meta-prompting agent to learn from past errors and improve prompt-based actor agents. This self-corrective adaptive system handles diverse documents, file formats, layouts, and LLMs, aiming to automate accurate information extraction without the need for human intervention. Results as reported on two benchmark datasets of SOIRE, and CORD, are promising for the agen-tic AI framework.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2505.13504

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Memory-Augmented Agent Training for Business Document Understanding

Liu, Jiale, Zeng, Yifan, Højmark-Bertelsen, Malte, Gadeberg, Marie Normann, Wang, Huazheng, Wu, Qingyun

arXiv.org Artificial IntelligenceDec-17-2024

Traditional enterprises face significant challenges in processing business documents, where tasks like extracting transport references from invoices remain largely manual despite their crucial role in logistics operations. While Large Language Models offer potential automation, their direct application to specialized business domains often yields unsatisfactory results. We introduce Matrix (Memory-Augmented agent Training through Reasoning and Iterative eXploration), a novel paradigm that enables LLM agents to progressively build domain expertise through experience-driven memory refinement and iterative learning. To validate this approach, we collaborate with one of the world's largest logistics companies to create a dataset of Universal Business Language format invoice documents, focusing on the task of transport reference extraction. Experiments demonstrate that Matrix outperforms prompting a single LLM by 30.3%, vanilla LLM agent by 35.2%. We further analyze the metrics of the optimized systems and observe that the agent system requires less API calls, fewer costs and can analyze longer documents on average. Our methods establish a new approach to transform general-purpose LLMs into specialized business tools through systematic memory enhancement in document processing tasks.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2412.15274

Country: North America > United States > Minnesota (0.28)

Genre: Research Report (1.00)

Industry:

Information Technology > Security & Privacy (0.68)
Transportation > Freight & Logistics Services (0.54)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.32)

Add feedback

Information Extraction from Heterogeneous Documents without Ground Truth Labels using Synthetic Label Generation and Knowledge Distillation

Bhattacharyya, Aniket, Tripathi, Anurag

arXiv.org Artificial IntelligenceNov-25-2024

Invoices and receipts submitted by employees are visually rich documents (VRDs) with textual, visual and layout information. To protect against the risk of fraud and abuse, it is crucial for organizations to efficiently extract desired information from submitted receipts. This helps in the assessment of key factors such as appropriateness of the expense claim, adherence to spending and transaction policies, the validity of the receipt, as well as downstream anomaly detection at various levels. These documents are heterogeneous, with multiple formats and languages, uploaded with different image qualities, and often do not contain ground truth labels for the efficient training of models. In this paper we propose Task Aware Instruction-based Labelling (TAIL), a method for synthetic label generation in VRD corpuses without labels, and fine-tune a multimodal Visually Rich Document Understanding Model (VRDU) on TAIL labels using response-based knowledge distillation without using the teacher model's weights or training dataset to conditionally generate annotations in the appropriate format. Using a benchmark external dataset where ground truth labels are available, we demonstrate conditions under which our approach performs at par with Claude 3 Sonnet through empirical studies. We then show that the resulting model performs at par or better on the internal expense documents of a large multinational organization than state-of-the-art LMM (large multimodal model) Claude 3 Sonnet while being 85% less costly and ~5X faster, and outperforms layout-aware baselines by more than 10% in Average Normalized Levenshtein Similarity (ANLS) scores due to its ability to reason and extract information from rare formats. Finally, we illustrate the usage of our approach in overpayment prevention.

dataset, llava-net, receipt, (17 more...)

arXiv.org Artificial Intelligence

2411.14957

Country:

North America > United States > Washington > King County > Seattle (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Idaho > Canyon County > Nampa (0.04)
Asia > South Korea (0.04)

Genre: Research Report (0.84)

Industry: Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.94)

Add feedback