FinEmbedDiff: A Cost-Effective Approach of Classifying Financial Documents with Vector Sampling using Multi-modal Embedding Models

May-28-2024–arXiv.org Artificial Intelligence

Accurate classification of multi-modal financial documents, containing text, tables, charts, and images, is crucial but challenging. Traditional text-based approaches often fail to capture the complex multi-modal nature of these documents. We propose FinEmbedDiff, a cost-effective vector sampling method that leverages pre-trained multi-modal embedding models to classify financial documents. Our approach generates multi-modal embedding vectors for documents, and compares new documents with pre-computed class embeddings using vector similarity measures. Evaluated on a large dataset, FinEmbedDiff achieves competitive classification accuracy compared to state-of-the-art baselines while significantly reducing computational costs. The method exhibits strong generalization capabilities, making it a practical and scalable solution for real-world financial applications.

classification, financial document, international research journal, (12 more...)

arXiv.org Artificial Intelligence

May-28-2024

arXiv.org PDF

Add feedback

Country:
- Asia > India (0.04)

Genre:
- Research Report > Promising Solution (0.68)

Industry:
- Banking & Finance (1.00)
- Information Technology > Software (0.34)

Technology:
- Information Technology
  - Data Science (1.00)
  - Artificial Intelligence
    - Representation & Reasoning > Rule-Based Reasoning (0.46)
    - Natural Language
      - Text Processing (0.50)
      - Text Classification (0.31)
    - Machine Learning
      - Performance Analysis > Accuracy (0.48)
      - Neural Networks (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found