WHEN FLUE MEETS FLANG: Benchmarks and Large Pre-trained Language Model for Financial Domain

Shah, Raj Sanjay, Chawla, Kunal, Eidnani, Dheeraj, Shah, Agam, Du, Wendi, Chava, Sudheer, Raman, Natraj, Smiley, Charese, Chen, Jiaao, Yang, Diyi

Oct-31-2022–arXiv.org Artificial Intelligence

Pre-trained language models have shown impressive performance on a variety of tasks and domains. Previous research on financial language models usually employs a generic training scheme to train standard model architectures, without completely leveraging the richness of the financial data. We propose a novel domain specific Financial LANGuage model (FLANG) which uses financial keywords and phrases for better masking, together with span boundary objective and in-filing objective. Additionally, the evaluation benchmarks in the field have been limited. To this end, we contribute the Financial Language Understanding Evaluation (FLUE), an open-source comprehensive suite of benchmarks for the financial domain. These include new benchmarks across 5 NLP tasks in financial domain as well as common benchmarks used in the previous research. Experiments on these benchmarks suggest that our model outperforms those in prior literature on a variety of NLP tasks. Our models, code and benchmark data are publicly available on Github and Huggingface.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

Oct-31-2022

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Minnesota > Hennepin County
    - Minneapolis (0.14)
  - Louisiana > Orleans Parish
    - New Orleans (0.04)
  - California > Santa Clara County
    - Palo Alto (0.04)
- Asia > British Indian Ocean Territory
  - Diego Garcia (0.04)

Genre:
- Research Report > New Finding (0.46)

Industry:
- Banking & Finance > Trading (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found