sentiment analysis
Learned in Translation: Contextualized Word Vectors
Computer vision has benefited from initializing multiple deep layers with weights pretrained on large supervised training sets like ImageNet. Natural language processing (NLP) typically sees initialization of only the lowest layer of deep models with pretrained word vectors. In this paper, we use a deep LSTM encoder from an attentional sequence-to-sequence model trained for machine translation (MT) to contextualize word vectors. We show that adding these context vectors (CoVe) improves performance over using only unsupervised word and character vectors on a wide variety of common NLP tasks: sentiment analysis (SST, IMDb), question classification (TREC), entailment (SNLI), and question answering (SQuAD). For fine-grained sentiment analysis and entailment, CoVe improves performance of our baseline models to the state of the art.
- North America > United States > Tennessee > Shelby County > Memphis (0.04)
- Europe > Italy > Tuscany > Florence (0.04)
- Asia > China > Zhejiang Province (0.04)
- Africa > South Sudan > Greater Upper Nile > Greater Pibor Administrative Area > Boma (0.04)
- Education (0.93)
- Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.68)
- Information Technology (0.67)
- North America > United States (0.14)
- Europe > Italy > Tuscany > Florence (0.04)
- Asia > China > Tianjin Province > Tianjin (0.04)
- Health & Medicine > Therapeutic Area (0.46)
- Information Technology > Services (0.46)
A Additional Results
The acronym dataset is a QA task that requires models to decode financial acronyms. The FinMA7B-full model achieved the highest ROUGE-1 score of 0.12 and the B.1 Why was the datasheet created? B.2 Has the dataset been used already? If so, where are the results so others can compare (e.g., links to published papers)? Y es, the dataset has already been used. It was employed in the FinLLM Share Task during the FinNLP-AgentScen Workshop at IJCAI 2024, known as the FinLLM Challenge.
- Asia > China > Hubei Province > Wuhan (0.04)
- North America > United States > New York > Suffolk County > Stony Brook (0.04)
- Asia > China > Jiangsu Province > Nanjing (0.04)
- (5 more...)
- Law (1.00)
- Information Technology > Security & Privacy (1.00)
- Banking & Finance > Trading (1.00)
- Government (0.93)
- North America > United States > Illinois > Champaign County > Urbana (0.04)
- Europe > Romania > Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)
- Asia > China > Beijing > Beijing (0.04)
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- Asia > Middle East > Qatar (0.04)
- Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
- (19 more...)
- Education (0.46)
- Information Technology (0.46)
- North America > United States > California > Santa Clara County > Stanford (0.05)
- North America > United States > California > Santa Clara County > Palo Alto (0.05)
- North America > United States > California > Santa Clara County > Mountain View (0.04)
- (2 more...)
- Law (1.00)
- Information Technology > Security & Privacy (1.00)
- Banking & Finance > Trading (1.00)
- Government (0.68)
- Information Technology > Artificial Intelligence > Machine Learning (1.00)
- Information Technology > Security & Privacy (0.93)
- Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.71)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.48)
- Europe > Switzerland > Zürich > Zürich (0.05)
- North America > Canada > Newfoundland and Labrador > Labrador (0.04)
- Media > Photography (0.69)
- Information Technology > Services (0.46)
- Information Technology > Sensing and Signal Processing > Image Processing (1.00)
- Information Technology > Communications > Social Media (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.46)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)