Time Will Change Things: An Empirical Study on Dynamic Language Understanding in Social Media Classification
–arXiv.org Artificial Intelligence
Language features are ever-evolving in the real-world social media environment. Many trained models in natural language understanding (NLU), ineffective in semantic inference for unseen features, might consequently struggle with the deteriorating performance in dynamicity. To address this challenge, we empirically study social media NLU in a dynamic setup, where models are trained on the past data and test on the future. It better reflects the realistic practice compared to the commonly-adopted static setup of random data split. To further analyze model adaption to the dynamicity, we explore the usefulness of leveraging some unlabeled data created after a model is trained. The performance of unsupervised domain adaption baselines based on auto-encoding and pseudo-labeling and a joint framework coupling them both are examined in the experiments. Substantial results on four social media tasks imply the universally negative effects of evolving environments over classification accuracy, while auto-encoding and pseudo-labeling collaboratively show the best robustness in dynamicity.
arXiv.org Artificial Intelligence
Oct-6-2022
- Country:
- Oceania > Australia
- New South Wales > Sydney (0.04)
- North America
- United States
- New Mexico (0.04)
- New York
- New York County > New York City (0.14)
- Richmond County > New York City (0.04)
- Queens County > New York City (0.04)
- Kings County > New York City (0.04)
- Bronx County > New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Georgia > Fulton County
- Atlanta (0.04)
- Canada
- Quebec > Montreal (0.04)
- Alberta > Census Division No. 15
- Improvement District No. 9 > Banff (0.04)
- United States
- Europe
- Austria (0.04)
- United Kingdom > Scotland
- City of Edinburgh > Edinburgh (0.04)
- Spain
- Catalonia > Barcelona Province
- Barcelona (0.04)
- Basque Country > Biscay Province
- Bilbao (0.04)
- Catalonia > Barcelona Province
- Italy > Tuscany
- Florence (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- Oceania > Australia
- Genre:
- Research Report (0.64)
- Industry:
- Technology: