Searching for Structure in Unfalsifiable Claims
Christensen, Peter Ebert, Warburg, Frederik, Jia, Menglin, Belongie, Serge
–arXiv.org Artificial Intelligence
Social media platforms give rise to an abundance of posts and comments on every topic imaginable. Many of these posts express opinions on various aspects of society, but their unfalsifiable nature makes them ill-suited to fact-checking pipelines. In this work, we aim to distill such posts into a small set of narratives that capture the essential claims related to a given topic. Understanding and visualizing these narratives can facilitate more informed debates on social media. As a first step towards systematically identifying the underlying narratives on social media, we introduce PAPYER, a fine-grained dataset of online comments related to hygiene in public restrooms, which contains a multitude of unfalsifiable claims. We present a human-in-the-loop pipeline that uses a combination of machine and human kernels to discover the prevailing narratives and show that this pipeline outperforms recent large transformer models and state-of-the-art unsupervised topic models.
arXiv.org Artificial Intelligence
Aug-19-2022
- Country:
- North America
- United States
- Ohio (0.04)
- California (0.04)
- Wisconsin > Dane County
- Madison (0.04)
- Utah > Salt Lake County
- Salt Lake City (0.04)
- New York > New York County
- New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Puerto Rico > San Juan
- San Juan (0.04)
- United States
- Europe
- Germany > Berlin (0.04)
- Eastern Europe (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Spain > Valencian Community
- Valencia Province > Valencia (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Asia
- North America
- Genre:
- Research Report > New Finding (0.45)
- Industry:
- Media (1.00)
- Energy (1.00)
- Consumer Products & Services (0.92)
- Materials > Paper & Forest Products (0.74)
- Water & Waste Management > Water Management
- Constituents > Bacteria (0.45)
- Health & Medicine
- Consumer Health (1.00)
- Pharmaceuticals & Biotechnology (0.67)
- Therapeutic Area
- Infections and Infectious Diseases (1.00)
- Immunology (0.92)
- Government > Regional Government
- Education > Educational Setting
- K-12 Education (0.67)
- Technology: