Detecting Statements in Text: A Domain-Agnostic Few-Shot Solution

May-9-2024–arXiv.org Artificial Intelligence

Many tasks related to Computational Social Science and Web Content Analysis involve classifying pieces of text based on the claims they contain. State-of-the-art approaches usually involve fine-tuning models on large annotated datasets, which are costly to produce. In light of this, we propose and release a qualitative and versatile few-shot learning methodology as a common paradigm for any claim-based textual classification task. This methodology involves defining the classes as arbitrarily sophisticated taxonomies of claims, and using Natural Language Inference models to obtain the textual entailment between these and a corpus of interest. The performance of these models is then boosted by annotating a minimal sample of data points, dynamically sampled using the well-established statistical heuristic of Probabilistic Bisection. We illustrate this methodology in the context of three tasks: climate change contrarianism detection, topic/stance classification and depression-relates symptoms detection.

annotation, taxonomy, threshold, (14 more...)

arXiv.org Artificial Intelligence

May-9-2024

arXiv.org PDF

Add feedback

Country:
- Antarctica (0.05)
- Europe > United Kingdom (0.04)
- North America
  - Greenland (0.04)
  - United States > California (0.04)
- Asia > China
  - Hong Kong (0.04)

Genre:
- Research Report (1.00)

Industry:
- Media > News (0.67)
- Health & Medicine > Therapeutic Area
  - Psychiatry/Psychology (0.93)
- Government > Regional Government
  - North America Government > United States Government (0.68)

Technology:
- Information Technology
  - Communications > Social Media (1.00)
  - Artificial Intelligence
    - Natural Language > Large Language Model (1.00)
    - Machine Learning > Neural Networks
      - Deep Learning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found