Safeguarding Decentralized Social Media: LLM Agents for Automating Community Rule Compliance

Sep-13-2024–arXiv.org Artificial Intelligence

Ensuring content compliance with community guidelines is crucial for maintaining healthy online social environments. However, traditional human-based compliance checking struggles with scaling due to the increasing volume of user-generated content and a limited number of moderators. Recent advancements in Natural Language Understanding demonstrated by Large Language Models unlock new opportunities for automated content compliance verification. This work evaluates six AI-agents built on Open-LLMs for automated rule compliance checking in Decentralized Social Networks, a challenging environment due to heterogeneous community scopes and rules. Analyzing over 50,000 posts from hundreds of Mastodon servers, we find that AI-agents effectively detect non-compliant content, grasp linguistic subtleties, and adapt to diverse community contexts. Most agents also show high inter-rater reliability and consistency in score justification and suggestions for compliance. Human-based evaluation with domain experts confirmed the agents' reliability and usefulness, rendering them promising tools for semi-automated or human-in-the-loop content moderation systems.

compliance score, llm-moderator, moderation, (13 more...)

arXiv.org Artificial Intelligence

Sep-13-2024

arXiv.org PDF

Add feedback

Country:
- North America
  - United States
    - New Jersey (0.04)
    - Texas > Travis County
      - Austin (0.04)
    - Pennsylvania > Philadelphia County
      - Philadelphia (0.04)
    - New York
      - New York County > New York City (0.04)
      - Erie County > Buffalo (0.04)
    - New Mexico > Santa Fe County
      - Santa Fe (0.04)
    - Minnesota > Hennepin County
      - Minneapolis (0.14)
    - California > Santa Clara County
      - Stanford (0.04)
  - Canada > Quebec
    - Montreal (0.04)
- Europe
  - Netherlands > North Holland
    - Amsterdam (0.04)
  - Italy
    - Calabria (0.04)
    - Lazio > Rome (0.04)

Genre:
- Research Report > New Finding (0.93)

Industry:
- Media > News (0.47)
- Law > Civil Rights & Constitutional Law (0.46)
- Information Technology
  - Services (0.49)
  - Security & Privacy (0.46)

Technology:
- Information Technology
  - Communications > Social Media (1.00)
  - Artificial Intelligence
    - Natural Language > Large Language Model (1.00)
    - Machine Learning > Neural Networks
      - Deep Learning (0.94)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found