The State of Profanity Obfuscation in Natural Language Processing

Oct-14-2022–arXiv.org Artificial Intelligence

Work on hate speech has made the consideration of rude and harmful examples in scientific publications inevitable. This raises various problems, such as whether or not to obscure profanities. While science must accurately disclose what it does, the unwarranted spread of hate speech is harmful to readers, and increases its internet frequency. While maintaining publications' professional appearance, obfuscating profanities makes it challenging to evaluate the content, especially for non-native speakers. Surveying 150 ACL papers, we discovered that obfuscation is usually employed for English but not other languages, and even so quite uneven. We discuss the problems with obfuscation and suggest a multilingual community resource called PrOf that has a Python module to standardize profanity obfuscation processes. We believe PrOf can help scientific publication policies to make hate speech work accessible and comparable, irrespective of language.

artificial intelligence, computational linguistic, natural language, (15 more...)

arXiv.org Artificial Intelligence

Oct-14-2022

arXiv.org PDF

Add feedback

Country:
- North America
  - Dominican Republic (0.05)
  - United States
    - Hawaii (0.04)
    - New Mexico > Santa Fe County
      - Santa Fe (0.04)
    - Minnesota > Hennepin County
      - Minneapolis (0.14)
    - California > Los Angeles County
      - Los Angeles (0.04)
- Europe
  - Italy
    - Tuscany > Florence (0.04)
    - Lombardy > Milan (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)

Genre:
- Research Report (1.00)

Industry:
- Law (0.68)
- Information Technology > Security & Privacy (0.47)
- Health & Medicine (0.46)

Technology:
- Information Technology > Artificial Intelligence > Natural Language (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found