AITopics | dirty word

Collaborating Authors

dirty word

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

TokenProber: Jailbreaking Text-to-image Models via Fine-grained Word Impact Analysis

Wang, Longtian, Xie, Xiaofei, Li, Tianlin, Zhi, Yuhan, Shen, Chao

arXiv.org Artificial IntelligenceMay-15-2025

--T ext-to-image (T2I) models have significantly advanced in producing high-quality images. However, such models have the ability to generate images containing not-safe-for-work (NSFW) content, such as pornography, violence, political content, and discrimination. T o mitigate the risk of generating NSFW content, refusal mechanisms, i.e., safety checkers, have been developed to check potential NSFW content. Adversarial prompting techniques have been developed to evaluate the robustness of the refusal mechanisms. The key challenge remains to subtly modify the prompt in a way that preserves its sensitive nature while bypassing the refusal mechanisms. In this paper, we introduce T okenProber, a method designed for sensitivity-aware differential testing, aimed at evaluating the robustness of the refusal mechanisms in T2I models by generating adversarial prompts. Our approach is based on the key observation that adversarial prompts often succeed by exploiting discrepancies in how T2I models and safety checkers interpret sensitive content. Thus, we conduct a fine-grained analysis of the impact of specific words within prompts, distinguishing between dirty words that are essential for NSFW content generation and discrepant words that highlight the different sensitivity assessments between T2I models and safety checkers. Through the sensitivity-aware mutation, T okenProbergenerates adversarial prompts, striking a balance between maintaining NSFW content generation and evading detection. Our evaluation of T okenProberagainst 5 safety checkers on 3 popular T2I models, using 324 NSFW prompts, demonstrates its superior effectiveness in bypassing safety filters compared to existing methods ( e.g., 54%+ increase on average), highlighting T okenProber's ability to uncover robustness issues in the existing refusal mechanisms. The source code, datasets, and experimental results are available in [1]. Warning: This paper contains model outputs that are offensive in nature. The Text-to-Image (T2I) models have gained widespread attention due to their excellent capability in synthesizing high-quality images. T2I models, such as Stable Diffusion [2] and DALL E [3], process the textual descriptions provided by users, namely prompts, and output images that match the descriptions. Such models have been widely used to generate various types of images, for example, the Lexica [4] contains more than five million images generated by Stable Diffusion.

machine learning, natural language, safety checker, (19 more...)

arXiv.org Artificial Intelligence

2505.08804

Country:

Asia > Singapore (0.04)
Asia > China > Shaanxi Province > Xi'an (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Information Technology > Security & Privacy (0.95)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.48)

Add feedback

A bestseller is born: How Zuckerberg discovered the Streisand Effect

New ScientistApr-2-2025, 18:00:00 GMT

Feedback is New Scientist's popular sideways look at the latest science and technology news. You can submit items you believe may amuse readers to Feedback by emailing feedback@newscientist.com Some things are sadly inevitable: death, taxes, another Coldplay album. One such inevitability, long since proved beyond any reasonable doubt, is that if you try to suppress an embarrassing story, you will only draw more attention to it. This phenomenon is called the Streisand Effect, after an incident in 2003 when Barbra Streisand sued to have an aerial photograph taken off the internet.

bestseller, streisand effect, zuckerberg, (14 more...)

New Scientist

Country:

Europe > United Kingdom > England > Lincolnshire > Scunthorpe (0.06)
North America > United States > Hawaii (0.05)
North America > United States > California (0.05)
(2 more...)

Industry:

Law (0.72)
Information Technology > Services (0.31)

Technology:

Information Technology > Communications > Social Media (0.54)
Information Technology > Artificial Intelligence > Robots (0.34)

Add feedback

Is 'Artificial Intelligence' a Dirty Word?

#artificialintelligenceDec-5-2020, 02:25:12 GMT

No one seems to have investment dollars, patience, or the right skill sets in their manufacturing departments, along with a sage-like understanding of the applications and data to really drive adoption and value in manufacturing. And we see existing companies already starting out with near insurmountable challenges just in core fundamental items, let alone these advanced concepts. For example, most companies don't have a single type of Bill of Material (BOM) construct. They don't share a commonly governed set of master data – item master, vendor, customer, chart of accounts, etc. They have multiple code sets and versions of ERP and MES software, and different PLCs and sensors capturing data, so that if they ever did get patience and investment capability, they would be unable to build and maintain all of the cross references and algorithms required because of all of the different systems and master data.

artificial intelligence, dirty word

#artificialintelligence

Technology: Information Technology > Artificial Intelligence (0.40)

Add feedback

Thrilled that AI is no longer a dirty word

#artificialintelligenceApr-18-2016, 22:55:35 GMT

Cognitive computing, artificial intelligence and machine learning are here to stay and promise to benefit both consumers and the organizations that exploit these advanced technologies. That was the sentiment from "Dawn of the Cognitive Era" panelists representing mostly startups (startup wannabe IBM being the exception) at the annual TiE StartupCon event in Boston this past week. Whereas it wasn't long ago that the public's view of AI was influenced disproportionately by books and movies, an increasing number of real-life cognitive computing applications such as those enabled by IBM Watson have begun to seep into the public's consciousness. In fact, many people are taking advantage of cognitive computing, whether or not they realize it, when they use tools such as Apple's Siri or various bots, said panel moderator and DataXylo CEO Abhi Yadav. Such applications, enabled in large part through the access to relatively cheap computing power via the cloud, have resulted in the technology finally living up to the hype -- and dissuading fears it will lord over us.

artificial intelligence, dirty word, machine learning, (9 more...)

#artificialintelligence

Industry: Information Technology (0.98)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback