Generative AI
Automatic Jailbreaking of the Text-to-Image Generative AI Systems
Kim, Minseon, Lee, Hyomin, Gong, Boqing, Zhang, Huishuai, Hwang, Sung Ju
Recent AI systems have shown extremely powerful performance, even surpassing human performance, on various tasks such as information retrieval, language generation, and image generation based on large language models (LLMs). At the same time, there are diverse safety risks that can cause the generation of malicious contents by circumventing the alignment in LLMs, which are often referred to as jailbreaking. However, most of the previous works only focused on the text-based jailbreaking in LLMs, and the jailbreaking of the text-to-image (T2I) generation system has been relatively overlooked. In this paper, we first evaluate the safety of the commercial T2I generation systems, such as ChatGPT, Copilot, and Gemini, on copyright infringement with naive prompts. From this empirical study, we find that Copilot and Gemini block only 12% and 17% of the attacks with naive prompts, respectively, while ChatGPT blocks 84% of them. Then, we further propose a stronger automated jailbreaking pipeline for T2I generation systems, which produces prompts that bypass their safety guards. Our automated jailbreaking framework leverages an LLM optimizer to generate prompts to maximize degree of violation from the generated images without any weight updates or gradient computation. Surprisingly, our simple yet effective approach successfully jailbreaks the ChatGPT with 11.0% block rate, making it generate copyrighted contents in 76% of the time. Finally, we explore various defense strategies, such as post-generation filtering and machine unlearning techniques, but found that they were inadequate, which suggests the necessity of stronger defense mechanisms.
Position: Towards Implicit Prompt For Text-To-Image Models
Yang, Yue, Lin, Yuqi, Liu, Hong, Shao, Wenqi, Chen, Runjian, Shang, Hailong, Wang, Yu, Qiao, Yu, Zhang, Kaipeng, Luo, Ping
Recent text-to-image (T2I) models have had great success, and many benchmarks have been proposed to evaluate their performance and safety. However, they only consider explicit prompts while neglecting implicit prompts (hint at a target without explicitly mentioning it). These prompts may get rid of safety constraints and pose potential threats to the applications of these models. This position paper highlights the current state of T2I models toward implicit prompts. We present a benchmark named ImplicitBench and conduct an investigation on the performance and impacts of implicit prompts with popular T2I models. Specifically, we design and collect more than 2,000 implicit prompts of three aspects: General Symbols, Celebrity Privacy, and Not-Safe-For-Work (NSFW) Issues, and evaluate six well-known T2I models' capabilities under these implicit prompts. Experiment results show that (1) T2I models are able to accurately create various target symbols indicated by implicit prompts; (2) Implicit prompts bring potential risks of privacy leakage for T2I models. (3) Constraints of NSFW in most of the evaluated T2I models can be bypassed with implicit prompts. We call for increased attention to the potential and risks of implicit prompts in the T2I community and further investigation into the capabilities and impacts of implicit prompts, advocating for a balanced approach that harnesses their benefits while mitigating their risks.
Metaheuristics and Large Language Models Join Forces: Towards an Integrated Optimization Approach
Sartori, Camilo Chacรณn, Blum, Christian, Bistaffa, Filippo, Corominas, Guillem Rodrรญguez
The advent of Large Language Models (LLMs) has altered the Natural Language Processing (NLP) landscape, empowering professionals across diverse disciplines with their remarkable ability to generate human-like text. Models like OpenAI's GPT [44], Meta's Llama [45], and Anthropic's Claude 3 [4] have become indispensable collaborators in many peoples' daily lives; giving rise to innovative products such as ChatGPT for general use, GitHub Copilot for code generation, DALL-E 2 for image creation, and a multitude of voice generators, including OpenAI's text-to-speech API and ElevenLabs's Generative Voice AI. Currently, LLMs are being experimentally applied across various fields, yielding mixed results [3]. While some applications seem questionable, others exhibit spectacular outcomes. One of the most contentious applications is using LLMs for tasks necessitating mathematical reasoning. Given LLMs' inherently probabilistic nature, this application was once deemed implausible. However, recent findings suggest a shift in perspective, particularly with LLMs boasting vast parameter counts [1]. As LLMs continue to scale, new capabilities emerge [48]. Crucially, these opportunities are contingent upon the thoughtful design of prompts, which helps mitigate the risk of LLMs providing irrelevant or inaccurate responses [47]. 1
Elon Musk's xAI raises 6bn in bid to take on OpenAI
Elon Musk's artificial intelligence company xAI has closed a 6bn ( 4.7bn) investment round that will make it among the best-funded challengers to OpenAI. The startup is only a year old, but it has rapidly built its own large language model (LLM), the technology underpinning many of the recent advances in generative artificial intelligence capable of creating human-like text, pictures, video, and voices. The funding round, one of the biggest yet in the burgeoning AI field, values the company at 18bn before taking into account the 6bn investment, Musk said on X, the social network he owns. Generative AI has so far proven very expensive to develop, in part because of the need for huge amounts of computing power and energy to train LLMs. In a blogpost, xAI said: "The funds from the round will be used to take xAI's first products to market, build advanced infrastructure, and accelerate the research and development of future technologies."
What Mark Zuckerberg Should Learn From Horny 19th-Century Telegraph Operators
"Oh, stop it--you're making me blush," the throaty voice said, laughing off a compliment. Barret Zoph, who'd given the compliment, looked pleased. As he should--Zoph represents OpenAI, the company behind the voice. "We are looking at the future of interaction between ourselves and the machines," promised Mira Murati, OpenAI's chief technology officer. ChatGPT-4o is just one of a wave of new conversational A.I., including the rollout of Meta AI last month.
Scarlett Johansson's OpenAI clash is just the start of legal wrangles over artificial intelligence
When OpenAI's new voice assistant said it was "doing fantastic" in a launch demo this month, Scarlett Johansson was not. The Hollywood star said she was "shocked, angered and in disbelief" that the updated version of ChatGPT, which can listen to spoken prompts and respond verbally, had a voice "eerily similar" to hers. One of Johansson's signature roles was as the voice of a futuristic version of Siri in the 2013 film Her and, for the actor, the similarity was stark. The OpenAI chief executive, Sam Altman, appeared to acknowledge the film's influence with a one-word post on X on the day of the launch: "her". In a statement, Johansson said Altman had approached her last year to be a voice of ChatGPT and that she had declined for "personal reasons".
xAI Raises 6 Billion as Elon Musk Aims to Challenge OpenAI
Elon Musk's artificial intelligence startup xAI has raised 6 billion to accelerate its challenge to his former allies at OpenAI. The Series B round, announced in a blog post on May 26, comes less than a year after xAI's debut and marks one of the bigger investments in the nascent field of developing AI tools. Musk had been an early supporter of artificial intelligence, backing OpenAI before it introduced ChatGPT in late 2022. He later withdrew his support from the venture and has advocated caution because of the technology's potential dangers. He was among a large group of industry leaders urging a pause to AI development last year.
How Ready Are Generative Pre-trained Large Language Models for Explaining Bengali Grammatical Errors?
Maity, Subhankar, Deroy, Aniket, Sarkar, Sudeshna
Grammatical error correction (GEC) tools, powered by advanced generative artificial intelligence (AI), competently correct linguistic inaccuracies in user input. However, they often fall short in providing essential natural language explanations, which are crucial for learning languages and gaining a deeper understanding of the grammatical rules. There is limited exploration of these tools in low-resource languages such as Bengali. In such languages, grammatical error explanation (GEE) systems should not only correct sentences but also provide explanations for errors. This comprehensive approach can help language learners in their quest for proficiency. Our work introduces a real-world, multi-domain dataset sourced from Bengali speakers of varying proficiency levels and linguistic complexities. This dataset serves as an evaluation benchmark for GEE systems, allowing them to use context information to generate meaningful explanations and high-quality corrections. Various generative pre-trained large language models (LLMs), including GPT-4 Turbo, GPT-3.5 Turbo, Text-davinci-003, Text-babbage-001, Text-curie-001, Text-ada-001, Llama-2-7b, Llama-2-13b, and Llama-2-70b, are assessed against human experts for performance comparison. Our research underscores the limitations in the automatic deployment of current state-of-the-art generative pre-trained LLMs for Bengali GEE. Advocating for human intervention, our findings propose incorporating manual checks to address grammatical errors and improve feedback quality. This approach presents a more suitable strategy to refine the GEC tools in Bengali, emphasizing the educational aspect of language learning.
Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT Even in Low-Resource Settings
Wolfe, Robert, Slaughter, Isaac, Han, Bin, Wen, Bingbing, Yang, Yiwei, Rosenblatt, Lucas, Herman, Bernease, Brown, Eva, Qu, Zening, Weber, Nic, Howe, Bill
The rapid proliferation of generative AI has raised questions about the competitiveness of lower-parameter, locally tunable, open-weight models relative to high-parameter, API-guarded, closed-weight models in terms of performance, domain adaptation, cost, and generalization. Centering under-resourced yet risk-intolerant settings in government, research, and healthcare, we see for-profit closed-weight models as incompatible with requirements for transparency, privacy, adaptability, and standards of evidence. Yet the performance penalty in using open-weight models, especially in low-data and low-resource settings, is unclear. We assess the feasibility of using smaller, open-weight models to replace GPT-4-Turbo in zero-shot, few-shot, and fine-tuned regimes, assuming access to only a single, low-cost GPU. We assess value-sensitive issues around bias, privacy, and abstention on three additional tasks relevant to those topics. We find that with relatively low effort, very low absolute monetary cost, and relatively little data for fine-tuning, small open-weight models can achieve competitive performance in domain-adapted tasks without sacrificing generality. We then run experiments considering practical issues in bias, privacy, and hallucination risk, finding that open models offer several benefits over closed models. We intend this work as a case study in understanding the opportunity cost of reproducibility and transparency over for-profit state-of-the-art zero shot performance, finding this cost to be marginal under realistic settings.
The Widening Gap: The Benefits and Harms of Generative AI for Novice Programmers
Prather, James, Reeves, Brent, Leinonen, Juho, MacNeil, Stephen, Randrianasolo, Arisoa S., Becker, Brett, Kimmel, Bailey, Wright, Jared, Briggs, Ben
Novice programmers often struggle through programming problem solving due to a lack of metacognitive awareness and strategies. Previous research has shown that novices can encounter multiple metacognitive difficulties while programming. Novices are typically unaware of how these difficulties are hindering their progress. Meanwhile, many novices are now programming with generative AI (GenAI), which can provide complete solutions to most introductory programming problems, code suggestions, hints for next steps when stuck, and explain cryptic error messages. Its impact on novice metacognition has only started to be explored. Here we replicate a previous study that examined novice programming problem solving behavior and extend it by incorporating GenAI tools. Through 21 lab sessions consisting of participant observation, interview, and eye tracking, we explore how novices are coding with GenAI tools. Although 20 of 21 students completed the assigned programming problem, our findings show an unfortunate divide in the use of GenAI tools between students who accelerated and students who struggled. Students who accelerated were able to use GenAI to create code they already intended to make and were able to ignore unhelpful or incorrect inline code suggestions. But for students who struggled, our findings indicate that previously known metacognitive difficulties persist, and that GenAI unfortunately can compound them and even introduce new metacognitive difficulties. Furthermore, struggling students often expressed cognitive dissonance about their problem solving ability, thought they performed better than they did, and finished with an illusion of competence. Based on our observations from both groups, we propose ways to scaffold the novice GenAI experience and make suggestions for future work.