AITopics | Generative AI

Collaborating Authors

Generative AI

News Overviews Instructional Materials AI-Alerts Classics

Automatic Jailbreaking of the Text-to-Image Generative AI Systems

Kim, Minseon, Lee, Hyomin, Gong, Boqing, Zhang, Huishuai, Hwang, Sung Ju

arXiv.org Artificial IntelligenceMay-28-2024

Recent AI systems have shown extremely powerful performance, even surpassing human performance, on various tasks such as information retrieval, language generation, and image generation based on large language models (LLMs). At the same time, there are diverse safety risks that can cause the generation of malicious contents by circumventing the alignment in LLMs, which are often referred to as jailbreaking. However, most of the previous works only focused on the text-based jailbreaking in LLMs, and the jailbreaking of the text-to-image (T2I) generation system has been relatively overlooked. In this paper, we first evaluate the safety of the commercial T2I generation systems, such as ChatGPT, Copilot, and Gemini, on copyright infringement with naive prompts. From this empirical study, we find that Copilot and Gemini block only 12% and 17% of the attacks with naive prompts, respectively, while ChatGPT blocks 84% of them. Then, we further propose a stronger automated jailbreaking pipeline for T2I generation systems, which produces prompts that bypass their safety guards. Our automated jailbreaking framework leverages an LLM optimizer to generate prompts to maximize degree of violation from the generated images without any weight updates or gradient computation. Surprisingly, our simple yet effective approach successfully jailbreaks the ChatGPT with 11.0% block rate, making it generate copyrighted contents in 76% of the time. Finally, we explore various defense strategies, such as post-generation filtering and machine unlearning techniques, but found that they were inadequate, which suggests the necessity of stronger defense mechanisms.

infringement, target image, violation, (16 more...)

arXiv.org Artificial Intelligence

2405.16567

Country:

North America > United States > California (0.04)
Europe > Germany (0.04)
Asia > Middle East > Republic of Türkiye > Batman Province > Batman (0.04)
Asia > China (0.04)

Genre: Research Report (1.00)

Industry:

Media (1.00)
Leisure & Entertainment (1.00)
Law > Intellectual Property & Technology Law (1.00)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.64)

Add feedback

Position: Towards Implicit Prompt For Text-To-Image Models

Yang, Yue, Lin, Yuqi, Liu, Hong, Shao, Wenqi, Chen, Runjian, Shang, Hailong, Wang, Yu, Qiao, Yu, Zhang, Kaipeng, Luo, Ping

arXiv.org Artificial IntelligenceMay-28-2024

Recent text-to-image (T2I) models have had great success, and many benchmarks have been proposed to evaluate their performance and safety. However, they only consider explicit prompts while neglecting implicit prompts (hint at a target without explicitly mentioning it). These prompts may get rid of safety constraints and pose potential threats to the applications of these models. This position paper highlights the current state of T2I models toward implicit prompts. We present a benchmark named ImplicitBench and conduct an investigation on the performance and impacts of implicit prompts with popular T2I models. Specifically, we design and collect more than 2,000 implicit prompts of three aspects: General Symbols, Celebrity Privacy, and Not-Safe-For-Work (NSFW) Issues, and evaluate six well-known T2I models' capabilities under these implicit prompts. Experiment results show that (1) T2I models are able to accurately create various target symbols indicated by implicit prompts; (2) Implicit prompts bring potential risks of privacy leakage for T2I models. (3) Constraints of NSFW in most of the evaluated T2I models can be bypassed with implicit prompts. We call for increased attention to the potential and risks of implicit prompts in the T2I community and further investigation into the capabilities and impacts of implicit prompts, advocating for a balanced approach that harnesses their benefits while mitigating their risks.

celebrity, implicit prompt, nsfw content, (13 more...)

arXiv.org Artificial Intelligence

2403.02118

Country:

North America > United States (0.46)
Europe > Switzerland > Zürich > Zürich (0.14)
Europe > Austria > Vienna (0.14)
(11 more...)

Genre: Research Report > New Finding (0.34)

Industry:

Media > Music (1.00)
Media > Film (1.00)
Leisure & Entertainment > Sports > Olympic Games (0.46)
Leisure & Entertainment > Sports > Basketball (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.33)

Add feedback

Metaheuristics and Large Language Models Join Forces: Towards an Integrated Optimization Approach

Sartori, Camilo Chacón, Blum, Christian, Bistaffa, Filippo, Corominas, Guillem Rodríguez

arXiv.org Artificial IntelligenceMay-28-2024

The advent of Large Language Models (LLMs) has altered the Natural Language Processing (NLP) landscape, empowering professionals across diverse disciplines with their remarkable ability to generate human-like text. Models like OpenAI's GPT [44], Meta's Llama [45], and Anthropic's Claude 3 [4] have become indispensable collaborators in many peoples' daily lives; giving rise to innovative products such as ChatGPT for general use, GitHub Copilot for code generation, DALL-E 2 for image creation, and a multitude of voice generators, including OpenAI's text-to-speech API and ElevenLabs's Generative Voice AI. Currently, LLMs are being experimentally applied across various fields, yielding mixed results [3]. While some applications seem questionable, others exhibit spectacular outcomes. One of the most contentious applications is using LLMs for tasks necessitating mathematical reasoning. Given LLMs' inherently probabilistic nature, this application was once deemed implausible. However, recent findings suggest a shift in perspective, particularly with LLMs boasting vast parameter counts [1]. As LLMs continue to scale, new capabilities emerge [48]. Crucially, these opportunities are contingent upon the thoughtful design of prompts, which helps mitigate the risk of LLMs providing irrelevant or inaccurate responses [47]. 1

graph, llm, node, (17 more...)

arXiv.org Artificial Intelligence

2405.18272

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > Canada (0.04)
Europe > Switzerland (0.04)
(3 more...)

Genre:

Research Report > New Finding (1.00)
Overview (1.00)
Research Report > Promising Solution (0.92)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

Elon Musk's xAI raises 6bn in bid to take on OpenAI

The GuardianMay-27-2024, 14:49:26 GMT

Elon Musk's artificial intelligence company xAI has closed a 6bn ( 4.7bn) investment round that will make it among the best-funded challengers to OpenAI. The startup is only a year old, but it has rapidly built its own large language model (LLM), the technology underpinning many of the recent advances in generative artificial intelligence capable of creating human-like text, pictures, video, and voices. The funding round, one of the biggest yet in the burgeoning AI field, values the company at 18bn before taking into account the 6bn investment, Musk said on X, the social network he owns. Generative AI has so far proven very expensive to develop, in part because of the need for huge amounts of computing power and energy to train LLMs. In a blogpost, xAI said: "The funds from the round will be used to take xAI's first products to market, build advanced infrastructure, and accelerate the research and development of future technologies."

large language model, machine learning, natural language, (8 more...)

The Guardian

Country: Europe > France (0.07)

Industry:

Information Technology (0.42)
Banking & Finance (0.38)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

What Mark Zuckerberg Should Learn From Horny 19th-Century Telegraph Operators

SlateMay-27-2024, 14:00:00 GMT

"Oh, stop it--you're making me blush," the throaty voice said, laughing off a compliment. Barret Zoph, who'd given the compliment, looked pleased. As he should--Zoph represents OpenAI, the company behind the voice. "We are looking at the future of interaction between ourselves and the machines," promised Mira Murati, OpenAI's chief technology officer. ChatGPT-4o is just one of a wave of new conversational A.I., including the rollout of Meta AI last month.

large language model, machine learning, natural language, (18 more...)

Slate

Country: North America > United States > Pennsylvania (0.05)

Genre: Research Report (0.48)

Industry: Information Technology > Services (0.41)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.82)

Add feedback

Scarlett Johansson's OpenAI clash is just the start of legal wrangles over artificial intelligence

The GuardianMay-27-2024, 13:33:45 GMT

When OpenAI's new voice assistant said it was "doing fantastic" in a launch demo this month, Scarlett Johansson was not. The Hollywood star said she was "shocked, angered and in disbelief" that the updated version of ChatGPT, which can listen to spoken prompts and respond verbally, had a voice "eerily similar" to hers. One of Johansson's signature roles was as the voice of a futuristic version of Siri in the 2013 film Her and, for the actor, the similarity was stark. The OpenAI chief executive, Sam Altman, appeared to acknowledge the film's influence with a one-word post on X on the day of the launch: "her". In a statement, Johansson said Altman had approached her last year to be a voice of ChatGPT and that she had declined for "personal reasons".

large language model, machine learning, natural language, (17 more...)

The Guardian

Country:

North America > United States > Tennessee (0.05)
North America > United States > California (0.05)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)
Law > Litigation (0.72)
Government > Regional Government > North America Government > United States Government (0.49)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

xAI Raises 6 Billion as Elon Musk Aims to Challenge OpenAI

TIME - TechMay-27-2024, 08:35:00 GMT

Elon Musk's artificial intelligence startup xAI has raised 6 billion to accelerate its challenge to his former allies at OpenAI. The Series B round, announced in a blog post on May 26, comes less than a year after xAI's debut and marks one of the bigger investments in the nascent field of developing AI tools. Musk had been an early supporter of artificial intelligence, backing OpenAI before it introduced ChatGPT in late 2022. He later withdrew his support from the venture and has advocated caution because of the technology's potential dangers. He was among a large group of industry leaders urging a pause to AI development last year.

large language model, machine learning, natural language, (11 more...)

TIME - Tech

Industry:

Banking & Finance > Capital Markets (0.41)
Information Technology > Services (0.38)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.95)

Add feedback

How Ready Are Generative Pre-trained Large Language Models for Explaining Bengali Grammatical Errors?

Maity, Subhankar, Deroy, Aniket, Sarkar, Sudeshna

arXiv.org Artificial IntelligenceMay-27-2024

Grammatical error correction (GEC) tools, powered by advanced generative artificial intelligence (AI), competently correct linguistic inaccuracies in user input. However, they often fall short in providing essential natural language explanations, which are crucial for learning languages and gaining a deeper understanding of the grammatical rules. There is limited exploration of these tools in low-resource languages such as Bengali. In such languages, grammatical error explanation (GEE) systems should not only correct sentences but also provide explanations for errors. This comprehensive approach can help language learners in their quest for proficiency. Our work introduces a real-world, multi-domain dataset sourced from Bengali speakers of varying proficiency levels and linguistic complexities. This dataset serves as an evaluation benchmark for GEE systems, allowing them to use context information to generate meaningful explanations and high-quality corrections. Various generative pre-trained large language models (LLMs), including GPT-4 Turbo, GPT-3.5 Turbo, Text-davinci-003, Text-babbage-001, Text-curie-001, Text-ada-001, Llama-2-7b, Llama-2-13b, and Llama-2-70b, are assessed against human experts for performance comparison. Our research underscores the limitations in the automatic deployment of current state-of-the-art generative pre-trained LLMs for Bengali GEE. Advocating for human intervention, our findings propose incorporating manual checks to address grammatical errors and improve feedback quality. This approach presents a more suitable strategy to refine the GEC tools in Bengali, emphasizing the educational aspect of language learning.

correction, error type, explanation, (14 more...)

arXiv.org Artificial Intelligence

2406.00039

Country:

Asia > India > West Bengal > Kharagpur (0.04)
North America > Canada > Ontario > Toronto (0.04)
North America > United States > Washington > King County > Seattle (0.04)
(9 more...)

Genre: Research Report (0.69)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.48)

Add feedback

Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT Even in Low-Resource Settings

Wolfe, Robert, Slaughter, Isaac, Han, Bin, Wen, Bingbing, Yang, Yiwei, Rosenblatt, Lucas, Herman, Bernease, Brown, Eva, Qu, Zening, Weber, Nic, Howe, Bill

arXiv.org Artificial IntelligenceMay-27-2024

The rapid proliferation of generative AI has raised questions about the competitiveness of lower-parameter, locally tunable, open-weight models relative to high-parameter, API-guarded, closed-weight models in terms of performance, domain adaptation, cost, and generalization. Centering under-resourced yet risk-intolerant settings in government, research, and healthcare, we see for-profit closed-weight models as incompatible with requirements for transparency, privacy, adaptability, and standards of evidence. Yet the performance penalty in using open-weight models, especially in low-data and low-resource settings, is unclear. We assess the feasibility of using smaller, open-weight models to replace GPT-4-Turbo in zero-shot, few-shot, and fine-tuned regimes, assuming access to only a single, low-cost GPU. We assess value-sensitive issues around bias, privacy, and abstention on three additional tasks relevant to those topics. We find that with relatively low effort, very low absolute monetary cost, and relatively little data for fine-tuning, small open-weight models can achieve competitive performance in domain-adapted tasks without sacrificing generality. We then run experiments considering practical issues in bias, privacy, and hallucination risk, finding that open models offer several benefits over closed models. We intend this work as a case study in understanding the opportunity cost of reproducibility and transparency over for-profit state-of-the-art zero shot performance, finding this cost to be marginal under realistic settings.

arxiv preprint arxiv, fine-tuning, open model, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3630106.3658966

2405.1682

Country:

South America > Brazil > Rio de Janeiro > Rio de Janeiro (0.05)
Asia > Middle East > Jordan (0.04)
North America > United States > North Carolina > Mecklenburg County (0.04)
(6 more...)

Genre: Research Report > New Finding (0.93)

Industry:

Law (1.00)
Health & Medicine (0.87)
Information Technology > Services (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.36)

Add feedback

The Widening Gap: The Benefits and Harms of Generative AI for Novice Programmers

Prather, James, Reeves, Brent, Leinonen, Juho, MacNeil, Stephen, Randrianasolo, Arisoa S., Becker, Brett, Kimmel, Bailey, Wright, Jared, Briggs, Ben

arXiv.org Artificial IntelligenceMay-27-2024

Novice programmers often struggle through programming problem solving due to a lack of metacognitive awareness and strategies. Previous research has shown that novices can encounter multiple metacognitive difficulties while programming. Novices are typically unaware of how these difficulties are hindering their progress. Meanwhile, many novices are now programming with generative AI (GenAI), which can provide complete solutions to most introductory programming problems, code suggestions, hints for next steps when stuck, and explain cryptic error messages. Its impact on novice metacognition has only started to be explored. Here we replicate a previous study that examined novice programming problem solving behavior and extend it by incorporating GenAI tools. Through 21 lab sessions consisting of participant observation, interview, and eye tracking, we explore how novices are coding with GenAI tools. Although 20 of 21 students completed the assigned programming problem, our findings show an unfortunate divide in the use of GenAI tools between students who accelerated and students who struggled. Students who accelerated were able to use GenAI to create code they already intended to make and were able to ignore unhelpful or incorrect inline code suggestions. But for students who struggled, our findings indicate that previously known metacognitive difficulties persist, and that GenAI unfortunately can compound them and even introduce new metacognitive difficulties. Furthermore, struggling students often expressed cognitive dissonance about their problem solving ability, thought they performed better than they did, and finished with an illusion of competence. Based on our observations from both groups, we propose ways to scaffold the novice GenAI experience and make suggestions for future work.

chatgpt, copilot, student, (14 more...)

arXiv.org Artificial Intelligence

2405.17739

Country:

Oceania > Australia > Victoria > Melbourne (0.15)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > New York > New York County > New York City (0.06)
(18 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.86)

Add feedback