Generative AI
Art and the science of generative AI: A deeper dive
Epstein, Ziv, Hertzmann, Aaron, Herman, Laura, Mahari, Robert, Frank, Morgan R., Groh, Matthew, Schroeder, Hope, Smith, Amy, Akten, Memo, Fjeld, Jessica, Farid, Hany, Leach, Neil, Pentland, Alex, Russakovsky, Olga
A new class of tools, colloquially called generative AI, can produce high-quality artistic media for visual arts, concept art, music, fiction, literature, video, and animation. The generative capabilities of these tools are likely to fundamentally alter the creative processes by which creators formulate ideas and put them into production. As creativity is reimagined, so too may be many sectors of society. Understanding the impact of generative AI - and making policy decisions around it - requires new interdisciplinary scientific inquiry into culture, economics, law, algorithms, and the interaction of technology and creativity. We argue that generative AI is not the harbinger of art's demise, but rather is a new medium with its own distinct affordances. In this vein, we consider the impacts of this new medium on creators across four themes: aesthetics and culture, legal questions of ownership and credit, the future of creative work, and impacts on the contemporary media ecosystem. Across these themes, we highlight key research questions and directions to inform policy and beneficial uses of the technology.
ChatGPT is fun, but it is not funny! Humor is still challenging Large Language Models
Jentzsch, Sophie, Kersting, Kristian
Humor is a central aspect of human communication that has not been solved for artificial agents so far. Large language models (LLMs) are increasingly able to capture implicit and contextual information. Especially, OpenAI's ChatGPT recently gained immense public attention. The GPT3-based model almost seems to communicate on a human level and can even tell jokes. Humor is an essential component of human communication. But is ChatGPT really funny? We put ChatGPT's sense of humor to the test. In a series of exploratory experiments around jokes, i.e., generation, explanation, and detection, we seek to understand ChatGPT's capability to grasp and reproduce human humor. Since the model itself is not accessible, we applied prompt-based experiments. Our empirical evidence indicates that jokes are not hard-coded but mostly also not newly generated by the model. Over 90% of 1008 generated jokes were the same 25 Jokes. The system accurately explains valid jokes but also comes up with fictional explanations for invalid jokes. Joke-typical characteristics can mislead ChatGPT in the classification of jokes. ChatGPT has not solved computational humor yet but it can be a big leap toward "funny" machines.
Easily Accessible Text-to-Image Generation Amplifies Demographic Stereotypes at Large Scale
Bianchi, Federico, Kalluri, Pratyusha, Durmus, Esin, Ladhak, Faisal, Cheng, Myra, Nozza, Debora, Hashimoto, Tatsunori, Jurafsky, Dan, Zou, James, Caliskan, Aylin
Machine learning models that convert user-written text descriptions into images are now widely available online and used by millions of users to generate millions of images a day. We investigate the potential for these models to amplify dangerous and complex stereotypes. We find a broad range of ordinary prompts produce stereotypes, including prompts simply mentioning traits, descriptors, occupations, or objects. For example, we find cases of prompting for basic traits or social roles resulting in images reinforcing whiteness as ideal, prompting for occupations resulting in amplification of racial and gender disparities, and prompting for objects resulting in reification of American norms. Stereotypes are present regardless of whether prompts explicitly mention identity and demographic language or avoid such language. Moreover, stereotypes persist despite mitigation strategies; neither user attempts to counter stereotypes by requesting images with specific counter-stereotypes nor institutional attempts to add system ``guardrails'' have prevented the perpetuation of stereotypes. Our analysis justifies concerns regarding the impacts of today's models, presenting striking exemplars, and connecting these findings with deep insights into harms drawn from social scientific and humanist disciplines. This work contributes to the effort to shed light on the uniquely complex biases in language-vision models and demonstrates the ways that the mass deployment of text-to-image generation models results in mass dissemination of stereotypes and resulting harms.
Governments worldwide rush to place regulations on artificial intelligence, a rapidly growing technology
Fox News Flash top headlines are here. Check out what's clicking on Foxnews.com. Rapid advances in artificial intelligence (AI) such as Microsoft-backed OpenAI's ChatGPT are complicating governments' efforts to agree laws governing the use of the technology. The government is consulting Australia's main science advisory body and considering next steps, a spokesperson for the industry and science minister said in April. The Financial Conduct Authority, one of several state regulators that has been tasked with drawing up new guidelines covering AI, is consulting with the Alan Turing Institute and other legal and academic institutions to improve its understanding of the technology, a spokesperson told Reuters.
Apple Ghosts the Generative AI Revolution
After years of anticipation and contributions from thousands of people, Apple Vision Pro made its debut yesterday, promising immersion in apps, games, movies, and the workplace. With more than 20 cameras, sensors, and microphones, two processing chips, and even an external battery you carry in your pocket, it's packed full of world-class tech, but missing an element that seems to be everywhere else right now: Generative AI. Since the launch of OpenAI's ChatGPT last fall, generative AI that creates text and imagery from simple prompts triggered calls for regulation and fear of an existential threat to humanity, and continues to play a role in ongoing Hollywood writers union strikes. It's also led Big Tech companies to speed up AI deployments, but not at Apple. Yesterday Apple announced new features powered by its neural engine hardware--like call screening that transcribes the first few words of a voicemail live so you can decide whether to pick up a call--but there was no mention of generative AI during the two-hour Worldwide Developer Conference keynote address.
House Democrat bill would force labeling of AI use
Harvey Castro talks about how AI cold be used in cold cases and the symbiotic relationship between AI and a detective. A new bill introduced in the House of Representatives on Monday is aimed at making sure American consumers know the difference between fantasy and reality online by cracking down on generative artificial intelligence technology. Rep. Ritchie Torres, D-N.Y., is leading the effort on the AI Disclosure Act of 2023, which would force AI-generated content to include the disclaimer, "Disclaimer: this output has been generated by artificial intelligence." In a statement announcing the bill, Torres predicted that "regulatory framework for managing the existential risks of AI will be one of the central challenges confronting Congress in the years and decades to come." He noted risks in going too far with policing AI as well as not regulating it enough.
City of Yokosuka adopts ChatGPT after favorable trial results
The city's authority was the nation's first local government to start trial use of the generative AI, which is driven by a machine learning model that works much like the human brain. During the trial, its officials used the AI tool to make bulletins, summarize records of meetings and edit documents for typographical errors, among other purposes. This could be due to a conflict with your ad-blocking or security software. Please add japantimes.co.jp and piano.io to your list of allowed sites. If this does not resolve the issue or you are unable to add the domains to your allowlist, please see this FAQ.
Phoenix: A Federated Generative Diffusion Model
Jothiraj, Fiona Victoria Stanley, Mashhadi, Afra
Generative AI has made impressive strides in enabling users to create diverse and realistic visual content such as images, videos, and audio. However, training generative models on large centralized datasets can pose challenges in terms of data privacy, security, and accessibility. Federated learning (FL) is an approach that uses decentralized techniques to collaboratively train a shared deep learning model while retaining the training data on individual edge devices to preserve data privacy. This paper proposes a novel method for training a Denoising Diffusion Probabilistic Model (DDPM) across multiple data sources using FL techniques. Diffusion models, a newly emerging generative model, show promising results in achieving superior quality images than Generative Adversarial Networks (GANs). Our proposed method Phoenix is an unconditional diffusion model that leverages strategies to improve the data diversity of generated samples even when trained on data with statistical heterogeneity or Non-IID (Non-Independent and Identically Distributed) data. We demonstrate how our approach outperforms the default diffusion model in an FL setting. These results indicate that high-quality samples can be generated by maintaining data diversity, preserving privacy, and reducing communication between data sources, offering exciting new possibilities in the field of generative AI.
Understanding Place Identity with Generative AI
Jang, Kee Moon, Chen, Junda, Kang, Yuhao, Kim, Junghwan, Lee, Jinhyung, Duarte, Fรกbio
Researchers are constantly leveraging new forms of data with the goal of understanding how people perceive the built environment and build the collective place identity of cities. Latest advancements in generative artificial intelligence (AI) models have enabled the production of realistic representations learned from vast amounts of data. In this study, we aim to test the potential of generative AI as the source of textual and visual information in capturing the place identity of cities assessed by filtered descriptions and images. We asked questions on the place identity of a set of 31 global cities to two generative AI models, ChatGPT and DALL-E2. Since generative AI has raised ethical concerns regarding its trustworthiness, we performed cross-validation to examine whether the results show similar patterns to real urban settings. In particular, we compared the outputs with Wikipedia data for text and images searched from Google for image. Our results indicate that generative AI models have the potential to capture the collective image of cities that can make them distinguishable. This study is among the first attempts to explore the capabilities of generative AI in understanding human perceptions of the built environment. It contributes to urban design literature by discussing future research opportunities and potential limitations.
The Creative Frontier of Generative AI: Managing the Novelty-Usefulness Tradeoff
Mukherjee, Anirban, Chang, Hannah
In this paper, drawing inspiration from the human creativity literature, we explore the optimal balance between novelty and usefulness in generative Artificial Intelligence (AI) systems. We posit that overemphasizing either aspect can lead to limitations such as hallucinations and memorization. Hallucinations, characterized by AI responses containing random inaccuracies or falsehoods, emerge when models prioritize novelty over usefulness. Memorization, where AI models reproduce content from their training data, results from an excessive focus on usefulness, potentially limiting creativity. To address these challenges, we propose a framework that includes domain-specific analysis, data and transfer learning, user preferences and customization, custom evaluation metrics, and collaboration mechanisms. Our approach aims to generate content that is both novel and useful within specific domains, while considering the unique requirements of various contexts. Its manifestations, such as divergent thinking that generates novel ideas and convergent thinking that refines these ideas to meet specific goals, have fueled numerous theories about its essence and underlying processes.