Generative AI
The next DALL-E will be able to generate results within ChatGPT
OpenAI is gearing up to roll out the third version of DALL-E, its text-to-image AI system, which reportedly improves its predecessor's capabilities and can generate results within the ChatGPT app. The company demonstrated how the new iteration integrates with ChatGPT to The Verge, and it showed the publication how users can ask the chatbot to write a lengthy and detailed prompt the image AI can use. OpenAI told Axios that DALL-E 3 is "significantly better" at being able to grasp a user's intention, especially if the prompt is long and detailed. If a user can't articulate what they want in a way that can maximize the image generator's abilities, then ChatGPT can help them write a comprehensive prompt for it. In the demo to The Verge, DALL-E produced four results for a prompt asking for a ramen restaurant logo in the mountains within ChatGPT.
Pixyz: a Python library for developing deep generative models
Suzuki, Masahiro, Kaneko, Takaaki, Matsuo, Yutaka
With the recent rapid progress in the study of deep generative models (DGMs), there is a need for a framework that can implement them in a simple and generic way. In this research, we focus on two features of DGMs: (1) deep neural networks are encapsulated by probability distributions, and (2) models are designed and learned based on an objective function. Taking these features into account, we propose a new Python library to implement DGMs called Pixyz. This library adopts a step-by-step implementation method with three APIs, which allows us to implement various DGMs more concisely and intuitively. In addition, the library introduces memoization to reduce the cost of duplicate computations in DGMs to speed up the computation. We demonstrate experimentally that this library is faster than existing probabilistic programming languages in training DGMs.
'Game of Thrones' author and others accuse ChatGPT maker of 'theft' in lawsuit
The lawsuit is the latest salvo in the ongoing debate over how AI tools should be trained and whether the companies behind them owe anything to the original creators of the training data. Large language models are generally trained on billions of sentences of text pulled from the internet, including news stories, Wikipedia and comments on social media sites. OpenAI and other AI companies such as Google and Microsoft do not say specifically what data they use, but AI critics have long suspected that it includes well-known collections of pirated books that have circulated online for years.
John Grisham, George R.R. Martin, other prominent authors sue OpenAI for copyright infringement
Fox News Flash top headlines are here. Check out what's clicking on Foxnews.com. The suit was organized by the Authors Guild and also includes David Baldacci, Sylvia Day, Jonathan Franzen and Elin Hilderbrand among others. "It is imperative that we stop this theft in its tracks or we will destroy our incredible literary culture, which feeds many other creative industries in the U.S.," Authors Guild CEO Mary Rasenberger said in a statement. "Great books are generally written by those who spend their careers and, indeed, their lives, learning and perfecting their crafts. To preserve our literature, authors must have the ability to control if and how their works are used by generative AI."
Everything Amazon announced at its 2023 Devices and Services event
Amazon's fall hardware event was chock full of updates. Perhaps unsurprisingly, given the generative AI boom from the last year, the company began transforming Alexa into a much more versatile and conversational personal chatbot. But it also had plenty of new hardware to introduce, with new models of the Echo Show, security cameras, Echo Frames, a 10-gigabit router and more. Here's everything Amazon unveiled on Wednesday. As generative AI has exploded in popularity during the last year, task-focused personal assistants like Siri, Google Assistant and Alexa now seem even more dated than they did before.
OpenAI unveils better image generator, DALL-E 3, as AI arms race deepens
As part of a voluntary White House pledge in June, OpenAI agreed to develop and deploy mechanisms to identify when visual or audio content is AI-generated, using methods like watermarking an image or encoding provenance data to indicate the service or model that created the content. DALL-E 3 is experimenting with a classifier that looks at where an image came from or the content's "provenance," said Ramesh, a method mentioned in the White House commitments.
OpenAI's Dall-E 3 Is an Art Generator Powered by ChatGPT
OpenAI has announced Dall-E 3, its latest AI art tool. It uses OpenAI's smash-hit chatbot, ChatGPT, to help create more complex and carefully composed works of art by automatically expanding on a prompt in a way that gives the generator more detailed and coherent instruction. What's new with Dall-E 3 is how it removes some of the complexity required with refining the text that is fed to the program--what's known as "prompt engineering"--and how it allows users to make refinements through ChatGPT's conversational interface. The new tool could help lower the bar for generating sophisticated AI artwork, and it could help OpenAI stay ahead of the competition thanks to the superior abilities of its chatbot. Take this image of the potato king, for example. This kind of quirky AI-generated art has become commonplace on social media thanks to a number of tools that turn a text prompt into a visual composition.
Amazon unveils a 'smarter' Alexa, bringing the AI arms race inside homes
Here's how it worked during a demo at Amazon's launch event: Say, "Alexa, let's chat," and an Echo smart speaker enters a special conversational mode. Amazon showed people asking it for advice about travel and to write stories and emails, with people able to interrupt and redirect the AI mid-sentence.
Generative Agent-Based Modeling: Unveiling Social System Dynamics through Coupling Mechanistic Models with Generative Artificial Intelligence
Ghaffarzadegan, Navid, Majumdar, Aritra, Williams, Ross, Hosseinichimeh, Niyousha
We discuss the emerging new opportunity for building feedback-rich computational models of social systems using generative artificial intelligence. Referred to as Generative Agent-Based Models (GABMs), such individual-level models utilize large language models such as ChatGPT to represent human decision-making in social settings. We provide a GABM case in which human behavior can be incorporated in simulation models by coupling a mechanistic model of human interactions with a pre-trained large language model. This is achieved by introducing a simple GABM of social norm diffusion in an organization. For educational purposes, the model is intentionally kept simple. We examine a wide range of scenarios and the sensitivity of the results to several changes in the prompt. We hope the article and the model serve as a guide for building useful diffusion models that include realistic human reasoning and decision-making.