Generative AI
Writing With Artificial Intelligence With Andrew Mayne
What is GPT-3 and how can writers use it responsibly as part of their creative process? How can we approach AI tools with curiosity, rather than fear? In the intro, I mention the discussion about whether Google's language model, LaMDA, could be sentient [The Verge]; and the Alliance of Independent Authors Ethical Usage of AI tools. If you'd like to know more about using AI for writing, images, marketing, voice, translation, and more, check out my course, The AI-Assisted Author. Andrew Mayne is the multi-award-nominated and internationally best-selling author of thrillers. He invented an underwater stealth suit for shark diving, and he works with OpenAI as a science communicator. He also has books for authors, including, 'How to Write a Novella in 24 hours,' and a co-hosts the podcast'Weird Things.' You can find Andrew at www.AndrewMayne.com You can find GPT-3 on OpenAI.com. There are many tools built on top of GPT-3. I use and recommend Sudowrite for fiction, in particular. Joanna: Andrew Mayne is the multi-award-nominated and internationally best-selling author of thrillers. He invented an underwater stealth suit for shark diving, and he works with OpenAI as a science communicator. He also has books for authors, including, 'How to Write a Novella in 24 hours,' and a co-hosts the podcast'Weird Things.' Andrew: Hey, thank you for having me. Joanna: Oh, you do so many things. But we are actually going to talk about AI today. Andrew: Well, ever since I was a little boy, I was really interested in science, and entertainment, and everything in between. And I loved robots when I was a kid. And I'd build robots from science fairs and stuff, and I would use coffee cans, and little motors and things I pulled from toys to do that.
Transformers, transformers everywhere: An Overview of Transformers
Many artificial intelligence enthusiasts and professionals might agree that the capacities of recent deep learning models have made leaps and bounds. The recent unveiling of models such as OpenAI's successor for DALL-E, DALL-E 2 [1] and Google's Imagen [2] have shocked the world with completely artificial images based on text prompts. Though for the greater public, the graphical aspect of these models catches the eye easier, the high levels of Natural Language Understanding (NLU) shown in these models is enough to impress anyone. You can also generate the coolest dog you have ever seen, which is obviously the real breakthrough here. These models address many tasks, including but not limited to: language understanding, question-and-answer for customer service chatbots, text classification for spam and fraud detection, text completion and generation for assisted content creation, image captioning and image segment captioning for automatic labeling and automatic image description, sentiment analysis for hate speech detection, style transfer…These tasks are highly relevant as information flow across web platforms keeps growing larger and larger and state-of-the-art models are needed to keep up.
Initial thoughts on using DALL E 2
Like finding a box of watercolors, the excitement is similar. The learning curve is easy to make images more aligned with your taste. Writing good prompts is a skill. Don't be discouraged by initial results; making art with text is an iterative process. Every time you put your eye on the viewfinder, you decide what is beautiful, what is acceptable, and what tells the story you want.
AI asked to show an image of the last selfie ever taken on Earth
An artificial intelligence (AI) has been asked to produce an image that displays the last selfie a human ever took on Earth. The artificial intelligence system is called DALL-E and is designed to produce completely original images from text descriptions that anyone can enter. The AI system is a 12-billion parameter version of GPT-3, which is an autoregressive language model that uses deep learning to produce human-like conversations. Engineers took that GPT-3 model, developed by OpenAI, to create DALL-E, and instead of the AI producing human-like conversations, it produces a batch of images based on the text entered. The AI system was asked to produce what it "thinks" would be the last selfie a human ever took on Earth, and using the data provided to it by Google's servers, the AI produced a batch of completely original images that showcase a grim depiction of the world ending. Now, these images hold no validity when it comes to predicting Earth's future.
DALL-E 2: When AI transforms words into images
Last year, researchers from the OpenAI consortium (an Elon Musk's and Sam Altman's non-profit organization) developed an advanced artificial intelligence model to generate colorful and artistic images based on specific keywords and texts. The AI-powered DALL-E bot was a digital version of a sketch artist that could generate accurate images from descriptive text sentences. Now, the world is buzzing about Dall-E 2: an enhanced AI capable of producing realistic images: Simply write whatever you want, and it creates for you. Long story short… Dall-E 2 receives as an input a textual description, for example: "An astronaut riding a horse ."The AI processes that information, understands it, interprets it, and outputs an image like the following. Not only that but Dall-E 2 can work from an image to generate variations on the theme; it can also modify existing images starting from a textual indication.
This AI newsletter is all you need #5
The big news: DALL-E 2 is now in beta! OpenAI just announced the release of DALL-E 2 to 1 million people, ten times more than the pre-beta model. You can no longer spam generations to have funny memes for free -- it is now nearly $300 for the same amount of free generations you had pre-beta. We had some terrific publications this past week like NUWA, BigColor, and Mega Portraits, all advancing the image generation field with fantastic approaches and results -- as well as the ICML 2022 event that released its outstanding papers that are worth the read. Last but not least, listen to this podcast hosted by one of our community members in this iteration!
Genshin Impact! Fine-tuning CLIP for anime search
Today let's build a search-anime system. We will use text as our query and get images as result. For this we would usually need to manually annotate the image with some tags, often referred to as TBIR (Text/Tag-based Image Retrieval). And for this example, we will use OpenAI CLIP. CLIP is a powerful embedding model that outputs the similarity between text and images.
Sourceless presents the first Cognitive Web
Formwelt, OpenAI Codex, Github Co-Pilot and other Artificial Intelligence projects will make the SourceLess Platform usable by absolutely anyone, being able to create anything just by using words (written or spoken). For example, by using the Formwelt language, anyone, regardless of nationality, can communicate in a direct and semantically correct way with OpenAI Codex and create anything in the digital world; you can create a complete and complex website in less than an hour. All these AI systems will be implemented inside the SourceLess Platform, thus everyone can have access to all the facilities of the new Web through a single domain (eg: str.domain). Education, Technology & Innovation -- these three pillars of the future are the foundations of the SourceLess Platform. The purpose of education in the Sourceless project is to transmit knowledge or foster skills and character traits. These aims may include the development of understanding, rationality, kindness, and honesty.
La veille de la cybersécurité
Artificial intelligence (AI) is not only affecting industries like business and healthcare. It is also playing an increasing role in the creative industries by ushering in a new era of AI-generated art. AI technologies and tools are often widely accessible to anyone, which is helping to create an entirely new generation of artists. We often hear that AI is going to automate away or take over all human tasks, including those in art, film, and other creative industries. But this is far from the case.