Goto

Collaborating Authors

 Large Language Model



Hype grows over "autonomous" AI agents that loop GPT-4 outputs

#artificialintelligence

Since the launch of OpenAI's GPT-4 API last month to beta testers, a loose group of developers has been experimenting with making agent-like ("agentic") implementations of the AI model that attempt to carry out multistep tasks with as little human intervention as possible. These homebrew scripts can loop, iterate, and spin-off new instances of an AI model as needed. Two experimental open source projects, in particular, have captured much attention on social media, especially among those who hype AI projects relentlessly: Auto-GPT, created by Toran Bruce Richards, and BabyAGI, created by Yohei Nakajima. They need a lot of human input and hand-holding along the way, so they're not yet as autonomous as promised. But they represent early steps toward more complex chaining AI models that could potentially be more capable than a single AI model working alone.


Google 'panic': Samsung said to consider switching to Bing

#artificialintelligence

Google went into a "panic" after Samsung, the world's second-largest manufacturer of smartphones and a user of the company's Android operating system, said it was considering offering Bing as the default search engine for users, according to a report published by the New York Times on Sunday. The tech giant is scrambling to catch up to rivals Microsoft in the adoption of A.I. services. The threat to Google's $162 billion business is reportedly pushing the company to revamp its search products, including creating an entirely new A.I.-powered search engine. Google's drive to compete contrasts with comments from Alphabet CEO Sundar Pichai in an interview, also aired on Sunday, in which he suggested that he wanted to avoid a rush to release new A.I. products for fear of how they might affect society. In the interview with CBS's 60 Minutes, Alphabet CEO Pichai called artificial intelligence "the most profound technology humanity is working on," adding that it was "more profound than fire or electricity or anything that we've done in the past."


The Drum

#artificialintelligence

The arrival of ChatGPT, however, feels different. Most new technologies have transformed the products of creative agencies, but ChatGPT is a product that could transform the strategic and creative process itself. Agencies and consultancies should therefore be seriously considering the potential value that this tool can bring to their businesses. One of the helpful ways to go about this is to frame ChatGPT as a thinking partner and to explore the two basic thinking styles that underpin the practice of innovation and strategy formation: divergent and convergent thinking. This approach can help us to understand the benefits of ChatGPT, and how we can start capitalizing on them.


Demystifying Segment Anything: A Comprehensive Guide to Next-Gen Image Segmentation

#artificialintelligence

Foundation models, which are large-scale pre-trained models, have significantly impacted the field of natural language processing (NLP) with their zero-shot and few-shot generalization capabilities. Recently, these models have been applied to computer vision tasks, such as image-text alignment, using contrastive learning. However, there's a need to expand foundation models for a wider range of computer vision tasks, such as image segmentation. In this research paper, the authors propose a foundation model for image segmentation, which they call "Segment Anything." The researchers propose a promotable segmentation task, inspired by the prompting techniques used in NLP foundation models.


What is Google LaMDA? Here's what you need to know - Android Authority

#artificialintelligence

Knowledge and accuracy: LaMDA can access the internet for the latest information, while both GPT-3 and even GPT-4 have knowledge cut-off dates of September 2021. If asked about more up-to-date events, these models could generate fictional responses. Training data: LaMDA's training dataset comprised primarily of dialog, while GPT-3 used everything from Wikipedia entries to traditional books. That makes GPT-3 more general-purpose and adaptable for applications like ChatGPT. Human training: In the previous section, we talked about how Google hired human workers to fine-tune its model for safety and quality.


OpenAI CEO confirms ChatGPT is not trying to train GPT-5

#artificialintelligence

OpenAI CEO confirms ChatGPT not training GPT-5, addresses safety concerns raised by open letter from tech community, Italy establishes task force on ChatGPT. OpenAI, a leading AI research organisation, has developed the ChatGPT programme, which has become immensely popular for its ability to provide quick and comprehensive answers to a wide range of queries. However, its fast expansion has sparked worries regarding its potential effects on employment, privacy, and safety. Recently, an open letter signed by tech giants Elon Musk, Steve Wozniak, and others urged companies to pause the development of AI systems that are more powerful than GPT-4, which was released by OpenAI. The open letter signed by Elon Musk and other researchers called for a halt in the development of AI systems that are more powerful than GPT-4, citing concerns about potential impacts on safety, privacy, and employment.


Sparks of Artificial General Intelligence: Early experiments with GPT-4 - Microsoft Research

#artificialintelligence

Artificial intelligence (AI) researchers have been developing and refining large language models (LLMs) that exhibit remarkable capabilities across a variety of domains and tasks, challenging our understanding of learning and cognition. The latest model developed by OpenAI, GPT-4, was trained using an unprecedented scale of compute and data. In this paper, we report on our investigation of an early version of GPT-4, when it was still in active development by OpenAI. We contend that (this early version of) GPT-4 is part of a new cohort of LLMs (along with ChatGPT and Google's PaLM for example) that exhibit more general intelligence than previous AI models. We discuss the rising capabilities and implications of these models.


Hello Dolly: Democratizing the magic of ChatGPT with open models

#artificialintelligence

Update Apr 12, 2023: We have released Dolly 2.0, licensed for both research and commercial use. See the new blog post here. We show that anyone can take a dated off-the-shelf open source large language model (LLM) and give it magical ChatGPT-like instruction following ability by training it in 30 minutes on one machine, using high-quality training data. Surprisingly, instruction-following does not seem to require the latest or largest models: our model is only 6 billion parameters, compared to 175 billion for GPT-3. We open source the code for our model (Dolly) and show how it can be re-created on Databricks.


Do you actually need a vector database?

#artificialintelligence

Spoiler alert: the answer is maybe! Vector databases are having their day right now. Three different vector DB companies have raised money on valuations up to $700 million (paywall link). Surprisingly, their rise in popularity is not for their "original" purpose in recommendation systems, but rather as an auxillary tool for Large Language Models (LLMs). Many online examples of combining embeddings with LLMs will show you how they store the embeddings in a vector database.