AITopics | Large Language Model

Collaborating Authors

Large Language Model

News Overviews Instructional Materials AI-Alerts Classics

OpenAI debuts ChatGPT and GPT-3.5 series as GPT-4 rumors fly

#artificialintelligenceNov-30-2022, 23:45:06 GMT

Check out the on-demand sessions from the Low-Code/No-Code Summit to learn how to successfully innovate and achieve efficiency by upskilling and scaling citizen developers. As GPT-4 rumors fly around NeurIPS 2022 this week in New Orleans (including whispers that details about GPT-4 will be revealed there), OpenAI has managed to make plenty of news in the meantime. On Monday, the company announced a new model in the GPT-3 family of AI-powered large language models, text-davinci-003, part of what it calls the "GPT-3.5 series," that reportedly improves on its predecessors by handling more complex instructions and producing higher-quality, longer-form content. Unlike davinci-002, which uses supervised fine-tuning on human-written demonstrations and highly scored model samples to improve generation quality, davinci-003 is a true reinforcement learning with human feedback (RLHF) model." Meanwhile, today OpenAI launched an early demo of ChatGPT, another part of the GPT-3.5 series that is an interactive, conversational model whose dialogue format "makes it possible for ChatGPT to answer followup questions, admit its mistakes, challenge incorrect premises, and reject inappropriate requests."

gpt-4 rumor fly, human feedback, openai debut chatgpt and gpt-3, (3 more...)

#artificialintelligence

Country: North America > United States > Louisiana > Orleans Parish > New Orleans (0.28)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.88)

Add feedback

While everyone waits for GPT-4, OpenAI is still fixing its predecessor

MIT Technology ReviewNov-30-2022, 18:02:38 GMT

ChatGPT appears to address some of these problems, but it is far from a full fix--as I found when I got to try it out. This suggests that GPT-4 won't be either. In particular, ChatGPT--like Galactica, Meta's large language model for science, which the company took offline earlier this month after just three days--still makes stuff up. There's a lot more to do, says John Shulman, a scientist at OpenAI: "We've made some progress on that problem, but it's far from solved." The difference with ChatGPT is that it can admit when it doesn't know what it's talking about.

chatgpt, christopher columbus, openai, (6 more...)

MIT Technology Review

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.70)

Add feedback

Best NLP Papers -- October 2022

#artificialintelligenceNov-30-2022, 16:41:06 GMT

This roundup highlights some interesting NLP papers from October 2022 around language model capabilities. This article's title and TL;DR have been generated with Cohere. Get started with text generation. NLP is evolving at a rapid pace, and every month we discover new capabilities. Large language models, like those built by Cohere, are being used for use cases that we couldn't have imagined even just a few months ago.

language description, language model, offensive content, (14 more...)

#artificialintelligence

Genre: Research Report > New Finding (0.30)

Industry: Education > Educational Technology > Educational Software (0.49)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.39)

Add feedback

AI Is Terrible at Detecting Misinformation. It Doesn't Have to Be. - Nautilus

#artificialintelligenceNov-30-2022, 16:40:41 GMT

Elon Musk has said he wants to make Twitter "the most accurate source of information in the world." I am not convinced that he means it, but whether he does or not, he's going to have to work on the problem; a lot of advertisers have already made that pretty clear. If he does nothing, they are out. And Musk has continued to tweet in ways that seem to indicate that he is generally on board with some kind of content moderation. The tech journalist Kara Swisher has speculated that Musk wants AI to help; on Twitter she wrote, rather plausibly, that Musk "is hoping to build an AI system that replaces [fired moderators] that will not work well now but will presumably get better."

galactica, language model, misinformation, (12 more...)

#artificialintelligence

Country: North America > United States > New York (0.05)

Industry:

Media > News (0.79)
Information Technology > Services (0.70)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.32)

Add feedback

Researchers Win Gordon Bell Special Prize for Models that Track COVID Variants

#artificialintelligenceNov-30-2022, 16:40:32 GMT

Members of the GenSLMs team received the Gordon Bell Special Prize for HPC-Based COVID-19 Research at the SC22 conference. Scientists from Argonne National Laboratory and a team of collaborators have won the 2022 ACM Gordon Bell Special Prize for High Performance Computing-Based COVID-19 Research for their method of quickly identifying how a virus evolves. Their work in training large language models (LLMs) to discover variants of SARS-CoV-2 has implications to biology beyond COVID-19. The researchers leveraged Argonne's supercomputing and AI resources to develop and apply LLMs toward tracking how a virus can mutate into more dangerous or more transmissible variants, or a variant of concern (VOC). Existing methods to track VOCs can be slow.

gordon bell special prize, university, win gordon bell special prize, (7 more...)

#artificialintelligence

Country:

North America > United States > Illinois > Cook County > Chicago (0.14)
North America > United States > New York (0.09)
North America > United States > California (0.09)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.09)

Industry: Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.89)

Add feedback

Google's Code-as-Policies Lets Robots Write Their Own Code

#artificialintelligenceNov-30-2022, 12:52:29 GMT

Researchers from Google's Robotics team have open-sourced Code-as-Policies (CaP), a robot control method that uses a large language model (LLM) to generate robot-control code that achieves a user-specified goal. CaP uses a hierarchical prompting technique for code generation that outperforms previous methods on the HumanEval code-generation benchmark. The technique and experiments were described in a paper published on arXiv. CaP differs from previous attempts to use LLMs to control robots; instead of generating a sequence of high-level steps or policies to be invoked by the robot, CaP directly generates Python code for those policies. The Google team developed a set of prompting techniques that improved code-generation, including a new hierarchical prompting method.

google, llm, robot, (12 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.91)

Add feedback

Effective Altruism Is Pushing a Dangerous Brand of 'AI Safety'

WIREDNov-30-2022, 12:00:00 GMT

Throughout my two decades in Silicon Valley, I have seen effective altruism (EA)--a movement consisting of an overwhelmingly white male group based largely out of Oxford University and Silicon Valley--gain alarming levels of influence. EA is currently being scrutinized due to its association with Sam Bankman-Fried's crypto scandal, but less has been written about how the ideology is now driving the research agenda in the field of artificial intelligence (AI), creating a race to proliferate harmful systems, ironically in the name of "AI safety." EA is defined by the Center for Effective Altruism as "an intellectual project, using evidence and reason to figure out how to benefit others as much as possible." And "evidence and reason" have led many EAs to conclude that the most pressing problem in the world is preventing an apocalypse where an artificially generally intelligent being (AGI) created by humans exterminates us. To prevent this apocalypse, EA's career advice center, 80,000 hours, lists "AI safety technical research" and "shaping future governance of AI" as the top two recommended careers for EAs to go into, and the billionaire EA class funds initiatives attempting to stop an AGI apocalypse.

apocalypse, dangerous brand, effective altruism, (11 more...)

WIRED

Country:

North America > United States > California (0.49)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.26)
Europe > Estonia > Harju County > Tallinn (0.06)

Industry: Banking & Finance > Trading (0.37)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.79)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Add feedback

OpenAI Turns to Davinci to Make GPT-3 Better

#artificialintelligenceNov-30-2022, 09:43:36 GMT

OpenAI API adds'text-davinci-003' to its list of main GPT-3 models, which can do all tasks other models can do while also ensuring high quality, longer output, and better instruction-following. Davinci is the most competent and can perform all tasks the other models can, often with fewer instructions. It works specifically well with tasks requiring in-depth knowledge of the subject matter, such as summarising texts for a specific audience and creative content development. However, the new capabilities of Davinci also require more computing resources leading to higher costs per API call and lesser speed than other models. For example, it is good at deducing solutions to various logical problems and outlining character motivations.

davinci, gpt-3 model, openai turn, (3 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.65)

Add feedback

NeurIPS 2022 -- 10 Topics and 50 Papers You Shouldn't Miss

#artificialintelligenceNov-30-2022, 06:15:50 GMT

Language Models, Brain-Inspired research, Diffusion Models, Graph Neural Networks… NeurIPS comes packed with world-class AI research insights, and this guide will help you find where to direct your attention.

learning openreview virtual poster, openreview virtual poster, virtual poster, (15 more...)

#artificialintelligence

Industry:

Health & Medicine > Therapeutic Area > Neurology (0.47)
Health & Medicine > Health Care Technology (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.96)

Add feedback

AIライティングアシスタントツール「Catchy」、テキスト生成能力を向上させるアップデートを実施 - Digital Shift Times（デジタルシフトタイムズ）その変革に勇気と希望を

#artificialintelligenceNov-30-2022, 06:14:36 GMT

株式会社デジタルレシピは、同社が提供するAIライティングアシスタントの「Catchy（キャッチー）」の性能を大幅にアップデートしたと発表した。

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.81)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.81)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.81)

Add feedback