AITopics

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.86)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.86)

#artificialintelligenceFeb-21-2023, 06:10:55 GMT

AI Product Counsel at OpenAI - San Francisco, California, United States

OpenAI's Legal team plays an indispensable role in advancing OpenAI's mission by navigating futuristic, foundational legal issues in AI. This is the team for you if you care deeply about doing meaningful and novel work as a technology lawyer. The team comprises various backgrounds, including technology, venture capital, M&A, employment, and tax law. As an AI Product Counsel, you will support and lead product, privacy, and regulatory legal initiatives for our cutting-edge models and technologies. This is a unique opportunity to be directly involved in the forefront of the legal and technology field.

ai product counsel, openai, san francisco, (5 more...)

Country: North America > United States > California > San Francisco County > San Francisco (0.91)

Industry: Law (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.96)

#artificialintelligenceFeb-21-2023, 00:56:51 GMT

GitHub Co-Pilot (Game Changer) for AI/ML Coders

Github Co-Pilot is a cutting-edge AI-poGithubred tool developed by GitHub and OpenAI that offers an innovative way for developers to write code. Using advanced machine learning algorithms, Co-Pilot provides intelligent suggestions and auto-completion for code snippets, making it easier and faster for programmers to write complex programs. One of the critical advantages of Co-Pilot is its ability to learn and adapt to the coding patterns and styles of individual developers. As developers work with Co-Pilot, it becomes more familiar with their coding habits and can offer more personalized suggestions and assistance. Moreover, Co-Pilot is built on OpenAI's state-of-the-art GPT language model, which is trained on a massive corpus of data and can understand natural language queries and generate human-like responses.

co-pilot, github copilot, suggestion, (12 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.77)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.48)

#artificialintelligenceFeb-21-2023, 00:55:16 GMT

OpenAI's ChatGPT Switches To Freemium Model - AI Summary

ChatGPT, the AI content tool, is switching to a freemium model, with a new paid tier, priced at $20 per month, which will give users better access to the tool. A version of the ChatGPT interface will remain free to use, but you may be subject to delays at peak usage times and/or outages depending on circumstance.

ai summary, chatgpt, freemium model, (1 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.40)

BadGPT: Exploring Security Vulnerabilities of ChatGPT via Backdoor Attacks to InstructGPT

Shi, Jiawen, Liu, Yixin, Zhou, Pan, Sun, Lichao

Recently, ChatGPT has gained significant attention in research due to its ability to interact with humans effectively. The core idea behind this model is reinforcement learning (RL) fine-tuning, a new paradigm that allows language models to align with human preferences, i.e., InstructGPT. In this study, we propose BadGPT, the first backdoor attack against RL fine-tuning in language models. By injecting a backdoor into the reward model, the language model can be compromised during the fine-tuning stage. Our initial experiments on movie reviews, i.e., IMDB, demonstrate that an attacker can manipulate the generated text through BadGPT.

large language model, machine learning, natural language, (16 more...)

2304.12298

Country:

North America > United States > Pennsylvania > Northampton County > Bethlehem (0.05)
Asia > China > Hubei Province > Wuhan (0.05)

Genre: Research Report > New Finding (0.49)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

A Prompt Pattern Catalog to Enhance Prompt Engineering with ChatGPT

White, Jules, Fu, Quchen, Hays, Sam, Sandborn, Michael, Olea, Carlos, Gilbert, Henry, Elnashar, Ashraf, Spencer-Smith, Jesse, Schmidt, Douglas C.

Prompt engineering is an increasingly important skill set needed to converse effectively with large language models (LLMs), such as ChatGPT. Prompts are instructions given to an LLM to enforce rules, automate processes, and ensure specific qualities (and quantities) of generated output. Prompts are also a form of programming that can customize the outputs and interactions with an LLM. This paper describes a catalog of prompt engineering techniques presented in pattern form that have been applied to solve common problems when conversing with LLMs. Prompt patterns are a knowledge transfer method analogous to software patterns since they provide reusable solutions to common problems faced in a particular context, i.e., output generation and interaction when working with LLMs. This paper provides the following contributions to research on prompt engineering that apply LLMs to automate software development tasks. First, it provides a framework for documenting patterns for structuring prompts to solve a range of problems so that they can be adapted to different domains. Second, it presents a catalog of patterns that have been applied successfully to improve the outputs of LLM conversations. Third, it explains how prompts can be built from multiple patterns and illustrates prompt patterns that benefit from combination with other prompt patterns.

large language model, machine learning, natural language, (18 more...)

2302.11382

Country: North America > United States > Tennessee > Davidson County > Nashville (0.14)

Genre: Research Report (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Kumar, Sachin, Balachandran, Vidhisha, Njoo, Lucille, Anastasopoulos, Antonios, Tsvetkov, Yulia

Language Generation Models Can Cause Harm: So What Can We Do About It? An Actionable Survey

Recent advances in the capacity of large language models to generate human-like text have resulted in their increased adoption in user-facing settings. In parallel, these improvements have prompted a heated discourse around the risks of societal harms they introduce, whether inadvertent or malicious. Several studies have explored these harms and called for their mitigation via development of safer, fairer models. Going beyond enumerating the risks of harms, this work provides a survey of practical methods for addressing potential threats and societal harms from language generation models. We draw on several prior works' taxonomies of language model risks to present a structured overview of strategies for detecting and ameliorating different kinds of risks/harms of language generators. Bridging diverse strands of research, this survey aims Figure 1: Overview of Intervention Strategies. A typical to serve as a practical guide for both LM researchers ML/NLP model development process involves data and practitioners, with explanations collection/curation, model training and design, inference, of different mitigation strategies' motivations, and finally application deployment.

computational linguistic, large language model, machine learning, (19 more...)

2210.077

Country:

North America > United States > Washington > King County > Seattle (0.14)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
(24 more...)

Genre: Overview (1.00)

Industry:

Media (1.00)
Law (1.00)
Information Technology > Security & Privacy (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

$k$NN-Adapter: Efficient Domain Adaptation for Black-Box Language Models

Huang, Yangsibo, Liu, Daogao, Zhong, Zexuan, Shi, Weijia, Lee, Yin Tat

Fine-tuning a language model on a new domain is standard practice for domain adaptation. However, it can be infeasible when it comes to modern large-scale language models such as GPT-3, which can only be accessed through APIs, making it difficult to access the internal parameters of the model. In this paper, we propose $k$NN-Adapter, a method to effectively adapt these black-box large language models (LLMs) to a new domain. The $k$NN-Adapter builds on top of the retrieval-augmented language model, and adaptively learns to interpolate the output of the language model with retrieval results from a datastore consisting of the target domain data. Our experiments on four different domains demonstrate that $k$NN-Adapter significantly improves perplexity, and works particularly well in settings with limited access to LLMs. Additionally, we show that $k$NN-Adapter is more effective than fine-tuning when the amount of training data is limited. We also release a dataset to encourage further study.

large language model, machine learning, natural language, (17 more...)

2302.10879

Country: Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Genre: Research Report (0.82)

Industry: Transportation > Air (0.63)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.36)

Wu, Shangda, Sun, Maosong

Efficient and Training-Free Control of Language Generation

In recent years, there has been a growing interest in the development of language models capable of generating text with controllable attributes. While several approaches have been proposed, many of these methods require condition-specific data or significant computational resources. In this study, we propose a novel method called Gamma Sampling, which enables controllable language generation without the need for any training data and maintains a fast generation speed. Gamma Sampling incorporates attribute-related information into the sampling process, effectively guiding the language model to produce text with desired attributes. Our experimental results demonstrate that Gamma Sampling, when applied to GPT2, outperforms representative baselines in terms of diversity, attribute relevance, and overall quality of the generated samples.

ampling, large language model, machine learning, (19 more...)

2205.06036

Country:

Europe > United Kingdom > Wales (0.04)
Oceania > New Zealand (0.04)
Asia > China (0.04)
(19 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)
Law > Government & the Courts (1.00)
(14 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Shibata, Hisaichi, Miki, Soichiro, Nakamura, Yuta

Playing the Werewolf game with artificial intelligence for language understanding

The Werewolf game is a social deduction game based on free natural language communication, in which players try to deceive others in order to survive. An important feature of this game is that a large portion of the conversations are false information, and the behavior of artificial intelligence (AI) in such a situation has not been widely investigated. The purpose of this study is to develop an AI agent that can play Werewolf through natural language conversations. First, we collected game logs from 15 human players. Next, we fine-tuned a Transformer-based pretrained language model to construct a value network that can predict a posterior probability of winning a game at any given phase of the game and given a candidate for the next action. We then developed an AI agent that can interact with humans and choose the best voting target on the basis of its probability from the value network. Lastly, we evaluated the performance of the agent by having it actually play the game with human players. We found that our AI agent, Deep Wolf, could play Werewolf as competitively as average human players in a villager or a betrayer role, whereas Deep Wolf was inferior to human players in a werewolf or a seer role. These results suggest that current language models have the capability to suspect what others are saying, tell a lie, or detect lies in conversations.

large language model, machine learning, natural language, (17 more...)

2302.10646

Country: Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)

Genre: Research Report > New Finding (0.35)

Industry:

Leisure & Entertainment > Games (1.00)
Health & Medicine (0.94)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.35)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)