AITopics | Large Language Model

Collaborating Authors

Large Language Model

News Overviews Instructional Materials AI-Alerts Classics

ChatGPT API: Building Custom Chatbot Experiences with Natural Language Processing

#artificialintelligenceDec-15-2022, 00:25:12 GMT

The ChatGPT API is a powerful tool for developers looking to build custom chatbot experiences using natural language processing (NLP). Built on the powerful GPT-3 language model, ChatGPT is capable of generating human-like responses to a wide range of questions and statements. This allows developers to create engaging and natural conversational experiences for their users. In this write-up, we will explore the key features and capabilities of the ChatGPT API, as well as provide code examples to help developers get started. One of the key features of ChatGPT is its ability to generate human-like responses to a wide range of questions and statements.

building custom chatbot experience, chatgpt api, developer, (7 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Retrieval-based Disentanglement with Distant Supervision

Zhou, Jiawei, Li, Xiaoguang, Shang, Lifeng, Jiang, Xin, Liu, Qun, Chen, Lei

arXiv.org Artificial IntelligenceDec-15-2022

Disentangled representation learning remains challenging as ground truth factors of variation do not naturally exist. To address this, we present Vocabulary Disentanglement Retrieval~(VDR), a simple yet effective retrieval-based disentanglement framework that leverages nature language as distant supervision. Our approach is built upon the widely-used bi-encoder architecture with disentanglement heads and is trained on data-text pairs that are readily available on the web or in existing datasets. This makes our approach task- and modality-agnostic with potential for a wide range of downstream applications. We conduct experiments on 16 datasets in both text-to-text and cross-modal scenarios and evaluate VDR in a zero-shot setting. With the incorporation of disentanglement heads and a minor increase in parameters, VDR achieves significant improvements over the base retriever it is built upon, with a 9% higher on NDCG@10 scores in zero-shot text-to-text retrieval and an average of 13% higher recall in cross-modal retrieval. In comparison to other baselines, VDR outperforms them in most tasks, while also improving explainability and efficiency.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2212.07699

Country: Asia > China > Hong Kong (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.55)

Add feedback

Robust Preference Learning for Storytelling via Contrastive Reinforcement Learning

Castricato, Louis, Havrilla, Alexander, Matiana, Shahbuland, Pieler, Michael, Ye, Anbang, Yang, Ian, Frazier, Spencer, Riedl, Mark

arXiv.org Artificial IntelligenceDec-15-2022

Controlled automated story generation seeks to generate natural language stories satisfying constraints from natural language critiques or preferences. Existing methods to control for story preference utilize prompt engineering which is labor intensive and often inconsistent. They may also use logit-manipulation methods which require annotated datasets to exist for the desired attributes. To address these issues, we first train a contrastive bi-encoder model to align stories with corresponding human critiques, named CARP, building a general purpose preference model. This is subsequently used as a reward function to fine-tune a generative language model via reinforcement learning. However, simply fine-tuning a generative language model with a contrastive reward Figure 1: Illustration of our technique for generating model does not always reliably result in story content controlled by preferences. A language a story generation system capable of generating model generates candidates, which are ranked stories that meet user preferences. To increase by the CARP model to produce scores. The scores are story generation robustness we further used to fine-tune the language model to produce higher fine-tune the contrastive reward model using a scoring--and thus more aligned with preferences-- prompt-learning technique.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2210.07792

Country:

Asia > Middle East > Iraq (0.14)
North America > United States > Texas (0.04)
North America > United States > Florida > Orange County (0.04)
(2 more...)

Genre: Research Report (0.82)

Industry: Leisure & Entertainment > Sports (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Detecting Label Errors by using Pre-Trained Language Models

Chong, Derek, Hong, Jenny, Manning, Christopher D.

arXiv.org Artificial IntelligenceDec-15-2022

We show that large pre-trained language models are inherently highly capable of identifying label errors in natural language datasets: simply examining out-of-sample data points in descending order of fine-tuned task loss significantly outperforms more complex error-detection mechanisms proposed in previous work. To this end, we contribute a novel method for introducing realistic, human-originated label noise into existing crowdsourced datasets such as SNLI and TweetNLP. We show that this noise has similar properties to real, hand-verified label errors, and is harder to detect than existing synthetic noise, creating challenges for model robustness. We argue that human-originated noise is a better standard for evaluation than synthetic noise. Finally, we use crowdsourced verification to evaluate the detection of real errors on IMDB, Amazon Reviews, and Recon, and confirm that pre-trained models perform at a 9-36% higher absolute Area Under the Precision-Recall Curve than existing models.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2205.12702

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > Dominican Republic (0.04)
Oceania > New Zealand (0.04)
(15 more...)

Genre: Research Report > Promising Solution (0.48)

Industry:

Media > Film (0.46)
Information Technology > Services (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Communications > Social Media > Crowdsourcing (0.69)
(2 more...)

Add feedback

ChatGPT Wrote This Article About Large Language Models – Gizmodo

#artificialintelligenceDec-14-2022, 22:41:22 GMT

ChatGPT, the machine–learning fueled chatbot, has been widely acclaimed for its ability to interact with users and generate "human-like" text.

chatgpt, gizmodo

#artificialintelligence

Industry: Media > News (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)

Add feedback

He Used AI to Publish a Children's Book in a Weekend. Artists Are Not Happy About It

TIME - TechDec-14-2022, 21:58:11 GMT

Ammaar Reshi was playing around with ChatGPT, an AI-powered chatbot from OpenAI when he started thinking about the ways artificial intelligence could be used to make a simple children's book to give to his friends. Just a couple of days later, he published a 12-page picture book, printed it, and started selling it on Amazon without ever picking up a pen and paper. The feat, which Reshi publicized in a viral Twitter thread, is a testament to the incredible advances in AI-powered tools like ChatGPT--which took the internet by storm two weeks ago with its uncanny ability to mimic human thought and writing. But the book, Alice and Sparkle, also renewed a fierce debate about the ethics of AI-generated art. Many argued that the technology preys on artists and other creatives--using their hard work as source material, while raising the specter of replacing them.

large language model, machine learning, natural language, (18 more...)

TIME - Tech

Country:

North America > United States > California > San Francisco County > San Francisco (0.06)
Pacific Ocean > North Pacific Ocean > San Francisco Bay (0.05)

Industry: Media > Publishing (0.76)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.80)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.80)

Add feedback

A Journey into the Fabulous Applications of Transformers -- Part 2 – Towards AI

#artificialintelligenceDec-14-2022, 21:00:10 GMT

Originally published on Towards AI the World's Leading AI and Technology News and Media Company. If you are building an AI-related product or service, we invite you to consider becoming an AI sponsor. At Towards AI, we help scale AI and technology startups. Let us help you unleash your technology to the masses. Transformer architecture is widely used in Natural Language Processing and it highly contributed to the need-of-the-hour Large Language Models (LLM).

artificial intelligence, large language model, natural language, (18 more...)

#artificialintelligence

Country: Europe > Finland > Uusimaa > Helsinki (0.05)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

The Digital Insider

#artificialintelligenceDec-14-2022, 19:20:15 GMT

GitHub has unveiled business usage terms for its GitHub Copilot AI-based coding assistant, making the service available to businesses for $19 per month per user. The company also vowed to keep users' own code safe from retention, storage, or sharing by GitHub. GitHub Copilot for Business gives organizations license management, organization-wide policy controls, and privacy features along with licenses for organizations, teams, and individual users. GitHub Copilot, introduced in 2021 as a Visual Studio Code editor extension, offers coding suggestions and functions directly from the user's programming editor or IDE. The AI model behind Copilot is trained on open source code in public repositories.

artificial intelligence, large language model, natural language, (5 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

How Does ChatGPT Actually Work?

#artificialintelligenceDec-14-2022, 15:50:49 GMT

Comcast is one of the email service providers and you can log-in by using credentials such as username and password. However, while using it, many users face Comcast email, not working problems. Here we discussed How To Resolve Xfinity Comcast Email Not Working Problems easily by going through the steps. This article helps you to get rid of several issues associated with Xfinity Comcast Email Not Working Issues. Many of them might be first-time users of Comcast.

large language model, machine learning, natural language, (7 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.40)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

Add feedback

A Conversation With ChatGPT About The Metaverse - Blockzeit

#artificialintelligenceDec-14-2022, 14:05:42 GMT

ChatGPT is a prototype artificial intelligence chatbot developed by OpenAI which specializes in dialogue. The chatbot is a large language model fine-tuned with both supervised and reinforcement learning techniques. It is based on OpenAI's GPT-3.5 model, an improved version of GPT-3. ChatGPT was launched on November 30, 2022 and has garnered attention for its detailed responses and articulate answers. I wanted to see what chatGPT has to say about the metaverse.

chatgpt, metaverse, platform, (12 more...)

#artificialintelligence

Genre: Personal > Interview (0.30)

Industry:

Leisure & Entertainment > Games > Computer Games (0.56)
Information Technology (0.56)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.47)

Add feedback