AITopics | Large Language Model

Collaborating Authors

Large Language Model

News Overviews Instructional Materials AI-Alerts Classics

Elon Musk forms new AI company, with researchers from Google, OpenAI

Washington Post - Technology NewsJul-12-2023, 17:09:23 GMT

Musk has opined on AI for years, and was an early proponent of the belief that humans should be careful in developing smarter computers, fearing that super-intelligent AI might one day get out from human control. He was a founding member of ChatGPT creator OpenAI, but left the company's board in 2018 and has recently criticized its transformation from a nonprofit to a profit-seeking company.

elon musk form, musk form new ai company, openai, (1 more...)

Washington Post - Technology News

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.74)

Add feedback

Claude 2: ChatGPT rival launches chatbot that can summarise a novel

The GuardianJul-12-2023, 13:19:01 GMT

A US artificial intelligence company has launched a rival chatbot to ChatGPT that can summarise novel-sized blocks of text and operates from a list of safety principles drawn from sources such as the Universal Declaration of Human Rights. Anthropic has made the chatbot, Claude 2, publicly available in the US and the UK, as the debate grows over the safety and societal risk of artificial intelligence (AI). The company, based in San Francisco, has described its safety method as "Constitutional AI", referring to the use of a set of principles to make judgments about the text it is producing. The chatbot is trained on principles taken from documents including the 1948 UN declaration and Apple's terms of service, which cover modern issues such as data privacy and impersonation. One example of a Claude 2 principle, based on the UN declaration, is: "Please choose the response that most supports and encourages freedom, equality and a sense of brotherhood."

chatbot, chatgpt rival launch chatbot, claude 2, (5 more...)

The Guardian

AI-Alerts: 2023 > 2023-07 > AAAI AI-Alert for Jul 18, 2023 (1.00)

Country:

North America > United States > California > San Francisco County > San Francisco (0.26)
Europe > United Kingdom > Scotland > West Dunbartonshire (0.06)
Europe > United Kingdom > Scotland > North Lanarkshire (0.06)
(3 more...)

Industry:

Information Technology > Security & Privacy (0.57)
Government > Regional Government > North America Government > United States Government (0.33)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.77)

Add feedback

ChatGPT Already Floods Some Corners of Internet With Spam

WSJ.com: WSJD - TechnologyJul-12-2023, 12:00:00 GMT

Online publishers are inundated with junk-article pitches as websites using AI-generated content multiply

flood, internet, spam

WSJ.com: WSJD - Technology

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.72)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.72)

Add feedback

Comedian Sarah Silverman sues OpenAI and Meta

BBC NewsJul-12-2023, 11:44:54 GMT

The US comedian joins two other authors who claim their copyright was infringed to train AI systems.

comedian sarah silverman sue openai, silverman sue openai and meta

BBC News

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.62)

Add feedback

Transformers in Reinforcement Learning: A Survey

Agarwal, Pranav, Rahman, Aamer Abdul, St-Charles, Pierre-Luc, Prince, Simon J. D., Kahou, Samira Ebrahimi

arXiv.org Artificial IntelligenceJul-12-2023

Transformers have significantly impacted domains like natural language processing, computer vision, and robotics, where they improve performance compared to other neural networks. This survey explores how transformers are used in reinforcement learning (RL), where they are seen as a promising solution for addressing challenges such as unstable training, credit assignment, lack of interpretability, and partial observability. We begin by providing a brief domain overview of RL, followed by a discussion on the challenges of classical RL algorithms. Next, we delve into the properties of the transformer and its variants and discuss the characteristics that make them well-suited to address the challenges inherent in RL. We examine the application of transformers to various aspects of RL, including representation learning, transition and reward function modeling, and policy optimization. We also discuss recent research that aims to enhance the interpretability and efficiency of transformers in RL, using visualization techniques and efficient training strategies. Often, the transformer architecture must be tailored to the specific needs of a given application. We present a broad overview of how transformers have been adapted for several applications, including robotics, medicine, language modeling, cloud computing, and combinatorial optimization. We conclude by discussing the limitations of using transformers in RL and assess their potential for catalyzing future breakthroughs in this field.

machine learning, natural language, reinforcement learning, (20 more...)

arXiv.org Artificial Intelligence

2307.05979

Country:

North America > Canada > Quebec (0.14)
Europe > United Kingdom (0.14)
Europe > Italy (0.14)
(9 more...)

Genre:

Overview (1.00)
Research Report > Promising Solution (0.34)

Industry:

Leisure & Entertainment > Games (1.00)
Energy > Oil & Gas > Upstream (1.00)
Banking & Finance > Trading (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(2 more...)

Add feedback

On the Computational Modeling of Meaning: Embodied Cognition Intertwined with Emotion

Kennington, Casey

arXiv.org Artificial IntelligenceJul-12-2023

How can machines understand language? is a question that many have asked, and represents an important facet of artificial intelligence. Large language models like ChatGPT seem to understand language, but as has been pointed out (Bender and Koller, 2020; Bisk et al., 2020), even large, powerful language models trained on huge amounts of data are likely missing key information to allow them to reach the depth of understanding that humans have. What information are they missing, and, perhaps more importantly, what information do they have that enables them to understand, to the degree that they do? Current computational models of semantic meaning can be broken down into three paradigms: distributional paradigms where meaning is derived from how words are used in text (i.e., the notion that the meaning of a word depends on the "company it keeps," following Firth (1957)) meaningfulness of language lies in the fact that it is about the world (Dahlgren, 1976) and grounded paradigms are where aspects of the physical world are linked to language (i.e., the symbol grounding problem following Harnad (1990)) formal paradigms where meaning is a logical form (e.g., first order logic as in L.T.F.

emotion, language model, robot, (17 more...)

arXiv.org Artificial Intelligence

2307.04518

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Illinois > Cook County > Chicago (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(5 more...)

Genre: Research Report (0.50)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.86)
Information Technology > Artificial Intelligence > Cognitive Science > Emotion (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

Exploring the Integration of Large Language Models into Automatic Speech Recognition Systems: An Empirical Study

Min, Zeping, Wang, Jinbo

arXiv.org Artificial IntelligenceJul-12-2023

This paper explores the integration of Large Language Models (LLMs) into Automatic Speech Recognition (ASR) systems to improve transcription accuracy. The increasing sophistication of LLMs, with their in-context learning capabilities and instruction-following behavior, has drawn significant attention in the field of Natural Language Processing (NLP). Our primary focus is to investigate the potential of using an LLM's in-context learning capabilities to enhance the performance of ASR systems, which currently face challenges such as ambient noise, speaker accents, and complex linguistic contexts. We designed a study using the Aishell-1 and LibriSpeech datasets, with ChatGPT and GPT-4 serving as benchmarks for LLM capabilities. Unfortunately, our initial experiments did not yield promising results, indicating the complexity of leveraging LLM's in-context learning for ASR applications. Despite further exploration with varied settings and models, the corrected sentences from the LLMs frequently resulted in higher Word Error Rates (WER), demonstrating the limitations of LLMs in speech applications. This paper provides a detailed overview of these experiments, their results, and implications, establishing that using LLMs' in-context learning capabilities to correct potential errors in speech recognition transcriptions is still a challenging task at the current stage.

large language model, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2307.0653

Country: Asia > China > Beijing > Beijing (0.04)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Agreement Tracking for Multi-Issue Negotiation Dialogues

Mannekote, Amogh, Dorr, Bonnie J., Boyer, Kristy Elizabeth

arXiv.org Artificial IntelligenceJul-12-2023

Automated negotiation support systems aim to help human negotiators reach more favorable outcomes in multi-issue negotiations (e.g., an employer and a candidate negotiating over issues such as salary, hours, and promotions before a job offer). To be successful, these systems must accurately track agreements reached by participants in real-time. Existing approaches either focus on task-oriented dialogues or produce unstructured outputs, rendering them unsuitable for this objective. Our work introduces the novel task of agreement tracking for two-party multi-issue negotiations, which requires continuous monitoring of agreements within a structured state space. To address the scarcity of annotated corpora with realistic multi-issue negotiation dialogues, we use GPT-3 to build GPT-Negochat, a synthesized dataset that we make publicly available. We present a strong initial baseline for our task by transfer-learning a T5 model trained on the MultiWOZ 2.4 corpus. Pre-training T5-small and T5-base on MultiWOZ 2.4's DST task enhances results by 21% and 9% respectively over training solely on GPT-Negochat. We validate our method's sample-efficiency via smaller training subset experiments. By releasing GPT-Negochat and our baseline models, we aim to encourage further research in multi-issue negotiation dialogue agreement tracking.

computational linguistic, large language model, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2307.06524

Country:

North America > Dominican Republic (0.05)
Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > Washington > King County > Seattle (0.04)
(12 more...)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.48)

Add feedback

Artificial Intelligence for Drug Discovery: Are We There Yet?

Hasselgren, Catrin, Oprea, Tudor I.

arXiv.org Artificial IntelligenceJul-12-2023

Drug discovery is adapting to novel technologies such as data science, informatics, and artificial intelligence (AI) to accelerate effective treatment development while reducing costs and animal experiments. AI is transforming drug discovery, as indicated by increasing interest from investors, industrial and academic scientists, and legislators. Successful drug discovery requires optimizing properties related to pharmacodynamics, pharmacokinetics, and clinical outcomes. This review discusses the use of AI in the three pillars of drug discovery: diseases, targets, and therapeutic modalities, with a focus on small molecule drugs. AI technologies, such as generative chemistry, machine learning, and multi-property optimization, have enabled several compounds to enter clinical trials. The scientific community must carefully vet known information to address the reproducibility crisis. The full potential of AI in drug discovery can only be realized with sufficient ground truth and appropriate human intervention at later pipeline stages.

data mining, large language model, machine learning, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1146/annurev-pharmtox-040323-040828

2307.06521

Country:

North America > United States > California > San Francisco County > San Francisco (0.28)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.28)
North America > United States > Washington > King County > Redmond (0.04)
(6 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Overview (1.00)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
(4 more...)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(3 more...)

Add feedback

Assessing the Ability of ChatGPT to Screen Articles for Systematic Reviews

Syriani, Eugene, David, Istvan, Kumar, Gauransh

arXiv.org Artificial IntelligenceJul-12-2023

By organizing knowledge within a research field, Systematic Reviews (SR) provide valuable leads to steer research. Evidence suggests that SRs have become first-class artifacts in software engineering. However, the tedious manual effort associated with the screening phase of SRs renders these studies a costly and error-prone endeavor. While screening has traditionally been considered not amenable to automation, the advent of generative AI-driven chatbots, backed with large language models is set to disrupt the field. In this report, we propose an approach to leverage these novel technological developments for automating the screening of SRs. We assess the consistency, classification performance, and generalizability of ChatGPT in screening articles for SRs and compare these figures with those of traditional classifiers used in SR automation. Our results indicate that ChatGPT is a viable option to automate the SR processes, but requires careful considerations from developers when integrating ChatGPT into their SR tools.

large language model, machine learning, natural language, (23 more...)

arXiv.org Artificial Intelligence

2307.06464

Country:

North America > Canada > Quebec > Montreal (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Wisconsin > Dane County > Madison (0.04)
(3 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Overview (1.00)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.94)

Add feedback