AITopics | Large Language Model

Collaborating Authors

Large Language Model

News Overviews Instructional Materials AI-Alerts Classics

ChatGPT sets its sights on university! AI bot can now reason as well as the average college student, study claims

Daily Mail - Science & techJul-31-2023, 15:47:02 GMT

Artificial intelligence can now reason as well as the average college student. Dr Geoffrey Hinton, who is seen as one of the godfathers of AI, warned recently that the technology'may soon be' more intelligent than people. Now it appears AI has mastered a type of intelligence called'analogical reasoning' which was previously believed to be uniquely human. Analogical reasoning means working out a solution to a completely new problem by using experience from previous similar problems. Given one type of test requiring this reasoning, the AI language programme GPT-3 beat the average score among 40 university students.

average college student, reasoning, student, (12 more...)

Daily Mail - Science & tech

Country: North America > United States > California > Los Angeles County > Los Angeles (0.16)

Genre: Research Report > Experimental Study (0.66)

Industry: Education > Educational Setting > Higher Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Meta Has A.I. Google Has A.I. Microsoft Has A.I. Amazon Has a Plan.

SlateJul-31-2023, 13:00:00 GMT

This article is from Big Technology, a newsletter by Alex Kantrowitz. Amazon's absence from this year's generative–A.I. bonanza has been a bit puzzling. The company invented Alexa, intuiting people's interest in speaking with computers, yet when OpenAI released ChatGPT it seemed to cede the territory. But rather than sitting out the game, Amazon is waiting to play on its terms. Instead of building one A.I. product, it wants a piece of all of them.

amazon, developer, microsoft, (9 more...)

Slate

Industry: Information Technology (0.52)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.60)

Add feedback

I'm a Hack, by ChatGPT

The New YorkerJul-31-2023, 10:00:00 GMT

One of the Writers Guild of America strike issues is me. Writers are better than me! If I was good, I would have an Emmy. That's because I have no idea how to write anything interesting or that sounds like it was written by a real human being. That is why I am writing this op-ed.

anyhoo, chatgpt, real writer, (4 more...)

The New Yorker

Country: North America > United States (0.06)

Industry:

Leisure & Entertainment (0.53)
Media > Film (0.52)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.40)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

Add feedback

An Effective Data Creation Pipeline to Generate High-quality Financial Instruction Data for Large Language Model

Wang, Ziao, Wang, Jianning, Wu, Junda, Zhang, Xiaofeng

arXiv.org Artificial IntelligenceJul-31-2023

At the beginning era of large language model, it is quite critical to generate a high-quality financial dataset to fine-tune a large language model for financial related tasks. Thus, this paper presents a carefully designed data creation pipeline for this purpose. Particularly, we initiate a dialogue between an AI investor and financial expert using ChatGPT and incorporate the feedback of human financial experts, leading to the refinement of the dataset. This pipeline yielded a robust instruction tuning dataset comprised of 103k multi-turn chats. Extensive experiments have been conducted on this dataset to evaluate the model's performance by adopting an external GPT-4 as the judge. The promising experimental results verify that our approach led to significant advancements in generating accurate, relevant, and financial-style responses from AI models, and thus providing a powerful tool for applications within the financial sector.

dataset, instruction, language model, (11 more...)

arXiv.org Artificial Intelligence

2308.01415

Country:

Asia > China > Shanghai > Shanghai (0.04)
Europe > Belgium > Brussels-Capital Region > Brussels (0.04)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
(3 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Banking & Finance (1.00)
Materials > Metals & Mining > Lithium (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.91)

Add feedback

Ontology engineering with Large Language Models

Mateiu, Patricia, Groza, Adrian

arXiv.org Artificial IntelligenceJul-31-2023

We tackle the task of enriching ontologies by automatically translating natural language sentences into Description Logic. Since Large Language Models (LLMs) are the best tools for translations, we fine-tuned a GPT-3 model to convert Natural Language sentences into OWL Functional Syntax. We employ objective and concise examples to fine-tune the model regarding: instances, class subsumption, domain and range of relations, object properties relationships, disjoint classes, complements, cardinality restrictions. The resulted axioms are used to enrich an ontology, in a human supervised manner. The developed tool is publicly provided as a Protge plugin.

declaration, namedindividual, ontology, (14 more...)

arXiv.org Artificial Intelligence

2307.16699

Country:

Europe > Romania > Nord-Vest Development Region > Cluj County > Cluj-Napoca (0.05)
North America > United States > Georgia > Clarke County > Athens (0.04)
Europe > France > Occitanie > Hérault > Montpellier (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

Unsupervised Improvement of Audio-Text Cross-Modal Representations

Wang, Zhepei, Subakan, Cem, Subramani, Krishna, Wu, Junkai, Tavares, Tiago, Ayres, Fabio, Smaragdis, Paris

arXiv.org Artificial IntelligenceJul-31-2023

Recent advances in using language models to obtain cross-modal audio-text representations have overcome the limitations of conventional training approaches that use predefined labels. This has allowed the community to make progress in tasks like zero-shot classification, which would otherwise not be possible. However, learning such representations requires a large amount of human-annotated audio-text pairs. In this paper, we study unsupervised approaches to improve the learning framework of such representations with unpaired text and audio. We explore domain-unspecific and domain-specific curation methods to create audio-text pairs that we use to further improve the model. We also show that when domain-specific curation is used in conjunction with a soft-labeled contrastive loss, we are able to obtain significant improvement in terms of zero-shot classification performance on downstream sound event classification or acoustic scene classification tasks.

dataset, improvement-set, representation, (17 more...)

arXiv.org Artificial Intelligence

2305.01864

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Washington > King County > Seattle (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(2 more...)

Genre: Research Report (0.64)

Industry: Education (0.49)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.49)

Add feedback

NLLG Quarterly arXiv Report 06/23: What are the most influential current AI Papers?

Eger, Steffen, Leiter, Christoph, Belouadi, Jonas, Zhang, Ran, Kostikova, Aida, Larionov, Daniil, Chen, Yanran, Fresen, Vivian

arXiv.org Artificial IntelligenceJul-31-2023

The rapid growth of information in the field of Generative Artificial Intelligence (AI), particularly in the subfields of Natural Language Processing (NLP) and Machine Learning (ML), presents a significant challenge for researchers and practitioners to keep pace with the latest developments. To address the problem of information overload, this report by the Natural Language Learning Group at Bielefeld University focuses on identifying the most popular papers on arXiv, with a specific emphasis on NLP and ML. The objective is to offer a quick guide to the most relevant and widely discussed research, aiding both newcomers and established researchers in staying abreast of current trends. In particular, we compile a list of the 40 most popular papers based on normalized citation counts from the first half of 2023. We observe the dominance of papers related to Large Language Models (LLMs) and specifically ChatGPT during the first half of 2023, with the latter showing signs of declining popularity more recently, however. Further, NLP related papers are the most influential (around 60\% of top papers) even though there are twice as many ML related papers in our data. Core issues investigated in the most heavily cited papers are: LLM efficiency, evaluation techniques, ethical considerations, embodied agents, and problem-solving with LLMs. Additionally, we examine the characteristics of top papers in comparison to others outside the top-40 list (noticing the top paper's focus on LLM related issues and higher number of co-authors) and analyze the citation distributions in our dataset, among others.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2308.04889

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)

Genre: Research Report (1.00)

Industry: Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.88)

Add feedback

Getting from Generative AI to Trustworthy AI: What LLMs might learn from Cyc

Lenat, Doug, Marcus, Gary

arXiv.org Artificial IntelligenceJul-31-2023

Generative AI, the most popular current approach to AI, consists of large language models (LLMs) that are trained to produce outputs that are plausible, but not necessarily correct. Although their abilities are often uncanny, they are lacking in aspects of reasoning, leading LLMs to be less than completely trustworthy. Furthermore, their results tend to be both unpredictable and uninterpretable. We lay out 16 desiderata for future AI, and discuss an alternative approach to AI which could theoretically address many of the limitations associated with current approaches: AI educated with curated pieces of explicit knowledge and rules of thumb, enabling an inference engine to automatically deduce the logical entailments of all that knowledge. Even long arguments produced this way can be both trustworthy and interpretable, since the full step-by-step line of reasoning is always available, and for each step the provenance of the knowledge used can be documented and audited. There is however a catch: if the logical language is expressive enough to fully represent the meaning of anything we can say in English, then the inference engine runs much too slowly. That's why symbolic AI systems typically settle for some fast but much less expressive logic, such as knowledge graphs. We describe how one AI system, Cyc, has developed ways to overcome that tradeoff and is able to reason in higher order logic in real time. We suggest that any trustworthy general AI will need to hybridize the approaches, the LLM approach and more formal approach, and lay out a path to realizing that dream.

large language model, logic & formal reasoning, machine learning, (21 more...)

arXiv.org Artificial Intelligence

2308.04445

Country:

Asia > Russia (0.14)
Europe > Ukraine (0.04)
North America > United States > New York (0.04)
(7 more...)

Genre: Research Report (1.00)

Industry:

Government > Regional Government (0.68)
Leisure & Entertainment (0.67)
Health & Medicine > Therapeutic Area (0.67)

Technology:

Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
(4 more...)

Add feedback

FinVis-GPT: A Multimodal Large Language Model for Financial Chart Analysis

Wang, Ziao, Li, Yuhang, Wu, Junda, Soon, Jaehyeon, Zhang, Xiaofeng

arXiv.org Artificial IntelligenceJul-31-2023

In this paper, we propose FinVis-GPT, a novel multimodal large language model (LLM) specifically designed for financial chart analysis. By leveraging the power of LLMs and incorporating instruction tuning and multimodal capabilities, FinVis-GPT is capable of interpreting financial charts and providing valuable analysis. To train FinVis-GPT, a financial task oriented dataset was generated for pre-training alignment and instruction tuning, comprising various types of financial charts and their corresponding descriptions. We evaluate the model performance via several case studies due to the time limit, and the promising results demonstrated that FinVis-GPT is superior in various financial chart related tasks, including generating descriptions, answering questions and predicting future market trends, surpassing existing state-of-the-art multimodal LLMs. The proposed FinVis-GPT serves as a pioneering effort in utilizing multimodal LLMs in the finance domain and our generated dataset will be release for public use in the near future to speedup related research.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2308.0143

Country:

Asia > China > Heilongjiang Province > Harbin (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)

Genre: Research Report (0.42)

Industry: Banking & Finance > Trading (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

Add feedback

HouYi: An open-source large language model specially designed for renewable energy and carbon neutrality field

Bai, Mingliang, Zhou, Zhihao, Wang, Ruidong, Yang, Yusheng, Qin, Zizhen, Chen, Yunxiao, Mu, Chunjin, Liu, Jinfu, Yu, Daren

arXiv.org Artificial IntelligenceJul-31-2023

Renewable energy is important for achieving carbon neutrality goal. With the great success of Large Language Models (LLMs) like ChatGPT in automatic content generation, LLMs are playing an increasingly important role. However, there has not been a specially designed LLM for renewable energy. Meanwhile, there has not been any dataset of renewable energy for training LLMs. Therefore, this paper published the first open-source Renewable Energy Academic Paper (REAP) dataset for non-commercial LLM research of renewable energy. REAP dataset is collected through searching the title and abstract of 1,168,970 academic literatures from Web of Science. Based on REAP dataset, HouYi model, the first LLM for renewable energy, is developed through finetuning general LLMs. HouYi demonstrated powerful academic paper paragraph generation ability in renewable energy field. Experiments show that its ability to generate academic papers on renewable energy is comparable to ChatGPT, slightly outperforms Claude, ERNIE Bot and SparkDesk, and significantly outperforms open-source LLaMA-13B model.

large language model, machine learning, wind energy conversion, (17 more...)

arXiv.org Artificial Intelligence

2308.01414

Country:

North America > United States (0.14)
Asia > China > Heilongjiang Province > Harbin (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)

Genre:

Research Report > New Finding (1.00)
Overview (1.00)

Industry:

Energy > Renewable > Solar (1.00)
Energy > Renewable > Wind (0.79)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback