AITopics | élément

Collaborating Authors

élément

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

CodeMirage: Hallucinations in Code Generated by Large Language Models

Agarwal, Vibhor, Pei, Yulong, Alamir, Salwa, Liu, Xiaomo

arXiv.org Artificial IntelligenceAug-14-2024

Large Language Models (LLMs) have shown promising potentials in program generation and no-code automation. However, LLMs are prone to generate hallucinations, i.e., they generate text which sounds plausible but is incorrect. Although there has been a recent surge in research on LLM hallucinations for text generation, similar hallucination phenomenon can happen in code generation. Sometimes the generated code can have syntactical or logical errors as well as more advanced issues like security vulnerabilities, memory leaks, etc. Given the wide adaptation of LLMs to enhance efficiency in code generation and development in general, it becomes imperative to investigate hallucinations in code generation. To the best of our knowledge, this is the first attempt at studying hallucinations in the code generated by LLMs. We start by introducing the code hallucination definition and a comprehensive taxonomy of code hallucination types. We propose the first benchmark CodeMirage dataset for code hallucinations. The benchmark contains 1,137 GPT-3.5 generated hallucinated code snippets for Python programming problems from two base datasets - HumanEval and MBPP. We then propose the methodology for code hallucination detection and experiment with open source LLMs such as CodeLLaMA as well as OpenAI's GPT-3.5 and GPT-4 models using one-shot prompt. We find that GPT-4 performs the best on HumanEval dataset and gives comparable results to the fine-tuned CodeBERT baseline on MBPP dataset. Towards the end, we discuss various mitigation strategies for code hallucinations and conclude our work.

code snippet, dataset, hallucination, (13 more...)

arXiv.org Artificial Intelligence

2408.08333

Genre: Research Report (0.40)

Industry: Information Technology > Security & Privacy (0.90)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Neural Slot Interpreters: Grounding Object Semantics in Emergent Slot Representations

Dedhia, Bhishma, Jha, Niraj K.

arXiv.org Artificial IntelligenceFeb-2-2024

Object-centric methods have seen significant progress in unsupervised decomposition of raw perception into rich object-like abstractions. However, limited ability to ground object semantics of the real world into the learned abstractions has hindered their adoption in downstream understanding applications. We present the Neural Slot Interpreter (NSI) that learns to ground and generate object semantics via slot representations. At the core of NSI is an XML-like programming language that uses simple syntax rules to organize the object semantics of a scene into object-centric program primitives. Then, an alignment model learns to ground program primitives into slots through a bi-level contrastive learning objective over a shared embedding space. Finally, we formulate the NSI program generator model to use the dense associations inferred from the alignment model to generate object-centric programs from slots. Experiments on bi-modal retrieval tasks demonstrate the efficacy of the learned alignments, surpassing set-matching-based predictors by a significant margin. Moreover, learning the program generator from grounded associations enhances the predictive power of slots. NSI generated programs demonstrate improved performance of object-centric learners on property prediction and object detection, and scale with real-world scene complexity.

dataset, representation, élément, (14 more...)

arXiv.org Artificial Intelligence

2403.07887

Country: North America > United States (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Improving Natural Language Capability of Code Large Language Model

Li, Wei, Zan, Daoguang, Guan, Bei, Yu, Ailun, Chen, Xiaolin, Wang, Yongji

arXiv.org Artificial IntelligenceJan-25-2024

Code large language models (Code LLMs) have demonstrated remarkable performance in code generation. Nonetheless, most existing works focus on boosting code LLMs from the perspective of programming capabilities, while their natural language capabilities receive less attention. To fill this gap, we thus propose a novel framework, comprising two modules: AttentionExtractor, which is responsible for extracting key phrases from the user's natural language requirements, and AttentionCoder, which leverages these extracted phrases to generate target code to solve the requirement. This framework pioneers an innovative idea by seamlessly integrating code LLMs with traditional natural language processing tools. To validate the effectiveness of the framework, we craft a new code generation benchmark, called MultiNL-H, covering five natural languages. Extensive experimental results demonstrate the effectiveness of our proposed framework.

code generation, code llm, natural language, (16 more...)

arXiv.org Artificial Intelligence

2401.14242

Country:

North America > Canada > Ontario > Toronto (0.04)
Europe > Monaco (0.04)
Asia > Middle East > Jordan (0.04)
Asia > China > Heilongjiang Province > Daqing (0.04)

Genre: Research Report > New Finding (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

SciLit: A Platform for Joint Scientific Literature Discovery, Summarization and Citation Generation

Gu, Nianlong, Hahnloser, Richard H. R.

arXiv.org Artificial IntelligenceNov-6-2023

Scientific writing involves retrieving, summarizing, and citing relevant papers, which can be time-consuming processes in large and rapidly evolving fields. By making these processes inter-operable, natural language processing (NLP) provides opportunities for creating end-to-end assistive writing tools. We propose SciLit, a pipeline that automatically recommends relevant papers, extracts highlights, and suggests a reference sentence as a citation of a paper, taking into consideration the user-provided context and keywords. SciLit efficiently recommends papers from large databases of hundreds of millions of papers using a two-stage pre-fetching and re-ranking literature search system that flexibly deals with addition and removal of a paper database. We provide a convenient user interface that displays the recommended papers as extractive summaries and that offers abstractively-generated citing sentences which are aligned with the provided context and which mention the chosen keyword(s). Our assistive tool for literature discovery and scientific writing is available at https://scilit.vercel.app

citation sentence, computational linguistic, keyword, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.18653/v1/2023.acl-demo.22

2306.03535

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Switzerland > Zürich > Zürich (0.05)
Asia > China > Hong Kong (0.04)
(2 more...)

Genre: Research Report (0.64)

Industry: Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.94)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.68)

Add feedback

Towards Realistic Single-Task Continuous Learning Research for NER

Payan, Justin, Merhav, Yuval, Xie, He, Krishna, Satyapriya, Ramakrishna, Anil, Sridhar, Mukund, Gupta, Rahul

arXiv.org Artificial IntelligenceOct-27-2021

There is an increasing interest in continuous learning (CL), as data privacy is becoming a priority for real-world machine learning applications. Meanwhile, there is still a lack of academic NLP benchmarks that are applicable for realistic CL settings, which is a major challenge for the advancement of the field. In this paper we discuss some of the unrealistic data characteristics of public datasets, study the challenges of realistic single-task continuous learning as well as the effectiveness of data rehearsal as a way to mitigate accuracy loss. We construct a CL NER dataset from an existing publicly available dataset and release it along with the code to the research community.

dataset, entity type, userinterfaceelem, (13 more...)

arXiv.org Artificial Intelligence

2110.14694

Country: North America > United States > Massachusetts > Hampshire County > Amherst (0.14)

Genre: Research Report > New Finding (0.93)

Industry:

Information Technology > Security & Privacy (1.00)
Education > Educational Setting > Continuing Education (0.81)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.46)

Add feedback

HPP-77-39

AI ClassicsJan-25-2015, 20:48:52 GMT

In the early days of computing, these goals were central to the new discipline called cybernetics [126], [2]. Over the past two decades, progress toward these goals has come from a variety of fields - notably computer science, psychology, adaptive control theory, pattern recognition, and philosophy. Substantial progress has been made in developing techniques for machine learning in highly restricted environments.

elsevier, machine learning, relx group plc, (30 more...)

AI Classics

Country: North America > United States > California (0.68)

Genre:

Overview (0.48)
Research Report (0.40)

Industry:

Leisure & Entertainment > Games (1.00)
Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.95)
(3 more...)

Add feedback

Report 77 14 A Model for Learning Systems . Stanford Reid G. Smith Tom M. Mitchell Richard A. Bruce G. Buchanan

AI ClassicsJan-25-2015, 20:42:11 GMT

C. Richard Johnson, Jr. provided very helpful comments on adaptive control systems. We received many valuable suggestions from members of the Heuristic Programming Project at Stanford. 2 Supported by the Research and Development Branch of the Department of National Defence of Canada.

elsevier, relx group plc, united states department of defense, (35 more...)

AI Classics

Country: North America > United States > California (0.68)

Genre: Research Report (0.40)

Industry:

Leisure & Entertainment > Games (1.00)
Government > Regional Government > > > > > > > North America Government (0.93)
Government > Regional Government > North America Government > United States Government (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Filters

Collaborating Authors

élément

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

CodeMirage: Hallucinations in Code Generated by Large Language Models

Neural Slot Interpreters: Grounding Object Semantics in Emergent Slot Representations

Improving Natural Language Capability of Code Large Language Model

SciLit: A Platform for Joint Scientific Literature Discovery, Summarization and Citation Generation

GitHub - jindongwang/MachineLearning: 一些关于机器学习的学习资料与研究介绍

Towards Realistic Single-Task Continuous Learning Research for NER

HPP-77-39

Report 77 14 A Model for Learning Systems . Stanford Reid G. Smith Tom M. Mitchell Richard A. Bruce G. Buchanan