AITopics

Country: North America > Canada > British Columbia (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Neural Information Processing SystemsFeb-18-2026, 00:22:22 GMT

f6b22ac37beb5da61efd4882082c9ecd-Paper-Conference.pdf

experience memory, large language model, machine learning, (19 more...)

Country:

Asia > China > Shanghai > Shanghai (0.04)
Oceania > New Zealand > North Island > Auckland Region > Auckland (0.04)
North America > United States > New York > Richmond County > New York City (0.04)
(6 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.78)

Neural Information Processing SystemsFeb-16-2026, 12:09:06 GMT

LIMA: Less Is More for Alignment

Moreover, the model tends to generalize well to unseen tasks that did not appear in the training data.

large language model, machine learning, natural language, (21 more...)

Country:

North America > United States > California (0.14)
Asia > Bhutan (0.04)
Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)
(3 more...)

Genre: Research Report (0.67)

Industry:

Banking & Finance > Economy (1.00)
Energy (0.93)
Government > Regional Government > North America Government > United States Government (0.47)
Education > Educational Setting > Higher Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Communications > Social Media (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.71)

Neural Information Processing SystemsOct-9-2025, 12:03:57 GMT

Large Language Models Are Semi-Parametric Reinforcement Learning Agents

As declared by Seifert et al. [1997], the episodic memory of the experiences from past episodes plays a crucial role in the complex decision-making processes of human [Suddendorf and Corballis, 2007]. By recollecting the experiences from past episodes, the human can learn from success to repeat it and learn from failure to avoid it.

experience memory, large language model, machine learning, (20 more...)

Country:

Asia > China > Shanghai > Shanghai (0.04)
Oceania > New Zealand > North Island > Auckland Region > Auckland (0.04)
North America > United States > Maryland > Baltimore (0.04)
Asia > China > Hong Kong (0.04)

Industry: Health & Medicine (0.49)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Neural Information Processing SystemsOct-9-2025, 04:32:43 GMT

LIMA: Less Is More for Alignment

Moreover, the model tends to generalize well to unseen tasks that did not appear in the training data.

large language model, machine learning, natural language, (21 more...)

Country:

North America > United States > California (0.14)
Asia > Bhutan (0.04)
Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)
(3 more...)

Genre: Research Report (0.67)

Industry:

Banking & Finance > Economy (1.00)
Energy (0.93)
Government > Regional Government > North America Government > United States Government (0.47)
Education > Educational Setting > Higher Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Communications > Social Media (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.71)

arXiv.org Artificial IntelligenceJul-2-2024

Learning Action Conditions from Instructional Manuals for Instruction Understanding

Wu, Te-Lin, Zhang, Caiqi, Hu, Qingyuan, Spangher, Alex, Peng, Nanyun

The ability to infer pre- and postconditions of an action is vital for comprehending complex instructions, and is essential for applications such as autonomous instruction-guided agents and assistive AI that supports humans to perform physical tasks. In this work, we propose a task dubbed action condition inference, and collecting a high-quality, human annotated dataset of preconditions and postconditions of actions in instructional manuals. We propose a weakly supervised approach to automatically construct large-scale training instances from online instructional manuals, and curate a densely human-annotated and validated dataset to study how well the current NLP models can infer action-condition dependencies in the instruction texts. We design two types of models differ by whether contextualized and global information is leveraged, as well as various combinations of heuristics to construct the weak supervisions. Our experimental results show a >20% F1-score improvement with considering the entire instruction contexts and a >6% F1-score benefit with the proposed heuristics.

instructable, postcondition, text segment, (15 more...)

2205.1242

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Pennsylvania (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Genre:

Instructional Material > Training Manual (0.81)
Research Report (0.69)

Industry:

Education > Educational Setting > Online (1.00)
Education > Educational Technology > Educational Software > Computer Based Training (0.54)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Wu, Te-Lin, Spangher, Alex, Alipoormolabashi, Pegah, Freedman, Marjorie, Weischedel, Ralph, Peng, Nanyun

Understanding Multimodal Procedural Knowledge by Sequencing Multimodal Instructional Manuals

arXiv.org Artificial IntelligenceFeb-20-2024

The ability to sequence unordered events is an essential skill to comprehend and reason about real world task procedures, which often requires thorough understanding of temporal common sense and multimodal information, as these procedures are often communicated through a combination of texts and images. Such capability is essential for applications such as sequential task planning and multi-source instruction summarization. While humans are capable of reasoning about and sequencing unordered multimodal procedural instructions, whether current machine learning models have such essential capability is still an open question. In this work, we benchmark models' capability of reasoning over and sequencing unordered multimodal instructions by curating datasets from popular online instructional manuals and collecting comprehensive human annotations. We find models not only perform significantly worse than humans but also seem incapable of efficiently utilizing the multimodal information. To improve machines' performance on multimodal event sequencing, we propose sequentiality-aware pretraining techniques that exploit the sequential alignment properties of both texts and images, resulting in > 5% significant improvements.

category, dataset, wikihow, (16 more...)

2110.08486

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Asia > China (0.04)
North America > United States > New York (0.04)
Asia > Macao (0.04)

Genre:

Research Report (0.82)
Instructional Material > Training Manual (0.60)

Industry:

Education > Educational Setting > Online (0.88)
Education > Educational Technology > Educational Software > Computer Based Training (0.54)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.93)
Information Technology > Artificial Intelligence > Vision (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

arXiv.org Artificial IntelligenceDec-9-2023

GEMINI: Controlling the Sentence-level Writing Style for Abstractive Text Summarization

Bao, Guangsheng, Ou, Zebin, Zhang, Yue

Human experts write summaries using different techniques, including extracting a sentence from the document and rewriting it, or fusing various information from the document to abstract it. These techniques are flexible and thus difficult to be imitated by any single method. To address this issue, we propose an adaptive model, GEMINI, that integrates a rewriter and a generator to mimic the sentence rewriting and abstracting techniques, respectively. GEMINI adaptively chooses to rewrite a specific document sentence or generate a summary sentence from scratch. Experiments demonstrate that our adaptive approach outperforms the pure abstractive and rewriting baselines on three benchmark datasets, achieving the best results on WikiHow. Interestingly, empirical results show that the human summary styles of summary sentences are consistently predictable given their context. We release our code and model at \url{https://github.com/baoguangsheng/gemini}.

generator, summarization, summary sentence, (16 more...)

2304.03548

Country:

North America > United States > Nevada > Clark County > Las Vegas (0.04)
Asia > Philippines (0.04)
Asia > Myanmar > Mandalay Region > Mandalay (0.04)
(7 more...)

Genre: Research Report > New Finding (0.88)

Industry: Leisure & Entertainment > Sports (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

arXiv.org Artificial IntelligenceOct-29-2023

Large Language Models Are Semi-Parametric Reinforcement Learning Agents

Zhang, Danyang, Chen, Lu, Zhang, Situo, Xu, Hongshen, Zhao, Zihan, Yu, Kai

Inspired by the insights in cognitive science with respect to human memory and reasoning mechanism, a novel evolvable LLM-based (Large Language Model) agent framework is proposed as REMEMBERER. By equipping the LLM with a long-term experience memory, REMEMBERER is capable of exploiting the experiences from the past episodes even for different task goals, which excels an LLM-based agent with fixed exemplars or equipped with a transient working memory. We further introduce Reinforcement Learning with Experience Memory (RLEM) to update the memory. Thus, the whole system can learn from the experiences of both success and failure, and evolve its capability without fine-tuning the parameters of the LLM. In this way, the proposed REMEMBERER constitutes a semi-parametric RL agent. Extensive experiments are conducted on two RL task sets to evaluate the proposed framework. The average results with different initialization and training sets exceed the prior SOTA by 4% and 2% for the success rate on two task sets and demonstrate the superiority and robustness of REMEMBERER.

agent, arxiv, experience memory, (16 more...)

2306.07929

Country:

Asia > China > Shanghai > Shanghai (0.04)
Oceania > New Zealand > North Island > Auckland Region > Auckland (0.04)
North America > United States > New York > Richmond County > New York City (0.04)
(7 more...)

Genre: Research Report (0.50)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)