AITopics | memento

Collaborating Authors

memento

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Number of Times Ranked Best T Memory-Enhanced Neural Solvers for Routing Problems

Neural Information Processing SystemsJun-16-2026, 01:08:15 GMT

Attention, learn to solve routing problems!

large language model, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.67)

Industry:

Energy (0.92)
Transportation (0.71)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Memory-Enhanced Neural Solvers for Routing Problems

Neural Information Processing SystemsJun-11-2026, 11:03:21 GMT

Routing Problems are central to many real-world applications, yet remain challenging due to their (NP-)hard nature. Amongst existing approaches, heuristics often offer the best trade-off between quality and scalability, making them suitable for industrial use. While Reinforcement Learning (RL) offers a flexible framework for designing heuristics, its adoption over handcrafted heuristics remains incomplete. Existing learned methods still lack the ability to adapt to specific instances and fully leverage the available computational budget. Current best methods either rely on a collection of pre-trained policies, or on RL fine-tuning; hence failing to fully utilize newly available information within the constraints of the budget. In response, we present MEMENTO, an approach that leverages memory to improve the search of neural solvers at inference. MEMENTO updates the action distribution dynamically based on the outcome of previous decisions. We validate its effectiveness on Traveling Salesman and Capacitated Vehicle Routing problems, demonstrating its superiority over tree-search and policy-gradient fine-tuning; and showing that it can be zero-shot combined with diversity-based solvers. We successfully train all RL auto-regressive solvers on large instances, and verify MEMENTO's scalability and data-efficiency: pushing the state-of-the-art on 11 out of 12 evaluated tasks.

artificial intelligence, machine learning, proceedings, (6 more...)

Neural Information Processing Systems

Industry: Transportation (0.86)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.77)

Add feedback

Learning Correlated Reward Models: Statistical Barriers and Opportunities

Cherapanamjeri, Yeshwanth, Daskalakis, Constantinos, Farina, Gabriele, Mohammadpour, Sobhan

arXiv.org Machine LearningOct-20-2025

Random Utility Models (RUMs) are a classical framework for modeling user preferences and play a key role in reward modeling for Reinforcement Learning from Human Feedback (RLHF). However, a crucial shortcoming of many of these techniques is the Independence of Irrelevant Alternatives (IIA) assumption, which collapses \emph{all} human preferences to a universal underlying utility function, yielding a coarse approximation of the range of human preferences. On the other hand, statistical and computational guarantees for models avoiding this assumption are scarce. In this paper, we investigate the statistical and computational challenges of learning a \emph{correlated} probit model, a fundamental RUM that avoids the IIA assumption. First, we establish that the classical data collection paradigm of pairwise preference data is \emph{fundamentally insufficient} to learn correlational information, explaining the lack of statistical and computational guarantees in this setting. Next, we demonstrate that \emph{best-of-three} preference data provably overcomes these shortcomings, and devise a statistically and computationally efficient estimator with near-optimal performance. These results highlight the benefits of higher-order preference data in learning correlated utilities, allowing for more fine-grained modeling of human preferences. Finally, we validate these theoretical guarantees on several real-world datasets, demonstrating improved personalization of human preferences.

artificial intelligence, machine learning, royal tenenbaum, (18 more...)

arXiv.org Machine Learning

2510.15839

Country:

North America > United States (0.93)
Europe (0.92)

Genre: Research Report > New Finding (0.46)

Industry:

Transportation > Ground > Road (1.00)
Transportation > Electric Vehicle (1.00)
Leisure & Entertainment (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.65)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.54)

Add feedback

Memento: Fine-tuning LLM Agents without Fine-tuning LLMs

Zhou, Huichi, Chen, Yihang, Guo, Siyuan, Yan, Xue, Lee, Kin Hei, Wang, Zihan, Lee, Ka Yiu, Zhang, Guchun, Shao, Kun, Yang, Linyi, Wang, Jun

arXiv.org Artificial IntelligenceAug-26-2025

In this paper, we introduce a novel learning paradigm for Adaptive Large Language Model (LLM) agents that eliminates the need for fine-tuning the underlying LLMs. Existing approaches are often either rigid, relying on static, handcrafted reflection workflows, or computationally intensive, requiring gradient updates of LLM model parameters. In contrast, our method enables low-cost continual adaptation via memory-based online reinforcement learning. We formalise this as a Memory-augmented Markov Decision Process (M-MDP), equipped with a neural case-selection policy to guide action decisions. Past experiences are stored in an episodic memory, either differentiable or non-parametric. The policy is continually updated based on environmental feedback through a memory rewriting mechanism, whereas policy improvement is achieved through efficient memory reading (retrieval). We instantiate our agent model in the deep research setting, namely \emph{Memento}, which attains top-1 on GAIA validation ($87.88\%$ Pass@$3$) and $79.40\%$ on the test set. It reaches $66.6\%$ F1 and $80.4\%$ PM on the DeepResearcher dataset, outperforming the state-of-the-art training-based method, while case-based memory adds $4.7\%$ to $9.6\%$ absolute points on out-of-distribution tasks. Our approach offers a scalable and efficient pathway for developing generalist LLM agents capable of continuous, real-time learning without gradient updates, advancing machine learning towards open-ended skill acquisition and deep research scenarios. The code is available at https://github.com/Agent-on-the-Fly/Memento.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2508.16153

Genre:

Research Report (0.40)
Instructional Material (0.34)

Industry: Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Memento: Note-Taking for Your Future Self

Wan, Chao, Gong, Albert, Mishra, Mihir, Henneking, Carl-Leander, Beger, Claas, Weinberger, Kilian Q.

arXiv.org Artificial IntelligenceJun-26-2025

Large language models (LLMs) excel at reasoning-only tasks, but struggle when reasoning must be tightly coupled with retrieval, as in multi-hop question answering. To overcome these limitations, we introduce a prompting strategy that first decomposes a complex question into smaller steps, then dynamically constructs a database of facts using LLMs, and finally pieces these facts together to solve the question. We show how this three-stage strategy, which we call Memento, can boost the performance of existing prompting strategies across diverse settings. On the 9-step PhantomWiki benchmark, Memento doubles the performance of chain-of-thought (CoT) when all information is provided in context. On the open-domain version of 2WikiMultiHopQA, CoT-RAG with Memento improves over vanilla CoT-RAG by more than 20 F1 percentage points and over the multi-hop RAG baseline, IRCoT, by more than 13 F1 percentage points. On the challenging MuSiQue dataset, Memento improves ReAct by more than 3 F1 percentage points, demonstrating its utility in agentic settings.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2506.20642

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Memory-Enhanced Neural Solvers for Efficient Adaptation in Combinatorial Optimization

Chalumeau, Felix, Shabe, Refiloe, de Nicola, Noah, Pretorius, Arnu, Barrett, Thomas D., Grinsztajn, Nathan

arXiv.org Artificial IntelligenceJun-24-2024

Combinatorial Optimization is crucial to numerous real-world applications, yet still presents challenges due to its (NP-)hard nature. Amongst existing approaches, heuristics often offer the best trade-off between quality and scalability, making them suitable for industrial use. While Reinforcement Learning (RL) offers a flexible framework for designing heuristics, its adoption over handcrafted heuristics remains incomplete within industrial solvers. Existing learned methods still lack the ability to adapt to specific instances and fully leverage the available computational budget. The current best methods either rely on a collection of pre-trained policies, or on data-inefficient fine-tuning; hence failing to fully utilize newly available information within the constraints of the budget. In response, we present MEMENTO, an RL approach that leverages memory to improve the adaptation of neural solvers at inference time. MEMENTO enables updating the action distribution dynamically based on the outcome of previous decisions. We validate its effectiveness on benchmark problems, in particular Traveling Salesman and Capacitated Vehicle Routing, demonstrating it can successfully be combined with standard methods to boost their performance under a given budget, both in and out-of-distribution, improving their performance on all 12 evaluated tasks.

compass, memento, neural information processing system, (13 more...)

arXiv.org Artificial Intelligence

2406.16424

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Africa > South Africa > Western Cape > Cape Town (0.04)

Genre: Research Report (0.82)

Industry:

Energy (0.46)
Transportation (0.35)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences

Wang, Xiyao, Zhou, Yuhang, Liu, Xiaoyu, Lu, Hongjin, Xu, Yuancheng, He, Feihong, Yoon, Jaehong, Lu, Taixi, Bertasius, Gedas, Bansal, Mohit, Yao, Huaxiu, Huang, Furong

arXiv.org Artificial IntelligenceJan-24-2024

Multimodal Large Language Models (MLLMs) have demonstrated proficiency in handling a variety of visual-language tasks. However, current MLLM benchmarks are predominantly designed to evaluate reasoning based on static information about a single image, and the ability of modern MLLMs to extrapolate from image sequences, which is essential for understanding our ever-changing world, has been less investigated. To address this challenge, this paper introduces Mementos, a new benchmark designed to assess MLLMs' sequential image reasoning abilities. Mementos features 4,761 diverse image sequences with varying lengths. We also employ a GPT-4 assisted method to evaluate MLLM reasoning performance. Through a careful evaluation of nine recent MLLMs on Mementos, including GPT-4V and Gemini, we find that they struggle to accurately describe dynamic information about given image sequences, often leading to hallucinations/misrepresentations of objects and their corresponding behaviors. Our quantitative analysis and case studies identify three key factors impacting MLLMs' sequential image reasoning: the correlation between object and behavioral hallucinations, the influence of cooccurring behaviors, and the compounding impact of behavioral hallucinations. Our dataset is available at https://github.com/umd-huang-lab/Mementos.

hallucination, image sequence, mllm, (11 more...)

arXiv.org Artificial Intelligence

2401.10529

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
North America > United States > Maryland > Prince George's County > College Park (0.04)
Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)

Genre: Research Report (1.00)

Industry:

Leisure & Entertainment (0.46)
Government > Military (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.51)

Add feedback

Memento: Facilitating Effortless, Efficient, and Reliable ML Experiments

Pullar-Strecker, Zac, Chang, Xinglong, Brydon, Liam, Ziogas, Ioannis, Dost, Katharina, Wicker, Jörg

arXiv.org Artificial IntelligenceApr-17-2023

Running complex sets of machine learning experiments is challenging and time-consuming due to the lack of a unified framework. This leaves researchers forced to spend time implementing necessary features such as parallelization, caching, and checkpointing themselves instead of focussing on their project. To simplify the process, in this paper, we introduce Memento, a Python package that is designed to aid researchers and data scientists in the efficient management and execution of computationally intensive experiments. Memento has the capacity to streamline any experimental pipeline by providing a straightforward configuration matrix and the ability to concurrently run experiments across multiple threads.

configuration matrix, experiment, memento, (12 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/978-3-031-43430-3_21

2304.09175

Country:

North America > United States > Mississippi > Lafayette County > Oxford (0.15)
Oceania > New Zealand > North Island > Auckland Region > Auckland (0.06)
North America > United States > New York > New York County > New York City (0.05)

Genre: Research Report (0.40)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Artificial intelligence makes some progress, but robots still can't match humans

AITopics Original LinksJan-18-2017, 12:07:49 GMT

When you call your bank, the robot on the other end doesn't want you to communicate using your touch-tone keypad anymore. No, it insists that you just speak to it, sometimes even adding, "You can use a wide variety of words." Your car is trying to emasculate you by taking over the parallel parking duties. And computers have long since drained all the fun out of chess. Fortunately, most robots aren't the complicated emotional beings that star in movies, and we're still pretty good at identifying android impostors.

artificial intelligence, computer, match human, (7 more...)

AITopics Original Links

Country: North America > United States > Michigan (0.06)

Technology: Information Technology > Artificial Intelligence > Robots (0.96)

Add feedback

On the Ranch with the Creators of "Westworld"

The New YorkerDec-16-2016, 14:35:03 GMT

My day job, in lieu of teaching creative writing like a normal person, is writing scripts for blockbuster video games. Last summer, while I watched a play-through of the then-unreleased Gears of War 4, for which I was the lead writer, something odd happened. The game's story called for a massive plane crash, out of which a single robot, operatically aflame, was intended to stride toward the player. Within the game's fiction, robots have hitherto opposed the player, but we wanted this particular burning robot to pose no immediate threat. The game programmers had thus switched off the hostility driven by the robot's artificial intelligence, allowing the player to walk past the hapless robot or shoot it. Most of us on the development team, I think, hoped our game's future players wouldn't shoot. Just ahead of the encounter we placed what is referred to, in game design, as a frontgate--a kind of contrived environmental blockage intended to prevent players from rushing too far ahead, which can mess up loading times.

artificial intelligence, nolan, westworld, (18 more...)

The New Yorker

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.04)
Asia > Middle East > Republic of Türkiye > Batman Province > Batman (0.04)

Industry:

Transportation (1.00)
Leisure & Entertainment > Games > Computer Games (1.00)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback