AITopics

Technology: Information Technology > Artificial Intelligence (0.51)

Neural Information Processing SystemsFeb-9-2026, 06:44:00 GMT

Mars: SituatedInductiveReasoning inanOpen-WorldEnvironment

Yet, most of them rely on pre-stored knowledge. Inducing new general knowledge from a specific environment and performing reasoning with the acquired knowledge--situated inductive reasoning, is crucial and challenging for machine intelligence. In this paper, we design Mars, an interactive environment devised for situated inductive reasoning.

large language model, machine learning, natural language, (21 more...)

Country: Asia > China (0.04)

Genre: Research Report (0.93)

Industry:

Leisure & Entertainment (0.49)
Materials > Metals & Mining (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (0.93)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.48)

Dylan Hadfield-Menell, Smitha Milli, Pieter Abbeel, Stuart J. Russell, Anca Dragan

Inverse Reward Design

Neural Information Processing SystemsNov-21-2025, 06:51:15 GMT

Autonomous agents optimize the reward function we give them.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Country:

North America > United States > California > Alameda County > Berkeley (0.14)
North America > United States > Texas > Travis County > Austin (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.46)
(2 more...)

Berden, Senne, Mahmutoğulları, Ali İrfan, Tsouros, Dimos, Guns, Tias

Solver-Free Decision-Focused Learning for Linear Optimization Problems

arXiv.org Artificial IntelligenceNov-13-2025

Mathematical optimization is a fundamental tool for decision-making in a wide range of applications. However, in many real-world scenarios, the parameters of the optimization problem are not known a priori and must be predicted from contextual features. This gives rise to predict-then-optimize problems, where a machine learning model predicts problem parameters that are then used to make decisions via optimization. A growing body of work on decision-focused learning (DFL) addresses this setting by training models specifically to produce predictions that maximize downstream decision quality, rather than accuracy. While effective, DFL is computationally expensive, because it requires solving the optimization problem with the predicted parameters at each loss evaluation. In this work, we address this computational bottleneck for linear optimization problems, a common class of problems in both DFL literature and real-world applications. We propose a solver-free training method that exploits the geometric structure of linear optimization to enable efficient training with minimal degradation in solution quality. Our method is based on the insight that a solution is optimal if and only if it achieves an objective value that is at least as good as that of its adjacent vertices on the feasible polytope. Building on this, our method compares the estimated quality of the ground-truth optimal solution with that of its precomputed adjacent vertices, and uses this as loss function. Experiments demonstrate that our method significantly reduces computational cost while maintaining high decision quality.

adjacent vertex, artificial intelligence, machine learning, (14 more...)

2505.22224

Country: Europe (0.28)

Genre:

Overview (0.93)
Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Neural Information Processing SystemsOct-9-2025, 20:38:13 GMT

Mars: Situated Inductive Reasoning in an Open-World Environment Xiaojuan Tang

diamond, func, pickaxe, (15 more...)

Country:

Asia > China (0.04)
North America > United States > Massachusetts (0.04)
North America > Montserrat (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre:

Workflow (0.93)
Research Report > New Finding (0.92)

Industry:

Materials > Metals & Mining > Diamonds (0.46)
Materials > Metals & Mining > Coal (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.83)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Neural Information Processing SystemsOct-2-2025, 14:13:11 GMT

2f10c1578a0706e06b6d7db6f0b4a6af-AuthorFeedback.pdf

We thank the reviewers for their time and thoughtful feedback. This is what we were hoping for! 's main concern, and we take the opportunity's main critique is that there isn't a new method falling out of the formalism. We want to clarify that this is what is happening in Fig.1. This was our mistake, we will clarify!

artificial intelligence, experiment, formalism, (11 more...)

Technology: Information Technology > Artificial Intelligence (0.51)

Chen, Yiqun T., McCormick, Tyler H., Liu, Li, Datta, Abhirup

LAVA: Language Model Assisted Verbal Autopsy for Cause-of-Death Determination

arXiv.org Artificial IntelligenceSep-12-2025

Verbal autopsy (VA) is a critical tool for estimating causes of death in resource-limited settings where medical certification is unavailable. This study presents LA-VA, a proof-of-concept pipeline that combines Large Language Models (LLMs) with traditional algorithmic approaches and embedding-based classification for improved cause-of-death prediction. Using the Population Health Metrics Research Consortium (PHMRC) dataset across three age categories (Adult: 7,580; Child: 1,960; Neonate: 2,438), we evaluate multiple approaches: GPT-5 predictions, LCVA baseline, text embed-dings, and meta-learner ensembles. Our results demonstrate that GPT-5 achieves the highest individual performance with average test site accuracies of 48.6% (Adult), 50.5% (Child), and 53.5% (Neonate), outperforming traditional statistical machine learning baselines by 5-10%. Our findings suggest that simple off-the-shelf LLM-assisted approaches could substantially improve verbal autopsy accuracy, with important implications for global health surveillance in low-resource settings.

language model assisted verbal autopsy, large language model, machine learning, (14 more...)

2509.09602

Country:

Africa (0.47)
North America > Mexico (0.29)
Asia > India (0.28)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area > Pediatrics/Neonatology (1.00)
Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Neural Information Processing SystemsAug-15-2025, 04:24:41 GMT

Supplementary Material Outline

The supplementary material is outlined as follows. Section D details how we apply POLCO in the robotics tasks. Concretely, this means that safety constraints in our environment are specified via language. The environment is procedurally generated. We collected natural language constraints in a two-step process.

constraint, policy network, supplementary material, (14 more...)

Genre: Workflow (0.49)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.31)

arXiv.org Artificial IntelligenceDec-11-2024

From Text to Trajectory: Exploring Complex Constraint Representation and Decomposition in Safe Reinforcement Learning

Dong, Pusen, Zhu, Tianchen, Qiu, Yue, Zhou, Haoyi, Li, Jianxin

Safe reinforcement learning (RL) requires the agent to finish a given task while obeying specific constraints. Giving constraints in natural language form has great potential for practical scenarios due to its flexible transfer capability and accessibility. Previous safe RL methods with natural language constraints typically need to design cost functions manually for each constraint, which requires domain expertise and lacks flexibility. In this paper, we harness the dual role of text in this task, using it not only to provide constraint but also as a training signal. We introduce the Trajectory-level Textual Constraints Translator (TTCT) to replace the manually designed cost function. Our empirical results demonstrate that TTCT effectively comprehends textual constraint and trajectory, and the policies trained by TTCT can achieve a lower violation rate than the standard cost function. Extra studies are conducted to demonstrate that the TTCT has zero-shot transfer capability to adapt to constraint-shift environments.

constraint, machine learning, reinforcement learning, (16 more...)

2412.0892

Country:

Asia > China > Beijing > Beijing (0.04)
North America > United States > Massachusetts > Hampshire County > Amherst (0.04)

Genre: Research Report > New Finding (0.66)

Industry:

Leisure & Entertainment (0.67)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

arXiv.org Artificial IntelligenceOct-31-2024

Mars: Situated Inductive Reasoning in an Open-World Environment

Tang, Xiaojuan, Li, Jiaqi, Liang, Yitao, Zhu, Song-chun, Zhang, Muhan, Zheng, Zilong

Large Language Models (LLMs) trained on massive corpora have shown remarkable success in knowledge-intensive tasks. Yet, most of them rely on pre-stored knowledge. Inducing new general knowledge from a specific environment and performing reasoning with the acquired knowledge -- \textit{situated inductive reasoning}, is crucial and challenging for machine intelligence. In this paper, we design Mars, an interactive environment devised for situated inductive reasoning. It introduces counter-commonsense game mechanisms by modifying terrain, survival setting and task dependency while adhering to certain principles. In Mars, agents need to actively interact with their surroundings, derive useful rules and perform decision-making tasks in specific contexts. We conduct experiments on various RL-based and LLM-based methods, finding that they all struggle on this challenging situated inductive reasoning benchmark. Furthermore, we explore \textit{Induction from Reflection}, where we instruct agents to perform inductive reasoning from history trajectory. The superior performance underscores the importance of inductive reasoning in Mars. Through Mars, we aim to galvanize advancements in situated inductive reasoning and set the stage for developing the next generation of AI systems that can reason in an adaptive and context-sensitive way.

diamond, func, pickaxe, (15 more...)