AITopics

2410.12172

Country:

Asia > Middle East > Republic of Türkiye > Karaman Province > Karaman (0.04)
North America > United States > New Jersey > Middlesex County > New Brunswick (0.04)

Genre:

Research Report (1.00)
Overview (0.88)

Industry: Energy (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.98)
Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
(4 more...)

Sridhara, Shashank N., Pavez, Eduardo, Ortega, Antonio

Towards joint graph learning and sampling set selection from data

arXiv.org Artificial IntelligenceDec-16-2024

We explore the problem of sampling graph signals in scenarios where the graph structure is not predefined and must be inferred from data. In this scenario, existing approaches rely on a two-step process, where a graph is learned first, followed by sampling. More generally, graph learning and graph signal sampling have been studied as two independent problems in the literature. This work provides a foundational step towards jointly optimizing the graph structure and sampling set. Our main contribution, Vertex Importance Sampling (VIS), is to show that the sampling set can be effectively determined from the vertex importance (node weights) obtained from graph learning. We further propose Vertex Importance Sampling with Repulsion (VISR), a greedy algorithm where spatially -separated "important" nodes are selected to ensure better reconstruction. Empirical results on simulated data show that sampling using VIS and VISR leads to competitive reconstruction performance and lower complexity than the conventional two-step approach of graph learning followed by graph sampling.

algorithm, artificial intelligence, machine learning, (15 more...)

2412.09753

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.28)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.34)

arXiv.org Artificial IntelligenceDec-16-2024

Survey on safe robot control via learning

Mabsout, Bassel El

Modern society heavily relies on robotic systems, their use affects the aerospace, automotive, energy, disaster response, health care, manufacturing, and traffic management industries among countless others. From making robots walk Westervelt et al. [2007] to getting molecular swarms to kill cancer cells Wijewardhane et al. [2022], whole fields of research dedicate themselves to the problem of control. Intelligently selecting control strategies so that we can manage, direct, or command the trajectories a system can take distills the essence of problems faced in control. When a system can be controlled in the aforementioned manner using control loops, the system in question is termed a control system. Tackling the problem of control, the research community has produced many alternative solutions with varying trade-offs concerning what is achievable and how much we can represent these systems and our goals.

controller, machine learning, reinforcement learning, (18 more...)

2501.01432

Country:

North America > United States > Massachusetts (0.28)
Europe > United Kingdom > England (0.28)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine (1.00)
Leisure & Entertainment > Games (0.93)
Aerospace & Defense (0.88)
Transportation > Air (0.68)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.93)
(2 more...)

Riviere, Benjamin, Lathrop, John, Chung, Soon-Jo

Monte Carlo Tree Search with Spectral Expansion for Planning with Dynamical Systems

The ability of a robot to plan complex behaviors with real-time computation, rather than adhering to predesigned or offline-learned routines, alleviates the need for specialized algorithms or training for each problem instance. Monte Carlo Tree Search is a powerful planning algorithm that strategically explores simulated future possibilities, but it requires a discrete problem representation that is irreconcilable with the continuous dynamics of the physical world. We present Spectral Expansion Tree Search (SETS), a real-time, tree-based planner that uses the spectrum of the locally linearized system to construct a low-complexity and approximately equivalent discrete representation of the continuous world. We prove SETS converges to a bound of the globally optimal solution for continuous, deterministic and differentiable Markov Decision Processes, a broad class of problems that includes underactuated nonlinear dynamics, non-convex reward functions, and unstructured environments. We experimentally validate SETS on drone, spacecraft, and ground vehicle robots and one numerical experiment, each of which is not directly solvable with existing methods. We successfully show SETS automatically discovers a diverse set of optimal behaviors and motion trajectories in real time.

artificial intelligence, machine learning, trajectory, (18 more...)

doi: 10.1126/scirobotics.ado1010

2412.1127

Country:

Europe (1.00)
North America > United States > California (0.45)

Genre: Research Report (1.00)

Industry:

Energy > Oil & Gas > Upstream (0.93)
Transportation > Air (0.68)
Government > Regional Government > North America Government > United States Government (0.67)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.34)

SWE-Search: Enhancing Software Agents with Monte Carlo Tree Search and Iterative Refinement

Antoniades, Antonis, Örwall, Albert, Zhang, Kexun, Xie, Yuxi, Goyal, Anirudh, Wang, William

Software engineers operating in complex and dynamic environments must continuously adapt to evolving requirements, learn iteratively from experience, and reconsider their approaches based on new insights. However, current large language model (LLM)-based software agents often rely on rigid processes and tend to repeat ineffective actions without the capacity to evaluate their performance or adapt their strategies over time. To address these challenges, we propose SWE-Search, a multi-agent framework that integrates Monte Carlo Tree Search (MCTS) with a self-improvement mechanism to enhance software agents' performance on repository-level software tasks. SWE-Search extends traditional MCTS by incorporating a hybrid value function that leverages LLMs for both numerical value estimation and qualitative evaluation. This enables self-feedback loops where agents iteratively refine their strategies based on both quantitative numerical evaluations and qualitative natural language assessments of pursued trajectories. The framework includes a SWE-Agent for adaptive exploration, a Value Agent for iterative feedback, and a Discriminator Agent that facilitates multi-agent debate for collaborative decision-making. Applied to the SWE-bench benchmark, our approach demonstrates a 23% relative improvement in performance across five models compared to standard open-source agents without MCTS. Our analysis reveals how performance scales with increased search depth and identifies key factors that facilitate effective self-evaluation in software agents. This work highlights the potential of self-evaluation driven search techniques to enhance agent reasoning and planning in complex, dynamic software engineering environments.

large language model, machine learning, natural language, (18 more...)

2410.20285

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > United States > California > Santa Barbara County > Santa Barbara (0.04)
(5 more...)

Genre: Research Report (1.00)

Industry:

Leisure & Entertainment (0.46)
Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.96)

Theoretical Analysis of Quality Diversity Algorithms for a Classical Path Planning Problem

Dang, Duc-Cuong, Neumann, Aneta, Neumann, Frank, Opris, Andre, Sudholt, Dirk

In recent years, computing diverse sets of high quality solutions for combinatorial optimisation problems has gained significant attention in the area of artificial intelligence from both theoretical (Baste et al., 2022, 2019; Fomin et al., 2024; Hanaka et al., 2023) and experimental (Vonásek and Saska, 2018; Ingmar et al., 2020) perspectives. Prominent examples where diverse sets of high quality solutions are sought come from the area of path planning (Hanaka et al., 2021; Gao et al., 2022). Particularly, quality diversity (QD) algorithms have shown to produce excellent results for challenging problems in the areas such as robotics (Miao et al., 2022; Shen et al., 2020), games (Cully and Demiris, 2018) and combinatorial optimisation (Nikfarjam et al., 2024a). This work contributes to the theoretical understanding of QD algorithms. Such algorithms compute several solutions that occupy different areas of a so-called behavioural space. Approaches that use a multidimensional archive of phenotypic elites, called Map-Elites (Mouret and Clune, 2015), are among the most commonly used QD algorithms.

artificial intelligence, evolutionary algorithm, machine learning, (13 more...)

2412.11446

Country:

Europe > Germany (0.04)
Oceania > Australia > South Australia > Adelaide (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (0.96)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.68)

Rasul, Hezha O., Ghafour, Dlzar D., Aziz, Bakhtyar K., Hassan, Bryar A., Rashid, Tarik A., Kivrak, Arif

Decoding Drug Discovery: Exploring A-to-Z In silico Methods for Beginners

The drug development process is a critical challenge in the pharmaceutical industry due to its time-consuming nature and the need to discover new drug potentials to address various ailments. The initial step in drug development, drug target identification, often consumes considerable time. While valid, traditional methods such as in vivo and in vitro approaches are limited in their ability to analyze vast amounts of data efficiently, leading to wasteful outcomes. To expedite and streamline drug development, an increasing reliance on computer-aided drug design (CADD) approaches has merged. These sophisticated in silico methods offer a promising avenue for efficiently identifying viable drug candidates, thus providing pharmaceutical firms with significant opportunities to uncover new prospective drug targets. The main goal of this work is to review in silico methods used in the drug development process with a focus on identifying therapeutic targets linked to specific diseases at the genetic or protein level. This article thoroughly discusses A-to-Z in silico techniques, which are essential for identifying the targets of bioactive compounds and their potential therapeutic effects. This review intends to improve drug discovery processes by illuminating the state of these cutting-edge approaches, thereby maximizing the effectiveness and duration of clinical trials for novel drug target investigation.

bioinformatics, data mining, machine learning, (19 more...)

doi: 10.1007/s12010-024-05110-2

2412.11137

Country:

North America > United States (0.93)
Asia > Middle East > Iraq > Kurdistan Region > Sulaymaniyah Governorate > Sulaymaniyah (0.04)
Europe > Portugal > Braga > Braga (0.04)
(8 more...)

Genre:

Workflow (1.00)
Research Report > Experimental Study (1.00)
Overview (1.00)
Research Report > New Finding (0.87)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
(4 more...)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Biomedical Informatics > Translational Bioinformatics (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(4 more...)

ViSymRe: Vision-guided Multimodal Symbolic Regression

Li, Da, Yin, Junping, Xu, Jin, Li, Xinxin, Zhang, Juan

Symbolic regression automatically searches for mathematical equations to reveal underlying mechanisms within datasets, offering enhanced interpretability compared to black box models. Traditionally, symbolic regression has been considered to be purely numeric-driven, with insufficient attention given to the potential contributions of visual information in augmenting this process. When dealing with high-dimensional and complex datasets, existing symbolic regression models are often inefficient and tend to generate overly complex equations, making subsequent mechanism analysis complicated. In this paper, we propose the vision-guided multimodal symbolic regression model, called ViSymRe, that systematically explores how visual information can improve various metrics of symbolic regression. Compared to traditional models, our proposed model has the following innovations: (1) It integrates three modalities: vision, symbol and numeric to enhance symbolic regression, enabling the model to benefit from the strengths of each modality; (2) It establishes a meta-learning framework that can learn from historical experiences to efficiently solve new symbolic regression problems; (3) It emphasizes the simplicity and structural rationality of the equations rather than merely numerical fitting. Extensive experiments show that our proposed model exhibits strong generalization capability and noise resistance. The equations it generates outperform state-of-the-art numeric-only baselines in terms of fitting effect, simplicity and structural accuracy, thus being able to facilitate accurate mechanism analysis and the development of theoretical models.

artificial intelligence, evolutionary algorithm, machine learning, (16 more...)

2412.11139

Country:

Asia > China > Beijing > Beijing (0.04)
Asia > China > Shanghai > Shanghai (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
(4 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.68)

Unveiling Language Skills via Path-Level Circuit Discovery

Chen, Hang, Zhu, Jiaying, Yang, Xinyu, Wang, Wenya

Circuit discovery with edge-level ablation has become a foundational framework for mechanism interpretability of language models. However, its focus on individual edges often overlooks the sequential, path-level causal relationships that underpin complex behaviors, thus potentially leading to misleading or incomplete circuit discoveries. To address this issue, we propose a novel path-level circuit discovery framework capturing how behaviors emerge through interconnected linear chain and build towards complex behaviors. Our framework is constructed upon a fully-disentangled linear combinations of ``memory circuits'' decomposed from the original model. To discover functional circuit paths, we leverage a 2-step pruning strategy by first reducing the computational graph to a faithful and minimal subgraph and then applying causal mediation to identify common paths of a specific skill, termed as skill paths. In contrast to circuit graph from existing works, we focus on the complete paths of a generic skill rather than on the fine-grained responses to individual components of the input. To demonstrate this, we explore three generic language skills, namely Previous Token Skill, Induction Skill and In-Context Learning Skill using our framework and provide more compelling evidence to substantiate stratification and inclusiveness of these skills.

artificial intelligence, machine learning, natural language, (20 more...)

2410.01334

Country:

Asia > China > Shaanxi Province > Xi'an (0.04)
Europe > Portugal > Lisbon > Lisbon (0.04)
Europe > France (0.04)
(3 more...)

Genre: Research Report (1.00)

Industry:

Leisure & Entertainment (0.67)
Media (0.45)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.46)

arXiv.org Machine LearningDec-15-2024

Minimax Regret Estimation for Generalizing Heterogeneous Treatment Effects with Multisite Data

Zhang, Yi, Huang, Melody, Imai, Kosuke

To test scientific theories and develop individualized treatment rules, researchers often wish to learn heterogeneous treatment effects that can be consistently found across diverse populations and contexts. We consider the problem of generalizing heterogeneous treatment effects (HTE) based on data from multiple sites. A key challenge is that a target population may differ from the source sites in unknown and unobservable ways. This means that the estimates from site-specific models lack external validity, and a simple pooled analysis risks bias. We develop a robust CATE (conditional average treatment effect) estimation methodology with multisite data from heterogeneous populations. We propose a minimax-regret framework that learns a generalizable CATE model by minimizing the worst-case regret over a class of target populations whose CATE can be represented as convex combinations of site-specific CATEs. Using robust optimization, the proposed methodology accounts for distribution shifts in both individual covariates and treatment effect heterogeneity across sites. We show that the resulting CATE model has an interpretable closed-form solution, expressed as a weighted average of site-specific CATE models. Thus, researchers can utilize a flexible CATE estimation method within each site and aggregate site-specific estimates to produce the final model. Through simulations and a real-world application, we show that the proposed methodology improves the robustness and generalizability of existing approaches.

artificial intelligence, cate model, machine learning, (17 more...)

arXiv.org Machine Learning

2412.11136

Country:

Asia > Mongolia (0.05)
Africa > Middle East > Morocco (0.05)
North America > Mexico (0.05)
(2 more...)

Genre: Research Report > Experimental Study (0.68)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)