AITopics

2502.10473

Genre:

Research Report (0.82)
Personal > Honors (0.46)

Industry: Energy > Oil & Gas > Upstream (0.35)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Kumar, Navdeep, Gupta, Adarsh, Elfatihi, Maxence Mohamed, Ramponi, Giorgia, Levy, Kfir Yehuda, Mannor, Shie

Dual Formulation for Non-Rectangular Lp Robust Markov Decision Processes

arXiv.org Artificial IntelligenceFeb-13-2025

We study robust Markov decision processes (RMDPs) with non-rectangular uncertainty sets, which capture interdependencies across states unlike traditional rectangular models. While non-rectangular robust policy evaluation is generally NP-hard, even in approximation, we identify a powerful class of $L_p$-bounded uncertainty sets that avoid these complexity barriers due to their structural simplicity. We further show that this class can be decomposed into infinitely many \texttt{sa}-rectangular $L_p$-bounded sets and leverage its structural properties to derive a novel dual formulation for $L_p$ RMDPs. This formulation provides key insights into the adversary's strategy and enables the development of the first robust policy evaluation algorithms for non-rectangular RMDPs. Empirical results demonstrate that our approach significantly outperforms brute-force methods, establishing a promising foundation for future investigation into non-rectangular robust MDPs.

artificial intelligence, machine learning, optimization problem, (16 more...)

2502.09432

Country:

North America > United States > New York > New York County > New York City (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report > New Finding (0.65)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.71)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.45)

Mazaré, Pierre-Emmanuel, Szilvasy, Gergely, Lomeli, Maria, Massa, Francisco, Murray, Naila, Jégou, Hervé, Douze, Matthijs

Inference-time sparse attention with asymmetric indexing

arXiv.org Artificial IntelligenceFeb-12-2025

Self-attention in transformer models is an incremental associative memory that maps key vectors to value vectors. One way to speed up self-attention is to employ GPU-compliant vector search algorithms, yet the standard partitioning methods yield poor results in this context, because (1) keys and queries follow different distributions and (2) the effect of RoPE positional encoding. In this paper, we introduce SAAP (Self-Attention with Asymmetric Partitions), which overcomes these problems. It is an asymmetrical indexing technique that employs distinct partitions for keys and queries, thereby approximating self-attention with a data-adaptive sparsity pattern. It works on pretrained language models without finetuning, as it only requires to train (offline) a small query classifier. On a long context Llama 3.1-8b model, with sequences ranging from 100k to 500k tokens, our method typically reduces by a factor 20 the fraction of memory that needs to be looked-up, which translates to a time saving of 60\% when compared to FlashAttention-v2.

artificial intelligence, machine learning, natural language, (17 more...)

2502.08246

Country:

Asia > Thailand > Bangkok > Bangkok (0.04)
Asia > Afghanistan > Parwan Province > Charikar (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Croella, Anna Livia, Piccialli, Veronica, Sudoso, Antonio M.

Strong bounds for large-scale Minimum Sum-of-Squares Clustering

arXiv.org Artificial IntelligenceFeb-12-2025

Clustering is a fundamental technique in data analysis and machine learning, used to group similar data points together. Among various clustering methods, the Minimum Sum-of-Squares Clustering (MSSC) is one of the most widely used. MSSC aims to minimize the total squared Euclidean distance between data points and their corresponding cluster centroids. Due to the unsupervised nature of clustering, achieving global optimality is crucial, yet computationally challenging. The complexity of finding the global solution increases exponentially with the number of data points, making exact methods impractical for large-scale datasets. Even obtaining strong lower bounds on the optimal MSSC objective value is computationally prohibitive, making it difficult to assess the quality of heuristic solutions. We address this challenge by introducing a novel method to validate heuristic MSSC solutions through optimality gaps. Our approach employs a divide-and-conquer strategy, decomposing the problem into smaller instances that can be handled by an exact solver. The decomposition is guided by an auxiliary optimization problem, the "anticlustering problem", for which we design an efficient heuristic. Computational experiments demonstrate the effectiveness of the method for large-scale instances, achieving optimality gaps below 3% in most cases while maintaining reasonable computational times. These results highlight the practicality of our approach in assessing feasible clustering solutions for large datasets, bridging a critical gap in MSSC evaluation.

algorithm, artificial intelligence, machine learning, (18 more...)

2502.08397

Country:

Europe > Italy (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

arXiv.org Artificial IntelligenceFeb-12-2025

LLM4GNAS: A Large Language Model Based Toolkit for Graph Neural Architecture Search

Gao, Yang, Yang, Hong, Chen, Yizhi, Wu, Junxian, Zhang, Peng, Wang, Haishuai

Graph Neural Architecture Search (GNAS) facilitates the automatic design of Graph Neural Networks (GNNs) tailored to specific downstream graph learning tasks. However, existing GNAS approaches often require manual adaptation to new graph search spaces, necessitating substantial code optimization and domain-specific knowledge. To address this challenge, we present LLM4GNAS, a toolkit for GNAS that leverages the generative capabilities of Large Language Models (LLMs). LLM4GNAS includes an algorithm library for graph neural architecture search algorithms based on LLMs, enabling the adaptation of GNAS methods to new search spaces through the modification of LLM prompts. This approach reduces the need for manual intervention in algorithm adaptation and code modification. The LLM4GNAS toolkit is extensible and robust, incorporating LLM-enhanced graph feature engineering, LLM-enhanced graph neural architecture search, and LLM-enhanced hyperparameter optimization. Experimental results indicate that LLM4GNAS outperforms existing GNAS methods on tasks involving both homogeneous and heterogeneous graphs.

large language model, machine learning, natural language, (17 more...)

2502.10459

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Oceania > Australia > New South Wales > Sydney (0.05)
Asia > China > Guangdong Province > Guangzhou (0.05)
(8 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Neural Information Processing SystemsFeb-11-2025, 22:29:38 GMT

Review for NeurIPS paper: Optimal visual search based on a model of target detectability in natural images

This paper presents a method to measure target detectability in natural images. It provides a visual search model (based on extracted features of a pre-trained CNN) to perform target detectability as a function of retinal eccentricity for human vision. Reviewers, including myself, appreciate that this paper tackles a topic that has not been well investigated in the visual search literature. The approach is well-motivated and paper is well written, and comparison with human data is a nice validation of the approach. There were issues concerning correctness of the approach, along with minor points, but the author's rebuttal has done an adequate job in addressing the concerns and I expect to see the camera ready version of the paper incorporate improvements to at will improve the clarity of the paper (esp with regards to reviewer's main concerns) using the extra page. I think this will be a nice addition to the NeurIPS2020 conference encouraging the community to look at a fresh topic, so I'm going to recommend we accept this work as a poster.

natural image, optimal visual search, target detectability, (1 more...)

Technology:

Information Technology > Information Management > Search (0.91)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.40)

Neural Information Processing SystemsFeb-11-2025, 21:19:29 GMT

Review for NeurIPS paper: Minimax Value Interval for Off-Policy Evaluation and Policy Optimization

Weaknesses: The study of bias issue is important, but I am not fully convinced the motivation of this so-called "confidence interval". Normally the confidence interval is designed for uncertain quantification and thus of great practical interest. However, although the authors explicitly point out they do not consider uncertainties, this will rule out all the important applications that typical CI could do (safe RL or else) (this CI will not be valid in practice due to estimation error). Thus, I can only view the contribution in this paper as sort of additional guarantee for the algorithm proposed in "Minimax Weight and Q-Function Learning for Off-Policy Evaluation" since the algorithms are the same. Solely quantifying a bias of an existing estimator may not be viewed as sufficiently significant.

minimax value interval, neurips paper, off-policy evaluation and policy optimization, (3 more...)

Genre: Summary/Review (0.41)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.64)

Neural Information Processing SystemsFeb-11-2025, 21:19:22 GMT

Review for NeurIPS paper: Minimax Value Interval for Off-Policy Evaluation and Policy Optimization

The paper provides a very general minimax framework for quantifying the bias/approximation error in off-policy evaluation, and the results apply to a range of OPE methods. Reviewers generally agree that this is a good paper and there is contribution. One potentially improvable direction would be to quantify the statistical noise in off-policy evaluation, which is nontrivial but extremely important. Reviewers, AC and SAC also agree that such analysis could be left for future work. We would also like to strongly suggest that the authors consider rephrase/explain the wording "confidence interval".

approximation error, minimax value interval, off-policy evaluation and policy optimization, (1 more...)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.67)

Neural Information Processing SystemsFeb-11-2025, 20:53:34 GMT

Review for NeurIPS paper: Minimax Classification with 0-1 Loss and Performance Guarantees

Summary and Contributions: This paper presents minimax risk classifiers (MRCs) that do not rely on a choice of surrogate loss and family of rules. The goal of MRC is to find a classification rule that minimize the worst-case expected 0-1 loss with respect to a class of possible distributions. It first represents data, probability distributions and classification rules by matrices. The estimated classifier is cast as a linear optimization problem in which the uncertainty set is cast as the linear constraints. Some performance guarantees are proved, and numerical comparisons are conducted.

loss and performance guarantee, minimax classification, probability distribution, (6 more...)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.64)

Neural Information Processing SystemsFeb-11-2025, 20:53:27 GMT

Review for NeurIPS paper: Minimax Classification with 0-1 Loss and Performance Guarantees

This paper presents an interesting new perspective on the design of learning methods: the idea is to choose a classifier that minimizes the risk function uniformly over a family of distributions, constructed based on an iid data set, with the guarantee that (with high probability) the true data-generating distribution is contained in the family. This inherently supplies an upper bound on the risk of the chosen classifier. The family of distributions is generated by constraints on the expectation of a function Phi of (x,y), using data-dependent confidence bounds on its true expectation to set the constraints. Thus, the method is highly dependent on the choice of the function Phi. One significant concern noted by the reviewers is that the paper doesn't seem to explore this dependence in much depth, such as providing an array of illustrative examples and design principles for Phi, discussion of how choices of Phi for a given sample size may relate to notions of expressiveness and overfitting, or checking whether the technique can provide guarantees competitive with known results obtained by more traditional approaches (e.g., kernel methods, or ERM guarantees from uniform convergence).

loss and performance guarantee, minimax classification, neurips paper, (4 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.98)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.40)