AITopics | Aamand, Anders

Collaborating Authors

Aamand, Anders

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Learning-Augmented Frequent Directions

Aamand, Anders, Chen, Justin Y., Gollapudi, Siddharth, Silwal, Sandeep, Wu, Hao

arXiv.org Artificial IntelligenceMar-2-2025

An influential paper of Hsu et al. (ICLR'19) introduced the study of learning-augmented streaming algorithms in the context of frequency estimation. A fundamental problem in the streaming literature, the goal of frequency estimation is to approximate the number of occurrences of items appearing in a long stream of data using only a small amount of memory. Hsu et al. develop a natural framework to combine the worst-case guarantees of popular solutions such as CountMin and CountSketch with learned predictions of high frequency elements. They demonstrate that learning the underlying structure of data can be used to yield better streaming algorithms, both in theory and practice. We simplify and generalize past work on learning-augmented frequency estimation. Our first contribution is a learning-augmented variant of the Misra-Gries algorithm which improves upon the error of learned CountMin and learned CountSketch and achieves the state-of-the-art performance of randomized algorithms (Aamand et al., NeurIPS'23) with a simpler, deterministic algorithm. Our second contribution is to adapt learning-augmentation to a high-dimensional generalization of frequency estimation corresponding to finding important directions (top singular vectors) of a matrix given its rows one-by-one in a stream. We analyze a learning-augmented variant of the Frequent Directions algorithm, extending the theoretical and empirical understanding of learned predictions to matrix streaming.

artificial intelligence, data mining, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2503.00937

Country:

North America > United States (0.93)
Europe > Austria > Vienna (0.14)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining (0.68)

Add feedback

Statistical-Computational Trade-offs for Density Estimation

Aamand, Anders, Andoni, Alexandr, Chen, Justin Y., Indyk, Piotr, Narayanan, Shyam, Silwal, Sandeep, Xu, Haike

arXiv.org Machine LearningOct-30-2024

We study the density estimation problem defined as follows: given $k$ distributions $p_1, \ldots, p_k$ over a discrete domain $[n]$, as well as a collection of samples chosen from a ``query'' distribution $q$ over $[n]$, output $p_i$ that is ``close'' to $q$. Recently~\cite{aamand2023data} gave the first and only known result that achieves sublinear bounds in {\em both} the sampling complexity and the query time while preserving polynomial data structure space. However, their improvement over linear samples and time is only by subpolynomial factors. Our main result is a lower bound showing that, for a broad class of data structures, their bounds cannot be significantly improved. In particular, if an algorithm uses $O(n/\log^c k)$ samples for some constant $c>0$ and polynomial space, then the query time of the data structure must be at least $k^{1-O(1)/\log \log k}$, i.e., close to linear in the number of distributions $k$. This is a novel \emph{statistical-computational} trade-off for density estimation, demonstrating that any data structure must use close to a linear number of samples or take close to linear query time. The lower bound holds even in the realizable case where $q=p_i$ for some $i$, and when the distributions are flat (specifically, all distributions are uniform over half of the domain $[n]$). We also give a simple data structure for our lower bound instance with asymptotically matching upper bounds. Experiments show that the data structure is quite efficient in practice.

algorithm, artificial intelligence, data structure, (15 more...)

arXiv.org Machine Learning

2410.23087

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.82)

Add feedback

Improved Frequency Estimation Algorithms with and without Predictions

Aamand, Anders, Chen, Justin Y., Nguyen, Huy Lê, Silwal, Sandeep, Vakilian, Ali

arXiv.org Artificial IntelligenceDec-12-2023

Estimating frequencies of elements appearing in a data stream is a key task in large-scale data analysis. Popular sketching approaches to this problem (e.g., CountMin and CountSketch) come with worst-case guarantees that probabilistically bound the error of the estimated frequencies for any possible input. The work of Hsu et al. (2019) introduced the idea of using machine learning to tailor sketching algorithms to the specific data distribution they are being run on. In particular, their learning-augmented frequency estimation algorithm uses a learned heavy-hitter oracle which predicts which elements will appear many times in the stream. We give a novel algorithm, which in some parameter regimes, already theoretically outperforms the learning based algorithm of Hsu et al. without the use of any predictions. Augmenting our algorithm with heavy-hitter predictions further reduces the error and improves upon the state of the art. Empirically, our algorithms achieve superior performance in all experiments compared to prior approaches.

algorithm, artificial intelligence, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2312.07535

Country: North America > United States (0.46)

Genre: Research Report (0.50)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.92)

Add feedback

Data Structures for Density Estimation

Aamand, Anders, Andoni, Alexandr, Chen, Justin Y., Indyk, Piotr, Narayanan, Shyam, Silwal, Sandeep

arXiv.org Artificial IntelligenceJun-20-2023

We study statistical/computational tradeoffs for the following density estimation problem: given $k$ distributions $v_1, \ldots, v_k$ over a discrete domain of size $n$, and sampling access to a distribution $p$, identify $v_i$ that is "close" to $p$. Our main result is the first data structure that, given a sublinear (in $n$) number of samples from $p$, identifies $v_i$ in time sublinear in $k$. We also give an improved version of the algorithm of Acharya et al. (2018) that reports $v_i$ in time linear in $k$. The experimental evaluation of the latter algorithm shows that it achieves a significant reduction in the number of operations needed to achieve a given accuracy compared to prior work.

algorithm, artificial intelligence, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2306.11312

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)

Genre:

Research Report (0.50)
Workflow (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.62)

Add feedback

Improved Space Bounds for Learning with Experts

Aamand, Anders, Chen, Justin Y., Nguyen, Huy Lê, Silwal, Sandeep

arXiv.org Artificial IntelligenceMar-2-2023

Understanding the performance of learning algorithms under information constraints is a fundamental research direction in machine learning. While performance notions such as regret in online learning have been well explored, a recent line of work explores additional constraints in learning, with a particular emphasis on limited memory [Sha14, WS19, MSSV22] (see also Section 3). In this paper, we focus on the online learning with experts problem, a general framework for sequential decision making, with memory constraints. In the online learning with experts problem, an algorithm must make predictions about the outcome of an event for T consecutive days based on the predictions of n experts. The predictions of the algorithm at a time t T can only depend on the information it has received in the previous days as well as the predictions of the experts for day t. After predictions are made, the true outcome is revealed and the algorithm and all experts receive some loss (likely depending on the accuracy of their predictions). In addition to the fact that the online experts problem has found numerous algorithmic applications [AHK12], studying the problem with memory constraints is especially interesting in light of the fact that existing algorithms explicitly track the cumulative loss of every expert and follow the advice of a leading expert, which requires Ω(n) memory. Motivated by this lack of understanding, the online learning with experts problem with memory constraints was recently introduced in [SWXZ22], which studied the case where the losses of the experts form an i.i.d.

algorithm, artificial intelligence, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2303.01453

Country:

North America > United States (1.00)
Europe (1.00)
North America > Canada (0.93)

Genre: Research Report (0.50)

Industry: Education > Educational Setting > Online (0.94)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.94)

Add feedback

Exponentially Improving the Complexity of Simulating the Weisfeiler-Lehman Test with Graph Neural Networks

Aamand, Anders, Chen, Justin Y., Indyk, Piotr, Narayanan, Shyam, Rubinfeld, Ronitt, Schiefer, Nicholas, Silwal, Sandeep, Wagner, Tal

arXiv.org Artificial IntelligenceDec-21-2022

Recent work shows that the expressive power of Graph Neural Networks (GNNs) in distinguishing non-isomorphic graphs is exactly the same as that of the Weisfeiler-Lehman (WL) graph test. In particular, they show that the WL test can be simulated by GNNs. However, those simulations involve neural networks for the 'combine' function of size polynomial or even exponential in the number of graph nodes $n$, as well as feature vectors of length linear in $n$. We present an improved simulation of the WL test on GNNs with \emph{exponentially} lower complexity. In particular, the neural network implementing the combine function in each node has only a polylogarithmic number of parameters in $n$, and the feature vectors exchanged by the nodes of GNN consists of only $O(\log n)$ bits. We also give logarithmic lower bounds for the feature vector length and the size of the neural networks, showing the (near)-optimality of our construction.

artificial intelligence, machine learning, neural network, (20 more...)

arXiv.org Artificial Intelligence

2211.03232

Country: North America > United States (0.68)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.74)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

(Optimal) Online Bipartite Matching with Degree Information

Aamand, Anders, Chen, Justin Y., Indyk, Piotr

arXiv.org Artificial IntelligenceNov-14-2022

We propose a model for online graph problems where algorithms are given access to an oracle that predicts (e.g., based on modeling assumptions or on past data) the degrees of nodes in the graph. Within this model, we study the classic problem of online bipartite matching, and a natural greedy matching algorithm called MinPredictedDegree, which uses predictions of the degrees of offline nodes. For the bipartite version of a stochastic graph model due to Chung, Lu, and Vu where the expected values of the offline degrees are known and used as predictions, we show that MinPredictedDegree stochastically dominates any other online algorithm, i.e., it is optimal for graphs drawn from this model. Since the "symmetric" version of the model, where all online nodes are identical, is a special case of the well-studied "known i.i.d. model", it follows that the competitive ratio of MinPredictedDegree on such inputs is at least 0.7299. For the special case of graphs with power law degree distributions, we show that MinPredictedDegree frequently produces matchings almost as large as the true maximum matching on such graphs. We complement these results with an extensive empirical evaluation showing that MinPredictedDegree compares favorably to state-of-the-art online algorithms for online matching.

algorithm, artificial intelligence, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2110.11439

Country: North America > United States (0.46)

Genre: Research Report > New Finding (0.92)

Industry: Law (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Communications (0.67)

Add feedback