AITopics | Lugosi, Gabor

Collaborating Authors

Lugosi, Gabor

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

A note on estimating the dimension from a random geometric graph

Atamanchuk, Caelan, Devroye, Luc, Lugosi, Gabor

arXiv.org Machine LearningNov-21-2023

Let $G_n$ be a random geometric graph with vertex set $[n]$ based on $n$ i.i.d.\ random vectors $X_1,\ldots,X_n$ drawn from an unknown density $f$ on $\R^d$. An edge $(i,j)$ is present when $\|X_i -X_j\| \le r_n$, for a given threshold $r_n$ possibly depending upon $n$, where $\| \cdot \|$ denotes Euclidean distance. We study the problem of estimating the dimension $d$ of the underlying space when we have access to the adjacency matrix of the graph but do not know $r_n$ or the vectors $X_i$. The main result of the paper is that there exists an estimator of $d$ that converges to $d$ in probability as $n \to \infty$ for all densities with $\int f^5 < \infty$ whenever $n^{3/2} r_n^d \to \infty$ and $r_n = o(1)$. The conditions allow very sparse graphs since when $n^{3/2} r_n^d \to 0$, the graph contains isolated edges only, with high probability. We also show that, without any condition on the density, a consistent estimator of $d$ exists when $n r_n^d \to \infty$ and $r_n = o(1)$.

artificial intelligence, graph, machine learning, (17 more...)

arXiv.org Machine Learning

2311.13059

Country:

North America > United States (0.75)
North America > Canada > Quebec > Montreal (0.28)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.66)

Add feedback

Broadcasting in random recursive dags

Briend, Simon, Devroye, Luc, Lugosi, Gabor

arXiv.org Artificial IntelligenceJun-2-2023

A uniform $k$-{\sc dag} generalizes the uniform random recursive tree by picking $k$ parents uniformly at random from the existing nodes. It starts with $k$ ''roots''. Each of the $k$ roots is assigned a bit. These bits are propagated by a noisy channel. The parents' bits are flipped with probability $p$, and a majority vote is taken. When all nodes have received their bits, the $k$-{\sc dag} is shown without identifying the roots. The goal is to estimate the majority bit among the roots. We identify the threshold for $p$ as a function of $k$ below which the majority rule among all nodes yields an error $c+o(1)$ with $c<1/2$. Above the threshold the majority rule errs with probability $1/2+o(1)$.

artificial intelligence, maj, probability, (15 more...)

arXiv.org Artificial Intelligence

2306.01727

Country:

Europe > Spain (0.28)
North America > Canada > Quebec > Montreal (0.14)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence (0.46)

Add feedback

Multivariate mean estimation with direction-dependent accuracy

Lugosi, Gabor, Mendelson, Shahar

arXiv.org Machine LearningOct-22-2020

We consider the problem of estimating the mean of a random vector based on $N$ independent, identically distributed observations. We prove the existence of an estimator that has a near-optimal error in all directions in which the variance of the one dimensional marginal of the random vector is not too small: with probability $1-\delta$, the procedure returns $\wh{\mu}_N$ which satisfies that for every direction $u \in S^{d-1}$, \[ \inr{\wh{\mu}_N - \mu, u}\le \frac{C}{\sqrt{N}} \left( \sigma(u)\sqrt{\log(1/\delta)} + \left(\E\|X-\EXP X\|_2^2\right)^{1/2} \right)~, \] where $\sigma^2(u) = \var(\inr{X,u})$ and $C$ is a constant. To achieve this, we require only slightly more than the existence of the covariance matrix, in the form of a certain moment-equivalence assumption. The proof relies on novel bounds for the ratio of empirical and true probabilities that hold uniformly over certain classes of random variables.

artificial intelligence, machine learning, probability, (15 more...)

arXiv.org Machine Learning

2010.11921

Country: Europe > Spain (0.14)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.46)

Add feedback

Mean estimation and regression under heavy-tailed distributions--a survey

Lugosi, Gabor, Mendelson, Shahar

arXiv.org Machine LearningJun-10-2019

Arguably the most fundamental problem of statistics is that of estimating the expected value µ of a random variable X based on a sample of n independent, identically distributed draws from the distribution of X. The obvious choice of an estimator is, of course, the empirical mean. Its properties are well understood by classical results of probability theory. However, from the early days on, statisticians have been concerned about the quality of the empirical mean, especially when the distribution may be heavy-tailed or outliers may be present in the data. This concern gave rise to the area of robust statistics that addresses the problem of mean estimation (and other statistical problems) for such data.

artificial intelligence, estimator, machine learning, (15 more...)

arXiv.org Machine Learning

1906.0428

Country:

North America > United States (0.14)
Europe > Spain (0.14)
Europe > United Kingdom > England (0.14)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Multiplayer bandits without observing collision information

Lugosi, Gabor, Mehrabian, Abbas

arXiv.org Machine LearningAug-25-2018

We study multiplayer stochastic multi-armed bandit problems in which the players cannot communicate, and if two or more players pull the same arm, a collision occurs and the involved players receive zero reward. We consider two feedback models: a model in which the players can observe whether a collision has occurred, and a more difficult setup when no collision information is available. We give the first theoretical guarantees for the second model: an algorithm with a logarithmic regret, and an algorithm with a square-root regret type that does not depend on the gaps between the means. For the first model, we give the first square-root regret bounds that do not depend on the gaps. Building on these ideas, we also give an algorithm for reaching approximate Nash equilibria quickly in stochastic anti-coordination games.

algorithm, artificial intelligence, big data, (18 more...)

arXiv.org Machine Learning

1808.08416

Country:

Europe > Spain (0.28)
North America > Canada > Quebec > Montreal (0.14)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.89)

Add feedback

Boltzmann Exploration Done Right

Cesa-Bianchi, Nicolò, Gentile, Claudio, Lugosi, Gabor, Neu, Gergely

Neural Information Processing SystemsDec-31-2017

Boltzmann exploration is a classic strategy for sequential decision-making under uncertainty, and is one of the most standard tools in Reinforcement Learning (RL). Despite its widespread use, there is virtually no theoretical understanding about the limitations or the actual benefits of this exploration scheme. Does it drive exploration in a meaningful way? Is it prone to misidentifying the optimal actions or spending too much time exploring the suboptimal ones? What is the right tuning for the learning rate? In this paper, we address several of these questions for the classic setup of stochastic multi-armed bandits. One of our main results is showing that the Boltzmann exploration strategy with any monotone learning-rate sequence will induce suboptimal behavior. As a remedy, we offer a simple non-monotone schedule that guarantees near-optimal performance, albeit only when given prior access to key problem parameters that are typically not available in practical situations (like the time horizon $T$ and the suboptimality gap $\Delta$). More importantly, we propose a novel variant that uses different learning rates for different arms, and achieves a distribution-dependent regret bound of order $\frac{K\log^2 T}{\Delta}$ and a distribution-independent bound of order $\sqrt{KT}\log K$ without requiring such prior knowledge. To demonstrate the flexibility of our technique, we also propose a variant that guarantees the same performance bounds even if the rewards are heavy-tailed.

big data, exploration, upstream oil & gas, (21 more...)

Neural Information Processing Systems

Country:

Europe > Spain (0.28)
North America > United States > California (0.14)

Industry: Energy > Oil & Gas > Upstream (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.88)
Information Technology > Data Science > Data Mining > Big Data (0.70)

Add feedback

Mirror Descent Meets Fixed Share (and feels no regret)

Cesa-bianchi, Nicolò, Gaillard, Pierre, Lugosi, Gabor, Stoltz, Gilles

Neural Information Processing SystemsDec-31-2012

Mirror descent with an entropic regularizer is known to achieve shifting regret bounds that are logarithmic in the dimension. This is done using either a carefully designed projection or by a weight sharing technique. Via a novel unified analysis, we show that these two approaches deliver essentially equivalent bounds on a notion of regret generalizing shifting, adaptive, discounted, and other related regrets. Our analysis also captures and extends the generalized weight sharing technique of Bousquet and Warmuth, and can be refined in several ways, including improvements for small losses and adaptive tuning of parameters.

artificial intelligence, forecaster, machine learning, (14 more...)

Neural Information Processing Systems

Country: Europe (0.14)

Industry: Education (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Mirror Descent Meets Fixed Share (and feels no regret)

Cesa-Bianchi, Nicolò, Gaillard, Pierre, Lugosi, Gabor, Stoltz, Gilles

arXiv.org Machine LearningSep-27-2012

artificial intelligence, forecaster, machine learning, (15 more...)

arXiv.org Machine Learning

1202.3323

Country: Europe (0.28)

Genre: Research Report (0.64)

Industry: Education (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.90)

Add feedback

Minimax Policies for Combinatorial Prediction Games

Audibert, Jean-Yves, Bubeck, Sebastien, Lugosi, Gabor

arXiv.org Machine LearningMay-24-2011

We address the online linear optimization problem when the actions of the forecaster are represented by binary vectors. Our goal is to understand the magnitude of the minimax regret for the worst possible set of actions. We study the problem under three different assumptions for the feedback: full information, and the partial information models of the so-called "semi-bandit", and "bandit" problems. We consider both $L_\infty$-, and $L_2$-type of restrictions for the losses assigned by the adversary. We formulate a general strategy using Bregman projections on top of a potential-based gradient descent, which generalizes the ones studied in the series of papers Gyorgy et al. (2007), Dani et al. (2008), Abernethy et al. (2008), Cesa-Bianchi and Lugosi (2009), Helmbold and Warmuth (2009), Koolen et al. (2010), Uchiya et al. (2010), Kale et al. (2010) and Audibert and Bubeck (2010). We provide simple proofs that recover most of the previous results. We propose new upper bounds for the semi-bandit game. Moreover we derive lower bounds for all three feedback assumptions. With the only exception of the bandit game, the upper and lower bounds are tight, up to a constant factor. Finally, we answer a question asked by Koolen et al. (2010) by showing that the exponentially weighted average forecaster is suboptimal against $L_{\infty}$ adversaries.

artificial intelligence, assumption, machine learning, (15 more...)

arXiv.org Machine Learning

1105.4871

Country:

Europe > Spain (0.14)
Europe > France (0.14)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.61)

Add feedback

Online Multi-task Learning with Hard Constraints

Lugosi, Gabor, Papaspiliopoulos, Omiros, Stoltz, Gilles

arXiv.org Machine LearningMar-27-2009

We discuss multi-task online learning when a decision maker has to deal simultaneously with M tasks. The tasks are related, which is modeled by imposing that the M-tuple of actions taken by the decision maker needs to satisfy certain constraints. We give natural examples of such restrictions and then discuss a general class of tractable constraints, for which we introduce computationally efficient ways of selecting actions, essentially by reducing to an on-line shortest path problem. We briefly discuss "tracking" and "bandit" versions of the problem and extend the model in various ways, including non-additive global losses and uncountably infinite sets of tasks.

artificial intelligence, constraint-based reasoning, decision maker, (16 more...)

arXiv.org Machine Learning

0902.3526

Country: Europe > Spain (0.28)

Genre: Research Report (0.40)

Industry: Education > Educational Setting > Online (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.41)

Add feedback