AITopics | Lattimore, Tor

Plotting

Lattimore, Tor

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Bounded Regret for Finite-Armed Structured Bandits

Lattimore, Tor, Munos, Remi

Neural Information Processing SystemsDec-31-2014

We study a new type of K-armed bandit problem where the expected return of one arm may depend on the returns of other arms. We present a new algorithm for this general class of problems and show that under certain circumstances it is possible to achieve finite expected cumulative regret. We also give problem-dependent lower bounds on the cumulative regret showing that at least in special cases the new algorithm is nearly optimal.

artificial intelligence, big data, finite regret, (20 more...)

Neural Information Processing Systems

Country: North America > Canada > Alberta (0.14)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.51)

Add feedback

Concentration and Confidence for Discrete Bayesian Sequence Predictors

Lattimore, Tor, Hutter, Marcus, Sunehag, Peter

arXiv.org Machine LearningJun-29-2013

Bayesian sequence prediction is a simple technique for predicting future symbols sampled from an unknown measure on infinite sequences over a countable alphabet. While strong bounds on the expected cumulative error are known, there are only limited results on the distribution of this error. We prove tight high-probability bounds on the cumulative error, which is measured in terms of the Kullback-Leibler (KL) divergence. We also consider the problem of constructing upper confidence bounds on the KL and Hellinger errors similar to those constructed from Hoeffding-like bounds in the i.i.d. case. The new results are applied to show that Bayesian sequence prediction can be used in the Knows What It Knows (KWIK) framework with bounds that match the state-of-the-art.

artificial intelligence, ln 1, machine learning, (17 more...)

arXiv.org Machine Learning

1307.0127

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.67)

Add feedback

Time Consistent Discounting

Lattimore, Tor, Hutter, Marcus

arXiv.org Artificial IntelligenceJul-27-2011

A possibly immortal agent tries to maximise its summed discounted rewards over time, where discounting is used to avoid infinite utilities and encourage the agent to value current rewards more than future ones. Some commonly used discount functions lead to time-inconsistent behavior where the agent changes its plan over time. These inconsistencies can lead to very poor behavior. We generalise the usual discounted utility model to one where the discount function changes with the age of the agent. We then give a simple characterisation of time-(in)consistent discount functions and show the existence of a rational policy for an agent that knows its discount function is time-inconsistent.

artificial intelligence, discount matrix, game theory, (15 more...)

arXiv.org Artificial Intelligence

1107.5528

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Game Theory (0.94)

Add feedback

Asymptotically Optimal Agents

Lattimore, Tor, Hutter, Marcus

arXiv.org Artificial IntelligenceJul-27-2011

Artificial general intelligence aims to create agents capable of learning to solve arbitrary interesting problems. We define two versions of asymptotic optimality and prove that no agent can satisfy the strong version while in some cases, depending on discounting, there does exist a non-computable weak asymptotically optimal agent.

agent, artificial intelligence, discount function, (13 more...)

arXiv.org Artificial Intelligence

1107.5537

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback