AITopics | Jorge, Emilio

Collaborating Authors

Jorge, Emilio

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Isoperimetry is All We Need: Langevin Posterior Sampling for RL with Sublinear Regret

Jorge, Emilio, Dimitrakakis, Christos, Basu, Debabrota

arXiv.org Machine LearningDec-30-2024

In Reinforcement Learning (RL) theory, we impose restrictive assumptions to design an algorithm with provably sublinear regret. Common assumptions, like linear or RKHS models, and Gaussian or log-concave posteriors over the models, do not explain practical success of RL across a wider range of distributions and models. Thus, we study how to design RL algorithms with sublinear regret for isoperimetric distributions, specifically the ones satisfying the Log-Sobolev Inequality (LSI). LSI distributions include the standard setups of RL and others, such as many non-log-concave and perturbed distributions. First, we show that the Posterior Sampling-based RL (PSRL) yields sublinear regret if the data distributions satisfy LSI under some mild additional assumptions. Also, when we cannot compute or sample from an exact posterior, we propose a Langevin sampling-based algorithm design: LaPSRL. We show that LaPSRL achieves order optimal regret and subquadratic complexity per episode. Finally, we deploy LaPSRL with a Langevin sampler -- SARAH-LD, and test it for different bandit and MDP environments. Experimental results validate the generality of LaPSRL across environments and its competitive performance with respect to the baselines.

machine learning, posterior, reinforcement learning, (15 more...)

arXiv.org Machine Learning

2412.20824

Country: Europe > Sweden (0.14)

Genre: Research Report > New Finding (0.46)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.67)
(2 more...)

Add feedback

Minimax-Bayes Reinforcement Learning

Buening, Thomas Kleine, Dimitrakakis, Christos, Eriksson, Hannes, Grover, Divya, Jorge, Emilio

arXiv.org Artificial IntelligenceFeb-21-2023

While the Bayesian decision-theoretic framework offers an elegant solution to the problem of decision making under uncertainty, one question is how to appropriately select the prior distribution. One idea is to employ a worst-case prior. However, this is not as easy to specify in sequential decision making as in simple statistical estimation problems. This paper studies (sometimes approximate) minimax-Bayes solutions for various reinforcement learning problems to gain insights into the properties of the corresponding priors and policies. We find that while the worst-case prior depends on the setting, the corresponding minimax policies are more robust than those that assume a standard (i.e. uniform) prior.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

arXiv.org Artificial Intelligence

2302.10831

Country: Europe (0.46)

Genre: Research Report (0.84)

Industry: Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.97)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.93)

Add feedback

Spectral Analysis of Kernel and Neural Embeddings: Optimization and Generalization

Jorge, Emilio, Chehreghani, Morteza Haghir, Dubhashi, Devdatt

arXiv.org Machine LearningMay-13-2019

Kernel methods are one of by a spectral analysis of representations corresponding the pillars of machine learning, as they give us a flexible to kernel and neural embeddings. They framework to model complex functional relationships in a showed that in a simple single layer network, the principled way and also come with well-established statistical alignment of the labels to the eigenvectors of the properties and theoretical guarantees. The interplay of corresponding Gram matrix determines both the kernels and data labellings has been addressed before, for convergence of the optimization during training example in the work on kernel-target alignment (Cristianini as well as the generalization properties. We show et al., 2001). Recently, (Belkin et al., 2018) also make the quantitatively that kernel and neural representations case that progress on understanding deep learning is unlikely improve both optimization and generalization.

deep learning, generalization, neural network, (20 more...)

arXiv.org Machine Learning

1905.05095

Country:

Europe > Sweden (0.29)
North America > Canada > British Columbia (0.15)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.36)

Add feedback

Learning to Play Guess Who? and Inventing a Grounded Language as a Consequence

Jorge, Emilio, Kågebäck, Mikael, Johansson, Fredrik D., Gustavsson, Emil

arXiv.org Artificial IntelligenceMar-15-2017

Acquiring your first language is an incredible feat and not easily duplicated. Learning to communicate using nothing but a few pictureless books, a corpus, would likely be impossible even for humans. Nevertheless, this is the dominating approach in most natural language processing today. As an alternative, we propose the use of situated interactions between agents as a driving force for communication, and the framework of Deep Recurrent Q-Networks for evolving a shared language grounded in the provided environment. We task the agents with interactive image search in the form of the game Guess Who?. The images from the game provide a non trivial environment for the agents to discuss and a natural grounding for the concepts they decide to encode in their communication. Our experiments show that the agents learn not only to encode physical concepts in their words, i.e. grounding, but also that the agents learn to hold a multi-step dialogue remembering the state of the dialogue from step to step.

agent, deep learning, neural network, (21 more...)

arXiv.org Artificial Intelligence

1611.03218

Country:

Europe > Sweden (0.14)
North America > United States (0.14)

Industry: Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback