AITopics | Totaro, Simone

Collaborating Authors

Totaro, Simone

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

A functional mirror ascent view of policy gradient methods with function approximation

Vaswani, Sharan, Bachem, Olivier, Totaro, Simone, Mueller, Robert, Geist, Matthieu, Machado, Marlos C., Castro, Pablo Samuel, Roux, Nicolas Le

arXiv.org Artificial IntelligenceAug-12-2021

We use functional mirror ascent to propose a general framework (referred to as FMA-PG) for designing policy gradient methods. The functional perspective distinguishes between a policy's functional representation (what are its sufficient statistics) and its parameterization (how are these statistics represented) and naturally results in computationally efficient off-policy updates. For simple policy parameterizations, the FMA-PG framework ensures that the optimal policy is a fixed point of the updates. It also allows us to handle complex policy parameterizations (e.g., neural networks) while guaranteeing policy improvement. Our framework unifies several PG methods and opens the way for designing sample-efficient variants of existing methods. Moreover, it recovers important implementation heuristics (e.g., using forward vs reverse KL divergence) in a principled way. With a softmax functional representation, FMA-PG results in a variant of TRPO with additional desirable properties. It also suggests an improved variant of PPO, whose robustness and efficiency we empirically demonstrate on MuJoCo. Via experiments on simple reinforcement learning problems, we evaluate algorithms instantiated by FMA-PG.

fuzzy logic, optimization problem, parameterization, (19 more...)

arXiv.org Artificial Intelligence

2108.05828

Country:

North America > Canada > Alberta (0.14)
North America > Canada > Quebec > Montreal (0.14)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.66)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.40)

Add feedback

Hierarchical Representation Learning for Markov Decision Processes

Steccanella, Lorenzo, Totaro, Simone, Jonsson, Anders

arXiv.org Artificial IntelligenceJun-3-2021

In this paper we present a novel method for learning hierarchical representations of Markov decision processes. Our method works by partitioning the state space into subsets, and defines subtasks for performing transitions between the partitions. We formulate the problem of partitioning the state space as an optimization problem that can be solved using gradient descent given a set of sampled trajectories, making our method suitable for high-dimensional problems with large state spaces. We empirically validate the method, by showing that it can successfully learn a useful hierarchical representation in a navigation domain. Once learned, the hierarchical representation can be used to solve different tasks in the given domain, thus generalizing knowledge across tasks.

artificial intelligence, neural network, transition, (18 more...)

arXiv.org Artificial Intelligence

2106.01655

Country:

North America > Canada > Quebec (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.71)

Add feedback

Hierarchical reinforcement learning for efficient exploration and transfer

Steccanella, Lorenzo, Totaro, Simone, Allonsius, Damien, Jonsson, Anders

arXiv.org Artificial IntelligenceNov-12-2020

Sparse-reward domains are challenging for reinforcement learning algorithms since significant exploration is needed before encountering reward for the first time. Hierarchical reinforcement learning can facilitate exploration by reducing the number of decisions necessary before obtaining a reward. In this paper, we present a novel hierarchical reinforcement learning framework based on the compression of an invariant state space that is common to a range of tasks. The algorithm introduces subtasks which consist of moving between the state partitions induced by the compression. Results indicate that the algorithm can successfully solve complex sparse-reward domains, and transfer knowledge to solve new, previously unseen tasks more quickly.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

arXiv.org Artificial Intelligence

2011.06335

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Lifelong Control of Off-grid Microgrid with Model Based Reinforcement Learning

Totaro, Simone, Boukas, Ioannis, Jonsson, Anders, Cornélusse, Bertrand

arXiv.org Artificial IntelligenceMay-16-2020

The lifelong control problem of an off-grid microgrid Centralized microgrid control is usually decomposed in four is composed of two tasks, namely estimation tasks: i) estimating the parameters of the microgrid devices of the condition of the microgrid devices and operational (for instance the charge efficiency of a battery storage device planning accounting for the uncertainties as a function of the state of charge and temperature, or the by forecasting the future consumption and actual capacity of a battery after a number of cycles), ii) the renewable production. The main challenge forecasting the consumption and the renewable production, for the effective control arises from the various iii) operational planning to anticipate weather effects and changes that take place over time. In this paper, human activities, and iv) real-time control to adapt planned we present an open-source reinforcement framework decisions to the current situation. These tasks are preformed for the modeling of an off-grid microgrid sequentially during the lifetime of a microgrid in order to for rural electrification. The lifelong control problem achieve near optimal operation and to maximize the benefits of an isolated microgrid is formulated as a arising from distributed generation.

microgrid, optimization problem, renewable energy, (20 more...)

arXiv.org Artificial Intelligence

2005.08006

Country: Europe > Spain (0.28)

Genre: Research Report (1.00)

Industry:

Energy > Renewable > Solar (1.00)
Energy > Power Industry (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Recurrent Neural Networks with Flexible Gates using Kernel Activation Functions

Scardapane, Simone, Van Vaerenbergh, Steven, Comminiello, Danilo, Totaro, Simone, Uncini, Aurelio

arXiv.org Machine LearningJul-11-2018

Gated recurrent neural networks have achieved remarkable results in the analysis of sequential data. Inside these networks, gates are used to control the flow of information, allowing to model even very long-term dependencies in the data. In this paper, we investigate whether the original gate equation (a linear projection followed by an element-wise sigmoid) can be improved. In particular, we design a more flexible architecture, with a small number of adaptable parameters, which is able to model a wider range of gating functions than the classical one. To this end, we replace the sigmoid function in the standard gate with a non-parametric formulation extending the recently proposed kernel activation function (KAF), with the addition of a residual skip-connection. A set of experiments on sequential variants of the MNIST dataset shows that the adoption of this novel gate allows to improve accuracy with a negligible cost in terms of computational power and with a large speed-up in the number of training iterations.

architecture, deep learning, neural network, (18 more...)

arXiv.org Machine Learning

1807.04065

Country:

Europe > Spain (0.14)
Europe > Italy (0.14)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Kafnets: kernel-based non-parametric activation functions for neural networks

Scardapane, Simone, Van Vaerenbergh, Steven, Totaro, Simone, Uncini, Aurelio

arXiv.org Machine LearningNov-23-2017

Neural networks are generally built by interleaving (adaptable) linear layers with (fixed) nonlinear activation functions. To increase their flexibility, several authors have proposed methods for adapting the activation functions themselves, endowing them with varying degrees of flexibility. None of these approaches, however, have gained wide acceptance in practice, and research in this topic remains open. In this paper, we introduce a novel family of flexible activation functions that are based on an inexpensive kernel expansion at every neuron. Leveraging over several properties of kernel-based models, we propose multiple variations for designing and initializing these kernel activation functions (KAFs), including a multidimensional scheme allowing to nonlinearly combine information from different paths in the network. The resulting KAFs can approximate any mapping defined over a subset of the real line, either convex or nonconvex. Furthermore, they are smooth over their entire domain, linear in their parameters, and they can be regularized using any known scheme, including the use of $\ell_1$ penalties to enforce sparseness. To the best of our knowledge, no other known model satisfies all these properties simultaneously. In addition, we provide a relatively complete overview on alternative techniques for adapting the activation functions, which is currently lacking in the literature. A large set of experiments validates our proposal.

activation function, deep learning, neural network, (17 more...)

arXiv.org Machine Learning

1707.04035

Country: Europe > Spain > Cantabria (0.14)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback