Goto

Collaborating Authors

 fliess


Avoidance of an unexpected obstacle without reinforcement learning: Why not using advanced control-theoretic tools?

Join, Cédric, Fliess, Michel

arXiv.org Artificial Intelligence

This communication on collision avoidance with unexpected obstacles is motivated by some critical appraisals on reinforcement learning (RL) which "requires ridiculously large numbers of trials to learn any new task" (Yann LeCun). We use the classic Dubins' car in order to replace RL with flatness-based control, combined with the HEOL feedback setting, and the latest model-free predictive control approach. The two approaches lead to convincing computer experiments where the results with the model-based one are only slightly better. They exhibit a satisfactory robustness with respect to randomly generated mismatches/disturbances, which become excellent in the model-free case. Those properties would have been perhaps difficult to obtain with today's popular machine learning techniques in AI. Finally, we should emphasize that our two methods require a low computational burden.


Model-Free Predictive Control: Introductory Algebraic Calculations, and a Comparison with HEOL and ANNs

Join, Cédric, Delaleau, Emmanuel, Fliess, Michel

arXiv.org Artificial Intelligence

Model predictive control (MPC) is a popular control engineering practice, but requires a sound knowledge of the model. Model-free predictive control (MFPC), a burning issue today, also related to reinforcement learning (RL) in AI, is reformulated here via a linear differential equation with constant coefficients, thanks to a new perspective on optimal control combined with recent advances in the field of model-free control. It is replacing Dynamic Programming, the Hamilton-Jacobi-Bellman equation, and Pontryagin's Maximum Principle. The computing burden is low. The implementation is straightforward. Two nonlinear examples, a chemical reactor and a two tank system, are illustrating our approach. A comparison with the HEOL setting, where some expertise of the process model is needed, shows only a slight superiority of the later. A recent identification of the two tank system via a complex ANN architecture might indicate that a full modeling and the corresponding machine learning mechanism are not always necessary neither in control, nor, more generally, in AI.


Systematic and multifactor risk models revisited

Fliess, Michel, Join, Cédric

arXiv.org Machine Learning

Systematic, or market, risk is one of the most studied risk models not only in financial engineering, but also in actuarial sciences, in business and corporate management, and in several other domains. It is associated to the beta (β) coefficient, which is familiar in the investment industry since Sharpe's capital asset pricing model (CAPM) [30]. The pitfalls and shortcomings of β have been detailed by a number of excellent authors.