Model selection for stochastic dynamics: a parsimonious and principled approach
This thesis focuses on the discovery of stochastic differential equations (SDEs) and stochastic partial differential equations (SPDEs) from noisy and discrete time series. A major challenge is selecting the simplest possible correct model from vast libraries of candidate models, where standard information criteria (AIC, BIC) are often limited. We introduce PASTIS (Parsimonious Stochastic Inference), a new information criterion derived from extreme value theory. Its penalty term, $n_\mathcal{B} \ln(n_0/p)$, explicitly incorporates the size of the initial library of candidate parameters ($n_0$), the number of parameters in the considered model ($n_\mathcal{B}$), and a significance threshold ($p$). This significance threshold represents the probability of selecting a model containing more parameters than necessary when comparing many models. Benchmarks on various systems (Lorenz, Ornstein-Uhlenbeck, Lotka-Volterra for SDEs; Gray-Scott for SPDEs) demonstrate that PASTIS outperforms AIC, BIC, cross-validation (CV), and SINDy (a competing method) in terms of exact model identification and predictive capability. Furthermore, real-world data can be subject to large sampling intervals ($Δt$) or measurement noise ($σ$), which can impair model learning and selection capabilities. To address this, we have developed robust variants of PASTIS, PASTIS-$Δt$ and PASTIS-$σ$, thus extending the applicability of the approach to imperfect experimental data. PASTIS thus provides a statistically grounded, validated, and practical methodological framework for discovering simple models for processes with stochastic dynamics.
Jul-8-2025
- Country:
- Asia > Japan (0.04)
- Europe
- France
- Occitanie > Hérault
- Montpellier (0.04)
- Provence-Alpes-Côte d'Azur > Bouches-du-Rhône
- Marseille (0.04)
- Occitanie > Hérault
- Germany > Bavaria
- Upper Bavaria > Munich (0.04)
- Latvia > Riga Municipality
- Riga (0.04)
- Portugal (0.04)
- Switzerland > Vaud
- Lausanne (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- France
- North America > United States
- New York > Monroe County > Rochester (0.04)
- Genre:
- Personal > Honors (0.45)
- Research Report > New Finding (0.67)
- Summary/Review (1.00)
- Industry:
- Banking & Finance (0.92)
- Energy (0.92)
- Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
- Technology: