Wasserstein Distributionally Robust Shallow Convex Neural Networks
Pallage, Julien, Lesage-Landry, Antoine
–arXiv.org Artificial Intelligence
In this work, we propose Wasserstein distributionally robust shallow convex neural networks (WaDiRo-SCNNs) to provide reliable nonlinear predictions when subject to adverse and corrupted datasets. Our approach is based on a new convex training program for ReLU shallow neural networks which allows us to cast the problem as an exact, tractable reformulation of its order-1 Wasserstein distributionally robust equivalent. Our training procedure is conservative by design, has low stochasticity, is solvable with open-source solvers, and is scalable to large industrial deployments. We provide out-of-sample performance guarantees and show that hard convex physical constraints can be enforced in the training program. WaDiRo-SCNN aims to make neural networks safer for critical applications, such as in the energy sector. Finally, we numerically demonstrate the performance of our model on a synthetic experiment and a real-world power system application, i.e., the prediction of nonresidential buildings' hourly energy consumption. The experimental results are convincing and showcase the strengths of the proposed model. Keywords: distributionally robust optimization, shallow convex neural networks, trustworthy machine learning, physics constrained, energy.
arXiv.org Artificial Intelligence
Jul-23-2024
- Country:
- Europe > United Kingdom
- England
- Cambridgeshire > Cambridge (0.04)
- Oxfordshire > Oxford (0.04)
- England
- North America
- Canada > Quebec
- Montreal (0.04)
- Trinidad and Tobago > Trinidad
- United States
- California > Orange County
- Irvine (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- New York > New York County
- New York City (0.04)
- California > Orange County
- Canada > Quebec
- Europe > United Kingdom
- Genre:
- Research Report (0.50)
- Industry:
- Education (1.00)
- Energy > Power Industry (1.00)
- Technology: