Achieving the Tightest Relaxation of Sigmoids for Formal Verification

Chevalier, Samuel, Starkenburg, Duncan, Dvijotham, Krishnamurthy

Aug-21-2024–arXiv.org Artificial Intelligence

In the field of formal verification, Neural Networks (NNs) are typically reformulated into equivalent mathematical programs which are optimized over. To overcome the inherent non-convexity of these reformulations, convex relaxations of nonlinear activation functions are typically utilized. Common relaxations (i.e., static linear cuts) of "S-shaped" activation functions, however, can be overly loose, slowing down the overall verification process. In this paper, we derive tuneable hyperplanes which upper and lower bound the sigmoid activation function. When tuned in the dual space, these affine bounds smoothly rotate around the nonlinear manifold of the sigmoid activation function. This approach, termed $\alpha$-sig, allows us to tractably incorporate the tightest possible, element-wise convex relaxation of the sigmoid activation function into a formal verification framework. We embed these relaxations inside of large verification tasks and compare their performance to LiRPA and $\alpha$-CROWN, a state-of-the-art verification duo.

activation function, artificial intelligence, machine learning, (16 more...)

arXiv.org Artificial Intelligence

Aug-21-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Vermont (0.04)
  - Hawaii (0.04)

Genre:
- Research Report (0.64)

Industry:
- Energy > Power Industry (0.93)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found