Efficient Algorithms for Learning Depth-2 Neural Networks with General ReLU Activations

May-29-2025, 04:21:28 GMT–Neural Information Processing Systems

Prior works for learning networks with ReLU activations assume that the bias b is zero. In order to deal with the presence of the bias terms, our proposed algorithm consists of robustly decomposing multiple higher order tensors arising from the Hermite expansion of the function f(x). Using these ideas we also establish identifiability of the network parameters under minimal assumptions.

algorithm, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

May-29-2025, 04:21:28 GMT

Conferences PDF

Add feedback

Country:
- North America > United States > Colorado (0.14)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)