AITopics | breakline

Collaborating Authors

breakline

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Supplemental: Training Fully Connected Neural Networks is R-Complete A R-Membership Membership in R is already proven by Abrahamsen, Kleist and Miltzow in [

Neural Information Processing SystemsOct-8-2025, 21:52:00 GMT

F or each line ℓ L the change of the gradient of f when crossing ℓ is constant along ℓ. Then there is a fully connected two-layer neural network with m hidden neurons computing f . To see that this observation is true, consider the following construction. Describing all gadgets purely by their data points is tedious and obscures the relatively simple geometry enforced by these data points. A weak data point relaxes a regular data point and prescribes only a lower bound on the value of the label.

artificial intelligence, gadget, machine learning, (18 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.85)

Add feedback

Training Neural Networks is NP-Hard in Fixed Dimension

Froese, Vincent, Hertrich, Christoph

arXiv.org Artificial IntelligenceMar-29-2023

We study the parameterized complexity of training two-layer neural networks with respect to the dimension of the input data and the number of hidden neurons, considering ReLU and linear threshold activation functions. Albeit the computational complexity of these problems has been studied numerous times in recent years, several questions are still open. We answer questions by Arora et al. [ICLR '18] and Khalife and Basu [IPCO '22] showing that both problems are NP-hard for two dimensions, which excludes any polynomial-time algorithm for constant dimension. We also answer a question by Froese et al. [JAIR '22] proving W[1]-hardness for four ReLUs (or two linear threshold neurons) with zero training error. Finally, in the ReLU case, we show fixed-parameter tractability for the combined parameter number of dimensions and number of ReLUs if the network is assumed to compute a convex map. Our results settle the complexity status regarding these parameters almost completely.

artificial intelligence, machine learning, selection gadget, (18 more...)

arXiv.org Artificial Intelligence

2303.17045

Country:

Europe > United Kingdom > England > Greater London > London (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Germany > Berlin (0.04)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Training Fully Connected Neural Networks is $\exists\mathbb{R}$-Complete

Bertschinger, Daniel, Hertrich, Christoph, Jungeblut, Paul, Miltzow, Tillmann, Weber, Simon

arXiv.org Artificial IntelligenceNov-8-2022

We consider the algorithmic problem of finding the optimal weights and biases for a two-layer fully connected neural network to fit a given set of data points. This problem is known as empirical risk minimization in the machine learning community. We show that the problem is $\exists\mathbb{R}$-complete. This complexity class can be defined as the set of algorithmic problems that are polynomial-time equivalent to finding real roots of a polynomial with integer coefficients. Furthermore, we show that arbitrary algebraic numbers are required as weights to be able to train some instances to optimality, even if all data points are rational. Our results hold even if the following restrictions are all added simultaneously. $\bullet$ There are exactly two output neurons. $\bullet$ There are exactly two input neurons. $\bullet$ The data has only 13 different labels. $\bullet$ The number of hidden neurons is a constant fraction of the number of data points. $\bullet$ The target training error is zero. $\bullet$ The ReLU activation function is used. This shows that even very simple networks are difficult to train. The result explains why typical methods for $\mathsf{NP}$-complete problems, like mixed-integer programming or SAT-solving, cannot train neural networks to global optimality, unless $\mathsf{NP}=\exists\mathbb{R}$. We strengthen a recent result by Abrahamsen, Kleist and Miltzow [NeurIPS 2021].

artificial intelligence, gadget, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2204.01368

Country: