AITopics | nonlinear layer

Collaborating Authors

nonlinear layer

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

OntheSymmetriesofDeepLearningModelsand theirInternalRepresentations

Neural Information Processing SystemsFeb-8-2026, 20:14:54 GMT

Symmetry has been a fundamental tool in the exploration of a broad range of complexsystems.

artificial intelligence, machine learning, symmetry, (18 more...)

Neural Information Processing Systems

Country: Europe > France (0.04)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.96)

Add feedback

6244b2ba957c48bc64582cf2bcec3d04-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 15:45:41 GMT

decryption error rate, inference accuracy, inference latency, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > Indiana (0.04)
Asia > Japan > Honshū > Kansai > Kyoto Prefecture > Kyoto (0.04)
North America > Canada (0.04)
Europe (0.04)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Sharp Representation Theorems for ReLU Networks with Precise Dependence on Depth

Neural Information Processing SystemsOct-3-2025, 07:37:20 GMT

Note that decay of a function's Fourier transform is well-known to be related to its smoothness (c.f., [

fourier transform, neural network, relu network, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

AutoPrivacy: Automated Layer-wise Parameter Selection for Secure Neural Network Inference Qian Lou Indiana University Bloomington

Neural Information Processing SystemsOct-3-2025, 01:48:17 GMT

Privacy is important when clients upload their sensitive information, e.g., healthcare

artificial intelligence, inference latency, machine learning, (14 more...)

Neural Information Processing Systems

Country: North America > United States > Indiana (0.40)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Adaptive Sampling for Continuous Group Equivariant Neural Networks

Inal, Berfin, Cesa, Gabriele

arXiv.org Artificial IntelligenceSep-13-2024

Steerable networks, which process data with intrinsic symmetries, often use Fourier-based nonlinearities that require sampling from the entire group, leading to a need for discretization in continuous groups. As the number of samples increases, both performance and equivariance improve, yet this also leads to higher computational costs. To address this, we introduce an adaptive sampling approach that dynamically adjusts the sampling process to the symmetries in the data, reducing the number of required group samples and lowering the computational demands. We explore various implementations and their effects on model performance, equivariance, and computational efficiency. Our findings demonstrate improved model performance, and a marginal increase in memory efficiency.

matrix, nonlinearity, representation, (13 more...)

arXiv.org Artificial Intelligence

2409.08741

Country:

Europe > Austria > Vienna (0.14)
Europe > Netherlands > North Holland > Amsterdam (0.04)

Genre: Research Report > New Finding (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Lie Neurons: Adjoint-Equivariant Neural Networks for Semisimple Lie Algebras

Lin, Tzu-Yuan, Zhu, Minghan, Ghaffari, Maani

arXiv.org Artificial IntelligenceOct-6-2023

This paper proposes an adjoint-equivariant neural network that takes Lie algebra data as input. Various types of equivariant neural networks have been proposed in the literature, which treat the input data as elements in a vector space carrying certain types of transformations. In comparison, we aim to process inputs that are transformations between vector spaces. The change of basis on transformation is described by conjugations, inducing the adjoint-equivariance relationship that our model is designed to capture. Leveraging the invariance property of the Killing form, the proposed network is a general framework that works for arbitrary semisimple Lie algebras. Our network possesses a simple structure that can be viewed as a Lie algebraic generalization of a multi-layer perceptron (MLP). This work extends the application of equivariant feature learning. Respecting the symmetry in data is essential for deep learning models to understand the underlying objects.

adjoint action, lie algebra, transformation, (15 more...)

arXiv.org Artificial Intelligence

2310.04521

Country:

North America > United States > Michigan > Washtenaw County > Ann Arbor (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.68)

Add feedback

Specification-Driven Neural Network Reduction for Scalable Formal Verification

Ladner, Tobias, Althoff, Matthias

arXiv.org Artificial IntelligenceMay-3-2023

Formal verification of neural networks is essential before their deployment in safety-critical settings. However, existing methods for formally verifying neural networks are not yet scalable enough to handle practical problems that involve a large number of neurons. In this work, we propose a novel approach to address this challenge: A conservative neural network reduction approach that ensures that the verification of the reduced network implies the verification of the original network. Our approach constructs the reduction on-the-fly, while simultaneously verifying the original network and its specifications. The reduction merges all neurons of a nonlinear layer with similar outputs and is applicable to neural networks with any type of activation function such as ReLU, sigmoid, and tanh. Our evaluation shows that our approach can reduce a network to less than 5% of the number of neurons and thus to a similar degree the verification time is reduced.

artificial intelligence, machine learning, neuron, (16 more...)

arXiv.org Artificial Intelligence

2305.01932

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.05)
Europe > Germany > North Rhine-Westphalia > Upper Bavaria > Munich (0.04)

Genre: Research Report (0.71)

Industry: Information Technology (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Deep Learning Architectures

#artificialintelligenceNov-30-2021, 14:15:22 GMT

There are several types of Deep learning architectures, also known as artificial neural networks of multiple nonlinear layers. Characteristics of input data and the objective of the research work helps one and individual to decide which Deep Learning architecture is to be used and when. Deep Neural Network DNN: -Various Deep Learning Architectures in DNN are designed on the basis of building blocks of Neural Networks. These building blocks are based on Multilayer Perceptron (MLP) which uses Perceptron's, Stacked Auto-Encoder (SAE) which uses Auto-Encoders, and Deep Belief Networks (DBNs) which use Restricted Boltzmann machines (RBMs). Convolution Neural Network CNN: CNN's architectures are consist of different layers like convolution layers, nonlinear layers, and pooling layers.

building block, deep learning architecture, neural network, (6 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Adaptive deep density approximation for Fokker-Planck equations

Tang, Kejun, Wan, Xiaoliang, Liao, Qifeng

arXiv.org Machine LearningMar-20-2021

In this paper we present a novel adaptive deep density approximation strategy based on KRnet (ADDA-KR) for solving the steady-state Fokker-Planck equation. It is known that this equation typically has high-dimensional spatial variables posed on unbounded domains, which limit the application of traditional grid based numerical methods. With the Knothe-Rosenblatt rearrangement, our newly proposed flow-based generative model, called KRnet, provides a family of probability density functions to serve as effective solution candidates of the Fokker-Planck equation, which have weaker dependence on dimensionality than traditional computational approaches. To result in effective stochastic collocation points for training KRnet, we develop an adaptive sampling procedure, where samples are generated iteratively using KRnet at each iteration. In addition, we give a detailed discussion of KRnet and show that it can efficiently estimate general high-dimensional density functions. We present a general mathematical framework of ADDA-KR, validate its accuracy and demonstrate its efficiency with numerical experiments.

equation, fokker-planck equation, krnet, (15 more...)

arXiv.org Machine Learning

2103.11181

Country:

North America > United States > Louisiana > East Baton Rouge Parish > Baton Rouge (0.14)
Asia > China > Shanghai > Shanghai (0.04)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
(2 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Kernel-Based Smoothness Analysis of Residual Networks

Tirer, Tom, Bruna, Joan, Giryes, Raja

arXiv.org Machine LearningSep-21-2020

A major factor in the success of deep neural networks is the use of sophisticated architectures rather than the classical multilayer perceptron (MLP). Residual networks (ResNets) stand out among these powerful modern architectures. Previous works focused on the optimization advantages of deep ResNets over deep MLPs. In this paper, we show another distinction between the two models, namely, a tendency of ResNets to promote smoother interpolations than MLPs. We analyze this phenomenon via the neural tangent kernel (NTK) approach. First, we compute the NTK for a considered ResNet model and prove its stability during gradient descent training. Then, we show by various evaluation methodologies that the NTK of ResNet, and its kernel regression results, are smoother than the ones of MLP. The better smoothness observed in our analysis may explain the better generalization ability of ResNets and the practice of moderately attenuating the residual blocks.

artificial intelligence, machine learning, resnet, (17 more...)

arXiv.org Machine Learning

2009.10008

Country:

North America > United States > New York (0.04)
Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback