Evolving Normalization-Activation Layers

May-27-2025, 07:06:56 GMT–Neural Information Processing Systems

Normalization layers and activation functions are fundamental components in deep networks and typically co-locate with each other. Here we propose to design them using an automated approach. Instead of designing them separately, we unify them into a single tensor-to-tensor computation graph, and evolve its structure starting from basic mathematical functions. Examples of such mathematical functions are addition, multiplication and statistical moments. The use of low-level mathematical functions, in contrast to the use of high-level modules in mainstream NAS, leads to a highly sparse and large search space which can be challenging for search methods.

artificial intelligence, evolving normalization-activation layer, machine learning, (4 more...)

Neural Information Processing Systems

May-27-2025, 07:06:56 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (0.89)
  - Representation & Reasoning > Search (0.61)