AITopics | modern network

Collaborating Authors

modern network

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

4588e674d3f0faf985047d4c3f13ed0d-Supplemental.pdf

Neural Information Processing SystemsApr-25-2026, 16:05:31 GMT

artificial intelligence, latexit sha1, machine learning, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.52)

Add feedback

Towards Deeper Deep Reinforcement Learning with Spectral Normalization

Neural Information Processing SystemsApr-25-2026, 16:05:27 GMT

In computer vision and natural language processing, innovations in model architecture that increase model capacity have reliably translated into gains in performance. In stark contrast with this trend, state-of-the-art reinforcement learning (RL) algorithms often use small MLPs, and gains in performance typically originate from algorithmic innovations. It is natural to hypothesize that small datasets in RL necessitate simple models to avoid overfitting; however, this hypothesis is untested. In this paper we investigate how RL agents are affected by exchanging the small MLPs with larger modern networks with skip connections and normalization, focusing specifically on actor-critic algorithms. We empirically verify that naïvely adopting such architectures leads to instabilities and poor performance, likely contributing to the popularity of simple models in practice. However, we show that dataset size is not the limiting factor, and instead argue that instability from taking gradients through the critic is the culprit. We demonstrate that spectral normalization (SN) can mitigate this issue and enable stable training with large modern architectures. After smoothing with SN, larger models yield significant performance improvements -- suggesting that more "easy" gains may be had by focusing on model architectures in addition to algorithmic innovations.

arxiv preprint arxiv, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

0d1a9651497a38d8b1c3871c84528bd4-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-11-2026, 10:56:47 GMT

architecture, kernel, revision, (12 more...)

Neural Information Processing Systems

Genre: Research Report (0.58)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.72)

Add feedback

4588e674d3f0faf985047d4c3f13ed0d-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 10:35:46 GMT

latexit sha1, modern network, spectral normalization, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.52)

Add feedback

We thank the reviewers for their time and constructive feedback on the submission, which we will incorporate to 1 improve our manuscript

Neural Information Processing SystemsOct-2-2025, 01:30:57 GMT

We find that they are positive-definite as expected. Supervised Differentiable Programming" by Chizat and Bach is an important contribution and we will absolutely Sec 2.2 in V1, V2) are restricted to single-hidden-layer networks. It is still an open research question to determine what are the main factors that determine these performance gaps. We will expand discussion around this.

artificial intelligence, machine learning, time and constructive feedback, (17 more...)

Neural Information Processing Systems

Genre: Research Report (0.58)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.72)

Add feedback

Reviews: Wide Neural Networks of Any Depth Evolve as Linear Models Under Gradient Descent

Neural Information Processing SystemsJan-21-2025, 12:27:27 GMT

The paper was proofread, well-structured, and very clear. The experiments were clearly described in detail, and provided relevant results. Below we outline some detailed comments of the results. In particular, Chizat and Bach prove that the training of an NTK parameterized network is closely modeled by "lazy training" (their terminology for a linearized model). This paper is not referenced in the related work section.

chizat and bach, modern network, wide neural network, (4 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.40)

Add feedback

A path-norm toolkit for modern networks: consequences, promises and challenges

Gonon, Antoine, Brisebarre, Nicolas, Riccietti, Elisa, Gribonval, Rémi

arXiv.org Machine LearningNov-24-2023

This work introduces the first toolkit around path-norms that is fully able to encompass general DAG ReLU networks with biases, skip connections and any operation based on the extraction of order statistics: max pooling, GroupSort etc. This toolkit notably allows us to establish generalization bounds for modern neural networks that are not only the most widely applicable path-norm based ones, but also recover or beat the sharpest known bounds of this type. These extended path-norms further enjoy the usual benefits of path-norms: ease of computation, invariance under the symmetries of the network, and improved sharpness on feedforward networks compared to the product of operators' norms, another complexity measure most commonly used. The versatility of the toolkit and its ease of implementation allow us to challenge the concrete promises of path-norm-based generalization bounds, by numerically evaluating the sharpest known bounds for ResNets on ImageNet.

artificial intelligence, machine learning, neuron, (19 more...)

arXiv.org Machine Learning

2310.01225

Country:

North America > United States > California > Los Angeles County > Long Beach (0.14)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
(9 more...)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Artificial intelligence and machine learning are necessary to run a modern network -- Kerravala

#artificialintelligenceJan-6-2023, 15:40:56 GMT

Kerravala says AI/ML is the only path forward for a business that relies on its network to deliver customer and employee experiences, which is almost every company.

artificial intelligence and machine, europe government, machine learning, (3 more...)

#artificialintelligence

Country: Europe (0.22)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.69)

Add feedback

Why do Modern Networks Require AIOps?

#artificialintelligenceAug-27-2022, 00:50:18 GMT

Over the past decade, network operations teams have had to deal with a number of issues in their networks--from increased complexity to more distributed environments. With AIOps, you can start optimizing your networks now and prepare for the future. AIOps lets you manage your network like never before. According to Gartner, AIOps combines big data and machine learning to automate IT operations processes such as event correlation, anomaly detection, and causality determination to name a few. It can be defined as the application of machine learning (ML) and data science to IT operations problems.

accurate information, aiop, modern network, (12 more...)

#artificialintelligence

Industry: Information Technology (0.31)

Technology: