AITopics | multimnist

Collaborating Authors

multimnist

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

47fd3c87f42f55d4b233417d49c34783-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-8-2026, 07:25:47 GMT

capsule network, em-routing, multimnist, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.73)

Add feedback

reviewers find our proposed method, claims and empirical methodology to be correct (R1

Neural Information Processing SystemsOct-2-2025, 20:06:51 GMT

We would like to thank the reviewers for their comments and positive outlook on the paper. We hope our response clarifies all concerns. We thank the reviewer for the valueable suggestion. As shown, our method outperforms previous work by a large margin using fewer parameters. Therefore, VB-Routing still suffers from the efficiency drawbacks mentioned in Section 1.1.

artificial intelligence, claim and empirical methodology, machine learning, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.73)

Add feedback

Multi-Objective Optimization for Sparse Deep Multi-Task Learning

Hotegni, S. S., Berkemeier, M., Peitz, S.

arXiv.org Artificial IntelligenceJan-25-2024

Different conflicting optimization criteria arise naturally in various Deep Learning scenarios. These can address different main tasks (i.e., in the setting of Multi-Task Learning), but also main and secondary tasks such as loss minimization versus sparsity. The usual approach is a simple weighting of the criteria, which formally only works in the convex setting. In this paper, we present a Multi-Objective Optimization algorithm using a modified Weighted Chebyshev scalarization for training Deep Neural Networks (DNNs) with respect to several tasks. By employing this scalarization technique, the algorithm can identify all optimal solutions of the original problem while reducing its complexity to a sequence of single-objective problems. The simplified problems are then solved using an Augmented Lagrangian method, enabling the use of popular optimization techniques such as Adam and Stochastic Gradient Descent, while efficaciously handling constraints. Our work aims to address the (economical and also ecological) sustainability issue of DNN models, with a particular focus on Deep Multi-Task models, which are typically designed with a very large number of weights to perform equally well on multiple tasks. Through experiments conducted on two Machine Learning datasets, we demonstrate the possibility of adaptively sparsifying the model during training without significantly impacting its performance, if we are willing to apply task-specific adaptations to the network weights. The code is available at https://github.com/salomonhotegni/MDMTN

dataset, objective, optimization, (16 more...)

arXiv.org Artificial Intelligence

2308.12243

Country:

Europe > Germany (0.04)
North America > Mexico > Guanajuato (0.04)
North America > Canada > Ontario > Kingston (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)

Genre:

Overview (0.93)
Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.68)

Add feedback