AITopics | multitask deep neural classifier

Neural Information Processing Systems http://nips.cc/

auxiliary task, multitask benefit, student network, (15 more...)

Neural Information Processing Systems

Country:

Europe > Sweden > Stockholm > Stockholm (0.04)
North America > Canada > Alberta > Census Division No. 15 > Improvement District No. 9 > Banff (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Genre: Research Report (0.94)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

Add feedback

Generalization in multitask deep neural classifiers: a statistical physics approach

Neural Information Processing SystemsDec-25-2025, 03:11:14 GMT

A proper understanding of the striking generalization abilities of deep neural networks presents an enduring puzzle. Recently, there has been a growing body of numerically-grounded theoretical work that has contributed important insights to the theory of learning in deep neural nets. There has also been a recent interest in extending these analyses to understanding how multitask learning can further improve the generalization capacity of deep neural nets. These studies deal almost exclusively with regression tasks which are amenable to existing analytical techniques. We develop an analytic theory of the nonlinear dynamics of generalization of deep neural networks trained to solve classification tasks using softmax outputs and cross-entropy loss, addressing both single task and multitask settings. We do so by adapting techniques from the statistical physics of disordered systems, accounting for both finite size datasets and correlated outputs induced by the training dynamics. We discuss the validity of our theoretical results in comparison to a comprehensive suite of numerical experiments. Our analysis provides theoretical support for the intuition that the performance of multitask learning is determined by the noisiness of the tasks and how well their input features align with each other. Highly related, clean tasks benefit each other, whereas unrelated, clean tasks can be detrimental to individual task performance.

generalization, multitask deep neural classifier, statistical physics approach, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Generalization in multitask deep neural classifiers: a statistical physics approach

Anthony Ndirango, Tyler Lee

Neural Information Processing SystemsOct-2-2025, 08:56:35 GMT

A proper understanding of the striking generalization abilities of deep neural networks presents an enduring puzzle.

artificial intelligence, machine learning, multitask benefit, (17 more...)

Neural Information Processing Systems

Country:

Europe > Sweden > Stockholm > Stockholm (0.04)
North America > Canada > Alberta > Census Division No. 15 > Improvement District No. 9 > Banff (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Genre: Research Report (0.94)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

Add feedback

Generalization in multitask deep neural classifiers: a statistical physics approach

Neural Information Processing SystemsOct-2-2025, 08:56:21 GMT

We would first like to thank all three reviewers for their thorough, constructive and considered reviews. Appendix A, our model is a nonequilibrium variant of Derrida's Random Energy Model. We will update the final manuscript to describe this analogy more explicitly. As such, this is still a matter of active research. Conditions claimed in L181-184: We will amend the manuscript to indicate that the equation directly preceding eqn.

artificial intelligence, machine learning, multitask deep neural classifier, (12 more...)

Neural Information Processing Systems

Genre: Research Report (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.30)

Add feedback

Reviews: Generalization in multitask deep neural classifiers: a statistical physics approach

Neural Information Processing SystemsFeb-11-2025, 21:18:50 GMT

The experiments on multitask learning are informative. I wish the experiments and theory were a bit more integrated. See my comments below for more details. The authors moved a lot of details to the appendix while keeping the main conclusions in the main submission to ease understanding. Here are some examples: (a) L181-184 what equation shows (s_A - \tilde{s_A}) depends on the said 4 things; (b) L185-186 when labelled data is scarce why is (\bar{s_A*g(s_A)}-\tilde{s_A*g(s_A)} 0; (c) L189-190 why does (\bar{s_A*g(s_A)}-\tilde{s_A*g(s_A)} tend to 0 when training data is abundant.

multitask deep neural classifier, relevant equation, statistical physics approach, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.64)

Add feedback

Reviews: Generalization in multitask deep neural classifiers: a statistical physics approach

Neural Information Processing SystemsFeb-11-2025, 21:18:40 GMT

This paper is a nice combination of theoretical understanding and simple experiments to verify it in the case of multitask learning in neural nets. Given that there is not much known in this space, this work can be impactful. I suggest authors to add a few multi-task experiments with real datasets to verify their understanding.

generalization, multitask deep neural classifier, statistical physics approach, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.89)

Add feedback

Generalization in multitask deep neural classifiers: a statistical physics approach

Neural Information Processing SystemsJan-22-2025, 06:23:49 GMT

A proper understanding of the striking generalization abilities of deep neural networks presents an enduring puzzle. Recently, there has been a growing body of numerically-grounded theoretical work that has contributed important insights to the theory of learning in deep neural nets. There has also been a recent interest in extending these analyses to understanding how multitask learning can further improve the generalization capacity of deep neural nets. These studies deal almost exclusively with regression tasks which are amenable to existing analytical techniques. We develop an analytic theory of the nonlinear dynamics of generalization of deep neural networks trained to solve classification tasks using softmax outputs and cross-entropy loss, addressing both single task and multitask settings.

generalization, multitask deep neural classifier, statistical physics approach, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Generalization in multitask deep neural classifiers: a statistical physics approach

Ndirango, Anthony, Lee, Tyler

Neural Information Processing SystemsMar-19-2020, 03:16:46 GMT

A proper understanding of the striking generalization abilities of deep neural networks presents an enduring puzzle. Recently, there has been a growing body of numerically-grounded theoretical work that has contributed important insights to the theory of learning in deep neural nets. There has also been a recent interest in extending these analyses to understanding how multitask learning can further improve the generalization capacity of deep neural nets. These studies deal almost exclusively with regression tasks which are amenable to existing analytical techniques. We develop an analytic theory of the nonlinear dynamics of generalization of deep neural networks trained to solve classification tasks using softmax outputs and cross-entropy loss, addressing both single task and multitask settings.

generalization, multitask deep neural classifier, statistical physics approach, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Collaborating Authors

multitask deep neural classifier

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Generalization in multitask deep neural classifiers: a statistical physics approach

Generalization in multitask deep neural classifiers: a statistical physics approach

Generalization in multitask deep neural classifiers: a statistical physics approach

Generalization in multitask deep neural classifiers: a statistical physics approach

Reviews: Generalization in multitask deep neural classifiers: a statistical physics approach

Reviews: Generalization in multitask deep neural classifiers: a statistical physics approach

Generalization in multitask deep neural classifiers: a statistical physics approach

Generalization in multitask deep neural classifiers: a statistical physics approach