Great Minds Think Alike: The Universal Convergence Trend of Input Salience

Mar-23-2025, 06:13:56 GMT–Neural Information Processing Systems

Uncertainty is introduced in optimized DNNs through stochastic algorithms, forming specific distributions. Training models can be seen as random sampling from this distribution of optimized models. In this work, we study the distribution of optimized DNNs as a family of functions by leveraging a pointwise approach. We focus on the input saliency maps, as the input gradient field is decisive to the models' mathematical essence. Our investigation of saliency maps reveals a counter-intuitive trend: two stochastically optimized models tend to resemble each other more as either of their capacities increases. Therefore, we hypothesize several properties of these distributions, suggesting that (1) Within the same model architecture (e.g., CNNs, ResNets), different family variants (e.g., varying capacities) tend to align in terms of their population mean directions of the input salience.

artificial intelligence, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Mar-23-2025, 06:13:56 GMT

Conferences PDF

Add feedback

Country:
- North America > United States > Indiana > Tippecanoe County (0.14)

Genre:
- Research Report
  - Experimental Study (1.00)
  - New Finding (0.67)

Industry:
- Government > Regional Government > North America Government > United States Government (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Neural Networks > Deep Learning (1.00)
    - Statistical Learning (0.68)
  - Natural Language (0.95)
  - Representation & Reasoning (1.00)
  - Vision (1.00)