AITopics | sparse structure

Collaborating Authors

sparse structure

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Integrating Bayesian and Discriminative Sparse Kernel Machines for Multi-class Active Learning

Weishi Shi, Qi Yu

Neural Information Processing SystemsFeb-13-2026, 21:01:52 GMT

Neural Information Processing Systems http://nips.cc/

data sample, decision boundary, learning, (13 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

0dfd8a39e2a5dd536c185e19a804a73b-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 11:56:13 GMT

Despite tremendous success in many application scenarios, the training and inference costs of using deep learning are also rapidly increasing over time. The lottery tickethypothesis (LTH)emergesasapromising frameworktoleverage a special sparse subnetwork (i.e.,winning ticket) instead of a full model for both training and inference, that can lower both costs without sacrificing the performance.

artificial intelligence, machine learning, ticket, (15 more...)

Neural Information Processing Systems

Genre: Contests & Prizes (0.37)

Industry: Leisure & Entertainment (0.37)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.34)

Add feedback

Meta-ticket: Finding optimal subnetworks for few-shot learning within randomly initialized neural networks

Neural Information Processing SystemsDec-24-2025, 21:47:20 GMT

Few-shot learning for neural networks (NNs) is an important problem that aims to train NNs with a few data. The main challenge is how to avoid overfitting since over-parameterized NNs can easily overfit to such small dataset.

few-shot learning, meta-ticket, subnetwork, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Crisis-Resilient Portfolio Management via Graph-based Spatio-Temporal Learning

Li, Zan, Fan, Rui

arXiv.org Artificial IntelligenceOct-27-2025

Financial time series forecasting faces a fundamental challenge: predicting optimal asset allocations requires understanding regime-dependent correlation structures that transform during crisis periods. Existing graph-based spatio-temporal learning approaches rely on predetermined graph topologies--correlation thresholds, sector classifications--that fail to adapt when market dynamics shift across different crisis mechanisms: credit contagion, pandemic shocks, or inflation-driven selloffs. We present CRISP (Crisis-Resilient Investment through Spatio-temporal Patterns), a graph-based spatio-temporal learning framework that encodes spatial relationships via Graph Convolutional Networks and temporal dynamics via BiLSTM with self-attention, then learns sparse structures through multi-head Graph Attention Networks. Unlike fixed-topology methods, CRISP discovers which asset relationships matter through attention mechanisms, filtering 92.5% of connections as noise while preserving crisis-relevant dependencies for accurate regime-specific predictions. Trained on 2005--2021 data encompassing credit and pandemic crises, CRISP demonstrates robust generalization to 2022--2024 inflation-driven markets--a fundamentally different regime--by accurately forecasting regime-appropriate correlation structures. This enables adaptive portfolio allocation that maintains profitability during downturns, achieving Sharpe ratio 3.76: 707% improvement over equal-weight baselines and 94% improvement over static graph methods. Learned attention weights provide interpretable regime detection, with defensive cluster attention strengthening 49% during crises versus 31% market-wide--emergent behavior from learning to forecast rather than imposing assumptions.

artificial intelligence, forecasting, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2510.20868

Genre: Research Report (0.40)

Industry: Banking & Finance > Trading (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)

Add feedback

Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for Sparse Training

Neural Information Processing SystemsAug-22-2025, 00:44:16 GMT

We show that both techniques can be well incorporated into the sparse training algorithm to form a generic framework, which we dub SpFDE.

artificial intelligence, deep learning, machine learning, (17 more...)

Neural Information Processing Systems

Genre: Research Report (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

Integrating Bayesian and Discriminative Sparse Kernel Machines for Multi-class Active Learning

Weishi Shi, Qi Yu

Neural Information Processing SystemsAug-20-2025, 00:45:58 GMT

We propose a novel active learning (AL) model that integrates Bayesian and discriminative kernel machines for fast and accurate multi-class data sampling.

data sample, decision boundary, learning, (13 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Meta-ticket: Finding optimal subnetworks for few-shot learning within randomly initialized neural networks

Neural Information Processing SystemsAug-17-2025, 08:06:26 GMT

The main challenge is how to avoid overfitting since over-parameterized NNs can easily overfit to such small dataset.

artificial intelligence, machine learning, meta-ticket, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)
(2 more...)

Genre: Research Report > New Finding (0.68)

Industry: Leisure & Entertainment (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

T\'yr-the-Pruner: Unlocking Accurate 50% Structural Pruning for LLMs via Global Sparsity Distribution Optimization

Li, Guanchen, Xu, Yixing, Li, Zeping, Liu, Ji, Yin, Xuanwu, Li, Dong, Barsoum, Emad

arXiv.org Artificial IntelligenceMar-17-2025

Structural pruning enhances hardware-agnostic inference efficiency for large language models (LLMs) but often struggles to maintain performance. Local pruning performs efficient layer-by-layer compression but ignores global topology. Global pruning has the potential to find the optimal solution although resource-intensive. However, existing methods tend to rank structural saliency uniformly, ignoring inter-structure dependencies and failing to achieve end-to-end optimization. To address these limitations, we propose T\'yr-the-Pruner, an efficient end-to-end search-based global structural pruning framework. This framework constructs a supernet by repeatedly applying local pruning across a range of sparsity ratios to each layer in an LLM, with the core goal of determining the optimal sparsity distribution under a target overall sparsity ratio. Concretely, we introduce an effective local pruning and an expectation error accumulation approach to improve supernet construction. Furthermore, we employ an iterative prune-and-search strategy with coarse-to-fine sparsity granularity to ensure efficient search convergence. Experimental results show that T\'yr-the-Pruner achieves state-of-the-art structural pruning, retaining 97% of the dense model's performance while removing a challenging 50% of Llama-3.1-70B's parameters.

large language model, machine learning, pruning, (17 more...)

arXiv.org Artificial Intelligence

2503.09657

Country:

Europe > Austria > Vienna (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.14)
(10 more...)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.91)

Add feedback

On the Interplay Between Sparsity and Training in Deep Reinforcement Learning

Davelouis, Fatima, Martin, John D., Bowling, Michael

arXiv.org Artificial IntelligenceFeb-1-2025

We study the benefits of different sparse architectures for deep reinforcement learning. In particular, we focus on image-based domains where spatially-biased and fully-connected architectures are common. Using these and several other architectures of equal capacity, we show that sparse structure has a significant effect on learning performance. We also observe that choosing the best sparse architecture for a given domain depends on whether the hidden layer weights are fixed or learned.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

arXiv.org Artificial Intelligence

2501.16729

Country:

North America > Canada > Alberta (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > Italy > Campania > Naples (0.04)
Asia > China > Tianjin Province > Tianjin (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Filters

Collaborating Authors

sparse structure

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Integrating Bayesian and Discriminative Sparse Kernel Machines for Multi-class Active Learning

794a425a2e47e05d29d30f79b79a692d-Paper-Conference.pdf

0dfd8a39e2a5dd536c185e19a804a73b-Paper.pdf

Meta-ticket: Finding optimal subnetworks for few-shot learning within randomly initialized neural networks

Crisis-Resilient Portfolio Management via Graph-based Spatio-Temporal Learning

Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for Sparse Training

Integrating Bayesian and Discriminative Sparse Kernel Machines for Multi-class Active Learning

Meta-ticket: Finding optimal subnetworks for few-shot learning within randomly initialized neural networks

T\'yr-the-Pruner: Unlocking Accurate 50% Structural Pruning for LLMs via Global Sparsity Distribution Optimization

On the Interplay Between Sparsity and Training in Deep Reinforcement Learning