AITopics | state dimension

Details

Neural Information Processing SystemsApr-24-2026, 16:49:22 GMT

We formalize and generalize the notation of Section 4.1 and prove the results. For the remainder of this section, we fix a dimension D (e.g. The D-dimensional SSM will be a map from a function u: D! to y: D!. Definition 2 (Indexing notation). Given a subset I [D], let I denote its complement [D]\I. Given a 2 N1 and b 2 N2, let a b 2 N1 N2 be defined as (a b)n1,n2 = an1bn2.

artificial intelligence, machine learning, resolution, (19 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.50)

Add feedback

Appendix of Temporal Conditioning Spiking Latent Variable Models of the Neural Response to Natural Visual Scenes A Hidden State and Latent Space Experiments

Neural Information Processing SystemsApr-24-2026, 16:48:00 GMT

After completely excluding the temporal dimension from the model parameter space, we introduced the temporal conditioning operation to handle the temporal information. In particular, this operation enables memory-dependent processing as in biological coding circuits. Figure 6: Performances under di erent hidden state and latent space dimension settings on Movie 2 Retina 2 data. For hidden state experiments, the latent space dimension is set to 32. And for latent space experiments, the hidden state dimension is 64.

artificial intelligence, machine learning, teco-lvm model, (14 more...)

Neural Information Processing Systems

Industry: Health & Medicine > Therapeutic Area > Neurology (0.65)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)

Add feedback

Layer-Adaptive State Pruning for Deep State Space Models

Neural Information Processing SystemsMar-18-2026, 08:27:24 GMT

Due to the lack of state dimension optimization methods, deep state space models (SSMs) have sacrificed model capacity, training search space, or stability to alleviate computational costs caused by high state dimensions. In this work, we provide a structured pruning method for SSMs, Layer-Adaptive STate pruning (LAST), which reduces the state dimension of each layer in minimizing model-level output energy loss by extending modal truncation for a single system. LAST scores are evaluated using the $\mathcal{H}_{\infty}$ norms of subsystems and layer-wise energy normalization. The scores serve as global pruning criteria, enabling cross-layer comparison of states and layer-adaptive pruning.

artificial intelligence, name change, proceedings, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)

Add feedback

cf4356f994917177213c55ff438ddf71-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 00:21:29 GMT

change factor, experiment, half-cheetah, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

f89394c979b34a25cc4ff8e11234fbfb-Supplemental.pdf

Neural Information Processing SystemsFeb-12-2026, 00:08:21 GMT

ground truth, log likelihood, trajectory, (15 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

a862f5788fd09bb6843c694d8120d50c-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 05:31:27 GMT

dimension, implementation, transition, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

14730e0dd6ac1c4a5765310909fd51b1-Paper-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 06:56:14 GMT

artificial intelligence, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Hampshire County > Amherst (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Singapore (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

13388efc819c09564c66ab2dc8463809-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 13:43:34 GMT

base resolution, experiment, resolution, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.49)

Add feedback

Action-Free Offline-to-Online RL via Discretised State Policies

Neggatu, Natinael Solomon, Houssineau, Jeremie, Montana, Giovanni

arXiv.org Machine LearningFeb-3-2026

Most existing offline RL methods presume the availability of action labels within the dataset, but in many practical scenarios, actions may be missing due to privacy, storage, or sensor limitations. We formalise the setting of action-free offline-to-online RL, where agents must learn from datasets consisting solely of $(s,r,s')$ tuples and later leverage this knowledge during online interaction. To address this challenge, we propose learning state policies that recommend desirable next-state transitions rather than actions. Our contributions are twofold. First, we introduce a simple yet novel state discretisation transformation and propose Offline State-Only DecQN (\algo), a value-based algorithm designed to pre-train state policies from action-free data. \algo{} integrates the transformation to scale efficiently to high-dimensional problems while avoiding instability and overfitting associated with continuous state prediction. Second, we propose a novel mechanism for guided online learning that leverages these pre-trained state policies to accelerate the learning of online agents. Together, these components establish a scalable and practical framework for leveraging action-free datasets to accelerate online RL. Empirical results across diverse benchmarks demonstrate that our approach improves convergence speed and asymptotic performance, while analyses reveal that discretisation and regularisation are critical to its effectiveness.

machine learning, reinforcement learning, state policy, (15 more...)

arXiv.org Machine Learning

2602.00629

Country: