Goto

Collaborating Authors

 timestep


Consistency Regularised Gradient Flows for Inverse Problems

arXiv.org Machine Learning

Vision-Language Latent Diffusion Models (LDMs) (Rombach et al., 2022) provide powerful generative priors for inverse problems. However, existing LDM-based inverse solvers typically require a large number of neural function evaluations (NFEs) and backpropagation through large pretrained components, leading to substantial computational costs and, in some cases, degraded reconstruction quality. We propose a unified Euclidean-Wasserstein-2 gradient-flow framework that jointly performs posterior sampling and prompt optimization in the latent space through a single flow that aligns the prior and posterior with the observed data. Combined with few-step latent text-to-image models, this formulation enables low-NFE inference without backpropagation through autoencoders. Experiments across several canonical imaging inverse problems show that our method achieves state-of-the-art performance with significantly reduced computational cost.


Neural Data Transformer 2: Multi-context Pretraining for Neural Spiking Activity

Neural Information Processing Systems

The neural population spiking activity recorded by intracortical brain-computer interfaces (iBCIs) contain rich structure. Current models of such spiking activity are largely prepared for individual experimental contexts, restricting data volume to that collectable within a single session and limiting the effectiveness of deep neural networks (DNNs). The purported challenge in aggregating neural spiking data is the pervasiveness of context-dependent shifts in the neural data distributions. However, large scale unsupervised pretraining by nature spans heterogeneous data, and has proven to be a fundamental recipe for successful representation learning across deep learning. We thus develop Neural Data Transformer 2 (NDT2), a spatiotemporal Transformer for neural spiking activity, and demonstrate that pretraining can leverage motor BCI datasets that span sessions, subjects, and experimental tasks. NDT2 enables rapid adaptation to novel contexts in downstream decoding tasks and opens the path to deployment of pretrained DNNs for iBCI control.



PALMER: Perception-Action Loop with Memory for Long-Horizon Planning

Neural Information Processing Systems

To achieve autonomy in a priori unknown real-world scenarios, agents should be able to: i) act from high-dimensional sensory observations (e.g., images), ii) learn from past experience to adapt and improve, and iii) be capable of long horizon planning.


Predict, Refine, Synthesize: Self-Guiding Diffusion Models for Probabilistic Time Series Forecasting

Neural Information Processing Systems

Diffusion models have achieved state-of-the-art performance in generative modeling tasks across various domains. Prior works on time series diffusion models have primarily focused on developing conditional models tailored to specific forecasting or imputation tasks. In this work, we explore the potential of taskagnostic, unconditional diffusion models for several time series applications. We propose TSDiff, an unconditionally-trained diffusion model for time series. Our proposed self-guidance mechanism enables conditioning TSDiff for downstream tasks during inference, without requiring auxiliary networks or altering the training procedure. We demonstrate the effectiveness of our method on three different time series tasks: forecasting, refinement, and synthetic data generation. First, we show that TSDiff is competitive with several task-specific conditional forecasting methods (predict). Second, we leverage the learned implicit probability density of TSDiff to iteratively refine the predictions of base forecasters with reduced computational overhead over reverse diffusion (refine). Notably, the generative performance of the model remains intact -- downstream forecasters trained on synthetic samples from TSDiff outperform forecasters that are trained on samples from other state-of-the-art generative time series models, occasionally even outperforming models trained on real data (synthesize).


5812f92450ccaf17275500841c70924a-Supplemental.pdf

Neural Information Processing Systems

We present a brief proof about the local optimality of one-hot encodings in the decision-theoretic framework presented in Section 3.2. We seek to prove that, under assumptions of an identity reward matrix, tokens constrained to a unit hypercube, and gaussian additive noise, one-hot tokens are an optimally robust communication strategy. We only seek to prove local optimality, as one many trivially generate multiple, equally optimal tokens by, for example, flipping all bits. The following derivation uses Karush-Kuhn-Tucker (KKT) conditions, a generalization of Lagrange multipliers [17]. We maximize the function, subject to constraints. T>j Ti Ti + ||Tj||2 Ti # ~ยตi + ~ฮปi = ~0 (13) (14) We seek to show that one-hot vectors are an optimum, so we now show that one-hot vectors indeed respect the constraints and set the derivatives to zero.


Controlled object Main model Outputfunk(hm) CB(hm) = hห†Lfunk(hs,ds) CF(hs) Inputhmhmhs, dshs

Neural Information Processing Systems

There are no explicit equations for the cerebellum traditionally also has access to a desired state ds (in particular, one can consider this a and forward DNI, respectively; L denotes the loss function. In addition, the inverse model of the of a motor area and sensory area, respectively; CB,CF denotes the computation of backward DNI Notation is largely consistent with section 2 of the main text: hm,hs denotes the hidden activity properties of the inverse model of the cerebellum can be set against those of forward DNI (red). Controller Neocortex Main model Cerebellum Synthesiser Forward Model Backward DNIInverse Model Forward DNI be summarised in table S1. In general, the likeness in formulation between DNI and the cerebellar internal model hypothesis can backward DNI where the main model is an motor-associated RNN. In fact, it was recently suggested that the cerebellum out that though the temporal case of forward DNI was not originally considered in [5], there remain learns to mimic the forward computations which then take place in the neocortex.


as decoupling neural interfaces Cortico-cerebellar networks

Neural Information Processing Systems

Overall, our work offers a novel perspective on the cerebellum as a brainneuronal observations while making several testable predictions across multiple mental observations. Moreover, our model also explains recent behavioural and learning while reducing ataxia-like behaviours, consistent with classical experishown to be cerebellar-dependent. In all tasks, we observe that ccRNNs facilitates and cognitive tasks (pattern recognition and caption generation) that have been network (ccRNN) model on a number of sensorimotor (line and digit drawing) tions from a cerebellar module. We test this cortico-cerebellar recurrent neural in which a recurrent cortical network receives online temporal feedback predicdemonstrate the potential of this framework we introduce a systems-level model lum, helps the cerebral cortex solve similar locking problems akin to DNIs.