AITopics | computational mechanism

Sequential Memory with Temporal Predictive Coding

Neural Information Processing SystemsDec-26-2025, 07:22:42 GMT

Forming accurate memory of sequential stimuli is a fundamental function of biological agents. However, the computational mechanism underlying sequential memory in the brain remains unclear. Inspired by neuroscience theories and recent successes in applying predictive coding (PC) to \emph{static} memory tasks, in this work we propose a novel PC-based model for \emph{sequential} memory, called \emph{temporal predictive coding} (tPC). We show that our tPC models can memorize and retrieve sequential inputs accurately with a biologically plausible neural implementation. Importantly, our analytical study reveals that tPC can be viewed as a classical Asymmetric Hopfield Network (AHN) with an implicit statistical whitening process, which leads to more stable performance in sequential memory tasks of structured inputs. Moreover, we find that tPC exhibits properties consistent with behavioral observations and theories in neuroscience, thereby strengthening its biological relevance. Our work establishes a possible computational mechanism underlying sequential memory in the brain that can also be theoretically interpreted using existing memory model frameworks.

name change, sequential memory, temporal predictive coding, (4 more...)

Neural Information Processing Systems

Industry: Law > Litigation (0.91)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.64)

Add feedback

From deep learning to mechanistic understanding in neuroscience: the structure of retinal prediction

Neural Information Processing SystemsDec-26-2025, 03:12:06 GMT

Recently, deep feedforward neural networks have achieved considerable success in modeling biological sensory processing, in terms of reproducing the input-output map of sensory neurons. However, such models raise profound questions about the very nature of explanation in neuroscience. Are we simply replacing one complex system (a biological circuit) with another (a deep network), without understanding either? Moreover, beyond neural representations, are the deep network's computational mechanisms for generating neural responses the same as those in the brain? Without a systematic approach to extracting and understanding computational mechanisms from deep neural network models, it can be difficult both to assess the degree of utility of deep learning approaches in neuroscience, and to extract experimentally testable hypotheses from deep networks. We develop such a systematic approach by combining dimensionality reduction and modern attribution methods for determining the relative importance of interneurons for specific visual computations. We apply this approach to deep network models of the retina, revealing a conceptual understanding of how the retina acts as a predictive feature extractor that signals deviations from expectations for diverse spatiotemporal stimuli. For each stimulus, our extracted computational mechanisms are consistent with prior scientific literature, and in one case yields a new mechanistic hypothesis. Thus overall, this work not only yields insights into the computational mechanisms underlying the striking predictive capabilities of the retina, but also places the framework of deep networks as neuroscientific models on firmer theoretical foundations, by providing a new roadmap to go beyond comparing neural representations to extracting and understand computational mechanisms.

computational mechanism, deep learning, neuroscience, (9 more...)

Neural Information Processing Systems

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

From deep learning to mechanistic understanding in neuroscience: the structure of retinal prediction

Hidenori Tanaka, Aran Nayebi, Niru Maheswaranathan, Lane McIntosh, Stephen Baccus, Surya Ganguli

Neural Information Processing SystemsAug-20-2025, 08:36:10 GMT

Neural Information Processing Systems http://nips.cc/

mechanism, retina, stimuli, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.14)
North America > United States > California > Santa Clara County > Stanford (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
(4 more...)

Genre: Research Report (0.46)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

9877d915a4b4f00e85e7b4cfdf41e450-Paper-Conference.pdf

Neural Information Processing SystemsAug-17-2025, 04:17:03 GMT

artificial intelligence, machine learning, trajectory, (18 more...)

Neural Information Processing Systems

Genre: Research Report (0.46)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Communications (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Computation Mechanism Behind LLM Position Generalization

Han, Chi, Ji, Heng

arXiv.org Artificial IntelligenceMar-17-2025

Most written natural languages are composed of sequences of words and sentences. Similar to humans, large language models (LLMs) exhibit flexibility in handling textual positions - a phenomenon we term position generalization. They can understand texts with position perturbations and generalize to longer texts than those encountered during training with the latest techniques. These phenomena suggest that LLMs handle positions tolerantly, but how LLMs computationally process positional relevance remains largely unexplored. This work connects the linguistic phenomenon with LLMs' computational mechanisms. We show how LLMs enforce certain computational mechanisms for the aforementioned tolerance in position perturbations. Despite the complex design of the self-attention mechanism, this work reveals that LLMs learn a counterintuitive disentanglement of attention logits. Their values show a 0.959 linear correlation with an approximation of the arithmetic sum of positional relevance and semantic importance. Furthermore, we identify a prevalent pattern in intermediate features, which we prove theoretically enables this effect. The pattern, which is different from how randomly initialized parameters would behave, suggests that it is a learned behavior rather than a natural result of the model architecture. Based on these findings, we provide computational explanations and criteria for LLMs' position flexibilities. This work takes a pioneering step in linking position generalization with modern LLMs' internal mechanisms.

artificial intelligence, large language model, natural language, (17 more...)

arXiv.org Artificial Intelligence

2503.13305

Country:

North America > United States > Illinois > Champaign County > Urbana (0.04)
Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Government > Regional Government (0.46)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Reviews: From deep learning to mechanistic understanding in neuroscience: the structure of retinal prediction

Neural Information Processing SystemsFeb-5-2025, 01:51:09 GMT

This manuscript aims to attack an interesting problem, namely how could one obtain mechanistic insights from the CNN model fit to the neural responses. The writing is generally clear, although it would benefit to tone down some of the statements to more accurately reflect the real contributions. Overall, the manuscript could be an interesting contribution to the field. However, I am skeptical about various claims made in the paper. The main issues I have with this manuscript are three-fold: 1.the results is rather incremental relatively to ref [2] and [9,10].

manuscript, mechanism, prediction, (16 more...)

Neural Information Processing Systems

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

Add feedback

Sequential Memory with Temporal Predictive Coding

Neural Information Processing SystemsJan-19-2025, 13:39:37 GMT

Forming accurate memory of sequential stimuli is a fundamental function of biological agents. However, the computational mechanism underlying sequential memory in the brain remains unclear. Inspired by neuroscience theories and recent successes in applying predictive coding (PC) to \emph{static} memory tasks, in this work we propose a novel PC-based model for \emph{sequential} memory, called \emph{temporal predictive coding} (tPC). We show that our tPC models can memorize and retrieve sequential inputs accurately with a biologically plausible neural implementation. Importantly, our analytical study reveals that tPC can be viewed as a classical Asymmetric Hopfield Network (AHN) with an implicit statistical whitening process, which leads to more stable performance in sequential memory tasks of structured inputs.

computational mechanism, sequential memory, temporal predictive coding, (2 more...)

Neural Information Processing Systems

Industry: Law > Litigation (0.91)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.70)

Add feedback

From deep learning to mechanistic understanding in neuroscience: the structure of retinal prediction

Neural Information Processing SystemsOct-11-2024, 05:25:37 GMT

Recently, deep feedforward neural networks have achieved considerable success in modeling biological sensory processing, in terms of reproducing the input-output map of sensory neurons. However, such models raise profound questions about the very nature of explanation in neuroscience. Are we simply replacing one complex system (a biological circuit) with another (a deep network), without understanding either? Moreover, beyond neural representations, are the deep network's computational mechanisms for generating neural responses the same as those in the brain? Without a systematic approach to extracting and understanding computational mechanisms from deep neural network models, it can be difficult both to assess the degree of utility of deep learning approaches in neuroscience, and to extract experimentally testable hypotheses from deep networks.

computational mechanism, deep network, neuroscience, (6 more...)

Neural Information Processing Systems

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Computational Mechanism to Account for Averaged Modified Hand Trajectories

Neural Information Processing SystemsApr-6-2023, 19:22:23 GMT

Using the double-step target displacement paradigm the mechanisms un(cid:173) derlying arm trajectory modification were investigated. Using short (10- 110 msec) inter-stimulus intervals the resulting hand motions were initially directed in between the first and second target locations. The kinematic features of the modified motions were accounted for by the superposition scheme, which involves the vectorial addition of two independent point-to(cid:173) point motion units: one for moving the hand toward an internally specified location and a second one for moving between that location and the final target location . The similarity between the inferred internally specified lo(cid:173) cations and previously reported measured end-points of the first saccades in double-step eye-movement studies may suggest similarities between per(cid:173) ceived target locations in eye and hand motor control.

averaged modified hand trajectory, computational mechanism, target location, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.47)

Add feedback

Understanding Information Processing in Human Brain by Interpreting Machine Learning Models

Kuzovkin, Ilya

arXiv.org Artificial IntelligenceOct-17-2020

The thesis explores the role machine learning methods play in creating intuitive computational models of neural processing. Combined with interpretability techniques, machine learning could replace human modeler and shift the focus of human effort to extracting the knowledge from the ready-made models and articulating that knowledge into intuitive descroptions of reality. This perspective makes the case in favor of the larger role that exploratory and data-driven approach to computational neuroscience could play while coexisting alongside the traditional hypothesis-driven approach. We exemplify the proposed approach in the context of the knowledge representation taxonomy with three research projects that employ interpretability techniques on top of machine learning methods at three different levels of neural organization. The first study (Chapter 3) explores feature importance analysis of a random forest decoder trained on intracerebral recordings from 100 human subjects to identify spectrotemporal signatures that characterize local neural activity during the task of visual categorization. The second study (Chapter 4) employs representation similarity analysis to compare the neural responses of the areas along the ventral stream with the activations of the layers of a deep convolutional neural network. The third study (Chapter 5) proposes a method that allows test subjects to visually explore the state representation of their neural signal in real time. This is achieved by using a topology-preserving dimensionality reduction technique that allows to transform the neural data from the multidimensional representation used by the computer into a two-dimensional representation a human can grasp. The approach, the taxonomy, and the examples, present a strong case for the applicability of machine learning methods to automatic knowledge discovery in neuroscience.

automatic perceptual categorization, interpreting machine learning model, khaligh-razavi and kriegeskorte, (16 more...)

arXiv.org Artificial Intelligence

2010.08715

Country:

Europe > Estonia > Tartu County > Tartu (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
(16 more...)

Genre:

Workflow (1.00)
Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
(2 more...)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(5 more...)

Add feedback

Filters

Collaborating Authors

computational mechanism

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Sequential Memory with Temporal Predictive Coding

From deep learning to mechanistic understanding in neuroscience: the structure of retinal prediction

From deep learning to mechanistic understanding in neuroscience: the structure of retinal prediction

9877d915a4b4f00e85e7b4cfdf41e450-Paper-Conference.pdf

Computation Mechanism Behind LLM Position Generalization

Reviews: From deep learning to mechanistic understanding in neuroscience: the structure of retinal prediction

Sequential Memory with Temporal Predictive Coding

From deep learning to mechanistic understanding in neuroscience: the structure of retinal prediction

A Computational Mechanism to Account for Averaged Modified Hand Trajectories

Understanding Information Processing in Human Brain by Interpreting Machine Learning Models