neural encoding
A Tractable Approximation to Optimal Point Process Filtering: Application to Neural Encoding
The process of dynamic state estimation (filtering) based on point process observations is in general intractable. Numerical sampling techniques are often practically useful, but lead to limited conceptual insight about optimal encoding/decoding strategies, which are of significant relevance to Computational Neuroscience. We develop an analytically tractable Bayesian approximation to optimal filtering based on point process observations, which allows us to introduce distributional assumptions about sensory cell properties, that greatly facilitates the analysis of optimal encoding in situations deviating from common assumptions of uniform coding. The analytic framework leads to insights which are difficult to obtain from numerical algorithms, and is consistent with experiments about the distribution of tuning curve centers. Interestingly, we find that the information gained from the absence of spikes may be crucial to performance.
Tuning In to Neural Encoding: Linking Human Brain and Artificial Supervised Representations of Language
Sun, Jingyuan, Zhang, Xiaohan, Moens, Marie-Francine
To understand the algorithm that supports the human brain's language representation, previous research has attempted to predict neural responses to linguistic stimuli using embeddings generated by artificial neural networks (ANNs), a process known as neural encoding. However, most of these studies have focused on probing neural representations of Germanic languages, such as English, with unsupervised ANNs. In this paper, we propose to bridge the gap between human brain and supervised ANN representations of the Chinese language. Specifically, we investigate how task tuning influences a pretained Transformer for neural encoding and which tasks lead to the best encoding performances. We generate supervised representations on eight Natural Language Understanding (NLU) tasks using prompt-tuning, a technique that is seldom explored in neural encoding for language. We demonstrate that prompt-tuning yields representations that better predict neural responses to Chinese stimuli than traditional fine-tuning on four tasks. Furthermore, we discover that tasks that require a fine-grained processing of concepts and entities lead to representations that are most predictive of brain activation patterns. Additionally, we reveal that the proportion of tuned parameters highly influences the neural encoding performance of fine-tuned models. Overall, our experimental findings could help us better understand the relationship between supervised artificial and brain language representations.
Advancing Neural Encoding of Portuguese with Transformer Albertina PT-*
Rodrigues, João, Gomes, Luís, Silva, João, Branco, António, Santos, Rodrigo, Cardoso, Henrique Lopes, Osório, Tomás
In recent years, the field of Artificial Intelligence has come to successfully exploit the paradigm of deep learning, a machine learning approach based on large artificial neural networks [LeCun et al., 2015]. Applied to Natural Language Processing (NLP), deep learning gained outstanding traction with notable breakthroughs under the distributional semantics approach, namely with word embedding techniques [Mikolov et al., 2013] and the Transformer neural architecture [Vaswani et al., 2017]. These neural models acquire semantic representations from massive amounts of data in a self-supervised learning process that ultimately results in the so-called Foundation Models [Bommasani et al., 2021]. Self-supervision is accomplished in NLP through language modeling [Bengio et al., 2000] and was initially adopted in shallow neural network models such as Word2Vec [Mikolov et al., 2013] for the creation of word embeddings. Over time, this approach was scaled beyond the single-token level to sequence transduction with encoding-decoding models based on recurrent [Sutskever et al., 2014] or convolution neural networks and occasionally supported by attention mechanisms [Bahdanau et al., 2015]. A particular neural network architecture, the Transformer, has stood out among all others, showing superior performance by a large margin, sometimes even surpassing human-level performance [Wang et al., 2018, Wang et al., 2019], and became mainstream in virtually every NLP task and application [Bommasani et al., 2021]. Several variants have spun out from the base Transformer architecture (encoder-decoder), including the landmark encoder BERT [Devlin et al., 2019] and the outstanding decoder GPT [Brown et al., 2020], which have been most successfully adapted to downstream
A Tractable Approximation to Optimal Point Process Filtering: Application to Neural Encoding
Harel, Yuval, Meir, Ron, Opper, Manfred
The process of dynamic state estimation (filtering) based on point process observations is in general intractable. Numerical sampling techniques are often practically useful, but lead to limited conceptual insight about optimal encoding/decoding strategies, which are of significant relevance to Computational Neuroscience. We develop an analytically tractable Bayesian approximation to optimal filtering based on point process observations, which allows us to introduce distributional assumptions about sensory cell properties, that greatly facilitates the analysis of optimal encoding in situations deviating from common assumptions of uniform coding. The analytic framework leads to insights which are difficult to obtain from numerical algorithms, and is consistent with experiments about the distribution of tuning curve centers. Interestingly, we find that the information gained from the absence of spikes may be crucial to performance.