AITopics

doi: 10.1088/1751-8121/acfeb7

2201.12305

Country:

Europe > United Kingdom > England > Nottinghamshire > Nottingham (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)
South America > Uruguay > Maldonado > Maldonado (0.04)
(5 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Systems & Languages > Programming Languages (0.82)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.82)

Cabannes, Vivien, Dohmatob, Elvis, Bietti, Alberto

Scaling Laws for Associative Memories

arXiv.org Machine LearningOct-4-2023

Learning arguably involves the discovery and memorization of abstract rules. The aim of this paper is to study associative memory mechanisms. Our model is based on high-dimensional matrices consisting of outer products of embeddings, which relates to the inner layers of transformer language models. We derive precise scaling laws with respect to sample size and parameter size, and discuss the statistical efficiency of different estimators, including optimization-based algorithms. We provide extensive numerical experiments to validate and interpret theoretical results, including fine-grained visualizations of the stored memory associations.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Machine Learning

2310.02984

Country:

North America > United States (0.14)
Europe > Portugal > Lisbon > Lisbon (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
(3 more...)

arXiv.org Artificial IntelligenceSep-28-2023

Memory in Plain Sight: A Survey of the Uncanny Resemblances between Diffusion Models and Associative Memories

Hoover, Benjamin, Strobelt, Hendrik, Krotov, Dmitry, Hoffman, Judy, Kira, Zsolt, Chau, Duen Horng

Diffusion Models (DMs) have recently set state-of-the-art on many generation benchmarks. However, there are myriad ways to describe them mathematically, which makes it difficult to develop a simple understanding of how they work. In this survey, we provide a concise overview of DMs from the perspective of dynamical systems and Ordinary Differential Equations (ODEs) which exposes a mathematical connection to the highly related yet often overlooked class of energy-based models, called Associative Memories (AMs). Energy-based AMs are a theoretical framework that behave much like denoising DMs, but they enable us to directly compute a Lyapunov energy function on which we can perform gradient descent to denoise data. We then summarize the 40 year history of energy-based AMs, beginning with the original Hopfield Network, and discuss new research directions for AMs and DMs that are revealed by characterizing the extent of their similarities and differences

diffusion model and associative memory, plain sight, uncanny resemblance

2309.1675

Genre: Overview (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Systems & Languages > Programming Languages (0.60)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.60)

Pickering, Lynn, Almajano, Tereso Del Rio, England, Matthew, Cohen, Kelly

Explainable AI Insights for Symbolic Computation: A case study on selecting the variable ordering for cylindrical algebraic decomposition

arXiv.org Artificial IntelligenceAug-29-2023

In recent years there has been increased use of machine learning (ML) techniques within mathematics, including symbolic computation where it may be applied safely to optimise or select algorithms. This paper explores whether using explainable AI (XAI) techniques on such ML models can offer new insight for symbolic computation, inspiring new implementations within computer algebra systems that do not directly call upon AI tools. We present a case study on the use of ML to select the variable ordering for cylindrical algebraic decomposition. It has already been demonstrated that ML can make the choice well, but here we show how the SHAP tool for explainability can be used to inform new heuristics of a size and complexity similar to those human-designed heuristics currently commonly used in symbolic computation.

avg, cylindrical algebraic decomposition, polynomial, (15 more...)

doi: 10.1016/j.jsc.2023.102276

2304.12154

Country:

North America > United States > Ohio (0.04)
Oceania > Nauru (0.04)
Europe > United Kingdom > England > West Midlands (0.04)
Europe > Slovenia (0.04)

Genre: Research Report (1.00)

Industry: Education (0.46)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Artificial Intelligence > Systems & Languages > Programming Languages (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(3 more...)

Simas, Rodrigo, Sa-Couto, Luis, Wichert, Andreas

Classification and Generation of real-world data with an Associative Memory Model

arXiv.org Artificial IntelligenceJul-13-2023

Drawing from memory the face of a friend you have not seen in years is a difficult task. However, if you happen to cross paths, you would easily recognize each other. The biological memory is equipped with an impressive compression algorithm that can store the essential, and then infer the details to match perception. The Willshaw Memory is a simple abstract model for cortical computations which implements mechanisms of biological memories. Using our recently proposed sparse coding prescription for visual patterns, this model can store and retrieve an impressive amount of real-world data in a fault-tolerant manner. In this paper, we extend the capabilities of the basic Associative Memory Model by using a Multiple-Modality framework. In this setting, the memory stores several modalities (e.g., visual, or textual) of each pattern simultaneously. After training, the memory can be used to infer missing modalities when just a subset is perceived. Using a simple encoder-memory-decoder architecture, and a newly proposed iterative retrieval algorithm for the Willshaw Model, we perform experiments on the MNIST dataset. By storing both the images and labels as modalities, a single Memory can be used not only to retrieve and complete patterns but also to classify and generate new ones. We further discuss how this model could be used for other learning tasks, thus serving as a biologically-inspired framework for learning.

information, modality, vector, (14 more...)

doi: 10.1016/j.neucom.2023.126514

2207.04827

Country:

Europe > Portugal > Lisbon > Lisbon (0.05)
North America > United States > New York > New York County > New York City (0.04)
Europe > Germany > Saarland (0.04)
Europe > France > Occitanie > Haute-Garonne > Toulouse (0.04)

Genre: Research Report (1.00)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Systems & Languages > Programming Languages (0.62)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.62)

arXiv.org Artificial IntelligenceMay-5-2023

Improved Logical Reasoning of Language Models via Differentiable Symbolic Programming

Zhang, Hanlin, Huang, Jiani, Li, Ziyang, Naik, Mayur, Xing, Eric

Pre-trained large language models (LMs) struggle to perform logical reasoning reliably despite advances in scale and compositionality. In this work, we tackle this challenge through the lens of symbolic programming. We propose DSR-LM, a Differentiable Symbolic Reasoning framework where pre-trained LMs govern the perception of factual knowledge, and a symbolic module performs deductive reasoning. In contrast to works that rely on hand-crafted logic rules, our differentiable symbolic reasoning framework efficiently learns weighted rules and applies semantic loss to further improve LMs. DSR-LM is scalable, interpretable, and allows easy integration of prior knowledge, thereby supporting extensive symbolic programming to robustly derive a logical conclusion. The results of our experiments suggest that DSR-LM improves the logical reasoning abilities of pre-trained language models, resulting in a significant increase in accuracy of over 20% on deductive reasoning benchmarks. Furthermore, DSR-LM outperforms a variety of competitive baselines when faced with systematic changes in sequence length.

large language model, machine learning, natural language, (21 more...)

2305.03742

Country: North America > United States > Pennsylvania (0.04)

Genre: Research Report (1.00)

Industry: Leisure & Entertainment (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.97)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)
(2 more...)

Neural Information Processing SystemsApr-6-2023, 20:09:50 GMT

Performance Measures for Associative Memories that Learn and Forget

Recently, many modifications to the McCulloch/Pitts model have been proposed where both learning and forgetting occur. Given that the network never saturates (ceases to function effectively due to an overload of information), the learning updates can con(cid:173) tinue indefinitely. For these networks, we need to introduce performance measmes in addi(cid:173) tion to the information capacity to evaluate the different networks. We mathematically define quantities such as the plasticity of a network, the efficacy of an information vector, and the probability of network saturation. From these quantities we analytically compare different networks.

associative memory, learn and forget, performance measure, (1 more...)

Technology:

Information Technology > Artificial Intelligence > Systems & Languages > Programming Languages (0.40)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.40)

Neural Information Processing SystemsApr-6-2023, 20:09:43 GMT

Capacity for Patterns and Sequences in Kanerva's SDM as Compared to Other Associative Memory Models

The information capacity of Kanerva's Sparse, Distributed Memory (SDM) and Hopfield-type neural networks is investigated. Under the approximations used here, it is shown that the to(cid:173) tal information stored in these systems is proportional to the number connections in the net(cid:173) work. The proportionality constant is the same for the SDM and HopJreld-type models in(cid:173) dependent of the particular model, or the order of the model. The approximations are checked numerically. This same analysis can be used to show that the SDM can store se(cid:173) quences of spatiotemporal patterns, and the addition of time-delayed connections allows the retrieval of context dependent temporal patterns.

associative memory model, kanerva, pattern and sequence, (3 more...)

Technology:

Information Technology > Artificial Intelligence > Systems & Languages > Programming Languages (0.40)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.31)

Neural Information Processing SystemsApr-6-2023, 20:09:18 GMT

The Capacity of the Kanerva Associative Memory is Exponential

It is shown by sphere packing arguments that as the address length increases. This exponential grovth in capacity can actually be achieved by the Kanerva associative memory. Formulas for these op.timal values are provided. The exponential grovth in capacity for the Kanerva associative memory contrasts sharply vith the sub-linear grovth in capacity for the Hopfield associative memory. Our model of an associative memory is the folloving.

associative memory, kanerva associative memory, sequence, (8 more...)

Technology:

Information Technology > Artificial Intelligence > Systems & Languages > Programming Languages (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)

Neural Information Processing SystemsApr-6-2023, 20:04:37 GMT

Invariant Object Recognition Using a Distributed Associative Memory

This paper describes an approach to 2-dimensional object recognition. Complex-log con(cid:173) formal mapping is combined with a distributed associative memory to create a system which recognizes objects regardless of changes in rotation or scale. Recalled information from the memorized database is used to classify an object, reconstruct the memorized ver(cid:173) sion of the object, and estimate the magnitude of changes in scale or rotation. The system response is resistant to moderate amounts of noise and occlusion. Several experiments, us(cid:173) ing real, gray scale images, are presented to show the feasibility of our approach.

associative memory, cid, invariant object recognition

Technology:

Information Technology > Artificial Intelligence > Vision (0.69)
Information Technology > Artificial Intelligence > Systems & Languages > Programming Languages (0.69)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.69)