AITopics

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Texas > Travis County > Austin (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(4 more...)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Kobayashi, Mutsumi, Watanabe, Hiroshi

Learning and composing of classical music using restricted Boltzmann machines

arXiv.org Artificial IntelligenceDec-1-2025

We investigate how machine learning models acquire the ability to compose music and how musical information is internally represented within such models. We develop a composition algorithm based on a restricted Boltzmann machine (RBM), a simple generative model capable of producing musical pieces of arbitrary length. We convert musical scores into piano-roll image representations and train the RBM in an unsupervised manner. We confirm that the trained RBM can generate new musical pieces; however, by analyzing the model's responses and internal structure, we find that the learned information is not stored in a form directly interpretable by humans. This study contributes to a better understanding of how machine learning models capable of music composition may internally represent musical structure and highlights issues related to the interpretability of generative models in creative tasks.

artificial intelligence, machine learning, rbm, (18 more...)

2509.04899

Country: Asia > Japan (0.14)

Genre: Research Report > New Finding (1.00)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.63)

Gerlach, Thore, Schenk, Michael, Kain, Verena

Quantum Boltzmann Machines for Sample-Efficient Reinforcement Learning

arXiv.org Artificial IntelligenceNov-10-2025

We introduce theoretically grounded Continuous Semi-Quantum Boltzmann Machines (CSQBMs) that supports continuous-action reinforcement learning. By combining exponential-family priors over visible units with quantum Boltzmann distributions over hidden units, CSQBMs yield a hybrid quantum-classical model that reduces qubit requirements while retaining strong expressiveness. Crucially, gradients with respect to continuous variables can be computed analytically, enabling direct integration into Actor-Critic algorithms. Building on this, we propose a continuous Q-learning framework that replaces global maximization by efficient sampling from the CSQBM distribution, thereby overcoming instability issues in continuous control.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

2511.04856

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Takayuki Osogami, Makoto Otsuka

Restricted Boltzmann machines modeling human choice

Neural Information Processing SystemsOct-9-2025, 14:33:18 GMT

We extend the multinomial logit model to represent some of the empirical phenomena that are frequently observed in the choices made by humans. These phenomena include the similarity effect, the attraction effect, and the compromise effect. We formally quantify the strength of these phenomena that can be represented by our choice model, which illuminates the flexibility of our choice model. We then show that our choice model can be represented as a restricted Boltzmann machine and that its parameters can be learned effectively from data. Our numerical experiments with real data of human choices suggest that we can train our choice model in such a way that it represents the typical phenomena of choice.

choice model, rbm choice model, typical choice phenomenon, (14 more...)

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
North America > United States > California > Santa Cruz County > Santa Cruz (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Switzerland (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.61)

Neural Information Processing SystemsOct-2-2025, 19:46:36 GMT

From Boltzmann Machines to Neural Networks and Back Again

Graphical models are powerful tools for modeling high-dimensional data, but learning graphical models in the presence of latent variables is well-known to be difficult. In this work we give new results for learning Restricted Boltzmann Machines, probably the most well-studied class of latent variable models.

algorithm, artificial intelligence, machine learning, (18 more...)

Country:

North America > United States (0.46)
Europe > United Kingdom > England (0.28)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

arXiv.org Artificial IntelligenceFeb-24-2025

Expressive equivalence of classical and quantum restricted Boltzmann machines

Demidik, Maria, Tüysüz, Cenk, Piatkowski, Nico, Grossi, Michele, Jansen, Karl

Quantum computers offer the potential for efficiently sampling from complex probability distributions, attracting increasing interest in generative modeling within quantum machine learning. This surge in interest has driven the development of numerous generative quantum models, yet their trainability and scalability remain significant challenges. A notable example is a quantum restricted Boltzmann machine (QRBM), which is based on the Gibbs state of a parameterized non-commuting Hamiltonian. While QRBMs are expressive, their non-commuting Hamiltonians make gradient evaluation computationally demanding, even on fault-tolerant quantum computers. In this work, we propose a semi-quantum restricted Boltzmann machine (sqRBM), a model designed for classical data that mitigates the challenges associated with previous QRBM proposals. The sqRBM Hamiltonian is commuting in the visible subspace while remaining non-commuting in the hidden subspace. This structure allows us to derive closed-form expressions for both output probabilities and gradients. Leveraging these analytical results, we demonstrate that sqRBMs share a close relationship with classical restricted Boltzmann machines (RBM). Our theoretical analysis predicts that, to learn a given probability distribution, an RBM requires three times as many hidden units as an sqRBM, while both models have the same total number of parameters. We validate these findings through numerical simulations involving up to 100 units. Our results suggest that sqRBMs could enable practical quantum machine learning applications in the near future by significantly reducing quantum resource requirements.

boltzmann machine, hamiltonian, sqrbm, (14 more...)

2502.17562

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > Rhode Island > Providence County > Providence (0.04)
North America > United States > Oregon (0.04)
(5 more...)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Takayuki Osogami, Makoto Otsuka

Restricted Boltzmann machines modeling human choice

Neural Information Processing SystemsFeb-8-2025, 23:38:25 GMT

artificial intelligence, choice model, machine learning, (17 more...)

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > Santa Cruz County > Santa Cruz (0.04)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.61)

Neural Information Processing SystemsFeb-6-2025, 10:10:08 GMT

Export Reviews, Discussions, Author Feedback and Meta-Reviews

Detailed comments: 40 - You might be interested in [MacKay, David. Artificial Intelligence] is one earlier example of its use that I'm familiar with. A paper by Salakhutdinov suggests that [H. Stat., 22:400-407, 1951.] is the earliest reference, though I have not read this older paper. Doesn't m have a single fixed point?

author feedback and meta-review, discussion, export review, (5 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Fournier, Samantha J., Urbani, Pierfrancesco

Generative modeling through internal high-dimensional chaotic activity

arXiv.org Artificial IntelligenceMay-17-2024

Generative models aim to create samples statistically similar to those belonging to a training dataset: their goal is to fit the probability distribution from which the datapoints supposedly come from. In a generic setting, this probability distribution takes the form of a Boltzmann factor. The corresponding Energy Based Models (EBMs) fit the parameters of the Hamiltonian of the Boltzmann distribution and can be viewed as maximum entropy models, where the statistical properties of the dataset are imposed as constraints to low degree correlation functions [1, 2], (see [3, 4] for recent reviews). The resulting learning rule can be viewed as a gradient ascent on the Log-Likelihood (LL). However, running the training dynamics is a notoriously challenging task: at each epoch, the evaluation of the gradient of the LL requires the computation of the correlation functions of the degrees of freedom as predicted from the current estimation of the model's probability distribution. This is typically an intractable problem from an analytical point of view and is generally tackled numerically through parallel Monte Carlo Markov Chain (MCMC) simulations.

architecture, dynamical system, generative model, (15 more...)

2405.10822

Country:

Europe > France (0.04)
Europe > Finland > Uusimaa > Helsinki (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.35)

Böttcher, Lucas, Wheeler, Gregory

Statistical Mechanics and Artificial Neural Networks: Principles, Models, and Applications

arXiv.org Artificial IntelligenceApr-5-2024

The field of neuroscience and the development of artificial neural networks (ANNs) have mutually influenced each other, drawing from and contributing to many concepts initially developed in statistical mechanics. Notably, Hopfield networks and Boltzmann machines are versions of the Ising model, a model extensively studied in statistical mechanics for over a century. In the first part of this chapter, we provide an overview of the principles, models, and applications of ANNs, highlighting their connections to statistical mechanics and statistical learning theory. Artificial neural networks can be seen as high-dimensional mathematical functions, and understanding the geometric properties of their loss landscapes (i.e., the high-dimensional space on which one wishes to find extrema or saddles) can provide valuable insights into their optimization behavior, generalization abilities, and overall performance. Visualizing these functions can help us design better optimization methods and improve their generalization abilities. Thus, the second part of this chapter focuses on quantifying geometric properties and visualizing loss functions associated with deep ANNs.

curvature, neural network, projection, (16 more...)

2405.10957

Country:

North America > United States > New York > Richmond County > New York City (0.14)
North America > United States > New York > Queens County > New York City (0.14)
North America > United States > New York > New York County > New York City (0.14)
(31 more...)

Genre:

Overview (0.68)
Research Report > New Finding (0.46)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.92)