AITopics | yoshua bengio

Collaborating Authors

yoshua bengio

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

On scalable and efficient training of diffusion samplers

Neural Information Processing SystemsJun-15-2026, 17:30:34 GMT

We address the challenge of training diffusion models to sample from unnormalized energy distributions in the absence of data, the so-called diffusion samplers. Although these approaches have shown promise, they struggle to scale in more demanding scenarios where energy evaluations are expensive and the sampling space is high-dimensional. To address this limitation, we propose a scalable and sample-efficient framework that properly harmonizes the powerful classical sampling method and the diffusion sampler. Specifically, we utilize Monte Carlo Markov chain (MCMC) samplers with a novelty-based auxiliary energy as a Searcher to collect off-policy samples, using an auxiliary energy function to compensate for exploring modes the diffusion sampler rarely visits. These off-policy samples are then combined with on-policy data to train the diffusion sampler, thereby expanding its coverage of the energy landscape. Furthermore, we identify primacy bias, i.e., the preference of samplers for early experience during training, as the main cause of mode collapse during training, and introduce a periodic re-initialization trick to resolve this issue. Our method significantly improves sample efficiency on standard benchmarks for diffusion samplers and also excels at higher-dimensional problems and real-world molecular conformer generation.

artificial intelligence, machine learning, sampler, (17 more...)

Neural Information Processing Systems

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.67)

Industry: Education (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.66)

Add feedback

Towards Deep Conversational Recommendations

Raymond Li, Samira Ebrahimi Kahou, Hannes Schulz, Vincent Michalski, Laurent Charlin, Chris Pal

Neural Information Processing SystemsMar-15-2026, 18:59:07 GMT

Foreachparticipantit threelabels: the "suggested" label (binary), the "seen" label (categoricalwiththree "liked" label (categoricalwiththreeclasses) foratotalof 14 dimensions.

machine learning, natural language, yoshua bengio, (17 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.05)
Oceania > New Zealand (0.04)
North America > United States (0.04)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.70)
Information Technology > Communications > Social Media (0.49)
Information Technology > Artificial Intelligence > Natural Language (0.48)

Add feedback

Global Sparse Momentum SGD for Pruning Very Deep Neural Networks

Xiaohan Ding, guiguang ding, Xiangxin Zhou, Yuchen Guo, Jungong Han, Ji Liu

Neural Information Processing SystemsFeb-15-2026, 01:53:14 GMT

DNN pruning is an approach for deep model compression, which aimsateliminating someparameters withtolerable performance degradation.

artificial intelligence, machine learning, pruning, (17 more...)

Neural Information Processing Systems

Country:

Asia > China > Beijing > Beijing (0.05)
North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
(4 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)

Add feedback

Imitation Learning from Observations by Minimizing Inverse Dynamics Disagreement

Chao Yang, Xiaojian Ma, Wenbing Huang, Fuchun Sun, Huaping Liu, Junzhou Huang, Chuang Gan

Neural Information Processing SystemsFeb-14-2026, 23:36:59 GMT

Incontrast toLearning fromDemonstration (LfD) that involves both action and state supervision, LfO is more practical in leveraging previously inapplicable resources (e.g.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Asia > China (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

Adaptive Cross-Modal Few-shot Learning

Chen Xing, Negar Rostamzadeh, Boris Oreshkin, Pedro O. O. Pinheiro

Neural Information Processing SystemsFeb-14-2026, 11:18:16 GMT

Neural Information Processing Systems http://nips.cc/

latexit sha1, learning, modality, (14 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.05)
North America > United States > California (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Multi-Task Zipping via Layer-wise Neuron Sharing

Xiaoxi He, Zimu Zhou, Lothar Thiele

Neural Information Processing SystemsFeb-14-2026, 03:26:18 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, wal, (11 more...)

Neural Information Processing Systems

Country: North America > Canada > Quebec > Montreal (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Visualizing the Loss Landscape of Neural Nets

Hao Li, Zheng Xu, Gavin Taylor, Christoph Studer, Tom Goldstein

Neural Information Processing SystemsFeb-14-2026, 00:27:42 GMT

Forsurfaceplotsof ResNet-56, see Figure 1.

artificial intelligence, figure 2, machine learning, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > Maryland (0.04)
North America > Canada > Quebec > Montreal (0.04)
Africa > Middle East > Tunisia > Ben Arous Governorate > Ben Arous (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.65)

Add feedback

Visualizing the PHATE of Neural Networks

Scott Gigante, Adam S. Charles, Smita Krishnaswamy, Gal Mishne

Neural Information Processing SystemsFeb-13-2026, 17:55:29 GMT

Wedemonstrate that our visualization provides intuitive, detailed summaries of the learning dynamics beyond simple global measures (i.e., validation loss and accuracy), without the need to access validation data. Furthermore, M-PHATE better captures both the dynamics and community structure of the hidden units as compared to visualization based on standard dimensionality reduction methods (e.g., ISOMAP,t-SNE).

artificial intelligence, machine learning, urlhttp, (18 more...)

Neural Information Processing Systems

Country: