AITopics | moss

a7667ee5d545a43d2f0fda98863c260e-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-27-2026, 00:05:14 GMT

artificial intelligence, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

AMixture of Surprises for Unsupervised Reinforcement Learning

Neural Information Processing SystemsApr-27-2026, 00:05:10 GMT

Unsupervised reinforcement learning aims at learning a generalist policy in a reward-free manner for fast adaptation to downstream tasks. Most of the existing methods propose to provide an intrinsic reward based on surprise. Maximizing or minimizing surprise drives the agent to either explore or gain control over its environment. However, both strategies rely on a strong assumption: the entropy of the environment's dynamics is either high or low. This assumption may not always hold in real-world scenarios, where the entropy of the environment's dynamics may be unknown.

arxiv preprint arxiv, machine learning, reinforcement learning, (13 more...)

Neural Information Processing Systems

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Moss helped nab notorious grave-robbers

Popular ScienceMar-5-2026, 02:00:00 GMT

Botanists say the plant'is a little bit freaky.' Breakthroughs, discoveries, and DIY tips sent six days a week. It may always be beneath our feet, but moss has intertwined with human history for thousands of years. Indigenous cultures often harvested the plants for bedding material and structural insulation. Europe's oldest natural mummy, Ötzi the Iceman, died with moss packed into his boots for warmth.

artificial intelligence, moss, von konrat, (9 more...)

Popular Science

Country:

Europe (0.25)
North America > United States > Illinois > Cook County > Alsip (0.06)
North America > United States > Illinois > Cook County > Chicago (0.05)

Genre: Research Report > New Finding (0.36)

Industry:

Law (0.72)
Media > Photography (0.31)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.30)
Government > Regional Government > North America Government > United States Government (0.30)

Technology: Information Technology > Artificial Intelligence (0.51)

Add feedback

a7667ee5d545a43d2f0fda98863c260e-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 04:57:19 GMT

agent, moss, quadruped, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

a7667ee5d545a43d2f0fda98863c260e-Paper-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 04:57:15 GMT

agent, arxiv preprint arxiv, entropy, (10 more...)

Neural Information Processing Systems

Country: Asia > China (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

GAUCHE: A Library for Gaussian Processes in Chemistry

Neural Information Processing SystemsDec-27-2025, 05:03:07 GMT

We introduce GAUCHE, an open-source library for GAUssian processes in CHEmistry. Gaussian processes have long been a cornerstone of probabilistic machine learning, affording particular advantages for uncertainty quantification and Bayesian optimisation. Extending Gaussian processes to molecular representations, however, necessitates kernels defined over structured inputs such as graphs, strings and bit vectors. By providing such kernels in a modular, robust and easy-to-use framework, we seek to enable expert chemists and materials scientists to make use of state-of-the-art black-box optimization techniques. Motivated by scenarios frequently encountered in practice, we showcase applications for GAUCHE in molecular discovery, chemical reaction optimisation and protein design.

gauche, gaussian process, name change, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

A Mixture Of Surprises for Unsupervised Reinforcement Learning

Neural Information Processing SystemsDec-24-2025, 22:31:59 GMT

Unsupervised reinforcement learning aims at learning a generalist policy in a reward-free manner for fast adaptation to downstream tasks. Most of the existing methods propose to provide an intrinsic reward based on surprise. Maximizing or minimizing surprise drives the agent to either explore or gain control over its environment. However, both strategies rely on a strong assumption: the entropy of the environment's dynamics is either high or low. This assumption may not always hold in real-world scenarios, where the entropy of the environment's dynamics may be unknown. Hence, choosing between the two objectives is a dilemma. We propose a novel yet simple mixture of policies to address this concern, allowing us to optimize an objective that simultaneously maximizes and minimizes the surprise. Concretely, we train one mixture component whose objective is to maximize the surprise and another whose objective is to minimize the surprise. Hence, our method does not make assumptions about the entropy of the environment's dynamics.

name change, objective, unsupervised reinforcement learning, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.33)

Add feedback

MOSS: Efficient and Accurate FP8 LLM Training with Microscaling and Automatic Scaling

Zhang, Yu, Zhen, Hui-Ling, Yuan, Mingxuan, Yu, Bei

arXiv.org Artificial IntelligenceDec-8-2025

Training large language models with FP8 formats offers significant efficiency gains. However, the reduced numerical precision of FP8 poses challenges for stable and accurate training. Current frameworks preserve training performance using mixed-granularity quantization, i.e., applying per-group quantization for activations and per-tensor/block quantization for weights. While effective, per-group quantization requires scaling along the inner dimension of matrix multiplication, introducing additional dequantization overhead. Moreover, these frameworks often rely on just-in-time scaling to dynamically adjust scaling factors based on the current data distribution. However, this online quantization is inefficient for FP8 training, as it involves multiple memory reads and writes that negate the performance benefits of FP8. To overcome these limitations, we propose MOSS, a novel FP8 training framework that ensures both efficiency and numerical stability. MOSS introduces two key innovations: (1) a two-level microscaling strategy for quantizing sensitive activations, which balances precision and dequantization cost by combining a high-precision global scale with compact, power-of-two local scales; and (2) automatic scaling for weights in linear layers, which eliminates the need for costly max-reduction operations by predicting and adjusting scaling factors during training. Leveraging these techniques, MOSS enables efficient FP8 training of a 7B parameter model, achieving performance comparable to the BF16 baseline while achieving up to 34% higher training throughput. Large language models (LLMs) have demonstrated remarkable capabilities across diverse tasks, including reasoning, language understanding, and generation (Achiam et al., 2023; Grattafiori et al., 2024; Liu et al., 2024; Adler et al., 2024).

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2511.05811

Genre: Research Report > New Finding (0.46)

Industry: Energy (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

On Instability of Minimax Optimal Optimism-Based Bandit Algorithms

Praharaj, Samya, Khamaru, Koulik

arXiv.org Machine LearningNov-25-2025

Statistical inference from data generated by multi-armed bandit (MAB) algorithms is challenging due to their adaptive, non-i.i.d. nature. A classical manifestation is that sample averages of arm rewards under bandit sampling may fail to satisfy a central limit theorem. Lai and Wei's stability condition provides a sufficient, and essentially necessary criterion, for asymptotic normality in bandit problems. While the celebrated Upper Confidence Bound (UCB) algorithm satisfies this stability condition, it is not minimax optimal, raising the question of whether minimax optimality and statistical stability can be achieved simultaneously. In this paper, we analyze the stability properties of a broad class of bandit algorithms that are based on the optimism principle. We establish general structural conditions under which such algorithms violate the Lai-Wei stability criterion. As a consequence, we show that widely used minimax-optimal UCB-style algorithms, including MOSS, Anytime-MOSS, Vanilla-MOSS, ADA-UCB, OC-UCB, KL-MOSS, KL-UCB++, KL-UCB-SWITCH, and Anytime KL-UCB-SWITCH, are unstable. We further complement our theoretical results with numerical simulations demonstrating that, in all these cases, the sample means fail to exhibit asymptotic normality. Overall, our findings suggest a fundamental tension between stability and minimax optimal regret, raising the question of whether it is possible to design bandit algorithms that achieve both. Understanding whether such simultaneously stable and minimax optimal strategies exist remains an important open direction.

algorithm, bandit algorithm, equation, (15 more...)

arXiv.org Machine Learning

2511.1875

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > California (0.04)
Europe > Hungary > Budapest > Budapest (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)

Add feedback

Moss survived 283 days in space, shocking biologists

After defying multiple mass extinctions on Earth, the hardy plant passes an intergalactic test. Breakthroughs, discoveries, and DIY tips sent every weekday. While it may appear humble, Earth's moss is built darn tough. It thrives in extreme environments -from the bitter cold, low-oxygen air of the Himalayas, down to the parched sands of Death Valley. Some species even make their home among the lava fields of active volcanoes .

artificial intelligence, moss, spore, (14 more...)

Popular Science

Country: Asia > Japan (0.15)

Genre: Research Report > New Finding (0.50)

Industry: