AITopics | sgb

Does Stochastic Gradient really succeed for bandits?

Neural Information Processing SystemsJun-13-2026, 16:07:31 GMT

Recent works of Mei et al. (2023, 2024) have deepened the theoretical understanding of the *Stochastic Gradient Bandit* (SGB) policy, showing that using a constant learning rate guarantees asymptotic convergence to the optimal policy, and that sufficiently *small* learning rates can yield logarithmic regret. However, whether logarithmic regret holds beyond small learning rates remains unclear. In this work, we take a step towards characterizing the regret *regimes* of SGB as a function of its learning rate. For two--armed bandits, we identify a sharp threshold, scaling with the sub-optimality gap $\Delta$, below which SGB achieves *logarithmic* regret on all instances, and above which it can incur *polynomial* regret on some instances. This result highlights the necessity of knowing (or estimating) $\Delta$ to ensure logarithmic regret with a constant learning rate. For general $K$-armed bandits, we further show the learning rate must scale inversely with $K$ to avoid polynomial regret. We introduce novel techniques to derive regret upper bounds for SGB, laying the groundwork for future advances in the theory of gradient-based bandit algorithms.

artificial intelligence, machine learning, proceedings, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Fed-GraB: Federated Long-tailed Learning with Self-Adjusting Gradient Balancer

Neural Information Processing SystemsFeb-17-2026, 23:41:50 GMT

For instance, patients' diagnosis varies substantially across medical centers but collaboratively form long-tailed distributions for

artificial intelligence, fed-grab, machine learning, (12 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.04)
Asia > China > Zhejiang Province (0.04)
Asia > China > Hong Kong (0.04)

Genre: Research Report (0.68)

Industry:

Information Technology > Security & Privacy (0.68)
Health & Medicine > Health Care Providers & Services (0.54)

Technology:

Information Technology > Artificial Intelligence > Vision (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Appendix APerformanceonreal-worldbasedinstances

Neural Information Processing SystemsFeb-8-2026, 09:26:27 GMT

We further evaluate SGBS+EAS on nine real-world based instance sets from [15]. Each instance set consists of 20 instances that have similar characteristics (i.e., they have been sampled from the same underlying distribution). To account for this new evaluation setting, we always perform 10 runs in parallel for EAS and SGBS+EAS. This improves the solution quality, while leading only to a slight increase of the requiredruntime. For SGBS+EAS we set (β, γ) = (35,5), the learning rate α = 0.005 and λ = 0.05.

artificial intelligence, candidate solution, machine learning, (19 more...)

Neural Information Processing Systems

Country: North America > United States (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Simulation-guidedBeamSearch forNeuralCombinatorialOptimization

Neural Information Processing SystemsFeb-8-2026, 09:26:24 GMT

Neural approaches for combinatorial optimization (CO) equip a learning mechanism to discover powerful heuristics for solving complex real-world problems. While neural approaches capable of high-quality solutions in a single shot are emerging, state-of-the-art approaches are often unable to take full advantage of the solving time available to them. In contrast, hand-crafted heuristics perform highly effective search well and exploit the computation time given to them, but contain heuristics that are difficult to adapt to a dataset being solved.

artificial intelligence, deep learning, machine learning, (19 more...)

Neural Information Processing Systems

Country: Europe > Germany (0.04)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

Simulation-guided Beam Search for Neural Combinatorial Optimization

Neural Information Processing SystemsDec-24-2025, 01:13:25 GMT

Neural approaches for combinatorial optimization (CO) equip a learning mechanism to discover powerful heuristics for solving complex real-world problems. While neural approaches capable of high-quality solutions in a single shot are emerging, state-of-the-art approaches are often unable to take full advantage of the solving time available to them. In contrast, hand-crafted heuristics perform highly effective search well and exploit the computation time given to them, but contain heuristics that are difficult to adapt to a dataset being solved. With the goal of providing a powerful search procedure to neural CO approaches, we propose simulation-guided beam search (SGBS), which examines candidate solutions within a fixed-width tree search that both a neural net-learned policy and a simulation (rollout) identify as promising. We further hybridize SGBS with efficient active search (EAS), where SGBS enhances the quality of solutions backpropagated in EAS, and EAS improves the quality of the policy used in SGBS. We evaluate our methods on well-known CO benchmarks and show that SGBS significantly improves the quality of the solutions found under reasonable runtime assumptions.

name change, neural combinatorial optimization, simulation-guided beam search, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)

Add feedback

Fed-GraB: Federated Long-tailed Learning with Self-Adjusting Gradient Balancer

Neural Information Processing SystemsOct-9-2025, 11:54:23 GMT

For instance, patients' diagnosis varies substantially across medical centers but collaboratively form long-tailed distributions for

artificial intelligence, fed-grab, machine learning, (12 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.04)
Asia > China > Zhejiang Province (0.04)
Asia > China > Hong Kong (0.04)

Genre: Research Report (0.68)

Industry:

Information Technology > Security & Privacy (0.68)
Health & Medicine > Health Care Providers & Services (0.54)

Technology:

Information Technology > Artificial Intelligence > Vision (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Minimal Variance Sampling in Stochastic Gradient Boosting

Bulat Ibragimov, Gleb Gusev

Neural Information Processing SystemsOct-2-2025, 19:38:49 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Country: Europe > Russia (0.14)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.51)

Add feedback

Appendix A Performance on real-world based instances

Neural Information Processing SystemsAug-14-2025, 06:37:44 GMT

We further evaluate SGBS+EAS on nine real-world based instance sets from [15]. Each instance set consists of 20 instances that have similar characteristics (i.e., they have been sampled from the same underlying distribution). The instance sets differ significantly in terms of several structural properties, for example, the number of customers n and their position (e.g., clustered vs. random positions). A more detailed description of instance sets can be found in [15]. One major advantage of neural combinatorial optimization approaches over traditional handcrafted optimization methods is their ability to quickly learn customized heuristics for new problem settings.

algorithm, candidate solution, search method, (17 more...)

Neural Information Processing Systems

Country: North America > United States (0.05)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning (0.96)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.92)

Add feedback

39b9b60f0d149eabd1fff2d7c7d5afc4-Paper-Conference.pdf

Neural Information Processing SystemsAug-14-2025, 06:37:41 GMT

neural network, node, sgb, (14 more...)

Neural Information Processing Systems

Country:

Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Europe > Germany (0.04)

Genre: Research Report (0.93)

Industry:

Information Technology (0.46)
Transportation (0.31)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.94)

Add feedback

Simulation-guided Beam Search for Neural Combinatorial Optimization

Neural Information Processing SystemsOct-10-2024, 16:40:03 GMT

Neural approaches for combinatorial optimization (CO) equip a learning mechanism to discover powerful heuristics for solving complex real-world problems. While neural approaches capable of high-quality solutions in a single shot are emerging, state-of-the-art approaches are often unable to take full advantage of the solving time available to them. In contrast, hand-crafted heuristics perform highly effective search well and exploit the computation time given to them, but contain heuristics that are difficult to adapt to a dataset being solved. With the goal of providing a powerful search procedure to neural CO approaches, we propose simulation-guided beam search (SGBS), which examines candidate solutions within a fixed-width tree search that both a neural net-learned policy and a simulation (rollout) identify as promising. We further hybridize SGBS with efficient active search (EAS), where SGBS enhances the quality of solutions backpropagated in EAS, and EAS improves the quality of the policy used in SGBS.

neural combinatorial optimization, sgb, simulation-guided beam search

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)

Add feedback

Filters

Collaborating Authors

sgb

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Does Stochastic Gradient really succeed for bandits?

Fed-GraB: Federated Long-tailed Learning with Self-Adjusting Gradient Balancer

Appendix APerformanceonreal-worldbasedinstances

Simulation-guidedBeamSearch forNeuralCombinatorialOptimization

Simulation-guided Beam Search for Neural Combinatorial Optimization

Fed-GraB: Federated Long-tailed Learning with Self-Adjusting Gradient Balancer

Minimal Variance Sampling in Stochastic Gradient Boosting

Appendix A Performance on real-world based instances

39b9b60f0d149eabd1fff2d7c7d5afc4-Paper-Conference.pdf

Simulation-guided Beam Search for Neural Combinatorial Optimization