AITopics

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Phan, Buu, Khisti, Ashish

Channel Simulation and Distributed Compression with Ensemble Rejection Sampling

arXiv.org Artificial IntelligenceOct-8-2025

We study channel simulation and distributed matching, two fundamental problems with several applications to machine learning, using a recently introduced generalization of the standard rejection sampling (RS) algorithm known as Ensemble Rejection Sampling (ERS). For channel simulation, we propose a new coding scheme based on ERS that achieves a near-optimal coding rate. In this process, we demonstrate that standard RS can also achieve a near-optimal coding rate and generalize the result of Braverman and Garg (2014) to the continuous alphabet setting. Next, as our main contribution, we present a distributed matching lemma for ERS, which serves as the rejection sampling counterpart to the Poisson Matching Lemma (PML) introduced by Li and Anantharam (2021). Our result also generalizes a recent work on importance matching lemma (Phan et al, 2024) and, to our knowledge, is the first result on distributed matching in the family of rejection sampling schemes where the matching probability is close to PML. We demonstrate the practical significance of our approach over prior works by applying it to distributed compression. The effectiveness of our proposed scheme is validated through experiments involving synthetic Gaussian sources and distributed image compression using the MNIST dataset.

artificial intelligence, machine learning, probability, (16 more...)

2510.05552

Country:

North America > Canada > Ontario (0.27)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.27)

Genre: Research Report > New Finding (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Parys, Paweł, Vaidya, Sairam, Berg-Kirkpatrick, Taylor, D'Antoni, Loris

Constrained Adaptive Rejection Sampling

arXiv.org Artificial IntelligenceOct-3-2025

Language Models (LMs) are increasingly used in applications where generated outputs must satisfy strict semantic or syntactic constraints. Existing approaches to constrained generation fall along a spectrum: greedy constrained decoding methods enforce validity during decoding but distort the LM's distribution, while rejection sampling (RS) preserves fidelity but wastes computation by discarding invalid outputs. Both extremes are problematic in domains such as program fuzzing, where both validity and diversity of samples are essential. We present Constrained Adaptive Rejection Sampling (CARS), an approach that strictly improves the sample-efficiency of RS without distributional distortion. CARS begins with unconstrained LM sampling and adaptively rules out constraint-violating continuations by recording them in a trie and subtracting their probability mass from future draws. This adaptive pruning ensures that prefixes proven invalid are never revisited, acceptance rates improve monotonically, and the resulting samples exactly follow the constrained distribution. In experiments on a variety of domains -- e.g., program fuzzing and molecular generation -- CARS consistently achieves higher efficiency -- measured in the number of LM forward passes per valid sample -- while also producing stronger sample diversity than both GCD and methods that approximate the LM's distribution.

large language model, machine learning, natural language, (17 more...)

2510.01902

Country: North America > United States (0.68)

Genre: Research Report (1.00)

Industry: Materials > Chemicals > Commodity Chemicals (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.93)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

arXiv.org Artificial IntelligenceJun-12-2025

Flipping Against All Odds: Reducing LLM Coin Flip Bias via Verbalized Rejection Sampling

Xiao, Tim Z., Zenn, Johannes, Liu, Zhen, Liu, Weiyang, Bamler, Robert, Schölkopf, Bernhard

Large language models (LLMs) can often accurately describe probability distributions using natural language, yet they still struggle to generate faithful samples from them. This mismatch limits their use in tasks requiring reliable stochasticity, such as Monte Carlo methods, agent-based simulations, and randomized decision-making. We investigate this gap between knowledge and sampling in the context of Bernoulli distributions. We introduce Verbalized Rejection Sampling (VRS), a natural-language adaptation of classical rejection sampling that prompts the LLM to reason about and accept or reject proposed samples. Despite relying on the same Bernoulli mechanism internally, VRS substantially reduces sampling bias across models. We provide theoretical analysis showing that, under mild assumptions, VRS improves over direct sampling, with gains attributable to both the algorithm and prompt design. More broadly, our results show how classical probabilistic tools can be verbalized and embedded into LLM workflows to improve reliability, without requiring access to model internals or heavy prompt engineering.

large language model, machine learning, natural language, (18 more...)

2506.09998

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.91)

arXiv.org Artificial IntelligenceFeb-17-2025

FastMCTS: A Simple Sampling Strategy for Data Synthesis

Li, Peiji, Lv, Kai, Shao, Yunfan, Ma, Yichuan, Li, Linyang, Zheng, Xiaoqing, Qiu, Xipeng, Guo, Qipeng

Synthetic high-quality multi-step reasoning data can significantly enhance the performance of large language models on various tasks. However, most existing methods rely on rejection sampling, which generates trajectories independently and suffers from inefficiency and imbalanced sampling across problems of varying difficulty. In this work, we introduce FastMCTS, an innovative data synthesis strategy inspired by Monte Carlo Tree Search. FastMCTS provides a more efficient sampling method for multi-step reasoning data, offering step-level evaluation signals and promoting balanced sampling across problems of different difficulty levels. Experiments on both English and Chinese reasoning datasets demonstrate that FastMCTS generates over 30\% more correct reasoning paths compared to rejection sampling as the number of generated tokens scales up. Furthermore, under comparable synthetic data budgets, models trained on FastMCTS-generated data outperform those trained on rejection sampling data by 3.9\% across multiple benchmarks. As a lightweight sampling strategy, FastMCTS offers a practical and efficient alternative for synthesizing high-quality reasoning data. Our code will be released soon.

large language model, machine learning, node, (20 more...)

2502.11476

Country:

Asia (0.68)
North America > United States (0.28)
Europe > Austria > Vienna (0.15)

Genre: Research Report (0.82)

Industry: Education > Educational Setting (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Neural Information Processing SystemsFeb-5-2025, 04:05:00 GMT

Review for NeurIPS paper: Fast and Accurate k -means++ via Rejection Sampling

Additional Feedback: Overall: Why only 3 trees are sufficient for Lemma 3.1? Three looks like a magic number after reading the paper. L90-92 you explain the known results that a single tree metric does not suffice, but why three trees? What are the space requirements of the proposed algorithm? L36-41: In your main contribution, you should *not* Use \tilde{O} without defining explicitly the hidden terms.

algorithm, failure probability, rejection sampling, (6 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.40)

Neural Information Processing SystemsFeb-5-2025, 04:04:53 GMT

Review for NeurIPS paper: Fast and Accurate k -means++ via Rejection Sampling

The paper presents a new algorithm for speeding up k-means algorithms with rigorous theoretical guarantees. It is quite surprising that they can improve the running time to \tilde{O}(nd n {1 \eps}) when even one round of k-means algorithm takes O(ndk) time. The main shortcoming is the performance gain is only visible for large k. However, I think the large k regime is very interesting and does appear in practice. The authors should add discussion about aspect ratio and the new experiments as pointed out by them in the rebuttal.

algorithm, neurips paper, rejection sampling

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Neural Information Processing SystemsOct-11-2024, 05:26:46 GMT

Fast and Accurate k -means++ via Rejection Sampling

Despite its wide adoption, k -means sometimes suffers from being slow on large data-sets so a natural question has been to obtain more efficient algorithms with similar guarantees. Interestingly our algorithm obtains the same theoretical guarantees as k -means and significantly improves earlier results on fast k -means seeding. Moreover, we show empirically that our algorithm is significantly faster than k -means and obtains solutions of equivalent quality.

algorithm, rejection sampling

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Neural Information Processing SystemsOct-11-2024, 00:06:22 GMT

TabNAS: Rejection Sampling for Neural Architecture Search on Tabular Datasets

neural architecture search, resource constraint, tabnas, (2 more...)

Technology:

Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.90)

arXiv.org Artificial IntelligenceFeb-15-2024

RS-DPO: A Hybrid Rejection Sampling and Direct Preference Optimization Method for Alignment of Large Language Models

Khaki, Saeed, Li, JinJin, Ma, Lan, Yang, Liu, Ramachandra, Prathap

Reinforcement learning from human feedback (RLHF) has been extensively employed to align large language models with user intent. However, proximal policy optimization (PPO) based RLHF is occasionally unstable requiring significant hyperparameter finetuning, and computationally expensive to maximize the estimated reward during alignment. Recently, direct preference optimization (DPO) is proposed to address those challenges. However, DPO relies on contrastive responses generated from human annotator and alternative LLM, instead of the policy model, limiting the effectiveness of the RLHF. In this paper, we addresses both challenges by systematically combining rejection sampling (RS) and DPO. Our proposed method, RS-DPO, initiates with the development of a supervised fine-tuned policy model (SFT). A varied set of k responses per prompt are sampled directly from the SFT model. RS-DPO identifies pairs of contrastive samples based on their reward distribution. Finally, we apply DPO with the contrastive samples to align the model to human preference. Our experiments indicate that our proposed method effectively fine-tunes LLMs with limited resource environments, leading to improved alignment with user intent. Furthermore, it outperforms existing methods, including RS, PPO, and DPO.

dataset, proposed method pythia-6, reward model, (14 more...)

2402.10038

Country:

North America > United States > District of Columbia > Washington (0.05)
North America > United States > Pennsylvania (0.04)
North America > United States > Maryland > Prince George's County > College Park (0.04)

Genre: Research Report (1.00)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.93)
Food & Agriculture (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)