AITopics | bart

Semi-Parametric Bayesian Additive Regression Trees for Risk Prediction with High-Dimensional Epigenetic Signatures and Low-Dimensional Covariates

Bhandari, Saurabh, Bhatti, Parveen, Chiu, Brian C. -H., Ji, Yuan

arXiv.org Machine LearningMay-25-2026

In the era of precision medicine, genome-wide epigenetic modifications offer rich data that could inform risk prediction. However, these data are high-dimensional and exhibit complex dependence structures, which makes it difficult to jointly model them with low-dimensional covariates when the goal is to obtain interpretable effect estimates for covariate adjustment. Standard Bayesian additive regression trees (BART) provide strong predictive performance but treat all predictors uniformly within the tree ensemble, obscuring the contributions of significant covariates and complicating variable selection in high-dimensional settings. We propose a semi-parametric BART model (spBART) that addresses this limitation by modeling low-dimensional covariates through a parametric component with interpretable coefficients, while capturing complex nonlinear associations among high-dimensional predictors through the tree ensemble. To perform stable variable selection, we develop a cross-validation-based procedure that aggregates posterior inclusion probabilities across folds and applies Bayesian false discovery rate control. We apply the proposed method to a pooled case--control analysis of high-dimensional genome-wide 5-hydroxymethylcytosine profiles derived from circulating cell-free DNA in two multiple myeloma studies ($N = 869$). The approach identifies a parsimonious set of candidate loci and achieves strong out-of-sample discrimination (AUC $= 0.96$) in a held-out validation set. Overall, spBART provides a unified framework for combining interpretable covariate inference with flexible modeling and variable selection in high-dimensional biomedical studies.

artificial intelligence, machine learning, semi-parametric bart, (17 more...)

arXiv.org Machine Learning

2605.20143

Country: North America > United States (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Epidemiology (1.00)
Health & Medicine > Therapeutic Area > Hematology (0.89)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.93)

Add feedback

d60e14c19cd6e0fc38556ad29ac8fbc9-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-29-2026, 22:17:37 GMT

artificial intelligence, computation time, machine learning, (17 more...)

Neural Information Processing Systems

Genre: Research Report (0.32)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.70)

Add feedback

00b76fddeaaa7d8c2c43d504b2babd8a-Supplemental.pdf

Neural Information Processing SystemsApr-24-2026, 09:48:55 GMT

artificial intelligence, bast, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America > United States > Texas (0.16)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)

Add feedback

00b76fddeaaa7d8c2c43d504b2babd8a-Paper.pdf

Neural Information Processing SystemsApr-24-2026, 09:48:51 GMT

artificial intelligence, machine learning, partition, (18 more...)

Neural Information Processing Systems

Country: North America > United States > Texas (0.14)

Technology:

Information Technology > Data Science (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.47)

Add feedback

d60e14c19cd6e0fc38556ad29ac8fbc9-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-17-2026, 08:12:14 GMT

artificial intelligence, computation time, machine learning, (16 more...)

Neural Information Processing Systems

Genre: Research Report (0.32)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.70)

Add feedback

8deff86036ee6a73cda883dc7d2d6b07-Paper-Conference.pdf

Neural Information Processing SystemsFeb-16-2026, 13:39:01 GMT

logic & formal reasoning, machine learning, prediction, (20 more...)

Neural Information Processing Systems

Country:

Asia > Vietnam > Hanoi > Hanoi (0.05)
Europe > Belgium > Flanders > Flemish Brabant > Leuven (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.67)

Industry: Information Technology (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
(2 more...)

Add feedback

8eff4196f50c43eda7bcf0f0cf87a0d0-Paper-Conference.pdf

Neural Information Processing SystemsFeb-15-2026, 20:28:59 GMT

artificial intelligence, bayesian inference, machine learning, (19 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)

Add feedback

A Appendix

Neural Information Processing SystemsFeb-11-2026, 15:36:15 GMT

A.1 Summary of Commonly Used Metrics for T ext Generation Table 1: Summary of commonly used metrics for text generation. For settings and tasks, we only list the ones justified by the original paper for each metric. We conduct experiments on WMT19, and the results are shown in Tab. 2. We don't observe A.3 Prompt Set In Tab. 3, we list the full prompt set for both s h direction and h r direction. Prompt Set s h Last Tersely Succinctly In summation To put it succinctly After In brief All in all To summarize Bringing up the rear Behind In short In outline In a nutshell To come to the point Lastly Concisely In closing In conclusion In the final analysis In sum In precis In passing In winding up Without wasting words To end In a word To conclude Last in order At the end of the day Curtly Compactly Summarising In a few words Without waste of words Crisply Summarily In the rear As a final point Finally yet importantly At last To sum up Summarizing Not least of all To put it in a nutshell Pithily Basically Laconically To put it briefly When all is said and done Shortly In the end At the rear Not to mince words To cut a long story short In fine At the end To be brief Last but not least Not to beat about the bush Finally In essence Last of all Just as importantly In drawing things to a close Briefly Ultimately Elliptically To put it concisely Not to put too fine a point on ith r As To wit As it were Case in point As an illustration sc. That is Especially That is to say To give an example i.e.

artificial intelligence, metric, natural language, (9 more...)

Neural Information Processing Systems

Country: Europe > Denmark > Capital Region > Copenhagen (0.05)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.30)

Add feedback

BFTS: Thompson Sampling with Bayesian Additive Regression Trees

Deng, Ruizhe, Chakraborty, Bibhas, Chen, Ran, Tan, Yan Shuo

arXiv.org Machine LearningFeb-10-2026

Contextual bandits are a core technology for personalized mobile health interventions, where decision-making requires adapting to complex, non-linear user behaviors. While Thompson Sampling (TS) is a preferred strategy for these problems, its performance hinges on the quality of the underlying reward model. Standard linear models suffer from high bias, while neural network approaches are often brittle and difficult to tune in online settings. Conversely, tree ensembles dominate tabular data prediction but typically rely on heuristic uncertainty quantification, lacking a principled probabilistic basis for TS. We propose Bayesian Forest Thompson Sampling (BFTS), the first contextual bandit algorithm to integrate Bayesian Additive Regression Trees (BART), a fully probabilistic sum-of-trees model, directly into the exploration loop. We prove that BFTS is theoretically sound, deriving an information-theoretic Bayesian regret bound of $\tilde{O}(\sqrt{T})$. As a complementary result, we establish frequentist minimax optimality for a "feel-good" variant, confirming the structural suitability of BART priors for non-parametric bandits. Empirically, BFTS achieves state-of-the-art regret on tabular benchmarks with near-nominal uncertainty calibration. Furthermore, in an offline policy evaluation on the Drink Less micro-randomized trial, BFTS improves engagement rates by over 30% compared to the deployed policy, demonstrating its practical effectiveness for behavioral interventions.

artificial intelligence, bayesian inference, machine learning, (18 more...)

arXiv.org Machine Learning

2602.07767

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > North Carolina > Wake County > Raleigh (0.04)
(6 more...)

Genre: Research Report > Experimental Study (0.86)

Industry: Health & Medicine (0.92)

Add feedback

NeurIPS Rebuttal for " Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks "

Neural Information Processing SystemsFeb-8-2026, 18:52:44 GMT

NeurIPS Rebuttal for "Retrieval-Augmented Generation for Knowledge-Intensive NLP T asks" We thank reviewers for their thoughtful, detailed reviews. "information retrieval strategy to improve the the generation Pre-trained seq2seq models have only become available in the last year (T5, BART) or two (GPT2). We study two RAG models. RAG-Sequence's formulation is similar to REALM, but RAG-Token is novel and Further, we explore novel decoding strategies for these models. "contribution [...] is not very specific, since R1 suggested that "A figure or example about P AG-Sequence Model and P AG-Token Model is needed", and R3 mentions "description of the model is quite concise (due to space restrictions)".

large language model, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Genre: Research Report (0.31)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

Filters

Collaborating Authors

bart

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Semi-Parametric Bayesian Additive Regression Trees for Risk Prediction with High-Dimensional Epigenetic Signatures and Low-Dimensional Covariates

d60e14c19cd6e0fc38556ad29ac8fbc9-Supplemental-Conference.pdf

00b76fddeaaa7d8c2c43d504b2babd8a-Supplemental.pdf

00b76fddeaaa7d8c2c43d504b2babd8a-Paper.pdf

d60e14c19cd6e0fc38556ad29ac8fbc9-Supplemental-Conference.pdf

8deff86036ee6a73cda883dc7d2d6b07-Paper-Conference.pdf

8eff4196f50c43eda7bcf0f0cf87a0d0-Paper-Conference.pdf

A Appendix

BFTS: Thompson Sampling with Bayesian Additive Regression Trees

NeurIPS Rebuttal for " Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks "