AITopics | dlogt

Collaborating Authors

dlogt

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

ContextualRecommendationsandLow-Regret Cutting-PlaneAlgorithms AnonymousAuthor(s) Affiliation Address email

Neural Information Processing SystemsFeb-10-2026, 23:01:43 GMT

We wish to2 learn a hiddend-dimensional valuew . Every round, we are presented with a3 subset Xt Rd of possible actions.

algorithm, artificial intelligence, recommendation, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.93)

Add feedback

ContextualRecommendationsandLow-Regret Cutting-PlaneAlgorithms

Neural Information Processing SystemsFeb-10-2026, 23:01:39 GMT

We wish to learn a hiddend-dimensional valuew . Every round, we are presented with a subset Xt Rd of possible actions.

algorithm, artificial intelligence, recommendation, (18 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.69)

Add feedback

Provable Acceleration for Diffusion Models under Minimal Assumptions

Li, Gen, Cai, Changxiao

arXiv.org Machine LearningNov-3-2024

While score-based diffusion models have achieved exceptional sampling quality, their sampling speeds are often limited by the high computational burden of score function evaluations. Despite the recent remarkable empirical advances in speeding up the score-based samplers, theoretical understanding of acceleration techniques remains largely limited. To bridge this gap, we propose a novel training-free acceleration scheme for stochastic samplers. Under minimal assumptions -- namely, $L^2$-accurate score estimates and a finite second-moment condition on the target distribution -- our accelerated sampler provably achieves $\varepsilon$-accuracy in total variation within $\widetilde{O}(d^{5/4}/\sqrt{\varepsilon})$ iterations, thereby significantly improving upon the $\widetilde{O}(d/\varepsilon)$ iteration complexity of standard score-based samplers. Notably, our convergence theory does not rely on restrictive assumptions on the target distribution or higher-order score estimation guarantees.

arxiv preprint arxiv, diffusion model, dlogt, (13 more...)

arXiv.org Machine Learning

2410.23285

Country:

Asia > China > Hong Kong (0.04)
North America > United States > Michigan > Washtenaw County > Ann Arbor (0.04)

Genre:

Research Report (0.49)
Workflow (0.46)
Overview (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

$O(d/T)$ Convergence Theory for Diffusion Probabilistic Models under Minimal Assumptions

Li, Gen, Yan, Yuling

arXiv.org Machine LearningSep-27-2024

Score-based diffusion models, which generate new data by learning to reverse a diffusion process that perturbs data from the target distribution into noise, have achieved remarkable success across various generative tasks. Despite their superior empirical performance, existing theoretical guarantees are often constrained by stringent assumptions or suboptimal convergence rates. In this paper, we establish a fast convergence theory for a popular SDE-based sampler under minimal assumptions. Our analysis shows that, provided $\ell_{2}$-accurate estimates of the score functions, the total variation distance between the target and generated distributions is upper bounded by $O(d/T)$ (ignoring logarithmic factors), where $d$ is the data dimensionality and $T$ is the number of steps. This result holds for any target distribution with finite first-order moment. To our knowledge, this improves upon existing convergence theory for both the SDE-based sampler and another ODE-based sampler, while imposing minimal assumptions on the target data distribution and score estimates. This is achieved through a novel set of analytical tools that provides a fine-grained characterization of how the error propagates at each step of the reverse process.

diffusion model, dlogt, sampler, (14 more...)

arXiv.org Machine Learning

2409.18959

Country:

North America > United States > Wisconsin > Dane County > Madison (0.14)
Asia > China > Hong Kong (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Genre:

Workflow (0.86)
Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.40)

Add feedback

Towards Faster Non-Asymptotic Convergence for Diffusion-Based Generative Models

Li, Gen, Wei, Yuting, Chen, Yuxin, Chi, Yuejie

arXiv.org Machine LearningOct-1-2023

Diffusion models, which convert noise into new data instances by learning to reverse a Markov diffusion process, have become a cornerstone in contemporary generative modeling. While their practical power has now been widely recognized, the theoretical underpinnings remain far from mature. In this work, we develop a suite of non-asymptotic theory towards understanding the data generation process of diffusion models in discrete time, assuming access to $\ell_2$-accurate estimates of the (Stein) score functions. For a popular deterministic sampler (based on the probability flow ODE), we establish a convergence rate proportional to $1/T$ (with $T$ the total number of steps), improving upon past results; for another mainstream stochastic sampler (i.e., a type of the denoising diffusion probabilistic model), we derive a convergence rate proportional to $1/\sqrt{T}$, matching the state-of-the-art theory. Imposing only minimal assumptions on the target data distribution (e.g., no smoothness assumption is imposed), our results characterize how $\ell_2$ score estimation errors affect the quality of the data generation processes. In contrast to prior works, our theory is developed based on an elementary yet versatile non-asymptotic approach without resorting to toolboxes for SDEs and ODEs. Further, we design two accelerated variants, improving the convergence to $1/T^2$ for the ODE-based sampler and $1/T$ for the DDPM-type sampler, which might be of independent theoretical and empirical interest.

artificial intelligence, dlogt, machine learning, (18 more...)

arXiv.org Machine Learning

2306.09251

Country:

North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.14)
Asia > China > Hong Kong (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
(2 more...)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.45)

Add feedback