Improved Finite-Particle Convergence Rates for Stein Variational Gradient Descent
Balasubramanian, Krishnakumar, Banerjee, Sayan, Ghosal, Promit
We provide finite-particle convergence rates for the Stein Variational Gradient Descent (SVGD) algorithm in the Kernel Stein Discrepancy ($\mathsf{KSD}$) and Wasserstein-2 metrics. Our key insight is the observation that the time derivative of the relative entropy between the joint density of $N$ particle locations and the $N$-fold product target measure, starting from a regular initial distribution, splits into a dominant `negative part' proportional to $N$ times the expected $\mathsf{KSD}^2$ and a smaller `positive part'. This observation leads to $\mathsf{KSD}$ rates of order $1/\sqrt{N}$, providing a near optimal double exponential improvement over the recent result by~\cite{shi2024finite}. Under mild assumptions on the kernel and potential, these bounds also grow linearly in the dimension $d$. By adding a bilinear component to the kernel, the above approach is used to further obtain Wasserstein-2 convergence. For the case of `bilinear + Mat\'ern' kernels, we derive Wasserstein-2 rates that exhibit a curse-of-dimensionality similar to the i.i.d. setting. We also obtain marginal convergence and long-time propagation of chaos results for the time-averaged particle laws.
Sep-12-2024
- Country:
- Asia
- Japan > Honshū
- Kantō > Kanagawa Prefecture (0.05)
- Middle East > Jordan (0.04)
- Japan > Honshū
- Europe > Switzerland
- North America > United States
- California > Yolo County
- Davis (0.04)
- Illinois > Cook County
- Chicago (0.04)
- North Carolina > Orange County
- Chapel Hill (0.04)
- California > Yolo County
- Asia
- Genre:
- Research Report (0.40)
- Technology: