Goto

Collaborating Authors

 svm result and time consumption


Scalable Sobolev IPM for Probability Measures on a Graph

arXiv.org Machine Learning

We investigate the Sobolev IPM problem for probability measures supported on a graph metric space. Sobolev IPM is an important instance of integral probability metrics (IPM), and is obtained by constraining a critic function within a unit ball defined by the Sobolev norm. In particular, it has been used to compare probability measures and is crucial for several theoretical works in machine learning. However, to our knowledge, there are no efficient algorithmic approaches to compute Sobolev IPM effectively, which hinders its practical applications. In this work, we establish a relation between Sobolev norm and weighted $L^p$-norm, and leverage it to propose a \emph{novel regularization} for Sobolev IPM. By exploiting the graph structure, we demonstrate that the regularized Sobolev IPM provides a \emph{closed-form} expression for fast computation. This advancement addresses long-standing computational challenges, and paves the way to apply Sobolev IPM for practical applications, even in large-scale settings. Additionally, the regularized Sobolev IPM is negative definite. Utilizing this property, we design positive-definite kernels upon the regularized Sobolev IPM, and provide preliminary evidences of their advantages on document classification and topological data analysis for measures on a graph.


Scalable Unbalanced Sobolev Transport for Measures on a Graph

arXiv.org Artificial Intelligence

Optimal transport (OT) is a popular and powerful tool for comparing probability measures. However, OT suffers a few drawbacks: (i) input measures required to have the same mass, (ii) a high computational complexity, and (iii) indefiniteness which limits its applications on kernel-dependent algorithmic approaches. To tackle issues (ii)--(iii), Le et al. (2022) recently proposed Sobolev transport for measures on a graph having the same total mass by leveraging the graph structure over supports. In this work, we consider measures that may have different total mass and are supported on a graph metric space. To alleviate the disadvantages (i)--(iii) of OT, we propose a novel and scalable approach to extend Sobolev transport for this unbalanced setting where measures may have different total mass. We show that the proposed unbalanced Sobolev transport (UST) admits a closed-form formula for fast computation, and it is also negative definite. Additionally, we derive geometric structures for the UST and establish relations between our UST and other transport distances. We further exploit the negative definiteness to design positive definite kernels and evaluate them on various simulations to illustrate their fast computation and comparable performances against other transport baselines for unbalanced measures on a graph.


Sobolev Transport: A Scalable Metric for Probability Measures with Graph Metrics

arXiv.org Machine Learning

However, evaluating the OT incurs a high computational complexity in Optimal transport (OT) is a popular measure general (Peyré and Cuturi, 2019) which leads to several to compare probability distributions. However, proposals in the recent literature to address this drawback OT suffers a few drawbacks such as (i) of OT, e.g., approximate using entropic regularization a high complexity for computation, (ii) indefiniteness (Cuturi, 2013), or exploit geometric structure which limits its applicability to of supports (Rabin et al., 2011; Le et al., 2019; Le and kernel machines. In this work, we consider Nguyen, 2021). Among them, tree-Wasserstein (Evans probability measures supported on a graph and Matsen, 2012; Le et al., 2019) (TW) leverages the metric space and propose a novel Sobolev tree structure over supports to obtain a closed-form transport metric. We show that the Sobolev for fast computation. However, the requirement about transport metric yields a closed-form formula tree structure for supports may be restricted in applications.


Entropy Partial Transport with Tree Metrics: Theory and Practice

arXiv.org Machine Learning

Optimal transport (OT) theory provides powerful tools to compare probability measures. However, OT is limited to nonnegative measures having the same mass, and suffers serious drawbacks about its computation and statistics. This leads to several proposals of regularized variants of OT in the recent literature. In this work, we consider an \textit{entropy partial transport} (EPT) problem for nonnegative measures on a tree having different masses. The EPT is shown to be equivalent to a standard complete OT problem on a one-node extended tree. We derive its dual formulation, then leverage this to propose a novel regularization for EPT which admits fast computation and negative definiteness. To our knowledge, the proposed regularized EPT is the first approach that yields a \textit{closed-form} solution among available variants of unbalanced OT. For practical applications without priori knowledge about the tree structure for measures, we propose tree-sliced variants of the regularized EPT, computed by averaging the regularized EPT between these measures using random tree metrics, built adaptively from support data points. Exploiting the negative definiteness of our regularized EPT, we introduce a positive definite kernel, and evaluate it against other baselines on benchmark tasks such as document classification with word embedding and topological data analysis. In addition, we empirically demonstrate that our regularization also provides effective approximations.