Kernel Choice and Classifiability for RKHS Embeddings of Probability Distributions
Fukumizu, Kenji, Gretton, Arthur, Lanckriet, Gert R., Schölkopf, Bernhard, Sriperumbudur, Bharath K.
–Neural Information Processing Systems
Embeddings of probability measures into reproducing kernel Hilbert spaces have been proposed as a straightforward and practical means of representing and comparing probabilities.In particular, the distance between embeddings (the maximum mean discrepancy, or MMD) has several key advantages over many classical metrics on distributions, namely easy computability, fast convergence and low bias of finite sample estimates. An important requirement of the embedding RKHS is that it be characteristic: in this case, the MMD between two distributions is zero if and only if the distributions coincide. Three new results on the MMD are introduced inthe present study. First, it is established that MMD corresponds to the optimal risk of a kernel classifier, thus forming a natural link between the distance between distributions and their ease of classification. An important consequence is that a kernel must be characteristic to guarantee classifiability between distributions inthe RKHS.
Neural Information Processing Systems
Dec-31-2009
- Country:
- North America > United States (1.00)
- Europe (0.68)
- Genre:
- Research Report > New Finding (0.66)
- Technology: