comparison theorem
Neural Expectation Operators
This paper introduces \textbf{Measure Learning}, a paradigm for modeling ambiguity via non-linear expectations. We define Neural Expectation Operators as solutions to Backward Stochastic Differential Equations (BSDEs) whose drivers are parameterized by neural networks. The main mathematical contribution is a rigorous well-posedness theorem for BSDEs whose drivers satisfy a local Lipschitz condition in the state variable $y$ and quadratic growth in its martingale component $z$. This result circumvents the classical global Lipschitz assumption, is applicable to common neural network architectures (e.g., with ReLU activations), and holds for exponentially integrable terminal data, which is the sharp condition for this setting. Our primary innovation is to build a constructive bridge between the abstract, and often restrictive, assumptions of the deep theory of quadratic BSDEs and the world of machine learning, demonstrating that these conditions can be met by concrete, verifiable neural network designs. We provide constructive methods for enforcing key axiomatic properties, such as convexity, by architectural design. The theory is extended to the analysis of fully coupled Forward-Backward SDE systems and to the asymptotic analysis of large interacting particle systems, for which we establish both a Law of Large Numbers (propagation of chaos) and a Central Limit Theorem. This work provides the foundational mathematical framework for data-driven modeling under ambiguity.
Comparison theorems on large-margin learning
Classification is a very important research topic in statist ical machine learning. There are a large amount of literature on various classification methods, ran ging from the very classical distribution-based likelihood approaches such as Fisher linear discriminant analysis (LDA) and logistic regression [3], to the margin-based approaches such as the well-known s upport vector machine (SVM) [1, 2]. Each type of classifiers has their own merits. Recently, Liu a nd his coauthors proposed in [4] the so-called large-margin unified machines (LUMs) which es tablish a unique transition between these two types of classifiers. As noted in [5], SVM may suffer fr om data piling problems in the high-dimension low-sample size (HDLSS) settings, that is, the support vectors will pile up on top of each other at the margin boundaries when projected onto th e normal vector of the separating hyperplane.
Efficient Localized Inference for Large Graphical Models
Chen, Jinglin, Peng, Jian, Liu, Qiang
We propose a new localized inference algorithm for answering marginalization queries in large graphical models with the correlation decay property. Given a query variable and a large graphical model, we define a much smaller model in a local region around the query variable in the target model so that the marginal distribution of the query variable can be accurately approximated. We introduce two approximation error bounds based on the Dobrushin's comparison theorem and apply our bounds to derive a greedy expansion algorithm that efficiently guides the selection of neighbor nodes for localized inference. We verify our theoretical bounds on various datasets and demonstrate that our localized inference algorithm can provide fast and accurate approximation for large graphical models.