Goto

Collaborating Authors

 smin


Supplementary to " Approximation with CNNs in Sobolev Space: with Applications to Classification "

Neural Information Processing Systems

In the Supplementary materials, we include detailed descriptions on convex surrogate losses,convolutional neural networks, non-asymptotic error bounds for commonly used loss functions, and prove Theorems 2.1,2.2, A toy example on the numerical performance of CNN approximation is presented in Appendix D. We next give a brief review of the convex surrogate loss functions and discuss in details on the connection between the excess risk with respect to the ฯ•-loss and that of 0-1 loss [28, 4]. Let ฯ•be a given convex univariate function ฯ•: R [0,). Instead of minimizing the excess risk R over H, we consider minimizing the risk with respect to the loss ฯ•(ฯ•-risk) R(f):= E{ฯ•(Yf(X))} over a certain class of functions F, where ฯ•: R [0,) is some generic loss function. For the special case when H = {h: h(x) = sign(f(x)),f F} and ฯ•() is a step function, i.e., ฯ•(x) = 1 Guohao Shen and Yuling Jiao contributed equally to this work Corresponding authors 36th Conference on Neural Information Processing Systems (NeurIPS 2022). As shown in [28] and [4], for a properly chosen ฯ•, ห†fn can indeed help reduce the 0-1 excess risk R (ห†hn) R (h0). More precisely, let R0:= inff measurable R(f), then for a proper ฯ•, we have ฯˆ(R (ห†hn) R (h0)) R(ห†fn) R(f0), where ฯˆ: [ 1,1] [0,)is a nonnegative continuous function, invertible on [0,1], and achieves its minimum at 0 with ฯˆ(0) = 0. A wide variety of popular classification methods are based on this tactic.


ddd808772c035aed516d42ad3559be5f-Supplemental.pdf

Neural Information Processing Systems

We study the problem of learning an optimal regression function subject to a fairness constraint. It requires that, conditionally on the sensitive feature, the distribution of the function output remains the same. This constraint naturally extends the notion of demographic parity, often used in classification, to the regression setting. We tackle this problem by leveraging on a proxy-discretized version, for which we derive an explicit expression of the optimal fair predictor. This result naturally suggests a two stage approach, in which we first estimate the (unconstrained) regression function from a set of labeled data and then we recalibrate it with another set of unlabeled data.



Are Cluster Validity Measures (In)valid?

arXiv.org Artificial Intelligence

Internal cluster validity measures (such as the Calinski-Harabasz, Dunn, or Davies-Bouldin indices) are frequently used for selecting the appropriate number of partitions a dataset should be split into. In this paper we consider what happens if we treat such indices as objective functions in unsupervised learning activities. Is the optimal grouping with regards to, say, the Silhouette index really meaningful? It turns out that many cluster (in)validity indices promote clusterings that match expert knowledge quite poorly. We also introduce a new, well-performing variant of the Dunn index that is built upon OWA operators and the near-neighbour graph so that subspaces of higher density, regardless of their shapes, can be separated from each other better.


Social Recommendation with Self-Supervised Metagraph Informax Network

arXiv.org Artificial Intelligence

In recent years, researchers attempt to utilize online social information to alleviate data sparsity for collaborative filtering, based on the rationale that social networks offers the insights to understand the behavioral patterns. However, due to the overlook of inter-dependent knowledge across items (e.g., categories of products), existing social recommender systems are insufficient to distill the heterogeneous collaborative signals from both user and item sides. In this work, we propose a Self-Supervised Metagraph Infor-max Network (SMIN) which investigates the potential of jointly incorporating social- and knowledge-aware relational structures into the user preference representation for recommendation. To model relation heterogeneity, we design a metapath-guided heterogeneous graph neural network to aggregate feature embeddings from different types of meta-relations across users and items, em-powering SMIN to maintain dedicated representations for multi-faceted user- and item-wise dependencies. Additionally, to inject high-order collaborative signals, we generalize the mutual information learning paradigm under the self-supervised graph-based collaborative filtering. This endows the expressive modeling of user-item interactive patterns, by exploring global-level collaborative relations and underlying isomorphic transformation property of graph topology. Experimental results on several real-world datasets demonstrate the effectiveness of our SMIN model over various state-of-the-art recommendation methods. We release our source code at https://github.com/SocialRecsys/SMIN.