Goto

Collaborating Authors

 nullz



A Discussion of the generative model 1

Neural Information Processing Systems

Thus, we verify that the random effects estimator is equivalent to the generative model (1). Specifically, if u(x) = 1 for all x X, we use ( X, P,ψ) for simplicity. Due to the separability of ψ, we have the following coreset definition. Definitions 2.2 and 2.3, the regression objectives of OLSE and GLSE can be decomposed into Thus, we can apply the above definition to define coresets for OLSE and GLSE. Now we are ready to describe the FL framework in the language of a query space. We first prove Theorem C.1 and propose the corresponding algorithm that constructs an Next, we prove Theorem C.2 and propose the corresponding algorithm that constructs an accurate Caratheodory's Theorem, there must exist at most To accelerate the running time, Jubran et al. [ By the Caratheodory's Theorem, there must exist at most In this section, we complete the proofs for GLSE.





Inexact Moreau Envelope Lagrangian Method for Non-Convex Constrained Optimization under Local Error Bound Conditions on Constraint Functions

Huang, Yankun, Lin, Qihang, Xu, Yangyang

arXiv.org Artificial Intelligence

In this paper, we study the inexact Moreau envelope Lagrangian (iMELa) method for solving smooth non-convex optimization problems over a simple polytope with additional convex inequality constraints. By incorporating a proximal term into the traditional Lagrangian function, the iMELa method approximately solves a convex optimization subproblem over the polyhedral set at each main iteration. Under the assumption of a local error bound condition for subsets of the feasible set defined by subsets of the constraints, we establish that the iMELa method can find an $\epsilon$-Karush-Kuhn-Tucker point with $\tilde O(\epsilon^{-2})$ gradient oracle complexity.


On the statistical complexity of quantum circuits

Bu, Kaifeng, Koh, Dax Enshan, Li, Lu, Luo, Qingxian, Zhang, Yaobo

arXiv.org Machine Learning

In theoretical machine learning, the statistical complexity is a notion that measures the richness of a hypothesis space. In this work, we apply a particular measure of statistical complexity, namely the Rademacher complexity, to the quantum circuit model in quantum computation and study how the statistical complexity depends on various quantum circuit parameters. In particular, we investigate the dependence of the statistical complexity on the resources, depth, width, and the number of input and output registers of a quantum circuit. To study how the statistical complexity scales with resources in the circuit, we introduce a resource measure of magic based on the $(p,q)$ group norm, which quantifies the amount of magic in the quantum channels associated with the circuit. These dependencies are investigated in the following two settings: (i) where the entire quantum circuit is treated as a single quantum channel, and (ii) where each layer of the quantum circuit is treated as a separate quantum channel. The bounds we obtain can be used to constrain the capacity of quantum neural networks in terms of their depths and widths as well as the resources in the network.


Motivic clustering schemes for directed graphs

Pinto, Guilherme Vituri F., Mémoli, Facundo

arXiv.org Machine Learning

Motivated by the concept of network motifs we construct certain clustering methods (functors) which are parametrized by a given collection of motifs (or representers).


Robust Aggregation for Federated Learning

Pillutla, Krishna, Kakade, Sham M., Harchaoui, Zaid

arXiv.org Machine Learning

We present a robust aggregation approach to make federated learning robust to settings when a fraction of the devices may be sending corrupted updates to the server. The proposed approach relies on a robust secure aggregation oracle based on the geometric median, which returns a robust aggregate using a constant number of calls to a regular non-robust secure average oracle. The robust aggregation oracle is privacy-preserving, similar to the secure average oracle it builds upon. We provide experimental results of the proposed approach with linear models and deep networks for two tasks in computer vision and natural language processing. The robust aggregation approach is agnostic to the level of corruption; it outperforms the classical aggregation approach in terms of robustness when the level of corruption is high, while being competitive in the regime of low corruption.


High dimensional regression for regenerative time-series: an application to road traffic modeling

Bouchouia, Mohammed, Portier, François

arXiv.org Machine Learning

This paper investigates statistical models for road traffic modeling. The proposed methodology considers road traffic as a (i) highdimensional time-series for which (ii) regeneration occurs at the end of each day. Since (ii), prediction is based on a daily modeling of the road traffic using a vector autoregressive model that combines linearly the past observations of the day. Considering (i), the learning algorithm follows from an l1-penalization of the regression coefficients. Excess risk bounds are established under the high-dimensional framework in which the number of road sections goes to infinity with the number of observed days. Considering floating car data observed in an urban area, the approach is compared to state-of-the-art methods including neural networks. In addition of being very competitive in terms of prediction, it enables to identify the most determinant sections of the road network.