Goto

Collaborating Authors

 Learning Management


Review for NeurIPS paper: Online learning with dynamics: A minimax perspective

Neural Information Processing Systems

Post-rebuttal: I am satisfied with the rebuttal. I am interested to know more about "One reason why such rates are common in online learning is the connection of the sequential Rademacher complexity with uniform convergence of martingale difference sequences in the corresponding Banach space (see [2] for details)." If the paper is accepted and space is allowed, I suggest to elaborate on this more thoroughly.


Review for NeurIPS paper: Online learning with dynamics: A minimax perspective

Neural Information Processing Systems

One of the main issues of this paper is clarity, however. We trust that you will work very hard to improve clarity for the final submission, as suggested in the reviews.


Optimizing Decentralized Online Learning for Supervised Regression and Classification Problems

arXiv.org Artificial Intelligence

Decentralized learning networks aim to synthesize a single network inference from a set of raw inferences provided by multiple participants. To determine the combined inference, these networks must adopt a mapping from historical participant performance to weights, and to appropriately incentivize contributions they must adopt a mapping from performance to fair rewards. Despite the increased prevalence of decentralized learning networks, there exists no systematic study that performs a calibration of the associated free parameters. Here we present an optimization framework for key parameters governing decentralized online learning in supervised regression and classification problems. These parameters include the slope of the mapping between historical performance and participant weight, the timeframe for performance evaluation, and the slope of the mapping between performance and rewards. These parameters are optimized using a suite of numerical experiments that mimic the design of the Allora Network, but have been extended to handle classification tasks in addition to regression tasks. This setup enables a comparative analysis of parameter tuning and network performance optimization (loss minimization) across both problem types. We demonstrate how the optimal performance-weight mapping, performance timeframe, and performance-reward mapping vary with network composition and problem type. Our findings provide valuable insights for the optimization of decentralized learning protocols, and we discuss how these results can be generalized to optimize any inference synthesis-based, decentralized AI network.


Review for NeurIPS paper: Temporal Variability in Implicit Online Learning

Neural Information Processing Systems

This paper considers the implicit update algorithm for online learning (a.k.a. It is shown that the algorithm achieves a regret bound that is adapted to the variability of the sequence of loss functions. This holds even without the smoothness of the loss. I believe this is a firm contribution to the fields of online learning and stochastic optimization. Firstly, Implicit updates are known to have practical advantages, but their theoretical understanding has been limited to the fact that they enjoy the same worst-case regret guarantees as their explicit counterparts. This is one of a very few works (if not the first one) which shows a nontrivial advantages of the implicit methods and thus makes a significant progress in better understanding of their behavior.


Review for NeurIPS paper: Metric-Free Individual Fairness in Online Learning

Neural Information Processing Systems

The paper concerns a new online learning problem subject to the constraint of individual fairness. It provides a framework that reduces online classification in the considered model to standard online classification, obtaining an algorithm with sublinear regret both in terms of accuracy and fairness, as well as strong generalization bounds in the i.i.d. All the reviewers liked the paper and the proposed metric-free approach. The appreciated an interesting problem formulation and a clean reduction technique to a known online learning problem. The paper received very high uniform scores of 8 from each reviewer. The reviewers found some issues with the presentation, and I hope the authors will address them in the final version of the manuscript.


Reviews: Private Learning Implies Online Learning: An Efficient Reduction

Neural Information Processing Systems

A theory paper showing that a reduction from online learning to differentially private PAC learning, e.g., a differentially private PAC learner for concept class H can be used in a black-box fashion to obtain a low regret bound for online learning with 0/1 loss for concept class H. The key contribution here is that this is a computationally efficient reduction, which resolves an open problem of Neel/Roth/Wu and improves on the information theoretic result of Feldman/Xiao. In addition to the strong result, the reviewers point out that the paper contains many interesting observations along the way (e.g., an innovative use of online boosting).


Generation of reusable learning objects from digital medical collections: An analysis based on the MASMDOA framework

arXiv.org Artificial Intelligence

Learning Objects represent a widespread approach to structuring instructional materials in a large variety of educational contexts. The main aim of this work consists of analyzing from a qualitative point of view the process of generating reusable learning objects (RLOs) followed by Clavy, a tool that can be used to retrieve data from multiple medical knowledge sources and reconfigure such sources in diverse multimedia-based structures and organizations. From these organizations, Clavy is able to generate learning objects which can be adapted to various instructional healthcare scenarios with several types of user profiles and distinct learning requirements. Moreover, Clavy provides the capability of exporting these learning objects through educational standard specifications, which improves their reusability features. The analysis insights highlight the importance of having a tool able to transfer knowledge from the available digital medical collections to learning objects that can be easily accessed by medical students and healthcare practitioners through the most popular e-learning platforms.


Reviews: Optimal Stochastic and Online Learning with Individual Iterates

Neural Information Processing Systems

This paper proposes an online stochastic optimization algorithm (similar to SGD) that has optimal convergence rate of the last iterate in two settings (O(1/sqrt(T)) for Lipschitz convex functions and O(1/T) strongly convex functions), and additionally it allows an arbitrary non-smooth regularizer (e.g. Many subsets of the properties are achieved by prior works. Namely, it was known how to achieve these results up to O(log T) factors. It was known how to achieve the optimal rates with averaging, which, however, destroys sparsity. However, this paper has the first algorithm that has all the properties simultaneously and removes the log factors. The paper has rigorous proofs of the convergence rates and extensive numerical experiments.



Reviews: Dual Space Gradient Descent for Online Learning

Neural Information Processing Systems

The rebuttal answered all my technical questions, in particular the one about probability and expectations in Theorems 1(ii) and 3. My scores (which were given assuming the technicalities can be fixed) remain the same. The theoretical results do not feel that exciting. The regret bounds seem more or less what one might expect, and there is no comparison to bounds for other algorithms. I understand that there are no directly comparable bounds since you have both budget size and dimensionality of random projections as significant parameters, but still some discussion would be nice. The dependence on budget B in your bounds could perhaps be pointed out more explicitly.