AITopics | online gradient descent algorithm

Collaborating Authors

online gradient descent algorithm

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Simple Stochastic and Online Gradient Descent Algorithms for Pairwise Learning

Neural Information Processing SystemsDec-24-2025, 16:41:49 GMT

Pairwise learning refers to learning tasks where the loss function depends on a pair of instances.

name change, online gradient descent algorithm, simple stochastic, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Simple Stochastic and Online Gradient Descent Algorithms for Pairwise Learning

Neural Information Processing SystemsJan-18-2025, 13:09:29 GMT

Pairwise learning refers to learning tasks where the loss function depends on a pair of instances. A popular approach to handle streaming data in pairwise learning is an online gradient descent (OGD) algorithm, where one needs to pair the current instance with a buffering set of previous instances with a sufficiently large size and therefore suffers from a scalability issue. In this paper, we propose simple stochastic and online gradient descent methods for pairwise learning. A notable difference from the existing studies is that we only pair the current instance with the previous one in building a gradient direction, which is efficient in both the storage and computational complexity. We develop novel stability results, optimization, and generalization error bounds for both convex and nonconvex as well as both smooth and nonsmooth problems.

online gradient descent algorithm, pairwise learning, simple stochastic

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.89)

Add feedback

Continuous Online Learning and New Insights to Online Imitation Learning

Lee, Jonathan, Cheng, Ching-An, Goldberg, Ken, Boots, Byron

arXiv.org Machine LearningDec-3-2019

Online learning is a powerful tool for analyzing iterative algorithms. However, the classic adversarial setup sometimes fails to capture certain regularity in online problems in practice. Motivated by this, we establish a new setup, called Continuous Online Learning (COL), where the gradient of online loss function changes continuously across rounds with respect to the learner's decisions. We show that COL covers and more appropriately describes many interesting applications, from general equilibrium problems (EPs) to optimization in episodic MDPs. Using this new setup, we revisit the difficulty of achieving sublinear dynamic regret. We prove that there is a fundamental equivalence between achieving sublinear dynamic regret in COL and solving certain EPs, and we present a reduction from dynamic regret to both static regret and convergence rate of the associated EP. At the end, we specialize these new insights into online imitation learning and show improved understanding of its learning stability.

algorithm, dynamic regret, sublinear dynamic regret, (15 more...)

arXiv.org Machine Learning

1912.01261

Country:

North America > United States > California > Alameda County > Berkeley (0.04)
North America > Canada (0.04)

Genre: Research Report (0.50)

Industry: Education > Educational Setting > Online (0.85)

Technology:

Information Technology > Enterprise Applications > Human Resources > Learning Management (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.34)

Add feedback