AITopics | Education

Collaborating Authors

Education

Learning to learn by gradient descent by gradient descent

Marcin Andrychowicz, Misha Denil, Sergio Gómez, Matthew W. Hoffman, David Pfau, Tom Schaul, Nando de Freitas

Neural Information Processing SystemsApr-22-2026, 14:32:21 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, optimizer, (14 more...)

Neural Information Processing Systems

Country: Europe (0.46)

Industry: Education (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.88)

Add feedback

Lifelong Learning with Weighted Majority Votes

Anastasia Pentina, Ruth Urner

Neural Information Processing SystemsApr-22-2026, 13:03:24 GMT

Better understanding of the potential benefits of information transfer and representation learning is an important step towards the goal of building intelligent systems that are able to persist in the world and learn over time. In this work, we consider a setting where the learner encounters a stream of tasks but is able to retain only limited information from each encountered task, such as a learned predictor. In contrast to most previous works analyzing this scenario, we do not make any distributional assumptions on the task generating process. Instead, we formulate a complexity measure that captures the diversity of the observed tasks. We provide a lifelong learning algorithm with error guarantees for every observed task (rather than on average). We show sample complexity reductions in comparison to solving every task in isolation in terms of our task complexity measure. Further, our algorithmic framework can naturally be viewed as learning a representation from encountered tasks with a neural network.

artificial intelligence, hypothesis, machine learning, (14 more...)

Neural Information Processing Systems

Country: Europe (0.68)

Genre: Instructional Material (0.65)

Industry: Education > Educational Setting > Continuing Education (0.65)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.49)

Add feedback

My Partner Just Got Laid Off From His Job of 12 Years. What He's Doing Now Boggles the Mind.

SlateApr-22-2026, 10:00:00 GMT

What He's Doing Now Boggles the Mind. My partner, a 36-year-old man, is being let go from his job. He was informed that his company would be cutting him and his entire department. The only person staying is his boss, who will be overseeing the new AI customer service client they are replacing the real life people with. But that's not what this is about, even if AI is going to be the death of humanity.

artificial intelligence, slate shop game newsletter sign, social media, (10 more...)

Slate

Industry:

Marketing (1.00)
Education (1.00)
Banking & Finance > Financial Services (0.52)

Technology:

Information Technology > Artificial Intelligence (0.35)
Information Technology > Communications > Social Media (0.31)

Add feedback

dc4c44f624d600aa568390f1f1104aa0-Paper.pdf

Neural Information Processing SystemsApr-22-2026, 09:42:48 GMT

artificial intelligence, constraint, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America > United States (0.48)

Industry: Education > Educational Setting > Online (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)

Add feedback

db116b39f7a3ac5366079b1d9fe249a5-Paper.pdf

Neural Information Processing SystemsApr-22-2026, 09:41:55 GMT

artificial intelligence, bernstein condition, machine learning, (16 more...)

Neural Information Processing Systems

Country: Europe > Netherlands (0.28)

Industry: Education > Educational Setting > Online (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Mistake Bounds for Binary Matrix Completion

Mark Herbster, Stephen Pasteris, Massimiliano Pontil

Neural Information Processing SystemsApr-22-2026, 07:04:29 GMT

We study the problem of completing a binary matrix in an online learning setting. On each trial we predict a matrix entry and then receive the true entry. We propose a Matrix Exponentiated Gradient algorithm [1] to solve this problem. We provide a mistake bound for the algorithm, which scales with the margin complexity [2, 3] of the underlying matrix. The bound suggests an interpretation where each row of the matrix is a prediction task over a finite set of objects, the columns. Using this we show that the algorithm makes a number of mistakes which is comparable up to a logarithmic factor to the number of mistakes made by the Kernel Perceptron with an optimal kernel in hindsight. We discuss applications of the algorithm to predicting as well as the best biclustering and to the problem of predicting the labeling of a graph without knowing the graph in advance.

algorithm, artificial intelligence, machine learning, (14 more...)

Neural Information Processing Systems

Country:

Europe (0.68)
North America > United States (0.46)

Industry:

Government (0.47)
Education > Educational Setting > Online (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.36)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.35)

Add feedback

DECOrrelated feature space partitioning for distributed sparse regression

Xiangyu Wang, David B. Dunson, Chenlei Leng

Neural Information Processing SystemsApr-22-2026, 07:03:00 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, subset, (17 more...)

Neural Information Processing Systems

Industry: Education (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Without-Replacement Sampling for Stochastic Gradient Methods Ohad Shamir Department of Computer Science and Applied Mathematics Weizmann Institute of Science Rehovot, Israel ohad.shamir@weizmann.ac.il

Neural Information Processing SystemsApr-22-2026, 03:44:07 GMT

Stochastic gradient methods for machine learning and optimization problems are usually analyzed assuming data points are sampled with replacement. In contrast, sampling without replacement is far less understood, yet in practice it is very common, often easier to implement, and usually performs better. In this paper, we provide competitive convergence guarantees for without-replacement sampling under several scenarios, focusing on the natural regime of few passes over the data. Moreover, we describe a useful application of these results in the context of distributed optimization with randomly-partitioned data, yielding a nearly-optimal algorithm for regularized least squares (in terms of both communication complexity and runtime complexity) under broad parameter regimes. Our proof techniques combine ideas from stochastic optimization, adversarial online learning and transductive learning theory, and can potentially be applied to other stochastic optimization and learning problems.

algorithm, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Israel (0.40)
Europe (0.28)

Industry: Education (0.54)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.74)

Add feedback

Stochastic Online AUC Maximization

Yiming Ying, Longyin Wen, Siwei Lyu

Neural Information Processing SystemsApr-22-2026, 03:42:40 GMT

Area under ROC (AUC) is a metric which is widely used for measuring the classification performance for imbalanced data. It is of theoretical and practical interest to develop online learning algorithms that maximizes AUC for large-scale data. A specific challenge in developing online AUC maximization algorithm is that the learning objective function is usually defined over a pair of training examples of opposite classes, and existing methods achieves on-line processing with higher space and time complexity. In this work, we propose a new stochastic online algorithm for AUC maximization. In particular, we show that AUC optimization can be equivalently formulated as a convex-concave saddle point problem. From this saddle representation, a stochastic online algorithm (SOLAM) is proposed which has time and space complexity of one datum. We establish theoretical convergence of SOLAM with high probability and demonstrate its effectiveness on standard benchmark datasets.

artificial intelligence, inductive learning, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Industry: Education > Educational Setting > Online (0.36)

Technology: