ARUBA: Learning-to-Learn with Less Regret

Nov-23-2019, 05:24:37 GMT–#artificialintelligence

Figure 1: Illustration of the meta-learning process as applied to the task of personalized next-word prediction. Here each mobile device corresponds to a different next-word prediction task, with the test-task not seen during meta-training (Step 1). In the classical machine learning setup, we aim to learn a single model for a single task given many training samples from the same distribution. However, in many practical applications, we are in fact exposed to several distinct yet related tasks that have only a few examples each. Because the data now come from different training distributions, simply learning a single global model, e.g., via stochastic gradient descent (SGD), may result in poor performance on each task.

gradient descent, initialization, learning rate, (14 more...)

#artificialintelligence

Nov-23-2019, 05:24:37 GMT

News Web Page

Add feedback

Country:
- North America
  - Aruba (0.42)
  - United States > Pennsylvania
    - Allegheny County > Pittsburgh (0.40)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Statistical Learning > Gradient Descent (0.72)
  - Transfer Learning (0.65)