AITopics | Greater Victoria

Collaborating Authors

Greater Victoria

Improving Line Search Methods for Large Scale Neural Network Training

Kenneweg, Philip, Kenneweg, Tristan, Hammer, Barbara

arXiv.org Artificial IntelligenceMar-27-2024

In recent studies, line search methods have shown significant improvements in the performance of traditional stochastic gradient descent techniques, eliminating the need for a specific learning rate schedule. In this paper, we identify existing issues in state-of-the-art line search methods, propose enhancements, and rigorously evaluate their effectiveness. We test these methods on larger datasets and more complex data domains than before. Specifically, we improve the Armijo line search by integrating the momentum term from ADAM in its search direction, enabling efficient large-scale training, a task that was previously prone to failure using Armijo line search methods. Our optimization approach outperforms both the previous Armijo implementation and tuned learning rate schedules for Adam. Our evaluation focuses on Transformers and CNNs in the domains of NLP and image data. Our work is publicly available as a Python package, which provides a hyperparameter free Pytorch optimizer.

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/ACDSA59508.2024.10467724

2403.18519

Country:

North America > United States > California > San Diego County > San Diego (0.04)
Europe > Germany > North Rhine-Westphalia (0.04)
Europe > Belgium > Brussels-Capital Region > Brussels (0.04)
Africa > Seychelles > Greater Victoria > English River > Victoria (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Deep Gaussian Covariance Network with Trajectory Sampling for Data-Efficient Policy Search

Bogoclu, Can, Vosshall, Robert, Cremanns, Kevin, Roos, Dirk

arXiv.org Machine LearningMar-23-2024

Probabilistic world models increase data efficiency of model-based reinforcement learning (MBRL) by guiding the policy with their epistemic uncertainty to improve exploration and acquire new samples. Moreover, the uncertainty-aware learning procedures in probabilistic approaches lead to robust policies that are less sensitive to noisy observations compared to uncertainty unaware solutions. We propose to combine trajectory sampling and deep Gaussian covariance network (DGCN) for a data-efficient solution to MBRL problems in an optimal control setting. We compare trajectory sampling with density-based approximation for uncertainty propagation using three different probabilistic world models; Gaussian processes, Bayesian neural networks, and DGCNs. We provide empirical evidence using four different well-known test environments, that our method improves the sample-efficiency over other combinations of uncertainty propagation methods and probabilistic models. During our tests, we place particular emphasis on the robustness of the learned policies with respect to noisy initial states.

artificial intelligence, machine learning, trajectory, (17 more...)

arXiv.org Machine Learning

doi: 10.1109/ACDSA59508.2024.10467448

2403.15908

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > Washington > King County > Bellevue (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(5 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.67)

Add feedback