AITopics | hoof

Neural Information Processing Systems http://nips.cc/

hoof, hyperparameter, optimization, (11 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
North America > Canada (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.96)

Add feedback

Fast Efficient Hyperparameter Tuning for Policy Gradient Methods

Neural Information Processing SystemsDec-25-2025, 14:07:31 GMT

The performance of policy gradient methods is sensitive to hyperparameter settings that must be tuned for any new application. Widely used grid search methods for tuning hyperparameters are sample inefficient and computationally expensive. More advanced methods like Population Based Training that learn optimal schedules for hyperparameters instead of fixed settings can yield better results, but are also sample inefficient and computationally expensive. In this paper, we propose Hyperparameter Optimisation on the Fly (HOOF), a gradient-free algorithm that requires no more than one training run to automatically adapt the hyperparameter that affect the policy update directly through the gradient. The main idea is to use existing trajectories sampled by the policy gradient method to optimise a one-step improvement objective, yielding a sample and computationally efficient algorithm that is easy to implement. Our experimental results across multiple domains and algorithms show that using HOOF to learn these hyperparameter schedules leads to faster learning with improved performance.

fast efficient hyperparameter tuning, name change, policy gradient method, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.64)

Add feedback

Dinosaur 'mummies' prove some dinos had hooves

Science Dinosaurs Dinosaur'mummies' prove some dinos had hooves'Edmontosaurus annectens' stormed around North America during the Late Cretaceous. Breakthroughs, discoveries, and DIY tips sent every weekday. For the first time, paleontologists have successfully reconstructed the profiles of two massive, duck-billed dinosaurs, right down to their pebbled skin and unexpected hooves. Based in part on remains recovered decades ago in the badlands of Wyoming, the pair of specimens were preserved only thanks to an extremely rare, delicate "mummification" process. At around 39 feet long and weighing about 6.2 tons, was one of the largest and most common dinosaurs in present day North America during the Late Cretaceous period.

andrew paul, dinosaur, mummy, (13 more...)

Popular Science

Country:

North America > United States > Wyoming (0.27)
North America > United States > Minnesota (0.05)
North America > United States > Illinois > Cook County > Chicago (0.05)
(2 more...)

Genre: Research Report > New Finding (0.36)

Industry: Health & Medicine (0.48)

Technology: Information Technology > Artificial Intelligence (0.51)

Add feedback

Fast Efficient Hyperparameter Tuning for Policy Gradient Methods

Supratik Paul, Vitaly Kurin, Shimon Whiteson

Neural Information Processing SystemsOct-3-2025, 00:26:40 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, reinforcement learning, (12 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.96)

Add feedback

743c41a921516b04afde48bb48e28ce6-AuthorFeedback.pdf

Neural Information Processing SystemsOct-3-2025, 00:26:25 GMT

HOOF is robust to settings within this range. We could not present results for Ant and Walker due to space constraints. Thus we are restricted to zero order optimisers. For natural gradients like TNPG, HOOF does not add any new hyperparameters beyond those used by grid search - i.e. Other methods like PBT introduce more hyperparameters than these.

artificial intelligence, constraint, hyperparameter, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.32)

Add feedback

Fast Efficient Hyperparameter Tuning for Policy Gradient Methods

Neural Information Processing SystemsOct-10-2024, 07:37:37 GMT

The performance of policy gradient methods is sensitive to hyperparameter settings that must be tuned for any new application. Widely used grid search methods for tuning hyperparameters are sample inefficient and computationally expensive. More advanced methods like Population Based Training that learn optimal schedules for hyperparameters instead of fixed settings can yield better results, but are also sample inefficient and computationally expensive. In this paper, we propose Hyperparameter Optimisation on the Fly (HOOF), a gradient-free algorithm that requires no more than one training run to automatically adapt the hyperparameter that affect the policy update directly through the gradient. The main idea is to use existing trajectories sampled by the policy gradient method to optimise a one-step improvement objective, yielding a sample and computationally efficient algorithm that is easy to implement.

fast efficient hyperparameter tuning, policy gradient method, sample inefficient and computationally, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.71)

Add feedback

As a Data Scientist, You Should Know About a Clever Horse Called Hans

#artificialintelligenceJan-23-2023, 00:30:08 GMT

In 1904, math teacher Wilhelm von Osten presented his horse called Hans to the public in Berlin, Germany. Von Osten claimed that Hans was smart enough to answer complex questions. For example, Hans could read the time. He could also identify the composers of music and the painters of famous paintings. Additionally, Hans could solve math problems. As shown in Figure 1, Hans did this by tapping a certain amount of times with his front hoof on a footstep.

artificial intelligence, machine learning, shortcut, (6 more...)

#artificialintelligence

Country: Europe > Germany > Berlin (0.26)

Industry:

Education > Curriculum > Subject-Specific Education (0.57)
Health & Medicine (0.37)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

How Do You Make a Robot Walk on Mars? It's a Steep Challenge

WIREDJun-14-2021, 11:00:00 GMT

From the Sojourner rover, which landed on Mars in 1997, to Perseverance, which touched down in February, the robots of the Red Planet share a defining feature: wheels. Rolling is far more stable and energy efficient than walking, which even robots on Earth still struggle to master. After all, NASA would hate for its very expensive Martian explorer to topple over and flail around like a turtle on its back. The problem with wheels, though, is that they limit where rovers can go: To explore complicated Martian terra like steep hills, you need the kinds of legs that evolution gave animals on Earth. So a team of scientists from ETH Zurich in Switzerland and the Max Planck Institute for Solar System Research in Germany have been playing around with a small quadrupedal robot called SpaceBok, designed to mimic an antelope known as a springbok.

configuration, robot, steep challenge, (8 more...)

WIRED

AI-Alerts: 2021 > 2021-06 > AAAI AI-Alert for Jun 15, 2021 (1.00)

Country:

Europe > Switzerland > Zürich > Zürich (0.27)
Europe > Germany (0.26)
Africa (0.06)

Genre: Research Report (0.34)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

Fast Efficient Hyperparameter Tuning for Policy Gradient Methods

Paul, Supratik, Kurin, Vitaly, Whiteson, Shimon

Neural Information Processing SystemsMar-18-2020, 22:17:31 GMT

The performance of policy gradient methods is sensitive to hyperparameter settings that must be tuned for any new application. Widely used grid search methods for tuning hyperparameters are sample inefficient and computationally expensive. More advanced methods like Population Based Training that learn optimal schedules for hyperparameters instead of fixed settings can yield better results, but are also sample inefficient and computationally expensive. In this paper, we propose Hyperparameter Optimisation on the Fly (HOOF), a gradient-free algorithm that requires no more than one training run to automatically adapt the hyperparameter that affect the policy update directly through the gradient. The main idea is to use existing trajectories sampled by the policy gradient method to optimise a one-step improvement objective, yielding a sample and computationally efficient algorithm that is easy to implement.

fast efficient hyperparameter tuning, policy gradient method, sample inefficient and computationally, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.76)

Add feedback

Fast Efficient Hyperparameter Tuning for Policy Gradients

Paul, Supratik, Kurin, Vitaly, Whiteson, Shimon

arXiv.org Machine LearningFeb-18-2019

The performance of policy gradient methods is sensitive to hyperparameter settings that must be tuned for any new application. Widely used grid search methods for tuning hyperparameters are sample inefficient and computationally expensive. More advanced methods like Population Based Training (Jaderberg et al., 2017) that learn optimal schedules for hyperparameters instead of fixed settings canyield better results, but are also sample inefficient and computationally expensive. In this paper, we propose Hyperparameter Optimisation on the Fly (HOOF), a gradient-free meta-learning algorithm that can automatically learn an optimal schedule for hyperparameters that affect the policy updatedirectly through the gradient. The main idea is to use existing trajectories sampled by the policy gradient method to optimise a one-step improvement objective,yielding a sample and computationally efficientalgorithm that is easy to implement. Our experimental results across multiple domains and algorithms show that using HOOF to learn these hyperparameter schedules leads to faster learning with improved performance.

fast efficient hyperparameter tuning, hoof, hyperparameter, (12 more...)

arXiv.org Machine Learning

1902.06583

Country: