AITopics | cheetah

Collaborating Authors

cheetah

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Appendix Effective Population Based Reinforcement Learning 8 Additional Experiment Details

Neural Information Processing SystemsFeb-19-2026, 07:23:53 GMT

Let 1, , M be K Kis PSD, i 0foralli.

artificial intelligence, machine learning, reinforcement learning 8, (13 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.52)

Add feedback

dc89a0709f213fd0ac4b1172719b2c38-Paper-Conference.pdf

Neural Information Processing SystemsFeb-18-2026, 09:26:27 GMT

machine learning, natural language, reinforcement learning, (20 more...)

Neural Information Processing Systems

Country:

Asia > South Korea > Seoul > Seoul (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry: Government (0.45)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(4 more...)

Add feedback

8be9c134bb193d8bd3827d4df8488228-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 15:46:18 GMT

experiment, meta, walker, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.48)
Information Technology > Artificial Intelligence > Robots (0.30)

Add feedback

Supplementary Information: Meta-ReinforcementLearningwith Self-ModifyingNetworks 9 Optimization

Neural Information Processing SystemsFeb-8-2026, 06:16:20 GMT

General information: As specified in section3 and 5, we test a single model definition for all experiments in this work, with one layer of dynamic weightsWt. This layer consists in a dense matrix ofsizen Nwith learnt orrandom initialization.

artificial intelligence, machine learning, meta-reinforcementlearningwith self-modifyingnetwork 9, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Mummified cheetahs could help save the critically endangered big cats

Cheetahs were spotted on the Arabian Peninsula as recently as 1977. Breakthroughs, discoveries, and DIY tips sent six days a week. Seven naturally-mummified cheetahs are more than just an exciting paleontological find. The specimens discovered in five caves near the city of Arar in northern Saudi Arabia offer a glimpse of hope for reintroducing the species to the Arabian Peninsula. The findings are described in a study published today in the journal .

cheetah, mummified cheetah, subspecy, (12 more...)

Popular Science

Country:

Asia > Middle East > Saudi Arabia > Northern Borders Province > Arar (0.25)
Asia > Middle East > Oman (0.06)
Africa (0.06)
(14 more...)

Genre: Research Report > New Finding (0.56)

Technology: Information Technology > Artificial Intelligence (0.36)

Add feedback

Guiding Skill Discovery with Foundation Models

Yang, Zhao, Moerland, Thomas M., Preuss, Mike, Plaat, Aske, François-Lavet, Vincent, Hu, Edward S.

arXiv.org Artificial IntelligenceOct-28-2025

Learning diverse skills without hand-crafted reward functions could accelerate reinforcement learning in downstream tasks. However, existing skill discovery methods focus solely on maximizing the diversity of skills without considering human preferences, which leads to undesirable behaviors and possibly dangerous skills. For instance, a cheetah robot trained using previous methods learns to roll in all directions to maximize skill diversity, whereas we would prefer it to run without flipping or entering hazardous areas. In this work, we propose a Foundation model Guided (FoG) skill discovery method, which incorporates human intentions into skill discovery through foundation models. Specifically, FoG extracts a score function from foundation models to evaluate states based on human intentions, assigning higher values to desirable states and lower to undesirable ones. These scores are then used to re-weight the rewards of skill discovery algorithms. By optimizing the re-weighted skill discovery rewards, FoG successfully learns to eliminate undesirable behaviors, such as flipping or rolling, and to avoid hazardous areas in both state-based and pixel-based tasks. Interestingly, we show that FoG can discover skills involving behaviors that are difficult to define. Interactive visualisations are available from https://sites.google.com/view/submission-fog.

large language model, machine learning, reinforcement learning, (20 more...)

arXiv.org Artificial Intelligence

2510.23167

Country: Europe > Netherlands (0.14)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)
(2 more...)

Add feedback

Optimistic Task Inference for Behavior Foundation Models

Rupf, Thomas, Bagatella, Marco, Vlastelica, Marin, Krause, Andreas

arXiv.org Artificial IntelligenceOct-24-2025

Behavior Foundation Models (BFMs) are capable of retrieving high-performing policy for any reward function specified directly at test-time, commonly referred to as zero-shot reinforcement learning (RL). While this is a very efficient process in terms of compute, it can be less so in terms of data: as a standard assumption, BFMs require computing rewards over a non-negligible inference dataset, assuming either access to a functional form of rewards, or significant labeling efforts. To alleviate these limitations, we tackle the problem of task inference purely through interaction with the environment at test-time. We propose OpTI-BFM, an optimistic decision criterion that directly models uncertainty over reward functions and guides BFMs in data collection for task inference. Formally, we provide a regret bound for well-trained BFMs through a direct connection to upper-confidence algorithms for linear bandits. Empirically, we evaluate OpTI-BFM on established zero-shot benchmarks, and observe that it enables successor-features-based BFMs to identify and optimize an unseen reward function in a handful of episodes with minimal compute overhead. Code is available at https://github.com/ThomasRupf/opti-bfm.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

arXiv.org Artificial Intelligence

2510.20264

Country: Europe > Switzerland (0.28)

Genre:

Workflow (0.67)
Research Report (0.50)

Industry: Education (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

A Simple Framework for Generalization in Visual RL under Dynamic Scene Perturbations

Neural Information Processing SystemsOct-10-2025, 18:44:35 GMT

Imbalanced saliency is a phenomenon where an RL agent disproportionately identifies salient features across consecutive frames in a frame stack.

augmentation, simgrl, tid score, (16 more...)

Neural Information Processing Systems

Country:

Asia > South Korea > Seoul > Seoul (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry: Government (0.45)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(4 more...)

Add feedback

A Reward Net Algorithm

Neural Information Processing SystemsAug-16-2025, 20:21:32 GMT

In this section, we present the detailed procedures of MRN in Algorithm 1. In Section 4.2, the implicit derivative at iteration k of is calculated by: g Cauchy-Schwarz inequality, and the last inequality holds for the definition of Lipschitz smoothness. Lemma 2. Assume the outer loss Then the gradient of with respect to the outer loss is Lipschitz continuous. Theorem 1. Assume the outer loss Theorem 2. Assume the outer loss Even worse, it might be difficult for human experts to give preferences to trajectory pairs (e.g., a pair of poor trajectories.). This problem leads to a significant impact on the efficiency of the feedback in the initial stage.

artificial intelligence, machine learning, meta, (19 more...)

Neural Information Processing Systems

Technology: