AITopics | versatility

Regularized Conditional Diffusion Model for Multi-Task Preference Alignment

Neural Information Processing SystemsMar-22-2026, 22:51:33 GMT

Sequential decision-making can be formulated as a conditional generation process, with targets for alignment with human intents and versatility across various tasks. Previous return-conditioned diffusion models manifest comparable performance but rely on well-defined reward functions, which requires amounts of human efforts and faces challenges in multi-task settings. Preferences serve as an alternative but recent work rarely considers preference learning given multiple tasks. To facilitate the alignment and versatility in multi-task preference learning, we adopt multi-task preferences as a unified framework. In this work, we propose to learn preference representations aligned with preference labels, which are then used as conditions to guide the conditional generation process of diffusion models. The traditional classifier-free guidance paradigm suffers from the inconsistency between the conditions and generated trajectories.

artificial intelligence, machine learning, proceedings, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

CPM-Nets: Cross Partial Multi-View Networks

Neural Information Processing SystemsDec-25-2025, 01:02:32 GMT

Despite multi-view learning progressed fast in past decades, it is still challenging due to the difficulty in modeling complex correlation among different views, especially under the context of view missing. To address the challenge, we propose a novel framework termed Cross Partial Multi-View Networks (CPM-Nets). In this framework, we first give a formal definition of completeness and versatility for multi-view representation and then theoretically prove the versatility of the latent representation learned from our algorithm. To achieve the completeness, the task of learning latent multi-view representation is specifically translated to degradation process through mimicking data transmitting, such that the optimal tradeoff between consistence and complementarity across different views could be achieved. In contrast with methods that either complete missing views or group samples according to view-missing patterns, our model fully exploits all samples and all views to produce structured representation for interpretability.

artificial intelligence, machine learning, proceedings, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

BeyondMimic: From Motion Tracking to Versatile Humanoid Control via Guided Diffusion

Liao, Qiayuan, Truong, Takara E., Huang, Xiaoyu, Gao, Yuman, Tevet, Guy, Sreenath, Koushil, Liu, C. Karen

arXiv.org Artificial IntelligenceNov-14-2025

The human-like form of humanoid robots positions them uniquely to achieve the agility and versatility in motor skills that humans possess. Learning from human demonstrations offers a scalable approach to acquiring these capabilities. However, prior works either produce unnatural motions or rely on motion-specific tuning to achieve satisfactory naturalness. Furthermore, these methods are often motion- or goal-specific, lacking the versatility to compose diverse skills, especially when solving unseen tasks. We present BeyondMimic, a framework that scales to diverse motions and carries the versatility to compose them seamlessly in tackling unseen downstream tasks. At heart, a compact motion-tracking formulation enables mastering a wide range of radically agile behaviors, including aerial cartwheels, spin-kicks, flip-kicks, and sprinting, with a single setup and shared hyperparameters, all while achieving state-of-the-art human-like performance. Moving beyond the mere imitation of existing motions, we propose a unified latent diffusion model that empowers versatile goal specification, seamless task switching, and dynamic composition of these agile behaviors. Leveraging classifier guidance, a diffusion-specific technique for test-time optimization toward novel objectives, our model extends its capability to solve downstream tasks never encountered during training, including motion inpainting, joystick teleoperation, and obstacle avoidance, and transfers these skills zero-shot to real hardware. This work opens new frontiers for humanoid robots by pushing the limits of scalable human-like motor skill acquisition from human motion and advancing seamless motion synthesis that achieves generalization and versatility beyond training setups.

diffusion model, machine learning, reinforcement learning, (19 more...)

arXiv.org Artificial Intelligence

2508.08241

Country: North America > United States > California (0.46)

Genre: Research Report > New Finding (0.46)

Industry:

Education (0.87)
Leisure & Entertainment > Sports (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots > Locomotion (0.68)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

ATRos: Learning Energy-Efficient Agile Locomotion for Wheeled-legged Robots

Sun, Jingyuan, Ji, Hongyu, Qu, Zihan, Wang, Chaoran, Zhang, Mingyu

arXiv.org Artificial IntelligenceOct-14-2025

Hybrid locomotion of wheeled-legged robots has recently attracted increasing attention due to their advantages of combining the agility of legged locomotion and the efficiency of wheeled motion. But along with expanded performance, the whole-body control of wheeled-legged robots remains challenging for hybrid locomotion. In this paper, we present ATRos, a reinforcement learning (RL)-based hybrid locomotion framework to achieve hybrid walking-driving motions on the wheeled-legged robot. Without giving predefined gait patterns, our planner aims to intelligently coordinate simultaneous wheel and leg movements, thereby achieving improved terrain adaptability and improved energy efficiency. Based on RL techniques, our approach constructs a prediction policy network that could estimate external environmental states from proprioceptive sensory information, and the outputs are then fed into an actor critic network to produce optimal joint commands. The feasibility of the proposed framework is validated through both simulations and real-world experiments across diverse terrains, including flat ground, stairs, and grassy surfaces. The hybrid locomotion framework shows robust performance over various unseen terrains, highlighting its generalization capability.

artificial intelligence, locomotion, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2510.0998

Country: Asia > China (0.48)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Robots > Locomotion (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.47)

Add feedback

CPM-Nets: Cross Partial Multi-View Networks

Neural Information Processing SystemsOct-2-2025, 03:31:06 GMT

In the real-word applications, data is usually represented in different views, including multiple modalities or multiple types of features.

artificial intelligence, machine learning, representation, (17 more...)

Neural Information Processing Systems

Country: Asia > Middle East > UAE (0.28)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.68)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

11b9842e0a271ff252c1903e7132cd68-AuthorFeedback.pdf

Neural Information Processing SystemsOct-2-2025, 03:30:51 GMT

artificial intelligence, machine learning, nielsen and witten, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.76)

Add feedback

Appendix A Versatility of the neuron model In our neuron model, depending on the decay coefficients

Neural Information Processing SystemsAug-16-2025, 23:41:10 GMT

The SRM-based back-propagation can be summarized using the relationship between the potentials as follows. Hyper-parameters used for loss landscape estimation (Section 3.4) and random spike-train matching Some of the hyper-parameters were not mentioned in the paper. Table A1: Hyper-parameters used for loss landscape estimation (Section 3.4) and random spike-train matching

neuron model, rnn-like description, spike, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Regularized Conditional Diffusion Model for Multi-Task Preference Alignment

Neural Information Processing SystemsMay-27-2025, 21:57:57 GMT

Sequential decision-making can be formulated as a conditional generation process, with targets for alignment with human intents and versatility across various tasks. Previous return-conditioned diffusion models manifest comparable performance but rely on well-defined reward functions, which requires amounts of human efforts and faces challenges in multi-task settings. Preferences serve as an alternative but recent work rarely considers preference learning given multiple tasks. To facilitate the alignment and versatility in multi-task preference learning, we adopt multi-task preferences as a unified framework. In this work, we propose to learn preference representations aligned with preference labels, which are then used as conditions to guide the conditional generation process of diffusion models.

artificial intelligence, machine learning, regularized conditional diffusion model, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

The best Arduino starter kits of 2024

We may earn revenue from the products available on this page and participate in affiliate programs. Arduino kits are great for teaching students about science, technology, engineering, and math. The Interaction Design Institute in Turin, Italy, created Arduino in 2005 to provide people of all ages with an easy, inexpensive way to build electronic devices and control them with rudimentary code. By making Arduino an open-source platform, the Institute made the technology freely available to anyone, which led to a vast array of starter kits. Today, everyone from young children to seasoned professional techies uses them to build everything from simple devices that turn on the lights to robots controlled remotely via Wi-Fi. Given their popularity, there are a lot of kits available, like our best overall pick, the Official Arduino Starter Kit.

arduino starter kit, beginner, starter kit, (13 more...)

Popular Science

Country: Europe > Italy > Piedmont > Turin Province > Turin (0.25)

Technology:

Information Technology > Software (0.50)
Information Technology > Artificial Intelligence > Robots (0.36)

Add feedback

CPM-Nets: Cross Partial Multi-View Networks

Neural Information Processing SystemsOct-9-2024, 13:34:25 GMT

Despite multi-view learning progressed fast in past decades, it is still challenging due to the difficulty in modeling complex correlation among different views, especially under the context of view missing. To address the challenge, we propose a novel framework termed Cross Partial Multi-View Networks (CPM-Nets). In this framework, we first give a formal definition of completeness and versatility for multi-view representation and then theoretically prove the versatility of the latent representation learned from our algorithm. To achieve the completeness, the task of learning latent multi-view representation is specifically translated to degradation process through mimicking data transmitting, such that the optimal tradeoff between consistence and complementarity across different views could be achieved. In contrast with methods that either complete missing views or group samples according to view-missing patterns, our model fully exploits all samples and all views to produce structured representation for interpretability.

cpm-net, cross partial multi-view network, representation, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.44)

Add feedback