AITopics | ess-infogail

Imitation learning aims to reproduce expert behaviors without relying on an explicit reward signal. However, real-world demonstrations often present challenges, such as multi-modal, data imbalance, and expensive labeling processes. In this work, we propose a novel semi-supervised imitation learning architecture that learns disentangled behavior representations from imbalanced demonstrations using limited labeled data. Specifically, our method consists of three key components. First, we adapt the concept of semi-supervised generative adversarial networks to the imitation learning context. Second, we employ a learnable latent distribution to align the generated and expert data distributions. Finally, we utilize a regularized information maximization approach in conjunction with an approximate label prior to further improve the semi-supervised learning performance. Experimental results demonstrate the efficiency of our method in learning multi-modal behaviors from imbalanced demonstrations compared to baseline methods.

ess-infogail, name change, semi-supervised imitation learning, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.61)

Add feedback

Supplementary A Properties of the InfoGAIL

Neural Information Processing SystemsOct-9-2025, 06:06:12 GMT

I ( x; y; c) can be decomposed as I (x; y; c) = I ( y; x) + I ( c; x) I ( y, c; x) = I ( y; x) + I ( c; x) H (y, c) + H (y, c |x) = I ( y; c) I (y; c |x). I ( s, a; s, a) is finally increased as well. The main parameters for training Ess-InfoGAIL are listed in Table 4. To minimize computational time, we restrict the update of the latent skill distribution to only the first iteration of policy updates. Our experiments demonstrate that this approach does not result in significant performance degradation.

artificial intelligence, behavior mode, machine learning, (11 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)

Add feedback

bcf26768143c94bd36e363cd4bf5daf0-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 06:06:09 GMT

demonstration, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)
Asia > China > Jiangsu Province > Nanjing (0.04)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Data Science (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)
(4 more...)

Add feedback

Ess-InfoGAIL: Semi-supervised Imitation Learning from Imbalanced Demonstrations

Neural Information Processing SystemsJan-19-2025, 20:46:30 GMT

Imitation learning aims to reproduce expert behaviors without relying on an explicit reward signal. However, real-world demonstrations often present challenges, such as multi-modal, data imbalance, and expensive labeling processes. In this work, we propose a novel semi-supervised imitation learning architecture that learns disentangled behavior representations from imbalanced demonstrations using limited labeled data. Specifically, our method consists of three key components. First, we adapt the concept of semi-supervised generative adversarial networks to the imitation learning context. Second, we employ a learnable latent distribution to align the generated and expert data distributions.

ess-infogail, imbalanced demonstration, semi-supervised imitation learning

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.46)

Add feedback