AITopics | learning f-divergence

Collaborating Authors

learning f-divergence

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

f-GAIL: Learning f-Divergence for Generative Adversarial Imitation Learning

Neural Information Processing SystemsDec-24-2025, 07:50:28 GMT

Imitation learning (IL) aims to learn a policy from expert demonstrations that minimizes the discrepancy between the learner and expert behaviors. Various imitation learning algorithms have been proposed with different pre-determined divergences to quantify the discrepancy. This naturally gives rise to the following question: Given a set of expert demonstrations, which divergence can recover the expert policy more accurately with higher data efficiency? In this work, we propose f-GAIL - a new generative adversarial imitation learning model - that automatically learns a discrepancy measure from the f-divergence family as well as a policy capable of producing expert-like behaviors. Compared with IL baselines with various predefined divergence measures, f-GAIL learns better policies with higher data efficiency in six physics-based control tasks.

generative adversarial imitation learning, learning f-divergence, name change, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Review for NeurIPS paper: f-GAIL: Learning f-Divergence for Generative Adversarial Imitation Learning

Neural Information Processing SystemsJan-26-2025, 17:54:59 GMT

Additional Feedback: My other main concern is that the objective in Eq. (5) is badly motivated and the implications are under underexplored. The imitation learning objective is notoriously ill-defined and a large part of the literature focuses on introducing objectives that produce good behavior. The notion of finding the "best" f-divergence therefore requires us to state what we are optimizing for, which the authors don't do very explicitly. On line 38, the authors mention that an imitation learning method which uses a fixed divergence method is likely to learn a sub-optimal policy, but the notion of optimality does not exist without a given divergence. For example, whether mode-seeking or mode-covering behavior is better is entirely dependent on context that the agent does not have. Either solution could be better.

divergence, generative adversarial imitation learning, learning f-divergence, (7 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Robots (0.86)

Add feedback

Review for NeurIPS paper: f-GAIL: Learning f-Divergence for Generative Adversarial Imitation Learning

Neural Information Processing SystemsJan-26-2025, 17:54:51 GMT

After reading the authors' rebuttal, the reviewers discussed their concerns about this paper. Ultimately, a consensus was not reached as reviewer #3 feels that some of her/his concerns were not properly addressed in the authors' feedback. The other reviewers are positive with respect to the paper (especially thanks to the promising experimental results), but they share one of the concerns of reviewer #3, i.e., the definition of optimal f-divergence'' and the convergence properties of the proposed approach. I agree with them that the paper has merits and the ideas contained in the paper are interesting, so I propose to accept it, but I recommend that the authors take the issues raised in the reviews seriously and address them carefully in the final version of the paper.

generative adversarial imitation learning, learning f-divergence, reviewer, (1 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Robots (0.40)
Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

f-GAIL: Learning f-Divergence for Generative Adversarial Imitation Learning

Neural Information Processing SystemsOct-10-2024, 20:20:59 GMT

Imitation learning (IL) aims to learn a policy from expert demonstrations that minimizes the discrepancy between the learner and expert behaviors. Various imitation learning algorithms have been proposed with different pre-determined divergences to quantify the discrepancy. This naturally gives rise to the following question: Given a set of expert demonstrations, which divergence can recover the expert policy more accurately with higher data efficiency? In this work, we propose f-GAIL – a new generative adversarial imitation learning model – that automatically learns a discrepancy measure from the f-divergence family as well as a policy capable of producing expert-like behaviors. Compared with IL baselines with various predefined divergence measures, f-GAIL learns better policies with higher data efficiency in six physics-based control tasks.

generative adversarial imitation learning, higher data efficiency, learning f-divergence, (3 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback