AITopics | conditional maximum entropy model

Efficient Large-Scale Distributed Training of Conditional Maximum Entropy Models

Neural Information Processing SystemsApr-6-2023, 14:03:17 GMT

Training conditional maximum entropy models on massive data requires significant time and computational resources. In this paper, we investigate three common distributed training strategies: distributed gradient, majority voting ensembles, and parameter mixtures. We analyze the worst-case runtime and resource costs of each and present a theoretical foundation for the convergence of parameters under parameter mixtures, the most efficient strategy. We present large-scale experiments comparing the different strategies and demonstrate that parameter mixtures over independent models use fewer resources and achieve comparable loss as compared to standard approaches.

conditional maximum entropy model, efficient large-scale, parameter mixture

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Maximum Entropy (0.69)

Add feedback

Deep Adaptive Multi-Intention Inverse Reinforcement Learning

Bighashdel, Ariyan, Meletis, Panagiotis, Jancura, Pavol, Dubbelman, Gijs

arXiv.org Artificial IntelligenceJul-14-2021

This paper presents a deep Inverse Reinforcement Learning (IRL) framework that can learn an a priori unknown number of nonlinear reward functions from unlabeled experts' demonstrations. For this purpose, we employ the tools from Dirichlet processes and propose an adaptive approach to simultaneously account for both complex and unknown number of reward functions. Using the conditional maximum entropy principle, we model the experts' multi-intention behaviors as a mixture of latent intention distributions and derive two algorithms to estimate the parameters of the deep reward network along with the number of experts' intentions from unlabeled demonstrations. The proposed algorithms are evaluated on three benchmarks, two of which have been specifically extended in this study for multi-intention IRL, and compared with well-known baselines. We demonstrate through several experiments the advantages of our algorithms over the existing approaches and the benefits of online inferring, rather than fixing beforehand, the number of expert's intentions.

demonstration, intention, reward function, (14 more...)

arXiv.org Artificial Intelligence

2107.06692

Country:

Europe > Netherlands > North Brabant > Eindhoven (0.04)
Asia > Middle East > Jordan (0.04)
Europe > France (0.04)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)

Add feedback

Efficient Large-Scale Distributed Training of Conditional Maximum Entropy Models

Mcdonald, Ryan, Mohri, Mehryar, Silberman, Nathan, Walker, Dan, Mann, Gideon S.

Neural Information Processing SystemsMar-19-2020, 21:02:08 GMT

Training conditional maximum entropy models on massive data requires significant time and computational resources. In this paper, we investigate three common distributed training strategies: distributed gradient, majority voting ensembles, and parameter mixtures. We analyze the worst-case runtime and resource costs of each and present a theoretical foundation for the convergence of parameters under parameter mixtures, the most efficient strategy. We present large-scale experiments comparing the different strategies and demonstrate that parameter mixtures over independent models use fewer resources and achieve comparable loss as compared to standard approaches. Papers published at the Neural Information Processing Systems Conference.

conditional maximum entropy model, efficient large-scale, parameter mixture

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Maximum Entropy (0.69)

Add feedback

Efficient Large-Scale Distributed Training of Conditional Maximum Entropy Models

Mcdonald, Ryan, Mohri, Mehryar, Silberman, Nathan, Walker, Dan, Mann, Gideon S.

Neural Information Processing SystemsDec-31-2009

Training conditional maximum entropy models on massive data requires significant time and computational resources. In this paper, we investigate three common distributed training strategies: distributed gradient, majority voting ensembles, and parameter mixtures. We analyze the worst-case runtime and resource costs of each and present a theoretical foundation for the convergence of parameters under parameter mixtures, the most efficient strategy. We present large-scale experiments comparing the different strategies and demonstrate that parameter mixtures over independent models use fewer resources and achieve comparable loss as compared to standard approaches.

artificial intelligence, machine learning, natural language, (13 more...)

Neural Information Processing Systems

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Maximum Entropy (0.63)

Add feedback

Filters

Collaborating Authors

conditional maximum entropy model

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Efficient Large-Scale Distributed Training of Conditional Maximum Entropy Models

Deep Adaptive Multi-Intention Inverse Reinforcement Learning

Efficient Large-Scale Distributed Training of Conditional Maximum Entropy Models

Efficient Large-Scale Distributed Training of Conditional Maximum Entropy Models