Reviews: Exponentially Weighted Imitation Learning for Batched Historical Data
–Neural Information Processing Systems
A method for learning deep policies from data recorded in demonstrations is introduced. The method uses exponentially weighted learning that can learn policies from data generated by another policy The proposed approach is interesting and well presented. Theat would be even more interesting than this presented imitation learning scheme, however the paper gives the introduction, background and discussion for that future work. How is generated the data for the HFO environment? Why is not used PG, PGIS in the experiments with Torcs and king of Glory?
Neural Information Processing Systems
Oct-7-2024, 09:41:15 GMT
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning (0.78)
- Robots (0.66)
- Information Technology > Artificial Intelligence