Changing Model Behavior at Test-Time Using Reinforcement Learning

Odena, Augustus, Lawson, Dieterich, Olah, Christopher

Feb-24-2017–arXiv.org Machine Learning

A computer vision model operating on an embedded device may need to perform real-time inference; a translation model operating on a cell phone may wish to bound its average compute time in order to be power-efficient. In these cases, there is often a tension between satisfying the constraint and achieving acceptable model performance. These constraints need not be restricted to speed and accuracy, but can reflect preferences for model simplicity or other desiderata. One way to deal with constraints is to build them into models explicitly at training time. This has two major disadvantages: First, it requires manually designing and retraining a new model for each use case.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

arXiv.org Machine Learning

Feb-24-2017

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.42)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.41)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found