Not enough data to create a plot.
Try a different view from the menu above.
Trofin, Mircea
Offline Imitation Learning from Multiple Baselines with Applications to Compiler Optimization
Marinov, Teodor V., Agarwal, Alekh, Trofin, Mircea
This work studies a Reinforcement Learning (RL) problem in which we are given a set of trajectories collected with K baseline policies. Each of these policies can be quite suboptimal in isolation, and have strong performance in complementary parts of the state space. The goal is to learn a policy which performs as well as the best combination of baselines on the entire state space. We propose a simple imitation learning based algorithm, show a sample complexity bound on its accuracy and prove that the the algorithm is minimax optimal by showing a matching lower bound. Further, we apply the algorithm in the setting of machine learning guided compiler optimization to learn policies for inlining programs with the objective of creating a small binary. We demonstrate that we can learn a policy that outperforms an initial policy learned via standard RL through a few iterations of our approach.
The Next 700 ML-Enabled Compiler Optimizations
VenkataKeerthy, S., Jain, Siddharth, Kalvakuntla, Umesh, Gorantla, Pranav Sai, Chitale, Rajiv Shailesh, Brevdo, Eugene, Cohen, Albert, Trofin, Mircea, Upadrasta, Ramakrishna
There is a growing interest in enhancing compiler optimizations with ML models, yet interactions between compilers and ML frameworks remain challenging. Some optimizations require tightly coupled models and compiler internals,raising issues with modularity, performance and framework independence. Practical deployment and transparency for the end-user are also important concerns. We propose ML-Compiler-Bridge to enable ML model development within a traditional Python framework while making end-to-end integration with an optimizing compiler possible and efficient. We evaluate it on both research and production use cases, for training and inference, over several optimization problems, multiple compilers and its versions, and gym infrastructures.