Enhancing Deployment-Time Predictive Model Robustness for Code Analysis and Optimization
Wang, Huanting, Lenihan, Patrick, Wang, Zheng
–arXiv.org Artificial Intelligence
Supervised machine learning techniques have shown promising results in code analysis and optimization problems. However, a learning-based solution can be brittle because minor changes in hardware or application workloads -- such as facing a new CPU architecture or code pattern -- may jeopardize decision accuracy, ultimately undermining model robustness. We introduce Prom, an open-source library to enhance the robustness and performance of predictive models against such changes during deployment. Prom achieves this by using statistical assessments to identify test samples prone to mispredictions and using feedback on these samples to improve a deployed model. We showcase Prom by applying it to 13 representative machine learning models across 5 code analysis and optimization tasks. Our extensive evaluation demonstrates that Prom can successfully identify an average of 96% (up to 100%) of mispredictions. By relabeling up to 5% of the Prom-identified samples through incremental learning, Prom can help a deployed model achieve a performance comparable to that attained during its model training phase.
arXiv.org Artificial Intelligence
Dec-31-2024
- Country:
- Europe
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- United Kingdom > England
- West Yorkshire > Leeds (0.04)
- Ireland > Leinster
- North America
- Canada > Alberta
- United States
- Arizona > Maricopa County
- Phoenix (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Nevada > Clark County
- Las Vegas (0.05)
- New York > New York County
- New York City (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- Arizona > Maricopa County
- Europe
- Genre:
- Research Report (1.00)
- Industry:
- Information Technology > Security & Privacy (0.93)
- Technology:
- Information Technology
- Artificial Intelligence > Machine Learning
- Neural Networks > Deep Learning (1.00)
- Performance Analysis > Accuracy (1.00)
- Statistical Learning (1.00)
- Data Science > Data Mining (1.00)
- Modeling & Simulation (1.00)
- Artificial Intelligence > Machine Learning
- Information Technology