Test & Evaluation Best Practices for Machine Learning-Enabled Systems
Chandrasekaran, Jaganmohan, Cody, Tyler, McCarthy, Nicola, Lanus, Erin, Freeman, Laura
–arXiv.org Artificial Intelligence
Machine learning (ML) - based software systems are rapidly gaining adoption across various domains, making it increasingly essential to ensure they perform as intended. This report presents best practices for the Test and Evaluation (T&E) of ML-enabled software systems across its lifecycle. We categorize the lifecycle of ML-enabled software systems into three stages: component, integration and deployment, and post-deployment. At the component level, the primary objective is to test and evaluate the ML model as a standalone component. Next, in the integration and deployment stage, the goal is to evaluate an integrated ML-enabled system consisting of both ML and non-ML components. Finally, once the ML-enabled software system is deployed and operationalized, the T&E objective is to ensure the system performs as intended. Maintenance activities for ML-enabled software systems span the lifecycle and involve maintaining various assets of ML-enabled software systems. Given its unique characteristics, the T&E of ML-enabled software systems is challenging. While significant research has been reported on T&E at the component level, limited work is reported on T&E in the remaining two stages. Furthermore, in many cases, there is a lack of systematic T&E strategies throughout the ML-enabled system's lifecycle. This leads practitioners to resort to ad-hoc T&E practices, which can undermine user confidence in the reliability of ML-enabled software systems. New systematic testing approaches, adequacy measurements, and metrics are required to address the T&E challenges across all stages of the ML-enabled system lifecycle.
arXiv.org Artificial Intelligence
Oct-10-2023
- Country:
- Asia
- Europe > Russia
- North America
- Canada (0.04)
- United States
- California > San Francisco County
- San Francisco (0.14)
- Texas (0.04)
- Virginia (0.05)
- California > San Francisco County
- Genre:
- Overview (0.93)
- Research Report > New Finding (0.67)
- Industry:
- Automobiles & Trucks (0.93)
- Government
- Information Technology > Security & Privacy (1.00)
- Transportation > Ground
- Road (0.93)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning
- Neural Networks > Deep Learning (1.00)
- Performance Analysis > Accuracy (0.67)
- Statistical Learning (0.92)
- Robots > Autonomous Vehicles (0.68)
- Machine Learning
- Software (1.00)
- Artificial Intelligence
- Information Technology