Model Monitoring in the Absence of Labeled Data via Feature Attributions Distributions
–arXiv.org Artificial Intelligence
Model monitoring involves analyzing AI algorithms once they have been deployed and detecting changes in their behaviour. This thesis explores machine learning model monitoring ML before the predictions impact real-world decisions or users. This step is characterized by one particular condition: the absence of labelled data at test time, which makes it challenging, even often impossible, to calculate performance metrics. The thesis is structured around two main themes: (i) AI alignment, measuring if AI models behave in a manner consistent with human values and (ii) performance monitoring, measuring if the models achieve specific accuracy goals or desires. The thesis uses a common methodology that unifies all its sections. It explores feature attribution distributions for both monitoring dimensions. Using these feature attribution explanations, we can exploit their theoretical properties to derive and establish certain guarantees and insights into model monitoring.
arXiv.org Artificial Intelligence
Jan-25-2025
- Country:
- North America
- Montserrat (0.04)
- United States
- West Virginia (0.04)
- North Carolina (0.04)
- District of Columbia > Washington (0.04)
- Texas > Travis County
- Austin (0.04)
- Nevada > Clark County
- Las Vegas (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.13)
- Illinois > Cook County
- Chicago (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- California
- San Francisco County > San Francisco (0.13)
- Los Angeles County > Long Beach (0.13)
- Orange County > Irvine (0.04)
- Alameda County > Berkeley (0.04)
- Santa Clara County
- New York > New York County
- New York City (0.04)
- Puerto Rico > San Juan
- San Juan (0.04)
- Canada
- Quebec > Montreal (0.04)
- Nova Scotia > Halifax Regional Municipality
- Halifax (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Europe
- France (0.04)
- Germany (0.04)
- Italy > Sardinia (0.04)
- United Kingdom
- Northern Ireland (0.04)
- England
- Cambridgeshire > Cambridge (0.14)
- Oxfordshire > Oxford (0.04)
- Sweden
- Stockholm > Stockholm (0.04)
- Västerbotten County > Umeå (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Asia
- India (0.04)
- Middle East > Jordan (0.04)
- South Korea > Seoul
- Seoul (0.04)
- North America
- Genre:
- Research Report
- Promising Solution (1.00)
- New Finding (1.00)
- Experimental Study (1.00)
- Research Report
- Industry:
- Law > Civil Rights & Constitutional Law (1.00)
- Information Technology > Security & Privacy (1.00)
- Health & Medicine (1.00)
- Education > Educational Setting (0.92)
- Government > Regional Government
- Technology:
- Information Technology > Artificial Intelligence
- Natural Language > Explanation & Argumentation (1.00)
- Issues > Social & Ethical Issues (1.00)
- Representation & Reasoning > Expert Systems (0.67)
- Machine Learning
- Performance Analysis > Accuracy (1.00)
- Statistical Learning > Regression (0.93)
- Neural Networks > Deep Learning (0.93)
- Decision Tree Learning (0.68)
- Information Technology > Artificial Intelligence