An Efficient Framework for Monitoring Subgroup Performance of Machine Learning Systems