Towards Safe Semi-Supervised Learning for Multivariate Performance Measures