Functional trustworthiness of AI systems by statistically valid testing