Towards Trustworthy Amortized Bayesian Model Comparison