The Ability of Image-Language Explainable Models to Resemble Domain Expertise