Are medical AI devices evaluated appropriately?