Identifying Trustworthiness Challenges in Deep Learning Models for Continental-Scale Water Quality Prediction