A Comparative Analysis of Faithfulness Metrics and Humans in Citation Evaluation