Do Automatic Factuality Metrics Measure Factuality? A Critical Evaluation