Comparing Hallucination Detection Metrics for Multilingual Generation