When Scale Meets Diversity: Evaluating Language Models on Fine-Grained Multilingual Claim Verification