Golden Touchstone: A Comprehensive Bilingual Benchmark for Evaluating Financial Large Language Models