CALF: Benchmarking Evaluation of LFQA Using Chinese Examinations