FLAME: Financial Large-Language Model Assessment and Metrics Evaluation