FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models