Cafe-Talk: Generating 3D Talking Face Animation with Multimodal Coarse- and Fine-grained Control