Coarse-to-Fine Dual Encoders are Better Frame Identification Learners