Masked Frequency Modeling for Self-Supervised Visual Pre-Training