Gradient-Guided Importance Sampling for Learning Binary Energy-Based Models