Harnessing Network Effect for Fake News Mitigation: Selecting Debunkers via Self-Imitation Learning