Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection