Audio tagging with noisy labels and minimal supervision