Towards falsifiable interpretability research