A single $T$-gate makes distribution learning hard