Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains

Open in new window