APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding

Open in new window