Pre$^3$: Enabling Deterministic Pushdown Automata for Faster Structured LLM Generation

Open in new window