Cactus: Accelerating Auto-Regressive Decoding with Constrained Acceptance Speculative Sampling