SS1: Accelerating Inference with Fast and Expressive Sketch Structured Transform

Neural Information Processing Systems 

Structured Transform(SS1), an expressive and GPU-friendly operator that accelerates inference.