NN-LUT: Neural Approximation of Non-Linear Operations for Efficient Transformer Inference

Open in new window