EchoAtt: Attend, Copy, then Adjust for More Efficient Large Language Models

Open in new window