Performance Control in Early Exiting to Deploy Large Models at the Same Cost of Smaller Ones

Open in new window