Fast and Cost-effective Speculative Edge-Cloud Decoding with Early Exits

Open in new window