Duo-LLM: A Framework for Studying Adaptive Computation in Large Language Models

Open in new window