Chameleon: a heterogeneous and disaggregated accelerator system for retrieval-augmented language models

Open in new window