RevMUX: Data Multiplexing with Reversible Adapters for Efficient LLM Batch Inference

Open in new window