ODIA: Oriented Distillation for Inline Acceleration of LLM-based Function Calling

Open in new window