CONFETTI: Conversational Function-Calling Evaluation Through Turn-Level Interactions