Ruffle&Riley: Insights from Designing and Evaluating a Large Language Model-Based Conversational Tutoring System