Developing a Tutoring Dialog Dataset to Optimize LLMs for Educational Use

Oct-24-2024–arXiv.org Artificial Intelligence

Recent advances in large language models (LLMs) have shown promise for scalable educational applications, but their use in dialog-based tutoring systems remains challenging due to the need for effective pedagogical strategies and the high costs associated with expert-curated datasets. Our study explores the use of smaller, more affordable LLMs for one-on-one tutoring in the context of solving reading comprehension problems. We developed a synthetic tutoring dialog dataset, evaluated by human teachers, and fine-tuned a smaller LLM using this dataset. Furthermore, we conducted an interactive experiment comparing the performance of the fine-tuned model with a larger model in real-world tutoring scenarios. Our results show that the fine-tuned model performs on par with the larger model but at a lower cost, demonstrating a viable, cost-effective approach for implementing LLM-based tutoring systems in educational settings.

artificial intelligence, large language model, natural language, (17 more...)

arXiv.org Artificial Intelligence

Oct-24-2024

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia (0.04)
- Europe > Russia (0.04)
- Africa (0.04)
- North America > United States
  - Washington > King County > Seattle (0.04)
- Asia
  - Middle East > Jordan (0.05)
  - Russia (0.04)
  - China (0.04)
  - Japan > Kyūshū & Okinawa
    - Kyūshū > Fukuoka Prefecture > Fukuoka (0.04)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Education
  - Educational Setting (1.00)
  - Educational Technology > Educational Software (0.54)
  - Curriculum > Subject-Specific Education (0.46)
  - Assessment & Standards > Student Performance (0.35)

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found