How Effective is GPT-4 Turbo in Generating School-Level Questions from Textbooks Based on Bloom's Revised Taxonomy?
Maity, Subhankar, Deroy, Aniket, Sarkar, Sudeshna
–arXiv.org Artificial Intelligence
We evaluate the effectiveness of GPT-4 Turbo in generating educational questions from NCERT textbooks in zero-shot mode. Our study highlights GPT-4 Turbo's ability to generate questions that require higher-order thinking skills, especially at the "understanding" level according to Bloom's Revised Taxonomy. While we find a notable consistency between questions generated by GPT-4 Turbo and those assessed by humans in terms of complexity, there are occasional differences. Our evaluation also uncovers variations in how humans and machines evaluate question quality, with a trend inversely related to Bloom's Revised Taxonomy levels. These findings suggest that while GPT-4 Turbo is a promising tool for educational question generation, its efficacy varies across different cognitive levels, indicating a need for further refinement to fully meet educational standards.
arXiv.org Artificial Intelligence
Jun-21-2024
- Country:
- Asia > India
- West Bengal (0.15)
- North America > United States
- Texas (0.14)
- Asia > India
- Genre:
- Research Report > New Finding (0.48)
- Industry:
- Education
- Assessment & Standards (0.66)
- Educational Setting (0.65)
- Education
- Technology: