How Effective is GPT-4 Turbo in Generating School-Level Questions from Textbooks Based on Bloom's Revised Taxonomy?
Maity, Subhankar, Deroy, Aniket, Sarkar, Sudeshna
–arXiv.org Artificial Intelligence
We evaluate the effectiveness of GPT-4 Turbo in generating educational questions from NCERT textbooks in zero-shot mode. Our study highlights GPT-4 Turbo's ability to generate questions that require higher-order thinking skills, especially at the "understanding" level according to Bloom's Revised Taxonomy. While we find a notable consistency between questions generated by GPT-4 Turbo and those assessed by humans in terms of complexity, there are occasional differences. Our evaluation also uncovers variations in how humans and machines evaluate question quality, with a trend inversely related to Bloom's Revised Taxonomy levels. These findings suggest that while GPT-4 Turbo is a promising tool for educational question generation, its efficacy varies across different cognitive levels, indicating a need for further refinement to fully meet educational standards.
arXiv.org Artificial Intelligence
Jun-21-2024
- Country:
- North America > United States
- District of Columbia > Washington (0.05)
- Texas > Travis County
- Austin (0.04)
- New York > New York County
- New York City (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Europe
- Switzerland (0.04)
- Italy (0.04)
- Asia
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- India > West Bengal
- Kharagpur (0.05)
- Myanmar > Tanintharyi Region
- North America > United States
- Genre:
- Instructional Material (0.68)
- Research Report > New Finding (0.48)
- Industry:
- Education
- Assessment & Standards (0.66)
- Educational Setting (0.65)
- Education
- Technology: