Analyzing Examinee Comments using DistilBERT and Machine Learning to Ensure Quality Control in Exam Content
–arXiv.org Artificial Intelligence
To ensure that the items are of sufficient quality to be included in the test, multiple rounds of item review are conducted both before and after the test is administered. Typically, once the testing period has ended, psychometricians will analyze the response data using var ious methods to identify any items that require further review based on their statistical properties (e.g., p - value, point - biserial correlation, etc.). For example, one item with a low point - biserial correlation value can be flagged for further review due to poor discrimination. While flagging items using their statistics can help identify potentially problematic items, it does not guarantee that the flagged items actually contain issues. Therefore, subject matter experts (SMEs) need to review the flagged items to determine whether they indeed pose any problems.
arXiv.org Artificial Intelligence
Apr-10-2025
- Country:
- North America > United States
- California > San Francisco County
- San Francisco (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Oregon > Washington County
- Beaverton (0.04)
- California > San Francisco County
- North America > United States
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Leisure & Entertainment (0.46)
- Media > Film (0.68)
- Technology: