Exploration of Summarization by Generative Language Models for Automated Scoring of Long Essays

Nov-20-2025–arXiv.org Artificial Intelligence

The majority of summarized essays fall well below the 512 - token limit (marked by the red dashed line), indicating that the summarization process effectively compressed the original texts while maintaining consistency in length. The smooth decline beyond 300 tokens and the sparse occurrence of samples approaching the upper l imit suggest that v ery few summaries exceeded the intended compression threshold. Overall, this distribution demonstrates that the GPT - 5 - mini summarizer produced concise and length - stable representations, ensuring efficient model input handling and minimizing the risk of truncation in downstream processing.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

Nov-20-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States > Maryland (0.28)

Genre:
- Research Report (1.00)

Industry:
- Food & Agriculture > Agriculture (0.68)
- Education
  - Assessment & Standards > Student Performance (0.73)
  - Educational Technology > Educational Software
    - Computer-Aided Assessment (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found