Exploration of Summarization by Generative Language Models for Automated Scoring of Long Essays
Hua, Haowei, Jiao, Hong, Wang, Xinyi
–arXiv.org Artificial Intelligence
The majority of summarized essays fall well below the 512 - token limit (marked by the red dashed line), indicating that the summarization process effectively compressed the original texts while maintaining consistency in length. The smooth decline beyond 300 tokens and the sparse occurrence of samples approaching the upper l imit suggest that v ery few summaries exceeded the intended compression threshold. Overall, this distribution demonstrates that the GPT - 5 - mini summarizer produced concise and length - stable representations, ensuring efficient model input handling and minimizing the risk of truncation in downstream processing.
arXiv.org Artificial Intelligence
Nov-20-2025
- Country:
- North America > United States
- California > Los Angeles County
- Long Beach (0.04)
- Colorado > Denver County
- Denver (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Maryland > Prince George's County
- College Park (0.14)
- New Jersey > Mercer County
- Princeton (0.04)
- New York (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- California > Los Angeles County
- North America > United States
- Genre:
- Research Report (1.00)
- Industry:
- Technology: