NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates Hexuan Deng 1 Min Zhang