We build strong baseline models upon large pretrained language models, including GPT -3 and T5. Our benchmark is an ongoing effort, and this paper presents real-time evaluation results over the past year.
The rapid evolution of large language models (LLMs) has expanded their capabilities across various data modalities, extending from well-established image data to increasingly popular graph data.