Supplementary material for CTIBench: A Benchmark for Evaluating LLMs in Cyber Threat Intelligence 1 Dataset Documentations 1 1.1 Hosted URLs
–Neural Information Processing Systems
This task assesses the LLMs' ability to evaluate the severity of This task tests the LLMs' capability to The dataset consists of 5 TSV files, each corresponding to a different task. "Prompt" column used to pose questions to the LLM. Most files also include a "GT" column that Do not distribute. of LLMs to understand and analyze various aspects of open-source CTI. The dataset includes URLs indicating the sources from which the data was collected. A permanent DOI identifier is associated with the dataset: DOI: AI4Sec (2024).
Neural Information Processing Systems
Oct-10-2025, 03:35:26 GMT