Supplementary material for CTIBench: A Benchmark for Evaluating LLMs in Cyber Threat Intelligence
–Neural Information Processing Systems
This task assesses the LLMs' ability to evaluate the severity of This task tests the LLMs' capability to The dataset consists of 5 TSV files, each corresponding to a different task. "Prompt" column used to pose questions to the LLM. Most files also include a "GT" column that The dataset includes URLs indicating the sources from which the data was collected. A permanent DOI identifier is associated with the dataset: DOI: AI4Sec (2024).
Neural Information Processing Systems
May-29-2025, 16:04:28 GMT