DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph

Oct-10-2025, 21:35:56 GMT–Neural Information Processing Systems

The current paradigm of evaluating Large Language Models (LLMs) through static benchmarks comes with significant limitations, such as vulnerability to data contamination and a lack of adaptability to the evolving capabilities of LLMs.

arxiv preprint arxiv, language model, reasoning graph, (12 more...)

Neural Information Processing Systems

Oct-10-2025, 21:35:56 GMT

Conferences PDF

Add feedback

Country:
- South America > Colombia
  - Meta Department > Villavicencio (0.04)
- North America
  - United States > California
    - Santa Clara County > Palo Alto (0.04)
  - Canada > Ontario
    - Toronto (0.04)
- Europe
  - Ukraine > Kyiv Oblast
    - Kyiv (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
- Asia
  - Singapore (0.04)
  - Indonesia > Bali (0.04)
  - Middle East
    - Jordan (0.04)
    - UAE > Abu Dhabi Emirate
      - Abu Dhabi (0.04)
  - China > Guangxi Province
    - Nanning (0.04)

Genre:
- Research Report > Experimental Study (1.00)

Industry:
- Law (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph

Similar Docs Excel Report more

Title	Similarity	Source
None found