Context-Aware Testing: A New Paradigm for Model Testing with Large Language Models

Feb-18-2026, 04:41:38 GMT–Neural Information Processing Systems

The predominant de facto paradigm of testing ML models relies on either using only held-out data to compute aggregate evaluation metrics or by assessing the performance on different subgroups.

large language model, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Feb-18-2026, 04:41:38 GMT

Conferences PDF

Country:
- North America > United States (0.04)
- Europe > United Kingdom
  - Wales (0.04)
  - Scotland (0.04)
  - England
    - Cambridgeshire > Cambridge (0.04)
    - West Midlands (0.04)
    - East Midlands (0.04)

Genre:
- Research Report
  - Experimental Study (1.00)
  - New Finding (0.67)

Industry:
- Education (1.00)
- Health & Medicine
  - Therapeutic Area > Oncology (1.00)
  - Consumer Health (0.92)
  - Diagnostic Medicine (0.68)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Natural Language > Large Language Model (1.00)
  - Cognitive Science (1.00)
  - Machine Learning
    - Neural Networks > Deep Learning (1.00)
    - Performance Analysis > Accuracy (0.94)
    - Statistical Learning (0.93)

Duplicate Docs Excel Report

Title
Context-Aware Testing: A New Paradigm for Model Testing with Large Language Models

Similar Docs Excel Report more

Title	Similarity	Source
None found