ETHIC: Evaluating Large Language Models on Long-Context Tasks with High Information Coverage

Open in new window