Ref-Long: Benchmarking the Long-context Referencing Capability of Long-context Language Models

Open in new window