Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks