$\texttt{metabench}$ -- A Sparse Benchmark to Measure General Ability in Large Language Models