ByteSpan: Information-Driven Subword Tokenisation