From Language Models over Tokens to Language Models over Characters