Multiscale Byte Language Models -- A Hierarchical Architecture for Causal Million-Length Sequence Modeling