SPARTAN: A Sparse Transformer Learning Local Causation