NOTA: Multimodal Music Notation Understanding for Visual Large Language Model