Towards Large-scale Chemical Reaction Image Parsing via a Multimodal Large Language Model