Molecular Joint Representation Learning via Multi-modal Information