Veagle: Advancements in Multimodal Representation Learning