Enhancing Advanced Visual Reasoning Ability of Large Language Models