ContextNav: Towards Agentic Multimodal In-Context Learning