Domain-Aware RAG: MoL-Enhanced RL for Efficient Training and Scalable Retrieval