Towards Smarter Sensing: 2D Clutter Mitigation in RL-Driven Cognitive MIMO Radar