Clustering-Based Weight Orthogonalization for Stabilizing Deep Reinforcement Learning