Doubly Mild Generalization for Offline Reinforcement Learning