Relative Distributed Formation and Obstacle Avoidance with Multi-agent Reinforcement Learning