Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization