Dual policy as self-model for planning