Transferable Post-training via Inverse Value Learning