Failure-aware Policy Learning for Self-assessable Robotics Tasks