Task-aware world model learning with meta weighting via bi-level optimization