State Abstraction in MAXQ Hierarchical Reinforcement Learning