Learning Human Reaching Optimality Principles from Minimal Observation Inverse Reinforcement Learning