Using General Value Functions to Learn Domain-Backed Inventory Management Policies