A Generalized Reinforcement Learning Algorithm for Online 3D Bin-Packing