IMRL: Integrating Visual, Physical, Temporal, and Geometric Representations for Enhanced Food Acquisition