LAVA: Long-horizon Visual Action based Food Acquisition