Regret Minimisation in Multi-Armed Bandits Using Bounded Arm Memory