Asymptotically Optimal Multi-Armed Bandit Policies under a Cost Constraint