Linear Bellman Completeness Suffices for Efficient Online Reinforcement Learning with Few Actions