SAIL: Self-Improving Efficient Online Alignment of Large Language Models