Beyond Imitation: Leveraging Fine-grained Quality Signals for Alignment