Direct Behavior Optimization: Unlocking the Potential of Lightweight LLMs