Enhancing Decision-Making for LLM Agents via Step-Level Q-Value Models