Enhancing Temporal Understanding in Audio Question Answering for Large Audio Language Models