Enhancing Low-Resource Language and Instruction Following Capabilities of Audio Language Models