Beyond Words: Evaluating Large Language Models in Transportation Planning