Evaluating Large Language Models on Time Series Feature Understanding: A Comprehensive Taxonomy and Benchmark