Towards Sustainable NLP: Insights from Benchmarking Inference Energy in Large Language Models