A Comprehensive Evaluation of Large Language Models on Benchmark Biomedical Text Processing Tasks