Evaluating the Effectiveness of Cost-Efficient Large Language Models in Benchmark Biomedical Tasks