Efficacy of Large Language Models in Systematic Reviews