Are We There Yet? Revealing the Risks of Utilizing Large Language Models in Scholarly Peer Review