EVALOOOP: A Self-Consistency-Centered Framework for Assessing Large Language Model Robustness in Programming

Open in new window