Are Large Language Models Reliable AI Scientists? Assessing Reverse-Engineering of Black-Box Systems

Open in new window