GPT-4 is judged more human than humans in displaced and inverted Turing tests

Rathi, Ishika, Taylor, Sydney, Bergen, Benjamin K., Jones, Cameron R.

arXiv.org Artificial Intelligence 

Everyday AI detection requires differentiating between people and AI in informal, online conversations. In many cases, people will not interact directly with AI systems but instead read conversations between AI systems and other people. We measured how well people and large language models can discriminate using Figure 1: A summary of our experimental design. Transcripts two modified versions of the Turing test: inverted were sampled from an interactive Turing test, and displaced. GPT-3.5, GPT-4, and where a human judge interrogates a witness to determine displaced human adjudicators judged whether if they are human or AI. In an inverted Turing test, an agent was human or AI on the basis of a we present transcripts to AI models, who judge whether Turing test transcript. We found that both AI the same witnesses are human or AI. In a displaced and displaced human judges were less accurate Turing test, a separate group of human participants read than interactive interrogators, with below the same transcripts and make this judgement.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found