ChatGPT creator confirms a bug allowed some users to snoop on others' chat histories

Daily Mail - Science & tech 

OpenAI states that their ChatGPT model, trained using a machine learning technique called Reinforcement Learning from Human Feedback (RLHF), can simulate dialogue, answer follow-up questions, admit mistakes, challenge incorrect premises and reject inappropriate requests. Initial development involved human AI trainers providing the model with conversations in which they played both sides - the user and an AI assistant. The version of the bot available for public testing attempts to understand questions posed by users and responds with in-depth answers resembling human-written text in a conversational format. A tool like ChatGPT could be used in real-world applications such as digital marketing, online content creation, answering customer service queries or as some users have found, even to help debug code. The bot can respond to a large range of questions while imitating human speaking styles.