Generative AI
The Download: unpacking OpenAI Q* hype, and X's financial woes
While we still don't know all the details, there have been reports that researchers at OpenAI had made a "breakthrough" in AI that alarmed staff members. The claim is that they came up with a new way to make powerful AI systems and had created a new model, called Q* (pronounced Q star), that was able to perform grade-school level math. Some at OpenAI reportedly believe this could be a breakthrough in the company's quest to build artificial general intelligence, a much-hyped concept of an AI system that is smarter than humans. And why is grade-school math such a big deal? Our senior AI reporter Melissa Heikkilรค called some experts to find out how big of a deal any such breakthrough would really be.
Unpacking the hype around OpenAI's rumored new Q* model
While we still don't know all the details, there have been reports that researchers at OpenAI had made a "breakthrough" in AI that had alarmed staff members. Reuters and The Information both report that researchers had come up with a new way to make powerful AI systems and had created a new model, called Q* (pronounced Q star), that was able to perform grade-school-level math. According to the people who spoke to Reuters, some at OpenAI believe this could be a milestone in the company's quest to build artificial general intelligence, a much-hyped concept referring to an AI system that is smarter than humans. The company declined to comment on Q*. Social media is full of speculation and excessive hype, so I called some experts to find out how big a deal any breakthrough in math and AI would really be.
The real story of the OpenAI debacle is the tyranny of big tech Sarah Radsch
The theatrics of OpenAI's seeming implosion amid the firing of its CEO and co-founder Sam Altman, Microsoft's dramatic offer to poach its top executives and staff, and Altman's triumphant return following the ouster of the board has all the trappings of a Hollywood blockbuster. But the drama unfolding should put the spotlight on the tyranny of the tech titans that control critical aspects of the AI ecosystem. OpenAI has developed some of the most advanced large-language models and pioneering artificial-intelligence products, such as the text generator ChatGPT and image generator Dall-E, which have been responsible for making generative AI into a household term and discussion about AI risks into dinnertime conversation. Although OpenAI is in the spotlight, however, Microsoft has played a leading role in the unfolding drama. Microsoft swooped in to scoop up the ousted executives and create a new AI research division for Altman to lead, with hundreds of staff reportedly ready to follow them.
Finding value in generative AI for financial services
According to a McKinsey report, generative AI could add $2.6 trillion to $4.4 trillion annually in value to the global economy. The banking industry was highlighted as among sectors that could see the biggest impact (as a percentage of their revenues) from generative AI. The technology "could deliver value equal to an additional $200 billion to $340 billion annually if the use cases were fully implemented," says the report. For businesses from every sector, the current challenge is to separate the hype that accompanies any new technology from the real and lasting value it may bring. This is a pressing issue for firms in financial services.
Real Customization or Just Marketing: Are Customized Versions of Chat GPT Useful?
Garrido-Merchรกn, Eduardo C., Arroyo-Barrigรผete, Jose L., Borrรกs-Pala, Francisco, Escobar-Torres, Leandro, de Ibarreta, Carlos Martรญnez, Ortiz-Lozano, Jose Marรญa, Rua-Vieites, Antonio
Large Language Models (LLMs), as the case of OpenAI ChatGPT-4 Turbo, are revolutionizing several industries, including higher education. In this context, LLMs can be personalized through a fine-tuning process to meet the student demands on every particular subject, like statistics. Recently, OpenAI has launched the possibility to fine-tune their model with a natural language web interface, enabling the possibility to create customized GPT version deliberately conditioned to meet the demands of a specific task. The objective of this research is to assess the potential of the customized GPTs that have recently been launched by OpenAI. After developing a Business Statistics Virtual Professor (BSVP), tailored for students at the Universidad Pontificia Comillas, its behavior was evaluated and compared with that of ChatGPT-4 Turbo. The results lead to several conclusions. Firstly, a substantial modification in the style of communication was observed. Following the instructions it was trained with, BSVP provided responses in a more relatable and friendly tone, even incorporating a few minor jokes. Secondly, and this is a matter of relevance, when explicitly asked for something like, "I would like to practice a programming exercise similar to those in R practice 4," BSVP was capable of providing a far superior response: having access to contextual documentation, it could fulfill the request, something beyond ChatGPT-4 Turbo's capabilities. On the downside, the response times were generally higher. Lastly, regarding overall performance, quality, depth, and alignment with the specific content of the course, no statistically significant differences were observed in the responses between BSVP and ChatGPT-4 Turbo. It appears that customized assistants trained with prompts present advantages as virtual aids for students, yet they do not constitute a substantial improvement over ChatGPT-4 Turbo.
chatGPT for generating questions and assessments based on accreditations
This research aims to take advantage of artificial intelligence techniques in producing students assessment that is compatible with the different academic accreditations of the same program. The possibility of using generative artificial intelligence technology was studied to produce an academic accreditation compliant test the National Center for Academic Accreditation of Kingdom of Saudi Arabia and Accreditation Board for Engineering and Technology. A novel method was introduced to map the verbs used to create the questions introduced in the tests. The method allows a possibility of using the generative artificial intelligence technology to produce and check the validity of questions that measure educational outcomes. A questionnaire was distributed to ensure that the use of generative artificial intelligence to create exam questions is acceptable by the faculty members, as well as to ask about the acceptance of assistance in validating questions submitted by faculty members and amending them in accordance with academic accreditations. The questionnaire was distributed to faculty members of different majors in the Kingdom of Saudi Arabias universities. one hundred twenty responses obtained with eight five percentile approval percentage for generate complete exam questions by generative artificial intelligence . Whereas ninety eight percentage was the approval percentage for editing and improving already existed questions.
Student Mastery or AI Deception? Analyzing ChatGPT's Assessment Proficiency and Evaluating Detection Strategies
Wang, Kevin, Akins, Seth, Mohammed, Abdallah, Lawrence, Ramon
Generative AI systems such as ChatGPT have a disruptive effect on learning and assessment. Computer science requires practice to develop skills in problem solving and programming that are traditionally developed using assignments. Generative AI has the capability of completing these assignments for students with high accuracy, which dramatically increases the potential for academic integrity issues and students not achieving desired learning outcomes. This work investigates the performance of ChatGPT by evaluating it across three courses (CS1,CS2,databases). ChatGPT completes almost all introductory assessments perfectly. Existing detection methods, such as MOSS and JPlag (based on similarity metrics) and GPTzero (AI detection), have mixed success in identifying AI solutions. Evaluating instructors and teaching assistants using heuristics to distinguish between student and AI code shows that their detection is not sufficiently accurate. These observations emphasize the need for adapting assessments and improved detection methods.
Generative AI and US Intellectual Property Law
The rapidity with which generative AI has been adopted and advanced has raised legal and ethical questions related to the impact on artists rights, content production, data collection, privacy, accuracy of information, and intellectual property rights. Recent administrative and case law challenges have shown that generative AI software systems do not have independent intellectual property rights in the content that they generate. It remains to be seen whether human content creators can retain their intellectual property rights against generative AI software, its developers, operators, and owners for the misappropriation of the work of human creatives, given the metes and bounds of existing law. Early signs from various courts are mixed as to whether and to what degree the results generated by AI models meet the legal standards of infringement under existing law.
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
Avrahami, Omri, Hertz, Amir, Vinker, Yael, Arar, Moab, Fruchter, Shlomi, Fried, Ohad, Cohen-Or, Daniel, Lischinski, Dani
Recent advances in text-to-image generation models have unlocked vast potential for visual creativity. However, these models struggle with generation of consistent characters, a crucial aspect for numerous real-world applications such as story visualization, game development asset design, advertising, and more. Current methods typically rely on multiple pre-existing images of the target character or involve labor-intensive manual processes. In this work, we propose a fully automated solution for consistent character generation, with the sole input being a text prompt. We introduce an iterative procedure that, at each stage, identifies a coherent set of images sharing a similar identity and extracts a more consistent identity from this set. Our quantitative analysis demonstrates that our method strikes a better balance between prompt alignment and identity consistency compared to the baseline methods, and these findings are reinforced by a user study. To conclude, we showcase several practical applications of our approach. Project page is available at https://omriavrahami.com/the-chosen-one
The Guardian view on OpenAI's board shake-up: changes deliver more for shareholders than for humanity Editorial
In the 1983 movie WarGames, the US defence department runs a superintelligent central computer that is hacked into by a teenager, who unwittingly almost causes a nuclear Armageddon. The end of the world is averted when the computer, known as Joshua, learns, after playing tic-tac-toe with the teenager, that nuclear war cannot have a winner. The insight causes him to rescind missile launch orders with the comment: "A strange game. The only winning move is not to play." Joshua embodied the idea that a superintelligent AI would have an anthropomorphic mindset.