Generative AI
A Survey on ChatGPT: AI-Generated Contents, Challenges, and Solutions
Wang, Yuntao, Pan, Yanghe, Yan, Miao, Su, Zhou, Luan, Tom H.
With the widespread use of large artificial intelligence (AI) models such as ChatGPT, AI-generated content (AIGC) has garnered increasing attention and is leading a paradigm shift in content creation and knowledge representation. AIGC uses generative large AI algorithms to assist or replace humans in creating massive, high-quality, and human-like content at a faster pace and lower cost, based on user-provided prompts. Despite the recent significant progress in AIGC, security, privacy, ethical, and legal challenges still need to be addressed. This paper presents an in-depth survey of working principles, security and privacy threats, state-of-the-art solutions, and future challenges of the AIGC paradigm. Specifically, we first explore the enabling technologies, general architecture of AIGC, and discuss its working modes and key characteristics. Then, we investigate the taxonomy of security and privacy threats to AIGC and highlight the ethical and societal implications of GPT and AIGC technologies. Furthermore, we review the state-of-the-art AIGC watermarking approaches for regulatable AIGC paradigms regarding the AIGC model and its produced content. Finally, we identify future challenges and open research directions related to AIGC.
Science in the Era of ChatGPT, Large Language Models and Generative AI: Challenges for Research Ethics and How to Respond
Since the release of popular large language models (LLMs) such as ChatGPT, the transformative impact of artificial intelligence (AI) on broader society has been unprecedented. This is particularly alarming for science and its conquest of truth (Chomsky et al., 2023). Generative AI and, particularly, conversational AI based on language models has set new ethical dilemmas for knowledge, epistemology and research practice. From authorship, to misinformation, biases, fairness and safety of interactions with human subjects, research ethics boards need to adapt to this new era in order to protect research integrity and set high-quality ethical standards for research conduct (van Dis et al., 2023). This paper focuses on reviewing these challenges with the aim of laying foundations for a timely and effective response. ChatGPT is an AI chatbot released in November 2022 by OpenAI. It is a Generative Pre-trained Transformer (GPT), a type of artificial deep neural network with a number of parameters in the order of billions. It is designed to process sequential input data, i.e. natural language, without labeling (self-supervised learning), but with remarkable capabilities for parallelization that significantly reduce training time. The model is further enhanced by a combination of supervised and reinforcement learning based on past conversations as well as human feedback to fine-tune the model and its responses (Stiennon et al., 2020; Gao,
Tech Companies' Friendly New Strategy to Destroy One Another
More than a decade ago, in a prescient essay for Scientific American, the inventor of the World Wide Web denounced what Facebook and other tech giants were doing to his signature invention. "Why should you care?" Tim Berners-Lee wrote at the time. "Because the Web is yours." These companies, he warned, were restructuring the web itself, turning an expanse of interconnected websites all built on the same open infrastructure into a series of "fragmented islands" where users were kept hostage. On Facebook's island, he wrote, people give over their entire digital life for the chance to connect with their friends, but have no way to transfer their information to any other platform.
Multilingual Tourist Assistance using ChatGPT: Comparing Capabilities in Hindi, Telugu, and Kannada
This research investigates the effectiveness of ChatGPT, an AI language model by OpenAI, in translating English into Hindi, Telugu, and Kannada languages, aimed at assisting tourists in India's linguistically diverse environment. To measure the translation quality, a test set of 50 questions from diverse fields such as general knowledge, food, and travel was used. These were assessed by five volunteers for accuracy and fluency, and the scores were subsequently converted into a BLEU score. The BLEU score evaluates the closeness of a machine-generated translation to a human translation, with a higher score indicating better translation quality. The Hindi translations outperformed others, showcasing superior accuracy and fluency, whereas Telugu translations lagged behind. Human evaluators rated both the accuracy and fluency of translations, offering a comprehensive perspective on the language model's performance.
Beyond Reality: The Pivotal Role of Generative AI in the Metaverse
Chamola, Vinay, Bansal, Gaurang, Das, Tridib Kumar, Hassija, Vikas, Reddy, Naga Siva Sai, Wang, Jiacheng, Zeadally, Sherali, Hussain, Amir, Yu, F. Richard, Guizani, Mohsen, Niyato, Dusit
Imagine stepping into a virtual world that's as rich, dynamic, and interactive as our physical one. This is the promise of the Metaverse, and it's being brought to life by the transformative power of Generative Artificial Intelligence (AI). This paper offers a comprehensive exploration of how generative AI technologies are shaping the Metaverse, transforming it into a dynamic, immersive, and interactive virtual world. We delve into the applications of text generation models like ChatGPT and GPT-3, which are enhancing conversational interfaces with AI-generated characters. We explore the role of image generation models such as DALL-E and MidJourney in creating visually stunning and diverse content. We also examine the potential of 3D model generation technologies like Point-E and Lumirithmic in creating realistic virtual objects that enrich the Metaverse experience. But the journey doesn't stop there. We also address the challenges and ethical considerations of implementing these technologies in the Metaverse, offering insights into the balance between user control and AI automation. This paper is not just a study, but a guide to the future of the Metaverse, offering readers a roadmap to harnessing the power of generative AI in creating immersive virtual worlds.
A Critical Review of Large Language Models: Sensitivity, Bias, and the Path Toward Specialized AI
Hajikhani, Arash, Cole, Carolyn
In the realm of Artificial Intelligence (AI), the rise of Large Language Models (LLMs) such as OpenAI's Generative Pretrained Transformer (GPT) series has introduced unprecedented capabilities in text summarization and classification (Min et al., 2021; Yoo et al., 2021). These AI juggernauts can dissect vast quantities of text, distill key points, and even classify documents with a level of speed and accuracy that leaves human ability far behind (Jiang et al., 2022). While we applaud these advancements, it's imperative to keep a clear perspective on their inner workings, particularly their training data and decision making procedures. The advent of LLMs has undoubtedly revolutionized text analytics, but it has also introduced novel challenges concerning sensitivity and potential biases (Albrecht et al., 2022; Liang et al., 2021). Inherent in the training of these models is their susceptibility to embed the biases present in the training data, a subtle yet pervasive issue that can later be extremely difficult to detect and rectify (Alvi et al., 2019; Zhang & Verma, 2021). It's crucial, therefore, to scrutinize not only the LLMs themselves but also the mechanisms that train them. The broad and diverse nature of subjects that these models deal with, ranging from mundane queries to sensitive matters, necessitates a systematic and rigorous training approach.
The European AI Liability Directives -- Critique of a Half-Hearted Approach and Lessons for the Future
As ChatGPT et al. conquer the world, the optimal liability framework for AI systems remains an unsolved problem across the globe. In a much-anticipated move, the European Commission advanced two proposals outlining the European approach to AI liability in September 2022: a novel AI Liability Directive and a revision of the Product Liability Directive. They constitute the final cornerstone of EU AI regulation. Crucially, the liability proposals and the EU AI Act are inherently intertwined: the latter does not contain any individual rights of affected persons, and the former lack specific, substantive rules on AI development and deployment. Taken together, these acts may well trigger a Brussels Effect in AI regulation, with significant consequences for the US and beyond. This paper makes three novel contributions. First, it examines in detail the Commission proposals and shows that, while making steps in the right direction, they ultimately represent a half-hearted approach: if enacted as foreseen, AI liability in the EU will primarily rest on disclosure of evidence mechanisms and a set of narrowly defined presumptions concerning fault, defectiveness and causality. Hence, second, the article suggests amendments, which are collected in an Annex at the end of the paper. Third, based on an analysis of the key risks AI poses, the final part of the paper maps out a road for the future of AI liability and regulation, in the EU and beyond. This includes: a comprehensive framework for AI liability; provisions to support innovation; an extension to non-discrimination/algorithmic fairness, as well as explainable AI; and sustainability. I propose to jump-start sustainable AI regulation via sustainability impact assessments in the AI Act and sustainable design defects in the liability regime. In this way, the law may help spur not only fair AI and XAI, but potentially also sustainable AI (SAI).
The Download: protecting photos from AI, and air-conditioning's dilemma
There's currently nothing stopping someone taking the selfie you posted online last week and editing it using powerful generative AI systems. Even worse, it might be impossible to prove that the resulting image is fake. The good news is that a new tool, created by researchers at MIT, could prevent this. The tool, called PhotoGuard, works like a protective shield by altering photos in tiny ways that are invisible to the human eye but prevent them from being manipulated. If someone tries to use an editing app based on a generative AI model to manipulate an image that has been "immunized" by PhotoGuard, the result will look unrealistic or warped.
Fact-Checking of AI-Generated Reports
Mahmood, Razi, Wang, Ge, Kalra, Mannudeep, Yan, Pingkun
With advances in generative artificial intelligence (AI), it is now possible to produce realistic-looking automated reports for preliminary reads of radiology images. This can expedite clinical workflows, improve accuracy and reduce overall costs. However, it is also well-known that such models often hallucinate, leading to false findings in the generated reports. In this paper, we propose a new method of fact-checking of AI-generated reports using their associated images. Specifically, the developed examiner differentiates real and fake sentences in reports by learning the association between an image and sentences describing real or potentially fake findings. To train such an examiner, we first created a new dataset of fake reports by perturbing the findings in the original ground truth radiology reports associated with images. Text encodings of real and fake sentences drawn from these reports are then paired with image encodings to learn the mapping to real/fake labels. The utility of such an examiner is demonstrated for verifying automatically generated reports by detecting and removing fake sentences. Future generative AI approaches can use the resulting tool to validate their reports leading to a more responsible use of AI in expediting clinical workflows.
Performance of ChatGPT on USMLE: Unlocking the Potential of Large Language Models for AI-Assisted Medical Education
Sharma, Prabin, Thapa, Kisan, Thapa, Dikshya, Dhakal, Prastab, Upadhaya, Mala Deep, Adhikari, Santosh, Khanal, Salik Ram
Artificial intelligence is gaining traction in more ways than ever before. The popularity of language models and AI-based businesses has soared since ChatGPT was made available to the general public via OpenAI. It is becoming increasingly common for people to use ChatGPT both professionally and personally. Considering the widespread use of ChatGPT and the reliance people place on it, this study determined how reliable ChatGPT can be for answering complex medical and clinical questions. Harvard University gross anatomy along with the United States Medical Licensing Examination (USMLE) questionnaire were used to accomplish the objective. The paper evaluated the obtained results using a 2-way ANOVA and posthoc analysis. Both showed systematic covariation between format and prompt. Furthermore, the physician adjudicators independently rated the outcome's accuracy, concordance, and insight. As a result of the analysis, ChatGPT-generated answers were found to be more context-oriented and represented a better model for deductive reasoning than regular Google search results. Furthermore, ChatGPT obtained 58.8% on logical questions and 60% on ethical questions. This means that the ChatGPT is approaching the passing range for logical questions and has crossed the threshold for ethical questions. The paper believes ChatGPT and other language learning models can be invaluable tools for e-learners; however, the study suggests that there is still room to improve their accuracy. In order to improve ChatGPT's performance in the future, further research is needed to better understand how it can answer different types of questions.