Generative AI
Alteration-free and Model-agnostic Origin Attribution of Generated Images
Wang, Zhenting, Chen, Chen, Zeng, Yi, Lyu, Lingjuan, Ma, Shiqing
Recently, there has been a growing attention in image generation models. However, concerns have emerged regarding potential misuse and intellectual property (IP) infringement associated with these models. Therefore, it is necessary to analyze the origin of images by inferring if a specific image was generated by a particular model, i.e., origin attribution. Existing methods are limited in their applicability to specific types of generative models and require additional steps during training or generation. This restricts their use with pre-trained models that lack these specific operations and may compromise the quality of image generation. To overcome this problem, we first develop an alteration-free and model-agnostic origin attribution method via input reverse-engineering on image generation models, i.e., inverting the input of a particular model for a specific image. Given a particular model, we first analyze the differences in the hardness of reverse-engineering tasks for the generated images of the given model and other images. Based on our analysis, we propose a method that utilizes the reconstruction loss of reverse-engineering to infer the origin. Our proposed method effectively distinguishes between generated images from a specific generative model and other images, including those generated by different models and real images.
ProcessGPT: Transforming Business Process Management with Generative Artificial Intelligence
Beheshti, Amin, Yang, Jian, Sheng, Quan Z., Benatallah, Boualem, Casati, Fabio, Dustdar, Schahram, Nezhad, Hamid Reza Motahari, Zhang, Xuyun, Xue, Shan
Generative Pre-trained Transformer (GPT) is a state-of-the-art machine learning model capable of generating human-like text through natural language processing (NLP). GPT is trained on massive amounts of text data and uses deep learning techniques to learn patterns and relationships within the data, enabling it to generate coherent and contextually appropriate text. This position paper proposes using GPT technology to generate new process models when/if needed. We introduce ProcessGPT as a new technology that has the potential to enhance decision-making in data-centric and knowledge-intensive processes. ProcessGPT can be designed by training a generative pre-trained transformer model on a large dataset of business process data. This model can then be fine-tuned on specific process domains and trained to generate process flows and make decisions based on context and user input. The model can be integrated with NLP and machine learning techniques to provide insights and recommendations for process improvement. Furthermore, the model can automate repetitive tasks and improve process efficiency while enabling knowledge workers to communicate analysis findings, supporting evidence, and make decisions. ProcessGPT can revolutionize business process management (BPM) by offering a powerful tool for process augmentation, automation and improvement. Finally, we demonstrate how ProcessGPT can be a powerful tool for augmenting data engineers in maintaining data ecosystem processes within large bank organizations. Our scenario highlights the potential of this approach to improve efficiency, reduce costs, and enhance the quality of business operations through the automation of data-centric and knowledge-intensive processes. These results underscore the promise of ProcessGPT as a transformative technology for organizations looking to improve their process workflows.
AI will indeed be, but its rise will be mundane not apocalyptic John Naughton
Cheered by the news that OpenAI, the company behind ChatGPT, had released a free iPhone app for the language model, I went to the Apple app store to download it, only to find that it was nowhere to be found. This is because – as I belatedly discovered – it's currently only available via the US app store and will be rolled out to other jurisdictions in due course. Despite that, though, the UK store was positively groaning with "ChatGPT" apps – of which I counted 25 before losing the will to live. For example, there's AI Chat – Chatbot AI Assistant ("Experience the power of AI! Create Essays, Emails, Resumes or Any Text!"). Or Chat AI – Ask Open Chatbot ("The ultimate AI chat app that can assist you with anything and everything you need")?
Tech stocks surge as wave of interest in AI drives $4tn rally
A rush of interest in artificial intelligence (AI) has helped to fuel a $4tn (£3.2tn) rally in technology stocks this year, with the US Nasdaq exchange reaching its highest level since last August in a week that saw the chipmaker Nvidia poised to become the next trillion-dollar company. Some stocks seen as AI winners – such as semiconductor makers and software developers – have more than doubled in value as traders bet on massive growth in the industry, even as fears mount over waves of job losses as everyday tasks become automated. On Friday, the combined value of technology companies listed on the Nasdaq Composite share index reached $22tn, according to the international data firm Refinitiv, up from $18tn at the end of 2022. The AI rally has helped lift the index 23% so far this year. Nvidia, whose high-end chips are used to power the datacentres used by the new wave of generative AI products such as ChatGPT, could soon become the first chipmaker to be valued at more than $1tn.
Where Memory Ends and Generative AI Begins
In late March, a well-funded artificial intelligence startup hosted what it said was the first ever AI film festival at the Alamo Drafthouse theater in San Francisco. The startup, called Runway, is best known for cocreating Stable Diffusion, the standout text-to-image AI tool that captured imaginations in 2022. Then, in February of this year, Runway released a tool that could change the entire style of an existing video with just a simple prompt. Runway told budding filmmakers to have at it and later selected 10 short films to showcase at the fest. The short films were mostly demonstrations of technology; well-constructed narratives took a backseat.
AI will make humans more creative, not replace them, predict entertainment executives
People in Texas sounded off on AI job displacement, with half of people who spoke to Fox News convinced that the tech will rob them of work. With new developments in generative artificial intelligence bringing the technology to the forefront of public conversation, concerns about how it will affect jobs in the entertainment industry have risen, even contributing in a writer strike in Hollywood. But, founders of Web3 animation studio Toonstar have been using artificial intelligence in their studio for years, and told Fox News Digital it serves as an aid in the creative process. AI can "unlock creativity" and give animators a "head start" in terms of creativity, Luisa Huang, COO and co-founder of Toonstar told Fox News Digital. "But I have yet to see AI be able to put output anything … that is ready for production," she added.
Attention Paper: How Generative AI Reshapes Digital Shadow Industry?
Wang, Qichao, Ma, Huan, Wei, Wentao, Li, Hangyu, Chen, Liang, Zhao, Peilin, Zhao, Binwen, Hu, Bo, Zhang, Shu, Zheng, Zibin, Wu, Bingzhe
The rapid development of digital economy has led to the emergence of various black and shadow internet industries, which pose potential risks that can be identified and managed through digital risk management (DRM) that uses different techniques such as machine learning and deep learning. The evolution of DRM architecture has been driven by changes in data forms. However, the development of AI-generated content (AIGC) technology, such as ChatGPT and Stable Diffusion, has given black and shadow industries powerful tools to personalize data and generate realistic images and conversations for fraudulent activities. This poses a challenge for DRM systems to control risks from the source of data generation and to respond quickly to the fast-changing risk environment. This paper aims to provide a technical analysis of the challenges and opportunities of AIGC from upstream, midstream, and downstream paths of black/shadow industries and suggest future directions for improving existing risk control systems. The paper will explore the new black and shadow techniques triggered by generative AI technology and provide insights for building the next-generation DRM system.
Stereotypes and Smut: The (Mis)representation of Non-cisgender Identities by Text-to-Image Models
Ungless, Eddie L., Ross, Björn, Lauscher, Anne
Cutting-edge image generation has been praised for producing high-quality images, suggesting a ubiquitous future in a variety of applications. However, initial studies have pointed to the potential for harm due to predictive bias, reflecting and potentially reinforcing cultural stereotypes. In this work, we are the first to investigate how multimodal models handle diverse gender identities. Concretely, we conduct a thorough analysis in which we compare the output of three image generation models for prompts containing cisgender vs. non-cisgender identity terms. Our findings demonstrate that certain non-cisgender identities are consistently (mis)represented as less human, more stereotyped and more sexualised. We complement our experimental analysis with (a)~a survey among non-cisgender individuals and (b) a series of interviews, to establish which harms affected individuals anticipate, and how they would like to be represented. We find respondents are particularly concerned about misrepresentation, and the potential to drive harmful behaviours and beliefs. Simple heuristics to limit offensive content are widely rejected, and instead respondents call for community involvement, curated training data and the ability to customise. These improvements could pave the way for a future where change is led by the affected community, and technology is used to positively ``[portray] queerness in ways that we haven't even thought of'' rather than reproducing stale, offensive stereotypes.
Evaluating OpenAI's Whisper ASR for Punctuation Prediction and Topic Modeling of life histories of the Museum of the Person
Gris, Lucas Rafael Stefanel, Marcacini, Ricardo, Junior, Arnaldo Candido, Casanova, Edresson, Soares, Anderson, Aluísio, Sandra Maria
Automatic speech recognition (ASR) systems play a key role in applications involving human-machine interactions. Despite their importance, ASR models for the Portuguese language proposed in the last decade have limitations in relation to the correct identification of punctuation marks in automatic transcriptions, which hinder the use of transcriptions by other systems, models, and even by humans. However, recently Whisper ASR was proposed by OpenAI, a general-purpose speech recognition model that has generated great expectations in dealing with such limitations. This chapter presents the first study on the performance of Whisper for punctuation prediction in the Portuguese language. We present an experimental evaluation considering both theoretical aspects involving pausing points (comma) and complete ideas (exclamation, question, and fullstop), as well as practical aspects involving transcript-based topic modeling - an application dependent on punctuation marks for promising performance. We analyzed experimental results from videos of Museum of the Person, a virtual museum that aims to tell and preserve people's life histories, thus discussing the pros and cons of Whisper in a real-world scenario. Although our experiments indicate that Whisper achieves state-of-the-art results, we conclude that some punctuation marks require improvements, such as exclamation, semicolon and colon.
ChatGPT: A Study on its Utility for Ubiquitous Software Engineering Tasks
Sridhara, Giriprasad, G., Ranjani H., Mazumdar, Sourav
ChatGPT (Chat Generative Pre-trained Transformer) is a chatbot launched by OpenAI on November 30, 2022. OpenAI's GPT-3 family of large language models serve as the foundation for ChatGPT. ChatGPT is fine-tuned with both supervised and reinforcement learning techniques and has received widespread attention for its articulate responses across diverse domains of knowledge. In this study, we explore how ChatGPT can be used to help with common software engineering tasks. Many of the ubiquitous tasks covering the breadth of software engineering such as ambiguity resolution in software requirements, method name suggestion, test case prioritization, code review, log summarization can potentially be performed using ChatGPT. In this study, we explore fifteen common software engineering tasks using ChatGPT. We juxtapose and analyze ChatGPT's answers with the respective state of the art outputs (where available) and/or human expert ground truth. Our experiments suggest that for many tasks, ChatGPT does perform credibly and the response from it is detailed and often better than the human expert output or the state of the art output. However, for a few other tasks, ChatGPT in its present form provides incorrect answers and hence is not suited for such tasks.