Generative AI
Qualcomm brings on-device AI to mobile and PC
Qualcomm is no stranger in running artificial intelligence and machine learning systems on-device and without an internet connection. They've been doing it with their camera chipsets for years. But on Tuesday at Snapdragon Summit 2023, the company announced that on-device AI is finally coming to mobile devices and Windows 11 PCs as part of the new Snapdragon 8 Gen 3 and X Elite chips. Both chipsets were built from the ground up with generative AI capabilities in mind and are able to support a variety of large language models (LLM), language vision models (LVM), and transformer network-based automatic speech recognition (ASR) models, up to 10 billion parameters for the SD8 gen 3 and 13 billion parameters for the X Elite, entirely on-device. That means you'll be able to run anything from Baidu's ERNIE 3.5 to OpenAI's Whisper, Meta's Llama 2 or Google's Gecko on your phone or laptop, without an internet connection.
Qualcomm's Snapdragon 8 Gen 3 brings on-device generative AI to more Android phones
At its annual Snapdragon Summit on Tuesday, Qualcomm revealed its latest mobile chipset. Perhaps the biggest change in the Snapdragon 8 Gen 3 is the introduction of on-device generative AI (akin to Google's Tensor G3). The chipset's AI Engine supports multi-modal generative AI models and what Qualcomm claims is the world's fastest Stable Diffusion system with the ability to generate an image in under a second. So, you should be able to whip up backgrounds and images for social media posts in a flash. Because GAI requests are handled on-device, Qualcomm says they remain private.
How AI Can Be Regulated Like Nuclear Energy
Prominent AI researchers and figures have consistently dominated headlines by invoking comparisons that AI risk is on par with the existential and safety risks that were posed with the coming of the nuclear age. From statements that AI should be subject to regulation akin to nuclear energy, to declarations paralleling the risk of human extinction to that of nuclear war, the analogies drawn between AI and nuclear have been consistent. The argument for such extinction risk has hinged on the hypothetical and unproven risk of an Artificial General Intelligence (AGI) imminently arising from current Large Language Models (e.g., ChatGPT), necessitating increased caution with their creation and deployment. Sam Altman, the CEO of OpenAI, has even referenced to the well established nuclear practice of "licensing", deemed anti-competitive by some. He has called on the creation of a federal agency that can grant licenses to create AI models above a certain threshold of capabilities.
Robo-Insight #6
Source: OpenAI's DALLยทE 2 with prompt "a hyperrealistic picture of a robot reading the news on a laptop at a coffee shop" Welcome to the 6th edition of Robo-Insight, a robotics news update! In this post, we are excited to share a range of new advancements in the field and highlight robots' progress in areas like medical assistance, prosthetics, robot flexibility, joint movement, work performance, AI design, and household cleanliness. In the medical world, researchers from Germany have developed a robotic system designed to help nurses relieve the physical strain associated with patient care. Their work explores how robotic technology can assist in such tasks by remotely anchoring patients in a lateral position. The results indicate that the system improved the working posture of nurses by an average of 11.93% and was rated as user-friendly.
The Download: poisoning generative AI, and heat-storing batteries
What's happening: A new tool lets artists make invisible changes to the pixels in their art before they upload it online so that if it's scraped into an AI training set, it can cause the resulting model to break in chaotic and unpredictable ways. Why it matters: The tool, called Nightshade, is intended as a way to fight back against AI companies that use artists' work to train their models without the creator's permission. Using it to "poison" this training data could damage future iterations of image-generating AI models, such as DALL-E, Midjourney, and Stable Diffusion, by rendering some of their outputs useless. How it works: Nightshade exploits a security vulnerability in generative AI models, one arising from the fact that they are trained on vast amounts of data--in this case, images that have been hoovered from the internet. Poisoned data samples can manipulate models into learning, for example, that images of hats are cakes, and images of handbags are toasters.
Do Stochastic Parrots have Feelings Too? Improving Neural Detection of Synthetic Text via Emotion Recognition
Cowap, Alan, Graham, Yvette, Foster, Jennifer
Recent developments in generative AI have shone a spotlight on high-performance synthetic text generation technologies. The now wide availability and ease of use of such models highlights the urgent need to provide equally powerful technologies capable of identifying synthetic text. With this in mind, we draw inspiration from psychological studies which suggest that people can be driven by emotion and encode emotion in the text they compose. We hypothesize that pretrained language models (PLMs) have an affective deficit because they lack such an emotional driver when generating text and consequently may generate synthetic text which has affective incoherence i.e. lacking the kind of emotional coherence present in human-authored text. We subsequently develop an emotionally aware detector by fine-tuning a PLM on emotion. Experiment results indicate that our emotionally-aware detector achieves improvements across a range of synthetic text generators, various sized models, datasets, and domains. Finally, we compare our emotionally-aware synthetic text detector to ChatGPT in the task of identification of its own output and show substantial gains, reinforcing the potential of emotion as a signal to identify synthetic text. Code, models, and datasets are available at https: //github.com/alanagiasi/emoPLMsynth
Content-Based Search for Deep Generative Models
Lu, Daohan, Wang, Sheng-Yu, Kumari, Nupur, Agarwal, Rohan, Tang, Mia, Bau, David, Zhu, Jun-Yan
The growing proliferation of customized and pretrained generative models has made it infeasible for a user to be fully cognizant of every model in existence. To address this need, we introduce the task of content-based model search: given a query and a large set of generative models, finding the models that best match the query. As each generative model produces a distribution of images, we formulate the search task as an optimization problem to select the model with the highest probability of generating similar content as the query. We introduce a formulation to approximate this probability given the query from different modalities, e.g., image, sketch, and text. Furthermore, we propose a contrastive learning framework for model retrieval, which learns to adapt features for various query modalities. We demonstrate that our method outperforms several baselines on Generative Model Zoo, a new benchmark we create for the model retrieval task.
GPT-4 gave advice on planning terrorist attacks when asked in Zulu
Safeguards designed to prevent OpenAI's GPT-4 artificial intelligence from answering harmful prompts failed when it received requests in languages such as Scots Gaelic or Zulu. This allowed researchers to get AI-generated answers on how to build a homemade bomb or perform insider trading. The vulnerability demonstrated in the large language model involves instructing the AI in languages that are mostly absent from its training data.
This new data poisoning tool lets artists fight back against generative AI
Meta, Google, Stability AI, and OpenAI did not respond to MIT Technology Review's request for comment on how they might respond. Zhao's team also developed Glaze, a tool that allows artists to "mask" their own personal style to prevent it from being scraped by AI companies. It works in a similar way to Nightshade: by changing the pixels of images in subtle ways that are invisible to the human eye but manipulate machine-learning models to interpret the image as something different from what it actually shows. The team intends to integrate Nightshade into Glaze, and artists can choose whether they want to use the data-poisoning tool or not. The team is also making Nightshade open source, which would allow others to tinker with it and make their own versions.
Counter Turing Test CT^2: AI-Generated Text Detection is Not as Easy as You May Think -- Introducing AI Detectability Index
Chakraborty, Megha, Tonmoy, S. M Towhidul Islam, Zaman, S M Mehedi, Sharma, Krish, Barman, Niyar R, Gupta, Chandan, Gautam, Shreya, Kumar, Tanay, Jain, Vinija, Chadha, Aman, Sheth, Amit P., Das, Amitava
With the rise of prolific ChatGPT, the risk and consequences of AI-generated text has increased alarmingly. To address the inevitable question of ownership attribution for AI-generated artifacts, the US Copyright Office released a statement stating that 'If a work's traditional elements of authorship were produced by a machine, the work lacks human authorship and the Office will not register it'. Furthermore, both the US and the EU governments have recently drafted their initial proposals regarding the regulatory framework for AI. Given this cynosural spotlight on generative AI, AI-generated text detection (AGTD) has emerged as a topic that has already received immediate attention in research, with some initial methods having been proposed, soon followed by emergence of techniques to bypass detection. This paper introduces the Counter Turing Test (CT^2), a benchmark consisting of techniques aiming to offer a comprehensive evaluation of the robustness of existing AGTD techniques. Our empirical findings unequivocally highlight the fragility of the proposed AGTD methods under scrutiny. Amidst the extensive deliberations on policy-making for regulating AI development, it is of utmost importance to assess the detectability of content generated by LLMs. Thus, to establish a quantifiable spectrum facilitating the evaluation and ranking of LLMs according to their detectability levels, we propose the AI Detectability Index (ADI). We conduct a thorough examination of 15 contemporary LLMs, empirically demonstrating that larger LLMs tend to have a higher ADI, indicating they are less detectable compared to smaller LLMs. We firmly believe that ADI holds significant value as a tool for the wider NLP community, with the potential to serve as a rubric in AI-related policy-making.