Goto

Collaborating Authors

 Generative AI


AI career coaches are here. Should you trust them?

Washington Post - Technology News

At a time when people worry they could be replaced by AI in their jobs, some are using that same tech to help guide them at work. People already turn to generative AI for the kind of advice provided by professionals -- things like dating tips, trip planning and how to deal with toxic people.


Artificial Intelligence Index Report 2024

arXiv.org Artificial Intelligence

The 2024 Index is our most comprehensive to date and arrives at an important moment when AI's influence on society has never been more pronounced. This year, we have broadened our scope to more extensively cover essential trends such as technical advancements in AI, public perceptions of the technology, and the geopolitical dynamics surrounding its development. Featuring more original data than ever before, this edition introduces new estimates on AI training costs, detailed analyses of the responsible AI landscape, and an entirely new chapter dedicated to AI's impact on science and medicine. The AI Index report tracks, collates, distills, and visualizes data related to artificial intelligence (AI). Our mission is to provide unbiased, rigorously vetted, broadly sourced data in order for policymakers, researchers, executives, journalists, and the general public to develop a more thorough and nuanced understanding of the complex field of AI. The AI Index is recognized globally as one of the most credible and authoritative sources for data and insights on artificial intelligence. Previous editions have been cited in major newspapers, including the The New York Times, Bloomberg, and The Guardian, have amassed hundreds of academic citations, and been referenced by high-level policymakers in the United States, the United Kingdom, and the European Union, among other places. This year's edition surpasses all previous ones in size, scale, and scope, reflecting the growing significance that AI is coming to hold in all of our lives.


Analysing the Public Discourse around OpenAI's Text-To-Video Model 'Sora' using Topic Modeling

arXiv.org Artificial Intelligence

Announced on February 15, 2024, it instantly caught the public's attention by demonstrating the ability to generate dynamic and realistic video clips from text prompts, similar to how OpenAI's DALL-E generates images from text. While Sora is still in a pre-release phase, its potential to revolutionize content creation and disrupt various industries be it media, entertainment, or advertising, has already ignited discussions across online communities. Subreddits such as r/OpenAI, r/technology and r/ChatGPT have emerged as epicentres for technology enthusiasts and critics to openly discuss and share narratives about the latest advancements in AI technologies. Previous studies have explored public perceptions of large language models like ChatGPT and image generators such as DALL-E through analysing online forums. For instance, Talafidaryani and Mora (2024) employed topic modeling techniques on Reddit data to uncover dominant themes surrounding ChatGPT, including its capabilities, limitations, and ethical considerations. Similarly, Zhou and Nabus (2023) investigated discussions on DALL-E, revealing discourse on creative applications, risks of misuse, and comparisons to human artists. However, due to Sora's relatively recent emergence, there is still a lack of research on the narratives and themes emerging from Reddit conversations about this novel technology. By conducting topic modeling analysis on a large corpus of Reddit comments, the study aims to feel that gap and uncover the main topics and themes users are discussing about Sora. These narratives can provide valuable insights into public perceptions, areas of excitement, as well as societal and ethical concerns surrounding around the advent of new generative AI technologies.


On Perception of Prevalence of Cheating and Usage of Generative AI

arXiv.org Artificial Intelligence

This report investigates the perceptions of teaching staff on the prevalence of student cheating and the impact of Generative AI on academic integrity. Data was collected via an anonymous survey of teachers at the Department of Information Technology at Uppsala University and analyzed alongside institutional statistics on cheating investigations from 2004 to 2023. The results indicate that while teachers generally do not view cheating as highly prevalent, there is a strong belief that its incidence is increasing, potentially due to the accessibility of Generative AI. Most teachers do not equate AI usage with cheating but acknowledge its widespread use among students. Furthermore, teachers' perceptions align with objective data on cheating trends, highlighting their awareness of the evolving landscape of academic dishonesty.


The ethical situation of DALL-E 2

arXiv.org Artificial Intelligence

A hot topic of Artificial Intelligence right now is image generation from prompts. DALL-E 2 is one of the biggest names in this domain, as it allows people to create images from simple text inputs, to even more complicated ones. The company that made this possible, OpenAI, has assured everyone that visited their website that "Our mission is to ensure that artificial general intelligence benefits all humanity". A noble idea in our opinion, that also stood as the motive behind us choosing this subject. This paper analyzes the ethical implications of an AI image generative system, with an emphasis on how society is responding to it, how it probably will and how it should if all the right measures are taken.


Risks and Opportunities of Open-Source Generative AI

arXiv.org Artificial Intelligence

Applications of Generative AI (Gen AI) are expected to revolutionize a number of different areas, ranging from science & medicine to education. The potential for these seismic changes has triggered a lively debate about the potential risks of the technology, and resulted in calls for tighter regulation, in particular from some of the major tech companies who are leading in AI development. This regulation is likely to put at risk the budding field of open-source generative AI. Using a three-stage framework for Gen AI development (near, mid and long-term), we analyze the risks and opportunities of open-source generative AI models with similar capabilities to the ones currently available (near to mid-term) and with greater capabilities (long-term). We argue that, overall, the benefits of open-source Gen AI outweigh its risks. As such, we encourage the open sourcing of models, training and evaluation data, and provide a set of recommendations and best practices for managing risks associated with open-source generative AI.


CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning

arXiv.org Artificial Intelligence

Data selection has emerged as a core issue for large-scale visual-language model pretaining (e.g., CLIP), particularly with noisy web-curated datasets. Three main data selection approaches are: (1) leveraging external non-CLIP models to aid data selection, (2) training new CLIP-style embedding models that are more effective at selecting high-quality data than the original OpenAI CLIP model, and (3) designing better metrics or strategies universally applicable to any CLIP embedding without requiring specific model properties (e.g., CLIPScore is one popular metric). While the first two approaches have been extensively studied, the third remains under-explored. In this paper, we advance the third approach by proposing two new methods. Firstly, instead of classical CLIP scores that only consider the alignment between two modalities from a single sample, we introduce negCLIPLoss, a CLIP loss-inspired method that adds the alignment between one sample and its contrastive pairs as an extra normalization term for better quality measurement. Secondly, when downstream tasks are known, we propose a new norm-based metric, NormSim, to measure the similarity between pretraining data and target data. We test our methods on the data selection benchmark, DataComp~\cite{gadre2023datacomp}. Compared to the best baseline using only OpenAI's CLIP-L/14, our methods achieve a 5.3\% improvement on ImageNet-1k and a 2.8\% improvement on 38 downstream evaluation tasks. Moreover, both negCLIPLoss and NormSim are compatible with existing techniques. By combining our methods with the current best methods DFN~\cite{fang2023data} and HYPE~\cite{kim2024hype}, we can boost average performance on downstream tasks by 0.9\%, achieving a new state-of-the-art.


Leveraging Generative AI for Smart City Digital Twins: A Survey on the Autonomous Generation of Data, Scenarios, 3D City Models, and Urban Designs

arXiv.org Artificial Intelligence

The digital transformation of modern cities by integrating advanced information, communication, and computing technologies has marked the epoch of data-driven smart city applications for efficient and sustainable urban management. Despite their effectiveness, these applications often rely on massive amounts of high-dimensional and multi-domain data for monitoring and characterizing different urban sub-systems, presenting challenges in application areas that are limited by data quality and availability, as well as costly efforts for generating urban scenarios and design alternatives. As an emerging research area in deep learning, Generative Artificial Intelligence (AI) models have demonstrated their unique values in data and code generation. This survey paper aims to explore the innovative integration of generative AI techniques and urban digital twins to address challenges in the realm of smart cities in various urban sectors, such as transportation and mobility management, energy system operations, building and infrastructure management, and urban design. The survey starts with the introduction of popular generative AI models with their application areas, followed by a structured review of the existing urban science applications that leverage the autonomous capability of the generative AI techniques to facilitate (a) data augmentation for promoting urban monitoring and predictive analytics, (b) synthetic data and scenario generation, (c) automated 3D city modeling, and (d) generative urban design and optimization. Based on the review, this survey discusses potential opportunities and technical strategies that integrate generative AI models into the next-generation urban digital twins for more reliable, scalable, and automated management of smart cities.


Participation in the age of foundation models

arXiv.org Artificial Intelligence

Growing interest and investment in the capabilities of foundation models has positioned such systems to impact a wide array of public services. Alongside these opportunities is the risk that these systems reify existing power imbalances and cause disproportionate harm to marginalized communities. Participatory approaches hold promise to instead lend agency and decision-making power to marginalized stakeholders. But existing approaches in participatory AI/ML are typically deeply grounded in context - how do we apply these approaches to foundation models, which are, by design, disconnected from context? Our paper interrogates this question. First, we examine existing attempts at incorporating participation into foundation models. We highlight the tension between participation and scale, demonstrating that it is intractable for impacted communities to meaningfully shape a foundation model that is intended to be universally applicable. In response, we develop a blueprint for participatory foundation models that identifies more local, application-oriented opportunities for meaningful participation. In addition to the "foundation" layer, our framework proposes the "subfloor'' layer, in which stakeholders develop shared technical infrastructure, norms and governance for a grounded domain, and the "surface'' layer, in which affected communities shape the use of a foundation model for a specific downstream task. The intermediate "subfloor'' layer scopes the range of potential harms to consider, and affords communities more concrete avenues for deliberation and intervention. At the same time, it avoids duplicative effort by scaling input across relevant use cases. Through three case studies in clinical care, financial services, and journalism, we illustrate how this multi-layer model can create more meaningful opportunities for participation than solely intervening at the foundation layer.


OpenAI's board allegedly learned about ChatGPT launch on Twitter

Engadget

Helen Toner, one of OpenAI's former board members who was responsible for firing CEO Sam Altman last year, revealed that the company's board didn't know about the launch of ChatGPT until it was released in November 2022. "[The] board was not informed in advance of that," Toner said on Tuesday on a podcast called The Ted AI Show. "We learned about ChatGPT on Twitter." Toner's comments came just two days after criticized the way OpenAI was governed in an Economist piece published on Sunday that she co-wrote with Tasha McCauley, another former OpenAI board member. This is the first time that Toner has spoken openly about the circumstances that led to Altman's dramatic ouster from the company he co-founded in 2015, and his quick reinstatement following protests from employees.