Goto

Collaborating Authors

 Generative AI


Bias in Generative AI

arXiv.org Artificial Intelligence

This study analyzed images generated by three popular generative artificial intelligence (AI) tools - Midjourney, Stable Diffusion, and DALLE 2 - representing various occupations to investigate potential bias in AI generators. Our analysis revealed two overarching areas of concern in these AI generators, including (1) systematic gender and racial biases, and (2) subtle biases in facial expressions and appearances. Firstly, we found that all three AI generators exhibited bias against women and African Americans. Moreover, we found that the evident gender and racial biases uncovered in our analysis were even more pronounced than the status quo when compared to labor force statistics or Google images, intensifying the harmful biases we are actively striving to rectify in our society. Secondly, our study uncovered more nuanced prejudices in the portrayal of emotions and appearances. For example, women were depicted as younger with more smiles and happiness, while men were depicted as older with more neutral expressions and anger, posing a risk that generative AI models may unintentionally depict women as more submissive and less competent than men. Such nuanced biases, by their less overt nature, might be more problematic as they can permeate perceptions unconsciously and may be more difficult to rectify. Although the extent of bias varied depending on the model, the direction of bias remained consistent in both commercial and open-source AI generators. As these tools become commonplace, our study highlights the urgency to identify and mitigate various biases in generative AI, reinforcing the commitment to ensuring that AI technologies benefit all of humanity for a more inclusive future.


Knowledge Graphs as Context Sources for LLM-Based Explanations of Learning Recommendations

arXiv.org Artificial Intelligence

In the era of personalized education, the provision of comprehensible explanations for learning recommendations is of a great value to enhance the learner's understanding and engagement with the recommended learning content. Large language models (LLMs) and generative AI in general have recently opened new doors for generating human-like explanations, for and along learning recommendations. However, their precision is still far away from acceptable in a sensitive field like education. To harness the abilities of LLMs, while still ensuring a high level of precision towards the intent of the learners, this paper proposes an approach to utilize knowledge graphs (KG) as a source of factual context, for LLM prompts, reducing the risk of model hallucinations, and safeguarding against wrong or imprecise information, while maintaining an application-intended learning context. We utilize the semantic relations in the knowledge graph to offer curated knowledge about learning recommendations. With domain-experts in the loop, we design the explanation as a textual template, which is filled and completed by the LLM. Domain experts were integrated in the prompt engineering phase as part of a study, to ensure that explanations include information that is relevant to the learner. We evaluate our approach quantitatively using Rouge-N and Rouge-L measures, as well as qualitatively with experts and learners. Our results show an enhanced recall and precision of the generated explanations compared to those generated solely by the GPT model, with a greatly reduced risk of generating imprecise information in the final learning explanation.


Roundtables: The AI Economy

MIT Technology Review

There's no doubt that generative AI will impact the economy--but how, exactly, remains an open question. Despite fears that these AI tools will upend workers and exacerbate wealth inequality, early evidence suggests the technology could actually help level the playing field for some. Meanwhile, the demand for chips that underpin modern AI including generative tools is expected to grow significantly. And the US is spending billions to reshore the industry. Global competition for these chips is fierce, with both countries and companies now making unprecedented investments in the sector.


Roundtables: How does AI work?

MIT Technology Review

Everyone's talking about large language models and image generators built on artificial intelligence. Many people have tested out tools like ChatGPT or DALL-E 2 and been amazed at the results, or disturbed by their tendency to hallucinate. But how do the algorithms underpinning these new generative tools actually work? And what's the best way to evaluate their capabilities?


Large Language Models in Fire Engineering: An Examination of Technical Questions Against Domain Knowledge

arXiv.org Artificial Intelligence

This communication presents preliminary findings from comparing two recent chatbots, OpenAI's ChatGPT and Google's Bard, in the context of fire engineering by evaluating their responses in handling fire safety related queries. A diverse range of fire engineering questions and scenarios were created and examined, including structural fire design, fire prevention strategies, evacuation, building code compliance, and fire suppression systems (some of which resemble those commonly present in the Fire Protection exam (FPE)). The results reveal some key differences in the performance of the chatbots, with ChatGPT demonstrating a relatively superior performance. Then, this communication highlights the potential for chatbot technology to revolutionize fire engineering practices by providing instant access to critical information while outlining areas for further improvement and research. Evidently, and when it matures, this technology will likely be elemental to our engineers' practice and education.


Large Language Models and Video Games: A Preliminary Scoping Review

arXiv.org Artificial Intelligence

LLMs are powerful tools for language processing and prediction, pre-trained on vast collections of natural language, and capable of performing diverse language analysis and generation tasks [23]. The release of ChatGPT, along with the many other available LLMs (e.g., GPT-4, LLaMa, Codex, BERT) has opened new doors to research and development potential, which has seen a recent increase in related research. Like many fields, LLMs hold interesting possibilities for video games, which has prompted many researchers to hasten to investigate the potential for applying LLMs to various aspects of video game research and development. Although the concept of generative AI is not new to video games, with decades of prior work in AI-powered generation of game content [26, 46], LLMs have the potential to revolutionise generation and co-creation of video game content, along with game development tools and processes, and games research approaches. As research and development of LLMs and games is occurring and evolving quickly, it is difficult to capture a full picture of how LLMs are being used in games research. The aim of this paper is to provide a preliminary scoping review of LLMs and video games, surveying the related research conducted between 2020 and 2023. We aim to identify the ways in which researchers have been exploring the use of LLMs for game development and research to date. To identify the relevant papers, we conducted a Google Scholar search for papers published between 2020-2023 (and very early 2024). We identified 76 relevant papers from 2260 results returned in the search.


Robert F. Kennedy Jr.'s Microsoft-Powered Chatbot Just Disappeared

WIRED

Since Robert F. Kennedy Jr. first announced his longshot presidential bid, his campaign has leaned into a variety of unorthodox digital strategies. He's appeared on countless podcasts and has collabed with popular influencers to reach voters online. More recently, the Kennedy campaign has experimented with an AI chatbot that used an apparent loophole to get around OpenAI's restrictions on political use. On Sunday, after inquiries from WIRED, the chatbot disappeared. The loophole in question is an apparent result of the tight relationship between Microsoft and OpenAI.


Exploring the Design of Generative AI in Supporting Music-based Reminiscence for Older Adults

arXiv.org Artificial Intelligence

Music-based reminiscence has the potential to positively impact the psychological well-being of older adults. However, the aging process and physiological changes, such as memory decline and limited verbal communication, may impede the ability of older adults to recall their memories and life experiences. Given the advanced capabilities of generative artificial intelligence (AI) systems, such as generated conversations and images, and their potential to facilitate the reminiscing process, this study aims to explore the design of generative AI to support music-based reminiscence in older adults. This study follows a user-centered design approach incorporating various stages, including detailed interviews with two social workers and two design workshops (involving ten older adults). Our work contributes to an in-depth understanding of older adults' attitudes toward utilizing generative AI for supporting music-based reminiscence and identifies concrete design considerations for the future design of generative AI to enhance the reminiscence experience of older adults.


SARD: A Human-AI Collaborative Story Generation

arXiv.org Artificial Intelligence

Generative artificial intelligence (GenAI) has ushered in a new era for storytellers, providing a powerful tool to ignite creativity and explore uncharted narrative territories. As technology continues to advance, the synergy between human creativity and AI-generated content holds the potential to redefine the landscape of storytelling. In this work, we propose SARD, a drag-and-drop visual interface for generating a multi-chapter story using large language models. Our evaluation of the usability of SARD and its creativity support shows that while node-based visualization of the narrative may help writers build a mental model, it exerts unnecessary mental overhead to the writer and becomes a source of distraction as the story becomes more elaborated. We also found that AI generates stories that are less lexically diverse, irrespective of the complexity of the story. We identified some patterns and limitations of our tool that can guide the development of future human-AI co-writing tools.


Regeneration Based Training-free Attribution of Fake Images Generated by Text-to-Image Generative Models

arXiv.org Artificial Intelligence

Text-to-image generative models have recently garnered significant attention due to their ability to generate images based on prompt descriptions. While these models have shown promising performance, concerns have been raised regarding the potential misuse of the generated fake images. In response to this, we have presented a simple yet effective training-free method to attribute fake images generated by text-to-image models to their source models. Given a test image to be attributed, we first inverse the textual prompt of the image, and then put the reconstructed prompt into different candidate models to regenerate candidate fake images. By calculating and ranking the similarity of the test image and the candidate images, we can determine the source of the image. This attribution allows model owners to be held accountable for any misuse of their models. Note that our approach does not limit the number of candidate text-to-image generative models. Comprehensive experiments reveal that (1) Our method can effectively attribute fake images to their source models, achieving comparable attribution performance with the state-of-the-art method; (2) Our method has high scalability ability, which is well adapted to real-world attribution scenarios. (3) The proposed method yields satisfactory robustness to common attacks, such as Gaussian blurring, JPEG compression, and Resizing. We also analyze the factors that influence the attribution performance, and explore the boost brought by the proposed method as a plug-in to improve the performance of existing SOTA. We hope our work can shed some light on the solutions to addressing the source of AI-generated images, as well as to prevent the misuse of text-to-image generative models.