AITopics

2405.00958

Country: Europe > United Kingdom (0.28)

Genre:

Research Report > New Finding (0.48)
Research Report > Promising Solution (0.34)
Overview > Innovation (0.34)

Industry: Energy > Oil & Gas (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.69)

Clemm, Christian, Stobbe, Lutz, Wimalawarne, Kishan, Druschke, Jan

Towards Green AI: Current status and future research

arXiv.org Artificial IntelligenceMay-1-2024

We are in the midst of an explosive growth of the The rapidly growing computational requirements of AI development and integration of artificial intelligence (AI)- models necessitate increasingly powerful hardware to provide based systems into all aspects of human activities that has the computational infrastructure required for the training and been speculated to be'as transformative as the industrial inference of AI models. Graphics processing units (GPU) revolution' and could incur profound social and economic provide the parallel processing capabilities and are employed changes [1]. The release of'generative AI' applications, in server systems operated in globally distributed data centers notably the text generator ChatGPT, text-to-image generators ('the cloud'). The energy needs of the compute hardware and like Midjourney, and text-to-video models like Sora have required heating, ventilation, and air conditioning (HVAC) in recently brought public attention to the rapidly progressing data centers are ever-increasing. The IEA projects the technological capabilities.

carbon footprint, consumption, environmental impact, (13 more...)

2407.10237

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Europe > Austria > Vienna (0.14)
Europe > Germany > Berlin (0.04)
(7 more...)

Genre:

Overview (1.00)
Research Report (0.82)

Industry:

Information Technology > Services (1.00)
Energy > Renewable (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

arXiv.org Artificial IntelligenceMay-1-2024

Survey of Bias In Text-to-Image Generation: Definition, Evaluation, and Mitigation

Wan, Yixin, Subramonian, Arjun, Ovalle, Anaelia, Lin, Zongyu, Suvarna, Ashima, Chance, Christina, Bansal, Hritik, Pattichis, Rebecca, Chang, Kai-Wei

The recent advancement of large and powerful models with Text-to-Image (T2I) generation abilities -- such as OpenAI's DALLE-3 and Google's Gemini -- enables users to generate high-quality images from textual prompts. However, it has become increasingly evident that even simple prompts could cause T2I models to exhibit conspicuous social bias in generated images. Such bias might lead to both allocational and representational harms in society, further marginalizing minority groups. Noting this problem, a large body of recent works has been dedicated to investigating different dimensions of bias in T2I systems. However, an extensive review of these studies is lacking, hindering a systematic understanding of current progress and research gaps. We present the first extensive survey on bias in T2I generative models. In this survey, we review prior studies on dimensions of bias: Gender, Skintone, and Geo-Culture. Specifically, we discuss how these works define, evaluate, and mitigate different aspects of bias. We found that: (1) while gender and skintone biases are widely studied, geo-cultural bias remains under-explored; (2) most works on gender and skintone bias investigated occupational association, while other aspects are less frequently studied; (3) almost all gender bias works overlook non-binary identities in their studies; (4) evaluation datasets and metrics are scattered, with no unified framework for measuring biases; and (5) current mitigation methods fail to resolve biases comprehensively. Based on current limitations, we point out future research directions that contribute to human-centric definitions, evaluations, and mitigation of biases. We hope to highlight the importance of studying biases in T2I systems, as well as encourage future efforts to holistically understand and tackle biases, building fair and trustworthy T2I technologies for everyone.

arxiv preprint arxiv, computational linguistic, proceedings, (14 more...)

2404.0103

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > Switzerland > Zürich > Zürich (0.14)
North America > Canada > Ontario > Toronto (0.04)
(9 more...)

Genre:

Research Report (1.00)
Overview (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

The GuardianApr-30-2024, 18:29:03 GMT

Eight US newspapers sue OpenAI and Microsoft for copyright infringement

The New York Daily News, Chicago Tribune, Denver Post and other papers filed the lawsuit on Tuesday in a New York federal court. "We've spent billions of dollars gathering information and reporting news at our publications, and we can't allow OpenAI and Microsoft to expand the Big Tech playbook of stealing our work to build their own businesses at our expense," said a written statement from Frank Pine, executive editor for the MediaNews Group and Tribune Publishing. The other newspapers that are part of the lawsuit are MediaNews Group's Mercury News, Denver Post, Orange County Register and St Paul Pioneer-Press, and Tribune Publishing's Orlando Sentinel and South Florida Sun Sentinel. All of the newspapers are owned by Alden Global Capital. Microsoft declined to comment on Tuesday.

large language model, machine learning, openai and microsoft, (12 more...)

The Guardian

Country:

North America > United States > New York (0.53)
North America > United States > Illinois > Cook County > Chicago (0.28)
North America > United States > Florida (0.28)

Industry:

Media > News (1.00)
Law > Litigation (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.78)

Washington Post - Technology NewsApr-30-2024, 17:24:53 GMT

8 major newspapers join legal backlash against OpenAI, Microsoft

The publications were joined in the suit by South Florida's Sun Sentinel, the Denver Post, Orange County (Calif.) The lawsuit alleges that OpenAI and Microsoft used their news articles to train and run their AI tools, including OpenAI's ChatGPT. All eight newspapers are owned by New York City-based hedge fund Alden Global Capital.

large language model, machine learning, natural language, (6 more...)

Washington Post - Technology News

Country:

North America > United States > Illinois > Cook County > Chicago (0.40)
North America > United States > New York (0.38)
North America > United States > Florida (0.38)
North America > United States > California > Orange County (0.38)

Industry:

Media > News (1.00)
Banking & Finance > Trading (0.82)
Law > Litigation (0.74)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

MIT Technology ReviewApr-30-2024, 09:23:07 GMT

My deepfake shows how valuable our data is in the age of AI

Synthesia has managed to create AI avatars that are remarkably humanlike after only one year of tinkering with the latest generation of generative AI. It's equally exciting and daunting thinking about where this technology is going. It will soon be very difficult to differentiate between what is real and what is not, and this is a particularly acute threat given the record number of elections happening around the world this year. We are not ready for what is coming. If people become too skeptical about the content they see, they might stop believing in anything at all, which could enable bad actors to take advantage of this trust vacuum and lie about the authenticity of real content.

artificial intelligence, deepfake show, machine learning, (10 more...)

MIT Technology Review

Country: Europe > Sweden > Uppsala County > Uppsala (0.06)

Industry: Information Technology > Security & Privacy (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.59)

Adelani, David Ifeoluwa, Doğruöz, A. Seza, Shode, Iyanuoluwa, Aremu, Anuoluwapo

Which Nigerian-Pidgin does Generative AI speak?: Issues about Representativeness and Bias for Multilingual and Low Resource Languages

Naija is the Nigerian-Pidgin spoken by approx. 120M speakers in Nigeria and it is a mixed language (e.g., English, Portuguese and Indigenous languages). Although it has mainly been a spoken language until recently, there are currently two written genres (BBC and Wikipedia) in Naija. Through statistical analyses and Machine Translation experiments, we prove that these two genres do not represent each other (i.e., there are linguistic differences in word order and vocabulary) and Generative AI operates only based on Naija written in the BBC genre. In other words, Naija written in Wikipedia genre is not represented in Generative AI.

bbc genre, naija, wikipedia genre, (10 more...)

2404.19442

Country:

Europe > France > Provence-Alpes-Côte d'Azur > Bouches-du-Rhône > Marseille (0.04)
Africa > West Africa (0.04)
Africa > Nigeria > Plateau State > Jos (0.04)
(12 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.89)
Information Technology > Artificial Intelligence > Natural Language > Generation (0.81)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.81)

Buongiorno, Steph, Clark, Corey

A Framework for Leveraging Human Computation Gaming to Enhance Knowledge Graphs for Accuracy Critical Generative AI Applications

External knowledge graphs (KGs) can be used to augment large language models (LLMs), while simultaneously providing an explainable knowledge base of facts that can be inspected by a human. This approach may be particularly valuable in domains where explainability is critical, like human trafficking data analysis. However, creating KGs can pose challenges. KGs parsed from documents may comprise explicit connections (those directly stated by a document) but miss implicit connections (those obvious to a human although not directly stated). To address these challenges, this preliminary research introduces the GAME-KG framework, standing for "Gaming for Augmenting Metadata and Enhancing Knowledge Graphs." GAME-KG is a federated approach to modifying explicit as well as implicit connections in KGs by using crowdsourced feedback collected through video games. GAME-KG is shown through two demonstrations: a Unity test scenario from Dark Shadows, a video game that collects feedback on KGs parsed from US Department of Justice (DOJ) Press Releases on human trafficking, and a following experiment where OpenAI's GPT-4 is prompted to answer questions based on a modified and unmodified KG. Initial results suggest that GAME-KG can be an effective framework for enhancing KGs, while simultaneously providing an explainable set of structured facts verified by humans.

computation gaming, information, knowledge, (13 more...)

2404.19729

Country:

North America > United States > Texas > Dallas County > Dallas (0.05)
North America > United States > New York > New York County > New York City (0.04)
Europe > Norway > Eastern Norway > Oslo (0.04)
(3 more...)

Genre: Research Report (0.70)

Industry:

Leisure & Entertainment > Games > Computer Games (1.00)
Law (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.83)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.71)

Social Life Simulation for Non-Cognitive Skills Learning

Yan, Zihan, Xiang, Yaohong, Huang, Yun

Non-cognitive skills are crucial for personal and social life well-being, and such skill development can be supported by narrative-based (e.g., storytelling) technologies. While generative AI enables interactive and role-playing storytelling, little is known about how users engage with and perceive the use of AI in social life simulation for non-cognitive skills learning. To this end, we introduced SimuLife++, an interactive platform enabled by a large language model (LLM). The system allows users to act as protagonists, creating stories with one or multiple AI-based characters in diverse social scenarios. In particular, we expanded the Human-AI interaction to a Human-AI-AI collaboration by including a sage agent, who acts as a bystander to provide users with more insightful perspectives on their choices and conversations. Through a within-subject user study, we found that the inclusion of the sage agent significantly enhanced narrative immersion, according to the narrative transportation scale, leading to more messages, particularly in group chats. Participants' interactions with the sage agent were also associated with significantly higher scores in their perceived motivation, self-perceptions, and resilience and coping, indicating positive impacts on non-cognitive skills reflection. Participants' interview results further explained the sage agent's aid in decision-making, solving ethical dilemmas, and problem-solving; on the other hand, they suggested improvements in user control and balanced responses from multiple characters. We provide design implications on the application of generative AI in narrative solutions for non-cognitive skill development in broader social contexts.

participant, sage agent, storytelling, (10 more...)

2405.00273

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > New York County > New York City (0.05)
North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
(13 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Questionnaire & Opinion Survey (1.00)

Industry:

Leisure & Entertainment > Games > Computer Games (1.00)
Education (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.68)
Information Technology (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)
(2 more...)

Interrante-Grant, Alexander, Davis, Andy, Preslier, Heather, Leek, Tim

On Training a Neural Network to Explain Binaries

In this work, we begin to investigate the possibility of training a deep neural network on the task of binary code understanding. Specifically, the network would take, as input, features derived directly from binaries and output English descriptions of functionality to aid a reverse engineer in investigating the capabilities of a piece of closed-source software, be it malicious or benign. Given recent success in applying large language models (generative AI) to the task of source code summarization, this seems a promising direction. However, in our initial survey of the available datasets, we found nothing of sufficiently high quality and volume to train these complex models. Instead, we build our own dataset derived from a capture of Stack Overflow containing 1.1M entries. A major result of our work is a novel dataset evaluation method using the correlation between two distances on sample pairs: one distance in the embedding space of inputs and the other in the embedding space of outputs. Intuitively, if two samples have inputs close in the input embedding space, their outputs should also be close in the output embedding space. We found this Embedding Distance Correlation (EDC) test to be highly diagnostic, indicating that our collected dataset and several existing open-source datasets are of low quality as the distances are not well correlated. We proceed to explore the general applicability of EDC, applying it to a number of qualitatively known good datasets and a number of synthetically known bad ones and found it to be a reliable indicator of dataset value.

correlation, dataset, explanation, (14 more...)

2404.19631

Country: North America > United States > Massachusetts > Middlesex County > Lexington (0.04)

Genre: Research Report > Experimental Study (0.69)

Industry:

Government > Regional Government (0.69)
Information Technology > Security & Privacy (0.68)
Government > Military (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.34)