AITopics | Generative AI

Collaborating Authors

Generative AI

News Overviews Instructional Materials AI-Alerts Classics

'Hey Siri, can you win the AI race?' How Apple Intelligence could be a game-changer.

Christian Science Monitor | ScienceJun-12-2024, 00:48:56 GMT

In rebranding artificial intelligence as Apple Intelligence, Apple Inc is banking on the idea that people by and large won't buy the powerful A.I. software that its rivals are developing. Instead, they'll want really cool hardware that incorporates A.I. It's a compelling but risky strategy for a company that specializes in seamlessly integrating hardware and software into must-have products. "It's the next big step for Apple," Apple CEO Tim Cook said Monday in unveiling Apple Intelligence at the company's developers conference. Apple is diving into artificial intelligence – focused on the idea of a "virtual personal assistant" - as a potential must-have app for consumers. Since it lacks its own cutting-edge version of the predictive, sounds-like-a-human technology known as generative A.I., Apple will license that technology from other companies, starting with OpenAI.

artificial intelligence, machine learning, natural language, (7 more...)

Christian Science Monitor | Science

Industry: Information Technology (0.43)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.63)

Add feedback

Standard Language Ideology in AI-Generated Language

Smith, Genevieve, Fleisig, Eve, Bossi, Madeline, Rustagi, Ishita, Yin, Xavier

arXiv.org Artificial IntelligenceJun-12-2024

In this position paper, we explore standard language ideology in language generated by large language models (LLMs). First, we outline how standard language ideology is reflected and reinforced in LLMs. We then present a taxonomy of open problems regarding standard language ideology in AI-generated language with implications for minoritized language communities. We introduce the concept of standard AI-generated language ideology, the process by which AI-generated language regards Standard American English (SAE) as a linguistic default and reinforces a linguistic bias that SAE is the most "appropriate" language. Finally, we discuss tensions that remain, including reflecting on what desirable system behavior looks like, as well as advantages and drawbacks of generative AI tools imitating--or often not--different English language varieties. Throughout, we discuss standard language ideology as a manifestation of existing global power structures in and through AI-generated language before ending with questions to move towards alternative, more emancipatory digital futures.

language ideology, language variety, minoritized variety, (12 more...)

arXiv.org Artificial Intelligence

2406.08726

Country:

North America > United States > District of Columbia > Washington (0.05)
North America > United States > California > Alameda County > Berkeley (0.05)
Europe > United Kingdom > England > Greater London > London (0.05)
(14 more...)

Genre: Research Report (0.50)

Industry:

Media (1.00)
Health & Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.51)

Add feedback

Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation

Tang, Raphael, Zhang, Xinyu, Xu, Lixinyu, Lu, Yao, Li, Wenyan, Stenetorp, Pontus, Lin, Jimmy, Ture, Ferhan

arXiv.org Artificial IntelligenceJun-12-2024

Diffusion models are the state of the art in text-to-image generation, but their perceptual variability remains understudied. In this paper, we examine how prompts affect image variability in black-box diffusion-based models. We propose W1KP, a human-calibrated measure of variability in a set of images, bootstrapped from existing image-pair perceptual distances. Current datasets do not cover recent diffusion models, thus we curate three test sets for evaluation. Our best perceptual distance outperforms nine baselines by up to 18 points in accuracy, and our calibration matches graded human judgements 78% of the time. Using W1KP, we study prompt reusability and show that Imagen prompts can be reused for 10-50 random seeds before new images become too similar to already generated images, while Stable Diffusion XL and DALL-E 3 can be reused 50-200 times. Lastly, we analyze 56 linguistic features of real prompts, finding that the prompt's length, CLIP embedding norm, concreteness, and word senses influence variability most. As far as we are aware, we are the first to analyze diffusion variability from a visuolinguistic perspective. Our project page is at http://w1kp.com

arXiv.org Artificial Intelligence

2406.08482

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
Europe > Denmark > Capital Region > Copenhagen (0.04)

Genre:

Research Report > New Finding (0.68)
Research Report > Experimental Study (0.68)

Industry: Transportation (0.34)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.51)

Add feedback

Advancing High Resolution Vision-Language Models in Biomedicine

Chen, Zekai, Pekis, Arda, Brown, Kevin

arXiv.org Artificial IntelligenceJun-12-2024

Multi-modal learning has significantly advanced generative AI, especially in vision-language modeling. Innovations like GPT-4V and open-source projects such as LLaVA have enabled robust conversational agents capable of zero-shot task completions. However, applying these technologies in the biomedical field presents unique challenges. Recent initiatives like LLaVA-Med have started to adapt instruction-tuning for biomedical contexts using large datasets such as PMC-15M. Our research offers three key contributions: (i) we present a new instruct dataset enriched with medical image-text pairs from Claude3-Opus and LLaMA3 70B, (ii) we propose a novel image encoding strategy using hierarchical representations to improve fine-grained biomedical visual comprehension, and (iii) we develop the Llama3-Med model, which achieves state-of-the-art zero-shot performance on biomedical visual question answering benchmarks, with an average performance improvement of over 10% compared to previous methods. These advancements provide more accurate and reliable tools for medical professionals, bridging gaps in current multi-modal conversational assistants and promoting further innovations in medical AI.

dataset, llama3-med, llava-med, (14 more...)

arXiv.org Artificial Intelligence

2406.09454

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.34)

Add feedback

FakeInversion: Learning to Detect Images from Unseen Text-to-Image Models by Inverting Stable Diffusion

Cazenavette, George, Sud, Avneesh, Leung, Thomas, Usman, Ben

arXiv.org Artificial IntelligenceJun-12-2024

Due to the high potential for abuse of GenAI systems, the task of detecting synthetic images has recently become of great interest to the research community. Unfortunately, existing image-space detectors quickly become obsolete as new high-fidelity text-to-image models are developed at blinding speed. In this work, we propose a new synthetic image detector that uses features obtained by inverting an open-source pre-trained Stable Diffusion model. We show that these inversion features enable our detector to generalize well to unseen generators of high visual fidelity (e.g., DALL-E 3) even when the detector is trained only on lower fidelity fake images generated via Stable Diffusion. This detector achieves new state-of-the-art across multiple training and evaluation setups. Moreover, we introduce a new challenging evaluation protocol that uses reverse image search to mitigate stylistic and thematic biases in the detector evaluation. We show that the resulting evaluation scores align well with detectors' in-the-wild performance, and release these datasets as public benchmarks for future research.

dataset, detector, real image, (15 more...)

arXiv.org Artificial Intelligence

2406.08603

Country:

Asia > Japan > Honshū > Chūbu > Nagano Prefecture > Nagano (0.04)
North America > United States > Massachusetts (0.04)
Asia > Middle East > Republic of Türkiye > Karaman Province > Karaman (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Information Technology > Security & Privacy (0.50)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.53)

Add feedback

Tailoring Generative AI Chatbots for Multiethnic Communities in Disaster Preparedness Communication: Extending the CASA Paradigm

Zhao, Xinyan, Sun, Yuan, Liu, Wenlin, Wong, Chau-Wai

arXiv.org Artificial IntelligenceJun-12-2024

This study is among the first to develop different prototypes of generative AI (GenAI) chatbots powered by GPT 4 to communicate hurricane preparedness information to diverse residents. Drawing from the Computers Are Social Actors (CASA) paradigm and the literature on disaster vulnerability and cultural tailoring, this study conducted a between-subjects experiment with 441 Black, Hispanic, and Caucasian residents of Florida. A computational analysis of chat logs (N = 7,848) shows that anthropomorphism and personalization are key communication topics in GenAI chatbot-user interactions. SEM results (N = 441) suggest that GenAI chatbots varying in tone formality and cultural tailoring significantly predict bot perceptions and, subsequently, hurricane preparedness outcomes. These results highlight the potential of using GenAI chatbots to improve diverse communities' disaster preparedness.

chatbot, communication, information, (14 more...)

arXiv.org Artificial Intelligence

2406.08411

Country:

North America > United States > North Carolina > Orange County > Chapel Hill (0.04)
North America > United States > New York (0.04)
North America > United States > Connecticut > Fairfield County > Westport (0.04)
Asia > Japan (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Questionnaire & Opinion Survey (0.93)

Industry:

Health & Medicine (1.00)
Government (0.93)
Information Technology (0.93)
Education > Educational Setting > Higher Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.85)

Add feedback

Global AI Governance in Healthcare: A Cross-Jurisdictional Regulatory Analysis

Chakraborty, Attrayee, Karhade, Mandar

arXiv.org Artificial IntelligenceJun-12-2024

Artificial Intelligence (AI) is being adopted across the world and promises a new revolution in healthcare. While AI-enabled medical devices in North America dominate 42.3% of the global market, the use of AI-enabled medical devices in other countries is still a story waiting to be unfolded. We aim to delve deeper into global regulatory approaches towards AI use in healthcare, with a focus on how common themes are emerging globally. We compare these themes to the World Health Organization's (WHO) regulatory considerations and principles on ethical use of AI for healthcare applications. Our work seeks to take a global perspective on AI policy by analyzing 14 legal jurisdictions including countries representative of various regions in the world (North America, South America, South East Asia, Middle East, Africa, Australia, and the Asia-Pacific). Our eventual goal is to foster a global conversation on the ethical use of AI in healthcare and the regulations that will guide it. We propose solutions to promote international harmonization of AI regulations and examine the requirements for regulating generative AI, using China and Singapore as examples of countries with well-developed policies in this area.

healthcare, intelligence, regulation, (15 more...)

arXiv.org Artificial Intelligence

2406.08695

Country:

Asia > Singapore (0.35)
Oceania > Australia (0.25)
Asia > East Asia (0.24)
(19 more...)

Genre: Research Report > Experimental Study (1.00)

Industry:

Law > Statutes (1.00)
Law > Civil Rights & Constitutional Law (1.00)
Information Technology > Security & Privacy (1.00)
(6 more...)

Technology:

Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
Information Technology > Artificial Intelligence > Applied AI (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.49)

Add feedback

Fine-Tuned 'Small' LLMs (Still) Significantly Outperform Zero-Shot Generative AI Models in Text Classification

Bucher, Martin Juan José, Martini, Marco

arXiv.org Artificial IntelligenceJun-12-2024

Generative AI offers a simple, prompt-based alternative to fine-tuning smaller BERT-style LLMs for text classification tasks. This promises to eliminate the need for manually labeled training data and task-specific model training. However, it remains an open question whether tools like ChatGPT can deliver on this promise. In this paper, we show that smaller, fine-tuned LLMs (still) consistently and significantly outperform larger, zero-shot prompted models in text classification. We compare three major generative AI models (ChatGPT with GPT-3.5/GPT-4 and Claude Opus) with several fine-tuned LLMs across a diverse set of classification tasks (sentiment, approval/disapproval, emotions, party positions) and text categories (news, tweets, speeches). We find that fine-tuning with application-specific training data achieves superior performance in all cases. To make this approach more accessible to a broader audience, we provide an easy-to-use toolkit alongside this paper. Our toolkit, accompanied by non-technical step-by-step guidance, enables users to select and fine-tune BERT-like LLMs for any classification task with minimal technical and computational effort.

classification, classification task, training data, (13 more...)

arXiv.org Artificial Intelligence

2406.0866

Country:

Europe > United Kingdom (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > Canada (0.04)
(4 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Government > Regional Government > North America Government > United States Government (0.93)
Law (0.93)
Government > Regional Government > Europe Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.92)

Add feedback

Elon Musk drops lawsuit against OpenAI

Washington Post - Technology NewsJun-11-2024, 23:21:59 GMT

Musk originally filed his lawsuit at the beginning of March, arguing that OpenAI had breached its commitment to early investors and the public to build AI for the benefit of humanity when it began making money. At the time, OpenAI executives blamed Musk's lawsuit on his not being a part of the company as it was seeing massive success. Musk did not immediately respond to a request for comment Tuesday. A spokesperson for OpenAI declined to comment.

large language model, lawsuit, machine learning, (6 more...)

Washington Post - Technology News

Industry: Law > Litigation (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

Elon Musk Drops Suit Accusing OpenAI of Breaching Founding Mission

TIME - TechJun-11-2024, 22:47:47 GMT

Elon Musk dropped a lawsuit alleging OpenAI and its chief executive officer Sam Altman breached a founding promise last year by prioritizing profits over humanity. The billionaire withdrew his complaint a day before a California judge was set to hear OpenAI's request for dismissal. Musk had accused the company of becoming a "de facto subsidiary" of Microsoft Corp. in violation of a founding agreement to be a non-profit that developed artificial intelligence "for the benefit of humanity." OpenAI and Musk have been engaged in a well-publicized battle since well before the court case. Musk was an early backer of the startup and part of its founding team, before he had a falling out with the company.

large language model, machine learning, natural language, (8 more...)

TIME - Tech

Country: North America > United States > California > San Francisco County > San Francisco (0.08)

Industry:

Law > Litigation (0.61)
Information Technology (0.41)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback