AITopics | Generative AI

Collaborating Authors

Generative AI

News Overviews Instructional Materials AI-Alerts Classics

Contrastive Language-Vision AI Models Pretrained on Web-Scraped Multimodal Data Exhibit Sexual Objectification Bias

Wolfe, Robert, Yang, Yiwei, Howe, Bill, Caliskan, Aylin

arXiv.org Artificial IntelligenceMay-15-2023

Nine language-vision AI models trained on web scrapes with the Contrastive Language-Image Pretraining (CLIP) objective are evaluated for evidence of a bias studied by psychologists: the sexual objectification of girls and women, which occurs when a person's human characteristics, such as emotions, are disregarded and the person is treated as a body. We replicate three experiments in psychology quantifying sexual objectification and show that the phenomena persist in AI. A first experiment uses standardized images of women from the Sexual OBjectification and EMotion Database, and finds that human characteristics are disassociated from images of objectified women: the model's recognition of emotional state is mediated by whether the subject is fully or partially clothed. Embedding association tests (EATs) return significant effect sizes for both anger (d >0.80) and sadness (d >0.50), associating images of fully clothed subjects with emotions. GRAD-CAM saliency maps highlight that CLIP gets distracted from emotional expressions in objectified images. A second experiment measures the effect in a representative application: an automatic image captioner (Antarctic Captions) includes words denoting emotion less than 50% as often for images of partially clothed women than for images of fully clothed women. A third experiment finds that images of female professionals (scientists, doctors, executives) are likely to be associated with sexual descriptions relative to images of male professionals. A fourth experiment shows that a prompt of "a [age] year old girl" generates sexualized images (as determined by an NSFW classifier) up to 73% of the time for VQGAN-CLIP and Stable Diffusion; the corresponding rate for boys never surpasses 9%. The evidence indicates that language-vision AI models trained on web scrapes learn biases of sexual objectification, which propagate to downstream applications.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3593013.3594072

2212.11261

Country:

North America > United States > Washington > King County > Seattle (0.14)
North America > United States > Illinois > Cook County > Chicago (0.05)
North America > United States > New York > New York County > New York City (0.04)
(2 more...)

Genre: Research Report > New Finding (0.67)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

Unveiling the Latent Space Geometry of Push-Forward Generative Models

Issenhuth, Thibaut, Tanielian, Ugo, Mary, Jérémie, Picard, David

arXiv.org Artificial IntelligenceMay-15-2023

Many deep generative models are defined as a push-forward of a Gaussian measure by a continuous generator, such as Generative Adversarial Networks (GANs) or Variational Auto-Encoders (VAEs). This work explores the latent space of such deep generative models. A key issue with these models is their tendency to output samples outside of the support of the target distribution when learning disconnected distributions. We investigate the relationship between the performance of these models and the geometry of their latent space. Building on recent developments in geometric measure theory, we prove a sufficient condition for optimality in the case where the dimension of the latent space is larger than the number of modes. Through experiments on GANs, we demonstrate the validity of our theoretical results and gain new insights into the latent space geometry of these models. Additionally, we propose a truncation method that enforces a simplicial cluster structure in the latent space and improves the performance of GANs.

artificial intelligence, latent space, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2207.10541

Country:

North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
Europe > France > Île-de-France > Paris > Paris (0.04)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.44)

Add feedback

Integrating Generative Artificial Intelligence in Intelligent Vehicle Systems

Stappen, Lukas, Dillmann, Jeremy, Striegel, Serena, Vögel, Hans-Jörg, Flores-Herr, Nicolas, Schuller, Björn W.

arXiv.org Artificial IntelligenceMay-15-2023

This paper aims to serve as a comprehensive guide for researchers and practitioners, offering insights into the current state, potential applications, and future research directions for generative artificial intelligence and foundation models within the context of intelligent vehicles. As the automotive industry progressively integrates AI, generative artificial intelligence technologies hold the potential to revolutionize user interactions, delivering more immersive, intuitive, and personalised in-car experiences. We provide an overview of current applications of generative artificial intelligence in the automotive domain, emphasizing speech, audio, vision, and multimodal interactions. We subsequently outline critical future research areas, including domain adaptability, alignment, multimodal integration and others, as well as, address the challenges and risks associated with ethics. By fostering collaboration and addressing these research areas, generative artificial intelligence can unlock its full potential, transforming the driving experience and shaping the future of intelligent vehicles.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2305.17137

Country:

North America > United States > New York > New York County > New York City (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
Asia > Middle East > Jordan (0.04)
(5 more...)

Genre:

Overview (0.88)
Research Report (0.83)

Industry:

Transportation > Ground > Road (1.00)
Law (1.00)
Information Technology > Security & Privacy (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Generation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

Facebook pivoted to the metaverse. Now it wants to show off its AI.

Washington Post - Technology NewsMay-14-2023, 12:02:37 GMT

In February, the company announced it was forming a new product group to "turbocharge" its use of generative AI. The group, led by former Apple executive Ahmed Al-Dahle, aims to bring together key teams from research and consumer-focused groups to create new products, according to the company. Zuckerberg has said he expects to build generative AI-powered chat experiences in WhatsApp and Messenger as well as innovations in business messaging and customer support.

facebook pivoted, metaverse

Washington Post - Technology News

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Generation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

Microsoft's Satya Nadella Doesn't Think Now Is the Time to Stop on AI

TIME - TechMay-14-2023, 11:00:12 GMT

The last year has been characterized by a rush of new artificial intelligence (AI) programs being released into the world since OpenAI, a lab backed by Microsoft, launched ChatGPT in November 2022. Both Microsoft and Google rolled out products in March that they say will use AI to transform work, and IBM's CEO Arvind Krishna said the company's AI tool will be able to reduce 30 to 50% of repetitive office work. Since taking the helm at Microsoft in 2014, at a time when its market dominance with traditional software offerings was waning, Satya Nadella has focused on ensuring the company remains relevant. . The company has invested heavily in Azure, its cloud computing platform, and in AI, pouring at least $13 billion in the leading lab OpenAI. Microsoft's share price has risen nearly tenfold since Nadella became CEO, outperforming the S&P 500, which has merely doubled its value over the same time.

conséquence, economic growth, microsoft, (16 more...)

TIME - Tech

Country:

Europe (0.16)
North America > United States (0.05)

Industry:

Information Technology (1.00)
Banking & Finance > Trading (0.55)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.90)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.75)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.55)

Add feedback

The Curious Case of the Missing Google Assistant

WIREDMay-14-2023, 11:00:00 GMT

Google executives hosted the company's I/O developer conference this week, an annual ritual that has in recent years centered on artificial intelligence. With OpenAI's ChatGPT and Microsoft's Bing chatbot seen as challenging Google's search domination, Google CEO Sundar Pichai seemed intent on projecting the message that his company is still the leader in AI--and is speeding up deployment of the technology. Google's own chatty large language model, Bard, was the headliner, and it is now publicly available in 180 countries. Following along behind came about a dozen generative AI product features and experiments that can do things like help programmers write code, draft emails, or generate speaker notes for Google Slides presentations. But hardly a word was said about Google Assistant, the clunkily named and voice-centric AI assistant that was the company's previous AI champion.

curious case, google assistant, missing google assistant, (9 more...)

WIRED

Industry: Information Technology > Services (0.55)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.63)

Add feedback

Toyota Leaked Vehicle Data of 2 Million Customers

WIREDMay-13-2023, 13:00:00 GMT

SafeGraph, the data broker famous for selling location data linked to abortion clinic visits, is now a US military contractor. Documents obtained by WIRED reveal that the company landed an initial contract with the US Air Force and is hoping the Pentagon will buy a tool that SafeGraph says will pinpoint locations not to bomb, like schools and hospitals. Your data is, of course, everywhere--likely including in the training data of generative AI tools like ChatGPT. Fortunately, at least some users can request that OpenAI, which created the tool, delete their data. It's also possible to delete your chat history with ChatGPT.

large language model, machine learning, natural language, (13 more...)

WIRED

Country:

North America > United States (1.00)
Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.06)
Asia > Japan (0.06)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.79)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.79)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.57)

Add feedback

AI chatbots aren't search engines. They're crypto bros

PCWorldMay-12-2023, 13:00:00 GMT

Over the last few months, AI chatbots have exploded in popularity off the surging success of OpenAI's revolutionary ChatGPT--which, amazingly, only burst onto the scene around December. But when Microsoft seized the opportunity to hitch its wagon to OpenAI's rising star for a steep $10 billion dollars, it chose to do so by introducing a GPT-4-powered chatbot under the guise of Bing, its swell-but-also-ran search engine, in a bid to upend Google's search dominance. Google quickly followed suit with its own homegrown Bard AI and unleashed plans to put AI answers before traditional search results, an utterly monumental alteration to one of the most significant places on the Internet. Both are touted as experiments. And these "AI chatbots" are truly wondrous advancements--I've spent many nights with my kids joyously creating fantastic stuff-of-your-dreams artwork with Bing Chat's Dall-E integration and prompting sick raps about wizards who think lizards are the source of all magic, and seeing them come to life in mere moments with these fantastic tools.

chatbot, search engine, turley, (14 more...)

PCWorld

Country: North America > United States (0.05)

Industry:

Law (0.73)
Banking & Finance > Trading (0.42)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.78)

Add feedback

Anthropic says its Claude AI can now read a whole book in under a minute

EngadgetMay-12-2023, 12:01:14 GMT

Anthropic says it has vastly expanded the amount of information its generative AI, Claude, is able to process. Claude has gone from having a limit of 9,000 tokens to 100,000 tokens, which corresponds to roughly 75,000 words. To put that into perspective, Claude now has the ability to easily read and finish Ernest Hemingway's A Farewell to Arms (74,240 words), Mary Shelley's Frankenstein (74,800 words) and Mark Twain's The Adventures of Tom Sawyer (69,000 words). And, as The Verge notes, the company says Claude can read and analyze information from each book in under a minute. Generative AIs like Claude are still limited by the number of "tokens" they can process.

anthropic, claude, claude ai, (2 more...)

Engadget

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.61)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.61)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.43)

Add feedback

Google Is Opening the AI Floodgates

WIREDMay-12-2023, 12:00:00 GMT

Google would like you to know that it has been at the forefront of machine intelligence for decades, actually. Never mind that it was beaten to the generative AI hype party by the likes of OpenAI and Microsoft Bing, because Google has big plans. At its I/O developer conference this week, in addition to announcing some new hardware (including a folding phone), Google turned on the firehose of AI. During a two-hour presentation, the company showed how it's busily building generative technologies into nearly everything it does. Chatbots, text generators, and content creation tools will soon be embedded in Google's devices, search pages, Android apps, and Google's Workspace suite of productivity apps like Gmail, Docs, and Sheets.

ai floodgate, gadget lab, google, (2 more...)

WIRED

Technology:

Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.59)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.59)

Add feedback