AITopics | laion

Collaborating Authors

laion

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Prompt-AgnosticAdversarialPerturbationfor CustomizedDiffusionModels

Neural Information Processing SystemsFeb-18-2026, 18:01:56 GMT

Inthispaper,weintroduce aPrompt-Agnostic Adversarial Perturbation (PAP) method for customized diffusion models.

artificial intelligence, deep learning, machine learning, (20 more...)

Neural Information Processing Systems

Genre: Research Report (0.67)

Industry:

Information Technology > Security & Privacy (1.00)
Law (0.68)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Into the LAION's Den: Investigating Hate in Multimodal Datasets

Neural Information Processing SystemsDec-24-2025, 21:53:07 GMT

While the impact of model scaling has been extensively studied, we are only beginning to scratch the surface of data scaling and its consequences. This is especially of critical importance in the context of vision-language datasets such as LAION. These datasets are continually growing in size and are built based on large-scale internet dumps such as the Common Crawl, which is known to have numerous drawbacks ranging from quality, legality, and content. The datasets then serve as the backbone for large generative models, contributing to the operationalization and perpetuation of harmful societal and historical biases and stereotypes. In this paper, we investigate the effect of scaling datasets on hateful content through a comparative audit of two datasets: LAION-400M and LAION-2B.

laion, multimodal dataset, name change, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Generation (0.38)

Add feedback

Into the LAION's Den: Investigating Hate in Multimodal Datasets

Neural Information Processing SystemsJan-14-2025, 02:12:21 GMT

Scale the model, scale the data, scale the compute' is the reigning sentiment in the world of generative AI today. While the impact of model scaling has been extensively studied, we are only beginning to scratch the surface of data scaling and its consequences. This is especially of critical importance in the context of vision-language datasets such as LAION. These datasets are continually growing in size and are built based on large-scale internet dumps such as the Common Crawl, which is known to have numerous drawbacks ranging from quality, legality, and content. The datasets then serve as the backbone for large generative models, contributing to the operationalization and perpetuation of harmful societal and historical biases and stereotypes.

laion, multimodal dataset

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language > Generation (0.60)
Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

Topological Perspectives on Optimal Multimodal Embedding Spaces

B, Abdul Aziz A., Rahim, A. B Abdul

arXiv.org Artificial IntelligenceMay-29-2024

Recent strides in multimodal model development have ignited a paradigm shift in the realm of text-to-image generation. Among these advancements, CLIP stands out as a remarkable achievement which is a sophisticated autoencoder adept at encoding both textual and visual information within a unified latent space. This paper delves into a comparative analysis between CLIP and its recent counterpart, CLOOB. To unravel the intricate distinctions within the embedding spaces crafted by these models, we employ topological data analysis. Our approach encompasses a comprehensive examination of the modality gap drivers, the clustering structures existing across both high and low dimensions, and the pivotal role that dimension collapse plays in shaping their respective embedding spaces. Empirical experiments substantiate the implications of our analyses on downstream performance across various contextual scenarios. Through this investigation, we aim to shed light on the nuanced intricacies that underlie the comparative efficacy of CLIP and CLOOB, offering insights into their respective strengths and weaknesses, and providing a foundation for further refinement and advancement in multimodal model research.

arxiv, clip and cloob, cloob, (15 more...)

arXiv.org Artificial Intelligence

2405.18867

Country: Asia > Middle East > Saudi Arabia > Riyadh Province > Riyadh (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Can AI image generators be policed to prevent explicit deepfakes of children?

The GuardianApr-22-2024, 23:01:04 GMT

Child abusers are creating AI-generated "deepfakes" of their targets in order to blackmail them into filming their own abuse, beginning a cycle of sextortion that can last for years. Creating simulated child abuse imagery is illegal in the UK, and Labour and the Conservatives have aligned on the desire to ban all explicit AI-generated images of real people. But there is little global agreement on how the technology should be policed. Worse, no matter how strongly governments take action, the creation of more images will always be a press of a button away – explicit imagery is built into the foundations of AI image generation. In December, researchers at Stanford University made a disturbing discovery: buried among the billions of images making up one of the largest training sets for AI image generators was hundreds, maybe thousands, of instances of child sexual abuse material (CSAM).

dataset, image generator, prevent explicit deepfake, (12 more...)

The Guardian

Country: Europe > United Kingdom (0.25)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Law (1.00)
Health & Medicine > Therapeutic Area > Pediatrics/Neonatology (0.56)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.56)

Add feedback

AI image generators trained on pictures of child sexual abuse, study finds

The GuardianDec-20-2023, 17:20:24 GMT

Hidden inside the foundation of popular artificial intelligence (AI) image generators are thousands of images of child sexual abuse, according to new research published on Wednesday. The operators of some of the largest and most-used sets of images utilized to train AI shut off access to them in response to the study. The Stanford Internet Observatory found more than 3,200 images of suspected child sexual abuse in the giant AI database LAION, an index of online images and captions that's been used to train leading AI image-makers such as Stable Diffusion. The watchdog group based at Stanford University worked with the Canadian Centre for Child Protection and other anti-abuse charities to identify the illegal material and report the original photo links to law enforcement. More than 1,000 of the suspected images were confirmed as child sexual abuse material.

child sexual abuse, laion, stable diffusion, (12 more...)

The Guardian

Country: North America > Canada (0.05)

Genre: Research Report > New Finding (0.35)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Law (1.00)
Health & Medicine > Therapeutic Area > Pediatrics/Neonatology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.30)

Add feedback

See inside the stereotyping machines pushing American bias across the internet

Washington Post - Technology NewsNov-1-2023, 05:25:53 GMT

Artificial intelligence image tools have a tendency to spin up disturbing clichés: Asian women are hypersexual. These stereotypes don't reflect the real world; they stem from the data that trains the technology. Grabbed from the internet, these troves can be toxic -- rife with pornography, misogyny, violence and bigotry. Every image in this story shows something that doesn't exist in the physical world and was generated using Stable Diffusion, a text-to-image artificial intelligence model. Stability AI, maker of the popular image generator Stable Diffusion XL, told The Washington Post it had made a significant investment in reducing bias in its latest model, which was released in July.

internet, stable diffusion, stereotype, (15 more...)

Washington Post - Technology News

Country:

North America > United States (0.15)
Europe (0.05)
Asia > Middle East > Iraq (0.05)
(2 more...)

Industry:

Government (1.00)
Law > Civil Rights & Constitutional Law (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

Add feedback

EU urged to protect grassroots AI research or risk losing out to US

The GuardianMay-4-2023, 13:42:30 GMT

The EU has been warned that it risks handing control of artificial intelligence to US tech firms if it does not act to protect grassroots research in its forthcoming AI bill. In an open letter coordinated by the German research group Laion, or Large-scale AI Open Network, the European parliament was told that "one-size-fits-all" rules risked eliminating open research and development. "Rules that require a researcher or developer to monitor or control downstream use could make it impossible to release open-source AI in Europe," which would "entrench large firms" and "hamper efforts to improve transparency, reduce competition, limit academic freedom, and drive investment in AI overseas", the letter says. It adds: "Europe cannot afford to lose AI sovereignty. Eliminating open-source R&D will leave the European scientific community and economy critically dependent on a handful of foreign and proprietary firms for essential AI infrastructure."

protect grassroot ai research, schuhmann, stable diffusion, (7 more...)

The Guardian

Country: Europe > United Kingdom > England > Oxfordshire > Oxford (0.05)

Industry: Government (0.70)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.34)

Add feedback

The future of AI relies on a high schoolteacher's free database

The Japan TimesApr-24-2023, 07:41:39 GMT

In front of a suburban house on the outskirts of the northern Germany city of Hamburg, a single word -- "LAION" -- is scrawled in pencil across a mailbox. It's the only indication that the home belongs to the person behind a massive data gathering effort central to the artificial intelligence boom that has seized the world's attention. That person is high schoolteacher Christoph Schuhmann, and LAION, short for "Large-scale AI Open Network," is his passion project. When Schuhmann isn't teaching physics and computer science to German teens, he works with a small team of volunteers building the world's biggest free AI training data set, which has already been used in text-to-image generators such as Google's Imagen and Stable Diffusion. Databases like LAION are central to AI text-to-image generators, which rely on them for the enormous amounts of visual material used to deconstruct and create new images.

free database, high schoolteacher, laion, (1 more...)

The Japan Times

Country: Europe > Germany (0.27)

Industry: Law (0.62)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.40)

Add feedback

OpenCLIP for Image Search and Automatic Captioning

#artificialintelligenceMar-9-2023, 12:05:34 GMT

I have been using and writing about OpenAI's CLIP system since it came out in 2021 [1]. It consists of image and text encoding models that can be used for various forms of cross-modal comparison, like using a text query to find the best matching image in a library quickly. In December 2022, an independent group of researchers known as LAION released a paper called "Reproducible scaling laws for contrastive language-image learning" [2] that describes how they first reimplemented and trained a model similar to CLIP and then experimented with improving the system by training with a larger dataset and using new ML techniques. They call their new model OpenCLIP. In this article, I will provide some background info on the original CLIP, describe how LAION improved the model, and show some results from my experiments with the two systems using images from the Library of Congress's Flickr photostream.

caption, dataset, openclip, (13 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition > Image Matching (0.40)

Add feedback