Media
From shrimp Jesus to erotic tractors: how viral AI slop took over the internet
Clockwise from top left: Shrimp Jesus, Nayib Bukele, Justin Bieber and Super Cat League. Clockwise from top left: Shrimp Jesus, Nayib Bukele, Justin Bieber and Super Cat League. In the algorithm-driven economy of 2025, one man's shrimp Jesus is another man's side hustle. AI slop - the low-quality, surreal content flooding social media platforms, designed to farm views - is a phenomenon, some would say the phenomenon of the 2024 and 2025 internet. Merriam-Webster's word of the year this year is "slop", referring exclusively to the internet variety.
American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers
Existing full text datasets of U.S. public domain newspapers do not recognize the often complex layouts of newspaper scans, and as a result the digitized content scrambles texts from articles, headlines, captions, advertisements, and other layout regions. OCR quality can also be low. This study develops a novel, deep learning pipeline for extracting full article texts from newspaper images and applies it to the nearly 20 million scans in Library of Congress's public domain Chronicling America collection. The pipeline includes layout detection, legibility classification, custom OCR, and association of article texts spanning multiple bounding boxes. To achieve high scalability, it is built with efficient architectures designed for mobile phones. The resulting American Stories dataset provides high quality data that could be used for pre-training a large language model to achieve better understanding of historical English and historical world knowledge. The dataset could also be added to the external database of a retrieval-augmented language model to make historical information - ranging from interpretations of political events to minutiae about the lives of people's ancestors - more widely accessible. Furthermore, structured article texts facilitate using transformer-based methods for popular social science applications like topic classification, detection of reproduced content, and news story clustering. Finally, American Stories provides a massive silver quality dataset for innovating multimodal layout analysis models and other multimodal applications.
Constructing Non-isotropic Gaussian Diffusion Model Using Isotropic Gaussian Diffusion Model for Image Editing
Score-based diffusion models (SBDMs) have achieved state-of-the-art results in image generation. In this paper, we propose a Non-isotropic Gaussian Diffusion Model (NGDM) for image editing, which requires editing the source image while preserving the image regions irrelevant to the editing task. We construct NGDM by adding independent Gaussian noises with different variances to different image pixels.
Both of these influencers are successful - but only one is human
In some ways, Gigi is like any other young social media influencer. With perfect hair and makeup, she logs on and talks to her fans. She shares clips: eating, doing skin care, putting on lipstick. She even has a cute baby who appears in some videos. But after a few seconds, something may seem a little off.