Goto

Collaborating Authors

 Government


Questioning the Survey Responses of Large Language Models

Neural Information Processing Systems

Surveys have recently gained popularity as a tool to study large language models. By comparing models' survey responses to those of different human reference populations, researchers aim to infer the demographics, political opinions, or values best represented by current language models. In this work, we critically examine language models' survey responses on the basis of the well-established American Community Survey by the U.S. Census Bureau. Evaluating 43 different language models using de-facto standard prompting methodologies, we establish two dominant patterns. First, models' responses are governed by ordering and labeling biases, for example, towards survey responses labeled with the letter "A".


The Download: OpenAI is building a fully automated researcher, and a psychedelic trial blind spot

MIT Technology Review

Plus: OpenAI is also creating a super app. OpenAI has a new grand challenge: building an AI researcher--a fully automated agent-based system capable of tackling large, complex problems by itself. The San Francisco firm said the new goal will be its "north star" for the next few years. By September, the company plans to build "an autonomous AI research intern" that can take on a small number of specific research problems. The intern will be the precursor to the fully automated multi-agent system, which is slated to debut in 2028. In an exclusive interview this week, OpenAI's chief scientist, Jakub Pachocki, talked me through the plans.


OpenAI is throwing everything into building a fully automated researcher

MIT Technology Review

OpenAI is refocusing its research efforts and throwing its resources into a new grand challenge. The San Francisco firm has set its sights on building what it calls an AI researcher, a fully automated agent-based system that will be able to go off and tackle large, complex problems by itself. OpenAI says that this new research goal will be its "North Star" for the next few years, pulling together multiple research strands, including work on reasoning models, agents, and interpretability .


Blue Origin also wants to put AI data centers in space

Engadget

It filed a request with the FCC to deploy almost 52,000 satellites. Blue Origin has revealed its plans for an {@/data/467/1/1 orbital AI data center @/data/467/1/1} system in a new filing with the Federal Communications Commission. The company has asked the agency for permission to deploy 51,600 satellites, as reported by the and . Called Project Sunrise, the initiative aims to launch and operate a constellation of satellites that can deliver computing capacity for artificial intelligence uses. Project Sunrise's satellites will be placed in sun-synchronous orbits at altitudes between 311 and 1,118 miles.


MALT Powers Up Adversarial Attacks

Neural Information Processing Systems

Current adversarial attacks for multi-class classifiers choose potential adversarial target classes naively based on the classifier's confidence levels. We present a novel adversarial targeting method, \textit{MALT - Mesoscopic Almost Linearity Targeting}, based on local almost linearity assumptions. Our attack wins over the current state of the art AutoAttack on the standard benchmark datasets CIFAR-100 and Imagenet and for different robust models. In particular, our attack uses a \emph{five times faster} attack strategy than AutoAttack's while successfully matching AutoAttack's successes and attacking additional samples that were previously out of reach. We additionally prove formally and demonstrate empirically that our targeting method, although inspired by linear predictors, also applies to non-linear models.


China Approves the First Brain Chips for Sale--and Has a Plan to Dominate the Industry

WIRED

While the United States and Europe are moving cautiously forward with clinical trials, China is racing toward the commercialization of brain implants. China has made history by becoming the first nation to approve a commercially available brain chip to treat a disability. NEO, the implant developed by Neuracle Medical Technology, translates the thoughts of a person with paralysis into movements of an assistive robotic hand. After 18 months of testing that proved its safety, China's National Medical Products Administration authorized the implant for people aged 19 to 60 with paralysis caused by neck or spinal cord injuries that prevent them from moving their limbs. According Nature, the implant embedded in the skull is about the size of a coin.


NEWT GINGRICH, JASON HAYES: There's a nuclear solution to recharging American industry

FOX News

Small modular reactors and microreactors could power AI data centers and factories, but outdated rules and public fears are stalling America's nuclear energy future.


A Clarinetist, a High School Student, and Some Climate Deniers Write a Science Paper

Mother Jones

Don't miss this: Double your impact! We're able to stand strong because we're funded by readers like you. Support journalism that doesn't flinch. Don't miss this: Tomorrow is the final day of our $50,000 match We're able to stand strong because we're funded by readers like you. Support journalism that doesn't flinch.


Machine learning framework to predict global imperilment status of freshwater fish

AIHub

Researchers spent five years developing an AI-based model to protect freshwater fish worldwide from extinction, with a particular focus on identifying threats to fish before they become endangered. "People sometimes go in to protect species when it's already too late," said Ivan Arismendi, an associate professor in Oregon State University's Department of Fisheries, Wildlife, and Conservation Sciences. "With our model, decision makers can deploy resources in advance before a species becomes imperiled." The findings were recently published in the journal Nature Communications. Nearly one-third of freshwater fish species face possible extinction, threatening food supplies, ecosystems and outdoor recreation.


Resident Evil at 30: how Capcom's horror opus has survived

The Guardian

Flourishing Resident Evil Requiem introduces FBI agent Grace Ashcroft. Flourishing Resident Evil Requiem introduces FBI agent Grace Ashcroft. Resident Evil at 30: how Capcom's horror opus has survived and thrived T o many of us playing and writing about video games in the 1990s, Resident Evil seemed to come out of nowhere. The emerging PlayStation and Saturn consoles were all about slick, bright arcade conversions - the shiny thrills of Daytona and Tekken - and Japanese publisher Capcom was in a rut of coin-op conversions and endless sequels to Street Fighter and Mega Man. Scary games were rare at the time and mostly confined to the PC. So when the news of a horror title named Biohazard (the Japanese name for the series) started to emerge in 1995, it caught the attention of games journalists as it seemed radically out of step with prevailing trends.