Goto

Collaborating Authors

 Law



Inside the Messy, Accidental Kryptos Reveal

WIRED

After 35 years, the secretive CIA sculpture finally gave up its mystery, thanks to a novelist, a playwright, and some misplaced documents. But the chase to decode continues. Jim Sanborn couldn't believe it. He was weeks away from auctioning off the answer to Kryptos, the sculpture he created for the CIA that had defied solution for 35 years. As always, wannabe solvers kept on paying him a $50 fee to offer their guesses to the remaining unsolved portion of the 1,800-character encrypted message, known as K4--wrong without exception.


Armed police in US handcuff teen after AI mistakes crisp packet for gun

BBC News

A US teenager was handcuffed by armed police after an artificial intelligence (AI) system mistakenly said he was carrying a gun - when really he was holding a packet of crisps. Police showed up, like eight cop cars, and then they all came out with guns pointed at me talking about getting on the ground, 16-year-old Baltimore pupil Taki Allen told local outlet WMAR-2 News . Baltimore County Police Department said their officers responded appropriately and proportionally based on the information provided at the time. It said the AI alert was sent to human reviewers who found no threat - but the principal missed this and contacted the school's safety team, who ultimately called the police. But the incident has prompted calls by some for the schools' procedures around the use of such technology to be reviewed.


Clippy is BACK! Microsoft's paperclip mascot delights users as it returns - 18 years after it was axed from Office

Daily Mail - Science & tech

European diplomats reveal the'tough guy' US negotiator leading the charge on Greenland: 'He hates us' A former Marine was unmasked as the'Zodiac killer' after a bombshell new investigation. I suffered a horrific side effect of a drug used by millions of Americans... and my face'melted off' The ICE backlash isn't the end of Kristi Noem It may have just saved her career FedEx driver accused of abducting and killing little girl while delivering her Christmas present says he shouldn't be executed because he has autism Senator accused of steamy affair with her bodyguard in bombshell lawsuit from his WIFE: 'Bring MDMA so I can guide you' Hunter Biden's stripper baby mama asks for him to be ARRESTED over claims he is still failing to pay her child support Family of Tyler Robinson's transgender lover speaks out for first time since Charlie Kirk assassination and reveals where he is now Dodgers agree with Kyle Tucker'on $240m deal' as champs beat out Mets, Blue Jays for top free agent World's sexiest hockey star and OnlyFans model Mikayla Demaiter spills out of little dress in latest post Nicole Richie addresses her daughter's new identity after unveiling transformation on her 18th birthday Trump gushes over'young beautiful' hockey players and teases rebranding of famed presidential wall Trump's AG secretary sparks mockery with tone-deaf $3 dinner advice as food costs soar Karoline Leavitt reveals the thinking behind Trump's call to cancel elections Microsoft's paperclip mascot delights users as it returns - 18 years after it was axed from Office It was the original virtual assistant, released years before Siri, Alexa, and Bixby. Now, almost two decades after it was axed, Microsoft's Clippy is officially back. The friendly anthropomorphic paper clip has been spotted as an Easter egg in Microsoft's latest announcement about a new AI companion called Mico. Mico - whose name is a nod to Microsoft Copilot - is a small blob with a friendly smiley face, and doesn't look much like its much-loved predecessor.


The Download: carbon removal's future, and measuring pain using an app

MIT Technology Review

Plus: Meta's lawyers advised staff to remove parts of their research After years of growth that spawned hundreds of startups, the nascent carbon removal sector appears to be facing a reckoning. Running Tide, a promising aquaculture company, shut down its operations last summer, and a handful of other companies have shuttered, downsized, or pivoted in recent months as well. And the collective industry hasn't made a whole lot more progress toward Running Tide's ambitious plans to sequester a billion tons of carbon dioxide by this year. The hype phase is over and the sector is sliding into the turbulent business trough that follows, experts warn. And the open question is: If the carbon removal sector is heading into a painful if inevitable clearing-out cycle, where will it go from there? This story is part of MIT Technology Review's What's Next series, which looks across industries, trends, and technologies to give you a first look at the future.



The 'Surge' of Troops May Not Come to San Francisco, but the City Is Ready Anyway

WIRED

The'Surge' of Troops May Not Come to San Francisco, but the City Is Ready Anyway San Francisco is preparing for federal law enforcement's invasion of the Bay Area, whether it happens or not. Citizens protesting the threat of federal troop deployments in the San Francisco Bay Area held a rally on Thursday at SF City Hall. After months of deployments by US Immigration and Customs Enforcement and the National Guard across American cities, federal agents have been preparing to descend into San Francisco . Local resistance groups have been coordinating with activists in other cities across the country that have been besieged by federal law enforcement. Thousands of volunteers, coordinating through Signal group chats, Zoom calls, and social media posts, planned protests and spread the word that federal troops are on their way to San Francisco.


Relative-Based Scaling Law for Neural Language Models

arXiv.org Artificial Intelligence

Scaling laws aim to accurately predict model performance across different scales. Existing scaling-law studies almost exclusively rely on cross-entropy as the evaluation metric. However, cross-entropy provides only a partial view of performance: it measures the absolute probability assigned to the correct token, but ignores the relative ordering between correct and incorrect tokens. Yet, relative ordering is crucial for language models, such as in greedy-sampling scenario. To address this limitation, we investigate scaling from the perspective of relative ordering. We first propose the Relative-Based Probability (RBP) metric, which quantifies the probability that the correct token is ranked among the top predictions. Building on this metric, we establish the Relative-Based Scaling Law, which characterizes how RBP improves with increasing model size. Through extensive experiments on four datasets and four model families spanning five orders of magnitude, we demonstrate the robustness and accuracy of this law. Finally, we illustrate the broad application of this law with two examples, namely providing a deeper explanation of emergence phenomena and facilitating finding fundamental theories of scaling laws. In summary, the Relative-Based Scaling Law complements the cross-entropy perspective and contributes to a more complete understanding of scaling large language models. Thus, it offers valuable insights for both practical development and theoretical exploration.


Machine Unlearning under Overparameterization

arXiv.org Artificial Intelligence

Machine unlearning algorithms aim to remove the influence of specific training samples, ideally recovering the model that would have resulted from training on the remaining data alone. We study unlearning in the overparameterized setting, where many models interpolate the data, and defining the solution as any loss minimizer over the retained set$\unicode{x2013}$as in prior work in the underparameterized setting$\unicode{x2013}$is inadequate, since the original model may already interpolate the retained data and satisfy this condition. In this regime, loss gradients vanish, rendering prior methods based on gradient perturbations ineffective, motivating both new unlearning definitions and algorithms. For this setting, we define the unlearning solution as the minimum-complexity interpolator over the retained data and propose a new algorithmic framework that only requires access to model gradients on the retained set at the original solution. We minimize a regularized objective over perturbations constrained to be orthogonal to these model gradients, a first-order relaxation of the interpolation condition. For different model classes, we provide exact and approximate unlearning guarantees and demonstrate that an implementation of our framework outperforms existing baselines across various unlearning experiments.


LeCoDe: A Benchmark Dataset for Interactive Legal Consultation Dialogue Evaluation

arXiv.org Artificial Intelligence

Legal consultation is essential for safeguarding individual rights and ensuring access to justice, yet remains costly and inaccessible to many individuals due to the shortage of professionals. While recent advances in Large Language Models (LLMs) offer a promising path toward scalable, low-cost legal assistance, current systems fall short in handling the interactive and knowledge-intensive nature of real-world consultations. To address these challenges, we introduce LeCoDe, a real-world multi-turn benchmark dataset comprising 3,696 legal consultation dialogues with 110,008 dialogue turns, designed to evaluate and improve LLMs' legal consultation capability. With LeCoDe, we innovatively collect live-streamed consultations from short-video platforms, providing authentic multi-turn legal consultation dialogues. The rigorous annotation by legal experts further enhances the dataset with professional insights and expertise. Furthermore, we propose a comprehensive evaluation framework that assesses LLMs' consultation capabilities in terms of (1) clarification capability and (2) professional advice quality. This unified framework incorporates 12 metrics across two dimensions. Through extensive experiments on various general and domain-specific LLMs, our results reveal significant challenges in this task, with even state-of-the-art models like GPT-4 achieving only 39.8% recall for clarification and 59% overall score for advice quality, highlighting the complexity of professional consultation scenarios. Based on these findings, we further explore several strategies to enhance LLMs' legal consultation abilities. Our benchmark contributes to advancing research in legal domain dialogue systems, particularly in simulating more real-world user-expert interactions.