Goto

Collaborating Authors

 prc



EditDistanceRobustWatermarksforLanguage Models

Neural Information Processing Systems

Earlier schemes could only handle stochastic substitutions anddeletions, andthus weareaiming foramore natural and appealing robustness guarantee that holds with respect to editdistance.



Ideal Attribution and Faithful Watermarks for Language Models

Song, Min Jae, Shahabi, Kameron

arXiv.org Machine Learning

We introduce ideal attribution mechanisms, a formal abstraction for reasoning about attribution decisions over strings. At the core of this abstraction lies the ledger, an append-only log of the prompt-response interaction history between a model and its user. Each mechanism produces deterministic decisions based on the ledger and an explicit selection criterion, making it well-suited to serve as a ground truth for attribution. We frame the design goal of watermarking schemes as faithful representation of ideal attribution mechanisms. This novel perspective brings conceptual clarity, replacing piecemeal probabilistic statements with a unified language for stating the guarantees of each scheme. It also enables precise reasoning about desiderata for future watermarking schemes, even when no current construction achieves them, since the ideal functionalities are specified first. In this way, the framework provides a roadmap that clarifies which guarantees are attainable in an idealized setting and worth pursuing in practice.



Pose Estimation of a Cable-Driven Serpentine Manipulator Utilizing Intrinsic Dynamics via Physical Reservoir Computing

Tanaka, Kazutoshi, Takahashi, Tomoya, Hamaya, Masashi

arXiv.org Artificial Intelligence

Cable-driven serpentine manipulators hold great potential in unstructured environments, offering obstacle avoidance, multi-directional force application, and a lightweight design. By placing all motors and sensors at the base and employing plastic links, we can further reduce the arm's weight. To demonstrate this concept, we developed a 9-degree-of-freedom cable-driven serpentine manipulator with an arm length of 545 mm and a total mass of only 308 g. However, this design introduces flexibility-induced variations, such as cable slack, elongation, and link deformation. These variations result in discrepancies between analytical predictions and actual link positions, making pose estimation more challenging. To address this challenge, we propose a physical reservoir computing based pose estimation method that exploits the manipulator's intrinsic nonlinear dynamics as a high-dimensional reservoir. Experimental results show a mean pose error of 4.3 mm using our method, compared to 4.4 mm with a baseline long short-term memory network and 39.5 mm with an analytical approach. This work provides a new direction for control and perception strategies in lightweight cable-driven serpentine manipulators leveraging their intrinsic dynamics.


One Size Fits None: Rethinking Fairness in Medical AI

Roller, Roland, Hahn, Michael, Ravichandran, Ajay Madhavan, Osmanodja, Bilgin, Oetke, Florian, Sassi, Zeineb, Burchardt, Aljoscha, Netter, Klaus, Budde, Klemens, Herrmann, Anne, Strapatsas, Tobias, Dabrock, Peter, Möller, Sebastian

arXiv.org Artificial Intelligence

Machine learning (ML) models are increasingly used to support clinical decision-making. However, real-world medical datasets are often noisy, incomplete, and imbalanced, leading to performance disparities across patient subgroups. These differences raise fairness concerns, particularly when they reinforce existing disadvantages for marginalized groups. In this work, we analyze several medical prediction tasks and demonstrate how model performance varies with patient characteristics. While ML models may demonstrate good overall performance, we argue that subgroup-level evaluation is essential before integrating them into clinical workflows. By conducting a performance analysis at the subgroup level, differences can be clearly identified-allowing, on the one hand, for performance disparities to be considered in clinical practice, and on the other hand, for these insights to inform the responsible development of more effective models. Thereby, our work contributes to a practical discussion around the subgroup-sensitive development and deployment of medical ML models and the interconnectedness of fairness and transparency.


Outperformance Score: A Universal Standardization Method for Confusion-Matrix-Based Classification Performance Metrics

Zhao, Ningsheng, Bui, Trang, Yu, Jia Yuan, Dzieciolowski, Krzysztof

arXiv.org Machine Learning

Many classification performance metrics exist, each suited to a specific application. However, these metrics often differ in scale and can exhibit varying sensitivity to class imbalance rates in the test set. As a result, it is difficult to use the nominal values of these metrics to interpret and evaluate classification performances, especially when imbalance rates vary. To address this problem, we introduce the outperformance score function, a universal standardization method for confusion-matrix-based classification performance (CMBCP) metrics. It maps any given metric to a common scale of $[0,1]$, while providing a clear and consistent interpretation. Specifically, the outperformance score represents the percentile rank of the observed classification performance within a reference distribution of possible performances. This unified framework enables meaningful comparison and monitoring of classification performance across test sets with differing imbalance rates. We illustrate how the outperformance scores can be applied to a variety of commonly used classification performance metrics and demonstrate the robustness of our method through experiments on real-world datasets spanning multiple classification applications.


White House: US will lead in AI, but China is catching up

FOX News

Kurt'CyberGuy' Knutsson on President-elect Trump's plan to deregulate cryptocurrency and A.I. in his second administration. EXCLUSIVE: China's innovation in artificial intelligence is "accelerating," according to Michael Kratsios, director of the White House Office of Science and Technology. He told Fox News Digital that the United States' "promote and protect" strategy will solidify its standing as the world's dominant power in AI. Kratsios, who served as chief technology officer during the first Trump administration, sat for an exclusive interview with Fox News Digital on Monday. FLASHBACK: US TECHNOLOGY CHIEF WARNS CHINA'TWISTING' ARTIFICIAL INTELLIGENCE TO TARGET CRITICS, AS AMERICA JOINS GLOBAL PACT "The White House in the first Trump administration redefined national tech policy to focus on American leadership in emerging technologies, and those were technologies like artificial intelligence, quantum computing and 5G, [which] were big back then," Kratsios said.


Securing the AI future: How President Trump's action plan can position America for success

FOX News

The Trump administration is prioritizing the critical role of artificial intelligence in creating and upholding freedom. Just three weeks in, Vice President JD Vance declared at a global AI summit in Paris that AI "will make people more productive, more prosperous, and more free. The United States of America is the leader in AI, and our administration plans to keep it that way." To achieve this, the White House is working toward an AI action plan and calling on leading American AI companies to submit our best ideas. OpenAI is pleased to submit proposals today on a range of important considerations for AI from national security, to infrastructure and energy, to the federal government's own use of AI.