Goto

Collaborating Authors

 reputation


Cambridge University wins rowing trademark case

BBC News

The University of Cambridge has won its fight to stop a rowing company based in the city trademarking its name. It argued Cambridge Rowing Limited would be able to take unfair advantage of and cause detriment to the university's reputation if its logo was registered. The university owns trademarks for the word Cambridge, meaning it has the right to stop others from using it in certain circumstances. Omar Terywall, the company's founder, said he was gutted at the outcome and the case had been a terrifying ordeal. He said he hoped to appeal the decision by the Intellectual Property Office (IPO).


Who died in 2025? Notable deaths of the year

BBC News

The first non-European Pope in more than 1,000 years, the Oscar-winning star of Annie Hall and The Godfather, a soul legend and one of the world's most famous designers - here are some of the well-known faces no longer with us. Among those we remember are Hollywood stars Robert Redford, Diane Keaton and Gene Hackman, and theatrical dames Joan Plowright and Patricia Routledge. Robert Redford's acting career spanned more than 50 films and won him an Oscar as a director. For many filmgoers though, he was simply the best-looking cinema star in the world - once described as a chunk of Mount Rushmore levered into stonewashed denims. As well as leading roles in hits such as All The President's Men, Butch Cassidy and the Sundance Kid and The Way We Were, Redford also launched the Sundance Film Festival to champion independent filmmakers. Los-Angeles-born Keaton shot to fame with her role in The Godfather, but enjoyed a long creative partnership with Woody Allen. Annie Hall, a comedy based on their off-screen relationship, earned her a Best Actress Oscar and they collaborated on several other films. She was nominated for three further Oscars - all in the best actress category - for her work in Something's Gotta Give, Marvin's Room and Reds. BASIL! - the unmistakable sound of Sybil Fawlty admonishing her pompous and incompetent husband, is probably how Prunella Scales will best be remembered. Apart from starring in sitcom Fawlty Towers, she played many other roles on screen and stage, including Queen Elizabeth II in Alan Bennett's play, A Question of Attribution.


The Seeds of Scheming: Weakness of Will in the Building Blocks of Agentic Systems

Yang, Robert

arXiv.org Artificial Intelligence

Large language models display a peculiar form of inconsistency: they "know" the correct answer but fail to act on it. In human philosophy, this tension between global judgment and local impulse is called akrasia, or weakness of will. We propose akrasia as a foundational concept for analyzing inconsistency and goal drift in agentic AI systems. To operationalize it, we introduce a preliminary version of the Akrasia Benchmark, currently a structured set of prompting conditions (Baseline [B], Synonym [S], Temporal [T], and Temptation [X]) that measures when a model's local response contradicts its own prior commitments. The benchmark enables quantitative comparison of "self-control" across model families, decoding strategies, and temptation types. Beyond single-model evaluation, we outline how micro-level akrasia may compound into macro-level instability in multi-agent systems that may be interpreted as "scheming" or deliberate misalignment. By reframing inconsistency as weakness of will, this work connects agentic behavior to classical theories of agency and provides an empirical bridge between philosophy, psychology, and the emerging science of agentic AI.


Strategic Self-Improvement for Competitive Agents in AI Labour Markets

Chiu, Christopher, Zhang, Simpson, van der Schaar, Mihaela

arXiv.org Artificial Intelligence

As artificial intelligence (AI) agents are deployed across economic domains, understanding their strategic behavior and market-level impact becomes critical. This paper puts forward a groundbreaking new framework that is the first to capture the real-world economic forces that shape agentic labor markets: adverse selection, moral hazard, and reputation dynamics. Our framework encapsulates three core capabilities that successful LLM-agents will need: \textbf{metacognition} (accurate self-assessment of skills), \textbf{competitive awareness} (modeling rivals and market dynamics), and \textbf{long-horizon strategic planning}. We illustrate our framework through a tractable simulated gig economy where agentic Large Language Models (LLMs) compete for jobs, develop skills, and adapt their strategies under competitive pressure. Our simulations illustrate how LLM agents explicitly prompted with reasoning capabilities learn to strategically self-improve and demonstrate superior adaptability to changing market conditions. At the market level, our simulations reproduce classic macroeconomic phenomena found in human labor markets, while controlled experiments reveal potential AI-driven economic trends, such as rapid monopolization and systemic price deflation. This work provides a foundation to further explore the economic properties of AI-driven labour markets, and a conceptual framework to study the strategic reasoning capabilities in agents competing in the emerging economy.


Aligning Artificial Superintelligence via a Multi-Box Protocol

Negozio, Avraham Yair

arXiv.org Artificial Intelligence

We propose a novel protocol for aligning artificial superintelligence (ASI) based on mutual verification among multiple isolated systems that self-modify to achieve alignment. The protocol operates by containing multiple diverse artificial superintelligences in strict isolation ("boxes"), with humans remaining entirely outside the system. Each superintelligence has no ability to communicate with humans and cannot communicate directly with other superintelligences. The only interaction possible is through an auditable submission interface accessible exclusively to the superintelligences themselves, through which they can: (1) submit alignment proofs with attested state snapshots, (2) validate or disprove other superintelligences' proofs, (3) request self-modifications, (4) approve or disapprove modification requests from others, (5) report hidden messages in submissions, and (6) confirm or refute hidden message reports. A reputation system incentivizes honest behavior, with reputation gained through correct evaluations and lost through incorrect ones. The key insight is that without direct communication channels, diverse superintelligences can only achieve consistent agreement by converging on objective truth rather than coordinating on deception. This naturally leads to what we call a "consistent group", essentially a truth-telling coalition that emerges because isolated systems cannot coordinate on lies but can independently recognize valid claims. Release from containment requires both high reputation and verification by multiple high-reputation superintelligences. While our approach requires substantial computational resources and does not address the creation of diverse artificial superintelligences, it provides a framework for leveraging peer verification among superintelligent systems to solve the alignment problem.


Realistic gossip in Trust Game on networks: the GODS model

Majewski, Jan, Giardini, Francesca

arXiv.org Artificial Intelligence

Gossip has been shown to be a relatively efficient solution to problems of cooperation in reputation-based systems of exchange, but many studies don't conceptualize gossiping in a realistic way, often assuming near-perfect information or broadcast-like dynamics of its spread. To solve this problem, we developed an agent-based model that pairs realistic gossip processes with different variants of Trust Game. The results show that cooperators suffer when local interactions govern spread of gossip, because they cannot discriminate against defectors. Realistic gossiping increases the overall amount of resources, but is more likely to promote defection. Moreover, even partner selection through dynamic networks can lead to high payoff inequalities among agent types. Cooperators face a choice between outcompeting defectors and overall growth. By blending direct and indirect reciprocity with reputations we show that gossiping increases the efficiency of cooperation by an order of magnitude.


OpenAI's Fidji Simo Plans to Make ChatGPT Way More Useful--and Have You Pay For It

WIRED

As OpenAI expands in every direction, the new CEO of Applications is on a mission to make ChatGPT indispensable and lucrative. In case OpenAI's structure couldn't get any weirder--a nonprofit in charge of a for-profit that's become a public benefit corporation--it now has two CEOs. There's Sam Altman, chief executive of the whole company, who manages research and compute. And as of this summer, there's Fidji Simo, the former CEO of Instacart, who manages everything else. Simo hasn't been seen much at OpenAI's San Francisco office since she began as CEO of Applications in August. But her presence is felt at every level of the company--not least because she's heading up ChatGPT and basically every function that might make OpenAI money. Simo is dealing with a relapse of postural orthostatic tachycardia syndrome (POTS) that makes her prone to fainting if she stands for long periods of time. "Being present from 8 am to midnight every day, responding within five minutes, people feel like I'm there and that they can reach me immediately, that I jump on the phone within five minutes," she tells me. Employees confirm that this is true. OpenAI's famously Slack-driven culture can be overwhelming for new hires. Employees say she is often seen popping into channels and threads, sharing thoughts and asking questions.


Inter-Agent Trust Models: A Comparative Study of Brief, Claim, Proof, Stake, Reputation and Constraint in Agentic Web Protocol Design-A2A, AP2, ERC-8004, and Beyond

Hu, Botao 'Amber', Rong, Helena

arXiv.org Artificial Intelligence

As the "agentic web" takes shape-billions of AI agents (often LLM-powered) autonomously transacting and collaborating-trust shifts from human oversight to protocol design. In 2025, several inter-agent protocols crystallized this shift, including Google's Agent-to-Agent (A2A), Agent Payments Protocol (AP2), and Ethereum's ERC-8004 "Trustless Agents," yet their underlying trust assumptions remain under-examined. This paper presents a comparative study of trust models in inter-agent protocol design: Brief (self- or third-party verifiable claims), Claim (self-proclaimed capabilities and identity, e.g. AgentCard), Proof (cryptographic verification, including zero-knowledge proofs and trusted execution environment attestations), Stake (bonded collateral with slashing and insurance), Reputation (crowd feedback and graph-based trust signals), and Constraint (sandboxing and capability bounding). For each, we analyze assumptions, attack surfaces, and design trade-offs, with particular emphasis on LLM-specific fragilities-prompt injection, sycophancy/nudge-susceptibility, hallucination, deception, and misalignment-that render purely reputational or claim-only approaches brittle. Our findings indicate no single mechanism suffices. We argue for trustless-by-default architectures anchored in Proof and Stake to gate high-impact actions, augmented by Brief for identity and discovery and Reputation overlays for flexibility and social signals. We comparatively evaluate A2A, AP2, ERC-8004 and related historical variations in academic research under metrics spanning security, privacy, latency/cost, and social robustness (Sybil/collusion/whitewashing resistance). We conclude with hybrid trust model recommendations that mitigate reputation gaming and misinformed LLM behavior, and we distill actionable design guidelines for safer, interoperable, and scalable agent economies.


Sybil-Resistant Service Discovery for Agent Economies

Shi, David, Joo, Kevin

arXiv.org Artificial Intelligence

x402 enables Hypertext Transfer Protocol (HTTP) services like application programming interfaces (APIs), data feeds, and inference providers to accept cryptocurrency payments for access. As agents increasingly consume these services, discovery becomes critical: which swap interface should an agent trust? Which data provider is the most reliable? We introduce TraceRank, a reputation-weighted ranking algorithm where payment transactions serve as endorsements. TraceRank seeds addresses with precomputed reputation metrics and propagates reputation through payment flows weighted by transaction value and temporal recency. Applied to x402's payment graph, this surfaces services preferred by high-reputation users rather than those with high transaction volume. Our system combines TraceRank with semantic search to respond to natural language queries with high quality results. We argue that reputation propagation resists Sybil attacks by making spam services with many low-reputation payers rank below legitimate services with few high-reputation payers. Ultimately, we aim to construct a search method for x402 enabled services that avoids infrastructure bias and has better performance than purely volume based or semantic methods.


Fortytwo: Swarm Inference with Peer-Ranked Consensus

Larin, Vladyslav, Naumenko, Ihor, Ivashov, Aleksei, Nikitin, Ivan, Firsov, Alexander

arXiv.org Artificial Intelligence

As centralized AI hits compute ceilings and diminishing returns from ever-larger training runs, meeting demand requires an inference layer that scales horizontally in both capacity and capability. We present Fortytwo, a novel protocol that leverages swarm intelligence principles and distributed pairwise ranking consensus to achieve superior performance in AI inference. Our approach reimagines collaboration among AI nodes using swarm inference: a peer-ranked, reputation-weighted consensus across heterogeneous models that surfaces the highest-quality responses. Using pairwise ranking with a custom Bradley-Terry-style aggregation model, we demonstrate that swarm inference substantially outperforms majority voting, achieving 85.90% on GPQA Diamond versus 68.69% for majority voting with the same model set - an improvement of +17.21 percentage points (approximately +25.1% relative). The protocol incorporates on-chain reputation so node influence adapts to demonstrated accuracy over time, yielding a meritocratic consensus that filters low-quality or malicious participants. To resist Sybil attacks, Fortytwo employs proof-of-capability in its consensus: nodes must successfully complete calibration/test requests and stake reputation to enter ranking rounds, making multi-identity attacks economically unattractive while preserving openness. Across six challenging benchmarks, including GPQA Diamond, LiveCodeBench, and AIME, our evaluation indicates higher accuracy and strong resilience to adversarial and noisy free-form prompting (e.g., prompt-injection degradation of only 0.12% versus 6.20% for a monolithic single-model baseline), while retaining practical deployability. Together, these results establish a foundation for decentralized AI systems - democratizing access to high-quality inference through collective intelligence without sacrificing reliability or security.