AITopics | reputation

Collaborating Authors

reputation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

693e00827fd44bdfca210801fe1e6439-Paper-Position_Paper_Track.pdf

Neural Information Processing SystemsJun-18-2026, 02:04:10 GMT

The meteoric rise of Artificial Intelligence (AI), with its rapidly expanding market capitalization, presents both transformative opportunities and critical challenges. Chief among these is the urgent need for a new, unified paradigm for trustworthy evaluation, as current benchmarks increasingly reveal critical vulnerabilities. Issues like data contamination and selective reporting by model developers fuel hype, while inadequate data quality control can lead to biased evaluations that, even if unintentionally, may favor specific approaches. As a flood of participants enters the AI space, this "Wild West" of assessment makes distinguishing genuine progress from exaggerated claims exceptionally difficult. Such ambiguity blurs scientific signals and erodes public confidence, much as unchecked claims would destabilize financial markets reliant on credible oversight from agencies like Moody's. In high-stakes human examinations (e.g., SAT, GRE), substantial effort is devoted to ensuring fairness and credibility; why settle for less in evaluating AI, especially given its profound societal impact? This position paper argues that a laissezfaire approach is untenable. For true and sustainable AI advancement, we call for a paradigm shift to a unified, live, and quality-controlled benchmarking framework--robust by construction rather than reliant on courtesy or goodwill.

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.28)

Industry:

Social Sector (0.66)
Information Technology > Security & Privacy (0.46)
Banking & Finance > Trading (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

163 surrendered rats seek new homes in Massachusetts

Popular ScienceMar-6-2026, 16:02:00 GMT

'Rats have a bad reputation, but they actually make really great companion pets.' Rats are much more clean than their reputation suggests. Breakthroughs, discoveries, and DIY tips sent six days a week. A non-profit organization in Massachusetts received a boatload of pet rats in need of new homes. An individual in northeastern Massachusetts surrendered 163 rats in early February. That's almost 60 percent more than the total number of rats that were adopted from the Massachusetts Society for the Prevention of Cruelty to Animals-Angell (MSPCA-Angell) in 2025 alone.

artificial intelligence, mspca-angell, physics popular science video space, (11 more...)

Popular Science

Country:

North America > United States > Massachusetts (1.00)
North America > United States > New Hampshire (0.07)
Oceania > Australia > South Australia (0.05)
(5 more...)

Genre: Research Report > New Finding (0.36)

Industry: Media > Photography (0.32)

Technology: Information Technology > Artificial Intelligence (0.36)

Add feedback

Cambridge University wins rowing trademark case

BBC NewsFeb-10-2026, 06:13:28 GMT

The University of Cambridge has won its fight to stop a rowing company based in the city trademarking its name. It argued Cambridge Rowing Limited would be able to take unfair advantage of and cause detriment to the university's reputation if its logo was registered. The university owns trademarks for the word Cambridge, meaning it has the right to stop others from using it in certain circumstances. Omar Terywall, the company's founder, said he was gutted at the outcome and the case had been a terrifying ordeal. He said he hoped to appeal the decision by the Intellectual Property Office (IPO).

artificial intelligence, cambridge rowing, university, (10 more...)

BBC News

Country:

North America (1.00)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.91)

Industry: Law > Intellectual Property & Technology Law (1.00)

Technology:

Information Technology > Communications (0.32)
Information Technology > Artificial Intelligence (0.31)

Add feedback

Who died in 2025? Notable deaths of the year

BBC NewsDec-31-2025, 06:01:53 GMT

The first non-European Pope in more than 1,000 years, the Oscar-winning star of Annie Hall and The Godfather, a soul legend and one of the world's most famous designers - here are some of the well-known faces no longer with us. Among those we remember are Hollywood stars Robert Redford, Diane Keaton and Gene Hackman, and theatrical dames Joan Plowright and Patricia Routledge. Robert Redford's acting career spanned more than 50 films and won him an Oscar as a director. For many filmgoers though, he was simply the best-looking cinema star in the world - once described as a chunk of Mount Rushmore levered into stonewashed denims. As well as leading roles in hits such as All The President's Men, Butch Cassidy and the Sundance Kid and The Way We Were, Redford also launched the Sundance Film Festival to champion independent filmmakers. Los-Angeles-born Keaton shot to fame with her role in The Godfather, but enjoyed a long creative partnership with Woody Allen. Annie Hall, a comedy based on their off-screen relationship, earned her a Best Actress Oscar and they collaborated on several other films. She was nominated for three further Oscars - all in the best actress category - for her work in Something's Gotta Give, Marvin's Room and Reds. BASIL! - the unmistakable sound of Sybil Fawlty admonishing her pompous and incompetent husband, is probably how Prunella Scales will best be remembered. Apart from starring in sitcom Fawlty Towers, she played many other roles on screen and stage, including Queen Elizabeth II in Alan Bennett's play, A Question of Attribution.

actress, getty image, novelist, (15 more...)

BBC News

Country:

Europe > France (0.46)
North America > United States > California > Los Angeles County > Los Angeles (0.24)
Europe > United Kingdom > Northern Ireland (0.14)
(30 more...)

Genre:

Personal > Obituary (0.88)
Personal > Honors (0.68)

Industry:

Media > Film (1.00)
Leisure & Entertainment > Sports > Soccer (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
(2 more...)

Technology:

Information Technology > Communications (0.46)
Information Technology > Artificial Intelligence (0.46)

Add feedback

The Seeds of Scheming: Weakness of Will in the Building Blocks of Agentic Systems

Yang, Robert

arXiv.org Artificial IntelligenceDec-8-2025

Large language models display a peculiar form of inconsistency: they "know" the correct answer but fail to act on it. In human philosophy, this tension between global judgment and local impulse is called akrasia, or weakness of will. We propose akrasia as a foundational concept for analyzing inconsistency and goal drift in agentic AI systems. To operationalize it, we introduce a preliminary version of the Akrasia Benchmark, currently a structured set of prompting conditions (Baseline [B], Synonym [S], Temporal [T], and Temptation [X]) that measures when a model's local response contradicts its own prior commitments. The benchmark enables quantitative comparison of "self-control" across model families, decoding strategies, and temptation types. Beyond single-model evaluation, we outline how micro-level akrasia may compound into macro-level instability in multi-agent systems that may be interpreted as "scheming" or deliberate misalignment. By reframing inconsistency as weakness of will, this work connects agentic behavior to classical theories of agency and provides an empirical bridge between philosophy, psychology, and the emerging science of agentic AI.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2512.05449

Country:

North America > United States (0.93)
Europe > United Kingdom > England (0.28)

Genre: Research Report (0.50)

Industry: Law (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.97)

Add feedback

Strategic Self-Improvement for Competitive Agents in AI Labour Markets

Chiu, Christopher, Zhang, Simpson, van der Schaar, Mihaela

arXiv.org Artificial IntelligenceDec-5-2025

As artificial intelligence (AI) agents are deployed across economic domains, understanding their strategic behavior and market-level impact becomes critical. This paper puts forward a groundbreaking new framework that is the first to capture the real-world economic forces that shape agentic labor markets: adverse selection, moral hazard, and reputation dynamics. Our framework encapsulates three core capabilities that successful LLM-agents will need: \textbf{metacognition} (accurate self-assessment of skills), \textbf{competitive awareness} (modeling rivals and market dynamics), and \textbf{long-horizon strategic planning}. We illustrate our framework through a tractable simulated gig economy where agentic Large Language Models (LLMs) compete for jobs, develop skills, and adapt their strategies under competitive pressure. Our simulations illustrate how LLM agents explicitly prompted with reasoning capabilities learn to strategically self-improve and demonstrate superior adaptability to changing market conditions. At the market level, our simulations reproduce classic macroeconomic phenomena found in human labor markets, while controlled experiments reveal potential AI-driven economic trends, such as rapid monopolization and systemic price deflation. This work provides a foundation to further explore the economic properties of AI-driven labour markets, and a conceptual framework to study the strategic reasoning capabilities in agents competing in the emerging economy.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2512.04988

Genre:

Research Report > Experimental Study (0.68)
Research Report > Strength High (0.54)
Research Report > New Finding (0.46)

Industry:

Banking & Finance > Trading (1.00)
Banking & Finance > Economy (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Aligning Artificial Superintelligence via a Multi-Box Protocol

Negozio, Avraham Yair

arXiv.org Artificial IntelligenceDec-1-2025

We propose a novel protocol for aligning artificial superintelligence (ASI) based on mutual verification among multiple isolated systems that self-modify to achieve alignment. The protocol operates by containing multiple diverse artificial superintelligences in strict isolation ("boxes"), with humans remaining entirely outside the system. Each superintelligence has no ability to communicate with humans and cannot communicate directly with other superintelligences. The only interaction possible is through an auditable submission interface accessible exclusively to the superintelligences themselves, through which they can: (1) submit alignment proofs with attested state snapshots, (2) validate or disprove other superintelligences' proofs, (3) request self-modifications, (4) approve or disapprove modification requests from others, (5) report hidden messages in submissions, and (6) confirm or refute hidden message reports. A reputation system incentivizes honest behavior, with reputation gained through correct evaluations and lost through incorrect ones. The key insight is that without direct communication channels, diverse superintelligences can only achieve consistent agreement by converging on objective truth rather than coordinating on deception. This naturally leads to what we call a "consistent group", essentially a truth-telling coalition that emerges because isolated systems cannot coordinate on lies but can independently recognize valid claims. Release from containment requires both high reputation and verification by multiple high-reputation superintelligences. While our approach requires substantial computational resources and does not address the creation of diverse artificial superintelligences, it provides a framework for leveraging peer verification among superintelligent systems to solve the alignment problem.

artificial intelligence, machine learning, superintelligence, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.70777/si.v2i5.15579

2511.21779

Genre: Research Report (0.41)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

Realistic gossip in Trust Game on networks: the GODS model

Majewski, Jan, Giardini, Francesca

arXiv.org Artificial IntelligenceNov-26-2025

Gossip has been shown to be a relatively efficient solution to problems of cooperation in reputation-based systems of exchange, but many studies don't conceptualize gossiping in a realistic way, often assuming near-perfect information or broadcast-like dynamics of its spread. To solve this problem, we developed an agent-based model that pairs realistic gossip processes with different variants of Trust Game. The results show that cooperators suffer when local interactions govern spread of gossip, because they cannot discriminate against defectors. Realistic gossiping increases the overall amount of resources, but is more likely to promote defection. Moreover, even partner selection through dynamic networks can lead to high payoff inequalities among agent types. Cooperators face a choice between outcompeting defectors and overall growth. By blending direct and indirect reciprocity with reputations we show that gossiping increases the efficiency of cooperation by an order of magnitude.

artificial intelligence, game theory, gossip, (18 more...)

arXiv.org Artificial Intelligence

2511.20248

Country: Europe (0.68)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Game Theory (0.88)

Add feedback

OpenAI's Fidji Simo Plans to Make ChatGPT Way More Useful--and Have You Pay For It

WIREDNov-17-2025, 11:00:00 GMT

As OpenAI expands in every direction, the new CEO of Applications is on a mission to make ChatGPT indispensable and lucrative. In case OpenAI's structure couldn't get any weirder--a nonprofit in charge of a for-profit that's become a public benefit corporation--it now has two CEOs. There's Sam Altman, chief executive of the whole company, who manages research and compute. And as of this summer, there's Fidji Simo, the former CEO of Instacart, who manages everything else. Simo hasn't been seen much at OpenAI's San Francisco office since she began as CEO of Applications in August. But her presence is felt at every level of the company--not least because she's heading up ChatGPT and basically every function that might make OpenAI money. Simo is dealing with a relapse of postural orthostatic tachycardia syndrome (POTS) that makes her prone to fainting if she stands for long periods of time. "Being present from 8 am to midnight every day, responding within five minutes, people feel like I'm there and that they can reach me immediately, that I jump on the phone within five minutes," she tells me. Employees confirm that this is true. OpenAI's famously Slack-driven culture can be overwhelming for new hires. Employees say she is often seen popping into channels and threads, sharing thoughts and asking questions.

large language model, machine learning, natural language, (21 more...)

WIRED

Country:

North America > United States > California > San Francisco County > San Francisco (0.24)
North America > United States > California > Los Angeles County > Los Angeles (0.04)
Europe > Slovakia (0.04)
(2 more...)

Genre: Personal > Interview (0.46)

Industry:

Information Technology (1.00)
Media (0.94)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

Inter-Agent Trust Models: A Comparative Study of Brief, Claim, Proof, Stake, Reputation and Constraint in Agentic Web Protocol Design-A2A, AP2, ERC-8004, and Beyond

Hu, Botao 'Amber', Rong, Helena

arXiv.org Artificial IntelligenceNov-6-2025

As the "agentic web" takes shape-billions of AI agents (often LLM-powered) autonomously transacting and collaborating-trust shifts from human oversight to protocol design. In 2025, several inter-agent protocols crystallized this shift, including Google's Agent-to-Agent (A2A), Agent Payments Protocol (AP2), and Ethereum's ERC-8004 "Trustless Agents," yet their underlying trust assumptions remain under-examined. This paper presents a comparative study of trust models in inter-agent protocol design: Brief (self- or third-party verifiable claims), Claim (self-proclaimed capabilities and identity, e.g. AgentCard), Proof (cryptographic verification, including zero-knowledge proofs and trusted execution environment attestations), Stake (bonded collateral with slashing and insurance), Reputation (crowd feedback and graph-based trust signals), and Constraint (sandboxing and capability bounding). For each, we analyze assumptions, attack surfaces, and design trade-offs, with particular emphasis on LLM-specific fragilities-prompt injection, sycophancy/nudge-susceptibility, hallucination, deception, and misalignment-that render purely reputational or claim-only approaches brittle. Our findings indicate no single mechanism suffices. We argue for trustless-by-default architectures anchored in Proof and Stake to gate high-impact actions, augmented by Brief for identity and discovery and Reputation overlays for flexibility and social signals. We comparatively evaluate A2A, AP2, ERC-8004 and related historical variations in academic research under metrics spanning security, privacy, latency/cost, and social robustness (Sybil/collusion/whitewashing resistance). We conclude with hybrid trust model recommendations that mitigate reputation gaming and misinformed LLM behavior, and we distill actionable design guidelines for safer, interoperable, and scalable agent economies.

artificial intelligence, large language model, natural language, (18 more...)

arXiv.org Artificial Intelligence

2511.03434

Country:

Europe (0.68)
South America > Brazil (0.28)

Genre: Research Report (0.84)

Industry:

Information Technology > Security & Privacy (1.00)
Banking & Finance (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback