Goto

Collaborating Authors

 vote


Welcome to the dark side of crypto's permissionless dream

MIT Technology Review

Jean-Paul Thorbjornsen is a leader of THORChain, a blockchain that is not supposed to have any leaders--and is reeling from a series of expensive controversies. We can do whatever we want," Jean-Paul Thorbjornsen tells me from the pilot's seat of his Aston Martin helicopter. As we fly over suburbs outside Melbourne, Australia, it's becoming clear that doing whatever he wants is Thorbjornsen's MO. Upper-middle-class homes give way to vineyards, and Thorbjornsen points out our landing spot outside a winery. "They're going to ask for a shot now," he says, used to the attention drawn by his luxury helicopter, emblazoned with the tail letters "BTC" for bitcoin (the price tag of $5 million in Australian dollars--$3.5 million in US dollars today--was perhaps reasonable for someone who claims a previous crypto project made more than AU$400 million, although he also says those funds were tied up in the company). Thorbjornsen is a founder of THORChain, a blockchain through which users can swap ...



US says it shot down Iranian drone flying towards aircraft carrier

BBC News

An Iranian drone was shot down as it aggressively approached an American aircraft carrier in the Arabian Sea on Tuesday, a US military spokesman has said. An F-35C stealth fighter jet which took off from the USS Abraham Lincoln warship shot down the drone in self-defence to protect the aircraft carrier and its personnel, US Central Command spokesman Capt Tim Hawkins said. The ship was approximately 500 miles from the Iranian coast when the drone approached it with unclear intent. No US service members were harmed and no equipment was damaged. It comes as the US continues to build up a military presence in the region, with tensions high between Washington and Tehran.


Tesla sees first annual revenue drop as it shifts to AI and robots

BBC News

Tesla says its annual revenue has fallen for the first time as the electric vehicle (EV) maker shifts it focus to artificial intelligence (AI) and robotics. The company, which is run by multi-billionaire Elon Musk, reported a 3% decline in total revenues in 2025, while profits fell 61% in the last three months of the year. Tesla also announced plans to end production of its Model S and Model X vehicles. It will now use the manufacturing plant in California that made those cars to produce its line of humanoid robots - known as Optimus. In January, China's BYD overtook Tesla as the world's biggest EV maker, while Musk's involvement in politics both in the US and abroad has proved controversial.


GenAI Arena: An Open Evaluation Platform for Generative Models

Neural Information Processing Systems

Generative AI has made remarkable strides to revolutionize fields such as image and video generation. These advancements are driven by innovative algorithms, architecture, and data. However, the rapid proliferation of generative models has highlighted a critical gap: the absence of trustworthy evaluation metrics. Current automatic assessments such as FID, CLIP, FVD, etc often fail to capture the nuanced quality and user satisfaction associated with generative outputs. This paper proposes an open platform GenAI-Arena to evaluate different image and video generative models, where users can actively participate in evaluating these models.


Efficient and Thrifty Voting by Any Means Necessary

Neural Information Processing Systems

We take an unorthodox view of voting by expanding the design space to include both the elicitation rule, whereby voters map their (cardinal) preferences to votes, and the aggregation rule, which transforms the reported votes into collective decisions. Intuitively, there is a tradeoff between the communication requirements of the elicitation rule (i.e., the number of bits of information that voters need to provide about their preferences) and the efficiency of the outcome of the aggregation rule, which we measure through distortion (i.e., how well the utilitarian social welfare of the outcome approximates the maximum social welfare in the worst case). Our results chart the Pareto frontier of the communication-distortion tradeoff.


AI-Generated Compromises for Coalition Formation: Modeling, Simulation, and a Textual Case Study

Briman, Eyal, Shapiro, Ehud, Talmon, Nimrod

arXiv.org Artificial Intelligence

The challenge of finding compromises between agent proposals is fundamental to AI sub-fields such as argumentation, mediation, and negotiation. Building on this tradition, Elkind et al. (2021) introduced a process for coalition formation that seeks majority-supported proposals preferable to the status quo, using a metric space where each agent has an ideal point. The crucial step in this iterative process involves identifying compromise proposals around which agent coalitions can unite. How to effectively find such compromise proposals, however, remains an open question. We address this gap by formalizing a holistic model that encompasses agent bounded rationality and uncertainty and developing AI models to generate such compromise proposals. We focus on the domain of collaboratively writing text documents -- e.g., to enable the democratic creation of a community constitution. We apply NLP (Natural Language Processing) techniques and utilize LLMs (Large Language Models) to create a semantic metric space for text and develop algorithms to suggest suitable compromise points. To evaluate the effectiveness of our algorithms, we simulate various coalition formation processes and demonstrate the potential of AI to facilitate large-scale democratic text editing, such as collaboratively drafting a constitution, an area where traditional tools are limited.


Could ChatGPT Secretly Tell You How to Vote?

The Atlantic - Technology

Could ChatGPT Secretly Tell You How to Vote? In the months leading up to last year's presidential election, more than 2,000 Americans, roughly split across partisan lines, were recruited for an experiment: Could an AI model influence their political inclinations? The premise was straightforward--let people spend a few minutes talking with a chatbot designed to stump for Kamala Harris or Donald Trump, then see if their voting preferences changed at all. After talking with a pro-Trump bot, one in 35 people who initially said they would not vote for Trump flipped to saying they would. The number who flipped after talking with a pro-Harris bot was even higher, at one in 21.


Where did I put it? Loss of vital crypto key voids election

New Scientist

Feedback is entertained by the commotion at the International Association for Cryptologic Research's recent elections, where results could not be decrypted after an honest but unfortunate human mistake The phrase "you couldn't make it up", Feedback feels, is often misunderstood. It doesn't mean there are limits to the imagination, but rather that there are some developments you can't include in a fictional story because people would say "oh come on, that would never happen". The trouble is, those people are wrong, because real life is frequently ridiculous. In the world of codes and ciphers, one of the more important organisations is the International Association for Cryptologic Research, described as " a non-profit organization devoted to supporting the promotion of the science of cryptology ". The IACR recently held elections to choose new officers and directors and to tweak its bylaws.


Distribution-Calibrated Inference time compute for Thinking LLM-as-a-Judge

Dadkhahi, Hamid, Trabelsi, Firas, Riley, Parker, Juraska, Juraj, Mirzazadeh, Mehdi

arXiv.org Artificial Intelligence

Thinking Large Language Models (LLMs) used as judges for pairwise preferences remain noisy at the single-sample level, and common aggregation rules (majority vote, soft self-consistency, or instruction-based self-aggregation) are inconsistent when ties are allowed. We study inference-time compute (ITC) for evaluators that generate n independent thinking-rating samples per item, and propose a principled, distribution-calibrated aggregation scheme. Our method models three-way preferences with a Bradley-Terry-Davidson formulation on rating counts, leveraging both polarity (margin among non-ties) and decisiveness (non-tie rate) to distinguish narrow margins from strong consensus. Across various evaluation benchmarks, our approach consistently reduces MAE and increases pairwise accuracy versus standard baselines, and when evaluated against human-consensus meta-labels, matches or exceeds individual human raters. These results show that carefully allocating ITC and aggregating with distribution-aware methods turns noisy individual model judgments into reliable ratings for evaluation. Thinking large language models (LLMs) are increasingly being employed as automated judges for evaluating the output of other generative systems, a paradigm known as "Thinking-LLM-as-a-Judge" (Saha et al., 2025). This approach offers a scalable and cost-effective alternative to human evaluation, which is often slow and expensive. To mitigate the inherent stochasticity and noise of single-pass judgments, a common strategy is to leverage inference-time compute (ITC) Snell et al. (2024) by generating multiple independent reasoning and rating samples for each item being evaluated. However, the reliability of the final judgment hinges critically on how these multiple outputs are aggregated. Current aggregation methods, such as majority voting (Self-Consistency (Wang et al., 2023b)) or heuristics based on model confidence scores or LLM generated aggregators, are often brittle and statistically suboptimal.