clash
- Europe > Austria > Vienna (0.14)
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Research Report > Strength High (0.93)
- Research Report > Strength Medium (0.93)
- Research Report > Experimental Study (0.93)
- Research Report > New Finding (0.93)
Thailand accuses Cambodia of breaking newly signed ceasefire deal
Thailand's army has accused Cambodia of breaching a newly-signed ceasefire deal reached after weeks of deadly clashes that forced nearly one million people from their homes. In a statement, the Thai army said than more than 250 unmanned aerial vehicles (UAVs) were detected flying from the Cambodian side on Sunday night. The ceasefire took effect at noon local time (05:00 GMT) on Saturday. Both sides agreed to freeze the front lines where they are now, ban reinforcements and allow civilians living in border areas to return as soon as possible. It had been seen as a breakthrough, which came after days of talks between both countries, with diplomatic encouragement from China and the US.
- Asia > Cambodia (0.81)
- Asia > Thailand (0.74)
- North America > United States (0.50)
- (19 more...)
- Leisure & Entertainment (0.75)
- Government > Regional Government (0.49)
- Law > Criminal Law (0.31)
- Media > Film (0.30)
Should I Stop or Should I Go: Early Stopping with Heterogeneous Populations
Randomized experiments often need to be stopped prematurely due to the treatment having an unintended harmful effect. Existing methods that determine when to stop an experiment early are typically applied to the data in aggregate and do not account for treatment effect heterogeneity. In this paper, we study the early stopping of experiments for harm on heterogeneous populations. We first establish that current methods often fail to stop experiments when the treatment harms a minority group of participants. We then use causal machine learning to develop CLASH, the first broadly-applicable method for heterogeneous early stopping. We demonstrate CLASH's performance on simulated and real data and show that it yields effective early stopping for both clinical trials and A/B tests.
- Research Report > Experimental Study (0.93)
- Research Report > New Finding (0.93)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Vision (0.93)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
- Information Technology > Artificial Intelligence > Natural Language (0.67)
- Europe > Austria > Vienna (0.14)
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Research Report > Strength High (0.93)
- Research Report > Strength Medium (0.93)
CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives
Lee, Ayoung, Kwon, Ryan Sungmo, Railton, Peter, Wang, Lu
Navigating dilemmas involving conflicting values is challenging even for humans in high-stakes domains, let alone for AI, yet prior work has been limited to everyday scenarios. To close this gap, we introduce CLASH (Character perspective-based LLM Assessments in Situations with High-stakes), a meticulously curated dataset consisting of 345 high-impact dilemmas along with 3,795 individual perspectives of diverse values. CLASH enables the study of critical yet underex-plored aspects of value-based decision-making processes, including understanding of decision ambivalence and psychological discomfort as well as capturing the temporal shifts of values in the perspectives of characters. By benchmarking 14 non-thinking and thinking models, we uncover several key findings. Instead, new failure patterns emerge, including early commitment and overcom-mitment. This paper aims to address a core question: Can LLMs make proper judgments in high-stakes dilemmas according to different perspectives?
- North America > United States > Michigan > Washtenaw County > Ann Arbor (0.14)
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
- Europe > Netherlands > South Holland > Delft (0.04)
- (2 more...)
- Health & Medicine > Therapeutic Area (0.68)
- Law (0.67)
- Education > Educational Setting > K-12 Education (0.45)
Musk's Grok AI bot falsely suggests police misrepresented footage of far-right rally in London
Grok claimed the location was Trafalgar Square. Grok claimed the location was Trafalgar Square. Musk's Grok AI bot falsely suggests police misrepresented footage of far-right rally in London The Metropolitan police has had to counter false suggestions by the artificial intelligence on Elon Musk's X platform that the force passed off footage from 2020 as being from Saturday's far-right rally in the city. The claim by the chatbot Grok was in answer to an X user's query about where and when footage of police clashing with crowds was filmed. Police seek man who called for Keir Starmer to be'assassinated' at far-right rally Grok, which has had a track record of giving false and misleading answers, replied: "This footage appears to be from an anti-lockdown protest in London's Trafalgar Square on 26 September 2020, during clashes between demonstrators and police over Covid restrictions."
- North America > United States (0.16)
- Europe > Ukraine (0.06)
- Africa > South Africa (0.05)
- (2 more...)
- Government > Regional Government (1.00)
- Leisure & Entertainment > Sports (0.71)
- Media > News (0.70)
Europe's clash with Musk's xAI escalates after Grok's rants
The clash between billionaire Elon Musk's xAI empire and European officials is intensifying with leaders in Poland and Germany calling for more aggressive action against the company. German lawmaker Ralf Stegner, responding to antisemitic comments that xAI's chatbot Grok made Tuesday on Musk's social media platform, X, said the posts "must not be tolerated under any circumstances" and called for sanctions in an interview with the German newspaper Handelsblatt. Poland's government separately urged the European Union to investigate and possibly fine xAI following lewd comments made by Grok about the country's politicians. The European Union is already investigating Musk's social media platform under a relatively new content-moderation policy known as the Digital Services Act and had been weighing a fine ahead of its summer recess in August. The regulator is reportedly considering calculating the fine by including revenue from Musk's other businesses, including SpaceX and Neuralink, an approach that would significantly increase the potential penalties.
- Government > Regional Government > Europe Government > Poland Government (0.30)
- Government > Regional Government > Europe Government > Germany Government (0.30)