Goto

Collaborating Authors

 Africa


LLM Output Homogenization is Task Dependent

arXiv.org Artificial Intelligence

A large language model can be less helpful if it exhibits output response homogenization. But whether two responses are considered homogeneous, and whether such homogenization is problematic, both depend on the task category. For instance, in objective math tasks, we often expect no variation in the final answer but anticipate variation in the problem-solving strategy. Whereas, for creative writing tasks, we may expect variation in key narrative components (e.g. plot, genre, setting, etc), beyond the vocabulary or embedding diversity produced by temperature-sampling. Previous work addressing output homogenization often fails to conceptualize diversity in a task-dependent way. We address this gap in the literature directly by making the following contributions. (1) We present a task taxonomy comprised of eight task categories that each have distinct concepts of output homogenization. (2) We introduce task-anchored functional diversity to better evaluate output homogenization. (3) We propose a task-anchored sampling technique that increases functional diversity for task categories where homogenization is undesired, while preserving it where it is desired. (4) We challenge the perceived existence of a diversity-quality trade-off by increasing functional diversity while maintaining response quality. Overall, we demonstrate how task dependence improves the evaluation and mitigation of output homogenization.


Democratic or Authoritarian? Probing a New Dimension of Political Biases in Large Language Models

arXiv.org Artificial Intelligence

As Large Language Models (LLMs) become increasingly integrated into everyday life and information ecosystems, concerns about their implicit biases continue to persist. While prior work has primarily examined socio-demographic and left--right political dimensions, little attention has been paid to how LLMs align with broader geopolitical value systems, particularly the democracy--authoritarianism spectrum. In this paper, we propose a novel methodology to assess such alignment, combining (1) the F-scale, a psychometric tool for measuring authoritarian tendencies, (2) FavScore, a newly introduced metric for evaluating model favorability toward world leaders, and (3) role-model probing to assess which figures are cited as general role-models by LLMs. We find that LLMs generally favor democratic values and leaders, but exhibit increased favorability toward authoritarian figures when prompted in Mandarin. Further, models are found to often cite authoritarian figures as role models, even outside explicit political contexts. These results shed light on ways LLMs may reflect and potentially reinforce global political ideologies, highlighting the importance of evaluating bias beyond conventional socio-political axes. Our code is available at: https://github.com/irenestrauss/Democratic-Authoritarian-Bias-LLMs.


Waymo runs into safety concerns and competition as it expands in the US

Al Jazeera

The sidewalk outside Majed Zeidan's grocery store in San Francisco's Mission District has stayed filled with flowers, candles, memorials and pictures since his cat was crushed under a Waymo in late October. A month later, a Waymo reportedly crushed a dog. Amid the pictures of the cat, a visitor had placed a poster that said, "save the cat, kill the car". That's when Zeidan knew Kit Kat, his bodega cat, had become the face of the simmering discontent over San Francisco's growing number of self-driving cars. Residents became increasingly comfortable riding one, costumed Halloween parade goers clambered on its rooftops and danced, and pedestrians occasionally banged its bonnet to get it to give way to them.


Pioneering new treatment reverses incurable blood cancer in some patients

BBC News

A therapy that would once have been considered a feat of science fiction has reversed aggressive and incurable blood cancers in some patients, doctors report. The treatment involves precisely editing the DNA in white blood cells to transform them into a cancer-fighting living drug. The first girl to be treated, whose story we reported in 2022, is still free of the disease and now plans to become a cancer scientist. Now eight more children and two adults with T-cell acute lymphoblastic leukaemia have been treated, with almost two thirds (64%) of patients in remission. T-cells are supposed to be the body's guardians - seeking out and destroying threats - but in this form of leukaemia, they grow out of control.


Sweaty Betty in new dispute over ad slogans

BBC News

Activewear brand Sweaty Betty has become involved in a new dispute over advertising slogans, which a period underwear company claims were copied. Kelly Newton said Sweaty Betty's use of two taglines that were very similar to her firm Nixi Body's seemed a little off, and while she could not get them trademarked she felt Sweaty Betty was taking from other female founders. Sweaty Betty said the No ifs. Ms Newton said she was speaking out after seeing personal trainer Georgina Cox reveal Sweaty Betty had offered her a settlement over a disputed slogan . Ms Newton, who co-founded Nixi Body in 2019, said the company has advertised its leak-proof period underwear with the lines Keeping you moving through menstruation, motherhood and menopause and No leaks.


Ukraine firefighters rush to rescue people, pets after Russian strike

Al Jazeera

What is in the 28-point US plan for Ukraine? 'Ukraine is running out of men, money and time' Can the US get all sides to end the war? Why is Europe opposing Trump's peace plan? Firefighters evacuated residents and their pets from a nine-storey apartment building in Ukraine's Sumy region after a Russian drone strike. The strikes come as Ukrainian President Volodymyr Zelenskyy met with leaders of the UK, France and Germany in London to discuss the US peace plan.


How AI Is Reshaping Diplomacy and Global Affairs

TIME - Tech

With artificial intelligence putting productivity on hyperspeed, the painstaking but often slow nature of dealing with other countries, as well as policymaking, is also forced to speed up. But a panel at the forefront of these changes at the BRIDGE Summit in Abu Dhabi--which convenes creators, policymakers, investors, technologists, media institutions, and cultural leaders around the world to discuss the future of media--said that breaking things fast is not without consequences. "Decision makers are being asked to make decisions very quickly on the basis of information that may not be verified or verifiable," Elizabeth Churchill, a professor of Human-Computer Interaction from the Mohamed Bin Zayed University of Artificial Intelligence, told moderator Nikhil Kumar, an executive editor at TIME, which is a media partner of the BRIDGE Summit. Churchill, who held senior roles in firms like Google and Yahoo, said she returned to academia to explore transparent and "interrogable" AI tools and content that is effectively watermarked--so that decision-makers know at a glance if information is trustworthy. She said current shortfalls in information quality are "very much a design problem that sits at the surface of all of the tools that we use and in diplomacy conversations many different people are using."


Thieves snatch eight Matisse artworks from library in Brazil

BBC News

Two armed men have stolen eight engravings by French artist Matisse and at least another five by Brazilian painter Cรขndido Portinari from a library in Sรฃo Paulo. Brazilian officials say the thieves held up a security guard and an elderly couple who were visiting the library before making off with the artworks on foot. They reportedly entered the library by the main entrance at 10:00 (13:00 GMT) on Sunday, and left by the same route, heading towards the nearest metro station. The heist comes less than two months after the art world was rocked by a brazen break-in at the Louvre museum in Paris, where thieves made off with priceless jewels. The engravings stolen from Biblioteca Mรกrio de Andrade on Sunday formed part of a joint exhibition with the Sรฃo Paulo Museum of Modern Art.


Trump to sign 'one-rule' executive order on AI to bypass US state approvals

Al Jazeera

What are the implications of Trump's Somali'garbage' comments? What happens if the US attacks Venezuela? Does'America First' make the US weaker? What we know about the DC pipe bomb suspect Brian Cole Jr. Trump to sign'one-rule' executive order on AI to bypass US state approvals United States President Donald Trump has said he will sign an executive order creating "one rulebook" for artificial intelligence (AI) development. The announcement on Monday via Trump's Truth Social account represented the US president's latest effort to remove AI barriers, a priority of his administration that has raised concerns related to oversight of the transformative technology.


New mpox strain identified in England

BBC News

A new strain of mpox, previously called monkeypox, has been detected in a person in England, say UK health officials. The virus is a mix of two major types of the mpox virus, and was found in someone who recently returned from travelling in Asia. Officials say they are still assessing the significance of the new strain. The UK Health Security Agency (UKHSA) says it is normal for viruses to evolve. Getting vaccinated remains the best way to protect against severe disease - although an mpox infection is mild for many.