Goto

Collaborating Authors

 Law


Toxicity Red-Teaming: Benchmarking LLM Safety in Singapore's Low-Resource Languages

arXiv.org Artificial Intelligence

The advancement of Large Language Models (LLMs) has transformed natural language processing; however, their safety mechanisms remain under-explored in low-resource, multilingual settings. Here, we aim to bridge this gap. In particular, we introduce \textsf{SGToxicGuard}, a novel dataset and evaluation framework for benchmarking LLM safety in Singapore's diverse linguistic context, including Singlish, Chinese, Malay, and Tamil. SGToxicGuard adopts a red-teaming approach to systematically probe LLM vulnerabilities in three real-world scenarios: \textit{conversation}, \textit{question-answering}, and \textit{content composition}. We conduct extensive experiments with state-of-the-art multilingual LLMs, and the results uncover critical gaps in their safety guardrails. By offering actionable insights into cultural sensitivity and toxicity mitigation, we lay the foundation for safer and more inclusive AI systems in linguistically diverse environments.\footnote{Link to the dataset: https://github.com/Social-AI-Studio/SGToxicGuard.} \textcolor{red}{Disclaimer: This paper contains sensitive content that may be disturbing to some readers.}


Hurricane forecasters warn of double storm nearing East Coast in just DAYS

Daily Mail - Science & tech

Jimmy Kimmel's big TV comeback strangled as SEVENTY ABC affiliates refuse to air tonight's show Wall Street delivers clear verdict on Trump's Tylenol claims I was a devout Catholic... until I died. I'm the doctor on the cusp of an autism breakthrough... we're using an everyday $2.50 pill to reverse children's symptoms Secret Service foils'espionage' plot in NYC ahead of UN General Assembly that could have crashed Big Apple's phone network The'marry me' sex move that'll make even the most commitment-phobic of men beg to see you again... and it worked for THREE of my friends Sarah Ferguson sent Jeffrey Epstein fawning apology email'after he threatened to destroy her' in'Hannibal Lector-like' phone call The six hidden messages in the texts between Charlie Kirk's'assassin' and his trans lover DECODED Awkward moment Emmanuel Macron rings Trump for help after his motorcade is stopped by cops in New York... but ends up having to get out and walk Kate Middleton delivers a'mic drop' moment in dazzling gold dress identical to the late monarch - giving a glimpse of the Queen she plans to be William is urging his father to disown Fergie and Andrew over Epstein scandal... but King fears they could go rogue and values their loyalty Insiders speak out on Barack and Michelle Obama's secretive yacht vacation amid's**t show': 'They NEEDED this trip' MORE: Gabrielle surging into major hurricane as forecasters warn of'life-threatening' impact to East Coast Hurricane forecasters warn that the US East Coast could feel the wrath of two tropical cyclones at the same time next week. The National Hurricane Center (NHC) revealed their updated tracks for a pair of tropical disturbances brewing in the Atlantic, giving both a high likelihood of turning into named storms within seven days . One of these threatening weather systems, currently dubbed Invest 94L, is expected to form right off the coast of Florida . The other potential storm, called Invest 93L, has been developing in the middle of the Atlantic and forecasters unveiled an initial path that could take it towards the Carolinas by the end of the week.


Why Amazon Fresh failed: As all 19 remaining UK stores close, experts blame 'dystopian' concept that stoked fears of Big Brother surveillance

Daily Mail - Science & tech

Jimmy Kimmel's big TV comeback strangled as SEVENTY ABC affiliates refuse to air tonight's show Wall Street delivers clear verdict on Trump's Tylenol claims I was a devout Catholic... until I died. I'm the doctor on the cusp of an autism breakthrough... we're using an everyday $2.50 pill to reverse children's symptoms Secret Service foils'espionage' plot in NYC ahead of UN General Assembly that could have crashed Big Apple's phone network The'marry me' sex move that'll make even the most commitment-phobic of men beg to see you again... and it worked for THREE of my friends Sarah Ferguson sent Jeffrey Epstein fawning apology email'after he threatened to destroy her' in'Hannibal Lector-like' phone call The six hidden messages in the texts between Charlie Kirk's'assassin' and his trans lover DECODED Awkward moment Emmanuel Macron rings Trump for help after his motorcade is stopped by cops in New York... but ends up having to get out and walk Kate Middleton delivers a'mic drop' moment in dazzling gold dress identical to the late monarch - giving a glimpse of the Queen she plans to be William is urging his father to disown Fergie and Andrew over Epstein scandal... but King fears they could go rogue and values their loyalty Insiders speak out on Barack and Michelle Obama's secretive yacht vacation amid's**t show': 'They NEEDED this trip' Why Amazon Fresh failed: As all 19 remaining UK stores close, experts blame'dystopian' concept that stoked fears of Big Brother surveillance READ MORE: How artificial intelligence is ALREADY patrolling Britain's shops It was supposed to trigger a revolution in how we shop on the high street. But Amazon's'Fresh' shopping concept - which uses AI to track customers - may instead be remembered as a failed experiment in mass surveillance. The tech giant has announced that it has made the'difficult decision' to close all 19 Amazon Fresh stores in the UK - putting up to 250 workers at risk. Amazon claims the closures are the result of a'thorough evaluation of business operations and the very substantial growth opportunities in online delivery'.


Shake It Off! How Taylor Swift has ditched her Southern drawl in favour of a northern American accent, revealed

Daily Mail - Science & tech

Jimmy Kimmel's big TV comeback strangled as SEVENTY ABC affiliates refuse to air tonight's show Wall Street delivers clear verdict on Trump's Tylenol claims I was a devout Catholic... until I died. I'm the doctor on the cusp of an autism breakthrough... we're using an everyday $2.50 pill to reverse children's symptoms Secret Service foils'espionage' plot in NYC ahead of UN General Assembly that could have crashed Big Apple's phone network The'marry me' sex move that'll make even the most commitment-phobic of men beg to see you again... and it worked for THREE of my friends Sarah Ferguson sent Jeffrey Epstein fawning apology email'after he threatened to destroy her' in'Hannibal Lector-like' phone call The six hidden messages in the texts between Charlie Kirk's'assassin' and his trans lover DECODED Awkward moment Emmanuel Macron rings Trump for help after his motorcade is stopped by cops in New York... but ends up having to get out and walk Kate Middleton delivers a'mic drop' moment in dazzling gold dress identical to the late monarch - giving a glimpse of the Queen she plans to be William is urging his father to disown Fergie and Andrew over Epstein scandal... but King fears they could go rogue and values their loyalty Insiders speak out on Barack and Michelle Obama's secretive yacht vacation amid's**t show': 'They NEEDED this trip' Shake It Off! How Taylor Swift has ditched her Southern drawl in favour of a northern American accent, revealed In the 19 years since she released her first song, Taylor Swift's music been through several transitions, changing from country, to pop, to indie folk, and almost every genre in between. Now, a study has revealed how it's not just Swift's music that has evolved. Scientists from the University of Minnesota say the chart-topping singer's accent has also changed over time. In their study, the team analysed years of Swift's recorded interviews to track how her dialect has transformed.


Scientists think we may have received a signal from a parallel universe via a WORMHOLE

Daily Mail - Science & tech

Jimmy Kimmel's big TV comeback strangled as SEVENTY ABC affiliates refuse to air tonight's show Wall Street delivers clear verdict on Trump's Tylenol claims I was a devout Catholic... until I died. I'm the doctor on the cusp of an autism breakthrough... we're using an everyday $2.50 pill to reverse children's symptoms Secret Service foils'espionage' plot in NYC ahead of UN General Assembly that could have crashed Big Apple's phone network The'marry me' sex move that'll make even the most commitment-phobic of men beg to see you again... and it worked for THREE of my friends Sarah Ferguson sent Jeffrey Epstein fawning apology email'after he threatened to destroy her' in'Hannibal Lector-like' phone call The six hidden messages in the texts between Charlie Kirk's'assassin' and his trans lover DECODED Awkward moment Emmanuel Macron rings Trump for help after his motorcade is stopped by cops in New York... but ends up having to get out and walk Kate Middleton delivers a'mic drop' moment in dazzling gold dress identical to the late monarch - giving a glimpse of the Queen she plans to be William is urging his father to disown Fergie and Andrew over Epstein scandal... but King fears they could go rogue and values their loyalty Insiders speak out on Barack and Michelle Obama's secretive yacht vacation amid's**t show': 'They NEEDED this trip' In 2019, gravitational wave detectors on Earth picked up a signal that left scientists baffled. Gravitational waves are ripples in the fabric of space and time, usually created when massive, dense objects like black holes collide. But at less than a tenth of a second long, this sudden burst was far shorter than the drawn-out chirps normally produced by merging black holes. Now, researchers think this strange signal, dubbed GW190521, could have arrived from a parallel universe.


Drone attack kills eight children in Haiti's capital

Al Jazeera

Drone attack kills eight children in Haiti's capital NewsFeed Drone attack kills eight children in Haiti's capital At least 11 people, including eight children, were reported killed in a drone attack in a gang-controlled area of Haiti's capital Port-au-Prince. Relatives of the victims said police were targeting a suspected gang leader. Video: Parents carry children's bodies after Israeli attack on Gaza City Protesters outside UN say recognising Palestine isn't enough


"I think this is fair'': Uncovering the Complexities of Stakeholder Decision-Making in AI Fairness Assessment

arXiv.org Artificial Intelligence

Assessing fairness in artificial intelligence (AI) typically involves AI experts who select protected features, fairness metrics, and set fairness thresholds. However, little is known about how stakeholders, particularly those affected by AI outcomes but lacking AI expertise, assess fairness. To address this gap, we conducted a qualitative study with 30 stakeholders without AI expertise, representing potential decision subjects in a credit rating scenario, to examine how they assess fairness when placed in the role of deciding on features with priority, metrics, and thresholds. We reveal that stakeholders' fairness decisions are more complex than typical AI expert practices: they considered features far beyond legally protected features, tailored metrics for specific contexts, set diverse yet stricter fairness thresholds, and even preferred designing customized fairness. Our results extend the understanding of how stakeholders can meaningfully contribute to AI fairness governance and mitigation, underscoring the importance of incorporating stakeholders' nuanced fairness judgments.


Causal Fuzzing for Verifying Machine Unlearning

arXiv.org Artificial Intelligence

As machine learning models become increasingly embedded in decision-making systems, the ability to "unlearn" targeted data or features is crucial for enhancing model adaptability, fairness, and privacy in models which involves expensive training. To effectively guide machine unlearning, a thorough testing is essential. Existing methods for verification of machine unlearning provide limited insights, often failing in scenarios where the influence is indirect. In this work, we propose CAFร‰, a new causality based framework that unifies datapoint- and feature-level unlearning for verification of black-box ML models. CAFร‰ evaluates both direct and indirect effects of unlearning targets through causal dependencies, providing actionable insights with fine-grained analysis. Our evaluation across five datasets and three model architectures demonstrates that CAFร‰ successfully detects residual influence missed by baselines while maintaining computational efficiency.


Evaluating Behavioral Alignment in Conflict Dialogue: A Multi-Dimensional Comparison of LLM Agents and Humans

arXiv.org Artificial Intelligence

Large Language Models (LLMs) are increasingly deployed in socially complex, interaction-driven tasks, yet their ability to mirror human behavior in emotionally and strategically complex contexts remains underexplored. This study assesses the behavioral alignment of personality-prompted LLMs in adversarial dispute resolution by simulating multi-turn conflict dialogues that incorporate negotiation. Each LLM is guided by a matched Five-Factor personality profile to control for individual variation and enhance realism. We evaluate alignment across three dimensions: linguistic style, emotional expression (e.g., anger dynamics), and strategic behavior. GPT-4.1 achieves the closest alignment with humans in linguistic style and emotional dynamics, while Claude-3.7-Sonnet best reflects strategic behavior. Nonetheless, substantial alignment gaps persist. Our findings establish a benchmark for alignment between LLMs and humans in socially complex interactions, underscoring both the promise and the limitations of personality conditioning in dialogue modeling.


Secure Confidential Business Information When Sharing Machine Learning Models

arXiv.org Artificial Intelligence

Model-sharing offers significant business value by enabling firms with well-established Machine Learning (ML) models to monetize and share their models with others who lack the resources to develop ML models from scratch. However, concerns over data confidentiality remain a significant barrier to model-sharing adoption, as Confidential Property Inference (CPI) attacks can exploit shared ML models to uncover confidential properties of the model provider's private model training data. Existing defenses often assume that CPI attacks are non-adaptive to the specific ML model they are targeting. This assumption overlooks a key characteristic of real-world adversaries: their responsiveness, i.e., adversaries' ability to dynamically adjust their attack models based on the information of the target and its defenses. To overcome this limitation, we propose a novel defense method that explicitly accounts for the responsive nature of real-world adversaries via two methodological innovations: a novel Responsive CPI attack and an attack-defense arms race framework. The former emulates the responsive behaviors of adversaries in the real world, and the latter iteratively enhances both the target and attack models, ultimately producing a secure ML model that is robust against responsive CPI attacks. Furthermore, we propose and integrate a novel approximate strategy into our defense, which addresses a critical computational bottleneck of defense methods and improves defense efficiency. Through extensive empirical evaluations across various realistic model-sharing scenarios, we demonstrate that our method outperforms existing defenses by more effectively defending against CPI attacks, preserving ML model utility, and reducing computational overhead.